Time Series Analysis is a specialized branch of data science that deals with data points indexed in time order. Unlike standard regression where we assume observations are independent, in Time Series, the order of data matters because today’s value usually depends on yesterday’s.
To analyze a time series, we first decompose it into four fundamental parts:
Most statistical forecasting models (like ARIMA) require the data to be Stationary.
ARIMA is the “Gold Standard” for non-seasonal time series forecasting. It combines three parts:
Notation: $ARIMA(p, d, q)$. For example, $ARIMA(1, 1, 1)$ means we use 1 lag, 1 difference, and 1 error term.
SARIMA is an extension of ARIMA that explicitly supports univariate time series data with a seasonal component.
Beyond ARIMA/SARIMA, there are several modern ways to predict the future:
Assigns exponentially decreasing weights to past observations. The most recent data is weighted more heavily than older data.
An open-source tool designed for business forecasting.
Long Short-Term Memory (LSTM) networks are a type of Recurrent Neural Network (RNN) specifically designed to “remember” long-term dependencies in sequences.
When choosing between multiple time series models, we use:
| Technique | Best For | Complexity |
| Naive Forecast | Stable data with no change. | Very Low |
| ARIMA | Data with a trend but no seasonality. | Medium |
| SARIMA | Data with strong seasonal patterns. | High |
| Prophet | Business data with holidays/missing points. | Low (Automated) |
| LSTM | Massive datasets with non-linear patterns. | Very High |