0% found this document useful (0 votes)

89 views

Module 3.1 Time Series Forecasting ARIMA Model

This document provides an overview of time series forecasting using ARIMA models. It discusses that ARIMA models are commonly used statistical models for time series analysis that attempt to model relationships between past values of a variable to predict future values. The document outlines the differences between autoregressive (AR) models, which use past values of the variable to forecast, and moving average (MA) models, which use past forecast errors. It also discusses using autocorrelation functions and partial autocorrelation functions to help identify the appropriate AR and MA parameters for an ARIMA model.

Uploaded by

Duane Eugenio Ani

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views

Module 3.1 Time Series Forecasting ARIMA Model

Uploaded by

Duane Eugenio Ani

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

MODULE 3.

1 SUPERVISED LEARNING: TIME SERIES FORECASTING PART 1

Traditional Time Series Models: Introduction to ARIMA Models

Supervised regression–based machine learning is a predictive form of modeling in which the goal is to model the
relationship between a target and the predictor variable(s) in order to estimate a continuous set of possible outcomes.

The first part of Module 3 will cover time series models. In its broadest form, time series analysis is about inferring
what has happened to a series of data points in the past and attempting to predict what will happen to it in the future.
There have been a lot of comparisons and debates in academia and the industry regarding the differences between
supervised regression and time series models. Most time series models are parametric (i.e., a known function is
assumed to represent the data), while the majority of supervised regression models are nonparametric.

Part 1: Statistical Modelling vs. Machine Learning

The biggest difference between statistical modelling (SM) and machine learning (ML) is their purposes. While SM is
used for finding and explaining the relationships between variables, ML models are built for providing accurate
predictions without explicit programming.

Statistical models explicitly specify a probabilistic model or function for the data and identify variables that are usually
interpretable and of special interest, such as effects of predictor variables. In addition to identifying relationships
between variables, statistical models establish both the scale and significance of the relationship. For example, consider
an exponential smoothing model: Y’(t) = αY(t-1) + [(1-α)(Y’(t-1)] + ε(t), with Y(t) the actual value and Y’(t) the predicted
value. We just need to fit the right parameters (in this case α) to that function.

Meanwhile, in ML, we don’t make any assumptions about the shape of the function that represents our data. We rely
on the universal approximation properties of our algorithm to find the best fit for our data.

Statistical Model Machine Learning

The relationship between variables is finding out by the self-

The relationship between variables is found in the
learning algorithm that learns from the data without relying
form of mathematical equations.
on rule-based learning.
In Statistical Modeling takes a lot of assumptions to
In machine learning don’t rely on such assumptions.
identify the underlying distributions and relationships.

More interpretable as compared to machine learning Less interpretable and more complex

The model was developed on training data and sometimes

The model was developed on training data and tested
hyperparameters are tuned or validation data and finally get
on testing data.
evaluated/tested again testing data.

Mostly used for research purposes ML is implemented in a production environment

It is not best suited to a large amount of data. It can range from small to large amounts of data sets

Strong predictive ability due to the ability to learn from past

Best estimate relationship between variables.
data.

Part 2: Overview of ARIMA Models

An Autoregressive Integrated Moving Average (ARIMA) model is one of the most popular and widely used statistical
model for time series forecasting. It is a class of statistical algorithms that captures the standard temporal dependencies
that is unique to time series data.
Before we introduce ARIMA models, we must first recall the concept of stationarity and autocorrelation. ARIMA models
are, in theory, the most general class of models for forecasting a time series which can be made to be stationary by
differencing (if necessary). A stationary series has no trend, its variations around its mean have a constant amplitude,
and it wiggles in a consistent fashion (i.e., its short-term random time patterns always look the same in a statistical
sense).

The latter condition means that its autocorrelations (correlations with its own prior deviations from the mean) remain
constant over time, or equivalently, that its power spectrum remains constant over time. A time series of this form
can be viewed as a combination of signal and noise, and the signal (if one is apparent) could be a pattern of fast or
slow mean reversion, or sinusoidal oscillation, or rapid alternation in sign. It could also have a seasonal component.

An ARIMA model can be viewed as a “filter” that tries to separate the signal from the noise, and the signal is then
extrapolated into the future to obtain forecasts.

Before we dive into the parameter tuning of an ARIMA model, let us first discuss what is an Autoregressive (AR) model
and its difference from a Moving Average (MA) model.

Autoregressive (AR) Model

In a multiple regression model, we forecast the variable of interest using a linear combination of predictors. In an
autoregression model, we forecast the variable of interest using a linear combination of past values of the variable.
The term autoregression indicates that it is a regression of the variable against itself.

Thus, an autoregressive model of order p can be written as

𝑦𝑡 = 𝜇 + 𝜑1 𝑦𝑡 −1 + 𝜑2 𝑦𝑡 −2 + ⋯ + 𝜑𝑝 𝑦𝑡 −𝑝 + 𝜀𝑡 ,

where 𝜇 is the intercept or constant, 𝜑𝑝 is the AR coefficient at lag p, and 𝜀𝑡 is white noise. This is like a multiple
regression but with lagged values of 𝑦𝑡 as predictors. We refer to this as an AR(p) model, an autoregressive model at
lag p.

Moving Average (MA) Model

Rather than using past values of the forecast variable in a regression, a moving average model uses past forecast errors
in a regression-like model,

𝑦𝑡 = 𝜇 + 𝜀𝑡 + 𝜃1 𝜀𝑡−1 + 𝜃2 𝜀𝑡 −2 + ⋯ + 𝜃𝑞 𝜀𝑡−𝑞 ,

where 𝜇 is the intercept or constant, 𝜀𝑡 is white noise, 𝜃𝑞 is MA coefficient at lag q, and 𝜀𝑡−𝑞 is the forecast error that
was made at period t - q. We refer to this as an MA(q) model, a moving average model at lag q. A moving average
process states that the current or predicted value is linearly dependent on the current and past error terms. Again, the
error terms are assumed to be mutually independent and normally distributed, just like white noise.

Notes on Non-Seasonal ARIMA Models

If we combine differencing with autoregression and a moving average model, we obtain a non-seasonal ARIMA model.
ARIMA is an acronym for Autoregressive Integrated Moving Average (in this context, “integration” is the order of
differencing performed to make a time series stationary). The full model can be written as

𝑦′𝑡 = 𝜇 + 𝜑1 𝑦′𝑡−1 + 𝜑2 𝑦′𝑡−2 + ⋯ + 𝜑𝑝 𝑦 ′ 𝑡−𝑝 + 𝜃1 𝜀𝑡−1 + 𝜃2 𝜀𝑡−2 + ⋯ + 𝜃𝑞 𝜀𝑡−𝑞 + 𝜀𝑡 ,

where 𝑦′𝑡 is the differenced series (it may have been differenced more than once or not at all). The “predictors” on
the right-hand side include both lagged values of 𝒚𝒕 and lagged errors. We call this an ARIMA(p,d,q) model where:

▪ p = order or number of autoregressive terms;

▪ d = degree of differencing involved;
▪ q = number of lagged forecast errors in the prediction equation.
Three items should be considered to determine the parameters of an ARIMA model: a time series plot of the data, the
ACF, and the PACF.

Time series plot of the observed series

In Module 2.1, we discussed what to look for: possible trend, seasonality, outliers, constant variance, and/or
nonconstant variance.
▪ You won’t be able to spot any particular model just by looking at the time series plot, but you will be able to
see the need for various possible actions.
▪ If there’s an obvious upward or downward linear trend, a first difference may be needed. A quadratic trend
might need a 2nd order difference (d = 2). We rarely want to go much beyond two. Over-differencing can cause
us to introduce unnecessary levels of dependency.
▪ For data with a curved upward trend accompanied by increasing variance, you should consider transforming
the series with either a logarithm or a square root.

ACF and PACF plots

After a time series has been turned stationary by differencing, the next step in fitting an ARIMA model is to determine
whether AR or MA terms are needed to correct any autocorrelation that remains in the differenced series. Of course,
with software like Python and R, you could just try some different combinations of terms and see what works best. But
there is a more systematic way to do this. By looking at the autocorrelation function (ACF) and partial autocorrelation
(PACF) plots of the differenced series, you can tentatively identify the numbers of AR and/or MA terms that are
needed. The ACF and PACF should be considered together. It can sometimes be tricky going, but a few combined
patterns do stand out.

▪ If the PACF displays a sharp cutoff while the ACF is exponentially decaying or sinusoidal, we say that the
stationarized series displays an "AR signature," meaning that the autocorrelation pattern can be explained
more easily by adding AR terms than by adding MA terms. The lag at which the PACF cuts off is the indicated
number of AR terms. An AR(1) model has a single spike in the PACF and an ACF with a pattern 𝜌𝑘 = 𝜑1𝑘 .

An AR(2) model has two spikes in the PACF and a sinusoidal ACF that converges to 0.
▪ If the ACF of the differenced series displays a sharp cutoff while the PACF is exponentially decaying or
sinusoidal, the series displays an “MA signature,” meaning that the autocorrelation pattern can be explained
more easily by adding MA terms than by adding AR terms. The lag at which the ACF cuts off is the indicated
number of MA terms. Below is a sample of an MA(1) model.

▪ In most cases, the best model turns out a model that uses either only AR terms or only MA terms, although in
some cases a "mixed" model with both AR and MA terms may provide the best fit to the data.

▪ It is possible for an AR term and an MA term to cancel each other's effects. So, if a mixed ARMA model seems
to fit the data, try a model with one fewer AR term and one fewer MA term – particularly if the parameter
estimates in the original model require more than 10 iterations to converge. BEWARE OF USING MULTIPLE AR
TERMS AND MULTIPLE MA TERMS IN THE SAME MODEL.

▪ ARMA models (including both AR and MA terms) have ACFs and PACFs that both tail off to 0. These are the
trickiest because the order will not be particularly obvious. Basically, you just have to guesstimate that one or
two terms of each type may be needed and then see what happens when you estimate the model. Below is a
sample of an ARIMA (1,1,1) model.
Notes on Seasonal ARIMA Models
Seasonality in a time series is a regular pattern of changes that repeats over S time periods, where S defines the number
of time periods until the pattern repeats again.
For example, there is seasonality in monthly data for which high values tend always to occur in particular months and
low values tend always to occur in other particular months. In this case, S = 12 (months per year) is the span of the
periodic seasonal behavior. For quarterly data, S = 4 time periods per year.

A seasonal ARIMA (SARIMA) model is formed by including additional seasonal terms in the ARIMA models we have
seen so far. It is written as follows:

where m = the seasonal period (e.g., number of observations per year). We use uppercase notation for the seasonal
parts of the model, and lowercase notation for the non-seasonal parts of the model. Here, P = number of seasonal
autoregressive (SAR) terms, D = number of seasonal differences, Q = number of seasonal moving average (SMA) terms.

In a SARIMA model, seasonal AR and MA terms predict 𝑦𝑡 using data values and errors at times with lags that are
multiples of S (the span of the seasonality).
▪ With monthly data (and S = 12), a seasonal first order autoregressive model would use 𝑦𝑡 −12 to predict 𝑦𝑡 . For
instance, if we were selling cooling fans, we might predict this March's sales using last March's sales. (This
relationship of predicting using last year's data would hold for any month of the year.)
▪ A seasonal second order autoregressive model would use 𝑦𝑡−12 and 𝑦𝑡−24 to predict 𝑦𝑡 . Here we would predict
this March’s values from the past two years’ March values.
▪ A seasonal first order MA(1) model (with S = 12) would use 𝜀𝑡−12 as a predictor. A seasonal second order MA(2)
model would use 𝜀𝑡−12 and 𝜀𝑡−24 as predictors.

In identifying a seasonal model, the first step is to determine whether a seasonal difference is needed. If the series has
a strong and consistent seasonal pattern, then consider using an order of seasonal differencing – but never use more
than one order of seasonal differencing or more than 2 orders of total differencing (non -seasonal and seasonal
combined).
The seasonal part of an AR or MA model will be seen in the seasonal lags of the PACF and ACF. For example, an
ARIMA(0,0,0)(0,0,1)[12] model will show:
▪ a spike at lag 12 in the ACF but no other significant spikes;
▪ exponential decay in the seasonal lags of the PACF (i.e., at lags 12, 24, 36, …).

Similarly, an ARIMA(0,0,0)(1,0,0)[12] model will show:

▪ exponential decay in the seasonal lags of the ACF;
▪ a single significant spike at lag 12 in the PACF.

In considering the appropriate seasonal orders for a seasonal ARIMA model, restrict attention to the seasonal lags. If
the autocorrelation at the seasonal period is positive, consider adding an SAR term to the model. If the autocorrelation
at the seasonal period is negative, consider adding an SMA term to the model. Try to avoid mixing SAR and SMA terms
in the same model, and avoid using more than one of either kind.

Usually an SAR(1) or SMA(1) term is sufficient. You will rarely encounter a genuine SAR(2) or SMA(2) process, and even
more rarely have enough data to estimate 2 or more seasonal coefficients without the estimation algorithm getting
into a "feedback loop."

Notes on ARIMA Models with Exogenous Variables (ARIMAX)

ARIMAX stands for Autoregressive Integrated Moving Average with Exogenous Variables.

ARIMAX is an extension of the traditional ARIMA model that allows for the inclusion of additional variables, known as
exogenous variables, which may have an effect on the time series being forecasted. These exogenous variables can be
any type of data, but for our purposes, we will only consider time-varying measurements: economic indicators such as
inflation rate or price indices, weather data, inventory turnover, etc.

By incorporating these external factors, ARIMAX models can provide more accurate and comprehensive predictions.
Additionally, ARIMAX models can also be used for causal analysis, where the relationship between the exogenous
variables and the time series data can be examined. Overall, ARIMAX models offer a powerful tool for forecasting and
analyzing time series data in a multivariate context.

We can see how this ARIMAX model compares with the standard ARIMA. For simplicity let’s first consider an
ARIMA(1,1,1):

𝑦′𝑡 = 𝜇 + 𝜑1 𝑦′𝑡−1 + 𝜃1 𝜀𝑡−1 + 𝜀𝑡

▪ 𝜇 = constant or drift
▪ 𝑦 = variable of interest (which appears differentiated because of d = 1)
▪ 𝜑 = AR(1) coefficient
▪ 𝜃 = MA(1) coefficient
▪ 𝜀𝑡 = error term, which is white noise

The ARIMAX(1,1,1) will add another term to the equation:

𝑦′𝑡 = 𝜇 + 𝜑1 𝑦′𝑡−1 + 𝜃1 𝜀𝑡−1 + 𝛽𝑋 + 𝜀𝑡

The new term consists of the ARIMAX coefficient 𝛽 fitted based on the model and data, and the exogenous variable 𝑋.
It is important to remark that this exogenous variable must be available for every time period. Make sure to include
exogenous variables with a strong correlation with the variable of interest/time series data.

Residual Diagnostics
A good forecasting method will yield model residuals with the following properties:
1. The residuals are uncorrelated. If there are correlations between the residuals, then there is information left
in the residuals which should be used in computing forecasts.
2. The residuals have zero mean. If they have a mean other than zero, then the forecasts are biased.
3. The residuals have constant variance. This is known as “homoscedasticity”.
4. The residuals are normally distributed.

The first two properties can be ensured by performing an ADF test for stationarity and Box-Ljung test for serial
autocorrelation on the model residuals. Recall that:
▪ ADF test: p-value < .05 to reject H0 that a series of residuals contains a unit root and is non-stationary
▪ Box-Ljung test: p-value > .05 to fail to reject H0 that residuals are independently distributed

The last two properties make the calculation of prediction intervals easier.
▪ ARCH test: p-value > .05 to fail to reject H0 that a series of residuals exhibits no conditional heteroscedasticity
(ARCH effects)
▪ Jarque-Bera test: p-value > .05 to fail to reject H0 that the residuals follow a normal distribution

The first prediction interval is easy to calculate. If 𝜎̂ is the standard deviation of the residuals, then a 95% prediction
interval is given by 𝑦̂𝑇+1 | 𝑇 ± 1.96𝜎̂. This result is true for all ARIMA models regardless of their parameters and orders.

More general results, and other special cases of multi-step prediction intervals for an ARIMA(p,d,q) model, are given
in more advanced textbooks such Brockwell & Davis (2016)1.

The prediction intervals for ARIMA models are based on assumptions that the residuals are uncorrelated and normally
distributed. If either of these assumptions does not hold, then the prediction intervals may be incorrect. For this
reason, always plot the ACF and histogram of the residuals to check the assumptions before producing prediction
intervals and perform the Jarque-Bera test on the model residuals.

If the residuals are uncorrelated but not normally distributed, then bootstrapped intervals can be obtained instead. In
R, this is easily achieved by simply adding bootstrap=TRUE in the forecast() function.

In general, prediction intervals from ARIMA models increase as the forecast horizon increases. For stationary models
(i.e., with d = 0) they will converge, so that prediction intervals for long horizons are all essentially the same. For d ≥ 1,
the prediction intervals will continue to grow into the future.

Part 3: ARIMA Modelling for Time Series Forecasting in R

We will be using ARIMA models to forecast local inflation. Our data consists of monthly Philippine Consumer Price
Index (CPI) and the Rice Price Index from January 2005 to August 2023. Our target variable is the CPI data.

CODE BLOCK #1: Loading of libraries, data importation, train-test split, data integrity check

#load libraries
library(forecast)
library(FinTS)
library(tseries)
library(urca)
library(tidyverse)

#data importation
data = read.csv(file.choose(), header = T)

head(data)
tail(data)

1
Brockwell, P. J., & Davis, R. A. (2016). Introduction to time series and forecasting (3rd ed). Springer.
#train-test split

train = head(data, 0.95*nrow(data))

test = tail(data, 0.05*nrow(data))

#this way my forecast horizon = 12 months

#important: how long your test set is, should be the length of the forecast horizon

train
test

#data integrity check

#count for missing values

sum(is.na(train$CPI))

#check for duplicate dates

sum(duplicated(train[,2]))

Instead of the standard 70-30 or 80-20 train-test split, I chose to split the data 95-5 so that I will be able to capture
how the time series behaved during the pandemic and “back-to-normal” period during late 2021 to early 2022. This
assumption in the data is important as we are forecasting a macroeconomic variable.
CODE BLOCK #2: Time series visuals

#Plotting the time series

ts_train = ts(train[,2], start = c(2005, 01), frequency = 12)

ts_train %>% autoplot()+

ggtitle("Philippine CPI") + xlab("Date") + ylab("CPI")

#decompose the time series

decompose_train = decompose(ts_train, "multiplicative")

par(mfrow = c(4,1))
plot(as.ts(decompose_train$trend))
plot(as.ts(decompose_train$seasonal))
plot(as.ts(decompose_train$random))
plot(as.ts(decompose_train))
The time series has an observable upward trend, while it’s difficult to detect any sort of seasonal pattern. We’ll have
to check the ACF and PACF plots to inspect it better.

CODE BLOCK #3: Check for stationarity

#check for stationarity

#initial ADF test

adf_initial = ur.df(ts_train, type = "none", selectlags = "AIC")
summary(adf_initial)

#Differencing the time series

ts_train_diff = diff(ts_train, difference = 1)

#Re-try of ADF test

adf_diff = ur.df(ts_train_diff, type = "none", selectlags = "AIC")
The initial ADF test showed that the time series is not stationary. A differencing of order d = 1 makes the series
stationary as shown in the re-run of the ADF test.

CODE BLOCK #4: ACF and PACF plots

The ACF plot shows significant lags up to lag 3, with a spike in lag 1 and significant drop in lags 2 and 3. The PACF plot
shows a significant spike in lag 1 and then dropping off immediately to lag 2. The PACF plot has a more sinusoidal
pattern.
The time series displays a stronger AR signature but it’s not strong enough to merit a pure AR(p) model. We will have
to work with a mixed ARMA model, but we need to be careful in not overfitting an ARMA model by not using too high
of an order for both AR(p) and MA(q) terms.

CODE BLOCK #5: Fitting of ARIMA Model #1

#model 1 -> auto.arima

mod1 = auto.arima(ts_train)
summary(mod1)

#model diagnostics of model 1

jarque.bera.test(mod1$residuals)
ArchTest(mod1$residuals)
adf.test(mod1$residuals)
Box.test(mod1$residuals, type = "Ljung-Box")

auto.arima() is a is a statistical algorithm used for time series forecasting. It automatically determines the optimal
parameters for an ARIMA model, such as the order of differencing, autoregressive (AR) terms, and moving average
(MA) terms. It searches through different combinations of these parameters to find the best fit for the given time series
data. This automated process saves time and effort, making it easier for users to generate accurate forecasts without
requiring extensive knowledge of time series analysis.
The auto.arima() process fitted an ARIMA(1,1,0)(0,0,1)[12] with drift model. As expected, it fitted a differencing
transformation for the time series with d = 1. The AR(1) model is something we expected from the ACF and PACF plots.
The algorithm also saw an MA signature, but instead included it in the seasonal parameters. The model diagnostic s
also shows the model residuals passed all the regression assumptions.

CODE BLOCK #5: Fitting of ARIMA Models #2 to #4

#model 2 -> not seasonal using auto.arima

mod2 = auto.arima(ts_train, seasonal = FALSE)

summary(mod2)

#model diagnostics of model 2

jarque.bera.test(mod2$residuals)
ArchTest(mod2$residuals)
adf.test(mod2$residuals)
Box.test(mod2$residuals, type = "Ljung-Box")

#model 3 -> not seasonal using manual inputs

mod3 = Arima(ts_train, order = c(1,1,3))

summary(mod3)

#model diagnostics of model 3

jarque.bera.test(mod3$residuals)
ArchTest(mod3$residuals)
adf.test(mod3$residuals)
Box.test(mod3$residuals, type = "Ljung-Box")
#model 4 -> SARIMA with exogenous variable

exogen = ts(train[,3], start = c(2005, 01), frequency = 12)

mod4 = Arima(ts_train, order = c(1,1,0), seasonal = c(0,0,1), xreg = exogen)

summary(mod4)

#model diagnostics of model 4

jarque.bera.test(mod4$residuals)
ArchTest(mod4$residuals)
adf.test(mod4$residuals)
Box.test(mod4$residuals, type = "Ljung-Box")
Model #2 shows how to fit an auto.arima() model without seasonality. It fitted a mixed ARMA model (with a d = 1
differencing as expected) with p = 2 AR terms and q = 2 MA terms. A high order of AR and MA at the same time poses
stability and accuracy concerns at our forecasts even if the model diagnostics pass all assumptions. The AR(2) fit is due
to significant lag 2 in our ACF plot. We will have to check our actual forecasts against our test set to see the accuracy
of the mixed ARMA (2,2) fit.

Model #3 shows how we can manually tune the (p,d,q) parameters for the ARIMA model. Again, we have to manually
fit a d = 1 parameter given our knowledge from the earlier ADF test that we need to difference the time series to make
it stationary. I fitted here an AR(1) parameter given the strong AR signature of the time series, especially at lag 1 as
seen in the ACF plot. The MA term was trickier to tune. It was honestly fitted with guesswork here. A (1,1,1) and (1,1,2)
fit both produced residuals that violated more than one assumption. The (1,1,3) fit still violated the autocorrelation
assumption. I stopped here as with any linear regression models, a less parsimonious fit (i.e., AR term or MA term that
is greater than 3) will be penalized by the OLS estimation.

Model #4 shows a SARIMAX fit. The exogenous variable here is the local price index for rice. Rice, being a staple
commodity locally, is one of the most heavily-weighted items in the CPI basket. Like in Model #3, we manually fitted
the parameters by combining the (p,d,q) parameters from Model #3 with the (P,D,Q) fit from Model #1. The model
residuals pass all regression assumptions, but we have to check the forecasts to see its accuracy.

CODE BLOCK #6: Forecasting with the fitted models

#model 1 forecasts
mod1_fcst = forecast(mod1, h = 12)
mod1_fcst

#model 2 forecasts
mod2_fcst = forecast(mod2, h = 12)
mod2_fcst

#model 3 forecasts
mod3_fcst = forecast(mod3, h = 12)
mod3_fcst

#model 4 forecasts

exogen.test = ts(test[,3], start = c(2022, 09), frequency = 12)

mod4_fcst = forecast(mod4, h = 12, xreg = exogen.test)

mod4_fcst
The forecast() function will return a prediction interval for the upper and lower 80 th and 95th percentiles. The 95th
percentile will produce a wider prediction interval. If you want a tighter prediction interval, you can amend the function
to force a prediction interval by adding the “level” argument. For example, to show a prediction interval at 90 th
percentile, mod1_fcst = forecast(mod1, h = 12, level = c(90)).

The h-steps argument means the number of time steps ahead forecast you want to see. Since, our test set is equal to
12 months, we set h = 12.

We did not use the forecasts from Model #3 because the residual diagnostics failed the autocorrelation assumption.
Here, we can see that the best model that we fitted is Model #1, the auto.arima() fit.

It’s not always the case that the auto.arima() fit will produce the best forecast. Here, we hoped that the extraneous
variable will help stabilize the forecast. However, it acted as a dampener to Model #4 forecasts, hence the understated
forecasted values.

You’ll also notice that the ARIMA models we fitted barely reached the levels in the test set. A couple of things
contributed to this.

First, the ARIMA model, like most regression models that are based on a linear function, tend to underestimate
forecasts in the long-term because the stationarity assumption forces the forecast to be mean-reverting. Second, the
ARIMA model has a recency bias (we set AR(p) lag = 1) where the latest data in our train set was the pandemic (where
prices significantly dropped) and the a few months of the economy reopening. Our test set, shows a significant increase
from Q4 2022 that the ARIMA models were not able to chase. You’ll notice in the table below that there was a
significant jump month-on-month in October 2022 that the ARIMA models were not able to consider.

Date CPI MoM Change %

01/31/2021 108.4
02/28/2021 108.5 0.09%
03/31/2021 108.2 -0.28%
04/30/2021 108.1 -0.09%
05/31/2021 108.1 0.00%
06/30/2021 108.3 0.19%
07/31/2021 108.8 0.46%
08/31/2021 109.4 0.55%
09/30/2021 109.3 -0.09%
10/31/2021 109.5 0.18%
11/30/2021 110.2 0.64%
12/31/2021 110.5 0.27%
01/31/2022 111.7 1.09%
02/28/2022 111.8 0.09%
03/31/2022 112.5 0.63%
04/30/2022 113.4 0.80%
05/31/2022 113.9 0.44%
06/30/2022 114.9 0.88%
07/31/2022 115.8 0.78%
08/31/2022 116.3 0.43%
09/30/2022 116.8 0.43%
10/31/2022 117.9 0.94%
11/30/2022 119 0.93%
12/31/2022 119.4 0.34%
01/31/2023 121.4 1.68%
02/28/2023 121.4 0.00%
03/31/2023 121.1 -0.25%
04/30/2023 120.9 -0.17%
05/31/2023 120.9 0.00%
06/30/2023 121.1 0.17%
07/31/2023 121.2 0.08%
08/31/2023 122.5 1.07%

So what are the next steps? We can try to re-fit our ARIMA models (like what was stated in the notes on non-seasonal
ARIMA, a mixed ARMA model parameter tuning takes a lot guesstimate work). You can try to transform the actual data
itself and winsorize the 2020 and 2021 data as if to force the data to follow up the growth rate pre-pandemic. This
means we will model the train set as if the pandemic did not happen.

We can also fit other models fit for time series forecasting. For further reading, you can check out Vector
Autoregression and the ARCH family of models which is better for modelling volatility. The other alternative is to, of
course, use other Machine Learning models that do not depend on Gaussian processes like recurrent neural network
models.

CODE BLOCK #7: Forecasting 12-month ahead using the best ARIMA model

# 12-month ahead forecast on the entire dataset using best ARIMA model

series = ts(data[,2], start = c(2005, 01), frequency = 12)

mod1.series = Arima(series, order = c(1,1,0), seasonal = c(0,0,1), include.drift = TRUE)

summary(mod1.series)

jarque.bera.test(mod1.series$residuals)
ArchTest(mod1.series$residuals)
adf.test(mod1.series$residuals)
Box.test(mod1.series$residuals, type = "Ljung-Box")

#series forecasts
series_fcst = forecast(mod1.series, h = 12)
series_fcst

Assuming that we will use ARIMA for our time series model, we would eventually want to forecast the 12-month ahead
forecasts for inflation using the entire dataset and the parameters of the best ARIMA model.

In our case, we will use Model #1. Recall that Model #1 is the auto.arima() fit and returned a SARIMA(1,1,0)(0,0 ,1)
fit. In the code block above, I did not run an auto.arima() fit but instead fitted the exact parameters using the entire
dataset this time.

Boilerplate v2.9.4 by DaviddTech
No ratings yet
Boilerplate v2.9.4 by DaviddTech
37 pages
Oblicon 1106 - 1134
No ratings yet
Oblicon 1106 - 1134
14 pages
Defining Parent-Child Collection Plans - SPD
No ratings yet
Defining Parent-Child Collection Plans - SPD
36 pages
PHILIPPINE PRESS INSTITUTE vs. COMELEC
No ratings yet
PHILIPPINE PRESS INSTITUTE vs. COMELEC
2 pages
Texstyle 5 Series
No ratings yet
Texstyle 5 Series
80 pages
Piczon V Piczon GR No L29139
No ratings yet
Piczon V Piczon GR No L29139
3 pages
Bello Vs Court of Appeals
No ratings yet
Bello Vs Court of Appeals
1 page
People Vs Pagalasan
No ratings yet
People Vs Pagalasan
2 pages
BANAT Partylist Vs COMELEC - Digest
No ratings yet
BANAT Partylist Vs COMELEC - Digest
1 page
Done - 2. Lacson v. Executive Secretary
No ratings yet
Done - 2. Lacson v. Executive Secretary
2 pages
Comendador Et Al vs. de Villa Et Al
No ratings yet
Comendador Et Al vs. de Villa Et Al
9 pages
Marcos Vs Manglapus
No ratings yet
Marcos Vs Manglapus
8 pages
Gratz v. Bollinger & Grutter v. Bollinger
No ratings yet
Gratz v. Bollinger & Grutter v. Bollinger
2 pages
35 and 36
No ratings yet
35 and 36
8 pages
14-People v. Rio G.R. No. 90294 September 24, 1991
0% (1)
14-People v. Rio G.R. No. 90294 September 24, 1991
7 pages
Consti 1 Reviewer
No ratings yet
Consti 1 Reviewer
9 pages
21-Cruz Vs Iturralde
No ratings yet
21-Cruz Vs Iturralde
6 pages
US Vs de Guzman
No ratings yet
US Vs de Guzman
3 pages
DIGEST PH Vs Migrino and Tecson
No ratings yet
DIGEST PH Vs Migrino and Tecson
22 pages
ALU-TUCP v. NLRC
No ratings yet
ALU-TUCP v. NLRC
3 pages
CLJ 2 - Human Rights Education Syllabus
No ratings yet
CLJ 2 - Human Rights Education Syllabus
13 pages
MORALES VS SUBIDO GR No L-29658 1969-02-27
No ratings yet
MORALES VS SUBIDO GR No L-29658 1969-02-27
5 pages
StatCon MidTerms
No ratings yet
StatCon MidTerms
8 pages
Ramirez Vs CA GR No. 93833 September 28, 1995
No ratings yet
Ramirez Vs CA GR No. 93833 September 28, 1995
6 pages
Constitutional Law 2 Compiled Case Digests
No ratings yet
Constitutional Law 2 Compiled Case Digests
137 pages
12. People vs. Apurado
No ratings yet
12. People vs. Apurado
2 pages
Pabillo Vs Comelec
No ratings yet
Pabillo Vs Comelec
4 pages
Gloria vs. CA
No ratings yet
Gloria vs. CA
4 pages
Carpio Morales V Ca GR NO. 217126-27 November 10, 2015
No ratings yet
Carpio Morales V Ca GR NO. 217126-27 November 10, 2015
6 pages
A. DIGEST - Katipunan, Jr. vs. Carrera, A.C. No. 12661, February 19, 2020
No ratings yet
A. DIGEST - Katipunan, Jr. vs. Carrera, A.C. No. 12661, February 19, 2020
5 pages
Santiago Vs COMELEC
No ratings yet
Santiago Vs COMELEC
3 pages
Constitutional Law 1 File No 7
100% (1)
Constitutional Law 1 File No 7
63 pages
Yale Approach: The Policy Science Perspective
No ratings yet
Yale Approach: The Policy Science Perspective
9 pages
Villanueva v. People
No ratings yet
Villanueva v. People
2 pages
Estarija v. Ranada
No ratings yet
Estarija v. Ranada
14 pages
Asian Surety Vs Herrera
No ratings yet
Asian Surety Vs Herrera
4 pages
G.R. No. 177983
No ratings yet
G.R. No. 177983
5 pages
Crimpro MT
No ratings yet
Crimpro MT
34 pages
Capati Vs Ocampo, 113 Scra 794 No. L-28742. April 30, 1982. VIRGILIO CAPATI, Plaintiff-Appellant, vs. DR. JESUS P. OCAMPO, Defendant-Appellee
No ratings yet
Capati Vs Ocampo, 113 Scra 794 No. L-28742. April 30, 1982. VIRGILIO CAPATI, Plaintiff-Appellant, vs. DR. JESUS P. OCAMPO, Defendant-Appellee
60 pages
Minoza V Lopez
No ratings yet
Minoza V Lopez
1 page
Paderanga vs. CA
No ratings yet
Paderanga vs. CA
22 pages
Amendments and Revisions
No ratings yet
Amendments and Revisions
4 pages
Hernandez VS Albano
No ratings yet
Hernandez VS Albano
7 pages
Agreement For Sale of Car
No ratings yet
Agreement For Sale of Car
2 pages
JMM Promotions and Management CASE DIGEST
No ratings yet
JMM Promotions and Management CASE DIGEST
2 pages
Republic Act No
No ratings yet
Republic Act No
6 pages
Digest
No ratings yet
Digest
3 pages
People-v-Veneracion Case Digest
No ratings yet
People-v-Veneracion Case Digest
2 pages
Javellana V
No ratings yet
Javellana V
2 pages
10 Torres Vs Yu
No ratings yet
10 Torres Vs Yu
13 pages
Case Digest - G.R. No. L-48468-69 - Primero vs. Court of Appeals
No ratings yet
Case Digest - G.R. No. L-48468-69 - Primero vs. Court of Appeals
2 pages
Luis Mario M. General v. Ramon S. Roco (G.r. No. 143366 Jan. 29, 2001)
100% (1)
Luis Mario M. General v. Ramon S. Roco (G.r. No. 143366 Jan. 29, 2001)
6 pages
ARTICLE VIII Judicial Department
No ratings yet
ARTICLE VIII Judicial Department
1 page
Infotech vs. COMELEC
No ratings yet
Infotech vs. COMELEC
2 pages
Jison vs. Court of Appeals 164 SCRA 339 No. L 45349 August 15 1988
No ratings yet
Jison vs. Court of Appeals 164 SCRA 339 No. L 45349 August 15 1988
7 pages
1 Agote vs. Lorenzo
100% (1)
1 Agote vs. Lorenzo
1 page
Mansanto Vs Factoran GR No. 78239 Feb 9, 1989
No ratings yet
Mansanto Vs Factoran GR No. 78239 Feb 9, 1989
2 pages
Gonzales Vs Comelec
No ratings yet
Gonzales Vs Comelec
5 pages
3 Reasons For The Necessity of Presentation of Public Document
No ratings yet
3 Reasons For The Necessity of Presentation of Public Document
2 pages
Pimentel, Jr. vs. Llorente, 339 SCRA 154
No ratings yet
Pimentel, Jr. vs. Llorente, 339 SCRA 154
4 pages
MBA Analytics For Finance 12
No ratings yet
MBA Analytics For Finance 12
9 pages
Time Series Methods-ARIMA
No ratings yet
Time Series Methods-ARIMA
2 pages
Med - Berrag.esc@hotmail - FR - Amar@yahoo - FR Ane Gam: Abstract
No ratings yet
Med - Berrag.esc@hotmail - FR - Amar@yahoo - FR Ane Gam: Abstract
14 pages
Terms & Conditions of The Service Agreement
No ratings yet
Terms & Conditions of The Service Agreement
3 pages
Homework 5 - Current Liabilities - Revised
No ratings yet
Homework 5 - Current Liabilities - Revised
3 pages
Advanced Network Defense
No ratings yet
Advanced Network Defense
35 pages
Contact Google Workspace Support - Google Workspace Admin Help
No ratings yet
Contact Google Workspace Support - Google Workspace Admin Help
2 pages
Resume-Mayank Goel
No ratings yet
Resume-Mayank Goel
4 pages
Kanji 2004
No ratings yet
Kanji 2004
10 pages
MES AAO Manual
No ratings yet
MES AAO Manual
141 pages
OLD MAN EMU - KitRecommendations - 1612172335 - AU
No ratings yet
OLD MAN EMU - KitRecommendations - 1612172335 - AU
4 pages
Law Enforcement Intelligence Defined
No ratings yet
Law Enforcement Intelligence Defined
26 pages
Checklist RFQ For Utility Water Tank
No ratings yet
Checklist RFQ For Utility Water Tank
2 pages
Exploring Digital Communication Language in action 1st Edition Caroline Tagg - Download the ebook now to start reading without waiting
No ratings yet
Exploring Digital Communication Language in action 1st Edition Caroline Tagg - Download the ebook now to start reading without waiting
53 pages
Acct Statement - XX8693 - 15032024
No ratings yet
Acct Statement - XX8693 - 15032024
50 pages
Risk Management Programme Poultry Farmers PDF
No ratings yet
Risk Management Programme Poultry Farmers PDF
3 pages
Effective Office Administration
No ratings yet
Effective Office Administration
62 pages
BILL - OF - MATERIALS - Proj2.0
No ratings yet
BILL - OF - MATERIALS - Proj2.0
51 pages
Maxos Sight Glass
100% (1)
Maxos Sight Glass
8 pages
Angus Amp 90 Tds
No ratings yet
Angus Amp 90 Tds
2 pages
Research Proposal. Calton and Melquice
100% (1)
Research Proposal. Calton and Melquice
25 pages
Forex Strategy - Dreamliner HFT Method
No ratings yet
Forex Strategy - Dreamliner HFT Method
14 pages
Enantioselective Synthesis of (1,2) - Oxazinone Scaffolds and (1,2) - Oxazine Core Structures of FR900482
No ratings yet
Enantioselective Synthesis of (1,2) - Oxazinone Scaffolds and (1,2) - Oxazine Core Structures of FR900482
9 pages
Subject Line "Deputy Program Manager - SBI Youth For India Fellowship"
No ratings yet
Subject Line "Deputy Program Manager - SBI Youth For India Fellowship"
1 page
IID-EN-SDS16-Mip CIP.28.04.2009
No ratings yet
IID-EN-SDS16-Mip CIP.28.04.2009
5 pages
University of Mauritius
No ratings yet
University of Mauritius
6 pages
Toshiba NB510 (DOLPHIN10) - 6050A2488301-MB-A02 - Pago
No ratings yet
Toshiba NB510 (DOLPHIN10) - 6050A2488301-MB-A02 - Pago
45 pages
System Design Considerations For The Use Turbo Codes in Aeronautical Satellite Communcations
No ratings yet
System Design Considerations For The Use Turbo Codes in Aeronautical Satellite Communcations
8 pages
ADMS 2511: Management Information Systems Session 3
No ratings yet
ADMS 2511: Management Information Systems Session 3
41 pages
ELEC Layout3
No ratings yet
ELEC Layout3
1 page