Econometrics__2__Notes (1)
Econometrics__2__Notes (1)
Yi = β1 + β2 Xi + ui (1)
Properties of LPM
1. E(Yi |Xi ) = β1 + β2 Xi , which is the conditional probability P (Yi =
1|Xi ).
2. Disadvantages of LPM:
• Non-normality of the error term ui .
• Heteroskedasticity in ui .
• Predicted probabilities can fall outside the range [0,1].
• R2 may not be a meaningful measure of fit.
2 Logit Model
The logit model uses the logistic distribution function for modeling binary
outcomes:
1
Pi = , Zi = β1 + β2 Xi (2)
1 + e−Zi
The log of the odds ratio is linear:
Pi
Li = ln = Zi = β1 + β2 Xi (3)
1 − Pi
1
Form of Y Form of X Interpretation
Y X When X changes by 1 unit, Y will change by β1 units.
ln Y ln X When X changes by 1%, Y will change by β1 % (elasticity).
Y ln X When X changes by 1%, Y will change by β1 /100 units.
ln Y X When X changes by 1 unit, Y will change by (β1 × 100)%.
3 Probit Model
The probit model assumes a cumulative normal distribution for binary out-
comes:
P (Y = 1|X) = Φ(β1 + β2 X) (4)
where Φ is the cumulative distribution function (CDF) of the standard nor-
mal distribution: Z X
1 t2
Φ(X) = √ e− 2 dt (5)
−∞ 2π
• Equation: ut ∼ IIDN(0, σ 2 ).
• Properties:
2
– ACF is zero at all lags except lag 0.
odds ratio = eZ
• Properties:
• Properties:
– Always stationary.
– The ACF cuts off after lag q.
– The PACF decays gradually.
3
Autoregressive Moving Average (ARMA) Model
• Definition: Combines the AR and MA models to explain a time series
using both past values and past error terms.
• Properties:
• Properties:
• Covariance depends only on the lag (distance) between two time peri-
ods, not the actual time.
4
Key Properties:
• Mean Reversion: The series tends to return to its mean over time.
• Constant Fluctuations: Variance remains stable, indicating consis-
tent amplitude of fluctuations.
Importance of Stationarity:
• Nonstationary series are specific to the observed time period and un-
suitable for generalization or forecasting.
5
Key Concepts
• Random walks (with or without drift) are nonstationary processes.
• Random walks exhibit stochastic trends.
Types of Trends
Deterministic Trend:
• Predictable and constant over time.
• Equation: Yt = β1 + β2 t + ut
• Subtracting the trend (β1 + β2 t) results in a stationary series. This
process is called detrending.
Stochastic Trend:
• Unpredictable and nonstationary.
• Found in random walks with or without drift.
Integrated Processes
A stochastic process requiring differencing d times to achieve stationarity is
said to be integrated of order d, denoted as I(d).
• Example:
– I(0): Stationary time series.
– I(1): Requires first differencing.
– I(2): Requires second differencing.
• Range: −1 ≤ ρk ≤ 1.
Correlogram:
• A plot of ρk against k (lags).
• High and slowly decaying ρk : Indicates nonstationarity.
6
Testing Autocorrelation Significance
Box–Pierce Q Statistic:
• Approximation: Q ∼ χ2 (m).
• LB ∼ χ2 (m).
Yt = ρYt−1 + ut where − 1 ≤ ρ ≤ 1
Yt − Yt−1 = (ρ − 1)Yt−1 + ut
7
Dickey-Fuller (DF) Test
• If H0 is true (δ = 0), the t-statistic for the coefficient of Yt−1 in the
regression follows the τ -statistic distribution (critical values available
in specialized tables).
∆Yt = δYt−1 + ut
∆Yt = β1 + δYt−1 + ut
∆Yt = β1 + β2 t + δYt−1 + ut
8
The F-statistic is given by the equation:
SSR
Explained Variation per Degree of Freedom k−1
F = = SSE
(6)
Unexplained Variation per Degree of Freedom n−k
where:
• n: Number of observations
9
Engle–Granger (EG) or Augmented Engle–Granger (AEG) Test
• The DF or ADF unit root tests can be applied by estimating a regres-
sion of the form:
Yt = β1 + β2 Xt + ut ,
obtaining the residuals ut , and applying the DF or ADF tests to ut .
• Critical values for this test were provided by Sargan and Bhargava.
• In the short run, disequilibrium may occur. The error term in the
cointegrating equation can be treated as the equilibrium error.
10
– Each endogenous variable is explained by its own lagged values
and the lagged values of all other endogenous variables in the
model.
– Typically, there are no exogenous variables in the model.
7. Regression of one time series variable on one or more time series vari-
ables often can give nonsensical or spurious results. This phenomenon
is known as spurious regression. One way to guard against it is to find
out if the time series are cointegrated.
11
Engle–Granger (EG), Augmented Engle–Granger (AEG), and Cointe-
grating Regression Durbin–Watson (CRDW) tests can be used to find
out if two or more time series are cointegrated.
9. Cointegration of two (or more) time series suggests that there is a long-
run, or equilibrium, relationship between them.
10. The error correction mechanism (ECM) developed by Engle and Granger
is a means of reconciling the short-run behavior of an economic variable
with its long-run behavior.
11. The field of time series econometrics is evolving. The established results
and tests are in some cases tentative, and a lot more work remains. An
important question that needs an answer is why some economic time
series are stationary and some are nonstationary.
12