Cointegration

Many time series appear to be $I (1)$ and have useful long-run equilibrium relationships. We can difference to avoid the problem of spurious regression, but this loses the useful relationships!

Sometimes we can carry out meaningful regressions involving $I (1)$ variables

Definition: Two $I (1)$ time series ${Y_{t}}$ and ${X_{t}}$ are cointegrated if there exists a linear combination $Y_{t} - λ X_{t} = (1, - λ) (Y_{t}, X_{t})^{'}$ such that ${Y_{t} - λ X_{t}} \sim I (0)$

$(1, - λ)^{'}$ is called the cointegrating vector and the two variables share a common stochastic trend

Example:
Let $Y_{t} = a R_{t} + Z_{t}$ and $X_{t} = b R_{t} + W_{t}$ where $a, b \neq = 0$ , ${Z_{t}}$ and ${W_{t}}$ are weakly stationary and mutually independent, and ${R_{t}}$ is a random walk, $R_{t} = R_{t - 1} + u_{t}$ , ${u_{t}} \sim WN (0, σ_{u}^{2})$

The cointegrating vector is $(1, - a / b)^{'}$ ,
$(1, - a / b) (Y_{t}, X_{t})^{'} = Y_{t} - a X_{t} / b = a R_{t} + Z_{t} - a R_{t} - a W_{t} / b$
$= Z_{t} - a W_{t} / b$

Definition: A vector $X_{t} = (X_{1, t}, \dots, X_{m, t})$ is cointegrated of order $(r, k)$ , denoted $X_{t} \sim CI (r, k)$ , if each element of $X_{t}$ is $I (r)$ and there exists a linear combination $c^{'} X_{t} \sim I (r - k)$ for $k \geq 1$

How do we test if two variables are cointegrated?

We can reformulate cointegration to make this easier
Let $Y_{t} = λ X_{t} + Z_{t}$ where ${X_{t}}$ and ${Z_{t}}$ are mutually independent
If ${X_{t}} \sim I (1)$ and ${Z_{t}} \sim I (0)$ then $(Y_{t}, X_{t}) \sim CI (1, 1)$

Now testing for cointegration is equivalent to testing ${Z_{t}}$ is $I (0)$

First we perform a regression and obtain parameter estimates $\hat{δ}$ and $\hat{λ}$ for $Y_{t} = δ + λ X_{t} + Z_{t}$

The residuals are $\hat{Z}_{t} = Y_{t} - \hat{δ} - \hat{λ} X_{t} \approx Z_{t}$
So we test if ${\hat{Z}_{t}} \sim I (0)$

Definition: The Dickey-Fuller test for no-cointegration between ${Y_{t}}$ and ${X_{t}}$ makes use of the $t$ -ratio test statistic $DFNC = (\hat{ϕ} - 1) / SE (\hat{ϕ})$ where $\hat{ϕ}$ is the estimated parameter from the $AR (1)$ model $\hat{Z}_{t} = ϕ \hat{Z}_{t - 1} + ϵ_{t}$ and ${\hat{Z}_{t}}$ are residuals from the static regression $Y_{t} = δ + λ X_{t} + Z_{t}$

The null hypothesis is $H_{0} : ϕ = 1$ (no-cointegration) against $H_{1} : ∣ ϕ ∣ < 1$ (cointegration)

Critical values are taken from a special table, which accounts for kinds of trends, sample size, and number of time series being tested

Instead of single numbers, they fit a more complex function to accommodate different values of $T$ , $DFNC_{crit} = β_{\infty} + β_{1} / T + β_{2} / T^{2} + β_{3} / T^{3}$

In practice,

Start with performing ADF unit root tests to determine the integration order of ${Y_{t}}$ and ${X_{t}}$ (if either is stationary then we should stop testing)
Perform regression and obtain $\hat{Z}_{t}$ residuals
Perform unit root test on residuals and check against the special critical values

Error Correct Models

The ECM helps us model relationships between cointegrated non-stationary time series

$Δ Y_{t} = γ [Y_{t - 1} - δ - λ X_{t - 1}] + ϵ_{t}$
If we fix for long-term equilibriums, we get $\overset{ˉ}{Y} = δ + λ \overset{ˉ}{X}$

$Y_{t - 1} - δ - λ X_{t - 1}$ is called the error correction term, since it corrects $Y_{t}$ towards the long-run equilibrium (if $γ < 0$ )

Granger’s Representation Theorem: If $(Y_{t}, X_{t}) \sim CI (1, 1)$ then ${Y_{t}}$ admits the error-correction representation $Δ Y_{t} = γ [Y_{t - 1} - δ - λ X_{t - 1}] + ϵ_{t}$

In fact, the theorem is a bit more general and allows for for lags of the differences of our time series,
$Δ Y_{t} = γ [Y_{t - 1} - δ - λ X_{t - 1}] + \sum_{i = 1}^{p} ϕ_{i} Δ Y_{t - i} + \sum_{j = 1}^{q} β_{j} Δ X_{t - j} + ϵ_{t}$ (in practice, $ϵ_{t}$ can be zero-mean, weakly stationary, which is more general than white-noise)

Written in another way,
$Δ Y_{t} = γ Z_{t - 1} + \sum_{i = 1}^{p} ϕ_{i} Δ Y_{t - i} + \sum_{j = 1}^{q} β_{j} Δ X_{t - j} + ϵ_{t}$
where $Z_{t - 1} = Y_{t - 1} - δ - λ X_{t - 1}$

So our steps to estimate these parameters are,

Regress $X_{t}$ on $Y_{t}$ to obtain $\hat{Z}_{t}$
Perform a second regression for the final model parameters

This works because our estimator $(\hat{δ}, \hat{λ})^{'}$ is super-consistent and is therefore unusually accurate and suitable for a second regression

Engle and Granger showed that $\hat{ϕ}_{i}$ and $\hat{β}_{j}$ have the same asymptotic distribution as if ${Z_{t}}$ is known

Binyamin's Notes

Explorer

Error Correct Models

Table of Contents