Many time series appear to be and have useful long-run equilibrium relationships. We can difference to avoid the problem of spurious regression, but this loses the useful relationships!

Sometimes we can carry out meaningful regressions involving variables

Definition: Two time series and are cointegrated if there exists a linear combination such that

is called the cointegrating vector and the two variables share a common stochastic trend

Example:
Let and where , and are weakly stationary and mutually independent, and is a random walk, ,

The cointegrating vector is ,

Definition: A vector is cointegrated of order , denoted , if each element of is and there exists a linear combination for

How do we test if two variables are cointegrated?

We can reformulate cointegration to make this easier
Let where and are mutually independent
If and then

Now testing for cointegration is equivalent to testing is

First we perform a regression and obtain parameter estimates and for

The residuals are
So we test if

Definition: The Dickey-Fuller test for no-cointegration between and makes use of the -ratio test statistic where is the estimated parameter from the model and are residuals from the static regression

The null hypothesis is (no-cointegration) against (cointegration)

Critical values are taken from a special table, which accounts for kinds of trends, sample size, and number of time series being tested

  • Instead of single numbers, they fit a more complex function to accommodate different values of ,

In practice,

  1. Start with performing ADF unit root tests to determine the integration order of and (if either is stationary then we should stop testing)
  2. Perform regression and obtain residuals
  3. Perform unit root test on residuals and check against the special critical values

Error Correct Models

The ECM helps us model relationships between cointegrated non-stationary time series


If we fix for long-term equilibriums, we get

is called the error correction term, since it corrects towards the long-run equilibrium (if )

Granger’s Representation Theorem: If then admits the error-correction representation

In fact, the theorem is a bit more general and allows for for lags of the differences of our time series,
(in practice, can be zero-mean, weakly stationary, which is more general than white-noise)

Written in another way,

where

So our steps to estimate these parameters are,

  1. Regress on to obtain
  2. Perform a second regression for the final model parameters

This works because our estimator is super-consistent and is therefore unusually accurate and suitable for a second regression

Engle and Granger showed that and have the same asymptotic distribution as if is known