Many time series appear to be and have useful long-run equilibrium relationships. We can difference to avoid the problem of spurious regression, but this loses the useful relationships!
Sometimes we can carry out meaningful regressions involving variables
Definition: Two time series and are cointegrated if there exists a linear combination such that
is called the cointegrating vector and the two variables share a common stochastic trend
Example:
Let and where , and are weakly stationary and mutually independent, and is a random walk, ,
The cointegrating vector is ,
Definition: A vector is cointegrated of order , denoted , if each element of is and there exists a linear combination for
How do we test if two variables are cointegrated?
We can reformulate cointegration to make this easier
Let where and are mutually independent
If and then
Now testing for cointegration is equivalent to testing is
First we perform a regression and obtain parameter estimates and for
The residuals are
So we test if
Definition: The Dickey-Fuller test for no-cointegration between and makes use of the -ratio test statistic where is the estimated parameter from the model and are residuals from the static regression
The null hypothesis is (no-cointegration) against (cointegration)
Critical values are taken from a special table, which accounts for kinds of trends, sample size, and number of time series being tested
- Instead of single numbers, they fit a more complex function to accommodate different values of ,
In practice,
- Start with performing ADF unit root tests to determine the integration order of and (if either is stationary then we should stop testing)
- Perform regression and obtain residuals
- Perform unit root test on residuals and check against the special critical values
Error Correct Models
The ECM helps us model relationships between cointegrated non-stationary time series
If we fix for long-term equilibriums, we get
is called the error correction term, since it corrects towards the long-run equilibrium (if )
Granger’s Representation Theorem: If then admits the error-correction representation
In fact, the theorem is a bit more general and allows for for lags of the differences of our time series,
(in practice, can be zero-mean, weakly stationary, which is more general than white-noise)
Written in another way,
where
So our steps to estimate these parameters are,
- Regress on to obtain
- Perform a second regression for the final model parameters
This works because our estimator is super-consistent and is therefore unusually accurate and suitable for a second regression
Engle and Granger showed that and have the same asymptotic distribution as if is known