Definition: A time series is a sequence of random variables ordered by time, denoted as
Definition: A sample of size from a time series is a realized path of the time series
Definition: A time series model for a sample is a set of joint distributions for the corresponding time series
We often omit the interval in our notation, sometimes assuming the series is infinite
We often focus only on the first two moments of the data (means, variances, covariances)
The simplest model is that of independent and identically distributed points, denoted as
We can be more specific and specify the distribution , in which case we denote the series as
In real life, we expect dependence in time series data between nearby points, however IID sequences are still important components of more realistic data
Definition: A white noise process, denoted , is a sequence of uncorrelated random variables with zero mean and constant variance
This is more general than the IID conditions (with mean 0)
Example:
Consider if is odd and where if is even
is the skewness, which is 0 for a normal variable (it’s symmetric)
We can also see that variance is constant, ( accomplishes this on the even terms)
For a general time series, we define functions for the first two moments,
An important classification is whether these first moments are time-invariant (dependent on )
Definition: A time series is weakly stationary (or covariance stationary) if the mean and autocovariance are constant in , and the variance is finite.
- , which implies
is weakly stationary
This is a powerful condition
We can easily define estimation functions for these moments for a given sample
Definition: The autocorrelation function (ACF) for a weakly stationary function is
This is analogous to how we scale covariance to obtain the correlation coefficient
Of course, plenty of time series are not weakly stationary! Consider
is not constant in
Definition: Trend-stationary processes combine a stationary process with a deterministic trend (such as )
These are non-stationary
Definition: A random walk is a time series defined as where is a white noise process
This is often written as with
This is not weakly stationary because the variance increases with
This is said to have a stochastic trend which will be discussed more later
Definition: The autoregressive model of order one, AR(1), is defined by the process , where
This is probably the most widely used time series in practice
We can also write this in closed form,
considers all indices over
If (random walk), the process is non-stationary
An AR(1) process with is called stable
Assuming a stable process is stationary, we have , which means
Using , we can derive
Exercise: Show
We can also solve this more precisely, or write out a recursive formula
It follows that
This reveals that directly controls the decay of correlation as increases
Definition: The lag operator maps to ,
Definition: The difference operator is defined as ,
The difference operator plays an important role in detrending data
Example: Let with
Is this weakly stationary?
Yes
If , this is
If , this is
If , this is 0
So yes
In general, we can take to detrend a polynomial trend of degree
The approach of differencing until a time series is stationary is called the Box-Jenkins approach
Differencing is also useful for stochastic trends, like the random walk
is weakly stationary
Definition: The first-order lag polynomial is defined as ,
This lets us write AR(1) as
Why would we do this?
If the inverse of this operator exists, we can now say
This is more abstract but looks interesting
Lemma: Let and suppose is weakly stationary. Then where the equality is in mean square.
Definition: Two random variables and are equal in mean square if
This lemma allows us to easily derive our earlier expression