25 Spring 439/639 TSA: Lecture 5

Author

Dr Sergey Kushnarev

1 Causality condition for an AR(\(p\)) process

1.1 Derive the GLP form

Recall last time, from the AR(\(p\)) \[ Y_t - \phi_1 Y_{t-1} - \phi_2 Y_{t-2} - \cdots - \phi_p Y_{t-p} = e_t, \quad e_t \sim \mathrm{iid}(0, \sigma_e^2), \] we got the reformulated form \(\Phi(B) Y_t = e_t\), where the AR polynomial is \(\Phi(x) = 1 - \phi_1 x^1 - \phi_2 x^2 - \cdots - \phi_p x^p\). If all the \(p\) complex roots of the AR polynomial satisfy \(|z_i| >1\) for all \(i=1,...,p\), i.e., all the \(p\) roots are outside the unit disc in \(\mathbb{C}\), then we showed that the AR(\(p\)) equation became \[ e_t = \left(1-\frac{B}{z_1}\right) \cdots \left(1-\frac{B}{z_p}\right) Y_t . \] Since each \(|\frac{1}{z_i}|<1\), we have \(\left(1-\frac{B}{z_i}\right) ^{-1} = \sum_{j=0}^\infty z_i^{-j} B^j\). Then \[ \begin{split} Y_t &= \left(1-\frac{B}{z_1}\right)^{-1} \cdots \left(1-\frac{B}{z_p}\right)^{-1} e_t \\ &= \left(\sum_{j=0}^\infty z_1^{-j} B^j \right) \cdots \left(\sum_{j=0}^\infty z_p^{-j} B^j \right) e_t . \end{split} \] As we mentioned last time, this product can be written as a GLP. To see this, we first consider the simple case \(p=2\). Suppose \(a_j = z_1^{-j}\) and \(b_j = z_2^{-j}\). Then \[ \begin{split} & \left(\sum_{j=0}^\infty z_1^{-j} B^j \right) \left(\sum_{j=0}^\infty z_2^{-j} B^j \right) = \left(\sum_{j=0}^\infty a_j B^j \right) \left(\sum_{j=0}^\infty b_j B^j \right) \\ &= \left(a_0 + a_1 B^1 + a_2 B^2 + \cdots \right) \left(b_0 + b_1 B^1 + b_2 B^2 + \cdots \right) \\ &= a_0 b_0 + (a_0 b_1 + a_1 b_0) B^1 + (a_0 b_2 + a_1 b_1 + a_2 b_0) B^2 + (a_0 b_3 + a_1 b_2 + a_2 b_1 + a_3 b_0) B^3 + \cdots \\ &= \sum_{n=0}^\infty \underbrace{\left( \sum_{j_1+j_2 = n} a_{j_1} b_{j_2} \right)}_{c_n} B^n . \end{split} \] For general \(p\), we have the similar result \[ \left(\sum_{j=0}^\infty a_{1,j} B^j \right) \left(\sum_{j=0}^\infty a_{2,j} B^j \right) \cdots \left(\sum_{j=0}^\infty a_{p,j} B^j \right) = \sum_{n=0}^\infty c_n B^n \] where \(c_n = \sum_{j_1+j_2+\cdots + j_p = n} a_{1,j_1} a_{2,j_2} \cdots a_{p,j_p}\). Note: this can be seen as a \(p\)-fold convolution. Using this result, \[ \begin{split} & \Phi(B)^{-1} = \left(\sum_{j=0}^\infty z_1^{-j} B^j \right) \cdots \left(\sum_{j=0}^\infty z_p^{-j} B^j \right) = \sum_{n=0}^\infty \psi_n B^n ,\\ & \text{ where}\quad \psi_n = \sum_{j_1+j_2+\cdots + j_p = n} z_1^{-j_i} z_2^{-j_2} \cdots z_p^{-j_p} . \end{split} \] So the AR(\(p\)) can be written as \[ \begin{split} Y_t &= \left(\sum_{j=0}^\infty z_1^{-j} B^j \right) \cdots \left(\sum_{j=0}^\infty z_p^{-j} B^j \right) e_t = \left(\sum_{n=0}^\infty \psi_n B^n \right) e_t = \sum_{n=0}^\infty \psi_n e_{t-n} \end{split} \] which looks like a GLP, with the \(\{\psi_n\}\) specified above. To ensure this is a GLP, we still need to verify \(\sum_{n=0}^\infty |\psi_n| < \infty\) (see the definition of GLP).

1.2 Sketch of the remaining proof

The sketch of the proof for \(\sum_{n=0}^\infty |\psi_n| < \infty\): \[ \begin{split} \sum_{n=0}^\infty |\psi_n| &= \sum_{n=0}^\infty \left| \sum_{j_1+j_2+\cdots + j_p = n} z_1^{-j_i} z_2^{-j_2} \cdots z_p^{-j_p} \right| \le \sum_{n=0}^\infty \sum_{j_1+j_2+\cdots + j_p = n} |z_1|^{-j_i} |z_2|^{-j_2} \cdots |z_p|^{-j_p} \\ &\le \sum_{n=0}^\infty \sum_{j_1+j_2+\cdots + j_p = n} |z^*|^{-j_1-j_2-\cdots - j_p} \le \sum_{n=0}^\infty c(p) n^p |z^*|^{-n} < \infty . \end{split} \] Notes on the missing details (without proof):

  • Let \(z^*\) be the root among \(\{z_1,\dots, z_p\}\) with the smallest modulus, so \(1<|z^*| \le \min\{ |z_1|,\dots, |z_p| \}\).
  • The number of terms in the summation \(\sum_{j_1+j_2+\cdots + j_p = n}\) can be upper bounded by \(c(p) n^p\) where \(c(p)\) is a constant that only depends on \(p\). (Since \({n+p-1\choose p-1} = \frac{(n+p-1)(n+p-2) \cdots (n+1)}{ (p-1)!}\) is a polynomial of \(n\) with degree less than \(p\) and coefficients only depend on \(p\).)
  • The last step is because \(\sum_{n=0}^\infty n^p |z^*|^{-n}\) converges for \(|z^*| >1\).

So we just showed \(\sum_{n=0}^\infty |\psi_n| < \infty\), which finishes the proof that \(Y_t = \Phi(B)^{-1} e_t\) is a GLP (as long as all roots of \(\Phi(x)\) are outside the unit disc in \(\mathbb{C}\)).

1.3 Discussion on other cases (without proof)

In summary, for AR(\(p\)) process, we have the following results. (Look similar to the three cases for AR(\(1\)) from last lecture.)

  • If all roots of the AR polynomial are outside the unit disc (\(|z_i| > 1\) for all \(i\)), then \((Y_t)\) is causal and stationary.
  • If at least one of the roots is inside the unit disc (\(|z_i| < 1\) for one \(z_i\)), and none of the roots is on the unit circle (\(|z_i| \neq 1\) for all \(i\)), then \((Y_t)\) is non-causal (future-dependent) and stationary.
  • If at least one root is a unit root (i.e. \(|z_i| = 1\) for one \(z_i\)), then \((Y_t)\) is not stationary.

1.4 Example: AR(\(2\))

We start with a concrete AR(\(2\)) example. Suppose the AR(\(2\)) equation is \[ Y_t = Y_{t-1} + Y_{t-2} + e_t . \] We can use the previous results to find whether this process is causal or stationary. The AR polynomial for this example is \(\Phi(x) = 1 - x - x^2\). Solving \(\Phi(x)=0\), we get two roots \[ z_{1,2} = \frac{-1 \pm \sqrt{1 + 4}}{2} = \frac{-1 \pm \sqrt{5}}{2} . \] Since \(\left| \frac{-1 - \sqrt{5}}{2} \right| >1\) and \(\left| \frac{-1 + \sqrt{5}}{2} \right| <1\), this process is stationary but non-causal. Note: in this example \(z_{1,2}\) are both real, so the modulus \(|z|\) reduces to the absolute value of real number.

In general, for a generic AR(\(2\)) equation \[ Y_t = \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + e_t , \] the AR polynomial is \(\Phi(x) = 1 - \phi_1 x - \phi_2 x^2\), which has two roots \[ z_{1,2} = \frac{-\phi_1 \pm \sqrt{\phi_1^2 + 4\phi_2}}{2\phi_2} . \] The causality condition can be explicitly characterized by \[ |z_{1,2}| > 1 \iff \begin{cases} \phi_1 + \phi_2 < 1 \\ \phi_2 - \phi_1 < 1 \\ |\phi_2| < 1 \end{cases} \]

2 Remark on different definitions with the textbook

In the textbook (Cryan and Chan), the definition of GLP is \(Y_t = \sum_{j=0}^{\infty} \psi_j\, e_{t-j}\). So their stationary GLP means causal and stationary GLP in our notations.

In this course TSA, we defined GLP as \(Y_t = \sum_{j=-\infty}^{\infty} \psi_j\, e_{t-j}\). And a stationary GLP can be either causal or non-causal in our setting.

3 Finding the coefficients \(\psi\) in the GLP representation of AR(\(p\))

3.1 Method 1: using convolution

We can use the formula \(\psi_n = \sum_{j_1+j_2+\cdots + j_p = n} z_1^{-j_i} z_2^{-j_2} \cdots z_p^{-j_p}\) from the first part of this lecture.

Example: consider the AR(\(2\)) equation \(Y_t = \frac{1}{6} Y_{t-1} + \frac{1}{6} Y_{t-2} + e_t\).

Exercise: verify this AR(\(2\)) process is causal, and the roots of the AR polynomial are \(\{-3, 2\}\).

Then we can use the formula above to calculate \(\psi_n\): \[ \begin{split} \psi_0 &= z_1^0 z_2^0 = 1 \\ \psi_1 &= z_1^0 z_2^{-1} + z_1^{-1} z_2^0 = 1 \times \frac{1}{2} + \frac{1}{-3} \times 1 = \frac{1}{2} + \left(-\frac{1}{3}\right) = \frac{1}{6} \\ \psi_2 &= z_1^0 z_2^{-2} + z_1^{-1} z_2^{-1} + z_1^{-2} z_2^0 = \frac{1}{4} + \frac{1}{-3} \cdot \frac{1}{2} + \frac{1}{9} = \frac{7}{36} \\ \cdots \end{split} \]

3.2 Method 2: using AR(\(p\)) equation

We can use the AR(\(p\)) equation and directly solve a system for \(\{\psi_n\}\). Consider the same example \[ Y_t = \frac{1}{6} Y_{t-1} + \frac{1}{6} Y_{t-2} + e_t . \] Since we want to get the GLP form \(Y_t = \psi_0 e_t + \psi_1 e_{t-1} + \psi_2 e_{t-2} + \cdots\), we can plug it into the equation above: \[ \psi_0 e_t + \psi_1 e_{t-1} + \psi_2 e_{t-2} + \cdots = \frac{1}{6} (\psi_0 e_{t-1} + \psi_1 e_{t-2} + \psi_2 e_{t-3} + \cdots) + \frac{1}{6} (\psi_0 e_{t-2} + \psi_1 e_{t-3} + \psi_2 e_{t-4} + \cdots) + e_t . \] For this to hold, the coefficients for each \(e_{t-n}\) term (\(n=0,1,\dots\)) on both sides should match. So we get the following system of equations \[ \begin{split} \psi_0 &= 1 \\ \psi_1 &= \frac{1}{6} \psi_0 \\ \psi_n &= \frac{1}{6} \psi_{n-1} + \frac{1}{6} \psi_{n-2} \quad\text{for } n\ge 2 \end{split} \] So \(\psi_0 = 1\), \(\psi_1 = \frac{1}{6} \psi_0 = \frac{1}{6}\), \(\psi_2 = \frac{1}{6} \psi_1 + \frac{1}{6} \psi_0 = \frac{1}{36} + \frac{1}{6} = \frac{7}{36}\). These results are same as the earlier convolution method.

4 Yule-Walker method for AR(\(p\))

The general idea is similar to the AR(\(2\)) example from the last lecture. For YW method to work, we still need to require the process \((Y_t)\) is mean zero, stationary, and causal.

The AR(\(p\)) equation is \[ Y_t - \phi_1 Y_{t-1} - \cdots - \phi_p Y_{t-p} = e_t . \] Step 1: multiply by \(Y_{t-k}\) (for some \(k\ge 0\)) \[ Y_t Y_{t-k} - \phi_1 Y_{t-1} Y_{t-k} - \cdots - \phi_p Y_{t-p} Y_{t-k} = e_t Y_{t-k}. \] Step 2: take expectation, and we call the following ``the \(k\)-th YW equation” \[ \mathbb{E}[Y_t Y_{t-k}] - \phi_1 \mathbb{E}[Y_{t-1} Y_{t-k}] - \cdots - \phi_p \mathbb{E}[Y_{t-p} Y_{t-k}] = \mathbb{E}[e_t Y_{t-k}] . \] Since \((Y_t)\) is mean zero and stationary, the terms \(\mathbb{E}[Y_{t-i} Y_{t-k}] = \gamma_{k-i}\). By causality, \(\mathbb{E}[e_t Y_{t-k}] = \sigma_e^2\) if \(k=0\), and \(\mathbb{E}[e_t Y_{t-k}] = 0\) if \(k>0\). So the \(k\)-th YW equation is \[ \gamma_k - \phi_1 \gamma_{k-1} - \cdots - \phi_p \gamma_{k-p} = \begin{cases} \sigma_e^2, & k = 0 \\ 0, & k \geq 1 \end{cases} \] Exercise: verify the statement above about \(\mathbb{E}[e_t Y_{t-k}]\).

Note that \(\gamma_{k-p} = \gamma_{p-k}\), so the first (\(p\)+1) YW equations can be written together as a system for \(\gamma_0,\gamma_1, \dots, \gamma_p\): \[ \begin{cases} \gamma_0 - \phi_1 \gamma_1 - \phi_2 \gamma_2 - \cdots - \phi_p \gamma_p = \sigma_e^2 ,\quad &\text{0th YW eq}\\ \gamma_1 - \phi_1 \gamma_0 - \phi_2 \gamma_1 - \cdots - \phi_p \gamma_{p-1} = 0 ,\quad &\text{1st YW eq}\\ \cdots \\ \gamma_p - \phi_1 \gamma_{p-1} - \phi_2 \gamma_{p-2} - \cdots - \phi_p \gamma_0 = 0 ,\quad & p\text{-th YW eq} \end{cases} \] So we can solve \((\gamma_0,\gamma_1, \dots, \gamma_p)\) from this system (\(p+1\) equations and \(p+1\) unknown variables). This can be seen as the initial conditions for the following recursion.

For \(k\ge p\), the \(k\)-th YW equation is simply \[ \gamma_k = \phi_1 \gamma_{k-1} + \phi_2 \gamma_{k-2} + \cdots + \phi_p \gamma_{k-p} . \] Then we can use this to calculate \(\gamma_{p+1}, \gamma_{p+2}, \dots\) recursively.

4.1 A property of the recursion part

If \(k\) is large, solving \(\gamma_k\) recursively from \((\gamma_0,\gamma_1, \dots, \gamma_p)\) may be hard. We have the following useful property.

Claim: If the roots of the AR polynomial \(\Phi(x)\) are distinct, then there exist (complex) numbers \(A_1, \dots, A_p\), such that the solution to \[ \begin{cases} \text{initial conditions for } (\gamma_0,\gamma_1, \dots, \gamma_p)\\ \text{recursion equation: } \gamma_k = \phi_1 \gamma_{k-1} + \phi_2 \gamma_{k-2} + \cdots + \phi_p \gamma_{k-p}, \quad \forall k\ge p \end{cases} \] is given by \[ \gamma_k = A_1 z_1^{-k} + A_2 z_2^{-k} + \cdots + A_p z_p^{-k}, \quad \forall k \ge 0 . \]

This gives another idea to calculate \(\gamma_k\) in the final step of the YW method (useful when we are targeting for a large \(k\)): after getting the values of \((\gamma_0,\gamma_1, \dots, \gamma_p)\), we can solve \((A_1,...,A_p)\) such that \(A_1 z_1^{-k} + A_2 z_2^{-k} + \cdots + A_p z_p^{-k} = \gamma_k\) hold for all \(0\le k\le p-1\). Then for this solution \((A_1,...,A_p)\), the formula \(\gamma_k = A_1 z_1^{-k} + A_2 z_2^{-k} + \cdots + A_p z_p^{-k}\) (for \(k\ge 0\)) gives the ACVF we wanted.