How do the error terms in ARMA or MA models actually work and How do we estimate them?

Question

I asked a similar question before but i did not get any answers. Should i delete the old question?

$x_t = c + \theta_{1}x_{t-1} + \phi_1\varepsilon_{t-1}+ \phi_2\varepsilon_{t-2} + \varepsilon_t$
$\hat{x_t} = c + \theta_{1}x_{t-1} + \phi_1\varepsilon_{t-1}+ \phi_2\varepsilon_{t-2}$

     t X
[1,] 1 44
[2,] 2 55
[3,] 3 66
[4,] 4 77

I want to predict x5. How should the equation be like? lets say c=5 and $\theta_{1} =2$ and $\phi_1=3$ and $\phi_2=4$
I am sure that $x_{t-1}$ is 77 but the part that confuses me is $\varepsilon_{t-1}$ and $\varepsilon_{t-2}$! Should we predict the $x_{t-1}$ to calculated its error/residual first then put the residuals values in the equation to forecast? If that's the case then How we can calculate x1 (the first data point at time 1)?

I am talking about MA and ARMA. Even thinking about the estimation of $\phi$ is weird to me. in AR process is just like linear regression where we make another $x_{t-1}$ variable and regress it on $x_t$, but in the case of MA or ARMA models i can't get it.
PS. i know the $x_t$ is not stationary. It's just an example.

I've asked the same question myself https://stats.stackexchange.com/questions/324877/why-are-the-error-terms-in-an-maq-model-considered-unobserved — Skander H., Jun 25 '18 at 19:07

How do the error terms in ARMA or MA models actually work and How do we estimate them?

0 Answers0