ARIMAX model in R

Question

I want to fit ARIMAX model in R. For simplicity, let's consider model: $Y_t = \theta Y_{t-1} + \beta X + \epsilon$.

I know function auto.arima(), but it fits ARIMA model or a regression with ARIMA errors (when we use xreg argument). This is not the same as ARIMAX model.

My question is what function should I use to fit that model? I could use linear regression to find $\theta$ and $\beta$ by using $lm(Y_t\sim Y_{t-1}+X$), but it fits model: $Y_t = \beta _1 Y_{t-1} + \beta _2 X + \epsilon$. These parametres are from OLS, while the $\theta$ should be from the Yule-Walker equation. Am I right? It's a big problem?

Other way is to fit separately the regression part and autoregressive, by using $lm(Y\sim X)$ and $ar/arima/auto.arima$ for $Y$, but how could I combine these results?

Are you asking how to [use lm() to write your own function](https://stats.stackexchange.com/questions/105475/estimating-arma-equation-using-lm-in-r) to fit ARMA models or are you asking [whether there is an ARIMAX package for R?](https://stats.stackexchange.com/questions/18375/how-to-fit-an-arimax-model-with-r) — Digio, Jul 21 '17 at 09:22
I don't want to fit ARMA/ARIMA model either a regression with ARIMA errors. In this link the best answer says:"4. Fit the model with the arima function in base R. This function can handle ARMAX models through the use of the xreg argument.", but it's not true. Here https://robjhyndman.com/hyndsight/arimax/ we can read that "The arima() function in R (and Arima() and auto.arima() from the forecast package) fits a regression with ARIMA errors. " so it's not the same. — Neokokaina, Jul 21 '17 at 09:41
As I know, there isn't any auto.arimax function in R and the arimax() funtion dosn't fit the ARIMAX model (only transfer function), so I'm looking for any solutions. As I said, I have 2 options how to do this, but first is in my opinion not statistical correct. Additionally, I don't know how the coefficients in ARIMAX model (Yt=θYt−1+βX+ϵ) are estimated, by OLS? — Neokokaina, Jul 21 '17 at 09:41
I'm familiar with the link you posted and I know exactly what you mean when you make a distinction between ARIMAX and regression with ARMA errors. However, since you can't find an ARMAX function, I'm guessing that you want to make your own. What you describe as a second option (fitting two separate models) does not sound as an ARMAX model at all since you're not adding the covariance on the right hand side. And if I understood well, the first option won't work either since fitting a MA() term is not as simple as a linear regression model. IMO, the best to do would be to find an ARMAX function. — Digio, Jul 21 '17 at 10:18
I think [this post](https://stackoverflow.com/questions/25224155/transfer-function-models-arimax-in-tsa) describes a true ARMAX process in R. But why are you rejecting regression with ARMA errors in the first place? — Digio, Jul 21 '17 at 10:24
Yes, the MA() part is problematic, but if we consider only AR(1) with one additional variable, using lm() is still incorrect. Regression with ARMA errors is okey, but my task is to use arimax model. Someone used just lm() to ARIMAX model and made prediction. I would like to know what are the consequences of that and I want to do it correctly. — Neokokaina, Jul 21 '17 at 10:37
For the AR(p) model that you described, lm() should in theory suffice. Why would you think that it would be incorrect? — Digio, Jul 21 '17 at 12:14
I'm not sure, in that model we have two parameters: θ and β. θ is from AR (Yule-Walker equation) and β from LM (Least Squared Method). LM() function will give us 2 parameters (both from Least Squared Method). Is it statistically correct? — Neokokaina, Jul 21 '17 at 13:07
I think you're mixing things up. The Yule-Walker equations introduce the autocovariance into the model, this is not something that's necessary for an AR(p) process. Least squares will estimate AR(p) as a "beta" value, if you will, and will minimise the same loss function as if it were an ordinary regressor. — Digio, Jul 21 '17 at 20:22
So, is it correct, to fit Y(t)=θY(t−1)+βX+ϵ and other ARIMAX models just by using lm(Y(t)~Y(t-1)+X)? — Neokokaina, Jul 24 '17 at 06:14
You may only fit AR(p) terms in this way, not MA(q), so this will never be ARIMAX per se. This may be called [ARX](https://uk.mathworks.com/help/ident/ug/what-are-nonlinear-arx-models.html) and it is something commonly used in neural network autoregression. — Digio, Jul 24 '17 at 07:36
Ok, thanks :) As you said, for ARIMAX is better to use regression with ARIMA errors: for example Arima(Y, order=c(1,0,0), xreg=X)? — Neokokaina, Jul 24 '17 at 08:26
As [Hyndman](https://robjhyndman.com/hyndsight/arimax/) explains, regression with ARMA errors has the advantage of interpretability over ARMAX, the latter being largely a black-box approach. If I were you I would start off with Arima(..., xreg) and if that didn't work sufficiently well I would then consider moving onto an alternative method such as [linear ARX](https://uk.mathworks.com/help/ident/ref/arx.html) or [ARX neural nets](https://uk.mathworks.com/help/ident/ref/neuralnet.html). — Digio, Jul 24 '17 at 08:44
Yes, VGAMextra is on [CRAN](https://cran.r-project.org/package=VGAMextra). The function is `ARIMAX.errors.ff()`. — Victor Miranda, Jul 30 '18 at 22:45

Victor Miranda · Answer 1 · 2018-03-29T01:38:54.100

auto.arima() is by far a wrap of arima(), hence will act as such in several directions.

In presence of covariates entered through xreg, the arima() Rd states "...If am xreg term is included, a linear regression (with a constant term if include.mean is true and there is no differencing) is fitted with an ARMA model for the error term...". Seems like it fits an LM on the x's, imposing an ARMA structure on the errors. This approach differs from that conveyed by the ARMAX framework, hence auto.arima() won't work.

To fit an ARMAX(p, q) or any sub-class of it, you may want to try with vector generalized linear models (VGLMs) applied to time series, in R, part of my PhD. Particularly, my family function ARXff() estimates ARXs as that one above, $Y_t - \theta Y_{t - 1} = \beta X_t + \varepsilon_t$ (...[1]), by MLE using Fisher scoring. The following gives an example of such, assuming normal errors:

 set.seed(201802)
 nn <- 140
 x2 <- rnorm(nn)
 y  <- numeric(nn); y[1] <- 0
 theta <- 0.25
 beta  <- 1.5
 for (ii in 2:nn) 
   y[ii] <- theta * y[ii - 1] + beta * x2[ii] + rnorm(1)

 # Remove warming - up values.
 ts.data <- data.frame(y = y[-c(1:100)], x2 = x2[-c(1:100)])

# Modelling function: vglm(); Family function: ARXff()
 fit1 <- vglm(y ~ x2, ARXff(order = 1, zero = c("coeff", "Var"),
                            type.EIM = "exact"),
         data = ts.data, trace = TRUE)
  VGLM    linear loop  1 :  loglikelihood = -67.168821
  VGLM    linear loop  2 :  loglikelihood = -61.822737
  VGLM    linear loop  3 :  loglikelihood = -60.963229
  VGLM    linear loop  4 :  loglikelihood = -60.943982
  VGLM    linear loop  5 :  loglikelihood = -60.943972
  VGLM    linear loop  6 :  loglikelihood = -60.943972

 Checks on stationarity / invertibility successfully performed. 
 No roots lying inside the unit circle. 
 Further details within the 'summary' output.
 > coef(fit1, matrix = TRUE)
              ARdrift1 loge(noiseVar1) ARcoeff11
(Intercept) -0.086297         0.20932   0.26196
x2           1.339758         0.00000   0.00000

*** This is what arima() returns:

 with(ts.data, arima(y, order =c(1, 0, 0), xreg = x2))

 Call:
 arima(x = y, order = c(1, 0, 0), xreg = x2)

  Coefficients:
         ar1  intercept     x2
       0.386     -0.174  1.292
 s.e.  0.170      0.290  0.187

 sigma^2 estimated as 1.3:  log likelihood = -62.02,  aic = 132.05

Given the normality assumption,fit1 is the same as fitting a normal distribution with mean conditional on $x_2$ and $Y_{t - 1}$, similar to [1] above. To see this, use the family uninormal(), from the VGAM package, as follows:

 ts.data <- transform(ts.data, ARcoeff = WN.lags(cbind(y), lags = 1))
 > fit2 <- vglm(y ~ x2 + ARcoeff, uninormal(var.arg = TRUE),
                data = ts.data, trace = TRUE)
 VGLM    linear loop  1 :  loglikelihood = -69.732118
 VGLM    linear loop  2 :  loglikelihood = -62.620483
 VGLM    linear loop  3 :  loglikelihood = -61.012317
 VGLM    linear loop  4 :  loglikelihood = -60.944089
 VGLM    linear loop  5 :  loglikelihood = -60.943972
 VGLM    linear loop  6 :  loglikelihood = -60.943972
> coef(fit2, matrix = TRUE)
                  mean loge(sd)
  (Intercept) -0.086297  0.10466
  x2           1.339758  0.00000
  ARcoeff      0.261963  0.00000

But ARXff() is even broader, e.g., you can model $\theta$ using VGLM-link functions, such as logit(), if needed. Also, I have implemented ARXff() to work with the exact expected information matrices (See type.EIM = "exact") for the ARX model.

However, the covariate effects on the ARXs as above may not be simple to interpret, e.g, as with ordinary LMs. This is the downside of ARX models. For the ease of interpretation, that regression model with ARMA errors arises probably as the most convenient choice. This is how arima() seems to work in presence of covariates. BTW. I have too implemented this approach through my family function ARIMAX.errors.ff().

ARMAXff() and ARIMAXff() have also been implemented accordingly, so you can also fit ARMAX models, similar to fit1. But for regression models with ARMAX errors, ARIMAX.errors.ff() is the choice. These are incorporated in my package, called VGAMextra, an extension of VGAM in a few directions, including time series analysis. At the moment, VGAMextra concetrates on modelling and estimation. I will incorporate, e.g., automatic forecasting over time.

Hello Victor, are these models available through the VGAM package? (https://cran.r-project.org/web/packages/VGAM/index.html) Or is it another package that is not part of CRAN yet? — Bar, May 04 '18 at 08:57

ARIMAX model in R

1 Answers1