2

I am very new to time-series analysis and have got some time-series data regarding product prices. The data set is monthly data collect since 1993 to 2014.

I have tried plotting the ACF and PACF but I do not really understand the meaning behind these plots. Furthermore, I am not sure if I need to convert the data series by differencing of order 1, then proceed to plot ACF and PACF.

My ACF and PACF plot

I plot a lag.max of 250, since there are alot of data point, and from the ACF, there appears too many lag that are above the confidence interval. However, for PACF, there are only 2 lag that are above the condidence interval.

What is the meaning behind this? Or do I need to do differencing before the acf plot? In addition, how do I further evaluate my data in time-series plot?

My Data Set plot

Thanks

Ted
  • 303
  • 2
  • 3
  • 11

1 Answers1

2

The ACF and PACF are descriptive statistics showing simple correlation and conditional correlation. They are sometimes useful in identifying ARIMA models that 1) have no Pulses (your series does) and 2) have no deterministic time trends or level shifts ( your series seems to have no trend followed by a period that has trend) and whose parameters and model error variances are constant over time. If you actually post your data in an excel format I will try and help you understand more BUT my initial visual assessment is that this is a time series that may have complications/opportunities that will challenge simple (1960 type !) ARIMA analysis or simple model selection (list-based) schemes that use an AIC/BIC criteria to select/identify a model. We should aspire to keeping things/models simple but not too simple so much so that they are of little value !

EDIT AFTER RECEIPT OF DATA:

I took the 269 monthly values and analyzed it with the automatic option in AUTOBOX. After identifying a global model taking into account various pulse effects a significant difference was found between the first 169 values and the most recent 99 values enter image description here . This is visually obvious.The best model for the most recent 99 values was enter image description here and here enter image description here . The residuals from the model appear well behaved [enter image description here] .  [4] The ACF of the residuals suggest a slight possibility of a minor seasonal effect but most likely not important .enter image description here . The plot of the actual , fit and forecast using standard ARIMA procedures tp compute uncertainty limits is here enter image description here . Note however from http://www.autobox.com/cms/index.php/blog/entry/you-should-have-50-confidence-in-your-confidence-limits that the forecast limits are flawed on two accounts. Taking into account the possibility of future values being effected by pulses i.e.unusual values and that the estimated parameters are not necessarily the poulation parameters we get a more realistic picture enter image description here

IrishStat
  • 27,906
  • 5
  • 29
  • 55
  • http://www.filedropper.com/product_1 – Ted Aug 13 '15 at 10:17
  • The above is the file link as I do not have enough rep points to post it on my question. I have a few questions to ask as well. So to see the pulses, we just look at the data set to see if it fluctuates across the yearly data series? And to see the model error variance, you check whether the differences/residual are "slightly constant/fluctuates" along 0? – Ted Aug 13 '15 at 10:30
  • To detect pulses you build an ARIMA model and then follow TSAY http://www.unc.edu/~jbhill/tsay.pdf and the flow diagram in http://www.autobox.com/cms/index.php/blog/entry/build-or-make-your-own-arima-forecasting-model. To tests for constant error variance you evaluate alternative break-points as detailed by TSAY – IrishStat Aug 13 '15 at 11:34
  • As I do not have Autobox, and the version that can be downloaded is a demo version, so through using R, how do I identify structural break in the series? By structural break, does it means that for the first 170 observations there are no trend, thus it is side aside? Through the use of ACF and PACF, can I detect pulses as well? Do I need to do differencing in my data, as it seems that there are alot of points that falls outside the Confidence interval in the ACF? or Since there are only 2 points that falls outside the Confidence interval in PACF, It means there is no need for differencing. – Ted Aug 14 '15 at 01:06
  • Structural breaks ( detecting a change in parameters over time)requires identify an appropriate ARIMA model and Intervention series.Use the intervention series to cleanse the data of pulses,level shifts,time trends.Take the adjusted series and estimate parameters for different time sections. Find the greatest difference in parameters in terms of two distinct sections and conclude that the parameters have changed over time. That would be 1-169 and 170-269 . Now take the observations from 170-269 and model them in terms of an ARIMA model and Intervention series. – IrishStat Aug 14 '15 at 02:13
  • So in time series analysis, the approach to it would be 1. Determine the structural break through intervention series 2. Then use the recent observation and plot it to ARIMA model to do the forecasting? Do you know of any R packages regarding intervention analysis? and how do you check/detect for seasonality in your data? Can intervention analysis or ARIMA allow us to "see" it, for example, from certain spike in ARIMA, this implies there is seasonality in our data set. Sorry for so many doubts posted. – Ted Aug 14 '15 at 08:52
  • ARIMA identification includes looking for spikes and decay patterns in both the acf and the pacf as suggestions for the type of ARIMA model that might be adequate. – IrishStat Aug 14 '15 at 11:07
  • In terms of free software available in R you might look at "tsoutliers" . It assumes that the ARIMA portion is identified first and the the identification of the interventions proceed. There are many cases where this assumption is flat out wrong and that one needs to identify the interventions first and then the ARIMA structure secondarily. – IrishStat Aug 14 '15 at 12:09
  • What do you mean by "tsoutliers" assume ARIMA portion is identified first, do you mean that we should already determine the ARIMA model, then proceed with using tsoutliers, which will then identify the intervention? And so how do we see seasonality in the dataset? Is it just through visualisation through eyes? Thanks for your guidance – Ted Aug 18 '15 at 08:58
  • What I was doing was suggesting that the current practice of identifying an ARIMA model first (assuming no outliers/level shifts/seasonal pulses/local time trends) just doesn't work all the time as it is a non sequitir http://www.merriam-webster.com/dictionary/non%20sequitur. Alternatively identifying the outliers/level shifts/seasonal pulses.local time trends first doesn't work all the time either.Both procedures need to be considered in the analysis.Furthermore time varying parameters need to be considered which leads to data segmentation as in your case.Seasonlityis part of the final model. – IrishStat Aug 18 '15 at 10:02
  • I am trying out tsoutliers from R and SPSS statistics, however both output of the arima model is different. In addition, there is slight difference in detection of outliers. Is there any problem with it? In addition, how do I determine the point of structural break where I only take the recent data, as SPSS statistics and R doesn't seem to do that. – Ted Aug 20 '15 at 08:25
  • Neither of those programs can detect structural breaks (as you need ), I can only suggest that you investigate http://www.autobox.com/cms/ (which I have helped develop). Also I don't believe SAS has this handy feature . – IrishStat Aug 20 '15 at 11:07
  • I have tried out tsoutliers and have some doubts with it, I would appreciate your help to look into this link [here](http://stats.stackexchange.com/questions/168655/how-to-interpret-and-do-forecasting-using-tsoutliers-package-and-auto-arima) – Ted Aug 26 '15 at 02:43