3

I have been obsessed with trying to conduct an analysis in the “correct” way. I read that no time series analysis should be conducted on non stationary series. I found both my series have a unit root and therefore are non stationary, so I’ve been running analyses on the first differenced data.

My final question here is the following: if I am purely interested in describing the past relationship, with no intent of predicting the future, is is “okay” if I run correlation analysis on two non stationary series to identify time lags? Or should I cross correlate the first differenced data and stop asking so many questions?

I have zero interest in inferring a causal “relationship.” I only want to explain the co-movement. I’m interested to know the lag which maximizes the correlation between two interest rates. I.e. interest rate A changes now, that is correlated most with changes 2 months from now for interest rate B.

Richard Hardy
  • 54,375
  • 10
  • 95
  • 219
user10136297
  • 359
  • 6
  • As far I understand, there is no statistical estimation in plotting $y_t$ against $x_t$. It might give you a richer description than a correlation coefficient. While it doesn't answer your question, it may be a suitable alternative. For a line plot I would recommend putting a decay function on the drawing transparency if it ends up looking like a messy ball. – DifferentialPleiometry Dec 13 '21 at 07:18

2 Answers2

5

Imagine that you have 2 independent series $y_t$ and $x_t$, both random walks (non-stationary series with a unit root). You run the following regression: $y_t=\beta_1+\beta_2x_t+\epsilon_t$. Davidson & MacKinnon (see the red line in snapshot below from their textbook "Econometric Theory and Methods") did this simulation 1 million times for increasingly large sample sizes. With a sample size of 20, almost 50% of the time the t-statistic for $\beta_2$ rejected the null hypothesis $\beta_2=0$. We found a relationship that doesn't really exist because we know that $y_t$ and $x_t$ are independent by design; therefore we found a spurious relationship. What's more alarming, the larger the sample size, the higher the proportion of time that the regression result will be spurious, and this proportion converges to 1.

The phenomenon of spurious regressions is not something that afflicts non-stationary series only. Imagine repeating the above simulation, this time taking $y_t$ and $x_t$ to be independent stationary AR(1) processes with the autoregressive parameter $\rho_1=0.8$. Davidson & MacKinnon did just that; see the results depicted by the blue line. For most sample sizes, some 35% of the time we will find a spurious relationship. Fortunately, the problem does not get worse the larger the sample sizes, as it did when the 2 series were random walks. If we repeated the above experiment but this time using a lower value for the autoregressive parameter $\rho_1$ we can expect to find a lower proportion of spurious relationships. For low enough a value of the autoregressive parameter the proportion of spurious relationships should be around 0.05 (the significance level).

Whether you are looking to explain the co-movement of the 2 series, as you put it, or to predict one using the other, you will want to estimate $\beta_2$ consistently. If $y_t$ and $x_t$ are both random walks but instead of being independent (as in the first experiment above), they happen to be cointegrated then the OLS estimator of $\beta_2$ will be super-consistent (consistent and converges to the true value even faster).

So, in summary, if the 2 series are both Integrated of Order 1 and they are cointegrated, then yes you can regress one on the other to estimate their co-movement. If they are not cointegrated, you shouldn't (think of the first of the above experiments). In this case, you should difference them and regress the two in first-differences. Even then, keep in mind that if the true process (of the first-differenced series) was an AR(1) process with a root close to 1, spuriousness will still show up. Time series analysis is a tricky business.

Keep in mind that if $y_t$ is cointegrated with $x_t$ then $y_t$ will also be cointegrated with any lag of $x_t$. So, if you conclude that $x_t$ and $y_t$ are cointegrated, you are in business - and can run binary OLS regressions of $y_t$ on lagged $x_t$, and $\beta_2$ will be consistent. You can then use a fit statistic (such as $R^2$ or $AIC$) to compare the results of these binary regressions for different lags of $x_t$.

enter image description here

ColorStatistics
  • 2,699
  • 1
  • 10
  • 26
  • My series are non stationary, and not cointegrated. I suppose the best way forward is to run my correlations on first differenced data to be safe about not reporting wrong information. – user10136297 Dec 12 '21 at 22:44
  • 3
    Sounds like a good plan. Make sure the first-differenced series themselves are stationary. – ColorStatistics Dec 12 '21 at 22:49
  • I tested my first differenced data using ADF and they are stationary at the sub 1% level. – user10136297 Dec 13 '21 at 00:45
  • 4
    Interesting answer. While time series analysis is clearly interested in 'time', we can have sequences indexed by other parameters. Is my speculation correct that cointegration is important for OLS regardless of whether the data is indexed by time or some other parameter? – DifferentialPleiometry Dec 13 '21 at 03:13
  • Reading this answer again. I think you’re under the impression still that I want to forecast but I don’t. I only want to understand co-movement. The two series I’m looking at are widely understood to have a clear relationship. Is first differencing still the way to go??? – user10136297 Dec 13 '21 at 05:39
  • 2
    @Galen, yes, that must be the case. Changing the title of the index should not change the statistical conclusions. – Richard Hardy Dec 13 '21 at 07:10
  • 1
    @user10136297: Please see the answer again. I stated "Whether you are looking to explain the co-movement of the 2 series, as you put it, or to predict one using the other, you will want to estimate $\beta_2$ consistently". – ColorStatistics Dec 13 '21 at 10:31
  • 2
    @Galen: That is a great question. I think your speculation is correct. – ColorStatistics Dec 13 '21 at 10:33
  • Oh I understand. Using non stationary data leads to a misestimation of beta. I think I keep forgetting Thais is inferential… we are estimating the regression coefficient for the entire dataset. But, that is harder to think about when dealing with time series because I keep thinking “well here’s all the data… just run the correlations and slopes and that’s the value.” – user10136297 Dec 13 '21 at 16:05
  • 1
    @user10136297: a time series is also just a sample (a subset of a particular realization of a process); it is no more of a "here's all the data" than is a cross-sectional sample; in fact with time series, unless you assume stationarity, you do not have a sample with multiple observations but a sequence of samples of size 1 – ColorStatistics Dec 13 '21 at 16:14
  • Can you expand briefly on the last part? – user10136297 Dec 13 '21 at 16:16
  • @user10136297, it could be helpful to extend the fomulation from ColorStatistics' last comment to *unless you assume stationarity **for the time series itself or a transformation thereof***. We usually do that (i.e. assume stationarity), as otherwise we cannot make any useful inferences. – Richard Hardy Dec 13 '21 at 16:19
  • And would you both agree we can use the estimates from a first differenced regression as estimates of the DGP for the level relationship? In other words, if my FD coefficient is 2, I don’t need to interpret in the context of changes of changes, but rather as changes in levels for the DGP. – user10136297 Dec 13 '21 at 16:24
  • 1
    @ColorStatistics, your example is for an. i.i.d. time series, not a stationary time series in general (though it conveys the intuition well!). – Richard Hardy Dec 13 '21 at 16:28
  • I do not think so. i.i.d. implies (strict and weak) stationarity but not the other way around. An MA(1) process is stationary but not i.i.d. Given an MA(1) process, we cannot say we have a single random variable observed many times, while given an i.i.d. process we can. – Richard Hardy Dec 13 '21 at 16:48
1

That is an interesting and nontrivial question. Based on some recent related threads by the OP, let me focus on the case where the correlation is spurious and its population counterpart is ill defined. This would be the case with a pair of unit-root, noncointegrated time series.

Whether dealing with description, prediction or something else, we routinely use statistical estimates to infer something about the corresponding estimands / population counterparts, thus to generalize.$^{*}$,$^{**}$ That shapes how we react to statistical estimates when presented with them. Given a sample correlation, people would often implicitly think about generalization and picture themselves a well-defined population counterpart.$^{***}$

Therefore, I would avoid reporting sample correlation when its population counterpart is ill defined – unless I knew very well what I am doing and had ample opportunity to explain myself in detail to the audience. Otherwise, the message that gets through might be quite different from the one you may be trying to send.

$^{*}$An exception could be information compression where we only care about the sample, trying to compress it without much loss in fidelity.
$^{**}$In the case of prediction, we do not focus on the estimands directly but employ knowledge about them to obtain predictions.
$^{***}$Most people would probably instinctively consider causation, too, but that is another topic.

Richard Hardy
  • 54,375
  • 10
  • 95
  • 219
  • I think you highlight my misunderstanding here… I’m struggling with the idea of the “population” of data of a time series. I’ll go read up on that. If I understand that we are estimating a population slope, then it makes sense why we wouldn’t use non stationary data… because that may lead to a bad estimate of the population beta. – user10136297 Dec 13 '21 at 16:08
  • 1
    @user10136297, that is what I though. I am glad I got the gist of your question. In time series, it is often more convenient to think about data generating processes (DGPs) than populations. I used the word *population* for simplicity (so that also people who are not used to time series can get the intuition), but it would be more appropriate to talk about data generating processes here. – Richard Hardy Dec 13 '21 at 16:10
  • And there it is… that answers a lot of my questions. We are trying to estimate the parameters for the data generating process… not the particular sample we get. Thanks a ton. – user10136297 Dec 13 '21 at 16:15
  • 1
    @user10136297, exactly. – Richard Hardy Dec 13 '21 at 16:17
  • And if I use non stationary data, my parameter estimates for the DGP are not reliable for a multitude of reasons. That’s why we transform to make stationary and then estimate the parameter. What’s weird, though, is I’ve gotten mixed answers on whether or not I can use the parameters from the first differenced model to describe the DGP. I can right? I don’t need to stick to interpreting “a change in the change of x of 1 means a change in the change of y by 2” for a coefficient of 2, for example. – user10136297 Dec 13 '21 at 16:21
  • 1
    @user10136297, I think you have a separate thread for your last question from before, so let us keep that discussion there. Briefly, your model for a transformed series yields direct conclusions for that transformation, but since you know the transformation, you can back the conclusions up to the original DGP. Sometimes this is straightforward, sometimes not. When the DGP contains a unit root and the transformation is first differencing, interpretation of coefficients in levels vs. first differences is not entirely straightforward. – Richard Hardy Dec 13 '21 at 16:23
  • Thanks Richard!! – user10136297 Dec 13 '21 at 16:26
  • Let us [continue this discussion in chat](https://chat.stackexchange.com/rooms/132294/discussion-between-user10136297-and-richard-hardy). – user10136297 Dec 13 '21 at 16:29
  • 1
    Regarding the downvote, consider following it up with some constructive criticism. Thank you. – Richard Hardy Dec 13 '21 at 17:30
  • I upvoted!!!!!! – user10136297 Dec 13 '21 at 17:54
  • @user10136297, no worries! The comment was not for you but for the one who downvoted. – Richard Hardy Dec 13 '21 at 17:56