0

Could anyone provide intuition on why for y ~ β1x1 + β2x2 + β3x3, β1 β2 and β3 can be significant in a multiple variable model (p range 7x10-3 to 8x10-4), but the βs are not significant in separate univariate regressions (p range 0.02 to 0.3)?

My intuition is that it has something to do with correlations among the βs, but not quite clear how. Other info in my case:

  • variance inflation factors are <1.5 in combined model
  • cor(β1, β2) = -0.23, cor(β1, β3) = 0.02, cor(β2, β3) = 0.53
  • n=171, so should be enough for 3 coefficients
  • The change in estimates from single to multiple variable regression is: β1=-0.03→-0.04, β2=-0.02→-0.05, β3=0.05→0.18 (i.e., directionality does not change but magnitude does)

Thanks!

1 Answers1

2

This looks like the phenomenon of the suppressor variable. Such a variable has low correlation with the outcome (which seems to be the case here for some of them since you quote p = 0.02 to 0.3 for the individual $\beta$) but does correlate with one or more of the other predictors (which also seems to be true from the correlations you quote). Including the suppressor variable in the model then accounts for some of the variance in the other predictor(s) which was uncorrelated with the outcome hence making them appear at a higher level of statistical significance. It has suppressed that variance, hence the name.

mdewey
  • 16,541
  • 22
  • 30
  • 57
  • Thank you @mdewey! I am now reading up on [suppressor variables](https://stats.stackexchange.com/q/73869/227447). I'll mark your answer as accepted – Felix Richter Nov 19 '18 at 18:25