0

Let’s say I want to explain one variable with six regressors. I don’t know if the relationship between my dependent variable and each regressor is linear, if any. I have six regressors, so I can’t look graphically at these relationships, as the best I can do is a 3D plot. But, based on scientific literature, it’s likely that these relationships are all linear, so, I choose to fit a linear regression (let’s admit that my data comes from a Gaussian distribution). All my variables are kept by my method of selection. I check normality of my residuals, homoscedasticity, independence, all is okay. Now, I plot my fitted values against my observed data.

Possibility one: my model fits my data well, in the sense that there’s a good linear relationship between my fitted values and my observed data. Can I thus validate my hypothesis of linearity between my dependent variable and my regressors? Does such a good fit always necessarily confirm this hypothesis?

Possibility two: my model fits my data poorly; my fitted values aren’t aligned with my observed data, but they show a non-linear trend. So I reject my hypothesis of linearity and try to fit a non-linear model (GAM,…) Am I right?

Is this method ok or are there some parameters I’m not taking into account? If this method is okay, are there some extra precautions to take for mixed linear models, for generalized linear models or for GLMM?

Methods that I saw to check linearity often imply smoothing splines, loess etc., but I’m wondering if it is possible to use such methods if you have more than two predictors? Or is there a way than I can plot a variable against another while "correcting" the effects of the others without creating a new model?

Nick Cox
  • 48,377
  • 8
  • 110
  • 156
Kjian
  • 253
  • 1
  • 9
  • 3
    Added variable plots are a direct answer to your question. See for example http://stats.stackexchange.com/questions/125561/what-does-an-added-variable-plot-partial-regression-plot-explain-in-a-multiple It doesn't follow that nonlinearity as visible graphically calls for nonlinear models. Powers, logarithms and other transformations may be enough to keep a model in the linear model framework. – Nick Cox Apr 11 '16 at 18:34
  • Thank you for your answer, that was exactly what I was looking for. If there is a relationship between my fitted values and my observed values, can I directly conclude that all the relationships are linear or can it hide a non-linear relationship in a way ? – Kjian Apr 11 '16 at 18:49
  • 2
    There are no guarantees here, but more positively, if you see linearity everywhere you look, then that's reassuring. Note that each projection to two dimensions necessarily ignores or distorts some of the detail. – Nick Cox Apr 11 '16 at 18:52

0 Answers0