Diagnostics for General Linear Models

Question

Pearson residuals follow normal distribution. We plot them against predicted values to see if the model is good.

Why would we plot deviance residuals against predicted values? Deviance residuals don't follow normal distribution, right? What's the point of plotting them?

What leads you to claim Pearson residuals *follow a normal distribution*? — Glen_b, Sep 09 '17 at 08:48
I am wrong. It does not follow a normal distribution unless the response variable is normally distributed. What I meant is homoskedastic. — Eli.Z, Sep 09 '17 at 08:52
I have rolled back your question because both answers responded to your original question (about normality) not your updated Q about homoskedasticity. The lengthy answers were both in progress when you updated and were each posted before they saw your change (indeed I didn't see it show up until just now). It would be a pity to invalidate those answers. Could you post a new question that relates to the issue about homoskedasticity? — Glen_b, Sep 09 '17 at 09:50
Incidentally, if you click on the "edited ... ago" link under your tags you can scroll to the edit where you made the question about homoskedasticity, select the text of that version, copy it to your clipboard and then paste it into the edit-window for a new question — Glen_b, Sep 09 '17 at 10:32
@Eli.Z , maybe this is helpful https://cran.r-project.org/web/packages/DHARMa/vignettes/DHARMa.html — Florian Hartig, Oct 03 '17 at 13:45

score 2 · Answer 1 · answered Sep 09 '17 at 09:11

In a number of texts both Pearson and deviance residuals (or their standardized versions, for example, Sheather (2009)) are used to plot against predicted values. When it comes to the comparison between these two types residuals, deviance residuals is preferred over Pearson residuals. As an explanation of why this is the case, Simonoff argues that

The Pearson residuals are probably the most commonly used residuals, but the deviance residuals (or standardized deviance residuals) are actually preferred, since their distribution is closer to that of least squares residuals (Simonoff, 2003: 133).

In an earlier and widely cited article, Pierce & Schafer (1986) argued about the superior performance of deviance residuals, concluding that:

The deviance-based residuals are remarkably appropriate, and if continuity corrections are made the discreteness presents no real problem for their use in individual data point diagnostics (Pierce & Schafer, 1986: 985).

They made reference to McCullagh & Nelder's classic text on GLM (1983), where the latter compared adjusted deviance residuals and Pearson residuals in the case of binary data and reached a similar conclusion (p.87).

In both of these texts, the similarity between Anscombe and (adjusted) deviance residuals and their performance in terms of approximating normality is emphasized. They are considered successful in "producing residual structures that are: centered at zero, have standard error of one, and are approximately normal" (Gill, 2001: 59). As far as I understand, this is not the case for Pearson residuals which are more skewed compared to deviance residuals. However, @Scortchi's answer to this question highlights an important point: "the deviance (or Pearson) residuals are not expected to have a normal distribution except for a Gaussian model."

Gill, J., 2001. Generalized Linear Models: A Unified Approach. Sage Publications.

McCullagh, P., Nelder, J.A., 1983. Generalized Linear Models. Chapman and Hall.

Pierce, D.A., Schafer, D.W., 1986. Residuals in Generalized Linear Models. J. Am. Stat. Assoc. 81, 977–986.

Sheather, S.J., 2009. A Modern Approach to Regression with R. Springer.

Simonoff, J.S., 2003. Analyzing Categorical Data. Springer.

score 2 · Accepted Answer · answered Sep 09 '17 at 09:23

Pearson residuals in general do not follow a normal distribution.

Deviance residuals don't follow normal distribution, right?

They don't, but they will typically be much closer to being normally distributed than Pearson residuals.

Here's an example with a Poisson model applied to actually Poisson data

Clearly the working residuals are the least normal-looking. The Pearson residuals are better but still show a clear curvature. The straightest Q-Q plots are for the deviance and Anscombe residuals. This is fairly typical across a number of GLM models.

None of them are actually normal, but the Pearson residuals are clearly skewed, while the deviance residuals are much more nearly symmetric.

Diagnostics for General Linear Models

2 Answers2

Linked