Diagnostic measures (such as residuals or some summary statistics calculated from residuals) are used to evaluate some aspect of quality of model fit to data.
Questions tagged [diagnostic]
346 questions
78
votes
3 answers
Diagnostics for logistic regression?
For linear regression, we can check the diagnostic plots (residuals plots, Normal QQ plots, etc) to check if the assumptions of linear regression are violated.
For logistic regression, I am having trouble finding resources that explain how to…

ialm
- 1,707
- 2
- 19
- 19
53
votes
2 answers
How to read Cook's distance plots?
Does anyone know how to work out whether points 7, 16 and 29 are influential points or not?
I read somewhere that because Cook's distance is lower than 1, they are not. Am, I right?

Platypezid
- 1,197
- 3
- 13
- 16
39
votes
3 answers
Interpreting residual diagnostic plots for glm models?
I am looking for guidelines on how to interpret residual plots of glm models. Especially poisson, negative binomial, binomial models. What can we expect from these plots when the models are "correct"? (for example, we expect the variance to grow…

Tal Galili
- 19,935
- 32
- 133
- 195
32
votes
2 answers
Likelihood ratio test in R
Suppose I am going to do a univariate logistic regression on several independent variables, like this:
mod.a <- glm(x ~ a, data=z, family=binominal("logistic"))
mod.b <- glm(x ~ b, data=z, family=binominal("logistic"))
I did a model comparison…

lokheart
- 2,999
- 7
- 37
- 47
30
votes
5 answers
Why does my bootstrap interval have terrible coverage?
I wanted to do a class demonstration where I compare a t-interval to a bootstrap interval and calculate the coverage probability of both. I wanted the data to come from a skewed distribution so I chose to generate the data as exp(rnorm(10, 0, 2)) +…

Flounderer
- 9,575
- 1
- 32
- 43
26
votes
3 answers
Is my model any good, based on the diagnostic metric ($R^2$/ AUC/ accuracy/ RMSE etc.) value?
I've fitted my model and am trying to understand whether it's any good. I've calculated the recommended metrics to assess it ($R^2$/ AUC / accuracy / prediction error / etc) but do not know how to interpret them. In short, how do I tell if my model…

mkt
- 11,770
- 9
- 51
- 125
26
votes
1 answer
What diagnostic plots exists for quantile regression?
Following on my question for OLS, I wonder: what diagnostic plots exists for quantile regression? (and are there R implementation of them?)
A quick google search already came up with the worm plot (which I have never heard about before), and I'd be…

Tal Galili
- 19,935
- 32
- 133
- 195
14
votes
3 answers
Is it valid to analyze signal detection data without employing metrics derived from signal detection theory?
A signal detection experiment typically presents the observer (or diagnostic system) with either a signal or a non-signal, and the observer is asked to report whether they think the presented item is a signal or non-signal. Such experiments yield…

Mike Lawrence
- 12,691
- 8
- 40
- 65
14
votes
2 answers
MCMC Geweke diagnostic
I'm running a Metropolis sampler (C++) and want to use the previous samples to estimate the convergence rate.
One easy to implement diagnostic I found is the Geweke diagnostic, which computes the difference between the two sample means divided by…

Yang
- 263
- 2
- 8
14
votes
2 answers
Prove the relation between Mahalanobis distance and Leverage?
I have seen formulas on Wikipedia. that relate Mahalanobis distance and Leverage:
Mahalanobis distance is closely related to the leverage statistic, $h$, but has a different scale: $$D^2 = (N - 1)(h - \tfrac{1}{N}).$$
In a linked article,…

dave2d
- 303
- 2
- 4
13
votes
1 answer
Why are diagnostics based on residuals?
In simple linear regression one often wants to verify if certain assumptions are met to be able to do inference (e.g. residuals are normally distributed).
Is it reasonable to check the assumptions by checking if the fitted values are normally…

bdeonovic
- 8,507
- 1
- 24
- 49
12
votes
4 answers
Is there a test for omitted variable bias in OLS?
I am aware of the Ramsey Reset test which may detect nonlinear dependencies. However, if you just throw out one of the regression coefficients (merely linear dependencies), you may get a bias, depending on the correlations. This is obviously not…

user13655
- 377
- 1
- 2
- 8
11
votes
2 answers
What is gained from a scale-location plot?
The plot function in R provides four diagnostic plots for linear regression:
It seems like the residuals vs fitted plot and the scale-location plot are basically providing the same exact information. So why provide this seemingly redundant…

user166625
- 459
- 5
- 13
11
votes
1 answer
When/why not to use studentized residuals for regression diagnostics?
Consider a linear regression
$$
y=X\beta+\varepsilon.
$$
Residuals $e:=y-X\hat\beta$ are often used as substitutes for the unobserved model errors $\varepsilon$ for validating assumptions such as homoskedasticity of $\varepsilon$, normality of…

Richard Hardy
- 54,375
- 10
- 95
- 219
11
votes
2 answers
What are some good exploratory analysis and diagnostic plots for count data?
Does anyone know of good reference material on exploratory analysis and diagnostic plots for count data?

user12397
- 91
- 1
- 4