Questions tagged [heteroscedasticity]

Non-constant variance along some continuum in a random process.

Heteroscedasticity refers to the property of a random process that has non-constant variance along some continuum. This most commonly presents in regression where the error variance increases as a function of one or more predictors, but also commonly refers to a time series whose variance changes over time. The Greek skedasis means "dispersion".

Random data showing heteroscedasticity: . . . . . and heteroscedastic vs. homoscedastic residuals:
by Q9. . . . . . . . . . by Protonk.

Heteroscedasticity may be intrinsically interesting, as in this example from Wikipedia:

A classic example of heteroscedasticity is that of income versus expenditure on meals...A poorer person will spend a rather constant amount by always eating inexpensive food; a wealthier person may occasionally buy inexpensive food and at other times eat expensive meals. Those with higher incomes display a greater variability of food consumption. [Emphasis added.]

Heteroscedasticity may complicate predictive/explanatory modeling, as in the other example:

Imagine you are watching a rocket take off nearby and measuring the distance it has traveled once each second. In the first couple of seconds your measurements may be accurate to the nearest centimeter, say. However, 5 minutes later as the rocket recedes into space, the accuracy of your measurements may only be good to 100 m, because of the increased distance, atmospheric distortion and a variety of other factors. The data you collect would exhibit heteroscedasticity. [Emphasis added.]

Questions that should use this tag:

Questions about variables for which the variance depends on another variable
Questions involving analyses of datasets with problematic heteroscedasticity
- E.g., ols estimation of general-linear-models assumes sphericity, which entails homo- scedasticity and nonautocorrelation. Heteroscedasticity can bias standard-errors in:
  - anova, particularly with repeated-measures
  - linear-regression, particularly of time-series

See Wikipedia also for:

Further definition of heteroscedasticity, including the multivariate case
Consequences of heteroscedasticity
Detection methods, including levenes-test
Fixes, including robust methods like WLS estimation, HCSEs, and ARCH modeling
References and further reading

1054 questions

votes

2 answers

What does having "constant variance" in a linear regression model mean?

What does having "constant variance" in the error term mean? As I see it, we have a data with one dependent variable and one independent variable. Constant variance is one of the assumptions of linear regression. I am wondering what homoscedasticity…

regression heteroscedasticity

asked Mar 13 '13 at 12:51

Mukul

votes

7 answers

When conducting a t-test why would one prefer to assume (or test for) equal variances rather than always use a Welch approximation of the df?

It seems like when the assumption of homogeneity of variance is met that the results from a Welch adjusted t-test and a standard t-test are approximately the same. Why not simply always use the Welch adjusted t?

variance t-test heteroscedasticity

asked Jul 20 '10 at 14:19

russellpierce

17,079
16
67
98

votes

1 answer

Alternatives to one-way ANOVA for heteroskedastic data

I have data from 3 groups of algae biomass ($A$, $B$, $C$) which contain unequal sample sizes ($n_A=15$, $n_B=13$, $n_C=12$) and I would like compare if these groups are from the same population. One-way ANOVA would definitely be the way to go,…

r anova data-transformation heteroscedasticity

asked Mar 30 '14 at 04:21

Rick L.

votes

5 answers

Why are there two spellings of "heteroskedastic" or "heteroscedastic"?

I frequently see both the spellings "heteroskedastic" and "heteroscedastic", and similarly for "homoscedastic" and "homoskedastic". There seems to be no difference in meaning between the "c" and the "k" variants, simply an orthographic difference…

terminology heteroscedasticity etymology

asked May 22 '15 at 08:47

Silverfish

20,678
23
92
180

votes

5 answers

What are the dangers of violating the homoscedasticity assumption for linear regression?

As an example, consider the ChickWeight data set in R. The variance obviously grows over time, so if I use a simple linear regression like: m <- lm(weight ~ Time*Diet, data=ChickWeight) My questions: Which aspects of the model will be…

r regression heteroscedasticity assumptions

asked Feb 14 '12 at 15:50

Dan M.

votes

2 answers

How do you find weights for weighted least squares regression?

I am a bit lost in the process of WLS regression. I have been given dataset and my task is to test whether there is heteroscedascity, and if so I should run WLS regression. I have carried out the test and found evidence for heteroscedascity, so I…

regression heteroscedasticity weighted-regression

asked May 15 '14 at 12:44

m3div0

votes

4 answers

Best way to deal with heteroscedasticity?

I have a plot of residual values of a linear model in function of the fitted values where the heteroscedasticity is very clear. However I'm not sure how I should proceed now because as far as I understand this heteroscedasticity makes my linear…

r generalized-linear-model heteroscedasticity lm

asked Apr 18 '15 at 22:29

TristanDM

votes

1 answer

Sandwich estimator intuition

Wikipedia and the R sandwich package vignette give good information about the assumptions supporting OLS coefficient standard errors and the mathematical background of the sandwich estimators. I'm still not clear how the problem of residuals…

multiple-regression residuals heteroscedasticity robust-standard-error

asked Feb 25 '13 at 00:25

Robert Kubrick

4,078
8
38
55

votes

6 answers

Always Report Robust (White) Standard Errors?

It has been suggested by Angrist and Pischke that Robust (i.e. robust to heteroskedasticity or unequal variances) Standard Errors are reported as a matter of course rather than testing for it. Two questions: What is impact on the standard errors of…

regression standard-error heteroscedasticity robust-standard-error

asked Jul 21 '10 at 17:45

Graham Cookson

7,543
6
41
35

votes

1 answer

Why Levene test of equality of variances rather than F ratio?

SPSS uses the Levene test to evaluate homogeneity of variances in the independent group t-test procedure. Why is the Levene test better than a simple F ratio of the ratio of the variances of the two groups?

hypothesis-testing anova variance t-test heteroscedasticity

asked Mar 02 '12 at 21:59

Joel W.

3,096
3
31
45

votes

3 answers

Regression modelling with unequal variance

I would like to fit a linear model (lm) where the residuals variance is clearly dependent on the explanatory variable. The way I know to do this is by using glm with the Gamma family to model the variance, and then put its inverse into the weights…

r generalized-linear-model linear-model heteroscedasticity gamlss

asked Aug 14 '12 at 21:28

Tal Galili

19,935
32
133
195

votes

4 answers

Practically speaking, how do people handle ANOVA when the data doesn't quite meet assumptions?

This isn't a strictly stats question--I can read all the textbooks about ANOVA assumptions--I'm trying to figure out how actual working analysts handle data that doesn't quite meet the assumptions. I've gone through a lot of questions on this site…

anova heteroscedasticity assumptions

asked May 09 '14 at 18:59

Jas Max

votes

2 answers

Transforming proportion data: when arcsin square root is not enough

Is there a (stronger?) alternative to the arcsin square root transformation for percentage/proportion data? In the data set I'm working on at the moment, marked heteroscedasticity remains after I apply this transformation, i.e. the plot of…

data-transformation generalized-linear-model heteroscedasticity

asked May 19 '11 at 13:48

Freya Harrison

3,212
4
25
31

votes

2 answers

How do I interpret this fitted vs residuals plot?

I don't really understand heteroscedasticity. I would like to know whether my model is appropriate or not according to this plot.

r regression residuals heteroscedasticity independence

asked Jan 18 '14 at 18:11

kanbhold

votes

2 answers

How to run two-way ANOVA on data with neither normality nor equality of variance in R?

I am working on my master thesis at the moment and planned on running the statistics with SigmaPlot. However, after spending some time with my data I came to the conclusion that SigmaPlot might not be fit for my problem (I may be mistaken) so I…

r anova nonparametric heteroscedasticity

asked May 16 '12 at 13:21

Sabine

2 3

…

70 71 Next