What are the weakest assumptions about the errors needed to derive the ordinary least squares estimator?

Asked Jun 15 '19 at 11:54

Active Jun 15 '19 at 12:11

Viewed 20 times

This is a question from a past paper. The answer given in the mark scheme is that the minimal assumption is just $\textrm{Var}(\epsilon)=\sigma^2I_n$. I'm struggling to understand why this is the case, based off the standard derivation of the estimator. I would have assumed that the assumptions were $\mathbb{E}(\epsilon)=0$ so that the sum of squares is given by $S(\beta)=(y-Z\beta)^T(y-Z\beta)$ and that $Z$ is of full rank so that $(Z^TZ)^{-1}$ exists.

I am aware that the assumption of equal variance and no correlation is required for the Gauss-Markov theorem to hold, but I can't see its relevance in this case. Could someone please cast some light on this issue?

edited Jun 15 '19 at 12:11

Richard Hardy

54,375
10
95
219

asked Jun 15 '19 at 11:54

Tim Hargreaves

1

OLS as an optimization algorithm does not assume anything about error terms. – Michael M Jun 15 '19 at 12:17
Michael M: That's true but the result might not be minimum variance if the error term is not gaussian. – mlofton Jun 15 '19 at 12:59

What are the weakest assumptions about the errors needed to derive the ordinary least squares estimator?

0 Answers0