Calculating standard errors in least squares and the normality assumption

Question

The question titled “How are the standard errors of coefficients calculated in a regression?” is asking how the standard errors of regression coefficient estimates are computed (for example, the output you see from the lm() function in R).

The answer is assuming :

\begin{array}{l} \mathbf{\epsilon} \sim N(0, \sigma^2 \mathbf{I}) \end{array}

I guess standard errors can be calculated without assuming normality of errors (as is true for least squares). It is only when one wants to give confidence intervals that normality may be needed.

Your guess is correct. A standard error is calculated without any distributional assumptions at all. Those assumptions are needed only for interpreting the SE probabilistically. What, then, is your question? — whuber
– whuber ♦, Commented Oct 25 at 12:56
Well my question was : why then is it assumed that it is normal ? It was more a comment. The assumptions in stats.stackexchange.com/questions/44838/… are then somehow misleading. The linear model does NOT need normality and the demonstration could have been made without writing that the error are normally distributed as the answer does not discuss Confidence Interval but standard errors. — Laut567
– Laut567, Commented Oct 25 at 13:41
I cannot find any reference to a Normal distribution assumption in that thread. As I wrote initially, standard errors are neither defined nor computed using any distributional assumptions, so it's hard to see what might be misleading about an answer that discusses SEs. — whuber
– whuber ♦, Commented Oct 25 at 15:58
@whuber unless I'm completely misreading it, the accepted answer begins: the linear model can be written: \begin{array}{l} \mathbf{y} = \mathbf{X} \mathbf{\beta} + \mathbf{\epsilon} \\ \mathbf{\epsilon} \sim N(0, \sigma^2 \mathbf{I}), \end{array} — Rick Hass
– Rick Hass, Commented Oct 25 at 16:58
Thanks, @Rick. I failed to notice that mainly because (a) neither the words "Normal" or "Gaussian" appear on that page and--more to the point--nothing in the accepted answer ever appeals to a Normality assumption to draw any inferences or conclusions. It's a calculation of variances and standard errors only. Some of the comments do refer to a Bayesian analysis that requires a distributional assumption. — whuber
– whuber ♦, Commented Oct 25 at 19:19

Rick Hass · Accepted Answer · 2025-10-25 17:25:29Z

To answer your question directly, the accepted answer in the thread you reference, does indeed implicitly make the assumption of normally distributed errors:

\begin{array}{l} \mathbf{y} = \mathbf{X} \mathbf{\beta} + \mathbf{\epsilon} \\ \mathbf{\epsilon} \sim N(0, \sigma^2 \mathbf{I}), \end{array}

This is not necessary. For example, note, that the Gauss-Markov theorem only assumes that the $\epsilon$'s:

Have zero mean: $\mathbf{E}(\epsilon_{i}) = 0$
Have constant variance: $\text{Var}(\epsilon_{i}) = \sigma^{2} < \infty $
Are uncorrelated with each other

Yet, proof of the theorem involves derivation of the variance of $\hat{\beta}$. Standard error could be defined from there.

The assumption of normality of the errors makes the least squares solution also the maximum likelihood estimate. From there, the distributions of the parameters can be derived and confidence intervals defined. See this nice explanation or one from this site. Furthermore, since for the least squares case, we arrive at a $t$ pivot, the intervals are “exact” while for the generalized linear model, we’d rely on the asymptotic covariance matrix of $\hat{\beta}$.

Thanks. I was pretty sure my guess was correct and I agree with your answer. It was just strange to see 190 up votes or so when the assumptions are basically misleading. — Laut567
– Laut567, Commented Oct 25 at 17:23
Yeah I think we're just used to going right to normal errors since that can often be part of the model-diagnostic procedure — Rick Hass
– Rick Hass, Commented Oct 25 at 17:24

Stack Exchange Network

Calculating standard errors in least squares and the normality assumption

1 Answer 1

Your Answer

Linked

Hot Network Questions

Calculating standard errors in least squares and the normality assumption

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Hot Network Questions