The normal linear regression model

This lecture discusses the main properties of the Normal Linear Regression Model (NLRM), a linear regression model in which the vector of errors of the regression is assumed to have a multivariate normal distribution conditional on the matrix of regressors. The assumption of multivariate normality, together with other assumptions (mainly concerning the covariance matrix of the errors), allows us to derive analytically the distributions of the Ordinary Least Squares (OLS) estimators of the regression coefficients and of several other statistics.

Table of contents

Setting
Assumptions
Distribution of the OLS estimator
Estimation of the variance of the error terms
Estimation of the covariance matrix of the OLS estimator
Maximum likelihood estimation
Hypothesis testing

Setting

We use the same notation used in the lecture entitled Properties of the OLS estimator (to which you can refer for more details): the vector of observations of the dependent variable is denoted by , the matrix of regressors (called design matrix) is denoted by , the vector of errors is denoted by and the vector of regression coefficients is denoted by , so that the regression equations can be written in matrix form asThe OLS estimator is the vector which minimizes the sum of squared residuals [eq2] and, if the design matrix has full rank, it can be computed as

Assumptions

The assumptions made in a normal linear regression model are:

the design matrix has full-rank (as a consequence, $X^{ op }X$ is invertible and the OLS estimator is );
conditional on , the vector of errors has a multivariate normal distribution with mean equal to and covariance matrix equal towhere is a positive constant and is the identity matrix;

Note that the assumption that the covariance matrix of is diagonal implies that the entries of are mutually independent, that is, $varepsilon _{i}$ is independent of $varepsilon _{j}$ for . Moreover, the assumption that all diagonal entries of the covariance matrix are equal implies that all the entries of have the same variance, that is, for any . The latter assumption is often referred to as "homoscedasticity assumption", and if the assumption is satisfied, we say that the errors are homoscedastic. On the contrary, if homoscedasticity does not hold, we say that the errors are heteroscedastic.

Distribution of the OLS estimator

Under the assumptions made in the previous section, the OLS estimator has a multivariate normal distribution, conditional on the design matrix.

Proposition In a Normal Linear Regression Model, the OLS estimator has a multivariate normal distribution, conditional on , with mean and covariance matrix

Proof

First of all, note that [eq9] The fact that we are conditioning on means that we can treat as a constant matrix. Therefore, conditional on , the OLS estimator is a linear transformation of a multivariate normal random vector (the vector ). This implies that also is multivariate normal, with mean [eq10] and variance [eq11]

Note that means that the OLS estimator is unbiased, not only conditionally, but also unconditionally, because by the Law of Iterated Expectations we have that

Estimation of the variance of the error terms

The variance of the error terms is usually not known. A commonly used estimator of is the adjusted sample variance of the residuals: [eq14] where the regression residuals are

The properties enjoyed by are summarized by the following proposition.

Proposition In a Normal Linear Regression Model, the adjusted sample variance of the residuals is a conditionally unbiased estimator of : [eq18] Furthermore, conditional on , has a Gamma distribution with parameters and and it is independent of .

Proof

Denote by the vector of residuals. Remember from the previous proof that the OLS estimator can be written asAs a consequence, we have [eq21] The matrixis clearly symmetric (verify it by taking its transpose). It is also idempotent because [eq23] Therefore, [eq24] where has a standard multivariate normal distribution, that is, a multivariate normal distribution with zero mean and unit covariance matrix. Since the matrix is symmetric and idempotent, the quadratic form has a Chi-square distribution with a number of degrees of freedom equal to the trace of the matrix (see the lecture Normal distribution - Quadratic forms). But the trace of is [eq27] Since the expected value of a Chi-square random variable is equal to its number of degrees of freedom, we have [eq28] Moreover, the fact that the quadratic form has a Chi-square distribution with degrees of freedom implies that the sample variance [eq29] has a Gamma distribution with parameters and (see the lecture on the Gamma distribution for a proof of this fact). To conclude, we need to prove that is independent of . Since [eq31] andwe have that and are functions of the same multivariate normal random vector . Therefore, by standard results on the independence of quadratic forms involving normal vectors, and are independent if and are orthogonal. In order to check their orthogonality, we only need to verify that the product between and is zero: [eq37]

Note that also in this case, the proposed estimator is unbiased not only conditionally, but also unconditionally because, by the Law of Iterated Expectations, we have that [eq38]

Estimation of the covariance matrix of the OLS estimator

We have already proved that in the Normal Linear Regression Model the conditional covariance matrix of the OLS estimator (conditional on ) is

In practice, however, this quantity is not known exactly because the variance of the error terms, that is , is unknown. However, we can replace its unknown value with the estimator proposed above (the adjusted sample variance of the residuals), so as to obtain an estimator of the covariance matrix of : [eq40]

This estimator is often employed to construct test statistics that allow us to conduct tests of hypotheses about the regression coefficients.

Maximum likelihood estimation

It can be proved that the OLS estimators of the coefficients of a Normal Linear Regression Model are equal to the maximum likelihood estimators. On the contrary, the maximum likelihood estimator of the variance of the error terms is different from the estimator derived above. For proofs of these two facts, see the lecture entitled Linear Regression - Maximum likelihood estimation.

Hypothesis testing

In the lecture on Linear regressions and hypothesis testing we explain how to perform hypothesis tests on the coefficients of a normal linear regression model.

How to cite

Please cite as:

Taboga, Marco (2021). "The normal linear regression model", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/fundamentals-of-statistics/normal-linear-regression-model.