Wishart distribution

The Wishart distribution is a multivariate continuous distribution which generalizes the Gamma distribution.

Table of contents

How the distribution is derived
Definition
Relation to the multivariate normal distribution
Expected value
Covariance matrix
Review of matrix algebra
References

How the distribution is derived

In previous lectures we have explained that:

a Chi-square random variable with degrees of freedom can be seen as a sum of squares of independent normal random variables having mean 0 and variance 1;
a Gamma random variable with parameters and can be seen as a sum of squares of independent normal random variables having mean 0 and variance .

A Wishart random matrix with parameters and can be seen as a sum of outer products of independent multivariate normal random vectors having mean 0 and covariance matrix $frac{1}{n}H$ .

In this sense, the Wishart distribution can be considered a generalization of the Gamma distribution (take point 2 above and substitute normal random variables with multivariate normal random vectors, squares with outer products and the variance with the covariance matrix $frac{1}{n}H$ ).

At the bottom of this page you can find a brief review of some basic concepts in matrix algebra that will be helpful in understanding the remainder of this lecture.

Definition

Wishart random matrices are characterized as follows.

Definition Let be a continuous random matrix. Let its support be the set of all symmetric and positive definite real matrices:Let be a symmetric and positive definite matrix and . We say that has a Wishart distribution with parameters and if its joint probability density function is [eq2] where [eq3] and is the Gamma function.

The parameter needs not be an integer, but, when is not an integer, can no longer be interpreted as a sum of outer products of multivariate normal random vectors.

Relation to the multivariate normal distribution

The following proposition provides the link between the multivariate normal distribution and the Wishart distribution.

Proposition Let be independent random vectors all having a multivariate normal distribution with mean and covariance matrix $frac{1}{n}H$ . Let . Define [eq6] Then has a Wishart distribution with parameters and .

Proof

The proof of this proposition is quite lengthy and complicated. The interested reader might have a look at Ghosh and Sinha (2002).

Expected value

The expected value of a Wishart random matrix is

Proof

We do not provide a fully general proof, but we prove this result only for the special case in which is integer and can be written as [eq8] (see subsection above). In this case, we have that [eq9] where we have used the fact that the covariance matrix of can be written as(see the lecture entitled Covariance matrix).

Covariance matrix

The concept of covariance matrix is well-defined only for random vectors. However, when dealing with a random matrix, one might want to compute the covariance matrix of its associated vectorization (if you are not familiar with the concept of vectorization, see the review of matrix algebra below for a definition). Therefore, in the case of a Wishart random matrix , we might want to compute the following covariance matrix:

Since , the vectorization of , is a $K^{2} imes 1$ random vector, is a $K^{2} imes K^{2}$ matrix.

It is possible to prove thatwhere denotes the Kronecker product and is the transposition-permutation matrix associated to (see the review of matrix algebra below for a definition).

Proof

The proof of this formula can be found in Muirhead (2005).

There is a simpler expression for the covariances between the diagonal entries of :

Proof

Again, we do not provide a fully general proof, but we prove this result only for the special case in which is integer and can be written as [eq8] (see above). To compute this covariance, we first need to compute the following fourth cross-moment:where $X_{si}$ denotes the -th component () of the random vector $X_{s}$ (). This cross-moment can be computed by taking the fourth cross-partial derivative of the joint moment generating function of $X_{si}$ and $X_{sj}$ and evaluating it at zero (see the lecture entitled Joint moment generating function). While this is not complicated, the algebra is quite tedious. I recommend doing it with computer algebra, for example utilizing the Matlab Symbolic Toolbox and the following four commands:

syms t1 t2 s1 s2 s12;

f=exp(0.5*(s1^2)*(t1^2)+0.5*(s2^2)*(t2^2)+s12*t1*t2);

d4f=diff(diff(f,t1,2),t2,2);

subs(d4f,{t1,t2},{0,0})

The result of the computations is [eq19] Using this result, the covariance between $W_{ii}$ and $W_{jj}$ is derived as follows: [eq20]

Review of matrix algebra

This section reviews some results from matrix algebra that are used to deal with the Wishart distribution.

Outer products

As the Wishart distribution involves outer products of multivariate normal random vectors, we briefly review here the concept of outer product.

If is a column vector, the outer product of with itself is the matrix obtained from the multiplication of with its transpose:

Example If is the random vector [eq22] then its outer product $XX^{ op }$ is the random matrix [eq23]

Symmetric matrices

A matrix is symmetric if and only ifi.e. if and only if equals its transpose.

Positive definite matrices

A matrix is said to be positive definite if and only if for any real vector such that .

All positive definite matrices are also invertible.

Proof

The proof is by contradiction. Suppose a positive definite matrix were not invertible. Then would not be full rank, i.e. there would be a vector such thatwhich, premultiplied by $x^{ op }$ , would yieldBut this is a contradiction.

Trace of a matrix

Let be a matrix and denote by $A_{ij}$ the -th entry of (i.e. the entry at the intersection of the -th row and the -th column). The trace of , denoted by , is the sum of all the diagonal entries of : [eq29]

Vectorization of a matrix

Given a matrix , its vectorization, denoted by , is the vector obtained by stacking the columns of on top of each other.

Example If is a matrix [eq31] the vectorization of is the random vector [eq32]

For a given matrix , the vectorization of will in general be different from the vectorization of its transpose $A^{ op }$ . The transposition permutation matrix associated to is the matrix such that

Kronecker product

Given a matrix and a matrix , the Kronecker product of and , denoted by , is a matrix having the following structure: [eq36] where $A_{ij}$ is the -th entry of .

References

Ghosh, M. and Sinha, B. K. (2002) "A simple derivation of the Wishart distribution", The American Statistician, 56, 100-101.

Muirhead, R.J. (2005) Aspects of multivariate statistical theory, Wiley.

How to cite

Please cite as:

Taboga, Marco (2021). "Wishart distribution", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/probability-distributions/wishart-distribution.