Conditional expectation

The conditional expectation (or conditional expected value, or conditional mean) is the expected value of a random variable, computed with respect to a conditional probability distribution.

Table of contents

A pragmatic approach
Definition
Conditional expectation of a discrete random variable
Conditional expectation of a continuous random variable
Conditional expectation in general
Properties of conditional expectation
Law of iterated expectations
Solved exercises

A pragmatic approach

As in the case of the expected value, a completely rigorous definition of the conditional expectation requires a complicated mathematical apparatus.

To make things simpler, we do not give a completely rigorous definition in this lecture. We rather give an informal definition and we show how the conditional expectation can be computed.

In particular, we discuss how to calculate the conditional expected value of a random variable when we observe the realization of another random variable , that is, when we receive the information that .

Definition

The following informal definition is very similar to our previous definition of the expected value.

Definition Let and be two random variables. The conditional expectation of given is the weighted average of the values that can take on, where each possible value is weighted by its respective conditional probability (conditional on the information that ).

The expectation of a random variable conditional on is denoted by

Conditional expectation of a discrete random variable

We start with the case in which and are two discrete random variables and, considered together, they form a discrete random vector.

The formula for the conditional mean of given is a straightforward implementation of the above informal definition: the weights of the average are given by the conditional probability mass function of .

Definition Let and be two discrete random variables. Let be the support of and let be the conditional probability mass function of given . The conditional expectation of given is [eq3] provided that [eq4]

If you do not understand the symbol $sum_{xin R_{X}}$ and the finiteness condition above (absolute summability) go back to the lecture on the Expected value, where they are explained.

Example Let the support of the random vector be [eq6] and its joint probability mass function be [eq7] Let us compute the conditional probability mass function of given . The marginal probability mass function of evaluated at is [eq8] The support of isThus, the conditional probability mass function of given is [eq10] The conditional expectation of given is [eq11]

Conditional expectation of a continuous random variable

Let us now tackle the case in which and are continuous random variables, forming a continuous random vector.

The formula for the conditional mean of given involves an integral, which can be thought of as the limiting case of the summation [eq12] found in the discrete case above.

Definition Let and be two continuous random variables. Let be the support of and let be the conditional probability density function of given . The conditional expectation of given is [eq14] provided that [eq15]

If you do not understand why an integration is required and why the finiteness condition above (absolute integrability) is imposed, you can find an explanation in the lecture entitled Expected value.

Example Let the support of the random vector be and its joint probability density function be [eq18] Let us compute the conditional probability density function of given . The support of isWhen , the marginal probability density function of is ; when , the marginal probability density function is [eq22] Thus, the marginal probability density function of is [eq23] When evaluated at , it is [eq24] The support of isThus, the conditional probability density function of given is [eq26] The conditional expected value of given is [eq27]

Conditional expectation in general

The general formula for the conditional expectation of given does not require that the two variables form a discrete or a continuous random vector, but it is applicable to any random vector.

Definition Let be the conditional distribution function of given . The conditional expectation of given is [eq29] where the integral is a Riemann-Stieltjes integral and the expected value exists and is well-defined only as long as the integral is well-defined.

The above formula follows the same logic of the formula for the expected value with the only difference that the unconditional distribution function has now been replaced with the conditional distribution function .

If you are puzzled by these formulae, you can go back to the lecture on the Expected value, which provides an intuitive introduction to the Riemann-Stieltjes integral.

Properties of conditional expectation

From the above sections, it should be clear that the conditional expectation is computed exactly as the expected value, with the only difference that probabilities and probability densities are replaced by conditional probabilities and conditional probability densities.

Therefore, the properties enjoyed by the expected value, such as linearity, are also enjoyed by the conditional expectation.

Law of iterated expectations

Before knowing the realization of , the conditional expectation of given is unknown and can itself be regarded as a random variable. We denote it by .

In other words, is a random variable such that its realization equals when is the realization of .

This random variable satisfies a very important property, known as law of iterated expectations (or tower property):

Proof

For discrete random variables this is proved as follows: [eq37] For continuous random variables the proof is analogous: [eq38]

Solved exercises

Below you can find some exercises with explained solutions.

Exercise 1

Let be a discrete random vector with support and joint probability mass function [eq41]

What is the conditional expectation of given ?

Solution

Let us compute the conditional probability mass function of given . The marginal probability mass function of evaluated at is [eq42] The support of isThus, the conditional probability mass function of given is [eq44] The conditional expectation of given is [eq45]

Exercise 2

Suppose that is a continuous random vector with support and joint probability density function [eq48]

Compute the expected value of conditional on .

Solution

We first need to compute the conditional probability density function of given , by using the formula [eq49] Note that, by using indicator functions, we can writeThe marginal probability density function is obtained by marginalizing the joint density: [eq52] When evaluated at , it isFurthermore, Thus, the conditional probability density function of given is [eq55] The conditional expectation of given is [eq56]

Exercise 3

Let and be two random variables.

Remember that the variance of can be computed as

In a similar manner, the conditional variance of , given , can be defined as

Use the law of iterated expectations to prove that

Solution

This is proved as follows:

[eq60] where: in step we have used the law of iterated expectations; in step we have used the formula in step we have used the linearity of the expected value; in step we have used the formula

How to cite

Please cite as:

Taboga, Marco (2021). "Conditional expectation", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/fundamentals-of-probability/conditional-expectation.