Search for probability and statistics terms on Statlect
StatLect

Distribution function

by , PhD

What is the probability that the realization of a random variable will be less than or equal to a certain threshold value?

The distribution function of a random variable allows us to answer exactly this question.

Its value at a given point is equal to the probability of observing a realization of the random variable below that point or equal to that point.

Table of Contents

Synonyms

The distribution function is also often called cumulative distribution function (abbreviated as cdf).

Definition

The following is a formal definition.

Definition If X is a random variable, its distribution function is a function [eq1] such that[eq2]where [eq3] is the probability that X is less than or equal to x.

Example

Suppose that a random variable can take only two values (0 and 1), each with probability 1/2.

Its distribution function is[eq4]

Here is a plot of the function.

Plot of a distribution function highlighting the points where the function jumps.

Properties

Every distribution function enjoys the following four properties:

  1. Increasing. [eq5] is increasing, i.e., [eq6]

  2. Right-continuous. [eq7] is right-continuous, i.e.,[eq8]for any $xin U{211d} $;

  3. Limit at minus infinity. [eq7] satisfies [eq10]

  4. Limit at plus infinity. [eq7] satisfies [eq12]

Concise proofs of these properties can be found here and in Williams (1991).

Proper distribution function

Any distribution function enjoys the four properties above.

Moreover, for any given function enjoying these four properties, it is possible to define a random variable that has the given function as its distribution function (for a proof, see Williams 1991, Sec. 3.11).

The practical consequence of this fact is that, when we need to check whether a given function is a proper distribution function, we just need to verify that it satisfies the four properties above.

How to derive the cdf in the discrete case

When the random variable X is discrete, the cdf can be derived as[eq13]where:

This can be quickly done with a table.

Example

Suppose that the probability mass function of X is[eq15]

Then, we can set up a table that has three rows.

In the first row, we write the possible values of X, sorted from smallest to largest.

In the second row, we write the probabilities of the single values.

The third row contains the values of the cdf.

The leftmost cell in the third row is equal to the cell immediately above.

Then, we go from left to right and the value in each cell is set equal to the sum of:

  1. the probability in the cell immediately to the left;

  2. the probability in the cell immediately above.

[eq16]

Thus, the distribution function is[eq17]

How to derive the cdf in the continuous case

When the random variable is continuous, its cdf can be computed as[eq18]where [eq19] is the probability density function of X.

Example

The simplest example is probably the cdf of the uniform distribution.

The probability density function of a random variable having uniform distribution on the interval $left[ 0,1
ight] $ is[eq20]where [eq21] is an indicator function that takes value 1 on the interval $left[ 0,1
ight] $ and value 0 everywhere else.

There are three cases:

  1. if $x<0$, then[eq22]

  2. if [eq23], then[eq24]

  3. if $xgeq 1$, then[eq25]

Therefore, the cdf is[eq26]

More details

More details about the distribution function can be found in the lecture on Random variables.

References

Williams, D., 1991. Probability with martingales. Cambridge university press.

Keep reading the glossary

Previous entry: Discrete random vector

Next entry: Estimator

How to cite

Please cite as:

Taboga, Marco (2021). "Distribution function", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/glossary/distribution-function.

The books

Most of the learning materials found on this website are now available in a traditional textbook format.