StatlectThe Digital Textbook

# Binomial distribution

Consider an experiment having two possible outcomes: either success or failure. Suppose the experiment is repeated several times and the repetitions are independent of each other. The total number of experiments where the outcome turns out to be a success is a random variable whose distribution is called binomial distribution. The distribution has two parameters: the number of repetitions of the experiment and the probability of success of an individual experiment.

A binomial distribution can be seen as a sum of mutually independent Bernoulli random variables that take value 1 in case of success of the experiment and value 0 otherwise. This connection between the binomial and Bernoulli distributions will be illustrated in detail in the remainder of this lecture and will be used to prove several properties of the binomial distribution.

## Definition

The binomial distribution is characterized as follows:

Definition Let be a discrete random variable. Let and . Let the support of be:We say that has a binomial distribution with parameters and if its probability mass function is:where is a binomial coefficient.

The following is a proof that is a legitimate probability mass function.

Proof

Non-negativity is obvious. We need to prove that the sum of over its support equals . This is proved as follows:where we have used the usual formula for binomial expansions:

## Relation to the Bernoulli distribution

The binomial distribution is intimately related to the Bernoulli distribution. The following propositions show how.

Proposition If a random variable has a binomial distribution with parameters and , with , then has a Bernoulli distribution with parameter .

Proof

The probability mass function of is:but:and:Therefore, the probability mass function can be written as:which is the probability mass function of a Bernoulli random variable.

Proposition If a random variable has a binomial distribution with parameters and , then is a sum of jointly independent Bernoulli random variables with parameter .

Proof

We prove it by induction. So, we have to prove that it is true for and for a generic , given that it is true for . For , it has been proved in the proposition above (the binomial distribution with parameter is a Bernoulli distribution). Now, suppose the claim is true for a generic . We have to verify that is a binomial random variable, where:and , , , are independent Bernoulli random variables. Since the claim is true for , this is tantamount to verifying that:is a binomial random variable, where has a binomial distribution with parameters and Using the convolution formula, we can compute the probability mass function of : If , then:where the last equality is the recursive formula for binomial coefficients. If , then:Finally, if , then:Therefore, for we have:and:which is the probability mass function of a binomial random variable with parameters and . This completes the proof.

## Expected value

The expected value of a binomial random variable is:

Proof

It can be derived as follows:

## Variance

The variance of a binomial random variable is:

Proof

Representing as a sum of jointly independent Bernoulli random variables:

## Moment generating function

The moment generating function of a binomial random variable is defined for any :

Proof

This is proved as follows:Since the moment generating function of a Bernoulli random variable exists for any , also the moment generating function of a binomial random variable exists for any .

## Characteristic function

The characteristic function of a binomial random variable is:

Proof

Again, we are going to use the fact that a binomial random variable with parameter is a sum of independent Bernoulli random variables:

## Distribution function

The distribution function of a binomial random variable is:where is the floor of , i.e. the largest integer not greater than .

Proof

For , , because cannot be smaller than . For , , because is always smaller than or equal to . For :

Values of are usually computed by computer algorithms. For example, the MATLAB command:

`binocdf(x,n,p)`

returns the value of the distribution function at the point `x` when the parameters of the distribution are `n` and `p`.

## Solved exercises

Below you can find some exercises with explained solutions:

1. Exercise set 1 (compute quantities related to the binomial distribution)

The book

Most learning materials found on this website are now available in a traditional textbook format.

Glossary entries
Share