Banner Image

AP Statistics

Geometric Distribution, Definition and Solved Examples

Written by Prerit Jain

Updated on: 29 Jul 2023

Geometric Distribution,  Definition and Solved Examples

Geometric Distribution, Definition and Solved Examples

The geometric distribution is a type of discrete probability distribution that represents the probability of the number of successive failures before success is obtained in a Bernoulli trial.

Numerous situations in everyday life utilize geometric distribution. For instance, in the financial sector, a cost-benefit analysis is performed using geometric distribution to determine the financial rewards of a certain course of action.

 What is geometric distribution?

The probability of achieving success for the first time after a series of failures can be expressed as a discrete probability distribution called a geometric distribution. A geometric distribution can attempt an endless number of times up to the first success.

 A similar type of distribution is the binomial distribution where there can be only two outcomes i.e., success or failure, and the probability of success is the same for each trial but the difference between Geometric distribution and Binomial distribution is as follows:

Geometric DistributionBinomial Distribution
The only success that matters in a geometric distribution is the first one. The random variable, X, keeps track of how many attempts are necessary to get the first success.The random variable, X, counts the number of successes in the fixed number of trials that make up a binomial distribution.

Characteristics of geometric distribution

A geometric distribution is defined as a discrete probability of a random variable “x” which satisfies some of the conditions as follows:

  • An event that has a series of trials
  • And each trial has only two outcomes.
  • The probability of success is the same for each trial

Some of the real-life applications of geometric distribution are:

  • In sports, namely baseball, a geometric distribution may be used to determine the likelihood that a hitter would receive a hit before receiving three strikes; in this case, the objective is to succeed within three tries.
  • In time management, the goal is to complete a task before some set amount of time.
  • In cost-benefit analyses, such as a company deciding whether to fund research trials that, if successful, will earn the company some estimated profit, the goal is to reach success before the cost outweighs the potential gain.

The number of Bernoulli trials necessary to achieve the first success is shown by the random variable in the discrete probability distribution known as the geometric distribution.

The outcomes of events that can be counted or are finite are included in a discrete probability distribution. This differs from a continuous distribution, in which results can occur anywhere along a continuum. The binomial, Poisson, and Bernoulli distributions are some examples of discrete distributions.

ap statistics practice tests and past papers download

Properties of geometric distribution

We take the probability of success to be p.

And as defined the geometric distribution is a family of curves that models the number of failures before success occurs in a series of independent trials.

  • Mean

The predicted value of the geometric distribution would be the distribution’s mean. The weighted average of all X values can be used to define the expected value of a random variable, X.

The formula of the mean is

                                                      

    \[E\left[ X \right]=\frac{1}{p}\]

  • Variance

The variance is the measure of the dispersion from the mean.

The formula for the variance of geometric distribution is

     

    \[Var\left[ X \right]=\frac{\left( 1-p \right)}{{{p}^{2}}}\]

  • Probability mass function

The likelihood that a discrete random variable, X, will be identical to some value, x is determined by the probability mass function.

The probability mass function formula is

    \[P\left( X=x \right)={{(1-p)}^{x-1}}p\]

  • Binomial distribution relation

The geometric distribution is a special case of the negative binomial distribution. It deals with the number of trials required for a single success. Thus, the geometric distribution is a negative binomial distribution where the number of successes (

    \[r\]

) is equal to 1.

Applications of geometric distribution

Some of the main examples of geometric distribution are:

  1. Cost-benefit applications
    • The majority of organizations often do a cost-benefit analysis using a geometric probability distribution. The goal of a cost-benefit analysis is to calculate the financial gain that an organization would experience from taking a given course of action while deducting the associated costs. This lessens the likelihood of capital loss and enhances the organization’s capacity for decision-making.
  2. Tossing a coin
    • One of the tests that come after the Bernoulli trials is tossing a coin. Let’s say that when a coin is tossed, getting heads on top is deemed a success, and getting tails on top is considered a failure. Geometric distribution makes it simple to depict the likelihood of how many times a coin must be thrown before it lands on its head.
  3. Other applications:
  • Feedback from Customers
  • Number of Supporters of a Law
  • Number of Faulty Products Manufactured in an Industry
  • Number of Bugs in a Code 
  • A Teacher Examining Test Records
  • Playing a Game
  • Throwing Darts at a Dartboard 
  • Number of Network Failures 

Identifying the geometric distribution

The three key ways to identify a geometric distribution are:

  • There are only two possible outcomes for each trial (success or failure).
  • The trials are independent.
  • The probability of success is the same for each trial.

If a distribution possesses all the above properties, it can be a geometric distribution.

The geometric distribution is only concerned with the first success in the trials made.

The geometric probability distribution

 The geometric distribution is a special case of the negative binomial distribution. It deals with the number of trials required for a single success. Thus, the geometric distribution is a negative binomial distribution where the number of successes (r) is equal to 1.

The formula is:

                                                      

    \[P(X=x)=p\times {{q}^{x-1}}\]

p is the probability of success for each trial

qis the probability which is (1-p)

xis the number of failures before the success

P(X-x) is the probability ofxsuccesses inntrials.

Standard deviation in geometric distribution

The square root of the variance may be used to establish the standard deviation. The distribution’s divergence from the mean is also shown by the standard deviation.

The following is the formula for a geometric distribution’s standard deviation:

                                                         

    \[S.D=\frac{\sqrt{1-p}}{p}\]

The probability of the geometric random variable

 The geometric random variable is used when one is modeling a series of experiments that have one of two possible outcomes – success or failure. The geometric random variable tells you the number of experiments that were performed before obtaining success.

The probability mass function for the geometric random variable is given by:

    \[{{f}_{X}}(x)=P(X=x)={{(1-p)}^{x-1}}p\]

The variance for this variable is

    \[Var\left[ X \right]=\frac{\left( 1-p \right)}{{{p}^{2}}}\]

.

The cumulative geometric distribution

The chance that a random variable, X, will have a value that is less than or equal to x may be described as the cumulative distribution function of a random variable, X, that is assessed at a point, x. The distribution function is an alternative name for it.

The formula for the geometric distribution CDF is given as follows:

    \[P(X\le x)=1-{{(1-p)}^{x}}\]

.

Solved Examples

Example 1: Suppose we are playing a game of soccer. The probability of getting a goal is 0.2. What is the probability we will get a goal on the third try?

Solution:

Given,p=0.2

Using the formula

    P(X=x)=p\times {{q}^{x-1}}\]and <span class="ql-right-eqno">   </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/5a/ql_8acea25be05fabc79d43d69b3422675a_l3.png" height="19" width="697" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[x=3\). <!-- /wp:paragraph --> <!-- wp:paragraph --> \[q=(1-p)=(1-0.2)=0.8\]" title="Rendered by QuickLaTeX.com"/> <!-- /wp:paragraph --> <!-- wp:paragraph --> Substituting the values, we get <!-- /wp:paragraph --> <!-- wp:paragraph --> <span class="ql-right-eqno">   </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/e8/ql_3f6d4298ea43109c6b7b6b09f87290e8_l3.png" height="22" width="191" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[P(X=3)=0.2\times {{0.8}^{3-1}}\]" title="Rendered by QuickLaTeX.com"/> <!-- /wp:paragraph --> <!-- wp:paragraph --> <span class="ql-right-eqno">   </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/c9/ql_dc28085e4e623af4bd1a75f83971d9c9_l3.png" height="102" width="3262" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[<span class="ql-right-eqno"> (1) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/49/ql_acba62c609d746db9c185c6a8ea2fd49_l3.png" height="102" width="639" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=3)=0.2\times {{0.8}^{2}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=3)=0.2\times 0.64 \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=3)=0.128 \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]" title="Rendered by QuickLaTeX.com"/> <!-- /wp:paragraph --> <!-- wp:paragraph --> Hence the probability we get a goal in the third try is 0.128 <!-- /wp:paragraph --> <!-- wp:paragraph --> <strong>Example 2: A cart of apples finds 4 in every 40 apples to be rotten. When they are being picked by a person buying apples what is the probability the 5</strong><strong><sup>th</sup></strong><strong> apple picked is rotten?</strong> <!-- /wp:paragraph --> <!-- wp:paragraph --> <strong>Solution 2:</strong> <!-- /wp:paragraph --> <!-- wp:paragraph --> We see that four out of every forty are rotten hence,\(p=\frac{4}{40}=\frac{1}{10}=0.1

Given,x=5

    \[q=(1-p)=(1-0.1)=0.9\]

Using the formula and substituting the values we get,

    \[P(X=5)=0.1\times {{0.9}^{5-1}}\]

    \[<span class="ql-right-eqno"> (2) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/af/ql_790faab7c7959e6f241cf94834707faf_l3.png" height="102" width="656" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=5)=0.1\times {{0.9}^{4}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=5)=0.1\times 0.6561 \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=5)=0.06561 \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]

Hence the probability that the person would pick a rotten apple on the 5th try is 0.06561

Example 3: A person trying to sell t-shirts on call observes that 3 out of 63 people who answer want to buy a t-shirt. How many calls does another person who also wants to sell them need to make his first sale?

Solution 3:

As we are looking to calculate the number of calls needed to make only one sale that is the first sale, we need to calculate the expected value of the variable which is also the mean of the distribution.

We have p=\frac{3}{63}=\frac{1}{12}

And E\left[ X \right]=\frac{1}{p}

Substituting the value, we get E\left[ X \right]=\frac{1}{\frac{1}{12}}=12

 Hence the number of calls that need to be made to make the first sale would be 12.

Example 4: A die is rolled until a 6 occurs. What is the resulting geometric distribution for the first 3 throws?

Solution 4:

The probability of getting a 6 is \frac{1}{6}

Hence, p=\frac{1}{6}

Using the formula

 

    \[<span class="ql-right-eqno"> (3) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/38/ql_274cfb09a1cdaa7cfd6168575fbbd638_l3.png" height="151" width="617" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=1)=\frac{1}{6}\times {{\frac{5}{6}}^{0}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=1)=\frac{1}{6}\times 1 \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=1)=\frac{1}{6} \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]

    \[<span class="ql-right-eqno"> (4) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/19/ql_77574d4f629dd7403e923dd739af2719_l3.png" height="151" width="616" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=2)=\frac{1}{6}\times {{\frac{5}{6}}^{1}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=2)=\frac{1}{6}\times \frac{5}{6} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=2)=\frac{5}{36} \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]

    \[<span class="ql-right-eqno"> (5) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/57/ql_02a45baa295f11bb23be1f32c58a3557_l3.png" height="151" width="617" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=3)=\frac{1}{6}\times {{\frac{5}{6}}^{2}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=3)=\frac{1}{6}\times \frac{25}{36} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=3)=\frac{25}{216} \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]

Therefore, the geometric distributions for the first three throws are

P(X=1)=\frac{1}{6},P(X=2)=\frac{5}{36},P(X=3)=\frac{25}{216}

Example 5: If the probability of success for a candidate in a voting campaign is 0.5, what is the probability to meet a voter who voted for the winning candidate on your fourth try?

Solution 5:

We have, p=0.5

And x=4

    \[q=(1-p)=(1-0.5)=0.5\]

Substituting the values in the equation P(X=x)=p\times {{q}^{x-1}}

    \[<span class="ql-right-eqno"> (6) </span><span class="ql-left-eqno">   </span><img src="https://quicklatex.com/cache3/7f/ql_84c5d15f5054f8873351fe6c53c2207f_l3.png" height="102" width="647" class="ql-img-displayed-equation quicklatex-auto-format" alt="\begin{align*} <!-- /wp:paragraph --> <!-- wp:paragraph -->   & P(X=4)=0.5\times {{0.5}^{3}} \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=4)=0.5\times 0.125 \\ <!-- /wp:paragraph --> <!-- wp:paragraph -->  & P(X=4)=0.0625 \\ <!-- /wp:paragraph --> <!-- wp:paragraph --> \end{align*}" title="Rendered by QuickLaTeX.com"/>\]

Therefore, the probability is 0.0625.

Conclusion

The geometric distribution is a discrete probability distribution where the random variable indicates the number of Bernoulli trials required to get the first success.

 Frequently asked questions (FAQs)

What is a discrete random variable?

A discrete variable is one that can take on finitely many, or countably infinitely many values.

What does independent trial mean?

Trials in an experiment are independent if the likelihood of each possible outcome does not change from trial to trial.

What is negative binomial distribution?

A negative binomial distribution discusses the ultimate success that may be attained following a string of triumphs in earlier trials. The rth  Success in a negative binomial distribution is one that has been preceded by n – 1 trials, each of which contained r – 1 success.

What is the formula of the binomial distribution?

Binomial distribution is a probability distribution used in statistics that summarizes the likelihood that a value will take one of two independent values under a given set of parameters or assumptions.

What is probability?

A probability is a number that reflects the chance or likelihood that a particular event will occur. Probabilities can be expressed as proportions that range from 0 to 1.

References 

Philippou, A. N., Georghiou, C., & Philippou, G. N. (1983). A generalized geometric distribution and some of its properties. Statistics & Probability Letters, 1(4), 171-175.

Gómez-Déniz, E. (2010). Another generalization of the geometric distribution. Test, 19(2), 399-415.

Written by by

Prerit Jain

Share article on

tutor Pic
tutor Pic