Geometric Distribution

Modeling the number of trials needed to get the first success.

"How Long Until It Hits?"

The Geometric distribution answers the question: "How many times do I have to try until I get my first success?" It models the number of independent Bernoulli trials required to achieve the first success.

In finance, this could model the number of trades you need to make until you have your first profitable one, or how many quarters it will take for a startup in your portfolio to finally turn a profit. It's always right-skewed, because a small number of trials is always more likely than a large number.

Interactive Geometric Distribution

Adjust the probability of success (

p

) to see how it affects the likelihood of achieving the first success on a given trial.

Probability of Success (p): 0.25

Mean (

\mu

): 4.00

Variance (

\sigma^2

): 12.00

Core Concepts

Probability Mass Function (PMF)

The PMF gives the probability that the first success occurs on exactly the k-th trial.

P(X=k) = (1-p)^{k-1}p

This formula is very intuitive. For the first success to occur on trial $k$ , two things must happen:

You must have exactly $k-1$ failures in a row first. The probability of one failure is $1-p$ , so the probability of $k-1$ independent failures is $(1-p)^{k-1}$ .
The $k$ -th trial itself must be a success, which has a probability of $p$ .

Multiplying these probabilities together gives the formula.

Mean (

\mu

): 4.00

Variance (

\sigma^2

): 12.00

Cumulative Distribution Function (CDF)

The CDF gives the probability that the first success occurs on or before the k-th trial.

F(k) = P(X \le k) = 1 - (1-p)^k

The CDF can be derived from its complementary event: the probability of needing *more* than `k` trials for the first success. This only happens if the first `k` trials are all failures, an event with probability $(1-p)^k$ . Therefore, the probability of needing `k` or fewer trials is $1 - (1-p)^k$ .

Mean (

\mu

): 4.00

Variance (

\sigma^2

): 12.00

Key Derivations

Deriving the Mean and Variance

Deriving the Expected Value (Mean)

Step 1: Set up the Infinite Series for E[X]

The expected value is the sum of each outcome $k$ multiplied by its probability $P(X=k)$ . Let $q = 1-p$ .

E[X] = \sum_{k=1}^{\infty} k \cdot P(X=k) = \sum_{k=1}^{\infty} k \cdot q^{k-1}p

We can pull the constant $p$ out:

E[X] = p \sum_{k=1}^{\infty} k q^{k-1} = p(1 + 2q + 3q^2 + 4q^3 + \dots)

Step 2: Use the Geometric Series Derivative Trick

Recall the formula for an infinite geometric series: $\sum_{k=0}^{\infty} q^k = \frac{1}{1-q}$ .

If we take the derivative of both sides with respect to $q$ , we get:

\frac{d}{dq} \left( \sum_{k=0}^{\infty} q^k \right) = \sum_{k=1}^{\infty} k q^{k-1} = \frac{d}{dq} \left( \frac{1}{1-q} \right) = \frac{1}{(1-q)^2}

This gives us the value of the summation from Step 1.

Step 3: Substitute and Solve

Substitute this result back into the equation for $E[X]$ :

E[X] = p \cdot \frac{1}{(1-q)^2}

Since $q = 1-p$ , we have $1-q = p$ .

E[X] = p \cdot \frac{1}{p^2}

Final Mean Formula

E[X] = \frac{1}{p}

Deriving the Variance

We use $Var(X) = E[X^2] - (E[X])^2$ . A common trick is to first find $E[X(X-1)]$ and then use it to find $E[X^2]$ .

Step 1: Calculate E[X(X-1)]

We set up another series:

E[X(X-1)] = \sum_{k=1}^{\infty} k(k-1)q^{k-1}p = p \sum_{k=2}^{\infty} k(k-1)q^{k-1}

This sum is the second derivative of the geometric series formula with respect to $q$ , multiplied by $q$ .

\frac{d^2}{dq^2} \left( \sum_{k=0}^{\infty} q^k \right) = \sum_{k=2}^{\infty} k(k-1)q^{k-2} = \frac{2}{(1-q)^3}

Therefore, the summation part is $\sum_{k=2}^{\infty} k(k-1)q^{k-1} = \frac{2q}{(1-q)^3}$ .

E[X(X-1)] = p \cdot \frac{2q}{(1-q)^3} = p \cdot \frac{2q}{p^3} = \frac{2q}{p^2}

Step 2: Find E[X²]

Using the property $E[X(X-1)] = E[X^2 - X] = E[X^2] - E[X]$ , we can rearrange to solve for $E[X^2]$ .

E[X^2] = E[X(X-1)] + E[X] = \frac{2q}{p^2} + \frac{1}{p}

Step 3: Calculate the Variance

Var(X) = E[X^2] - (E[X])^2 = \left(\frac{2q}{p^2} + \frac{1}{p}\right) - \left(\frac{1}{p}\right)^2

= \frac{2q}{p^2} + \frac{p}{p^2} - \frac{1}{p^2} = \frac{2q + p - 1}{p^2}

Substitute $p-1 = -q$ :

= \frac{2q - q}{p^2} = \frac{q}{p^2}

Final Variance Formula

Var(X) = \frac{1-p}{p^2}

Applications

Quantitative Finance: Modeling Time to First Default

An analyst models the probability of a company defaulting in any given year as `p=0.05`. They can use the Geometric distribution to answer questions like:

"What is the probability the company defaults for the first time in year 5?" ( $P(X=5) = (1-0.05)^{5-1} \times 0.05$ )
"What is the average number of years we expect this company to survive without defaulting?" ( $E[X] = 1/0.05 = 20$ years)