Geometric Distribution
Modeling the number of trials needed to get the first success.
The Geometric distribution answers the question: "How many times do I have to try until I get my first success?" It models the number of independent Bernoulli trials required to achieve the first success.
In finance, this could model the number of trades you need to make until you have your first profitable one, or how many quarters it will take for a startup in your portfolio to finally turn a profit. It's always right-skewed, because a small number of trials is always more likely than a large number.
Core Concepts
This formula is very intuitive. For the first success to occur on trial , two things must happen:
- You must have exactly failures in a row first. The probability of one failure is , so the probability of independent failures is .
- The -th trial itself must be a success, which has a probability of .
Multiplying these probabilities together gives the formula.
The CDF can be derived from its complementary event: the probability of needing *more* than `k` trials for the first success. This only happens if the first `k` trials are all failures, an event with probability . Therefore, the probability of needing `k` or fewer trials is .
Key Derivations
Deriving the Expected Value (Mean)
Step 1: Set up the Infinite Series for E[X]
The expected value is the sum of each outcome multiplied by its probability . Let .
We can pull the constant out:
Step 2: Use the Geometric Series Derivative Trick
Recall the formula for an infinite geometric series: .
If we take the derivative of both sides with respect to , we get:
This gives us the value of the summation from Step 1.
Step 3: Substitute and Solve
Substitute this result back into the equation for :
Since , we have .
Deriving the Variance
We use . A common trick is to first find and then use it to find .
Step 1: Calculate E[X(X-1)]
We set up another series:
This sum is the second derivative of the geometric series formula with respect to , multiplied by .
Therefore, the summation part is .
Step 2: Find E[X²]
Using the property , we can rearrange to solve for .
Step 3: Calculate the Variance
Substitute :
Applications
Quantitative Finance: Modeling Time to First Default
An analyst models the probability of a company defaulting in any given year as `p=0.05`. They can use the Geometric distribution to answer questions like:
- "What is the probability the company defaults for the first time in year 5?" ()
- "What is the average number of years we expect this company to survive without defaulting?" ( years)