Negative Binomial Distribution
Modeling the number of trials needed to achieve a specified number of successes.
The Negative Binomial distribution answers the question: "How many trials will it take to get my -th success?" It is a generalization of the Geometric distribution, which is just the special case where .
In finance, a trader might use this to model how many trades it will take to achieve 10 winning trades. A venture capitalist could model how many startups they need to fund to get 3 successful exits.
Core Concepts
For the -th success to happen on trial , two things must be true:
- In the first trials, there must have been exactly successes. The number of ways this can happen is .
- The -th trial itself must be a success (with probability ).
- The overall probability combines the ways the previous successes could happen with the probabilities of those successes and failures: , and then all of that is multiplied by the probability of the final success on trial , which gives .
Key Derivations
Deriving the Expected Value (Mean)
Step 1: Decompose into Geometric Variables
Let be the total number of trials to get successes. We can think of as the sum of independent random variables, where each is the number of trials to get the next success after the previous one.
Each follows a Geometric distribution with probability . We know from the Geometric distribution page that .
Step 2: Use Linearity of Expectation
The expectation of a sum is the sum of the expectations.
Step 3: Sum the Geometric Means
Since each has the same mean, we are just adding to itself times.
Deriving the Variance
We use the same decomposition as above. The variance of a sum of *independent* random variables is the sum of their variances.
Step 1: Sum the Variances of Geometric Variables
The variance of a Geometric distribution is .
Step 2: Final Result
We are adding the same variance to itself times.
Applications
Quantitative Finance: Structuring Products
An investment bank is structuring a "first-to-default" credit-linked note on a basket of 10 bonds. They need to achieve their first (`r=1`) default to trigger a payout. If the annual probability of default for any bond is `p`, they can use the Geometric distribution (a special case of Negative Binomial with r=1) to model the waiting time for this event and price the note accordingly.