Poisson Distribution
Modeling the number of events occurring in a fixed interval of time or space.
The Poisson Distribution is used to model the number of times an event occurs within a specified interval. The key assumptions are that events are independent, the average rate of events is constant, and two events cannot occur at the exact same instant.
In finance, it's particularly useful for modeling rare events. For example, a credit analyst might use it to model the number of defaults in a large portfolio of loans over a month, or a trader might use it to model the number of times a stock's price jumps by more than 5% in a single day.
Core Concepts
- is the number of occurrences of an event (0, 1, 2, ...).
- (lambda) is the average number of events per interval.
- is Euler's number (approximately 2.71828).
- The (k factorial) term is crucial because it accounts for the fact that the events can happen in any order within the interval.
The CDF accumulates the probabilities from the PMF. For example, the probability of observing 2 or fewer events, , is the sum of the probabilities of observing 0, 1, and 2 events: .
Key Derivations
Deriving the Expected Value (Mean)
Step 1: The Moment-Generating Function (MGF)
The MGF of a Poisson() random variable is defined as . It can be shown to be:
Step 2: Find the First Derivative
The mean, , is the first derivative of the MGF evaluated at . We use the chain rule:
Step 3: Evaluate at t=0
Now, we substitute into the first derivative.
Deriving the Variance
We use . We need the second moment, , which is the second derivative of the MGF at .
Step 1: Find the Second Derivative
We differentiate using the product rule:
Step 2: Evaluate at t=0
Substitute into the second derivative to find .
Step 3: Calculate the Variance
Now we use the variance formula:
Applications
Quantitative Finance: Operational Risk
Banks use the Poisson distribution to model operational risk, such as the number of fraudulent transactions or system failures per week. If a bank's system typically has failures per week, and one week they experience 8 failures, they can use the PMF to calculate how unlikely that event was. This triggers an investigation to see if something has fundamentally changed in their system's stability.
Machine Learning: Feature Engineering
In analyzing customer behavior data (e.g., website visits), the number of times a user performs a certain action in a day (like 'add to cart') might follow a Poisson distribution. Understanding this can help in feature engineering. Instead of just using the raw count, a data scientist might create a feature like "Is this user's activity a rare event?" by calculating the Poisson probability .