# AP Statistics Curriculum 2007 Distrib Poisson

## General Advance-Placement (AP) Statistics Curriculum - Poisson Random Variables and Experiments

### Poisson Random Variables and Experiments

• Definition: Poisson Distribution is a discrete probability distribution that expresses the probability of the number of events occurring in a fixed period of time. These events occur with a known average rate and are independent of the time since the last event. The Poisson Distribution can also be used for the number of events in other specified intervals such as distance, area or volume.
• Mass function: For X~Poisson(λ), the Poisson mass function is given by $P(X=k)=\frac{\lambda^k e^{-\lambda}}{k!},\,\!$ where
• e is the natural number (e = 2.71828...)
• k is the number of occurrences of an event - the probability of which is given by the mass function
• $k! = 1\times 2\times 3\times \cdots \times k$
• λ is a positive real number, equal to the expected number of occurrences that occur during the given interval. For instance, if the events occur on average every 4 minutes, and you are interested in the number of events occurring in a 10 minute interval, you would use as model a Poisson distribution with λ=10/4=2.5.
• Expectation: The expected value of a Poisson distributed random variable X is λ.

### Example

Suppose one expects to see 40 shooting stars per hour at night (by observing at a specific location and not using visual enhancers). Assume that the process of observing arrivals (shooting stars) is Poisson(λ = 40).

• Compute the probability that in a 1 hour timeframe, the number of observed shooting stars will be between 44 and 52! We can compute this as follows: Let $X\sim Poisson(\lambda)$.

$P(44 \le X \le 52) = \sum_{x=44}^{52} \frac{40^x e^{-40}}{x!}=0.255719.$ The figure below shows this result using SOCR distributions:

• Note: We observe that this distribution is bell-shaped. We can also use the Normal distribution to approximate this probability. Using $Y\sim N(\mu=40, \sigma=\sqrt{40}=6.325)$, together with the continuity correction for better approximation we obtain $P(44 \le X \le 52)=P(44 \le Y \le 52)=0.234655$, which is close to the exact that was found earlier. The figure below shows this probability.

### Applications

The Poisson Distribution arises in many different discrete situations when the probability of the observed phenomenon is constant in time or space. Examples of events that may be modeled by Poisson Distribution include:

• The number of cars that pass through a certain point on a road (sufficiently distant from traffic lights) during a given period of time.
• The number of spelling mistakes one makes while typing a single page.
• The number of phone calls at a call center per minute.
• The number of times a web server is accessed per minute.
• The number of road kill (animals killed) found per unit length of road.
• The number of mutations in a given stretch of DNA after a certain amount of radiation exposure.
• The number of unstable atomic nuclei that decayed within a given period of time in a piece of radioactive substance.
• The number of pine trees per unit area of mixed forest.
• The number of stars in a given volume of space.
• The distribution of visual receptor cells in the retina of the human eye.
• The number of light bulbs that burn out in a certain amount of time.
• The number of viruses that can infect a cell in cell culture.
• The number of hematopoietic stem cells in a sample of unfractionated bone marrow cells.
• The number of inventions of an inventor over their career.
• The number of particles that "scatter" off of a target in a nuclear or high energy physics experiment.

### Poisson as a Limiting Case of Binomial Distribution

In several of the above examples, the events being counted are actually the outcomes of discrete trials, and would more precisely be modeled using the Binomial Distribution with parameters n (number of trials) and $p={\lambda \over n}$ (probability of success). That is, Binomial(n,p) approaches the Poisson(λ) distribution, where expected value $\lambda=p\times n$, as n approaches infinity. This provides the means to approximate Binomial random variables (for large n) using the Poisson distribution. The details of this approximation are below:

• $\lim_{n\to\infty}\left(1-{\lambda \over n}\right)^n=e^{-\lambda}.$

Let $p = \frac{\lambda}{n}$ and $X \sim Binomial(n, p)$. Then we have

$\lim_{n\to\infty} P(X=k)=\lim_{n\to\infty}{n \choose k} p^k (1-p)^{n-k} =\lim_{n\to\infty}{n! \over (n-k)!k!} \left({\lambda \over n}\right)^k \left(1-{\lambda\over n}\right)^{n-k}$
$=\lim_{n\to\infty} \underbrace{\left({n \over n}\right)\left({n-1 \over n}\right)\left({n-2 \over n}\right) \cdots \left({n-k+1 \over n}\right)}\ \underbrace{\left({\lambda^k \over k!}\right)}\ \underbrace{\left(1-{\lambda \over n}\right)^n}\ \underbrace{\left(1-{\lambda \over n}\right)^{-k}}$
• As $n \longrightarrow \infty$ the first term approaches 1; the second remains constant since "n" does not appear in it at all; the third approaches e; and the fourth expression approaches 1.
• Consequently the limit is
${\lambda^k e^{-\lambda} \over k!}.\,\!$
• More generally, whenever a sequence of binomial random variables with parameters n and pn is such that
$\lim_{n\rightarrow\infty} np_n = \lambda,$

the sequence converges (in distribution) to a Poisson random variable with mean λ.