Some probability distributions are extremely versatile and can be used to model a wide variety of natural phenomena
The binomial, hypergeometric, and uniform models were already explored in Probability. Some other key distributions are explored more in Statistics for their wide applicability.
Poisson Distribution
The binomial distribution is awesome, until you want to use it on scales. Then you’re stuck wondering, how do I calculate ?
The Poisson limit is an approximation used when as , i.e. when is large, is small:
How large does have to be and how small does have to be before the Poisson limit becomes a good approximation? Without trying to define “good approximation”, empirical calculations show that this can become remarkably good by ().
If you were clever and/or had previously heard about Poisson in your Computational Neuroscience class, you might realize that the binomial distribution taken asymptotically can be viewed as observing the occurrence of events over continuous time. The Poisson distribution was quickly (well it took 50 years), found to be very good at modeling situations that had neither an explicit binomial random variable nor defined values for and .
Definition: The Poisson distribution is defined as where . We find
Let the random variable denote the interval between consecutive events: is an exponential distribution
Normal Distribution
De Moivre first derived another approximation for the binomial distribution,
where is a binomial random variable, is large, and
Laplace extended this to the De Moivre-Laplace theorem,
where is a binomial random variable
Definition: is referred to as the standard normal (Gaussian) curve, and is the convention for a random variable. We see that and
One way of proving this result is showing that the LHS’s moment-generating function is and that the RHS evaluates to the same quantity
There is no closed form for the CDF . Instead, we either find values with computers or consult a normal table
Continuity correction deals with the fact that binomial random variables are discrete by improving our estimation with
As a general rule of thumb, this limit is best used when and
Central Limit Theorem: Let be an infinite sequence of i.i.d. random variables, with finite and . For any and ,
This is also often written as (where is the average of ).
The amazing thing is this theorem works for any sequence of i.i.d. random variables
Many individual random variables can also be approximated with a standard curve. One could view this as a sum over error factors. A random variable is normally distributed if . One could also write .
Using , we find that the sum of two normal variables and is another normal variable
Proof of the Central Limit Theorem
Proving the Central Limit Theorem in full generality is difficult, but a slightly weaker proof assuming moment-generating functions exist for is succinct
Lemma: If then . I.e. if the moment generating functions match, then the CDFs match.
We must show
We define
Taylor’s theorem gives us where is some remainder
,
The existence of implies the existence of all derivatives, and therefore is continuous for all
Since , as , and
So
Geometric Distribution
Consider a series of independent trials, each succeeding or failing. If is the trial at which the first success occurs, .
This is called a geometric distribution,
The negative binomial distribution generalizes this to waiting for the th success. This is equivalent to saying there are successes in the first trials and a success on the th trial,
, , all result immediately from linearity of expectation
Gamma Distribution
Theorem: Suppose that Poisson events occur with a rate of per unit time. Let denote the waiting time for the th event,
This implies that converges for any real number, which justifies considering the gamma function
This is a funky function! , , so if is an integer then
We extend our equation above and define the gamma pdf as
We get and
Gamma random variables are additive, just like Poisson and Normal
The Gamma distribution is used in tests, based on the distribution, which is a special case of the gamma pdf where and