Laws of Large Numbers

Central Limit Theorem

Markov’s inequality: If $X$ is a random variable that takes only nonnegative values, then for any value $a > 0$ , $P {X \geq a} \leq \frac{E [ X ]}{a}$ .

This is simple to prove. Let $I$ indicate $X \geq a$ . We see $I \leq \frac{X}{a}$ . Therefore, $E [I] = P {X \geq a} \leq \frac{E [ X ]}{a}$ .

As a corollary,
Chebyshev’s inequality: If $X$ is a random variable with finite mean $μ$ and variance $σ^{2}$ , then for any $k > 0$ , $P {∣ X - μ ∣ \geq k} \leq \frac{σ ^{2}}{k ^{2}}$

To get this, we apply Markov’s inequality on $(X - μ)^{2}$ with $a = k^{2}$ , obtaining $P {(X - μ)^{2} \geq k^{2}} \leq \frac{E [( X - μ ) ^{2} ]}{k ^{2}}$ . This is directly equivalent to Chebyshev’s.

These bounds are important when we know little about a distribution besides mean (and variance).

Central Limit Theorem: Let $X_{1}, X_{2} \dots$ be a sequence of independent and identically distributed random variables, with mean $μ$ and variance $σ^{2}$ . The distribution of $\frac{X _{1} + \dots + X _{n} - n μ}{σ n}$ tends towards the standard normal as $n \to \infty$ .

That is, for $- \infty < a < \infty$ , $P {\frac{X _{1} + \dots + X _{n} - n μ}{σ n} \leq a} \to \frac{1}{2 π} \int_{- \infty}^{a} e^{- x^{2} /2} d x$ as $n \to \infty$

The proof leverages the following,
Lemma: Let $Z_{1}, Z_{2}, \dots$ be a sequence of random variables with distribution functions $F_{Z_{n}}$ and moment generating functions $M_{Z_{n}}, n \geq 1$ , and let $Z$ be a random variable having distribution function $F_{Z}$ and moment generating function $M_{Z}$ . If $M_{Z_{n}} (t) \to M_{Z} (t)$ for all $t$ , then $F_{Z_{n}} (t) \to F_{Z} (t)$ for all $t$ at which $F_{Z} (t)$ is continuous. This makes sense with our previous intuition that $M$ is fully representative of a random variable.

If we let $Z$ be a standard normal random variable, $M_{Z} (t) = e^{t^{2} /2}$ . Therefore, we just need to show that our sequence of random variables will tend towards $M_{Z} (t)$

Assuming the generating function of $X_{i}$ , $M (i)$ exists and is finite, the moment generating function of $X_{i} / n$ is $E [exp {\frac{t X _{i}}{n}}] = M (\frac{t}{n})$

Since these are independent, we can say that the moment generating function of $i = 1 \sum n X_{i} / n$ is $[M (\frac{t}{n})]^{2}$

Let $L (t) = lo g M (t)$ . Note that $L^{'} (0) = \frac{M ^{'} ( 0 )}{M ( 0 )} = μ = 0$ and $L^{''} (0) = \frac{M ( 0 ) M ^{''} ( 0 ) - [ M ^{'} ( 0 ) ] ^{2}}{[ M ( 0 ) ] ^{2}} = E [X^{2}] - (E [X])^{2} = 1$

We must show that $[M (t / n)]^{n} \to e^{t^{2} /2}$ , equivalently that $n L (t / n) \to t^{2} /2$ .

Using L’Hôpital, $n \to \infty lim \frac{L ( t / n )}{n ^{- 1}} = n \to \infty lim \frac{- L ^{'} ( t / n ) n ^{- 3/2} t}{- 2 n ^{- 2}} = n \to \infty lim [\frac{L ^{'} ( t / n ) t}{2 n ^{- 1/2}}]$
$= n \to \infty lim [\frac{- L ^{''} ( t / n ) n ^{- 3/2} t ^{2}}{- 2 n ^{- 3/2}}] = n \to \infty lim [L^{''} (\frac{t}{n}) t \frac{^{2}}{2}] = \frac{t ^{2}}{2}$

This proves the central limit theorem on standard variables. The same result can be applied to any variable by considering its standardized version $X_{i}^{*} = (X_{i} - μ) / σ$

This theorem states that for each individual $a$ , $P {\frac{X _{1} + ... + X _{n} - n μ}{σ n} \leq a} \to Φ (a)$
It can also be shown that this convergence is uniform in $a$ , meaning for all $ϵ > 0$ , there is a point where $∣ f_{n} (a) - f (a) ∣ < ϵ$ for all $a$

Example:
Suppose an astronomer wants to measure the distance to a star. He has a technique but he knows there’s a a variance of $4$ light-years in his observations. He wants to make $n$ observations and take the average $d$ as his estimation. How many measurements does he need to make sure $d$ is with $\pm .5$ light-years?

$\overset{ˉ}{X} = \sum_{i = 1}^{n} X_{i}$
The central limit theorem tells us $Z_{n} = \frac{X ˉ - n d}{2 n}$ is approximately normal
$P {- .5 \leq \frac{X ˉ}{n} - d \leq .5} = P {- .5 \frac{n}{2} \leq Z_{n} \leq .5 \frac{n}{2}} \approx Φ (\frac{n}{4}) - Φ (- \frac{n}{4})$
$= 2Φ (\frac{n}{4}) - 1$

So we need to find $2Φ (\frac{n}{4}) - 1 = .95$ , or $Φ (\frac{n}{4}) = .975$
The inverse CDF does not have a closed-form, but a solver would tell us $\frac{n}{4} = 1.96$ , which means $n \approx 62$ observations

Technically, we don’t know when the normal approximation will begin to be a good approximation of this distribution. If we are especially unsure, we can use Chebyshev’s inequality for a tight bound.

$P {∣ \frac{X ˉ}{n} - d ∣ > .5} \leq \frac{4}{n ( .5 ) ^{2}} = \frac{16}{n}$ , so $n = 320$ observations

General Central Limit Theorem: Let $X_{1}, X_{2}, \dots$ be a sequence of independent random variables with $μ_{i} = E [X_{i}], σ_{i}^{2} = Var (X_{i})$ . If $X_{i}$ are uniformly bounded and $\sum_{i = 1}^{\infty} σ_{i}^{2} = \infty$ then $P {\frac{\sum _{i = 1}^{n} ( X _{i} - μ _{i} )}{\sum _{i = 1}^{n} σ _{i}^{2}} \leq a} \to Φ (a)$ as $n \to \infty$

It really is a remarkable fact.

Strong Law of Large Numbers

Theorem: Let $X_{1}, X_{2}, \dots$ be a sequence of i.i.d. random variables, with $μ = E [X_{i}]$ . With probability $1$ , $\frac{X _{1} + X _{2} + \dots + X _{n}}{n} \to μ$ as $n \to \infty$

This implies we can approximate the probability of an event by repeating many trials.

Proof:
We assume the fourth moment of $X_{i}$ is finite, i.e. $E [X_{i}^{4}] = K < \infty$ (however the theorem can be proven without this)

Assume $μ = 0$ . Let $S_{n} = \sum_{i = 1}^{n} X_{i}$ . Consider, $E [S_{n}^{4}] = E [(X_{1} + \dots + X_{n})^{4}]$ . Expanding these terms yields results in the form $X_{i}^{4}$ , $X_{i}^{3} X_{j}$ , $X_{i}^{2} X_{j}^{2}$ , $X_{i}^{2} X_{j} X_{k}$ , and $X_{i} X_{j} X_{k} X_{l}$ , where $i \neq = j \neq = k \neq = l$ . The terms with single $X_{i} s$ have mean $0$ by independence. So expanding yields,
$E [S_{n}^{4}] = n E [X_{i}^{4}] + (2 4) (2 n) E [X_{i}^{2} X_{j}^{2}] = n K + 3 n (n - 1) E [X_{i}^{2}] E [X_{j}^{2}]$

$0 \leq Var (X_{i}^{2}) = E [X_{i}^{4}] - E [X_{i}^{2}]^{2}$ so $E [X_{i}^{2}]^{2} \leq E [X_{i}^{4}] = K$

Therefore, $E [S_{n}^{4}] \leq n K + 3 n (n - 1) K$ which implies $E [\frac{S _{n}^{4}}{n ^{4}}] \leq \frac{K}{n ^{3}} + \frac{3 K}{n ^{2}}$

$E [\sum_{n = 1}^{\infty} S_{n}^{4}] = \sum_{n = 1}^{\infty} E [\frac{S _{n}^{4}}{n ^{4}}] < \infty$ , since the series converges. This implies that the series is finite with probability $1$ (since the expectation would otherwise be infinite). The convergence of the series implies $lim_{n \to \infty} \frac{S _{n}^{4}}{n ^{4}} = 0$ , which implies $\frac{S _{n}}{n} \to 0$

When $μ \neq = 0$ , we can apply this argument to the random variables $X_{i} - μ$ to obtain that $n \to \infty lim i = 1 \sum n \frac{X _{i} - μ}{n} = 0$

Binyamin's Notes

Explorer

Central Limit Theorem

Strong Law of Large Numbers

Table of Contents