Sampling Distributions

The normal distribution is pretty rad. We’ve already talked about the usefulness of $Z$ -tests for Hypothesis Testing.md

The $Z$ -tests we’ve considered so far have required us knowing the true variance $σ^{2}$ . But what if we don’t know this?

The typical MLE estimator is $S^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} (Y_{i} - \overset{ˉ}{Y})^{2}$
The logical question is whether we can then use $\frac{Y ˉ - μ}{S / n}$ for our decision rules

When $n$ is large, we can basically treat them the same. However, when $n$ is smaller, changing $σ$ to $S$ makes a difference.

William Sealy Gossett graduated from Oxford in 1899 with First Class degrees in chemistry and mathematics. He took a position at Guinness and was tasked with making their recipes more scientific. This task had inherently small sample sizes and he became convinced that $\frac{Y ˉ - μ}{S / n}$ had a different pdf. He derived the proper pdf and published it anonymously (under the name Student) in 1908, since Guinness forbid employees from publishing papers.

It took a while for anyone to realize the importance of this work. We now recognize it as the Student t distribution

In deriving this distribution, we will encounter several other sampling distributions, distributions that model the behavior of functions based on sets of random variables, used for inference

Theorem: $U = \sum_{j = 1}^{m} Z_{j}^{2}$ where $Z_{i}$ is a standard normal RV has a gamma distribution with $r = \frac{m}{2}$ and $λ = \frac{1}{2}$ . $f_{U} (u) = \frac{1}{2 ^{m /2} Γ ( \frac{m}{2} )} u^{m /2 - 1} e^{- u /2}$

Consider $m = 1$ , $F_{Z^{2}} (u) = P (Z^{2} \leq u) = 2 P (0 \leq Z \leq u)$
$= \frac{2}{2 π} \int_{0}^{u} e^{- z^{2} /2} d z$
$f_{Z^{2}} (u) = \frac{2}{2 π} \frac{1}{2 u} e^{- u /2} = \frac{1}{2 ^{1/2} Γ ( \frac{1}{2} )} u^{1/2 - 1} e^{- u /2}$
This is a gamma pdf with $r = \frac{1}{2}$ , $λ = \frac{1}{2}$
The sum of $m$ gamma pdfs like this has $r = \frac{m}{2}$ , $λ = \frac{1}{2}$

Definition: The pdf of $U = \sum_{j = 1}^{m} Z_{j}^{2}$ is called the chi square distribution with m degrees of freedom

Theorem: $S^{2}$ and $\overset{ˉ}{Y}$ are independent and $\frac{( n - 1 ) S ^{2}}{σ ^{2}} = \frac{1}{σ ^{2}} \sum_{i = 1}^{n} (Y_{i} - \overset{ˉ}{Y})^{2}$ has a chi square distribution with $n - 1$ degrees of freedom

Definition: Suppose that $U$ and $V$ are independent chi square random variables with $n$ and $m$ degrees of freedom. $\frac{V / m}{U / n}$ is said to have an F distribution with m and n degrees of freedom

$F$ commemorates Sir Ronald Fisher (also involved with the Student $t$ distribution)

Theorem: $f_{F_{m, n}} (w) = \frac{Γ ( \frac{m + n}{2} ) m ^{m /2} n ^{n /2} w ^{m /2 - 1}}{Γ ( \frac{m}{2} ) Γ ( \frac{n}{2} ) ( n + m w ) ^{(m + n) /2}}$ , $w \geq 0$
We derive this with two equations,
$f_{V / U} (w) = \int_{0}^{\infty} ∣ u ∣ f_{U} (u) f_{V} (u w) d u$ and
$f_{\frac{V / m}{U / n}} (w) = \frac{m}{n} f_{V / U} (\frac{m}{n} w)$

Definition: Let $Z$ be a standard normal random variable and let $U$ be a chi square random variable independent of $Z$ with $n$ degrees of freedom. The Student t ratio with n degrees of freedom is $T_{n} = \frac{Z}{U / n}$

We often abbreviate degrees of freedom as df

$f_{T_{n}} (t) = f_{T_{n}} (- t)$

Theorem: The pdf for a Student $t$ random variable with $n$ degrees of freedom is $f_{T_{n}} (t) = \frac{Γ ( \frac{n + 1}{2} )}{nπ Γ ( \frac{n}{2} ) ( 1 + \frac{t ^{2}}{n} ) ^{(n + 1) /2}}$ for $- \infty < t < \infty$ . This is often denoted as $f_{t} (t)$ .

We use $T_{n}^{2} = \frac{Z ^{2}}{U / n}$ (an $F$ distribution) to derive the pdf

Theorem: $T_{n - 1} = \frac{Y ˉ - μ}{S / n}$

The proof is simple, $\frac{Y ˉ - μ}{S / n} = \frac{\frac{Y ˉ - μ}{σ / n}}{\frac{( n - 1 ) S ^{2}}{σ ^{2} ( n - 1 )}} = \frac{Z}{U / ( n - 1 )}$

Both $T_{n}$ and $Z$ are bell shaped and symmetric around zero. $T_{n}$ is flatter and has thicker tails. As $n$ increases, $T_{n}$ approaches $Z$ .

Drawing Inferences

Now we can draw inferences about $μ$ when $σ$ is not known

Theorem: Let $y_{1}, y_{2}, \dots, y_{n}$ be a randoms ample from a normal distribution with unknown mean $μ$ . A $100 (1 - α) %$ confidence interval for $μ$ is $(\overset{y}{ˉ} - t_{α /2, n - 1} \cdot \frac{s}{n}, \overset{y}{ˉ} + t_{α /2, n - 1} \cdot \frac{s}{n})$

The procedure for testing $H_{0} : μ = μ_{0}$ for unknown $σ$ is called the one-sampled t test

Theorem: Let $y_{1}, y_{2}, \dots, y_{n}$ be a random sample from a normal distribution with unknown $σ$ . $H_{0} := μ = μ_{0}$ . Let $t = \frac{y ˉ - μ _{0}}{s / n}$

Accept $H_{1} := μ > μ_{0}$ if $t \geq t_{α, n - 1}$
Accept $H_{1} := μ < μ_{0}$ if $t \leq - t_{α, n - 1}$
Accept $H_{1} : μ \neq = μ_{0}$ if $t \leq - t_{α /2, n - 1}$ or $t \geq t_{α /2, n - 1}$

We prove this by showing that $t$ is a monotonic function of $λ$ , satisfying GLRT

Of course, $t$ tests make the assumption that our samples are normally distributed. However,

The distribution of $\frac{Y ˉ - μ}{S / n}$ is relatively unaffected by the pdf of $y_{i}$ , provided $f_{Y} (y)$ is not too skewed and $n$ is not too small
As $n$ increases, $\frac{Y ˉ - μ}{S / n}$ becomes increasingly similar to $f_{T_{n - 1}} (t)$

This is awesome. Our $t$ test is robust, meaning it is not heavily dependent on its assumptions. Departures from normality are acceptable.

Sometimes we’d like to estimate $σ^{2}$ instead of $μ$ . We now have the tools to do this.

$\frac{( n - 1 ) S ^{2}}{σ ^{2}}$ has a chi square distribution with df $= n - 1$
$P [χ_{α /2, n - 1}^{2} \leq \frac{( n - 1 ) S ^{2}}{σ ^{2}} \leq χ_{1 - α /2, n - 1}^{2}]$

Theorem: Let $s^{2}$ denote the sample variance from $n$ observations drawn from a normal distribution. A $100 (1 - α) %$ confidence interval for $σ^{2}$ is $(\frac{( n - 1 ) s ^{2}}{χ _{1 - α /2, n - 1}^{2}}, \frac{( n - 1 ) s ^{2}}{χ _{α /2, n - 1}^{2}})$

We can create a corresponding decision test

Theorem: Let $s^{2}$ be the sample variance from $n$ observations drawn from a normal distribution. $H_{0} : σ^{2} = σ_{0}^{2}$ . Let $χ^{2} = (n - 1) s^{2} / σ_{0}^{2}$ .

Accept $H_{1} := σ^{2} > σ_{0}^{2}$ if $χ^{2} \geq χ_{1 - α, n - 1}^{2}$
Accept $H_{1} := σ^{2} < σ_{0}^{2}$ if $χ^{2} \leq χ_{1 - α, n - 1}^{2}$
Accept $H_{1} : σ^{2} \neq = σ_{0}^{2}$ if $χ^{2} \leq - χ_{α /2, n - 1}^{2}$ or $χ^{2} \geq χ_{α /2, n - 1}^{2}$

Working with Type II error under these new sampling distributions is quite more involved and involves working with noncentral distributions

Binyamin's Notes

Explorer

Drawing Inferences

Table of Contents