The normal distribution is pretty rad. We’ve already talked about the usefulness of -tests for Hypothesis Testing.md

The -tests we’ve considered so far have required us knowing the true variance . But what if we don’t know this?

The typical MLE estimator is
The logical question is whether we can then use for our decision rules

When is large, we can basically treat them the same. However, when is smaller, changing to makes a difference.

William Sealy Gossett graduated from Oxford in 1899 with First Class degrees in chemistry and mathematics. He took a position at Guinness and was tasked with making their recipes more scientific. This task had inherently small sample sizes and he became convinced that had a different pdf. He derived the proper pdf and published it anonymously (under the name Student) in 1908, since Guinness forbid employees from publishing papers.

It took a while for anyone to realize the importance of this work. We now recognize it as the Student t distribution

In deriving this distribution, we will encounter several other sampling distributions, distributions that model the behavior of functions based on sets of random variables, used for inference

Theorem: where is a standard normal RV has a gamma distribution with and .

Consider ,


This is a gamma pdf with ,
The sum of gamma pdfs like this has ,

Definition: The pdf of is called the chi square distribution with m degrees of freedom

Theorem: and are independent and has a chi square distribution with degrees of freedom

Definition: Suppose that and are independent chi square random variables with and degrees of freedom. is said to have an F distribution with m and n degrees of freedom

commemorates Sir Ronald Fisher (also involved with the Student distribution)

Theorem: ,
We derive this with two equations,
and

Definition: Let be a standard normal random variable and let be a chi square random variable independent of with degrees of freedom. The Student t ratio with n degrees of freedom is

We often abbreviate degrees of freedom as df

Theorem: The pdf for a Student random variable with degrees of freedom is for . This is often denoted as .

We use (an distribution) to derive the pdf

Theorem:

The proof is simple,

Both and are bell shaped and symmetric around zero. is flatter and has thicker tails. As increases, approaches .

Drawing Inferences

Now we can draw inferences about when is not known

Theorem: Let be a randoms ample from a normal distribution with unknown mean . A confidence interval for is

The procedure for testing for unknown is called the one-sampled t test

Theorem: Let be a random sample from a normal distribution with unknown . . Let

  • Accept if
  • Accept if
  • Accept if or

We prove this by showing that is a monotonic function of , satisfying GLRT

Of course, tests make the assumption that our samples are normally distributed. However,

  1. The distribution of is relatively unaffected by the pdf of , provided is not too skewed and is not too small
  2. As increases, becomes increasingly similar to

This is awesome. Our test is robust, meaning it is not heavily dependent on its assumptions. Departures from normality are acceptable.

Sometimes we’d like to estimate instead of . We now have the tools to do this.

has a chi square distribution with df

Theorem: Let denote the sample variance from observations drawn from a normal distribution. A confidence interval for is

We can create a corresponding decision test

Theorem: Let be the sample variance from observations drawn from a normal distribution. . Let .

  • Accept if
  • Accept if
  • Accept if or

Working with Type II error under these new sampling distributions is quite more involved and involves working with noncentral distributions