The normal distribution is pretty rad. We’ve already talked about the usefulness of -tests for Hypothesis Testing.md
The -tests we’ve considered so far have required us knowing the true variance . But what if we don’t know this?
The typical MLE estimator is
The logical question is whether we can then use for our decision rules
When is large, we can basically treat them the same. However, when is smaller, changing to makes a difference.
William Sealy Gossett graduated from Oxford in 1899 with First Class degrees in chemistry and mathematics. He took a position at Guinness and was tasked with making their recipes more scientific. This task had inherently small sample sizes and he became convinced that had a different pdf. He derived the proper pdf and published it anonymously (under the name Student) in 1908, since Guinness forbid employees from publishing papers.
It took a while for anyone to realize the importance of this work. We now recognize it as the Student t distribution
In deriving this distribution, we will encounter several other sampling distributions, distributions that model the behavior of functions based on sets of random variables, used for inference
Theorem: where is a standard normal RV has a gamma distribution with and .
Consider ,
This is a gamma pdf with ,
The sum of gamma pdfs like this has ,
Definition: The pdf of is called the chi square distribution with m degrees of freedom
Theorem: and are independent and has a chi square distribution with degrees of freedom
Definition: Suppose that and are independent chi square random variables with and degrees of freedom. is said to have an F distribution with m and n degrees of freedom
commemorates Sir Ronald Fisher (also involved with the Student distribution)
Theorem: ,
We derive this with two equations,
and
Definition: Let be a standard normal random variable and let be a chi square random variable independent of with degrees of freedom. The Student t ratio with n degrees of freedom is
We often abbreviate degrees of freedom as df
Theorem: The pdf for a Student random variable with degrees of freedom is for . This is often denoted as .
We use (an distribution) to derive the pdf
Theorem:
The proof is simple,
Both and are bell shaped and symmetric around zero. is flatter and has thicker tails. As increases, approaches .
Drawing Inferences
Now we can draw inferences about when is not known
Theorem: Let be a randoms ample from a normal distribution with unknown mean . A confidence interval for is
The procedure for testing for unknown is called the one-sampled t test
Theorem: Let be a random sample from a normal distribution with unknown . . Let
- Accept if
- Accept if
- Accept if or
We prove this by showing that is a monotonic function of , satisfying GLRT
Of course, tests make the assumption that our samples are normally distributed. However,
- The distribution of is relatively unaffected by the pdf of , provided is not too skewed and is not too small
- As increases, becomes increasingly similar to
This is awesome. Our test is robust, meaning it is not heavily dependent on its assumptions. Departures from normality are acceptable.
Sometimes we’d like to estimate instead of . We now have the tools to do this.
has a chi square distribution with df
Theorem: Let denote the sample variance from observations drawn from a normal distribution. A confidence interval for is
We can create a corresponding decision test
Theorem: Let be the sample variance from observations drawn from a normal distribution. . Let .
- Accept if
- Accept if
- Accept if or
Working with Type II error under these new sampling distributions is quite more involved and involves working with noncentral distributions