Hypothesis Testing

Hypothesis testing aims to choose between a null hypothesis $H_{0}$ and an alternative hypothesis $H_{1}$

A function of the observed data who dictates the outcome of our hypothesis is called a test statistic. The set of values that result in our null hypothesis’ rejection is the critical region, denoted $C$ . The points separating $C$ from the acceptance region is the critical value.

The probability a test statistic rejects $H_{0}$ when $H_{0}$ is true is the level of signifiance, denoted $α$

$α = 0.01, 0.05, 0.1$ are common

Theorem: Let $y_{1}, y_{2}, \dots, y_{n}$ be a random sample from a normal distribution with $σ$ . $H_{0} := μ = μ_{0}$ . Let $z = \frac{y ˉ - μ _{0}}{σ / n}$

Accept $H_{1} := μ > μ_{0}$ if $z \geq z_{α}$
Accept $H_{1} := μ < μ_{0}$ if $z \leq - z_{α}$
Accept $H_{1} : μ \neq = μ_{0}$ if $z \leq - z_{α /2}$ or $z \geq z_{α /2}$

An alternative way of formulating the same idea, the P-value of a test statistic is the probability of getting a value at least as extreme as a given statistic, assuming $H_{0}$ is true

We can run binomial hypothesis tests much like the test above. When we have a “large” number of samples, we can approximate a group of samples as a normal distribution.

We use the condition $0 < n p_{0} - 3 n p_{0} (1 - p_{0}) < n p_{0} + 3 n p_{0} (1 - p_{0}) < n$ to identify large samples. This is true when the range of 3 standard deviations fall into the valid values of $X$ .

$σ = n p_{0} (1 - p_{0})$

Theorem: Let $k = k_{1} + k_{2} + \dots + k_{n}$ be a random sample of $n$ Bernoulli random variables, for which $0 < n p_{0} - 3 n p_{0} (1 - p_{0}) < n p_{0} + 3 n p_{0} (1 - p_{0}) < n$ . We use $z = \frac{k - n p _{0}}{n p _{0} ( 1 - p _{0} )}$ and test as above.

If our inequality doesn’t hold, we use the exact binomial distribution
$P (X \geq k) = \sum_{x = k}^{\infty} (x n) p_{0}^{x} (1 - p_{0})^{n - x}$
$P (X \leq k) = \sum_{x = 0}^{k} (x n) p_{0}^{x} (1 - p_{0})^{n - x}$

Accept $H_{1} := p > p_{0}$ if $P (X \geq k) \leq α$
Accept $H_{1} := p < p_{0}$ if $P (X \leq k) \leq α$
Accept $H_{1} := p \neq = p_{0}$ if $P (X \leq k) + P (X \geq k) \leq α$

This is equivalent to our previous theorems, but stated terms of the CDF and $α$ instead of the critical value and $z_{α}$ (which is the inverse-CDF of normal)

Type I and II Errors

We already defined $α =$ the probability of incorrectly rejecting $H_{0}$ . Call this the Type I error.

$β =$ the probability of incorrectly accepting $H_{0}$ , is called the Type II error.

$β$ is a function of the presumed value $μ$ . If $μ$ is really close to $μ_{0}$ then $β$ will be high.

Definition: $1 - β$ is the power of a decision test, as a function of the parameter being tested. A power curve graphs this relation.

The power of a test diminishes as $μ \to μ_{0}$ . At $μ = μ_{0}$ , $1 - β = α$ .

A steeper power curve is a stronger test

The power of the $Z$ test is a function of $α$ , $σ$ , and $n$ . We can improve our power by either decreasing $α$ , decreasing $σ$ , or increasing $n$ .

Example: Say we’d like to test $H_{0} : μ = 100$ versus $H_{1} : μ > 100$ with $α = 0.05$ and $1 - β = 0.60$ when $μ = 103$ . What is the smallest sample size that achieves this objective? Assume a normal distribution with $σ = 14$

We solve this by writing a system of equations with the critical value $\overset{y}{ˉ}^{*}$
$\overset{y}{ˉ}^{*} = 100 + z_{α} \cdot \frac{14}{n}$
$\overset{y}{ˉ}^{*} = 103 + z_{β} \cdot \frac{14}{n}$

Nonnormal Decision Rules

Decision rules on general pdfs $f_{Y} (y; θ)$ work much the same

To test $H_{0} : θ = θ_{o}$ , we define a decision rule in terms of a sufficient statistic $\hat{θ}$ . The critical region is the set of values of $\hat{θ}$ least compatible with $θ_{o}$ and admissible under $H_{1}$ whose total probability when $H_{0}$ is true is $α$

Example: $f_{Y} (y; θ) = 1/ θ$ for $0 \leq y \leq θ$
$n = 8$
$H_{0} : θ = 2.0$
$H_{1} : θ < 2.0$
$α = 0.10$

Suppose we base the decision rule on $Y_{8}^{'}$ , the largest order statistic. What’s $β$ when $θ = 1.7$ ?

Some thought about the uniform distribution makes it clear we are looking for some value $P (Y_{8}^{'} \leq c ∣ H_{0} is true) = 0.10$

$f_{Y_{8}^{'}} (y; θ = 2) = 8 (\frac{y}{2})^{7} \cdot \frac{1}{2}$ for $0 \leq y \leq 2$
$\int_{0}^{c} 8 (\frac{y}{2})^{7} \cdot \frac{1}{2} d y = 0.10$
$c = 1.50$

$β = P (Y_{8}^{'} > 1.50 ∣ θ = 1.7) = \int_{1.50}^{1.7} 8 (\frac{y}{1.7})^{7} \cdot \frac{1}{1.7} d y = 0.63$

The Generalized Likelihood Ratio

What is the best decision rule for choosing between $H_{0}$ and $H_{1}$ , and how do we show it is optimal?

Define $ω$ as the set of unknown parameter values admissible under $H_{0}$
Define $Ω$ as the set of all possible values of all unknown parameters

Definition: Let $y_{1}, y_{2}, \dots, y_{n}$ be a random sample from $f_{Y} (y; θ_{1}, \dots, θ_{k})$ . The generalized likelihood ratio is $λ = \frac{m a x _{ω} L ( θ _{1} , \dots , θ _{k} )}{m a x _{Ω} L ( θ _{1} , \dots , θ _{k} )} = \frac{L ( ω _{e} )}{L ( Ω _{e} )}$

Maximizing $L (θ)$ under $Ω$ (no restrictions), is accomplished by substituting $θ_{e}$ into $L (θ)$

Values closer to $1$ show that the data is more compatible with $H_{0}$

Definition: A generalized likelihood ratio test (GLRT) rejections $H_{0}$ whenever $0 < λ \leq λ^{*}$ where $P (0 < Λ \leq λ^{*} ∣ H_{0} is true) = α$ ( $Λ$ is $λ$ expressed as a random variable)

Expressed similarly, $α = \int_{0}^{λ^{*}} f_{Λ} (λ ∣ H_{0}) d λ$
In many situations, $f_{Λ} (λ ∣ H_{0})$ is not known, so we must show $Λ$ is a monotonic function of a quantity $W$ , where the distribution of $W$ is known.

Example:
So say we work with a uniform distribution, $H_{0} : θ = θ_{o}$ and $H_{1} : θ < θ_{o}$ . $ω = {θ_{o}}$ and $Ω = {θ : 0 < θ \leq θ_{o}}$

We have $λ = \frac{( 1/ θ _{0} ) ^{n}}{( 1/ y _{max} ) ^{n}} = (\frac{y _{max}}{θ _{0}})^{n}$

$α = P [(\frac{Y _{max}}{θ _{0}})^{n} \leq λ^{*} ∣ H_{0} is true]$
Instead of trying to solve for $λ^{*}$ directly, we take $W = Y_{max} / θ_{0}$ and $w^{*} = n λ^{*}$

$f_{W} (w; θ_{0}) = θ_{0} f_{Y_{max}} (θ_{0} w; θ_{0}) = \frac{θ _{0} n ( θ _{0} w ) ^{n - 1}}{θ _{0}^{n}} = n w^{n - 1}$ for $0 \leq w \leq 1$
$P (W \leq w^{*}) ∣ H_{0} is true = \int_{0}^{w^{*}} n w^{n_{0} 1} d w = (w^{*})^{n} = α$

We conclude that the GLRT calls to reject $H_{0}$ if $\frac{y _{max}}{θ _{0}} \leq n α$
Fancy!

Binyamin's Notes

Explorer

Type I and II Errors

Nonnormal Decision Rules

The Generalized Likelihood Ratio

Table of Contents