Martingales

Conditional Expectation

Suppose $f (x, y) = P {X = x, Y = y}$ , $f_{X} (x) = \sum_{y} f (x, y)$ , and $\sum_{x} f (x, y)$
We define $E (Y ∣ X) (x) = \frac{\sum _{y} y f ( x , y )}{f _{X} ( x )}$

If $X_{1}, \dots, X_{n}, Y$ have a joint probability density function $f$ , and $g (x_{1}, \dots, x_{n}) = \sum_{y} f (x_{1}, \dots, x_{n}, y)$
$E (Y ∣ X_{1}, \dots, X_{n}) (x_{1}, \dots, x_{n}) = \frac{\sum _{y} y f ( x _{1} , \dots , x _{n} , y )}{g ( x _{1} , \dots , x _{n} )}$

We can easily define continuous analogues,
$f_{X} (x) = \int_{- \infty}^{\infty} f (x, y) d y$ , $f_{Y} (y) = \int_{- \infty}^{\infty} f (x, y) d x$
$E (Y ∣ X) (x) = \frac{\int _{- \infty}^{\infty} y f ( x , y ) d y}{f _{X} ( x )}$
And $E (Y ∣ X_{1}, \dots, X_{n}) (x_{1}, \dots, x_{n}) = \frac{\int _{- \infty}^{\infty} y f ( x _{1} , \dots , x _{n} , y ) d y}{f _{X_{1}, \dots, X_{n}} ( x _{1} , \dots , x _{n} )}$

$E (Y ∣ X_{1}, \dots, X_{n})$ is characterized by two properties,

$E (Y ∣ X_{1}, \dots, X_{n})$ depends only on the values of $X_{1}, \dots, X_{n}$ (we can write it as $ϕ (X_{1}, \dots, X_{n})$ for some $ϕ$ ). We call $E (Y ∣ X_{1}, \dots, X_{n})$ measurable with respect to $X_{1}, \dots, X_{n}$ .
Suppose $A$ is an event that depends on $X_{1}, \dots, X_{n}$ and $I_{A}$ is the indicator function of $A$ . $E (Y I_{A}) = E [E (Y ∣ X_{1}, \dots, X_{n}) I_{A}]$ .

Measure theoretic treatments of probabilities define conditional expectation as the unique random variable satisfying these two properties

As shorthand, we write $E (Y ∣ F_{n})$ , where $F_{n}$ denotes the information contained in $X_{1}, \dots, X_{n}$

This gives us $E [E (Y ∣ F_{n})] = E (Y)$
And $E (a Y_{1} + b Y_{2} ∣ F_{n}) = a E (Y_{1} ∣ F_{n}) + b E (Y_{2} ∣ F_{n})$

If $Y$ is a function of $F_{n}$ then $E (Y ∣ F_{n}) = Y$

For any $Y$ , if $m < n$ , then $E (E (Y ∣ F_{n}) ∣ F_{m}) = E (Y ∣ F_{m})$

If $Y$ is independent of $F_{n}$ , $E (Y ∣ F_{n}) = E (Y)$

If $Y$ is any random variable and $Z$ is a random variable that is measurable with $F_{n}$ , then $E (Y Z ∣ F_{n}) = ZE (Y ∣ F_{n})$

Example:
Suppose $X_{1}, X_{2}, \dots$ are i.i.d. with $μ$ and let $S_{n} = X_{1} + \dots + X_{n}$
Let $F_{n}$ denote the information in $X_{1}, \dots, X_{n}$ and suppose $m < n$

$E (S_{n} ∣ F_{m}) = E (X_{1} + \dots + X_{m} ∣ F_{m}) + E (X_{m + 1} + \dots + X_{n} ∣ F_{m})$
$E (X_{1} + \dots + X_{m} ∣ F_{m}) = S_{m}$
$E (X_{m + 1} + \dots + X_{n} ∣ F_{m}) = E (X_{m + 1} + \dots + X_{n}) = (n - m) μ$
So $E (S_{n} ∣ F_{m}) = S_{m} + (n - m) μ$

Example:
Assume the same as above, but $μ = 0 ⟹ Var (X_{i}) = E (X_{i}^{2}) = σ^{2}$
$E (S_{n}^{2} ∣ F_{m}) = E ([S_{m} + (S_{n} - S_{m})]^{2} ∣ F_{m})$
$= E (S_{m}^{2} ∣ F_{m}) + 2 E (S_{m} (S_{n} - S_{m}) ∣ F_{m}) + E ((S_{n} - S_{m})^{2} ∣ F_{m})$

$E (S_{m}^{2} ∣ F_{m}) = S_{m}^{2}$
$E ((S_{n} - S_{m})^{2} ∣ F_{m}) = E ((S_{n} - S_{m})^{2}) = Var (S_{n} - S_{m}) = (n - m) σ^{2}$
$E (S_{m} (S_{n} - S_{m}) ∣ F_{m}) = S_{m} E (S_{n} - S_{m} ∣ F_{m}) = S_{m} E (S_{n} - S_{m}) = 0$

Altogether, $E (S_{n}^{2} ∣ F_{m}) = S_{m}^{2} + (n - m) σ^{2}$

Example:
Consider the first example where $X_{i}$ is Bernoulli and $i \leq m$

$E (X_{i} ∣ S_{n}) = \frac{S _{n}}{n}$
$E (S_{m} ∣ S_{n}) = E (X_{1} ∣ S_{n}) + \dots + E (X_{m} ∣ S_{n}) = \frac{m}{n} S_{n}$

Consider $E (Y ∣ X_{α}, α \in A)$ where $X_{α}, α \in A$ is an infinite collection of random variables
Let $F$ denote the information in ${X_{α}}$
$Z$ is $F$ -measurable if knowledge of ${X_{α}}$ determines $Z$ , i.e. $Z = ϕ (X_{α_{1}}, \dots, X_{α_{n}})$ for some $ϕ$ and some finite subcollection, or if $Z$ is a limit of such random variables

As an example, say $Y, X_{1}, X_{2}, \dots$ are independent random variables and $X_{1}, X_{2}, \dots$ are i.i.d. with the standard normal distribution, while $Y$ is unknown, define $Z_{j} = X_{j} + Y$ , and let $F_{n}$ denote the information in $Z_{1}, \dots, Z_{n}$

One cannot determine $Y$ given any finite set $F_{n}$ , however $Y$ is $F_{\infty}$ measurable since $Y = lim_{n \to \infty} \frac{Z _{1} + \dots Z _{n}}{n}$

An event $A$ is $F$ -measurable if $I_{A}$ is an $F$ -measurable random variable
$E (Y ∣ F)$ is defined as the unique $F$ -measurable random variable $Z$ such that $E (Y I_{A}) = E (Z I_{A})$ for all $F$ -measurable events A

I.e. $E (Y I_{A}) = E (E (Y ∣ F) I_{A})$

A martingale is a model of a fair game

We let ${F_{n}}$ denote an increasing collection of information, a collection of random variables $A_{n}$ such that $A_{m} \subset A_{n}$ if $m < n$ , meaning we don’t lose information

The increasing sequence of $F_{n}$ is known as a filtration

A sequence of random variables $M_{0}, M_{1}, M_{2}, \dots$ with $E (∣ M_{i} ∣) < \infty$ is a martingale with respect to ${F_{n}}$ if

Each $M_{n}$ is measurable with respect to $F_{n}$ and
$E (M_{n} ∣ F_{m}) = M_{m}$ for all $m < n$

The second condition is equivalent to $E (M_{n} - M_{m} ∣ F_{m}) = 0$ and can be proven by showing $E (M_{n + 1} ∣ F_{n}) = M_{n}$

Example:
$M_{n} = S_{n} - n μ$ is a martingale with respect to $F_{n}$
$E (M_{n + 1} ∣ F_{n}) = E (S_{n + 1} - (n + 1) μ ∣ F_{n}) = E (S_{n + 1} ∣ F_{n}) - (n + 1) μ = (S_{n} + μ) - (n + 1) μ = M_{n}$

Example:
Suppose $P {X_{i} = 1} = P {X_{i} = - 1} = \frac{1}{2}$

Say we double our bet every instance and stop when we win and let $W_{n}$ denote the winnings or losses up through $n$ flips of the coin,
$P {W_{n + 1} = 1 ∣ W_{n} = 1} = 1$

If the first $n$ flips have turned up tails, $W_{n} = - (2^{n} - 1)$
$P {W_{n + 1} = 1 ∣ W_{n} = - (2^{n} - 1)} = \frac{1}{2}$
$P {W_{n + 1} = - (2^{n + 1} - 1) ∣ W_{n} = - (2^{n} - 1)} = \frac{1}{2}$

We can verify $E (W_{n + 1} ∣ F_{n}) = W_{n}$ , so $W_{n}$ is a martingale with respect to $F_{n}$

We can generalize this, such that we make a bet $B_{n}$ on the $n$ th flip, measurable on $F_{n - 1}$ and $W_{n} = \sum_{j = 1}^{n} B_{j} X_{j}$

We can let $B_{n}$ be negative, corresponding to betting the coin will come up tails

$E (W_{n + 1} ∣ F_{n}) = E (\sum_{j = 1}^{n + 1} B_{j} X_{j} ∣ F_{n}) = E (\sum_{j = 1}^{n} B_{j} X_{j} ∣ F_{n}) + E (B_{n + 1} X_{n + 1} ∣ F_{n})$
$= W_{n} + B_{n + 1} E (X_{n + 1}) = W_{n}$

So the general form is also a Martingale

Example:
Consider an urn with a red ball and a green ball. Every time one draws a ball, it is returned along with another of the same color. Let $X_{n}$ denote the number of red balls in the urn after $n$ draws

This is a Markov chain,
$P {X_{n + 1} = k + 1 ∣ X_{n} = k} = \frac{k}{n + 2}$
$P {X_{n + 1} = k ∣ X_{n} = k} = \frac{n + 2 - k}{n + 2}$

$M_{n} = \frac{X _{n}}{n + 2}$ is a martingale

$E (X_{n + 1} ∣ X_{n}) = X_{n} + \frac{X _{n}}{n + 2}$
Since this is a Markov chain, all relevant information in $F_{n}$ is contained in $X_{n}$ ,
$E (M_{n + 1} ∣ F_{n}) = E ((n + 3)^{- 1} X_{n + 1} ∣ X_{n})$
$= \frac{1}{n + 3} [X_{n} + \frac{X _{n}}{n + 2}] = \frac{X _{n}}{n + 2}$
$= M_{n}$

$M_{n}$ is called a submartingale if $E (M_{n} ∣ F_{m}) \geq M_{m}$ and a supermartingale if $E (M_{n} ∣ F_{m}) \leq M_{m}$

Example:
Let $X_{n}$ be a finite Markov chain, with $v (x) = E (f (X_{T}) ∣ X_{0} = x)$ where $T$ is the optimal stopping rule

$M_{n} = v (X_{n})$ is a supermartingale with respect to $X_{0}, X_{1}, \dots$

Optional Sampling Theorem

The optional sampling theorem states that you cannot beat a fair game, however it’s a bit subtle

We say $T$ is a stopping time with respect to ${F_{n}}$ is for each $n$ , ${T = n}$ is measurable with respect to $F_{n}$

Under certain conditions, $E (M_{T}) = E (M_{0})$

Suppose $M_{0}, M_{1}, \dots$ is a martingale with respect to ${F_{n}}$ and suppose $T \leq K$ is a stopping time which is bounded, then $E (M_{T} ∣ F_{0}) = M_{0}$

$M_{T} = \sum_{j = 0}^{K} M_{j} I {T = j}$
$E (M_{T} ∣ F_{K - 1}) = E (M_{K} I {T = K} ∣ F_{K - 1}) + \sum_{j = 0}^{K - 1} E (M_{j} I {T = j} ∣ F_{K - 1})$
$E (M_{j} I {T = j} ∣ F_{K - 1}) = M_{j} I {T = j}$
$E (M_{K} I {T = K} ∣ F_{K - 1}) = E (M_{K} I {T > K - 1} ∣ F_{K - 1})$
$= I {T > K - 1} E (M_{K} ∣ F_{K - 1})$
$= I {T > K - 1} M_{K - 1}$

Putting it together, we get $E (M_{T} ∣ F_{K - 1}) = I {T > K - 1} M_{K - 1} + \sum_{j = 0}^{K - 1} M_{j} I {T = j}$
$= I {T > K - 2} M_{K - 1} + \sum_{j = 0}^{K - 2} M_{j} I {T = j}$

We can do this argument again conditioning on $F_{K - 2}$ to get $E (M_{T} ∣ F_{K - 2}) = E (E (M_{T} ∣ F_{K - 1}) ∣ F_{K - 2})$
$= I {T > K - 3} M_{K - 2} + \sum_{j = 0}^{K - 3} M_{j} I {T = j}$

This can be iterated until we arrive at $E (M_{T} ∣ F_{0}) = M_{0}$

However, $T$ is not always bounded, such as in the betting game we first introduced. So when is the optional sampling theorem valid?

Consider $T_{n} = min {T, n}$
$M_{T} = M_{T_{n}} + M_{T} I {T > n} - M_{n} I {T > n}$

$E (M_{T_{n}}) = E (M_{0})$ since $T_{n}$ is bounded, but what about the other terms? The first one approaches 0 essentially because $I {T > n}$ approaches 0 as $n \to \infty$ .

So the problematic term for our nice result is $E (M_{n} I {T > n})$ , which does not necessarily approach 0. In our doubling betting example, $I {T > n} = 2^{- n}$ and $M_{n} = 1 - 2^{n}$ , so we get $2^{- n} (1 - 2^{n})$ , which does not approach 0.

Optional Sampling Theorem: Suppose $M_{0}, M_{1}, \dots$ is a martingale with respect to ${F_{n}}$ and $T$ is a stopping time satisfying $P {T < \infty} = 1$ , $E (∣ M_{T} ∣) < \infty$ , and $lim_{n \to \infty} E (∣ M_{n} ∣ I {T > n}) = 0$ , then $E (M_{T}) = E (M_{0})$ .

Example:
Let $X_{n}$ be a simple random walk on ${0, \dots, N}$ with absorbing boundaries, let $X_{0} = a$ , and let $T = min {j : X_{j} = 0 or N}$

Since $X_{n}$ is bounded, we know immediately $E (M_{T}) = E (M_{0}) = a$ and $E (M_{T}) = N P {X_{T} = N}$

Therefore, $P {X_{T} = N} = \frac{a}{N}$

Now let $M_{n} = X_{n}^{2} - n$
$E (M_{n + 1} ∣ F_{n}) = E (X_{n + 1}^{2} - (n + 1) ∣ F_{n}) = X_{n}^{2} + 1 - (n + 1) = M_{n}$ (using this), so this is a Martingale

Using the same $T$ as before, $M_{n}$ is no longer bounded, however we can show there exists $C < \infty$ and $ρ < 1$ such that $P {T > n} \leq C ρ^{n}$

Since $∣ M_{n} ∣ \leq N^{2} + n$ , we can show $E (∣ M_{T} ∣) < \infty$ and $E (∣ M_{n} ∣ I {T > n}) \leq C ρ^{n} (N^{2} + n) \to 0$

Hence, $E (M_{T}) = E (M_{0}) = a^{2}$

Since $E (M_{T}) = E (X_{T}^{2}) - E (T) = N^{2} P {X_{T} = N} - E (T) = a N - E (T)$ , we see $E (T) = a (N - a)$

Uniform Integrability

$lim_{n \to \infty} E (∣ M_{T} ∣ I {T > n}) = 0$ is hard to verify, so we’d like to find some easier conditions

Suppose we have $X$ with $E (∣ X ∣) < \infty$ ,
$lim_{K \to \infty} E (∣ X ∣ I {∣ X ∣ > K}) = lim_{K \to \infty} \int_{K}^{\infty} ∣ x ∣ d F (x) = 0$

We say $X_{1}, X_{2}, \dots$ is uniformly integrable if for every $ϵ > 0$ there is some $K$ with $E [∣ X_{n} ∣ I {∣ X_{n} ∣ > K}] < ϵ$ for each $n$ , where $K$ depends on $ϵ$ (not $n$ )

If $X_{n}$ is uniformly integrable then for every $ϵ > 0$ there is some $δ > 0$ where $P (A) < δ ⟹ E (∣ X_{n} ∣ I_{A}) < ϵ$ for each $n$

To show this, let $δ = ϵ / (2 K)$ , so if $P (A) < δ$ then
$E (∣ X_{n} ∣ I_{A}) \leq E (∣ X_{n} ∣ I_{A}; ∣ X_{n} ∣ \leq K) + E (∣ X_{n} ∣ I_{A}; ∣ X_{n} ∣ > K) < K P (A) + (ϵ /2) < ϵ$

Example:
Consider our martingale betting strategy, with random variables $W_{0}, W_{1}, W_{2}, \dots$ , and $A_{n}$ the event ${X_{1} = X_{2} = \dots = X_{n} = - 1}$

$P (A_{n}) = 2^{- n}$
$E (∣ W_{n} ∣ I_{A_{n}}) = (2^{n} - 1) \cdot 2^{- n} \to 1$
This cannot satisfy the conditions for uniform integrability for any $ϵ < 1$

Now suppose $M_{0}, M_{1}, \dots$ is a uniformly integrable martingale (with respect to $X_{0}, X_{1}, \dots$ ) and $T$ is a stopping time with $P {T < \infty} = 1$

We have $lim_{n \to \infty} P {T > n} = 0$
So we therefore have $lim_{n \to \infty} E (∣ M_{n} ∣ I {T > n}) = 0$

Optional Sampling Theorem (again): Suppose $M_{0}, M_{1}, \dots$ is a uniformly integrable martingale with respect to ${F_{n}}$ and $T$ is a stopping time with $P {T < \infty} = 1$ and $E (∣ M_{T} ∣) < \infty$ , then $E (M_{T}) = E (M_{0})$

If $X_{1}, X_{2}, \dots$ is a sequence of random variable and there is $C < \infty$ such that $E (X_{n}^{2}) < C$ for each $n$ , then the sequence is uniformly integrable

To show this with our previous definition, let $δ = ϵ^{2} /4 C$ and suppose $P (A) < δ$
$E (∣ X_{n} ∣ I_{A}) = E [∣ X_{n} ∣ I (A \cap {∣ X_{n} ∣ \geq 2 C / ϵ})] + E [∣ X_{n} ∣ I (A \cap {∣ X_{n} ∣ < 2 C / ϵ})]$
$\leq (ϵ /2 C) E [∣ X_{n} ∣^{2} I (A \cap {∣ X_{n} ∣ \geq 2 C / ϵ})] + (2 C / ϵ) P (A)$
$\leq (ϵ /2 C) E (∣ X_{n} ∣^{2}) + (2 C / ϵ) P (A) < ϵ$

Example:
What if we set the the signs of the harmonic series randomly?
$P {X_{i} = 1} = P {X_{i} = - 1} = 1/2$
$M_{n} = \sum_{j = 1}^{n} \frac{1}{j} X_{j}$

This is a martingale, since each $X_{n}$ has mean 0

$E (M_{n}^{2}) = Var (M_{n}^{2}) = \sum_{j = 1}^{n} Var (\frac{1}{j} X_{j}) = \sum_{j = 1}^{n} \frac{1}{j ^{2}} < \infty$
So this is uniformly integrable

Martingale Convergence Theorem

This theorem states when a martingale converges to a limiting random variable $M_{\infty}$ . Let’s use Polya’s urn as an example

Let $0 < a < b < \infty$ , suppose that $M_{n} < a$ , let $T = min {j : j \geq n and M_{j} \geq b}$ , and $T_{m} = min {T, m}$
For $m > n$ , the optional sampling theorem says $E (M_{T_{m}}) = M_{n} < a$
But $E (M_{T_{m}}) \geq E (M_{T_{m}} I {T \leq m}) = E (M_{T} I {T \leq m}) \geq b P {T \leq m}$
So $P {T \leq m} < a / b$

This is true for all $m$ , so $P {T < \infty} \leq \frac{a}{b}$
In other words, with probability $1 - (a / b)$ the proportion of red balls never gets as high as $b$
If it does go up to $b$ (with probability $a / b$ ), the inverse of this argument says it’ll drop back down to $a$ with probability $\frac{1 - b}{1 - a}$

The probability this continues happening is $lim_{n \to \infty} (\frac{a}{b})^{n} (\frac{1 - b}{1 - a})^{n} = 0$
This tells us the proportion cannot fluctuate infinitely between any two numbers, i.e. $M_{\infty}$ exists
It happens to be we can also prove that this setup results in a uniform distribution directly

Martingale Convergence Theorem: Suppose $M_{0}, M_{1}, \dots$ is a martingale with respect to ${F_{n}}$ such that there exists $C < \infty$ with $E (∣ M_{n} ∣) < C$ for all $n$ . There there exists a random variable $M_{\infty}$ such that $lim_{n \to \infty} M_{n} = M_{\infty}$ .

Fix $a < b$ and think of $M_{n}$ as representing the cumulative results of a fair game
Whenever $M_{n} < a$ , keep betting 1 until $M_{n} > b$ , then stop until $M_{n} < a$ again

Our winnings are then $W_{n} = \sum_{j = 1}^{n} B_{j} (M_{j} - M_{j - 1})$ , where $B_{j}$ is 0 or 1 and $M_{j} - M_{j - 1}$ is $\pm 1$

Note that $W_{n}$ is a martingale and $W_{n} \geq (b - a) U_{n} - ∣ M_{n} - a ∣$ where $U_{n}$ denotes the number of “upcrossings” (passes between $a$ and $b$ )

$E (W_{0}) = E (W_{n}) \geq (b - a) E (U_{n}) - E (∣ M_{n} - a ∣)$
$E (∣ M_{n} - a ∣) \leq E (∣ M_{n} ∣) + a \leq C + a$
$E (U_{n}) \leq \frac{E ( W _{0} ) + C + a}{b - a}$

This holds for all $n$ , so the expected number of upcrossings is bounded, meaning the number of upcrossings is always finite

Note that the martingale property $E (M_{n}) = E (M_{0})$ does not imply that $E (M_{\infty}) = E (M_{0})$ , such as in the martingale betting strategy $W_{\infty} = lim_{n \to \infty} W_{n} = 1 \neq = E (W_{0})$

If $M_{n}$ is a uniformly integrable martingale then $M_{\infty} = lim_{n \to \infty} M_{n}$ exists and $E (M_{\infty}) = E (M_{0})$

Example:
Let $M_{n}$ be the proportion of red balls in Polya’s urn and suppose that at time $n = 0$ there are $k$ red balls and $m$ green balls. Since $M_{n}$ is bounded, it is uniformly integrable, and $M_{n}$ approaches $M_{\infty}$ with $E (M_{\infty}) = E (M_{0}) = k / (k + m)$

Example:
Let $X_{1}, X_{2}, \dots$ be independent random variables with $P {X_{i} = 3/2} = P {X_{i} = 1/2} = \frac{1}{2}$

Let $M_{0} = 1$ and for $n > 0$ , let $M_{n} = X_{1} \dots X_{n}$
Note that $E (M_{n}) = E (X_{1}) \dots E (X_{n}) = 1$
$E (M_{n + 1} ∣ F_{n}) = E (X_{1} \dots X_{n + 1} ∣ F_{n}) = X_{1} \dots X_{n} E (X_{n + 1} ∣ F_{n}) = M_{n}$
Therefore, $M_{n}$ is a martingale with respect to $X_{1}, X_{2}, \dots$
Since $E (∣ M_{n} ∣) = 1$ , the conditions of the martingale convergence theorem hold and $M_{n} \to M_{\infty}$ for some $M_{\infty}$

Is $M_{n}$ uniformly integrable?

$ln M_{n} = \sum_{j = 1}^{n} ln X_{j}$
$E (ln X_{i}) < 0 ⟹ ln M_{n} \to - \infty$
Therefore $M_{n} \to 0$

$E (M_{0}) \neq = E (M_{\infty})$ , i.e. $M_{n}$ is not uniformly integrable

Example:
A normal problem in statistics is estimating the mean $θ$ of a distribution given $Y_{1}, Y_{2}, Y_{3}, \dots$

In Bayesian statistics, the parameter $θ$ is taken to be a random variable taken from a prior distribution

Under the prior distribution, $E [θ] = μ$ , and let $M_{0} = μ$ and $M_{n} = E [θ ∣ Y_{1}, \dots, Y_{n}]$

$M_{n}$ is a martingale and the conditional distribution on $M_{n}$ is called the posterior distribution

We also know $lim_{n \to \infty} M_{n} = M_{\infty}$
The strong law of large numbers says $lim_{n \to \infty} \frac{Y _{1} + \dots + Y _{n}}{n} = θ$
So $lim_{n \to \infty} E [θ ∣ Y_{1}, \dots, Y_{n}] = θ$

Assume these samples are from a Bernoulli distribution with $P {Y_{j} = 1} = θ$
$P {Y_{1} + \dots + Y_{n} = k} = (k n) θ^{k} (1 - θ)^{n - k}$
$f_{n} (θ ∣ k) = \frac{( k n ) θ ^{k} ( 1 - θ ) ^{n - k}}{\int _{0}^{1} ( k n ) θ _{1}^{k} ( 1 - θ _{1} ) ^{n - k} d θ _{1}} = \frac{( n + 1 )!}{k ! ( n - k )!} θ^{k} (1 - θ)^{n - k}$

This is the beta distribution with parameters $k + 1$ and $n - k + 1$
The mean happens to be $\frac{k + 1}{n + 2}$

$P {Y_{n + 1} = k + 1 ∣ Y_{1} + \dots + Y_{n} = k} = \int_{0}^{1} P {Y_{n + 1} = 1 ∣ θ} f_{n} (θ ∣ k) d θ$
$= \int_{0}^{1} θ f_{n} (θ ∣ k) d θ = \frac{k + 1}{n + 2}$

Maximal Inequalities

If $M_{i}$ is a sequence of random variables, defined the maximum processes as $\overline{M}_{n} = max {M_{0}, \dots, M_{n}}$ and $M_{n}^{*} = max {∣ M_{0} ∣, \dots, ∣ M_{n} ∣}$

Maximal inequalities relate probabilities or expectations for $\overline{M}_{n}, M_{n}^{*}$ to $M_{n}, ∣ M_{n} ∣$ respectively

Reflection Principle: Suppose $X_{i}$ are independent random variables whose distributions are symmetric about the origin, and let $M_{0} = 0$ and $M_{n} = X_{1} + \dots + X_{n}$ . For every $a > 0$ , $P {\overline{M}_{n} \geq a} \leq 2 P {M_{n} \geq a}$

To prove this, let $T = min {j : M_{j} \geq a}$
$P {\overline{M}_{n} \geq a} = \sum_{j = 0}^{n} P {T = j}$
$P {M_{n} \geq a} = \sum_{j = 0}^{n} P {T = j, M_{n} \geq a} = \sum_{j = 0}^{n} P {T = j} P {M_{n} \geq a ∣ T = j}$
The independence and symmetry of $X_{i}$ shows that $P {M_{n} \geq a ∣ T = j} \geq P {M_{n} - M_{j} \geq 0 ∣ T = j} = P {M_{n} - M_{j} \geq 0} \geq \frac{1}{2}$ (noting that $M_{j}$ can be greater than $a$ makes this easier to understand)

Doob’s Maximal Inequality: Suppose $M_{i}$ is a nonnegative submartingale with respect to $F_{n}$ . Then for every $a > 0$ , $P {\overline{M}_{n} \geq a} \leq \frac{E [ M _{n} ]}{a}$

Let $A_{j}$ denote the $F_{j}$ -measurable event ${T = j}$ (same $T$ as before),
$E [M_{n}] \geq E [M_{n} I {T \leq n}] = \sum_{j = 0}^{n} E [M_{n} I_{A_{j}}]$
$E [M_{n} I_{A_{j}}] = E [E (M_{n} I_{A_{j}} ∣ F_{j})] = E [E (M_{n} ∣ F_{j}) I_{A_{j}}] \geq E [M_{j} I_{A_{j}}] \geq E [a I_{A_{j}}] = a P (A_{j})$

So $E [M_{n}] \geq \sum_{j = 0}^{n} a P (A_{j)} = a P {\overline{M}_{n} \geq a}$

If $M_{i}$ is not necessarily nonnegative, we cannot immediately use the inequality, but there’s still an extension

If $E [∣ M_{n} ∣^{r}] < \infty$ then $∣ M_{n} ∣^{r}$ is a submartingale,
$E [∣ Y ∣^{r} ∣ F_{n}] \geq ∣ E [Y ∣ F_{n}] ∣^{r}$
$E [∣ M_{n + 1} ∣^{r} ∣ F_{n}] \geq ∣ E (M_{n + 1} ∣ F_{n}) ∣^{r} = ∣ M_{n} ∣^{r}$

Likewise, if $E [e^{Y}] < \infty$ ,
$E [e^{Y} ∣ F] \geq e^{E (Y ∣ F)}$
$E [e^{b M_{n + 1}} ∣ F_{n}] \geq e^{E (b M_{n + 1} ∣ F_{n})} = e^{b M_{n}}$ for all $b$

Doob’s Maximal Inequality (again): Suppose $M_{i}$ is a martingale with respect to $F_{n}$ , then for every $a, b > 0$ and $r \geq 1$ ,

$P {∣ \overline{M}_{n} ∣ \geq a} \leq \frac{E [ ∣ M _{n} ∣ ^{r} ]}{a ^{r}}$
$P {\overline{M}_{n} \geq a} \leq \frac{E [ e ^{b M_{n}} ]}{e ^{ba}}$

Example:
Let $S_{n} = X_{1} + \dots + X_{n}$ denote a simple random walk in $Z$ and let $b = 1/ n$

$S_{n}$ is a martingale so we have,
$P {max {S_{1}, \dots, S_{n}} \geq a n} \leq e^{- a} E [e^{S_{n} / n}]$
$E [e^{S_{n} / n}] = E [e^{(X_{1} + \dots + X_{n}) / n}] = (E [e^{X_{1} / n}])^{n} = (\frac{e ^{1/ n} + e ^{- 1/ n}}{2})^{n}$
Taylor series shows that $\frac{e ^{1/ n} + e ^{- 1/ n}}{2} = 1 + \frac{1}{2 n} + O (\frac{1}{n ^{2}})$
$lim_{n \to \infty} E [e^{S_{n} / n}] = lim_{n \to \infty} [1 + \frac{1}{2 n} + O (\frac{1}{n ^{2}})]^{n} = e^{1/2}$

Hence,
$P {max {S_{1}, \dots, S_{n}} \geq a n} \leq C e^{- a}$ , i.e. the random walk in $Z$ is recurrent

Binyamin's Notes

Explorer

Martingales

Conditional Expectation

Martingales

Optional Sampling Theorem

Uniform Integrability

Martingale Convergence Theorem

Maximal Inequalities

Table of Contents