Non-Linear Equations

Bisection Method

Suppose $f : [a, b] \mapsto R$ , $a < b$ and suppose we want to find $x_{*} \in [a, b]$ such that $f (x_{*}) = 0$

For special $f$ s we can get an exact solution in closed form, but this is the exception rather than the rule… In many cases, we can’t even write $f$ in closed form and instead treat it as a kind of black box with unpredictable behavior.

To find a root in the general case, we will work with an algorithm that generates a sequence ${x_{k}}_{k \geq 0}$ where $x_{k} \to x_{*}$ as $k \to \infty$ , minimizing $∣ x_{k} - x_{*} ∣$

In practice, we’re looking to satisfy $∣ x_{k} - x_{*} ∣ \leq F (k, f, x_{0}, \dots, x_{k})$ , where $F$ is a computable error bound

We want an algorithm that is,

Robust - Always gives an approximate solution
Accurate - Gets closer with more computation with a computable error bound
Efficient - Gets to an approximation in as few evaluations as possible

The bisection method is a good example algorithm (but is not used in practice). It works because of the Intermediate Value Theorem; if we have an interval with a positive and negative value then there must be a zero. We iteratively cut our search interval in half, basically like binary search, until the length of the interval is satisfactory for our error bound.

How do we prove this formally? We have a few useful properties to work with:

$a_{k} \leq x_{k} \leq b_{k}$
$a = a_{0} \leq a_{1} \leq \dots \leq a_{k} \leq x_{*} \leq b_{k} \leq \dots \leq b_{1} \leq b_{0} = b$
$b_{k} - a_{k} = (b_{k - 1} - a_{k - 1}) /2$

It doesn’t take much experimenting to come up with the bound $∣ x_{k} - x_{*} ∣ < \frac{b - a}{2 ^{k + 1}}$
We’d like to prove this bound formally

Theorem: Suppose $a < b$ , $f : [a, b] \mapsto R$ isa continuous function and $f (a) f (b) < 0$ . Then the sequence ${x_{k}}$ defined by the bisection algorithm converges to a limit $x_{*} \in (a, b)$ such that $f (x_{*}) = 0$ , and $∣ x_{k} - x_{*} ∣ < \frac{b - a}{2 ^{k + 1}}$ .

To prove this, note that ${a_{k}}$ is bounded above and monotonically increasing and ${b_{k}}$ is bounded below and monotonically decreasing

$b_{*} = lim_{k \to \infty} b_{k}$ and $a_{*} = lim_{k \to \infty} a_{k}$
$b_{k} - a_{k} = \frac{b _{k - 1} - a _{k - 1}}{2} = \frac{b - a}{2 ^{k}}$
$lim_{k \to \infty} b_{k} - lim_{k \to \infty} a_{k} = lim_{k \to \infty} (b_{k} - a_{k}) = lim_{k \to \infty} \frac{b - a}{2 ^{k}} = 0$

So let’s define $x_{*} = a_{*} = b_{*}$
The Pinching Theorem gives us $x_{k} = x_{*}$

Since $f$ is continuous, we can use the inequalities to show that $f (x_{*}) = 0$ ,
$0 \leq lim_{k \to \infty} f (a_{k}) = f (lim_{k \to \infty} a_{k}) = f (x_{*})$
$lim_{k \to \infty} f (b_{k}) = f (x_{*}) \leq 0$
So $f (x_{*}) = 0$

To get the bound,
$a_{k} \leq x_{*} \leq b_{k}$
$x_{k} - b_{k} \leq x_{k} - x_{*} \leq x_{k} - a_{k}$
$\frac{a _{k} + b _{k}}{2} - b_{k} \leq x_{k} - x_{*} \leq \frac{a _{k} + b _{k}}{2} - a_{k}$
$∣ x_{k} - x_{*} ∣ \leq \frac{b _{k} - a _{k}}{2} = \frac{b - a}{2 ^{k + 1}}$

Simple Iteration

Another way to find $f (x_{*}) = 0$ is by writing $f (x) = x - g (x)$ and finding a fixed point $x_{*} = g (x_{*})$

Theorem: Suppose $g : [a, b] \mapsto [a, b]$ is continuous and a contraction ( $∣ g (x) - g (y) ∣ \leq L ∣ x - y ∣$ for some $0 \leq L \leq 1$ ), then $g$ has a unique fixed point $g (x_{*}) = x_{*}$ . Furthermore, the sequence ${x_{k}}$ defined by $x_{k + 1} = g (x_{k})$ converges to the unique fixed $x_{*}$ as $k \to \infty$ , with $∣ x_{k} - x_{*} ∣ \leq \frac{L ^{k}}{1 - L} ∣ x_{1} - x_{0} ∣$

$k \geq ⌈ \frac{l n \frac{∣ x _{1} - x _{0} ∣}{( 1 - L ) ϵ}}{l n \frac{1}{L}} ⌉$

This method is robust, accurate, and efficient

Sometimes the conditions of our contraction mapping theorem are hard to check and we’d still like to know if convergence is possible

Theorem: Suppose $g : [a, b] \mapsto [a, b]$ is a continuous function, let $x_{*}$ be a fixed point of $g$ and assume that $g$ has a continuous derivative in some neighborhood of $x_{*}$ with $∣ g^{'} (x_{*}) ∣ < 1$ , then $x_{k + 1} = g (x_{k})$ converges to $x_{*}$ as $k \to \infty$ , provided that $x_{0}$ is sufficiently close to $x_{*}$

Newton’s Method

Newton’s method is what we use in practice, because of its convergence rate

$x_{k + 1} = x_{k} - \frac{f ( x _{k} )}{f ^{'} ( x _{k} )}$

Where does this equation come from? You can gain an intuition of it geometrically

Consider $g (x) = x - λ f (x)$ where $λ$ is unknown and $f (x_{*}) = 0$ (i.e. $x_{*} = g (x_{*})$ )
We’d like to pick $λ$ such that $∣ g^{'} (x_{*}) ∣ = 0$ , yielding super-linear convergence

$0 = 1 - λ f^{'} (x_{*}) ⟹ λ = \frac{1}{f ^{'} ( x _{*} )}$

So we hope that in a neighborhood of $x_{*}$ we can get away with using $λ = \frac{1}{f ^{'} ( x _{k} )}$

Theorem: Assume $f \in C^{2} (I)$ , $I = [x_{*} - δ, x_{*} + δ]$ , $λ > 0$ , $\frac{f ^{''} ( x )}{f ^{'} ( y )} \leq A$ for all $x, y \in I$ , and $∣ x_{0} - x_{*} ∣ \leq h$ where $h = min {δ, 1/ A}$ . Then $x_{k} \to x_{*}$ as $k \to \infty$ , where $x_{k + 1} = x_{k} - \frac{f ( x _{k} )}{f ^{'} ( x _{k} )}$ , and the convergence is at least quadratic, and precisely quadratic if $f^{''} (x_{*}) \neq = 0$ .

How do we make sense of it? Let’s show the proof

$P (k) = ∣ x_{k} - x_{*} ∣ \leq \frac{h}{2 ^{k}}$
$P (0)$ is true by assumption, so we need to show $P (k) ⟹ P (k + 1)$

$x_{k + 1} - x_{*} = x_{k} - x_{*} - \frac{f ( x _{k} )}{f ^{'} ( x _{k} )} = \frac{( x _{k} - x _{*} ) f ^{'} ( x _{k} ) - f ( x _{k} )}{f ^{'} ( x _{k} )}$
We use Taylor’s Theorem,
$f (x_{*}) = f (x_{k}) + (x_{*} - x_{k}) f^{'} (x_{k}) + \frac{f ^{''} ( η _{k} )}{2} (x_{*} - x_{k})^{2}$ for some $η_{k} \in (x_{*}, x_{k})$

$x_{k + 1} - x_{*} = \frac{1}{2} \frac{f ^{''} ( η _{k} )}{f ^{'} ( x _{k} )} (x_{k} - x_{*})^{2}$
$∣ x_{k + 1} - x_{*} ∣ = \frac{1}{2} \frac{f ^{''} ( η _{k} )}{f ^{'} ( x _{k} )} ∣ x_{k} - x_{*} ∣^{2} < \frac{A h}{2} ∣ x_{k} - x_{*} ∣ \leq \frac{1}{2} ∣ x_{k} - x_{*} ∣ \leq \frac{h}{2 ^{k + 1}}$

Binyamin's Notes

Explorer

Bisection Method

Simple Iteration

Newton’s Method

Table of Contents