Linear Systems

Gaussian Elimination

Solving linear systems is really important, how do we do it quickly?

The most trivial way to solve $A x = b$ is with our explicit formula for $A^{- 1}$ (calculating determinants recursively), but this is uselessly slow ( $O (n!)$ if done completely naively)

While the inverse method is nice in theory, Gaussian elimination (the manual method, named after the same dude but unrelated to the distribution), is faster

Let’s be precise about the properties of this algorithm

It works for systems of equations with non-singular coefficient matrices
We can achieve good accuracy by picking “stronger” pivots (larger)

This algorithm runs in $O (n^{3})$ and gives an exact solution (barring floating-point errors)

Iterative Methods

Iterative methods are promising because they allow one to improve approximations bit by bit, which allows for flexible trade-offs in time and accuracy

Assume we pick $S, T$ such that $S$ is invertible and one can write $A = S - T$
We can write $x = M x + c$ where $M = S^{- 1} T$ and $c = S^{- 1} b$

This looks like a fixed-point problem, so we attempt the scheme $x_{k + 1} = M x_{k} + c$ and hope that the error decreases

This scheme relies only on matrix-vector multiplication and addition
If $M$ is sparse then we can do this faster
This is only helpful if $S$ is cheap to invert

We can quickly derive $e_{k} = M^{k} e_{0}$ so we need to show $∥ M^{k} e_{0} ∥ \to 0$ as $k \to \infty$

To do this, we need to first define a matrix norm

We note that all norms are equivalent,
Theorem: Let $∥ \cdot ∥_{α}$ , $∥ \cdot ∥_{β}$ be norms for vectors in $R^{n}$ , then there exists $M, m \geq 0$ such that $m ∥ x ∥_{α} \leq ∥ x ∥_{β} \leq M ∥ x ∥_{α}$ for all $x \in R^{n}$

Corollary: $∥ x_{k} ∥_{α} \to 0 ⟺ ∥ x_{k} ∥_{β} \to 0$ as $k \to \infty$

Example:
$∥ A ∥ = sup_{x} \frac{∥ A x ∥}{∥ x ∥}$

We can prove each property relatively easily, for example,
$∥ A + B ∥ = sup_{x} \frac{∥( A + B ) x ∥}{∥ x ∥} \leq sup_{x} \frac{∥ A x ∥ + ∥ B x ∥}{∥ x ∥} \leq sup_{x} \frac{∥ A x ∥}{∥ x ∥} + sup_{x} \frac{∥ B x ∥}{∥ x ∥} = ∥ A ∥ + ∥ B ∥$

We can further derive some useful identities for this norm,
$∥ A ∥_{\infty} = max_{i = 1}^{n} \sum_{j = 1}^{n} ∣ A_{ij} ∣$
$∥ A ∥_{1} = ∥ A^{T} ∥_{\infty}$
$∥ A ∥_{2} = λ^{1/2}$ where $λ$ is the largest eigenvalue of $A^{T} A$

Now let’s build up what we need
Does $M^{k} \to 0$ ? If it is diagonalizable, we can write $M^{k} = R D^{k} R^{- 1}$ where $D$ is diagonalizable and contains the eigenvalues $λ_{i}$ , so $M^{k} \to 0 ⟺ ρ (M) = max_{i = 1}^{n} ∣ λ_{i} ∣ < 1$

$∥ e_{k} ∥ = ∥ M^{k} e_{0} ∥$
$\leq ∥ R ∥_{\infty} ∥ R^{- 1} ∥_{\infty} ∥ D^{k} ∥_{\infty} ∥ e_{0} ∥_{\infty}$
$\leq C \cdot ρ (M)^{k}$

$lim_{k \to \infty} \frac{∥ e _{k + 1} ∥ _{\infty}}{∥ e _{k} ∥ _{\infty}} \leq ρ (M)$
I.e. at least linear convergence if $ρ (M) < 1$

To obtain our convergence theorem’s error bound, we use a few important properties,

$ρ (A) \leq ∥ A ∥$ for any norm
$ρ (A) < 1 ⟹ I - A$ is invertible
If $ρ (A) < 1$ , then $∥(I - A)^{- 1} ∥ < \frac{1}{1 - ∥ A ∥}$

Deriving it from here is pretty simple, and I’m going to omit it

Theorem: Let $A = S - T$ where $S$ is non-singular and define $x_{k}$ by $S x_{k} = T x_{k - 1} + b$ . Then $x_{k} \to x_{*}$ where $A x_{*} = b$ if and only if $ρ (S^{- 1} T) < 1$ . Furthermore, if $∥ S^{- 1} T ∥ < 1$ then $∥ x_{k} - x_{*} ∥ \leq \frac{∥ S ^{- 1} T ∥}{1 - ∥ S ^{- 1} T ∥} ∥ x_{k} - x_{k - 1} ∥$ .

As we’ve said,
$x_{k + 1} = M x_{k} + c$ where $M = S^{- 1} T$

This is great because it gives us an error bound, however we still require an easy to compute $S^{- 1}$ for the scheme to be practical

One way begins by splitting $A = L + D + U$ (lower, diagonal, upper) and defines $S = D$ , $T = - L - U$
This is the most trivial solution, since diagonal matrices are very easy to invert, and is called the Jacobi scheme

However, we note that the order on the rows of $A$ effects the convergence of our method under the Jacobi scheme! This is bad

We can find that convergence of $A$ under the Jacobi scheme rests on strict diagonal dominance of $A$ , where the magnitude of each diagonal is greater than the sum of the magnitudes of the remaining values in its row

Theorem: Let $A$ be strictly diagonally dominant, then the Jacobi method for solving $A x = b$ converges

An alternative to the Jacobi method is the Gauss-Seidel method, which splits $A$ into $S = L + D$ and $T = - U$

Lower triangular matrices are easy to invert and our same theorem applies

Binyamin's Notes

Explorer

Gaussian Elimination

Iterative Methods

Table of Contents