14  M Matrices and Stability in ∞-norm

Lecture 14

Published

March 30, 2026

Note

I encourage you to play around with the juptyer notebook for this lecture - you can copy the code with the this notebook button on the side of this page.

Warning

These notes are mainly a record of what we discussed and are not a substitute for attending the lectures and reading books! If anything is unclear/wrong, let me know and I will update the notes.

Tip

This chapter is mostly based on

  • Burden, Faires, and Burden (2015) Chapter 11,
  • Driscoll and Braun (2018) Chapter 10

14.1 Recap

We are considering boundary value problems of the following form: Suppose we have a differential operator \mathcal L, a “nice” domain \Omega \subset \mathbb R^d, and data f. Seek u : \Omega \to \mathbb R such that

\begin{align} \mathcal L[u] &= f &&\text{in } \Omega \nonumber\\ u &= 0 &&\text{on } \partial \Omega . \tag{BVP} \end{align}

This is a problem with homogeneous Dirichlet boundary conditions (the values of the solution are prescribed on the boundary (Dirichlet) and are zero (homogeneous)).

We also consider the specific problem (in 1d):

Example 14.1 (Linear Advection-Diffusion-Reaction Equations) Consider

\begin{align} \mathcal L[u] := -\Delta u + p \cdot \nabla u + q u \end{align}

In the context of semiconductor physics, u(x) represents the concentration of charge carriers at a position x \in \Omega. Each of these terms relates to the physics in question:

  • -\Delta : diffusion; electrons move from areas of high concentration to low,
  • p \cdot \nabla u : advection (drift); the solution is driven with velocity p (due to e.g. an external electric field),
  • q u : reaction; electrons and electron holes are being created or destroyed.

The same equation also models e.g pollutant transport in a river, drug delivery in a blood vessel, asset price in finance, heat transfer in fluids, ……

By approximating the continuous problem with finite differences, we obtain a discrete problem: find \bm U \in \mathbb R^{n+1} (with index starting at 0 or equivalently U: \Omega_h \to \mathbb R where \Omega_h := \{x_j\}_{j=0}^n \to \mathbb R with U_j := U(x_j)) such that

\begin{align} \begin{split} &\mathcal L_h [\bm U] = \bm f \\ &U_0 = U_n = 0 \end{split}\tag{$\text{BVP}_h$} \end{align}

where \mathcal L_h depends on the choice of finite difference scheme.

Last time, we approximated the ADR equation with the following finite difference scheme on the mesh \{x_j\}_{j=0}^n with x_j = \frac{j}{n} and mesh size h= \frac{1}{n}:

\begin{align} \mathcal L_h &= -L + p D^0 + qI \nonumber\\ % &= -\frac{1}{h^2} \begin{pmatrix} -2 & 1 & & & \\ 1 & -2 & 1 & & \\ & 1 & -2 & \ddots \\ & & \ddots & \ddots & 1\\ & & & 1 & -2 \end{pmatrix} % + \frac{p}{2h} \begin{pmatrix} 0 & 1 & & & \\ -1 & 0 & 1 & & \\ & -1 & 0 & \ddots \\ & & \ddots & \ddots & 1\\ & & & -1 & 0 \end{pmatrix} % + q I, \nonumber \end{align}

a (n-1)\times(n-1) matrix (acting on the interior points of the grid).

We saw that this method is 2nd order consistent:

Definition 14.1 (Consistency) Suppose that u : \Omega \to \mathbb R is the exact solution to \text{(BVP)} and let \bm u = \{ u(x_j) \}_{j=0}^n be the exact solution evaluated on the mesh. We say the numerical approximation (\text{BVP}_h) is consistent if \| \mathcal L_h[\bm u] - \bm f\|_{\infty} \to 0 as h \to 0 where f_j = f(x_j).

We say the numerical approximation is order-p accurate (or has order of accuracy p) if \| \mathcal L_h[\bm u] - \bm f\|_{\infty} = O(h^p) as h \to 0.

and, for q>0, stable in the following sense:

Definition 14.2 (Stability) We say that (\text{BVP}_h) is stable if there exists C>0 such that

\begin{align} \| \bm U_h - \bm W_h \|_{\infty} \leq C \| \mathcal L_h[\bm U_h] - \mathcal L_h[\bm W_h] \|_{\infty} \end{align}

for all \bm U_h, \bm W_h.

We showed that the stability constant (C in the above defininition) for \mathcal L_h is (at most) C = \frac{1}{q}. Therefore, this finite difference approximation converges:

Theorem 14.1 Suppose that \text{(BVP)} and (\text{BVP}_h) have unique solutions u and \bm U_h. Then, if (\text{BVP}_h) is consistent (with order p) and stable then the method is convergent (with order at least p).

Proof.

Let \bm u be the exact solution to the continuous problem restricted to the mesh and suppose that (\text{BVP}_h) is order p accurate. Then, by stability, we have

\begin{align} \| \bm U_h - \bm u \|_\infty &\leq C \| \mathcal L_h[\bm U_h] - \mathcal L_h[\bm u] \|_\infty \nonumber\\ % &= C \| \bm f - \mathcal L_h[\bm u] \|_\infty \nonumber\\ % &= O(h^p) \end{align}

as h\to0. (The final equality is the consistency estimate).

To prove the stability estimate, we showed proved the following result:

Lemma 14.1 Suppose A is an SDD matrix. Then,

\begin{align} \|x\|_\infty \leq % \max_i \left[ |A_{ii}| - \sum_{j\not=i} |A_{ij}| \right]^{-1} \| Ax\|_{\infty}. \end{align}

Exercise 14.1 Show that, for p \in\mathbb R and q > 0, the approximation scheme (\text{BVP}_h) is stable for all h < \frac{2}{|p|}. What is the corresponding stability constant?

It turns out that we can extend these stability estimates to the case where q=0:

14.2 M Matrices

The aim of this section is to understand and prove the following result:

Theorem 14.2 Suppose that A is an M-matrix with \Phi \succ 0 and A\Phi \succ 0.

Then,

\begin{align} &\|x\|_{\infty} \leq C_\Phi \|A x\|_{\infty}, \nonumber\\ % &\text{with} \quad C_\Phi := \frac{\max_i \Phi_i}{\min_i (A\Phi)_i} \end{align}

Here, we are using the notation

  • \Phi \succ 0 if and only if \Phi_i > 0 for all i,
  • Similarly, we use x \preccurlyeq y or y \succcurlyeq x to mean x_i \leq y_i for all i,

and the following definition:

Definition 14.3 A matrix A is an M matrix if

(i) A_{ii} > 0 for all i,
(ii) A_{ij} \leq 0 for all i \not= j,
(iii) (Monotonicity) Ax \succcurlyeq 0 \implies x \succcurlyeq 0.

It is quite difficult to check if a matrix is an M matrix directly ((iii) is hard to check). It turns out that all Irreducibly Diagonally Dominant (IDD) matrices that satisfy (i) and (ii) are M matrices:

Definition 14.4 A matrix A is Irreducibly Diagonally Dominant (IDD) if

  • |A_{ii}| \geq \sum_{j\not=i} |A_{ij}| for all i,
  • There exists i^\star such that |A_{i^\star i^\star}| > \sum_{j\not=i^\star} |A_{i^\star j}|,
  • For every i and j there exists k_1,\dots,k_n such that A_{ik_1}, A_{k_1,k_2}, ..., A_{k_n,j} \not= 0.

It turns out that IDD matrices are invertible. Moreover, we have:

Lemma 14.2 Suppose that A is IDD and A_{ii} > 0 for all i and A_{ij} \leq 0 for all i\not= j. Then, A is a M matrix

Proof. We suppose that b := Ax \succcurlyeq 0 (we want to show that x \succcurlyeq 0).

Take i such that x_i = \min_j x_j < 0. We have,

\begin{align} 0 \leq b_i &= A_{ii} x_i + \sum_{j\not= i} A_{ij} x_j \nonumber\\ % &= |A_{ii}| x_i - \sum_{j\not= i} |A_{ij}| x_j \nonumber\\ % &\leq \Big( |A_{ii}| - \sum_{j\not= i} |A_{ij}|\Big) x_i % \leq 0 \end{align}

In the last line, we use the fact that -x_j \leq -x_i (since x_i \leq x_j) and the fact that A is diagonally dominant. That is, all of the above inequalities are actually equalities and thus b_i = 0 and, if A_{ij}\not= 0, then x_i = x_j.

Repeating the exact same argment with j instead of i, we find that b_j = 0 and x_{j} = x_k for all k with A_{jk} \not= 0. Since A is irreducible, we find that this argument “spreads” to all indices and b = 0. However, because there exists a row i^\star for which we have strict diagonal dominance (|A_{i^\star i^\star}| - \sum_{j\not= i^\star} |A_{i^\star j}| >0), we have b_{i^\star} < 0, a contradiction.

As a result, we have that x_{i} \geq 0 and thus x \succcurlyeq 0, as required. \square

Exercise 14.2 For what choice of p,q is \mathcal L_h an M matrix?

Exercise 14.3 Suppose that A is an M matrix. Show that [A^{-1}]_{ij} \geq 0 for all i,j.

We now prove Theorem Theorem thm-M:

Proof. We let A be an M matrix so that A_{ii} > 0 for all i, A_{ij} \leq 0 for all i\not= j and A is monotone: Ax \succcurlyeq 0 implies x \succcurlyeq 0. We also suppose there exists \Phi \succ 0 such that A\Phi \succ 0.

We let c := \min_i (A\Phi)_i so that A\Phi \succcurlyeq c \begin{pmatrix} 1 & 1 & \dots & 1 \end{pmatrix}^\top. Define \Psi := c A^{-1} \begin{pmatrix} 1 & 1 & \dots & 1 \end{pmatrix}^\top and note that A(\Phi - \Psi) \succcurlyeq 0. As a result we have \Phi - \Psi \succcurlyeq 0 and thus \Phi \succcurlyeq \Psi. That is, \sum_j [A^{-1}]_{ij} \leq c^{-1} \Phi_i. Therefore,

\begin{align} \pm x_i % &= \sum_{j} [A^{-1}]_{ij} (\pm A x)_j \nonumber\\ % &\leq \sum_{j} [A^{-1}]_{ij} \|A x\|_\infty \nonumber\\ % &\leq \frac{\Phi_i}{c} \|A x\|_\infty % \leq \frac{\max \Phi_i}{\min_i (A\Phi)_i } \|A x\|_\infty, \end{align}

as required. \square

Exercise 14.4 (i) Notice that \Phi_j := \frac12 jh (1- jh) satisfies \Phi_j > 0 for all j=1,\dots,n-1,
(ii) Use the fact that -L is second order accurate to show that -L \Phi = 1,
(iii) Apply the theorem to with choice of \Phi. What is C_{\Phi}?
(iv) Conclude that approximating -u''(x) = f with u(0) = u(1) = 0 with the discrete problem -L \bm u = \bm f with u_0 = u_n = 0 is 2^\text{nd} order convergent.

TipTO-DO
  • Take a look at the suggestions for Presentations/Posters/Papers and sign up to present,
  • Read through the proofs from this lecture - come with questions on Wednesday,
  • (Sign-up to office hours if you would like)

Previous reading:

  • Chapter 1 reading:
  • Chapter 2 reading:
    • Recap matrices: sections 7.1 - 7.2 of Burden, Faires, and Burden (2015)
    • Jacobi & Gauss-Seidel: section 7.3 Burden, Faires, and Burden (2015)
    • CG: section 7.6 Burden, Faires, and Burden (2015)
  • Chapter 3 reading:
    • Sections 11.1 - 11.2 of Burden, Faires, and Burden (2015): shooting methods
Burden, Richard L., Douglas J. Faires, and Annette M. Burden. 2015. Numerical Analysis. 10th ed. CENGAGE Learning.
Driscoll, Tobin A, and Richard J Braun. 2018. Fundamentals of Numerical Computation. https://fncbook.com/.