01.01.14 · foundations / linear-algebra

Rayleigh quotient and the Courant-Fischer min-max characterisation of eigenvalues

shipped3 tiersLean: none

Anchor (Master): Shilov *Linear Algebra* Ch. 10 §10.2–§10.3; Horn-Johnson *Matrix Analysis* §4.2–§4.3 (Courant-Fischer, interlacing, Weyl); Parlett *The Symmetric Eigenvalue Problem* Ch. 10–11; Reed-Simon *Methods of Modern Mathematical Physics* Vol. IV §XIII.1 (min-max for self-adjoint operators); Kato *Perturbation Theory for Linear Operators* Ch. I §6.10

Intuition Beginner

A symmetric matrix stretches space along a set of perpendicular axes, with a stretch factor for each axis. The spectral theorem 01.01.13 guarantees those axes and factors exist; the factors are the eigenvalues. A natural question is whether you can find the biggest and smallest stretch factors without first solving for every eigenvalue. The Rayleigh quotient answers yes, and it does so by a simple measurement you can take on any vector at all.

The Rayleigh quotient of a vector is the amount the matrix stretches that vector along its own direction, measured as a ratio. Push the vector through the matrix, see how much of the result points back along the original vector, and divide by the vector's own length-squared. For an eigenvector the answer is exactly its eigenvalue. For any other vector the answer is a blend of eigenvalues, and a blend can never exceed the largest ingredient or fall below the smallest.

So the largest eigenvalue is the highest value the Rayleigh quotient ever reaches, and the smallest eigenvalue is its lowest value. The middle eigenvalues are trickier: each one is the best you can do on some restricted family of directions, and the Courant-Fischer theorem says exactly which family. This turns every eigenvalue into the answer to an optimisation problem.

Visual Beginner

The picture shows a symmetric two-by-two matrix and its Rayleigh quotient evaluated all the way around the unit circle. As a unit vector sweeps from angle zero to a full turn, the Rayleigh quotient rises and falls between two values: it peaks at the direction of the top eigenvector, where it equals the larger eigenvalue, and it bottoms out at the perpendicular direction, where it equals the smaller eigenvalue.

Two facts are visible. The curve never goes above the larger eigenvalue or below the smaller one, so the eigenvalues are the ceiling and floor of the Rayleigh quotient. The peak and the valley occur at perpendicular directions, which are the two eigendirections of the symmetric matrix.

Worked example Beginner

Take the symmetric matrix

A = (3113) .

Step 1. Write the Rayleigh quotient for a unit vector $v = (cos θ, sin θ)$ . Since $v$ has length $1$ , the denominator is $1$ and the quotient is just the top part, $R (v) = (A v) \cdot v$ .

Step 2. Compute $A v$ . Multiplying, $A v = (3 cos θ + sin θ, cos θ + 3 sin θ)$ .

Step 3. Dot $A v$ with $v$ . We get $R (v) = (3 cos θ + sin θ) cos θ + (cos θ + 3 sin θ) sin θ = 3 cos^{2} θ + 2 sin θ cos θ + 3 sin^{2} θ$ .

Step 4. Simplify. Using $cos^{2} θ + sin^{2} θ = 1$ and $2 sin θ cos θ = sin (2 θ)$ , this becomes $R (v) = 3 + sin (2 θ)$ .

Step 5. Read off the high and low values. The term $sin (2 θ)$ ranges from $- 1$ to $+ 1$ , so $R (v)$ ranges from $2$ to $4$ . The maximum $4$ happens when $sin (2 θ) = 1$ , that is $θ = 45$ degrees, the direction $(1, 1) / 2$ . The minimum $2$ happens at $θ = 135$ degrees, the direction $(1, - 1) / 2$ .

Step 6. Check against the eigenvalues. The characteristic polynomial is $(t - 3)^{2} - 1 = (t - 2) (t - 4)$ , so the eigenvalues are $2$ and $4$ . These match the floor and ceiling found above.

What this tells us: the largest eigenvalue $4$ is the maximum of the Rayleigh quotient and the smallest eigenvalue $2$ is its minimum, with both extremes reached along the eigendirections. You found the eigenvalues by maximising and minimising a single ratio, not by any extra factoring work.

Check your understanding Beginner

Formal definition Intermediate+

Let $(V, ⟨ \cdot, \cdot ⟩)$ be a finite-dimensional inner-product space over $F \in {R, C}$ in the sense of 01.01.09, with norm $∥ v ∥ = ⟨ v, v ⟩$ and the inner product linear in the first slot and conjugate-linear in the second. Throughout this unit $A : V \to V$ is self-adjoint, $A = A^{*}$ , with adjoint as in 01.01.10. By the spectral theorem 01.01.13 the eigenvalues of $A$ are real; list them with multiplicity in decreasing order

λ_{1} \geq λ_{2} \geq \dots \geq λ_{n}, n = dim V,

and fix an orthonormal eigenbasis $u_{1}, \dots, u_{n}$ with $A u_{i} = λ_{i} u_{i}$ . The largest eigenvalue $λ_{1} = λ_{m a x}$ and the smallest $λ_{n} = λ_{m i n}$ .

Definition (Rayleigh quotient). For $v \in V$ with $v \neq = 0$ , the Rayleigh quotient of $A$ at $v$ is

R (v) = R_{A} (v) = \frac{⟨ A v , v ⟩}{⟨ v , v ⟩} .

Self-adjointness makes the numerator real: $⟨ A v, v ⟩ = ⟨ v, A v ⟩ = \overline{⟨ A v, v ⟩}$ , so $R (v) \in R$ for all $v \neq = 0$ . The quotient is scale-invariant, $R (c v) = R (v)$ for every nonzero scalar $c$ , so it is a function on the projective space of directions, equivalently on the unit sphere ${v : ∥ v ∥ = 1}$ , where it reduces to $R (v) = ⟨ A v, v ⟩$ .

Definition (Rayleigh-Ritz subspace value). For a subspace $S \subseteq V$ with $S \neq = {0}$ , write

\overline{R} (S) = 0 \neq = v \in S max R (v), \underline{R} (S) = 0 \neq = v \in S min R (v)

for the largest and smallest Rayleigh values over the directions inside $S$ . Both extrema are attained because $R$ is continuous on the compact unit sphere of $S$ . The eigenvalues will be obtained by optimising $\overline{R}$ and $\underline{R}$ over all subspaces of a fixed dimension.

Notation: $Gr (k, V)$ denotes the set of $k$ -dimensional subspaces of $V$ ; $S^{⊥}$ is the orthogonal complement of $S$ ; $dim S$ is the dimension of $S$ . The Kronecker delta $δ_{ij}$ is $1$ when $i = j$ and $0$ otherwise. The notation $A ⪰ 0$ means $A$ is positive semidefinite, that is $⟨ A v, v ⟩ \geq 0$ for all $v$ .

Counterexamples to common slips

The reality $R (v) \in R$ and the range $[λ_{n}, λ_{1}]$ both require self-adjointness. For the non-self-adjoint $A = (01 - 1 0)$ on $C^{2}$ , the numerator $⟨ A v, v ⟩$ is purely imaginary on most vectors, and the eigenvalues $\pm i$ are not real; the extremal characterisation simply does not apply.
"The maximum of $R$ is attained at a unique direction" fails when $λ_{1}$ is a repeated eigenvalue. If $λ_{1} = λ_{2}$ , every unit vector in the two-dimensional top eigenspace attains the maximum, so the maximiser is a whole sphere of directions, not a point.
Courant-Fischer requires the eigenvalues to be listed with multiplicity. Omitting repetitions shifts the index $k$ and breaks the formula; $λ_{k}$ in the min-max statement is the $k$ -th eigenvalue counted with multiplicity, not the $k$ -th distinct value.

Key theorem with proof Intermediate+

Theorem (Rayleigh-Ritz extremal characterisation; Shilov Ch. 10 §10.2 ^{[source pending]}; Horn-Johnson §4.2 ^{[source pending]}). Let $A = A^ $b ese l f - a d j o in t o na f ini t e - d im e n s i o na l inn er - p r o d u c t s p a ce$ V $w i t h e i g e n v a l u es$ \lambda_1 \geq \cdots \geq \lambda_n $an d or t h o n or ma l e i g e nba s i s$ u_1, \ldots, u_n $. T h e n f or e v er y$ v \neq 0$,*

λ_{n} \leq R (v) \leq λ_{1},

the range of $R$ is exactly the closed interval $[λ_{n}, λ_{1}]$ , and

λ_{1} = v \neq = 0 max R (v), λ_{n} = v \neq = 0 min R (v),

with the maximum attained precisely on the $λ_{1}$ -eigenspace and the minimum on the $λ_{n}$ -eigenspace.

Proof. Expand an arbitrary $v \neq = 0$ in the orthonormal eigenbasis: $v = \sum_{i = 1}^{n} c_{i} u_{i}$ with $c_{i} = ⟨ v, u_{i} ⟩ \in F$ . Orthonormality gives $⟨ v, v ⟩ = \sum_{i} ∣ c_{i} ∣^{2}$ , and since $A u_{i} = λ_{i} u_{i}$ ,

⟨ A v, v ⟩ = ⟨ i \sum λ_{i} c_{i} u_{i}, j \sum c_{j} u_{j} ⟩ = i, j \sum λ_{i} c_{i} \overline{c_{j}} ⟨ u_{i}, u_{j} ⟩ = i \sum λ_{i} ∣ c_{i} ∣^{2},

where orthonormality $⟨ u_{i}, u_{j} ⟩ = δ_{ij}$ collapsed the double sum. Therefore

R (v) = \frac{\sum _{i} λ _{i} ∣ c _{i} ∣ ^{2}}{\sum _{i} ∣ c _{i} ∣ ^{2}} .

The right-hand side is a convex combination of the eigenvalues $λ_{i}$ with weights $w_{i} = ∣ c_{i} ∣^{2} / \sum_{j} ∣ c_{j} ∣^{2} \geq 0$ summing to $1$ . A convex combination of real numbers lies between their minimum and maximum, so $λ_{n} \leq R (v) \leq λ_{1}$ .

For attainment and the full range: setting $v = u_{1}$ puts all weight on $λ_{1}$ , giving $R (u_{1}) = λ_{1}$ ; setting $v = u_{n}$ gives $R (u_{n}) = λ_{n}$ . For an intermediate value $μ \in [λ_{n}, λ_{1}]$ , choose $t \in [0, 1]$ with $μ = t λ_{1} + (1 - t) λ_{n}$ and take $v = t u_{1} + 1 - t u_{n}$ ; then $R (v) = t λ_{1} + (1 - t) λ_{n} = μ$ . So the range is all of $[λ_{n}, λ_{1}]$ .

Finally, suppose $R (v) = λ_{1}$ for some $v = \sum_{i} c_{i} u_{i} \neq = 0$ . Then $\sum_{i} λ_{i} ∣ c_{i} ∣^{2} = λ_{1} \sum_{i} ∣ c_{i} ∣^{2}$ , that is $\sum_{i} (λ_{1} - λ_{i}) ∣ c_{i} ∣^{2} = 0$ . Every term is non-negative because $λ_{1} \geq λ_{i}$ , so each term vanishes: $c_{i} = 0$ whenever $λ_{i} < λ_{1}$ . Hence $v$ lies in the $λ_{1}$ -eigenspace. The minimum case is symmetric, replacing $λ_{1} - λ_{i} \geq 0$ by $λ_{i} - λ_{n} \geq 0$ . $□$

Bridge. The Rayleigh-Ritz characterisation builds toward the full Courant-Fischer min-max theorem proved in the Advanced results, where the same convex-combination identity is restricted to subspaces to pin down every intermediate eigenvalue, and it appears again in 01.01.12 (singular value decomposition), where applying it to $A^{*} A$ identifies the largest singular value as $σ_{1} = max_{∥ v ∥ = 1} ∥ A v ∥$ . The mechanism is the diagonalisation $R (v) = \sum_{i} w_{i} λ_{i}$ supplied by the spectral theorem 01.01.13: once the self-adjoint operator is written in its orthonormal eigenbasis, the Rayleigh quotient becomes a weighted average of eigenvalues, and weighted averages are governed entirely by the extreme ingredients. The foundational identity is $⟨ A v, v ⟩ = \sum_{i} λ_{i} ∣ ⟨ v, u_{i} ⟩ ∣^{2}$ , the spectral expansion of the quadratic form attached to $A$ . Putting these together, the extremal characterisation removes the need to compute a characteristic polynomial to find the dominant eigenvalue, justifies the power-iteration and Rayleigh-quotient-iteration algorithms of numerical linear algebra, and is the finite-dimensional shadow of the variational principle that estimates ground-state energies in quantum mechanics, which appears again in 12.07.03 (the Rayleigh-Ritz method).

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Show that for any $v \neq = 0$ , the difference $R (v) - λ_{n}$ equals $\sum_{i} (λ_{i} - λ_{n}) w_{i}$ with weights $w_{i} = ∣ ⟨ v, u_{i} ⟩ ∣^{2} /∥ v ∥^{2}$ , and deduce $R (v) \geq λ_{n}$ with equality iff $v$ is a $λ_{n}$ -eigenvector.

Hint

Use the spectral expansion $R (v) = \sum_{i} λ_{i} w_{i}$ with $\sum_{i} w_{i} = 1$ , and subtract $λ_{n} = λ_{n} \sum_{i} w_{i}$ .

Answer

Write $R (v) = \sum_{i} λ_{i} w_{i}$ with $w_{i} \geq 0$ and $\sum_{i} w_{i} = 1$ . Since $λ_{n} = λ_{n} \sum_{i} w_{i}$ , $$ R(v) - \lambda_n = \sum_i (\lambda_i - \lambda_n) w_i. $$ Each $λ_{i} - λ_{n} \geq 0$ and each $w_{i} \geq 0$ , so the sum is $\geq 0$ , giving $R (v) \geq λ_{n}$ . Equality holds iff every term vanishes, that is $w_{i} = 0$ whenever $λ_{i} > λ_{n}$ . Then $v$ has nonzero component only along eigenvectors with eigenvalue $λ_{n}$ , so $v$ lies in the $λ_{n}$ -eigenspace. Rubric: full credit for the weighted-difference identity and the equality analysis.

Exercise 4 (medium, short-answer).

Prove that the unconstrained critical points of $R$ on $V ∖ {0}$ are exactly the eigenvectors of $A$ , and the critical value at an eigenvector is its eigenvalue.

Hint

Differentiate $R (v) = ⟨ A v, v ⟩ / ⟨ v, v ⟩$ . The numerator's gradient is $2 A v$ , the denominator's is $2 v$ (real case); set the quotient-rule gradient to zero.

Answer

Work over $R$ (the complex case follows by treating real and imaginary parts). With $N (v) = ⟨ A v, v ⟩$ and $D (v) = ⟨ v, v ⟩$ , the gradients are $\nabla N (v) = 2 A v$ (using $A = A^{*}$ ) and $\nabla D (v) = 2 v$ . By the quotient rule, $$ \nabla R(v) = \frac{D(v), \nabla N(v) - N(v), \nabla D(v)}{D(v)^2} = \frac{2}{|v|^2}\big(A v - R(v), v\big). $$ Setting $\nabla R (v) = 0$ gives $A v = R (v) v$ , so $v$ is an eigenvector with eigenvalue $R (v)$ . Conversely, if $A v = λ v$ then $R (v) = λ$ and $A v - R (v) v = 0$ , so $v$ is critical. Hence the critical points are precisely the eigenvectors and the critical value is the eigenvalue. Rubric: full credit for the quotient-rule gradient and both directions of the equivalence.

Exercise 7 (hard, short-answer).

Prove the max-min half of Courant-Fischer for $λ_{k}$ : show $λ_{k} = max_{d i m S = k} min_{0 \neq = v \in S} R (v)$ .

Hint

For the lower bound, test $S = span (u_{1}, \dots, u_{k})$ . For the upper bound, intersect an arbitrary $k$ -dimensional $S$ with $span (u_{k}, \dots, u_{n})$ and count dimensions.

Answer

Let $S_{k} = span (u_{1}, \dots, u_{k})$ . For $0 \neq = v \in S_{k}$ , $v = \sum_{i = 1}^{k} c_{i} u_{i}$ and $R (v) = \sum_{i = 1}^{k} λ_{i} ∣ c_{i} ∣^{2} / \sum_{i = 1}^{k} ∣ c_{i} ∣^{2} \geq λ_{k}$ since each $λ_{i} \geq λ_{k}$ for $i \leq k$ ; equality at $v = u_{k}$ . So $min_{0 \neq = v \in S_{k}} R (v) = λ_{k}$ , proving $max_{d i m S = k} min_{0 \neq = v \in S} R (v) \geq λ_{k}$ .

For the reverse, let $S$ be any $k$ -dimensional subspace and $W = span (u_{k}, u_{k + 1}, \dots, u_{n})$ , which has dimension $n - k + 1$ . By the dimension formula $dim (S \cap W) \geq dim S + dim W - n = k + (n - k + 1) - n = 1$ , so there is a nonzero $v \in S \cap W$ . Writing $v = \sum_{i \geq k} c_{i} u_{i}$ , $R (v) = \sum_{i \geq k} λ_{i} ∣ c_{i} ∣^{2} / \sum_{i \geq k} ∣ c_{i} ∣^{2} \leq λ_{k}$ since each $λ_{i} \leq λ_{k}$ for $i \geq k$ . Hence $min_{0 \neq = v \in S} R (v) \leq R (v) \leq λ_{k}$ for every $S$ , giving $max_{d i m S = k} min_{0 \neq = v \in S} R (v) \leq λ_{k}$ . The two inequalities force equality. Rubric: full credit for the $S_{k}$ test vector, the intersection dimension count, and both bounds.

Exercise 8 (hard, symbolic).

Use Courant-Fischer to prove the monotonicity statement: if $A ⪯ B$ (meaning $B - A ⪰ 0$ ) are self-adjoint, then $λ_{k} (A) \leq λ_{k} (B)$ for every $k$ .

Hint

$A ⪯ B$ means $R_{A} (v) \leq R_{B} (v)$ for all $v \neq = 0$ . Apply the max-min formula term by term.

Answer

The hypothesis $B - A ⪰ 0$ says $⟨(B - A) v, v ⟩ \geq 0$ , that is $⟨ A v, v ⟩ \leq ⟨ B v, v ⟩$ , hence $R_{A} (v) \leq R_{B} (v)$ for every $v \neq = 0$ . By the max-min formula (Exercise 7), $$ \lambda_k(A) = \max_{\dim S = k}\ \min_{0\neq v\in S} R_A(v) \leq \max_{\dim S = k}\ \min_{0\neq v\in S} R_B(v) = \lambda_k(B), $$ where the inequality holds because at each subspace $S$ the inner minimum for $A$ is $\leq$ the inner minimum for $B$ (the integrand $R_{A} \leq R_{B}$ pointwise), and the outer maximum preserves the inequality. So $λ_{k} (A) \leq λ_{k} (B)$ for all $k$ . Rubric: full credit for the pointwise Rayleigh inequality and its propagation through the nested optimisation.

Lean formalization Intermediate+

Mathlib carries the Rayleigh quotient and the supremum/infimum eigenvalue facts for self-adjoint operators, but the full Courant-Fischer min-max equality, Cauchy interlacing, and Weyl's inequalities are not present as single named theorems. The readable view of the intended statements:

import Mathlib.Analysis.InnerProductSpace.Rayleigh
import Mathlib.Analysis.InnerProductSpace.Spectrum

variable {𝕜 V : Type*} [RCLike 𝕜]
  [NormedAddCommGroup V] [InnerProductSpace 𝕜 V] [FiniteDimensional 𝕜 V]

/-- The supremum of the Rayleigh quotient over the unit sphere is an
eigenvalue (the top eigenvalue). Mathlib:
`LinearMap.IsSymmetric.hasEigenvalue_iSup`. -/
example (T : V →ₗ[𝕜] V) (hT : T.IsSymmetric) (hV : 0 < Module.rank 𝕜 V) :
    Module.End.HasEigenvalue T
      (⨆ x : {v : V // v ≠ 0}, RCLike.re (T.reApplyInnerSelf x / ‖(x : V)‖ ^ 2)) :=
  hT.hasEigenvalue_iSup hV

/-- Courant-Fischer max-min characterisation of the k-th eigenvalue.
NOT a single named Mathlib theorem. -/
theorem courant_fischer_maxmin
    (T : V →ₗ[𝕜] V) (hT : T.IsSymmetric) (k : ℕ)
    (lam : ℕ → ℝ) (hsort : Antitone lam) /- eigenvalues, decreasing -/ :
    lam k = sSup { r : ℝ | ∃ S : Submodule 𝕜 V, Module.finrank 𝕜 S = k + 1 ∧
        r = sInf { R : ℝ | ∃ v : V, v ∈ S ∧ v ≠ 0 ∧
          R = RCLike.re (inner (T v) v) / ‖v‖ ^ 2 } } :=
  sorry  -- S_k test subspace for ≥; dimension-count intersection for ≤

/-- Cauchy interlacing: eigenvalues of a compression to a codim-1
subspace interlace those of T. NOT a single named Mathlib theorem. -/
theorem cauchy_interlace
    (T : V →ₗ[𝕜] V) (hT : T.IsSymmetric) (W : Submodule 𝕜 V)
    (hW : Module.finrank 𝕜 W + 1 = Module.finrank 𝕜 V)
    (lamT lamW : ℕ → ℝ) (k : ℕ) :
    lamT (k + 1) ≤ lamW k ∧ lamW k ≤ lamT k :=
  sorry  -- apply courant_fischer to T and to the compression P_W T P_W

The proof gap to a clean contribution: (i) order the eigenvalues with multiplicity as an antitone Fin n → ℝ and connect to Module.End.eigenvalues; (ii) prove courant_fischer_maxmin from the spectral orthonormal basis using the S_k = span(u_0, ..., u_k) test subspace and the Submodule.finrank_sup_add_finrank_inf_le dimension-count for the intersection; (iii) derive cauchy_interlace and weyl_inequality as corollaries. Each is reachable from Mathlib.Analysis.InnerProductSpace.Rayleigh and the finite-dimensional spectral theorem, but no current named lemma presents the min-max double characterisation or the interlacing inequalities in packaged form.

Advanced results Master

Theorem (Courant-Fischer min-max; Fischer 1905 ^{[source pending]}; Courant 1920 ^{[source pending]}; Horn-Johnson §4.2 ^{[source pending]}). Let $A = A^ $o n$ V $w i t h e i g e n v a l u es$ \lambda_1 \geq \cdots \geq \lambda_n $l i s t e d w i t hm u l t i pl i c i t y . F or e a c h$ k \in {1, \ldots, n}$,*

λ_{k} = S \subseteq V d i m S = k max 0 \neq = v \in S min R (v) = S \subseteq V d i m S = n - k + 1 min 0 \neq = v \in S max R (v) .

The first equality recovers $λ_{1}$ as a plain maximum (the case $k = 1$ ) and $λ_{n}$ as a plain minimum (the case $k = n$ of the second form), so the Rayleigh-Ritz characterisation is the boundary case of the general formula. The decisive feature is that the characterisation of $λ_{k}$ refers to no eigenvector other than through the optimisation over $k$ -dimensional subspaces; this is what makes the formula usable for operators whose eigenvectors are unavailable, and it is the form that survives passage to the infinite-dimensional discrete spectrum. The full proof is in the proof set below.

Theorem (Cauchy interlacing; Cauchy 1829 ^{[source pending]}). Let $A = A^ $o nan$ n $- d im e n s i o na l s p a ce ha v ee i g e n v a l u es$ \lambda_1 \geq \cdots \geq \lambda_n $, an d l e t$ B $b e t h eco m p r ess i o n$ B = P A|W $o f$ A $t o an$ (n-1) $- d im e n s i o na l s u b s p a ce$ W $(e q u i v a l e n tl y, a p r in c i p a l$ (n-1) \times (n-1) $s u bma t r i x inan or t h o n or ma l ba s i s a d a pt e d t o$ W $), w i t h e i g e n v a l u es$ \mu_1 \geq \cdots \geq \mu{n-1}$. Then*

λ_{1} \geq μ_{1} \geq λ_{2} \geq μ_{2} \geq \dots \geq μ_{n - 1} \geq λ_{n} .

More generally, deleting $m$ rows and the corresponding columns produces eigenvalues $μ_{j}$ with $λ_{j + m} \leq μ_{j} \leq λ_{j}$ .

Interlacing is the workhorse behind the symmetric eigenvalue problem: the eigenvalues of a leading principal submatrix sandwich those of the full matrix, which justifies bisection methods using Sturm sequences and explains why adding a row and column to a symmetric matrix can shift each eigenvalue by at most one slot.

Theorem (Weyl's inequalities; Weyl 1912 ^{[source pending]}). Let $A, B$ be self-adjoint on $V$ with eigenvalues listed decreasingly. For indices $i, j$ with $i + j - 1 \leq n$ ,

λ_{i + j - 1} (A + B) \leq λ_{i} (A) + λ_{j} (B),

and dually $λ_{i + j - n} (A + B) \geq λ_{i} (A) + λ_{j} (B)$ when $i + j - n \geq 1$ . In particular $λ_{k} (A) + λ_{n} (B) \leq λ_{k} (A + B) \leq λ_{k} (A) + λ_{1} (B)$ , so each eigenvalue is Lipschitz in the operator: $∣ λ_{k} (A + B) - λ_{k} (A) ∣ \leq ∥ B ∥$ .

Weyl's inequalities are the eigenvalue-perturbation theory that controls how the spectrum moves when a self-adjoint operator is perturbed by another. The special case $B ⪰ 0$ recovers the monotonicity $λ_{k} (A) \leq λ_{k} (A + B)$ , and the Lipschitz bound is the spectral stability statement underpinning numerical eigenvalue computation and the continuity of the spectrum in perturbation theory.

Theorem (spectral gap and the second eigenvalue). With $λ_{1} \geq λ_{2} \geq \dots$ , the second eigenvalue admits the constrained form

λ_{2} = 0 \neq = v ⊥ u_{1} max R (v),

the maximum of the Rayleigh quotient over the orthogonal complement of the top eigenvector. The gap $λ_{1} - λ_{2}$ measures how sharply the maximiser is isolated: for a positive operator normalised to $λ_{1} = 1$ , the gap $1 - λ_{2}$ controls the geometric convergence rate of power iteration and, in the graph-Laplacian setting, the algebraic connectivity and mixing time.

The deflation form $λ_{k + 1} = max_{v ⊥ u_{1}, \dots, u_{k}} R (v)$ is the recursive version of the Rayleigh-Ritz characterisation: having found the top $k$ eigenvectors, the next eigenvalue is the maximum of $R$ on what remains. Courant-Fischer is the eigenvector-free upgrade that removes the dependence on $u_{1}, \dots, u_{k}$ .

Theorem (min-max for semibounded self-adjoint operators; Courant-Fischer-Weyl in the Hilbert-space form; Reed-Simon Vol. IV §XIII.1 ^{[source pending]}). Let $A$ be a self-adjoint operator on a Hilbert space $H$ , bounded below, with $Σ = in f σ_{ess} (A)$ the bottom of the essential spectrum. Define

μ_{k} (A) = S \subseteq D (A) d i m S = k min v \in S ∥ v ∥ = 1 max ⟨ A v, v ⟩ .

Then for each $k$ either $μ_{k} (A) < Σ$ and $μ_{k} (A)$ is the $k$ -th eigenvalue below the essential spectrum (counted with multiplicity), or $μ_{k} (A) = Σ$ and there are at most $k - 1$ eigenvalues below $Σ$ .

This is the theorem that powers the variational estimation of bound-state energies. The finite sum over eigenvalues becomes an optimisation over finite-dimensional trial subspaces of the domain, and the Rayleigh-Ritz method computes upper bounds for the discrete eigenvalues below the continuous spectrum. The Sturm-Liouville operator $- \frac{d}{d x} (p \frac{d}{d x}) + q$ on a bounded interval and the Dirichlet Laplacian on a bounded domain are the canonical instances, with the min-max principle giving Courant's nodal-domain bound and Weyl's eigenvalue-counting asymptotics.

Synthesis. The Rayleigh quotient converts the algebraic eigenvalue problem for a self-adjoint operator into a problem of optimisation, and the spectral theorem 01.01.13 is what makes the conversion exact: in the orthonormal eigenbasis $R (v)$ is a weighted average of eigenvalues, so its extremes are the extreme eigenvalues and its critical points are the eigenvectors. The Courant-Fischer min-max formula completes the picture by characterising every eigenvalue, not only the top and bottom, through optimisation over subspaces of fixed dimension, and it does so without naming a single eigenvector. From that one formula the interlacing of Cauchy and the perturbation inequalities of Weyl follow as corollaries by applying the same max-min to a compression or to a sum, and the monotonicity $λ_{k} (A) \leq λ_{k} (A + B)$ for $B ⪰ 0$ is immediate from the pointwise inequality of Rayleigh quotients. Putting these together, the gradient computation $\nabla R (v) \propto A v - R (v) v$ identifies critical points with eigenvectors and drives Rayleigh-quotient iteration, the second-eigenvalue gap controls the convergence of power iteration and the connectivity of graphs, and the eigenvector-free min-max form passes intact to the discrete spectrum of a semibounded self-adjoint operator, where it becomes the variational principle of 12.07.03 and the eigenvalue-asymptotics machinery of Weyl and Courant for the differential operators of mathematical physics. The Rayleigh quotient, the min-max theorem, interlacing, and the perturbation inequalities are four readings of the single fact that the quadratic form $⟨ A v, v ⟩$ on the unit sphere has its eigenvalues as critical values.

Full proof set Master

Proposition (Courant-Fischer min-max, both forms). Let $A = A^ $o n$ V $ha v ee i g e n v a l u es$ \lambda_1 \geq \cdots \geq \lambda_n $w i t h or t h o n or ma l e i g e nba s i s$ u_1, \ldots, u_n $. F or e a c h$ k$,* $$ \lambda_k = \max_{\dim S = k}\ \min_{0 \neq v \in S} R(v) = \min_{\dim S = n-k+1}\ \max_{0 \neq v \in S} R(v). $$

Proof. Max-min form, lower bound. Take $S_{k} = span (u_{1}, \dots, u_{k})$ , which has dimension $k$ . For $0 \neq = v = \sum_{i = 1}^{k} c_{i} u_{i} \in S_{k}$ , $$ R(v) = \frac{\sum_{i=1}^k \lambda_i |c_i|^2}{\sum_{i=1}^k |c_i|^2} \geq \lambda_k, $$ because each $λ_{i} \geq λ_{k}$ for $i \leq k$ , with equality at $v = u_{k}$ . Hence $min_{0 \neq = v \in S_{k}} R (v) = λ_{k}$ , so the outer maximum is at least $λ_{k}$ .

Max-min form, upper bound. Let $S$ be any $k$ -dimensional subspace and set $W = span (u_{k}, \dots, u_{n})$ , of dimension $n - k + 1$ . By the subspace dimension formula $dim (S \cap W) = dim S + dim W - dim (S + W) \geq k + (n - k + 1) - n = 1$ , there is a nonzero $v \in S \cap W$ . Writing $v = \sum_{i = k}^{n} c_{i} u_{i}$ , $$ R(v) = \frac{\sum_{i=k}^n \lambda_i |c_i|^2}{\sum_{i=k}^n |c_i|^2} \leq \lambda_k, $$ since each $λ_{i} \leq λ_{k}$ for $i \geq k$ . Thus $min_{0 \neq = v \in S} R (v) \leq R (v) \leq λ_{k}$ for every such $S$ , so the outer maximum is at most $λ_{k}$ . The two bounds give the max-min equality.

Min-max form. Apply the max-min form to $- A$ , whose eigenvalues are $- λ_{n} \geq \dots \geq - λ_{1}$ in decreasing order, so the $k$ -th eigenvalue of $- A$ is $- λ_{n - k + 1}$ . The Rayleigh quotient of $- A$ is $- R$ , and $min (- R) = - max (R)$ , $max (- R) = - min (R)$ . The max-min statement for $- A$ at index $k$ reads $- λ_{n - k + 1} = max_{d i m S = k} min_{v \in S} (- R (v)) = - min_{d i m S = k} max_{v \in S} R (v)$ . Re-indexing $k \mapsto n - k + 1$ (so $dim S = n - k + 1$ ) gives $λ_{k} = min_{d i m S = n - k + 1} max_{0 \neq = v \in S} R (v)$ . $□$

Proposition (Cauchy interlacing). Let $A = A^ $o nan$ n $- d im e n s i o na l$ V $w i t h e i g e n v a l u es$ \lambda_1 \geq \cdots \geq \lambda_n $, l e t$ W \subseteq V $ha v e d im e n s i o n$ n - 1 $, an d l e t$ B = P_W A|W $b e t h eco m p r ess i o n (w i t h$ P_W $t h eor t h o g o na l p r o j ec t i o n o n t o$ W $), a se l f - a d j o in t o p er a t or o n$ W $w i t h e i g e n v a l u es$ \mu_1 \geq \cdots \geq \mu{n-1} $. T h e n$ \lambda_{k+1} \leq \mu_k \leq \lambda_k $f or$ 1 \leq k \leq n-1$.*

Proof. For $v \in W$ , $⟨ B v, v ⟩ = ⟨ P_{W} A v, v ⟩ = ⟨ A v, P_{W} v ⟩ = ⟨ A v, v ⟩$ since $P_{W} v = v$ and $P_{W} = P_{W}^{*}$ . So the Rayleigh quotient of $B$ on $W$ agrees with that of $A$ restricted to $W$ : $R_{B} (v) = R_{A} (v)$ for $0 \neq = v \in W$ .

Upper bound $μ_{k} \leq λ_{k}$ . By Courant-Fischer for $B$ on the $(n - 1)$ -dimensional $W$ , $μ_{k} = max_{S \subseteq W d i m S = k} min_{0 \neq = v \in S} R_{A} (v)$ . Every $k$ -dimensional $S \subseteq W$ is also a $k$ -dimensional subspace of $V$ , so this maximum is over a subfamily of the subspaces appearing in the Courant-Fischer max-min for $λ_{k}$ . A maximum over a smaller family is no larger, so $μ_{k} \leq λ_{k}$ .

Lower bound $λ_{k + 1} \leq μ_{k}$ . Use the min-max form. For $A$ , $λ_{k + 1} = min_{d i m T = n - k} max_{0 \neq = v \in T} R_{A} (v)$ . For $B$ on $W$ , $μ_{k} = min_{T \subseteq W d i m T = (n - 1) - k + 1 = n - k} max_{0 \neq = v \in T} R_{A} (v)$ . The subspaces $T \subseteq W$ of dimension $n - k$ form a subfamily of all dimension- $(n - k)$ subspaces of $V$ , and a minimum over a smaller family is no smaller, so $λ_{k + 1} \leq μ_{k}$ . Combining, $λ_{k + 1} \leq μ_{k} \leq λ_{k}$ . The general $m$ -deletion bound $λ_{k + m} \leq μ_{k} \leq λ_{k}$ follows by iterating the codimension-one step $m$ times. $□$

Proposition (Weyl's inequality $λ_{i + j - 1} (A + B) \leq λ_{i} (A) + λ_{j} (B)$ ). For self-adjoint $A, B$ on $V$ and indices with $i + j - 1 \leq n$ , the stated inequality holds.

Proof. Let $u_{1}, \dots, u_{n}$ be an orthonormal eigenbasis for $A$ and $w_{1}, \dots, w_{n}$ one for $B$ . Set $$ U = \mathrm{span}(u_i, u_{i+1}, \ldots, u_n), \quad \dim U = n - i + 1, \qquad W = \mathrm{span}(w_j, \ldots, w_n), \quad \dim W = n - j + 1. $$ On $U$ , every unit vector has $R_{A} \leq λ_{i} (A)$ (it is a combination of eigenvectors with eigenvalues $\leq λ_{i}$ ); on $W$ , $R_{B} \leq λ_{j} (B)$ . The intersection $U \cap W$ has dimension at least $(n - i + 1) + (n - j + 1) - n = n - i - j + 2$ . For any unit $v \in U \cap W$ , $$ R_{A+B}(v) = \langle (A+B)v, v\rangle = R_A(v) + R_B(v) \leq \lambda_i(A) + \lambda_j(B). $$ Now apply the min-max form to $A + B$ . Set $m = i + j - 1$ , so a subspace of dimension $n - m + 1 = n - i - j + 2$ exists inside $U \cap W$ . Then $$ \lambda_m(A+B) = \min_{\dim S = n-m+1}\ \max_{0\neq v\in S} R_{A+B}(v) \leq \max_{0\neq v\in U\cap W} R_{A+B}(v) \leq \lambda_i(A) + \lambda_j(B), $$ where the first inequality chooses $S$ to be an $(n - m + 1)$ -dimensional subspace of $U \cap W$ . This is the claim $λ_{i + j - 1} (A + B) \leq λ_{i} (A) + λ_{j} (B)$ . The dual lower inequality follows by applying this to $- A, - B$ and re-indexing. $□$

Proposition (Rayleigh-quotient iteration converges cubically near a simple eigenvalue). Let $A = A^ $ha v e a s im pl ee i g e n v a l u e$ \lambda $w i t h u ni t e i g e n v ec t or$ u $. T h e i t er a t i o n$ v_{m+1} = (A - R(v_m) I)^{-1} v_m $f o l l o w e d b y n or ma l i s a t i o n, s t a r t in g f r o m$ v_0 $c l osee n o ug h t o$ u $, s a t i s f i es$ \sin\angle(v_{m+1}, u) = O\big(\sin^3\angle(v_m, u)\big)$.*

Proof. Write $v_{m} = (cos θ_{m}) u + (sin θ_{m}) z_{m}$ with $z_{m} ⊥ u$ , $∥ z_{m} ∥ = 1$ , where $θ_{m} = ∠ (v_{m}, u)$ . The Rayleigh quotient satisfies $R (v_{m}) = λ cos^{2} θ_{m} + ⟨ A z_{m}, z_{m} ⟩ sin^{2} θ_{m} = λ + (R_{z} - λ) sin^{2} θ_{m}$ , with $R_{z} = ⟨ A z_{m}, z_{m} ⟩$ bounded; so $R (v_{m}) - λ = O (sin^{2} θ_{m})$ . Let $A^{'} = A ∣_{u^{⊥}}$ act on the invariant complement $u^{⊥}$ , with spectral gap $γ = dist (λ, σ (A^{'})) > 0$ by simplicity. Applying $(A - R (v_{m}) I)^{- 1}$ scales the $u$ -component by $(λ - R (v_{m}))^{- 1}$ and the $z_{m}$ -component by a factor of size at most $(γ - ∣ R (v_{m}) - λ ∣)^{- 1} = O (1)$ . Hence $$ \tan\theta_{m+1} = \frac{|\text{component} \perp u|}{|\text{component along } u|} \leq \frac{O(1)\sin\theta_m}{|\lambda - R(v_m)|^{-1}\cos\theta_m} = O\big(|R(v_m) - \lambda|\big)\tan\theta_m = O(\sin^2\theta_m)\tan\theta_m. $$ Since $tan θ_{m} = O (sin θ_{m})$ for small angles, $tan θ_{m + 1} = O (sin^{3} θ_{m})$ , giving the cubic local convergence. The self-adjointness is used twice: to make $u^{⊥}$ an $A$ -invariant orthogonal complement, and to make $R (v_{m}) - λ$ second order in $θ_{m}$ rather than first order. $□$

Connections Master

The Rayleigh quotient is the variational face of the spectral theorem 01.01.13: where the spectral theorem produces the eigenvalues by diagonalisation, the extremal characterisation produces them by optimisation, and the equivalence of the two is the identity $⟨ A v, v ⟩ = \sum_{i} λ_{i} ∣ ⟨ v, u_{i} ⟩ ∣^{2}$ . The self-adjoint hypothesis is exactly the condition from 01.01.10 that makes $⟨ A v, v ⟩$ real and the eigenvalues real, without which the quotient has no order structure to extremise.

The min-max characterisation specialises to the singular value decomposition 01.01.12 through $A^{*} A$ : the singular values are $σ_{k} = λ_{k} (A^{*} A)$ , so Courant-Fischer applied to the positive self-adjoint $A^{*} A$ gives $σ_{k} = min_{d i m S = n - k + 1} max_{0 \neq = v \in S} ∥ A v ∥/∥ v ∥$ , the variational characterisation of singular values that underlies the Eckart-Young low-rank approximation theorem.

The interlacing and perturbation inequalities feed the quadratic-form theory of 01.01.15: Sylvester's law of inertia classifies symmetric matrices by the signs of their eigenvalues, and Weyl's monotonicity plus Cauchy interlacing govern how those signs persist under congruence, compression, and perturbation, which is the algebraic content of the inertia of a bordered or perturbed form.

The eigenvector-free min-max form is the bridge to the variational method in quantum mechanics 12.07.03: the Rayleigh-Ritz estimate of a ground-state energy is the statement $E_{0} \leq R_{H} (Ψ)$ for any trial state $Ψ$ , the $k = 1$ Rayleigh case for the Hamiltonian $H$ , and the min-max over trial subspaces produces upper bounds for the excited-state energies (MacDonald's theorem) exactly as the finite-dimensional Courant-Fischer formula predicts.

Historical & philosophical context Master

The Rayleigh quotient began in mechanics, not in linear algebra. In The Theory of Sound (1877) Lord Rayleigh studied the small oscillations of a system about equilibrium and observed that the ratio of the potential energy to the kinetic energy of a trial configuration is a stationary estimate of the squared natural frequencies, accurate to second order when the trial mode is close to a true normal mode ^{[Rayleigh 1877]}. The quotient $⟨ A v, v ⟩ / ⟨ v, v ⟩$ is the abstraction of that energy ratio, with $A$ the stiffness operator and the denominator the mass form. Ritz turned the stationarity observation into an algorithm in 1909, expanding the trial configuration in a finite basis and reducing the variational problem to a generalised matrix eigenvalue problem, which is why the finite-basis variational method carries his name ^{[Ritz 1909]}.

The min-max formula appeared first in Fischer's 1905 study of real quadratic forms, where the $k$ -th eigenvalue is given as a max-min over $k$ -dimensional subspaces ^{[Fischer 1905]}. Weyl, in his 1912 work on the asymptotic distribution of the eigenvalues of the Laplacian on a membrane, used the same variational characterisation to prove both the perturbation inequalities now bearing his name and the leading-order eigenvalue-counting law, establishing that the count of eigenvalues below a threshold grows like the volume of the domain times a universal constant ^{[Weyl 1912]}. Courant extended the principle to general self-adjoint differential operators in 1920 and connected it to the nodal domains of eigenfunctions, giving the geometric form of the theorem that propagated through Methoden der mathematischen Physik into the standard toolkit of mathematical physics ^{[Courant 1920]}. The finite-dimensional Cauchy interlacing relation, which the modern proof derives as a corollary of min-max, predates all of this: Cauchy established it in 1829 while analysing the secular equation of planetary perturbation theory ^{[Cauchy 1829]}.

Bibliography Master

@book{Rayleigh1877,
  author    = {Rayleigh, John William Strutt, Baron},
  title     = {The Theory of Sound, Volume I},
  publisher = {Macmillan},
  address   = {London},
  year      = {1877}
}

@article{Fischer1905,
  author  = {Fischer, Ernst},
  title   = {{\"U}ber quadratische Formen mit reellen Koeffizienten},
  journal = {Monatshefte f{\"u}r Mathematik und Physik},
  volume  = {16},
  year    = {1905},
  pages   = {234--249}
}

@article{Ritz1909,
  author  = {Ritz, Walther},
  title   = {{\"U}ber eine neue Methode zur L{\"o}sung gewisser Variationsprobleme der mathematischen Physik},
  journal = {Journal f{\"u}r die reine und angewandte Mathematik},
  volume  = {135},
  year    = {1909},
  pages   = {1--61}
}

@article{Weyl1912,
  author  = {Weyl, Hermann},
  title   = {Das asymptotische Verteilungsgesetz der Eigenwerte linearer partieller Differentialgleichungen (mit einer Anwendung auf die Theorie der Hohlraumstrahlung)},
  journal = {Mathematische Annalen},
  volume  = {71},
  year    = {1912},
  pages   = {441--479}
}

@article{Courant1920,
  author  = {Courant, Richard},
  title   = {{\"U}ber die Eigenwerte bei den Differentialgleichungen der mathematischen Physik},
  journal = {Mathematische Zeitschrift},
  volume  = {7},
  year    = {1920},
  pages   = {1--57}
}

@article{Cauchy1829,
  author  = {Cauchy, Augustin-Louis},
  title   = {Sur l'{\'e}quation {\`a} l'aide de laquelle on d{\'e}termine les in{\'e}galit{\'e}s s{\'e}culaires des mouvements des plan{\`e}tes},
  journal = {Exercices de Math{\'e}matiques},
  volume  = {4},
  year    = {1829},
  pages   = {140--160}
}

@book{HornJohnson2013,
  author    = {Horn, Roger A. and Johnson, Charles R.},
  title     = {Matrix Analysis},
  edition   = {2nd},
  publisher = {Cambridge University Press},
  year      = {2013}
}

@book{Parlett1998,
  author    = {Parlett, Beresford N.},
  title     = {The Symmetric Eigenvalue Problem},
  publisher = {SIAM},
  series    = {Classics in Applied Mathematics},
  year      = {1998}
}

@book{ReedSimon1978,
  author    = {Reed, Michael and Simon, Barry},
  title     = {Methods of Modern Mathematical Physics, Vol. IV: Analysis of Operators},
  publisher = {Academic Press},
  address   = {New York},
  year      = {1978}
}

@book{Shilov1977,
  author    = {Shilov, Georgi E.},
  title     = {Linear Algebra},
  publisher = {Dover Publications},
  address   = {New York},
  year      = {1977},
  note      = {Translation of the 1971 Russian edition, transl. R. A. Silverman}
}

Prerequisites

01.01.13
01.01.10
01.01.09

Tier anchors

beginner: The largest and smallest stretch factors of a symmetric matrix, read off the unit circle as the high and low points of the Rayleigh quotient — Strang *Introduction to Linear Algebra* Ch. 6; 3Blue1Brown *Essence of Linear Algebra* Ch. 14 (eigenvectors as the special directions)
intermediate: Shilov *Linear Algebra* Ch. 10 §10.2 (extremal properties of eigenvalues); Horn-Johnson *Matrix Analysis* §4.2 (Rayleigh-Ritz and Courant-Fischer); Strang *Introduction to Linear Algebra* Ch. 6
master: Shilov *Linear Algebra* Ch. 10 §10.2–§10.3; Horn-Johnson *Matrix Analysis* §4.2–§4.3 (Courant-Fischer, interlacing, Weyl); Parlett *The Symmetric Eigenvalue Problem* Ch. 10–11; Reed-Simon *Methods of Modern Mathematical Physics* Vol. IV §XIII.1 (min-max for self-adjoint operators); Kato *Perturbation Theory for Linear Operators* Ch. I §6.10

References

images/Shilov-Linear-Algebra__4cbdee00cc.jpg · Shilov *Linear Algebra* — Fast Track archive cover; Ch. 10 §10.2 extremal properties of the eigenvalues of a self-adjoint operator, the Rayleigh quotient and the recursive max-min characterisation
Calculus Vol.2 - Multi-Variable Calculus and Linear Algebra with Applications (Tom Apostol).pdf · Ch. 5 §5.5–§5.6 eigenvalues of symmetric matrices, extremal properties, the quadratic form on the unit sphere
Shilov, G. E. — Linear Algebra (Dover, 1977 transl. of the 1971 Russian ed.) · Ch. 10 §10.2–§10.3 — the Rayleigh quotient $R(v) = \langle A v, v \rangle / \langle v, v \rangle$, its range $[\lambda_{\min}, \lambda_{\max}]$, and the recursive min-max characterisation of the eigenvalues of a self-adjoint operator
Horn, R. A. & Johnson, C. R. — Matrix Analysis (2nd ed.) · §4.2 Rayleigh-Ritz theorem and the Courant-Fischer min-max theorem; §4.3 Cauchy interlacing, the eigenvalue inequalities for principal submatrices, Weyl's inequalities for sums of Hermitian matrices
Rayleigh, J. W. S. (Lord Rayleigh) — The Theory of Sound, Vol. I · §88 (Macmillan, 1877) — the quotient of potential to kinetic energy as a stationary estimate of the natural frequencies of a vibrating system, the mechanical origin of the Rayleigh quotient
Fischer, E. — Über quadratische Formen mit reellen Koeffizienten · Monatshefte für Mathematik und Physik 16 (1905), 234–249 — the min-max (max-min) formula for the eigenvalues of a real quadratic form
Courant, R. — Über die Eigenwerte bei den Differentialgleichungen der mathematischen Physik · Mathematische Zeitschrift 7 (1920), 1–57 — the min-max principle for the eigenvalues of self-adjoint differential operators, nodal-domain bounds, and the monotonicity of eigenvalues under domain change
Weyl, H. — Das asymptotische Verteilungsgesetz der Eigenwerte linearer partieller Differentialgleichungen · Mathematische Annalen 71 (1912), 441–479 — the asymptotic distribution of eigenvalues and the perturbation inequalities $\lambda_{i+j-1}(A+B) \le \lambda_i(A) + \lambda_j(B)$ for Hermitian operators
Ritz, W. — Über eine neue Methode zur Lösung gewisser Variationsprobleme der mathematischen Physik · Journal für die reine und angewandte Mathematik 135 (1909), 1–61 — the trial-subspace variational algorithm that turns the extremal characterisation into a computational eigenvalue method
Cauchy, A.-L. — Sur l'équation à l'aide de laquelle on détermine les inégalités séculaires des mouvements des planètes · Exercices de Mathématiques 4 (1829), 140–160 — the interlacing of the eigenvalues of a real symmetric matrix and those of a bordered (principal-submatrix) matrix, the original Cauchy interlacing statement

Estimated time

beginner: 18m
intermediate: 45m
master: 90m