01.01.13 · foundations / linear-algebra

Spectral theorem for normal operators on a finite-dim inner-product space (principal-axes theorem)

shipped3 tiersLean: none

Anchor (Master): Shilov Linear Algebra Ch. 9 §9.3 + Ch. 10 §10.1; Halmos Finite-Dimensional Vector Spaces §73–§80; Hoffman-Kunze Linear Algebra Ch. 8 §8.4–§8.5; Axler Linear Algebra Done Right §7.B–§7.C; Reed-Simon Methods of Modern Mathematical Physics Vol. I §VII for the Hilbert-space extensions; Kato Perturbation Theory for Linear Operators Ch. V

Intuition Beginner

A matrix tells you how a linear map stretches space. Most maps mix rotation and stretching together, and the rotation hides the simple part. The spectral theorem says that for a certain large class of maps — the normal ones — you can change to a special set of perpendicular axes in which the rotation disappears entirely, and the map becomes pure stretching along each axis.

A symmetric matrix on the plane is the cleanest example. Its level sets are ellipses, and the ellipses always have two perpendicular principal axes 01.01.15. Rotate to those axes and the matrix becomes diagonal, with the eigenvalues sitting on the diagonal as the stretch factors. The principal-axes theorem is the statement that this rotation-to-diagonal works in every dimension, not only on the plane.

The complex version covers more maps. A normal map is one that commutes with its mirror image (the adjoint). The class includes symmetric maps, rotations, and reflections. For every normal map there is an orthonormal basis of eigenvectors, and the map acts on each basis vector by a single scalar. The mixing of rotation and stretching always comes apart along perpendicular eigendirections.

Visual Beginner

The picture shows a real symmetric two-by-two matrix and the rotation that diagonalises it. On the left, a unit circle is mapped to an ellipse by the matrix; the principal axes of the ellipse point along the eigendirections, and the lengths of the semi-axes are the eigenvalues. On the right, an orthonormal basis rotated to match the eigendirections, with the matrix in the new basis written as a diagonal matrix carrying the two eigenvalues.

Two pieces of data are visible. The two eigendirections are perpendicular — that is what the spectral theorem guarantees for a symmetric matrix. The eigenvalues are the stretch factors along those directions, and the diagonal matrix in the rotated basis is the whole map.

Worked example Beginner

Take the two-by-two real symmetric matrix

S = (2112) .

Step 1. Compute the characteristic polynomial. Expand $det (t I - S) = (t - 2)^{2} - 1 = t^{2} - 4 t + 3 = (t - 1) (t - 3)$ . The two eigenvalues are $λ_{1} = 1$ and $λ_{2} = 3$ .

Step 2. Find an eigenvector for $λ_{1} = 1$ . Solve $(S - I) v = 0$ , where $S - I$ is the matrix with row entries $(1, 1)$ and $(1, 1)$ . A solution is $v_{1} = (1, - 1)$ . Normalise to $u_{1} = (1/ 2, - 1/ 2)$ .

Step 3. Find an eigenvector for $λ_{2} = 3$ . Solve $(S - 3 I) v = 0$ , where $S - 3 I$ has row entries $(- 1, 1)$ and $(1, - 1)$ . A solution is $v_{2} = (1, 1)$ . Normalise to $u_{2} = (1/ 2, 1/ 2)$ .

Step 4. Check that $u_{1}$ and $u_{2}$ are perpendicular. The dot product is $u_{1} \cdot u_{2} = (1/ 2) (1/ 2) + (- 1/ 2) (1/ 2) = 1/2 - 1/2 = 0$ . The eigenvectors are orthogonal as predicted by the spectral theorem.

Step 5. Assemble the diagonalisation. Form the orthogonal matrix $Q$ whose columns are $u_{1}$ and $u_{2}$ :

Q = \frac{1}{2} (1 - 1 11), Q^{⊤} S Q = (1003) .

You can verify the right-hand equality by direct matrix multiplication. The original symmetric matrix becomes a diagonal matrix in the rotated basis, with the two eigenvalues on the diagonal.

What this tells us: every real symmetric matrix factors as $S = Q Λ Q^{⊤}$ with $Q$ orthogonal and $Λ$ diagonal. The columns of $Q$ are an orthonormal eigenbasis, and the diagonal entries of $Λ$ are the eigenvalues. The factorisation is the principal-axes theorem in a single equation.

Check your understanding Beginner

Formal definition Intermediate+

Let $V$ be a finite-dimensional inner-product space over $F \in {R, C}$ with inner product $⟨ \cdot, \cdot ⟩$ — conjugate-linear in the second argument over $C$ , linear in both arguments over $R$ . Given a linear operator $T : V \to V$ , the adjoint $T^{*} : V \to V$ is the unique linear operator satisfying $⟨ T v, w ⟩ = ⟨ v, T^{*} w ⟩$ for all $v, w \in V$ . Existence and uniqueness follow from the Riesz representation theorem for linear functionals on $V$ together with finite-dimensionality. In any orthonormal basis the matrix of $T^{*}$ is the conjugate transpose of the matrix of $T$ ; over $R$ this is the ordinary transpose.

The operator $T$ is normal iff $T T^{*} = T^{*} T$ . Three load-bearing special cases:

Self-adjoint (or Hermitian over $C$ , symmetric over $R$ ): $T = T^{*}$ .
Unitary (over $C$ ) or orthogonal (over $R$ ): $T T^{*} = T^{*} T = I$ , equivalently $T$ preserves the inner product.
Anti-self-adjoint (or skew-Hermitian / skew-symmetric): $T = - T^{*}$ .

Each of the three classes is automatically normal because the defining identity implies commutation with $T^{*}$ . The converse fails: the matrix $T = diag (1, i)$ is normal but neither self-adjoint, unitary, nor skew-adjoint.

An orthogonal projection $P : V \to V$ is a linear operator satisfying $P^{2} = P$ and $P = P^{*}$ . Its range is a subspace $W \subseteq V$ , and $P$ is the orthogonal projection onto $W$ (along its orthogonal complement $W^{⊥}$ ). The spectral theorem produces a family of pairwise-orthogonal orthogonal projections summing to the identity.

The spectrum of $T$ , written $spec (T)$ , is the set of eigenvalues, equivalently the roots of the characteristic polynomial $χ_{T} (t) = det (t I - T)$ . Over $C$ the spectrum is non-empty (fundamental theorem of algebra); over $R$ the spectrum may be empty (e.g., the rotation matrix on $R^{2}$ ).

Counterexamples to common slips

The condition $T T^{*} = T^{*} T$ is genuinely weaker than self-adjointness. The diagonal matrix $diag (1, i, - 1, - i) \in M_{4} (C)$ is normal — every diagonal matrix is — but is not self-adjoint, since its eigenvalues are not real.
The real-spectral-theorem direction "orthogonally diagonalisable implies symmetric" needs care: if $S = Q Λ Q^{⊤}$ with $Q$ orthogonal and $Λ$ real diagonal, then $S^{⊤} = Q Λ^{⊤} Q^{⊤} = Q Λ Q^{⊤} = S$ . The converse direction "symmetric implies orthogonally diagonalisable" is the substantive content of the theorem and uses the complex case as a black box.
Real matrices satisfying $T T^{⊤} = T^{⊤} T$ (real-normal) are not in general orthogonally diagonalisable over $R$ . The rotation matrix $R = (01 - 1 0)$ is real-normal (in fact orthogonal) but has no real eigenvalues; the diagonalisation lives over $C$ . The real-normal canonical form replaces complex-conjugate eigenvalue pairs by real two-by-two rotation blocks.

Key theorem with proof Intermediate+

Theorem (complex spectral theorem; Shilov Ch. 10 §10.1 ^{[source pending]}; Halmos §79 ^{[source pending]}). Let $V$ be a finite-dimensional complex inner-product space and $T : V \to V$ a linear operator. The following are equivalent:

(i) $T$ is normal: $T T^ = T^* T$.*

(ii) $V$ has an orthonormal basis of eigenvectors of $T$ .

(iii) There is a spectral decomposition $$ T = \sum_{i=1}^k \lambda_i P_i, $$ where $λ_{1}, \dots, λ_{k} \in C$ are the distinct eigenvalues of $T$ and $P_{1}, \dots, P_{k}$ are pairwise-orthogonal orthogonal projections summing to the identity: $\sum_{i} P_{i} = I$ and $P_{i} P_{j} = δ_{ij} P_{i}$ .

Proof. The implications $(ii) \Rightarrow (iii) \Rightarrow (i)$ are direct computations; the content is $(i) \Rightarrow (ii)$ , which is what the five-step argument below establishes.

Step 1 (existence of an eigenvector). Over $C$ the characteristic polynomial $χ_{T} (t) \in C [t]$ has degree $dim V \geq 1$ and is non-constant, so by the fundamental theorem of algebra it has a root $λ \in C$ . The operator $T - λ I$ is then singular, hence has a non-zero element $v$ in its kernel: $T v = λ v$ . So $T$ has at least one eigenvector. This step is the only place complex coefficients enter; the real spectral theorem reduces to this step via complexification.

Step 2 (same eigenvector for $T$ and $T^{*}$ , with conjugate eigenvalue). Suppose $T v = λ v$ with $v \neq = 0$ and $T$ normal. Then $(T - λ I) (T^{*} - \overset{ˉ}{λ} I) = T T^{*} - λ T^{*} - \overset{ˉ}{λ} T + ∣ λ ∣^{2} I$ , and the same expression equals $(T^{*} - \overset{ˉ}{λ} I) (T - λ I)$ since $T T^{*} = T^{*} T$ . So $T - λ I$ is itself normal. Compute the norm: $$ |(T - \lambda I) v|^2 = \langle (T - \lambda I) v, (T - \lambda I) v \rangle = \langle v, (T - \lambda I)^* (T - \lambda I) v \rangle. $$ By normality $(T - λ I)^{*} (T - λ I) = (T - λ I) (T - λ I)^{*}$ , and the same calculation gives $∥ (T - λ I)^{*} v ∥^{2} = ∥ (T - λ I) v ∥^{2}$ . Setting $(T - λ I) v = 0$ then forces $(T^{*} - \overset{ˉ}{λ} I) v = 0$ , i.e., $T^{*} v = \overset{ˉ}{λ} v$ . The eigenvector of $T$ at $λ$ is also an eigenvector of $T^{*}$ at $\overset{ˉ}{λ}$ .

Step 3 (eigenspaces at distinct eigenvalues are orthogonal). Let $T v_{1} = λ_{1} v_{1}$ and $T v_{2} = λ_{2} v_{2}$ with $λ_{1} \neq = λ_{2}$ . Using Step 2, $T^{*} v_{2} = \overset{ˉ}{λ}_{2} v_{2}$ . Then $$ \lambda_1 \langle v_1, v_2 \rangle = \langle \lambda_1 v_1, v_2 \rangle = \langle T v_1, v_2 \rangle = \langle v_1, T^* v_2 \rangle = \langle v_1, \bar\lambda_2 v_2 \rangle = \lambda_2 \langle v_1, v_2 \rangle. $$ The conjugate-linearity of the inner product in the second slot turned $\overset{ˉ}{λ}_{2}$ into $λ_{2}$ on the right-hand side. Subtracting gives $(λ_{1} - λ_{2}) ⟨ v_{1}, v_{2} ⟩ = 0$ , and since $λ_{1} \neq = λ_{2}$ we conclude $⟨ v_{1}, v_{2} ⟩ = 0$ .

Step 4 (orthogonal complement of an eigenspace is invariant). Let $E_{λ} = ker (T - λ I)$ be an eigenspace and $W = E_{λ}^{⊥}$ its orthogonal complement. For $w \in W$ and $v \in E_{λ}$ , compute $$ \langle T w, v \rangle = \langle w, T^* v \rangle = \langle w, \bar\lambda v \rangle = \lambda \langle w, v \rangle = 0, $$ where Step 2 supplied $T^{*} v = \overset{ˉ}{λ} v$ , and conjugate-linearity in the second slot turned $\overset{ˉ}{λ}$ into $λ$ when pulled out. So $T w \in W$ , meaning $W$ is $T$ -invariant. By the same argument applied to $T^{*}$ , $W$ is also $T^{*}$ -invariant, so the restriction $T ∣_{W}$ is again normal as an operator on $W$ with its inherited inner product.

Step 5 (induction on dimension). Argue by induction on $n = dim V$ . The base case $n = 1$ is immediate: any unit vector is an orthonormal eigenbasis. For $n > 1$ , Step 1 supplies an eigenvalue $λ_{1}$ and eigenspace $E_{λ_{1}}$ of positive dimension. The orthogonal complement $W = E_{λ_{1}}^{⊥}$ has $dim W < n$ , the restriction $T ∣_{W}$ is normal by Step 4, and by inductive hypothesis $W$ admits an orthonormal eigenbasis for $T ∣_{W}$ . Concatenate an orthonormal basis of $E_{λ_{1}}$ (taking each eigenvector to be a unit vector and applying Gram-Schmidt within the eigenspace) with the orthonormal eigenbasis of $W$ . The result is an orthonormal eigenbasis of $V$ for $T$ . This completes the induction and proves (i) $\Rightarrow$ (ii).

For (ii) $\Rightarrow$ (iii): given the orthonormal eigenbasis, group eigenvectors by eigenvalue. Let $P_{i}$ be the orthogonal projection onto the $λ_{i}$ -eigenspace $E_{λ_{i}}$ . Since the eigenspaces are pairwise orthogonal (Step 3) and span $V$ (eigenbasis exhausts $V$ ), the projections satisfy $\sum_{i} P_{i} = I$ and $P_{i} P_{j} = δ_{ij} P_{i}$ . On any basis vector $v \in E_{λ_{j}}$ , $T v = λ_{j} v = \sum_{i} λ_{i} P_{i} v$ since $P_{i} v = δ_{ij} v$ . By linearity $T = \sum_{i} λ_{i} P_{i}$ .

For (iii) $\Rightarrow$ (i): given the spectral decomposition, compute $T^{*} = \sum_{i} \overset{ˉ}{λ}_{i} P_{i}$ using $P_{i}^{*} = P_{i}$ . Then $T T^{*} = \sum_{i, j} λ_{i} \overset{ˉ}{λ}_{j} P_{i} P_{j} = \sum_{i} ∣ λ_{i} ∣^{2} P_{i} = T^{*} T$ , where the cross-terms with $i \neq = j$ vanish by $P_{i} P_{j} = 0$ . So $T$ is normal. $□$

Bridge. The complex spectral theorem builds toward every operator-theoretic structural result on a finite-dimensional inner-product space, and it appears again in 01.01.12 (singular value decomposition), where the spectral theorem applied to $T^{*} T$ produces the singular values and right singular vectors of an arbitrary linear map. The foundational reason the theorem holds is exactly the normality identity $T T^{*} = T^{*} T$ : it forces $T$ and $T^{*}$ to share eigenvectors with conjugate eigenvalues (Step 2), and this is what makes the orthogonal complement of an eigenspace $T$ -invariant (Step 4) instead of merely $T$ -stable in some weaker sense. The central insight is that normality identifies the operator $T$ with a sum $\sum_{i} λ_{i} P_{i}$ over its spectrum, with the orthogonal projections $P_{i}$ resolving the identity. Putting these together, the spectral decomposition generalises to the real symmetric case (real eigenvalues + orthogonal eigenbasis), to the principal-axes theorem for quadratic forms, to the singular value decomposition for arbitrary operators, and is dual to the Schur decomposition that triangularises every operator over $C$ . The bridge is that the cap of the finite-dimensional spectral theorem — a finite sum $T = \sum_{i} λ_{i} P_{i}$ — generalises to an integral $T = \int_{spec (T)} λ d E (λ)$ against a projection-valued measure $E$ in the unbounded self-adjoint case treated by von Neumann.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Find an orthonormal eigenbasis of the unitary matrix $T = (01 - 1 0)$ acting on $C^{2}$ with the standard Hermitian inner product.

Hint

Solve $(T - i I) v = 0$ and $(T + i I) v = 0$ separately, then normalise each eigenvector. The eigenvalues $\pm i$ are distinct, so the two eigenvectors are automatically orthogonal.

Answer

For $λ_{1} = i$ : solve $(T - i I) v = 0$ , that is $(- i 1 - 1 - i) v = 0$ . The two rows are proportional ( $- i$ times the first equals the second), so the solution space is spanned by $v_{1} = (1, - i)$ . Normalise: $∥ v_{1} ∥^{2} = 1 + ∣ - i ∣^{2} = 2$ , so $u_{1} = \frac{1}{2} (1, - i)$ .

For $λ_{2} = - i$ : solve $(T + i I) v = 0$ , that is $(i 1 - 1 i) v = 0$ . The solution space is spanned by $v_{2} = (1, i)$ , normalised to $u_{2} = \frac{1}{2} (1, i)$ .

Verify orthogonality: $⟨ u_{1}, u_{2} ⟩ = \frac{1}{2} (1 \cdot \overset{ˉ}{1} + (- i) (\overset{ˉ}{i})) = \frac{1}{2} (1 + (- i) (- i)) = \frac{1}{2} (1 - 1) = 0$ . The two eigenvectors are unitary-orthogonal, the spectrum ${i, - i}$ sits on the unit circle (a unitary signature), and $T = i \cdot u_{1} u_{1}^{*} + (- i) \cdot u_{2} u_{2}^{*}$ is the spectral decomposition.

Rubric: full credit for both normalised eigenvectors and the verification of orthogonality.

Exercise 7 (hard, short-answer).

State and prove the real spectral theorem: a linear operator $T$ on a finite-dimensional real inner-product space is orthogonally diagonalisable iff $T = T^{*}$ (symmetric).

Hint

For ( $\Leftarrow$ ), complexify $V$ to $V_{C} = V \otimes_{R} C$ , apply the complex spectral theorem, and use Exercise 4 to argue the eigenvalues are real, hence the eigenvectors can be chosen real.

Answer

( $\Rightarrow$ ) If $T = Q Λ Q^{⊤}$ with $Q$ orthogonal and $Λ$ real diagonal, then $T^{⊤} = Q Λ^{⊤} Q^{⊤} = Q Λ Q^{⊤} = T$ , so $T$ is symmetric.

( $\Leftarrow$ ) Let $T : V \to V$ be symmetric. Complexify: form $V_{C} = V \otimes_{R} C$ with the Hermitian inner product extending the real inner product on $V$ , and extend $T$ to a $C$ -linear $T_{C} : V_{C} \to V_{C}$ . The complexified operator $T_{C}$ is self-adjoint (Hermitian) with respect to the Hermitian inner product because $T$ is symmetric. By Exercise 4 every eigenvalue of $T_{C}$ is real, so the spectrum of $T_{C}$ lies in $R$ . By the complex spectral theorem, $V_{C}$ has an orthonormal eigenbasis of $T_{C}$ with real eigenvalues $λ_{i} \in R$ .

The complexification map $V ↪ V_{C}$ has image the real subspace fixed by complex conjugation. For each real eigenvalue $λ$ , the $λ$ -eigenspace $E_{λ} (T_{C})$ is conjugation-invariant (since $T_{C}$ and $λ$ are real), so it splits as $E_{λ} (T_{C}) = E_{λ} (T) \otimes_{R} C$ where $E_{λ} (T) = ker (T - λ I) \subseteq V$ . Pick a real orthonormal basis of each real eigenspace $E_{λ} (T)$ (using Gram-Schmidt in $V$ ), and concatenate across eigenvalues. Distinct real eigenspaces are orthogonal in $V$ (the proof in Step 3 of the key theorem uses only real inner-product properties when the eigenvalues are real). The result is a real orthonormal eigenbasis of $V$ , and the matrix of $T$ in that basis is real diagonal.

Rubric: full credit for the complexification argument together with the reality-of-eigenvalues step and the descent to a real orthonormal eigenbasis.

Exercise 8 (hard, symbolic).

Use the spectral decomposition to compute $exp (S)$ for the matrix $S = (2112)$ .

Hint

Write $S = λ_{1} P_{1} + λ_{2} P_{2}$ where $λ_{1} = 1$ , $λ_{2} = 3$ , and $P_{1}, P_{2}$ are the orthogonal projections onto the two eigenspaces. Then $exp (S) = e^{λ_{1}} P_{1} + e^{λ_{2}} P_{2}$ by the functional calculus.

Answer

Eigenvectors: $u_{1} = \frac{1}{2} (1, - 1)$ at $λ_{1} = 1$ and $u_{2} = \frac{1}{2} (1, 1)$ at $λ_{2} = 3$ . Orthogonal projections: $$ P_1 = u_1 u_1^\top = \frac{1}{2}\begin{pmatrix} 1 & -1 \ -1 & 1 \end{pmatrix}, \qquad P_2 = u_2 u_2^\top = \frac{1}{2}\begin{pmatrix} 1 & 1 \ 1 & 1 \end{pmatrix}. $$

Functional calculus: $$ \exp(S) = e^1 P_1 + e^3 P_2 = \frac{e}{2}\begin{pmatrix} 1 & -1 \ -1 & 1 \end{pmatrix} + \frac{e^3}{2}\begin{pmatrix} 1 & 1 \ 1 & 1 \end{pmatrix} = \frac{1}{2}\begin{pmatrix} e + e^3 & e^3 - e \ e^3 - e & e + e^3 \end{pmatrix}. $$

Rubric: full credit for the matrix of $exp (S)$ with the spectral-decomposition derivation.

Lean formalization Intermediate+

Mathlib's Mathlib.Analysis.InnerProductSpace.Spectrum provides the finite-dimensional spectral theorem for self-adjoint operators via LinearMap.IsSymmetric.eigenvectorBasis, which constructs an orthonormal basis of eigenvectors and supplies the diagonalisation. The normal-operator generalisation is partially packaged through IsStarNormal together with the spectral decomposition for elements of a star algebra, but no single theorem unifies (i) the normality condition, (ii) the orthonormal eigenbasis, and (iii) the spectral decomposition $T = \sum_{i} λ_{i} P_{i}$ with the orthogonal-projection-family identities. The intended formalisation reads schematically:

import Mathlib.Analysis.InnerProductSpace.Adjoint
import Mathlib.Analysis.InnerProductSpace.Spectrum
import Mathlib.Analysis.NormedSpace.Star.Basic

variable {𝕜 V : Type*} [RCLike 𝕜]
  [NormedAddCommGroup V] [InnerProductSpace 𝕜 V] [FiniteDimensional 𝕜 V]
  (T : V →L[𝕜] V)

/-- Complex spectral theorem for normal operators on a finite-dim
inner-product space: a normal operator has an orthonormal basis of
eigenvectors. -/
theorem normal_spectralTheorem (hT : IsStarNormal T) :
    ∃ (n : ℕ) (b : Fin n → V) (λ : Fin n → 𝕜),
      Orthonormal 𝕜 b ∧ (∀ i, T (b i) = λ i • b i) ∧
      Submodule.span 𝕜 (Set.range b) = ⊤ :=
  sorry  -- Step 1 (FTA-driven eigenvector), Steps 2-4 (normality lemmas),
         -- Step 5 (induction on dim)

/-- Spectral decomposition: T = ∑ λᵢ • Pᵢ with orthogonal projections
Pᵢ summing to the identity and pairwise-orthogonal. -/
theorem normal_spectralDecomposition (hT : IsStarNormal T) :
    ∃ (k : ℕ) (λ : Fin k → 𝕜) (P : Fin k → V →L[𝕜] V),
      (∀ i, IsOrthogonalProjection (P i)) ∧
      (∀ i j, i ≠ j → P i * P j = 0) ∧
      (∑ i, P i = 1) ∧
      (T = ∑ i, (λ i : 𝕜) • P i) :=
  sorry  -- repackage normal_spectralTheorem by grouping eigenvectors

/-- Real spectral theorem: a symmetric operator on a finite-dim
real inner-product space is orthogonally diagonalisable. -/
theorem symmetric_spectralTheorem {V : Type*}
    [NormedAddCommGroup V] [InnerProductSpace ℝ V] [FiniteDimensional ℝ V]
    (T : V →L[ℝ] V) (hT : LinearMap.IsSymmetric T.toLinearMap) :
    ∃ (n : ℕ) (b : Fin n → V) (λ : Fin n → ℝ),
      Orthonormal ℝ b ∧ (∀ i, T (b i) = λ i • b i) :=
  sorry  -- complexify, apply normal_spectralTheorem, descend to ℝ

The proof gap to a clean Mathlib contribution: (i) define IsOrthogonalProjection if absent; (ii) prove the normality-implies-shared-eigenvector lemma T.IsStarNormal → T x = λ • x → T.adjoint x = (starRingEnd 𝕜 λ) • x; (iii) prove the orthogonal-complement-of-eigenspace-is-invariant lemma; (iv) package the induction-on-dimension step that combines these. Each piece is formalisable from existing Mathlib infrastructure, but no single named theorem currently presents the spectral decomposition in the textbook form $T = \sum_{i} λ_{i} P_{i}$ with the projection-family identities as a single result.

Advanced results Master

Theorem (real spectral theorem; Hoffman-Kunze §8.4 ^{[source pending]}). Let $V$ be a finite-dimensional real inner-product space and $T : V \to V$ a linear operator. Then $T$ is orthogonally diagonalisable iff $T = T^ $(sy mm e t r i c), in w hi c h c a se$ V $a d mi t s an or t h o n or ma l e i g e nba s i so f r e a l e i g e n v ec t or s an d t h es p ec t r u m l i es in$ \mathbb{R}$.*

The proof complexifies $V$ to $V_{C}$ , applies the complex spectral theorem to $T_{C}$ , uses the self-adjoint reality-of-eigenvalues argument to confine the spectrum to $R$ , and descends an orthonormal real eigenbasis by averaging each complex eigenvector with its conjugate. The full descent argument is given in the proof set below.

Theorem (principal-axes theorem for quadratic forms; Shilov Ch. 10 §10.1 ^{[source pending]}). Every quadratic form $Q : R^{n} \to R$ , $Q (x) = x^{⊤} S x$ with $S$ real symmetric, is congruent under orthogonal change of basis to a diagonal form $Q (y) = \sum_{i} λ_{i} y_{i}^{2}$ , where $λ_{i}$ are the eigenvalues of $S$ . The directions of the new basis are the principal axes of $Q$ .

Specifically, the orthogonal matrix $Q$ in the diagonalisation $S = Q Λ Q^{⊤}$ is the change-of-basis matrix whose columns are the orthonormal eigenvectors of $S$ . Setting $y = Q^{⊤} x$ rewrites $x^{⊤} S x = (Q y)^{⊤} S (Q y) = y^{⊤} Λ y = \sum_{i} λ_{i} y_{i}^{2}$ . Sylvester's law of inertia 01.01.15 further refines this: the signature $(p, q, z)$ — the number of positive, negative, and zero eigenvalues — is invariant under arbitrary congruence by $GL_{n} (R)$ , not only orthogonal congruence.

Theorem (Schur decomposition; Schur 1909 Math. Ann. 66, 488–510 ^{[source pending]}). Every linear operator $T : V \to V$ on a finite-dimensional complex inner-product space admits a unitary basis change taking $T$ to upper-triangular form: $T = U R U^ $w i t h$ U $u ni t a r y an d$ R $u pp er - t r ian g u l a r . T h e d ia g o na l e n t r i eso f$ R $a r e t h ee i g e n v a l u eso f$ T$, in any chosen order.*

The Schur form holds for every operator, not only normal ones. The spectral theorem is the special case where normality forces the strictly upper-triangular part of $R$ to vanish, so $R$ is diagonal. The Schur form is the strongest universal canonical form available in the unitary-similarity category, and it is the input to QR-style numerical eigenvalue algorithms.

Theorem (real-normal canonical form). A real linear operator $T : V \to V$ satisfying $T T^{⊤} = T^{⊤} T$ is orthogonally similar to a block-diagonal matrix with real one-by-one blocks $(λ_{i})$ for real eigenvalues and real two-by-two blocks $(a b - b a)$ for complex-conjugate eigenvalue pairs $a \pm bi$ ( $b > 0$ ).

Real-orthogonal matrices (e.g., rotation matrices) are real-normal but in general have no real eigenvalues; the two-by-two rotation blocks encode the complex-conjugate eigenvalue pairs on the unit circle.

Theorem (functional calculus for normal operators). Let $T$ be normal with spectral decomposition $T = \sum_{i} λ_{i} P_{i}$ . For any function $f : spec (T) \to C$ , define $$ f(T) = \sum_i f(\lambda_i) P_i. $$ The map $f \mapsto f (T)$ is a $ $- a l g e b r ah o m o m or p hi s m f r o m t h e a l g e b r a o f f u n c t i o n so n$ \mathrm{spec}(T) $t o t h eo p er a t or a l g e b r a$ \mathrm{End}(V) $. I tt ak es t h e i d e n t i t y f u n c t i o n$ f(\lambda) = \lambda $t o$ T $, t h eco n s t an t$ 1 $t o t h e i d e n t i t y o p er a t or,$ f g $t o$ f(T) g(T) $, an d co m pl e x co nj ug a t i o n$ \bar f $t o t h e a d j o in t$ f(T)^ $. F or p o l y n o mia l$ f \in \mathbb{C}[t] $,$ f(T) $a g r ees w i t h t h e p o l y n o mia l e v a l u a t i o n; f or$ f = \exp $,$ f(T) = \exp(T) = \sum_i e^{\lambda_i} P_i $; f or$ f(\lambda) = \sqrt{\lambda} $o na p os i t i v eo p er a t or,$ f(T)$ is the unique positive square root.

The finite-dimensional functional calculus is the model for the continuous functional calculus on a C*-algebra (Gelfand-Naimark) and for the Borel functional calculus on a von Neumann algebra; the spectral-measure formulation extends $f (T)$ from continuous to Borel functions and from $\sum_{i}$ to $\int_{spec (T)} f d E$ .

Theorem (compact self-adjoint operators on Hilbert space; Riesz-Schauder, Hilbert 1906 + Riesz 1907 ^{[source pending]}). Let $H$ be a separable Hilbert space and $T : H \to H$ a compact self-adjoint operator. The spectrum of $T$ is a countable subset of $R$ with $0$ as the only possible accumulation point. Each non-zero eigenvalue has finite-dimensional eigenspace, and $H$ admits an orthonormal basis of eigenvectors of $T$ . Equivalently, $$ T = \sum_{n=1}^\infty \lambda_n P_n, $$ with the sum converging in the operator norm and $λ_{n} \to 0$ .

This is the infinite-dimensional generalisation that motivated the development of functional analysis: Hilbert's 1906 study of symmetric integral kernels on $L^{2} [a, b]$ established the spectral theorem for the compact case, and Riesz 1907/1910 abstracted it to compact operators on an arbitrary Hilbert space, founding the Riesz-Schauder theory. The hypothesis "compact" replaces "finite-dimensional"; the conclusion is the same diagonalisation, now indexed by a countable spectrum accumulating only at $0$ .

Theorem (bounded self-adjoint operators on Hilbert space; multiplication-operator form, Hahn-Hellinger). Let $H$ be a separable Hilbert space and $T : H \to H$ a bounded self-adjoint operator. There exist a $σ$ -finite measure space $(X, μ)$ , a bounded measurable function $f : X \to R$ , and a unitary isomorphism $U : H \to L^{2} (X, μ)$ such that $U T U^{- 1}$ is multiplication by $f$ : $$ (U T U^{-1} \psi)(x) = f(x) \psi(x). $$ Equivalently (and more invariantly), there is a projection-valued measure $E$ on the Borel $σ$ -algebra of $spec (T) \subseteq R$ such that $T = \int_{spec (T)} λ d E (λ)$ .

The multiplication-operator form replaces the finite sum $\sum_{i} λ_{i} P_{i}$ by integration of the identity function against the spectral measure $E$ . Hahn-Hellinger established the multiplicity-theoretic refinement: the unitary equivalence class of $T$ is determined by a multiplicity function on $spec (T)$ .

Theorem (unbounded self-adjoint operators on Hilbert space; von Neumann 1929 Math. Ann. 102, 49–131 ^{[source pending]}). Let $T : D (T) \to H$ be an unbounded self-adjoint operator on a separable Hilbert space, with dense domain $D (T) \subseteq H$ and $T = T^ $a s u nb o u n d e d o p er a t or s . T h er ee x i s t s a u ni q u e p r o j ec t i o n - v a l u e d m e a s u r e$ E $o n t h e B or e l$ \sigma $- a l g e b r a o f$ \mathbb{R}$ such that* $$ T = \int_{\mathbb{R}} \lambda , dE(\lambda), \qquad D(T) = \left{ \psi \in H : \int_{\mathbb{R}} \lambda^2 , d|E_\lambda \psi|^2 < \infty \right}. $$

The unbounded case is the technically deepest version of the spectral theorem and the one that underwrites quantum mechanics: observables are unbounded self-adjoint operators on a Hilbert space of state vectors, and the projection-valued measure $E$ is the measurement postulate. Stone's theorem identifies the one-parameter unitary group $U_{t} = e^{i tT} = \int e^{i t λ} d E (λ)$ as the dynamical evolution generated by the observable $T$ .

Theorem (normal operators on Hilbert space). A bounded normal operator $T$ on a separable Hilbert space admits a projection-valued measure $E$ on the Borel $σ$ -algebra of $spec (T) \subseteq C$ such that $T = \int_{spec (T)} λ d E (λ)$ . Equivalently, $T$ is unitarily equivalent to multiplication by a bounded measurable complex-valued function on a $σ$ -finite measure space.

The complex spectrum supports the projection-valued measure; the proof passes through the C*-algebra generated by $T$ and applies Gelfand-Naimark to identify it with continuous functions on $spec (T)$ . The unbounded normal case requires an additional commutativity hypothesis between $T$ and $T^{*}$ on domains (Putnam-Fuglede).

Synthesis. The finite-dimensional spectral theorem for normal operators is the foundational reason the inner-product geometry of $V$ admits an operator-adapted orthonormal basis: the operator $T$ and its adjoint $T^{*}$ commute exactly when their joint eigenspaces resolve the identity, and this is exactly the content of the orthogonal-eigenbasis statement. The central insight is that normality identifies $T$ with a finite sum $\sum_{i} λ_{i} P_{i}$ of orthogonal projections weighted by complex scalars, and the projections form a partition of unity in the inner-product geometry of $V$ . Putting these together, one spectral framework produces the real symmetric case (eigenvalues confined to $R$ ), the unitary case (eigenvalues on the unit circle), the anti-self-adjoint case (eigenvalues imaginary), the principal-axes theorem for quadratic forms, the singular value decomposition via the spectral theorem applied to $T^{*} T$ , and the functional calculus $f (T) = \sum_{i} f (λ_{i}) P_{i}$ for arbitrary spectral functions. The bridge is that the finite sum generalises to an operator-valued integral $T = \int_{spec (T)} λ d E (λ)$ against a projection-valued measure, and this is exactly the form taken by the spectral theorem for unbounded self-adjoint operators on Hilbert space, builds toward 02.11.03 (unbounded self-adjoint operators) where the integral form supplies the dynamical-evolution generator of quantum mechanics, and appears again in 01.01.12 (singular value decomposition) as the diagonalisation of $T^{*} T$ that produces singular values for arbitrary linear maps.

The spectral framework identifies several pairings that look distinct at first inspection. The real-symmetric diagonalisation is exactly the orthogonal-congruence classification of quadratic forms via the principal-axes theorem. The unitary diagonalisation is exactly the Fourier-style decomposition of a unitary operator into eigenspaces on the unit circle, the finite-dimensional model for Stone's theorem on one-parameter unitary groups. The Schur decomposition supplies an upper-triangular form for every complex operator, and normality is the precise additional condition that collapses Schur to diagonal. This pattern recurs at every level of generality: in the Hilbert-space case the spectral theorem for compact self-adjoint operators (Hilbert-Riesz-Schauder) gives the same orthonormal-eigenbasis statement with a countable spectrum, the bounded self-adjoint case replaces the eigenbasis by a multiplication operator on an $L^{2}$ -space (Hahn-Hellinger), and the unbounded case packages everything in a projection-valued measure (von Neumann). At each level, the finite-dimensional formula $T = \sum_{i} λ_{i} P_{i}$ is the model, and the upgrade is to a measure-theoretic integral. The foundational identification — operator decomposes against its spectrum — is preserved across the whole hierarchy.

Full proof set Master

Theorem (complex spectral theorem), proof. Given in the Intermediate-tier section: existence of an eigenvalue from the fundamental theorem of algebra (Step 1), shared eigenvector for $T$ and $T^{*}$ with conjugate eigenvalue using normality of $T - λ I$ (Step 2), orthogonality of eigenspaces at distinct eigenvalues via the adjoint-pairing argument (Step 3), $T$ -invariance of the orthogonal complement of an eigenspace (Step 4), and induction on $dim V$ (Step 5). The implication (ii) $\Rightarrow$ (iii) groups eigenvectors by eigenvalue and reads off the projection-family decomposition; (iii) $\Rightarrow$ (i) computes $T T^{*}$ and $T^{*} T$ both as $\sum_{i} ∣ λ_{i} ∣^{2} P_{i}$ . $□$

Proposition (real spectral theorem), proof. Let $T : V \to V$ be a symmetric operator on a finite-dim real inner-product space. Complexify: $V_{C} = V \otimes_{R} C ≅ V \oplus iV$ , with the Hermitian inner product $⟨ v_{1} + i w_{1}, v_{2} + i w_{2} ⟩_{C} = ⟨ v_{1}, v_{2} ⟩ + ⟨ w_{1}, w_{2} ⟩ + i (⟨ w_{1}, v_{2} ⟩ - ⟨ v_{1}, w_{2} ⟩)$ restricting to the real inner product on $V$ . Extend $T$ to a $C$ -linear operator $T_{C} (v + i w) = T v + i T w$ . Verify that $T_{C}$ is Hermitian-self-adjoint: $$ \langle T_\mathbb{C}(v_1 + i w_1), v_2 + i w_2 \rangle_\mathbb{C} = \langle T v_1, v_2 \rangle + \langle T w_1, w_2 \rangle + i(\langle T w_1, v_2 \rangle - \langle T v_1, w_2 \rangle), $$ and on the other side $$ \langle v_1 + i w_1, T_\mathbb{C}(v_2 + i w_2) \rangle_\mathbb{C} = \langle v_1, T v_2 \rangle + \langle w_1, T w_2 \rangle + i(\langle w_1, T v_2 \rangle - \langle v_1, T w_2 \rangle). $$ By $T = T^{⊤}$ in the real inner product, the two sides match. So $T_{C}$ is self-adjoint, hence normal.

Apply the complex spectral theorem: $V_{C}$ admits an orthonormal eigenbasis ${e_{1}, \dots, e_{n}}$ of $T_{C}$ with eigenvalues $λ_{i}$ . Each $λ_{i}$ is real, by the self-adjoint reality argument: $λ_{i} ⟨ e_{i}, e_{i} ⟩_{C} = ⟨ T_{C} e_{i}, e_{i} ⟩_{C} = ⟨ e_{i}, T_{C} e_{i} ⟩_{C} = \overset{ˉ}{λ}_{i} ⟨ e_{i}, e_{i} ⟩_{C}$ , forcing $λ_{i} = \overset{ˉ}{λ}_{i} \in R$ .

Complex conjugation on $V_{C}$ (sending $v + i w \mapsto v - i w$ ) commutes with $T_{C}$ because $T$ is real. For each real $λ$ , the eigenspace $E_{λ} (T_{C})$ is conjugation-invariant, so it has a real basis: pick any $z \in E_{λ} (T_{C})$ , decompose $z = v + i w$ with $v, w \in V$ , then $T_{C} z = T v + i T w = λ (v + i w)$ gives $T v = λ v$ and $T w = λ w$ , so $v, w \in E_{λ} (T) := ker (T - λ I) ∣_{V}$ . Choosing $z, \overset{z}{ˉ}$ as a real-spanning pair and applying Gram-Schmidt in the real eigenspace gives a real orthonormal basis of $E_{λ} (T)$ .

Distinct real eigenspaces of $T$ are orthogonal in $V$ by the same calculation as Step 3 of the complex case (specialised to real $λ_{1} \neq = λ_{2}$ , where the conjugate-linear distinction vanishes). Concatenating the real orthonormal bases of the distinct real eigenspaces produces a real orthonormal eigenbasis of $V$ , and the matrix of $T$ in this basis is real diagonal. $□$

Proposition (principal-axes theorem), proof. Let $S$ be a real symmetric $n \times n$ matrix and $Q (x) = x^{⊤} S x$ the associated quadratic form on $R^{n}$ 01.01.15. By the real spectral theorem there is an orthogonal matrix $P$ and a real diagonal matrix $Λ$ with $S = P Λ P^{⊤}$ . Change variable: set $y = P^{⊤} x$ , so $x = P y$ . Substitute into $Q$ : $$ Q(x) = x^\top S x = (P y)^\top (P \Lambda P^\top)(P y) = y^\top P^\top P \Lambda P^\top P y = y^\top \Lambda y = \sum_{i=1}^n \lambda_i y_i^2, $$ using $P^{⊤} P = I$ . The new basis vectors (columns of $P$ ) are the principal axes of $Q$ , the eigenvalues $λ_{i}$ are the diagonal entries of $Λ$ , and $Q$ is diagonalised as a sum of weighted squares. $□$

Proposition (Schur decomposition), proof. Let $T : V \to V$ be a linear operator on a finite-dim complex inner-product space, $dim V = n$ . Argue by induction on $n$ . The base case $n = 1$ is immediate. For $n > 1$ , $T$ has an eigenvalue $λ_{1} \in C$ with eigenvector $v_{1}$ ; normalise to $u_{1} = v_{1} /∥ v_{1} ∥$ . Let $W = (span {u_{1}})^{⊥} \subseteq V$ , a subspace of dimension $n - 1$ . Note: $W$ is not in general $T$ -invariant (it is $T^{*}$ -invariant, by the same argument as Step 4 of the spectral theorem but with $T$ and $T^{*}$ swapped). However, the composite $π_{W} \circ T ∣_{W} : W \to W$ is a well-defined linear operator on $W$ , where $π_{W}$ is orthogonal projection onto $W$ .

By inductive hypothesis there is an orthonormal basis ${u_{2}, \dots, u_{n}}$ of $W$ in which $π_{W} \circ T ∣_{W}$ is upper triangular. Concatenate to form the orthonormal basis ${u_{1}, u_{2}, \dots, u_{n}}$ of $V$ . The matrix of $T$ in this basis has first column $(λ_{1}, 0, \dots, 0)^{⊤}$ (since $T u_{1} = λ_{1} u_{1}$ ) and the lower-right $(n - 1) \times (n - 1)$ block agrees with the matrix of $π_{W} \circ T ∣_{W}$ , which is upper-triangular by inductive hypothesis. Off-diagonal entries above the diagonal in the first row are the inner products $⟨ T u_{j}, u_{1} ⟩$ for $j > 1$ , which need not vanish — but they sit in the upper triangle, so the resulting matrix is upper triangular. Setting $U$ to be the unitary matrix whose columns are ${u_{1}, \dots, u_{n}}$ gives $T = U R U^{*}$ with $R$ upper triangular. $□$

Proposition (functional calculus is a $*$ -homomorphism), proof. Let $T$ be normal with spectral decomposition $T = \sum_{i} λ_{i} P_{i}$ , and for $f : spec (T) \to C$ define $f (T) = \sum_{i} f (λ_{i}) P_{i}$ . Use $P_{i} P_{j} = δ_{ij} P_{i}$ and $P_{i}^{*} = P_{i}$ :

(i) Linearity: $(α f + β g) (T) = \sum_{i} (α f (λ_{i}) + β g (λ_{i})) P_{i} = α f (T) + β g (T)$ .

(ii) Multiplicativity: $f (T) g (T) = (\sum_{i} f (λ_{i}) P_{i}) (\sum_{j} g (λ_{j}) P_{j}) = \sum_{i, j} f (λ_{i}) g (λ_{j}) P_{i} P_{j} = \sum_{i} f (λ_{i}) g (λ_{i}) P_{i} = (f g) (T)$ .

(iii) Identity: the constant function $1$ gives $1 (T) = \sum_{i} P_{i} = I$ .

(iv) Star: $\overline{f} (T) = \sum_{i} \overline{f (λ_{i})} P_{i} = (\sum_{i} f (λ_{i}) P_{i})^{*} = f (T)^{*}$ , since each $P_{i}$ is self-adjoint.

(v) Identity function: $id (T) = \sum_{i} λ_{i} P_{i} = T$ .

The map $f \mapsto f (T)$ is therefore a $*$ -algebra homomorphism from the algebra of functions on $spec (T)$ (with pointwise operations and complex conjugation) to $End (V)$ . For $f \in C [t]$ a polynomial, $f (T) = \sum_{k} a_{k} T^{k}$ by linearity and multiplicativity, agreeing with the standard polynomial evaluation. For $f = exp$ , $exp (T) = \sum_{i} e^{λ_{i}} P_{i}$ is a finite sum that equals the operator-norm exponential $\sum_{k \geq 0} T^{k} / k!$ by uniform-convergence comparison on the finite spectrum. $□$

Theorem (compact self-adjoint operators), stated without independent proof — see Hilbert 1906 Nachr. Gött. IV ^{[source pending]} and Riesz 1907 C. R. Acad. Sci. Paris 144, 615–619 ^{[source pending]}. The Riesz-Schauder argument adapts the five-step proof of the finite-dimensional case to compact operators: the spectrum is a countable set with $0$ as the only possible accumulation point (Riesz lemma), each non-zero eigenvalue has finite multiplicity (compactness of the unit ball in the eigenspace), and the eigenvectors span the closure of the range (by induction on the largest eigenvalue, using that the orthogonal complement of an eigenspace is invariant and that the restriction is still compact and self-adjoint). The full development is in Reed-Simon Vol. I §VI.

Theorem (unbounded self-adjoint operators, von Neumann's spectral theorem), stated without independent proof — see von Neumann 1929 Math. Ann. 102, 49–131 ^{[source pending]}. The unbounded case requires the Cayley transform $C = (T - i I) (T + i I)^{- 1}$ , which converts an unbounded self-adjoint $T$ into a bounded unitary $C$ , then applies the spectral theorem for unitary operators (a normal-operator special case with spectrum on the unit circle), and inverts the Cayley transform to recover the projection-valued measure of $T$ . The detailed argument is given in Reed-Simon Vol. I §VIII; the finite-dimensional theorem is the model that orients the entire construction.

Connections Master

Eigenvalue, eigenvector, characteristic polynomial 01.01.08. The spectral theorem is the structural refinement of the eigenvalue-eigenvector theory for normal operators on an inner-product space. The eigenvalues $λ_{i}$ of $T$ are the roots of $χ_{T} (t)$ , and normality is the precise condition that promotes "every operator has eigenvectors over $C$ " to "every normal operator has an orthonormal eigenbasis". Without the inner-product structure and the adjoint $T^{*}$ , the orthonormality clause is meaningless; without normality, eigenspaces at distinct eigenvalues need not be orthogonal and the orthogonal-complement-of-eigenspace need not be $T$ -invariant.
Bilinear / quadratic form 01.01.15. The principal-axes theorem is the orthogonal-congruence diagonalisation of a real quadratic form, and it is exactly the real spectral theorem applied to the symmetric matrix representing the form. Sylvester's law of inertia further classifies real quadratic forms up to arbitrary $GL_{n} (R)$ -congruence by the signature $(p, q, z)$ , recovering the eigenvalue-sign information from the spectral decomposition. The bridge between the inner-product-geometric statement (orthogonal congruence) and the bilinear-form classification (Sylvester's law) is the spectral theorem.
Singular value decomposition 01.01.12. For an arbitrary linear map $A : V \to W$ between finite-dim inner-product spaces, the operator $A^{*} A$ on $V$ is self-adjoint and positive semi-definite, so the spectral theorem gives an orthonormal eigenbasis ${v_{i}}$ with non-negative eigenvalues $σ_{i}^{2} \geq 0$ . The numbers $σ_{i}$ are the singular values of $A$ ; setting $u_{i} = A v_{i} / σ_{i}$ for $σ_{i} > 0$ produces the orthonormal basis on the $W$ -side, and $A = \sum_{i} σ_{i} u_{i} v_{i}^{*}$ is the singular value decomposition. The SVD is the spectral theorem on $A^{*} A$ paired with the recipe for transporting the eigenbasis to the codomain.
Jordan canonical form 01.01.11. Over $C$ every linear operator $T$ admits a Jordan canonical form, the strongest similarity-classification statement available without an inner-product structure. The Schur decomposition is the unitary-similarity refinement (every operator is unitarily upper-triangularisable), and the spectral theorem is the further refinement available when $T$ is normal (the upper-triangular part collapses to diagonal). Jordan and spectral are dual viewpoints: Jordan classifies up to $GL (V)$ -conjugacy without inner-product structure, spectral classifies up to $U (V)$ -conjugacy with inner-product structure, and normal operators are exactly the ones whose Jordan form is already diagonal.
Unbounded self-adjoint operators 02.11.03. The infinite-dimensional generalisation of the spectral theorem to unbounded self-adjoint operators on a separable Hilbert space (von Neumann 1929) is the foundation of quantum-mechanical observable theory. The finite-sum spectral decomposition $T = \sum_{i} λ_{i} P_{i}$ generalises to an operator-valued integral $T = \int λ d E (λ)$ against a projection-valued measure $E$ on the Borel $σ$ -algebra of $spec (T) \subseteq R$ , and Stone's theorem identifies the one-parameter unitary group $e^{i tT}$ as the dynamical evolution. The finite-dim theorem proved here is the model that orients the entire functional-analytic theory.

Historical & philosophical context Master

The principal-axes theorem in its real symmetric form is due to Cauchy, who established the orthogonal diagonalisation of real symmetric matrices in his 1829 Exercices de Mathématiques IV (140–160) ^{[source pending]} in the context of celestial mechanics: the secular perturbations of planetary orbits were governed by a real symmetric matrix arising from the gravitational potential, and Cauchy needed the principal axes of the corresponding quadratic form to decouple the perturbation equations. The real eigenvalues recorded the frequencies of the linearised motion; the orthogonal eigenvectors recorded the normal modes. The theorem was first applied as a problem-solving tool in mathematical astronomy before being recognised as a structural result in linear algebra. Hermite extended the result to complex Hermitian matrices in 1855 (Comptes Rendus 41, 181–183, and J. Reine Angew. Math. 50, 273–283) ^{[source pending]}, establishing the reality of eigenvalues for the Hermitian case and the corresponding orthogonal diagonalisation.

The normal-operator framing is due to Toeplitz, whose 1918 paper Das algebraische Analogon zu einem Satze von Fejér (Math. Z. 2, 187–197) ^{[source pending]} identified normality $T T^{*} = T^{*} T$ as the precise condition for unitary diagonalisability in the finite-dimensional complex setting. Schur's 1909 paper Über die charakteristischen Wurzeln einer linearen Substitution (Math. Ann. 66, 488–510) ^{[source pending]} had already established the unitary upper-triangularisation of arbitrary complex matrices, and Toeplitz's contribution was the recognition that the strictly upper-triangular part of the Schur form vanishes exactly when the operator is normal — collapsing the Schur decomposition to a diagonal one and identifying normality as the algebraic signature of the spectral theorem.

The infinite-dimensional generalisations developed in parallel with the finite case. Hilbert's Grundzüge einer allgemeinen Theorie der linearen Integralgleichungen — published in six instalments in the Nachrichten der Gesellschaft der Wissenschaften zu Göttingen between 1904 and 1910 and collected as a monograph in 1912 — established the spectral theorem for symmetric integral operators on $L^{2}$ , the first infinite-dimensional spectral theorem ^{[source pending]}. Riesz's 1907 Comptes Rendus note (144, 615–619) and 1910 Mathematische Annalen paper (69, 449–497) abstracted Hilbert's argument to compact symmetric operators on a separable Hilbert space, founding what is now called Riesz-Schauder theory ^{[source pending]}. The bounded self-adjoint case received its multiplication-operator form through Hahn and Hellinger in the 1910s and 1920s, and the unbounded case was settled by von Neumann's 1929 Allgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren (Math. Ann. 102, 49–131) ^{[source pending]}, where the projection-valued measure formulation $T = \int λ d E (λ)$ first appeared. Von Neumann's 1932 monograph Mathematische Grundlagen der Quantenmechanik (Springer 1932) then identified self-adjoint operators with quantum-mechanical observables and the spectral measure with the measurement postulate, an identification that has structured both functional analysis and the foundations of physics ever since.

Bibliography Master

@article{Cauchy1829Exercices,
  author  = {Cauchy, Augustin-Louis},
  title   = {Sur l'{\'e}quation {\`a} l'aide de laquelle on d{\'e}termine les in{\'e}galit{\'e}s s{\'e}culaires des mouvements des plan{\`e}tes},
  journal = {Exercices de Math{\'e}matiques},
  volume  = {4},
  year    = {1829},
  pages   = {140--160}
}

@article{Hermite1855,
  author  = {Hermite, Charles},
  title   = {Remarques sur un th{\'e}or{\`e}me de M. Cauchy},
  journal = {Comptes Rendus de l'Acad{\'e}mie des Sciences},
  volume  = {41},
  year    = {1855},
  pages   = {181--183}
}

@article{Schur1909,
  author  = {Schur, Issai},
  title   = {{\"U}ber die charakteristischen Wurzeln einer linearen Substitution mit einer Anwendung auf die Theorie der Integralgleichungen},
  journal = {Mathematische Annalen},
  volume  = {66},
  year    = {1909},
  pages   = {488--510}
}

@article{Toeplitz1918,
  author  = {Toeplitz, Otto},
  title   = {Das algebraische Analogon zu einem Satze von Fej{\'e}r},
  journal = {Mathematische Zeitschrift},
  volume  = {2},
  year    = {1918},
  pages   = {187--197}
}

@article{Hilbert1906,
  author  = {Hilbert, David},
  title   = {Grundz{\"u}ge einer allgemeinen Theorie der linearen Integralgleichungen, Vierte Mitteilung},
  journal = {Nachrichten der K. Gesellschaft der Wissenschaften zu G{\"o}ttingen},
  year    = {1906},
  pages   = {157--227}
}

@book{Hilbert1912IntegralEquations,
  author    = {Hilbert, David},
  title     = {Grundz{\"u}ge einer allgemeinen Theorie der linearen Integralgleichungen},
  publisher = {Teubner},
  address   = {Leipzig},
  year      = {1912}
}

@article{Riesz1907,
  author  = {Riesz, Frigyes},
  title   = {Sur les syst{\`e}mes orthogonaux de fonctions},
  journal = {Comptes Rendus de l'Acad{\'e}mie des Sciences},
  volume  = {144},
  year    = {1907},
  pages   = {615--619}
}

@article{Riesz1910,
  author  = {Riesz, Frigyes},
  title   = {Untersuchungen {\"u}ber Systeme integrierbarer Funktionen},
  journal = {Mathematische Annalen},
  volume  = {69},
  year    = {1910},
  pages   = {449--497}
}

@article{vonNeumann1929,
  author  = {von Neumann, John},
  title   = {Allgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren},
  journal = {Mathematische Annalen},
  volume  = {102},
  year    = {1929},
  pages   = {49--131}
}

@book{vonNeumann1932,
  author    = {von Neumann, John},
  title     = {Mathematische Grundlagen der Quantenmechanik},
  publisher = {Springer},
  address   = {Berlin},
  year      = {1932}
}

@book{HalmosFDVS,
  author    = {Halmos, Paul R.},
  title     = {Finite-Dimensional Vector Spaces},
  publisher = {Springer-Verlag},
  series    = {Undergraduate Texts in Mathematics},
  edition   = {2},
  year      = {1974}
}

@book{ShilovLinearAlgebra,
  author    = {Shilov, Georgi E.},
  title     = {Linear Algebra},
  publisher = {Dover},
  year      = {1977},
  note      = {English translation; Russian original 1969}
}

@book{HoffmanKunze,
  author    = {Hoffman, Kenneth and Kunze, Ray},
  title     = {Linear Algebra},
  publisher = {Prentice-Hall},
  edition   = {2},
  year      = {1971}
}

@book{AxlerLADR,
  author    = {Axler, Sheldon},
  title     = {Linear Algebra Done Right},
  publisher = {Springer},
  series    = {Undergraduate Texts in Mathematics},
  edition   = {3},
  year      = {2015}
}

@book{ReedSimonI,
  author    = {Reed, Michael and Simon, Barry},
  title     = {Methods of Modern Mathematical Physics, Vol. I: Functional Analysis},
  publisher = {Academic Press},
  edition   = {Revised and Enlarged},
  year      = {1980}
}

@book{KatoPerturbation,
  author    = {Kato, Tosio},
  title     = {Perturbation Theory for Linear Operators},
  publisher = {Springer-Verlag},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {132},
  edition   = {2},
  year      = {1980}
}

Prerequisites

01.01.08
01.01.15

Tier anchors

beginner: Principal axes of an ellipse — Shilov Ch. 10 §10.1 informal opening; Strang *Introduction to Linear Algebra* Ch. 6 (eigenvalues and the symmetric case)
intermediate: Axler — Linear Algebra Done Right §7.B (complex spectral theorem) + §7.C (real spectral theorem); Apostol Calculus Vol. 2 §5.6; Hoffman-Kunze §8.4
master: Shilov Linear Algebra Ch. 9 §9.3 + Ch. 10 §10.1; Halmos Finite-Dimensional Vector Spaces §73–§80; Hoffman-Kunze Linear Algebra Ch. 8 §8.4–§8.5; Axler Linear Algebra Done Right §7.B–§7.C; Reed-Simon Methods of Modern Mathematical Physics Vol. I §VII for the Hilbert-space extensions; Kato Perturbation Theory for Linear Operators Ch. V

References

images/Shilov-Linear-Algebra__4cbdee00cc.jpg · Shilov Linear Algebra Ch. 9 §9.3 (normal operators) + Ch. 10 §10.1 (spectral / principal-axes theorem) anchor listed in Fast Track archive
Calculus Vol.2 - Multi-Variable Calculus and Linear Algebra with Applications (Tom Apostol).pdf · Ch. 5 §5.5–§5.6, Hermitian and symmetric matrices, orthogonal diagonalisation, the spectral theorem in the symmetric case
Shilov, G. E. — Linear Algebra · Ch. 9 §9.3 normal operators on a finite-dim complex inner-product space; Ch. 10 §10.1 principal-axes theorem and the spectral decomposition
Halmos, P. R. — Finite-Dimensional Vector Spaces (2nd ed.) · §73–§80 inner-product spaces, adjoint operators, self-adjoint and normal operators, the spectral theorem and orthogonal projections
Axler, S. — Linear Algebra Done Right (3rd ed.) · Ch. 7 §7.B complex spectral theorem; §7.C real spectral theorem; §7.A self-adjoint and normal operators
Hoffman, K. & Kunze, R. — Linear Algebra (2nd ed.) · Ch. 8 §8.4 normal operators; §8.5 spectral theorem; §9 the bilinear-form generalisation
Cauchy, A.-L. — Sur l'équation à l'aide de laquelle on détermine les inégalités séculaires des mouvements des planètes · Exercices de Mathématiques, 4 (1829), 140–160 — diagonalisation of real symmetric matrices and the principal-axes theorem for celestial-mechanics secular perturbations
Hermite, C. — Remarques sur un théorème de M. Cauchy · Comptes Rendus de l'Académie des Sciences 41 (1855), 181–183; expanded in J. Reine Angew. Math. 50 (1855), 273–283 — Hermitian matrices, reality of eigenvalues for the complex symmetric case
Schur, I. — Über die charakteristischen Wurzeln einer linearen Substitution · Math. Ann. 66 (1909), 488–510 — unitary upper-triangularisation, the Schur decomposition that powers the modern proof of the spectral theorem
Toeplitz, O. — Das algebraische Analogon zu einem Satze von Fejér · Math. Z. 2 (1918), 187–197 — normal-operator statement of the spectral theorem in the finite-dimensional complex case
Hilbert, D. — Grundzüge einer allgemeinen Theorie der linearen Integralgleichungen, Vierte Mitteilung · Nachrichten der K. Gesellschaft der Wissenschaften zu Göttingen (1906), 157–227, and collected as Hilbert 1912 — spectral theorem for symmetric integral operators on $L^2$, the infinite-dim antecedent
Riesz, F. — Sur les systèmes orthogonaux de fonctions · Comptes Rendus 144 (1907), 615–619, and Riesz 1910, Math. Ann. 69, 449–497 — compact symmetric operators on Hilbert space, spectral decomposition and Riesz-Schauder theory
von Neumann, J. — Allgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren · Math. Ann. 102 (1929), 49–131 — spectral theorem for unbounded self-adjoint operators with the projection-valued measure formulation

Estimated time

beginner: 18m
intermediate: 45m
master: 90m