Spectral theorem for normal operators on a finite-dim inner-product space (principal-axes theorem)
Anchor (Master): Shilov Linear Algebra Ch. 9 §9.3 + Ch. 10 §10.1; Halmos Finite-Dimensional Vector Spaces §73–§80; Hoffman-Kunze Linear Algebra Ch. 8 §8.4–§8.5; Axler Linear Algebra Done Right §7.B–§7.C; Reed-Simon Methods of Modern Mathematical Physics Vol. I §VII for the Hilbert-space extensions; Kato Perturbation Theory for Linear Operators Ch. V
Intuition Beginner
A matrix tells you how a linear map stretches space. Most maps mix rotation and stretching together, and the rotation hides the simple part. The spectral theorem says that for a certain large class of maps — the normal ones — you can change to a special set of perpendicular axes in which the rotation disappears entirely, and the map becomes pure stretching along each axis.
A symmetric matrix on the plane is the cleanest example. Its level sets are ellipses, and the ellipses always have two perpendicular principal axes 01.01.15. Rotate to those axes and the matrix becomes diagonal, with the eigenvalues sitting on the diagonal as the stretch factors. The principal-axes theorem is the statement that this rotation-to-diagonal works in every dimension, not only on the plane.
The complex version covers more maps. A normal map is one that commutes with its mirror image (the adjoint). The class includes symmetric maps, rotations, and reflections. For every normal map there is an orthonormal basis of eigenvectors, and the map acts on each basis vector by a single scalar. The mixing of rotation and stretching always comes apart along perpendicular eigendirections.
Visual Beginner
The picture shows a real symmetric two-by-two matrix and the rotation that diagonalises it. On the left, a unit circle is mapped to an ellipse by the matrix; the principal axes of the ellipse point along the eigendirections, and the lengths of the semi-axes are the eigenvalues. On the right, an orthonormal basis rotated to match the eigendirections, with the matrix in the new basis written as a diagonal matrix carrying the two eigenvalues.
Two pieces of data are visible. The two eigendirections are perpendicular — that is what the spectral theorem guarantees for a symmetric matrix. The eigenvalues are the stretch factors along those directions, and the diagonal matrix in the rotated basis is the whole map.
Worked example Beginner
Take the two-by-two real symmetric matrix
Step 1. Compute the characteristic polynomial. Expand . The two eigenvalues are and .
Step 2. Find an eigenvector for . Solve , where is the matrix with row entries and . A solution is . Normalise to .
Step 3. Find an eigenvector for . Solve , where has row entries and . A solution is . Normalise to .
Step 4. Check that and are perpendicular. The dot product is . The eigenvectors are orthogonal as predicted by the spectral theorem.
Step 5. Assemble the diagonalisation. Form the orthogonal matrix whose columns are and :
You can verify the right-hand equality by direct matrix multiplication. The original symmetric matrix becomes a diagonal matrix in the rotated basis, with the two eigenvalues on the diagonal.
What this tells us: every real symmetric matrix factors as with orthogonal and diagonal. The columns of are an orthonormal eigenbasis, and the diagonal entries of are the eigenvalues. The factorisation is the principal-axes theorem in a single equation.
Check your understanding Beginner
Formal definition Intermediate+
Let be a finite-dimensional inner-product space over with inner product — conjugate-linear in the second argument over , linear in both arguments over . Given a linear operator , the adjoint is the unique linear operator satisfying for all . Existence and uniqueness follow from the Riesz representation theorem for linear functionals on together with finite-dimensionality. In any orthonormal basis the matrix of is the conjugate transpose of the matrix of ; over this is the ordinary transpose.
The operator is normal iff . Three load-bearing special cases:
- Self-adjoint (or Hermitian over , symmetric over ): .
- Unitary (over ) or orthogonal (over ): , equivalently preserves the inner product.
- Anti-self-adjoint (or skew-Hermitian / skew-symmetric): .
Each of the three classes is automatically normal because the defining identity implies commutation with . The converse fails: the matrix is normal but neither self-adjoint, unitary, nor skew-adjoint.
An orthogonal projection is a linear operator satisfying and . Its range is a subspace , and is the orthogonal projection onto (along its orthogonal complement ). The spectral theorem produces a family of pairwise-orthogonal orthogonal projections summing to the identity.
The spectrum of , written , is the set of eigenvalues, equivalently the roots of the characteristic polynomial . Over the spectrum is non-empty (fundamental theorem of algebra); over the spectrum may be empty (e.g., the rotation matrix on ).
Counterexamples to common slips
- The condition is genuinely weaker than self-adjointness. The diagonal matrix is normal — every diagonal matrix is — but is not self-adjoint, since its eigenvalues are not real.
- The real-spectral-theorem direction "orthogonally diagonalisable implies symmetric" needs care: if with orthogonal and real diagonal, then . The converse direction "symmetric implies orthogonally diagonalisable" is the substantive content of the theorem and uses the complex case as a black box.
- Real matrices satisfying (real-normal) are not in general orthogonally diagonalisable over . The rotation matrix is real-normal (in fact orthogonal) but has no real eigenvalues; the diagonalisation lives over . The real-normal canonical form replaces complex-conjugate eigenvalue pairs by real two-by-two rotation blocks.
Key theorem with proof Intermediate+
Theorem (complex spectral theorem; Shilov Ch. 10 §10.1 [source pending]; Halmos §79 [source pending]). Let be a finite-dimensional complex inner-product space and a linear operator. The following are equivalent:
(i) is normal: $T T^ = T^* T$.*
(ii) has an orthonormal basis of eigenvectors of .
(iii) There is a spectral decomposition $$ T = \sum_{i=1}^k \lambda_i P_i, $$ where are the distinct eigenvalues of and are pairwise-orthogonal orthogonal projections summing to the identity: and .
Proof. The implications are direct computations; the content is , which is what the five-step argument below establishes.
Step 1 (existence of an eigenvector). Over the characteristic polynomial has degree and is non-constant, so by the fundamental theorem of algebra it has a root . The operator is then singular, hence has a non-zero element in its kernel: . So has at least one eigenvector. This step is the only place complex coefficients enter; the real spectral theorem reduces to this step via complexification.
Step 2 (same eigenvector for and , with conjugate eigenvalue). Suppose with and normal. Then , and the same expression equals since . So is itself normal. Compute the norm: $$ |(T - \lambda I) v|^2 = \langle (T - \lambda I) v, (T - \lambda I) v \rangle = \langle v, (T - \lambda I)^* (T - \lambda I) v \rangle. $$ By normality , and the same calculation gives . Setting then forces , i.e., . The eigenvector of at is also an eigenvector of at .
Step 3 (eigenspaces at distinct eigenvalues are orthogonal). Let and with . Using Step 2, . Then $$ \lambda_1 \langle v_1, v_2 \rangle = \langle \lambda_1 v_1, v_2 \rangle = \langle T v_1, v_2 \rangle = \langle v_1, T^* v_2 \rangle = \langle v_1, \bar\lambda_2 v_2 \rangle = \lambda_2 \langle v_1, v_2 \rangle. $$ The conjugate-linearity of the inner product in the second slot turned into on the right-hand side. Subtracting gives , and since we conclude .
Step 4 (orthogonal complement of an eigenspace is invariant). Let be an eigenspace and its orthogonal complement. For and , compute $$ \langle T w, v \rangle = \langle w, T^* v \rangle = \langle w, \bar\lambda v \rangle = \lambda \langle w, v \rangle = 0, $$ where Step 2 supplied , and conjugate-linearity in the second slot turned into when pulled out. So , meaning is -invariant. By the same argument applied to , is also -invariant, so the restriction is again normal as an operator on with its inherited inner product.
Step 5 (induction on dimension). Argue by induction on . The base case is immediate: any unit vector is an orthonormal eigenbasis. For , Step 1 supplies an eigenvalue and eigenspace of positive dimension. The orthogonal complement has , the restriction is normal by Step 4, and by inductive hypothesis admits an orthonormal eigenbasis for . Concatenate an orthonormal basis of (taking each eigenvector to be a unit vector and applying Gram-Schmidt within the eigenspace) with the orthonormal eigenbasis of . The result is an orthonormal eigenbasis of for . This completes the induction and proves (i) (ii).
For (ii) (iii): given the orthonormal eigenbasis, group eigenvectors by eigenvalue. Let be the orthogonal projection onto the -eigenspace . Since the eigenspaces are pairwise orthogonal (Step 3) and span (eigenbasis exhausts ), the projections satisfy and . On any basis vector , since . By linearity .
For (iii) (i): given the spectral decomposition, compute using . Then , where the cross-terms with vanish by . So is normal.
Bridge. The complex spectral theorem builds toward every operator-theoretic structural result on a finite-dimensional inner-product space, and it appears again in 01.01.12 (singular value decomposition), where the spectral theorem applied to produces the singular values and right singular vectors of an arbitrary linear map. The foundational reason the theorem holds is exactly the normality identity : it forces and to share eigenvectors with conjugate eigenvalues (Step 2), and this is what makes the orthogonal complement of an eigenspace -invariant (Step 4) instead of merely -stable in some weaker sense. The central insight is that normality identifies the operator with a sum over its spectrum, with the orthogonal projections resolving the identity. Putting these together, the spectral decomposition generalises to the real symmetric case (real eigenvalues + orthogonal eigenbasis), to the principal-axes theorem for quadratic forms, to the singular value decomposition for arbitrary operators, and is dual to the Schur decomposition that triangularises every operator over . The bridge is that the cap of the finite-dimensional spectral theorem — a finite sum — generalises to an integral against a projection-valued measure in the unbounded self-adjoint case treated by von Neumann.
Exercises Intermediate+
Lean formalization Intermediate+
Mathlib's Mathlib.Analysis.InnerProductSpace.Spectrum provides the finite-dimensional spectral theorem for self-adjoint operators via LinearMap.IsSymmetric.eigenvectorBasis, which constructs an orthonormal basis of eigenvectors and supplies the diagonalisation. The normal-operator generalisation is partially packaged through IsStarNormal together with the spectral decomposition for elements of a star algebra, but no single theorem unifies (i) the normality condition, (ii) the orthonormal eigenbasis, and (iii) the spectral decomposition with the orthogonal-projection-family identities. The intended formalisation reads schematically:
import Mathlib.Analysis.InnerProductSpace.Adjoint
import Mathlib.Analysis.InnerProductSpace.Spectrum
import Mathlib.Analysis.NormedSpace.Star.Basic
variable {𝕜 V : Type*} [RCLike 𝕜]
[NormedAddCommGroup V] [InnerProductSpace 𝕜 V] [FiniteDimensional 𝕜 V]
(T : V →L[𝕜] V)
/-- Complex spectral theorem for normal operators on a finite-dim
inner-product space: a normal operator has an orthonormal basis of
eigenvectors. -/
theorem normal_spectralTheorem (hT : IsStarNormal T) :
∃ (n : ℕ) (b : Fin n → V) (λ : Fin n → 𝕜),
Orthonormal 𝕜 b ∧ (∀ i, T (b i) = λ i • b i) ∧
Submodule.span 𝕜 (Set.range b) = ⊤ :=
sorry -- Step 1 (FTA-driven eigenvector), Steps 2-4 (normality lemmas),
-- Step 5 (induction on dim)
/-- Spectral decomposition: T = ∑ λᵢ • Pᵢ with orthogonal projections
Pᵢ summing to the identity and pairwise-orthogonal. -/
theorem normal_spectralDecomposition (hT : IsStarNormal T) :
∃ (k : ℕ) (λ : Fin k → 𝕜) (P : Fin k → V →L[𝕜] V),
(∀ i, IsOrthogonalProjection (P i)) ∧
(∀ i j, i ≠ j → P i * P j = 0) ∧
(∑ i, P i = 1) ∧
(T = ∑ i, (λ i : 𝕜) • P i) :=
sorry -- repackage normal_spectralTheorem by grouping eigenvectors
/-- Real spectral theorem: a symmetric operator on a finite-dim
real inner-product space is orthogonally diagonalisable. -/
theorem symmetric_spectralTheorem {V : Type*}
[NormedAddCommGroup V] [InnerProductSpace ℝ V] [FiniteDimensional ℝ V]
(T : V →L[ℝ] V) (hT : LinearMap.IsSymmetric T.toLinearMap) :
∃ (n : ℕ) (b : Fin n → V) (λ : Fin n → ℝ),
Orthonormal ℝ b ∧ (∀ i, T (b i) = λ i • b i) :=
sorry -- complexify, apply normal_spectralTheorem, descend to ℝThe proof gap to a clean Mathlib contribution: (i) define IsOrthogonalProjection if absent; (ii) prove the normality-implies-shared-eigenvector lemma T.IsStarNormal → T x = λ • x → T.adjoint x = (starRingEnd 𝕜 λ) • x; (iii) prove the orthogonal-complement-of-eigenspace-is-invariant lemma; (iv) package the induction-on-dimension step that combines these. Each piece is formalisable from existing Mathlib infrastructure, but no single named theorem currently presents the spectral decomposition in the textbook form with the projection-family identities as a single result.
Advanced results Master
Theorem (real spectral theorem; Hoffman-Kunze §8.4 [source pending]). Let be a finite-dimensional real inner-product space and a linear operator. Then is orthogonally diagonalisable iff $T = T^V\mathbb{R}$.*
The proof complexifies to , applies the complex spectral theorem to , uses the self-adjoint reality-of-eigenvalues argument to confine the spectrum to , and descends an orthonormal real eigenbasis by averaging each complex eigenvector with its conjugate. The full descent argument is given in the proof set below.
Theorem (principal-axes theorem for quadratic forms; Shilov Ch. 10 §10.1 [source pending]). Every quadratic form , with real symmetric, is congruent under orthogonal change of basis to a diagonal form , where are the eigenvalues of . The directions of the new basis are the principal axes of .
Specifically, the orthogonal matrix in the diagonalisation is the change-of-basis matrix whose columns are the orthonormal eigenvectors of . Setting rewrites . Sylvester's law of inertia 01.01.15 further refines this: the signature — the number of positive, negative, and zero eigenvalues — is invariant under arbitrary congruence by , not only orthogonal congruence.
Theorem (Schur decomposition; Schur 1909 Math. Ann. 66, 488–510 [source pending]). Every linear operator on a finite-dimensional complex inner-product space admits a unitary basis change taking to upper-triangular form: $T = U R U^URRT$, in any chosen order.*
The Schur form holds for every operator, not only normal ones. The spectral theorem is the special case where normality forces the strictly upper-triangular part of to vanish, so is diagonal. The Schur form is the strongest universal canonical form available in the unitary-similarity category, and it is the input to QR-style numerical eigenvalue algorithms.
Theorem (real-normal canonical form). A real linear operator satisfying is orthogonally similar to a block-diagonal matrix with real one-by-one blocks for real eigenvalues and real two-by-two blocks for complex-conjugate eigenvalue pairs ().
Real-orthogonal matrices (e.g., rotation matrices) are real-normal but in general have no real eigenvalues; the two-by-two rotation blocks encode the complex-conjugate eigenvalue pairs on the unit circle.
Theorem (functional calculus for normal operators). Let be normal with spectral decomposition . For any function , define $$ f(T) = \sum_i f(\lambda_i) P_i. $$ The map is a $\mathrm{spec}(T)\mathrm{End}(V)f(\lambda) = \lambdaT1f gf(T) g(T)\bar ff(T)^f \in \mathbb{C}[t]f(T)f = \expf(T) = \exp(T) = \sum_i e^{\lambda_i} P_if(\lambda) = \sqrt{\lambda}f(T)$ is the unique positive square root.
The finite-dimensional functional calculus is the model for the continuous functional calculus on a C*-algebra (Gelfand-Naimark) and for the Borel functional calculus on a von Neumann algebra; the spectral-measure formulation extends from continuous to Borel functions and from to .
Theorem (compact self-adjoint operators on Hilbert space; Riesz-Schauder, Hilbert 1906 + Riesz 1907 [source pending]). Let be a separable Hilbert space and a compact self-adjoint operator. The spectrum of is a countable subset of with as the only possible accumulation point. Each non-zero eigenvalue has finite-dimensional eigenspace, and admits an orthonormal basis of eigenvectors of . Equivalently, $$ T = \sum_{n=1}^\infty \lambda_n P_n, $$ with the sum converging in the operator norm and .
This is the infinite-dimensional generalisation that motivated the development of functional analysis: Hilbert's 1906 study of symmetric integral kernels on established the spectral theorem for the compact case, and Riesz 1907/1910 abstracted it to compact operators on an arbitrary Hilbert space, founding the Riesz-Schauder theory. The hypothesis "compact" replaces "finite-dimensional"; the conclusion is the same diagonalisation, now indexed by a countable spectrum accumulating only at .
Theorem (bounded self-adjoint operators on Hilbert space; multiplication-operator form, Hahn-Hellinger). Let be a separable Hilbert space and a bounded self-adjoint operator. There exist a -finite measure space , a bounded measurable function , and a unitary isomorphism such that is multiplication by : $$ (U T U^{-1} \psi)(x) = f(x) \psi(x). $$ Equivalently (and more invariantly), there is a projection-valued measure on the Borel -algebra of such that .
The multiplication-operator form replaces the finite sum by integration of the identity function against the spectral measure . Hahn-Hellinger established the multiplicity-theoretic refinement: the unitary equivalence class of is determined by a multiplicity function on .
Theorem (unbounded self-adjoint operators on Hilbert space; von Neumann 1929 Math. Ann. 102, 49–131 [source pending]). Let be an unbounded self-adjoint operator on a separable Hilbert space, with dense domain and $T = T^E\sigma\mathbb{R}$ such that* $$ T = \int_{\mathbb{R}} \lambda , dE(\lambda), \qquad D(T) = \left{ \psi \in H : \int_{\mathbb{R}} \lambda^2 , d|E_\lambda \psi|^2 < \infty \right}. $$
The unbounded case is the technically deepest version of the spectral theorem and the one that underwrites quantum mechanics: observables are unbounded self-adjoint operators on a Hilbert space of state vectors, and the projection-valued measure is the measurement postulate. Stone's theorem identifies the one-parameter unitary group as the dynamical evolution generated by the observable .
Theorem (normal operators on Hilbert space). A bounded normal operator on a separable Hilbert space admits a projection-valued measure on the Borel -algebra of such that . Equivalently, is unitarily equivalent to multiplication by a bounded measurable complex-valued function on a -finite measure space.
The complex spectrum supports the projection-valued measure; the proof passes through the C*-algebra generated by and applies Gelfand-Naimark to identify it with continuous functions on . The unbounded normal case requires an additional commutativity hypothesis between and on domains (Putnam-Fuglede).
Synthesis. The finite-dimensional spectral theorem for normal operators is the foundational reason the inner-product geometry of admits an operator-adapted orthonormal basis: the operator and its adjoint commute exactly when their joint eigenspaces resolve the identity, and this is exactly the content of the orthogonal-eigenbasis statement. The central insight is that normality identifies with a finite sum of orthogonal projections weighted by complex scalars, and the projections form a partition of unity in the inner-product geometry of . Putting these together, one spectral framework produces the real symmetric case (eigenvalues confined to ), the unitary case (eigenvalues on the unit circle), the anti-self-adjoint case (eigenvalues imaginary), the principal-axes theorem for quadratic forms, the singular value decomposition via the spectral theorem applied to , and the functional calculus for arbitrary spectral functions. The bridge is that the finite sum generalises to an operator-valued integral against a projection-valued measure, and this is exactly the form taken by the spectral theorem for unbounded self-adjoint operators on Hilbert space, builds toward 02.11.03 (unbounded self-adjoint operators) where the integral form supplies the dynamical-evolution generator of quantum mechanics, and appears again in 01.01.12 (singular value decomposition) as the diagonalisation of that produces singular values for arbitrary linear maps.
The spectral framework identifies several pairings that look distinct at first inspection. The real-symmetric diagonalisation is exactly the orthogonal-congruence classification of quadratic forms via the principal-axes theorem. The unitary diagonalisation is exactly the Fourier-style decomposition of a unitary operator into eigenspaces on the unit circle, the finite-dimensional model for Stone's theorem on one-parameter unitary groups. The Schur decomposition supplies an upper-triangular form for every complex operator, and normality is the precise additional condition that collapses Schur to diagonal. This pattern recurs at every level of generality: in the Hilbert-space case the spectral theorem for compact self-adjoint operators (Hilbert-Riesz-Schauder) gives the same orthonormal-eigenbasis statement with a countable spectrum, the bounded self-adjoint case replaces the eigenbasis by a multiplication operator on an -space (Hahn-Hellinger), and the unbounded case packages everything in a projection-valued measure (von Neumann). At each level, the finite-dimensional formula is the model, and the upgrade is to a measure-theoretic integral. The foundational identification — operator decomposes against its spectrum — is preserved across the whole hierarchy.
Full proof set Master
Theorem (complex spectral theorem), proof. Given in the Intermediate-tier section: existence of an eigenvalue from the fundamental theorem of algebra (Step 1), shared eigenvector for and with conjugate eigenvalue using normality of (Step 2), orthogonality of eigenspaces at distinct eigenvalues via the adjoint-pairing argument (Step 3), -invariance of the orthogonal complement of an eigenspace (Step 4), and induction on (Step 5). The implication (ii) (iii) groups eigenvectors by eigenvalue and reads off the projection-family decomposition; (iii) (i) computes and both as .
Proposition (real spectral theorem), proof. Let be a symmetric operator on a finite-dim real inner-product space. Complexify: , with the Hermitian inner product restricting to the real inner product on . Extend to a -linear operator . Verify that is Hermitian-self-adjoint: $$ \langle T_\mathbb{C}(v_1 + i w_1), v_2 + i w_2 \rangle_\mathbb{C} = \langle T v_1, v_2 \rangle + \langle T w_1, w_2 \rangle + i(\langle T w_1, v_2 \rangle - \langle T v_1, w_2 \rangle), $$ and on the other side $$ \langle v_1 + i w_1, T_\mathbb{C}(v_2 + i w_2) \rangle_\mathbb{C} = \langle v_1, T v_2 \rangle + \langle w_1, T w_2 \rangle + i(\langle w_1, T v_2 \rangle - \langle v_1, T w_2 \rangle). $$ By in the real inner product, the two sides match. So is self-adjoint, hence normal.
Apply the complex spectral theorem: admits an orthonormal eigenbasis of with eigenvalues . Each is real, by the self-adjoint reality argument: , forcing .
Complex conjugation on (sending ) commutes with because is real. For each real , the eigenspace is conjugation-invariant, so it has a real basis: pick any , decompose with , then gives and , so . Choosing as a real-spanning pair and applying Gram-Schmidt in the real eigenspace gives a real orthonormal basis of .
Distinct real eigenspaces of are orthogonal in by the same calculation as Step 3 of the complex case (specialised to real , where the conjugate-linear distinction vanishes). Concatenating the real orthonormal bases of the distinct real eigenspaces produces a real orthonormal eigenbasis of , and the matrix of in this basis is real diagonal.
Proposition (principal-axes theorem), proof. Let be a real symmetric matrix and the associated quadratic form on 01.01.15. By the real spectral theorem there is an orthogonal matrix and a real diagonal matrix with . Change variable: set , so . Substitute into :
$$
Q(x) = x^\top S x = (P y)^\top (P \Lambda P^\top)(P y) = y^\top P^\top P \Lambda P^\top P y = y^\top \Lambda y = \sum_{i=1}^n \lambda_i y_i^2,
$$
using . The new basis vectors (columns of ) are the principal axes of , the eigenvalues are the diagonal entries of , and is diagonalised as a sum of weighted squares.
Proposition (Schur decomposition), proof. Let be a linear operator on a finite-dim complex inner-product space, . Argue by induction on . The base case is immediate. For , has an eigenvalue with eigenvector ; normalise to . Let , a subspace of dimension . Note: is not in general -invariant (it is -invariant, by the same argument as Step 4 of the spectral theorem but with and swapped). However, the composite is a well-defined linear operator on , where is orthogonal projection onto .
By inductive hypothesis there is an orthonormal basis of in which is upper triangular. Concatenate to form the orthonormal basis of . The matrix of in this basis has first column (since ) and the lower-right block agrees with the matrix of , which is upper-triangular by inductive hypothesis. Off-diagonal entries above the diagonal in the first row are the inner products for , which need not vanish — but they sit in the upper triangle, so the resulting matrix is upper triangular. Setting to be the unitary matrix whose columns are gives with upper triangular.
Proposition (functional calculus is a -homomorphism), proof. Let be normal with spectral decomposition , and for define . Use and :
(i) Linearity: .
(ii) Multiplicativity: .
(iii) Identity: the constant function gives .
(iv) Star: , since each is self-adjoint.
(v) Identity function: .
The map is therefore a -algebra homomorphism from the algebra of functions on (with pointwise operations and complex conjugation) to . For a polynomial, by linearity and multiplicativity, agreeing with the standard polynomial evaluation. For , is a finite sum that equals the operator-norm exponential by uniform-convergence comparison on the finite spectrum.
Theorem (compact self-adjoint operators), stated without independent proof — see Hilbert 1906 Nachr. Gött. IV [source pending] and Riesz 1907 C. R. Acad. Sci. Paris 144, 615–619 [source pending]. The Riesz-Schauder argument adapts the five-step proof of the finite-dimensional case to compact operators: the spectrum is a countable set with as the only possible accumulation point (Riesz lemma), each non-zero eigenvalue has finite multiplicity (compactness of the unit ball in the eigenspace), and the eigenvectors span the closure of the range (by induction on the largest eigenvalue, using that the orthogonal complement of an eigenspace is invariant and that the restriction is still compact and self-adjoint). The full development is in Reed-Simon Vol. I §VI.
Theorem (unbounded self-adjoint operators, von Neumann's spectral theorem), stated without independent proof — see von Neumann 1929 Math. Ann. 102, 49–131 [source pending]. The unbounded case requires the Cayley transform , which converts an unbounded self-adjoint into a bounded unitary , then applies the spectral theorem for unitary operators (a normal-operator special case with spectrum on the unit circle), and inverts the Cayley transform to recover the projection-valued measure of . The detailed argument is given in Reed-Simon Vol. I §VIII; the finite-dimensional theorem is the model that orients the entire construction.
Connections Master
Eigenvalue, eigenvector, characteristic polynomial
01.01.08. The spectral theorem is the structural refinement of the eigenvalue-eigenvector theory for normal operators on an inner-product space. The eigenvalues of are the roots of , and normality is the precise condition that promotes "every operator has eigenvectors over " to "every normal operator has an orthonormal eigenbasis". Without the inner-product structure and the adjoint , the orthonormality clause is meaningless; without normality, eigenspaces at distinct eigenvalues need not be orthogonal and the orthogonal-complement-of-eigenspace need not be -invariant.Bilinear / quadratic form
01.01.15. The principal-axes theorem is the orthogonal-congruence diagonalisation of a real quadratic form, and it is exactly the real spectral theorem applied to the symmetric matrix representing the form. Sylvester's law of inertia further classifies real quadratic forms up to arbitrary -congruence by the signature , recovering the eigenvalue-sign information from the spectral decomposition. The bridge between the inner-product-geometric statement (orthogonal congruence) and the bilinear-form classification (Sylvester's law) is the spectral theorem.Singular value decomposition
01.01.12. For an arbitrary linear map between finite-dim inner-product spaces, the operator on is self-adjoint and positive semi-definite, so the spectral theorem gives an orthonormal eigenbasis with non-negative eigenvalues . The numbers are the singular values of ; setting for produces the orthonormal basis on the -side, and is the singular value decomposition. The SVD is the spectral theorem on paired with the recipe for transporting the eigenbasis to the codomain.Jordan canonical form
01.01.11. Over every linear operator admits a Jordan canonical form, the strongest similarity-classification statement available without an inner-product structure. The Schur decomposition is the unitary-similarity refinement (every operator is unitarily upper-triangularisable), and the spectral theorem is the further refinement available when is normal (the upper-triangular part collapses to diagonal). Jordan and spectral are dual viewpoints: Jordan classifies up to -conjugacy without inner-product structure, spectral classifies up to -conjugacy with inner-product structure, and normal operators are exactly the ones whose Jordan form is already diagonal.Unbounded self-adjoint operators
02.11.03. The infinite-dimensional generalisation of the spectral theorem to unbounded self-adjoint operators on a separable Hilbert space (von Neumann 1929) is the foundation of quantum-mechanical observable theory. The finite-sum spectral decomposition generalises to an operator-valued integral against a projection-valued measure on the Borel -algebra of , and Stone's theorem identifies the one-parameter unitary group as the dynamical evolution. The finite-dim theorem proved here is the model that orients the entire functional-analytic theory.
Historical & philosophical context Master
The principal-axes theorem in its real symmetric form is due to Cauchy, who established the orthogonal diagonalisation of real symmetric matrices in his 1829 Exercices de Mathématiques IV (140–160) [source pending] in the context of celestial mechanics: the secular perturbations of planetary orbits were governed by a real symmetric matrix arising from the gravitational potential, and Cauchy needed the principal axes of the corresponding quadratic form to decouple the perturbation equations. The real eigenvalues recorded the frequencies of the linearised motion; the orthogonal eigenvectors recorded the normal modes. The theorem was first applied as a problem-solving tool in mathematical astronomy before being recognised as a structural result in linear algebra. Hermite extended the result to complex Hermitian matrices in 1855 (Comptes Rendus 41, 181–183, and J. Reine Angew. Math. 50, 273–283) [source pending], establishing the reality of eigenvalues for the Hermitian case and the corresponding orthogonal diagonalisation.
The normal-operator framing is due to Toeplitz, whose 1918 paper Das algebraische Analogon zu einem Satze von Fejér (Math. Z. 2, 187–197) [source pending] identified normality as the precise condition for unitary diagonalisability in the finite-dimensional complex setting. Schur's 1909 paper Über die charakteristischen Wurzeln einer linearen Substitution (Math. Ann. 66, 488–510) [source pending] had already established the unitary upper-triangularisation of arbitrary complex matrices, and Toeplitz's contribution was the recognition that the strictly upper-triangular part of the Schur form vanishes exactly when the operator is normal — collapsing the Schur decomposition to a diagonal one and identifying normality as the algebraic signature of the spectral theorem.
The infinite-dimensional generalisations developed in parallel with the finite case. Hilbert's Grundzüge einer allgemeinen Theorie der linearen Integralgleichungen — published in six instalments in the Nachrichten der Gesellschaft der Wissenschaften zu Göttingen between 1904 and 1910 and collected as a monograph in 1912 — established the spectral theorem for symmetric integral operators on , the first infinite-dimensional spectral theorem [source pending]. Riesz's 1907 Comptes Rendus note (144, 615–619) and 1910 Mathematische Annalen paper (69, 449–497) abstracted Hilbert's argument to compact symmetric operators on a separable Hilbert space, founding what is now called Riesz-Schauder theory [source pending]. The bounded self-adjoint case received its multiplication-operator form through Hahn and Hellinger in the 1910s and 1920s, and the unbounded case was settled by von Neumann's 1929 Allgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren (Math. Ann. 102, 49–131) [source pending], where the projection-valued measure formulation first appeared. Von Neumann's 1932 monograph Mathematische Grundlagen der Quantenmechanik (Springer 1932) then identified self-adjoint operators with quantum-mechanical observables and the spectral measure with the measurement postulate, an identification that has structured both functional analysis and the foundations of physics ever since.
Bibliography Master
@article{Cauchy1829Exercices,
author = {Cauchy, Augustin-Louis},
title = {Sur l'{\'e}quation {\`a} l'aide de laquelle on d{\'e}termine les in{\'e}galit{\'e}s s{\'e}culaires des mouvements des plan{\`e}tes},
journal = {Exercices de Math{\'e}matiques},
volume = {4},
year = {1829},
pages = {140--160}
}
@article{Hermite1855,
author = {Hermite, Charles},
title = {Remarques sur un th{\'e}or{\`e}me de M. Cauchy},
journal = {Comptes Rendus de l'Acad{\'e}mie des Sciences},
volume = {41},
year = {1855},
pages = {181--183}
}
@article{Schur1909,
author = {Schur, Issai},
title = {{\"U}ber die charakteristischen Wurzeln einer linearen Substitution mit einer Anwendung auf die Theorie der Integralgleichungen},
journal = {Mathematische Annalen},
volume = {66},
year = {1909},
pages = {488--510}
}
@article{Toeplitz1918,
author = {Toeplitz, Otto},
title = {Das algebraische Analogon zu einem Satze von Fej{\'e}r},
journal = {Mathematische Zeitschrift},
volume = {2},
year = {1918},
pages = {187--197}
}
@article{Hilbert1906,
author = {Hilbert, David},
title = {Grundz{\"u}ge einer allgemeinen Theorie der linearen Integralgleichungen, Vierte Mitteilung},
journal = {Nachrichten der K. Gesellschaft der Wissenschaften zu G{\"o}ttingen},
year = {1906},
pages = {157--227}
}
@book{Hilbert1912IntegralEquations,
author = {Hilbert, David},
title = {Grundz{\"u}ge einer allgemeinen Theorie der linearen Integralgleichungen},
publisher = {Teubner},
address = {Leipzig},
year = {1912}
}
@article{Riesz1907,
author = {Riesz, Frigyes},
title = {Sur les syst{\`e}mes orthogonaux de fonctions},
journal = {Comptes Rendus de l'Acad{\'e}mie des Sciences},
volume = {144},
year = {1907},
pages = {615--619}
}
@article{Riesz1910,
author = {Riesz, Frigyes},
title = {Untersuchungen {\"u}ber Systeme integrierbarer Funktionen},
journal = {Mathematische Annalen},
volume = {69},
year = {1910},
pages = {449--497}
}
@article{vonNeumann1929,
author = {von Neumann, John},
title = {Allgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren},
journal = {Mathematische Annalen},
volume = {102},
year = {1929},
pages = {49--131}
}
@book{vonNeumann1932,
author = {von Neumann, John},
title = {Mathematische Grundlagen der Quantenmechanik},
publisher = {Springer},
address = {Berlin},
year = {1932}
}
@book{HalmosFDVS,
author = {Halmos, Paul R.},
title = {Finite-Dimensional Vector Spaces},
publisher = {Springer-Verlag},
series = {Undergraduate Texts in Mathematics},
edition = {2},
year = {1974}
}
@book{ShilovLinearAlgebra,
author = {Shilov, Georgi E.},
title = {Linear Algebra},
publisher = {Dover},
year = {1977},
note = {English translation; Russian original 1969}
}
@book{HoffmanKunze,
author = {Hoffman, Kenneth and Kunze, Ray},
title = {Linear Algebra},
publisher = {Prentice-Hall},
edition = {2},
year = {1971}
}
@book{AxlerLADR,
author = {Axler, Sheldon},
title = {Linear Algebra Done Right},
publisher = {Springer},
series = {Undergraduate Texts in Mathematics},
edition = {3},
year = {2015}
}
@book{ReedSimonI,
author = {Reed, Michael and Simon, Barry},
title = {Methods of Modern Mathematical Physics, Vol. I: Functional Analysis},
publisher = {Academic Press},
edition = {Revised and Enlarged},
year = {1980}
}
@book{KatoPerturbation,
author = {Kato, Tosio},
title = {Perturbation Theory for Linear Operators},
publisher = {Springer-Verlag},
series = {Grundlehren der mathematischen Wissenschaften},
volume = {132},
edition = {2},
year = {1980}
}