01.01.16 · foundations / linear-algebra

Invariant subspaces and the primary decomposition

shipped3 tiersLean: none

Anchor (Master): Shilov *Linear Algebra* Ch. 6–7; Hoffman-Kunze *Linear Algebra* Ch. 6–7 (annihilating polynomials, primary decomposition, cyclic decomposition); Hoffman-Kunze §7.5 (the Jordan and rational forms); Lang *Algebra* Ch. XIV (modules over a PID and canonical forms); Jacobson *Basic Algebra I* Ch. 3

Intuition Beginner

A linear map moves the whole space around at once, and that can be hard to picture all together. The trick is to look for pieces of the space that the map keeps to themselves. A direction, a plane, or a larger slab is kept to itself by the map if every vector inside it lands back inside it. Such a piece is called an invariant subspace. Inside one, the map is its own self-contained transformation, and you can study it without watching what happens everywhere else.

Eigenvectors are the smallest example. An eigenvector sits on a line that the map only stretches, so the line is kept to itself. The bigger idea is that you can often slice the whole space into several such pieces that the map never mixes, and then the complicated map becomes a stack of simpler maps, one per piece. The matrix in matching coordinates falls into separate blocks along the diagonal, with empty corners, because no piece feeds into another.

The primary decomposition is the cleanest way to do this slicing. It groups directions by which eigenvalue, or which repeated-root factor, controls them. Each group is a slab the map keeps to itself, and the groups together fill the whole space. You break one hard problem into a handful of easy ones, each tied to a single eigenvalue.

Visual Beginner

The picture shows a space split into two slabs that a linear map never mixes. On the left, a transformation acts on all of three-dimensional space at once, arrows pointing every which way. On the right, the same space is drawn as a horizontal plane stacked with a vertical line through it; the map sends plane-vectors back into the plane and line-vectors back into the line, so the two pieces stay separate forever.

Two facts are visible. Because each slab is kept to itself, the matrix in coordinates adapted to the slabs has its non-zero entries gathered into square blocks on the diagonal, with zeros in the corners. And because the slabs together fill the whole space, studying the map on each slab separately tells you everything about the map.

Worked example Beginner

Take the three-by-three matrix

A = 200040014 .

Step 1. Read off the eigenvalues from the diagonal of this upper-triangular matrix: $2$ and $4$ .

Step 2. Find the slab for the eigenvalue $2$ . Solve $A v = 2 v$ . The vector $(1, 0, 0)$ works, because $A (1, 0, 0) = (2, 0, 0) = 2 (1, 0, 0)$ . So the line through $(1, 0, 0)$ is kept to itself, and on it the map just multiplies by $2$ .

Step 3. Find the slab for the eigenvalue $4$ . Look at the second and third coordinates, where the active block is $(4014)$ . The vectors $(0, 1, 0)$ and $(0, 0, 1)$ span a plane. Check it is kept to itself: $A (0, 1, 0) = (0, 4, 0)$ stays in the plane, and $A (0, 0, 1) = (0, 1, 4)$ stays in the plane. So this plane is a second slab.

Step 4. Confirm the two slabs fill the space and do not overlap. The line and the plane together span all three coordinates, and they share only the zero vector, so every vector splits uniquely into a line-part plus a plane-part.

Step 5. See the block structure. In the basis $(1, 0, 0), (0, 1, 0), (0, 0, 1)$ , the matrix is already block-diagonal: a one-by-one block holding $2$ , and a two-by-two block holding $(4014)$ , with zeros in the corners.

What this tells us: the eigenvalue $2$ owns a one-dimensional slab and the eigenvalue $4$ owns a two-dimensional slab. The map never mixes them, so the single three-dimensional problem became one easy problem on a line plus one small problem on a plane.

Check your understanding Beginner

Formal definition Intermediate+

Let $K$ be a field, $V$ a finite-dimensional $K$ -vector space, and $T : V \to V$ a linear operator in the sense of 01.01.05. The role of the spectral theorem 01.01.13 in this unit is as the orthogonal model case: when $T$ is normal on an inner-product space, the invariant decomposition below is the orthogonal eigenspace decomposition, and the present unit drops both the inner product and the diagonalisability hypothesis.

Definition (invariant subspace). A subspace $W \subseteq V$ is $T$ -invariant if $T (W) \subseteq W$ , that is $T w \in W$ for every $w \in W$ . The zero subspace ${0}$ and the whole space $V$ are $T$ -invariant for every $T$ . The kernel $ker T$ , the image $im T$ , each eigenspace $E_{λ} = ker (T - λ I)$ , and each generalised eigenspace $ker (T - λ I)^{k}$ are $T$ -invariant; for the generalised eigenspace, $T$ commutes with $(T - λ I)^{k}$ , so $T$ carries its kernel into itself.

Induced operators. If $W$ is $T$ -invariant, the restriction $T ∣_{W} : W \to W$ , $w \mapsto T w$ , is a well-defined operator on $W$ . The operator $T$ also descends to the quotient $V / W$ : the induced operator $\overline{T} : V / W \to V / W$ , $v + W \mapsto T v + W$ , is well-defined precisely because $T (W) \subseteq W$ makes the rule independent of the coset representative.

Block-triangular and block-diagonal forms. Choose a basis $w_{1}, \dots, w_{m}$ of $W$ and extend to a basis $w_{1}, \dots, w_{m}, u_{1}, \dots, u_{n - m}$ of $V$ . Because $T (W) \subseteq W$ , the matrix of $T$ in this basis is block upper-triangular,

[T] = ([T ∣_{W}] 0 B [\overline{T}]),

with $[T ∣_{W}]$ the $m \times m$ matrix of the restriction, $[\overline{T}]$ the $(n - m) \times (n - m)$ matrix of the induced operator on the quotient, and $B$ recording how $T$ pushes the complementary directions back toward $W$ . If a complementary invariant subspace $U$ exists — a $T$ -invariant $U$ with $V = W \oplus U$ — then choosing the $u_{j}$ inside $U$ forces $B = 0$ , and the matrix is block-diagonal,

[T] = ([T ∣_{W}] 0 0 [T ∣_{U}]) .

The existence of a complementary invariant subspace is the difference between a block-triangular reduction (always available from a single invariant subspace) and a block-diagonal splitting (which decouples the operator into independent pieces).

Definition (annihilating and minimal polynomial). The polynomials $p \in K [t]$ with $p (T) = 0$ form a non-zero ideal of $K [t]$ ; its monic generator $m_{T} (t)$ is the minimal polynomial of $T$ (developed in 01.01.11). Over $K [t]$ , which is a principal ideal domain, $m_{T}$ factors uniquely as

m_{T} (t) = i = 1 \prod r p_{i} (t)^{e_{i}},

with $p_{1}, \dots, p_{r}$ distinct monic irreducible polynomials and $e_{i} \geq 1$ . The subspaces $ker p_{i} (T)^{e_{i}}$ are the primary components of $V$ under $T$ . When $K$ contains all eigenvalues, each $p_{i} (t) = t - λ_{i}$ and the primary components are the generalised eigenspaces $ker (T - λ_{i} I)^{e_{i}}$ .

Notation: $T ∣_{W}$ is the restriction of $T$ to an invariant subspace $W$ ; $\overline{T}$ or $T_{V / W}$ is the induced operator on $V / W$ ; $V = W \oplus U$ denotes an internal direct sum; $E_{λ} = ker (T - λ I)$ ; $G_{λ} = ker (T - λ I)^{e}$ for the stabilised index $e$ . The symbol $g cd$ denotes the monic greatest common divisor in $K [t]$ , and two polynomials are coprime when their $g cd$ is $1$ .

Counterexamples to common slips

An invariant subspace need not have an invariant complement. On $K^{2}$ the operator $T = (0010)$ keeps the horizontal line $W = span (e_{1})$ to itself, but no second $T$ -invariant line exists, since $E_{0} = ker T = W$ is the only invariant line. So $T$ admits a block-triangular form but no block-diagonal split.
The primary decomposition keys off the minimal polynomial, not the characteristic polynomial. Using $χ_{T}$ in place of $m_{T}$ gives the same subspaces here because $ker p_{i} (T)^{e_{i}} = ker p_{i} (T)^{a_{i}}$ once $e_{i}$ reaches the stabilisation index, but the exponents in $m_{T}$ are the smallest that produce the full primary component.
"Each primary component is an eigenspace" fails whenever $e_{i} > 1$ . The component $ker (T - λ I)^{e_{i}}$ is the generalised eigenspace; it equals the ordinary eigenspace $ker (T - λ I)$ only when $λ$ contributes a square-free factor to $m_{T}$ .

Key theorem with proof Intermediate+

Theorem (primary decomposition; Hoffman-Kunze §6.8 ^{[source pending]}; Shilov Ch. 7 ^{[source pending]}). Let $T : V \to V$ be a linear operator on a finite-dimensional $K$ -vector space with minimal polynomial $m_{T} (t) = \prod_{i = 1}^{r} p_{i} (t)^{e_{i}}$ , the $p_{i}$ distinct monic irreducibles. Set $W_{i} = ker p_{i} (T)^{e_{i}}$ . Then

V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{r},

each $W_{i}$ is $T$ -invariant, the minimal polynomial of $T ∣_{W_{i}}$ is exactly $p_{i} (t)^{e_{i}}$ , and the projection $P_{i} : V \to W_{i}$ along the other summands is a polynomial in $T$ .

Proof. For each $i$ put $f_{i} (t) = \prod_{j \neq = i} p_{j} (t)^{e_{j}} = m_{T} (t) / p_{i} (t)^{e_{i}}$ . The $r$ polynomials $f_{1}, \dots, f_{r}$ have no common irreducible factor: any irreducible dividing all of them would divide $f_{i}$ while being coprime to $p_{i}^{e_{i}}$ , yet $p_{i}^{e_{i}}$ is the only factor of $m_{T}$ absent from $f_{i}$ , so such a common divisor would have to be a unit. Hence $g cd (f_{1}, \dots, f_{r}) = 1$ . Because $K [t]$ is a principal ideal domain, Bézout's identity supplies polynomials $g_{1}, \dots, g_{r} \in K [t]$ with

i = 1 \sum r g_{i} (t) f_{i} (t) = 1.

Evaluate at $T$ and define $P_{i} = g_{i} (T) f_{i} (T)$ . Then $\sum_{i} P_{i} = I$ , and each $P_{i}$ is a polynomial in $T$ , so the $P_{i}$ commute with $T$ and with one another.

The $P_{i}$ are projections onto the $W_{i}$ . For $i \neq = j$ the product $f_{i} (t) f_{j} (t)$ is divisible by $m_{T} (t)$ , because between them $f_{i}$ and $f_{j}$ already carry every factor $p_{k}^{e_{k}}$ of $m_{T}$ to at least its full multiplicity (the factor $p_{i}^{e_{i}}$ appears in $f_{j}$ , the factor $p_{j}^{e_{j}}$ appears in $f_{i}$ , and every other $p_{k}^{e_{k}}$ appears in both). Thus $f_{i} (T) f_{j} (T) = 0$ , whence $P_{i} P_{j} = g_{i} (T) f_{i} (T) g_{j} (T) f_{j} (T) = 0$ for $i \neq = j$ . Multiplying $\sum_{k} P_{k} = I$ by $P_{i}$ gives $P_{i}^{2} = P_{i}$ , so each $P_{i}$ is idempotent, and $V = ⨁_{i} im P_{i}$ is the internal direct sum attached to a complete system of orthogonal idempotents.

It remains to identify $im P_{i}$ with $W_{i} = ker p_{i} (T)^{e_{i}}$ . If $v \in W_{i}$ , then for $j \neq = i$ the polynomial $f_{j}$ is divisible by $p_{i}^{e_{i}}$ , so $f_{j} (T) v = 0$ and hence $P_{j} v = 0$ ; from $\sum_{k} P_{k} v = v$ this gives $P_{i} v = v$ , so $W_{i} \subseteq im P_{i}$ . Conversely, $p_{i} (T)^{e_{i}} P_{i} = p_{i} (T)^{e_{i}} g_{i} (T) f_{i} (T)$ , and $p_{i} (t)^{e_{i}} f_{i} (t) = m_{T} (t)$ , so $p_{i} (T)^{e_{i}} f_{i} (T) = m_{T} (T) = 0$ ; therefore $p_{i} (T)^{e_{i}} P_{i} = 0$ , which puts $im P_{i} \subseteq ker p_{i} (T)^{e_{i}} = W_{i}$ . The two inclusions give $im P_{i} = W_{i}$ , and the direct sum is $V = ⨁_{i} W_{i}$ .

Invariance and the induced minimal polynomial. Each $W_{i}$ is $T$ -invariant because $T$ commutes with $p_{i} (T)^{e_{i}}$ and so preserves its kernel. Let $q_{i} (t)$ be the minimal polynomial of $T ∣_{W_{i}}$ . Since $p_{i} (T)^{e_{i}}$ kills $W_{i}$ , we have $q_{i} ∣ p_{i}^{e_{i}}$ , so $q_{i} = p_{i}^{c_{i}}$ with $c_{i} \leq e_{i}$ . The polynomial $\prod_{i} q_{i}$ annihilates $T$ on every summand, hence on $V$ , so $m_{T} ∣ \prod_{i} q_{i} = \prod_{i} p_{i}^{c_{i}}$ . Comparing with $m_{T} = \prod_{i} p_{i}^{e_{i}}$ forces $c_{i} \geq e_{i}$ , so $c_{i} = e_{i}$ and $q_{i} = p_{i}^{e_{i}}$ . $□$

Bridge. The primary decomposition builds toward the canonical-form theory of 01.01.11: once $V$ splits into the invariant pieces $W_{i}$ on which $T$ acts with minimal polynomial $p_{i}^{e_{i}}$ , classifying $T$ reduces to classifying a single operator whose minimal polynomial is a prime power, which is exactly the local problem the Jordan and rational forms solve. This is exactly the linear-algebra face of the Chinese remainder theorem: the coprime factorisation $m_{T} = \prod_{i} p_{i}^{e_{i}}$ and the Bézout identity $\sum_{i} g_{i} f_{i} = 1$ are the ring-level statement $K [t] / (m_{T}) ≅ \prod_{i} K [t] / (p_{i}^{e_{i}})$ , read off on the $K [t]$ -module $V$ . The foundational reason the projections are polynomials in $T$ is that they are the images of the idempotents of that product ring, so they commute with everything $T$ commutes with and require no choice of basis or inner product. The construction generalises the orthogonal eigenspace splitting of the spectral theorem 01.01.13 by replacing orthogonal projection with algebraic projection, and it appears again in 01.02.05 (the Jordan-Hölder and nilpotent theory) where the same coprime-idempotent mechanism organises a solvable or nilpotent action into its primary blocks. Putting these together, the primary decomposition is the field-agnostic, inner-product-free engine underneath every later canonical form.

Exercises Intermediate+

Exercise 7 (hard, short-answer).

Prove that an operator $T$ on a finite-dimensional space over an algebraically closed field is diagonalisable if and only if its minimal polynomial is square-free, by reading the condition off the primary decomposition.

Hint

Square-free means every $e_{i} = 1$ . On the component $ker (T - λ_{i} I)^{e_{i}}$ , what does $e_{i} = 1$ say about whether the component is an ordinary eigenspace?

Answer

Over an algebraically closed field every $p_{i}$ is linear, $p_{i} = t - λ_{i}$ , so $m_{T} = \prod_{i} (t - λ_{i})^{e_{i}}$ and the primary components are $W_{i} = ker (T - λ_{i} I)^{e_{i}}$ , with $V = ⨁_{i} W_{i}$ .

If $m_{T}$ is square-free, every $e_{i} = 1$ , so $W_{i} = ker (T - λ_{i} I) = E_{λ_{i}}$ is an ordinary eigenspace. Then $V = ⨁_{i} E_{λ_{i}}$ is a direct sum of eigenspaces, and a basis assembled from bases of the $E_{λ_{i}}$ is an eigenbasis, so $T$ is diagonalisable.

Conversely, if $T$ is diagonalisable with distinct eigenvalues $λ_{1}, \dots, λ_{r}$ , then $\prod_{i} (t - λ_{i})$ annihilates every eigenvector hence all of $V$ , so $m_{T} ∣ \prod_{i} (t - λ_{i})$ ; since each $λ_{i}$ is a root of $m_{T}$ , equality holds and $m_{T} = \prod_{i} (t - λ_{i})$ is square-free. Rubric: full credit for both directions tied to the $e_{i} = 1$ reading of the components.

Exercise 8 (hard, symbolic).

Let $T$ have primary decomposition $V = ⨁_{i} W_{i}$ with projections $P_{i}$ that are polynomials in $T$ . Show that a subspace $U$ is $T$ -invariant if and only if $U = ⨁_{i} (U \cap W_{i})$ , i.e. invariant subspaces respect the primary splitting.

Hint

For the forward direction use that each $P_{i} = g_{i} (T) f_{i} (T)$ maps a $T$ -invariant $U$ into itself, so $P_{i} u \in U \cap W_{i}$ for $u \in U$ .

Answer

If $U = ⨁_{i} (U \cap W_{i})$ , then $U$ is $T$ -invariant because each $U \cap W_{i}$ is $T$ -invariant (an intersection of two $T$ -invariant subspaces) and a sum of $T$ -invariant subspaces is $T$ -invariant.

Conversely, suppose $U$ is $T$ -invariant. Each projection $P_{i}$ is a polynomial in $T$ , so $P_{i} (U) \subseteq U$ by Exercise 5; also $im P_{i} = W_{i}$ , so $P_{i} u \in U \cap W_{i}$ for every $u \in U$ . From $\sum_{i} P_{i} = I$ , any $u \in U$ satisfies $u = \sum_{i} P_{i} u$ with each $P_{i} u \in U \cap W_{i}$ , so $U \subseteq \sum_{i} (U \cap W_{i})$ . The reverse inclusion is immediate, and the sum is direct because the $W_{i}$ are independent. Hence $U = ⨁_{i} (U \cap W_{i})$ . Rubric: full credit for the polynomial-projection invariance and the $\sum_{i} P_{i} = I$ reconstruction.

Lean formalization Intermediate+

Mathlib carries invariant submodules, generalised eigenspaces, and the spanning of generalised eigenspaces over an algebraically closed field, but the field-agnostic primary decomposition with polynomial projectors is not packaged as one named theorem. The readable view of the intended statements:

import Mathlib.LinearAlgebra.Eigenspace.Triangularizable
import Mathlib.RingTheory.Coprime.Basic
import Mathlib.FieldTheory.Minpoly.Field

variable {K V : Type*} [Field K] [AddCommGroup V] [Module K V]
  [FiniteDimensional K V] (f : Module.End K V)

/-- A subspace is f-invariant iff it is carried into itself.
Mathlib spelling: `p ≤ p.comap f`. -/
def IsInvariant (p : Submodule K V) : Prop := p ≤ p.comap f

/-- Primary decomposition: if `minpoly K f = ∏ pᵢ ^ eᵢ` with the `pᵢ`
distinct monic irreducibles, then `V = ⨆ i, ker (pᵢ f) ^ eᵢ` as an
internal direct sum, each summand f-invariant, each projection a
polynomial in f. NOT a single named Mathlib theorem. -/
theorem primary_decomposition
    {ι : Type*} [Fintype ι] (p : ι → Polynomial K) (e : ι → ℕ)
    (hirr : ∀ i, Irreducible (p i))
    (hdist : Function.Injective p)
    (hmin : minpoly K f = ∏ i, (p i) ^ (e i)) :
    DirectSum.IsInternal
      (fun i => LinearMap.ker (((p i) ^ (e i)).aeval f)) :=
  sorry  -- Bézout: ∑ gᵢ fᵢ = 1 with fᵢ = m / pᵢ^eᵢ; projectors Pᵢ = gᵢ(f) fᵢ(f)

The proof gap to a clean contribution: (i) form $f_{i} = m_{T} / p_{i}^{e_{i}}$ and obtain the Bézout identity $\sum_{i} g_{i} f_{i} = 1$ from IsCoprime on the pairwise-coprime $p_{i}^{e_{i}}$ ; (ii) define $P_{i} = (g_{i} f_{i}) (T)$ and show $\sum_{i} P_{i} = 1$ , $P_{i} P_{j} = 0$ for $i \neq = j$ , $P_{i}^{2} = P_{i}$ ; (iii) identify $im P_{i}$ with LinearMap.ker ((p i ^ e i).aeval f) to package DirectSum.IsInternal. Each step is reachable from Mathlib's coprimeness and eigenspace machinery, but no current named lemma presents the primary decomposition with its polynomial projectors or derives the Jordan-Chevalley split as a corollary.

Advanced results Master

Theorem (generalised eigenspace decomposition over an algebraically closed field; Shilov Ch. 7 ^{[source pending]}). Let $K$ be algebraically closed and $T$ act on a finite-dimensional $V$ with distinct eigenvalues $λ_{1}, \dots, λ_{r}$ . Writing $m_{i}$ for the multiplicity of $λ_{i}$ in $m_{T}$ ,

V = i = 1 ⨁ r ker (T - λ_{i} I)^{m_{i}},

and $dim ker (T - λ_{i} I)^{m_{i}}$ equals the algebraic multiplicity of $λ_{i}$ in the characteristic polynomial. This is the primary decomposition specialised to linear irreducible factors. The chain $ker (T - λ_{i} I) \subseteq ker (T - λ_{i} I)^{2} \subseteq \dots$ stabilises at index $m_{i}$ — the exponent in the minimal polynomial — and the stable space is the full generalised eigenspace $G_{λ_{i}}$ . On each $G_{λ_{i}}$ the operator $N_{i} = (T - λ_{i} I) ∣_{G_{λ_{i}}}$ is nilpotent, which is the entry point to the Jordan refinement of 01.01.11.

Theorem (Jordan-Chevalley decomposition). Over a perfect field $K$ with $m_{T}$ splitting into linear factors over $K$ , there is a unique pair of operators $S, N$ on $V$ with

T = S + N, S N = N S, S semisimple (diagonalisable), N nilpotent,

and moreover $S$ and $N$ are polynomials in $T$ with zero constant term in the appropriate sense. Concretely, $S = \sum_{i} λ_{i} P_{i}$ acts as the scalar $λ_{i}$ on the generalised eigenspace $G_{λ_{i}}$ , with $P_{i}$ the primary projection, and $N = T - S$ acts as $T - λ_{i} I$ on $G_{λ_{i}}$ , hence nilpotent there. The semisimple part $S$ records the eigenvalue data and the nilpotent part $N$ records the failure of diagonalisability. Because $S = \sum_{i} λ_{i} P_{i}$ is a polynomial in $T$ (each $P_{i}$ is), the splitting is intrinsic: no basis, no inner product, no diagonalising change of coordinates is chosen.

Theorem (cyclic / rational canonical refinement; Frobenius 1878 ^{[source pending]}; Hoffman-Kunze Ch. 7 ^{[source pending]}). Each primary component $W_{i} = ker p_{i} (T)^{e_{i}}$ decomposes further as a direct sum of $T$ -cyclic subspaces $K [t] \cdot v$ , each isomorphic as a $K [t]$ -module to $K [t] / (p_{i}^{d})$ for some $d \leq e_{i}$ , and on a cyclic subspace the matrix of $T$ is the companion matrix of $p_{i}^{d}$ . This is the structure theorem for the finitely generated torsion $K [t]$ -module $V$ over the principal ideal domain $K [t]$ . The invariant-factor form collects the cyclic pieces into a chain $d_{1} (t) ∣ d_{2} (t) ∣ \dots ∣ d_{s} (t)$ with $d_{s} = m_{T}$ and $\prod_{j} d_{j} = χ_{T}$ ; the elementary-divisor form lists the prime powers $p_{i}^{d}$ directly. Over an algebraically closed field the elementary divisors are $(t - λ)^{d}$ and the companion matrices become Jordan blocks, recovering 01.01.11; over a general field the companion matrices stay, giving the rational canonical form valid without enlarging $K$ .

Theorem (module-theoretic reading). The data $(V, T)$ is the same as a finitely generated torsion module over $K [t]$ , with $t$ acting as $T$ ; the primary decomposition is the decomposition of this module into its $p_{i}$ -primary components, and the cyclic refinement is the structure theorem for modules over a PID. The annihilator of the module is $(m_{T})$ , its order ideal is $(χ_{T})$ , and the equivalence $K [t] / (m_{T}) ≅ \prod_{i} K [t] / (p_{i}^{e_{i}})$ is the Chinese remainder isomorphism whose idempotents are the projections $P_{i}$ . This identification turns every canonical-form theorem of linear algebra into a special case of the classification of finitely generated modules over a principal ideal domain.

Synthesis. The single operative fact is that $V$ carries a $K [t]$ -module structure with $t$ acting as $T$ , and the minimal polynomial $m_{T} = \prod_{i} p_{i}^{e_{i}}$ is the annihilator of that module. The coprimeness of the prime-power factors, expressed through the Bézout identity $\sum_{i} g_{i} f_{i} = 1$ , manufactures a complete system of commuting idempotents $P_{i} = (g_{i} f_{i}) (T)$ , and these idempotents are the primary projections; this is the Chinese remainder theorem realised on the module $V$ . From that one construction the entire canonical-form hierarchy unfolds: the primary decomposition $V = ⨁_{i} ker p_{i} (T)^{e_{i}}$ is immediate, the Jordan-Chevalley split $T = S + N$ with $S = \sum_{i} λ_{i} P_{i}$ a polynomial in $T$ is its scalar-versus-nilpotent reading, and the cyclic refinement of each primary component is the PID structure theorem that produces the rational canonical form over any field and the Jordan form over an algebraically closed one. Because the projections are polynomials in $T$ , every step is intrinsic and basis-free, which is why the decomposition survives the loss of an inner product that the spectral theorem 01.01.13 required: orthogonal projection is replaced by algebraic projection, diagonalisability is replaced by a nilpotent remainder, and the result holds over fields where eigenvectors may not even exist. The primary decomposition, the generalised eigenspace splitting, the Jordan-Chevalley decomposition, and the cyclic / rational canonical form are four readings of the single statement that a finitely generated torsion $K [t]$ -module is the direct sum of its primary components.

Full proof set Master

Proposition (uniqueness of the Jordan-Chevalley decomposition). Let $K$ be a field over which $m_{T}$ splits into linear factors, and suppose $T = S + N = S^{'} + N^{'}$ are two decompositions with $S, S^{'}$ semisimple, $N, N^{'}$ nilpotent, $S N = N S$ , and $S^{'} N^{'} = N^{'} S^{'}$ . Then $S = S^{'}$ and $N = N^{'}$ .

Proof. From $T = S + N$ with $S, N$ commuting, both $S$ and $N$ commute with $T$ ; in fact, by the primary decomposition $S = \sum_{i} λ_{i} P_{i}$ is a polynomial in $T$ , so $S$ and $N = T - S$ commute with every operator commuting with $T$ . In particular $S$ and $N$ commute with $S^{'}$ and $N^{'}$ , since $S^{'} = h (T)$ and $N^{'} = T - S^{'}$ are themselves polynomials in $T$ by the same construction applied to the second decomposition.

Consider $S - S^{'} = N^{'} - N$ . The left side is a difference of two commuting semisimple operators, hence semisimple: commuting semisimple (diagonalisable) operators are simultaneously diagonalisable, and a difference of simultaneously diagonalisable operators is diagonalisable. The right side is a difference of two commuting nilpotent operators, hence nilpotent: if $N^{a} = 0$ , $N^{' b} = 0$ , and $N N^{'} = N^{'} N$ , the binomial expansion of $(N^{'} - N)^{a + b}$ has every term containing $N^{a}$ or $N^{' b}$ , so $(N^{'} - N)^{a + b} = 0$ . An operator that is both semisimple and nilpotent has minimal polynomial both square-free and a power of $t$ , so its minimal polynomial is $t$ , forcing the operator to be $0$ . Therefore $S - S^{'} = 0$ and $N - N^{'} = 0$ . $□$

Proposition (the primary projections are the unique idempotents that are polynomials in $T$ and project onto the components). Let $V = ⨁_{i} W_{i}$ be the primary decomposition of $T$ . The projections $P_{i}$ onto $W_{i}$ along $⨁_{j \neq = i} W_{j}$ are the unique operators that are simultaneously (a) polynomials in $T$ , (b) idempotent, (c) satisfy $im P_{i} = W_{i}$ and $ker P_{i} \supseteq W_{j}$ for $j \neq = i$ .

Proof. Existence is the key theorem: $P_{i} = g_{i} (T) f_{i} (T)$ has all three properties. For uniqueness, suppose $Q_{i}$ also satisfies (a)–(c). Both $P_{i}$ and $Q_{i}$ act as the identity on $W_{i}$ and as $0$ on each $W_{j}$ with $j \neq = i$ , since (c) forces this on each summand. Because $V = ⨁_{j} W_{j}$ , an operator is determined by its action on each summand, so $P_{i} = Q_{i}$ as operators. The hypothesis that $Q_{i}$ is a polynomial in $T$ is not even needed for this uniqueness, but it is what makes $P_{i}$ canonical: the projections attached to any $T$ -invariant direct-sum decomposition need not be polynomials in $T$ , whereas the primary projections always are, because they descend from the idempotents of $K [t] / (m_{T}) ≅ \prod_{i} K [t] / (p_{i}^{e_{i}})$ . $□$

Proposition (the induced operator on a quotient has minimal polynomial dividing $m_{T}$ ). Let $W$ be $T$ -invariant and $\overline{T}$ the induced operator on $V / W$ . Then $m_{\overline{T}} ∣ m_{T}$ and $m_{T ∣_{W}} ∣ m_{T}$ , and $m_{T} = lcm (m_{T ∣_{W}}, m_{\overline{T}})$ when $V = W \oplus U$ with $U$ also invariant, taking $\overline{T} ≅ T ∣_{U}$ .

Proof. For any polynomial $p$ , $p (\overline{T}) (v + W) = p (T) v + W$ . Taking $p = m_{T}$ gives $p (T) v = 0$ for all $v$ , so $p (\overline{T}) = 0$ on $V / W$ , whence $m_{\overline{T}} ∣ m_{T}$ . Likewise $m_{T} (T ∣_{W}) = (m_{T} (T)) ∣_{W} = 0$ , so $m_{T ∣_{W}} ∣ m_{T}$ . When $V = W \oplus U$ with both summands invariant, a polynomial $p$ annihilates $T$ iff it annihilates both $T ∣_{W}$ and $T ∣_{U}$ , because $p (T) = p (T ∣_{W}) \oplus p (T ∣_{U})$ on the block-diagonal form; the smallest such monic $p$ is the least common multiple of $m_{T ∣_{W}}$ and $m_{T ∣_{U}}$ , and $\overline{T} ≅ T ∣_{U}$ via the isomorphism $V / W ≅ U$ . Hence $m_{T} = lcm (m_{T ∣_{W}}, m_{T ∣_{U}})$ . $□$

Proposition (Cayley-Hamilton from the primary decomposition). With $χ_{T} = \prod_{i} p_{i}^{a_{i}}$ and $m_{T} = \prod_{i} p_{i}^{e_{i}}$ over an algebraically closed field, $e_{i} \leq a_{i}$ for each $i$ , hence $m_{T} ∣ χ_{T}$ and $χ_{T} (T) = 0$ .

Proof. On the generalised eigenspace $G_{λ_{i}} = ker (T - λ_{i} I)^{e_{i}}$ the restriction $T ∣_{G_{λ_{i}}}$ has minimal polynomial $(t - λ_{i})^{e_{i}}$ by the key theorem, and its characteristic polynomial is $(t - λ_{i})^{d i m G_{λ_{i}}}$ because $λ_{i}$ is its only eigenvalue. The index of nilpotency $e_{i}$ of $(T - λ_{i} I) ∣_{G_{λ_{i}}}$ is at most $dim G_{λ_{i}} = a_{i}$ , since a nilpotent operator on a $d$ -dimensional space satisfies $N^{d} = 0$ . Thus $e_{i} \leq a_{i}$ . Multiplying over $i$ , $m_{T} = \prod_{i} (t - λ_{i})^{e_{i}}$ divides $χ_{T} = \prod_{i} (t - λ_{i})^{a_{i}}$ , and since $m_{T} (T) = 0$ on each component hence on $V$ , also $χ_{T} (T) = 0$ . The primary decomposition thereby recovers the Cayley-Hamilton identity of 01.01.08 as a divisibility consequence rather than an independent computation. $□$

Connections Master

The primary decomposition is the inner-product-free generalisation of the orthogonal eigenspace splitting of the spectral theorem 01.01.13. Where the spectral theorem requires a normal operator and produces orthogonal projections onto eigenspaces, the primary decomposition requires only a minimal polynomial and produces algebraic projections — polynomials in $T$ — onto generalised eigenspaces, dropping both normality and the inner product, and it remains valid over fields with no eigenvectors at all.

The cyclic refinement of each primary component is exactly the input to the Jordan and minimal-polynomial theory of 01.01.11: that unit takes a single operator with prime-power minimal polynomial $(t - λ)^{e}$ and builds its Jordan chains, which is the per-component problem the primary decomposition isolates. The rational canonical form named there over a general field is the companion-matrix assembly of the same cyclic pieces.

The coprime-idempotent mechanism reappears in the group setting of 01.02.05 (solvable, nilpotent, and Jordan-Hölder structure): the primary decomposition of a single operator and the decomposition of a nilpotent or solvable linear action into characteristic blocks are two instances of the same use of commuting idempotents and stable filtrations to break a hard action into local pieces.

Historical & philosophical context Master

The classification of a single linear substitution by its canonical block form is due to Camille Jordan, whose 1870 Traité des substitutions et des équations algébriques gave the canonical form of a linear substitution first over finite fields, in the service of Galois theory, and then over the complex numbers ^{[Jordan 1870]}. The decomposition that underlies the block form — the splitting of the space according to the prime-power factors of an annihilating polynomial — was placed on its modern algebraic footing by Frobenius, whose 1878 memoir on linear substitutions and bilinear forms introduced the minimal polynomial, the invariant factors, and the rational canonical form, valid over an arbitrary field and not requiring the eigenvalues to lie in it ^{[Frobenius 1878]}. The recognition that all of this is the structure theory of finitely generated modules over the principal ideal domain $K [t]$ , with the primary decomposition as the coprime-primary splitting and the canonical forms as the cyclic decomposition, is the ideal-theoretic viewpoint codified by Krull and the abstract-algebra tradition of the 1920s and 1930s ^{[Krull 1935]}. Shilov's Linear Algebra presents the invariant-subspace and canonical-form development in the concrete operator language used here, while Hoffman and Kunze give the primary decomposition theorem in the coprime-annihilator form whose proof this unit follows.

Bibliography Master

@book{Jordan1870,
  author    = {Jordan, Camille},
  title     = {Trait{\'e} des substitutions et des {\'e}quations alg{\'e}briques},
  publisher = {Gauthier-Villars},
  address   = {Paris},
  year      = {1870}
}

@article{Frobenius1878,
  author  = {Frobenius, Ferdinand Georg},
  title   = {{\"U}ber lineare Substitutionen und bilineare Formen},
  journal = {Journal f{\"u}r die reine und angewandte Mathematik},
  volume  = {84},
  year    = {1878},
  pages   = {1--63}
}

@book{Krull1935,
  author    = {Krull, Wolfgang},
  title     = {Idealtheorie},
  series    = {Ergebnisse der Mathematik und ihrer Grenzgebiete},
  publisher = {Springer},
  address   = {Berlin},
  year      = {1935}
}

@book{HoffmanKunze1971,
  author    = {Hoffman, Kenneth and Kunze, Ray},
  title     = {Linear Algebra},
  edition   = {2nd},
  publisher = {Prentice-Hall},
  address   = {Englewood Cliffs, NJ},
  year      = {1971}
}

@book{Shilov1977,
  author    = {Shilov, Georgi E.},
  title     = {Linear Algebra},
  publisher = {Dover Publications},
  address   = {New York},
  year      = {1977},
  note      = {Translation of the 1971 Russian edition, transl. R. A. Silverman}
}

@book{Jacobson1985,
  author    = {Jacobson, Nathan},
  title     = {Basic Algebra I},
  edition   = {2nd},
  publisher = {W. H. Freeman},
  address   = {New York},
  year      = {1985}
}

@book{Lang2002,
  author    = {Lang, Serge},
  title     = {Algebra},
  edition   = {3rd, revised},
  series    = {Graduate Texts in Mathematics},
  volume    = {211},
  publisher = {Springer},
  year      = {2002}
}

Prerequisites

01.01.05
01.01.08
01.01.13

Tier anchors

beginner: Breaking a transformation into independent directions it never mixes — Shilov *Linear Algebra* Ch. 6; 3Blue1Brown *Essence of Linear Algebra* Ch. 14 (the special directions an operator preserves)
intermediate: Shilov *Linear Algebra* Ch. 6 (invariant subspaces) and Ch. 7 (the canonical form); Hoffman-Kunze *Linear Algebra* §6.7–§6.8 (invariant direct sums and the primary decomposition theorem)
master: Shilov *Linear Algebra* Ch. 6–7; Hoffman-Kunze *Linear Algebra* Ch. 6–7 (annihilating polynomials, primary decomposition, cyclic decomposition); Hoffman-Kunze §7.5 (the Jordan and rational forms); Lang *Algebra* Ch. XIV (modules over a PID and canonical forms); Jacobson *Basic Algebra I* Ch. 3

References

images/Shilov-Linear-Algebra__4cbdee00cc.jpg · Shilov *Linear Algebra* — Fast Track archive cover; Ch. 6 invariant subspaces and the induced operator, Ch. 7 the canonical form of a linear operator and the decomposition into cyclic / primary pieces
Shilov, G. E. — Linear Algebra (Dover, 1977 transl. of the 1971 Russian ed.) · Ch. 6 §§6.1–6.4 (invariant subspaces, the matrix of an operator in an adapted basis, the induced operator on a quotient), Ch. 7 (the canonical form of a linear operator, invariant and cyclic subspaces)
Hoffman, K. & Kunze, R. — Linear Algebra (2nd ed., Prentice-Hall, 1971) · §6.4 invariant subspaces and the conductor / stuffer polynomial; §6.7 direct-sum decompositions and invariant direct sums; §6.8 the primary decomposition theorem via coprime annihilators; Ch. 7 the rational and Jordan forms
Jordan, C. — Traité des substitutions et des équations algébriques · Gauthier-Villars, Paris, 1870 — the canonical form of a linear substitution over a finite field and over $\mathbb{C}$, the historical origin of the block decomposition refining the primary decomposition
Frobenius, F. G. — Über lineare Substitutionen und bilineare Formen · Journal für die reine und angewandte Mathematik 84 (1878), 1–63 — the minimal polynomial of a linear substitution, invariant factors, and the rational canonical form refining the primary decomposition over an arbitrary field
Krull, W. — Idealtheorie · Ergebnisse der Mathematik, Springer, 1935 — the ideal-theoretic / Chinese-remainder viewpoint on coprime-factor decompositions of a module over a principal ideal domain, the abstract frame in which the primary decomposition sits

Estimated time

beginner: 18m
intermediate: 45m
master: 90m