02.06.03 · analysis / transcendental

Systems of linear ODEs and the matrix exponential

shipped3 tiersLean: none

Anchor (Master): Sylvester 1883 Comptes Rendus 94; Peano 1888 Integration par series; Coddington-Levinson Theory of Ordinary Differential Equations Ch. 3

Intuition [Beginner]

Imagine two water tanks connected by a pipe. Water flows from tank $1$ to tank $2$ at one rate, and from tank $2$ back to tank $1$ at a different rate. The water level in each tank changes over time, but the two levels are coupled: what happens in tank $1$ affects tank $2$ and vice versa.

A system of ODEs captures exactly this situation. Instead of one unknown function $y (t)$ , there are several — say $x_{1} (t)$ and $x_{2} (t)$ — and each derivative depends on both unknowns. Writing the unknowns as a column vector $x$ and the coupling coefficients as a matrix $A$ , the entire system collapses to one compact equation: $x^{'} = A x$ .

The solution is the matrix exponential $e^{A t}$ , which acts like the ordinary exponential $e^{a t}$ but works for matrices. It turns a coupled system into a single formula: $x (t) = e^{A t} x (0)$ . Why does this concept exist? Because most real-world systems have multiple interacting components, and the matrix exponential gives one unified way to solve them all.

Visual [Beginner]

The picture shows two curves in the $(x_{1}, x_{2})$ -plane, starting from different initial points. Both spiral inward toward the origin, because the matrix $A$ has eigenvalues with negative real parts. The spiral shape comes from the complex eigenvalues, and the inward drift comes from the negative real parts.

Each trajectory is the path traced by the tip of the vector $x (t) = e^{A t} x (0)$ as time $t$ increases from $0$ .

Worked example [Beginner]

Solve the system $x_{1}^{'} = - x_{1} + 2 x_{2}$ , $x_{2}^{'} = - 2 x_{1} - x_{2}$ with initial conditions $x_{1} (0) = 1$ , $x_{2} (0) = 0$ .

Step 1. Write as a matrix equation: $x^{'} = A x$ where $A = (- 1 - 2 2 - 1)$ .

Step 2. Find the eigenvalues of $A$ : $det (A - r I) = (r + 1)^{2} + 4 = 0$ , so $r = - 1 \pm 2 i$ .

Step 3. The real parts are $- 1$ (decay), and the imaginary parts are $\pm 2$ (oscillation). The solution is $x_{1} (t) = e^{- t} cos (2 t)$ , $x_{2} (t) = - e^{- t} sin (2 t)$ .

What this tells us: the two water tanks exchange fluid in an oscillating pattern that decays over time. The matrix exponential captures both the oscillation (from the imaginary part of the eigenvalues) and the decay (from the negative real part).

Check your understanding [Beginner]

Formal definition [Intermediate+]

A first-order linear homogeneous system of ODEs with constant coefficients is

$x^{'} (t) = A x (t),$

where $x : R \to R^{n}$ (or $C^{n}$ ) and $A$ is an $n \times n$ constant matrix.

The matrix exponential is defined by the convergent power series

$e^{A t} = k = 0 \sum \infty \frac{( A t ) ^{k}}{k !} = I + A t + \frac{A ^{2} t ^{2}}{2 !} + \frac{A ^{3} t ^{3}}{3 !} + \dots,$

which converges for every $n \times n$ matrix $A$ and every $t \in R$ (because $∥ A t ∥^{k} / k! \leq (∥ A ∥∣ t ∣)^{k} / k!$ and the scalar exponential series converges).

Fundamental matrix. An $n \times n$ matrix function $Φ (t)$ is a fundamental matrix for $x^{'} = A x$ if each column of $Φ$ is a solution and $Φ (t)$ is invertible for all $t$ . The matrix exponential $e^{A t}$ is the unique fundamental matrix satisfying $Φ (0) = I$ .

Counterexamples to common slips

$e^{A + B} \neq = e^{A} e^{B}$ in general. This identity holds only when $A B = B A$ . For non-commuting matrices, the correct formula is the Baker-Campbell-Hausdorff expansion.
$x^{'} = A (t) x$ with variable $A (t)$ does not have solution $e^{\int A (t) d t}$ . The matrix exponential formula applies only to constant-coefficient systems. Variable-coefficient systems require the Magnus expansion or other techniques.
$e^{A t}$ is not computed by exponentiating each entry of $A t$ . The matrix exponential is defined by the power series, not entrywise exponentiation. For diagonal matrices, $e^{A t}$ does equal the diagonal matrix of exponentials, but this fails for general matrices.

Key theorem with proof [Intermediate+]

Theorem (Matrix exponential solution). The unique solution of the initial value problem $x^{'} = A x$ , $x (0) = x_{0}$ is

$x (t) = e^{A t} x_{0} .$

Moreover, $e^{A t}$ satisfies:

$\frac{d}{d t} e^{A t} = A e^{A t} = e^{A t} A$ .
$e^{A \cdot 0} = I$ .
$e^{A (t + s)} = e^{A t} e^{A s}$ for all $t, s \in R$ .

Proof of (1). Differentiate the power series term by term:

$\frac{d}{d t} e^{A t} = \frac{d}{d t} k = 0 \sum \infty \frac{A ^{k} t ^{k}}{k !} = k = 1 \sum \infty \frac{A ^{k} \cdot k t ^{k - 1}}{k !} = k = 1 \sum \infty \frac{A ^{k} t ^{k - 1}}{( k - 1 )!} .$

Re-indexing $j = k - 1$ :

$\frac{d}{d t} e^{A t} = A j = 0 \sum \infty \frac{A ^{j} t ^{j}}{j !} = A e^{A t} .$

Since $A^{k} = A \cdot A^{k - 1} = A^{k - 1} \cdot A$ , the same computation gives $e^{A t} A$ . Term-by-term differentiation is justified because the series converges uniformly on every bounded interval (the derivative series has the same radius of convergence).

Proof of (2). At $t = 0$ : $e^{A \cdot 0} = \sum_{k = 0}^{\infty} A^{k} \cdot 0^{k} / k! = I + 0 + 0 + \dots = I$ .

Proof of (3). Fix $s$ and let $F (t) = e^{A (t + s)}$ . Then $F^{'} (t) = A e^{A (t + s)} = A F (t)$ by property (1), and $F (0) = e^{A s}$ . The function $G (t) = e^{A t} e^{A s}$ satisfies $G^{'} (t) = A e^{A t} e^{A s} = A G (t)$ and $G (0) = e^{A s}$ . By uniqueness of solutions to $y^{'} = A y$ with given initial condition, $F (t) = G (t)$ .

Proof of the solution formula. The function $x (t) = e^{A t} x_{0}$ satisfies $x^{'} (t) = A e^{A t} x_{0} = A x (t)$ by property (1), and $x (0) = e^{A \cdot 0} x_{0} = I x_{0} = x_{0}$ by property (2). Uniqueness follows from the existence-uniqueness theorem for linear ODE systems. $□$

Bridge. The foundational reason the matrix exponential solves the system is that the power series $\sum (A t)^{k} / k!$ differentiates to $A$ times itself, and this is exactly the matrix analogue of the scalar identity $(d / d t) e^{a t} = a e^{a t}$ . The central insight is that $e^{A t}$ plays the role of the scalar exponential $e^{a t}$ , but in $n$ dimensions simultaneously. This result builds toward Lyapunov stability theory where the eigenvalues of $A$ determine whether $e^{A t}$ decays, and appears again in 02.06.02 via the companion-matrix reduction where a scalar nth-order ODE becomes a system whose solution is $e^{A t}$ . Putting these together, the diagonalisation identity $e^{A t} = P e^{D t} P^{- 1}$ (where $D$ is diagonal with the eigenvalues) is the bridge that connects the abstract power series to concrete computations: the eigenvalues of $A$ control the behaviour of every solution.

Exercises [Intermediate+]

Advanced results [Master]

Theorem 1 (Diagonalisation formula). If $A = P D P^{- 1}$ with $D = diag (λ_{1}, \dots, λ_{n})$ , then $e^{A t} = P diag (e^{λ_{1} t}, \dots, e^{λ_{n} t}) P^{- 1}$ . The columns of $P$ are eigenvectors of $A$ , and each eigenvalue $λ_{j} = α_{j} + i β_{j}$ contributes a factor $e^{α_{j} t} (cos β_{j} t + i sin β_{j} t)$ to the solution.

Theorem 2 (Jordan canonical form and the matrix exponential). For any matrix $A$ , there exists an invertible $P$ such that $A = P J P^{- 1}$ where $J$ is the Jordan form: a block-diagonal matrix of Jordan blocks $J_{i} (λ) = λ I + N_{i}$ where $N_{i}$ is nilpotent. Then $e^{A t} = P e^{J t} P^{- 1}$ and each Jordan block contributes $e^{λ t} \sum_{k = 0}^{m - 1} N^{k} t^{k} / k!$ where $m$ is the block size. This handles non-diagonalisable matrices.

Theorem 3 (Lyapunov stability theorem). The equilibrium $x = 0$ of $x^{'} = A x$ is:

Asymptotically stable if and only if every eigenvalue of $A$ has negative real part.
Stable (but not asymptotically) if and only if every eigenvalue has non-positive real part and every eigenvalue with zero real part is semisimple (its algebraic and geometric multiplicities coincide).
Unstable otherwise.

The Lyapunov equation $A^{T} P + P A = - Q$ (for any positive-definite $Q$ ) has a unique positive-definite solution $P$ if and only if $A$ is asymptotically stable.

Theorem 4 (Controllability and the Kalman rank condition). The system $x^{'} = A x + B u$ is controllable (any initial state can be driven to any target state in finite time by a suitable input $u$ ) if and only if the controllability matrix $C = [B ∣ A B ∣ A^{2} B ∣ \dots ∣ A^{n - 1} B]$ has full row rank $n$ . This is the Kalman rank condition.

Theorem 5 (Putzer's algorithm). The matrix exponential can be computed without finding eigenvectors, using only the eigenvalues $λ_{1}, \dots, λ_{n}$ and the powers of $A$ . Define $P_{0} = I$ , $P_{k} = (A - λ_{k} I) P_{k - 1}$ , and solve the cascade of first-order linear ODEs $r_{1}^{'} = λ_{1} r_{1}$ , $r_{k}^{'} = λ_{k} r_{k} + r_{k - 1}$ for $k \geq 2$ , with $r_{1} (0) = 1$ , $r_{k} (0) = 0$ for $k \geq 2$ . Then $e^{A t} = \sum_{k = 1}^{n} r_{k} (t) P_{k - 1}$ .

Theorem 6 (Non-homogeneous systems and variation of constants). The solution of $x^{'} = A x + f (t)$ with $x (t_{0}) = x_{0}$ is

$x (t) = e^{A (t - t_{0})} x_{0} + \int_{t_{0}}^{t} e^{A (t - s)} f (s) d s .$

This is the variation of constants (or Duhamel) formula.

Synthesis. The foundational reason the matrix exponential works is that the power series $\sum (A t)^{k} / k!$ converges for every matrix and differentiates to $A$ times itself, and this is exactly the content that makes $x (t) = e^{A t} x (0)$ solve the system. The central insight is that the eigenvalues of $A$ control the qualitative behaviour: decay, growth, oscillation, or a mix. Putting these together with the Jordan form, even non-diagonalisable matrices are handled by the nilpotent correction $e^{λ t} \sum N^{k} t^{k} / k!$ . The bridge is between the abstract power series and the concrete eigenvalue decomposition $A = P D P^{- 1}$ that reduces $e^{A t}$ to scalar exponentials. This identification appears again in the companion matrix reduction of 02.06.02 where scalar nth-order equations become systems, the pattern generalises to the non-homogeneous Duhamel formula, and builds toward Lyapunov stability and controllability where the eigenvalue locations determine whether the system can be stabilised or controlled.

Full proof set [Master]

Proposition 1 (The power series for $e^{A t}$ converges). For any $n \times n$ matrix $A$ and any $t \in R$ , the series $\sum_{k = 0}^{\infty} (A t)^{k} / k!$ converges.

Proof. Let $∥ \cdot ∥$ be any submultiplicative matrix norm (e.g., the operator norm). Then $∥ (A t)^{k} / k! ∥ \leq (∥ A ∥ \cdot ∣ t ∣)^{k} / k!$ . The series $\sum (∥ A ∥∣ t ∣)^{k} / k! = e^{∥ A ∥∣ t ∣}$ converges. By the Weierstrass M-test, the matrix series converges absolutely, hence converges. $□$

Proposition 2 ( $e^{A + B} = e^{A} e^{B}$ when $A B = B A$ ). If $A B = B A$ , then $e^{A + B} = e^{A} e^{B}$ .

Proof. When $A B = B A$ , the binomial theorem applies to matrices: $(A + B)^{k} = \sum_{j = 0}^{k} (j k) A^{j} B^{k - j}$ . So

$e^{A + B} = k = 0 \sum \infty \frac{1}{k !} j = 0 \sum k (j k) A^{j} B^{k - j} = k = 0 \sum \infty j = 0 \sum k \frac{A ^{j}}{j !} \frac{B ^{k - j}}{( k - j )!} .$

This double sum equals $(\sum_{j} A^{j} / j!) (\sum_{m} B^{m} / m!) = e^{A} e^{B}$ by the Cauchy product formula, since both series converge absolutely. $□$

Connections [Master]

n-th-order linear ODE with constant coefficients 02.06.02. The companion matrix reduction transforms every scalar nth-order ODE into a first-order system $x^{'} = A x$ , and the eigenvalues of the companion matrix are exactly the roots of the characteristic polynomial. The matrix exponential $e^{A t}$ generalises the scalar exponential solutions $e^{r t}$ to the coupled setting, recovering the scalar theory when $n = 1$ .
First-order linear and separable ODEs 02.08.01. The existence-uniqueness theorem for first-order linear ODE systems provides the theoretical foundation for the solution formula $x (t) = e^{A t} x (0)$ . The scalar integrating-factor method from that unit is the $1 \times 1$ case of the matrix exponential formula.
Second-order linear ODE with constant coefficients 02.08.02. A second-order equation $y^{''} + b y^{'} + cy = 0$ becomes a $2 \times 2$ system via the companion matrix $A = (0 - c 1 - b)$ . The eigenvalues of $A$ are the roots of $r^{2} + b r + c = 0$ , and $e^{A t}$ produces the same oscillatory, exponential, or critically-damped solutions as the scalar characteristic-polynomial method.

Historical & philosophical context [Master]

Sylvester 1883, in On the Equation to the Secular Inequalities in the Planetary Theory ^{[Sylvester1883]}, introduced the matrix exponential in the context of solving systems of linear differential equations arising from celestial mechanics. His formulation recognised that the power series $\sum (A t)^{k} / k!$ generalises the scalar exponential to matrices and satisfies the key differential-equation property.

Peano 1888, in Integration par series des equations differentielles lineaires ^[Peano1888], gave the rigorous power-series definition of the matrix exponential and proved the fundamental properties: term-by-term differentiation yields $d / d t e^{A t} = A e^{A t}$ , and the composition $e^{A (t + s)} = e^{A t} e^{A s}$ . Peano's treatment was the first to establish the matrix exponential as the canonical solution operator for linear ODE systems with constant coefficients. The modern synthesis — diagonalisation, Jordan form, stability theory, and the Kalman controllability criterion — was developed through the mid-twentieth century, with the Jordan-form computation of $e^{A t}$ appearing in Gantmacher's Theory of Matrices (1959) and the controllability theory in Kalman's foundational papers of 1960-1963.

Bibliography [Master]

@article{Sylvester1883,
  author = {Sylvester, James Joseph},
  title = {Sur les quantites formant un groupe de nonnes analogues aux quaternions de Hamilton},
  journal = {Comptes Rendus de l'Academie des Sciences, Paris},
  volume = {94},
  year = {1883},
  pages = {1336--1340},
}

@article{Peano1888,
  author = {Peano, Giuseppe},
  title = {Integration par series des equations differentielles lineaires},
  journal = {Mathematische Annalen},
  volume = {32},
  year = {1888},
  pages = {450--456},
}

@book{BoyceDiPrima2012,
  author = {Boyce, William E. and DiPrima, Richard C.},
  title = {Elementary Differential Equations and Boundary Value Problems},
  publisher = {Wiley},
  year = {2012},
  edition = {10th},
}

@book{CoddingtonLevinson1955,
  author = {Coddington, Earl A. and Levinson, Norman},
  title = {Theory of Ordinary Differential Equations},
  publisher = {McGraw-Hill},
  year = {1955},
}

@book{Gantmacher1959,
  author = {Gantmacher, Felix R.},
  title = {The Theory of Matrices},
  publisher = {Chelsea},
  year = {1959},
}

Prerequisites

02.06.02

Tier anchors

beginner: 3Blue1Brown essence of linear algebra + ODE visualisation; coupled-tank analogy
intermediate: Boyce-Diprima Elementary Differential Equations Ch. 7-8; Apostol Calculus Vol. 2 Ch. 6
master: Sylvester 1883 Comptes Rendus 94; Peano 1888 Integration par series; Coddington-Levinson Theory of Ordinary Differential Equations Ch. 3

References

TODO_REF
Sylvester 1883 — On the Equation to the Secular Inequalities in the Planetary Theory · Comptes Rendus 94, originator of the matrix exponential concept
TODO_REF
Peano 1888 — Integration par series des equations differentielles lineaires · Math. Ann. 32, originator of the series definition of the matrix exponential for solving ODE systems
TODO_REF
Boyce and Diprima — Elementary Differential Equations and Boundary Value Problems · Ch. 7-8, systems of first-order equations and the matrix exponential
TODO_REF
Coddington and Levinson — Theory of Ordinary Differential Equations · Ch. 3, linear systems

Reviewer

TBD

Estimated time

beginner: 15m
intermediate: 35m
master: 70m