02.06.03 · analysis / transcendental

Systems of linear ODEs and the matrix exponential

shipped3 tiersLean: none

Anchor (Master): Sylvester 1883 Comptes Rendus 94; Peano 1888 Integration par series; Coddington-Levinson Theory of Ordinary Differential Equations Ch. 3

Intuition [Beginner]

Imagine two water tanks connected by a pipe. Water flows from tank to tank at one rate, and from tank back to tank at a different rate. The water level in each tank changes over time, but the two levels are coupled: what happens in tank affects tank and vice versa.

A system of ODEs captures exactly this situation. Instead of one unknown function , there are several — say and — and each derivative depends on both unknowns. Writing the unknowns as a column vector and the coupling coefficients as a matrix , the entire system collapses to one compact equation: .

The solution is the matrix exponential , which acts like the ordinary exponential but works for matrices. It turns a coupled system into a single formula: . Why does this concept exist? Because most real-world systems have multiple interacting components, and the matrix exponential gives one unified way to solve them all.

Visual [Beginner]

The picture shows two curves in the -plane, starting from different initial points. Both spiral inward toward the origin, because the matrix has eigenvalues with negative real parts. The spiral shape comes from the complex eigenvalues, and the inward drift comes from the negative real parts.

A phase portrait in the x1-x2 plane showing two solution trajectories spiralling inward toward the origin. The spirals come from complex eigenvalues of the matrix A, and the inward drift comes from eigenvalues with negative real parts.

Each trajectory is the path traced by the tip of the vector as time increases from .

Worked example [Beginner]

Solve the system , with initial conditions , .

Step 1. Write as a matrix equation: where .

Step 2. Find the eigenvalues of : , so .

Step 3. The real parts are (decay), and the imaginary parts are (oscillation). The solution is , .

What this tells us: the two water tanks exchange fluid in an oscillating pattern that decays over time. The matrix exponential captures both the oscillation (from the imaginary part of the eigenvalues) and the decay (from the negative real part).

Check your understanding [Beginner]

Formal definition [Intermediate+]

A first-order linear homogeneous system of ODEs with constant coefficients is

where (or ) and is an constant matrix.

The matrix exponential is defined by the convergent power series

which converges for every matrix and every (because and the scalar exponential series converges).

Fundamental matrix. An matrix function is a fundamental matrix for if each column of is a solution and is invertible for all . The matrix exponential is the unique fundamental matrix satisfying .

Counterexamples to common slips

  • in general. This identity holds only when . For non-commuting matrices, the correct formula is the Baker-Campbell-Hausdorff expansion.

  • with variable does not have solution . The matrix exponential formula applies only to constant-coefficient systems. Variable-coefficient systems require the Magnus expansion or other techniques.

  • is not computed by exponentiating each entry of . The matrix exponential is defined by the power series, not entrywise exponentiation. For diagonal matrices, does equal the diagonal matrix of exponentials, but this fails for general matrices.

Key theorem with proof [Intermediate+]

Theorem (Matrix exponential solution). The unique solution of the initial value problem , is

Moreover, satisfies:

  1. .
  2. .
  3. for all .

Proof of (1). Differentiate the power series term by term:

Re-indexing :

Since , the same computation gives . Term-by-term differentiation is justified because the series converges uniformly on every bounded interval (the derivative series has the same radius of convergence).

Proof of (2). At : .

Proof of (3). Fix and let . Then by property (1), and . The function satisfies and . By uniqueness of solutions to with given initial condition, .

Proof of the solution formula. The function satisfies by property (1), and by property (2). Uniqueness follows from the existence-uniqueness theorem for linear ODE systems.

Bridge. The foundational reason the matrix exponential solves the system is that the power series differentiates to times itself, and this is exactly the matrix analogue of the scalar identity . The central insight is that plays the role of the scalar exponential , but in dimensions simultaneously. This result builds toward Lyapunov stability theory where the eigenvalues of determine whether decays, and appears again in 02.06.02 via the companion-matrix reduction where a scalar nth-order ODE becomes a system whose solution is . Putting these together, the diagonalisation identity (where is diagonal with the eigenvalues) is the bridge that connects the abstract power series to concrete computations: the eigenvalues of control the behaviour of every solution.

Exercises [Intermediate+]

Advanced results [Master]

Theorem 1 (Diagonalisation formula). If with , then . The columns of are eigenvectors of , and each eigenvalue contributes a factor to the solution.

Theorem 2 (Jordan canonical form and the matrix exponential). For any matrix , there exists an invertible such that where is the Jordan form: a block-diagonal matrix of Jordan blocks where is nilpotent. Then and each Jordan block contributes where is the block size. This handles non-diagonalisable matrices.

Theorem 3 (Lyapunov stability theorem). The equilibrium of is:

  • Asymptotically stable if and only if every eigenvalue of has negative real part.
  • Stable (but not asymptotically) if and only if every eigenvalue has non-positive real part and every eigenvalue with zero real part is semisimple (its algebraic and geometric multiplicities coincide).
  • Unstable otherwise.

The Lyapunov equation (for any positive-definite ) has a unique positive-definite solution if and only if is asymptotically stable.

Theorem 4 (Controllability and the Kalman rank condition). The system is controllable (any initial state can be driven to any target state in finite time by a suitable input ) if and only if the controllability matrix has full row rank . This is the Kalman rank condition.

Theorem 5 (Putzer's algorithm). The matrix exponential can be computed without finding eigenvectors, using only the eigenvalues and the powers of . Define , , and solve the cascade of first-order linear ODEs , for , with , for . Then .

Theorem 6 (Non-homogeneous systems and variation of constants). The solution of with is

This is the variation of constants (or Duhamel) formula.

Synthesis. The foundational reason the matrix exponential works is that the power series converges for every matrix and differentiates to times itself, and this is exactly the content that makes solve the system. The central insight is that the eigenvalues of control the qualitative behaviour: decay, growth, oscillation, or a mix. Putting these together with the Jordan form, even non-diagonalisable matrices are handled by the nilpotent correction . The bridge is between the abstract power series and the concrete eigenvalue decomposition that reduces to scalar exponentials. This identification appears again in the companion matrix reduction of 02.06.02 where scalar nth-order equations become systems, the pattern generalises to the non-homogeneous Duhamel formula, and builds toward Lyapunov stability and controllability where the eigenvalue locations determine whether the system can be stabilised or controlled.

Full proof set [Master]

Proposition 1 (The power series for converges). For any matrix and any , the series converges.

Proof. Let be any submultiplicative matrix norm (e.g., the operator norm). Then . The series converges. By the Weierstrass M-test, the matrix series converges absolutely, hence converges.

Proposition 2 ( when ). If , then .

Proof. When , the binomial theorem applies to matrices: . So

This double sum equals by the Cauchy product formula, since both series converge absolutely.

Connections [Master]

  • n-th-order linear ODE with constant coefficients 02.06.02. The companion matrix reduction transforms every scalar nth-order ODE into a first-order system , and the eigenvalues of the companion matrix are exactly the roots of the characteristic polynomial. The matrix exponential generalises the scalar exponential solutions to the coupled setting, recovering the scalar theory when .

  • First-order linear and separable ODEs 02.08.01. The existence-uniqueness theorem for first-order linear ODE systems provides the theoretical foundation for the solution formula . The scalar integrating-factor method from that unit is the case of the matrix exponential formula.

  • Second-order linear ODE with constant coefficients 02.08.02. A second-order equation becomes a system via the companion matrix . The eigenvalues of are the roots of , and produces the same oscillatory, exponential, or critically-damped solutions as the scalar characteristic-polynomial method.

Historical & philosophical context [Master]

Sylvester 1883, in On the Equation to the Secular Inequalities in the Planetary Theory [Sylvester1883], introduced the matrix exponential in the context of solving systems of linear differential equations arising from celestial mechanics. His formulation recognised that the power series generalises the scalar exponential to matrices and satisfies the key differential-equation property.

Peano 1888, in Integration par series des equations differentielles lineaires [Peano1888], gave the rigorous power-series definition of the matrix exponential and proved the fundamental properties: term-by-term differentiation yields , and the composition . Peano's treatment was the first to establish the matrix exponential as the canonical solution operator for linear ODE systems with constant coefficients. The modern synthesis — diagonalisation, Jordan form, stability theory, and the Kalman controllability criterion — was developed through the mid-twentieth century, with the Jordan-form computation of appearing in Gantmacher's Theory of Matrices (1959) and the controllability theory in Kalman's foundational papers of 1960-1963.

Bibliography [Master]

@article{Sylvester1883,
  author = {Sylvester, James Joseph},
  title = {Sur les quantites formant un groupe de nonnes analogues aux quaternions de Hamilton},
  journal = {Comptes Rendus de l'Academie des Sciences, Paris},
  volume = {94},
  year = {1883},
  pages = {1336--1340},
}

@article{Peano1888,
  author = {Peano, Giuseppe},
  title = {Integration par series des equations differentielles lineaires},
  journal = {Mathematische Annalen},
  volume = {32},
  year = {1888},
  pages = {450--456},
}

@book{BoyceDiPrima2012,
  author = {Boyce, William E. and DiPrima, Richard C.},
  title = {Elementary Differential Equations and Boundary Value Problems},
  publisher = {Wiley},
  year = {2012},
  edition = {10th},
}

@book{CoddingtonLevinson1955,
  author = {Coddington, Earl A. and Levinson, Norman},
  title = {Theory of Ordinary Differential Equations},
  publisher = {McGraw-Hill},
  year = {1955},
}

@book{Gantmacher1959,
  author = {Gantmacher, Felix R.},
  title = {The Theory of Matrices},
  publisher = {Chelsea},
  year = {1959},
}