43.10.03 · numerical-analysis / numerical-odes

Zero-stability, the root condition, and the Dahlquist equivalence theorem

shipped3 tiersLean: none

Anchor (Master): Hairer-Nørsett-Wanner 1993 *Solving Ordinary Differential Equations I: Nonstiff Problems* 2e (Springer) §III.3-III.4 (the root/Wurzelbedingung condition, the stability theorem, and the first Dahlquist barrier with its order-star/order-bound proof); Henrici 1962 *Discrete Variable Methods in Ordinary Differential Equations* (Wiley) Ch. 5 §5.2-5.3 (the companion-matrix stability analysis and the equivalence theorem); Dahlquist 1956 *Convergence and stability in the numerical integration of ordinary differential equations* (Mathematica Scandinavica 4) (the original equivalence proof)

Intuition Beginner

A multistep method blends several past values to predict the next one. That blend has a hidden personality: even before you ask it to solve a real problem, it has opinions about how a sequence of numbers should grow or shrink on its own. Zero-stability is the question of whether those opinions are calm or violent. A calm method lets small mistakes stay small as you march forward; a violent one lets a tiny early slip double, then double again, until the computed curve flies off the screen.

Picture the most boring problem there is: the slope is always zero, so the true answer is a flat line. A good method handed a flat line should keep returning that same flat value forever. The blend, though, also has spare ways to behave that have nothing to do with the real answer. These extra behaviors are called parasitic modes: ghost solutions the recurrence can support on its own. If a ghost mode stays bounded, it is harmless background noise. If a ghost mode grows, it eventually swamps the real answer no matter how accurate the method looked on paper.

The size of a ghost mode is decided by a short list of numbers called the roots of the method's first characteristic polynomial. Each root is a growth factor: a root of size less than one fades away, a root of size exactly one holds steady, and a root of size bigger than one explodes. The healthy situation is the root condition: every growth factor has size at most one, and any factor sitting right at size one is not repeated. One root, the principal root, always equals one and carries the real answer; the rest are ghosts that must be kept on a leash.

Here is the surprising part, and the heart of the unit. For a one-step method, being accurate was almost enough to be trustworthy. For a multistep method, accuracy and trustworthiness split apart. A method can match the true curve beautifully over a single step and still diverge, because a ghost root crept outside the safe zone. The Dahlquist equivalence theorem is the clean verdict: a method that is both accurate in the small (consistent) and calm in its ghosts (zero-stable) is exactly a method whose computed answer actually converges to the truth. Neither half alone will do.

Visual Beginner

Picture two number lines stacked. The top line tracks the real answer of a flat problem: it should sit unchanged at the same height step after step. The bottom line tracks a ghost mode the method secretly carries. Whether the bottom line stays flat, holds level, or rockets upward is decided entirely by the size of that ghost's growth factor.

The table below sorts the possible growth factors by size and says what each one does to a small starting error. Reading down, the only safe rows are the ones where every factor has size at most one and no size-one factor is repeated. The last column names the verdict on the whole method.

growth factor size what a small error does safe?
less than one fades toward zero yes
exactly one, not repeated holds at constant size yes
exactly one, repeated grows step by step in a ramp no
greater than one doubles and explodes no

Each row is a fate for a ghost mode, and the whole method is trustworthy only when every one of its ghost modes lands in a safe row.

Worked example Beginner

Let us watch a healthy method and a sick method behave on the flattest problem possible. The slope rule is "the slope is always zero," so the true answer starts at height and stays at forever. We feed each method two starting values that are exact, and , and let it march.

The healthy method is the two-step Adams-Bashforth blend, which on a zero slope reduces to . Step by step: , then , and so on. Every value stays exactly . Its growth factors are and , both inside or on the unit circle with the boundary one not repeated, so it is calm.

The sick method is a made-up two-step blend , which on a zero slope is purely this recurrence. Its growth factors turn out to be and . The factor is the troublemaker. Watch what one tiny error does.

Step 1. Suppose rounding nudges the second start to instead of , an error of just one thousandth. The first start is still .

Step 2. March the sick recurrence: . The error has grown from to .

Step 3. Keep going: ; then . The errors run , each roughly double the last.

Step 4. See the pattern. The error is being multiplied by about every step. After thirty steps the original thousandth-of-a-unit slip has grown past a million times its size, burying the true answer of .

What this tells us: both methods can look reasonable, but the sick one carries a ghost factor of size , and that single bad number turns a rounding whisper into an avalanche. The root condition is the rule that forbids exactly this.

Check your understanding Beginner

Formal definition Intermediate+

Consider the initial-value problem (IVP) , , with continuous and globally Lipschitz in , so the unique solution exists by the Picard-Lindelöf theorem 02.12.01. Fix a step , set , , and let an -step linear multistep method (LMM) 43.10.02 $$ \sum_{j=0}^{r} \alpha_j, u^{n+j} = h\sum_{j=0}^{r} \beta_j, f^{n+j}, \qquad \alpha_r = 1, $$ have first and second characteristic polynomials and . The method requires starting values , supplied by a one-step method 43.10.01 with as .

Definition (root condition). The method satisfies the root condition if every root of obeys , and every root on the unit circle is simple (multiplicity one). Because consistency forces 43.10.02, is always a root; it is the principal root, and the remaining roots are the spurious or parasitic roots. The root condition therefore demands that be simple and that no other root reach or exceed modulus one.

Definition (zero-stability). The method is zero-stable if there is a constant , independent of , such that for any two numerical solutions and generated by the same method with the same right-hand side but possibly different starting values and per-step perturbations , $$ \max_{0 \le n \le N}|u^n - \tilde u^n| ;\le; S\left(\max_{0\le k \le r-1}|u^k - \tilde u^k| + \sum_{n} |\delta^n|\right) $$ uniformly as with . Equivalently, the discrete solution operator that advances the perturbed homogeneous recurrence is bounded uniformly in and in the step count. Zero-stability is stability of the discretisation at : it concerns only the homogeneous recurrence governed by , not the right-hand side carried by .

Definition (convergence). The method is convergent if, whenever the starting values satisfy for as , the global error satisfies $$ \max_{0 \le n \le N}|u^n - y(t_n)| ;\longrightarrow; 0 \qquad (h \to 0) $$ for every IVP with globally Lipschitz . It is convergent of order if the global error is whenever the starting errors are .

The companion-matrix representation makes the link to eigenvalues 01.01.08 explicit. Stacking , the homogeneous recurrence becomes with the companion matrix of , whose eigenvalues are exactly the roots of . The roots , the companion matrix , the principal/spurious-root terminology, the discrete solution operator, and the Landau symbol are recorded in _meta/NOTATION.md.

Counterexamples to common slips Intermediate+

  • "Consistency implies convergence, as for one-step methods." For it does not. The two-step method is consistent of order , yet has the spurious root of modulus , so it violates the root condition and diverges. Consistency controls accuracy at ; zero-stability controls the other roots. Both are needed, which is the content of the equivalence theorem below.

  • "Any root inside the unit disc is fine, so a root at of multiplicity two is fine." A boundary root must be simple. A double root at makes the homogeneous recurrence support the growing solution , an unbounded parasitic mode, so the root condition fails even though no root leaves the closed disc.

  • "Zero-stability and absolute stability are the same." Zero-stability is the property of alone, asking only that parasitic modes not grow as the grid refines. Absolute stability 43.10.04 is a fixed- property of the combined polynomial on the test equation , asking that all roots stay in the disc for a given . A method can be zero-stable yet have a small absolute-stability region.

Key theorem with proof Intermediate+

The signature result is the Dahlquist equivalence theorem: for a consistent linear multistep method, convergence is the same condition as zero-stability, and zero-stability is the same condition as the root condition on . This collapses an analytic question about the limit of computed trajectories into a finite algebraic check on the roots of a single polynomial, and it is the multistep analogue of the one-step convergence theorem 43.10.01 in which the Lipschitz/Gronwall stability bound is replaced by power-boundedness of the companion matrix of .

Theorem (Dahlquist equivalence). Let an -step LMM with polynomials be consistent (so and ). Then the following are equivalent:

  1. the method is convergent;
  2. the method is zero-stable;
  3. satisfies the root condition.

Moreover, when these hold and the method has consistency order with starting errors , the global error is : the order of convergence equals the order of consistency. [LeVeque §6.3-6.4; Süli-Mayers §12.9-12.10]

Proof. We prove and then , which closes the cycle.

: the root condition gives zero-stability. Pass to the companion form. The homogeneous recurrence is with and the companion matrix of , so . The eigenvalues of are the roots of 01.01.08. A power is bounded uniformly in exactly when every eigenvalue has modulus at most one and every eigenvalue of modulus one is non-defective, that is, lies in a Jordan block of size one. Simplicity of the boundary roots is precisely non-defectiveness there, while roots strictly inside the disc contribute powers . Hence the root condition gives a bound for all , with independent of .

Now compare two perturbed solutions. Writing and stacking , the inhomogeneous recurrence with the Lipschitz right-hand side and per-step perturbations reads , where collects the -weighted -differences (with depending only on the coefficients) and carries the . By discrete variation of constants, , so $$ |E^n| \le M|E^0| + M\sum_{m=0}^{n-1}\big(h\Gamma L \max_{k}|e^k| + |D^m|\big). $$ The factor on the sum over terms keeps that contribution times with a coefficient ; absorbing it by a discrete Gronwall argument 43.10.01 gives with , independent of . This is zero-stability.

: zero-stability plus consistency gives convergence. Treat the exact solution as a perturbed numerical solution. Substituting into the scheme leaves the residual with the local truncation error 43.10.02, so satisfies the same recurrence as up to per-step perturbations . The starting errors are by hypothesis. Apply the zero-stability estimate with : $$ \max_n|u^n - y(t_n)| \le S\Big(\max_{k\le r-1}|u^k - y(t_k)| + \sum_n h|\tau^n|\Big) \le S\big(O(h^p) + (T-t_0),O(h^p)\big) = O(h^p), $$ the sum having terms each . In particular the error tends to zero, so the method converges, with order equal to the consistency order .

: convergence forces the root condition. Argue by contraposition: a method violating the root condition cannot converge. Suppose has a root with , or a repeated root on the unit circle. Apply the method to the IVP , , whose exact solution is identically zero, with starting values (when ) or (for a repeated boundary root). These starting values tend to zero with , so a convergent method would force . But on the recurrence is the homogeneous , whose solution with these data is (respectively ). At a fixed time with , (respectively ), which grows without bound as because faster than . The computed solution diverges while the true solution is zero, contradicting convergence. Hence convergence forces the root condition.

The estimate manufactured in is a discrete variation-of-constants formula: the companion matrix plays the role the scalar amplification factor played for one-step methods 43.10.01, and power-boundedness is the multistep replacement for the single-step Lipschitz contraction. The whole content of zero-stability is that this exists uniformly in .

Bridge. The equivalence theorem is the foundational reason a multistep method's reliability is read off the roots of and nothing else: consistency pins the principal root at with the correct first derivative, while zero-stability is the demand that the remaining companion-matrix spectrum stay power-bounded, and this is exactly the point at which the one-step story of 43.10.01 generalises — there the scalar factor was automatically power-bounded, here the companion matrix of is power-bounded only under the root condition. This builds toward the absolute-stability theory of 43.10.04, where the same companion-matrix idea is applied to the combined polynomial at fixed ; putting these together, zero-stability is the edge of absolute stability, and the central insight is that "consistency plus stability equals convergence" is one template realised twice, with stability meaning a Gronwall bound for one-step methods and power-boundedness of for multistep methods. The theorem appears again in 43.11.05 as the Lax-Richtmyer equivalence for finite-difference PDE schemes, where the companion matrix is replaced by the evolution operator and power-boundedness becomes uniform ; the bridge is the recognition that all three theorems are the statement that a consistent scheme converges precisely when its discrete solution operator does not amplify.

Exercises Intermediate+

Advanced results Master

The equivalence theorem is the convergence backbone; its sharp consequences are the barriers that bound how much order a usable method can carry, the refined stability notions that separate methods sharing the root condition, and the analytic machinery that proves the order bound.

Theorem 1 (first Dahlquist barrier). A zero-stable -step linear multistep method has order if is even and if is odd. The maximal order (even ) is attained only by methods all of whose -roots lie on the unit circle — the optimal methods — which have poor damping of the parasitic modes and are rarely used. The proof studies the analytic function , whose vanishing to order at is the order condition 43.10.02; the root condition constrains the zeros of to the closed disc with simple boundary zeros, and a contour/argument-principle count of how many such zeros can carry while vanishes to high order yields the bound. The order-star reformulation of Wanner-Hairer-Nørsett gives the cleanest modern proof [Hairer-Nørsett-Wanner §III.3-III.4].

Theorem 2 (strong stability, relative stability, and weak instability). Among zero-stable methods, the location of the spurious roots on the circle refines the classification. A method is strongly stable if is the only root on the unit circle, so all parasitic modes decay — the Adams and BDF families are strongly stable. It is weakly stable if other roots lie on the circle: the leapfrog/midpoint method, with spurious root , is zero-stable but its parasitic mode neither grows nor decays, producing the oscillatory error contamination that makes leapfrog unsuitable for dissipative problems without filtering. Relative stability compares the growth of spurious roots to the principal root on the test equation and is the bridge from this theory to the fixed- absolute stability of 43.10.04.

Theorem 3 (convergence order under root condition, quantitative). If a consistent zero-stable -step method has order , starting errors , and is applied to an IVP with and globally Lipschitz, then $$ \max_{0\le n\le N}|u^n - y(t_n)| \le S\Big(c_0 h^p + (T - t_0),\overline{C},h^p\Big) = O(h^p), $$ with the zero-stability constant (the power bound inflated by a Gronwall factor) and a bound on the principal-error-constant times the -st derivative of . The estimate is the multistep counterpart of the one-step bound of 43.10.01; the only structural change is the replacement of the scalar amplification by the companion-matrix power bound [Hairer-Nørsett-Wanner §III.4].

Theorem 4 (the spurious roots carry no accuracy but all the danger). Write the numerical solution near a smooth exact solution as a superposition over the roots: , where the principal root tracks the true solution and each spurious root contributes a parasitic component damped by . The order conditions are statements about the principal branch alone; the spurious branches contribute nothing to the order but become the dominant error if any approaches or exceeds one. This is the structural reason high-order methods built by pushing all coefficients into accuracy fail: extra order is bought by relocating the spurious roots toward and across the unit circle, exactly where the root condition forbids them [Henrici 1962].

Theorem 5 (necessity of one-step starters and their order). A convergent -step method requires starting values; for order- convergence the starting values must themselves be accurate to , which is why a one-step Runge-Kutta starter 43.10.01 of order at least is used to bootstrap the recurrence. Under zero-stability the starting errors propagate without amplification — the same power-bound controls their contribution — so a starter of matching order suffices and no order is lost in the transient. Lower-order starters degrade the global order to that of the starter, a failure mode invisible in the asymptotic theory until the starting errors dominate [Süli-Mayers §12.10].

Synthesis. The root condition on is the foundational reason the entire convergence theory of multistep methods reduces to a finite eigenvalue check: zero-stability is power-boundedness of the companion matrix of , convergence is zero-stability together with consistency, and the equivalence theorem makes these three the same condition, so the analytic limit of computed trajectories is decided by the moduli and multiplicities of algebraic numbers.

This is exactly the structure met for one-step methods in 43.10.01, where the scalar amplification factor was automatically power-bounded and only the Lipschitz/Gronwall ingredient mattered; the central insight is that the multistep recurrence introduces spurious roots that carry no accuracy and all the instability, and it generalises the one-step convergence theorem by promoting the scalar factor to the companion matrix whose spectrum is the roots of . Putting these together, the first Dahlquist barrier is the exact accounting of the tension the order theory of 43.10.02 left open — order conditions can always be met, but only by relocating -roots where the root condition forbids — and the optimal even- methods that touch the barrier pay for their order with all roots on the circle and no parasitic damping. The bridge to the rest of the chapter is the recognition that this stability is dual to the fixed- absolute stability of 43.10.04, which studies the same companion-matrix idea applied to at , with zero-stability the boundary case; the same template appears again as the Lax-Richtmyer equivalence theorem 43.11.05, completing the consistency-plus-stability story across ODE and PDE discretisations.

Full proof set Master

Proposition 1 (root condition bounded homogeneous solutions). The homogeneous recurrence (with ) has all solution sequences bounded in if and only if satisfies the root condition.

Proof. The general solution of the recurrence is a superposition of over the distinct roots of with , where is the multiplicity of — this is the standard solution basis of a linear constant-coefficient recurrence, and the are linearly independent. A term is bounded in iff (any , since the exponential decay beats the polynomial), or with . It is unbounded iff , or with , the latter requiring . Every solution is bounded iff every basis term is bounded, iff every root has and every root with has multiplicity one. That is the root condition.

Proposition 2 (the consistent unstable method diverges on a smooth problem). The two-step method is consistent of order but diverges on the IVP , , with vanishing starting errors.

Proof. Consistency and order are verified through the error constants 43.10.02: , , and one computes , , and by direct substitution, with . Now apply the method to . The recurrence is , a linear constant-coefficient recurrence with characteristic equation . At the roots are those of , namely and . For small they perturb to (the principal root tracking ) and . The general solution is . Even with starting values chosen so that for any fixed , at a fixed time the parasitic term is , and as faster than any power . Thus at fixed , while the exact solution is , finite. The method diverges.

Proposition 3 (Simpson's rule is order and zero-stable, attaining the barrier). The method has satisfying the root condition, and has order , realising the first Dahlquist barrier for .

Proof. The roots of are and , both of modulus and both simple, so the root condition holds and the method is zero-stable (Proposition 1). For the order, index the window with and , and compute the error constants . Then ; ; ; ; ; and . Over the common denominator , and , so . Hence and : the order is with even, the barrier value, attained with both roots on the unit circle.

Proposition 4 (starting-error propagation under zero-stability). For a zero-stable -step method with power bound , perturbing only the starting values by () changes the solution at every later step by at most , with independent of .

Proof. With identical right-hand sides and no per-step perturbations, the difference satisfies the inhomogeneous companion recurrence with collecting the -weighted Lipschitz -differences. Discrete variation of constants gives , so . The stacked initial vector has . Setting , the bound reads , a discrete Gronwall inequality 43.10.01 whose solution is . Take , independent of since .

Connections Master

  • The linear multistep order theory of 43.10.02 supplies the polynomials , the consistency conditions , , and the error constants that this unit takes as input; that unit constructs the Adams, BDF, and predictor-corrector families and certifies their order but leaves convergence open, flagging precisely that a consistent method can diverge. This unit closes that gap by proving convergence equivalent to the root condition, and the first Dahlquist barrier stated there is proved here as the constraint zero-stability places on how high the order of a stable can reach.

  • The one-step convergence theorem of 43.10.01 is the structural template this unit generalises: there the global bound came from consistency plus a scalar Lipschitz/discrete-Gronwall stability factor; here the scalar amplification is replaced by the companion matrix of , and power-boundedness — the root condition — is the multistep stability ingredient. The discrete Gronwall inequality of that unit is reused verbatim in the variation-of-constants estimate, and the Runge-Kutta starters that bootstrap a multistep recurrence are its instances.

  • The absolute-stability theory of 43.10.04 studies the same companion-matrix mechanism applied to the combined polynomial on the test equation at fixed : zero-stability is exactly the boundary case , asking only that the roots of alone satisfy the root condition, while absolute stability asks the same of the perturbed roots for . The classification of spurious roots here (strong versus weak stability) is the input to the boundary-locus construction there.

  • The eigenvalue and characteristic-polynomial theory of 01.01.08 is the engine of the companion-matrix argument: the roots of are the eigenvalues of , the Jordan structure controls whether is power-bounded, and the non-derogatory property of companion matrices ties Jordan-block size to root multiplicity, which is why simplicity of boundary roots is exactly non-defectiveness there.

  • The Cauchy/Bolzano-Weierstrass boundedness machinery of 02.03.02 underlies the definition of zero-stability as uniform boundedness of the discrete solution operator as : the existence of a single constant controlling all refinements is a boundedness criterion on the family of solution operators, and the limiting arguments in the contrapositive use the divergence of against a vanishing sequence.

Historical & philosophical context Master

The decisive synthesis is due to Germund Dahlquist, whose 1956 paper in Mathematica Scandinavica [Dahlquist 1956] introduced zero-stability through the root condition on the first characteristic polynomial and proved the equivalence of convergence with consistency-plus-stability for the general linear multistep method, establishing that accuracy and stability are independent requirements that together are necessary and sufficient. The same paper organised the order theory through the polynomials and ; Dahlquist's 1963 paper [Dahlquist 1963] then proved the order barriers, the first bounding the order of a zero-stable -step method and the second bounding the order of an A-stable one. The root condition itself had a precursor in the stability analyses of Richardson and the difference-equation tradition, but Dahlquist gave it the clean polynomial form and the equivalence theorem.

The companion-matrix and difference-operator organisation of the proof was systematised by Peter Henrici in his 1962 monograph Discrete Variable Methods in Ordinary Differential Equations, which cast the stability analysis as power-boundedness of the companion matrix and is the source of the modern textbook treatment. The structural parallel with the Lax-Richtmyer equivalence theorem for finite-difference approximations of partial differential equations, published by Peter Lax and Robert Richtmyer in the same year 1956, was recognised early: both theorems assert that for a consistent scheme approximating a well-posed linear problem, stability and convergence are equivalent. The order-star technique that gives the sharpest modern proof of the Dahlquist barriers was introduced by Gerhard Wanner, Ernst Hairer, and Syvert Nørsett in 1978.

Bibliography Master

@article{dahlquist1956,
  author  = {Dahlquist, Germund},
  title   = {Convergence and stability in the numerical integration of ordinary differential equations},
  journal = {Mathematica Scandinavica},
  volume  = {4},
  year    = {1956},
  pages   = {33--53}
}

@article{dahlquist1963,
  author  = {Dahlquist, Germund},
  title   = {A special stability problem for linear multistep methods},
  journal = {BIT Numerical Mathematics},
  volume  = {3},
  number  = {1},
  year    = {1963},
  pages   = {27--43}
}

@book{henrici1962,
  author    = {Henrici, Peter},
  title     = {Discrete Variable Methods in Ordinary Differential Equations},
  publisher = {John Wiley \& Sons},
  year      = {1962}
}

@book{hnw1993,
  author    = {Hairer, Ernst and N\o{}rsett, Syvert P. and Wanner, Gerhard},
  title     = {Solving Ordinary Differential Equations I: Nonstiff Problems},
  edition   = {2},
  series    = {Springer Series in Computational Mathematics},
  volume    = {8},
  publisher = {Springer-Verlag},
  year      = {1993}
}

@book{sulimayers2003,
  author    = {S\"{u}li, Endre and Mayers, David F.},
  title     = {An Introduction to Numerical Analysis},
  publisher = {Cambridge University Press},
  year      = {2003}
}

@book{leveque2007fdm,
  author    = {LeVeque, Randall J.},
  title     = {Finite Difference Methods for Ordinary and Partial Differential Equations: Steady-State and Time-Dependent Problems},
  publisher = {Society for Industrial and Applied Mathematics (SIAM)},
  year      = {2007}
}

@article{laxrichtmyer1956,
  author  = {Lax, Peter D. and Richtmyer, Robert D.},
  title   = {Survey of the stability of linear finite difference equations},
  journal = {Communications on Pure and Applied Mathematics},
  volume  = {9},
  number  = {2},
  year    = {1956},
  pages   = {267--293}
}

@article{wannerhairernorsett1978,
  author  = {Wanner, Gerhard and Hairer, Ernst and N\o{}rsett, Syvert P.},
  title   = {Order stars and stability theorems},
  journal = {BIT Numerical Mathematics},
  volume  = {18},
  number  = {4},
  year    = {1978},
  pages   = {475--489}
}