Linear multistep methods: Adams and BDF families, order via the characteristic polynomials
Anchor (Master): Hairer-Nørsett-Wanner 1993 *Solving Ordinary Differential Equations I: Nonstiff Problems* 2e (Springer) §III.1-III.2 (the general LMM, the error constants $C_p$, Adams and Nyström families, the order via the residual operator); Hairer-Wanner 1996 *Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems* 2e (Springer) §V.1 (BDF methods and their construction); Henrici 1962 *Discrete Variable Methods in Ordinary Differential Equations* (Wiley) Ch. 5 (the difference-operator calculus and the order conditions)
Intuition Beginner
A one-step method draws the next point of a solution curve using only the point you are standing on right now. Every slope you computed at the previous points is thrown away the moment you move. That feels wasteful: you paid to learn the slope at the last few stops, and those slopes still carry real information about how the curve is bending. A linear multistep method is the idea of keeping a short memory and reusing it.
Imagine walking a trail and predicting where the path goes next. A forgetful walker looks only at the ground underfoot. A walker with memory recalls the last few steps, notices the trail has been curving gently to the left, and leans into the turn before reaching it. By blending several recent slope readings instead of one, the method anticipates the bend and lands closer to the true path for the same amount of work each step.
There are two natural memories to keep. One is a memory of past slopes, used to extrapolate forward; this gives the Adams-Bashforth family, which predicts the next point from slopes already computed. The other also leans on the slope at the point you are about to reach, which must be solved for; this gives the Adams-Moulton family, which corrects a prediction and tracks curvature even better. A third design keeps a memory of past positions and fits the new slope to them; this gives the backward-differentiation formulas, the workhorses for stiff problems where explicit methods fail.
Two practical wrinkles appear the moment you keep a memory. First, a method needing the last three points cannot start from a single initial point, so the first few values must be supplied some other way, usually by a one-step method. Second, more memory is not automatically better: a badly chosen blend can let small rounding errors feed on themselves and grow. Sorting out which blends are safe is the work of the next unit; this one builds the blends and measures how accurate they are.
Visual Beginner
Picture the solution curve and a row of equally spaced time stations along the bottom. At each past station you have already marked a slope dash. A one-step method uses only the dash at the current station. A multistep method reaches back and draws on the dashes at several recent stations, fitting a small curve through them and following it forward to the next station.
The table below lines up the three families this unit builds. Reading across, each row says what the method remembers, whether it must solve for the new point, and the highest power of the step size whose error it cancels — the order.
| family | what it remembers | solve for the new point? | typical order for steps |
|---|---|---|---|
| Adams-Bashforth | the last slopes | no (explicit) | |
| Adams-Moulton | the last slopes plus the new slope | yes (implicit) | |
| backward differentiation | the last positions plus the new slope | yes (implicit) |
Each row is a recipe with fixed numerical weights, and the rest of the unit explains where those weights come from and why the order column reads as it does.
Worked example Beginner
Let us build and run the simplest memory method, two-step Adams-Bashforth, on the slope rule "the slope equals the current height," starting from height at time , whose true curve is . The recipe uses the two most recent slopes with weights and : $$ u^{n+2} = u^{n+1} + h\left(\tfrac{3}{2} f^{n+1} - \tfrac{1}{2} f^{n}\right), $$ where is the slope at the -th point, here equal to . Take a step of .
Step 1. The method needs two starting points, but we are handed only one. Use one step of forward Euler to manufacture the second: . Now we have at time and at time .
Step 2. Apply the two-step recipe to reach time . The slopes are and , so $$ u^2 = 1.5 + 0.5\left(\tfrac{3}{2}\times 1.5 - \tfrac{1}{2}\times 1\right) = 1.5 + 0.5(2.25 - 0.5) = 1.5 + 0.875 = 2.375. $$
Step 3. Compare with the truth. The exact height at time is . Plain forward Euler reached only here, an undershoot of about . The two-step method reached , an undershoot of about — closer, using the same number of slope evaluations per step, because it remembered the previous slope and used the curve's bend.
Step 4. See the order. The weights and are chosen so the method is exact whenever the true solution is a parabola. That makes it second order: halving the step shrinks the error by about a factor of four, not just two as for Euler.
What this tells us: keeping a one-step memory and blending two slopes with the right weights buys an extra power of accuracy for free, at the cost of needing a helper method to supply the first extra starting value.
Check your understanding Beginner
Formal definition Intermediate+
Consider the initial-value problem (IVP) , , with continuous and globally Lipschitz in , so the exact solution exists and is unique by the Picard-Lindelöf theorem 02.12.01. Fix a step , set , write , and abbreviate .
Definition (linear multistep method). An -step linear multistep method (LMM) is the difference equation $$ \sum_{j=0}^{r} \alpha_j, u^{n+j} ;=; h \sum_{j=0}^{r} \beta_j, f^{n+j}, \qquad n = 0, 1, 2, \dots, $$ with real coefficients , normalised by (so the leading term solves for the newest value) and (so the method genuinely reaches back steps). The method is explicit when , since then is given outright by past data; it is implicit when , since then appears inside and each step solves an algebraic equation for it.
Definition (characteristic polynomials). The first and second characteristic polynomials of the method are $$ \rho(\zeta) = \sum_{j=0}^{r} \alpha_j \zeta^j, \qquad \sigma(\zeta) = \sum_{j=0}^{r} \beta_j \zeta^j. $$ The polynomial records how past values are combined; records how past slopes are combined. The pair determines the method and is the object through which both accuracy and stability are read.
Definition (linear difference operator, truncation error, order). Associate to the method the linear difference operator acting on a smooth function : $$ \mathcal{L}z; h ;=; \sum_{j=0}^{r} \big(\alpha_j, z(t + jh) - h,\beta_j, z'(t + jh)\big). $$ The local truncation error (LTE) is , the residual when the exact solution is substituted into the formula and divided by . The method has order if for every function , equivalently ; it is consistent if . Expanding each and in a Taylor series collects with error constants $$ C_0 = \sum_j \alpha_j, \qquad C_q = \frac{1}{q!}\sum_j \alpha_j j^q - \frac{1}{(q-1)!}\sum_j \beta_j j^{q-1}\ \ (q \ge 1), $$ and order is exactly , , with the principal error constant.
The symbols , the operator , the error constants , and the Landau symbol are recorded in _meta/NOTATION.md.
Counterexamples to common slips Intermediate+
"Consistency of a multistep method is just , same as one-step." The consistency conditions read identically as , but for consistency no longer implies convergence: a consistent LMM can diverge if its has a spurious root outside the unit disc. The missing ingredient, zero-stability, is the root condition on developed in
43.10.03; this unit constructs the methods and certifies their order, and treats convergence as the companion question."The order is whatever says." Both polynomials enter every order condition. Consistency is and together: the first says constants are reproduced, the second says linear functions are reproduced. Ignoring produces a method that is exact on constants but already wrong on straight lines.
"More steps always raises the order." The number of order conditions an -step method must satisfy grows with , and for zero-stable methods the attainable order is capped — the first Dahlquist barrier limits a stable -step method to order (odd ) or (even ). One can push the order higher by spending all coefficients on accuracy, but the resulting then violates the root condition and the method is useless. This tension is the subject of
43.10.03.
Key theorem with proof Intermediate+
The signature result is the characterisation of order through and : a single algebraic statement that turns "match the Taylor expansion to order " into explicit conditions on the two characteristic polynomials, and reads consistency off as two equations on alone together with . Every coefficient choice in the Adams and BDF constructions below is dictated by it.
Theorem (order conditions for linear multistep methods). Let an -step LMM have characteristic polynomials and error constants as above. Then:
The method has order if and only if , equivalently
Equivalently, in terms of the polynomials,
so the method has order exactly when the analytic function vanishes to order at .
In particular the method is consistent (order ) if and only if
Proof. Take smooth and expand each term of in a Taylor series about . With and , substitute into the operator and collect powers of : $$ \mathcal{L}z;h = \sum_{j} \alpha_j \sum_{q\ge 0}\frac{(jh)^q}{q!} z^{(q)}
- h\sum_j \beta_j \sum_{q \ge 1}\frac{(jh)^{q-1}}{(q-1)!} z^{(q)} = \sum_{q\ge 0} C_q h^q z^{(q)}(t), $$ with and, for , , reading off the coefficient of from both sums (the second sum contributes to order via its factor). The operator therefore annihilates every function to order precisely when . Clearing the factorials in gives , which is part 1.
For part 2, apply the operator to the exponential probe . Then and , so $$ \mathcal{L}e^{s\cdot}; h = e^{st}\Big(\sum_j \alpha_j e^{sjh} - h s \sum_j \beta_j e^{sjh}\Big) = e^{st}\big(\rho(e^{sh}) - hs,\sigma(e^{sh})\big). $$ Setting (a unit-rate probe, with ) gives the function . On the other hand from the expansion above with ; comparing, at . Hence order (i.e. , ) is the same as .
Part 3 is the case written through the polynomials. The condition is . The condition is , and since gives while , this is .
The analytic form in part 2 is often written as the log condition: dividing by and setting , order is the statement that as , since near . This is the lens through which the Adams and BDF families are derived: choose first by a structural principle, then take to be the Taylor truncation of to the required degree.
Bridge. The order conditions are the foundational reason a multistep method's accuracy is a property of the pair and not of any individual coefficient: the linear difference operator filters out exactly the Taylor data that the polynomials fail to reproduce, and the probe collapses that filtering into the single analytic identity . This argument builds toward the convergence theory of 43.10.03, where the same first polynomial that here governs accuracy through , reappears as the carrier of stability through the location of its roots; that is exactly the point at which consistency and convergence come apart, because a consistent may still have a root outside the unit disc. The structure generalises the one-step convergence theorem of 43.10.01: the scalar amplification factor that there propagated the local error is here replaced by the companion matrix of , and putting these together, the central insight of the whole chapter — local accuracy from Taylor matching, propagated under a stability condition — is what the bridge from this unit to the next makes precise, with the root condition on playing the role the Lipschitz bound played for one-step methods. The error-constant calculus that drives the Adams and BDF construction below appears again in the Dahlquist-barrier count of how much order a stable can support.
Exercises Intermediate+
Advanced results Master
The order calculus and the construction extend into the full structural theory of multistep methods: the systematic generation of the Adams, Nyström, and BDF families, the error-constant comparison that selects methods in practice, the variable-step and variable-order reformulations of production codes, and the obstruction that caps the order of any method one would actually use.
Theorem 1 (Adams family and its error constants). The -step Adams methods take and the interpolatory quadrature weight polynomial for the integral . The explicit Adams-Bashforth method interpolates at the nodes and has order ; the implicit Adams-Moulton method interpolates additionally at and has order . Their principal error constants are and , the Adams coefficients generated by and ; for each step count the Moulton constant is markedly smaller in magnitude than the Bashforth constant, which is why the implicit corrector is paired with the explicit predictor [Hairer-Nørsett-Wanner §III.1].
Theorem 2 (BDF methods and their construction). The -step backward-differentiation formula takes (a single slope term, at the newest node) and , obtained by differentiating the interpolant of past values and matching its derivative at . Equivalently -coefficients and . BDF has order . The defining feature is that concentrates all slope information at the new point, which makes BDF methods strongly damping and the standard choice for stiff systems; the construction loses zero-stability for , so only BDF1 through BDF6 are usable, with BDF1 being backward Euler [Hairer-Wanner §V.1].
Theorem 3 (predictor-corrector schemes and local error estimation). Pairing an explicit predictor of order with an implicit corrector of order in the modes PEC, PECE, or P(EC)E avoids solving the implicit equation exactly: one or a fixed few corrector iterations suffice to recover the corrector's order provided is small enough that the corrector iteration contracts. The difference between predicted and corrected values, , is a computable estimate of the local error proportional to — Milne's device — and is the basis of step-size control in production codes [LeVeque §5.9.5].
Theorem 4 (first Dahlquist barrier — stated, proof deferred). A zero-stable -step linear multistep method has order at most if is even, if is odd, and order forces all roots of on the unit circle (the optimal methods, with poor damping). The maximal useful order for an explicit zero-stable method is , and for an A-stable method is (the second Dahlquist barrier). The barrier is the precise expression of the tension flagged above: order conditions can always be satisfied, but only at the cost of pushing the roots of where the root condition forbids. The root condition itself, zero-stability, and the equivalence theorem that converts it into convergence are developed in 43.10.03, where this barrier is proved [Hairer-Nørsett-Wanner §III.3].
Theorem 5 (variable-step and Nordsieck representation). Production multistep codes vary both step size and order. Re-deriving the coefficients at each step for a new step-size ratio is captured compactly by the Nordsieck vector, which carries scaled derivatives rather than past values, so a step-size change is a diagonal rescaling and an order change is a truncation or extension of the vector. The Adams-Moulton corrector in Nordsieck form is the algorithm underlying the classical Gear and Adams codes (LSODE, VODE, DASSL), making variable-order Adams and BDF the default stiff and nonstiff integrators in scientific computing [Hairer-Nørsett-Wanner §III.5].
Synthesis. The order conditions on are the foundational reason the entire zoo of multistep methods is one family with one accuracy calculus: every Adams, Nyström, Milne, and BDF scheme is a choice of by a structural principle — reuse the last value difference, or differentiate past values — followed by the unique the log condition forces for the target order, with the principal error constant ranking methods of equal order. This is exactly the situation met for one-step methods in 43.10.01, where order came from Taylor matching of the increment function; the central insight is that consistency is a statement about and near , and it generalises into the chapter's organising dichotomy: accuracy lives at while stability lives on the rest of the unit circle. Putting these together, the Adams and BDF constructions are dual responses to one constraint — Adams puts all freedom into for nonstiff accuracy, BDF collapses to a single term for stiff damping — and the bridge to the next unit is the recognition that the freedom the order conditions leave in is precisely what the root condition will spend, so the first Dahlquist barrier is the exact accounting of how much order a stable can carry. The companion-matrix propagation that converts these polynomials into a convergence statement is what 43.10.03 supplies, completing the consistency-plus-stability template this unit's order theory is one half of.
Full proof set Master
Proposition 1 (consistency conditions via ). An -step LMM is consistent (order ) if and only if and .
Proof. By the Taylor expansion of the difference operator (Key theorem), order is . The constant , so . The constant . Since , evaluation at gives , while . Thus , and .
Proposition 2 (the error constant is independent of the trajectory). If a method has order with principal error constant , then for any exact solution the local truncation error is .
Proof. The difference operator expands as . Order means , so the leading surviving term is . By definition , the higher terms collecting and beyond. The coefficient depends only on , not on .
Proposition 3 (Adams-Moulton two-step is order three with ). The method has order , and its principal error constant is .
Proof. Index the window (nodes ) with and . Compute the error constants from : ; ; ; . So order . Then . Hence the order is exactly with . The magnitude is smaller than the two-step Adams-Bashforth constant at the same step count, the quantitative form of the predictor-corrector advantage.
Proposition 4 (BDF2 is zero-stable; the construction degrades at six steps). The two-step BDF method has with roots and , both in the closed unit disc with the boundary root simple; it is order and zero-stable.
Proof. Factor , exhibiting the roots and . The principal root is simple and on the unit circle; the second root has modulus . The root condition — all roots of satisfy , those on simple — therefore holds, so the method is zero-stable 43.10.03. For order: , so ; gives and , consistency. Matching : with on and , and , so order . The general BDF first polynomial is , and a root-locus computation shows a root crosses outside the unit circle once , so the root condition fails and BDF is not zero-stable beyond six steps; the proof of that crossing is the barrier analysis of 43.10.03.
Connections Master
The one-step methods of
43.10.01supply the local-truncation-error and order framework this unit specialises: the difference operator and the order constants are the multistep generalisation of the increment-function Taylor matching used there, and the forward-Euler starter that bootstraps a multistep method's missing initial values is one of its instances. The clean discrete-Gronwall convergence bound of that unit, however, no longer holds automatically here — that is the gap the next unit closes.The zero-stability and Dahlquist equivalence theorem of
43.10.03is the direct continuation: it converts the same first characteristic polynomial used here for accuracy into the carrier of stability through the root condition on its zeros, and proves that for a consistent LMM convergence is equivalent to zero-stability, with the first Dahlquist barrier (stated here) limiting the order a zero-stable can attain. The order constants this unit computes are the accuracy half of the consistency-plus-stability template that unit completes.The absolute-stability theory of
43.10.04studies the combined polynomial on the linear test equation , : the region where all its roots lie in the unit disc is the multistep region of absolute stability, the direct analogue of the one-step stability function . The Adams and BDF pairs built here are exactly the inputs to that boundary-locus construction.The stiffness and A-stability theory of
43.10.05explains why the BDF family constructed here, with collapsed to a single newest-node term, is the standard stiff integrator: its strong damping comes from that structural choice, and the second Dahlquist barrier limits A-stable LMMs to order , with the trapezoidal rule — the one-step Adams-Moulton member — as the optimal case. BDF1 through BDF6 are the practically usable stiff multistep methods.The interpolatory-quadrature derivation of the Adams coefficients rests on the polynomial-interpolation and Newton-Gregory machinery whose root unit is the corpus's interpolation theory; the multivariable Taylor expansion of
02.05.05is the engine of every order condition, and the continuous IVP existence guaranteed by Picard-Lindelöf02.12.01is the exact solution the local truncation error is measured against.
Historical & philosophical context Master
The Adams methods originate with John Couch Adams, who devised the backward-difference quadrature scheme around 1855 to compute capillary-action profiles for Francis Bashforth's study of the theory of capillary action; the work was published jointly as Bashforth and Adams in 1883 [Bashforth-Adams 1883], giving the explicit family the name Adams-Bashforth. The implicit companion was introduced by Forest Ray Moulton (1926) [Moulton 1926] in his treatise on the numerical computation of ballistic trajectories, who paired the explicit predictor with the implicit corrector — the predictor-corrector idea in its modern form.
The unifying treatment through the characteristic polynomials and and the systematic order theory is due to Germund Dahlquist, whose 1956 paper [Dahlquist 1956] established consistency, the root condition, and the equivalence of convergence with zero-stability for the general linear multistep method, and whose 1963 paper proved the order barriers that bear his name. The difference-operator calculus that makes the order conditions a routine computation was organised by Peter Henrici in his 1962 monograph. The backward-differentiation formulas were brought to prominence by C. William Gear (1971) [Gear 1971], whose stiff-equation solver made BDF the default method for the stiff systems pervasive in chemical kinetics and circuit simulation; the variable-order Nordsieck implementation underlying the LSODE and VODE codes descends from this line.
Bibliography Master
@book{leveque2007fdm,
author = {LeVeque, Randall J.},
title = {Finite Difference Methods for Ordinary and Partial Differential Equations: Steady-State and Time-Dependent Problems},
publisher = {Society for Industrial and Applied Mathematics (SIAM)},
year = {2007}
}
@book{sulimayers2003,
author = {S\"{u}li, Endre and Mayers, David F.},
title = {An Introduction to Numerical Analysis},
publisher = {Cambridge University Press},
year = {2003}
}
@book{hnw1993,
author = {Hairer, Ernst and N\o{}rsett, Syvert P. and Wanner, Gerhard},
title = {Solving Ordinary Differential Equations I: Nonstiff Problems},
edition = {2},
series = {Springer Series in Computational Mathematics},
volume = {8},
publisher = {Springer-Verlag},
year = {1993}
}
@book{hairerwanner1996,
author = {Hairer, Ernst and Wanner, Gerhard},
title = {Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems},
edition = {2},
series = {Springer Series in Computational Mathematics},
volume = {14},
publisher = {Springer-Verlag},
year = {1996}
}
@book{henrici1962,
author = {Henrici, Peter},
title = {Discrete Variable Methods in Ordinary Differential Equations},
publisher = {John Wiley \& Sons},
year = {1962}
}
@book{bashforth1883,
author = {Bashforth, Francis and Adams, John Couch},
title = {An Attempt to Test the Theories of Capillary Action by Comparing the Theoretical and Measured Forms of Drops of Fluid},
publisher = {Cambridge University Press},
year = {1883}
}
@book{moulton1926,
author = {Moulton, Forest Ray},
title = {New Methods in Exterior Ballistics},
publisher = {University of Chicago Press},
year = {1926}
}
@article{dahlquist1956,
author = {Dahlquist, Germund},
title = {Convergence and stability in the numerical integration of ordinary differential equations},
journal = {Mathematica Scandinavica},
volume = {4},
year = {1956},
pages = {33--53}
}
@article{dahlquist1963,
author = {Dahlquist, Germund},
title = {A special stability problem for linear multistep methods},
journal = {BIT Numerical Mathematics},
volume = {3},
number = {1},
year = {1963},
pages = {27--43}
}
@book{gear1971,
author = {Gear, C. William},
title = {Numerical Initial Value Problems in Ordinary Differential Equations},
publisher = {Prentice-Hall},
year = {1971}
}