One-step methods: Euler, trapezoidal, Runge-Kutta; consistency and order
Anchor (Master): Hairer-Nørsett-Wanner 1993 *Solving Ordinary Differential Equations I: Nonstiff Problems* 2e (Springer) §II.1-II.3 (the general one-step method, the order conditions, Butcher's rooted-tree theory at a high level); Butcher 2016 *Numerical Methods for Ordinary Differential Equations* 3e (Wiley) Ch. 2-3 (the algebraic theory of order); LeVeque 2007 (SIAM) Ch. 5-6 (consistency, zero-stability, and convergence for one-step and linear multistep methods)
Intuition Beginner
A differential equation tells you the slope of a curve at every point but does not hand you the curve itself. The starting equation here is the simplest kind: you know where the curve begins, and you have a rule that, given the time and the current height, returns the slope right now. The question is how to draw the whole curve forward in time when all you ever get to ask for is the slope at the spot where you currently stand.
The honest answer is to take small steps. Stand at the starting point, ask for the slope, and walk a short distance in that straight-line direction. You will not land exactly on the true curve, because the real curve bends while you walked straight, but if the step was short you land close. Now ask for the slope again at the new spot and take another short straight step. Repeating this is the whole idea of a one-step method: each new point is computed from the one before it using a single evaluation of the slope rule.
The plainest version, taking the slope at the point you are leaving, is Euler's method. It is easy to picture and easy to compute, but it leans on one slope for the entire step, so it drifts. Two fixes appear at once. You can use the slope at where you are about to arrive instead, which needs a small bit of solving but behaves better on stiff problems. Or you can average the slope at the start and the slope at the end, which is the trapezoidal rule and tracks curvature far more faithfully.
The deepest fix is to peek. Before committing to a full step, sample the slope at a few cleverly chosen interior points and combine those samples with carefully tuned weights. That is a Runge-Kutta method, and the famous four-sample recipe matches the true curve so closely that halving the step shrinks the error by a factor of sixteen. This unit is the root of the chapter: every later method for marching a differential equation forward is measured against the standards set here.
Visual Beginner
Picture a slope field: at a grid of points in the time-height plane, a short dash shows the slope the rule would report there. The true solution is the curve that stays tangent to every dash it passes through. A one-step method is a polygonal path that tries to shadow that curve, choosing each new segment from sampled dashes.
The table below contrasts the four methods of this unit by what slope information each step uses. Reading down the table, the methods spend more slope evaluations per step and, in return, hug the true curve more tightly. The rightmost column is the order: the power of the step size that controls how fast the error shrinks as the steps get shorter.
| method | slope information used per step | cost per step | order |
|---|---|---|---|
| forward Euler | slope at the point you leave | one slope | |
| backward Euler | slope at the point you arrive (solve for it) | one implicit slope | |
| trapezoidal rule | average of the leaving and arriving slopes | one implicit slope | |
| classical RK4 | four sampled slopes, weighted over | four slopes |
Each row is a recipe you can run by hand on a small example, and the rest of the unit explains why the order column reads exactly as it does and why a higher order is worth the extra slope samples.
Worked example Beginner
Let us march the equation whose slope rule is "the slope equals the current height," starting from height at time . The true solution is the growing curve , so we have an exact answer to compare against. Take a step size of and advance with forward Euler to time .
Step 1. The rule is: new height equals old height plus times the slope, and the slope is the current height. Starting at height , the first step gives . We have arrived at time with height .
Step 2. Apply the same rule again from the new point: . We have reached time with height .
Step 3. Compare with the truth. The exact height at time is . The Euler march gives , an undershoot of about . Euler used the slope from the start of each step, and because this curve bends upward, the start-of-step slope is always a little too small, so the polygon falls below the curve.
Step 4. Halve the step and watch the error shrink. With the four-step march gives , landing at , an undershoot of about . Cutting the step roughly halved the final error, which is the signature of a first-order method: error falls in proportion to the step size.
What this tells us: a one-step method turns a differential equation into ordinary arithmetic you repeat, and the size of the leftover error is governed by the step size raised to the method's order. Euler is order one, so smaller steps help only in proportion. A higher-order method, with the same effort spent more cleverly, would have landed far closer with the very same step count.
Check your understanding Beginner
Formal definition Intermediate+
Consider the initial-value problem (IVP) for a system of first-order ordinary differential equations
$$
y'(t) = f(t, y(t)), \qquad y(t_0) = y_0, \qquad y(t) \in \mathbb{R}^d,
$$
with continuous and globally Lipschitz in : there is with for all and . Under this hypothesis the IVP has a unique solution on by the Picard-Lindelöf theorem 02.12.01. Fix a step , set , and write for the numerical approximation; the exact solution value is .
Definition (one-step method). A one-step method for the IVP is a recurrence $$ u^{n+1} = u^n + h,\Phi(t_n, u^n, h), $$ where the increment function encodes the method. The method is explicit when is evaluated from data already known at step , and implicit when the formula for contains inside an -evaluation, so each step solves an algebraic equation for .
The four basic instances are: forward (explicit) Euler, , giving ; backward (implicit) Euler, ; the trapezoidal rule, ; and the implicit midpoint rule, .
Definition (local truncation error and consistency). The local truncation error (LTE) is the residual obtained by substituting the exact solution into the difference formula: $$ \tau^n ;=; \frac{y(t_{n+1}) - y(t_n)}{h} ;-; \Phi\big(t_n, y(t_n), h\big). $$ The method is consistent of order if for some constant independent of and , as ; equivalently . Consistency (order ) is the requirement , i.e. the increment function reduces to the right-hand side in the zero-step limit. The integer is the order of the method.
Definition (explicit Runge-Kutta method). An -stage explicit Runge-Kutta (RK) method computes intermediate stages and advances by
$$
k_i = f\Big(t_n + c_i h,; u^n + h!\sum_{j=1}^{i-1} a_{ij} k_j\Big), \qquad
u^{n+1} = u^n + h \sum_{i=1}^{s} b_i, k_i,
$$
so . The coefficients are collected in the Butcher tableau with strictly lower triangular (the explicit condition), the weights , and the nodes . The classical RK4 is
$$
\begin{array}{c|cccc}
0 & & & & \
\tfrac12 & \tfrac12 & & & \
\tfrac12 & 0 & \tfrac12 & & \
1 & 0 & 0 & 1 & \ \hline
& \tfrac16 & \tfrac13 & \tfrac13 & \tfrac16
\end{array}
\qquad
u^{n+1} = u^n + \tfrac{h}{6}\big(k_1 + 2k_2 + 2k_3 + k_4\big).
$$
The notation , , the Butcher triple , and the Landau symbol are recorded in _meta/NOTATION.md.
Counterexamples to common slips Intermediate+
"Consistency alone gives convergence." For one-step methods consistency plus the Lipschitz bound on does give convergence (the theorem below), but the implication is not automatic from in isolation: the stability ingredient, here the Lipschitz constant of controlling error propagation, is essential. For multistep methods consistency without zero-stability fails outright
43.10.03."The local truncation error is the error after one step." The LTE is the residual divided by ; the actual one-step error from exact data is . The convergence theorem shows these per-step errors accumulate over steps to a global error — one power of is lost to accumulation.
"Backward Euler and the trapezoidal rule are the same order because both are implicit." Implicitness is orthogonal to order. Backward Euler is order ; the trapezoidal rule is order . The order is fixed by Taylor matching of , not by whether the step is implicit.
"More stages always means higher order." The number of order conditions grows faster than the number of free tableau coefficients: an -stage explicit RK method attains order at most , and for no explicit -stage method reaches order (the Butcher barriers). Stages buy order only up to these algebraic limits.
Key theorem with proof Intermediate+
The signature theorem is the convergence of one-step methods: a consistent method whose increment function is Lipschitz converges, with global error of the same order as the consistency order. It is the statement that makes "order " a promise about the answer and not merely about a single step, and every order claim in the chapter is read through it.
Theorem (convergence of one-step methods). Let be globally Lipschitz in with constant on , and let the one-step method have an increment function that is Lipschitz in its second argument, for , . If the method is consistent of order , so that the local truncation error satisfies , then the global error satisfies $$ \max_{0 \le n \le N} |e^n| ;\le; e^{\Lambda (T - t_0)},|e^0| ;+; \frac{e^{\Lambda (T - t_0)} - 1}{\Lambda}, C h^p, \qquad Nh = T - t_0. $$ In particular, if then with ; the method converges and the global error is . [LeVeque §5.3-5.4; Süli-Mayers §12.2-12.3]
Proof. Subtract the numerical recurrence from the exact-solution identity. By the definition of the LTE, the exact solution satisfies $$ y(t_{n+1}) = y(t_n) + h,\Phi\big(t_n, y(t_n), h\big) + h,\tau^n . $$ The numerical solution satisfies . Subtracting gives the error recurrence $$ e^{n+1} = e^n + h\big[\Phi(t_n, y(t_n), h) - \Phi(t_n, u^n, h)\big] + h,\tau^n . $$ Take norms and apply the Lipschitz bound on and the LTE bound: $$ |e^{n+1}| \le |e^n| + h\Lambda |e^n| + h|\tau^n| \le (1 + h\Lambda)|e^n| + C h^{p+1}. $$ This is a scalar linear recurrence in of the form with . Unrolling it, $$ \varepsilon_n \le (1 + h\Lambda)^n \varepsilon_0 + B \sum_{m=0}^{n-1} (1 + h\Lambda)^m = (1 + h\Lambda)^n \varepsilon_0 + B,\frac{(1 + h\Lambda)^n - 1}{h\Lambda}. $$ Use the elementary bound , hence for . Substituting and dividing the geometric factor by , $$ \varepsilon_n \le e^{\Lambda(T - t_0)},\varepsilon_0 + \frac{e^{\Lambda(T - t_0)} - 1}{\Lambda},C h^p . $$ Taking the maximum over yields the stated bound. With the first term vanishes and with as stated.
The recurrence is the discrete Gronwall inequality, the difference-equation shadow of the continuous Gronwall lemma used to prove uniqueness for the IVP 02.12.01. For forward Euler, is Lipschitz with , and a Taylor expansion gives , so and the global error is , the rate observed numerically in the Beginner worked example.
Bridge. This theorem is the foundational reason "order" is a property of the computed trajectory and not of a single step: the per-step residuals accumulate across steps, and the discrete Gronwall factor is exactly the amplification budget that keeps the sum at rather than letting it blow up. The argument builds toward the multistep convergence theory of 43.10.02, where the scalar amplification factor is replaced by a companion matrix and the clean Gronwall bound must be supplemented by the root condition — this is exactly the point at which consistency stops implying convergence on its own and zero-stability 43.10.03 becomes a separate hypothesis. The structure generalises: "consistency plus stability implies convergence" is the central insight, and it appears again in the Lax-Richtmyer equivalence theorem for finite-difference PDE schemes, where stability becomes uniform power-boundedness of the evolution operator. Putting these together, the one-step theorem is the simplest member of a family in which the bridge is always the same — local accuracy controlled by Taylor matching, propagated under a stability bound that prevents error amplification — and the continuous Gronwall lemma that gives IVP uniqueness is dual to the discrete Gronwall lemma that gives numerical convergence.
Exercises Intermediate+
Advanced results Master
The one-step convergence theorem is the elementary stratum. Its reach extends through the algebraic theory of order conditions, the structure of implicit and collocation methods, the asymptotic expansion of the global error that underlies extrapolation, and the distinction between the convergence proved above and the fixed- stability that governs practical computation.
Theorem 1 (Butcher order conditions and the order barriers). An explicit RK method with tableau has order if and only if the order conditions hold for every rooted tree with at most vertices, where is the tree's density and the elementary-weight monomial in the tableau coefficients indexed by . The count of conditions is the number of rooted trees: through orders . Because an -stage explicit method has free coefficients while the conditions proliferate, the attainable order is bounded: an explicit -stage RK has order , with equality impossible for (no -stage explicit method has order ; order needs stages), and order needs at least stages — the Butcher barriers [Hairer-Nørsett-Wanner §II.3].
Theorem 2 (implicit RK, collocation, and the Gauss methods). A fully implicit -stage RK method (general, not strictly lower-triangular ) can reach order . The -stage Gauss-Legendre collocation method, with nodes the roots of the shifted Legendre polynomial of degree , attains the maximal order and is symplectic; the trapezoidal and implicit-midpoint rules are the Gauss () and Lobatto/Gauss instances. Collocation reframes the method: is the value at of the degree- polynomial with and at the collocation nodes, so the order is the quadrature order of the underlying node set [Hairer-Nørsett-Wanner §II.7].
Theorem 3 (global error expansion and Richardson extrapolation). For a method of order applied to a smooth IVP, the global error admits an asymptotic expansion , where solves the linear variational equation with built from the principal LTE coefficient. The leading being computable in principle is what makes Richardson extrapolation work: combining the numerical solutions at steps and as cancels the term and raises the order to , iterated in the Gragg-Bulirsch-Stoer scheme [LeVeque §5.5].
Theorem 4 (A-stability obstruction for one-step vs. multistep). The stability function of an explicit RK method is a polynomial, so as : no explicit method can be A-stable, and explicit one-step methods inherit a bounded absolute-stability region. Implicit RK methods have rational , and A-stability is the condition on the left half-plane , characterised for the diagonal Padé approximants of realised by the Gauss methods. This is the one-step face of the stiffness theory whose multistep form is the Dahlquist second barrier 43.10.05.
Theorem 5 (B-convergence and the one-sided Lipschitz condition). The convergence theorem's constant degrades catastrophically when the Lipschitz constant is large, as for stiff systems. Replacing the Lipschitz bound by the one-sided Lipschitz condition with moderate (possibly ) gives B-convergence: for algebraically stable implicit RK methods the global error bound depends on rather than , so the estimate stays meaningful in the stiff limit . This is the rigorous form of the observation that implicit methods, not explicit ones, are the right tool when has widely separated time scales [Hairer-Nørsett-Wanner §II.1].
Synthesis. The one-step convergence theorem is the foundational reason numerical integration of an IVP is a controlled approximation rather than an open-ended drift: local accuracy is fixed by Taylor matching of the increment function against , measured by the order of the local truncation error, and that local accuracy is propagated to a global bound by the discrete Gronwall stability estimate. The central insight is that this is one mechanism with two halves — consistency and stability — and the entire chapter is the systematic refinement of each half: the order conditions of Butcher's rooted-tree algebra make consistency an exact combinatorial calculus, and the absolute-stability and zero-stability theories make stability a precise spectral condition. This generalises directly: the same consistency-plus-stability template, with the scalar amplification factor promoted to the companion matrix of a multistep method 43.10.02, produces the Dahlquist equivalence theorem 43.10.03, and promoted to the evolution operator of a discretised partial differential equation produces the Lax-Richtmyer equivalence theorem.
Putting these together, the convergence theorem proved here is dual to the Picard-Lindelöf uniqueness theorem 02.12.01 — the continuous Gronwall lemma giving uniqueness of the exact flow is exactly the discrete Gronwall lemma giving convergence of the numerical flow — and the bridge from the elementary Euler bound to the full stiffness theory is the recognition that for large Lipschitz constants the convergence constant must be re-derived from a one-sided bound, which is precisely why the implicit methods of this unit, not the explicit ones, survive the passage to stiff problems 43.10.05. The global error expansion completes the picture: the leading term is itself the solution of a variational equation, so the error is not merely bounded but asymptotically structured, and that structure is what extrapolation exploits to manufacture higher order from the same family of methods.
Full proof set Master
Proposition 1 (discrete Gronwall inequality). Let be non-negative reals satisfying for constants , . Then for all .
Proof. Unrolling the recurrence, , the geometric sum being valid since . The inequality (from the convexity of , or the series ) gives . Substituting into both terms, and using that the map is increasing so , yields .
Proposition 2 (order of forward Euler). For an IVP with , forward Euler has local truncation error for some , hence order ; if is globally Lipschitz the global error is .
Proof. By Taylor's theorem with Lagrange remainder, for some . The forward-Euler increment is . Hence $$ \tau^n = \frac{y(t_{n+1}) - y(t_n)}{h} - y'(t_n) = \frac{1}{h}\Big(h y'(t_n) + \tfrac{h^2}{2}y''(\xi_n)\Big) - y'(t_n) = \tfrac{h}{2}y''(\xi_n). $$ With bounded on the compact trajectory, , so . Forward Euler has with Lipschitz constant , so the one-step convergence theorem gives .
Proposition 3 (existence and Lipschitz continuity of the implicit-Euler increment). If is globally Lipschitz with constant and , the backward-Euler equation has a unique solution , and the increment function it defines is Lipschitz in with constant .
Proof. Fix . The map satisfies , a contraction since . By the Banach fixed-point theorem has a unique fixed point , which is the backward-Euler step. For Lipschitz continuity of : let solve the equation for data . Subtracting, , so , giving . Since , .
Proposition 4 (RK4 has order four on the linear test equation). The classical RK4 stability function is , matching through ; hence on the method is order .
Proof. For with , compute the stages: ; ; ; . Then $$ u^{n+1} = u^n + \tfrac{h}{6}(k_1 + 2k_2 + 2k_3 + k_4) = u^n\Big[1 + \tfrac{z}{6}\big(1 + 2(1 + \tfrac{z}{2}) + 2(1 + \tfrac{z}{2} + \tfrac{z^2}{4}) + (1 + z + \tfrac{z^2}{2} + \tfrac{z^3}{4})\big)\Big]. $$ The bracket inside is , so . Thus , which agrees with through order ; the one-step error , giving consistency order and, by the convergence theorem, global error .
Connections Master
The linear multistep methods of
43.10.02retain the local-truncation-error and order framework defined here but replace the single-step recurrence by an -step recurrence; the consistency conditions become conditions on the first and second characteristic polynomials , and the clean discrete-Gronwall stability bound of the one-step theorem no longer holds automatically. That unit is where consistency and convergence first come apart, motivating the separate zero-stability hypothesis; this unit supplies the LTE/order definitions it specialises.The zero-stability and Dahlquist equivalence theorem of
43.10.03is the multistep generalisation of the one-step convergence theorem proved here: "consistency plus stability implies convergence" reappears with the scalar amplification factor promoted to a companion matrix whose powers must stay bounded (the root condition). The order- global-error conclusion is identical in form; only the stability ingredient changes from a Lipschitz bound to a spectral condition.The absolute-stability theory of
43.10.04takes the stability function derived here for the linear test equation and studies the region , which governs the maximum stable step for a fixed problem rather than the convergence. The explicit-RK stability polynomials computed in this unit's proof set are exactly the objects whose sublevel sets that unit maps.The stiffness and A-stability theory of
43.10.05explains why the implicit methods introduced here — backward Euler and the trapezoidal rule — are indispensable: their rational stability functions can satisfy on the entire left half-plane, which no explicit RK polynomial can. The Dahlquist second barrier there limits the order of A-stable linear multistep methods, with the trapezoidal rule of this unit as the optimal second-order A-stable member.The continuous existence-uniqueness theory of
02.12.01is the input the convergence theorem assumes: the Picard-Lindelöf theorem guarantees the exact solution that the local truncation error is defined against, and the continuous Gronwall lemma used in that uniqueness proof is the exact analogue of the discrete Gronwall lemma driving the convergence proof here. The matrix-exponential solution of linear systems in02.06.03is the closed form of the linear test problem used throughout to read off stability functions.
Historical & philosophical context Master
Euler's method appears in Leonhard Euler's Institutiones calculi integralis (1768-1770) [Euler 1768], where the polygonal advance is introduced to approximate solutions of differential equations that resist closed-form integration; the method and its first-order error were understood by Euler as a limiting process recovering the exact solution as . Augustin-Louis Cauchy turned the polygonal construction into the first rigorous existence proof for the IVP in his 1820s-1830s lectures, the Cauchy-Euler method later completed by Lipschitz (1876) and given its fixed-point form by Picard (1890) and Lindelöf (1894), so the numerical scheme and the existence theorem share a single origin.
The higher-order one-step methods are due to Carl Runge (1895) [Runge 1895], who introduced the idea of sampling the slope at interior points to cancel low-order error terms, and Wilhelm Kutta (1901) [Kutta 1901], who systematised the order conditions and exhibited the classical fourth-order tableau now bearing both names; Karl Heun (1900) gave the second-order improved-Euler method in the same period. The algebraic theory of order — the rooted-tree calculus that turns the Taylor-matching conditions into a combinatorial enumeration and proves the order barriers — is the work of John Butcher, beginning with his 1963 paper on the coefficients of RK processes and developed into the B-series framework now standard. The unifying convergence-equivalence viewpoint, that consistency plus stability is equivalent to convergence, was crystallised for ordinary differential equations by Germund Dahlquist (1956) and for partial differential equations by Peter Lax and Robert Richtmyer (1956) in the same year, the structural parallel that organises the rest of this chapter.
Bibliography Master
@book{leveque2007fdm,
author = {LeVeque, Randall J.},
title = {Finite Difference Methods for Ordinary and Partial Differential Equations: Steady-State and Time-Dependent Problems},
publisher = {Society for Industrial and Applied Mathematics (SIAM)},
year = {2007}
}
@book{sulimayers2003,
author = {S\"{u}li, Endre and Mayers, David F.},
title = {An Introduction to Numerical Analysis},
publisher = {Cambridge University Press},
year = {2003}
}
@book{hnw1993,
author = {Hairer, Ernst and N\o{}rsett, Syvert P. and Wanner, Gerhard},
title = {Solving Ordinary Differential Equations I: Nonstiff Problems},
edition = {2},
series = {Springer Series in Computational Mathematics},
volume = {8},
publisher = {Springer-Verlag},
year = {1993}
}
@book{butcher2016,
author = {Butcher, John C.},
title = {Numerical Methods for Ordinary Differential Equations},
edition = {3},
publisher = {John Wiley \& Sons},
year = {2016}
}
@book{euler1768institutiones,
author = {Euler, Leonhard},
title = {Institutiones calculi integralis, Volumen Primum},
publisher = {Impensis Academiae Imperialis Scientiarum},
address = {Petropoli (St. Petersburg)},
year = {1768}
}
@article{runge1895,
author = {Runge, Carl},
title = {\"{U}ber die numerische Aufl\"{o}sung von Differentialgleichungen},
journal = {Mathematische Annalen},
volume = {46},
number = {2},
year = {1895},
pages = {167--178}
}
@article{kutta1901,
author = {Kutta, Wilhelm},
title = {Beitrag zur n\"{a}herungsweisen Integration totaler Differentialgleichungen},
journal = {Zeitschrift f\"{u}r Mathematik und Physik},
volume = {46},
year = {1901},
pages = {435--453}
}
@article{butcher1963,
author = {Butcher, John C.},
title = {Coefficients for the study of Runge-Kutta integration processes},
journal = {Journal of the Australian Mathematical Society},
volume = {3},
number = {2},
year = {1963},
pages = {185--201}
}
@article{dahlquist1956,
author = {Dahlquist, Germund},
title = {Convergence and stability in the numerical integration of ordinary differential equations},
journal = {Mathematica Scandinavica},
volume = {4},
year = {1956},
pages = {33--53}
}