Interpolation error and the Runge phenomenon
Anchor (Master): Trefethen 2019 *Approximation Theory and Approximation Practice* extended ed. (SIAM) Ch. 11-15 (the Lebesgue constant, the Runge phenomenon, potential theory and the equilibrium measure, exponential vs divergent convergence); Davis 1975 *Interpolation and Approximation* (Dover) Ch. 3-4 (the remainder, the contour-integral Hermite formula, and the Lebesgue function); Runge 1901 *Zeitschrift für Mathematik und Physik* 46 (the original divergence example)
Intuition Beginner
In the previous unit you learned that through any handful of points with different horizontal positions there passes exactly one polynomial of a matching degree. That settles which curve you get. This unit asks the next question: if the points were sampled from some true function, how far does the interpolating curve stray from the true function in the gaps between the points?
The answer has a pleasing shape. The mismatch at a position is a product of two things. One is a measure of how curvy the true function is — captured by a high derivative of the function, the part that a polynomial of limited degree simply cannot imitate. The other is a single polynomial you can read straight off the node positions: it is the product of the distances from to each node. Call it the node product. It is zero exactly at the nodes, which matches the fact that the curve hits every data point dead on, and it grows as you move away from the cluster of nodes.
You only get to choose one of these two factors. The curviness of the function is fixed once the function is given. But the node product depends entirely on where you place the sample points, and that is yours to decide. So the whole craft of accurate interpolation comes down to placing nodes so that the node product stays small across the interval.
Here is the surprise that makes this unit worth a chapter of its own. The most natural choice — spread the nodes out evenly — is a bad one. As you add more evenly spaced points, the curve can start to swing wildly near the ends of the interval, overshooting the true function by more and more, even when the true function is perfectly smooth. This runaway wobble is called the Runge phenomenon, and the cure is to bunch the nodes toward the ends rather than spacing them evenly.
Visual Beginner
Picture the true function as a calm curve and the interpolating polynomial as a curve forced to pass through the chosen sample dots. Between dots the two curves can part ways. The table below tracks the node product for five evenly spaced nodes on the interval from to : it is zero at each node and largest in the gaps, and the gaps near the two ends are where it swells the most.
| position | nearest behaviour | size of the node product |
|---|---|---|
| at a node | curve meets the true function | |
| middle gap, near centre | small mismatch | small |
| outer gap, near an end | large mismatch | large |
Each row is the same message read off the node product: the error is pinned to zero at every node and is free to grow in between, and with evenly spaced nodes it grows fastest near the ends. Moving the nodes closer together near the ends flattens those tall outer humps, which is the whole idea behind the cure.
Worked example Beginner
Let us measure the worst-case error of fitting a straight line through two points of the curve on the interval from to . Take the nodes and .
Step 1. Build the line through the two points. At , ; at , . The line through and is .
Step 2. Write down the exact mismatch. The error is . Notice this is exactly the node product for these two nodes, and it is zero at both nodes as expected.
Step 3. Find where the mismatch is biggest. The product is a downward parabola with its low point halfway between the nodes, at . There the value is , so the line sits unit above the curve at the midpoint.
Step 4. Compare with the error formula. The formula says the error equals a second derivative of , divided by , times the node product. For the second derivative is the constant , so the formula gives . This matches the exact mismatch from Step 2 perfectly.
What this tells us: even a function as tame as cannot be matched by a line except at the two nodes, and the formula predicts the gap exactly — a fixed curviness factor times the node product, biggest right in the middle of the gap.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, and are pairwise distinct nodes; is the unique interpolating polynomial of at these nodes (existence and uniqueness are the unisolvence theorem of 43.08.01). The notation means has continuous derivatives through order on ; denotes the -th derivative.
Definition (node polynomial and interpolation error). The node polynomial (or nodal polynomial) for the nodes is the monic degree- polynomial $$ \pi_{n+1}(x) = \prod_{k=0}^{n} (x - x_k) . $$ It is the unique monic element of vanishing at all nodes. The interpolation error (or remainder) is the function . It vanishes at every node, so is divisible by as a relation on the node set.
Definition (Lebesgue function and Lebesgue constant). With the Lagrange cardinal polynomials of 43.08.01, the Lebesgue function for the node set is
$$
\lambda_n(x) = \sum_{k=0}^{n} \lvert L_k(x) \rvert ,
$$
and the Lebesgue constant is its maximum,
$$
\Lambda_n = \max_{x \in [a,b]} \lambda_n(x) = \max_{x \in [a,b]} \sum_{k=0}^{n} \lvert L_k(x) \rvert .
$$
The interpolation operator , , is linear and idempotent (a projection); is precisely its operator norm in the supremum norm . The best approximation error of in is , the quantity owned by 43.08.04.
The symbols , , the supremum norm , the divided difference , and are recorded in _meta/NOTATION.md; the divided-difference recurrence and the node polynomial are inherited from 43.08.01.
Counterexamples to common slips Intermediate+
"The interpolation error is bounded by best approximation error." It is not bounded by it — it is bounded by times it. The Lebesgue constant is the amplification factor between the best a polynomial of degree can do and what interpolation at a fixed node set actually does. For equispaced nodes grows faster than any power of , so a small best-approximation error can coexist with a large interpolation error.
"A smooth (even analytic) function is always interpolated convergently." Runge's example is real-analytic on yet its equispaced interpolants diverge in the sup norm. Convergence depends on the interplay between the function's complex-analytic singularities and the node distribution, not on real-line smoothness alone.
"The intermediate point in the error formula is the same for every ." The point depends on the evaluation point and lies in the smallest interval containing and all the nodes. The formula is pointwise; only when one then bounds by its maximum does the -dependence of drop out.
"Equispaced nodes minimise the node polynomial." They do the opposite near the endpoints: for equispaced nodes is exponentially larger than for the Chebyshev nodes that minimise it. The node polynomial is the controllable factor, and equispacing is a poor control choice (
43.08.04).
Key theorem with proof Intermediate+
The signature result is the interpolation error theorem: it expresses the remainder as a product of an -st derivative of and the node polynomial. The derivative factor measures the part of that no degree- polynomial can capture; the node polynomial is the factor the analyst controls through node placement. The proof is a single application of Rolle's theorem 02.05.02, iterated times against an auxiliary function rigged to have one extra zero. This follows Süli-Mayers [Süli-Mayers §6.4].
Theorem (interpolation error / Cauchy remainder). Let and let interpolate at the distinct nodes . For every there exists a point in the open interval — the smallest open interval containing and all the nodes — such that $$ f(x) - p_n(x) = \frac{f^{(n+1)}(\xi)}{(n+1)!},\pi_{n+1}(x), \qquad \pi_{n+1}(x) = \prod_{k=0}^{n}(x - x_k). $$
Proof. If coincides with a node, both sides vanish and there is nothing to prove; fix not equal to any node. Define the constant $$ G = \frac{f(x) - p_n(x)}{\pi_{n+1}(x)}, $$ which is well-defined because for off the node set, and form the auxiliary function $$ \phi(t) = f(t) - p_n(t) - G,\pi_{n+1}(t), \qquad t \in [a,b]. $$ Then , since is and are polynomials. At each node , both and , so . At , the definition of gives . Thus vanishes at the distinct points .
Apply Rolle's theorem on each of the consecutive subintervals determined by these zeros: between any two adjacent zeros of there is a zero of , giving at least distinct zeros in . Repeating on yields with at least zeros, and after such steps has at least one zero . Differentiating exactly times: has , while is monic of degree , so . Hence $$ 0 = \phi^{(n+1)}(\xi) = f^{(n+1)}(\xi) - G,(n+1)!, $$ so . Substituting back into the definition of gives .
Corollary (uniform error bound). If , then
$$
\lVert f - p_n \rVert_\infty \le \frac{M_{n+1}}{(n+1)!},\lVert \pi_{n+1} \rVert_\infty .
$$
The bound factorises into a function-dependent part and a node-dependent part , the latter the sole quantity under the analyst's control — the entry point for node optimisation in 43.08.04.
Bridge. This theorem is the foundational reason node placement is the whole game: the error splits into a derivative factor fixed by and the node polynomial fixed by the analyst, and the corollary makes minimising the one lever available. This is exactly the optimisation that the Chebyshev theory solves, so the result builds toward the minimax node placement of 43.08.04, and the node polynomial appears again in the quadrature error of 43.09.01, where the same product governs how well approximates . The Rolle-iteration mechanism generalises the single mean value theorem of 02.05.02 from one secant to nested differences, and the leading-coefficient identity is dual to the divided-difference reading of the remainder: putting these together, , so the error constant is itself a confluent divided difference, the central insight tying this unit to the divided-difference calculus of 43.08.01.
Exercises Intermediate+
Advanced results Master
The pointwise remainder fixes the local error; the master layer concerns the global behaviour as the degree grows — when interpolation converges, when it diverges, and the single quantity, the Lebesgue constant, that adjudicates.
Theorem 1 (Lebesgue inequality and the role of node distribution). For any node set, , with the Lebesgue constant of the nodes and the best uniform approximation error. By the Weierstrass theorem for every , so interpolation converges uniformly whenever . The Lebesgue constant depends only on the nodes, not on : for equispaced nodes on it grows as , faster than any polynomial, while for Chebyshev nodes grows only logarithmically [Trefethen Ch. 15]. The contrast is the precise content of the slogan that node placement, not node count, governs convergence.
Theorem 2 (the Runge phenomenon). Equispaced polynomial interpolation of on diverges: as , with the error growing geometrically and concentrated near the endpoints , even though is real-analytic on [Runge 1901]. The mechanism is twofold. The function-side factor in the remainder grows like governed by the distance from to the poles of in the complex plane. The node-side factor for equispaced nodes grows like relative to its value near the centre but is exponentially larger near the endpoints; the product diverges precisely where the equispaced node polynomial swells. Potential theory makes this exact: the limiting node density of equispaced points is uniform, whose logarithmic potential is non-constant on , so the node polynomial cannot be uniformly small.
Theorem 3 (potential-theoretic convergence criterion). Let the nodes be drawn with limiting density (a probability measure on ), and let be its logarithmic potential. Equispaced-node interpolation converges geometrically for functions analytic in the region and diverges outside it; the convergence region is bounded by the level curve of through the endpoints. For the equilibrium (arcsine) measure — the limiting density of Chebyshev nodes — is constant on , the level curves are confocal ellipses (Bernstein ellipses), and interpolation converges geometrically for any function analytic in a neighbourhood of the interval [Trefethen Ch. 12]. The uniform density of equispaced nodes is not the equilibrium measure, which is the root cause of Runge divergence; the arcsine clustering of Chebyshev nodes toward the endpoints is exactly the correction.
Theorem 4 (divided-difference remainder and confluence). The remainder admits the divided-difference form , valid for off the node set and extending by continuity to all when [Stoer-Bulirsch §2.1.4]. The divided difference on points equals by the mean-value property, and as the evaluation point migrates to a node the formula passes continuously to the Hermite (repeated-node) remainder of 43.08.03, where derivative data is matched and the node polynomial acquires repeated factors . The divided difference is thus the analytic object unifying the Lagrange remainder, the Hermite remainder, and the coalescent limit established in 43.08.01.
Synthesis. The error theorem is the foundational reason interpolation accuracy is a question of geometry rather than of smoothness alone: the remainder factorises into a derivative of and the node polynomial , and the Lebesgue constant packages the same node-dependence into the operator norm of the interpolation projection, so the convergence theory is the study of one node-determined quantity. The central insight is that node placement controls and at once: equispaced nodes make the node polynomial swell exponentially and leave super-polynomial, while Chebyshev nodes make minimal and merely logarithmic — this is exactly the extremal problem solved in 43.08.04, and it is dual to the potential-theoretic statement that the equilibrium measure of the interval is the arcsine law, not the uniform law. Putting these together, the Runge phenomenon is not a failure of polynomials but a failure of equispaced sampling: the factor is governed by the complex singularities of through the Bernstein-ellipse picture, the node-side factor by the limiting node density, and divergence occurs when the analyticity region of omits the convergence region the node distribution allows. The bridge is that the arcsine node density at once minimises the node polynomial, tames the Lebesgue constant, and places the Gauss quadrature nodes of 43.09.03 — one geometric principle, clustering toward the endpoints, which generalises the local Rolle-based remainder proved here into the section's global approximation theory.
Full proof set Master
Proposition 1 (interpolation error theorem, Rolle form). For and the interpolant at distinct nodes , every admits with .
Proof. For equal to a node both sides vanish; fix off the node set and set . The function lies in and vanishes at the distinct points (at the nodes because and both vanish there, at by the choice of ). Ordering these zeros and applying Rolle's theorem to on each of the adjacent gaps produces zeros of in ; iterating, has zeros, so has at least one zero . Since and (monic of degree ), , whence and the claim follows.
Proposition 2 (divided-difference remainder). For distinct nodes and , for off the node set, and for some .
Proof. Let interpolate at . Its Newton form on these ordered points is . The first sum is the Newton form of the interpolant on , namely , and the last product is , so . Evaluating at , where , gives . Comparing with Proposition 1 and cancelling yields .
Proposition 3 (Lebesgue constant as operator norm and the near-best bound). The Lebesgue constant satisfies , and consequently .
Proof. For , gives , so . At a point with , pick , , with ; then , so , giving equality. For the near-best bound, let attain ; since , , and the triangle inequality with gives .
Proposition 4 (equispaced node polynomial swells near the endpoints). For the equispaced nodes on , the ratio of near an endpoint to its value near the centre grows exponentially in .
Proof. Compare the centre with a point near an endpoint, . Estimate by recognising it as a Riemann sum for up to a correction bounded in , where is the logarithmic potential of the uniform density on . A direct integration gives . Evaluating at the two points, , and . Their difference is , so is strictly larger near the endpoint than at the centre. Hence , and the ratio grows like . The endpoint swelling of the equispaced node polynomial is therefore exponential in the degree, the node-side driver of the Runge divergence; the arcsine (Chebyshev) density replaces the non-constant by a potential constant on , removing the swelling.
Connections Master
The node polynomial whose sup norm the error corollary isolates is minimised over all monic degree- polynomials by the scaled Chebyshev polynomial , whose roots are the optimal interpolation nodes; this extremal problem, the equioscillation characterisation, and the resulting logarithmic Lebesgue constant are the content of
43.08.04, which closes the Runge loop opened here by replacing equispaced nodes with the arcsine-clustered Chebyshev nodes.The divided-difference remainder proved here passes continuously, as the evaluation point migrates to a node, into the Hermite (repeated-node) remainder with node polynomial ; the confluent divided differences and the piecewise-cubic construction that defeats Runge by lowering the local degree are developed in
43.08.03, which inherits both the error mechanism and the node polynomial of this unit.Integrating the interpolant in place of turns the interpolation error into the quadrature error: , so the same node polynomial governs how accurately a Newton-Cotes rule integrates, and the Peano-kernel representation of the quadrature error in
43.09.01is the integrated form of the pointwise remainder established here; the instability of high-order equispaced Newton-Cotes mirrors the Runge divergence through the identical node-polynomial swelling.The error theorem rests entirely on iterated Rolle applied to an auxiliary function, the same mechanism by which the Taylor remainder is derived in
02.05.02; the Lagrange remainder is the interpolation generalisation of the Taylor remainder , recovered in the confluent limit where all nodes coalesce to , tying numerical interpolation to the differential calculus of single-variable analysis.
Historical & philosophical context Master
The pointwise interpolation remainder in the form was established by Augustin-Louis Cauchy in the 1840s, building on the Lagrange-remainder technique Cauchy had himself made rigorous for Taylor's theorem; the auxiliary-function-and-Rolle argument is the natural extension of the mean-value reasoning to interpolation conditions. The divided-difference form of the remainder and the identification of divided differences with derivatives via the mean-value property are due to the nineteenth-century difference calculus, with the integral representation supplied by Charles Hermite and Angelo Genocchi in the 1870s.
The divergence example is Carl Runge's 1901 paper in the Zeitschrift für Mathematik und Physik, where he showed that equispaced interpolation of on a sufficiently wide symmetric interval (equivalently on ) diverges as the degree increases, and traced the cause to the location of the function's complex poles relative to the interval [Runge 1901]. The quantitative theory matured through the twentieth century: Henri Lebesgue introduced the function and constant now bearing his name as the operator norm of the interpolation projection; Sergei Bernstein and the potential-theory school connected convergence to the equilibrium measure of the interval and the Bernstein-ellipse analyticity regions; and the modern synthesis, including the sharp asymptotics for equispaced nodes against for Chebyshev nodes, is presented by Lloyd N. Trefethen [Trefethen Ch. 15]. Runge's example is the standard cautionary instance that smoothness on the real line does not guarantee convergence of high-degree equispaced interpolation.
Bibliography Master
@book{sulimayers2003,
author = {S\"{u}li, Endre and Mayers, David F.},
title = {An Introduction to Numerical Analysis},
publisher = {Cambridge University Press},
year = {2003}
}
@book{stoerbulirsch2002,
author = {Stoer, Josef and Bulirsch, Roland},
title = {Introduction to Numerical Analysis},
edition = {3},
publisher = {Springer},
year = {2002}
}
@book{trefethen2019,
author = {Trefethen, Lloyd N.},
title = {Approximation Theory and Approximation Practice, Extended Edition},
publisher = {SIAM},
year = {2019}
}
@book{davis1975,
author = {Davis, Philip J.},
title = {Interpolation and Approximation},
publisher = {Dover Publications},
year = {1975}
}
@article{runge1901,
author = {Runge, Carl},
title = {\"{U}ber empirische Funktionen und die Interpolation zwischen \"{a}quidistanten Ordinaten},
journal = {Zeitschrift f\"{u}r Mathematik und Physik},
volume = {46},
year = {1901},
pages = {224--243}
}
@article{cauchy1840interpolation,
author = {Cauchy, Augustin-Louis},
title = {Sur les fonctions interpolaires},
journal = {Comptes Rendus de l'Acad\'{e}mie des Sciences},
volume = {11},
year = {1840},
pages = {775--789}
}