Polynomial interpolation: existence, uniqueness, and the Lagrange form
Anchor (Master): Davis 1975 *Interpolation and Approximation* (Dover) Ch. 2-3 (the interpolation problem as a moment problem, Vandermonde theory, the remainder); Trefethen 2019 *Approximation Theory and Approximation Practice* extended ed. (SIAM) Ch. 5 (the barycentric formula and its numerical stability); Berrut-Trefethen 2004 *SIAM Review* 46(3) (barycentric Lagrange interpolation as the form to use in practice); Higham 2004 *IMA J. Numer. Anal.* 24(4) (numerical stability of barycentric interpolation)
Intuition Beginner
Suppose you have measured a quantity at a handful of separate inputs — the temperature at five times of day, say — and you want a smooth formula that hits every reading exactly and lets you guess the value in between. The oldest answer is to fit a single polynomial through the points. A polynomial is built from powers of , it is easy to evaluate, easy to differentiate, and easy to integrate, so once you have one passing through your data you can compute with it freely. This unit is about the one polynomial that threads your points, why it exists, why there is only one, and three different ways to write it down.
The first surprise is a counting fact. Two points determine a unique straight line. Three points determine a unique parabola. In general, if you have a certain number of points with all-different inputs, there is exactly one polynomial of a matching modest degree that passes through all of them. Not zero, not many — exactly one. The degree you need grows with the number of points: one point needs a constant, two need a line, and adding a point bumps the degree up by one.
The second idea is a clever way to build that polynomial by hand. For each data point you cook up a helper polynomial that equals at that point's input and at every other input. Multiply each helper by the reading you want there and add them up. Because each helper switches on at exactly one input and switches off at all the others, the sum automatically lands on the right value at every input. These helpers are the Lagrange building blocks, and the recipe is the Lagrange formula.
There is a price to the most direct version of the recipe: if you later collect one more reading, you seem to have to rebuild everything from scratch. A second way of writing the same polynomial, due to Newton, fixes this by adding one correction term per new point, reusing all the earlier work. The two recipes always produce the identical curve; they are two notations for one object.
Visual Beginner
Picture five dots on graph paper, each at a different horizontal position. The interpolating polynomial is the single curve of the right degree that passes through all five dots at once. The table below shows the heart of the Lagrange construction for three nodes at horizontal positions : one helper curve per node, each rigged to read at its own node and at the others.
| node | helper value at | helper value at | helper value at |
|---|---|---|---|
| helper for | |||
| helper for | |||
| helper for |
Each row is a helper polynomial; the pattern of ones on the diagonal and zeros off it is the whole trick. When you scale row by the data value and add the rows, the diagonal of ones guarantees the total reads at node , so the combined curve threads every data dot.
Worked example Beginner
Let us fit a polynomial through the three points , , and . Three points call for a polynomial of degree at most two — a parabola or simpler.
Step 1. Build the helper for the node . It must read at and at the other inputs and . Start with , which is zero at and at . At this equals , so divide by to make it read there. The helper is .
Step 2. Build the helper for . It must be zero at and , so start with . At this is , so divide by . The helper is .
Step 3. Build the helper for . It must vanish at and , so start with . At this is , so divide by . The helper is .
Step 4. Scale each helper by its data value and add: . Multiplying out, , , and . Adding gives .
Step 5. Check. At , ; at , ; at , . All three match.
What this tells us: the helper recipe turns "pass through these points" into a short, mechanical computation, and the answer is one definite polynomial — here the parabola — that you can now evaluate anywhere you like.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, denotes or , and denotes the vector space of polynomials of degree at most with coefficients in ; , with the monomial basis . Write for the coefficient of in a polynomial (the leading-coefficient operator). The symbol denotes a finite product over the indicated index set.
Definition (the interpolation problem). Let be distinct points — the interpolation nodes — and let be prescribed values. A polynomial interpolates the data if for every . When the values arise from a function , so that , is the interpolating polynomial of at the nodes and is written .
The conditions are linear equations in the unknown coefficients of . In the monomial basis , the system is , where is the Vandermonde matrix (), , and . Solvability and uniqueness are therefore governed by .
Definition (Lagrange cardinal polynomials). For nodes , the Lagrange cardinal polynomials (or cardinal functions) are $$ L_k(x) = \prod_{\substack{j = 0 \ j \ne k}}^{n} \frac{x - x_j}{x_k - x_j}, \qquad k = 0, \dots, n. $$ Each , and by construction (the Kronecker delta: if , else ), because the numerator kills at every node but , and the denominator normalises its value at to . The Lagrange form of the interpolant is $$ p_n(x) = \sum_{k=0}^{n} f_k, L_k(x). $$
Definition (divided differences and the Newton form). The divided differences of on the nodes are defined recursively. The zeroth order is , and for , $$ f[x_i, x_{i+1}, \dots, x_{i+k}] = \frac{f[x_{i+1}, \dots, x_{i+k}] - f[x_i, \dots, x_{i+k-1}]}{x_{i+k} - x_i}. $$ With the Newton basis and for , the Newton form of the interpolant is $$ p_n(x) = \sum_{k=0}^{n} f[x_0, \dots, x_k], \pi_k(x) = f[x_0] + f[x_0,x_1](x - x_0) + \cdots + f[x_0,\dots,x_n]\prod_{j=0}^{n-1}(x - x_j). $$ The leading coefficient is the top divided difference. Adding a new node appends a single term without disturbing the earlier coefficients — the incremental property that the Lagrange form lacks.
Definition (barycentric weights and form). The barycentric weights are . The second (true) barycentric form of the interpolant, valid at any that is not a node, is $$ p_n(x) = \left. \sum_{k=0}^{n} \frac{w_k}{x - x_k}, f_k ;\middle/; \sum_{k=0}^{n} \frac{w_k}{x - x_k} \right. . $$
Counterexamples to common slips Intermediate+
"Repeated nodes are fine if the values agree." The existence-uniqueness theory requires the nodes to be pairwise distinct; with the Vandermonde matrix has two equal rows and is singular, and the cardinal denominators contain a zero factor. Matching a value and a derivative at a coincident node is Hermite interpolation, a separate construction (forward-pointed to
43.08.03)."A higher-degree polynomial also works, so the interpolant is not unique." Uniqueness is asserted within — degree at most . There are infinitely many higher-degree polynomials through the same points; the theorem pins down the unique one of degree , equivalently the unique representative modulo the node polynomial .
"The Lagrange and Newton forms give different polynomials." They are two coordinate systems for the same unique interpolant. The Lagrange form expands it in the cardinal basis ; the Newton form expands it in the basis . Both equal as functions on all of .
"The naive Lagrange formula is unstable, so avoid Lagrange interpolation." The representation is sound; only the textbook evaluation , which recomputes each from scratch, is wasteful and can lose accuracy. The barycentric rewrite of the identical polynomial restores -per-point evaluation and numerical stability (master tier).
Key theorem with proof Intermediate+
The signature result is the unisolvence theorem: through distinct nodes there passes one and only one polynomial of degree at most . Existence is delivered constructively by the Lagrange form; uniqueness rests on the fact that a nonzero polynomial of degree at most cannot have roots. The two facts together say the evaluation map is a linear isomorphism, which is why every later representation describes the same object.
Theorem (unisolvence of polynomial interpolation). Let be pairwise distinct and let be arbitrary. There exists exactly one polynomial with for all . Equivalently, the evaluation map , , is a linear isomorphism, and the Vandermonde determinant is $$ \det V = \prod_{0 \le i < j \le n} (x_j - x_i) \ne 0. $$
Proof. Existence. The Lagrange combination lies in , being a sum of degree- cardinal polynomials. Evaluating at a node and using gives . So an interpolant exists.
Uniqueness. Suppose both interpolate the data. Their difference satisfies for all distinct nodes. A nonzero polynomial of degree at most has at most roots, so a polynomial of degree vanishing at distinct points is the zero polynomial. Hence and .
The Vandermonde determinant. The map is linear between spaces of equal dimension , so it is an isomorphism precisely when it is injective; injectivity is the uniqueness just shown (the kernel is the set of degree- polynomials vanishing at all nodes, namely ). In the monomial basis is the matrix with , so . The explicit value follows by induction on : subtract times column from column (top-down), factor from each row , and reduce to the Vandermonde on ; the row factors accumulate to , and the induction yields . Distinctness of the nodes makes every factor nonzero.
Corollary (the Newton leading coefficient). The coefficient of in the interpolant of at equals the top divided difference: . Consequently the divided difference is a symmetric function of its arguments, since the interpolant — and hence its leading coefficient — does not depend on the order in which the nodes are listed. The recurrence then computes the whole Newton form in arithmetic from a triangular table, and adding a node costs one further pass.
Bridge. The unisolvence theorem is the foundational reason every representation in this chapter describes one object: existence-as-construction (the Lagrange cardinal basis) and uniqueness-as-rank (no degree- polynomial has roots) together say the evaluation map is an isomorphism, so the Lagrange, Newton, and barycentric formulas are change-of-basis images of a single interpolant — this is exactly the statement that and are two bases for adapted to the same nodes. The result builds toward the interpolation error theorem of 43.08.02, where the node polynomial — visible here as the unique element of vanishing at all nodes — controls , and it appears again in the quadrature rules of 43.09.01, which integrate in place of . The central insight is that interpolation is a linear-isomorphism statement about dressed in three notations, which generalises the Vandermonde nonsingularity into the broader theory of unisolvent (Haar) systems and is dual to the moment problem of reconstructing a measure from its action on . Putting these together, the choice among the three forms is governed entirely by the computational task — cheap incremental updates favour Newton, stable repeated evaluation favours barycentric — never by which "polynomial" results.
Exercises Intermediate+
Advanced results Master
The elementary theory fixes the interpolant; the master layer concerns how the three representations differ in cost and numerical behaviour, how the divided difference encodes analytic data, and the structural reason the barycentric form is the one to compute with.
Theorem 1 (equivalence and cost of the three representations). The Lagrange, Newton, and second barycentric formulas all evaluate to the unique interpolant . Their construction-and-evaluation costs differ: naive Lagrange recomputes products of factors at each evaluation point, costing per point; the Newton form costs once to build the divided-difference table and per point by Horner's rule on the nested products, with incremental cost to add a node; the barycentric form costs once to precompute the weights and per point thereafter, and a new node updates all weights in by dividing each existing by and forming directly [Berrut-Trefethen 2004].
Theorem 2 (numerical stability of the barycentric form). The second barycentric formula is backward stable for evaluating : the computed value is the exact interpolant of slightly perturbed data with bounded by a small multiple of unit roundoff times the Lebesgue-type amplification, for any node distribution; for nodes whose Lebesgue constant grows slowly (Chebyshev points), the formula is forward accurate as well. The naive Lagrange and the Newton forms can suffer cancellation that the barycentric quotient structure avoids, because the same potentially large factor appears in numerator and denominator and cancels [Higham 2004]. This overturns the classical verdict that Lagrange interpolation is numerically inferior: the representation is excellent once written barycentrically; only the textbook evaluation rule was at fault.
Theorem 3 (divided differences as derivatives and the Hermite-Genocchi formula). Divided differences are the discrete analogue of derivatives, and they limit to them: if all nodes coalesce, . The exact statement is the Hermite-Genocchi integral
$$
f[x_0, \dots, x_n] = \int_{\Sigma_n} f^{(n)}!\Big(x_0 + \sum_{i=1}^n t_i (x_i - x_0)\Big), dt,
$$
where is the standard simplex (volume ). The formula holds for , makes the symmetry of the divided difference in its arguments manifest, and shows divided differences depend continuously on the nodes — so the Newton form extends smoothly to the confluent (Hermite) case where nodes repeat and derivative data is matched, the construction owned by 43.08.03.
Theorem 4 (conditioning and the basis question). Solving the interpolation problem through the Vandermonde system in the monomial basis is ill-conditioned: the condition number of grows exponentially in for equispaced nodes on a real interval, so recovering the monomial coefficients is numerically hopeless even though the interpolant itself is well-determined. The remedy is never to form the monomial coefficients: the Newton basis triangularises the system (its matrix is lower-triangular, solved by the divided-difference recurrence) and the cardinal/barycentric basis diagonalises it (the coefficients are the data values themselves). The instability is an artifact of the monomial coordinate choice, not of the interpolation problem; the conditioning of the map data value is governed instead by the Lebesgue constant (forward cross-ref to the conditioning theory of 43.01.01 and the node-placement cure of 43.08.04) [Higham 2004].
Synthesis. The unisolvence theorem is the foundational reason interpolation, linear algebra, and approximation are one subject seen from three angles: the evaluation map is an isomorphism, and the Lagrange, Newton, and barycentric formulas are three bases of adapted to the nodes — cardinal (diagonalising), Newton (triangularising), monomial (the ill-conditioned ambient frame one must avoid). The central insight is that the interpolant is uniquely and stably determined while its monomial coordinates are not, so the entire practical theory is the discipline of computing with through a well-conditioned basis; this is exactly the lesson the SVD teaches for least-squares in 01.01.12, where the right orthonormal basis tames a problem the naive normal-equation basis renders ill-conditioned, and it is dual to the moment problem, where one reconstructs the action of a functional on from finitely many samples. Putting these together, the node polynomial threads the chapter: it is the kernel of the evaluation map on , supplies the barycentric weights through , is the controllable factor in the interpolation error of 43.08.02, and its optimal root placement is the Chebyshev story of 43.08.04. The bridge is that choosing nodes to minimise at once cures the Runge divergence, minimises the Lebesgue constant, and supplies the Gauss quadrature points of 43.09.03 — one extremal problem behind the whole edifice, which generalises the finite-dimensional isomorphism proved here into the asymptotic approximation theory the section develops.
Full proof set Master
Proposition 1 (Vandermonde determinant). For pairwise distinct nodes , the Vandermonde matrix with has .
Proof. Induct on . For , (empty product). For the step, perform the column operations for (descending, so each uses the un-modified previous column); these leave the determinant unchanged. Row becomes , and row has entries in column . Expanding along row reduces to the determinant of the minor with entries for , . Factoring from each row gives , where is the Vandermonde matrix on . By the inductive hypothesis , and combining the two products gives . Pairwise distinctness makes every factor nonzero, so and the interpolation system is uniquely solvable.
Proposition 2 (cardinal partition of unity and the second barycentric identity). The cardinal polynomials satisfy , and consequently for not a node, yielding the second barycentric form.
Proof. The polynomial takes the value at every node , so it interpolates the constant data ; the constant polynomial does too, and by Proposition-level uniqueness (the unisolvence theorem) the two agree identically, giving . Writing each (cancellation of the common factor, as and ), the identity becomes , i.e. . The first barycentric form is ; dividing it by cancels and gives .
Proposition 3 (divided-difference recurrence and Newton form). The recurrence holds, and the Newton form is the interpolant on .
Proof. Define , the leading coefficient of the degree- interpolant on . Let interpolate on and on , both in . The polynomial lies in and interpolates all of : at interior the bracket is ; at only the -term survives and equals ; at only the -term survives and equals . By unisolvence . The coefficient of in is , which is the recurrence. For the Newton form, observe vanishes at (where both interpolate ), so it is a scalar multiple of ; matching the leading coefficient, . Telescoping from to gives .
Proposition 4 (Hermite-Genocchi representation). For on an interval containing the distinct nodes, , with the standard simplex.
Proof. Induct on . For , the fundamental theorem of calculus gives . For the step, write the inner integration over on the simplex: holding fixed, integrating in from to produces, by the fundamental theorem, a difference of evaluated at the two endpoints, which are the simplex-convex combinations on the node sets and . The remaining -fold simplex integral of each is, by the inductive hypothesis, and respectively; their difference divided by is the divided-difference recurrence (Proposition 3, using symmetry to order the arguments), giving . Symmetry in the arguments is visible directly: the integrand depends on the nodes only through the convex combination, which is symmetric under relabelling, and the limit recovers since .
Connections Master
The node polynomial introduced here is the controllable factor in the interpolation error theorem developed at
43.08.02; that unit proves the remainder by applying Rolle's theorem to an auxiliary function and identifies the Runge phenomenon as the failure of to shrink for equispaced nodes. The divided-difference-as-derivative limit proved here (Proposition 4) is exactly the mechanism by which the error constant equals a confluent divided difference.The confluent (repeated-node) limit of the Newton form, where the divided difference matches derivative data via the Hermite-Genocchi continuity of Proposition 4, is the construction of Hermite and piecewise/spline interpolation in
43.08.03; the generalized divided differences with repeated arguments are the coefficients of the Hermite-Newton form, and the cubic spline assembles local interpolants of this kind into a global tridiagonal system.The optimal placement of the roots of to minimise — the extremal problem flagged in the synthesis — is the Chebyshev-node theory of
43.08.04, where the monic Chebyshev polynomial is shown to be the minimal-sup-norm monic degree- polynomial, so its roots are the interpolation nodes that defeat the Runge divergence and minimise the Lebesgue constant governing the conditioning of the interpolation map.Integrating the interpolant in place of produces the interpolatory quadrature rules of
43.09.01: the Lagrange form gives , so the quadrature weights are the integrated cardinal polynomials, and the Newton-Cotes family is precisely this construction at equispaced nodes; the Gauss rules of43.09.03choose the nodes as roots of orthogonal polynomials, the same node-placement freedom that the barycentric weight identity exploits.
Historical & philosophical context Master
Polynomial interpolation predates the calculus of finite differences that systematised it. The cardinal-function formula is named for Joseph-Louis Lagrange, who published it in his 1795 Leçons élémentaires sur les mathématiques lectures at the École Normale, though Edward Waring had given the same construction in 1779 in the Philosophical Transactions and Leonhard Euler had used an equivalent device earlier; the attribution is one of the standard misnamings in the subject [Lagrange 1795]. The divided-difference calculus and the incremental form belong to Isaac Newton, whose Principia (1687, Book III, Lemma V) and the earlier Methodus differentialis (circulated 1676, published 1711) constructed interpolating polynomials through forward and divided differences precisely to interpolate astronomical and tabular data, decades before Lagrange [Newton 1711].
The Vandermonde determinant carries the name of Alexandre-Théophile Vandermonde, whose 1772 Mémoire sur l'élimination treated symmetric functions and determinants, although the explicit product formula for what is now called his matrix does not appear in his work in that form — another instance of attributive drift. The integral representation of divided differences is due to Charles Hermite and Angelo Genocchi in the 1870s, tying the discrete difference calculus to the integral remainder. The barycentric formula, latent in the nineteenth-century literature, was isolated as the numerically preferred representation by Carl Runge and made the centrepiece of modern practice by Jean-Paul Berrut and Lloyd N. Trefethen in their 2004 SIAM Review survey, with the backward-stability analysis supplied by Nicholas Higham the same year [Berrut-Trefethen 2004]; their rehabilitation of the Lagrange form reversed a century of textbook preference for the Newton scheme on stability grounds.
Bibliography Master
@book{sulimayers2003,
author = {S\"{u}li, Endre and Mayers, David F.},
title = {An Introduction to Numerical Analysis},
publisher = {Cambridge University Press},
year = {2003}
}
@book{stoerbulirsch2002,
author = {Stoer, Josef and Bulirsch, Roland},
title = {Introduction to Numerical Analysis},
edition = {3},
publisher = {Springer},
year = {2002}
}
@article{berruttrefethen2004,
author = {Berrut, Jean-Paul and Trefethen, Lloyd N.},
title = {Barycentric {L}agrange Interpolation},
journal = {SIAM Review},
volume = {46},
number = {3},
year = {2004},
pages = {501--517}
}
@article{higham2004,
author = {Higham, Nicholas J.},
title = {The numerical stability of barycentric {L}agrange interpolation},
journal = {IMA Journal of Numerical Analysis},
volume = {24},
number = {4},
year = {2004},
pages = {547--556}
}
@book{davis1975,
author = {Davis, Philip J.},
title = {Interpolation and Approximation},
publisher = {Dover Publications},
year = {1975}
}
@book{trefethen2019,
author = {Trefethen, Lloyd N.},
title = {Approximation Theory and Approximation Practice, Extended Edition},
publisher = {SIAM},
year = {2019}
}
@book{lagrange1795,
author = {Lagrange, Joseph-Louis},
title = {Le\c{c}ons \'{e}l\'{e}mentaires sur les math\'{e}matiques, donn\'{e}es \`{a} l'\'{E}cole Normale},
publisher = {S\'{e}ances des \'{E}coles Normales},
year = {1795}
}
@book{newton1711,
author = {Newton, Isaac},
title = {Methodus differentialis},
publisher = {London},
year = {1711}
}