43.09.03 · numerical-analysis / 09-numerical-quadrature

Gauss quadrature via orthogonal polynomials

shipped3 tiersLean: none

Anchor (Master): Davis-Rabinowitz 1984 *Methods of Numerical Integration* 2e (Academic Press) Ch. 2-3 (Gauss rules, the orthogonal-polynomial connection, the Radau and Lobatto variants, and the Stieltjes error theory); Gautschi 2004 *Orthogonal Polynomials: Computation and Approximation* (Oxford) Ch. 1-3 (the Jacobi matrix, the Golub-Welsch eigenvalue algorithm, and the classical Gauss families); Gautschi 1981 'A survey of Gauss-Christoffel quadrature formulae' in *E. B. Christoffel* (Birkhäuser) (the comprehensive survey of Gauss-Christoffel rules, the Radau/Lobatto modifications, and the weight-positivity and convergence theory)

Intuition Beginner

When you estimate the area under a curve from a handful of sample heights, you face two separate choices: where to put the sample points, and how much weight to give each one. The classical rules from the previous unit fix the points in advance — equally spaced — and only get to choose the weights. That throws away half of your freedom. The Gauss idea is to choose both at once: pick the cleverest possible sample locations and the cleverest possible weights together, so the rule squeezes out as much accuracy as a fixed number of samples can possibly buy.

How much does that extra freedom win you? With samples you have locations and weights to set, so knobs in total. Each knob can be tuned to make the rule exact on one more polynomial. A rule with equally spaced points integrates polynomials up to degree about exactly; a Gauss rule with the same points reaches degree . You roughly double the accuracy for the same number of function evaluations, which is why Gauss rules are the workhorse behind most automatic integration software.

The surprise is where the best sample points turn out to sit. They are not spread evenly, and they are not chosen by trial and error. They are the roots of a special polynomial — the one built to be perpendicular, in a weighted-average sense, to all lower-degree polynomials. Those orthogonal polynomials came up when fitting curves by least squares, and the same family reappears here as the secret map of where to sample. One construction, two uses.

A second piece of good news is that the weights always come out positive. Positive weights mean the rule averages your sample heights rather than amplifying their errors, so unlike the high-point evenly-spaced rules, Gauss rules stay stable no matter how many points you use. More points always means more accuracy, never a blow-up.

Visual Beginner

Picture the same interval sampled two ways. The evenly spaced rule drops its points at fixed, regular positions. The Gauss rule pulls its points inward, away from the endpoints, clustering them where they do the most good — exactly at the roots of the special perpendicular polynomial. With the same count of points, the Gauss arrangement integrates far higher-degree curves with no error.

The table contrasts the two families by what they fix and what they win.

rule family points weights highest degree exact ( points)
evenly spaced (Newton-Cotes) fixed, equal spacing chosen about
Gauss chosen (orthogonal-polynomial roots) chosen

Reading across: the Gauss rule spends its extra freedom — the choice of where to sample — and is repaid with roughly twice the exact degree. The points land at the orthogonal-polynomial roots, which always sit strictly inside the interval.

Worked example Beginner

Let us build the two-point Gauss rule on the interval from to with the plain weight (every part of the interval counts equally), and check it integrates a cubic perfectly.

Step 1. Find the sample points. For two points the special perpendicular polynomial is . Its roots are where , namely and , about and . These are the two Gauss points — pulled inward from the endpoints.

Step 2. Find the weights. By symmetry the two weights are equal, and they must add up to the length of the interval, which is . So each weight is . The rule is: add the function value at and the function value at .

Step 3. Test it on . The true area under from to is , because the curve is odd and the two halves cancel. The rule gives , since the two cubes cancel. Match.

Step 4. Test it on . The true area is the value of at the endpoints: . The rule gives . Match again.

Step 5. Confirm the badge. Two points buy exactness up to degree . A check on and both passed, and by the same cancellation the rule is exact for every cubic.

What this tells us: with only two well-placed samples and two equal weights, the Gauss rule integrates every cubic exactly — the same feat Simpson's rule needs three points for. Choosing the points, not just the weights, is what pays for the extra degree.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is a fixed weight function on an interval (possibly infinite) with almost everywhere and all moments finite, exactly the setting of 43.08.05. The weighted inner product is , is the space of real polynomials of degree at most , and with are the orthogonal polynomials for of 43.08.05, with leading coefficient in . The error functional of a rule is , as in 43.09.01. Notation used below — , , , , the orthogonal polynomial , the leading coefficient , and the Jacobi matrix — is recorded in _meta/NOTATION.md.

Definition (general quadrature rule and the optimal-exactness problem). An -point quadrature rule for the weight is with nodes in and weights , approximating . The rule has free parameters. The optimal-exactness problem is to choose the nodes and weights so that for every with as large as possible. Since parameters can match at most conditions, the ceiling is .

Definition (interpolatory weights). Given fixed nodes , the interpolatory weights are , where are the Lagrange cardinal polynomials on the nodes from 43.08.01; this is the weight choice of 43.09.01 that makes the rule exact on for any node set.

Definition (Gauss quadrature rule). The Gauss (Gauss-Christoffel) quadrature rule of order for takes its nodes to be the roots of the orthogonal polynomial — real, simple, and inside by 43.08.05 — and its weights to be the interpolatory weights on those nodes. The weights are also called Christoffel numbers.

Definition (Gauss families). Specialising the weight gives the classical families: on gives Gauss-Legendre; on gives Gauss-Chebyshev; on gives Gauss-Hermite; on gives Gauss-Laguerre. The Gauss-Radau rule fixes one endpoint as a node and is exact on ; the Gauss-Lobatto rule fixes both endpoints and is exact on .

Counterexamples to common slips Intermediate+

  • "More nodes can always raise the exactness past ." No node-and-weight choice with nodes beats : the polynomial is strictly positive away from the nodes, so its weighted integral is positive while every rule evaluates it to . The ceiling is structural, not a matter of cleverness.

  • "The Gauss nodes are equally spaced, just rescaled." They are the roots of , which are not equispaced — for Gauss-Legendre they cluster toward the endpoints like of equispaced angles. Equispacing the nodes throws away the orthogonality that buys the extra degrees.

  • "Positivity of the weights needs the weight to be constant." The argument uses only and exactness on , so it holds for every admissible weight, including the singular Chebyshev weight and the infinite-interval Hermite and Laguerre weights.

  • "The error term uses , like interpolatory quadrature." The Gauss error involves , the doubled order, because the rule is exact through degree . Using the interpolation-error derivative would understate the order by .

Key theorem with proof Intermediate+

The signature result is the maximal-exactness theorem: among all -point rules, the Gauss rule alone reaches degree of exactness , and it does so precisely by placing the nodes at the roots of . The proof rests on one algebraic move — dividing a high-degree polynomial by — and one analytic fact — that is orthogonal to everything of lower degree. This follows the development in Süli-Mayers [Süli-Mayers 2003 §10.2].

Theorem (Gauss quadrature: maximal degree of exactness). Let the nodes be the roots of and the weights be the interpolatory weights . Then the rule satisfies for every . Conversely, no -point rule is exact on , and a rule is exact on only if its nodes are the roots of .

Proof. Take any . Divide by the degree- polynomial : there are unique with $$ p = q,\phi_n + r, \qquad \deg q \le n-1, \quad \deg r \le n-1, $$ since and is monic-up-to-scale of degree . Integrate against : $$ \int_a^b w,p,dx = \int_a^b w,q,\phi_n,dx + \int_a^b w,r,dx = (\phi_n, q)w + \int_a^b w,r,dx . $$ Because $q \in \Pi{n-1}\phi_n\Pi_{n-1}(\phi_n, q)w = 0\int_a^b w,p,dx = \int_a^b w,r,dxx_i\phi_n\phi_n(x_i) = 0p(x_i) = q(x_i)\phi_n(x_i) + r(x_i) = r(x_i)$, and $$ Q(p) = \sum_i w_i p(x_i) = \sum_i w_i r(x_i) = Q(r). $$ The rule is interpolatory on nodes, so it is exact on $\Pi{n-1}r \in \Pi_{n-1}Q(r) = \int_a^b w,r,dxQ(p) = \int_a^b w,r,dx = \int_a^b w,p,dxE(p) = 0p \in \Pi_{2n-1}\Pi_{2n-1}$.

For the ceiling, the polynomial satisfies for every node, so ; but except at the nodes, so by positivity of . Thus and no -point rule is exact on , whatever its nodes. Finally, suppose an -point rule with nodes is exact on ; let . For any the product vanishes at every node, so . Hence is orthogonal to all of ; a monic degree- polynomial with that property is up to scale (uniqueness of the monic orthogonal polynomial, 43.08.05), so the nodes are the roots of .

Corollary (positivity of the Gauss weights). The Gauss weights are strictly positive, for every , and they sum to . Fix and let be the cardinal polynomial; then , so the rule integrates it exactly: $$ 0 < \int_a^b w,L_i^2,dx = Q(L_i^2) = \sum_j w_j L_i(x_j)^2 = w_i , $$ the last step because . Exactness on the constant gives . Positive weights summing to the total mass are what make Gauss rules stable: , the minimal operator norm, in sharp contrast to the unbounded of high-order Newton-Cotes in 43.09.01.

Bridge. The division is the foundational reason Gauss quadrature works: it splits any polynomial of degree up to into a part that the orthogonality of annihilates under the integral and a part of degree that the interpolatory rule handles exactly. This is exactly the payoff of the orthogonal-polynomial root structure of 43.08.05: the roots are not merely real and simple, they are the unique node set that makes the integral of vanish, which is what doubles the exactness from to . The result builds toward the error term and the Golub-Welsch algorithm of the Advanced section, and it appears again in the spectral and Gaussian-process methods of 37.08.01 and the Clenshaw-Curtis comparison of 43.09.02. The central insight is that exactness on is equivalent to the node polynomial being orthogonal to , which generalises the interpolatory exactness of 43.09.01 from degree to degree and is dual to the least-squares projection of 43.08.05, where the same is the residual direction. Putting these together, choosing the nodes — not just the weights — is precisely the freedom that orthogonality converts into extra exact degrees.

Exercises Intermediate+

Advanced results Master

The elementary theory fixes the nodes and weights and proves the exactness ceiling; the master layer turns the construction into a spectral algorithm, controls the error and convergence precisely, and places the Radau and Lobatto variants inside the same orthogonal-polynomial frame.

Theorem 1 (the error term and its sign). For the Gauss rule has error $$ E(f) = \frac{(\phi_n, \phi_n)_w}{k_n^2,(2n)!},f^{(2n)}(\eta), \qquad \eta \in (a,b), $$ where is the leading coefficient of , obtained by integrating the Hermite-interpolation remainder against and collapsing the one-signed factor by the weighted mean value theorem [Süli-Mayers 2003 §10.4]. The error is governed by the -th derivative — twice the order an -point interpolatory rule reaches — and the constant is the squared norm of the monic orthogonal polynomial. For Gauss-Legendre this evaluates to , decaying super-geometrically in for analytic .

Theorem 2 (convergence for every continuous integrand). For every the Gauss rules converge, as . The positivity corollary gives for all , a uniform bound on the absolute-weight sums; since the rules are exact on a polynomial of each degree and polynomials are dense in by Weierstrass, the Pólya-Steklov criterion of 43.09.01 applies and forces convergence [Davis-Rabinowitz Ch. 3]. This is the structural reward of positivity: Gauss rules converge for all continuous functions, where the equispaced Newton-Cotes family diverges, and the divergence-versus-convergence dichotomy is decided entirely by the boundedness of .

Theorem 3 (Golub-Welsch: nodes and weights as a symmetric eigenproblem). In the orthonormal scaling , the three-term recurrence of 43.08.05 reads , so multiplication by on is the symmetric tridiagonal Jacobi matrix $$ J_n = \begin{pmatrix} \alpha_0 & \sqrt{\beta_1} & & \ \sqrt{\beta_1} & \alpha_1 & \sqrt{\beta_2} & \ & \ddots & \ddots & \ddots \ & & \sqrt{\beta_{n-1}} & \alpha_{n-1} \end{pmatrix}. $$ Its eigenvalues are exactly the Gauss nodes (the roots of ), and if is the normalised eigenvector for node , the Gauss weight is with — the squared first component scaled by the total mass. Thus computing an -point Gauss rule reduces to one symmetric tridiagonal eigendecomposition, solved in by the symmetric QR/QL iteration, which is the Golub-Welsch algorithm [Gautschi 2004]. This routes the entire construction through the symmetric eigenproblem of 43.06.01: real eigenvalues and an orthonormal eigenbasis are guaranteed by symmetry, and the simplicity of the nodes is the simplicity of a tridiagonal matrix with nonzero off-diagonals.

Theorem 4 (Radau and Lobatto from modified weights). Prescribing nodes trades exactness for endpoint control, and each variant is itself a Gauss rule for a modified weight. The Gauss-Radau rule fixes one endpoint, say : its free nodes are the roots of the degree- orthogonal polynomial for the modified weight , and it is exact on . The Gauss-Lobatto rule fixes both endpoints: its free nodes are the roots of the orthogonal polynomial for , and it is exact on [Gautschi 2004]. The weights remain positive by the same argument applied to the reduced free-node set together with explicit positive endpoint weights, and at the matrix level Radau and Lobatto are rank-one and rank-two modifications of that plant the prescribed eigenvalues. Lobatto rules underlie the spectral-element and collocation methods that need function values at element boundaries.

Synthesis. The division identity is the foundational reason Gauss quadrature, orthogonal polynomials, and the symmetric eigenproblem are one subject: exactness on is equivalent to the node polynomial being orthogonal to , which forces the nodes to be the roots of , which are the eigenvalues of the Jacobi matrix. The central insight is that the freedom to choose nodes — the freedom the equispaced rules discard — is exactly the orthogonality that doubles the exactness, and that the same orthogonality forces the weights positive, so the very property that maximises accuracy also guarantees stability and convergence; this is exactly the dichotomy of 43.09.01, where sign-definiteness and bounded absolute-weight sums separate the usable rules from the divergent ones, now resolved in Gauss's favour by construction. Putting these together, the Gauss rule is dual to the least-squares projection of 43.08.05: there is the residual direction perpendicular to under the integral, here its roots are the optimal sampling points, and the bridge between them is the single Gram-Schmidt-against-a-weight construction that simultaneously diagonalises the least-squares normal equations, generates the recurrence coefficients of the Jacobi matrix, and hands the Golub-Welsch algorithm its eigenproblem. This generalises the interpolatory quadrature of 43.09.01 from degree to and is the integration-theoretic shadow of the spectral theorem of 43.06.01, one tridiagonal matrix governing both where to sample and how much each sample counts.

Full proof set Master

Proposition 1 (maximal exactness of the Gauss rule). The interpolatory rule on the roots of is exact on , and no -point rule is exact on .

Proof. For , polynomial division by (degree ) gives unique with . Then , since and by 43.08.05. At each node , , so and ; the interpolatory rule on nodes is exact on , so . Hence . For the ceiling, has (vanishes at all nodes) while ( off the finite node set, ), so for any -point rule.

Proposition 2 (the orthogonal-polynomial nodes are forced). If an -point rule is exact on , its nodes are the roots of .

Proof. Let be the nodes and , monic of degree . For arbitrary , vanishes at every node, so exactness gives . Thus . A monic degree- polynomial orthogonal to equals the monic : their difference lies in and is orthogonal to , hence to itself, hence zero. So (monic scaling) and the are its roots.

Proposition 3 (positivity and total mass of the weights). The Gauss weights satisfy and .

Proof. , so exactness gives , using . Since and is a nonzero polynomial, , so . Applying exactness to gives .

Proposition 4 (the error term). For , for some , with the leading coefficient of .

Proof. Let be the Hermite interpolant of at the doubled nodes, matching and at each . The Hermite remainder is with . Exactness on gives , since depends only on the values . Hence . The factor and is continuous, so the weighted mean value theorem for integrals yields with .

Proposition 5 (Golub-Welsch eigenvalue characterisation). The Gauss nodes are the eigenvalues of the symmetric tridiagonal Jacobi matrix , and where is the normalised eigenvector at node and .

Proof. Write . The orthonormal recurrence gives . At a root of (equivalently of ), the trailing term drops and , so is an eigenvalue of with eigenvector (nonzero since ). The distinct roots give distinct eigenvalues, exhausting the spectrum of the symmetric , whose eigenvalues are real and whose eigenvectors are orthogonal by the spectral theorem of 43.06.01. The normalised eigenvector is , with first component . The Christoffel-Darboux reproducing-kernel weight formula of 43.08.05 then gives .

Proposition 6 (Radau exactness from the modified weight). The Gauss-Radau rule with fixed node — free nodes the roots of the degree- orthogonal polynomial for — is exact on .

Proof. Let on , and let be its degree- orthogonal polynomial with roots ; the Radau nodes are together with these. Take and divide by , degree : with , . Integrating against : . Since and under , the first term vanishes, leaving . The rule has nodes and interpolatory weights, exact on ; and at each node the factor vanishes (at by , at the others by ), so and .

Connections Master

  • The Gauss nodes are the roots of the orthogonal polynomial constructed in 43.08.05 — real, simple, interlacing, and inside — and the weight-positivity, error constant , and Golub-Welsch eigenvector formula all descend from the three-term recurrence and Christoffel-Darboux kernel developed there; this unit is the quadrature payoff of that unit's root-structure theorem, the forward reference its own Synthesis anticipated.

  • The maximal-exactness theorem is the resolution of the degree-of-exactness ceiling exposed in 43.09.01: where the interpolatory Newton-Cotes rules on fixed equispaced nodes reach only degree (or by symmetry) and acquire fatal negative weights at high order, the Gauss rule on the orthogonal-polynomial nodes reaches with strictly positive weights summing to , so the Pólya-Steklov convergence criterion proved there now certifies convergence for every continuous integrand rather than divergence.

  • The Golub-Welsch algorithm routes node-and-weight computation through the symmetric tridiagonal eigenproblem of 43.06.01: the Jacobi matrix of recurrence coefficients is real symmetric, so its eigenvalues (the nodes) are real and its eigenvectors orthogonal by the spectral theorem, and the symmetric QR/QL iteration computes the full rule in ; the simplicity of the Gauss nodes is the simplicity of a tridiagonal matrix with nonzero subdiagonal.

  • Gauss quadrature is the integration tool behind spectral and pseudospectral discretisations and the Gaussian-process / random-matrix computations of 37.08.01, where weighted integrals against (Gauss-Hermite) and against equilibrium measures are evaluated to machine precision; the Lobatto variant of Theorem 4, fixing both endpoints, supplies the boundary collocation nodes used in the spectral-element methods that border the finite-element theory of 24.04.07.

  • The error term's reliance on the Hermite interpolant at doubled nodes ties this unit to the Hermite and osculatory interpolation of 43.08.03, and the weighted mean value theorem that collapses the one-signed factor is the same integral mean value theorem underlying the Lagrange remainder of 02.05.02, so the Gauss error is the -th-order member of the single family of derivative-times-fixed-kernel error formulas that also produced the Peano representation of 43.09.01.

Historical & philosophical context Master

Carl Friedrich Gauss introduced the method in Methodus nova integralium valores per approximationem inveniendi, read to the Göttingen Society in 1814 and published in its Commentationes the same year. Gauss derived the -point rule on exact to degree through the continued-fraction expansion of a logarithmic generating function — equivalently, through what are now recognised as the Legendre polynomials — and tabulated nodes and weights for up to , doubling the attainable exactness of the equispaced interpolatory rules of Newton and Cotes for the same number of evaluations [Gauss 1814].

The orthogonal-polynomial reading that this unit follows is due to Carl Gustav Jacobi, who in an 1826 paper recast Gauss's nodes as the roots of the Legendre polynomials and supplied the modern proof via the division argument, freeing the construction from continued fractions. The generalisation to an arbitrary positive weight is the work of Pafnuty Chebyshev and Elwin Bruno Christoffel — the weights carry Christoffel's name as the Christoffel numbers from his 1858 study — and of Thomas Joannes Stieltjes, whose 1884 work tied the weights and nodes to the moment problem and continued fractions. The Radau and Lobatto endpoint variants are due to Rodolphe Radau (1880) and Rehuel Lobatto (1852). The reduction of the entire computation to a symmetric tridiagonal eigenproblem was published by Gene Golub and John Welsch in 1969, and the recurrence-coefficient viewpoint that makes it practical was developed by Walter Gautschi [Gautschi 1981].

Bibliography Master

@book{sulimayers2003,
  author    = {S\"{u}li, Endre and Mayers, David F.},
  title     = {An Introduction to Numerical Analysis},
  publisher = {Cambridge University Press},
  year      = {2003}
}

@book{davisrabinowitz1984,
  author    = {Davis, Philip J. and Rabinowitz, Philip},
  title     = {Methods of Numerical Integration},
  edition   = {2},
  publisher = {Academic Press},
  year      = {1984}
}

@book{gautschi2004,
  author    = {Gautschi, Walter},
  title     = {Orthogonal Polynomials: Computation and Approximation},
  publisher = {Oxford University Press},
  year      = {2004}
}

@article{golubwelsch1969,
  author  = {Golub, Gene H. and Welsch, John H.},
  title   = {Calculation of {G}auss Quadrature Rules},
  journal = {Mathematics of Computation},
  volume  = {23},
  number  = {106},
  year    = {1969},
  pages   = {221--230}
}

@article{gauss1814,
  author  = {Gauss, Carl Friedrich},
  title   = {Methodus nova integralium valores per approximationem inveniendi},
  journal = {Commentationes Societatis Regiae Scientiarum Gottingensis Recentiores},
  volume  = {3},
  year    = {1814},
  pages   = {39--76}
}

@incollection{gautschi1981,
  author    = {Gautschi, Walter},
  title     = {A Survey of {G}auss-{C}hristoffel Quadrature Formulae},
  booktitle = {E. B. Christoffel: The Influence of His Work on Mathematics and the Physical Sciences},
  editor    = {Butzer, P. L. and Feh\'{e}r, F.},
  publisher = {Birkh\"{a}user},
  year      = {1981},
  pages     = {72--147}
}

@article{jacobi1826,
  author  = {Jacobi, Carl Gustav Jacob},
  title   = {Ueber {G}au{\ss}' neue Methode, die Werthe der Integrale n\"{a}herungsweise zu finden},
  journal = {Journal f\"{u}r die reine und angewandte Mathematik},
  volume  = {1},
  year    = {1826},
  pages   = {301--308}
}