21.15.03 · number-theory / exponential-sums

van der Corput's Method for Exponential Sums

shipped3 tiersLean: none

Anchor (Master): van der Corput 1921 *Math. Ann.* 84, 53-79 and 1922 *Math. Ann.* 87, 39-65 (Zahlentheoretische Abschätzungen and Verschärfung der Abschätzung beim Teilerproblem); Graham-Kolesnik 1991 *Van der Corput's Method of Exponential Sums* (LMS LNS 126) Ch. 1-5 (the $A$- and $B$-processes, exponent pairs, the $(\kappa,\lambda)$ region); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8; Titchmarsh 1986 *The Theory of the Riemann Zeta-Function* (Oxford) Ch. 5; Huxley 1996 *Area, Lattice Points, and Exponential Sums* (Oxford) Ch. 1-3, 17 (the discrete Hardy-Littlewood method and the divisor / circle problems)

Intuition Beginner

Suppose you want to add up a long list of unit arrows, one for each whole number from to , where the direction of the -th arrow is set by a smoothly bending rule — not a straight line, not a polynomial, but some gently curving function like or . If the arrows all pointed the same way the total would be huge, about . The whole game is to show that because the directions keep turning, the arrows largely cancel and the total is far smaller. This is the same cancellation you met for straight-line and polynomial rules; the new difficulty is that a curving rule does not telescope and offers no obvious foothold.

Van der Corput found two moves that, in alternation, grind almost any smooth rule down to size. The first, the -process, is a shifting trick: compare each arrow with the one a few steps ahead, and the difference of the directions is a flatter, slower-turning rule that is easier to handle. It is the square-and-shift idea behind Weyl differencing, tuned so the shift length is a free dial you optimise. The second, the -process, is a change of viewpoint: a sum of a fast-turning wave can be rewritten as a new sum indexed by the speeds that appear, and the new sum is often much shorter. You trade a sum over positions for a sum over slopes.

Why care? Because two of the oldest questions in number theory — how the Riemann zeta function grows up the critical line, and how accurately you can count lattice points inside a circle or under a hyperbola — both come down to bounding exactly these sums. Each clever alternation of the two moves shaves the exponent in the answer, and chasing the best alternation is a whole industry.

Visual Beginner

Think of the sum as a walk in the plane: start at the origin, and for each take a unit step in the direction set by . The end point of the walk is the value of the sum. A straight-line rule sends you around a tight circle, returning near the start. A curving rule sends you on a more complicated path, and the question is how far from the origin you end up.

   A-process: shift and compare        B-process: change index

   step n   step n+h                   sum over positions n
     \        /                              |  |  |  |  |   (fast wiggle)
      \      /     difference of                 ||
       \    /      directions turns        becomes
        \  /       more slowly                sum over slopes m
         \/        (flatter rule)             |    |    |     (fewer terms)
       compare                                  short

The left panel is the shifting move: differencing replaces a turning rule with a flatter one. The right panel is the viewpoint move: a sum that wiggles fast over many positions becomes a sum over the handful of slopes that actually occur. Alternating the two is van der Corput's method, and each alternation is a fresh chance for cancellation.

Worked example Beginner

We estimate a model sum and watch the -process shrink it. Take where the rule is gently curving, and look at how the sum over becomes a sum over slopes. To keep arithmetic concrete, take the linear-slope model for from to , with , and compare the number of terms before and after.

Step 1. Count the terms in the original sum. There are arrows, one per integer from to .

Step 2. Find the range of slopes. The slope of the rule at position is the rate of change of , which for is . As runs from to , the slope runs from up to .

Step 3. Count the terms in the new sum. The -process re-indexes by whole-number slopes in that range. The slopes covered run from about to , so there is roughly one whole-number slope, (together with the endpoint behaviour). The dual sum has on the order of term instead of .

Step 4. Read off the saving. Each dual term carries a weight of about , and here , so . The dual sum is therefore of size about , against the plain size of the original. The square root of has appeared: .

What this tells us: the -process turned a sum of unit arrows into a sum of size about . The cancellation is real and quantitative — the total is the square root of the length, not the length. This saving is the simplest face of van der Corput's method, and iterating the two processes pushes the saving further.

Check your understanding Beginner

Formal definition Intermediate+

Throughout write and for the distance from to the nearest integer; the additive character is the one used on in 21.15.01 and 21.15.02. Let be smooth (typically real-analytic on the relevant range), and consider the exponential sum $$ S = \sum_{a < n \le b} e\big(f(n)\big). $$ Unlike the polynomial phases of 21.15.02, is now a general smooth phase, for example , with , or . The elementary bound is ; the object is power-saving cancellation.

Definition (-process — Weyl-van der Corput inequality). For an integer the -process is the differencing inequality $$ \Big| \sum_{a < n \le b} e(f(n)) \Big|^2 \le \frac{N + H}{H}\sum_{|h| < H}\Big(1 - \frac{|h|}{H}\Big)\sum_{n \in I_h} e\big(f(n+h) - f(n)\big), \qquad N = b - a, $$ where is the range of with both and in . It reduces the sum of to averaged sums of the differenced phases , whose first derivatives are smaller by a factor . The shift bound is a free parameter, optimised at the end.

Definition (-process — Poisson summation plus stationary phase). Suppose is monotone on with of constant sign and . Poisson summation 21.15.01 writes , and evaluating each integral by stationary phase 02.20.05 — the critical point solves — gives the -process transformation $$ \sum_{a < n \le b} e(f(n)) = \sum_{f'(a) \le m \le f'(b)} \frac{e\big(f(x_m) - m x_m + \tfrac18 \operatorname{sgn} f''\big)}{\sqrt{|f''(x_m)|}} + O\big(\log(2 + |f'(b) - f'(a)|)\big) + O\big(\text{endpoint terms}\big). $$ The new sum is the dual sum: its phase is the Legendre transform of , and its length is , the range of slopes.

Definition (exponent pair). A pair of real numbers with is an exponent pair if, for every smooth phase on obeying the standard derivative bounds (the model being with ), one has $$ \sum_{N < n \le 2N} e(f(n)) \ll \Big(\frac{F}{N}\Big)^{\kappa} N^{\lambda}, $$ with the implied constant uniform over the class of phases. The amplitude measures how fast turns; is the exponent of that amplitude and the exponent of the length.

Definition ( and operators on pairs). The -process acts on exponent pairs by $$ A(\kappa, \lambda) = \Big( \frac{\kappa}{2\kappa + 2},\ \frac{\kappa + \lambda + 1}{2\kappa + 2} \Big), \qquad B(\kappa, \lambda) = \Big( \lambda - \tfrac12,\ \kappa + \tfrac12 \Big). $$ Starting from the base pair — the elementary bound — and from , repeated application of and generates the classical exponent pairs. The operator is an involution ( on pairs), corresponding to applying the -process twice and returning to the original sum.

Counterexamples to common slips

  • "The -process always shortens the sum." It shortens the sum precisely when the slope range is smaller than the length , i.e. when . When the dual sum is longer, and one applies first to reduce . The art of the method is sequencing and so each is applied in its favourable regime.

  • "An exponent pair must have ." Only the base pair and the first -image satisfy . General pairs satisfy varying bounds; the quantity minimised for is over the convex hull of attainable pairs, but it is not constant.

  • "Stationary phase needs a polynomial phase." The -process applies to any smooth with of one sign and bounded away from and on the range; the stationary-phase lemma of 02.20.05 requires only a non-degenerate critical point, not a polynomial. The phases of interest (, fractional powers) are emphatically non-polynomial.

Key theorem with proof Intermediate+

The signature result is the -process exponent pair: a single application of Poisson summation and stationary phase converts the base pair into a genuine power-saving bound [Graham Kolesnik 1991].

Theorem (the -process pair; van der Corput). Let be smooth on with of constant sign, and controlled by and respectively. Then $$ \sum_{N < n \le 2N} e(f(n)) \ll N, \lambda^{1/2} + \lambda^{-1/2}. $$ Equivalently, in exponent-pair language with this is the pair : .

Proof. By Poisson summation 21.15.01, for a smooth cutoff adapted to , $$ \sum_{n} w(n), e(f(n)) = \sum_{m \in \mathbb{Z}} J_m, \qquad J_m = \int_{\mathbb{R}} w(x), e\big(f(x) - m x\big), dx. $$ The phase of is with . On the derivative ranges over an interval of length ; call its image .

For outside a neighbourhood of this image, is bounded below, and is monotone since has constant sign. The first van der Corput lemma for integrals (the case of 02.20.05) and repeated integration by parts give rapidly decaying, so .

For each with the phase has a unique stationary point solving , non-degenerate because . The stationary-phase formula of 02.20.05 gives $$ J_m = \frac{e\big(f(x_m) - m x_m + \tfrac18\big)}{\sqrt{f''(x_m)}}, w(x_m) + O\big(\lambda^{-1/2}\cdot (N^2\lambda)^{-1}\big), $$ the error being the second-order stationary-phase term, controlled by and . Each main term has size , and there are values of in the range. Summing the main terms with the triangle inequality bounds them by ; the accumulated stationary-phase errors are ; and the off-range tail contributes . Collecting, $$ \Big| \sum_{N < n \le 2N} e(f(n)) \Big| \ll N\lambda^{1/2} + \lambda^{-1/2} + \log(2 + N\lambda) \ll N\lambda^{1/2} + \lambda^{-1/2}, $$ the logarithm absorbed when is in the nondegenerate range . Writing gives , i.e. , the pair .

Bridge. The -process pair builds toward the entire exponent-pair calculus, and it appears again in the application to below, where already yields the convexity bound. The foundational reason the method works is that Poisson summation is dual to stationary phase: summation over positions becomes summation over slopes, and the weight is exactly the stationary-phase normalisation of 02.20.05. This is exactly the sum-side incarnation of the integral van der Corput lemma — where the integral lemma gives decay for one oscillatory integral, the -process gives a dual sum of terms each of that size. The -process generalises Weyl differencing from 21.15.02 to non-polynomial phases, and the central insight is that the two operators and on the region are dual to each other: trades length for amplitude by differencing, swaps amplitude and length by the Legendre transform. Putting these together, every word in the language of exponent pairs is a sequence of 's and 's applied to , and the best bound for any particular phase is the optimal such word.

Exercises Intermediate+

Advanced results Master

The exponent-pair region and optimisation

The set of attainable exponent pairs is the closure under and of , together with the convexity property that the attainable region is convex [Graham Kolesnik 1991]. The operators generate a countable dense family of vertices; for the divisor and zeta problems one minimises a linear functional of over this region. For with over attainable pairs, van der Corput's processes alone give from ; the longer words and their conjugates yield the van der Corput exponent pairs , whose infimum of still does not beat for the zeta function but does improve the divisor exponent. Rankin computed the optimal van der Corput exponent for the divisor problem from this region. The barrier for zeta is broken only by stepping outside the -process: the Bombieri-Iwaniec method (1986) and Huxley's discrete Hardy-Littlewood refinements reach .

The Dirichlet divisor and Gauss circle problems

Let and [Huxley 1996]. Both error terms reduce, via the hyperbola method and Voronoi summation 21.15.01, to exponential sums and with phase , . Applying an exponent pair yields up to -powers; the van der Corput pair gives the classical , sharpening Dirichlet's elementary and Voronoi's obtained by his summation formula. The conjectured exponent is , equivalent to the optimal cancellation ; the -results of Hardy show cannot be improved.

Iterated processes and the -process gain

The general bound from a word in applied to is sharp for the phase class it was tuned to, but the same phase may admit a better word. The -process improves a pair when differencing the phase produces a flatter sum: applied to it returns , which decreases (good for small-amplitude phases) at the cost of raising toward . Because and trade off, the optimal word depends on the regime : for the zeta-type phase the optimum among -words is , while for the divisor phase the optimum is a longer alternation. The exponent-pair conjecture asserts that is attainable for every , which would give for both the zeta and divisor problems; it remains open and is known to require methods beyond and .

Synthesis. Van der Corput's method, exponent pairs, the zeta growth problem, and the divisor and circle problems are one circle of ideas, and the bridge is the duality between a sum and its Legendre-transformed dual. The foundational reason the processes terminate in a power saving is that the -process is exactly Poisson summation from 21.15.01 composed with the stationary-phase lemma of 02.20.05: summation over positions becomes summation over slopes, the weight is the stationary-phase normalisation, and this is dual to the -process, which generalises the Weyl differencing of 21.15.02 from polynomial to smooth phases. Putting these together, every classical bound is a word in and applied to the base pair , the central insight being that trades length for amplitude and swaps them by the Legendre transform; the convexity bound is exactly , and the divisor and circle exponents are the same calculus read on the phase . The exponent-pair conjecture generalises the whole method to its conjectural limit, and the modern Bombieri-Iwaniec and decoupling methods are the central insight pushed past the -closure into the geometry of the dual sum's spacing.

Full proof set Master

Proposition 1 (first-derivative test; van der Corput's lemma for sums). Let be real on with monotone and for all (the distance to the nearest integer of stays ). Then $$ \Big| \sum_{a < n \le b} e(f(n)) \Big| \ll \frac{1}{\delta}. $$

Proof. By Euler-Maclaurin / partial summation against the integral, or directly by the truncated Poisson summation of 21.15.01, , where only with contribute a main term. The hypothesis means avoids a -neighbourhood of every integer, so for each integer the integrand phase has derivative of absolute value and monotone. The first van der Corput integral lemma (02.20.05, case ) gives -type control; summing the geometric-like decay over produces truncated, which is once the monotonicity of bounds the number of relevant by and the tails decay. Hence .

Proposition 2 (second-derivative test). Let be real on with for constants and of constant sign. Then $$ \Big| \sum_{a < n \le b} e(f(n)) \Big| \ll (b - a),\lambda^{1/2} + \lambda^{-1/2}. $$

Proof. This is the conclusion of the -process key theorem, here stated for a general interval. Since has constant sign, is strictly monotone, so the -process applies: Poisson summation 21.15.01 reduces to the dual sum over stationary points solving , plus . The number of integers in is , since ranges in . Each dual term has modulus . Bounding the dual sum directly by its length times the term size, $$ |S| \ll \big(1 + (b-a)\lambda\big)\lambda^{-1/2} + (\text{errors}) \ll (b-a)\lambda^{1/2} + \lambda^{-1/2}, $$ the stationary-phase and endpoint errors being lower order as in the key theorem.

Proposition 3 (the basic exponent pair from the second-derivative test). The pair is an exponent pair: for on with , .

Proof. Here , of constant sign on the dyadic block (shrinking the block if necessary so does not vanish). Proposition 2 with gives ; explicitly , and . In the standard range where the phase genuinely oscillates and is the regime exponent pairs are defined for, the first term dominates and . This is the assertion that is an exponent pair, matching .

Proposition 4 ( and the bound are equivalent inputs). Applying the -operator to gives , and this pair yields .

Proof. The first claim is the computation of Exercise 3. For the second, the exponent-pair bound on the zeta sum at the critical length , summed dyadically against by partial summation as in Exercise 8, gives . Substituting makes the exponent , so .

Connections Master

The -process is the smooth-phase generalisation of the Weyl differencing inequality of 21.15.02: where Weyl differencing reduces a degree- polynomial sum to degree- sums, the -process reduces a smooth sum to averaged difference sums with a free shift length, and both are Cauchy-Schwarz applied to shifted copies of the same character sum.

The -process is Poisson summation from 21.15.01 composed with the method of stationary phase from 02.20.05; the dual sum it produces is indexed by the slopes , exactly the transform side of the Poisson formula, and each term carries the stationary-phase weight that the oscillatory-integral lemma of 02.20.05 supplies.

The bound for proved here is the first convexity-breaking estimate feeding the growth and zero-density theory of the Riemann zeta function in 21.03.01; subconvexity for and for general -functions continues this line, with the complete exponential sums of 21.15.04 supplying the arithmetic input on the major-arc side that complements the smooth-phase analysis here.

Historical & philosophical context Master

Johannes Gualtherus van der Corput introduced his method in two papers in the Mathematische Annalen: the 1921 Zahlentheoretische Abschätzungen [van der Corput 1921], which set out the derivative tests and the differencing and Poisson-summation passages, and the 1922 Verschärfung der Abschätzung beim Teilerproblem [van der Corput 1922], which iterated the processes to improve the Dirichlet divisor exponent below Voronoi's . The two operations were later christened the - and -processes, and the bookkeeping of their effect on the bound was systematised into the theory of exponent pairs by Phillips, Rankin, and others in the 1930s-1950s; Rankin found the optimal van der Corput exponent for the divisor problem by linear programming over the attainable region.

The method's first triumph was the bound , breaking the convexity exponent inherited from the functional equation and the elementary estimate. It stood essentially unchallenged in its conceptual form until the Bombieri-Iwaniec method of 1986 introduced the spacing of the dual sum's terms as a new resource, and Huxley's discrete Hardy-Littlewood method refined this to the current [Huxley 1996]. Graham and Kolesnik's 1991 monograph remains the systematic account of the classical -calculus. The method is the discrete shadow of the stationary-phase analysis of oscillatory integrals: van der Corput himself stated the integral lemmas alongside the sum estimates, and the modern view that exponent pairs are words in two dual operations on the region dates to this systematisation.

Bibliography Master

@article{vandercorput1921,
  author  = {van der Corput, J. G.},
  title   = {Zahlentheoretische Absch{\"a}tzungen},
  journal = {Mathematische Annalen},
  volume  = {84},
  pages   = {53--79},
  year    = {1921}
}

@article{vandercorput1922,
  author  = {van der Corput, J. G.},
  title   = {Versch{\"a}rfung der Absch{\"a}tzung beim Teilerproblem},
  journal = {Mathematische Annalen},
  volume  = {87},
  pages   = {39--65},
  year    = {1922}
}

@book{grahamkolesnik1991,
  author    = {Graham, S. W. and Kolesnik, G.},
  title     = {Van der Corput's Method of Exponential Sums},
  series    = {London Mathematical Society Lecture Note Series},
  volume    = {126},
  publisher = {Cambridge University Press},
  year      = {1991}
}

@book{iwaniec-kowalski2004,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004}
}

@book{titchmarsh1986,
  author    = {Titchmarsh, E. C.},
  title     = {The Theory of the Riemann Zeta-Function},
  edition   = {2},
  note      = {Revised by D. R. Heath-Brown},
  publisher = {Oxford University Press},
  year      = {1986}
}

@book{huxley1996,
  author    = {Huxley, M. N.},
  title     = {Area, Lattice Points, and Exponential Sums},
  series    = {London Mathematical Society Monographs},
  volume    = {13},
  publisher = {Oxford University Press},
  year      = {1996}
}