21.15.03 · number-theory / exponential-sums

van der Corput's Method for Exponential Sums

shipped3 tiersLean: none

Anchor (Master): van der Corput 1921 *Math. Ann.* 84, 53-79 and 1922 *Math. Ann.* 87, 39-65 (Zahlentheoretische Abschätzungen and Verschärfung der Abschätzung beim Teilerproblem); Graham-Kolesnik 1991 *Van der Corput's Method of Exponential Sums* (LMS LNS 126) Ch. 1-5 (the $A$- and $B$-processes, exponent pairs, the $(\kappa,\lambda)$ region); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8; Titchmarsh 1986 *The Theory of the Riemann Zeta-Function* (Oxford) Ch. 5; Huxley 1996 *Area, Lattice Points, and Exponential Sums* (Oxford) Ch. 1-3, 17 (the discrete Hardy-Littlewood method and the divisor / circle problems)

Intuition Beginner

Suppose you want to add up a long list of unit arrows, one for each whole number $n$ from $1$ to $N$ , where the direction of the $n$ -th arrow is set by a smoothly bending rule $f (n)$ — not a straight line, not a polynomial, but some gently curving function like $n$ or $n lo g n$ . If the arrows all pointed the same way the total would be huge, about $N$ . The whole game is to show that because the directions keep turning, the arrows largely cancel and the total is far smaller. This is the same cancellation you met for straight-line and polynomial rules; the new difficulty is that a curving rule does not telescope and offers no obvious foothold.

Van der Corput found two moves that, in alternation, grind almost any smooth rule down to size. The first, the $A$ -process, is a shifting trick: compare each arrow with the one a few steps ahead, and the difference of the directions is a flatter, slower-turning rule that is easier to handle. It is the square-and-shift idea behind Weyl differencing, tuned so the shift length is a free dial you optimise. The second, the $B$ -process, is a change of viewpoint: a sum of a fast-turning wave can be rewritten as a new sum indexed by the speeds that appear, and the new sum is often much shorter. You trade a sum over positions for a sum over slopes.

Why care? Because two of the oldest questions in number theory — how the Riemann zeta function grows up the critical line, and how accurately you can count lattice points inside a circle or under a hyperbola — both come down to bounding exactly these sums. Each clever alternation of the two moves shaves the exponent in the answer, and chasing the best alternation is a whole industry.

Visual Beginner

Think of the sum as a walk in the plane: start at the origin, and for each $n$ take a unit step in the direction set by $f (n)$ . The end point of the walk is the value of the sum. A straight-line rule sends you around a tight circle, returning near the start. A curving rule sends you on a more complicated path, and the question is how far from the origin you end up.

   A-process: shift and compare        B-process: change index

   step n   step n+h                   sum over positions n
     \        /                              |  |  |  |  |   (fast wiggle)
      \      /     difference of                 ||
       \    /      directions turns        becomes
        \  /       more slowly                sum over slopes m
         \/        (flatter rule)             |    |    |     (fewer terms)
       compare                                  short

The left panel is the shifting move: differencing replaces a turning rule with a flatter one. The right panel is the viewpoint move: a sum that wiggles fast over many positions becomes a sum over the handful of slopes that actually occur. Alternating the two is van der Corput's method, and each alternation is a fresh chance for cancellation.

Worked example Beginner

We estimate a model sum and watch the $B$ -process shrink it. Take $f (n) = T g (n / N)$ where the rule is gently curving, and look at how the sum over $n$ becomes a sum over slopes. To keep arithmetic concrete, take the linear-slope model $f (n) = n^{2} / (2 N)$ for $n$ from $1$ to $N$ , with $N = 100$ , and compare the number of terms before and after.

Step 1. Count the terms in the original sum. There are $N = 100$ arrows, one per integer $n$ from $1$ to $100$ .

Step 2. Find the range of slopes. The slope of the rule at position $n$ is the rate of change of $f$ , which for $f (n) = n^{2} / (2 N)$ is $n / N$ . As $n$ runs from $1$ to $100$ , the slope $n / N$ runs from $0.01$ up to $1.0$ .

Step 3. Count the terms in the new sum. The $B$ -process re-indexes by whole-number slopes $m$ in that range. The slopes covered run from about $0$ to $1$ , so there is roughly one whole-number slope, $m = 1$ (together with the endpoint behaviour). The dual sum has on the order of $1$ term instead of $100$ .

Step 4. Read off the saving. Each dual term carries a weight of about $1/ ∣ f^{''} ∣$ , and here $f^{''} = 1/ N = 0.01$ , so $1/ 0.01 = 10$ . The dual sum is therefore of size about $10$ , against the plain size $100$ of the original. The square root of $N$ has appeared: $100 = 10$ .

What this tells us: the $B$ -process turned a sum of $100$ unit arrows into a sum of size about $10$ . The cancellation is real and quantitative — the total is the square root of the length, not the length. This $N$ saving is the simplest face of van der Corput's method, and iterating the two processes pushes the saving further.

Check your understanding Beginner

Exercise (easy, multiple choice).

In van der Corput's method, what does the $A$ -process do to a sum of the wave terms $e (f (n))$ ?

A. It replaces the sum by a new sum indexed by the slopes of $f$ .
B. It compares each term with a shifted copy and reduces to a flatter difference rule, with the shift length as a free parameter.
C. It assumes $f$ is a polynomial and reads off the leading coefficient.
D. It evaluates the sum exactly using a closed-form formula.

Hint

One of the two processes is the shifting / differencing move; the other is the change-of-index move. The $A$ -process is the shifting one.

Answer

B. The $A$ -process squares the sum and compares each term with a copy shifted by $h$ ; the difference $f (n + h) - f (n)$ is a flatter, slower-turning rule, and the shift length $H$ (the range of $h$ ) is a dial you optimise. Option A describes the $B$ -process, the change of index to a sum over slopes. Option C describes the polynomial Weyl method, a special case but not the general $A$ -process. Option D is wrong: the method bounds the sum, it does not evaluate it.

Formal definition Intermediate+

Throughout write $e (t) = e^{2 π i t}$ and $∥ t ∥$ for the distance from $t$ to the nearest integer; the additive character $e (\cdot)$ is the one used on $R / Z$ in 21.15.01 and 21.15.02. Let $f : [a, b] \to R$ be smooth (typically real-analytic on the relevant range), and consider the exponential sum $$ S = \sum_{a < n \le b} e\big(f(n)\big). $$ Unlike the polynomial phases of 21.15.02, $f$ is now a general smooth phase, for example $f (n) = (t /2 π) lo g n$ , $f (n) = α n^{c}$ with $c \in / Z$ , or $f (n) = T n$ . The elementary bound is $∣ S ∣ \leq b - a$ ; the object is power-saving cancellation.

Definition ( $A$ -process — Weyl-van der Corput inequality). For an integer $H \geq 1$ the $A$ -process is the differencing inequality $$ \Big| \sum_{a < n \le b} e(f(n)) \Big|^2 \le \frac{N + H}{H}\sum_{|h| < H}\Big(1 - \frac{|h|}{H}\Big)\sum_{n \in I_h} e\big(f(n+h) - f(n)\big), \qquad N = b - a, $$ where $I_{h}$ is the range of $n$ with both $n$ and $n + h$ in $(a, b]$ . It reduces the sum of $f$ to averaged sums of the differenced phases $f (\cdot + h) - f (\cdot)$ , whose first derivatives are smaller by a factor $\approx h f^{''}$ . The shift bound $H$ is a free parameter, optimised at the end.

Definition ( $B$ -process — Poisson summation plus stationary phase). Suppose $f^{'}$ is monotone on $[a, b]$ with $f^{''}$ of constant sign and $∣ f^{''} ∣ ≍ λ$ . Poisson summation 21.15.01 writes $\sum_{a < n \leq b} e (f (n)) = \sum_{m \in Z} \int_{a}^{b} e (f (x) - m x) d x$ , and evaluating each integral by stationary phase 02.20.05 — the critical point $x_{m}$ solves $f^{'} (x_{m}) = m$ — gives the $B$ -process transformation $$ \sum_{a < n \le b} e(f(n)) = \sum_{f'(a) \le m \le f'(b)} \frac{e\big(f(x_m) - m x_m + \tfrac18 \operatorname{sgn} f''\big)}{\sqrt{|f''(x_m)|}} + O\big(\log(2 + |f'(b) - f'(a)|)\big) + O\big(\text{endpoint terms}\big). $$ The new sum is the dual sum: its phase is the Legendre transform $g (m) = f (x_{m}) - m x_{m}$ of $f$ , and its length is $∣ f^{'} (b) - f^{'} (a) ∣$ , the range of slopes.

Definition (exponent pair). A pair of real numbers $(κ, λ)$ with $0 \leq κ \leq \frac{1}{2} \leq λ \leq 1$ is an exponent pair if, for every smooth phase $f$ on $[N, 2 N]$ obeying the standard derivative bounds $f^{(j)} (x) ≍ F / N^{j}$ (the model being $f (x) = T g (x / N)$ with $F = T$ ), one has $$ \sum_{N < n \le 2N} e(f(n)) \ll \Big(\frac{F}{N}\Big)^{\kappa} N^{\lambda}, $$ with the implied constant uniform over the class of phases. The amplitude $F / N ≍ f^{'}$ measures how fast $f$ turns; $κ$ is the exponent of that amplitude and $λ$ the exponent of the length.

Definition ( $A$ and $B$ operators on pairs). The $A B$ -process acts on exponent pairs by $$ A(\kappa, \lambda) = \Big( \frac{\kappa}{2\kappa + 2},\ \frac{\kappa + \lambda + 1}{2\kappa + 2} \Big), \qquad B(\kappa, \lambda) = \Big( \lambda - \tfrac12,\ \kappa + \tfrac12 \Big). $$ Starting from the base pair $(0, 1)$ — the elementary bound $∣ S ∣ \leq N$ — and from $B (0, 1) = (\frac{1}{2}, \frac{1}{2})$ , repeated application of $A$ and $B$ generates the classical exponent pairs. The operator $B$ is an involution ( $B^{2} = id$ on pairs), corresponding to applying the $B$ -process twice and returning to the original sum.

Counterexamples to common slips

"The $B$ -process always shortens the sum." It shortens the sum precisely when the slope range $∣ f^{'} (b) - f^{'} (a) ∣ = F$ is smaller than the length $N$ , i.e. when $F < N$ . When $F > N$ the dual sum is longer, and one applies $A$ first to reduce $F$ . The art of the method is sequencing $A$ and $B$ so each is applied in its favourable regime.
"An exponent pair must have $κ + λ = 1$ ." Only the base pair $(0, 1)$ and the first $B$ -image $(1/2, 1/2)$ satisfy $κ + λ = 1$ . General pairs satisfy $κ + λ \leq$ varying bounds; the quantity minimised for $ζ (1/2 + i t)$ is $κ + λ$ over the convex hull of attainable pairs, but it is not constant.
"Stationary phase needs a polynomial phase." The $B$ -process applies to any smooth $f$ with $f^{''}$ of one sign and bounded away from $0$ and $\infty$ on the range; the stationary-phase lemma of 02.20.05 requires only a non-degenerate critical point, not a polynomial. The phases of interest ( $lo g$ , fractional powers) are emphatically non-polynomial.

Key theorem with proof Intermediate+

The signature result is the $B$ -process exponent pair: a single application of Poisson summation and stationary phase converts the base pair into a genuine power-saving bound ^{[Graham Kolesnik 1991]}.

Theorem (the $B$ -process pair; van der Corput). Let $f$ be smooth on $[N, 2 N]$ with $f^{''} (x) ≍ λ > 0$ of constant sign, $f^{'''}$ and $f^{(4)}$ controlled by $f^{''} / N$ and $f^{''} / N^{2}$ respectively. Then $$ \sum_{N < n \le 2N} e(f(n)) \ll N, \lambda^{1/2} + \lambda^{-1/2}. $$ Equivalently, in exponent-pair language with $λ ≍ F / N^{2}$ this is the pair $(\frac{1}{2}, \frac{1}{2})$ : $\sum_{N < n \leq 2 N} e (f (n)) ≪ (F / N)^{1/2} N^{1/2} = F^{1/2}$ .

Proof. By Poisson summation 21.15.01, for a smooth cutoff $w$ adapted to $[N, 2 N]$ , $$ \sum_{n} w(n), e(f(n)) = \sum_{m \in \mathbb{Z}} J_m, \qquad J_m = \int_{\mathbb{R}} w(x), e\big(f(x) - m x\big), dx. $$ The phase of $J_{m}$ is $ϕ_{m} (x) = f (x) - m x$ with $ϕ_{m}^{'} (x) = f^{'} (x) - m$ . On $[N, 2 N]$ the derivative $f^{'}$ ranges over an interval of length $≍ N λ$ ; call its image $[f^{'} (N), f^{'} (2 N)]$ .

For $m$ outside a neighbourhood of this image, $∣ ϕ_{m}^{'} ∣ = ∣ f^{'} - m ∣ ≳ ∣ f^{'} - m ∣$ is bounded below, and $ϕ_{m}^{'}$ is monotone since $ϕ_{m}^{''} = f^{''} ≍ λ$ has constant sign. The first van der Corput lemma for integrals (the $k = 1$ case of 02.20.05) and repeated integration by parts give $∣ J_{m} ∣ ≪ (1 + dist (m, [f^{'} (N), f^{'} (2 N)]))^{- 1} \cdot$ rapidly decaying, so $\sum_{m \in / [f^{'} (N), f^{'} (2 N)]} ∣ J_{m} ∣ ≪ lo g (2 + N λ) + 1$ .

For each $m$ with $f^{'} (N) \leq m \leq f^{'} (2 N)$ the phase $ϕ_{m}$ has a unique stationary point $x_{m}$ solving $f^{'} (x_{m}) = m$ , non-degenerate because $ϕ_{m}^{''} (x_{m}) = f^{''} (x_{m}) ≍ λ \neq = 0$ . The stationary-phase formula of 02.20.05 gives $$ J_m = \frac{e\big(f(x_m) - m x_m + \tfrac18\big)}{\sqrt{f''(x_m)}}, w(x_m) + O\big(\lambda^{-1/2}\cdot (N^2\lambda)^{-1}\big), $$ the error being the second-order stationary-phase term, controlled by $f^{'''}$ and $f^{(4)}$ . Each main term has size $≍ λ^{- 1/2}$ , and there are $≍ N λ$ values of $m$ in the range. Summing the main terms with the triangle inequality bounds them by $(N λ) \cdot λ^{- 1/2} = N λ^{1/2}$ ; the accumulated stationary-phase errors are $≪ (N λ) \cdot λ^{- 1/2} (N^{2} λ)^{- 1} = λ^{- 1/2} / N \leq λ^{- 1/2}$ ; and the off-range tail contributes $O (lo g)$ . Collecting, $$ \Big| \sum_{N < n \le 2N} e(f(n)) \Big| \ll N\lambda^{1/2} + \lambda^{-1/2} + \log(2 + N\lambda) \ll N\lambda^{1/2} + \lambda^{-1/2}, $$ the logarithm absorbed when $λ$ is in the nondegenerate range $N^{- 2} ≪ λ ≪ 1$ . Writing $λ ≍ F / N^{2}$ gives $N λ^{1/2} ≍ F^{1/2}$ , i.e. $≪ (F / N)^{1/2} N^{1/2}$ , the pair $(\frac{1}{2}, \frac{1}{2})$ . $□$

Bridge. The $B$ -process pair builds toward the entire exponent-pair calculus, and it appears again in the application to $ζ (1/2 + i t)$ below, where $(\frac{1}{2}, \frac{1}{2})$ already yields the convexity bound. The foundational reason the method works is that Poisson summation is dual to stationary phase: summation over positions becomes summation over slopes, and the weight $λ^{- 1/2}$ is exactly the stationary-phase normalisation $∣ f^{''} ∣^{- 1/2}$ of 02.20.05. This is exactly the sum-side incarnation of the integral van der Corput lemma — where the integral lemma gives $λ^{- 1/2}$ decay for one oscillatory integral, the $B$ -process gives a dual sum of terms each of that size. The $A$ -process generalises Weyl differencing from 21.15.02 to non-polynomial phases, and the central insight is that the two operators $A$ and $B$ on the $(κ, λ)$ region are dual to each other: $A$ trades length for amplitude by differencing, $B$ swaps amplitude and length by the Legendre transform. Putting these together, every word in the language of exponent pairs is a sequence of $A$ 's and $B$ 's applied to $(0, 1)$ , and the best bound for any particular phase is the optimal such word.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Compute $A (0, 1)$ and then $B (A (0, 1))$ using $A (κ, λ) = (\frac{κ}{2 κ + 2}, \frac{κ + λ + 1}{2 κ + 2})$ and $B (κ, λ) = (λ - \frac{1}{2}, κ + \frac{1}{2})$ . Identify the resulting pair.

Hint

Apply $A$ to $(0, 1)$ first; the result has $κ = 0$ . Then apply $B$ .

Answer

First $A (0, 1) = (\frac{0}{2}, \frac{0 + 1 + 1}{2}) = (0, 1)$ — the $A$ -operator fixes the base pair, because differencing a sum that is already at the elementary bound gains nothing. So instead start from the basic pair $(\frac{1}{2}, \frac{1}{2}) = B (0, 1)$ and apply $A$ : $$ A(\tfrac12, \tfrac12) = \Big( \frac{1/2}{2\cdot\frac12 + 2},\ \frac{\frac12 + \frac12 + 1}{2\cdot\frac12 + 2} \Big) = \Big( \frac{1/2}{3},\ \frac{2}{3} \Big) = \Big( \tfrac16, \tfrac23 \Big). $$ Thus $A B (0, 1) = (\frac{1}{6}, \frac{2}{3})$ , the pair behind the $t^{1/6 + ε}$ zeta bound. The lesson is that $A$ must be applied after a $B$ has created a power-saving pair; on the base pair $A$ is idle.

Exercise 5 (medium, numeric).

The Gauss circle problem error term $P (x) = # {(m, n) : m^{2} + n^{2} \leq x} - π x$ is bounded by exponent pairs via $P (x) ≪ x^{κ / (2 κ - 2 λ + 2)}$ in the simplest reading; van der Corput's $(\frac{1}{6}, \frac{2}{3})$ gives the classical exponent $1/3$ . State the resulting bound $P (x) ≪ x^{θ}$ with $θ = 1/3$ as a decimal exponent, and compare with the elementary exponent $θ = 1/2$ .

Hint

$1/3$ as a decimal is $0.333 \dots$ ; $1/2$ is $0.5$ .

Answer

$θ = 1/3 \approx 0.333$ . Van der Corput's method gives $P (x) ≪ x^{1/3 + ε}$ for the circle-problem error, improving the elementary $P (x) ≪ x^{1/2}$ (which follows from counting boundary lattice points in an annulus of width $1$ ). The exponent $1/3$ came from van der Corput's 1922 treatment of the divisor problem and transfers to the circle problem; the conjectured truth is $θ = 1/4 + ε$ (Hardy), and the current record is near $θ = 131/416 \approx 0.3149$ (Huxley). The decimal comparison $0.333 < 0.5$ quantifies the saving the method delivers.

Exercise 6 (medium, symbolic).

For the zeta phase $f (x) = - \frac{t}{2 π} lo g x$ on $[N, 2 N]$ with $N ≍ t$ , compute $f^{''}$ and the amplitude $F = N ∣ f^{''} ∣ \cdot N$ , and explain why $N ≍ t$ is the critical length.

Hint

Differentiate $f$ twice. Then $f^{''} ≍ t / N^{2}$ , so the amplitude $F ≍ N^{2} ∣ f^{''} ∣$ . Set $F ≍ N$ to find the balance.

Answer

Differentiating, $f^{'} (x) = - \frac{t}{2 π x}$ and $f^{''} (x) = \frac{t}{2 π x ^{2}} ≍ t / N^{2}$ on $[N, 2 N]$ . The amplitude is $F = N^{2} ∣ f^{''} ∣ ≍ t$ , so $F / N ≍ t / N$ . The $B$ -process is favourable when the slope range $F = N^{2} ∣ f^{''} ∣ ≍ t$ relates to the length $N$ : the dual sum has length $∣ f^{'} (2 N) - f^{'} (N) ∣ ≍ t / N$ . Balancing the original length $N$ against the dual length $t / N$ gives $N ≍ t / N$ , i.e. $N ≍ t$ . At this critical length the sum and its dual have comparable size, and the approximate functional equation for $ζ (1/2 + i t)$ indeed has main sum of length $≍ t$ . This is why the zeta sum is studied at $N ≍ t$ : it is the self-dual length where the $B$ -process is exactly balanced.

Exercise 7 (hard, symbolic).

Prove the $A$ -process inequality in the clean form $∣ S ∣^{2} \leq \frac{N + H}{H} \sum_{∣ h ∣ < H} (1 - \frac{∣ h ∣}{H}) S_{h}$ , where $S = \sum_{a < n \leq b} e (f (n))$ , $N = b - a$ , and $S_{h} = \sum_{n \in I_{h}} e (f (n + h) - f (n))$ , starting from the identity $H S = \sum_{0 \leq r < H} \sum_{n} e (f (n + r))$ (a sum over shifted copies).

Hint

Write $H S = \sum_{n} \sum_{0 \leq r < H} e (f (n + r))$ after re-indexing, apply Cauchy-Schwarz in $n$ over a range of length $N + H$ , then expand the square and substitute $h = r - r^{'}$ .

Answer

Extend $f$ and pad with zero terms so that $a (n) = e (f (n))$ for $a < n \leq b$ and $a (n) = 0$ otherwise. For each integer $n$ define $b (n) = \sum_{0 \leq r < H} a (n + r)$ . Summing over $n$ in a window of length $N + H$ that contains all nonzero $a (n + r)$ , each $a (k)$ is counted $H$ times, so $\sum_{n} b (n) = H \sum_{k} a (k) = H S$ . By Cauchy-Schwarz over the $\leq N + H$ values of $n$ for which $b (n) \neq = 0$ , $$ |H S|^2 = \Big| \sum_n b(n) \Big|^2 \le (N + H) \sum_n |b(n)|^2. $$ Expand $∣ b (n) ∣^{2} = \sum_{0 \leq r, r^{'} < H} a (n + r) \overline{a (n + r^{'})}$ and sum over $n$ . Setting $h = r - r^{'}$ and summing over $n$ first, the diagonal-shifted sum is $\sum_{n} a (n + r) \overline{a (n + r^{'})} = \sum_{k} a (k + h) \overline{a (k)} = S_{h}$ , depending only on $h$ . The number of pairs $(r, r^{'})$ with $r - r^{'} = h$ and $0 \leq r, r^{'} < H$ is $H - ∣ h ∣$ . Hence $$ \sum_n |b(n)|^2 = \sum_{|h| < H} (H - |h|), S_h = H \sum_{|h|<H}\Big(1 - \tfrac{|h|}{H}\Big) S_h. $$ Combining, $H^{2} ∣ S ∣^{2} \leq (N + H) H \sum_{∣ h ∣ < H} (1 - \frac{∣ h ∣}{H}) S_{h}$ , and dividing by $H^{2}$ gives the stated inequality. The triangular weight $1 - ∣ h ∣/ H$ is the Fejér kernel coming from the overlap count of the shifts, and $S_{h} = \sum_{n \in I_{h}} e (f (n + h) - f (n))$ has the differenced phase whose derivative is smaller by $≍ h f^{''}$ . $□$

Exercise 8 (hard, symbolic).

Derive the bound $ζ (\frac{1}{2} + i t) ≪ t^{1/6 + ε}$ from the exponent pair $(\frac{1}{6}, \frac{2}{3})$ , using the approximate functional equation $ζ (\frac{1}{2} + i t) = \sum_{n \leq t /2 π} n^{- 1/2 - i t} + O (t^{- 1/4})$ and dyadic decomposition.

Hint

Split the sum into dyadic blocks $N < n \leq 2 N$ with $N \leq t$ . On each block write $n^{- 1/2 - i t} = n^{- 1/2} e (f (n))$ with $f (n) = - \frac{t}{2 π} lo g n$ , apply the exponent pair to $\sum e (f (n))$ , then sum the blocks by partial summation against $n^{- 1/2}$ .

Answer

By the approximate functional equation, $∣ ζ (\frac{1}{2} + i t) ∣ \leq \sum_{n \leq X} n^{- 1/2 - i t} + O (t^{- 1/4})$ with $X = t /2 π$ . Decompose the range $[1, X]$ into $O (lo g t)$ dyadic blocks $(N, 2 N]$ with $N \leq X$ . On each block, $n^{- 1/2 - i t} = n^{- 1/2} e (f (n))$ , $f (n) = - \frac{t}{2 π} lo g n$ . By partial (Abel) summation against the smooth factor $n^{- 1/2}$ , $$ \sum_{N < n \le 2N} n^{-1/2} e(f(n)) \ll N^{-1/2} \max_{N < M \le 2N}\Big| \sum_{N < n \le M} e(f(n)) \Big|. $$ The phase obeys $f^{(j)} (x) ≍ F / N^{j}$ with $F = t / N$ (since $f^{'} ≍ t / N$ ), so the exponent pair $(κ, λ) = (\frac{1}{6}, \frac{2}{3})$ gives $\sum_{N < n \leq M} e (f (n)) ≪ (F / N)^{κ} N^{λ} = (t / N^{2})^{1/6} N^{2/3}$ . Therefore $$ \sum_{N<n\le 2N} n^{-1/2} e(f(n)) \ll N^{-1/2} (t/N^2)^{1/6} N^{2/3} = t^{1/6} N^{-1/2 - 1/3 + 2/3} = t^{1/6} N^{-1/6}. $$ Summing over the $O (lo g t)$ dyadic blocks with $1 \leq N \leq X = t^{1/2}$ , the largest contribution is at the smallest $N ≍ 1$ , giving $≪ t^{1/6} lo g t$ . (More carefully, $\sum_{N dyadic} t^{1/6} N^{- 1/6}$ converges and is dominated by its first term up to the $lo g$ .) Hence $ζ (\frac{1}{2} + i t) ≪ t^{1/6} lo g t ≪ t^{1/6 + ε}$ , the convexity-breaking bound. The exponent $1/6$ is precisely $(κ + λ) /2 - 1/4$ for $(\frac{1}{6}, \frac{2}{3})$ , confirming Exercise 2.

Advanced results Master

The exponent-pair region and optimisation

The set of attainable exponent pairs is the closure under $A$ and $B$ of ${(0, 1)}$ , together with the convexity property that the attainable region is convex ^{[Graham Kolesnik 1991]}. The operators generate a countable dense family of vertices; for the divisor and zeta problems one minimises a linear functional of $(κ, λ)$ over this region. For $ζ (\frac{1}{2} + i t) ≪ t^{θ + ε}$ with $θ = \frac{1}{2} in f (κ + λ)$ over attainable pairs, van der Corput's processes alone give $θ = \frac{1}{6}$ from $(\frac{1}{6}, \frac{2}{3}) = A B (0, 1)$ ; the longer words $A^{k} B$ and their conjugates yield the van der Corput exponent pairs $(\frac{1}{2 ^{k + 1} - 2}, 1 - \frac{k}{2 ^{k + 1} - 2})$ , whose infimum of $κ + λ$ still does not beat $\frac{1}{6}$ for the zeta function but does improve the divisor exponent. Rankin computed the optimal van der Corput exponent $θ = 0.3290 \dots$ for the divisor problem from this region. The barrier $\frac{1}{6}$ for zeta is broken only by stepping outside the $A B$ -process: the Bombieri-Iwaniec method (1986) and Huxley's discrete Hardy-Littlewood refinements reach $θ \leq 131/416$ .

The Dirichlet divisor and Gauss circle problems

Let $D (x) = \sum_{n \leq x} d (n) = x lo g x + (2 γ - 1) x + Δ (x)$ and $# {(m, n) : m^{2} + n^{2} \leq x} = π x + P (x)$ ^{[Huxley 1996]}. Both error terms reduce, via the hyperbola method and Voronoi summation 21.15.01, to exponential sums $\sum_{n \leq M} ψ (x / n)$ and $\sum e (2 x n)$ with phase $f (n) = 2 x n$ , $f^{''} (n) ≍ x n^{- 3/2}$ . Applying an exponent pair $(κ, λ)$ yields $Δ (x), P (x) ≪ x^{\frac{κ + λ}{2 ( 1 + κ )}}$ up to $ε$ -powers; the van der Corput pair $(\frac{1}{6}, \frac{2}{3})$ gives the classical $x^{1/3}$ , sharpening Dirichlet's elementary $x^{1/2}$ and Voronoi's $x^{1/3}$ obtained by his summation formula. The conjectured exponent is $\frac{1}{4}$ , equivalent to the optimal cancellation $Δ (x) ≪ x^{1/4 + ε}$ ; the $Ω$ -results of Hardy show $\frac{1}{4}$ cannot be improved.

Iterated processes and the $A$ -process gain

The general bound from a word $w = w_{1} \dots w_{r}$ in ${A, B}$ applied to $(0, 1)$ is sharp for the phase class it was tuned to, but the same phase may admit a better word. The $A$ -process improves a pair when differencing the phase produces a flatter sum: applied to $(κ, λ)$ it returns $(\frac{κ}{2 κ + 2}, \frac{κ + λ + 1}{2 κ + 2})$ , which decreases $κ$ (good for small-amplitude phases) at the cost of raising $λ$ toward $1$ . Because $κ$ and $λ$ trade off, the optimal word depends on the regime $F = N^{c}$ : for the zeta-type phase $c = 2$ the optimum among $A B$ -words is $A B$ , while for the divisor phase $c = 1/2$ the optimum is a longer alternation. The exponent-pair conjecture asserts that $(ε, \frac{1}{2} + ε)$ is attainable for every $ε > 0$ , which would give $θ = \frac{1}{4} + ε$ for both the zeta and divisor problems; it remains open and is known to require methods beyond $A$ and $B$ .

Synthesis. Van der Corput's method, exponent pairs, the zeta growth problem, and the divisor and circle problems are one circle of ideas, and the bridge is the duality between a sum and its Legendre-transformed dual. The foundational reason the processes terminate in a power saving is that the $B$ -process is exactly Poisson summation from 21.15.01 composed with the stationary-phase lemma of 02.20.05: summation over positions becomes summation over slopes, the weight $∣ f^{''} ∣^{- 1/2}$ is the stationary-phase normalisation, and this is dual to the $A$ -process, which generalises the Weyl differencing of 21.15.02 from polynomial to smooth phases. Putting these together, every classical bound is a word in $A$ and $B$ applied to the base pair $(0, 1)$ , the central insight being that $A$ trades length for amplitude and $B$ swaps them by the Legendre transform; the convexity bound $ζ (\frac{1}{2} + i t) ≪ t^{1/6 + ε}$ is exactly $A B (0, 1)$ , and the $x^{1/3}$ divisor and circle exponents are the same calculus read on the phase $2 x n$ . The exponent-pair conjecture $θ = \frac{1}{4}$ generalises the whole method to its conjectural limit, and the modern Bombieri-Iwaniec and decoupling methods are the central insight pushed past the $A, B$ -closure into the geometry of the dual sum's spacing.

Full proof set Master

Proposition 1 (first-derivative test; van der Corput's lemma for sums). Let $f$ be real on $[a, b]$ with $f^{'}$ monotone and $∥ f^{'} (x) ∥ \geq δ > 0$ for all $x \in [a, b]$ (the distance to the nearest integer of $f^{'}$ stays $\geq δ$ ). Then $$ \Big| \sum_{a < n \le b} e(f(n)) \Big| \ll \frac{1}{\delta}. $$

Proof. By Euler-Maclaurin / partial summation against the integral, or directly by the truncated Poisson summation of 21.15.01, $\sum_{a < n \leq b} e (f (n)) = \sum_{m} \int_{a}^{b} e (f (x) - m x) d x + O (1)$ , where only $m$ with $m \in [f^{'} (a), f^{'} (b)]$ contribute a main term. The hypothesis $∥ f^{'} ∥ \geq δ$ means $f^{'}$ avoids a $δ$ -neighbourhood of every integer, so for each integer $m$ the integrand phase $f (x) - m x$ has derivative $f^{'} (x) - m$ of absolute value $\geq δ$ and monotone. The first van der Corput integral lemma (02.20.05, case $k = 1$ ) gives $∣ \int_{a}^{b} e (f (x) - m x) d x ∣ ≪ 1/∥ f^{'} - m ∥$ -type control; summing the geometric-like decay over $m$ produces $≪ \sum_{j \geq 0} (j + δ)^{- 1}$ truncated, which is $≪ δ^{- 1}$ once the monotonicity of $f^{'}$ bounds the number of relevant $m$ by $O (1 + ∣ f^{'} (b) - f^{'} (a) ∣)$ and the tails decay. Hence $∣ S ∣ ≪ δ^{- 1}$ . $□$

Proposition 2 (second-derivative test). Let $f$ be real on $[a, b]$ with $λ \leq ∣ f^{''} (x) ∣ \leq α λ$ for constants and $f^{''}$ of constant sign. Then $$ \Big| \sum_{a < n \le b} e(f(n)) \Big| \ll (b - a),\lambda^{1/2} + \lambda^{-1/2}. $$

Proof. This is the conclusion of the $B$ -process key theorem, here stated for a general interval. Since $f^{''}$ has constant sign, $f^{'}$ is strictly monotone, so the $B$ -process applies: Poisson summation 21.15.01 reduces $S$ to the dual sum $\sum_{f^{'} (a) \leq m \leq f^{'} (b)} ∣ f^{''} (x_{m}) ∣^{- 1/2} e (g (m))$ over stationary points $x_{m}$ solving $f^{'} (x_{m}) = m$ , plus $O (lo g)$ . The number of integers $m$ in $[f^{'} (a), f^{'} (b)]$ is $≪ 1 + ∣ f^{'} (b) - f^{'} (a) ∣ ≪ 1 + (b - a) λ$ , since $f^{''}$ ranges in $[λ, α λ]$ . Each dual term has modulus $∣ f^{''} (x_{m}) ∣^{- 1/2} ≍ λ^{- 1/2}$ . Bounding the dual sum directly by its length times the term size, $$ |S| \ll \big(1 + (b-a)\lambda\big)\lambda^{-1/2} + (\text{errors}) \ll (b-a)\lambda^{1/2} + \lambda^{-1/2}, $$ the stationary-phase and endpoint errors being lower order as in the key theorem. $□$

Proposition 3 (the basic exponent pair from the second-derivative test). The pair $(\frac{1}{2}, \frac{1}{2})$ is an exponent pair: for $f$ on $[N, 2 N]$ with $f^{(j)} ≍ F / N^{j}$ , $\sum_{N < n \leq 2 N} e (f (n)) ≪ (F / N)^{1/2} N^{1/2}$ .

Proof. Here $f^{''} ≍ F / N^{2} =: λ$ , of constant sign on the dyadic block (shrinking the block if necessary so $f^{''}$ does not vanish). Proposition 2 with $b - a = N$ gives $∣ S ∣ ≪ N λ^{1/2} + λ^{- 1/2} = N (F / N^{2})^{1/2} + (F / N^{2})^{- 1/2} = (F N)^{1/2} / N^{1/2} \cdot N^{1/2} \dots$ ; explicitly $N λ^{1/2} = N (F / N^{2})^{1/2} = F^{1/2} = (F / N)^{1/2} N^{1/2}$ , and $λ^{- 1/2} = (N^{2} / F)^{1/2} = N / F^{1/2} \leq N^{1/2} (N / F)^{1/2}$ . In the standard range $1 \leq F \leq N^{2}$ where the phase genuinely oscillates and is the regime exponent pairs are defined for, the first term dominates and $∣ S ∣ ≪ (F / N)^{1/2} N^{1/2}$ . This is the assertion that $(κ, λ) = (\frac{1}{2}, \frac{1}{2})$ is an exponent pair, matching $B (0, 1)$ . $□$

Proposition 4 ( $A B (0, 1) = (\frac{1}{6}, \frac{2}{3})$ and the $t^{1/6}$ bound are equivalent inputs). Applying the $A$ -operator to $(\frac{1}{2}, \frac{1}{2})$ gives $(\frac{1}{6}, \frac{2}{3})$ , and this pair yields $ζ (\frac{1}{2} + i t) ≪ t^{1/6 + ε}$ .

Proof. The first claim is the computation $A (\frac{1}{2}, \frac{1}{2}) = (\frac{1/2}{3}, \frac{2}{3}) = (\frac{1}{6}, \frac{2}{3})$ of Exercise 3. For the second, the exponent-pair bound on the zeta sum $\sum_{N < n \leq 2 N} n^{- 1/2 - i t}$ at the critical length $N ≍ t$ , summed dyadically against $n^{- 1/2}$ by partial summation as in Exercise 8, gives $ζ (\frac{1}{2} + i t) ≪ t^{(κ + λ) /2 - 1/4} lo g t$ . Substituting $(κ, λ) = (\frac{1}{6}, \frac{2}{3})$ makes the exponent $(\frac{1}{6} + \frac{2}{3}) /2 - \frac{1}{4} = \frac{5}{12} - \frac{1}{4} = \frac{1}{6}$ , so $ζ (\frac{1}{2} + i t) ≪ t^{1/6} lo g t ≪ t^{1/6 + ε}$ . $□$

Connections Master

The $A$ -process is the smooth-phase generalisation of the Weyl differencing inequality of 21.15.02: where Weyl differencing reduces a degree- $k$ polynomial sum to degree- $(k - 1)$ sums, the $A$ -process reduces a smooth sum to averaged difference sums with a free shift length, and both are Cauchy-Schwarz applied to shifted copies of the same character sum.

The $B$ -process is Poisson summation from 21.15.01 composed with the method of stationary phase from 02.20.05; the dual sum it produces is indexed by the slopes $m = f^{'} (x_{m})$ , exactly the transform side of the Poisson formula, and each term carries the stationary-phase weight $∣ f^{''} (x_{m}) ∣^{- 1/2}$ that the oscillatory-integral lemma of 02.20.05 supplies.

The $t^{1/6 + ε}$ bound for $ζ (1/2 + i t)$ proved here is the first convexity-breaking estimate feeding the growth and zero-density theory of the Riemann zeta function in 21.03.01; subconvexity for $ζ$ and for general $L$ -functions continues this line, with the complete exponential sums of 21.15.04 supplying the arithmetic input on the major-arc side that complements the smooth-phase analysis here.

Historical & philosophical context Master

Johannes Gualtherus van der Corput introduced his method in two papers in the Mathematische Annalen: the 1921 Zahlentheoretische Abschätzungen ^{[van der Corput 1921]}, which set out the derivative tests and the differencing and Poisson-summation passages, and the 1922 Verschärfung der Abschätzung beim Teilerproblem ^{[van der Corput 1922]}, which iterated the processes to improve the Dirichlet divisor exponent below Voronoi's $1/3$ . The two operations were later christened the $A$ - and $B$ -processes, and the bookkeeping of their effect on the bound was systematised into the theory of exponent pairs by Phillips, Rankin, and others in the 1930s-1950s; Rankin found the optimal van der Corput exponent for the divisor problem by linear programming over the attainable region.

The method's first triumph was the bound $ζ (\frac{1}{2} + i t) ≪ t^{1/6} lo g t$ , breaking the convexity exponent $\frac{1}{4}$ inherited from the functional equation and the elementary estimate. It stood essentially unchallenged in its conceptual form until the Bombieri-Iwaniec method of 1986 introduced the spacing of the dual sum's terms as a new resource, and Huxley's discrete Hardy-Littlewood method refined this to the current $θ \leq 131/416$ ^{[Huxley 1996]}. Graham and Kolesnik's 1991 monograph remains the systematic account of the classical $A, B$ -calculus. The method is the discrete shadow of the stationary-phase analysis of oscillatory integrals: van der Corput himself stated the integral lemmas alongside the sum estimates, and the modern view that exponent pairs are words in two dual operations on the $(κ, λ)$ region dates to this systematisation.

Bibliography Master

@article{vandercorput1921,
  author  = {van der Corput, J. G.},
  title   = {Zahlentheoretische Absch{\"a}tzungen},
  journal = {Mathematische Annalen},
  volume  = {84},
  pages   = {53--79},
  year    = {1921}
}

@article{vandercorput1922,
  author  = {van der Corput, J. G.},
  title   = {Versch{\"a}rfung der Absch{\"a}tzung beim Teilerproblem},
  journal = {Mathematische Annalen},
  volume  = {87},
  pages   = {39--65},
  year    = {1922}
}

@book{grahamkolesnik1991,
  author    = {Graham, S. W. and Kolesnik, G.},
  title     = {Van der Corput's Method of Exponential Sums},
  series    = {London Mathematical Society Lecture Note Series},
  volume    = {126},
  publisher = {Cambridge University Press},
  year      = {1991}
}

@book{iwaniec-kowalski2004,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004}
}

@book{titchmarsh1986,
  author    = {Titchmarsh, E. C.},
  title     = {The Theory of the Riemann Zeta-Function},
  edition   = {2},
  note      = {Revised by D. R. Heath-Brown},
  publisher = {Oxford University Press},
  year      = {1986}
}

@book{huxley1996,
  author    = {Huxley, M. N.},
  title     = {Area, Lattice Points, and Exponential Sums},
  series    = {London Mathematical Society Monographs},
  volume    = {13},
  publisher = {Oxford University Press},
  year      = {1996}
}

Prerequisites

21.15.02
21.15.01
02.20.05

Tier anchors

beginner: Stein-Shakarchi 2003 *Fourier Analysis: An Introduction* (Princeton Lectures in Analysis I) Ch. 4-5 (summation formulae and exponential sums informally); Steuding 2005 *An Introduction to the Theory of L-functions* lecture notes Ch. on exponential sums (the van der Corput estimates motivated through the zeta function)
intermediate: Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8 §8.2-8.4 (van der Corput's $A$- and $B$-processes and exponent pairs); Graham-Kolesnik 1991 *Van der Corput's Method of Exponential Sums* (LMS Lecture Note Series 126) Ch. 2-3; Titchmarsh 1986 *The Theory of the Riemann Zeta-Function* (Oxford, 2nd ed., rev. Heath-Brown) Ch. 5 (the van der Corput estimates for $\zeta(1/2+it)$)
master: van der Corput 1921 *Math. Ann.* 84, 53-79 and 1922 *Math. Ann.* 87, 39-65 (Zahlentheoretische Abschätzungen and Verschärfung der Abschätzung beim Teilerproblem); Graham-Kolesnik 1991 *Van der Corput's Method of Exponential Sums* (LMS LNS 126) Ch. 1-5 (the $A$- and $B$-processes, exponent pairs, the $(\kappa,\lambda)$ region); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8; Titchmarsh 1986 *The Theory of the Riemann Zeta-Function* (Oxford) Ch. 5; Huxley 1996 *Area, Lattice Points, and Exponential Sums* (Oxford) Ch. 1-3, 17 (the discrete Hardy-Littlewood method and the divisor / circle problems)

References

van der Corput, J. G. — Zahlentheoretische Abschätzungen · *Mathematische Annalen* 84 (1921), 53-79. The derivative-test estimates for exponential sums, the differencing process ($A$-process), and the Poisson-summation passage to a dual sum ($B$-process); the first systematic method for $\sum_{n} e(f(n))$ with a smooth phase $f$.
van der Corput, J. G. — Verschärfung der Abschätzung beim Teilerproblem · *Mathematische Annalen* 87 (1922), 39-65. The improved exponent in the Dirichlet divisor problem via iterated $A$- and $B$-processes; the prototype of the exponent-pair recursion.
Graham, S. W. & Kolesnik, G. — Van der Corput's Method of Exponential Sums · London Mathematical Society Lecture Note Series 126, Cambridge University Press (1991), Ch. 1-5. The $A$-process (Weyl-van der Corput inequality), the $B$-process (Poisson summation plus stationary phase), the formalism of exponent pairs $(\kappa,\lambda)$, the $AB$-recursion $A(\kappa,\lambda)=(\tfrac{\kappa}{2\kappa+2},\tfrac{\kappa+\lambda+1}{2\kappa+2})$ and $B(\kappa,\lambda)=(\lambda-\tfrac12,\kappa+\tfrac12)$, and applications to $\zeta(1/2+it)$ and the divisor problem.
Iwaniec, H. & Kowalski, E. — Analytic Number Theory · American Mathematical Society Colloquium Publications 53 (2004), Ch. 8 §8.2-8.4. Exponential sums with smooth phases, the $A$- and $B$-processes, exponent pairs, and the bound $\zeta(1/2+it)\ll t^{1/6+\varepsilon}$.
Titchmarsh, E. C. — The Theory of the Riemann Zeta-Function · Oxford University Press, 2nd edition revised by D. R. Heath-Brown (1986), Ch. 5. The approximate functional equation, van der Corput's estimates, and the convexity-breaking bound $\zeta(1/2+it)\ll t^{1/6}\log t$.
Huxley, M. N. — Area, Lattice Points, and Exponential Sums · London Mathematical Society Monographs 13, Oxford University Press (1996), Ch. 1-3, 17. The lattice-point / circle and divisor problems, the discrete Hardy-Littlewood method, and the modern $\theta\le 131/416$ exponent for $\zeta(1/2+it)$.

Estimated time

beginner: 20m
intermediate: 55m
master: 90m

Intuition Beginner

Visual Beginner

Worked example Beginner

Check your understanding Beginner

Formal definition Intermediate+

Counterexamples to common slips

Key theorem with proof Intermediate+

Exercises Intermediate+

Advanced results Master

The exponent-pair region and optimisation

The Dirichlet divisor and Gauss circle problems

Iterated processes and the A-process gain

Full proof set Master

Connections Master

Historical & philosophical context Master

Bibliography Master

Iterated processes and the $A$ -process gain