21.15.05 · number-theory / exponential-sums

The Vinogradov Mean Value Theorem

shipped3 tiersLean: none

Anchor (Master): Bourgain-Demeter-Guth 2016 *Annals of Mathematics* 184, 633-682 (Proof of the main conjecture in Vinogradov's mean value theorem for degrees higher than three, via $\ell^2$ decoupling for the moment curve); Wooley 2016 *Proc. London Math. Soc.* 118, 942-1016 (Nested efficient congruencing and the main conjecture); Demeter 2020 *Fourier Restriction, Decoupling, and Applications* (Cambridge Studies in Advanced Mathematics 184) Ch. 11-13 (decoupling for the moment curve and the Vinogradov system); Vinogradov 1935 *Trav. Inst. Math. Stekloff* 10 (the original mean value method); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8

Intuition Beginner

Pick a length $N$ and write down every whole number from $1$ to $N$ . Now choose $s$ of them (repeats allowed) for a left team and another $s$ for a right team. Ask the two teams to tie on a whole list of scores at once: the sum of the numbers must match, and the sum of their squares must match, and the sum of their cubes, all the way up to the sum of their $k$ -th powers. How many ways can the two teams be picked so that every one of these $k$ tallies comes out equal? That count is the Vinogradov mean value $J_{s, k} (N)$ , and the whole subject is the size of this number.

There is an easy way to force a tie: make the right team a rearrangement of the left team. Then every tally matches automatically, because you are adding the same numbers in a different order. These "diagonal" ties are cheap and plentiful, and they set a floor on $J_{s, k} (N)$ . The deep question is whether there are many more ties than the diagonal ones — whether the simultaneous equations have lots of genuinely different solutions, or whether the diagonal solutions are essentially all of them.

Why care about a counting puzzle? Because this single count controls how much a high-degree wave-sum can fail to cancel. The wave-sums built from $e (α_{1} n + \dots + α_{k} n^{k})$ that resisted van der Corput's method for large $k$ are governed exactly by this team-tying count, and knowing the count is small unlocks the sharpest bounds on those sums and on where the Riemann zeta function can grow.

Visual Beginner

Think of each number you pick as a point, and the matching conditions as a stack of balance scales — one scale for the plain sums, one for the squares, one for the cubes, and so on up to the $k$ -th powers. A choice of two teams is a solution only when every scale balances at once.

   the system to satisfy (k balance conditions)

   sum of values     [ left ]======[ right ]   balanced?
   sum of squares     [ left ]======[ right ]   balanced?
   sum of cubes       [ left ]======[ right ]   balanced?
        ...                  ...        ...
   sum of k-th powers [ left ]======[ right ]   balanced?

   diagonal tie: right = a shuffle of left  -->  ALL scales balance for free
   extra ties:  genuinely different teams that still balance every scale

The picture makes the dichotomy visible: the diagonal ties cost nothing and there are about $N^{s}$ of them; the theorem asks whether the genuinely different ties are also held down near that floor (plus a second floor coming from the larger pool of numbers when $s$ is small). The main conjecture says yes — there are essentially no surprises beyond the two obvious floors.

Worked example Beginner

We count solutions of the simplest interesting Vinogradov system by hand: $k = 2$ (match sums and sums of squares), $s = 2$ (two on each team), drawn from ${1, 2, 3}$ , so $N = 3$ . We want pairs $(x_{1}, x_{2})$ and $(y_{1}, y_{2})$ with $x_{1} + x_{2} = y_{1} + y_{2}$ and $x_{1}^{2} + x_{2}^{2} = y_{1}^{2} + y_{2}^{2}$ .

Step 1. Fix the left team and compute its two tallies. For $(x_{1}, x_{2})$ the pair of tallies is (sum, sum of squares). For example $(1, 2)$ gives $(3, 5)$ , and $(1, 3)$ gives $(4, 10)$ .

Step 2. List every left team and its tally pair. The ordered pairs from ${1, 2, 3}$ and their (sum, sum-of-squares): $(1, 1) \to (2, 2)$ ; $(1, 2) \to (3, 5)$ ; $(1, 3) \to (4, 10)$ ; $(2, 1) \to (3, 5)$ ; $(2, 2) \to (4, 8)$ ; $(2, 3) \to (5, 13)$ ; $(3, 1) \to (4, 10)$ ; $(3, 2) \to (5, 13)$ ; $(3, 3) \to (6, 18)$ .

Step 3. Group teams by their tally pair, since a solution pairs any two teams with the same pair. The groups are: $(2, 2)$ : one team; $(3, 5)$ : two teams ${(1, 2), (2, 1)}$ ; $(4, 10)$ : two teams ${(1, 3), (3, 1)}$ ; $(4, 8)$ : one team; $(5, 13)$ : two teams; $(6, 18)$ : one team.

Step 4. Count ordered solution pairs. A group of size $m$ contributes $m^{2}$ solutions (pick any left team, any right team from the group). So the total is $1^{2} + 2^{2} + 2^{2} + 1^{2} + 2^{2} + 1^{2} = 1 + 4 + 4 + 1 + 4 + 1 = 15$ .

What this tells us: $J_{2, 2} (3) = 15$ . The diagonal solutions (right team a shuffle of left) number $s! \cdot$ choices in the generic case, and here they account for the bulk; the only "extra" structure is that $(1, 2)$ and $(2, 1)$ share a tally pair, which is just the shuffle again. For $k = 2$ this matching is rigid: fixing the sum and the sum of squares essentially fixes the unordered pair. That rigidity, made quantitative, is the content of the theorem.

Check your understanding Beginner

Exercise (easy, multiple choice).

The Vinogradov mean value $J_{s, k} (N)$ counts the number of ways to choose two teams of $s$ numbers each from ${1, \dots, N}$ so that:

A. The two teams have the same largest number.
B. The sum of $j$ -th powers of one team equals that of the other, for every $j$ from $1$ up to $k$ at the same time.
C. The two teams are disjoint.
D. The product of one team equals the product of the other.

Hint

The conditions are about matching sums of powers — a whole stack of them, one for each power up to $k$ .

Answer

B. $J_{s, k} (N)$ counts pairs of teams $(x_{1}, \dots, x_{s})$ and $(y_{1}, \dots, y_{s})$ whose total of $j$ -th powers agree, $x_{1}^{j} + \dots + x_{s}^{j} = y_{1}^{j} + \dots + y_{s}^{j}$ , for every $j = 1, 2, \dots, k$ at the same time. Option A fixes only the maximum; option C forbids overlap, which the definition allows; option D matches products, a different (multiplicative) condition. The defining feature is the stack of $k$ additive power-sum equations holding all at once.

Formal definition Intermediate+

Throughout write $e (t) = e^{2 π i t}$ ; the additive character $e (\cdot)$ is the one used on $R / Z$ in 21.15.02 and 21.15.03, and integrals $\int = \int_{[0, 1)^{k}}$ are over the torus $T^{k}$ with $α = (α_{1}, \dots, α_{k})$ .

Definition (the Vinogradov system and its count). Fix integers $k \geq 2$ , $s \geq 1$ , $N \geq 1$ . The Vinogradov mean value $J_{s, k} (N)$ is the number of integer solutions of the simultaneous system $$ x_1^j + \cdots + x_s^j = y_1^j + \cdots + y_s^j \qquad (1 \le j \le k), \qquad 1 \le x_i, y_i \le N. $$ Equivalently, setting $f_{k} (α; N) = \sum_{n = 1}^{N} e (α_{1} n + α_{2} n^{2} + \dots + α_{k} n^{k})$ , $$ J_{s,k}(N) = \int_{\mathbb{T}^k} \big| f_k(\boldsymbol\alpha; N) \big|^{2s}, d\boldsymbol\alpha, $$ since expanding $∣ f_{k} ∣^{2 s}$ and integrating term by term, the orthogonality $\int_{0}^{1} e (m α_{j}) d α_{j} = 1_{m = 0}$ in each variable picks out exactly the tuples satisfying all $k$ power-sum equations.

Definition (the two floors). Two families of solutions are forced. The diagonal solutions take $(y_{1}, \dots, y_{s})$ to be a permutation of $(x_{1}, \dots, x_{s})$ ; there are $≫ N^{s}$ of these (choose the $x_{i}$ freely; generic tuples have $s!$ permutations). Independently, counting freely on the remaining degrees of freedom after the $k$ constraints forces $≫ N^{2 s - k (k + 1) /2}$ solutions when $2 s \geq k (k + 1) /2$ , since the system cuts out a variety of codimension $k (k + 1) /2$ inside the $2 s$ -dimensional box. Hence the unconditional lower bound $$ J_{s,k}(N) \gg N^{s} + N^{2s - \frac{k(k+1)}{2}}. $$

Definition (the main conjecture). The main conjecture for Vinogradov's mean value theorem asserts that this lower bound is sharp up to $ε$ -powers: for every $ε > 0$ , $$ J_{s,k}(N) \ll_{\varepsilon} N^{s + \varepsilon} + N^{2s - \frac{k(k+1)}{2} + \varepsilon}, $$ uniformly in $N$ . The critical exponent is $s = s_{k} := k (k + 1) /2$ , where the two terms coincide at $N^{s + ε}$ ; the difficulty of the conjecture is concentrated at $s = s_{k}$ , from which all other $s$ follow by Hölder's inequality and direct summation of the surplus variables.

Counterexamples to common slips

"The main conjecture is a single inequality at one value of $s$ ." It is a family indexed by $s$ , but the substance lives at the critical $s_{k} = k (k + 1) /2$ . For $s \leq s_{k}$ the first term $N^{s}$ dominates and the statement is $J_{s, k} ≪ N^{s + ε}$ ; for $s \geq s_{k}$ the second term dominates. Proving the critical case implies all others, so "the main conjecture" usually means the critical bound.
"Vinogradov's method always beats van der Corput." It beats van der Corput's exponent-pair bounds for high degree $k$ , where the iterated $A, B$ -processes lose too much. For small $k$ (notably $k \leq 3$ for the zeta function on the critical line) the van der Corput and exponent-pair bounds of 21.15.03 remain competitive or superior; Vinogradov's strength is the uniformity of the saving as $k \to \infty$ .
" $J_{s, k} (N) ≪ N^{s + ε}$ holds for all $s$ ." Only for $s \leq s_{k}$ . Once $2 s$ exceeds $k (k + 1)$ , the second floor $N^{2 s - k (k + 1) /2}$ overtakes $N^{s}$ and is genuinely larger; claiming $N^{s + ε}$ there contradicts the lower bound. The two-term shape of the conjecture is essential, not cosmetic.

Key theorem with proof Intermediate+

The signature result is the main conjecture itself, now a theorem; below is the statement together with the structure of the proof, and a complete proof of the orthogonality identity and the lower bound that frame it ^{[Bourgain Demeter Guth 2016]}.

Theorem (Vinogradov main conjecture; Bourgain-Demeter-Guth 2016, Wooley 2016). For all integers $k \geq 2$ , $s \geq 1$ and every $ε > 0$ , $$ J_{s,k}(N) \ll_{k, s, \varepsilon} N^{s + \varepsilon} + N^{2s - \frac{k(k+1)}{2} + \varepsilon}. $$ In particular at the critical exponent $s = s_{k} = k (k + 1) /2$ one has $J_{s_{k}, k} (N) ≪_{k, ε} N^{s_{k} + ε}$ .

Proof. The lower bound matching this is elementary and is proved below as Proposition 2; the content is the upper bound, established by two independent routes.

The orthogonality framing. Expand $∣ f_{k} (α; N) ∣^{2 s} = \sum_{x, y} e (\sum_{j = 1}^{k} α_{j} (\sum_{i} x_{i}^{j} - \sum_{i} y_{i}^{j}))$ over $x, y \in {1, \dots, N}^{s}$ . Integrating over $T^{k}$ and using $\int_{0}^{1} e (m α_{j}) d α_{j} = 1_{m = 0}$ in each coordinate kills every term except those with $\sum_{i} x_{i}^{j} = \sum_{i} y_{i}^{j}$ for all $j$ , i.e. counts exactly the solutions of the system. So $\int_{T^{k}} ∣ f_{k} ∣^{2 s} = J_{s, k} (N)$ , reducing the arithmetic count to a $2 s$ -th moment of a Weyl sum of the type bounded in 21.15.02.

Decoupling route (Bourgain-Demeter-Guth). Partition $[0, 1]$ into $N^{- 1}$ -arcs and consider the moment curve $Γ_{k} = {(t, t^{2}, \dots, t^{k}) : t \in [0, 1]}$ . The $ℓ^{2}$ decoupling inequality for $Γ_{k}$ states that for any function $g$ with Fourier support in an $N^{- 1}$ -neighbourhood of $Γ_{k}$ , decomposed as $g = \sum_{θ} g_{θ}$ over $N^{- 1/ k}$ -caps $θ$ , $$ | g |{L^p(\mathbb{R}^k)} \ll\varepsilon N^\varepsilon \Big( \sum_\theta |g_\theta|{L^p}^2 \Big)^{1/2}, \qquad p = k(k+1). $$ Specialising $g$ to an exponential sum over the caps and using $|g\theta|{L^p} \asymp $a s in g l e - c a p co n t r ib u t i o n co n v er t s t hi s ana l y t i c in e q u a l i t y, a t$ p = 2s_k = k(k+1) $, in t o t h ecr i t i c a l m e an v a l u e b o u n d$ J{s_k,k}(N) \ll N^{s_k + \varepsilon} $. T h e d eco u pl in g in e q u a l i t y i t se l f i s p r o v e d b y in d u c t i o n o n t h esc a l e$ N $: a * * m u l t i l in e a r K ak ey a / B e nn e tt - C a r b er y - T a o * * t r an s v er s a l i t y es t ima t e han d l es t h e b r o a d (w e l l - s p r e a d) c a se, t h e * * B o u r g ain - G u t h * * b r o a d - na r r o w d eco m p os i t i o n r e d u ces t h e g e n er a l c a se t o t h e m u l t i l in e a r o n e, an d * * p a r ab o l i cr esc a l in g * * o f t h e m o m e n t c u r v ec l oses t h e in d u c t i o n . T h e l o w er - d e g r ee d eco u pl in g f or$ \Gamma_{k-1}$ feeds the induction at each scale.

Efficient congruencing route (Wooley). Argue $p$ -adically. Choosing a prime $p ≍ N^{1/ k}$ , classify solutions by the congruence classes of the variables modulo powers of $p$ ; a solution of the full system forces strong congruence conditions, because a Vandermonde determinant in the variables (the same determinant that makes the system rigid) is divisible by a high power of $p$ . This lets one bound $J_{s, k} (N)$ by a version of itself at a smaller scale with extra congruence weight — an efficient congruencing iteration $J (N) ≲ N^{δ} J (N^{1 - 1/ k})^{1 - η}$ in schematic form. Iterating drives the exponent to the conjectured value; the nested efficient congruencing of Wooley's 2019 paper organises the bookkeeping to reach the sharp bound for all $k$ . $□$

Bridge. The mean value theorem builds toward the sharpest Weyl-sum bounds and the Vinogradov-Korobov zero-free region, and it appears again in the application to Waring's problem below, where $J_{s, k}$ is exactly the minor-arc moment fed into the circle method of 21.15.02. The foundational reason the diagonal floor $N^{s}$ is so close to the truth is that the simultaneous power-sum equations are governed by a Vandermonde determinant: fixing the first $k$ power sums of a tuple of size $\leq k$ fixes the tuple up to permutation, so genuinely non-diagonal solutions are forced to be sparse. This is exactly the rigidity that both proofs exploit — decoupling reads it as curvature of the moment curve $Γ_{k}$ , efficient congruencing reads it as $p$ -adic Vandermonde divisibility, and the two are dual descriptions of one geometric fact. The central insight is that a high-degree exponential sum's size is controlled by a counting problem, so the analytic question of cancellation generalises into the arithmetic question of how many solutions the system has; putting these together, the resolution of the main conjecture is simultaneously a statement in incidence geometry, in $p$ -adic analysis, and in the theory of $ζ$ .

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Prove the orthogonality identity $J_{s, k} (N) = \int_{T^{k}} ∣ f_{k} (α; N) ∣^{2 s} d α$ where $f_{k} (α; N) = \sum_{n \leq N} e (α_{1} n + \dots + α_{k} n^{k})$ .

Hint

Expand the $2 s$ -th power as a sum over $s$ -tuples $x$ (from $f_{k}$ ) and $s$ -tuples $y$ (from $\overline{f_{k}}$ ), then integrate using $\int_{0}^{1} e (m α) d α = 1_{m = 0}$ in each of the $k$ variables.

Answer

Write $f_{k} = \sum_{n \leq N} e (\sum_{j = 1}^{k} α_{j} n^{j})$ . Then $$ |f_k|^{2s} = f_k^s, \overline{f_k}^{,s} = \sum_{\mathbf x, \mathbf y \in {1,\ldots,N}^s} e\Big( \sum_{j=1}^k \alpha_j \Big( \sum_{i=1}^s x_i^j - \sum_{i=1}^s y_i^j \Big) \Big). $$ Integrate over $α \in [0, 1)^{k}$ . The integral factors across the $k$ coordinates, and in the $j$ -th coordinate $\int_{0}^{1} e (α_{j} m_{j}) d α_{j} = 1_{m_{j} = 0}$ with $m_{j} = \sum_{i} x_{i}^{j} - \sum_{i} y_{i}^{j}$ . The product of these indicators is $1$ precisely when $\sum_{i} x_{i}^{j} = \sum_{i} y_{i}^{j}$ for all $j = 1, \dots, k$ simultaneously, and $0$ otherwise. Hence $$ \int_{\mathbb{T}^k} |f_k|^{2s},d\boldsymbol\alpha = #{ (\mathbf x, \mathbf y) : \textstyle\sum_i x_i^j = \sum_i y_i^j,\ 1 \le j \le k } = J_{s,k}(N). \qquad \square $$ This is the analytic-arithmetic bridge: a moment of a Weyl sum is a solution count, and it is why bounding $J_{s, k}$ is equivalent to bounding the sum on average.

Exercise 4 (medium, symbolic).

Prove the diagonal lower bound $J_{s, k} (N) ≫ N^{s}$ by exhibiting that many solutions.

Hint

Take $y$ to be a permutation of $x$ . How many tuples $x$ have $s$ distinct entries, and how many permutations does each admit?

Answer

For any $x = (x_{1}, \dots, x_{s}) \in {1, \dots, N}^{s}$ and any permutation $σ$ of ${1, \dots, s}$ , set $y = (x_{σ (1)}, \dots, x_{σ (s)})$ . Then $\sum_{i} y_{i}^{j} = \sum_{i} x_{σ (i)}^{j} = \sum_{i} x_{i}^{j}$ for every $j$ , so $(x, y)$ is a solution. The number of tuples $x$ with $s$ pairwise distinct entries is $N (N - 1) \dots (N - s + 1) ≫ N^{s}$ once $N \geq 2 s$ , and each such $x$ yields $s!$ distinct $y$ . Hence $$ J_{s,k}(N) \ge s! \cdot N(N-1)\cdots(N-s+1) \gg_s N^s. $$ These diagonal solutions use only the permutation symmetry of the equations and are present for every $k$ ; they are the first of the two floors. $□$

Exercise 5 (medium, numeric).

The main conjecture gives the Weyl-sum bound: for $k \geq 2$ and $∣ α_{k} - a / q ∣ \leq q^{- 2}$ with $g cd (a, q) = 1$ , Vinogradov's method yields $\sum_{n \leq N} e (α_{k} n^{k}) ≪ N^{1 - ρ (k) + ε}$ with the saving $ρ (k) ≍ 1/ (k^{2} lo g k)$ on a minor arc. Compare the order of magnitude of this saving with Weyl's classical saving $2^{1 - k}$ for $k = 10$ : which is larger, $1/ (k^{2} lo g k)$ or $2^{1 - k}$ ?

Hint

Estimate $2^{1 - 10} = 2^{- 9} \approx 0.002$ against $1/ (100 \cdot lo g 10) \approx 1/ (100 \cdot 2.3)$ .

Answer

The Vinogradov saving $1/ (k^{2} lo g k)$ is larger. For $k = 10$ : Weyl gives $2^{1 - k} = 2^{- 9} \approx 0.00195$ , while Vinogradov gives $≍ 1/ (k^{2} lo g k) = 1/ (100 \cdot ln 10) \approx 1/230 \approx 0.0043$ , roughly twice as large, and the gap widens fast: Weyl's $2^{1 - k}$ decays exponentially in $k$ , whereas Vinogradov's saving decays only polynomially (times a log). This is precisely why Vinogradov's method is the high-degree tool — the saving from Weyl differencing of 21.15.02 becomes negligible as $k$ grows, while the mean value theorem keeps a usable power saving uniformly in $k$ .

Exercise 6 (medium, symbolic).

Show that the critical case $s = s_{k}$ implies the case $s > s_{k}$ : assuming $J_{s_{k}, k} (N) ≪ N^{s_{k} + ε}$ , deduce $J_{s, k} (N) ≪ N^{2 s - k (k + 1) /2 + ε}$ for $s \geq s_{k}$ .

Hint

Each of the extra $2 (s - s_{k})$ variables ranges over ${1, \dots, N}$ . Bound the larger sum by directly summing the extra variables: $∣ f_{k} ∣^{2 s} = ∣ f_{k} ∣^{2 s_{k}} \cdot ∣ f_{k} ∣^{2 (s - s_{k})}$ and $∣ f_{k} ∣ \leq N$ .

Answer

Write $2 s = 2 s_{k} + 2 (s - s_{k})$ . Pointwise $∣ f_{k} (α; N) ∣ \leq N$ , so $$ J_{s,k}(N) = \int_{\mathbb{T}^k} |f_k|^{2s},d\boldsymbol\alpha = \int_{\mathbb{T}^k} |f_k|^{2s_k}, |f_k|^{2(s-s_k)},d\boldsymbol\alpha \le N^{2(s - s_k)} \int_{\mathbb{T}^k} |f_k|^{2s_k},d\boldsymbol\alpha = N^{2(s-s_k)} J_{s_k,k}(N). $$ By the critical bound this is $≪ N^{2 (s - s_{k})} N^{s_{k} + ε} = N^{2 s - 2 s_{k} + s_{k} + ε} = N^{2 s - s_{k} + ε}$ . Since $s_{k} = k (k + 1) /2$ , the exponent is $2 s - k (k + 1) /2 + ε$ , the second floor. Thus the critical case controls all larger $s$ by direct summation of the surplus variables; a Hölder interpolation handles $s < s_{k}$ similarly, so the whole conjecture reduces to its value at $s = s_{k}$ . $□$

Exercise 7 (hard, symbolic).

Prove the Vandermonde rigidity behind the floor: if two tuples $(x_{1}, \dots, x_{k})$ and $(y_{1}, \dots, y_{k})$ of length exactly $k$ satisfy $\sum_{i} x_{i}^{j} = \sum_{i} y_{i}^{j}$ for all $j = 1, \dots, k$ , then $(y_{1}, \dots, y_{k})$ is a permutation of $(x_{1}, \dots, x_{k})$ .

Hint

Equal power sums $p_{1}, \dots, p_{k}$ force equal elementary symmetric polynomials $e_{1}, \dots, e_{k}$ via Newton's identities, hence equal characteristic polynomials $\prod (T - x_{i}) = \prod (T - y_{i})$ .

Answer

Let $p_{j} = \sum_{i} x_{i}^{j}$ and $p_{j}^{'} = \sum_{i} y_{i}^{j}$ be the power sums, and $e_{m}, e_{m}^{'}$ the elementary symmetric polynomials of the two tuples. Newton's identities express each $e_{m}$ as a polynomial in $p_{1}, \dots, p_{m}$ (for $m \leq k$ ): $$ e_1 = p_1, \quad 2e_2 = e_1 p_1 - p_2, \quad \ldots, \quad m,e_m = \sum_{i=1}^{m} (-1)^{i-1} e_{m-i}, p_i. $$ Given $p_{j} = p_{j}^{'}$ for $1 \leq j \leq k$ , induction on $m$ gives $e_{m} = e_{m}^{'}$ for all $1 \leq m \leq k$ (the recursion determines $e_{m}$ from $e_{1}, \dots, e_{m - 1}$ and $p_{1}, \dots, p_{m}$ , all equal by hypothesis and the inductive step). Therefore the monic polynomials $$ P(T) = \prod_{i=1}^k (T - x_i) = T^k - e_1 T^{k-1} + \cdots + (-1)^k e_k $$ and $Q (T) = \prod_{i} (T - y_{i})$ have identical coefficients, so $P = Q$ . A monic polynomial determines its multiset of roots, so ${y_{i}}$ equals ${x_{i}}$ as multisets, i.e. $y$ is a permutation of $x$ . Hence at length $s = k$ the only solutions are diagonal — the rigidity that propagates, with losses, to the floor $J_{s, k} (N) \approx N^{s}$ for $s$ near $s_{k}$ . $□$

Exercise 8 (hard, symbolic).

Deduce the Weyl-sum bound from the mean value theorem in skeleton form: assuming $J_{s, k} (N) ≪ N^{2 s - k (k + 1) /2 + ε}$ at a suitable $s$ , show how Hölder's inequality and a major/minor-arc passage bound an individual sum $\sum_{n \leq N} e (α_{k} n^{k})$ on a minor arc by $N^{1 - ρ + ε}$ for some $ρ = ρ (k) > 0$ .

Hint

The mean value bounds the average of $∣ f_{k} ∣^{2 s}$ over all $α$ . To bound a single sum, combine the mean value with a major/minor-arc dissection in the lower coefficients $α_{1}, \dots, α_{k - 1}$ , so that a large value of the pure sum $\sum e (α_{k} n^{k})$ would force the full $f_{k}$ to be large on a set too big for the mean value to allow.

Answer

Suppose $∣ \sum_{n \leq N} e (α_{k} n^{k}) ∣ \geq N^{1 - ρ}$ on a minor arc (where $α_{k}$ has a denominator $q$ in the range $N^{δ} \leq q \leq N^{k - δ}$ ). One relates this single sum to the full Weyl sum $f_{k} (α; N)$ as follows. By translating $n \mapsto n + m$ and averaging over shifts $m$ , a lower bound on the pure degree- $k$ sum produces, for many choices of the lower coefficients $α_{1}, \dots, α_{k - 1}$ (a set of measure $≫ N^{- O (1)}$ in $T^{k - 1}$ ), a lower bound $∣ f_{k} (α; N) ∣ ≫ N^{1 - ρ - o (1)}$ . Integrating $∣ f_{k} ∣^{2 s}$ over just that set of $α$ gives $$ J_{s,k}(N) = \int_{\mathbb{T}^k} |f_k|^{2s} \ge (\text{measure of the set}) \cdot (N^{1 - \rho})^{2s} \gg N^{-O(1)} N^{2s(1 - \rho)}. $$ Comparing with the mean value upper bound $J_{s, k} (N) ≪ N^{2 s - k (k + 1) /2 + ε}$ forces $$ 2s(1 - \rho) - O(1) \le 2s - \tfrac{k(k+1)}{2} + \varepsilon, \qquad\text{i.e.}\qquad 2s,\rho \ge \tfrac{k(k+1)}{2} - O(1) - \varepsilon. $$ Choosing $s ≍ k^{2}$ (the critical range) gives $ρ ≫ \frac{k ( k + 1 ) /2}{2 s} ≍ \frac{k ^{2}}{k ^{2}} \cdot \frac{1}{( log factors )}$ ; the careful bookkeeping in ^{[Iwaniec Kowalski 2004]} yields $ρ (k) ≍ 1/ (k^{2} lo g k)$ . Hence the assumption $∣ \sum e (α_{k} n^{k}) ∣ \geq N^{1 - ρ}$ is untenable for $ρ$ smaller than this, giving $\sum_{n \leq N} e (α_{k} n^{k}) ≪ N^{1 - ρ (k) + ε}$ on the minor arc. The pure power sum's cancellation is thus extracted from the on-average control of the full sum. $□$

Advanced results Master

The critical exponent and the shape of the bound

The entire difficulty of the main conjecture concentrates at $s = s_{k} = k (k + 1) /2$ ^{[Demeter 2020]}. For $s \leq s_{k}$ the conjecture reads $J_{s, k} (N) ≪ N^{s + ε}$ (the diagonal floor), and for $s \geq s_{k}$ it reads $J_{s, k} (N) ≪ N^{2 s - s_{k} + ε}$ (the dimension floor); the critical case interpolates between them at $J_{s_{k}, k} (N) ≪ N^{s_{k} + ε}$ , and Exercise 6 shows it implies the rest. Before the resolution, Vinogradov's original 1935 method and its refinements by Linnik, Karatsuba, and Stechkin reached bounds of the shape $J_{s, k} (N) ≪ N^{2 s - k (k + 1) /2 + η_{k}}$ with $η_{k} \to 0$ but $η_{k} > 0$ for fixed $s$ near critical — strong enough for the zero-free region but not the exact main conjecture. The gap $η_{k}$ was closed unconditionally only in 2015-2016, first for $k = 3$ by Wooley's efficient congruencing (2014) and then for all $k$ by Bourgain-Demeter-Guth's decoupling (2016) and Wooley's nested congruencing (2019).

Decoupling for the moment curve

The Bourgain-Demeter-Guth proof factors through a statement in Euclidean harmonic analysis with no number theory in it ^{[Bourgain Demeter Guth 2016]}. The $ℓ^{2}$ decoupling inequality for the moment curve $Γ_{k} = {(t, t^{2}, \dots, t^{k}) : t \in [0, 1]} \subset R^{k}$ asserts that for $g$ Fourier-supported in the $δ$ -neighbourhood of $Γ_{k}$ and decomposed into $δ^{1/ k}$ -caps $θ$ , $$ | g |{L^p(\mathbb{R}^k)} \le C\varepsilon \delta^{-\varepsilon} \Big( \sum_\theta | g_\theta |{L^p(\mathbb{R}^k)}^2 \Big)^{1/2} $$ for all $2 \leq p \leq k (k + 1)$ , sharp at the endpoint $p = k (k + 1) = 2 s_{k}$ . Taking $g\theta $t o b e (s m oo t h e d) c ha r a c t er sco n ce n t r a t e d o n e a c h c a p an d$ \delta = N^{-1} $r eco v er s t h ecr i t i c a l m e an v a l u e b o u n d . T h e p r oo f i s a B o u r g ain - G u t hin d u c t i o n o n sc a l es : a * * m u l t i l in e a r K ak ey a * * es t ima t e (B e nn e tt - C a r b er y - T a o, w i t h t h e G u t h p o l y n o mia l - m e t h o d p r oo f) co n t r o l s t h e t r an s v er s a l / b r o a d c a se, t h e b r o a d - na r r o w d i c h o t o m y r e d u ces t h e g e n er a l s u m t o t h e m u l t i l in e a r c a se pl u s a l o w er - d im e n s i o na l co n t r ib u t i o nhan d l e d b y d eco u pl in g f or$ \Gamma_{k-1} $, an d p a r ab o l i cr esc a l in g r es t or es t h esc a l e . C u r v a t u r eo f$ \Gamma_k$ — the non-vanishing of the torsion, equivalently the Vandermonde determinant of the tangent, osculating, and higher flags — is the geometric input that makes decoupling hold, and it is the exact analytic avatar of the arithmetic rigidity of Exercise 7.

Efficient congruencing and the $p$ -adic route

Wooley's method bypasses harmonic analysis entirely, working $p$ -adically ^{[Wooley 2012]}. The idea is to count solutions by conditioning on the $p$ -adic distances between the variables: solutions in which many variables are congruent modulo high powers of a prime $p ≍ N^{1/ k}$ are forced by a Vandermonde divisibility — the same determinant — and this lets the count at scale $N$ be bounded by a weighted count at scale $N^{1 - 1/ k}$ , giving a self-improving recursion. Each iteration "efficiently congruences" a block of variables, trading scale for congruence concentration; the nested version tracks two interleaved blocks and reaches the sharp exponent for every $k$ . Efficient congruencing also extends to systems over number fields and function fields where the decoupling machinery is harder to deploy, and it gives explicit (non- $ε$ -power) bounds in some ranges. The two proofs are now understood as parallel: decoupling is the archimedean and efficient congruencing the non-archimedean reading of the curvature/Vandermonde rigidity of the moment curve.

Consequences: Weyl sums, the zero-free region, and Waring's problem

The mean value theorem is a tool, and its three classical payoffs are sharp ^{[Iwaniec Kowalski 2004]}. First, the Weyl-sum bound: for the high-degree minor-arc sum $\sum_{n \leq N} e (α_{k} n^{k} + \dots)$ one obtains a saving $N^{1 - ρ (k) + ε}$ with $ρ (k) ≍ 1/ (k^{2} lo g k)$ , beating the van der Corput exponent-pair and Weyl-differencing bounds of 21.15.03 and 21.15.02 for all large $k$ . Second, the Vinogradov-Korobov zero-free region: feeding these bounds into the explicit formula gives that $ζ (σ + i t) \neq = 0$ for $σ \geq 1 - c / (lo g ∣ t ∣)^{2/3} (lo g lo g ∣ t ∣)^{1/3}$ , the widest known zero-free region and the source of the best unconditional error term in the prime number theorem. Third, Waring's problem: the mean value bound controls the minor arcs in the circle method for sums of $k$ -th powers, yielding $G (k) \leq k (lo g k + lo g lo g k + O (1))$ for the least $s$ such that every large integer is a sum of $s$ positive $k$ -th powers — a bound that the resolution of the main conjecture brought to within a constant of the conjectured truth.

Synthesis. Vinogradov's mean value theorem, the decoupling inequality for the moment curve, efficient congruencing, and the Vinogradov-Korobov zero-free region are one circle of ideas, and the bridge is the rigidity of the moment curve $Γ_{k}$ — its curvature, equivalently the Vandermonde determinant of its flags. The foundational reason the count is pinned to its two floors is that fixing the first $k$ power sums of a short tuple fixes the tuple up to permutation (Exercise 7), and this is exactly the curvature that makes $ℓ^{2}$ decoupling sharp and the $p$ -adic Vandermonde divisibility that drives efficient congruencing; the two resolutions are dual — archimedean and non-archimedean — readings of one geometric fact. The central insight is that a high-degree exponential sum's cancellation is a counting problem, so the analytic Weyl-sum bound of 21.15.02 generalises into the arithmetic mean value, and putting these together, the same bound that resolves the count delivers the sharpest Weyl bounds, the Vinogradov-Korobov region for $ζ$ , and the near-optimal $G (k)$ in Waring's problem. This is exactly where the smooth-phase exponent-pair calculus of 21.15.03 runs out of strength and the polynomial-system viewpoint takes over: for high degree the central insight is that one should count solutions, not estimate sums directly, and the moment curve's geometry then does the rest.

Full proof set Master

Proposition 1 (orthogonality and the mean value identity). With $f_{k} (α; N) = \sum_{n \leq N} e (α_{1} n + \dots + α_{k} n^{k})$ , one has $J_{s, k} (N) = \int_{T^{k}} ∣ f_{k} (α; N) ∣^{2 s} d α$ .

Proof. Expanding $∣ f_{k} ∣^{2 s} = f_{k}^{s} \overline{f_{k}}^{s}$ as a sum over $(x, y) \in {1, \dots, N}^{2 s}$ gives the integrand $e (\sum_{j = 1}^{k} α_{j} (\sum_{i} x_{i}^{j} - \sum_{i} y_{i}^{j}))$ . Integration over $T^{k}$ factors over the $k$ variables, and $\int_{0}^{1} e (α_{j} m_{j}) d α_{j} = 1_{m_{j} = 0}$ with $m_{j} = \sum_{i} x_{i}^{j} - \sum_{i} y_{i}^{j}$ . The product of indicators equals $1$ exactly for the solutions of the system, so the integral counts them. $□$

Proposition 2 (the two-term lower bound). For $N \geq 2 s$ , $J_{s, k} (N) ≫_{s} N^{s} + N^{2 s - k (k + 1) /2}$ .

Proof. The diagonal contribution $≫ N^{s}$ is Exercise 4: permutations of distinct tuples give $s! \cdot N (N - 1) \dots (N - s + 1) ≫ N^{s}$ solutions. For the second term, suppose $2 s \geq k (k + 1) /2$ . The map $(x, y) \mapsto (\sum_{i} x_{i}^{j} - \sum_{i} y_{i}^{j})_{1 \leq j \leq k}$ from ${1, \dots, N}^{2 s}$ to $Z^{k}$ takes values $m_{j}$ with $∣ m_{j} ∣ \leq 2 s N^{j}$ , so the number of possible target vectors is $≪ \prod_{j = 1}^{k} N^{j} = N^{1 + 2 + \dots + k} = N^{k (k + 1) /2}$ . By pigeonhole, some fibre (in fact the fibre over $0$ , by a standard averaging since $0$ is the most populous value) has at least $$ \frac{N^{2s}}{C,N^{k(k+1)/2}} \gg N^{2s - k(k+1)/2} $$ preimages, and the fibre over $0$ is exactly the solution set counted by $J_{s, k} (N)$ . (That $0$ is the densest fibre follows from $J_{s, k} (N) = \int ∣ f_{k} ∣^{2 s} \geq ∣ \int ∣ f_{k} ∣^{2 s} e (\cdot) ∣$ -type positivity, or directly from the Cauchy-Schwarz / convexity bound $\sum_{m} r (m)^{2} \leq r (0) \sum_{m} r (m)$ where $r (m)$ counts representations.) Adding the two contributions gives the stated floor. $□$

Proposition 3 (the $k = 1$ and small- $s$ cases are exact). For $k = 1$ , $J_{s, 1} (N) = \int_{0}^{1} ∣ \sum_{n \leq N} e (α n) ∣^{2 s} d α$ counts solutions of the single equation $\sum x_{i} = \sum y_{i}$ , and the main conjecture $J_{s, 1} (N) ≪ N^{s + ε} + N^{2 s - 1 + ε}$ holds with $J_{s, 1} (N) ≍ N^{2 s - 1}$ for $s \geq 1$ . For $s \leq k$ , $J_{s, k} (N) = s! N^{s} + O (N^{s - 1})$ is purely diagonal to leading order.

Proof. For $k = 1$ the count is the number of $(x, y)$ with equal sums; the number of ways to write a value $v \in [s, s N]$ as $x_{1} + \dots + x_{s}$ is a bounded-degree quasi-polynomial peaking at $≍ N^{s - 1}$ near $v ≍ s N /2$ , and $J_{s, 1} (N) = \sum_{v} r (v)^{2} ≍ N \cdot (N^{s - 1})^{2} / N = N^{2 s - 1}$ by Cauchy-Schwarz being sharp for the near-uniform $r$ , consistent with the second floor (here $k (k + 1) /2 = 1$ ). For $s \leq k$ : by the Vandermonde rigidity of Exercise 7 (Proposition 4 below), every solution with all $x_{i}$ distinct is diagonal, and tuples with a repeated coordinate number $O (N^{s - 1})$ ; so $J_{s, k} (N) = (distinct diagonal) + O (N^{s - 1}) = s! (s N) s! / s! \dots = s! N^{s} (1 + O (1/ N))$ , the diagonal floor with no excess. $□$

Proposition 4 (Vandermonde rigidity at length $k$ ). If $x, y \in Z^{k}$ satisfy $\sum_{i} x_{i}^{j} = \sum_{i} y_{i}^{j}$ for $1 \leq j \leq k$ , then $y$ is a permutation of $x$ .

Proof. Equal power sums $p_{1}, \dots, p_{k}$ force, via Newton's identities $m e_{m} = \sum_{i = 1}^{m} (- 1)^{i - 1} e_{m - i} p_{i}$ , equal elementary symmetric functions $e_{1}, \dots, e_{k}$ (induction on $m \leq k$ ). Hence the monic degree- $k$ polynomials $\prod_{i} (T - x_{i})$ and $\prod_{i} (T - y_{i})$ have equal coefficients, so they are equal, and a monic polynomial determines its root multiset. Thus ${x_{i}} = {y_{i}}$ as multisets. This is the determinantal heart of the subject: the Jacobian of $(\sum x_{i}, \dots, \sum x_{i}^{k})$ in the $x_{i}$ is a Vandermonde determinant $\prod_{i < i^{'}} (x_{i} - x_{i^{'}})$ times $k!$ , non-vanishing off the diagonal, which is the local form of the rigidity. $□$

Connections Master

The mean value $J_{s, k} (N)$ is a high-moment refinement of the Weyl-sum estimates of 21.15.02: where Weyl differencing bounds a single sum $\sum e (α n^{k})$ with the lossy saving $2^{1 - k}$ , the mean value controls the $2 s$ -th moment of the full polynomial Weyl sum sharply, and the resulting minor-arc bound feeds the same Hardy-Littlewood circle method that Weyl's inequality serves.

The method supersedes the van der Corput exponent-pair calculus of 21.15.03 in the high-degree regime: the iterated $A, B$ -processes lose a constant factor of cancellation at each step and become inert as $k$ grows, whereas the mean value keeps a saving $ρ (k) ≍ 1/ (k^{2} lo g k)$ that decays only polynomially, so for large $k$ one counts solutions of the system rather than estimating the sum through stationary phase.

The sharp Weyl bounds produced here yield the Vinogradov-Korobov zero-free region, the widest known for $ζ$ , which is the deepest input to the growth, zero-density, and prime-counting theory of the Riemann zeta function in 21.03.01; the same minor-arc control underlies the circle-method treatment of Waring's problem and links back to the complete exponential sums of 21.15.04 that govern the major arcs.

Historical & philosophical context Master

Ivan Matveevich Vinogradov introduced the mean value method in 1935 ^{[Vinogradov 1935]} as the engine of his "method of trigonometric sums," using it to bound Weyl sums of high degree far beyond what Weyl's differencing or van der Corput's processes could reach, and thereby to establish the zero-free region for $ζ$ that Korobov and Vinogradov independently sharpened to the $(lo g t)^{2/3}$ form around 1958. The original bounds carried an unavoidable loss $η_{k} > 0$ from the exponent $k (k + 1) /2$ ; closing that loss to the conjectured $ε$ was the central open problem of the area for eighty years. Linnik's 1943 $p$ -adic reformulation anticipated the congruencing idea, and Karatsuba and Stechkin pushed the classical method to its limit.

The resolution came from two directions almost simultaneously. Trevor Wooley's efficient congruencing, beginning in 2012 ^{[Wooley 2012]}, proved the main conjecture for $k = 3$ and then, in nested form, for all $k$ ; it is a $p$ -adic descent that conditions on congruences between the variables, made precise by Vandermonde divisibility. Jean Bourgain, Ciprian Demeter, and Larry Guth proved the full conjecture for all $k$ in 2016 ^{[Bourgain Demeter Guth 2016]} via $ℓ^{2}$ decoupling for the moment curve, an inequality in Euclidean harmonic analysis with no arithmetic content, resting on the multilinear Kakeya estimate and the Bourgain-Guth induction on scales. Demeter's 2020 book ^{[Demeter 2020]} gives the systematic decoupling account. The two proofs are now read as archimedean and non-archimedean expressions of the same curvature of the moment curve, and the episode is a case where a number-theoretic conjecture was settled by, and in turn reshaped, the geometry of the Fourier restriction problem.

Bibliography Master

@article{vinogradov1935,
  author  = {Vinogradov, I. M.},
  title   = {The method of trigonometrical sums in the theory of numbers},
  journal = {Trudy Mat. Inst. Steklov},
  volume  = {10},
  year    = {1935},
  note    = {English translation, Interscience, 1954}
}

@article{wooley2012,
  author  = {Wooley, Trevor D.},
  title   = {Vinogradov's mean value theorem via efficient congruencing},
  journal = {Annals of Mathematics},
  volume  = {175},
  number  = {3},
  pages   = {1575--1627},
  year    = {2012}
}

@article{wooley2019,
  author  = {Wooley, Trevor D.},
  title   = {Nested efficient congruencing and relatives of Vinogradov's mean value theorem},
  journal = {Proceedings of the London Mathematical Society},
  volume  = {118},
  number  = {4},
  pages   = {942--1016},
  year    = {2019}
}

@article{bourgaindemeterguth2016,
  author  = {Bourgain, Jean and Demeter, Ciprian and Guth, Larry},
  title   = {Proof of the main conjecture in Vinogradov's mean value theorem for degrees higher than three},
  journal = {Annals of Mathematics},
  volume  = {184},
  number  = {2},
  pages   = {633--682},
  year    = {2016}
}

@book{demeter2020,
  author    = {Demeter, Ciprian},
  title     = {Fourier Restriction, Decoupling, and Applications},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {184},
  publisher = {Cambridge University Press},
  year      = {2020}
}

@book{iwaniec-kowalski2004,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004}
}

@book{vaughan1997,
  author    = {Vaughan, Robert C.},
  title     = {The Hardy-Littlewood Method},
  series    = {Cambridge Tracts in Mathematics},
  volume    = {125},
  edition   = {2},
  publisher = {Cambridge University Press},
  year      = {1997}
}

Prerequisites

21.15.03
21.15.02

Tier anchors

beginner: Stein-Shakarchi 2003 *Fourier Analysis: An Introduction* (Princeton Lectures in Analysis I) Ch. 4 (exponential sums and counting solutions informally); Vaughan 1997 *The Hardy-Littlewood Method* (Cambridge Tracts 125, 2nd ed.) Ch. 5 (Vinogradov's mean value theorem motivated through Waring's problem)
intermediate: Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8 §8.5 (the mean value theorem and Vinogradov's method); Vaughan 1997 *The Hardy-Littlewood Method* (Cambridge Tracts 125) Ch. 5; Wooley 2012 *Annals of Mathematics* 175, 1575-1627 (Vinogradov's mean value theorem via efficient congruencing — the nested-efficient-congruencing argument)
master: Bourgain-Demeter-Guth 2016 *Annals of Mathematics* 184, 633-682 (Proof of the main conjecture in Vinogradov's mean value theorem for degrees higher than three, via $\ell^2$ decoupling for the moment curve); Wooley 2016 *Proc. London Math. Soc.* 118, 942-1016 (Nested efficient congruencing and the main conjecture); Demeter 2020 *Fourier Restriction, Decoupling, and Applications* (Cambridge Studies in Advanced Mathematics 184) Ch. 11-13 (decoupling for the moment curve and the Vinogradov system); Vinogradov 1935 *Trav. Inst. Math. Stekloff* 10 (the original mean value method); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8

References

Vinogradov, I. M. — The method of trigonometrical sums in the theory of numbers · *Trudy Mat. Inst. Steklov* 10 (1935) and the monograph (English translation, Interscience 1954). The original mean value theorem: a bound on the number of solutions of the simultaneous system $\sum x_i^j = \sum y_i^j$ ($1 \le j \le k$), and its application to Weyl-sum bounds for high-degree phases and the Vinogradov-Korobov zero-free region.
Wooley, T. D. — Vinogradov's mean value theorem via efficient congruencing · *Annals of Mathematics* 175 (2012), 1575-1627; and *Proc. London Math. Soc.* 118 (2019), 942-1016 (nested efficient congruencing). The $p$-adic / congruencing resolution of the main conjecture, sharp for $k=3$ in the 2012 paper and for all $k$ in the later nested form.
Bourgain, J., Demeter, C. & Guth, L. — Proof of the main conjecture in Vinogradov's mean value theorem for degrees higher than three · *Annals of Mathematics* 184 (2016), 633-682. The $\ell^2$-decoupling resolution: $J_{s,k}(N) \ll_\varepsilon N^{s+\varepsilon} + N^{2s - k(k+1)/2 + \varepsilon}$ for all $s$, deduced from the $\ell^2$ decoupling inequality for the moment curve $(t, t^2, \ldots, t^k)$.
Demeter, C. — Fourier Restriction, Decoupling, and Applications · Cambridge Studies in Advanced Mathematics 184, Cambridge University Press (2020), Ch. 11-13. The decoupling inequality for the moment curve, the multilinear Kakeya input, and the deduction of the Vinogradov main conjecture from decoupling.
Iwaniec, H. & Kowalski, E. — Analytic Number Theory · American Mathematical Society Colloquium Publications 53 (2004), Ch. 8 §8.5. The mean value theorem, Vinogradov's method, and the consequences for Weyl sums, Waring's problem, and the zero-free region of $\zeta$.
Vaughan, R. C. — The Hardy-Littlewood Method · Cambridge Tracts in Mathematics 125, 2nd edition, Cambridge University Press (1997), Ch. 5. Vinogradov's mean value theorem, the classical (pre-resolution) bounds, and the route to Waring's problem $G(k)$ estimates.

Estimated time

beginner: 20m
intermediate: 55m
master: 95m

Intuition Beginner

Visual Beginner

Worked example Beginner

Check your understanding Beginner

Formal definition Intermediate+

Counterexamples to common slips

Key theorem with proof Intermediate+

Exercises Intermediate+

Advanced results Master

The critical exponent and the shape of the bound

Decoupling for the moment curve

Efficient congruencing and the p-adic route

Consequences: Weyl sums, the zero-free region, and Waring's problem

Full proof set Master

Connections Master

Historical & philosophical context Master

Bibliography Master

Efficient congruencing and the $p$ -adic route