21.15.02 · number-theory / exponential-sums

Weyl Sums, Weyl Differencing, and Equidistribution

shipped3 tiersLean: none

Anchor (Master): Weyl 1916 *Math. Ann.* 77, 313-352 (Über die Gleichverteilung von Zahlen mod. Eins — the differencing method, the bound for polynomial Weyl sums, and the equidistribution of $(\alpha n^k)$); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) Ch. 8 (Weyl sums, van der Corput's method, the major/minor-arc dichotomy); Kuipers-Niederreiter 1974 *Uniform Distribution of Sequences* (Wiley) Ch. 1-2 (Weyl's criterion, discrepancy, the Erdős-Turán inequality, van der Corput's difference theorem); Montgomery 1994 *Ten Lectures* (CBMS 84) Ch. 1 (Weyl sums and the distribution of $\{n^2\alpha\}$); Vaughan 1997 *The Hardy-Littlewood Method* (Cambridge Tracts 125, 2nd ed.) Ch. 2 (Weyl's inequality and Hua's lemma in the circle method)

Intuition Beginner

Take an irrational number, say , and look at its multiples: . Throw away the whole-number part of each, keeping only what is left after the decimal point. You get a sequence of numbers between and . Weyl's equidistribution theorem says these leftover parts spread out perfectly evenly: in the long run, the fraction that land in any sub-interval is exactly the length of that interval, with no clumping and no gaps. The same holds for the squares , for the cubes, and for any polynomial in as long as one coefficient is irrational.

How do you prove a sequence spreads out evenly? Weyl's trick is to stop counting points in boxes and instead measure waves. For each whole number of cycles, wrap the unit interval into a circle, place a point for each term of the sequence, and add up the resulting arrows. If the points are spread evenly, the arrows point in all directions and cancel; their sum is tiny compared to the number of points. If the points clump, the arrows reinforce and the sum stays large. So even spreading becomes the single clean statement: every wave-sum is small.

For polynomial sequences the wave-sums are hard to add directly, so Weyl found a way to lower the difficulty. Squaring a sum lets you compare each term with a shifted copy of itself; the difference of two values of a degree- polynomial is a polynomial of degree one less. Repeating this peels the polynomial down to a straight line, which anyone can sum. This step-down is Weyl differencing, and it is the engine behind almost every bound on these sums.

Visual Beginner

Imagine the unit interval bent into a circle of circumference one. For a chosen whole number , you walk around the circle, and at step you mark the point reached by going times the fractional part of the -th term of your sequence. Each mark is an arrow of length one pointing from the center to that spot. The wave-sum is the single arrow you get by laying all these arrows tip to tail.

   evenly spread points          clumped points
        (arrows cancel)          (arrows reinforce)

         . | .                        . . .
       .   |   .                     . . . .
      .----+----.                    . . . .         -->  long
       .   |   .                       (all near                  total
         . | .                          one spot)                 arrow
     short total arrow

The picture shows the whole logic in one image: even spreading means short wave-sums, clumping means long ones. Weyl's criterion turns "the points are evenly spread" into "every wave-sum is short," and that is a thing you can actually compute.

Worked example Beginner

We watch the fractional parts of begin to even out for , and we check one wave-sum by hand.

Step 1. List the first six fractional parts (the part after the decimal point) of . We have so the fractional part is ; gives ; gives ; gives ; gives ; gives .

Step 2. Sort them to see the spread: . Even with only six points they already sample the whole interval from near to near without bunching. As you take more terms the gaps fill in more and more uniformly.

Step 3. Now test the wave-sum for one full cycle (). For each fractional part , the arrow is the pair of numbers . Adding the six arrows, the horizontal parts are summing to , and the vertical parts are , summing to .

Step 4. The total arrow has length about . Six arrows of length one have collapsed to a total of length , far shorter than six. The cancellation is already visible.

What this tells us: the fractional parts of are spreading evenly, and the matching fact is that the wave-sum is small relative to the number of terms. With more points the ratio of wave-sum length to point count keeps shrinking toward zero, which is precisely Weyl's criterion in action.

Check your understanding Beginner

Formal definition Intermediate+

Throughout write and let denote the fractional part of . The additive character is the same one used on in 02.10.04 and 21.15.01.

Definition (uniform distribution mod ). A sequence of real numbers is uniformly distributed modulo (equidistributed) if for every subinterval , $$ \lim_{N\to\infty} \frac{1}{N},#{, 1 \le n \le N : {x_n} \in [a,b) ,} = b - a. $$ Equivalently, by approximating indicator functions, for every Riemann-integrable on .

Definition (Weyl sum). For a real-valued phase the associated Weyl sum of length is $$ S_N(f) = \sum_{n=1}^{N} e\big(f(n)\big). $$ When is a polynomial of degree this is a polynomial Weyl sum; the case is the geometric (linear) sum, and is the prototypical quadratic Weyl sum.

Definition (discrepancy). The discrepancy of the first terms is $$ D_N = \sup_{0 \le a < b \le 1}\left| \frac{1}{N}#{, n \le N : {x_n} \in [a,b) ,} - (b-a)\right|. $$ A sequence is equidistributed iff . The discrepancy is the quantitative refinement of equidistribution: it measures the worst-case deviation from the fair share across all subintervals.

Definition (forward difference). For a function and a shift , the forward difference is . Iterating, . If is a polynomial of degree with leading coefficient , then has degree with leading coefficient ; applying differences leaves a linear polynomial, and applying differences leaves the constant . This degree-lowering is the structural fact Weyl differencing exploits.

Counterexamples to common slips

  • "Equidistribution mod is the same as density mod ." Density (the orbit closure being all of ) is strictly weaker. The sequence built by listing the fractional parts of is dense but not equidistributed, because grows too slowly and the counts are weighted toward small values. Equidistribution demands the correct proportion in every interval, not merely that every interval is visited.

  • "Weyl's criterion needs all real frequencies ." Only integer frequencies are tested. Testing all real would be both unnecessary and false in general; the integer characters already separate points on by Weyl's theorem on trigonometric polynomial approximation, so they suffice.

  • " and equidistribute for the same elementary reason." The linear case follows from a one-line geometric-series bound. The quadratic case has no such direct bound — the partial sums of do not telescope — and genuinely requires differencing to reduce the square to a linear sum. Treating the quadratic case as a corollary of the linear one is the most common error.

Key theorem with proof Intermediate+

The signature result is Weyl's criterion together with its consequence for polynomial sequences; the differencing inequality is the tool that makes the polynomial case effective [Weyl 1916].

Theorem (Weyl's criterion; Weyl 1916). A sequence of reals is uniformly distributed mod if and only if $$ \frac{1}{N}\sum_{n=1}^{N} e(h, x_n) \longrightarrow 0 \qquad (N\to\infty) $$ for every integer .

Proof. Equidistribution is equivalent, by the Riemann-integrability reformulation, to for every Riemann-integrable . Taking for gives , so equidistribution forces the stated limit; this is the easy direction.

Conversely, suppose for all . For the average is identically . Hence for every trigonometric polynomial , by linearity. Let be Riemann-integrable and . By the Weierstrass approximation theorem on the circle there are trigonometric polynomials with and . Then $$ \frac1N\sum_{n\le N} P_-({x_n}) \le \frac1N\sum_{n\le N} g({x_n}) \le \frac1N\sum_{n\le N} P_+({x_n}), $$ and letting the outer averages converge to , which sandwich within . As is arbitrary, . Taking the indicator of (Riemann-integrable, its boundary having measure zero) yields uniform distribution.

Lemma (Weyl differencing inequality). For any complex numbers and any , the Weyl sum satisfies $$ |S|^{2} \le N + 2,\bigg|\sum_{h=1}^{N-1}\ \sum_{n=1}^{N-h} e\big(f(n+h) - f(n)\big)\bigg| \le N \sum_{|h| < N}\ \bigg|\sum_{n \in I_h} e\big(\Delta_h f(n)\big)\bigg|, $$ where is the range of with .

Proof. Expand the square: . Substitute with ranging over ; for fixed , runs over those indices with and , an interval of length . Thus . The term contributes . Pairing with (conjugate inner sums) and applying the triangle inequality gives the middle bound; bounding and counting the values of gives the final form .

Corollary (equidistribution of ). For irrational and any integer , the sequence is uniformly distributed mod .

Proof. By Weyl's criterion it suffices to show for each fixed integer ; write , still irrational. Induct on . For , the geometric series gives since . For , apply the differencing lemma to . The inner difference is a polynomial in of degree with leading coefficient , which is irrational for . By the inductive hypothesis its Weyl sum is for each fixed , uniformly enough that $$ |S_N|^2 \le N \sum_{|h|<N} \Big|\sum_{n\in I_h} e(\Delta_h f(n))\Big| = N\Big(N + \sum_{0<|h|<N} o(N)\Big) = o(N^2), $$ where the term gives and a dominated-convergence / averaging argument over (splitting into and , then sending after ) controls the remaining sum. Hence , completing the induction.

Bridge. Weyl differencing builds toward the entire circle method, and it appears again in van der Corput's method below, where the same square-and-shift step is iterated with care for the lengths of the shift ranges. The foundational reason the polynomial case reduces to the linear case is that the forward difference lowers the degree of a polynomial by exactly one while keeping the leading coefficient irrational, so the induction this is exactly the differencing lemma supplies bottoms out at the elementary geometric sum. The criterion and the inequality are dual faces of one idea: the criterion says equidistribution is cancellation in , and the differencing inequality is how one produces that cancellation for polynomial phases. Putting these together, the quantitative version — the Erdős-Turán inequality of the Advanced results — turns each differencing bound into an explicit discrepancy estimate, and the same major/minor-arc split that organises the circle method is the central insight that the differencing bound is strong precisely on the minor arcs where is poorly approximated by rationals.

Exercises Intermediate+

Advanced results Master

Weyl's inequality and the saving exponent

The qualitative corollary equidistributes becomes quantitative through Weyl's inequality [Vaughan 1997]. Let with and . Then $$ \sum_{n\le N} e(f(n)) \ll N^{1+\varepsilon}\Big(\frac1q + \frac1N + \frac{q}{N^k}\Big)^{1/K}, \qquad K = 2^{k-1}. $$ The proof is iterations of the differencing inequality, each squaring the sum and lowering the degree, terminating at a linear sum estimated by . The exponent is the cumulative cost of the squarings: differencing is lossy, and the loss compounds geometrically in the degree. On a genuine minor arc, where , the bound reads , a power saving. The companion estimate, Hua's lemma, bounds the -th moment and is what feeds Weyl's inequality into the circle method for Waring's problem.

The major/minor-arc dichotomy

The phrase " well or poorly approximated by rationals" is made precise by the Farey dissection of into major and minor arcs [Iwaniec-Kowalski 2004]. Fix a parameter . The major arcs are short intervals around each rational with ; the minor arcs are the rest. On the major arcs the Weyl sum is large but explicitly approximated by a main term (a product of a singular series and a singular integral); on the minor arcs Weyl's inequality forces , so their contribution to a circle-method integral is an error term. The entire Hardy-Littlewood method is the statement that , with the major arcs giving the asymptotic main term and the minor arcs — controlled by Weyl differencing — giving a smaller error. Equidistribution of is the qualitative shadow: every fixed nonzero eventually lands in a minor-arc regime relative to , where the sum is .

Van der Corput's method and discrepancy

Van der Corput's method refines differencing into two named operations on exponential sums with smooth (not necessarily polynomial) phases [Kuipers-Niederreiter 1974]. The -process (Weyl-van der Corput) is exactly the differencing inequality, trading a sum of length for an average of difference sums. The -process is Poisson summation 21.15.01 followed by stationary phase, replacing by a dual sum over the critical points of . Alternating and processes produces the exponent pairs with , the backbone of bounds for and the Dirichlet divisor problem. Quantitatively, the Erdős-Turán inequality [Erdős Turán 1948] converts any such bound into a discrepancy estimate: , so cancellation in finitely many Weyl sums caps the worst-case deviation from uniformity across all intervals at once.

Synthesis. Weyl's criterion, the differencing inequality, van der Corput's method, and the major/minor-arc dichotomy are one circle of ideas, and the bridge is the additive character : equidistribution is the vanishing of every character average, and every tool here is a way to produce that vanishing for an increasingly wide class of phases. The foundational reason the polynomial case works is that forward differencing lowers degree by one while keeping leading coefficients irrational, so the induction bottoms out at the geometric sum; this is exactly the -process, and the central insight is that it is dual to van der Corput's -process, which is Poisson summation from 21.15.01. Putting these together, the saving exponent generalises the linear bound up the degree ladder, the Erdős-Turán inequality generalises the qualitative criterion into a quantitative discrepancy bound, and the major/minor-arc split is dual to the partition of frequencies by Diophantine quality — the same partition that organises the Hardy-Littlewood circle method and, through the -process, ties Weyl sums to the Voronoi and Poisson summation formulae of 21.15.01.

Full proof set Master

Proposition 1 (degree reduction under forward differencing). If is a real polynomial of degree with leading coefficient , then for each the difference is a polynomial of degree exactly with leading coefficient ; consequently is constant.

Proof. Write . Then $$ \Delta_h f(x) = \alpha_k\big((x+h)^k - x^k\big) + \Delta_h(\text{lower}) = \alpha_k\big(k h, x^{k-1} + \tbinom{k}{2}h^2 x^{k-2} + \cdots\big) + \Delta_h(\text{lower}). $$ The top term cancels, leaving as the highest surviving power; the differenced lower-order terms have degree , so they do not interfere with the degree- leading coefficient. Thus with leading coefficient . Iterating, each application lowers the degree by one and multiplies the leading coefficient by the next factor , so after steps the leading coefficient is and the degree is .

Proposition 2 (irrationality propagates through differencing). If is irrational and , then for every choice of nonzero shifts the iterated difference is a linear polynomial with slope , which is irrational.

Proof. By Proposition 1 applied times, has degree and leading (slope) coefficient $$ c = k\cdot(k-1)\cdots 2 \cdot \alpha, h_1 h_2 \cdots h_{k-1} = k!,\alpha, h_1\cdots h_{k-1}. $$ Each is a nonzero integer and the integer factor is nonzero, so is a nonzero rational multiple of the irrational , hence irrational. The constant term is real but irrelevant: and the unimodular factor does not affect . Therefore the terminal linear sum is a geometric sum with irrational ratio , bounded by uniformly in .

Proposition 3 (Weyl's criterion via unique ergodicity for ). For irrational , the rotation on is uniquely ergodic, and consequently is uniformly distributed mod .

Proof. Let be any -invariant Borel probability measure on . For its Fourier coefficients , invariance under gives , so . For , irrationality of makes , forcing ; thus has the same Fourier coefficients as Lebesgue measure and equals it. Unique ergodicity gives, for every continuous , the uniform convergence for every starting point , in particular : . Specialising recovers for , which is Weyl's criterion; equidistribution follows. This is the dynamical proof, complementary to the geometric-series proof, and it is the route Mathlib's uniform-distribution development takes.

Connections Master

Weyl differencing is the qualitative ancestor of the van der Corput -process, which is Poisson summation followed by stationary phase; the dual sum it produces is exactly the transform side of the Poisson and Voronoi formulae developed in 21.15.01, so the two units are the -process and -process halves of one method for estimating .

The Weyl sum machinery rests on the additive character and the Fourier-analytic duality between and formalised in 02.10.04: Weyl's criterion is the statement that the integer characters separate invariant measures, and the differencing inequality is a Cauchy-Schwarz manipulation of these characters.

The minor-arc bounds delivered by Weyl's inequality are the input to the Hardy-Littlewood circle method, where they appear again alongside the complete exponential sums of 21.15.04; the Gauss and Kloosterman sums there govern the major-arc main terms while Weyl differencing governs the minor-arc error, so the two exponential-sum units partition the circle method between them.

Historical & philosophical context Master

Hermann Weyl's 1916 paper in the Mathematische Annalen [Weyl 1916] introduced all three pillars of this unit at once: the equidistribution criterion, the differencing method for polynomial phases, and the resulting theorem that is uniformly distributed for irrational . The paper grew out of work on the foundations of analysis and on almost-periodic phenomena; the equidistribution of had been observed independently by Bohl, Sierpiński, and Weyl around 1909-1910, but the polynomial case and the criterion were Weyl's. The differencing idea — squaring a sum to compare a term with its shifts — became one of the most-reused tools in analytic number theory.

Johannes van der Corput systematised the method in the 1920s into the - and -processes and the theory of exponent pairs, and his difference theorem (a sequence is equidistributed if all its difference sequences are) gave the qualitative form. The quantitative bridge from exponential-sum cancellation to discrepancy is the inequality of Paul Erdős and Pál Turán [Erdős Turán 1948], published in two parts in the Indagationes Mathematicae of 1948. Weyl's inequality entered the Hardy-Littlewood circle method as the standard minor-arc estimate, codified in Vaughan's tract [Vaughan 1997]; the modern refinements — Vinogradov's mean value theorem and its resolution by Bourgain-Demeter-Guth via decoupling in 2016 — sharpen the saving exponent far beyond Weyl's for large , but the differencing reduction remains the conceptual starting point.

Bibliography Master

@article{weyl1916,
  author  = {Weyl, Hermann},
  title   = {{\"U}ber die Gleichverteilung von Zahlen mod. Eins},
  journal = {Mathematische Annalen},
  volume  = {77},
  number  = {3},
  pages   = {313--352},
  year    = {1916}
}

@article{erdosturan1948,
  author  = {Erd{\H o}s, Paul and Tur{\'a}n, P{\'a}l},
  title   = {On a problem in the theory of uniform distribution},
  journal = {Indagationes Mathematicae},
  volume  = {10},
  pages   = {370--378, 406--413},
  year    = {1948}
}

@book{kuipersniederreiter1974,
  author    = {Kuipers, Lauwerens and Niederreiter, Harald},
  title     = {Uniform Distribution of Sequences},
  series    = {Pure and Applied Mathematics},
  publisher = {Wiley-Interscience},
  year      = {1974}
}

@book{iwaniec-kowalski2004,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004}
}

@book{vaughan1997,
  author    = {Vaughan, Robert C.},
  title     = {The Hardy-Littlewood Method},
  series    = {Cambridge Tracts in Mathematics},
  volume    = {125},
  edition   = {2},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@book{montgomery1994,
  author    = {Montgomery, Hugh L.},
  title     = {Ten Lectures on the Interface Between Analytic Number Theory and Harmonic Analysis},
  series    = {CBMS Regional Conference Series in Mathematics},
  volume    = {84},
  publisher = {American Mathematical Society},
  year      = {1994}
}