21.13.04 · number-theory / dirichlet-l-functions-characters

The Polya-Vinogradov Inequality

shipped3 tiersLean: none

Anchor (Master): Polya 1918 *Göttinger Nachrichten* 21-29 (originator: Über die Verteilung der quadratischen Reste und Nichtreste); Vinogradov 1918 *Perm. Univ. Fiz.-Mat. Obshch.* (independent originator); Davenport 2000 *Multiplicative Number Theory* 3rd ed. (Springer GTM 74) §23; Montgomery-Vaughan 2007 *Multiplicative Number Theory I* (Cambridge) §9.4; Burgess 1957 *Mathematika* 4, 106-112 and 1962 *Proc. London Math. Soc.* (3) 12, 193-206 (the Burgess bound, the only known improvement in the short-sum range); Montgomery-Vaughan 1977 *Invent. Math.* 43, 69-82 (GRH-conditional refinement); Granville-Soundararajan 2007 *J. Amer. Math. Soc.* 20, 357-384 (the structure of large character sums); Paley 1932 *J. London Math. Soc.* 7, 28-32 (the $\Omega$-result: the inequality is sharp up to the constant for infinitely many $q$); Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium Publications 53) §12.4

Intuition Beginner

Take a non-principal character — for the cleanest case, the pattern that labels each odd number as a square or a non-square modulo a fixed odd prime. Walk along the integers and keep a running tally: add when the label is , subtract when the label is . The question of this unit is simple to state. As you walk further and further, how big can the running tally get?

If the labels behaved like genuine coin flips, the tally would drift like a random walk, wandering away from zero only as fast as the square root of the number of steps. The labels are not random — they are a fixed arithmetic pattern — yet the Polya-Vinogradov inequality says the tally stays almost as well-behaved as a random walk: no matter where you start the walk and no matter how long you walk, the tally never strays farther from zero than roughly (with a small extra logarithm), where is the modulus. The pattern cannot conspire to build up a long one-sided run.

Why does this matter? Because a bounded tally means the and labels stay nearly balanced over every stretch of integers. That balance is the quantitative engine behind statements like "the squares and non-squares modulo are evenly mixed," and it forces the first non-square to appear early — you cannot have a long opening run of squares, because that run would push the tally up past what the inequality allows.

Visual Beginner

The picture is the running tally of the pattern modulo . Using the Legendre symbol from 21.01.06, the squares modulo are and the non-squares are , so the labels on are and the pattern then repeats with period .

The table shows the running tally , the sum of the first labels:

label
tally

The tally never climbs above and never falls below over a full period. The label at is because shares a factor with the modulus, so multiples of contribute nothing. Over the whole period the labels sum to zero — three 's and three 's — which is why the walk returns to where it began. Polya-Vinogradov is the statement that this narrow band persists for every modulus, with width controlled by .

Worked example Beginner

Track the running tally for the squares-versus-non-squares pattern modulo and find the largest tally over one period.

Step 1. List the squares modulo . Squaring gives , so the squares modulo are and the non-squares are .

Step 2. Write the labels on through : the label is on a square and on a non-square, giving $$ +1, -1, +1, +1, +1, -1, -1, -1, +1, -1. $$

Step 3. Form the running tally by adding the labels one at a time: $$ 1,\ 0,\ 1,\ 2,\ 3,\ 2,\ 1,\ 0,\ 1,\ 0. $$

Step 4. Read off the largest value: the tally peaks at , reached at after the opening run . Over the full period the tally returns to , as it must, since the five 's and five 's cancel.

What this tells us: the peak tally is comfortably below . The largest tally measures the worst imbalance between squares and non-squares over any opening stretch, and here that imbalance never exceeds . Polya-Vinogradov guarantees a bound of this size — about — for every modulus and for every non-principal character, not just this small example.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is an integer, is a Dirichlet character modulo in the sense of 21.03.02, and denotes the standard additive character of . For a real number , write for the distance from to the nearest integer.

Definition (character sum). For integers and the character sum over the interval is $$ S_\chi(M, N) := \sum_{M < n \leq M + N} \chi(n). $$ The sum is complete when (it runs over a full period) and incomplete otherwise. The triangle inequality gives the elementary bound , and a second elementary bound holds for any length, since the complete sum vanishes for non-principal and any incomplete sum differs from a sub-interval of a single period.

Definition (Gauss sum). For a character modulo , the Gauss sum is $$ \tau(\chi) := \sum_{a \bmod q} \chi(a) e(a/q). $$ A character modulo is primitive if it is not induced by any character of strictly smaller modulus dividing ; equivalently, is not constant on the cosets of for any proper divisor . For primitive the separability identity holds: for every integer , and the Gauss sum has absolute value .

Definition (finite Fourier expansion of ). Rearranging the separability identity, a primitive character admits the finite Fourier expansion $$ \chi(n) = \frac{1}{\tau(\overline\chi)} \sum_{a \bmod q} \overline\chi(a), e!\left(\frac{an}{q}\right), $$ expressing as a linear combination of the additive characters with coefficients of constant modulus .

The expansion is the analytic heart of the unit: it converts a sum of the multiplicative quantity over an interval into a weighted sum of linear exponential sums , each of which is a geometric series and therefore explicitly summable.

Counterexamples to common slips

  • "The Gauss-sum expansion holds for any character." The clean identity holds for all only when is primitive. For imprimitive it fails when , and one must first reduce to the primitive character modulo its conductor . The Polya-Vinogradov bound for imprimitive then follows from the primitive case with , so primitivity is no loss of generality.

  • " is a bound on the complete sum." The complete sum of a non-principal character is exactly . Polya-Vinogradov is a bound on the incomplete sums over arbitrary sub-intervals, uniformly in and ; the content is that no partial run can build up an imbalance larger than .

  • "The logarithm comes from the Gauss sum." The factor comes from , but the comes from summing the geometric-series bounds over : the tail behaves like before division by . The two factors have distinct origins.

Key theorem with proof Intermediate+

The signature theorem is the Polya-Vinogradov inequality, proved by completing the incomplete character sum through the Gauss-sum Fourier expansion and bounding the resulting linear exponential sums.

Theorem (Polya-Vinogradov; Polya 1918, Vinogradov 1918). Let be a non-principal Dirichlet character modulo . Then for all integers and all , $$ \left| \sum_{M < n \leq M + N} \chi(n) \right| \leq \sqrt{q}, \log q. $$

Proof. By the reduction noted in the formal-definition section it suffices to treat primitive modulo (an imprimitive non-principal is induced by a primitive of conductor , and the interval sum changes only by terms where , controlled by the same bound with ). Assume henceforth primitive, so and the finite Fourier expansion holds for every .

Insert the expansion into the character sum and exchange the two finite sums: $$ S_\chi(M, N) = \sum_{M < n \leq M + N} \frac{1}{\tau(\overline\chi)} \sum_{a \bmod q} \overline\chi(a), e!\left(\frac{an}{q}\right) = \frac{1}{\tau(\overline\chi)} \sum_{a \bmod q} \overline\chi(a) \sum_{M < n \leq M + N} e!\left(\frac{an}{q}\right). $$

The term contributes since is non-principal (indeed by the convention that characters vanish off the units), so the outer sum runs over . Take absolute values, using and : $$ |S_\chi(M, N)| \leq \frac{1}{\sqrt{q}} \sum_{a = 1}^{q - 1} \left| \sum_{M < n \leq M + N} e!\left(\frac{an}{q}\right) \right|. $$

The inner sum is a geometric series with ratio . For , $$ \left| \sum_{M < n \leq M + N} e!\left(\frac{an}{q}\right) \right| = \left| \frac{e(a(M+N)/q) - e(aM/q)}{e(a/q) - 1} \right| \leq \frac{2}{|e(a/q) - 1|} = \frac{1}{|\sin(\pi a/q)|}, $$ using and the numerator bound . With the elementary inequality for the distance to the nearest integer, this gives $$ \left| \sum_{M < n \leq M + N} e!\left(\frac{an}{q}\right) \right| \leq \frac{1}{2 |a/q|}. $$

For the distance equals when and when ; the values for are therefore taken symmetrically about the midpoint, so $$ \sum_{a = 1}^{q - 1} \frac{1}{2|a/q|} = \frac{q}{2} \sum_{a = 1}^{q - 1} \frac{1}{\min(a, q - a)} \leq \frac{q}{2} \cdot 2 \sum_{b = 1}^{\lfloor q/2 \rfloor} \frac{1}{b} = q \sum_{b = 1}^{\lfloor q/2 \rfloor} \frac{1}{b}. $$

The harmonic sum satisfies for (and the bound is checked directly for the finitely many smaller ). Hence $$ |S_\chi(M, N)| \leq \frac{1}{\sqrt{q}} \cdot q \log q = \sqrt{q}, \log q, $$ which is the claimed inequality.

Bridge. The Polya-Vinogradov inequality builds toward the Burgess bound and the analytic theory of -functions, and appears again in 21.12.02 (the prime-number theorem in arithmetic progressions), where uniform control of character sums underlies the error term. The central insight is that completion — replacing an incomplete multiplicative sum by a weighted complete one through the finite Fourier expansion — trades the hard problem of cancellation in for the easy problem of cancellation in geometric series ; this is exactly the mechanism that converts the Gauss-sum bound into a bound on every interval sum. The same expansion generalises: replacing the linear phase by higher-degree phases gives Weil's bounds for complete sums, and the bridge is that all of analytic number theory's character-sum estimates begin by writing a multiplicative weight as a Fourier series in additive characters. Putting these together, the foundational reason the squares and non-squares stay mixed is the unit-modulus Gauss sum, and this is dual to the statement that proved in 21.03.02.

Exercises Intermediate+

Advanced results Master

The unconditional inequality is sharp in its dependence on up to the secondary logarithm. Paley's -result (Paley 1932 [Paley 1932]) produces, for infinitely many real primitive characters modulo , partial sums as large as ; the gap between and is the entire remaining uncertainty in the unconditional theory. Under the Generalised Riemann Hypothesis for , Montgomery and Vaughan (1977) [Montgomery-Vaughan 1977] closed most of that gap from above, proving , which matches Paley's lower bound up to the implied constant. The conditional truth is therefore for the extremal characters.

The mechanism behind the extremal characters is pretentiousness, isolated by Granville and Soundararajan (2007) [Granville-Soundararajan 2007]. A character has an abnormally large partial sum exactly when it correlates with — "pretends to be" — a character of small conductor and a fixed parity; the pretentious distance $$ \mathbb{D}(\chi, \xi; x)^2 = \sum_{p \leq x} \frac{1 - \mathrm{Re},\chi(p)\overline{\xi(p)}}{p} $$ between and a short test character measures the deficit. For characters of odd order , Granville and Soundararajan improve the unconditional bound to with , beating the classical . This is the first unconditional improvement to the constant exponent of the logarithm and locates the obstruction precisely in low-order, low-conductor mimicry.

In the short range the Burgess bound (Burgess 1957, 1962) [Burgess 1957] remains the only unconditional cancellation below , a genuine saving for , and its consequence for the least quadratic non-residue is still, after seven decades, the best unconditional exponent. Breaking the barrier in the Burgess range is equivalent to subconvexity bounds for and remains open.

Synthesis. The Polya-Vinogradov inequality is the foundational reason character sums cancel, and the entire tower above is a refinement of its single mechanism: completion through the Gauss-sum Fourier expansion. The central insight is that the bound factors as a Gauss-sum amplitude times a harmonic tail , and each later improvement attacks one factor — Burgess shortens the range by iterating the completion with multiplicative shifts, GRH sharpens the tail from to , and the pretentious method explains the extremal characters as those that mimic short conductors. This is exactly the pattern that organises analytic number theory: a clean unconditional bound, an -result showing it is nearly sharp, a conditional bound matching the -result, and a structural theory explaining the extremisers. Putting these together, the least-non-residue problem, the subconvexity problem for , and the distribution of character sums are dual faces of one question, and the bridge between them is the Gauss sum that first appeared in 21.03.02. The non-vanishing and the boundedness of generalise the same arithmetic harmony, and the whole structure builds toward the subconvexity programme and the Langlands-theoretic -function estimates that appears again in 21.10.01.

Full proof set Master

Proposition 1 (Gauss-sum separability for primitive characters). Let be a primitive Dirichlet character modulo . Then for every integer , $$ \chi(n), \tau(\overline\chi) = \sum_{a \bmod q} \overline\chi(a), e!\left(\frac{an}{q}\right). $$

Proof. When , substitute (a bijection of the units modulo ) in : with , $$ \tau(\overline\chi) = \sum_{a} \overline\chi(an) e(an/q) = \overline\chi(n) \sum_a \overline\chi(a) e(an/q), $$ and multiplying by (a value of modulus ) gives the claim. When , the left side is since ; the right side is , an exponential sum over a subgroup that vanishes precisely because is primitive (an imprimitive character would leave a nonzero residual sum over the cosets of the inducing modulus). The primitivity hypothesis is exactly what forces both sides to vanish together.

Proposition 2 (Gauss-sum modulus). For primitive modulo , .

Proof. Compute . By Proposition 1 applied with the substitution over units (and the vanishing off the units), the double sum collapses to . The inner sum over a complete residue system is when and otherwise, leaving . Hence .

Proposition 3 (imprimitive reduction). Let be a non-principal character modulo induced by the primitive character of conductor . If the Polya-Vinogradov bound holds for , then for an absolute treatment, where the divisor factor accounts for removing the primes dividing .

Proof. Write . Möbius-expanding the coprimality indicator over divisors that capture the extra primes, , gives $$ S_\chi(M, N) = \sum_{e \mid q/q^\star} \mu(e), \chi^\star(e) \sum_{M/e < m \leq (M+N)/e} \chi^\star(m). $$ Each inner sum is a character sum for the primitive over an interval, bounded by . Summing over the squarefree divisors of and using with the standard divisor estimate yields the stated bound; the leading absorbs the divisor factor for the inequality as stated.

Connections Master

This unit specialises the Gauss-sum and orthogonality theory of 21.03.02 (Dirichlet -functions and characters): the Fourier expansion and the evaluation are the only inputs beyond the geometric series, so Polya-Vinogradov is the simplest quantitative payoff of the character theory developed there.

The least-non-residue corollary refines the qualitative residue theory of 21.01.06 (quadratic residues and the Legendre symbol): where that unit establishes which residues are squares, this one bounds how soon the first non-square must appear, turning a structural fact into an effective estimate.

The inequality is a workhorse in 21.12.02 (the prime-number theorem in arithmetic progressions), where uniform control of across moduli feeds the explicit-formula error term, and in 21.14.01 (the large sieve), whose mean-value bounds for character sums generalise the single-character Polya-Vinogradov estimate to averages over all characters of a given modulus.

Historical & philosophical context Master

George Polya and Ivan Vinogradov proved the inequality independently in 1918 [Polya 1918], Polya in the Göttinger Nachrichten and Vinogradov in the journal of the Perm physico-mathematical society, each as a tool for studying the distribution of quadratic residues. Polya's motivation was the spacing of residues and non-residues; Vinogradov's was the least non-residue, a problem he would return to repeatedly. The two derivations are essentially the same completion argument through the Gauss sum, and the joint name reflects the genuine independence of the discoveries in the disrupted scientific communication of 1918.

Raymond Paley showed in 1932 [Paley 1932] that the bound cannot be unconditionally improved beyond , using a clever averaging over real characters; this fixed the target for all later work. David Burgess's 1957 thesis result [Burgess 1957] broke the range barrier for the first time, and his bound for the least non-residue has stood unimproved since. The pretentious reformulation of Granville and Soundararajan (2007) recast the extremal-character question in the language of multiplicative-function distance, connecting Polya-Vinogradov to the Halász-Montgomery theory and to subconvexity for .

Bibliography Master

@article{polya1918verteilung,
  author  = {P{\'o}lya, George},
  title   = {{\"U}ber die Verteilung der quadratischen Reste und Nichtreste},
  journal = {Nachrichten von der K{\"o}niglichen Gesellschaft der Wissenschaften zu G{\"o}ttingen, Mathematisch-Physikalische Klasse},
  year    = {1918},
  pages   = {21--29}
}

@article{vinogradov1918distribution,
  author  = {Vinogradov, Ivan M.},
  title   = {On the distribution of residues and non-residues of powers},
  journal = {Journal of the Physico-Mathematical Society of Perm University},
  volume  = {1},
  year    = {1918},
  pages   = {94--98},
  note    = {In Russian}
}

@article{paley1932theorem,
  author  = {Paley, Raymond E. A. C.},
  title   = {A theorem on characters},
  journal = {Journal of the London Mathematical Society},
  volume  = {7},
  year    = {1932},
  pages   = {28--32}
}

@article{burgess1957character,
  author  = {Burgess, David A.},
  title   = {On character sums and {$L$}-series},
  journal = {Mathematika},
  volume  = {4},
  year    = {1957},
  pages   = {106--112}
}

@article{burgess1962character2,
  author  = {Burgess, David A.},
  title   = {On character sums and {$L$}-series. {II}},
  journal = {Proceedings of the London Mathematical Society (3)},
  volume  = {12},
  year    = {1962},
  pages   = {193--206}
}

@article{montgomeryvaughan1977exponential,
  author  = {Montgomery, Hugh L. and Vaughan, Robert C.},
  title   = {Exponential sums with multiplicative coefficients},
  journal = {Inventiones Mathematicae},
  volume  = {43},
  year    = {1977},
  pages   = {69--82}
}

@article{granvillesound2007large,
  author  = {Granville, Andrew and Soundararajan, Kannan},
  title   = {Large character sums: pretentious characters and the {P}{\'o}lya-{V}inogradov theorem},
  journal = {Journal of the American Mathematical Society},
  volume  = {20},
  year    = {2007},
  pages   = {357--384}
}

@book{montgomeryvaughan2007mult,
  author    = {Montgomery, Hugh L. and Vaughan, Robert C.},
  title     = {Multiplicative Number Theory I: Classical Theory},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {97},
  publisher = {Cambridge University Press},
  year      = {2007}
}

@book{davenport2000mult,
  author    = {Davenport, Harold},
  title     = {Multiplicative Number Theory},
  edition   = {3},
  series    = {Graduate Texts in Mathematics},
  volume    = {74},
  publisher = {Springer},
  year      = {2000},
  note      = {Revised by H. L. Montgomery}
}