The Bombieri-Vinogradov Theorem
Anchor (Master): Bombieri 1965 *Mathematika* 12, 201-225 (*On the large sieve* — the original theorem); A. I. Vinogradov 1965 *Izv. Akad. Nauk SSSR* 29, 903-934 (independent proof); Montgomery-Vaughan 2007 *Multiplicative Number Theory I* (Cambridge SMM 97) §9; Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium 53) §17; Vaughan 1980 *Mathematika* 27, 153-163 (the simplified Vaughan-identity proof); Elliott-Halberstam 1970 *Symposia Mathematica* 4, 59-72 (the conjecture); Zhang 2014 *Annals of Math.* 179, 1121-1174 and Maynard 2015 *Annals of Math.* 181, 383-413 (bounded gaps)
Intuition Beginner
The primes thin out as you go up, but among the numbers with a fixed last-digit pattern they should thin out at the same rate. Count primes up to a million that leave remainder when divided by , and remainder , and remainder , and so on through the six allowed remainders: each box should hold roughly the same share of the primes. This is the prime number theorem for arithmetic progressions, and it says the primes do not play favourites among the remainder classes a divisor allows.
The catch is the size of the divisor. For small divisors this even spread is a proven fact. For divisors as large as the square root of where you are counting, the even spread is only a conjecture — the Generalized Riemann Hypothesis — and nobody can prove it for any single large divisor. The Bombieri-Vinogradov theorem makes a clever trade: instead of demanding the even spread for every large divisor, it asks only that the spread be even on average as the divisor ranges over all values up to the square root. That averaged statement, remarkably, can be proven outright.
So you give up control of each individual divisor and gain control of the whole pile. A few divisors might misbehave, but they cannot all misbehave at once, because the total error stays small. This is why people call the theorem "the Riemann Hypothesis on average": it delivers, unconditionally, the same strength the famous conjecture would give for each divisor separately, paid out as a sum rather than term by term.
The engine underneath is the large sieve. It is the tool that says well-separated test points cannot all light up at once, and summing the misbehaviour of many divisors is exactly a sum over well-separated points.
Visual Beginner
Picture a long counter with a box for each divisor from up to the square root of . Into box you drop the worst error you can find across all the remainder classes that allows — how far the prime count in the most lopsided class strays from its fair share .
divisor q: 2 3 4 5 ... ~sqrt(x)
┌──┐ ┌──┐ ┌──┐ ┌──┐ ┌──┐
worst error: │▁ │ │▃ │ │▁ │ │▂ │ ... │▁ │
└──┘ └──┘ └──┘ └──┘ └──┘
\________ total height capped ________/
sum of all box heights << x / (log x)^A
Any one box may spike, but the whole row's
summed height stays tiny -- "GRH on average".
A single box might rise — one stubborn divisor whose primes lean one way. But the bracket under the whole row stays low: add up every box and the total is a vanishing fraction of . That summed cap, not any single box, is what the theorem guarantees.
Worked example Beginner
We check the "fair share" idea by hand on a small divisor, to see what number the theorem says the error should hug.
Step 1. Take the divisor . The remainders coprime to are and ; there are of them. The prime number theorem for progressions predicts that primes split evenly between the class "" and the class "".
Step 2. Count primes up to . There are primes up to . Removing (the only prime not coprime to ), primes fall into the two allowed classes.
Step 3. The fair share for each class is primes apiece.
Step 4. Count the actual classes. Primes : — that is . Primes : — that is .
Step 5. The errors from the fair share of are and . The worst error for is , small against the count .
What this tells us: even at this tiny scale the lopsidedness is a single prime out of two dozen. Bombieri-Vinogradov says that when you add up the worst such error over every divisor up to , the grand total still stays below for any power you like — the small wobbles never conspire into a large sum.
Check your understanding Beginner
Formal definition Intermediate+
Write for the von Mangoldt function, for the weighted prime count in the progression , and for Euler's totient. The expected main term is , since the primes coprime to should distribute evenly over the admissible classes.
Definition (error term and its maximum over residues). For set $$ E(x;q,a) = \psi(x;q,a) - \frac{x}{\varphi(q)}, \qquad E^(x;q) = \max_{\gcd(a,q)=1}\big|E(x;q,a)\big|. $$ $E^(x;q)q$.
Definition (level of distribution). A real is a level of distribution for the primes if for every there is a such that $$ \sum_{q \le Q} E^*(x;q) \ll_A \frac{x}{(\log x)^A}, \qquad Q = \frac{x^{\vartheta}}{(\log x)^B}. $$ The larger , the longer the range of moduli over which equidistribution holds on average.
Definition (the Bombieri-Vinogradov theorem). The primes have level of distribution : for every there is with $$ \sum_{q \le Q} \max_{\gcd(a,q)=1}\Big|\psi(x;q,a) - \frac{x}{\varphi(q)}\Big| \ll_A \frac{x}{(\log x)^A}, \qquad Q = x^{1/2}(\log x)^{-B}. $$ The Generalized Riemann Hypothesis for Dirichlet -functions would give for each individual , hence the same averaged bound up to ; Bombieri-Vinogradov delivers the averaged conclusion unconditionally. In this sense is "GRH on average".
Definition (the Elliott-Halberstam conjecture). The conjecture asserts that every is a level of distribution: for all and all , $$ \sum_{q \le x^{1-\varepsilon}} E^*(x;q) \ll_{A,\varepsilon} \frac{x}{(\log x)^A}. $$ It extends the averaged equidistribution from moduli up to to moduli up to almost , and is far stronger than GRH (which controls each only to the scale).
Counterexamples to common slips
- "Bombieri-Vinogradov gives equidistribution for each ." It does not. It bounds the sum of the errors; an individual can have as large as in principle, and only finitely many such bad are excluded by the averaging. Pointwise equidistribution at this scale is GRH.
- "The maximum over is harmless and can be dropped." The inside the sum is essential and far from automatic to install: a bound for a single fixed is weaker, and the proof must produce uniformity in before summing. Replacing the max by an average over gives a strictly easier statement (the Barban-Davenport-Halberstam theorem).
- "Level and GRH are the same." GRH gives on average and pointwise control for each ; Bombieri-Vinogradov gives only the averaged half. Elliott-Halberstam () is strictly stronger than GRH on the averaged side, yet says nothing pointwise.
Key theorem with proof Intermediate+
The signature result is the level- bound; its proof routes the error through Dirichlet characters, applies the multiplicative large sieve of 21.14.01 to a bilinear decomposition of , and disposes of small moduli by Siegel-Walfisz [Montgomery-Vaughan 2007].
Theorem (Bombieri-Vinogradov; Bombieri 1965, A. I. Vinogradov 1965). For every there is such that, with , $$ \sum_{q \le Q} \max_{\gcd(a,q)=1}\Big|\psi(x;q,a) - \frac{x}{\varphi(q)}\Big| \ll_A \frac{x}{(\log x)^A}. $$
Proof. Detect the residue class with Dirichlet characters: for , $$ \psi(x;q,a) - \frac{x}{\varphi(q)}\cdot[\text{principal part}] = \frac{1}{\varphi(q)}\sum_{\chi \ne \chi_0}\bar\chi(a),\psi(x,\chi), \qquad \psi(x,\chi) = \sum_{n\le x}\chi(n)\Lambda(n), $$ so , the error absorbing the prime powers dividing . Each non-principal mod is induced by a unique primitive mod , and . Reordering the sum over by the primitive character of conductor , $$ \sum_{q\le Q}E^(x;q) \ll (\log x)\sum_{q^\le Q}\frac{1}{\varphi(q^)}\ \sideset{}{^}\sum_{\chi^\bmod q^}\max_{y\le x}|\psi(y,\chi^)| + \frac{x}{(\log x)^A}, $$ where the weight $\sum_{q\le Q,\ q^\mid q}1/\varphi(q) \ll (\log x)/\varphi(q^)q^$.
Two ranges of conductor are handled separately. For small conductor , the Siegel-Walfisz theorem — itself resting on the zero-free region and the Siegel-zero analysis of 21.13.01 — gives uniformly, and there are such characters, so their total contribution is .
For large conductor the saving comes from the large sieve. Decompose by Vaughan's identity [Vaughan 1980] into Type I sums (one smooth variable) and Type II bilinear sums (two rough variables), with . Apply the multiplicative large sieve of 21.14.01,
$$
\sum_{q^\le Q}\frac{q^}{\varphi(q^)}\ \sideset{}{^}\sum_{\chi^}\Big|\sum_{n}\xi_n\chi^(n)\Big|^2 \ll (M + Q^2)\sum_n|\xi_n|^2,
$$
to each bilinear piece after Cauchy-Schwarz in the two variables. With and the coefficient lengths balanced near , the resulting bound is once is taken large enough. Summing the small- and large-conductor contributions and passing back from to via Perron inversion 21.11.04 yields the stated bound.
Bridge. The proof builds toward the bounded-gaps results of the Advanced results, where level is fed into a Selberg sieve, and the same character-sum machinery appears again in Linnik's theorem on the least prime in a progression. The foundational reason the threshold sits at is exactly the large-sieve constant of 21.14.01: balancing against the coefficient length forces , so this is exactly the same tradeoff that capped Brun-Titchmarsh. The split into small and large conductor is dual to the split in the prime number theorem itself: small moduli are governed by the zero-free region of 21.13.01, large moduli by the sieve, and putting these together gives the central insight that an averaged GRH needs no Riemann Hypothesis — the average over converts a statement about individual -function zeros into a statement about the -energy of character sums, which the large sieve controls outright.
Exercises Intermediate+
Advanced results Master
The density-estimate proof and the comparison ladder
The cleanest modern proof routes through a zero-density estimate for Dirichlet -functions, summed over moduli by the large sieve [Montgomery-Vaughan 2007]. Writing for the number of zeros of in , , the large sieve yields $$ \sum_{q\le Q}\frac{q}{\varphi(q)}\ \sideset{}{^*}\sum_{\chi\bmod q}N(\alpha,T,\chi) \ll \big(Q^2 T\big)^{\frac{c(1-\alpha)}{1}}(\log QT)^{c'}, $$ a bound exhibiting that zeros far from the critical line are rare on average over the modulus. Fed through the explicit formula for , this density estimate converts directly into the averaged error bound, with the level emerging from the balance . The comparison ladder is then sharp: Siegel-Walfisz controls individually; Bombieri-Vinogradov controls on average; GRH would control every individually; Elliott-Halberstam would extend the average to .
The Vaughan-identity bilinear decomposition
Bombieri's original 1965 argument and Vinogradov's independent one used the large sieve in a more intricate form; Vaughan's 1980 identity streamlined the proof to its present shape [Vaughan 1980]. The identity writes as a sum of a Type I part (a single smooth divisor variable) and a Type II part (a genuine bilinear form in two rough variables). The Type I sums are disposed of by the Pólya-Vinogradov bound and partial summation; the Type II sums, after Cauchy-Schwarz, are exactly the character bilinear forms the multiplicative large sieve of 21.14.01 bounds. The threshold is the precise point where the large-sieve constant stops beating the diagonal bound when , so the bilinear method and the density-estimate method agree on where the wall stands.
Elliott-Halberstam, bounded gaps, and the parity barrier
The Elliott-Halberstam conjecture asks for level [Elliott-Halberstam 1970]; it is open and strictly stronger than GRH on the averaged side. Its arithmetic payoff is bounded gaps between primes. Goldston-Pintz-Yıldırım showed that any level forces ; Zhang broke the barrier for the restricted class of smooth, well-factorable moduli in a fixed residue class [Zhang 2014], obtaining a finite gap, and Maynard's multidimensional sieve achieved bounded gaps from Bombieri-Vinogradov's level alone [Maynard 2015], with Elliott-Halberstam sharpening the gap to . The parity barrier of sieve theory caps these methods: they cannot reach the twin-prime gap without an input that distinguishes numbers by the parity of their prime-factor count, which neither the large sieve nor Elliott-Halberstam supplies.
Synthesis. Bombieri-Vinogradov is the capstone of primes-in-progressions on average, and the bridge is the multiplicative large sieve of 21.14.01 read as an density statement: the inequality says character sums of length cannot all be large across the moduli , and this is exactly the orthogonality of primitive characters dualized into an operator-norm bound, the same duality that fixed the constant . The foundational reason the level is generalises the Brun-Titchmarsh threshold: balancing against the coefficient length caps at , so this is exactly the tradeoff seen there, now summed against the explicit formula of 21.11.04 instead of against the prime indicator. Putting these together, the small-conductor range is dual to the zero-free-region analysis of 21.13.01 — Siegel-Walfisz handling through the very same exceptional-zero control — while the large-conductor range is the large sieve's; the central insight is that averaging over replaces the Generalized Riemann Hypothesis with a zero-density estimate, and the parity barrier of 21.14.04 marks exactly the line where this circle of ideas, Elliott-Halberstam included, cannot reach the twin primes without new arithmetic.
Full proof set Master
Proposition 1 (character detection of a residue class). For , , where .
Proof. Orthogonality of Dirichlet characters gives if and , and otherwise (when every ; when but , the orthogonality relation vanishes). Since already forces the surviving to be coprime to , multiplying by and summing over interchanges the finite character sum with the sum over : $$ \psi(x;q,a) = \sum_{n\le x}\Lambda(n)\frac{1}{\varphi(q)}\sum_\chi\bar\chi(a)\chi(n) = \frac{1}{\varphi(q)}\sum_\chi\bar\chi(a)\sum_{n\le x}\chi(n)\Lambda(n), $$ which is the claim; the with are prime powers and contribute .
Proposition 2 (main term from the principal character). The principal-character term equals , identifying .
Proof. For mod , . The prime number theorem gives , so . Subtracting this from Proposition 1 isolates as the stated non-principal sum.
Proposition 3 (reduction to primitive characters). If mod is induced by the primitive $\chi^q^q^\mid q\psi(x,\chi) = \psi(x,\chi^) + O((\log qx)^2)$.
Proof. For one has , while for . Hence . The summand is non-zero only for prime powers with , . There are such primes, each yielding prime powers, each weighted by . The total is .
Proposition 4 (Siegel-Walfisz disposal of small conductors). For $\chi^q^\le(\log x)^B\max_{y\le x}|\psi(y,\chi^)| \ll_C x\exp(-c\sqrt{\log x})Cc=c(C)$, the implied constant ineffective when an exceptional (Siegel) zero is present.*
Proof. The Siegel-Walfisz theorem follows from the zero-free region of 21.13.01: has no zero with save a possible exceptional real zero (Siegel's ineffective bound). The truncated explicit formula then bounds the zero sum by once and , the exceptional zero (if any) contributing , still admissible for chosen against the target. The ineffectivity of is inherited from Siegel's bound.
Proposition 5 (the level- threshold from the large-sieve constant). In the Type II bilinear sums, the multiplicative large sieve over conductors $q^\le QQ\le x^{1/2+o(1)}$.*
Proof. The Type II sum, after Cauchy-Schwarz, reduces to bounding with the product of the two bilinear lengths. The multiplicative large sieve of 21.14.01 bounds this by . The diagonal bound, applying classwise, gives at the top of the range. The large sieve improves on this precisely when fails outright, i.e. when , that is . For the constant matches the diagonal bound and no saving remains.
Connections Master
The entire proof is downstream of the multiplicative large sieve of 21.14.01: the Type II bilinear sums are bounded by its constant after Cauchy-Schwarz, and the level of distribution is the same balance that caps the Brun-Titchmarsh range there, so Bombieri-Vinogradov is the large sieve's deepest arithmetic consequence.
The small-conductor range is handled by Siegel-Walfisz, which rests entirely on the zero-free regions and the exceptional-zero analysis of 21.13.01: the ineffective constant in Bombieri-Vinogradov is inherited from Siegel's ineffective lower bound for , and the dichotomy between the zero-free region and the possible Siegel zero is what makes the small- estimate uniform.
The passage from a bound on the character sum to a bound on the residue-class count uses the truncated Perron formula and Mellin inversion of 21.11.04: the explicit formula that turns -function zeros into a sum over is exactly the contour-integral machinery developed there, specialized to .
The parity barrier that prevents Bombieri-Vinogradov and even Elliott-Halberstam from delivering the twin-prime gap is the combinatorial-sieve obstruction of 21.14.04: the large sieve and the Selberg sieve both fail to separate the two parities of , which is why bounded gaps stop short of .
Historical & philosophical context Master
The theorem was proved independently in 1965 by Enrico Bombieri, in his Mathematika paper on the large sieve [Bombieri 1965], and by Askold Ivanovich Vinogradov in the Izvestiya [Vinogradov 1965], the two arriving at the same level- averaged bound by closely related large-sieve routes. Bombieri's formulation, with the maximum over residue classes inside the sum, became the standard one; the result is consistently named for both, with Bombieri's name usually first by the depth of the surrounding paper. The averaged-GRH reading was understood at once: the theorem makes unconditional what GRH would give pointwise, and it has since substituted for GRH in a long list of applications where only the average over is needed.
Robert Vaughan's 1980 identity [Vaughan 1980] reorganized the proof into the Type I / Type II bilinear shape now taught universally, replacing the more delicate original arguments. Peter Elliott and Heini Halberstam stated their conjecture in 1970 [Elliott-Halberstam 1970], extending the level of distribution toward . Its dramatic application came in 2013-2014: Yitang Zhang proved a Bombieri-Vinogradov-type estimate past level for smooth moduli and deduced bounded gaps between primes [Zhang 2014], and James Maynard, with a multidimensional sieve, obtained bounded gaps from the unconditional level alone and the gap bound under Elliott-Halberstam [Maynard 2015].
Bibliography Master
@article{bombieri1965largesieve,
author = {Bombieri, Enrico},
title = {On the large sieve},
journal = {Mathematika},
volume = {12},
pages = {201--225},
year = {1965}
}
@article{vinogradov1965density,
author = {Vinogradov, Askold I.},
title = {The density hypothesis for {Dirichlet} {L}-series},
journal = {Izvestiya Akademii Nauk SSSR, Seriya Matematicheskaya},
volume = {29},
pages = {903--934},
year = {1965},
note = {Corrigendum, ibid. 30 (1966), 719--720}
}
@article{vaughan1980elementary,
author = {Vaughan, Robert C.},
title = {An elementary method in prime number theory},
journal = {Mathematika},
volume = {27},
pages = {153--163},
year = {1980}
}
@article{elliotthalberstam1970conjecture,
author = {Elliott, Peter D. T. A. and Halberstam, Heini},
title = {A conjecture in prime number theory},
journal = {Symposia Mathematica},
volume = {4},
pages = {59--72},
year = {1970}
}
@article{zhang2014boundedgaps,
author = {Zhang, Yitang},
title = {Bounded gaps between primes},
journal = {Annals of Mathematics},
volume = {179},
number = {3},
pages = {1121--1174},
year = {2014}
}
@article{maynard2015smallgaps,
author = {Maynard, James},
title = {Small gaps between primes},
journal = {Annals of Mathematics},
volume = {181},
number = {1},
pages = {383--413},
year = {2015}
}
@book{davenport2000multiplicative,
author = {Davenport, Harold},
title = {Multiplicative Number Theory},
edition = {3},
series = {Graduate Texts in Mathematics},
volume = {74},
publisher = {Springer-Verlag},
year = {2000},
note = {Revised by H. L. Montgomery}
}
@book{montgomeryvaughan2007,
author = {Montgomery, Hugh L. and Vaughan, Robert C.},
title = {Multiplicative Number Theory I: Classical Theory},
series = {Cambridge Studies in Advanced Mathematics},
number = {97},
publisher = {Cambridge University Press},
year = {2007}
}
@book{iwaniec-kowalski2004,
author = {Iwaniec, Henryk and Kowalski, Emmanuel},
title = {Analytic Number Theory},
series = {American Mathematical Society Colloquium Publications},
volume = {53},
publisher = {American Mathematical Society},
year = {2004}
}