21.14.04 · number-theory / sieve-methods-large-sieve

Combinatorial Sieve Methods: Brun and Selberg

shipped3 tiersLean: none

Anchor (Master): Iwaniec-Kowalski 2004 *Analytic Number Theory* Ch. 6 (combinatorial and Selberg sieves, sieve dimension, the parity problem); Halberstam-Richert 1974 *Sieve Methods* (the canonical monograph); Friedlander-Iwaniec 2010 *Opera de Cribro* (AMS Colloquium 57) Ch. 1-6 (modern combinatorial sieve, the parity barrier and its evasion); Brun 1919/1920 (the original pure sieve and twin-prime convergence); Selberg 1947 *Norsk Vid. Selsk. Forh.* 19 (the quadratic upper-bound sieve)

Intuition Beginner

Suppose you want to count the prime numbers up to a hundred. The oldest method, the sieve of Eratosthenes, is to list every number, cross out the multiples of , then the multiples of , then of , and so on; whatever survives is prime. The crossing-out can be turned into an exact count using a bookkeeping trick: add back what you removed twice, subtract what you added back too often, and keep correcting. This back-and-forth correction is called inclusion-exclusion, and for counting primes it goes by the name of Legendre.

The trouble is that the corrections pile up. Each one carries a small rounding error, and there are so many corrections that the errors swamp the answer. Legendre's exact count is useless for large numbers because of this error flood. Viggo Brun's breakthrough, around , was simple to state: stop correcting early. If you cut off the inclusion-exclusion after a controlled number of steps, the leftover is provably an over-count or an under-count, never a wild error. You trade an exact-but-useless formula for an approximate-but-trustworthy bound.

Brun used his sieve to prove something striking about twin primes — pairs like or . Nobody knows whether there are infinitely many, but Brun showed that if you add up one-over-each-twin-prime, the total settles down to a finite number. Twin primes are rare enough that their reciprocals converge, even though ordinary primes' reciprocals do not.

Visual Beginner

A row of the numbers through . First the multiples of are shaded, then a second pass shades the multiples of , then a third pass the multiples of . Numbers shaded by two different passes (like , shaded by both and ) are double-counted if you simply add the shaded counts, which is why a correction is needed. The unshaded survivors past the first few are exactly the primes between and together with .

sifting primes used numbers removed over/under after adding correction
just exact none
over-removed add back
over- and under-counts add/subtract overlaps

The table shows the seesaw: each new prime you sift by removes some numbers but creates new double-counts among the multiples that share two primes. The full correction is an alternating sum, and Brun's idea is that chopping that alternating sum off at the right place still pens the true count between known limits.

Worked example Beginner

Count, by inclusion-exclusion, how many numbers from to have no prime factor among .

Step 1. Start with all numbers. Remove multiples of (there are ), of (there are ), of (there are ). Naive total removed: , already more than , because of overlaps.

Step 2. Add back the numbers removed twice: multiples of (five of them), of (three), of (two). Add back .

Step 3. Subtract the numbers removed three times: multiples of (just one, namely ). Subtract .

Step 4. The count of survivors is . Check: the numbers from to with no factor of are — exactly eight.

What this tells us: the exact answer came from a four-term alternating sum that matched the structure . The product of the "survival fractions" predicts the count. Brun's sieve keeps this product as the main term and bounds the error from stopping the alternating sum early.

Check your understanding Beginner

Formal definition Intermediate+

We follow Iwaniec-Kowalski [Iwaniec-Kowalski Ch. 6]. Throughout, denotes a prime and , (the number of distinct prime factors), and squarefreeness are as in 21.11.01.

Definition (the sieve problem). Let be a finite sequence of nonnegative reals, its total mass, and a set of primes. For a divisor supported on put , the mass of the subsequence with index divisible by . Fix a sifting level and write $$ P(z) = \prod_{\substack{p < z \ p \in \mathcal{P}}} p . $$ The sifting function is the surviving mass $$ S(\mathcal{A}, \mathcal{P}, z) = \sum_{\substack{n \le N \ (n, P(z)) = 1}} a_n . $$ To make the problem tractable one posits a density: a multiplicative function with and an approximation $$ A_d = g(d), X + r_d, \qquad g(d) = \prod_{p \mid d} g(p), $$ where is the remainder and is the expected proportion of killed by . Write for the sifting density, the heuristic survival probability.

Definition (Legendre sieve). Since by Möbius inversion of 21.11.01, $$ S(\mathcal{A}, \mathcal{P}, z) = \sum_{d \mid P(z)} \mu(d), A_d = X \sum_{d \mid P(z)} \mu(d), g(d) + \sum_{d \mid P(z)} \mu(d), r_d = X,V(z) + \sum_{d \mid P(z)} \mu(d), r_d . $$ The main term is exactly what the survival-fraction product predicts. The defect of the Legendre identity is the remainder sum: has prime factors, so it has squarefree divisors, and even if the error can reach , which is far larger than unless is tiny. The exact identity is therefore inert.

Definition (sieve dimension). The density has dimension if $$ \sum_{w \le p < z} g(p) \log p = \kappa \log\frac{z}{w} + O(1) \qquad (2 \le w \le z), $$ equivalently . The case (the linear sieve) governs sequences like used for twin primes, where for odd .

A truncated inclusion-exclusion replaces by a sieve weight supported on squarefree with , chosen so the surviving sum is one-sided. An upper-bound sieve requires ; a lower-bound sieve requires . Then or , and the art is choosing with main term close to and few nonzero entries.

Counterexamples to common slips

  • "The Legendre identity is wrong." It is exactly correct; it is merely useless, because the controlled main term is accompanied by a remainder sum over divisors whose errors need not cancel. Truncation does not fix an error in the identity — it limits how many remainder terms appear.
  • "A sieve detects primes." A sieve bounds the count of integers with no small prime factor — almost-primes — not primes themselves. The parity problem shows a sieve of this combinatorial type cannot distinguish integers with an even number of prime factors from those with an odd number, so it cannot isolate the primes (one prime factor) on its own.
  • "Selberg's weights are an approximation to ." The Selberg weights minimise a quadratic form and are generally not truncations of ; they are real numbers in chosen by optimisation, and only the constraint ties them to the Möbius identity.

Key theorem with proof Intermediate+

The signature theorem is Selberg's upper-bound sieve: a single nonnegativity trick produces, after diagonalising a quadratic form, an upper bound for the sifting function with main term .

Theorem (Selberg's upper bound). Let be a multiplicative density with for , and define the multiplicative function on squarefree by , together with . Then $$ S(\mathcal{A}, \mathcal{P}, z) ;\le; \frac{X}{G(z)} ;+; \sum_{\substack{d_1, d_2 < z \ d_i \mid P(z)}} |\lambda_{d_1} \lambda_{d_2}|; \big|r_{[d_1, d_2]}\big|, $$ where the optimal weights are with , and .

Proof. For any reals with , the square

because when the only divisor is and the left side is , while otherwise the left side is a square, hence . Summing against this inequality,

since and together mean . Inserting and using that is multiplicative with on squarefree arguments, the main term is the quadratic form

Diagonalise . Write and use the identity , valid because is the multiplicative function with on squarefree (locally , and the telescoping product gives only after the substitution below; concretely set ). Substituting and swapping the order of summation,

The change of variables is invertible by Möbius inversion, and the constraint becomes the linear constraint (taking ). Minimising the positive-definite form subject to is a Lagrange-multiplier problem; the minimum is attained at with . Since , a routine re-indexing identifies with for the reciprocal density, and the minimal value of is . Thus the main term is , and tracking the remainder through gives the stated error sum. The bound follows from the explicit formula for in terms of the partial sums .

Bridge. Selberg's sieve builds toward 21.16.04 pending the large sieve and Bombieri-Vinogradov, where the same quadratic-form positivity reappears as the dual large-sieve inequality and the remainder sum is controlled on average over moduli; it appears again in 21.14.03 the linear sieve, where the Selberg upper bound is paired with a matching lower bound to pin almost-primes. The foundational reason the method works is that is a nonnegative majorant of for any weights with , so optimisation over a finite-dimensional space of weights replaces the divergent Legendre alternating sum; this is exactly the move from an exact-but-inert identity to an inequality with a controllable error. The diagonalisation generalises the orthogonality that the large sieve will exploit, and the main term as shows the optimised bound recovers the heuristic survival density. Putting these together, the Selberg main term is dual to Brun's truncated inclusion-exclusion: both approximate , one by an optimised quadratic majorant and one by a Bonferroni-truncated alternating sum, and the central insight is that controlling the number of remainder terms — not their individual size — is what rescues the sieve.

Exercises Intermediate+

Advanced results Master

Theorem 1 (Brun's pure sieve and the fundamental lemma). For a -dimensional density and , the truncated Möbius sum over squarefree with satisfies [Iwaniec-Kowalski Ch. 6] $$ S(\mathcal{A}, \mathcal{P}, z) = X V(z)\Big(1 + O\big(e^{-s(\log s - \log\log 3s - 2)}\big)\Big) + O\Big(\sum_{\substack{d < z^{2k} \ d \mid P(z)}} |r_d|\Big), $$ with both an upper bound (even truncation ) and a lower bound (odd truncation ) by the Bonferroni inequalities (Exercise 3). The error is the fundamental lemma: once the sifting level is a small power of ( large), the sifting function is asymptotic to the heuristic with super-polynomially small relative error. Brun's original version [Brun 1920], with cruder truncation, already sufficed for the twin-prime bound and the convergence of to Brun's constant.

Theorem 2 (Selberg's sieve and its self-duality). The upper bound of the Key Theorem [Selberg 1947] gives with , and is optimal among all upper-bound sieves of the form . The Selberg weights, unlike Brun's, are not but real numbers obtained by minimising a positive-definite quadratic form, and the diagonalisation is the discrete analogue of completing the square. As , , so the optimised constant matches the heuristic; for the linear sieve the Selberg upper bound is within a factor of the truth, the deficit being precisely the parity barrier.

Theorem 3 (the sieve dimension and the linear sieve). The hypothesis defines the dimension [Halberstam-Richert]. For the Jurkat-Richert theorem furnishes the optimal continuous sieve functions solving the delay-differential system , with , for , yielding both upper and lower bounds $$ X V(z)\big(f(s) + o(1)\big) \le S(\mathcal{A}, \mathcal{P}, z) \le X V(z)\big(F(s) + o(1)\big), \qquad s = \frac{\log X}{\log z}. $$ The lower bound becomes positive only for , which is the linear-sieve form of the parity barrier: one cannot take as large as and still guarantee survivors are prime.

Theorem 4 (Chen's theorem). Chen 1973 [Halberstam-Richert]: every sufficiently large even is , a prime plus a number with at most two prime factors; equivalently there are infinitely many primes with . The proof is a weighted lower-bound sieve (the Selberg sieve combined with a switching trick and the Bombieri-Vinogradov theorem on primes in arithmetic progressions) that subtracts off the over-counted configurations, pushing the linear sieve to the very edge of the parity barrier. Chen's theorem is the closest unconditional approach to the twin-prime and binary Goldbach conjectures by sieve methods alone.

Theorem 5 (the parity problem). Selberg's parity principle [Friedlander-Iwaniec]: a sieve that uses only the sums (and bounds them by ) cannot distinguish integers with an even number of prime factors from those with an odd number, because the Liouville function of 21.11.01 is, to the sieve, indistinguishable from a constant of the same average. Concretely, there exist sequences with the same supported entirely on -even integers and others on -odd integers, so no combinatorial sieve can prove either set infinite. Detecting primes (, odd) therefore requires extra arithmetic input — bilinear (type II) sums, as in the Friedlander-Iwaniec theorem on primes , or the dispersion method — that lies outside the sieve axioms. The parity barrier is why the sieve gives (Chen) but not .

Synthesis. The combinatorial sieve is the foundational reason that questions about primes admit quantitative attack even when exact answers are out of reach: Brun's truncation converts the inert Legendre identity into a two-sided bound, and this is exactly the bridge from the elementary prime-product estimates of 21.11.03 — where first appears — to the analytic large sieve of 21.16.04 pending. The central insight is that one need not control the size of each remainder , only the number of surviving terms; Brun's Bonferroni truncation and Selberg's quadratic optimisation are two solutions to that single problem, both producing the main term and differing only in how they tame the divisor sum, which is dual to the way partial summation tamed the Mertens products of 21.11.03. Putting these together, the sieve dimension generalises the single Mertens product into a family indexed by how many residue classes each prime removes, the linear case governing the prime-counting band of 21.11.03 and governing twin primes and Goldbach. The parity problem then marks the method's limit: the sieve sees but is blind to , so Chen's is the boundary, and crossing it — to or to infinitely many twin primes — requires the bilinear input of the circle method in 21.16.05 pending, parity rather than error accumulation being the true obstruction once Brun has disposed of the latter.

Full proof set Master

Proposition 1 (the Legendre identity and its remainder). With notation as above, , and the remainder has at most terms.

Proof. By Möbius inversion of 21.11.01, . Hence $$ S = \sum_n a_n \sum_{\substack{d \mid P(z) \ d \mid n}} \mu(d) = \sum_{d \mid P(z)} \mu(d) \sum_{d \mid n} a_n = \sum_{d \mid P(z)} \mu(d) A_d . $$ Substituting and using multiplicativity, , which is the main term; the rest is . Since is squarefree with prime factors (restricting to all primes), it has divisors, bounding the term count.

Proposition 2 (partial-binomial truncation identity). For integers , .

Proof. Induct on . At both sides are . Assume the identity at . Then $$ \sum_{j=0}^{m}\binom{r}{j}(-1)^j = (-1)^{m-1}\binom{r-1}{m-1} + (-1)^m \binom{r}{m} = (-1)^m\Big(\binom{r}{m} - \binom{r-1}{m-1}\Big) = (-1)^m \binom{r-1}{m}, $$ by Pascal's rule . This is the claim. In particular the sign of the truncation error alternates with , which is the content of the Bonferroni inequalities and hence of Brun's two-sided sieve.

Proposition 3 (Selberg diagonalisation). The quadratic form equals with and .

Proof. On squarefree arguments . Define multiplicatively by ; then . The reciprocal-density identity holds for the conjugate function; carrying the standard normalisation (absorbing into the -variables) yields . Substitute and exchange summation: $$ Q(\lambda) = \sum_{e} h(e) \sum_{\substack{d_1, d_2 \ e \mid d_1, e \mid d_2}} \lambda_{d_1} g(d_1), \lambda_{d_2} g(d_2) = \sum_e h(e)\Big(\sum_{e \mid d} \lambda_d g(d)\Big)^2 = \sum_e h(e) y_e^2 . $$ The inner factorisation is exactly the square of , proving the diagonalisation.

Proposition 4 (optimal Selberg constant). Minimising subject to gives minimum value with , attained at .

Proof. By Cauchy-Schwarz with weights , $$ 1 = \Big(\sum_e \mu(e) y_e\Big)^2 = \Big(\sum_e \frac{\mu(e)}{\sqrt{h(e)}},\sqrt{h(e)},y_e\Big)^2 \le \Big(\sum_e \frac{\mu(e)^2}{h(e)}\Big)\Big(\sum_e h(e) y_e^2\Big), $$ so . Equality in Cauchy-Schwarz requires , i.e. ; the constraint fixes . The minimum value is therefore .

Connections Master

  • The entire method rests on the Möbius inversion and the convolution identity of 21.11.01: the Legendre sieve is this identity summed against , and the multiplicativity of the density that makes a product is the multiplicativity of arithmetic functions developed there.

  • The main term and its asymptotic are exactly the Mertens products of 21.11.03; the sieve dimension measures the rate of this product, and Mertens' third theorem is the prototype underlying the fundamental lemma.

  • The remainder sum of the Selberg sieve is controlled on average over moduli by 21.16.04 pending the large sieve and the Bombieri-Vinogradov theorem, which is what powers Chen's theorem; the quadratic-form positivity of Selberg's method is the same structure as the dual large-sieve inequality.

  • The parity barrier is evaded in 21.16.05 pending the circle method and the asymptotic sieve, where bilinear (type II) sums supply the arithmetic information about that the combinatorial sieve cannot access, yielding the Hardy-Littlewood asymptotics for Goldbach and Waring that the sieve bounds only from above.

  • The almost-prime outputs of Chen's theorem feed the elementary prime-distribution framework of 21.11.03 and the prime-counting heuristics there; conversely the Chebyshev bounds supply the crude prime-counting input that calibrates the sieve's main term against reality.

Historical & philosophical context Master

Viggo Brun announced his sieve in a Comptes Rendus note and developed it fully in a memoir [Brun 1920], the first genuine advance on the sieve of Eratosthenes since antiquity. Brun's idea — truncate the inclusion-exclusion to control the error — broke the deadlock created by the useless exactness of Legendre's identity, and his application to twin primes (the convergence of over twin primes, now Brun's constant ) gave the first nontrivial unconditional theorem about a sparse prime configuration. Atle Selberg, in a short note [Selberg 1947], replaced Brun's combinatorial truncation with the quadratic majorant , an optimisation that produced cleaner and often sharper upper bounds and became the standard upper-bound sieve.

The systematic theory was consolidated by Halberstam and Richert in their monograph [Halberstam-Richert], which fixed the modern notation for the dimension , the sieve functions and , and the Jurkat-Richert theorem for the linear sieve. Chen Jingrun's theorem that every large even number is stands as the high-water mark of the combinatorial sieve, reaching the parity barrier that Selberg had identified as the intrinsic limit of the method. Friedlander and Iwaniec's Opera de Cribro [Friedlander-Iwaniec] gave the contemporary account, including the bilinear-form methods that evade parity, exemplified by their theorem that infinitely many primes have the form .

Bibliography Master

@book{iwaniec_kowalski_2004,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004},
  note      = {Chapter 6: the sieve, Brun, Selberg, parity}
}

@book{halberstam_richert_1974,
  author    = {Halberstam, Heini and Richert, Hans-Egon},
  title     = {Sieve Methods},
  series    = {London Mathematical Society Monographs},
  volume    = {4},
  publisher = {Academic Press},
  year      = {1974}
}

@article{brun_1920,
  author  = {Brun, Viggo},
  title   = {Le crible d'\'{E}ratosth\`{e}ne et le th\'{e}or\`{e}me de Goldbach},
  journal = {Videnskapsselskapets Skrifter, I. Mat.-Naturv. Klasse, Kristiania},
  volume  = {3},
  pages   = {1--36},
  year    = {1920}
}

@article{selberg_1947,
  author  = {Selberg, Atle},
  title   = {On an elementary method in the theory of primes},
  journal = {Norske Videnskabers Selskabs Forhandlinger (Trondheim)},
  volume  = {19},
  number  = {18},
  pages   = {64--67},
  year    = {1947}
}

@article{chen_1973,
  author  = {Chen, Jing Run},
  title   = {On the representation of a larger even integer as the sum of a prime and the product of at most two primes},
  journal = {Scientia Sinica},
  volume  = {16},
  pages   = {157--176},
  year    = {1973}
}

@book{friedlander_iwaniec_2010,
  author    = {Friedlander, John and Iwaniec, Henryk},
  title     = {Opera de Cribro},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {57},
  publisher = {American Mathematical Society},
  year      = {2010}
}