21.13.03 · number-theory / dirichlet-l-functions-characters

The Prime Number Theorem in Arithmetic Progressions and Siegel-Walfisz

shipped3 tiersLean: none

Anchor (Master): Davenport 2000 *Multiplicative Number Theory* 3e §20, §22 (the full PNT-in-AP and Siegel-Walfisz development, the exceptional-zero term, the $q \le (\log x)^A$ uniformity); Montgomery-Vaughan 2007 *Multiplicative Number Theory I* §11; Iwaniec-Kowalski 2004 *Analytic Number Theory* (AMS Colloquium Publications 53) §5.9, §17 (Siegel-Walfisz and its role in the large sieve and Bombieri-Vinogradov); Walfisz 1936 *Math. Z.* 40; Siegel 1935 *Acta Arithmetica* 1 (the ineffective lower bound feeding the uniformity)

Intuition Beginner

Split the whole numbers by the remainder they leave when divided by a fixed number . Among numbers that leave remainder on division by — that is — how many are prime? Among those leaving remainder — that is — how many are prime? Dirichlet proved long ago that each such list with remainder sharing no common factor with holds infinitely many primes. The question now is sharper: do the primes spread evenly across these lists?

The answer is yes, and the statement is the prime number theorem for arithmetic progressions. There are exactly remainders that share no factor with , where counts them, and the primes split among those lists in equal shares. If you weight each prime by the logarithm of its base and add the weights up to inside one list, the running total is about divided by . Every allowed remainder gets the same slice of the primes; none is favoured.

Proving this reuses the contour machinery that proved the ordinary prime number theorem, but run once for each "character", a gadget that detects remainders. The hard part is honesty about how big may grow with before the proof stops working. One single bad zero of one single -function — the exceptional zero — could throw one remainder class off balance, and ruling it out is the most delicate step. The honest theorem, Siegel-Walfisz, lets grow only as fast as a fixed power of the logarithm of .

Visual Beginner

Picture bins, one for each remainder that shares no factor with . As you walk along the number line up to , each prime drops into the bin matching its remainder. The prime number theorem for progressions says all bins fill at the same rate: each holds about of the logarithm-weighted total. The bins stay level with one another, like water finding one height across connected tanks.

modulus allowed remainders count share per class

The middle column lists the remainders coprime to ; the last column is the equal slice each receives. The one tilted bin in the picture is the danger the exceptional zero poses, and the dashed guarantee line is exactly what Siegel-Walfisz secures while stays small.

Worked example Beginner

Count the logarithm-weighted primes up to in the two allowed classes modulo , and check they come out roughly equal.

Step 1. List the primes up to : they are . Drop the prime , since it shares the factor with and lands in no allowed class. The allowed remainders modulo are and .

Step 2. Sort the rest by remainder. Remainder : the primes (since , , ). Remainder : the primes (since ).

Step 3. Weight each prime by the natural logarithm of itself and add. Remainder : . Remainder : .

What this tells us: the two totals, and , are already close, and the theorem says their ratio approaches as grows. The predicted common share is ; both totals sit a little below , which is the expected undercount at this tiny scale. The primes are not crowding into one class — they are splitting evenly, exactly as the equal-bins picture promised.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is the modulus, is an integer with , ranges over Dirichlet characters modulo as in 21.03.02, is the principal character, is the -function with Euler product for , is the von Mangoldt function, and is Euler's totient. We follow Davenport [Davenport §19].

Definition (Chebyshev function in a progression). For real set $$ \psi(x; q, a) = \sum_{\substack{n \le x \ n \equiv a ,(\mathrm{mod}, q)}} \Lambda(n), $$ the von Mangoldt weight summed over the prime powers congruent to modulo . The twisted Chebyshev function is $$ \psi(x, \chi) = \sum_{n \le x} \chi(n)\Lambda(n), $$ the character-weighted analogue of the of 21.12.02.

Definition (orthogonality decomposition). The orthogonality of Dirichlet characters, when and otherwise for , gives the exact identity $$ \psi(x; q, a) = \frac{1}{\varphi(q)} \sum_{\chi \bmod q} \overline\chi(a), \psi(x, \chi). $$ The principal term contributes , the expected main term ; every non-principal contributes an error term controlled by the zeros of .

Definition (the PNT in arithmetic progressions). The prime number theorem in arithmetic progressions is the asymptotic $$ \psi(x; q, a) \sim \frac{x}{\varphi(q)} \qquad (x \to \infty) $$ for each fixed and each with ; equivalently for the count of primes in the class. The content is the equidistribution: the leading constant is the same for every allowed .

The symbols , , , , , , the modulus , the exceptional zero , and for real part are recorded in _meta/NOTATION.md. The exceptional (Siegel) zero is the possible real zero of for real described in 21.13.01.

Counterexamples to common slips

  • "Equidistribution is obvious once Dirichlet's theorem gives infinitely many primes per class." Infinitude is far weaker than equal proportion: Dirichlet's theorem allows wildly unequal densities a priori. The equal constant is exactly the non-vanishing of 21.03.02 sharpened to a zero-free region.
  • "The error term for each is uniform up to near ." It is not. Siegel-Walfisz secures uniformity only for . The barrier is the exceptional zero, whose contribution is controlled only by Siegel's ineffective bound, which degrades as grows.
  • "The decomposition has an error because orthogonality is approximate." The orthogonality identity is exact for ; the decomposition is an equality, not an estimate. All error enters through the analytic behaviour of each , not through the splitting.

Key theorem with proof Intermediate+

The signature result is the Siegel-Walfisz theorem: equidistribution with a de la Vallée Poussin error term, uniform in the modulus up to a fixed power of . The proof runs the contour argument of 21.12.02 once per character, then reassembles by orthogonality, with Siegel's bound taming the one exceptional zero. [Davenport §22]

Theorem (Siegel-Walfisz). Fix . There is a constant such that for all and all with , $$ \psi(x; q, a) = \frac{x}{\varphi(q)} + O!\big(x\exp(-c\sqrt{\log x})\big), $$ the implied constant depending only on . The constant is ineffective.

Proof. Start from the exact decomposition . The principal character gives by the prime number theorem of 21.12.02, so its share is up to the stated error. It remains to bound for each non-principal .

For non-principal the twisted explicit formula reads, for , $$ \psi(x, \chi) = -\sum_{|\Im\rho| \le T}\frac{x^\rho}{\rho} + O!\Big(\frac{x\log^2(qx)}{T}\Big), $$ the sum over the non-elementary zeros of inside the critical strip. Insert the zero-free region of 21.13.01: every such zero satisfies , with the single possible exception, for real , of one real zero . Bounding each term by and summing against the zero-density estimate yields, for the non-exceptional zeros, $$ \Big|\sum_{\rho \ne \beta}\frac{x^\rho}{\rho}\Big| \ll x\exp!\Big(-\frac{c_2\log x}{\log(qT)}\Big)\log^2(qT) + \frac{x\log^2(qx)}{T}. $$ Choosing and using , so , both pieces are .

The one term left is the exceptional zero, present only for at most one real modulo (Landau's repulsion, 21.13.01). It contributes to . Its size is . Siegel's theorem [Siegel 1935], referenced through 21.13.01, gives . Take : since , one has , so , and $$ x^\beta \ll x\exp!\big(-c_4 (\log x)^{1/2}\big) = x\exp(-c_4\sqrt{\log x}). $$ The constant is ineffective because Siegel's is. Collecting the principal main term, the non-exceptional zero sum, and the exceptional term, every error is with , completing the proof.

Bridge. This uniformity builds toward the analytic theory of primes in progressions and appears again in 21.14.02, where the large sieve replaces the per-modulus exceptional-zero control by an average over . The foundational reason the proof works is the exact orthogonality split: it converts a question about one residue class into independent contour problems, and this is exactly the device by which the single-modulus PNT of 21.12.02 generalises to every progression at once. The central insight is that the only obstruction to clean uniformity is one real zero of one real-character -function, so the entire strength of the theorem is dual to the strength of the lower bound on : putting these together, Siegel's ineffective is exactly what forces the range and no larger, and the bridge is the trade of an effective but feeble range for an ineffective but genuine one.

Exercises Intermediate+

Advanced results Master

Theorem 1 (twisted explicit formula). For non-principal modulo and [Davenport §19], $$ \psi(x, \chi) = -\sum_{|\Im\rho| \le T}\frac{x^\rho}{\rho} - (1 - E_0(\chi))\log x + O!\Big(\frac{x\log^2(qx)}{T}\Big), $$ the sum over non-elementary zeros of , with if and otherwise. The absence of a main term for non-principal — there is no pole of at — is the analytic statement of equidistribution: only produces , and orthogonality then averages the rest to zero.

Theorem 2 (PNT in progressions with exceptional term). Assembling the twisted explicit formulae through orthogonality [Davenport §20], $$ \psi(x; q, a) = \frac{x}{\varphi(q)} - \frac{\overline\chi_1(a)}{\varphi(q)}\frac{x^{\beta}}{\beta} + O!\big(x\exp(-c\sqrt{\log x})\big), $$ where the middle term is present only if some real modulo has an exceptional zero ; otherwise it is absent and the error stands alone. The exceptional term is the unique obstruction to clean equidistribution and is the reason the theorem is conditional in quality on the Landau-Siegel zero.

Theorem 3 (Siegel-Walfisz, uniform form). For every there is with [Walfisz 1936] $$ \psi(x; q, a) = \frac{x}{\varphi(q)} + O!\big(x\exp(-c\sqrt{\log x})\big) \qquad (q \le (\log x)^A,\ (a, q) = 1), $$ and correspondingly by partial summation. The constant is ineffective, inherited from Siegel's bound on ; this is the sole source of ineffectivity, since the non-exceptional zeros are handled by the effective de la Vallée Poussin region of 21.13.01.

Theorem 4 (the barrier). The range is sharp for the method [Iwaniec-Kowalski §5.9]. For the exceptional term to be subsumed in the error one needs , which via forces , i.e. . No choice of pushes the range to a power of ; the exceptional zero is the impassable wall for any single modulus, and crossing it for individual would require eliminating the Landau-Siegel zero outright.

Theorem 5 (recovery on average: Bombieri-Vinogradov). The large sieve of 21.14.02 yields, for every and [Iwaniec-Kowalski §17], $$ \sum_{q \le Q}\max_{(a, q) = 1}\Big|\psi(x; q, a) - \frac{x}{\varphi(q)}\Big| \ll_A \frac{x}{(\log x)^A}. $$ This is the average-over-moduli statement that compensates for the per-modulus failure: it reaches at the cost of a maximum over residues and a sum over moduli, and it is unconditional and effective, because the rarity of exceptional zeros (Landau's repulsion) makes their pooled contribution negligible.

Synthesis. The prime number theorem in arithmetic progressions is the foundational reason equidistribution of primes in residue classes is a theorem with an error term, and the exceptional zero is the single structural flaw running through it, exactly as in 21.13.01. The central insight is that orthogonality splits one residue class into independent -function contour problems, so the single-modulus PNT of 21.12.02 generalises verbatim except for the one real zero that the zero-free region cannot exclude: putting these together, the principal character supplies the main term while every other character contributes only zero-driven error, and this is exactly the analytic content of "no remainder is favoured". The bridge is everywhere the exceptional zero — its contribution is dual to Siegel's lower bound on , and the range is precisely the regime where that ineffective bound still beats . This is dual to the large-sieve story: where Siegel-Walfisz controls each modulus up to a power of at the cost of ineffectivity, the Bombieri-Vinogradov theorem of 21.14.02 controls all moduli up to on average and effectively, trading per-modulus precision for an average that the rarity of exceptional zeros makes clean. The central insight recurs across analytic number theory: a single conjectural zero conditions the entire quantitative theory, and every uniform statement about primes in progressions is written, like the zero-free region itself, in the conditional mood the Siegel zero imposes.

Full proof set Master

Proposition 1 (orthogonality decomposition is exact). For and , .

Proof. The character orthogonality relation gives if with , and otherwise; for the condition already forces . Multiply by , sum over , and exchange the finite sums: $$ \frac{1}{\varphi(q)}\sum_{\chi}\overline\chi(a)\sum_{n \le x}\chi(n)\Lambda(n) = \sum_{n \le x}\Lambda(n)\frac{1}{\varphi(q)}\sum_\chi \overline\chi(a)\chi(n) = \sum_{\substack{n \le x \ n \equiv a}}\Lambda(n) = \psi(x; q, a). $$ The inner sum on the left is , giving the claim. No error term arises; the identity is exact.

Proposition 2 (no main term for non-principal ). For non-principal , is holomorphic and non-zero at , so .

Proof. For non-principal the -function converges for by partial summation (the partial sums of are bounded by ), hence is holomorphic across , and by 21.03.02. Thus has no pole at , and the Perron contour for sweeps past no residue at . The leading contribution that the pole of supplied for in 21.12.02 is therefore absent, leaving only the zero-driven sum, which is by the zero-free region.

Proposition 3 (exceptional-term bound under Siegel). If has an exceptional zero and , then with ineffective.

Proof. Write . By Siegel's theorem [Siegel 1935], for each there is with . Take ; then gives , so and $$ x^\beta \le x\exp!\big(-C(\varepsilon)(\log x)^{-1/2}\cdot\log x\big) = x\exp!\big(-C(\varepsilon)\sqrt{\log x}\big). $$ Set . Siegel's is ineffective because its proof selects an auxiliary real character with the most extreme exceptional zero, whose existence cannot be decided, so is positive but not computable.

Proposition 4 (non-exceptional zero sum). With and , the contribution of the non-exceptional zeros of to is .

Proof. Each zero other than obeys by 21.13.01, so for . The number of zeros with is , and each contributes with or ; partial summation against the density bound gives . Adding the truncation error , and using since and , both terms are .

Connections Master

  • The contour proof, truncated Perron formula, and the optimisation are imported wholesale from 21.12.02; this unit runs that single-modulus argument once per character and reassembles by orthogonality, so the analytic engine is identical and only the -fold bookkeeping and the exceptional-zero term are new.

  • The zero-free region for , Landau's repulsion, and the isolation of the exceptional zero are exactly the apparatus of 21.13.01; the present unit consumes that infrastructure — the de la Vallée Poussin region for the non-exceptional zeros and the at-most-one-exceptional-zero bound for the real-character term — to turn the per-character explicit formula into a uniform error.

  • The orthogonality of Dirichlet characters and the non-vanishing that gives every non-principal no main term are the facts established in 21.03.02; the decomposition is the exact identity through which those facts become equidistribution.

  • Siegel's ineffective lower bound , the single source of ineffectivity here and the controller of the exceptional term, is developed in detail in the sibling unit 21.13.02; the range is precisely the regime where that bound dominates .

  • The large sieve and the Bombieri-Vinogradov theorem of 21.14.02 are the average-over-moduli sequel: where Siegel-Walfisz reaches only per modulus, the large sieve recovers on average, unconditionally and effectively, by exploiting the rarity of exceptional zeros that this unit's per-modulus argument cannot exploit.

Historical & philosophical context Master

The prime number theorem in arithmetic progressions descends directly from Dirichlet's 1837 proof that each coprime residue class modulo contains infinitely many primes, the first appearance of -functions and of the non-vanishing . The quantitative equidistribution , with the de la Vallée Poussin error term, followed the 1896 prime number theorem for the same character-twisted contour reasons, but only for fixed or slowly growing ; the obstruction to uniformity was the exceptional zero isolated by Edmund Landau in 1918. The decisive uniform statement is due to Arnold Walfisz, who in 1936 [Walfisz 1936] combined Carl Ludwig Siegel's ineffective lower bound on [Siegel 1935], published the year before, with the contour method to prove equidistribution uniformly for . The theorem bears both names because Siegel supplied the analytic input and Walfisz the synthesis.

The ineffectivity Siegel introduced is permanent under this method: the constant in the error is known to be positive but cannot be computed, because Siegel's argument bounds all in terms of a hypothetical extremal exceptional zero whose existence is undecided. The limitation stood as the working range for individual moduli until the large-sieve methods of the 1960s. Klaus Roth and, decisively, Enrico Bombieri and A. I. Vinogradov in 1965 proved that on average over the equidistribution holds with the strength one would expect from the Generalized Riemann Hypothesis for each modulus — the Bombieri-Vinogradov theorem — which is the form actually used in modern prime-gap and additive-prime results. Davenport's Multiplicative Number Theory §20-22 and Iwaniec-Kowalski's Analytic Number Theory are the canonical treatments.

Bibliography Master

@article{walfisz1936,
  author  = {Walfisz, Arnold},
  title   = {Zur additiven Zahlentheorie II},
  journal = {Mathematische Zeitschrift},
  volume  = {40},
  pages   = {592--607},
  year    = {1936}
}

@article{siegel1935ap,
  author  = {Siegel, Carl Ludwig},
  title   = {\"{U}ber die Classenzahl quadratischer Zahlk\"{o}rper},
  journal = {Acta Arithmetica},
  volume  = {1},
  pages   = {83--86},
  year    = {1935}
}

@article{bombieri1965,
  author  = {Bombieri, Enrico},
  title   = {On the large sieve},
  journal = {Mathematika},
  volume  = {12},
  pages   = {201--225},
  year    = {1965}
}

@book{davenport2000ap,
  author    = {Davenport, Harold},
  title     = {Multiplicative Number Theory},
  edition   = {3},
  series    = {Graduate Texts in Mathematics},
  volume    = {74},
  publisher = {Springer-Verlag},
  year      = {2000},
  note      = {Revised by H. L. Montgomery; \S19--22: PNT in arithmetic progressions and Siegel-Walfisz}
}

@book{iwaniec-kowalski2004ap,
  author    = {Iwaniec, Henryk and Kowalski, Emmanuel},
  title     = {Analytic Number Theory},
  series    = {American Mathematical Society Colloquium Publications},
  volume    = {53},
  publisher = {American Mathematical Society},
  year      = {2004},
  note      = {\S5.9 and Ch. 17: Siegel-Walfisz, the large sieve, and Bombieri-Vinogradov}
}

@book{montgomery-vaughan2007ap,
  author    = {Montgomery, Hugh L. and Vaughan, Robert C.},
  title     = {Multiplicative Number Theory I: Classical Theory},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {97},
  publisher = {Cambridge University Press},
  year      = {2007},
  note      = {\S11.3: the Siegel-Walfisz theorem and its ineffective constant}
}