Primes, the fundamental theorem of arithmetic, and the infinitude of primes
Anchor (Master): Euclid c. 300 BCE *Elements* IX Prop. 20 (originator of the infinitude proof); Hardy-Wright 2008 *An Introduction to the Theory of Numbers* 6e (Oxford, revised Heath-Brown-Silverman-Wiles) §§I-II, XXII; Apostol 1976 *Introduction to Analytic Number Theory* §§1-2; Lang 2002 *Algebra* (Springer GTM 211, 3e) Ch. II §5; Tenenbaum 2015 *Introduction to Analytic and Probabilistic Number Theory* 3e (Cambridge UP) §I; Euler 1737 *Comm. Acad. Sci. Petropolitanae* 9 (analytic proof via Euler product); Furstenberg 1955 *Amer. Math. Monthly* 62 (topological proof)
Intuition Beginner
A prime number is a whole number larger than that cannot be split into two smaller whole-number factors. The number is prime: the only way to write it as a product of two positive whole numbers is . The number is not prime: it splits as , or as , or as . Numbers that are not prime and larger than are called composite. The number is treated as neither prime nor composite, a convention chosen so that prime factorisations are unique.
The first few primes are . They get sparser as you walk up the number line, but they never stop. Between and there are primes; between and there are ; between and there are . The thinning is gradual, not sudden.
Every whole number larger than can be built by multiplying primes together, and the recipe is essentially unique. The number breaks up as , or compactly . No other multiset of primes multiplies to . This pattern — every whole number is a product of primes, in essentially one way — is the fundamental theorem of arithmetic.
There are infinitely many primes. Euclid proved this around BCE with a short and beautiful argument. Suppose you had a complete finite list of all the primes. Form the number . Dividing by any prime in your list leaves a remainder of , so none of those primes divides . Either is itself a prime not in your list, or has a prime factor not in your list. Either way, your list was incomplete. The list of primes cannot be finite.
Visual Beginner
A picture of a number-line grid from to , with primes highlighted in one colour and composites in another. The composite numbers form a regular thicket — multiples of on every other square, multiples of on every third, multiples of on every fifth — and what remains is the sieve of primes. Below the grid, an arrow points to the number with its prime tree branching out: .
The grid is the sieve of Eratosthenes, the oldest method for finding primes: cross out every multiple of except itself, then every multiple of except , and so on. What survives is the list of primes up to your chosen bound. The factor tree is the algorithm for unique factorisation: split repeatedly until every leaf is prime, and the multiset of leaves is the factorisation.
Worked example Beginner
Factor the number into primes.
Step 1. Check divisibility by the smallest primes. The number is odd, so does not divide it. The digit sum is , which is not a multiple of , so does not divide it. The last digit is , not or , so does not divide it.
Step 2. Try . Compute . The division comes out exactly: . So .
Step 3. Factor . Check : , not exact. Try : , exact. So .
Step 4. Both and are prime (no smaller prime divides them). The full factorisation is .
What this tells us: even a small-looking number can have a memorable prime structure. The product is the reason that multiplying any three-digit number by produces a six-digit number with the original three digits repeated.
Check your understanding Beginner
Formal definition Intermediate+
The notion of a prime, the multiplicative structure of , and the infinitude of primes admit clean definitions on the ring of integers, building on the divisibility and Bézout apparatus of 21.01.01.
Definition (prime). A positive integer is prime if its only positive divisors are and . A positive integer that is not prime is composite.
Definition (irreducible vs. prime, ring-theoretic). In an arbitrary integral domain , a nonzero non-unit is irreducible if forces or to be a unit; it is prime if forces or . In the two notions coincide, because is a unique-factorisation domain; in general rings they can differ.
Definition (prime factorisation). A prime factorisation of an integer is an expression with primes and integers.
Definition (prime-counting function). The prime-counting function is defined by $$ \pi(x) = #{ p \leq x : p \text{ prime} }. $$ Thus (primes ); ; ; .
Definition (sieve of Eratosthenes). Given a bound , the sieve of Eratosthenes computes as follows: initialise the list ; for each from up to , if , remove every multiple of from ; the remaining elements of are the primes up to .
Counterexamples to common slips
" is prime." The number is excluded from the primes by convention, because including it would break the uniqueness clause of the fundamental theorem: would give infinitely many "factorisations". The same convention extends to general unique-factorisation domains: units are factored out, leaving only the prime (irreducible) factors.
"Every irreducible element is prime." True in and in every unique-factorisation domain, but false in general. In the element is irreducible (no smaller-norm factor) but not prime: , yet divides neither factor. This is exactly the failure that motivated Dedekind's theory of ideals.
"Composites have more than one prime factorisation." The decomposition is the same factorisation written two ways. Uniqueness in the fundamental theorem means the multiset of prime factors (equivalently, the function recording the exponent of each prime) is unique.
Key theorem with proof Intermediate+
The signature theorem of this unit is the fundamental theorem of arithmetic. We prove both halves — existence via strong induction, uniqueness via Euclid's Lemma — and along the way state Euclid's original infinitude proof.
Theorem (fundamental theorem of arithmetic). Every integer admits a prime factorisation with primes and . The factorisation is unique: the multiset is determined by .
Proof. Existence. By strong induction on (the induction principle of 00.12.01). For , the factorisation is . For the inductive step, assume every integer with admits a prime factorisation, and consider . If is prime, is the factorisation. Otherwise is composite, so for some integers with . By the inductive hypothesis, and as products of primes; concatenating, is a product of primes. Re-grouping equal primes and sorting yields the canonical form.
Uniqueness. Suppose has two prime factorisations with all prime (writing the factorisations as flat products without exponent compression). We show by induction on that the two multisets coincide.
The base case gives ; since is prime its only positive divisors are and , forcing and .
For the inductive step, . By Euclid's Lemma (a prime dividing a product divides one of the factors — proved as Exercise 4 of 21.01.01 via Bézout), for some . Since is prime and , . Cancelling, , a product of primes on the left. By the inductive hypothesis on , the multisets and coincide. Adjoining to both sides gives the full multiset equality.
Corollary (Euclid c. 300 BCE). There are infinitely many primes.
Proof. Suppose for contradiction that is a complete enumeration of the primes. Form . By the fundamental theorem, has at least one prime divisor . If for some , then and , so — impossible for a prime. Hence is a prime not in the list, contradicting completeness.
Bridge. The fundamental theorem builds toward 01.02.07 polynomial rings + PIDs + UFDs, where the same statement reappears as " is a unique-factorisation domain", and the proof generalises directly to any Euclidean domain. The central insight is that Euclid's Lemma — proved via Bézout in 21.01.01 — is what converts existence-of-factorisation into uniqueness-of-factorisation. This is exactly the bridge from the additive structure of (Bézout's identity, as a PID) to the multiplicative structure (unique factorisation). The infinitude corollary appears again in 21.03.01 the Riemann zeta function, where Euler's product is the analytic encoding of unique factorisation; the divergence of then forces the prime set to be infinite by a route independent of Euclid's combinatorial construction. The pattern generalises through Dirichlet 1837 on primes in arithmetic progressions and the Hadamard-de la Vallée Poussin 1896 prime number theorem to the entire modern apparatus of analytic number theory.
Exercises Intermediate+
Lean formalization Intermediate+
Mathlib already supplies the primary objects of this unit: the predicate Nat.Prime, the multiset of prime factors Nat.factors, the finitely-supported function Nat.factorization, the fundamental-theorem statements Nat.factorization_prod_pow_eq_self and UniqueFactorizationMonoid.factors_prod, and the infinitude of primes via Nat.exists_infinite_primes.
-- Operative imports: Mathlib.Data.Nat.Prime, Mathlib.NumberTheory.Primes,
-- Mathlib.RingTheory.UniqueFactorizationDomain, Mathlib.Data.Nat.Factorization
#check @Nat.Prime
-- Nat.Prime : ℕ → Prop, defined as 2 ≤ p ∧ ∀ m, m ∣ p → m = 1 ∨ m = p
#check @Nat.exists_infinite_primes
-- Nat.exists_infinite_primes : ∀ (N : ℕ), ∃ p, N ≤ p ∧ Nat.Prime p
#check @Nat.factorization
-- Nat.factorization : ℕ →* (ℕ →₀ ℕ); the multiplicative homomorphism
-- assigning to each n its prime-exponent function
#check @Nat.factorization_prod_pow_eq_self
-- Nat.factorization_prod_pow_eq_self : ∀ {n : ℕ}, n ≠ 0 →
-- (n.factorization.prod fun p k => p ^ k) = n
#check @UniqueFactorizationMonoid.factors_prod
-- For any UFM, factoring then multiplying recovers the original (up to units)The Lean module Codex.NumberTheory.Elementary.PrimesFTA will export Euclid's original proof (a finite-cardinality contradiction on ) alongside the Mathlib Nat.exists_infinite_primes, and will record the fundamental theorem in the symmetric form n = p_1^{a_1} \cdots p_k^{a_k} with explicit decidability of the exponent function. The Mathlib gap discussed in the unit metadata (Chebyshev bounds, Mertens theorems, Furstenberg topology, Brun sieve, PNT) is the formalisation roadmap.
Advanced results Master
Theorem 1 (multiple proofs of the infinitude of primes). The proof attributed to Euclid c. 300 BCE Elements IX Prop. 20 [Euclid] is the canonical combinatorial argument: if exhausted the primes, then would have a prime factor outside the list. The first analytic proof is due to Euler 1737 [Euler 1737]: the Euler product $$ \sum_{n=1}^\infty \frac{1}{n^s} ;=; \prod_p \frac{1}{1 - p^{-s}} \quad (s > 1) $$ encodes unique factorisation; letting the left side diverges as the harmonic series, so the right side — a product over primes — must have infinitely many factors. A topological proof was given by Furstenberg 1955 Amer. Math. Monthly 62 [Furstenberg 1955]: equip with the topology generated by the arithmetic progressions for ; each is both open (by definition) and closed (as the complement of the union of the other residue classes); since and a finite union of closed sets is closed but is not open (every basic open set is infinite), the union must be infinite, hence the primes are infinite.
Theorem 2 (Chebyshev 1852 bounds). *There exist explicit positive constants with such that for all sufficiently large ,* $$ c_1 \cdot \frac{x}{\ln x} ;\leq; \pi(x) ;\leq; c_2 \cdot \frac{x}{\ln x}. $$ *Chebyshev's original values are and * [Chebyshev 1852]. The proof uses elementary estimates on the central binomial coefficient : the factorisation $$ \binom{2n}{n} ;=; \prod_{p \leq 2n} p^{v_p(\binom{2n}{n})} $$ combined with Kummer's theorem on of binomial coefficients (the -adic valuation equals the number of carries in adding to itself in base ) gives both upper and lower bounds on that scale as . Bertrand's postulate — for every there is a prime in — is an immediate consequence, since the Chebyshev lower bound forces once is large, and small are verified by hand. Erdős 1932 Acta Litt. Sci. Szeged 5 gave a particularly clean elementary proof using the same Kummer-valuation machinery.
Theorem 3 (prime number theorem; Hadamard 1896, de la Vallée Poussin 1896). As , $$ \pi(x) ;\sim; \frac{x}{\ln x}, \quad \text{equivalently} \quad \pi(x) \sim \mathrm{Li}(x) := \int_2^x \frac{dt}{\ln t}. $$ [Hadamard 1896] [de la Vallée Poussin 1896]. Both proofs proceed by showing that has no zeros on the line and then using a Tauberian argument to extract the asymptotic for . de la Vallée Poussin obtained the quantitative form . The logarithmic-integral version is a closer approximation to than : at the error while . The elementary proof of the PNT (avoiding complex analysis) was achieved by Erdős and Selberg in 1948-49, settling a long-open question.
Theorem 4 (Mertens 1874). As , $$ \sum_{p \leq x} \frac{\ln p}{p} ;=; \ln x + O(1), \qquad \sum_{p \leq x} \frac{1}{p} ;=; \ln \ln x + M + O!\left(\frac{1}{\ln x}\right), \qquad \prod_{p \leq x}\left(1 - \frac{1}{p}\right) \sim \frac{e^{-\gamma}}{\ln x}, $$ where is the Meissel-Mertens constant and is the Euler-Mascheroni constant [Mertens 1874]. The three statements are equivalent given the PNT-grade asymptotics, but Mertens proved them by elementary techniques (Abel summation against Chebyshev-grade estimates) without needing the full PNT. The second statement implies in particular that diverges — a stronger statement than infinitude — at the rate of an iterated logarithm. The third statement is the prime-product form, central to sieve theory.
Theorem 5 (Dirichlet 1837 on primes in arithmetic progressions). For every pair of coprime integers with , the arithmetic progression contains infinitely many primes. Moreover, in a precise asymptotic sense the primes are equidistributed across the admissible residue classes mod : for each such class $$ \pi(x; q, a) := #{ p \leq x : p \equiv a \pmod q } ;\sim; \frac{1}{\varphi(q)} \cdot \pi(x), $$ [Dirichlet 1837]. Dirichlet's proof introduces the Dirichlet characters and the associated -functions ; the central step is to show that for every non-principal character . This was the first proof in number theory to introduce systematic use of complex analytic functions and is the historical seed of modern analytic number theory. The quantitative equidistribution is the prime number theorem for arithmetic progressions, made effective by Siegel-Walfisz and culminating in the Bombieri-Vinogradov theorem of 1965.
Theorem 6 (Riemann hypothesis and its arithmetic content). Riemann's 1859 memoir Über die Anzahl der Primzahlen unter einer gegebenen Größe conjectured that all the non-real zeros of lie on the line (the critical line). The arithmetic equivalent is the error-term estimate $$ \big|\pi(x) - \mathrm{Li}(x)\big| ;=; O!\left(\sqrt{x} \ln x\right) \quad \text{as } x \to \infty. $$ The current state: Hardy 1914 proved infinitely many zeros lie on the critical line; Selberg 1942 showed a positive proportion do; Conrey 1989 raised this to ; Bui-Conrey-Young 2011 to . The hypothesis is one of the seven Clay Millennium Prize Problems and remains open as of writing. Numerical verification has confirmed all zeros below imaginary part lie on the critical line (Gourdon 2004). Conditional on RH, the PNT acquires the strongest possible error term and a wealth of arithmetic statements (e.g., explicit prime-gap bounds, the Lindelöf hypothesis on ) become accessible.
Theorem 7 (twin primes; Polignac 1849, Zhang 2014, Maynard 2015). A twin prime is a prime with also prime: . de Polignac 1849 conjectured infinitely many such pairs, and more generally infinitely many prime pairs for every positive integer . The Brun sieve (Brun 1919 [Brun 1919]) established the convergence , defining Brun's constant — an indirect signature that twin primes thin out faster than primes themselves. The first unconditional bounded-gap result was Zhang 2014 Annals of Math. 179 [Zhang 2014], who proved . Maynard 2015 Annals of Math. 181 [Maynard 2015] reduced the bound to via a modified Selberg sieve, and the subsequent Polymath8b collaboration drove it to . The twin prime conjecture itself () and the de Polignac conjecture remain open.
Theorem 8 (RSA cryptosystem; Rivest-Shamir-Adleman 1978). The RSA public-key cryptosystem [Rivest-Shamir-Adleman 1978] is a direct application of the fundamental theorem of arithmetic combined with the asymptotic asymmetry between multiplication and factorisation. Setup: choose two large primes (typically or bits each); publish and a public exponent coprime to ; keep secret the private exponent , computed via the extended Euclidean algorithm of 21.01.01. Encryption: ; decryption: . Security: recovering from without knowing is conjecturally as hard as factoring . The best classical algorithm (general number field sieve, Pollard et al. 1993) factors -bit integers in time — sub-exponential but super-polynomial. Shor 1994 showed factoring is polynomial-time on a quantum computer, prompting the ongoing transition to post-quantum cryptosystems (NIST PQC standards 2024).
Synthesis. The infinitude of primes is the foundational reason that the multiplicative structure of is genuinely rich, and this is exactly the bridge from elementary arithmetic to analytic number theory. The central insight, present already in Euclid c. 300 BCE and made quantitative by Euler 1737, is that the Euler product encodes the fundamental theorem of arithmetic into a single analytic identity; the divergence of then forces infinitude by a route entirely different from Euclid's combinatorial construction. The Chebyshev bounds 1852 quantify infinitude to , and the Hadamard-de la Vallée Poussin theorem 1896 sharpens to . Putting these together with Mertens 1874 and Dirichlet 1837 identifies the apparent randomness of primes with a precise asymptotic structure controlled by the analytic behaviour of and the Dirichlet -functions.
The pattern generalises in multiple directions. The Chebotarev density theorem extends Dirichlet's equidistribution from to Galois groups of number-field extensions, identifying the primes with the conjugacy classes of . The Sato-Tate conjecture (Clozel-Harris-Shepherd-Barron-Taylor 2008-11) extends Dirichlet-style equidistribution to the eigenvalues of Frobenius on -adic representations of elliptic curves. The Riemann hypothesis stretches across analytic number theory, finite-field zeta functions (Weil 1948-49, Deligne 1974), automorphic -functions (Langlands programme), and conjecturally beyond. The bridge is everywhere the same: the multiplicative structure of , encoded in the fundamental theorem and the Euler product, is the analytic shadow of the additive structure encoded by Bézout's identity in 21.01.01.
The applied consequence is the RSA cryptosystem and its descendants: the same multiplicative structure that organises the analytic theory provides the computational asymmetry — easy multiplication, hard factorisation — that secures most modern digital communication. Modern post-quantum cryptography (lattice-based, code-based, isogeny-based) replaces this hardness with alternatives, but the elementary fact remains that the fundamental theorem of arithmetic underwrote half a century of cryptographic infrastructure.
Full proof set Master
Proposition 1 (Euler 1737 analytic proof of infinitude). The sum over the primes diverges; in particular, the set of primes is infinite.
Proof. Consider the partial Euler product, for a real parameter: $$ \prod_{p \leq x}\left(1 - p^{-s}\right)^{-1} ;=; \prod_{p \leq x} \sum_{k=0}^\infty p^{-ks} ;=; \sum_{n \in N_x} n^{-s}, $$ where is the set of positive integers whose prime factors are all at most (an application of the fundamental theorem: each arises in exactly one way as a product ). As , exhausts , so $$ \prod_{p}\left(1 - p^{-s}\right)^{-1} ;=; \sum_{n=1}^\infty n^{-s} ;=; \zeta(s). $$
Taking logarithms and using : $$ \ln \zeta(s) ;=; \sum_p p^{-s} + R(s), \qquad R(s) = \sum_p \sum_{k \geq 2} p^{-ks}/k. $$ The remainder stays bounded as : comparing with gives . On the other hand, as (the harmonic series diverges), so . Subtracting the bounded leaves as . By Abel's theorem on monotone convergence of Dirichlet series, this forces .
Since a convergent series of positive terms cannot have , the set of primes contributing to the sum must be infinite.
Proposition 2 (Furstenberg 1955 topological proof of infinitude). The primes are infinite.
Proof. Topologise by declaring the basic open sets to be the arithmetic progressions $$ S(a, q) = { a + n q : n \in \mathbb{Z} } = a + q \mathbb{Z}, \qquad a \in \mathbb{Z}, \ q \in \mathbb{Z}_{> 0}. $$ Verify: every nonempty basic open set is infinite (since ); the intersection of two basic open sets is again a union of basic open sets (a Chinese-remainder argument); the topology is Hausdorff (given , choose and observe ).
Each basic open is also closed: its complement is the union of the other residue classes mod , each of which is itself a basic open set: $$ \mathbb{Z} \setminus S(a, q) ;=; \bigcup_{b = 0, \ b \neq a}^{q - 1} S(b, q). $$ A union of open sets is open, so the complement is open, hence is closed.
Now observe $$ \mathbb{Z} \setminus {-1, 1} ;=; \bigcup_p S(0, p), $$ the union taken over all primes : every integer with has at least one prime divisor , and conversely each is contained in provided .
If the primes were finite, the right side would be a finite union of closed sets, hence closed. Equivalently, would be open. But the only open sets containing are unions of basic open sets with , and every such basic open set is infinite. So , being finite and nonempty, cannot be open. Contradiction.
Proposition 3 (Chebyshev 1852 upper bound). There exists a constant such that for all .
Proof. For an integer consider the central binomial coefficient $$ \binom{2n}{n} ;=; \frac{(2n)!}{(n!)^2}. $$ On one hand, (since and is the largest of these terms).
On the other hand, Kummer's theorem (or direct factorial counting via Legendre's formula ) gives $$ v_p\left(\binom{2n}{n}\right) ;=; \sum_{k \geq 1}\left( \left\lfloor \frac{2n}{p^k}\right\rfloor - 2 \left\lfloor \frac{n}{p^k}\right\rfloor \right) ;\leq; \log_p(2n). $$ The terms in the sum are each or , and they vanish for ; so the total contribution is at most .
Combining: every prime with divides exactly once (since in this range from the formula). Hence $$ \prod_{n < p \leq 2n} p ;\leq; \binom{2n}{n} ;<; 4^n. $$ Taking logarithms, . Summing this estimate over the dyadic ranges for : $$ \sum_{p \leq x} \ln p ;\leq; (\ln 4) \cdot x \cdot (1 + 1/2 + 1/4 + \cdots) ;=; 2 (\ln 4) \cdot x. $$ Equivalently .
Now extract via Abel summation: . Rearranging, .
Proposition 4 (Bertrand's postulate, Erdős 1932 elementary form). For every integer , the interval contains a prime.
Proof. Assume for contradiction that contains no prime. We derive an upper bound on that contradicts the lower bound for large.
By Proposition 3's Kummer estimate, for every prime , with equality on at most one factor of . So $$ \binom{2n}{n} ;=; \prod_{p \leq 2n} p^{v_p(\binom{2n}{n})} ;\leq; \prod_{p \leq \sqrt{2n}} (2n) \cdot \prod_{\sqrt{2n} < p \leq 2n/3} p \cdot \prod_{n < p \leq 2n} p. $$ The middle range follows from a refined Kummer estimate showing for when . The third range is assumed empty by the contradiction hypothesis. Using (a separate elementary lemma, also proved by induction on via the central-binomial trick), the middle factor is at most .
Bounding: . Comparing with the lower bound : $$ \frac{4^n}{2n+1} ;\leq; (2n)^{\sqrt{2n}} \cdot 4^{2n/3} \quad \Longleftrightarrow \quad 4^{n/3} \leq (2n+1)(2n)^{\sqrt{2n}}. $$ The left side grows exponentially in ; the right side grows like , which is sub-exponential. The inequality fails for all (a direct numerical check), giving a contradiction.
Bertrand for is verified by exhibiting an explicit prime in each . Erdős noticed that the chain of primes — each less than twice the previous — covers all at once.
Connections Master
Divisibility, GCD, Bézout's identity, and the Euclidean algorithm
21.01.01. Just shipped. Supplies Bézout and Euclid's Lemma — the latter is the load-bearing step in the uniqueness half of the fundamental theorem proved above (Key Theorem). The implication chain " Euclidean PID UFD" of21.01.01Master tier is exactly the additive-to-multiplicative bridge: Bézout gives Euclid's Lemma gives unique factorisation. Without21.01.01the present unit's central theorem cannot be proved.Polynomial rings, PIDs, and UFDs
01.02.07. Pending. The fundamental theorem of arithmetic generalises directly to the statement " is a unique-factorisation domain". The same proof (existence via well-ordered descent, uniqueness via Euclid's Lemma) applies in for a field, in , in , and in every Euclidean domain. The chapter-closing synthesis appears in01.02.07(when shipped), where the abstract UFD framework will replace the ad-hoc -specific arguments with the categorical statement.Riemann zeta function and the Euler product
21.03.01. Shipped. The Euler product is the analytic encoding of the fundamental theorem of arithmetic established in this unit. Euler's 1737 analytic proof of infinitude (Master Proposition 1) is the entry point. The full development of 's functional equation, its non-real zeros, and the Riemann hypothesis lives in21.03.01; the present unit's Master Theorem 6 only states RH and its arithmetic-equivalent error bound, deferring the analytic apparatus.Prime number theorem and Dirichlet density
21.03.04. Shipped (in the staging). The qualitative infinitude of primes proved here, quantified to by Chebyshev (Master Theorem 2), and sharpened to by Hadamard-de la Vallée Poussin (Master Theorem 3). The proof of the PNT — via analytic continuation of and the non-vanishing of — lives in21.03.04. The Dirichlet generalisation to arithmetic progressions (Master Theorem 5) is the bridge to L-functions developed in the same chapter.Mathematical induction
00.12.01. Prerequisite. The existence half of the fundamental theorem is a strong-induction argument on ; the well-ordering principle equivalent to strong induction is also the operational content of "every nonempty subset of has a least element", invoked in Euclid's argument and in the lower-bound proof of the Chebyshev estimates. The foundational reason that elementary number theory works as a self-contained subject is that is well-founded.
Historical & philosophical context Master
The infinitude of primes is the oldest result in number theory still in current use. Euclid c. 300 BCE Elements Book IX, Proposition 20 [Euclid] gives the argument in essentially its modern form: "prime numbers are more than any assigned multitude of prime numbers". The Greek formulation considers three primes and constructs a fourth; Euclid does not state the argument for an arbitrary finite list, but the structure is identical, and later commentators (notably Theon of Alexandria, fourth century CE) generalised it. The fundamental theorem of arithmetic — every positive integer is a unique product of primes — is implicit in Euclid VII.30 (Euclid's Lemma in the prime case) and VII.31 (existence of a prime divisor), but the explicit uniqueness statement seems to first appear in Gauss 1801 Disquisitiones Arithmeticae §16, Theorem 8: Compositum quemvis numerum modis unico in factores primos resolvi posse ("Every composite number can be resolved into prime factors in a unique way").
Euler 1737 Comm. Acad. Sci. Petropolitanae 9 [Euler 1737] initiated the analytic study of primes with the Euler product identity. The proof that diverges is given in the same paper and constitutes the first proof of infinitude not modelled on Euclid's combinatorial construction. Dirichlet 1837 [Dirichlet 1837] extended Euler's analytic technique from the identity character on to the full character group, proving the equidistribution of primes across coprime residue classes. The bridge from Euler-Dirichlet to the modern theory is the systematic identification of with a complex-analytic object satisfying functional equations and admitting an analytic continuation; Riemann 1859 Monatsber. Berlin. Akad. identified the analytic apparatus needed and made the central conjecture that all non-real zeros of have real part .
The prime number theorem was conjectured by Gauss as a teenager (in correspondence with Encke, dated 1849 but referring to observations from 1792-1793) and independently by Legendre 1798 Essai sur la théorie des nombres. Chebyshev 1852 J. Math. Pures Appl. 17 [Chebyshev 1852] proved the first effective bounds via elementary combinatorial-binomial techniques and used them to confirm Bertrand's postulate. Hadamard 1896 Bull. Soc. Math. France 24 [Hadamard 1896] and de la Vallée Poussin 1896 Ann. Soc. Sci. Bruxelles 20 [de la Vallée Poussin 1896] independently proved the asymptotic, using complex-analytic techniques on near the line . Mertens 1874 J. reine angew. Math. 78 [Mertens 1874] had earlier proved the three constants-asymptotic statements that bear his name, via elementary Abel-summation arguments against the Chebyshev bounds.
The elementary proof of the prime number theorem (avoiding complex analysis) was achieved by Erdős and Selberg in 1948-49; both worked from a "fundamental inequality" of Selberg, , and reached the asymptotic via Tauberian-style manipulation in real-variable terms. The priority dispute between Erdős and Selberg was substantial — Selberg was awarded the 1950 Fields Medal in part for this work; Erdős received the 1951 Cole Prize. Furstenberg 1955 Amer. Math. Monthly 62 [Furstenberg 1955] gave the topological proof of infinitude as a one-page note; it has become a standard exhibit in introductory topology courses.
The twin prime conjecture (Polignac 1849) and the de Polignac generalisation remain open. Brun 1919 Bull. Sci. Math. 43 [Brun 1919] introduced the combinatorial sieve named for him and established the convergence of over twin primes, defining Brun's constant. Modern sieve theory descends from this work through Selberg 1950s, Bombieri-Vinogradov 1965, and the recent bounded-gap revolution: Zhang 2014 Annals 179 [Zhang 2014] established ; Maynard 2015 Annals 181 [Maynard 2015] reduced the bound to via a modified Selberg-sieve weight; the Polymath8b collaboration drove the bound to . The Goldbach conjecture (Goldbach 1742, in correspondence with Euler) that every even integer is a sum of two primes remains open in its binary form; Helfgott 2013 proved the weak (ternary) Goldbach conjecture — every odd integer is a sum of three primes — unconditionally.
The applied legacy is dominated by Rivest-Shamir-Adleman 1978 CACM 21 [Rivest-Shamir-Adleman 1978]. RSA encrypted the bulk of internet commerce from the 1990s through the 2010s and remains in widespread use, with the security premise — factoring is computationally hard — resting on the fundamental theorem of arithmetic combined with the absence of efficient classical factoring algorithms. Shor 1994 FOCS 35 showed quantum computers can factor in polynomial time; the NIST post-quantum cryptography standardisation (initiated 2016, finalised 2024) replaces RSA-style cryptosystems with lattice-based and code-based alternatives whose security rests on different hardness assumptions, but the elementary number-theoretic groundwork laid by Euclid, Euler, Gauss, and Dirichlet remains the substrate for the entire field.
Bibliography Master
@book{EuclidElementsIX,
author = {Euclid},
title = {The Thirteen Books of Euclid's Elements},
editor = {Heath, Thomas L.},
publisher = {Cambridge University Press},
year = {1908},
note = {Three volumes; Dover reprint 1956. Book IX Proposition 20 contains the infinitude-of-primes proof, originally c. 300 BCE.}
}
@article{Euler1737,
author = {Euler, Leonhard},
title = {Variae observationes circa series infinitas},
journal = {Commentarii Academiae Scientiarum Imperialis Petropolitanae},
volume = {9},
pages = {160--188},
year = {1737}
}
@article{Bertrand1845,
author = {Bertrand, Joseph},
title = {M\'{e}moire sur le nombre de valeurs que peut prendre une fonction quand on y permute les lettres qu'elle renferme},
journal = {Journal de l'\'{E}cole Royale Polytechnique},
volume = {30},
pages = {123--140},
year = {1845}
}
@article{Chebyshev1852,
author = {Chebyshev, Pafnuty L.},
title = {M\'{e}moire sur les nombres premiers},
journal = {Journal de Math\'{e}matiques Pures et Appliqu\'{e}es},
volume = {17},
pages = {366--390},
year = {1852}
}
@article{Dirichlet1837,
author = {Dirichlet, Peter Gustav Lejeune},
title = {Beweis des Satzes, da{\ss} jede unbegrenzte arithmetische Progression, deren erstes Glied und Differenz ganze Zahlen ohne gemeinschaftlichen Factor sind, unendlich viele Primzahlen enth\"{a}lt},
journal = {Abhandlungen der K\"{o}niglichen Preussischen Akademie der Wissenschaften zu Berlin},
pages = {45--81},
year = {1837}
}
@article{Mertens1874,
author = {Mertens, Franz},
title = {Ein Beitrag zur analytischen Zahlentheorie},
journal = {Journal f\"{u}r die reine und angewandte Mathematik},
volume = {78},
pages = {46--62},
year = {1874}
}
@article{Hadamard1896,
author = {Hadamard, Jacques},
title = {Sur la distribution des z\'{e}ros de la fonction $\zeta(s)$ et ses cons\'{e}quences arithm\'{e}tiques},
journal = {Bulletin de la Soci\'{e}t\'{e} Math\'{e}matique de France},
volume = {24},
pages = {199--220},
year = {1896}
}
@article{ValleePoussin1896,
author = {de la Vall\'{e}e Poussin, Charles},
title = {Recherches analytiques sur la th\'{e}orie des nombres premiers},
journal = {Annales de la Soci\'{e}t\'{e} Scientifique de Bruxelles},
volume = {20},
pages = {183--256},
year = {1896}
}
@article{Brun1919,
author = {Brun, Viggo},
title = {La s\'{e}rie $1/5 + 1/7 + 1/11 + \ldots$ o\`{u} les d\'{e}nominateurs sont nombres premiers jumeaux est convergente ou finie},
journal = {Bulletin des Sciences Math\'{e}matiques},
volume = {43},
pages = {100--128},
year = {1919}
}
@article{Furstenberg1955,
author = {Furstenberg, Hillel},
title = {On the infinitude of primes},
journal = {American Mathematical Monthly},
volume = {62},
pages = {353},
year = {1955}
}
@article{RivestShamirAdleman1978,
author = {Rivest, Ronald L. and Shamir, Adi and Adleman, Leonard},
title = {A method for obtaining digital signatures and public-key cryptosystems},
journal = {Communications of the ACM},
volume = {21},
pages = {120--126},
year = {1978}
}
@article{Zhang2014,
author = {Zhang, Yitang},
title = {Bounded gaps between primes},
journal = {Annals of Mathematics},
volume = {179},
pages = {1121--1174},
year = {2014}
}
@article{Maynard2015,
author = {Maynard, James},
title = {Small gaps between primes},
journal = {Annals of Mathematics},
volume = {181},
pages = {383--413},
year = {2015}
}
@book{HardyWright2008,
author = {Hardy, G. H. and Wright, E. M.},
title = {An Introduction to the Theory of Numbers},
edition = {6},
publisher = {Oxford University Press},
year = {2008},
note = {Revised by D. R. Heath-Brown and J. H. Silverman, foreword by A. Wiles. Chapters I-II and XXII contain the elementary and analytic material respectively.}
}
@book{Apostol1976,
author = {Apostol, Tom M.},
title = {Introduction to Analytic Number Theory},
series = {Undergraduate Texts in Mathematics},
publisher = {Springer},
year = {1976}
}
@book{Tenenbaum2015,
author = {Tenenbaum, G\'{e}rald},
title = {Introduction to Analytic and Probabilistic Number Theory},
edition = {3},
publisher = {Cambridge University Press},
year = {2015}
}