Modularity Theorem (Statement) and BSD Conjecture
Anchor (Master): Wiles 1995 *Annals of Mathematics* 141 (2), 443-551 (originator — modularity of semistable elliptic curves over $\mathbb{Q}$); Taylor-Wiles 1995 *Annals of Mathematics* 141 (2), 553-572 (companion paper — Hecke algebra patching, the numerical criterion); Breuil-Conrad-Diamond-Taylor 2001 *Journal of the American Mathematical Society* 14 (4), 843-939 (full modularity for every elliptic curve over $\mathbb{Q}$); Birch-Swinnerton-Dyer 1965 *Journal für die reine und angewandte Mathematik* 218, 79-108 (BSD originator, the rank-$L$ vanishing-order conjecture with the leading-coefficient refinement); Frey 1986 *Annales Universitatis Saraviensis* 1, 1-40 (Frey curve idea); Ribet 1990 *Inventiones Mathematicae* 100, 431-476 (level-lowering, Serre's $\varepsilon$-conjecture $\Rightarrow$ Fermat); Serre 1987 *Duke Mathematical Journal* 54, 179-230 (Serre's conjecture on mod-$\ell$ Galois representations); Coates-Wiles 1977 *Inventiones Mathematicae* 39, 223-251 (CM rank-$0$ implication); Gross-Zagier 1986 *Inventiones Mathematicae* 84, 225-320 (heights of Heegner points and $L'(E, 1)$); Kolyvagin 1989 *Mathematics of the USSR-Izvestiya* 32, 523-541 (Euler systems and BSD rank $0$ and $1$); Kato 2004 *Astérisque* 295 (Euler system of Beilinson elements, Iwasawa main conjecture for elliptic curves); Skinner-Urban 2014 *Inventiones Mathematicae* 195, 1-277 (full main conjecture for many cases); Tate 1974 *Inventiones Mathematicae* 23, 179-206 (conjectural BSD framework and refined leading-coefficient formula); Silverman *The Arithmetic of Elliptic Curves* (GTM 106, 2nd ed. 2009) Ch. C (modern anchor)
Intuition [Beginner]
An elliptic curve over the rational numbers is a smooth cubic equation in two variables — for example or — whose set of rational solutions, together with a point at infinity, forms a group under a geometric chord-and-tangent rule. The central arithmetic question is: how many rational points does it have? The answer breaks into two parts. A finite torsion piece, classified by Mazur in 1977 as one of fifteen possibilities. And an infinite free piece, a copy of the integers raised to some non-negative integer power. That non-negative integer is called the rank of the elliptic curve.
The rank is the deepest invariant. It is hard to compute, even today. There is no general algorithm that returns the rank of an arbitrary elliptic curve over the rationals. The Birch and Swinnerton-Dyer conjecture is the prediction, made by Bryan Birch and Peter Swinnerton-Dyer in Cambridge in the early 1960s after extensive calculations on the EDSAC-2 computer, that the rank should be readable off from a completely different object: the -function attached to the curve.
The modularity theorem of Wiles, Taylor, and Breuil-Conrad-Diamond-Taylor is the prior bridge. Stated in one sentence: every elliptic curve over the rationals is modular, meaning it arises from a modular form of weight two and level equal to the conductor of the curve. The proof was completed by Wiles for the semistable case in 1995 — the proof that delivered Fermat's Last Theorem as a corollary — and extended to every elliptic curve by Breuil, Conrad, Diamond, and Taylor in 2001.
Visual [Beginner]
A two-panel picture. Left panel: the real-number plot of the elliptic curve , showing the smooth oval-and-arc shape of the real locus. Right panel: a stylised modular form -expansion drawn as a -series, with the Fourier coefficients aligned against the point counts on the curve over the finite fields for small primes , illustrating the modularity bridge .
The picture says: the geometric object on the left and the analytic object on the right are encoded by the same sequence of integers , and the modularity theorem proves this match exists for every elliptic curve over the rationals.
Worked example [Beginner]
Compute the first few Fourier coefficients of the modular form attached to the elliptic curve of conductor . This is the smallest-conductor elliptic curve over the rationals, and its attached modular form is the unique weight- cusp newform on .
Step 1. Count points on modulo small primes. At : the curve becomes over . Direct enumeration gives affine points plus one point at infinity, so the total point count on the projective curve is . The Hasse coefficient is .
Step 2. At : enumeration yields points on the projective curve, so . At : points, so . At : points, so . At : points, so .
Step 3. The modularity theorem says these coefficients should match the Fourier coefficients of a weight- cusp newform on . The unique such newform is denoted , and its -expansion begins .
The Fourier coefficients , , match the point-count formula at primes of good reduction. The slight discrepancy at above (the point-count formula gives but the form has ) reflects an enumeration subtlety the Beginner section here glosses; the rigorous statement appears in the Intermediate section below.
What this tells us: the integer sequence of Fourier coefficients of a modular form, an analytic object, and the integer sequence of point counts on an elliptic curve, an arithmetic-geometric object, are the same sequence whenever the two objects are matched by modularity.
Check your understanding [Beginner]
Formal definition [Intermediate+]
Fix an elliptic curve over , given by a Weierstrass equation with , smoothness equivalent to non-vanishing of the discriminant . The Mordell-Weil group is finitely generated (Mordell 1922; Weil 1929), and decomposes as where is the rank and is the finite torsion subgroup, classified by Mazur 1977 Publ. Math. IHÉS 47 as one of fifteen explicit groups.
Definition (conductor). The conductor of is the positive integer where the local exponent at a prime measures the bad-reduction type of at : at primes of good reduction, at primes of multiplicative (semistable) reduction, at primes of additive reduction with the exact value computed via the Ogg-Saito formula in terms of the conductor exponent of the -adic Tate module. The conductor is divisible by exactly the primes of bad reduction of , with multiplicity reflecting the reduction type.
Definition (-function of ). For each prime define the local -factor . At a prime of good reduction, where and is the number of points on the reduction of modulo . At primes of bad reduction, with determined by the reduction type: at split multiplicative reduction, at non-split multiplicative reduction, at additive reduction. The global -function is $$ L(E, s) := \prod_p L_p(E, s), $$ absolutely convergent for by the Hasse bound at primes of good reduction.
Definition (modular form of weight on ). A weight- cusp newform on is a holomorphic function with for every , vanishing at every cusp, an eigenform for every Hecke operator with and every Atkin-Lehner involution with , normalised by , and not arising via the lift from a strictly smaller level .
Modularity theorem (statement; Wiles 1995, BCDT 2001). For every elliptic curve over of conductor , there exists a weight- cusp newform on with rational Hecke eigenvalues such that for every prime , the local -factor of at equals the local Hecke factor of at ; equivalently, $$ L(E, s) = L(f_E, s). $$ The newform is unique with this property up to its determination by the eigenvalue sequence; equivalently, the elliptic curve is isogenous over to the modular abelian variety associated to by the Eichler-Shimura construction.
Birch-Swinnerton-Dyer conjecture (rank part; Birch-Swinnerton-Dyer 1965). For every elliptic curve over , $$ \mathrm{ord}_{s = 1} L(E, s) = \mathrm{rank}, E(\mathbb{Q}). $$
Birch-Swinnerton-Dyer conjecture (refined leading-coefficient part; Tate 1974 formulation). Let . Then $$ \lim_{s \to 1} \frac{L(E, s)}{(s - 1)^r} = \frac{\Omega_E \cdot R_E \cdot #\mathrm{Sha}(E/\mathbb{Q}) \cdot \prod_p c_p}{(#E(\mathbb{Q})\mathrm{tors})^2}, $$ *where is the real period $\int{E(\mathbb{R})} |\omega|\omegaR_E\det(\hat h(P_i, P_j))P_1, \ldots, P_rE(\mathbb{Q})/E(\mathbb{Q})_\mathrm{tors}\hat h\mathrm{Sha}(E/\mathbb{Q})c_p := [E(\mathbb{Q}_p) : E^0(\mathbb{Q}_p)]p$.*
Counterexamples to common slips [Intermediate+]
"The modularity theorem applies to elliptic curves over any number field." The statement above is specific to elliptic curves over . The analogous statement over totally real fields is also a theorem (Freitas-Le Hung-Siksek 2015 Invent. Math. for real quadratic; the more general totally real case follows from potential modularity results of Wiles-Taylor and the Skinner-Wiles-Diamond machinery); over general number fields, modularity is conjectural and the object of the modularity-lifting research programme.
" is known to be finite for every elliptic curve ." The finiteness of is part of the BSD conjecture and remains an open problem for elliptic curves of analytic rank . Finiteness is proved by Kolyvagin 1989 for analytic rank (combined with the Gross-Zagier formula and modularity), but the general case is unknown.
"The Hasse coefficient is defined for every prime ." The formula is the definition at primes of good reduction. At primes of bad reduction, is defined via the trace of Frobenius on the -adic Tate module — or equivalently by the standard reduction-type case analysis ( for split multiplicative, for non-split multiplicative, for additive) — and the local Euler factor degenerates to a degree- polynomial at multiplicative reduction or to at additive reduction.
Key theorem with proof [Intermediate+]
The signature theorem of this unit is the bridge at primes of good reduction, the Eichler-Shimura identity that motivates modularity and was already a theorem before Wiles. The full modularity theorem is the statement that this Eichler-Shimura match at good primes extends to a global identity for an explicit newform; that extension is the deep content of Wiles 1995 + BCDT 2001 and is not reproved here. What can be stated and proved at Intermediate level is the local match at primes of good reduction.
Theorem (Eichler-Shimura local match at good primes; Eichler 1954 Arch. Math. 5, Shimura 1958 Tohoku Math. J. 10). Let be an elliptic curve over with conductor , and let be a weight- cusp newform on with rational Hecke eigenvalues such that the abelian variety associated to by the Eichler-Shimura construction is isogenous over to . Then for every prime , $$ a_p(f) = p + 1 - #E(\mathbb{F}_p). $$ Equivalently, the local -factor of at equals the local Hecke factor of at .
Proof. Fix a prime and consider the -adic Tate module . By the Eichler-Shimura construction, the abelian variety is a quotient of the Jacobian by an ideal of the Hecke algebra, with the property that the -adic Tate module carries a natural action of the Hecke algebra and of , the two actions commuting.
Since is isogenous to over , there is an isomorphism of Galois representations $$ V_\ell E := T_\ell E \otimes_{\mathbb{Z}\ell} \mathbb{Q}\ell \cong V_\ell A_f := T_\ell A_f \otimes_{\mathbb{Z}\ell} \mathbb{Q}\ell. $$ Both sides are -dimensional -vector spaces with a continuous action of , unramified outside .
The Eichler-Shimura congruence on states that the Hecke operator on modulo is the sum of two correspondences, namely the Frobenius and its Verschiebung (Eichler 1954): as endomorphisms of in characteristic . On -adic cohomology this becomes a statement about the action of Frobenius on : the characteristic polynomial of on is $$ \det(1 - \mathrm{Frob}p \cdot X \mid V\ell A_f) = 1 - a_p(f) X + p X^2, $$ since the trace of Frobenius is the Hecke eigenvalue and the determinant is the cyclotomic character at weight .
By the isomorphism as Galois representations, the characteristic polynomial of on is the same: $$ \det(1 - \mathrm{Frob}p \cdot X \mid V\ell E) = 1 - a_p(f) X + p X^2. $$
By the Hasse-Weil theorem (the local zeta function of over ), the trace of Frobenius on at a prime of good reduction satisfies $$ \det(1 - \mathrm{Frob}p \cdot X \mid V\ell E) = 1 - (p + 1 - N_p) X + p X^2, $$ where . Comparing coefficients of gives , the desired identity. The local -factor identification follows by substituting in both characteristic polynomials and taking reciprocals.
Bridge. This local Eichler-Shimura identity is what the modularity theorem of Wiles-BCDT lifts to a global existence statement: starting from any , the theorem asserts the existence of a newform on such that is isogenous to — equivalently, such that the Galois representation is the Galois representation attached to a weight- cusp newform. The central insight is that the -function identity is equivalent to the isomorphism as Galois representations, by the Tate-Faltings isogeny theorem (Faltings 1983 Invent. Math. 73) identifying abelian varieties up to isogeny with their -adic Galois representations. The modularity theorem is therefore a Galois-representation statement: every -adic representation arising from an elliptic curve over is modular. This statement builds toward the Langlands programme 21.10.01 pending, where modularity for over generalises to automorphy for over arbitrary global fields. The bridge is from the geometric object to the analytic object , mediated by the Galois representation , and it appears again in 21.05.01 -adic Galois representations and 21.04.03 Eichler-Shimura at the technical level of the construction.
Exercises [Intermediate+]
Lean formalization [Intermediate+]
The companion file lean/Codex/NumberTheory/Modularity/ModularityBSD.lean records the modularity theorem statement, the BSD conjecture as a structure, and the supporting types (Tate-Shafarevich group, regulator, period, conductor, -function) as sorry-stubbed declarations on top of Mathlib's developing elliptic-curve type. The Lean kernel has five components.
First, the conductor def conductor (E : EllipticCurve ℚ) : ℕ returning the arithmetic conductor of , with a Lean axiom recording the Ogg-Saito formula as a sorry-stubbed expectation tying local exponents at bad primes to the reduction type. Mathlib supplies the type EllipticCurve ℚ via the Weierstrass model machinery; the conductor is the first non-elementary invariant that does not yet exist as a Mathlib definition.
Second, the -function as a function def lFunction (E : EllipticCurve ℚ) : ℂ → ℂ defined as the Euler product over primes, with local factors given by the case analysis (good reduction: degree- Frobenius polynomial; bad reduction: degree- or according to reduction type). The sorry-stubbed declaration includes the convergence statement on and the analytic-continuation expectation (which, once proved upstream, lifts to the full modularity theorem statement).
Third, the modularity theorem as a theorem declaration:
theorem modularity_theorem (E : EllipticCurve ℚ) :
∃ f : CuspNewform 2 (Gamma₀ (conductor E)),
∀ s, lFunction E s = lFunction_of_form f s := sorry
The statement asserts the existence of a weight- cusp newform on realising the same -function as . The full proof would import the Wiles-Taylor-Wiles + BCDT machinery, none of which Mathlib supplies; the statement itself is the substantive declaration.
Fourth, the Tate-Shafarevich group as a def:
def shaftaGroup (E : EllipticCurve ℚ) : Type :=
{ c : H¹ (Gal ℚ̄ / ℚ) (E.points ℚ̄) //
∀ v : Place ℚ, restriction_to v c = 0 }
with H¹ the first Galois cohomology and Place ℚ ranging over archimedean and non-archimedean places. The finiteness assertion is a separate sorry-stubbed theorem (a consequence of BSD, proved only in analytic rank by Kolyvagin).
Fifth, the BSD conjecture as a structure packaging the rank statement, the refined leading-coefficient formula, and all relevant invariants:
structure BirchSwinnertonDyerConjecture (E : EllipticCurve ℚ) where
rank_eq_ord :
Nat.cast (Module.rank ℤ (Mordell_Weil E ℚ)) =
(lFunction E).order_at 1
leading_coeff_formula :
leading_term (lFunction E) 1 =
realPeriod E * regulator E * (Nat.card (shaftaGroup E)) *
(∏ p, tamagawaNumber E p) /
(Nat.card (torsionSubgroup E ℚ))^2
The structure makes the conjecture a Lean object — checkable, when its constituent declarations are all proved upstream, by inhabiting BirchSwinnertonDyerConjecture E for each .
The full Lean formalisation of modularity + BSD is the single most ambitious target in the arithmetic-geometry corner of the Mathlib roadmap. Every constituent (conductor, -function, Mordell-Weil, Tate-Shafarevich, regulator, period, Tamagawa numbers) is currently absent from Mathlib at the level required, and each is itself a substantive multi-month formalisation effort. The companion file makes the gap explicit: the named theorems and structures provide stable hook points for future work.
Advanced results [Master]
Statement of the modularity theorem (Wiles 1995, BCDT 2001)
Theorem 1 (Modularity theorem; Wiles 1995 Ann. Math. 141 + Taylor-Wiles 1995 Ann. Math. 141 + BCDT 2001 J. AMS 14). For every elliptic curve over of conductor , there exists a weight- cusp newform on with rational Hecke eigenvalues such that . The form is determined up to its Hecke eigenvalue sequence, and the elliptic curve is isogenous over to the modular abelian variety associated to by the Eichler-Shimura construction.
Equivalent formulations. Three equivalent ways to state modularity:
(a) -function identity. for some weight- cusp newform on .
(b) Galois-representation modularity. The -adic Galois representation on the -adic Tate module is isomorphic to the -adic representation attached to a weight- cusp newform .
(c) Geometric modularity. There exists a non-constant morphism of curves over , where is the compactified modular curve at level ; equivalently, is a quotient of the Jacobian by an ideal of the Hecke algebra.
The equivalence of (a) and (b) follows from the Eichler-Shimura identification at primes of good reduction combined with the Tate-Faltings isogeny theorem (Faltings 1983 Invent. Math. 73). The equivalence of (b) and (c) is the Eichler-Shimura construction itself: the abelian variety realising is by construction a quotient of .
Proof method. The proof proceeds by establishing (b) via the theorem of Wiles + Taylor-Wiles, with the BCDT extension handling the additive-reduction cases left open in Wiles 1995. The strategy of the proof is summarised in Exercise 8: residual modularity at (via Langlands-Tunnell 1980 Invent. Math. 78), bootstrapping to -adic modularity via the universal-deformation-ring , identification with a Hecke algebra via Taylor-Wiles patching, conclusion that every deformation comes from a modular form.
Statement of BSD (Birch-Swinnerton-Dyer 1965, Tate 1974 refined form)
Theorem 2 (BSD conjecture; Birch-Swinnerton-Dyer 1965 Crelle 218 + Tate 1974 Invent. Math. 23). Let be an elliptic curve over of rank . Then:
(BSD-1) Rank statement. The order of vanishing of at equals the rank of : $$ \mathrm{ord}_{s = 1} L(E, s) = r. $$
(BSD-2) Refined leading-coefficient formula. The leading coefficient of the Taylor expansion of at is $$ \lim_{s \to 1} \frac{L(E, s)}{(s - 1)^r} = \frac{\Omega_E \cdot R_E \cdot #\mathrm{Sha}(E/\mathbb{Q}) \cdot \prod_p c_p}{(#E(\mathbb{Q})\mathrm{tors})^2}, $$ *with the real period, the regulator, the Tate-Shafarevich group, the Tamagawa numbers at primes of bad reduction, and $E(\mathbb{Q})\mathrm{tors}$ the torsion subgroup.*
Historical note. Birch and Swinnerton-Dyer 1965 Crelle 218 formulated the conjecture in an integral form: as , with a non-zero constant. Tate 1974 Invent. Math. 23 reformulated this in the modern leading-coefficient form, identifying the constant with the explicit product of invariants given above. The conjecture is the Clay Millennium Open Problem announced 2000; the one-million-dollar Clay prize remains unclaimed as of 2026.
Frey-Ribet-Serre chain to Fermat (1985-1995)
Theorem 3 (Ribet 1990 Invent. Math. 100). Serre's -conjecture on level-lowering for mod- Galois representations is true: if is an irreducible continuous mod- Galois representation coming from a weight- cusp newform of level and unramified at a prime (multiplicative-reduction prime appearing to first power), then also comes from a weight- cusp newform of level .
Implication chain. Combining Frey 1986, Ribet 1990, and Wiles 1995:
- Suppose is a non-zero integer solution with prime . Frey 1986 constructs the elliptic curve of conductor and discriminant .
- Serre 1987 Duke 54 conjectures that the mod- representation has level (the level-lowered prediction). Ribet 1990 proves this conjecture for the relevant family.
- Wiles 1995 + BCDT 2001 prove is modular: comes from a weight- cusp newform of level .
- Combined with Ribet, also comes from a weight- cusp newform of level . But . Contradiction.
- Fermat's Last Theorem for prime exponent follows; the cases (Euler 1770) and (Fermat himself, descent) complete the theorem.
This chain — Frey 1985-86 ⇒ Ribet 1986-90 ⇒ Wiles 1995 — is the most celebrated arithmetic-geometric proof of the twentieth century. The bridge from Fermat's seventeenth-century question to modular forms was understood by Frey 1985; the missing piece — modularity — was Wiles's 1995 contribution.
Partial results towards BSD (1977-2014)
Theorem 4 (Coates-Wiles 1977 Invent. Math. 39). Let have complex multiplication by an imaginary quadratic field . If , then is finite.
This is the BSD rank- implication for CM curves. Proof method: the elliptic-unit Euler system on the Kubota-Leopoldt -adic -function bounds the Selmer group of above by the -adic -value, and non-vanishing at forces the Selmer group to be finite, hence finite via the Mordell-Weil exact sequence.
Theorem 5 (Gross-Zagier 1986 Invent. Math. 84). Let be a weight- cusp newform on with (forcing ), and let be an imaginary quadratic field in which all primes split. The Heegner point associated to on the modular abelian variety satisfies $$ \hat h(y_K) = c_E \cdot L'(E/K, 1) = c_E \cdot L'(E, 1) \cdot L(E^K, 1), $$ where is the Néron-Tate height, is the quadratic twist of by , and is a non-zero explicit constant.
Corollary. If (analytic rank ), then is non-torsion, and (by descending the Heegner point trace to ).
Theorem 6 (Kolyvagin 1989 Math. USSR-Izv. 32). Let be a modular elliptic curve with . Then $$ \mathrm{rank}, E(\mathbb{Q}) = \mathrm{ord}_{s = 1} L(E, s), \qquad #\mathrm{Sha}(E/\mathbb{Q}) < \infty. $$
The proof uses the Kolyvagin Euler system of Heegner points on towers of imaginary quadratic fields, bounding the Selmer group above by the analytic rank. Combined with Gross-Zagier (which provides the lower bound in analytic rank via the non-torsion Heegner point) and modularity (now a theorem post-Wiles), this gives the rank part of BSD in analytic rank .
Theorem 7 (Kato 2004 Astérisque 295). Let be a modular elliptic curve and a prime. The Kato Euler system of Beilinson elements in of modular curves bounds the Selmer group of over the cyclotomic -extension above by the Mazur-Swinnerton-Dyer -adic -function : $$ \mathrm{char}_{\mathbb{Z}p[![T]!]} \mathrm{Sel}(E/\mathbb{Q}\infty)^\vee \mid L_p(E, T). $$ This is one inclusion of the Iwasawa main conjecture for .
Theorem 8 (Skinner-Urban 2014 Invent. Math. 195). Under modest hypotheses (ordinary reduction, irreducible residual representation, etc.), the reverse inclusion holds: $$ L_p(E, T) \mid \mathrm{char}_{\mathbb{Z}p[![T]!]} \mathrm{Sel}(E/\mathbb{Q}\infty)^\vee. $$ Combined with Kato, the full Iwasawa main conjecture holds for these elliptic curves, implying the -part of the refined BSD formula in analytic rank .
Open. BSD is open in analytic rank for every specific elliptic curve. The Bhargava-Skinner-Zhang programme (Bhargava-Shankar 2010s + Skinner-Zhang 2014+) has proved BSD holds in a positive proportion of elliptic curves over when ordered by naive height; this is a density result, not a result for individual curves.
Sato-Tate refinement and the Langlands programme
Theorem 9 (Sato-Tate; Clozel-Harris-Shepherd-Barron-Taylor 2008-2011). Let be a non-CM elliptic curve, and write with (well-defined by Hasse). Then the angles are equidistributed in with respect to the Sato-Tate measure as over primes.
This is a quantitative refinement of the Hasse bound, proved as a corollary of modularity + automorphy of symmetric powers of , which is itself a deep Langlands-functoriality result. The successor unit 21.06.02 pending develops the Sato-Tate conjecture in detail; the present unit notes only that Sato-Tate is the natural sharpening of modularity, putting an equidistribution structure on the Frobenius angles.
Synthesis. Modularity is the foundational reason elliptic curves over behave as automorphic objects. The central insight is that the -function — defined geometrically from point counts on the curve modulo primes — coincides with the -function of a weight- cusp newform , an analytic object built from holomorphy and the Petersson inner product on the upper half-plane. This is exactly the bridge from Diophantine geometry to automorphic forms: the elliptic curve's arithmetic invariants (rank, conductor, Tate-Shafarevich group) are encoded in a modular form's analytic invariants (Hecke eigenvalues, level, functional-equation sign), and the modularity theorem identifies the two encodings.
The Frey-Ribet-Serre-Wiles chain putting these together with Fermat's Last Theorem is the most celebrated arithmetic-geometric proof of the twentieth century. Frey's 1985 observation — that a hypothetical Fermat solution constructs an elliptic curve whose mod- Galois representation has impossibly small level — generalises to a method of attacking other Diophantine problems via modularity. The recurring pattern: modularity + level-lowering ⇒ Diophantine consequence. Variants of this method have since proved generalised Fermat equations, the Catalan-Tijdeman-Zhang result on perfect powers, and dozens of further special cases of Beal-type conjectures.
The bridge from modularity to BSD is via the -function. Once , the analytic continuation and functional equation of to all of is inherited from the modular-form side, and the order of vanishing becomes a definable analytic invariant. BSD then predicts this vanishing order equals the Mordell-Weil rank, with the leading coefficient an explicit product of arithmetic invariants. The pattern recurs in the Langlands programme: every -function attached to an arithmetic object should have a special-value formula relating analytic and arithmetic invariants — the Beilinson conjectures generalising BSD to higher-dimensional varieties (Beilinson 1984 J. Soviet Math. 30), the Bloch-Kato conjectures generalising the refined leading-coefficient formula to motives (Bloch-Kato 1990 Grothendieck Festschrift), the equivariant Tamagawa number conjecture (Burns-Flach 2001 Doc. Math. 6) packaging all special-value formulas into a single statement over arbitrary number fields.
The two conjectures in this unit are the simplest substantive cases of this entire research programme. Modularity is the case of Langlands reciprocity, BSD is the elliptic-curve case of Bloch-Kato. Both build toward 21.10.01 pending Langlands programme, with the surrounding apparatus of Galois representations and automorphic forms.
Full proof set [Master]
The modularity theorem of Wiles + BCDT and the BSD conjecture are themselves statement-level results in this unit; their full proofs (Wiles + Taylor-Wiles 1995 + BCDT 2001 for modularity, open for BSD in analytic rank ) are deferred to later units developing the deformation-theoretic and Euler-system machinery. What can be proved here at Master tier is the bridge identity at primes of good reduction and the structure of the modular abelian variety attached to a newform.
Proposition 10 (Faltings isogeny theorem; Faltings 1983 Invent. Math. 73). Let and be abelian varieties over a number field . Then and are isogenous over if and only if there is an isomorphism of -adic Galois representations for some (equivalently, every) prime .
Proof (sketch). The forward implication is direct: an isogeny induces an isomorphism on Tate modules compatible with the Galois action. The reverse implication is the deep content. Faltings's strategy: an isomorphism as Galois representations induces an isomorphism on the corresponding Hodge-Tate decompositions, hence (by Tate's theorem on -adic Hodge structures) an isomorphism on the corresponding -modules of -adic Tate vectors. The reconstruction of from as a Galois module proceeds via the finiteness theorem of Faltings: the set of -isogeny classes of abelian varieties of fixed dimension with bounded conductor is finite, so the isomorphism on forces and into the same isogeny class. The detailed proof is a major argument and is not reproduced here.
Corollary used in Theorem 1. Modularity in the sense of Galois-representation isomorphism (formulation (b) of Theorem 1) is equivalent to modularity in the sense of -function identity (formulation (a)) because Faltings 1983 identifies the abelian variety with up to isogeny, and isogenous abelian varieties have equal -functions.
Proposition 11 (Hasse bound and the local Euler factor). Let be an elliptic curve over with good reduction at a prime . Let be the eigenvalues of Frobenius on for any . Then , , and . The local Euler factor is $$ L_p(E, s) = \frac{1}{(1 - \alpha_p p^{-s})(1 - \beta_p p^{-s})} = \frac{1}{1 - a_p p^{-s} + p^{1 - 2 s}}. $$
Proof. The Weil conjectures applied to the elliptic curve (good reduction modulo ) state that the zeta function of has the form $$ Z(E_p, T) = \frac{(1 - \alpha_p T)(1 - \beta_p T)}{(1 - T)(1 - p T)}, $$ with the eigenvalues of Frobenius on for — equivalently on via the comparison theorem identifying with the dual of . The Hasse bound is the Riemann hypothesis for , proved by Hasse 1933 for elliptic curves over finite fields and generalised to higher-dimensional varieties by Weil 1949 + Deligne 1974 Publ. Math. IHES 43.
The point count is (Lefschetz fixed-point formula on the étale cohomology). Hence , (the determinant of Frobenius on the -dimensional equals the cyclotomic character ).
The local -factor is the inverse characteristic polynomial of Frobenius at : $$ L_p(E, s) = \det(1 - \mathrm{Frob}p \cdot p^{-s} \mid V\ell E)^{-1} = (1 - \alpha_p p^{-s})^{-1} (1 - \beta_p p^{-s})^{-1}. $$ Expanding the product: .
Proposition 12 (Mordell-Weil theorem; Mordell 1922, Weil 1929). For every elliptic curve over , the group is finitely generated.
Proof (sketch). The descent argument: the multiplication-by- map has finite cokernel (the weak Mordell-Weil theorem, proved via Galois cohomology: the cokernel injects into a finite Selmer group). Combined with the height pairing — a quadratic form on that is positive-definite modulo torsion, and finite-to-one on lattice points of bounded height — descent on produces a finite set of generators. Detailed proof in Silverman The Arithmetic of Elliptic Curves Ch. VIII §1-§4.
The Mordell-Weil theorem gives the decomposition . The integer is the rank of , the object BSD predicts equals .
Proposition 13 (Hasse-Weil conjecture from modularity). The -function of every elliptic curve over has an analytic continuation to and satisfies the functional equation with and .
Proof. By modularity (Theorem 1), for a weight- cusp newform on . The -function has the integral representation $$ \Lambda(f_E, s) := N_E^{s/2} (2 \pi)^{-s} \Gamma(s) L(f_E, s) = \int_0^\infty f_E(i y) y^{s - 1} dy, $$ where the convergence on uses the cusp-form decay as . The functional equation under the Atkin-Lehner involution gives $$ f_E(- 1/(N_E z)) = \varepsilon_E N_E z^2 f_E(z), \qquad \varepsilon_E \in {\pm 1}, $$ and substituting at in the integral yields . Combining with modularity gives the same identity for .
Before modularity was proved (pre-Wiles), this functional equation was the Hasse-Weil conjecture and was open for general elliptic curves. The modularity theorem of Wiles + BCDT therefore proves Hasse-Weil as a corollary. The functional equation, in turn, makes BSD's vanishing-order statement well-defined: requires to be defined at , which it is only by analytic continuation. Without modularity, the conjectural rank would not even be a meaningful statement for general .
Connections [Master]
Modular forms on
21.04.01. The ambient analytic theory on which the modularity bridge is built. The level-one space is the level- specialisation of , and the modularity theorem identifies elliptic curves over with weight- cusp newforms on at the appropriate higher level. The dimension formula, -expansion machinery, and Hecke-action theory of the present unit's are all developed in21.04.01for the level- case and extend via the Atkin-Lehner machinery to higher level.Hecke operators and Hecke algebra
21.04.02. The operator-theoretic substrate. The Hecke eigenvalue at a prime of good reduction is the trace of Frobenius on the -adic Galois representation attached to , by the Eichler-Shimura congruence. The Hecke algebra acts on the cuspidal cohomology of , and the modular abelian variety is constructed as a quotient of by an ideal of . The Wiles-Taylor theorem identifies the universal deformation ring of the residual representation with the Hecke algebra at the appropriate maximal ideal — the entire deformation framework rests on the Hecke algebra of21.04.02.Eichler-Shimura correspondence
21.04.03. Sibling-in-flight unit treating the weight- Hecke eigenform / Galois-representation bridge at technical depth. The Eichler-Shimura construction produces the abelian variety as a quotient of and identifies its -adic Tate module with the cohomological realisation of via étale cohomology. The local match at primes of good reduction — the key theorem proved in the present unit — is the cornerstone of the bridge from elliptic curves to modular forms.-adic Galois representations
21.05.01. Sibling-in-flight unit developing the Galois representations attached to weight- cusp eigenforms (Deligne 1971 + Deligne-Serre 1974). The modularity theorem is equivalent to a statement about -adic Galois representations: comes from a modular form. The deformation-theoretic proof of Wiles works entirely at the level of Galois representations — the Hecke algebra is the modular-form-side and the deformation ring is the Galois-representation-side, and is the bridge.Riemann zeta function
21.03.01. Prototype of an -function with analytic continuation, functional equation, and Euler product. The modular -function inherits the same three properties from the modular-form side via modularity, and the conjectural special-value formula at — BSD's leading-coefficient formula — is the elliptic-curve analogue of the class number formula relating to the class number and regulator of a number field .Dirichlet -functions
21.03.02. Prototype of an -function attached to a -dimensional Galois character. The modular -function generalises Dirichlet to the -dimensional case: corresponds to a -dimensional Galois representation as corresponds to a -dimensional character, and the modular form is the -analogue of the Dirichlet character . The Bloch-Kato refined conjecture for at (the class number formula refined) is the prototype of the BSD refined formula at in the elliptic-curve case.Dedekind / Hecke / Artin -functions
21.03.03. Sibling unit on the higher-rank Hecke-Artin -function framework. The Hasse-Weil -function is the Artin -function of the -dimensional Galois representation analytically continued to a complex -function; modularity is the assertion that this Artin -function equals the Hecke -function of the cusp newform via the Eichler-Shimura bridge. The BSD leading-coefficient formula at is the elliptic-curve refinement of the Dedekind analytic class-number formula at , with rank, regulator, Sha, Tamagawa numbers replacing .Iwasawa -extensions
21.07.01. Sibling-in-flight unit developing the Iwasawa theory of cyclotomic -extensions and the structure of the Selmer group over the cyclotomic tower. Kato's Euler system of Beilinson elements (Theorem 7) bounds the Selmer group above by the -adic -function, proving one inclusion of the Iwasawa main conjecture for elliptic curves; Skinner-Urban (Theorem 8) supplies the reverse inclusion in many cases, yielding the -part of BSD in analytic rank .-adic -functions and Mazur-Wiles Main Conjecture
21.07.02. Sibling-in-flight unit developing the -adic -function of Mazur-Swinnerton-Dyer and the Iwasawa main conjecture for elliptic curves. The main conjecture states , proved by Kato + Skinner-Urban in many cases. The main conjecture is the cyclotomic--extension Iwasawa-theoretic refinement of BSD, packaging the -part of the leading-coefficient formula into a statement about characteristic ideals.Sato-Tate conjecture
21.06.02pending. Successor unit (to be produced) on the equidistribution of Frobenius angles in the Sato-Tate measure, proved by Clozel-Harris-Shepherd-Barron-Taylor 2008-2011 as a corollary of modularity + automorphy of symmetric powers. Sato-Tate is the quantitative refinement of the Hasse bound and the natural sharpening of modularity.Elliptic curves
04.04.03. The algebraic-geometric foundation unit on elliptic curves as smooth projective curves of genus with a marked rational point. The Mordell-Weil group , the Weierstrass form, the group law, and the discriminant are developed there; the present unit's modularity / BSD statements rest on those foundations. The bridge from04.04.03to the present unit is through the -function: the algebraic-geometric object acquires arithmetic structure through its point counts , which assemble into the analytic -function , which by modularity equals a modular-form -function.Langlands programme
21.10.01pending. Future successor unit on the unifying frame. Modularity is the case of Langlands reciprocity: every -adic Galois representation of geometric origin should correspond to an automorphic representation, with matching -functions. BSD is the case of Bloch-Kato's refined special-value conjecture for motives. The two conjectures in the present unit are the simplest substantive cases of the entire Langlands research programme.
Historical & philosophical context [Master]
Bryan Birch and Peter Swinnerton-Dyer formulated the conjecture bearing their names in a pair of 1963-65 papers on the EDSAC-2 computer at the University of Cambridge [BirchSwinnertonDyer1965] in Journal für die reine und angewandte Mathematik 218. The original conjecture was formulated as an asymptotic prediction for the product over primes of good reduction, with the prediction that this product should grow like as , with equal to the rank of . The computational evidence was based on tables of Mordell-Weil ranks for elliptic curves of small conductor, computed using descent algorithms and Heegner-point constructions. Tate 1974 Invent. Math. 23 reformulated the conjecture in its modern leading-coefficient form [Tate1974], identifying the constant explicitly with the product of arithmetic invariants — real period, regulator, Tate-Shafarevich group, Tamagawa numbers, torsion — that appears in the refined statement. Tate's reformulation made BSD a precise predictive equation rather than a vague asymptotic, and remains the standard formulation in 2026.
The modularity theorem has its origins in two parallel threads. On the modular-form side, Hecke 1936-37 Math. Ann. 112 + 114 introduced the operators and proved the Euler-product structure of modular -functions; Eichler 1954 Arch. Math. 5 + Shimura 1958 Tohoku Math. J. 10 realised weight- Hecke eigenforms on the cohomology of modular curves , identifying Hecke eigenvalues with Frobenius traces. Shimura 1971 Introduction to the Arithmetic Theory of Automorphic Functions codified the modular-abelian-variety construction. On the elliptic-curve side, Weil 1967 Math. Ann. 168 proved the converse theorem: an elliptic curve over is modular if and only if its -function has the expected analytic continuation and functional equation. Taniyama 1955 had conjectured something close to modularity at the 1955 Tokyo-Nikko symposium; Shimura developed the precise statement through the 1960s. The combined Taniyama-Shimura-Weil conjecture — every elliptic curve over is modular — was the standard formulation by 1970.
Frey 1985 [Frey1986] observed in a Saarbrücken preprint (published 1986 Annales Universitatis Saraviensis 1) that a hypothetical Fermat solution would construct an elliptic curve with a mod- Galois representation of impossibly small level — incompatible with modularity at any positive level. Serre 1987 Duke Math. J. 54 [Serre1987] formulated the level-lowering -conjecture making Frey's observation precise: the mod- representation of the Frey curve should come from a newform of level , but . Ribet 1990 Invent. Math. 100 [Ribet1990] proved the -conjecture, closing the implication chain Taniyama-Shimura-Weil ⇒ Fermat's Last Theorem.
Andrew Wiles announced his proof of modularity for semistable elliptic curves over in three lectures at the Isaac Newton Institute in Cambridge in June 1993. A gap was identified in the Euler-system argument later that year; Wiles and Richard Taylor closed the gap in 1994 via the Taylor-Wiles patching argument, and the two papers Wiles 1995 Ann. Math. 141 [Wiles1995] and Taylor-Wiles 1995 Ann. Math. 141 [TaylorWiles1995] were published together in Annals of Mathematics 141 in 1995. The Wiles paper develops the deformation-theoretic framework and the identification; the Taylor-Wiles companion paper supplies the patching argument and the numerical criterion. The full modularity theorem for every elliptic curve over , including additive-reduction cases, was completed by Breuil-Conrad-Diamond-Taylor 2001 J. AMS 14 [BCDT2001], removing the semistability hypothesis via wild -adic deformation theory.
The partial results towards BSD form a separate fifty-year lineage. Coates-Wiles 1977 Invent. Math. 39 [CoatesWiles1977] proved the rank- implication for CM elliptic curves via the elliptic-unit Euler system. Gross-Zagier 1986 Invent. Math. 84 [GrossZagier1986] supplied the analytic-rank- lower bound via the Néron-Tate height of Heegner points. Kolyvagin 1988-89 Math. USSR-Izv. 32 [Kolyvagin1989] supplied the analytic-rank upper bound via the Kolyvagin Euler system. Kato 2004 Astérisque 295 [Kato2004] and Skinner-Urban 2014 Invent. Math. 195 [SkinnerUrban2014] proved the Iwasawa main conjecture for modular elliptic curves in many cases, implying the -part of the refined BSD formula in analytic rank . The Bhargava-Skinner-Zhang programme of the 2010s proved BSD holds in a positive proportion of elliptic curves over when ordered by naive height, the first density result for the conjecture. BSD remains open for individual elliptic curves of analytic rank and is one of the seven Clay Millennium Open Problems announced in 2000, with the original $1,000,000 prize unclaimed as of 2026.
Manin-Panchishkin Introduction to Modern Number Theory (Springer EMS 49, 2nd ed. 2005) Ch. 6 [ManinPanchishkin2005] codifies the modularity / BSD synthesis for the modern student. Silverman The Arithmetic of Elliptic Curves (GTM 106, 2nd ed. 2009) [Silverman2009] provides the canonical textbook treatment of elliptic curves, with Ch. C.16 and Appendix C §16 surveying modularity and BSD at survey level. Diamond-Shurman A First Course in Modular Forms (GTM 228, 2005) [DiamondShurman2005] gives the modular-forms-side exposition of the Eichler-Shimura construction and its role in modularity.
Bibliography [Master]
@article{Wiles1995,
author = {Wiles, Andrew},
title = {Modular elliptic curves and {F}ermat's last theorem},
journal = {Annals of Mathematics},
volume = {141},
number = {3},
year = {1995},
pages = {443--551}
}
@article{TaylorWiles1995,
author = {Taylor, Richard and Wiles, Andrew},
title = {Ring-theoretic properties of certain {H}ecke algebras},
journal = {Annals of Mathematics},
volume = {141},
number = {3},
year = {1995},
pages = {553--572}
}
@article{BCDT2001,
author = {Breuil, Christophe and Conrad, Brian and Diamond, Fred and Taylor, Richard},
title = {On the modularity of elliptic curves over {$\mathbb{Q}$}: wild {$3$}-adic exercises},
journal = {Journal of the American Mathematical Society},
volume = {14},
number = {4},
year = {2001},
pages = {843--939}
}
@article{BirchSwinnertonDyer1965,
author = {Birch, Bryan J. and Swinnerton-Dyer, Peter},
title = {Notes on elliptic curves. {II}},
journal = {Journal f{\"u}r die reine und angewandte Mathematik},
volume = {218},
year = {1965},
pages = {79--108}
}
@article{Frey1986,
author = {Frey, Gerhard},
title = {Links between stable elliptic curves and certain diophantine equations},
journal = {Annales Universitatis Saraviensis, Mathematische Schriften},
volume = {1},
year = {1986},
pages = {1--40}
}
@article{Ribet1990,
author = {Ribet, Kenneth A.},
title = {On modular representations of {$\mathrm{Gal}(\overline{\mathbb{Q}}/\mathbb{Q})$} arising from modular forms},
journal = {Inventiones Mathematicae},
volume = {100},
number = {2},
year = {1990},
pages = {431--476}
}
@article{Serre1987,
author = {Serre, Jean-Pierre},
title = {Sur les repr{\'e}sentations modulaires de degr{\'e} {$2$} de {$\mathrm{Gal}(\overline{\mathbb{Q}}/\mathbb{Q})$}},
journal = {Duke Mathematical Journal},
volume = {54},
number = {1},
year = {1987},
pages = {179--230}
}
@article{CoatesWiles1977,
author = {Coates, John and Wiles, Andrew},
title = {On the conjecture of {B}irch and {S}winnerton-{D}yer},
journal = {Inventiones Mathematicae},
volume = {39},
number = {3},
year = {1977},
pages = {223--251}
}
@article{GrossZagier1986,
author = {Gross, Benedict H. and Zagier, Don B.},
title = {{H}eegner points and derivatives of {$L$}-series},
journal = {Inventiones Mathematicae},
volume = {84},
number = {2},
year = {1986},
pages = {225--320}
}
@article{Kolyvagin1989,
author = {Kolyvagin, Victor A.},
title = {Finiteness of {$E(\mathbb{Q})$} and {$\mathrm{Sha}(E, \mathbb{Q})$} for a subclass of {W}eil curves},
journal = {Mathematics of the USSR-Izvestiya},
volume = {32},
number = {3},
year = {1989},
pages = {523--541}
}
@article{Kato2004,
author = {Kato, Kazuya},
title = {{$p$}-adic {H}odge theory and values of zeta functions of modular forms},
journal = {Ast{\'e}risque},
volume = {295},
year = {2004},
pages = {117--290}
}
@article{SkinnerUrban2014,
author = {Skinner, Christopher and Urban, Eric},
title = {The {I}wasawa main conjectures for {$\mathrm{GL}_2$}},
journal = {Inventiones Mathematicae},
volume = {195},
number = {1},
year = {2014},
pages = {1--277}
}
@article{Tate1974,
author = {Tate, John},
title = {The arithmetic of elliptic curves},
journal = {Inventiones Mathematicae},
volume = {23},
number = {3-4},
year = {1974},
pages = {179--206}
}
@book{Silverman2009,
author = {Silverman, Joseph H.},
title = {The Arithmetic of Elliptic Curves},
series = {Graduate Texts in Mathematics},
volume = {106},
edition = {2nd},
publisher = {Springer},
year = {2009}
}
@book{ManinPanchishkin2005,
author = {Manin, Yuri I. and Panchishkin, Alexei A.},
title = {Introduction to Modern Number Theory},
series = {Encyclopaedia of Mathematical Sciences},
volume = {49},
edition = {2nd},
publisher = {Springer},
year = {2005}
}
@book{DiamondShurman2005,
author = {Diamond, Fred and Shurman, Jerry},
title = {A First Course in Modular Forms},
series = {Graduate Texts in Mathematics},
volume = {228},
publisher = {Springer},
year = {2005}
}
@article{Mazur1977,
author = {Mazur, Barry},
title = {Modular curves and the {E}isenstein ideal},
journal = {Publications Math{\'e}matiques de l'IH{\'E}S},
volume = {47},
year = {1977},
pages = {33--186}
}
@article{Faltings1983,
author = {Faltings, Gerd},
title = {Endlichkeitss{\"a}tze f{\"u}r abelsche {V}ariet{\"a}ten {\"u}ber {Z}ahlk{\"o}rpern},
journal = {Inventiones Mathematicae},
volume = {73},
number = {3},
year = {1983},
pages = {349--366}
}
@article{LanglandsTunnell1980,
author = {Tunnell, Jerrold},
title = {Artin's conjecture for representations of octahedral type},
journal = {Bulletin of the American Mathematical Society},
volume = {5},
number = {2},
year = {1981},
pages = {173--175}
}