21.09.03 · number-theory / arakelov-faltings

Heights and the Néron–Tate canonical height

shipped3 tiersLean: none

Anchor (Master): Néron 1965 *Annals of Mathematics* 82, 249-331 (originator paper — *Quasi-fonctions et hauteurs sur les variétés abéliennes*: the local decomposition of the canonical height into Néron local height functions, one per place); Tate (unpublished, ~1962, exposed in Lang *Elliptic Curves: Diophantine Analysis* and Manin-Panchishkin) — the limit construction $\hat h(P) = \lim_n h(nP)/n^2$ producing the canonical height by telescoping the parallelogram defect; Weil 1929 *Acta Mathematica* 52, 281-315 (the Mordell-Weil theorem over number fields, the descent in which heights first appear) and Weil 1951 *J. Math. Pures Appl.* 29 (the height machine attaching functorial heights to divisor classes); Northcott 1949 *Proc. Cambridge Phil. Soc.* 45 (the finiteness property); Silverman 2009 *The Arithmetic of Elliptic Curves* GTM 106 Ch. VIII; Bombieri-Gubler 2006 *Heights in Diophantine Geometry* Ch. 1-9; Lang 1983 *Fundamentals of Diophantine Geometry* (Springer) Ch. 4-5; Manin-Panchishkin 2005 *Introduction to Modern Number Theory* (Springer EMS 49, 2nd ed.) Ch. 4 §3, Ch. 6 §3

Intuition Beginner

A rational number like feels small, while feels large and complicated, even though both are close to the same real value. A height is a way to make this feeling precise: a number that measures the arithmetic complexity of a point, roughly the number of digits needed to write it down. The fraction in lowest terms has height about , so has small height and has large height.

The single most useful fact about heights is a finiteness statement. There are only finitely many rational numbers whose height is below any fixed bound, because there are only finitely many fractions with and both small. This is the engine behind many theorems: if you can show the rational points you care about all have bounded height, then there can be only finitely many of them.

On an elliptic curve, points can be added together by the group law. One wants a height that interacts well with this addition, so that doubling a point multiplies its height in a predictable way. The naive height almost does this, but with an annoying bounded error. The canonical height is a polished version that removes the error completely: it behaves like a perfect squared-length, a precise gauge in which adding points and measuring their size fit together exactly.

Visual Beginner

Picture the rational points of an elliptic curve as dots, each tagged with a number measuring its complexity. Small dots (the simple points like a torsion point) sit near the centre; large dots (points reached by adding a generator to itself many times) march outward, their tags growing like the square of how many steps they took. A dashed circle of fixed radius encloses only finitely many dots: this is the Northcott finiteness property drawn as a picture.

The key visual idea: the canonical height turns the group into something like a lattice of dots in space, where the height is the squared distance from the origin, and only finitely many lattice points fit inside any fixed ball.

Worked example Beginner

Compute heights of a few rational points and watch the canonical height emerge.

Step 1. The logarithmic height of a rational number in lowest terms is . So , and a torsion-like simple value such as has height . Bigger fractions have bigger heights: .

Step 2. On a curve, doubling a point roughly squares the size of its coordinates. So the naive height of is about four times the naive height of : the doubling formula on an elliptic curve is a degree-four map on coordinates, and a degree-four map multiplies logarithmic height by about . In symbols, , but with a bounded error that depends on the point.

Step 3. Tate's idea removes the error. Form the sequence , dividing the height of by . Each step corrects a little of the bounded error, and the numbers settle down to a limit. That limit is the canonical height .

Step 4. The canonical height squares under doubling with no error at all: , and more generally . The "squared-length" behaviour is now exact.

Step 5. The finiteness pay-off. Since only finitely many points have height below any bound, and since a point of height zero turns out to be a torsion point (one of finite order), the canonical height sorts the points of the curve cleanly: a finite torsion core at height zero, and everything else spreading outward with quadratically growing height.

What this tells us: the naive height already measures complexity and already gives finiteness, but it wobbles under the group law. The canonical height is the wobble-free version, a perfect quadratic gauge in which "small means few" and "doubling means times four" both hold on the nose.

Check your understanding Beginner

Formal definition Intermediate+

Let be a number field with set of places , each place normalised so that the product formula holds: for every , where the normalisation absorbs the local degrees into .

Multiplicative Weil height. For a point , the multiplicative height is $$ H(P) = \prod_{v \in M_K} \max_{0 \le i \le n} |x_i|_v. $$ The product formula makes independent of the choice of homogeneous coordinates: scaling all by a common multiplies the product by . The logarithmic (Weil) height is $$ h(P) = \log H(P) = \sum_{v \in M_K} \log \max_i |x_i|_v \ge 0. $$ The absolute height is obtained by normalising for the field of definition, , so that is well-defined on independently of the number field through which is presented. The additive form is ; the multiplicative form is .

Northcott finiteness. For every pair of bounds , the set $$ { P \in \mathbb{P}^n(\overline{\mathbb{Q}}) : h(P) \le B,\ [\mathbb{Q}(P) : \mathbb{Q}] \le d } $$ is finite. Kronecker's theorem is the boundary case: an algebraic number has if and only if or is a root of unity.

Weil height machine. For a smooth projective variety and a divisor class , there is a height function , well-defined up to a bounded function , characterised by: (i) normalisation ; (ii) additivity ; (iii) functoriality for a morphism . The machine is a homomorphism .

Néron–Tate canonical height. Let be an abelian variety (the elliptic-curve case is the main one) and a symmetric ample divisor class, so in . The canonical height is the limit $$ \hat h(P) = \hat h_D(P) = \lim_{n \to \infty} \frac{h_{A,D}(nP)}{n^2}, $$ a genuine real-valued function (no ambiguity) satisfying . On an elliptic curve with doubled to a degree-two class one usually writes using the -coordinate height. The canonical height is a positive-definite quadratic form on : it satisfies the parallelogram law $$ \hat h(P+Q) + \hat h(P-Q) = 2\hat h(P) + 2\hat h(Q), $$ the scaling , and .

Height pairing and regulator. The associated bilinear form is $$ \langle P, Q \rangle = \tfrac{1}{2}\big(\hat h(P+Q) - \hat h(P) - \hat h(Q)\big), $$ positive-definite on . If generate the free part of of rank , the regulator is $$ R_E = \det\big( \langle P_i, P_j \rangle \big)_{1 \le i, j \le r} > 0, $$ the covolume of the Mordell-Weil lattice, which enters the Birch-Swinnerton-Dyer formula.

Counterexamples to common slips

  • The naive height is not a quadratic form: , with the error genuinely present. Only the canonical limit kills the error. Treating itself as quadratic produces false exact identities.
  • characterises torsion, not the identity point alone. A nonzero torsion point of order also has canonical height zero. The vanishing locus of is the whole torsion subgroup.
  • The canonical height depends on the chosen symmetric divisor class through a positive scalar: and differ by the ratio of their classes when both are symmetric ample. Positive-definiteness and the torsion characterisation are independent of this normalisation, but the numerical regulator is normalised by a fixed choice (the class of the origin for an elliptic curve).

Key theorem with proof Intermediate+

Theorem (existence and quadraticity of the canonical height; Néron 1965, Tate c. 1962 [source pending]). Let be an elliptic curve and the logarithmic Weil height attached to the degree-two divisor class of the origin. For every the limit $$ \hat h(P) = \lim_{n \to \infty} \frac{h(2^n P)}{4^n} $$ exists, satisfies , and defines a quadratic form on : the parallelogram law holds, , and with if and only if is a torsion point.

Proof. The duplication formula on realises as a morphism of degree four on the divisor class of the origin, so by the functoriality of the height machine there is a constant with $$ |h(2Q) - 4 h(Q)| \le C \qquad \text{for all } Q \in E(\overline{K}). $$ Apply this with and divide by : $$ \left| \frac{h(2^{n+1}P)}{4^{n+1}} - \frac{h(2^n P)}{4^n} \right| \le \frac{C}{4^{n+1}}. $$ The partial sums of converge, so the sequence is Cauchy and the limit exists. Telescoping the same bound from gives , so .

For the scaling relation, the height machine gives , hence ; passing to the limit yields , and the analogous degree- bound for gives .

For the parallelogram law, the theorem of the square on supplies the divisor-class identity on the abelian variety, which through the height machine reads . Replacing each point by times itself, dividing by , and passing to the limit kills the and yields the exact parallelogram law for . A function satisfying the parallelogram law is a quadratic form, so is quadratic.

Non-negativity follows because and on a set bounded below after the scaling normalisation; more precisely, if then , contradicting bounded below, so . If then all multiples have bounded, so by Northcott finiteness the set is finite, forcing to have finite order. Conversely a torsion point has a finite orbit of bounded height, so its canonical height vanishes.

Bridge. The canonical height builds toward the entire arithmetic of the Mordell-Weil group: it is exactly the positive-definite quadratic form that turns into a lattice, and the finiteness it encodes is the foundational reason the descent argument terminates. This is exactly the input that powers the Birch-Swinnerton-Dyer regulator , where the height pairing measures the covolume of that lattice; the canonical height generalises the naive Weil height by removing its bounded defect, and putting these together one sees that the central insight — that arithmetic complexity is a squared-length — is dual to the geometric fact that on an abelian variety. The bridge is that this same quadratic form reappears as an arithmetic self-intersection number: the canonical height is the Arakelov height developed in 21.09.01, and it appears again in 21.09.02 where heights drive Faltings's proof of the Mordell conjecture, and in 21.06.01 where the regulator enters BSD.

Exercises Intermediate+

Advanced results Master

Theorem (Néron's local decomposition; Néron 1965 [source pending]). Let be an abelian variety with symmetric ample and canonical height . There exist Néron local height functions , one for each place of , each a quasi-function bounded by an intersection-theoretic local term and continuous in the -adic topology, such that for every $$ \hat h_D(P) = \sum_{v \in M_K} \lambda_v(P). $$ The local heights are determined up to normalisation by their transformation under the group law and their logarithmic singularity along ; at archimedean places is given by a Green's function on the complex torus , and at non-archimedean places by an intersection number on the Néron model.

This decomposition is the precise bridge to Arakelov geometry. The archimedean local heights are Green's-function integrals on , and the non-archimedean local heights are local intersection multiplicities on the Néron model over ; their sum reconstitutes the global canonical height. In the language of 21.09.01, is the arithmetic intersection number of the section against the hermitian line bundle attached to , so the canonical height is an Arakelov self-intersection.

Theorem (positive-definiteness over ; Néron-Tate). The canonical height extends by the quadratic-form property to a positive-definite real quadratic form on the finite-dimensional real vector space . Consequently is a lattice of full rank in with respect to the inner product , and the regulator is the squared covolume of this lattice, independent of the choice of basis up to sign.

The positive-definiteness is the analytic heart of the theory. It implies the finiteness of points of bounded canonical height directly (a ball in a Euclidean lattice contains finitely many lattice points), recovering Northcott in the geometric language, and it makes the regulator a well-defined real invariant of the curve. The regulator is one of the arithmetic factors in the conjectural Birch-Swinnerton-Dyer formula for the leading Taylor coefficient of at .

Theorem (lower bounds and Lehmer-type problems; Bombieri-Gubler 2006 [source pending]). For a fixed elliptic curve , the canonical heights of non-torsion points are bounded below by a positive constant depending on (a result of the Northcott property), and the elliptic Lehmer conjecture predicts a uniform lower bound scaling with the degree of the field of definition. The classical Lehmer problem is the analogous statement for the Weil height of algebraic numbers (the multiplicative-group case), and the two are unified by the height machine viewed across distinct group-variety structures: each is a statement that the height cannot be too small unless the point is special (torsion, or a root of unity).

Synthesis. The foundational reason heights exist is that arithmetic complexity must be a single real number that grows predictably under the operations of the geometry, and this is exactly what the Weil height machine delivers: a homomorphism from divisor classes to functions-up-to-. Putting these together, the naive Weil height already encodes Northcott finiteness and Kronecker's height-zero characterisation, the canonical height generalises it by removing the defect through Tate's telescoping limit, and the resulting quadratic form is dual to the geometric relation on an abelian variety — the central insight that turns the group law into a squared-length. The bridge to the rest of the arc is Néron's local decomposition: the global canonical height splits as a sum of local heights, archimedean Green's functions plus non-archimedean intersection numbers, and this is exactly the Arakelov self-intersection of 21.09.01, so the canonical height is simultaneously a Diophantine measuring stick, a positive-definite lattice inner product, and an arithmetic-intersection number. This same object appears again in 21.06.01 as the BSD regulator and in 21.09.02 as the height driving Faltings's finiteness, and putting these together one sees a single quadratic form organising the descent that proves Mordell-Weil, the lattice geometry that gives the regulator, and the arithmetic intersection theory that proves Mordell.

Full proof set Master

Proposition (well-definedness of the Weil height under change of coordinates). The multiplicative height on is independent of the choice of homogeneous coordinates representing .

Proof. Two coordinate vectors for the same projective point differ by a common scalar : if , then at each place , $$ \max_i |\lambda x_i|_v = |\lambda|_v \max_i |x_i|_v. $$ Taking the product over all places, $$ \prod_v \max_i |\lambda x_i|_v = \Big(\prod_v |\lambda|_v\Big) \prod_v \max_i |x_i|_v = \prod_v \max_i |x_i|_v, $$ where by the product formula for the number field . Hence is unchanged, and so is .

Proposition (the canonical height is well-defined and independent of the limit base). The limit exists for all , and equals along the full sequence of positive integers; in particular the value does not depend on using base .

Proof. Existence and the Cauchy estimate were established in the Key Theorem: the bound gives for , a convergent geometric tail. For independence of base, set and use together with the relation , which was proved from the degree- height-machine bound . Then for any , $$ \frac{h(nP)}{n^2} = \frac{\hat h(nP) + O(1)}{n^2} = \frac{n^2 \hat h(P) + O(1)}{n^2} = \hat h(P) + O(n^{-2}) \to \hat h(P), $$ so the limit along all integers exists and equals the base- limit. The canonical height is therefore intrinsic.

Proposition (the parallelogram law characterises as quadratic). A function on an abelian group satisfying for all and is a quadratic form: is bi-additive and .

Proof. Setting in the parallelogram law gives , consistent with . Setting gives , and inductively . Symmetry of is immediate. For additivity in the first slot, expand using the definition and the parallelogram law repeatedly; the standard polarisation identity shows this difference is zero, so is bi-additive. Finally . Applying this with , whose parallelogram law was proved via the theorem of the square, shows is a quadratic form with associated pairing .

Proposition (positivity of the regulator). If are -independent in , the Gram matrix is positive-definite, so .

Proof. The pairing is symmetric and bilinear on , and with equality only for torsion points. For independent and any nonzero real vector , the combination in is nonzero (by independence), so . Thus is positive-definite, and a positive-definite symmetric matrix has strictly positive determinant, giving . The determinant is the squared covolume of the lattice in the inner-product space .

Connections Master

  • Elliptic curves and Mordell-Weil 04.04.03. The canonical height is the quadratic form that makes the descent proof of the Mordell-Weil theorem work: the height-decrease under multiplication by combined with Northcott finiteness turns the finiteness of into finite generation of . The height pairing then organises the free part into a Euclidean lattice, and torsion pins down exactly which points collapse to the origin of that lattice.

  • Abelian varieties 04.10.16. On a general abelian variety with symmetric ample , the relation is the geometric source of the quadratic scaling . The theorem of the square supplies the parallelogram law, so the canonical height is a positive-semidefinite quadratic form on , positive-definite modulo torsion. The elliptic-curve case is the dimension-one specialisation, and the Néron-Tate height on a Jacobian is the same construction applied to the theta divisor.

  • Arakelov geometry and arithmetic surfaces 21.09.01. Néron's local decomposition identifies the canonical height with an Arakelov arithmetic intersection number: archimedean local heights are Green's-function integrals on and non-archimedean local heights are intersection multiplicities on the Néron model. The canonical height is therefore the arithmetic self-intersection in the framework of that unit, making heights a special case of arithmetic intersection theory.

  • Modularity and the BSD conjecture 21.06.01. The regulator built from the height pairing is one of the arithmetic factors in the Birch-Swinnerton-Dyer formula for the leading coefficient of at . The Gross-Zagier theorem computes the canonical height of a Heegner point in terms of the derivative , directly linking the height pairing developed here to the analytic side of BSD.

  • Faltings / Mordell theorem 21.09.02. Heights are the bookkeeping that drives Faltings's and Vojta's proofs of the Mordell conjecture: the Faltings height of an abelian variety and the canonical height of points on a curve in its Jacobian are the invariants bounded in the finiteness arguments. The Northcott property of the height — finitely many points of bounded height and degree — is the abstract finiteness principle that those proofs ultimately exploit.

Historical & philosophical context Master

The height function entered number theory as the bookkeeping device in André Weil's 1929 thesis, where he proved the Mordell-Weil theorem over arbitrary number fields by an infinite descent that needed a numerical measure of the "size" of a rational point to guarantee termination [Weil 1929]. Weil's height measured the number of digits of a point's coordinates, and the decisive property was finiteness: only finitely many points have bounded height. Mordell had proved the genus-one case over in 1922, and the descent already implicitly used heights; Weil made the height a systematic tool and later, around 1950, abstracted it into the height machine that attaches a functorial height to every divisor class on a projective variety, well-defined up to a bounded error.

The finiteness half was crystallised by D. G. Northcott in 1949 [Northcott 1949], whose inequality showed that algebraic points of bounded height and bounded degree are finite in number — the pigeonhole principle on which the whole subject rests. Kronecker's much older theorem, that an algebraic number of height zero is a root of unity, is the boundary case marking the height-zero locus. The qualitative naive height, however, wobbled under the group law on an abelian variety: doubling a point only multiplied the height by four up to a bounded error, so the height was not a clean quadratic form.

The decisive refinement came around 1962 from John Tate, who observed that the bounded defect could be telescoped away by averaging: the limit converges and is an exact quadratic form. Independently and more structurally, André Néron's 1965 Annals paper Quasi-fonctions et hauteurs sur les variétés abéliennes [Néron 1965] constructed the canonical height through a decomposition into local height functions, one per place, each a quasi-function controlled by local intersection theory. The two constructions agree: Tate's global limit and Néron's local sum are two faces of one object. Néron's local viewpoint proved prophetic, for it is precisely the decomposition that, two decades later, would be recognised as an Arakelov arithmetic intersection number, fusing the Diophantine theory of heights with the arithmetic intersection theory of the present chapter. The canonical height now sits at the centre of the subject: it is the positive-definite quadratic form whose determinant is the Birch-Swinnerton-Dyer regulator, the invariant Faltings bounded to prove Mordell, and the arithmetic self-intersection that closes the Arakelov circle.

Bibliography Master

@article{Neron1965,
  author  = {N{\'e}ron, Andr{\'e}},
  title   = {Quasi-fonctions et hauteurs sur les vari{\'e}t{\'e}s ab{\'e}liennes},
  journal = {Annals of Mathematics},
  volume  = {82},
  year    = {1965},
  pages   = {249--331}
}

@article{Weil1929,
  author  = {Weil, Andr{\'e}},
  title   = {L'arithm{\'e}tique sur les courbes alg{\'e}briques},
  journal = {Acta Mathematica},
  volume  = {52},
  year    = {1929},
  pages   = {281--315}
}

@article{Northcott1949,
  author  = {Northcott, D. G.},
  title   = {An inequality in the theory of arithmetic on algebraic varieties},
  journal = {Proceedings of the Cambridge Philosophical Society},
  volume  = {45},
  year    = {1949},
  pages   = {502--509}
}

@book{Silverman2009,
  author    = {Silverman, Joseph H.},
  title     = {The Arithmetic of Elliptic Curves},
  publisher = {Springer-Verlag},
  series    = {Graduate Texts in Mathematics},
  volume    = {106},
  edition   = {2},
  year      = {2009}
}

@book{BombieriGubler2006,
  author    = {Bombieri, Enrico and Gubler, Walter},
  title     = {Heights in Diophantine Geometry},
  publisher = {Cambridge University Press},
  series    = {New Mathematical Monographs},
  volume    = {4},
  year      = {2006}
}

@book{HindrySilverman2000,
  author    = {Hindry, Marc and Silverman, Joseph H.},
  title     = {Diophantine Geometry: An Introduction},
  publisher = {Springer-Verlag},
  series    = {Graduate Texts in Mathematics},
  volume    = {201},
  year      = {2000}
}

@book{Lang1983,
  author    = {Lang, Serge},
  title     = {Fundamentals of Diophantine Geometry},
  publisher = {Springer-Verlag},
  year      = {1983}
}

@book{ManinPanchishkin2005,
  author    = {Manin, Yuri I. and Panchishkin, Alexei A.},
  title     = {Introduction to Modern Number Theory},
  publisher = {Springer-Verlag},
  series    = {Encyclopaedia of Mathematical Sciences},
  volume    = {49},
  edition   = {2},
  year      = {2005}
}