08.14.04 · stat-mech / statistical-field-theory

Lattice fermions and the doubling problem

shipped3 tiersLean: none

Anchor (Master): Itzykson & Drouffe, *Statistical Field Theory*, Vol. 1 (Cambridge, 1989), Ch. 6 (lattice fermions, doublers, Wilson and staggered fermions); Nielsen & Ninomiya, *Nucl. Phys. B* 185, 20 (1981) and 193, 173 (1981) (no-go theorem); Montvay & Münster, *Quantum Fields on a Lattice* (Cambridge, 1994), Ch. 4

Intuition Beginner

Suppose you want to simulate a single quantum particle — an electron, say — on a computer. You cannot store space as a smooth continuum, so you replace it with a grid of points, a lattice, and you approximate the rate of change of a field by comparing its values at neighbouring grid points. For most equations this works fine. For the equation that governs electrons, the Dirac equation, the naive grid version misbehaves badly: instead of one electron, the lattice describes a whole flock of them.

Where do the extra particles come from? The Dirac equation involves a first derivative, and the most symmetric way to approximate a first derivative on a grid is to subtract the value two steps to the left from the value two steps to the right. That subtraction is blind to a fast zig-zag pattern that flips sign at every site: such a pattern looks motionless to the grid even though it is wildly varying. Each independent zig-zag direction adds one phantom particle, so a -dimensional grid hides of them. This is the doubling problem.

These phantoms — called doublers — are not a bug you can quietly delete. A deep result says that any honest, local recipe with the right symmetry will always grow them. So lattice physicists do something cleverer: they add a small correction that makes the phantoms very heavy, so heavy they drop out of the low-energy physics, leaving the one electron you wanted.

Visual Beginner

The picture shows a one-dimensional grid of sites. A smooth wave (the physical electron) varies slowly from site to site. Beneath it, a zig-zag wave flips sign at every site. The symmetric two-step difference used to approximate the derivative assigns both waves the same slope, so the grid cannot tell them apart — the zig-zag is the doubler.

The takeaway in one image: the symmetric difference has a blind spot at the fastest wiggle the grid supports, and that blind spot is exactly where each doubler lives.

Worked example Beginner

Work on a one-dimensional grid with spacing , and approximate the derivative of a field at site by the symmetric difference .

Feed it the slow wave for all (a constant). Then . A constant has zero slope: correct.

Now feed it the zig-zag , which reads . At an even site, and , so . The fastest-varying field the grid can hold is assigned slope zero, the same answer as the constant.

So two completely different fields — one flat, one flipping at every step — both look like a zero-momentum, slowly-drifting particle to this difference rule. The grid sees two particles where the continuum has one.

What this tells us: the symmetric difference vanishes at both ends of the grid's momentum range, not just at zero. Every extra vanishing point is a doubler, and counting them in dimensions gives the factor .

Check your understanding Beginner

Formal definition Intermediate+

Work on a -dimensional cubic lattice with spacing , sites , and unit lattice vectors . A Dirac field assigns to each site an anticommuting (Grassmann) spinor together with , living in the fermionic algebra of 08.14.02. Let be the Euclidean gamma matrices, .

The naive lattice Dirac action replaces the continuum derivative by the symmetric difference: $$ S_{\text{naive}} = a^d \sum_x \left[ \sum_\mu \bar\psi_x, \gamma_\mu, \frac{\psi_{x+\hat\mu} - \psi_{x-\hat\mu}}{2a} + m, \bar\psi_x \psi_x \right]. $$ Passing to momentum space with over the Brillouin zone , the action becomes diagonal with the lattice Dirac operator $$ D(p) = \frac{i}{a}\sum_\mu \gamma_\mu \sin(p_\mu a) + m . $$ The inverse is the lattice propagator. The doublers are the poles of the massless propagator: the points where for every , namely . There are such corners of the Brillouin zone, and near each the operator linearises to a continuum Dirac operator, so the naive action describes degenerate Dirac species rather than one.

The Wilson action adds a second-difference term with parameter (usually ): $$ S_W = S_{\text{naive}} - \frac{r a^{d-1}}{2}\sum_{x,\mu} \bar\psi_x, (\psi_{x+\hat\mu} - 2\psi_x + \psi_{x-\hat\mu}), $$ whose momentum-space effect is to shift the mass to the momentum-dependent $$ M(p) = m + \frac{2r}{a}\sum_\mu \sin^2!\frac{p_\mu a}{2}. $$ A doubler at a corner with components equal to acquires mass , so all doublers become heavy as while the physical mode at keeps mass .

Counterexamples to common slips

  • Setting the Wilson parameter returns the naive action and the doublers reappear; is essential. The Wilson term is dimension-five (an irrelevant operator), so it does not alter the continuum limit of the physical mode, yet it is decisive for the doublers because it scales as at the zone corners.
  • The Wilson term anticommutes with no and so commutes with : it breaks chiral symmetry explicitly, at . Forgetting this and expecting a chiral limit at fixed bare is the standard error; the chiral point must be tuned (the additive mass renormalisation).
  • Staggered fermions do not remove doublers; they redistribute them. The doublers become "tastes," and treating those tastes as physically distinct flavours (rather than a discretisation artefact requiring a rooting prescription) is a frequent misreading.

Key theorem with proof Intermediate+

Theorem (Naive lattice fermions describe species). The massless naive lattice Dirac operator on the Brillouin zone has exactly zeros, located at , and near each zero the operator reduces to a continuum massless Dirac operator. Hence the naive action propagates Dirac fermions.

Proof. The operator is a sum of gamma matrices with real coefficients . Because , one has . Therefore is singular (has a zero eigenvalue) precisely where for every simultaneously, i.e. for all .

On the half-open interval the equation has the two solutions and . Independently across the components, the joint zero set is the product , which contains points: the corners of the Brillouin zone.

Fix one such corner and write with small. For each component, , where according to whether is or . Substituting, $$ D(p^{(\alpha)} + k) = i\sum_\mu \big(s_\mu^{(\alpha)}\gamma_\mu\big), k_\mu + O(k^3). $$ Define . Since each , the satisfy the identical Clifford relations , so is a continuum massless Dirac operator in the variable . Each of the corners thus supplies one continuum Dirac species, and the species are degenerate because the leading operator is the same up to the field redefinition .

Bridge. This counting is the foundational reason a chiral lattice theory is so hard to build, and it builds toward the Nielsen-Ninomiya no-go theorem below, which promotes the accident into a topological obligation. The sign flips are not cosmetic: half of the corners carry one handedness and half the other, so summed over the Brillouin zone the net chirality vanishes — this is exactly the cancellation the no-go theorem forces. The bridge is that the analytic statement "the symbol has zeros" generalises to the topological statement "a smooth chiral charge on the Brillouin torus has vanishing total degree," and putting these together identifies the doubling count with an index. The same Clifford-algebra bookkeeping that fixes spinor components in 03.09.02 is dual to the bookkeeping that fixes the doublers here, and the central insight — that locality plus chirality plus the lattice forces extra fermions — appears again in the staggered and overlap constructions of the Master tier.

Exercises Intermediate+

Advanced results Master

The Nielsen-Ninomiya theorem elevates the doubling count to a topological obstruction. Consider a lattice Hamiltonian for a single Weyl fermion, defined on the Brillouin torus . Locality of the action forces to be a smooth (indeed analytic) periodic function of ; Hermiticity makes its eigenvalues real; and a Weyl fermion is a band touching where two bands cross with a definite chirality , the sign of the Jacobian of the map from a small sphere around the crossing to the sphere of Hamiltonian directions. The theorem states that for any such lattice realisation the chiralities of all band-touching points sum to zero, [Nielsen-Ninomiya 1981]. Consequently a single Weyl fermion cannot live on the lattice in isolation: it is always accompanied by a partner of opposite chirality.

The proof is a degree argument. Near a generic crossing in dimensions the Hamiltonian linearises to (for the Weyl case, with Pauli matrices ), and the chirality is the degree of the Gauss map , . The total chirality is the sum of these local degrees. Because the Brillouin zone is a torus — a closed manifold without boundary — the relevant differential form (a Berry curvature in , a more general index density in higher ) integrates to zero by Stokes, exactly as a vector field on a torus has zero total index by Poincaré-Hopf. The zero net degree forces the crossings to pair up. The original 1981 papers gave first a homotopy-theoretic proof (part I) and then an intuitive topological proof in terms of the conservation of a fermionic charge under spectral flow (part II).

Wilson's resolution sacrifices chiral symmetry. The Wilson operator , with the lattice Laplacian, satisfies at , so the no-go hypothesis of exact chiral symmetry fails and the doublers may be lifted [Wilson 1977]. The cost is real: an additive mass renormalisation (the bare mass must be tuned to a critical that is no longer protected by symmetry), an explicit discretisation error, and the loss of a clean lattice chirality. Kogut and Susskind instead diagonalised the spin structure, distributing the four Dirac components over the corners of an elementary hypercube; this keeps a continuous chiral symmetry that protects the mass, at the price of four degenerate "tastes" and the rooting controversy [Kogut-Susskind 1975].

The modern resolution deforms chirality rather than abandoning it. Ginsparg and Wilson observed that a lattice Dirac operator obeying realises a lattice-deformed chiral symmetry under , [Ginsparg-Wilson 1982]. An operator satisfying this relation can have a single light fermion with an exact lattice chiral symmetry and a correct continuum index — the lattice index theorem reproduces the topological charge. The no-go theorem is evaded because the symmetry algebra is the deformed one, not the undeformed . Neuberger's overlap operator and the equivalent domain-wall construction give explicit solutions.

Synthesis. The doubling problem is the foundational reason that putting chiral matter on a lattice is a topological, not merely numerical, problem, and the no-go theorem identifies the count with the vanishing of a net chiral charge on the Brillouin torus. This is exactly the same Poincaré-Hopf zero-total-index phenomenon that governs vector fields on closed manifolds, and it generalises the naive zero-counting of the Key theorem into a statement about degrees of Gauss maps. The central insight is that locality, Hermiticity, translation invariance, and exact chiral symmetry are jointly inconsistent on the lattice, so every practical scheme — Wilson, staggered, overlap — relaxes exactly one hypothesis: Wilson drops chiral symmetry outright, staggered keeps only a remnant at the cost of tastes, and the Ginsparg-Wilson route deforms the symmetry algebra. Putting these together, the bridge to continuum physics is that all three schemes flow to the same chiral gauge theory as , and the lattice index theorem identifies the deformed-chiral anomaly with the continuum axial anomaly, so the no-go obstruction is dual to the very anomaly it was forced by.

Full proof set Master

Proposition (Net lattice chirality vanishes for the naive operator). Let the naive massless lattice Dirac operator have its zeros at the corners , and assign each the chirality . Then for every .

Proof. The corner set factorises as a -fold Cartesian product, and the chirality is a product over components, so the total sum factorises: $$ \sum_{\alpha\in{0,\pi/a}^d} \prod_{\mu=1}^d \cos(p^{(\alpha)}\mu a) = \prod{\mu=1}^d \Big( \sum_{p_\mu \in {0,\pi/a}} \cos(p_\mu a) \Big) = \prod_{\mu=1}^d \big( \cos 0 + \cos\pi \big). $$ Each factor equals , so the whole product is . The cancellation is exact and dimension-independent, and it pairs each chirality corner with a corner.

Proposition (Wilson term lifts all doublers and leaves one light mode). With Wilson parameter and bare mass , the pole mass of the mode at a corner with components equal to is . For of order unity and , only the mode remains light; all modes have mass diverging as .

Proof. The Wilson momentum-space mass is . At a corner with for values of and for the rest, each slot contributes and each slot contributes . Hence and . The corner () keeps , finite. For each , as , so those doublers decouple from the continuum spectrum. The physical mode near has propagator , the continuum Dirac propagator.

Proposition (Ginsparg-Wilson operators carry a lattice chiral symmetry). If satisfies , then the action is invariant to first order under the deformed transformation , .

Proof. Compute the variation : $$ \delta S = \bar\psi(1 - \tfrac{a}{2}D)\gamma_5 D\psi + \bar\psi D\gamma_5(1 - \tfrac{a}{2}D)\psi = \bar\psi\Big[ \gamma_5 D + D\gamma_5 - \tfrac{a}{2}\big(D\gamma_5 D + D\gamma_5 D\big)\Big]\psi. $$ The bracket is , which vanishes precisely by the Ginsparg-Wilson relation. Hence and the deformed transformation is an exact symmetry of the lattice action. The undeformed chiral algebra is recovered as , where the relation degenerates to .

Connections Master

  • The link variables and gauge-invariant plaquette action of Wilson lattice gauge theory 08.08.01 supply the gauge field; coupling it to the lattice fermions defined here — by inserting the link between and in the hopping term — assembles the full lattice QCD action. That unit owns the pure-gauge sector; this unit owns the matter sector that the no-go theorem complicates.

  • The Grassmann integration of 08.14.02 is the measure in which these lattice fermions are defined: the fermionic partition function is exactly the determinant identity proved there, now with the lattice Dirac operator rather than the Ising transfer matrix. The doubler count is the multiplicity of zeros of this in the massless limit.

  • The fermionic Fock space and anticommutation relations of 08.10.09 are the operator content on which the Hamiltonian (Kogut-Susskind) lattice formulation acts; the spin-diagonalisation that produces staggered "tastes" is a reorganisation of those creation and annihilation operators across a unit hypercube.

  • The order-disorder and duality structure that organises the two-dimensional fermionic critical point appears again in 08.14.05, where the same lattice fermion bilinears carry the Kramers-Wannier and continuum-limit data; the chiral symmetry whose lattice fate is decided here is the symmetry whose breaking pattern that unit tracks.

Historical & philosophical context Master

Wilson introduced the lattice as a non-perturbative regulator of gauge theory in 1974, and in the same circle of work added the second-difference term — now the Wilson term — to remove the fermion doublers, presenting the lattice-fermion construction explicitly in the 1975 Erice lectures published in 1977 [Wilson 1977]. Kogut and Susskind gave the Hamiltonian lattice formulation in 1975 and the spin-diagonalised (staggered) treatment of fermions that trades doublers for tastes [Kogut-Susskind 1975]. The structural obstruction was made precise by Nielsen and Ninomiya in two 1981 papers in Nuclear Physics B: a homotopy-theoretic proof followed by an intuitive topological argument that the net chirality of any local, Hermitian, translation-invariant lattice fermion must vanish, so a single Weyl species — and with it a naive lattice formulation of the electroweak chiral gauge theory — is forbidden [Nielsen-Ninomiya 1981]. The deformation that ultimately reconciled chirality with the lattice came from Ginsparg and Wilson in 1982, whose relation lay largely dormant for fifteen years until Neuberger's overlap operator and the domain-wall construction realised it concretely [Ginsparg-Wilson 1982].

Bibliography Master

@article{NielsenNinomiya1981a,
  author  = {Nielsen, H. B. and Ninomiya, M.},
  title   = {Absence of Neutrinos on a Lattice: (I). Proof by Homotopy Theory},
  journal = {Nuclear Physics B},
  volume  = {185},
  year    = {1981},
  pages   = {20--40}
}

@article{NielsenNinomiya1981b,
  author  = {Nielsen, H. B. and Ninomiya, M.},
  title   = {Absence of Neutrinos on a Lattice: (II). Intuitive Topological Proof},
  journal = {Nuclear Physics B},
  volume  = {193},
  year    = {1981},
  pages   = {173--194}
}

@article{Wilson1974Confinement,
  author  = {Wilson, Kenneth G.},
  title   = {Confinement of Quarks},
  journal = {Physical Review D},
  volume  = {10},
  year    = {1974},
  pages   = {2445--2459}
}

@incollection{Wilson1977Erice,
  author    = {Wilson, Kenneth G.},
  title     = {Quarks and Strings on a Lattice},
  booktitle = {New Phenomena in Subnuclear Physics (Erice 1975)},
  editor    = {Zichichi, A.},
  publisher = {Plenum Press},
  address   = {New York},
  year      = {1977},
  pages     = {69--142}
}

@article{KogutSusskind1975,
  author  = {Kogut, John and Susskind, Leonard},
  title   = {Hamiltonian Formulation of Wilson's Lattice Gauge Theories},
  journal = {Physical Review D},
  volume  = {11},
  year    = {1975},
  pages   = {395--408}
}

@article{GinspargWilson1982,
  author  = {Ginsparg, Paul H. and Wilson, Kenneth G.},
  title   = {A Remnant of Chiral Symmetry on the Lattice},
  journal = {Physical Review D},
  volume  = {25},
  year    = {1982},
  pages   = {2649--2657}
}

@book{ItzyksonDrouffe1989Vol1,
  author    = {Itzykson, Claude and Drouffe, Jean-Michel},
  title     = {Statistical Field Theory, Volume 1},
  publisher = {Cambridge University Press},
  year      = {1989}
}

@book{MontvayMunster1994,
  author    = {Montvay, Istvan and M\"unster, Gernot},
  title     = {Quantum Fields on a Lattice},
  publisher = {Cambridge University Press},
  year      = {1994}
}

@article{Neuberger1998Overlap,
  author  = {Neuberger, Herbert},
  title   = {Exactly Massless Quarks on the Lattice},
  journal = {Physics Letters B},
  volume  = {417},
  year    = {1998},
  pages   = {141--144}
}