12.12.02 · quantum / canonical-qft

Coulomb gauge vs Lorenz gauge in QED

shipped3 tiersLean: nonepending prereqs

Anchor (Master): Berestetskii, Lifshitz, Pitaevskii, Quantum Electrodynamics, 2e (Pergamon 1982), §§4, 76–77, 102; Weinberg, The Quantum Theory of Fields, Vol. I (Cambridge 1995), §§8.5, 9.5; Itzykson & Zuber, Quantum Field Theory (McGraw-Hill 1980), §3-2

Intuition Beginner

Maxwell's equations have a redundancy built into them. The electric and magnetic fields are the directly measurable quantities, but the equations of quantum-electrodynamics are most cleanly written in terms of a four-component potential — the scalar potential and the vector potential together. The trouble is that two different potentials can give the same electric and magnetic fields: there is a one-function-per-spacetime-point shift you can perform on the potential (adding the gradient of an arbitrary function) that leaves the measurable fields invariant. This is a gauge symmetry, and two potentials related by such a shift describe the same physical situation.

For a classical theory this redundancy is a minor annoyance. For a quantum theory it is a structural problem. The standard quantisation recipe wants to identify canonical pairs of fields and momenta and impose commutation relations between them; a redundant degree of freedom has no genuine conjugate momentum, so the recipe fails. Gauge fixing is the procedure of cutting through the redundancy by imposing a constraint that selects one representative from each gauge orbit. Two such constraints dominate the textbook treatment of quantum-electrodynamics.

The first is the Coulomb gauge, which sets the spatial divergence of the vector potential to zero. It kills the longitudinal piece of the vector potential and leaves only the two transverse polarisations as propagating photons, with the instantaneous Coulomb interaction between charges appearing explicitly. It is the natural choice for atomic physics: when you compute the hydrogen spectrum or the Lamb shift, the dominant electron-proton binding is just the Coulomb attraction, and the Coulomb gauge puts it front and centre.

The price for choosing Coulomb gauge is that the gauge condition picks out the time direction. A Lorentz boost mixes time and space, so a frame's Coulomb gauge becomes a different frame's complicated gauge. Lorentz covariance holds in the theory but is no longer manifest in the formulas.

The second main choice is the Lorenz gauge (named after L. V. Lorenz 1867, not H. A. Lorentz of the transformation group), which sets a Lorentz-scalar combination of derivatives of the potential to zero. It is the natural choice for high-energy scattering: cross sections for electron-electron, electron-photon, and electron-positron collisions are Lorentz-invariant numbers, and a Lorentz-invariant gauge condition keeps the computations tidy.

The price for Lorenz gauge is that you have to quantise four photon polarisations — two transverse, one longitudinal, one time-like — three of which are unphysical. The unphysical pieces must cancel out of any observable quantity. Gupta and Bleuler showed in 1950 how to do this with an "indefinite metric" Hilbert space and a selection rule that defines physical states; Faddeev and Popov gave the modern path-integral version in 1967 with ghost fields.

Both choices describe the same physics. The differential cross section for Compton scattering, the energy levels of hydrogen, the anomalous magnetic moment of the electron — every measurable prediction of quantum-electrodynamics is the same in Coulomb gauge, in Lorenz gauge with any value of the gauge parameter, and in any other consistent gauge. This is the gauge equivalence theorem, and it is enforced by the Ward identity: a symmetry of the bare action that makes gauge-dependent pieces cancel between Feynman diagrams when external particles are placed on-shell. The choice of gauge is therefore a choice of formalism, not of physics. Choose the gauge that makes the computation you want to do as simple as possible.

Visual Beginner

Property Coulomb gauge Lorenz / Feynman gauge
Gauge condition div(A) = 0 covariant:
Propagating photon modes 2 transverse 4 covariant (2 transverse + 2 unphysical)
Scalar potential Instantaneous Coulomb potential Dynamical, time-like polarisation
Lorentz covariance True but hidden Manifest
Unitarity Manifest Restored by Gupta-Bleuler / BRST
Photon propagator (sketch) transverse projector + instantaneous -squared covariant tensor with gauge parameter
Preferred use Atomic physics, bound states, Lamb shift Scattering amplitudes, loop calculations, renormalisation
Generalises to non-abelian No (no covariant analogue) Yes (Faddeev-Popov ghosts essential)
Originator Coulomb 1773 (implicit); Dirac 1955 (canonical) Lorenz 1867; Fermi 1932; Gupta-Bleuler 1950

Worked example Beginner

Pick a concrete plane-wave photon and watch how the gauge choice constrains its polarisation.

Imagine a single plane-wave photon travelling in the direction with frequency — wavevector along the -axis. The vector potential can be written as a sum of three independent polarisation directions: (transverse), (transverse), and (longitudinal). The scalar potential adds a fourth time-like direction. Before any gauge fixing, the photon carries four polarisation amplitudes, and the gauge freedom shifts among them.

In Coulomb gauge the requirement that the vector potential be divergence-free, combined with a plane-wave dependence in the -direction, forces the -component of the polarisation to vanish. The longitudinal piece is gone; only the - and -polarisation amplitudes survive as propagating degrees of freedom. The scalar potential is determined by the charge density via the instantaneous Coulomb law — it is not a free choice. The plane-wave photon has exactly two physical polarisation states, matching what an experimentalist measures when a polarising filter is rotated in front of a laser beam.

In Lorenz gauge all four polarisation amplitudes remain in the formalism. Two of them (the transverse - and -polarisations) are physical; the other two (a combination of the time-like and longitudinal directions) are unphysical. The Gupta-Bleuler subsidiary condition is a rule on physical states that forces the two unphysical amplitudes into a combination that gives zero norm — a state that has zero probability of appearing in any measurement. After projecting onto physical states, the photon again has two physical polarisations, just as in Coulomb gauge.

Sanity check: count photons. In a beam of photons per second, both gauges predict events per second in a detector, with two independent polarisation channels per photon. The two gauges agree on what an experiment sees. They disagree only on the intermediate book-keeping — which is purely a matter of how the formalism is organised.

What this tells us: the gauge choice does not change the photon. It changes how many bookkeeping variables you carry around to describe the photon. Coulomb gauge carries the minimum number (two transverse modes plus an instantaneous Coulomb potential). Lorenz gauge carries the full four-vector and then projects out the unphysical pieces at the end. Same physics, different formalism.

Check your understanding Beginner

Formal definition Intermediate+

Work in natural units with the mostly-minus metric . The free electromagnetic Lagrangian is

The Euler-Lagrange equations give the source-free Maxwell equations . Coupling to a Dirac field 12.11.01 via the minimal-coupling prescription produces the full QED Lagrangian

with the electron-photon vertex as the unique gauge-invariant interaction at lowest order. The local gauge symmetry is the simultaneous transformation

under which is invariant, the kinetic term shifts by , and the interaction term shifts by exactly the opposite amount, so the full Lagrangian is invariant. The corresponding global symmetry ( constant) is the electron-number symmetry whose Noether current is the electromagnetic current sourcing .

The gauge-fixing problem

Canonical quantisation requires identifying the canonical momentum conjugate to each field. For the spatial components,

so the spatial electric field is conjugate to the spatial vector potential. But for ,

since identically. This is the constraint that prevents naive canonical quantisation: has no conjugate momentum, hence no canonical commutation relation, hence is not a dynamical degree of freedom. The Euler-Lagrange equation for gives the Gauss-law constraint (in the presence of charge density ), which similarly involves no time derivatives and so is a constraint rather than an equation of motion. The Dirac-Bergmann analysis of constrained Hamiltonian systems classifies these as first-class constraints whose generators are the components of the gauge group's Lie algebra acting on the field configuration. Quantisation requires either reducing to a non-redundant set of physical variables (the Coulomb-gauge route) or extending the field content with auxiliary modes whose unphysical contributions cancel out of physical states (the Lorenz / covariant-gauge route).

Coulomb gauge

Impose . Helmholtz-decompose any vector field into transverse and longitudinal parts with and ; the Coulomb gauge sets . The Gauss-law constraint together with and becomes

the instantaneous Coulomb potential of the charge distribution. is determined non-dynamically by the charge density at the same time; it is not an independent degree of freedom and does not need a canonical commutation relation. The dynamical content of the photon field is carried entirely by the two transverse components of , mode-expanded as

with (transversality) and (orthonormality). The mode operators satisfy the standard bosonic commutation relations on the bosonic Fock space 12.13.01, and the resulting photon Hamiltonian is positive on a positive-definite Hilbert space — the Coulomb-gauge quantisation is manifestly unitary.

The Coulomb-gauge photon propagator is most efficiently written in mixed coordinate-momentum form. The transverse projector is , and

The piece is the instantaneous Coulomb interaction in momentum space: it propagates without a pole, reflecting that is non-dynamical. The full structure is awkward to manipulate in covariant calculations because the transverse projector explicitly breaks Lorentz covariance, but it is precisely the structure that makes the Coulomb interaction explicit in atomic physics.

Lorenz gauge and the -family

Impose . The cleanest implementation is to modify the Lagrangian by adding a gauge-fixing term

with a real positive parameter. The full Lagrangian has no gauge invariance, but the variational equation now produces , which together with the gauge condition reduces to the four decoupled wave equations . Now every component of has a non-zero canonical momentum, canonical quantisation is straightforward, and the photon propagator follows by inverting the kinetic operator:

The parameter labels a one-parameter family of gauges within the Lorenz class:

  • is Feynman gauge: , the simplest possible form, with no term.
  • is Landau gauge (the strict Lorenz limit): , manifestly transverse in spacetime indices.
  • is unitary gauge (formal limit, useful for spontaneously broken gauge theories).

The gauge-fixed quantisation makes all four components of dynamical, with mode expansion

over four covariant polarisation vectors : two transverse (), one longitudinal (, ), and one time-like (, ). The commutation relations now read

with the metric tensor appearing explicitly. The minus sign in produces a sign-indefinite inner product on Fock space — the indefinite-metric problem.

The Gupta-Bleuler subsidiary condition

The resolution proposed independently by Gupta 1950 and Bleuler 1950 [Gupta 1950; Bleuler 1950] is to define physical states as the subspace satisfying

where is the positive-frequency (annihilation-operator) part of . The condition is weaker than the operator equation (which would be inconsistent with the canonical commutation relations); it picks out the subspace on which physical observables matter. On this subspace the longitudinal and timelike one-photon states pair into a null combination whose expectation values vanish, and the inner product is positive-semidefinite. Quotienting by the zero-norm states gives a positive-definite physical Hilbert space carrying only the two transverse photon modes — matching the Coulomb-gauge count.

Faddeev-Popov and the path integral

The modern derivation of the gauge-fixed Lagrangian, via the path integral, is due to Faddeev and Popov 1967 [Faddeev-Popov 1967]. The naive QED path integral overcounts by a factor of the volume of the gauge group, since gauge-equivalent configurations contribute identically. The Faddeev-Popov trick inserts the identity

for a gauge-fixing functional (such as for a fixed function ), where denotes the gauge-transform of and is the Faddeev-Popov determinant. For in the Abelian case, , which is field-independent, so is a constant absorbed into the path-integral normalisation. Averaging over the auxiliary function with Gaussian weight produces the gauge-fixed action with the -family propagator above. For QED the ghost fields decouple: no diagrams with ghost internal lines contribute to physical amplitudes. The procedure is the same in non-Abelian gauge theories — but there the gauge-fixing functional depends in an essential way on the gauge-transformed field, the Faddeev-Popov determinant becomes a functional of the gauge potential, and exponentiating it via Grassmann path integration over ghost fields produces a new term that interacts with the gauge field at loop level. Without these ghost contributions, non-Abelian gauge theory at one loop violates unitarity (the optical theorem); the ghosts are essential. The QED case is a degenerate limit in which the ghost machinery still applies but the ghosts happen not to contribute.

Key derivation Intermediate+

Theorem (Ward identity for QED). Let be a connected Feynman amplitude with one external photon of momentum and polarisation index , plus an arbitrary number of other on-shell external particles. Then

identically in , at any order in perturbation theory.

Proof. The Ward identity is the perturbative consequence of gauge invariance of the bare QED action. The functional derivation proceeds through the generating functional. Let be the generating functional of connected Green's functions, defined by

with the gauge-fixing term included to make the photon kinetic operator invertible. Under an infinitesimal local gauge transformation , , , the QED action is invariant by construction, the path-integral measure is invariant (no chiral anomalies arise in vector QED), the gauge-fixing term shifts by , and the source terms shift by . Demanding the invariance of under the change of integration variables produces the master Ward-Takahashi identity [Ward 1950; Takahashi 1957]:

Functional differentiation with respect to source fields and amputation of the external propagators converts this into identities among connected Green's functions. The on-shell projection — setting external particles on their mass shells and contracting with physical polarisation vectors — gives the Ward identity for S-matrix elements: for any amplitude with an external photon .

Corollary (gauge equivalence of physical amplitudes). Physical S-matrix elements are independent of the gauge parameter and identical in Coulomb and Lorenz gauges.

Proof of the -independence. The photon propagator's -dependent piece is . When contracted into an internal-photon-line insertion in a Feynman diagram, the factor multiplies the amplitude with one external photon at that vertex (after the internal line is "cut" by the standard Cutkosky-like manipulation); the Ward identity then forces that contracted amplitude to vanish on-shell. Therefore the -dependent piece contributes zero to any on-shell S-matrix element, and the amplitude is -independent. The argument extends to the Coulomb-to-Feynman gauge transformation by an analogous Ward-identity argument applied to the difference of propagators.

The Ward identity is also the technical reason why the photon-polarisation-completeness sum that appears in tree-level cross-section calculations can be replaced by :

The longitudinal and time-like terms in the full covariant completeness sum produce contributions proportional to on the amplitude, which the Ward identity kills. This substitution underlies every spin-and-polarisation sum in tree-level QED (Compton 12.12.03, Møller, Bhabha, bremsstrahlung) and is the cleanest worked example of the gauge-equivalence theorem in action.

Exercises Intermediate+

Lean formalization Intermediate+

Mathlib's coverage of the prerequisites is partial: Hilbert spaces, finite-dimensional inner-product spaces, the Lorentz-Minkowski quadratic form, and the abstract distributional framework on Schwartz space all exist in some form. What is missing is the physics-layer apparatus required to formalise covariant gauge fixing in QED:

  • An operator-valued-distribution model of the free Maxwell quantum field on a Krein space (indefinite-metric Hilbert space) carrying the four covariant photon modes, with commutation relations where is the Pauli-Jordan function.
  • The Coulomb-gauge transverse projector as an idempotent symmetric operator and the corresponding Helmholtz decomposition of vector fields, combined with the Fock-space construction over the resulting two-dimensional transverse fibre.
  • The Gupta-Bleuler subsidiary condition as a closed subspace projector, plus the proof that the inner product on the quotient is positive-definite.
  • The Faddeev-Popov determinant as a functional Jacobian for the gauge-group action, requiring infinite-dimensional Lie-group integration and (for the non-Abelian generalisation) Grassmann path integrals.
  • The Ward-Takahashi identity as a consequence of gauge invariance of the bare QED action, derivable from a Slavnov-Taylor functional differential equation.
  • The BRST operator as a nilpotent fermionic differential on the extended algebra of fields and ghosts, with the BRST cohomology at ghost number zero giving the physical Hilbert space; the Kugo-Ojima quartet mechanism showing that unphysical photon and ghost modes pair into BRST quartets that decouple from physical observables.

Each is a substantial Mathlib contribution. The unit ships lean_status: none. The aggregated upstream-Mathlib roadmap for QED quantum-field formalisation has this unit's Gupta-Bleuler and BRST infrastructure as one of its load-bearing sub-goals; without it, the gauge-equivalence theorem cannot be stated formally, and downstream units on Compton, Møller, Bhabha, the one-loop self-energy, the vertex correction, and the Lamb shift all rest on the same gauge-equivalence claim. The human-review surface is described in the unit metadata Mathlib gap analysis.

Advanced results Master

BRST cohomology as the modern foundation

The Becchi-Rouet-Stora 1974-1975 / Tyutin 1975 reformulation [Becchi-Rouet-Stora 1974; Tyutin 1975] of gauge-fixed gauge theory is the modern foundation of covariant-gauge quantisation. After Faddeev-Popov gauge fixing the action is no longer gauge-invariant, but it carries a global anticommuting symmetry: the BRST symmetry. On the extended field content — where are the Faddeev-Popov ghost and antighost, is the Nakanishi-Lautrup auxiliary field — the BRST operator acts as

and is nilpotent: . The Lagrangian can be written as for a gauge-fixing fermion , demonstrating that the gauge-fixing dependence is BRST-exact and therefore does not contribute to BRST-cohomology classes. The space of physical states is defined as the cohomology

restricted to the BRST-closed states modulo BRST-exact ones. For QED this construction reproduces the Gupta-Bleuler condition: BRST-closed states with ghost number zero are exactly the Gupta-Bleuler physical states. For non-Abelian gauge theories, BRST cohomology is the only known way to define the physical Hilbert space at the level of the operator algebra and forms the algebraic foundation of the Kugo-Ojima 1979 unitarity proof [Kugo-Ojima 1979] via the quartet mechanism: unphysical timelike and longitudinal photon modes pair with Faddeev-Popov ghost and antighost modes into quartets where is a longitudinal photon, and each quartet decouples from physical S-matrix elements by a BRST-exact-zero argument. The cohomological reformulation is the technical machinery behind the modern proofs of renormalisability and unitarity of the Standard Model.

The Slavnov-Taylor identity and renormalisation

The Ward-Takahashi identity has a non-Abelian generalisation: the Slavnov-Taylor identity [Slavnov 1972; Taylor 1971], which states that BRST invariance of the renormalised effective action gives functional relations among the Green's functions of the gauge, matter, and ghost fields. In QED the Slavnov-Taylor identity collapses to the Ward-Takahashi identity (because ghosts decouple). In Yang-Mills theory it gives substantive constraints among the three-gluon, four-gluon, and gluon-ghost vertices that protect the structure of the renormalised theory and reduce the apparent number of independent counterterms. 't Hooft 1971 [Faddeev-Popov 1967 cited above] used the Slavnov-Taylor identity to prove the renormalisability of non-Abelian gauge theories — the result that opened the door to the Standard Model and won 't Hooft and Veltman the 1999 Nobel Prize. The Slavnov-Taylor identity is the modern face of the gauge-symmetry constraint on the structure of perturbative quantum field theory.

Beyond perturbation theory: the Gribov ambiguity

Faddeev-Popov gauge fixing assumes that the gauge-fixing condition picks out exactly one representative from each gauge orbit. For Abelian gauge theories this is true (one exists for any starting potential, as shown in the Beginner Worked Example). For non-Abelian gauge theories this is false at the non-perturbative level: a given orbit may intersect the gauge-fixing surface multiple times, and the Faddeev-Popov determinant may vanish at some configurations — these are the Gribov copies discovered by Gribov 1978. The Gribov ambiguity is invisible in perturbation theory (it shows up only at field configurations with non-vanishing topology) but is essential to the non-perturbative quantisation of Yang-Mills theory and to the confinement problem. For QED, no Gribov ambiguity arises in any standard gauge; the Faddeev-Popov procedure is exact at all energies. This is one of the substantive differences between Abelian and non-Abelian gauge theories that the BLP treatment emphasises only briefly.

Pinch technique and gauge-invariant Green's functions

The off-shell Green's functions of gauge theories are gauge-dependent — the photon propagator depends on , the electron self-energy depends on , and so on. For some applications (effective-field-theory matching, the BFM background-field method, the running coupling defined off-shell) it is useful to construct gauge-invariant off-shell Green's functions. The pinch technique of Cornwall-Papavassiliou-Sirlin 1981-2009 reorganises the diagrammatic expansion by identifying and removing the gauge-dependent pieces of individual diagrams via a systematic algebraic procedure, producing gauge-invariant effective Green's functions term by term in perturbation theory. The pinch technique is equivalent to the background-field method in a particular gauge (Feynman background gauge), and the resulting gauge-invariant Green's functions satisfy ordinary Ward identities of the unbroken QED Ward-Takahashi form. The technique is used in precision electroweak phenomenology (e.g., the gauge-invariant definition of the -boson mass beyond tree level) and in non-perturbative QCD studies. For QED it is a luxury rather than a necessity; for the electroweak Standard Model it is essential.

Full proof set Master

The Ward identity stated in the Key Derivation has both a perturbative derivation (via the functional generating equation) and a non-perturbative derivation (via Noether's theorem for the global subgroup of the gauge group). The non-perturbative version is the cleanest. Let be the electric current; classical invariance of the QED action gives the local conservation law off-shell. Take an arbitrary connected Green's function

with external fields. The classical conservation law lifts to an operator identity inside the time-ordered product up to Schwinger contact terms that arise from the time-ordering being non-smooth across coincident points. Schematically, in the sense of distributions. Fourier-transforming and amputating the external propagators converts this into the Ward-Takahashi identity for vertex functions; setting the external momenta on-shell and contracting with physical polarisation vectors gives the Ward identity for S-matrix elements: .

The completeness of this derivation rests on three technical points. First, the path-integral measure must be invariant under the local gauge transformation. For vector QED in this holds; for chiral QED (involving couplings) the chiral anomaly of Adler-Bell-Jackiw 1969 violates the naive Ward identity, with consequences for decay, the cancellation of gauge anomalies in the Standard Model, and the Witten global anomaly in . Second, the renormalisation procedure must preserve the Ward identity — equivalent to the requirement that the regulariser be gauge-invariant. Dimensional regularisation in does this manifestly; Pauli-Villars regularisation does too (with careful sign assignments for the regulator fields); lattice regularisations can but require care (Wilson fermions break chiral symmetry; staggered fermions break flavour). Third, the on-shell projection must be consistent with the regulariser; for QED in dimensional regularisation this is automatic.

The Gupta-Bleuler subsidiary condition is a consistent restriction on physical states because commutes with itself at spacelike-separated points (a consequence of microcausality of the free Maxwell field) and with the interaction Hamiltonian (a consequence of current conservation ). The first commutation ensures that the subsidiary condition is preserved by the free time evolution; the second ensures that it is preserved by the interaction. Therefore the physical-state subspace is invariant under the full Heisenberg evolution, and the time-evolution of physical states stays physical. The same statement, in modern BRST language, is the assertion that BRST charge commutes with the Hamiltonian and that physical states are the cohomology classes on the appropriate ghost-number-zero subspace.

The gauge-equivalence theorem at the operator level (rather than the path-integral level used in the Key Derivation) is the assertion that the S-matrix in any consistent gauge is unitarily equivalent to the S-matrix in any other consistent gauge, when restricted to the physical-state subspace. This was proved for the QED Coulomb-vs-Lorenz comparison in Dirac 1955 [Dirac 1955] and Schwinger 1953; for the general non-Abelian comparison via BRST cohomology in Kugo-Ojima 1979 [Kugo-Ojima 1979]. The technical machinery is gauge-by-gauge LSZ reduction with appropriate canonical-transformation maps relating the in/out states in different gauges; the punchline is that physical observables (cross sections, decay rates, bound-state energies, anomalous magnetic moments) are gauge-independent.

Connections Master

  • Canonical quantum field theory 12.12.01 is the framework: the QED Lagrangian, the Dyson-series expansion, and the LSZ reduction. The gauge-fixing procedure of this unit is the step that turns the formal Lagrangian into a quantisable theory with well-defined photon propagator.

  • Dirac equation and relativistic spin 12.11.01 supplies the matter sector: four-component spinors, gamma-matrix algebra, the spin-completeness relation . The QED minimal coupling unifies the Dirac field with the gauge potential of this unit.

  • Free Dirac quantum field 12.05.05 supplies the operator-valued Dirac field with canonical anticommutation relations and the LSZ reduction formula. The Dirac field is the matter side; the gauge field of this unit is the radiation side; quantum-electrodynamics is the coupled system.

  • Bosonic Fock space 12.13.01 is the construction of the photon Fock space, with creation and annihilation operators satisfying on positive-definite metric, or on indefinite metric. The choice of metric distinguishes Coulomb-gauge unitarity from Lorenz-gauge covariant quantisation.

  • Compton scattering and the Klein-Nishina formula 12.12.03 uses the Lorenz-gauge photon-polarisation-completeness sum to convert the spin-and-polarisation sum into a manageable trace; the substitution is valid by the Ward identity derived here.

  • One-loop electron self-energy and mass renormalization 12.16.01 is the canonical worked example of the gauge equivalence theorem: the Coulomb-gauge and Feynman-gauge computations produce different intermediate Feynman-parameter integrals but the same physical mass shift, wave-function renormalisation, and running coupling.

  • One-loop QED vertex and anomalous magnetic moment 12.16.02 is most cleanly computed in Feynman gauge, where the photon propagator is ; the gauge equivalence theorem ensures the Schwinger result is gauge-independent.

  • Vacuum polarization at one loop and the Uehling potential 12.16.03 depends crucially on the Ward identity to enforce the transverse decomposition ; the photon stays massless to all orders in perturbation theory because a mass term is not gauge-invariant.

  • Non-Abelian gauge theory and Yang-Mills is the natural successor: the Faddeev-Popov machinery developed here for the Abelian case becomes essential rather than decorative for gauge theories, where interacting ghosts contribute to loop diagrams and are required for unitarity. The 't Hooft-Veltman 1971-1972 proof of renormalisability of non-Abelian gauge theories rests directly on the BRST machinery of this unit, generalised via the Slavnov-Taylor identity.

  • BRST cohomology in topological field theory is a downstream mathematical development. The nilpotent BRST operator and its cohomology classes were the inspiration for Witten's 1988 cohomological reformulation of topological quantum field theories; Donaldson and Seiberg-Witten invariants of four-manifolds are cohomology classes of BRST-like differentials on infinite-dimensional configuration spaces.

Historical & philosophical context Master

The gauge condition was written down by Ludvig Lorenz in 1867 [Lorenz 1867], a Danish physicist whose name is regularly mis-attributed to the better-known Hendrik Antoon Lorentz of Lorentz transformations and Lorentz force. Lorenz's 1867 paper "On the identity of the vibrations of light with electrical currents" predates Maxwell's 1873 Treatise on Electricity and Magnetism by six years and contains the covariant gauge condition as the natural choice for a relativistic wave equation for the electromagnetic potential. The historical confusion between Lorenz and Lorentz is documented in Jackson and Okun 2001 (Rev. Mod. Phys. 73, 663); the correct attribution is L. V. Lorenz 1867 for the gauge condition, and H. A. Lorentz for the transformation group.

Canonical quantisation of the electromagnetic field in Lorenz gauge was first attempted by Fermi 1932 [Fermi 1932] in his Reviews of Modern Physics survey of quantum electrodynamics. Fermi's treatment had several technical gaps: the gauge condition was imposed as a constraint operator equation, which is inconsistent with the canonical commutation relations, and the indefinite-metric problem was not addressed cleanly. The full resolution waited for the independent papers of Suraj Gupta in 1950 [Gupta 1950] and Konrad Bleuler in 1950 [Bleuler 1950], who introduced the indefinite-metric Hilbert space and the subsidiary condition that defines physical states. The Gupta-Bleuler quantisation is the historical foundation of all modern covariant treatments of gauge theories.

The Coulomb-gauge canonical quantisation was developed by Dirac 1955 [Dirac 1955] in his "Gauge-invariant formulation of quantum electrodynamics" paper, which gave the first manifestly unitary canonical treatment with explicit elimination of the gauge redundancy. Dirac's approach used the constraint analysis he had developed in 1949-1950 for general constrained Hamiltonian systems (the Dirac-Bergmann formalism), specialised to the QED case. Schwinger 1953 had given an essentially equivalent treatment in a slightly different formalism; the two approaches are now standard.

The path-integral derivation of gauge-fixed gauge theory was given by Ludvig Faddeev and Victor Popov in a 1967 Physics Letters B paper [Faddeev-Popov 1967] originally devoted to the quantisation of non-Abelian Yang-Mills theory. Their two-page paper introduced the Faddeev-Popov determinant, the ghost fields, and the modern covariant gauge-fixing prescription that became the standard technique in particle physics. For QED the Faddeev-Popov procedure reduces to the Gupta-Bleuler quantisation, with the ghost fields decoupling from physical amplitudes; for Yang-Mills theory the ghosts are essential. Faddeev's institute in Leningrad was the leading centre for mathematical aspects of gauge theory in the 1960s and 1970s; the 1967 Faddeev-Popov paper is one of the most cited in twentieth-century theoretical physics.

The BRST symmetry was discovered independently by Carlo Becchi, Alain Rouet, and Raymond Stora at Marseille in 1974-1976 [Becchi-Rouet-Stora 1974] and by Igor Tyutin at the Lebedev Institute in Moscow in 1975 [Tyutin 1975] (in a preprint that remained unpublished in Western journals for many years). The cohomological reformulation became the standard framework for unitarity proofs of gauge theories after the Kugo-Ojima 1979 paper [Kugo-Ojima 1979] established the quartet mechanism and the BRST-cohomological definition of physical states. The Slavnov-Taylor identity [Ward 1950; Takahashi 1957] and its non-Abelian generalisation by Slavnov 1972 and Taylor 1971 provided the technical machinery for the renormalisability proofs of 't Hooft-Veltman 1971-1972, which won the 1999 Nobel Prize.

The choice between Coulomb and Lorenz gauges reflects a structural feature of physics that is rarely articulated: the same physical theory can have multiple inequivalent canonical formulations, and the best formulation depends on the problem at hand. Atomic physics, where Coulomb binding is the dominant interaction and transverse photon emission is the perturbation, naturally uses Coulomb gauge; high-energy scattering, where Lorentz invariance is essential and the Feynman-rule machinery is the workhorse, naturally uses Lorenz / Feynman gauge. The gauge equivalence theorem guarantees that physical predictions are independent of this choice, but the computational cost of getting to those predictions can differ by orders of magnitude. The pedagogical literature reflects this division: textbooks aimed at atomic physicists (Bethe-Salpeter, Sakurai's Advanced Quantum Mechanics) emphasise Coulomb gauge; textbooks aimed at high-energy physicists (Peskin-Schroeder, Schwartz, Weinberg) emphasise Lorenz/Feynman gauge. Berestetskii-Lifshitz-Pitaevskii Volume 4 uniquely uses both fluently throughout the book, requiring the reader to translate between them — one reason the book is famously difficult to read cold but uniquely valuable as a unified treatment.

Bibliography Master

Primary literature:

  • Lorenz, L. V., "On the identity of the vibrations of light with electrical currents", Phil. Mag. 34 (1867), 287–301. [Originator: covariant gauge condition , predating Maxwell's preferred formulation.]

  • Fermi, E., "Quantum Theory of Radiation", Rev. Mod. Phys. 4 (1932), 87–132. [First canonical quantisation of QED in Lorenz gauge with a constraint approach.]

  • Gupta, S. N., "Theory of longitudinal photons in quantum electrodynamics", Proc. Phys. Soc. A 63 (1950), 681–691. [Originator: indefinite-metric quantisation in Lorenz gauge.]

  • Bleuler, K., "Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen", Helv. Phys. Acta 23 (1950), 567–586. [Originator: independent indefinite-metric quantisation.]

  • Dirac, P. A. M., "Gauge-invariant formulation of quantum electrodynamics", Can. J. Phys. 33 (1955), 650–660. [Originator: canonical Coulomb-gauge quantisation, manifestly unitary treatment of QED.]

  • Faddeev, L. D. & Popov, V. N., "Feynman diagrams for the Yang-Mills field", Phys. Lett. B 25 (1967), 29–30. [Originator: path-integral gauge fixing and ghost fields.]

  • Becchi, C., Rouet, A. & Stora, R., "The Abelian Higgs-Kibble model: unitarity of the S-operator", Phys. Lett. B 52 (1974), 344–346; Commun. Math. Phys. 42 (1975), 127–162; "Renormalization of gauge theories", Ann. Phys. 98 (1976), 287–321. [Originator: BRST cohomological reformulation.]

  • Tyutin, I. V., "Gauge invariance in field theory and statistical physics in operator formalism", Lebedev Institute preprint FIAN No. 39 (1975). [Originator: independent BRST construction.]

  • Ward, J. C., "An identity in quantum electrodynamics", Phys. Rev. 78 (1950), 182. [Originator: Ward identity for QED.]

  • Takahashi, Y., "On the generalised Ward identity", Nuovo Cim. 6 (1957), 371–375. [Originator: Ward-Takahashi identity.]

  • Kugo, T. & Ojima, I., "Local covariant operator formalism of non-Abelian gauge theories and quark confinement problem", Prog. Theor. Phys. Suppl. 66 (1979), 1–130. [BRST unitarity proof, quartet mechanism, BRST-cohomological definition of physical Hilbert space.]

  • Slavnov, A. A., "Ward identities in gauge theories", Theor. Math. Phys. 10 (1972), 99–104; Taylor, J. C., "Ward identities and charge renormalisation of the Yang-Mills field", Nucl. Phys. B 33 (1971), 436–444. [Slavnov-Taylor identity for non-Abelian gauge theories.]

  • Jackson, J. D. & Okun, L. B., "Historical roots of gauge invariance", Rev. Mod. Phys. 73 (2001), 663–680. [Historical clarification of Lorenz vs Lorentz attribution.]

Textbooks and monographs:

  • Berestetskii, V. B., Lifshitz, E. M., Pitaevskii, L. P., Quantum Electrodynamics, 2nd ed. (Pergamon, 1982), §§4, 76–77, 102. [Both Coulomb and Lorenz gauges used fluently throughout; explicit translation between them.]

  • Weinberg, S., The Quantum Theory of Fields, Vol. I: Foundations (Cambridge, 1995), §§8.5 (Coulomb gauge), 9.5 (covariant-gauge quantisation), 15.7 (BRST symmetry). [The modern axiomatic treatment.]

  • Peskin, M. E. & Schroeder, D. V., An Introduction to Quantum Field Theory (Westview / Addison-Wesley, 1995), §9.4 (functional quantisation of the EM field, Faddeev-Popov for Abelian and non-Abelian). [Modern graduate textbook treatment.]

  • Schwartz, M. D., Quantum Field Theory and the Standard Model (Cambridge, 2014), §§8, 14 (path-integral quantisation of gauge theories, gauge invariance of physical amplitudes). [Modern pedagogical synthesis.]

  • Itzykson, C. & Zuber, J.-B., Quantum Field Theory (McGraw-Hill, 1980), §3-2. [Franco-Russian-school treatment with explicit comparison of canonical and path-integral methods.]

  • Bjorken, J. D. & Drell, S. D., Relativistic Quantum Fields (McGraw-Hill, 1965), Ch. 14 (Maxwell-field quantisation in covariant gauge). [Pre-Faddeev-Popov canonical treatment.]

  • Henneaux, M. & Teitelboim, C., Quantisation of Gauge Systems (Princeton UP, 1992). [Comprehensive monograph on Dirac-Bergmann constraint analysis and BRST quantisation.]

  • Pokorski, S., Gauge Field Theories, 2nd ed. (Cambridge UP, 2000), Chs. 1–3. [Pedagogical treatment of gauge-fixing, BRST, and Slavnov-Taylor for the Standard Model.]