12.12.02 · quantum / canonical-qft

Coulomb gauge vs Lorenz gauge in QED

shipped3 tiersLean: nonepending prereqs

Anchor (Master): Berestetskii, Lifshitz, Pitaevskii, Quantum Electrodynamics, 2e (Pergamon 1982), §§4, 76–77, 102; Weinberg, The Quantum Theory of Fields, Vol. I (Cambridge 1995), §§8.5, 9.5; Itzykson & Zuber, Quantum Field Theory (McGraw-Hill 1980), §3-2

Intuition Beginner

Maxwell's equations have a redundancy built into them. The electric and magnetic fields are the directly measurable quantities, but the equations of quantum-electrodynamics are most cleanly written in terms of a four-component potential — the scalar potential and the vector potential together. The trouble is that two different potentials can give the same electric and magnetic fields: there is a one-function-per-spacetime-point shift you can perform on the potential (adding the gradient of an arbitrary function) that leaves the measurable fields invariant. This is a gauge symmetry, and two potentials related by such a shift describe the same physical situation.

For a classical theory this redundancy is a minor annoyance. For a quantum theory it is a structural problem. The standard quantisation recipe wants to identify canonical pairs of fields and momenta and impose commutation relations between them; a redundant degree of freedom has no genuine conjugate momentum, so the recipe fails. Gauge fixing is the procedure of cutting through the redundancy by imposing a constraint that selects one representative from each gauge orbit. Two such constraints dominate the textbook treatment of quantum-electrodynamics.

The first is the Coulomb gauge, which sets the spatial divergence of the vector potential to zero. It kills the longitudinal piece of the vector potential and leaves only the two transverse polarisations as propagating photons, with the instantaneous Coulomb interaction between charges appearing explicitly. It is the natural choice for atomic physics: when you compute the hydrogen spectrum or the Lamb shift, the dominant electron-proton binding is just the Coulomb attraction, and the Coulomb gauge puts it front and centre.

The price for choosing Coulomb gauge is that the gauge condition picks out the time direction. A Lorentz boost mixes time and space, so a frame's Coulomb gauge becomes a different frame's complicated gauge. Lorentz covariance holds in the theory but is no longer manifest in the formulas.

The second main choice is the Lorenz gauge (named after L. V. Lorenz 1867, not H. A. Lorentz of the transformation group), which sets a Lorentz-scalar combination of derivatives of the potential to zero. It is the natural choice for high-energy scattering: cross sections for electron-electron, electron-photon, and electron-positron collisions are Lorentz-invariant numbers, and a Lorentz-invariant gauge condition keeps the computations tidy.

The price for Lorenz gauge is that you have to quantise four photon polarisations — two transverse, one longitudinal, one time-like — three of which are unphysical. The unphysical pieces must cancel out of any observable quantity. Gupta and Bleuler showed in 1950 how to do this with an "indefinite metric" Hilbert space and a selection rule that defines physical states; Faddeev and Popov gave the modern path-integral version in 1967 with ghost fields.

Both choices describe the same physics. The differential cross section for Compton scattering, the energy levels of hydrogen, the anomalous magnetic moment of the electron — every measurable prediction of quantum-electrodynamics is the same in Coulomb gauge, in Lorenz gauge with any value of the gauge parameter, and in any other consistent gauge. This is the gauge equivalence theorem, and it is enforced by the Ward identity: a symmetry of the bare action that makes gauge-dependent pieces cancel between Feynman diagrams when external particles are placed on-shell. The choice of gauge is therefore a choice of formalism, not of physics. Choose the gauge that makes the computation you want to do as simple as possible.

Visual Beginner

Property	Coulomb gauge	Lorenz / Feynman gauge
Gauge condition	div(A) = 0	covariant: $d_{μ} A^{μ} = 0$
Propagating photon modes	2 transverse	4 covariant (2 transverse + 2 unphysical)
Scalar potential	Instantaneous Coulomb potential	Dynamical, time-like polarisation
Lorentz covariance	True but hidden	Manifest
Unitarity	Manifest	Restored by Gupta-Bleuler / BRST
Photon propagator (sketch)	transverse projector + instantaneous $1/ k$ -squared	covariant tensor with gauge parameter $ξ$
Preferred use	Atomic physics, bound states, Lamb shift	Scattering amplitudes, loop calculations, renormalisation
Generalises to non-abelian	No (no covariant analogue)	Yes (Faddeev-Popov ghosts essential)
Originator	Coulomb 1773 (implicit); Dirac 1955 (canonical)	Lorenz 1867; Fermi 1932; Gupta-Bleuler 1950

Worked example Beginner

Pick a concrete plane-wave photon and watch how the gauge choice constrains its polarisation.

Imagine a single plane-wave photon travelling in the $+ z$ direction with frequency $ω$ — wavevector along the $z$ -axis. The vector potential can be written as a sum of three independent polarisation directions: $x$ (transverse), $y$ (transverse), and $z$ (longitudinal). The scalar potential adds a fourth time-like direction. Before any gauge fixing, the photon carries four polarisation amplitudes, and the gauge freedom shifts among them.

In Coulomb gauge the requirement that the vector potential be divergence-free, combined with a plane-wave dependence in the $z$ -direction, forces the $z$ -component of the polarisation to vanish. The longitudinal piece is gone; only the $x$ - and $y$ -polarisation amplitudes survive as propagating degrees of freedom. The scalar potential is determined by the charge density via the instantaneous Coulomb law — it is not a free choice. The plane-wave photon has exactly two physical polarisation states, matching what an experimentalist measures when a polarising filter is rotated in front of a laser beam.

In Lorenz gauge all four polarisation amplitudes remain in the formalism. Two of them (the transverse $x$ - and $y$ -polarisations) are physical; the other two (a combination of the time-like and longitudinal directions) are unphysical. The Gupta-Bleuler subsidiary condition is a rule on physical states that forces the two unphysical amplitudes into a combination that gives zero norm — a state that has zero probability of appearing in any measurement. After projecting onto physical states, the photon again has two physical polarisations, just as in Coulomb gauge.

Sanity check: count photons. In a beam of $N$ photons per second, both gauges predict $N$ events per second in a detector, with two independent polarisation channels per photon. The two gauges agree on what an experiment sees. They disagree only on the intermediate book-keeping — which is purely a matter of how the formalism is organised.

What this tells us: the gauge choice does not change the photon. It changes how many bookkeeping variables you carry around to describe the photon. Coulomb gauge carries the minimum number (two transverse modes plus an instantaneous Coulomb potential). Lorenz gauge carries the full four-vector and then projects out the unphysical pieces at the end. Same physics, different formalism.

Check your understanding Beginner

Exercise (easy, multiple choice).

Which of the following is the gauge condition that defines the Coulomb gauge?

A. covariant Lorenz condition: $d_{μ} A^{μ} = 0$ B. the scalar potential is set to zero C. spatial divergence of the vector potential equals zero D. the entire vector potential equals zero

Hint

The Coulomb gauge is named after the gauge condition that makes the instantaneous Coulomb interaction explicit in atomic-physics calculations.

Answer

C. Setting the spatial divergence of the vector potential to zero is the Coulomb-gauge condition, also called the radiation gauge or transverse gauge. Option A is the Lorenz gauge (covariant). Option B is the temporal or Weyl gauge (rarely used in QED). Option D would kill all photon dynamics — it is not a gauge condition but a physical restriction.

Exercise (medium, multiple choice).

In Lorenz-gauge quantum-electrodynamics, the photon has four polarisation modes, but only two are physical. Which two are unphysical, and how are they removed?

A. Two transverse modes; removed by parity B. Longitudinal and time-like modes; removed by the Gupta-Bleuler condition or by Faddeev-Popov ghosts C. Helicity $\pm 1$ modes; removed by charge conjugation D. All four are physical; no removal needed

Hint

The two physical photon polarisations are transverse to the momentum. The other two are scalar combinations of the time-like and longitudinal directions.

Answer

B. The two transverse modes are the physical photons (helicity $\pm 1$ ). The longitudinal polarisation (along the propagation direction) and the time-like polarisation (along the time axis) are unphysical: they contribute to internal lines of Feynman diagrams in covariant gauge, but cancel from physical S-matrix elements. The cancellation is implemented in operator quantisation by the Gupta-Bleuler subsidiary condition (a rule selecting physical states), and in path-integral quantisation by the Faddeev-Popov ghost fields that contribute opposite-sign loops cancelling unphysical photon-loop contributions. In quantum-electrodynamics (Abelian) the ghosts decouple from physical amplitudes; for non-Abelian gauge theories they are interacting and essential to unitarity.

Formal definition Intermediate+

Work in natural units $ℏ = c = 1$ with the mostly-minus metric $g_{μν} = diag (+ 1, - 1, - 1, - 1)$ . The free electromagnetic Lagrangian is

L_{EM} = - \frac{1}{4} F_{μν} F^{μν}, F_{μν} = \partial_{μ} A_{ν} - \partial_{ν} A_{μ} .

The Euler-Lagrange equations give the source-free Maxwell equations $\partial_{μ} F^{μν} = 0$ . Coupling to a Dirac field 12.11.01 via the minimal-coupling prescription $\partial_{μ} \to D_{μ} = \partial_{μ} + i e A_{μ}$ produces the full QED Lagrangian

L_{QED} = - \frac{1}{4} F_{μν} F^{μν} + \overset{ˉ}{ψ} (i γ^{μ} D_{μ} - m) ψ = - \frac{1}{4} F_{μν} F^{μν} + \overset{ˉ}{ψ} (i \slashed \partial - m) ψ - e \overset{ˉ}{ψ} γ^{μ} ψ A_{μ},

with the electron-photon vertex $- e \overset{ˉ}{ψ} γ^{μ} ψ A_{μ}$ as the unique gauge-invariant interaction at lowest order. The local gauge symmetry is the simultaneous transformation

A_{μ} (x) \to A_{μ} (x) + \partial_{μ} Λ (x), ψ (x) \to e^{- i e Λ (x)} ψ (x), \overset{ˉ}{ψ} (x) \to e^{+ i e Λ (x)} \overset{ˉ}{ψ} (x),

under which $F_{μν}$ is invariant, the kinetic term $\overset{ˉ}{ψ} i \slashed \partial ψ$ shifts by $- e \overset{ˉ}{ψ} (\slashed \partial Λ) ψ$ , and the interaction term $- e \overset{ˉ}{ψ} \slashed A ψ$ shifts by exactly the opposite amount, so the full Lagrangian is invariant. The corresponding global symmetry ( $Λ =$ constant) is the $U (1)$ electron-number symmetry whose Noether current $j^{μ} = e \overset{ˉ}{ψ} γ^{μ} ψ$ is the electromagnetic current sourcing $A_{μ}$ .

The gauge-fixing problem

Canonical quantisation requires identifying the canonical momentum conjugate to each field. For the spatial components,

Π^{i} = \frac{\partial L _{EM}}{\partial ( \partial _{t} A _{i} )} = - F^{0 i} = E^{i},

so the spatial electric field is conjugate to the spatial vector potential. But for $A^{0}$ ,

Π^{0} = \frac{\partial L _{EM}}{\partial ( \partial _{t} A _{0} )} = 0,

since $F^{00} = 0$ identically. This is the constraint that prevents naive canonical quantisation: $A^{0}$ has no conjugate momentum, hence no canonical commutation relation, hence is not a dynamical degree of freedom. The Euler-Lagrange equation for $A^{0}$ gives the Gauss-law constraint $\nabla \cdot E = ρ$ (in the presence of charge density $ρ$ ), which similarly involves no time derivatives and so is a constraint rather than an equation of motion. The Dirac-Bergmann analysis of constrained Hamiltonian systems classifies these as first-class constraints whose generators are the components of the gauge group's Lie algebra acting on the field configuration. Quantisation requires either reducing to a non-redundant set of physical variables (the Coulomb-gauge route) or extending the field content with auxiliary modes whose unphysical contributions cancel out of physical states (the Lorenz / covariant-gauge route).

Coulomb gauge

Impose $\nabla \cdot A = 0$ . Helmholtz-decompose any vector field $V$ into transverse and longitudinal parts $V = V_{⊥} + V_{∥}$ with $\nabla \cdot V_{⊥} = 0$ and $\nabla \times V_{∥} = 0$ ; the Coulomb gauge sets $A_{∥} = 0$ . The Gauss-law constraint $\nabla \cdot E = ρ$ together with $E = - \nabla A^{0} - \partial_{t} A$ and $\nabla \cdot A = 0$ becomes

\nabla^{2} A^{0} = - ρ ⟹ A^{0} (x, t) = \int d^{3} x^{'} \frac{ρ ( x ^{'} , t )}{4 π ∣ x - x ^{'} ∣},

the instantaneous Coulomb potential of the charge distribution. $A^{0}$ is determined non-dynamically by the charge density at the same time; it is not an independent degree of freedom and does not need a canonical commutation relation. The dynamical content of the photon field is carried entirely by the two transverse components of $A$ , mode-expanded as

\hat{A}_{⊥} (x) = \int \frac{d ^{3} k}{( 2 π ) ^{3}} \frac{1}{2∣ k ∣} λ = 1, 2 \sum [a_{k}^{λ} ϵ^{λ} (k) e^{- ik \cdot x} + a_{k}^{λ †} ϵ^{λ *} (k) e^{+ ik \cdot x}],

with $k \cdot ϵ^{λ} (k) = 0$ (transversality) and $ϵ^{λ} (k) \cdot ϵ^{λ^{'}} (k) = δ^{λ λ^{'}}$ (orthonormality). The mode operators satisfy the standard bosonic commutation relations $[a_{k}^{λ}, a_{k^{'}}^{λ^{'} †}] = (2 π)^{3} δ^{(3)} (k - k^{'}) δ^{λ λ^{'}}$ on the bosonic Fock space 12.13.01, and the resulting photon Hamiltonian $\hat{H} = \int d^{3} k / (2 π)^{3} ∣ k ∣ a_{k}^{λ †} a_{k}^{λ}$ is positive on a positive-definite Hilbert space — the Coulomb-gauge quantisation is manifestly unitary.

The Coulomb-gauge photon propagator is most efficiently written in mixed coordinate-momentum form. The transverse projector is $P_{⊥}^{ij} (k) = δ^{ij} - k^{i} k^{j} /∣ k ∣^{2}$ , and

D_{Coul}^{ij} (k) = \frac{- i}{k ^{2} + i ϵ} (δ^{ij} - \frac{k ^{i} k ^{j}}{∣ k ∣ ^{2}}), D_{Coul}^{00} (k) = \frac{i}{∣ k ∣ ^{2}}, D_{Coul}^{0 i} (k) = 0.

The $D^{00}$ piece is the instantaneous Coulomb interaction in momentum space: it propagates without a $k^{0}$ pole, reflecting that $A^{0}$ is non-dynamical. The full structure is awkward to manipulate in covariant calculations because the transverse projector explicitly breaks Lorentz covariance, but it is precisely the structure that makes the Coulomb interaction explicit in atomic physics.

Lorenz gauge and the $ξ$ -family

Impose $\partial_{μ} A^{μ} = 0$ . The cleanest implementation is to modify the Lagrangian by adding a gauge-fixing term

L_{GF} = - \frac{1}{2 ξ} (\partial_{μ} A^{μ})^{2},

with $ξ$ a real positive parameter. The full Lagrangian $L_{EM} + L_{GF}$ has no gauge invariance, but the variational equation now produces $\partial^{2} A_{μ} - (1 - 1/ ξ) \partial_{μ} \partial_{ν} A^{ν} = 0$ , which together with the gauge condition $\partial_{μ} A^{μ} = 0$ reduces to the four decoupled wave equations $\partial^{2} A_{μ} = 0$ . Now every component of $A_{μ}$ has a non-zero canonical momentum, canonical quantisation is straightforward, and the photon propagator follows by inverting the kinetic operator:

D_{μν}^{ξ} (k) = \frac{- i}{k ^{2} + i ϵ} [g_{μν} - (1 - ξ) \frac{k _{μ} k _{ν}}{k ^{2}}] .

The parameter $ξ$ labels a one-parameter family of gauges within the Lorenz class:

$ξ = 1$ is Feynman gauge: $D_{μν} (k) = - i g_{μν} / k^{2}$ , the simplest possible form, with no $k_{μ} k_{ν}$ term.
$ξ = 0$ is Landau gauge (the strict Lorenz limit): $D_{μν} (k) = - i (g_{μν} - k_{μ} k_{ν} / k^{2}) / k^{2}$ , manifestly transverse in spacetime indices.
$ξ \to \infty$ is unitary gauge (formal limit, useful for spontaneously broken gauge theories).

The gauge-fixed quantisation makes all four components of $A_{μ}$ dynamical, with mode expansion

\hat{A}_{μ} (x) = \int \frac{d ^{3} k}{( 2 π ) ^{3}} \frac{1}{2∣ k ∣} λ = 0 \sum 3 [a_{k}^{λ} ϵ_{μ}^{λ} (k) e^{- ik \cdot x} + a_{k}^{λ †} ϵ_{μ}^{λ *} (k) e^{+ ik \cdot x}]

over four covariant polarisation vectors $ϵ_{μ}^{λ}$ : two transverse ( $λ = 1, 2$ ), one longitudinal ( $λ = 3$ , $ϵ^{3} \propto k$ ), and one time-like ( $λ = 0$ , $ϵ^{0} \propto (1, 0)$ ). The commutation relations now read

[\hat{A}_{μ} (x), \hat{A}_{ν} (y)] = - i g_{μν} D (x - y),

with the metric tensor appearing explicitly. The minus sign in $g_{00} = + 1, g_{ii} = - 1$ produces a sign-indefinite inner product on Fock space — the indefinite-metric problem.

The Gupta-Bleuler subsidiary condition

The resolution proposed independently by Gupta 1950 and Bleuler 1950 ^{[Gupta 1950; Bleuler 1950]} is to define physical states as the subspace satisfying

\partial_{μ} \hat{A}^{(+) μ} (x) ∣ ψ_{phys} ⟩ = 0,

where $\hat{A}^{(+) μ}$ is the positive-frequency (annihilation-operator) part of $\hat{A}^{μ}$ . The condition is weaker than the operator equation $\partial_{μ} \hat{A}^{μ} = 0$ (which would be inconsistent with the canonical commutation relations); it picks out the subspace on which physical observables matter. On this subspace the longitudinal and timelike one-photon states pair into a null combination whose expectation values vanish, and the inner product is positive-semidefinite. Quotienting by the zero-norm states gives a positive-definite physical Hilbert space carrying only the two transverse photon modes — matching the Coulomb-gauge count.

Faddeev-Popov and the path integral

The modern derivation of the gauge-fixed Lagrangian, via the path integral, is due to Faddeev and Popov 1967 ^{[Faddeev-Popov 1967]}. The naive QED path integral $Z = \int D A D \overset{ˉ}{ψ} D ψ e^{i S [A, ψ]}$ overcounts by a factor of the volume of the gauge group, since gauge-equivalent configurations contribute identically. The Faddeev-Popov trick inserts the identity

1 = \int D Λ δ [G (A^{Λ})] Δ_{FP} [A],

for a gauge-fixing functional $G$ (such as $G (A) = \partial_{μ} A^{μ} - ω$ for a fixed function $ω$ ), where $A^{Λ}$ denotes the gauge-transform of $A$ and $Δ_{FP} [A] = det (δ G (A^{Λ}) / δ Λ) ∣_{Λ = 0}$ is the Faddeev-Popov determinant. For $G = \partial_{μ} A^{μ}$ in the Abelian case, $δ G (A^{Λ}) / δ Λ = \partial_{μ} \partial^{μ} = \partial^{2}$ , which is field-independent, so $Δ_{FP} = det (\partial^{2})$ is a constant absorbed into the path-integral normalisation. Averaging over the auxiliary function $ω$ with Gaussian weight $exp [- i \int d^{4} x ω^{2} / (2 ξ)]$ produces the gauge-fixed action with the $ξ$ -family propagator above. For QED the ghost fields decouple: no diagrams with ghost internal lines contribute to physical amplitudes. The procedure is the same in non-Abelian gauge theories — but there the gauge-fixing functional depends in an essential way on the gauge-transformed field, the Faddeev-Popov determinant becomes a functional of the gauge potential, and exponentiating it via Grassmann path integration over ghost fields $\overset{c}{ˉ}, c$ produces a new term $L_{ghost} = - \overset{c}{ˉ} \partial_{μ} D^{μ} c$ that interacts with the gauge field at loop level. Without these ghost contributions, non-Abelian gauge theory at one loop violates unitarity (the optical theorem); the ghosts are essential. The QED case is a degenerate limit in which the ghost machinery still applies but the ghosts happen not to contribute.

Key derivation Intermediate+

Theorem (Ward identity for QED). Let $M^{μ} (k, \dots)$ be a connected Feynman amplitude with one external photon of momentum $k$ and polarisation index $μ$ , plus an arbitrary number of other on-shell external particles. Then

k_{μ} M^{μ} (k, \dots) = 0

identically in $k$ , at any order in perturbation theory.

Proof. The Ward identity is the perturbative consequence of $U (1)$ gauge invariance of the bare QED action. The functional derivation proceeds through the generating functional. Let $Z [J, \overset{η}{ˉ}, η]$ be the generating functional of connected Green's functions, defined by

Z [J, \overset{η}{ˉ}, η] = \int D A D \overset{ˉ}{ψ} D ψ exp {i \int d^{4} x [L_{QED} + L_{GF} + J_{μ} A^{μ} + \overset{η}{ˉ} ψ + \overset{ˉ}{ψ} η]},

with the gauge-fixing term included to make the photon kinetic operator invertible. Under an infinitesimal local $U (1)$ gauge transformation $A_{μ} \to A_{μ} + \partial_{μ} Λ$ , $ψ \to (1 - i e Λ) ψ$ , $\overset{ˉ}{ψ} \to (1 + i e Λ) \overset{ˉ}{ψ}$ , the QED action $\int L_{QED}$ is invariant by construction, the path-integral measure is invariant (no chiral anomalies arise in vector QED), the gauge-fixing term shifts by $- ξ^{- 1} (\partial^{μ} A_{μ}) (\partial^{ν} \partial_{ν} Λ)$ , and the source terms shift by $J^{μ} \partial_{μ} Λ + \overset{η}{ˉ} (- i e Λ) ψ + \overset{ˉ}{ψ} (i e Λ) η$ . Demanding the invariance of $Z [J, \overset{η}{ˉ}, η]$ under the change of integration variables produces the master Ward-Takahashi identity ^{[Ward 1950; Takahashi 1957]}:

\partial_{μ} J^{μ} (x) + \frac{1}{ξ} \partial^{2} \partial^{μ} \frac{δ Z}{i δ J ^{μ} ( x )} - i e \overset{η}{ˉ} (x) \frac{δ Z}{i δ η ˉ ( x )} + i e \frac{δ Z}{- i δ η ( x )} η (x) = 0.

Functional differentiation with respect to source fields and amputation of the external propagators converts this into identities among connected Green's functions. The on-shell projection — setting external particles on their mass shells and contracting with physical polarisation vectors — gives the Ward identity for S-matrix elements: $k_{μ} M^{μ} = 0$ for any amplitude with an external photon $k$ .

Corollary (gauge equivalence of physical amplitudes). Physical S-matrix elements are independent of the gauge parameter $ξ$ and identical in Coulomb and Lorenz gauges.

Proof of the $ξ$ -independence. The photon propagator's $ξ$ -dependent piece is $- i (1 - ξ) k_{μ} k_{ν} / (k^{2})^{2}$ . When contracted into an internal-photon-line insertion in a Feynman diagram, the $k_{μ}$ factor multiplies the amplitude with one external photon at that vertex (after the internal line is "cut" by the standard Cutkosky-like manipulation); the Ward identity then forces that contracted amplitude to vanish on-shell. Therefore the $ξ$ -dependent piece contributes zero to any on-shell S-matrix element, and the amplitude is $ξ$ -independent. The argument extends to the Coulomb-to-Feynman gauge transformation by an analogous Ward-identity argument applied to the difference of propagators. □

The Ward identity is also the technical reason why the photon-polarisation-completeness sum that appears in tree-level cross-section calculations can be replaced by $- g_{μν}$ :

λ = 1, 2 \sum ϵ_{λ}^{μ} (k) ϵ_{λ}^{* ν} (k) \to - g^{μν} (on-shell, on amplitudes satisfying k_{μ} M^{μ} = 0) .

The longitudinal and time-like terms in the full covariant completeness sum produce contributions proportional to $k^{μ}$ on the amplitude, which the Ward identity kills. This substitution underlies every spin-and-polarisation sum in tree-level QED (Compton 12.12.03, Møller, Bhabha, bremsstrahlung) and is the cleanest worked example of the gauge-equivalence theorem in action.

Exercises Intermediate+

Exercise 1 (easy, symbolic).

Verify that the Feynman gauge ( $ξ = 1$ ) photon propagator $D_{μν} (k) = - i g_{μν} / k^{2}$ is the inverse of the kinetic operator obtained from $L_{EM} + L_{GF}^{ξ = 1}$ .

Hint

Integrate by parts to write $L_{EM}$ as $A_{μ} K^{μν} A_{ν} /2$ , add $- (\partial_{μ} A^{μ})^{2} /2$ from the gauge-fixing term, and identify $K^{μν}$ .

Answer

$L_{EM} = - F_{μν} F^{μν} /4 = - (\partial_{μ} A_{ν} - \partial_{ν} A_{μ}) (\partial^{μ} A^{ν} - \partial^{ν} A^{μ}) /4 = - [\partial_{μ} A_{ν} \partial^{μ} A^{ν} - \partial_{μ} A_{ν} \partial^{ν} A^{μ}] /2$ . Integrating by parts (boundary terms drop), this becomes $A^{μ} [g_{μν} \partial^{2} - \partial_{μ} \partial_{ν}] A^{ν} /2$ . Adding the $ξ = 1$ gauge-fixing term $- (\partial^{μ} A_{μ})^{2} /2 = + A^{μ} \partial_{μ} \partial_{ν} A^{ν} /2$ cancels the cross term, leaving $L_{tot} = A^{μ} g_{μν} \partial^{2} A^{ν} /2$ . The kinetic operator is $K_{μν} = g_{μν} \partial^{2}$ ; its momentum-space inverse satisfies $K_{μν} (k) D^{ν ρ} (k) = i δ_{μ}^{ρ}$ , giving $D^{ν ρ} (k) = - i g^{ν ρ} / k^{2}$ , which is the Feynman-gauge propagator. The $ξ = 1$ choice is uniquely simple because the gauge-fixing exactly cancels the non-invertible piece $\partial_{μ} \partial_{ν}$ of the unfixed kinetic operator.

Exercise 2 (easy, symbolic).

Show that the general $ξ$ -gauge photon propagator $D_{μν}^{ξ} (k) = - i [g_{μν} - (1 - ξ) k_{μ} k_{ν} / k^{2}] / k^{2}$ reduces to the Feynman gauge at $ξ = 1$ and to the Landau gauge $D_{μν}^{0} (k) = - i (g_{μν} - k_{μ} k_{ν} / k^{2}) / k^{2}$ at $ξ = 0$ .

Hint

Substitute $ξ = 1$ and $ξ = 0$ into the bracket.

Answer

At $ξ = 1$ : $g_{μν} - 0 \cdot k_{μ} k_{ν} / k^{2} = g_{μν}$ , giving $D_{μν}^{1} = - i g_{μν} / k^{2}$ . At $ξ = 0$ : $g_{μν} - 1 \cdot k_{μ} k_{ν} / k^{2}$ , giving $D_{μν}^{0} = - i (g_{μν} - k_{μ} k_{ν} / k^{2}) / k^{2}$ . The Landau-gauge propagator is transverse in the spacetime sense: $k^{μ} D_{μν}^{0} (k) = - i [k_{ν} - (k^{2} / k^{2}) k_{ν}] / k^{2} = 0$ , which makes the gauge condition $\partial_{μ} A^{μ} = 0$ exact at the level of the propagator. The Feynman gauge has the simplest tensor structure but a non-transverse propagator; the gauge equivalence theorem ensures both give the same physical amplitudes.

Exercise 3 (medium, symbolic).

Show that the difference between the Feynman-gauge and Landau-gauge photon propagators, $Δ D_{μν} (k) = D_{μν}^{1} (k) - D_{μν}^{0} (k) = - i k_{μ} k_{ν} / (k^{2})^{2}$ , contributes zero to any on-shell QED amplitude with external photons, by the Ward identity.

Hint

When $Δ D_{μν}$ is contracted with the QED vertices at the endpoints of an internal photon line, the $k_{μ} k_{ν}$ structure factors out. Cut the diagram at the internal photon line and apply the Ward identity to each fragment.

Answer

An internal photon line connecting two vertices in a Feynman diagram contributes the factor $D_{μν} (k) \cdot J^{μ} (k) J^{ν} (- k)$ schematically, where $J^{μ}, J^{ν}$ are the QED currents at the two vertices and $k$ is the internal photon momentum. The $Δ D_{μν}$ piece gives $Δ D \cdot J J \to - i (k_{μ} J^{μ}) (k_{ν} J^{ν}) / (k^{2})^{2}$ . By the Ward identity at off-shell internal lines — which follows from the Ward-Takahashi identity for the QED current $\partial_{μ} J^{μ} = 0$ classically and is enforced perturbatively by gauge invariance of the bare action — $k_{μ} J^{μ} = 0$ for currents attached to physical (on-shell) external particles. Therefore $Δ D \cdot J J = 0$ on-shell, and the Feynman-gauge and Landau-gauge amplitudes are equal. This is the simplest case of the gauge equivalence theorem: pairwise comparison of two gauges in the $ξ$ -family.

Exercise 4 (medium, symbolic).

In Coulomb gauge, write the QED Hamiltonian as the sum of a free-photon transverse-mode Hamiltonian and an instantaneous Coulomb interaction between charges. Show that the instantaneous Coulomb term is

\hat{H}_{Coul} = \frac{1}{2} \int d^{3} x d^{3} x^{'} \frac{ρ ^ ( x ) ρ ^ ( x ^{'} )}{4 π ∣ x - x ^{'} ∣},

with $\overset{ρ}{^} = e \hat{ψ}^{†} \hat{ψ}$ the electric charge density.

Hint

Eliminate $A^{0}$ via the Gauss-law constraint $\nabla^{2} A^{0} = - ρ$ (so $A^{0} (x) = \int d^{3} x^{'} ρ (x^{'}) / (4 π ∣ x - x^{'} ∣)$ ) and substitute into $L$ .

Answer

The QED Hamiltonian density in Coulomb gauge contains the term $ρ A^{0} /2$ (factor of $1/2$ because the $A^{0}$ field is itself proportional to $ρ$ , so the interaction is quadratic in the charge). Substituting the Gauss-law solution gives $ρ (x) A^{0} (x) /2 = ρ (x) \int d^{3} x^{'} ρ (x^{'}) / (4 π ∣ x - x^{'} ∣) /2$ , and integrating over $x$ gives the symmetric two-body Coulomb interaction. The transverse photons couple separately to the transverse current via the standard $- e \overset{ˉ}{ψ} γ^{i} ψ A_{⊥}^{i}$ vertex. The Hamiltonian splits cleanly: $\hat{H} = \hat{H}_{Dirac, free} + \hat{H}_{photon, transverse} + \hat{H}_{Coul} + \hat{H}_{transverse, coupling}$ . In atomic physics the Coulomb piece is the dominant electron-proton binding, and the transverse photons mediate emission and absorption; the Coulomb-gauge Hamiltonian makes this structure manifest. The same physics in Feynman gauge would emerge after substantial cancellations between time-like and longitudinal photon-exchange contributions; the Coulomb-gauge formulation is the reason atomic-physics textbooks use it.

Exercise 5 (medium, multiple choice).

Which of the following is not a consequence of the Ward identity in QED?

A. Physical S-matrix elements are independent of the gauge parameter $ξ$ B. The photon remains massless to all orders in perturbation theory C. The renormalisation constants satisfy $Z_{1} = Z_{2}$ D. The electron magnetic moment is exactly $g = 2$ to all orders

Hint

The first three are direct consequences of gauge invariance of the QED action. The fourth is a tree-level prediction that receives loop corrections.

Answer

D. The Ward identity does not fix the electron magnetic moment; it fixes only the charge $F_{1} (0) = 1$ exactly via $Z_{1} = Z_{2}$ (statement C). The magnetic-moment form factor $F_{2} (0)$ is not constrained by the Ward identity and receives the famous Schwinger correction $F_{2} (0) = α / (2 π)$ at one loop 12.16.02. Statements A, B, and C are all consequences of gauge invariance: A by the argument given in the Key Derivation; B because a photon-mass term $\frac{1}{2} m_{γ}^{2} A_{μ} A^{μ}$ is not gauge-invariant (under $A_{μ} \to A_{μ} + \partial_{μ} Λ$ it shifts by $\partial_{μ} Λ (\dots) \neq = 0$ ), and the Ward identity protects masslessness against quantum corrections — the Schwinger model in $d = 2$ is an exception where the gauge field acquires a mass via spontaneous symmetry breaking; C because the Ward identity relates the vertex and self-energy renormalisations exactly: $Z_{1} = Z_{2}$ .

Exercise 6 (hard, symbolic).

The one-loop electron self-energy $- i Σ_{2} (\slashed p)$ in QED can be computed in Coulomb gauge or in Feynman gauge. In Feynman gauge the result is $Σ_{2}^{Feyn} (\slashed p) = (α /4 π) [A m + B (\slashed p - m)]$ with $A, B$ UV-divergent constants set by the on-shell renormalisation conditions. Sketch why the Coulomb-gauge computation produces a different split between mass renormalisation and wave-function renormalisation but the same values of $δ m = m - m_{0}$ and the same renormalised running coupling.

Hint

The mass shift $δ m$ and the renormalised running coupling are physical observables (the electron's pole mass and the QED beta function); the off-shell wave-function residue $Z_{2}$ is not directly physical. The Ward identity forces $Z_{1} = Z_{2}$ in any gauge, and the physical mass shift $δ m = Σ (\slashed p = m)$ is the value of the self-energy on the mass shell.

Answer

In Feynman gauge the photon propagator is $- i g_{μν} / k^{2}$ ; the one-loop self-energy integral is the standard Feynman-parameter calculation yielding $Σ_{2}^{Feyn} (\slashed p)$ . In Coulomb gauge the propagator splits into a transverse spatial piece $D_{⊥}^{ij} = - i (δ^{ij} - k^{i} k^{j} /∣ k ∣^{2}) / k^{2}$ and an instantaneous time-like piece $D^{00} = i /∣ k ∣^{2}$ , plus zero cross-terms. The vertex factors $γ^{μ}$ split correspondingly into $γ^{0}$ and $γ^{i}$ .

The two pieces of the propagator generate distinct contributions to the self-energy: the time-like piece $γ^{0} (\slashed p - \slashed k + m) γ^{0} / [(p - k)^{2} - m^{2}]$ integrated against the instantaneous $1/∣ k ∣^{2}$ and the spatial piece $γ^{i} (\slashed p - \slashed k + m) γ^{j} (δ^{ij} - k^{i} k^{j} /∣ k ∣^{2}) / [(p - k)^{2} - m^{2} \cdot k^{2}]$ . Individually these integrals are not Lorentz-covariant (the $∣ k ∣^{2}$ in the denominators breaks manifest covariance) and have different UV-divergent structure than the Feynman-gauge counterpart.

Their sum, after expansion in $\slashed p - m$ and projection onto the on-shell renormalisation condition $Σ (\slashed p = m) = δ m$ and $\partial Σ/ \partial \slashed p ∣_{\slashed p = m} = - δ_{2}$ , gives the same value of $δ m$ and of the running coupling $α (μ^{2})$ . The off-shell wave-function residues $Z_{2}^{Coul}$ and $Z_{2}^{Feyn}$ are themselves gauge-dependent — but the Ward identity $Z_{1} = Z_{2}$ holds in every gauge, so the renormalised charge $e_{R} = e_{0} Z_{3} \cdot Z_{2} / Z_{1}$ depends only on photon-side physics ( $Z_{3}$ , the photon-wave-function renormalisation from vacuum polarisation) and is gauge-independent. The detailed gauge-by-gauge breakdown is in Bjorken-Drell §8.5 and in Itzykson-Zuber §10-2; the bottom line is that the physical observables agree exactly while the intermediate counterterm structures differ.

Exercise 7 (hard, symbolic).

State and prove the Gupta-Bleuler condition $\partial_{μ} \hat{A}^{(+) μ} ∣ ψ_{phys} ⟩ = 0$ as a consistent subsidiary condition on covariantly quantised QED. Show that the resulting physical Hilbert space carries a positive-semidefinite inner product whose quotient by zero-norm states is positive-definite.

Hint

The longitudinal and time-like one-photon states have indefinite norm in covariant quantisation, but the Gupta-Bleuler condition projects onto a subspace where their contributions cancel pairwise. Compute the norm of a generic one-photon state $\sum_{λ} c_{λ} ∣ k, λ ⟩$ and check that the Gupta-Bleuler condition forces the time-like and longitudinal amplitudes to be equal up to a sign.

Answer

In covariant quantisation the four photon polarisation modes have commutators $[a_{k}^{λ}, a_{k^{'}}^{λ^{'} †}] = - g^{λ λ^{'}} (2 π)^{3} δ^{(3)} (k - k^{'})$ , so the time-like mode ( $λ = 0$ ) has positive sign while the three spatial modes have negative sign — the indefinite-metric structure. The Gupta-Bleuler condition $\partial_{μ} \hat{A}^{(+) μ} ∣ ψ ⟩ = 0$ contracts the four-momentum $k^{μ}$ with the polarisation vector at the level of states: $k^{μ} ϵ_{μ}^{λ} (k) a_{k}^{λ} ∣ ψ ⟩ = 0$ .

Using the standard basis $ϵ_{μ}^{0, 3}$ for time-like and longitudinal, $k_{μ} = (ω, k)$ for a photon on the light cone ( $ω = ∣ k ∣$ ), one finds $k^{μ} ϵ_{μ}^{0} = ω = ∣ k ∣$ and $k^{μ} ϵ_{μ}^{3} = - ∣ k ∣$ (with conventions chosen so $ϵ^{3} = \hat{k}$ ); the Gupta-Bleuler condition becomes $(a_{k}^{0} - a_{k}^{3}) ∣ ψ ⟩ = 0$ for each $k$ . This means only the symmetric combination $b_{k}^{+} = (a_{k}^{0} + a_{k}^{3}) / 2$ acts non-vacuously on $∣ ψ ⟩$ , while the antisymmetric combination $b_{k}^{-} = (a_{k}^{0} - a_{k}^{3}) / 2$ annihilates it.

The norm of a generic state $∣ ψ ⟩ = b_{k}^{+†} ∣0 ⟩$ vanishes: $⟨ ψ ∣ ψ ⟩ = [b^{+}, b^{+†}] = ([a^{0}, a^{0 †}] - [a^{3}, a^{3 †}]) /2 = ((+ 1) - (- 1) \cdot (- 1)) /2 \cdot (2 π)^{3} δ^{(3)} (0) = 0 \cdot \dots = 0$ (with appropriate normal-ordering). Therefore the time-like + longitudinal sector contains only zero-norm states; the physical inner product is positive-semidefinite; the quotient $H_{phys} / H_{null}$ is positive-definite and carries only the two transverse photon modes, exactly matching the Coulomb-gauge count. The Gupta-Bleuler construction is the operator-language ancestor of the modern BRST cohomology, which generalises it to non-Abelian gauge theories via the Kugo-Ojima quartet mechanism ^{[Kugo-Ojima 1979]}.

Exercise 8 (hard, open-ended).

Compare the Coulomb-gauge and Lorenz-gauge treatments of the Lamb shift calculation. Why is the Lamb shift conventionally computed in Coulomb gauge in atomic-physics textbooks (e.g., Bethe 1947) but in Lorenz / Feynman gauge in modern QED textbooks (e.g., Peskin-Schroeder)? What does the gauge equivalence theorem guarantee about the final answer?

Hint

Atomic physics uses non-relativistic perturbation theory around hydrogenic states; high-energy QED uses covariant Feynman-rule scattering theory.

Answer

Bethe's 1947 Lamb-shift calculation used non-relativistic perturbation theory: hydrogenic eigenstates as the unperturbed basis, the dipole electron-photon interaction $\hat{H}_{int} = - e A_{⊥} \cdot v$ in Coulomb gauge as the perturbation, second-order perturbation theory summed over intermediate photon-photon states. The Coulomb gauge is natural because the Coulomb attraction (nucleus-electron) is already in the unperturbed Hamiltonian, the transverse photon emission and reabsorption is treated perturbatively, and only physical photons appear. The result is a logarithmically divergent integral that Bethe cut off at the electron mass to obtain the famous non-relativistic estimate $Δ E_{2 S - 2 P} \sim α^{5} m c^{2} / (6 π) \cdot lo g (m c^{2} / ⟨ E_{atomic} ⟩)$ , agreeing with the Lamb-Retherford 1947 measurement to ~10%.

The modern textbook treatment (Peskin-Schroeder, Schwartz) computes the same shift in Feynman gauge using fully covariant Feynman rules: the bound-state interaction with the external Coulomb field is treated via the Furry picture (the electron is dressed by the static Coulomb field exactly), and the radiative corrections — vertex, vacuum polarisation, self-energy — are computed using Feynman-gauge propagators and dimensional regularisation. The Feynman gauge is natural because the photon propagator is the simplest possible ( $- i g_{μν} / k^{2}$ ), the integrals are Lorentz-invariant from the start, and dimensional regularisation gives manifestly gauge-invariant counterterms.

Both approaches give $Δ E_{2 S - 2 P}^{theory} = 1057.84 (2)$ MHz, in agreement with the modern experimental value $1057.85 (1)$ MHz to the part-per-thousand level. The gauge equivalence theorem guarantees this: the physical level shift is gauge-independent, so the Coulomb-gauge atomic-physics calculation and the Feynman-gauge covariant calculation must agree. The choice between them is purely a matter of computational convenience and pedagogical clarity.

Lean formalization Intermediate+

Mathlib's coverage of the prerequisites is partial: Hilbert spaces, finite-dimensional inner-product spaces, the Lorentz-Minkowski quadratic form, and the abstract distributional framework on Schwartz space all exist in some form. What is missing is the physics-layer apparatus required to formalise covariant gauge fixing in QED:

An operator-valued-distribution model of the free Maxwell quantum field on a Krein space (indefinite-metric Hilbert space) carrying the four covariant photon modes, with commutation relations $[A^{μ} (x), A^{ν} (y)] = - i g^{μν} D (x - y)$ where $D$ is the Pauli-Jordan function.
The Coulomb-gauge transverse projector $Π_{⊥}^{ij} (k) = δ^{ij} - k^{i} k^{j} /∣ k ∣^{2}$ as an idempotent symmetric operator and the corresponding Helmholtz decomposition of vector fields, combined with the Fock-space construction over the resulting two-dimensional transverse fibre.
The Gupta-Bleuler subsidiary condition $\partial_{μ} A^{(+) μ} ∣ ψ ⟩ = 0$ as a closed subspace projector, plus the proof that the inner product on the quotient $H_{phys} / H_{null}$ is positive-definite.
The Faddeev-Popov determinant as a functional Jacobian for the gauge-group action, requiring infinite-dimensional Lie-group integration and (for the non-Abelian generalisation) Grassmann path integrals.
The Ward-Takahashi identity as a consequence of $U (1)$ gauge invariance of the bare QED action, derivable from a Slavnov-Taylor functional differential equation.
The BRST operator $s$ as a nilpotent fermionic differential on the extended algebra $Ω^{∙} (A, ψ, c, \overset{c}{ˉ}, b)$ of fields and ghosts, with the BRST cohomology $H^{0} (s) = ker s_{0} / im s_{0}$ at ghost number zero giving the physical Hilbert space; the Kugo-Ojima quartet mechanism showing that unphysical photon and ghost modes pair into BRST quartets that decouple from physical observables.

Each is a substantial Mathlib contribution. The unit ships lean_status: none. The aggregated upstream-Mathlib roadmap for QED quantum-field formalisation has this unit's Gupta-Bleuler and BRST infrastructure as one of its load-bearing sub-goals; without it, the gauge-equivalence theorem cannot be stated formally, and downstream units on Compton, Møller, Bhabha, the one-loop self-energy, the vertex correction, and the Lamb shift all rest on the same gauge-equivalence claim. The human-review surface is described in the unit metadata Mathlib gap analysis.

Advanced results Master

BRST cohomology as the modern foundation

The Becchi-Rouet-Stora 1974-1975 / Tyutin 1975 reformulation ^{[Becchi-Rouet-Stora 1974; Tyutin 1975]} of gauge-fixed gauge theory is the modern foundation of covariant-gauge quantisation. After Faddeev-Popov gauge fixing the action is no longer gauge-invariant, but it carries a global anticommuting symmetry: the BRST symmetry. On the extended field content $(A_{μ}, ψ, c, \overset{c}{ˉ}, b)$ — where $c, \overset{c}{ˉ}$ are the Faddeev-Popov ghost and antighost, $b$ is the Nakanishi-Lautrup auxiliary field — the BRST operator $s$ acts as

s A_{μ} = \partial_{μ} c + i ec A_{μ} (non-Abelian; Abelian: s A_{μ} = \partial_{μ} c), s ψ = - i ec ψ, sc = - \frac{i e}{2} {c, c} (Abelian: sc = 0), s \overset{c}{ˉ} = ib, s b = 0,

and is nilpotent: $s^{2} = 0$ . The Lagrangian $L_{QED} + L_{GF} + L_{ghost}$ can be written as $L_{gauge-inv} + s (\overset{c}{ˉ} \cdot F)$ for a gauge-fixing fermion $F$ , demonstrating that the gauge-fixing dependence is BRST-exact and therefore does not contribute to BRST-cohomology classes. The space of physical states is defined as the cohomology

H_{phys} = \frac{ker s \cap { ghost number 0 }}{im s \cap { ghost number 0 }},

restricted to the BRST-closed states modulo BRST-exact ones. For QED this construction reproduces the Gupta-Bleuler condition: BRST-closed states with ghost number zero are exactly the Gupta-Bleuler physical states. For non-Abelian gauge theories, BRST cohomology is the only known way to define the physical Hilbert space at the level of the operator algebra and forms the algebraic foundation of the Kugo-Ojima 1979 unitarity proof ^{[Kugo-Ojima 1979]} via the quartet mechanism: unphysical timelike and longitudinal photon modes pair with Faddeev-Popov ghost and antighost modes into quartets $(b, c, \overset{c}{ˉ}, π)$ where $π$ is a longitudinal photon, and each quartet decouples from physical S-matrix elements by a BRST-exact-zero argument. The cohomological reformulation is the technical machinery behind the modern proofs of renormalisability and unitarity of the Standard Model.

The Slavnov-Taylor identity and renormalisation

The Ward-Takahashi identity has a non-Abelian generalisation: the Slavnov-Taylor identity ^{[Slavnov 1972; Taylor 1971]}, which states that BRST invariance of the renormalised effective action gives functional relations among the Green's functions of the gauge, matter, and ghost fields. In QED the Slavnov-Taylor identity collapses to the Ward-Takahashi identity (because ghosts decouple). In Yang-Mills theory it gives substantive constraints among the three-gluon, four-gluon, and gluon-ghost vertices that protect the structure of the renormalised theory and reduce the apparent number of independent counterterms. 't Hooft 1971 ^{[Faddeev-Popov 1967 cited above]} used the Slavnov-Taylor identity to prove the renormalisability of non-Abelian gauge theories — the result that opened the door to the Standard Model and won 't Hooft and Veltman the 1999 Nobel Prize. The Slavnov-Taylor identity is the modern face of the gauge-symmetry constraint on the structure of perturbative quantum field theory.

Beyond perturbation theory: the Gribov ambiguity

Faddeev-Popov gauge fixing assumes that the gauge-fixing condition $G (A) = 0$ picks out exactly one representative from each gauge orbit. For Abelian gauge theories this is true (one $Λ$ exists for any starting potential, as shown in the Beginner Worked Example). For non-Abelian gauge theories this is false at the non-perturbative level: a given orbit may intersect the gauge-fixing surface multiple times, and the Faddeev-Popov determinant may vanish at some configurations — these are the Gribov copies discovered by Gribov 1978. The Gribov ambiguity is invisible in perturbation theory (it shows up only at field configurations with non-vanishing topology) but is essential to the non-perturbative quantisation of Yang-Mills theory and to the confinement problem. For QED, no Gribov ambiguity arises in any standard gauge; the Faddeev-Popov procedure is exact at all energies. This is one of the substantive differences between Abelian and non-Abelian gauge theories that the BLP treatment emphasises only briefly.

Pinch technique and gauge-invariant Green's functions

The off-shell Green's functions of gauge theories are gauge-dependent — the photon propagator depends on $ξ$ , the electron self-energy depends on $ξ$ , and so on. For some applications (effective-field-theory matching, the BFM background-field method, the running coupling defined off-shell) it is useful to construct gauge-invariant off-shell Green's functions. The pinch technique of Cornwall-Papavassiliou-Sirlin 1981-2009 reorganises the diagrammatic expansion by identifying and removing the gauge-dependent pieces of individual diagrams via a systematic algebraic procedure, producing gauge-invariant effective Green's functions term by term in perturbation theory. The pinch technique is equivalent to the background-field method in a particular gauge (Feynman background gauge), and the resulting gauge-invariant Green's functions satisfy ordinary Ward identities of the unbroken QED Ward-Takahashi form. The technique is used in precision electroweak phenomenology (e.g., the gauge-invariant definition of the $W$ -boson mass beyond tree level) and in non-perturbative QCD studies. For QED it is a luxury rather than a necessity; for the electroweak Standard Model it is essential.

Full proof set Master

The Ward identity stated in the Key Derivation has both a perturbative derivation (via the functional generating equation) and a non-perturbative derivation (via Noether's theorem for the global $U (1)$ subgroup of the gauge group). The non-perturbative version is the cleanest. Let $j^{μ} = e \overset{ˉ}{ψ} γ^{μ} ψ$ be the electric current; classical $U (1)$ invariance of the QED action gives the local conservation law $\partial_{μ} j^{μ} = 0$ off-shell. Take an arbitrary connected Green's function

G^{μ} (x; {x_{i}}) = ⟨ 0∣ T j^{μ} (x) ϕ_{1} (x_{1}) \dots ϕ_{n} (x_{n}) ∣0 ⟩,

with $ϕ_{i}$ external fields. The classical conservation law lifts to an operator identity inside the time-ordered product up to Schwinger contact terms that arise from the time-ordering being non-smooth across coincident points. Schematically, $\partial_{μ}^{x} G^{μ} (x; {x_{i}}) = \sum_{i} δ^{4} (x - x_{i}) \cdot [gauge variation of ϕ_{i}]$ in the sense of distributions. Fourier-transforming and amputating the external propagators converts this into the Ward-Takahashi identity for vertex functions; setting the external momenta on-shell and contracting with physical polarisation vectors gives the Ward identity for S-matrix elements: $k_{μ} M^{μ} = 0$ .

The completeness of this derivation rests on three technical points. First, the path-integral measure $D A D \overset{ˉ}{ψ} D ψ$ must be invariant under the local $U (1)$ gauge transformation. For vector QED in $d = 4$ this holds; for chiral QED (involving $γ^{5}$ couplings) the chiral anomaly of Adler-Bell-Jackiw 1969 violates the naive Ward identity, with consequences for $π^{0} \to γ γ$ decay, the cancellation of gauge anomalies in the Standard Model, and the Witten global anomaly in $S U (2)$ . Second, the renormalisation procedure must preserve the Ward identity — equivalent to the requirement that the regulariser be gauge-invariant. Dimensional regularisation in $d = 4 - 2 ϵ$ does this manifestly; Pauli-Villars regularisation does too (with careful sign assignments for the regulator fields); lattice regularisations can but require care (Wilson fermions break chiral symmetry; staggered fermions break flavour). Third, the on-shell projection must be consistent with the regulariser; for QED in dimensional regularisation this is automatic.

The Gupta-Bleuler subsidiary condition is a consistent restriction on physical states because $\partial_{μ} \hat{A}^{(+) μ}$ commutes with itself at spacelike-separated points (a consequence of microcausality of the free Maxwell field) and with the interaction Hamiltonian (a consequence of current conservation $\partial_{μ} \hat{j}^{μ} = 0$ ). The first commutation ensures that the subsidiary condition is preserved by the free time evolution; the second ensures that it is preserved by the interaction. Therefore the physical-state subspace is invariant under the full Heisenberg evolution, and the time-evolution of physical states stays physical. The same statement, in modern BRST language, is the assertion that BRST charge $Q_{BRST}$ commutes with the Hamiltonian and that physical states are the cohomology classes $H^{0} (Q_{BRST})$ on the appropriate ghost-number-zero subspace.

The gauge-equivalence theorem at the operator level (rather than the path-integral level used in the Key Derivation) is the assertion that the S-matrix in any consistent gauge is unitarily equivalent to the S-matrix in any other consistent gauge, when restricted to the physical-state subspace. This was proved for the QED Coulomb-vs-Lorenz comparison in Dirac 1955 ^{[Dirac 1955]} and Schwinger 1953; for the general non-Abelian comparison via BRST cohomology in Kugo-Ojima 1979 ^{[Kugo-Ojima 1979]}. The technical machinery is gauge-by-gauge LSZ reduction with appropriate canonical-transformation maps relating the in/out states in different gauges; the punchline is that physical observables (cross sections, decay rates, bound-state energies, anomalous magnetic moments) are gauge-independent.

Connections Master

Canonical quantum field theory 12.12.01 is the framework: the QED Lagrangian, the Dyson-series expansion, and the LSZ reduction. The gauge-fixing procedure of this unit is the step that turns the formal Lagrangian into a quantisable theory with well-defined photon propagator.
Dirac equation and relativistic spin 12.11.01 supplies the matter sector: four-component spinors, gamma-matrix algebra, the spin-completeness relation $\sum_{s} u^{s} (p) \overset{u}{ˉ}^{s} (p) = \slashed p + m$ . The QED minimal coupling $\overset{ˉ}{ψ} i γ^{μ} D_{μ} ψ$ unifies the Dirac field with the gauge potential of this unit.
Free Dirac quantum field 12.05.05 supplies the operator-valued Dirac field with canonical anticommutation relations and the LSZ reduction formula. The Dirac field is the matter side; the gauge field of this unit is the radiation side; quantum-electrodynamics is the coupled system.
Bosonic Fock space 12.13.01 is the construction of the photon Fock space, with creation and annihilation operators satisfying $[a_{k}^{λ}, a_{k^{'}}^{λ^{'} †}] = (2 π)^{3} δ^{(3)} (k - k^{'}) δ^{λ λ^{'}}$ on positive-definite metric, or $- g^{λ λ^{'}}$ on indefinite metric. The choice of metric distinguishes Coulomb-gauge unitarity from Lorenz-gauge covariant quantisation.
Compton scattering and the Klein-Nishina formula 12.12.03 uses the Lorenz-gauge photon-polarisation-completeness sum $\sum_{λ} ϵ_{λ}^{μ} ϵ_{λ}^{* ν} \to - g^{μν}$ to convert the spin-and-polarisation sum into a manageable trace; the substitution is valid by the Ward identity derived here.
One-loop electron self-energy and mass renormalization 12.16.01 is the canonical worked example of the gauge equivalence theorem: the Coulomb-gauge and Feynman-gauge computations produce different intermediate Feynman-parameter integrals but the same physical mass shift, wave-function renormalisation, and running coupling.
One-loop QED vertex and anomalous magnetic moment 12.16.02 is most cleanly computed in Feynman gauge, where the photon propagator is $- i g_{μν} / k^{2}$ ; the gauge equivalence theorem ensures the Schwinger result $a_{e} = α / (2 π)$ is gauge-independent.
Vacuum polarization at one loop and the Uehling potential 12.16.03 depends crucially on the Ward identity to enforce the transverse decomposition $Π^{μν} (q) = (q^{2} g^{μν} - q^{μ} q^{ν}) Π (q^{2})$ ; the photon stays massless to all orders in perturbation theory because a mass term is not gauge-invariant.
Non-Abelian gauge theory and Yang-Mills is the natural successor: the Faddeev-Popov machinery developed here for the Abelian case becomes essential rather than decorative for $S U (N)$ gauge theories, where interacting ghosts contribute to loop diagrams and are required for unitarity. The 't Hooft-Veltman 1971-1972 proof of renormalisability of non-Abelian gauge theories rests directly on the BRST machinery of this unit, generalised via the Slavnov-Taylor identity.
BRST cohomology in topological field theory is a downstream mathematical development. The nilpotent BRST operator and its cohomology classes were the inspiration for Witten's 1988 cohomological reformulation of topological quantum field theories; Donaldson and Seiberg-Witten invariants of four-manifolds are cohomology classes of BRST-like differentials on infinite-dimensional configuration spaces.

Historical & philosophical context Master

The gauge condition $\partial_{μ} A^{μ} = 0$ was written down by Ludvig Lorenz in 1867 ^{[Lorenz 1867]}, a Danish physicist whose name is regularly mis-attributed to the better-known Hendrik Antoon Lorentz of Lorentz transformations and Lorentz force. Lorenz's 1867 paper "On the identity of the vibrations of light with electrical currents" predates Maxwell's 1873 Treatise on Electricity and Magnetism by six years and contains the covariant gauge condition as the natural choice for a relativistic wave equation for the electromagnetic potential. The historical confusion between Lorenz and Lorentz is documented in Jackson and Okun 2001 (Rev. Mod. Phys. 73, 663); the correct attribution is L. V. Lorenz 1867 for the gauge condition, and H. A. Lorentz for the transformation group.

Canonical quantisation of the electromagnetic field in Lorenz gauge was first attempted by Fermi 1932 ^{[Fermi 1932]} in his Reviews of Modern Physics survey of quantum electrodynamics. Fermi's treatment had several technical gaps: the gauge condition was imposed as a constraint operator equation, which is inconsistent with the canonical commutation relations, and the indefinite-metric problem was not addressed cleanly. The full resolution waited for the independent papers of Suraj Gupta in 1950 ^{[Gupta 1950]} and Konrad Bleuler in 1950 ^{[Bleuler 1950]}, who introduced the indefinite-metric Hilbert space and the subsidiary condition $\partial_{μ} A^{(+) μ} ∣ ψ_{phys} ⟩ = 0$ that defines physical states. The Gupta-Bleuler quantisation is the historical foundation of all modern covariant treatments of gauge theories.

The Coulomb-gauge canonical quantisation was developed by Dirac 1955 ^{[Dirac 1955]} in his "Gauge-invariant formulation of quantum electrodynamics" paper, which gave the first manifestly unitary canonical treatment with explicit elimination of the gauge redundancy. Dirac's approach used the constraint analysis he had developed in 1949-1950 for general constrained Hamiltonian systems (the Dirac-Bergmann formalism), specialised to the QED case. Schwinger 1953 had given an essentially equivalent treatment in a slightly different formalism; the two approaches are now standard.

The path-integral derivation of gauge-fixed gauge theory was given by Ludvig Faddeev and Victor Popov in a 1967 Physics Letters B paper ^{[Faddeev-Popov 1967]} originally devoted to the quantisation of non-Abelian Yang-Mills theory. Their two-page paper introduced the Faddeev-Popov determinant, the ghost fields, and the modern covariant gauge-fixing prescription that became the standard technique in particle physics. For QED the Faddeev-Popov procedure reduces to the Gupta-Bleuler quantisation, with the ghost fields decoupling from physical amplitudes; for Yang-Mills theory the ghosts are essential. Faddeev's institute in Leningrad was the leading centre for mathematical aspects of gauge theory in the 1960s and 1970s; the 1967 Faddeev-Popov paper is one of the most cited in twentieth-century theoretical physics.

The BRST symmetry was discovered independently by Carlo Becchi, Alain Rouet, and Raymond Stora at Marseille in 1974-1976 ^{[Becchi-Rouet-Stora 1974]} and by Igor Tyutin at the Lebedev Institute in Moscow in 1975 ^{[Tyutin 1975]} (in a preprint that remained unpublished in Western journals for many years). The cohomological reformulation became the standard framework for unitarity proofs of gauge theories after the Kugo-Ojima 1979 paper ^{[Kugo-Ojima 1979]} established the quartet mechanism and the BRST-cohomological definition of physical states. The Slavnov-Taylor identity ^{[Ward 1950; Takahashi 1957]} and its non-Abelian generalisation by Slavnov 1972 and Taylor 1971 provided the technical machinery for the renormalisability proofs of 't Hooft-Veltman 1971-1972, which won the 1999 Nobel Prize.

The choice between Coulomb and Lorenz gauges reflects a structural feature of physics that is rarely articulated: the same physical theory can have multiple inequivalent canonical formulations, and the best formulation depends on the problem at hand. Atomic physics, where Coulomb binding is the dominant interaction and transverse photon emission is the perturbation, naturally uses Coulomb gauge; high-energy scattering, where Lorentz invariance is essential and the Feynman-rule machinery is the workhorse, naturally uses Lorenz / Feynman gauge. The gauge equivalence theorem guarantees that physical predictions are independent of this choice, but the computational cost of getting to those predictions can differ by orders of magnitude. The pedagogical literature reflects this division: textbooks aimed at atomic physicists (Bethe-Salpeter, Sakurai's Advanced Quantum Mechanics) emphasise Coulomb gauge; textbooks aimed at high-energy physicists (Peskin-Schroeder, Schwartz, Weinberg) emphasise Lorenz/Feynman gauge. Berestetskii-Lifshitz-Pitaevskii Volume 4 uniquely uses both fluently throughout the book, requiring the reader to translate between them — one reason the book is famously difficult to read cold but uniquely valuable as a unified treatment.

Bibliography Master

Primary literature:

Lorenz, L. V., "On the identity of the vibrations of light with electrical currents", Phil. Mag. 34 (1867), 287–301. [Originator: covariant gauge condition $\partial_{μ} A^{μ} = 0$ , predating Maxwell's preferred formulation.]
Fermi, E., "Quantum Theory of Radiation", Rev. Mod. Phys. 4 (1932), 87–132. [First canonical quantisation of QED in Lorenz gauge with a constraint approach.]
Gupta, S. N., "Theory of longitudinal photons in quantum electrodynamics", Proc. Phys. Soc. A 63 (1950), 681–691. [Originator: indefinite-metric quantisation in Lorenz gauge.]
Bleuler, K., "Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen", Helv. Phys. Acta 23 (1950), 567–586. [Originator: independent indefinite-metric quantisation.]
Dirac, P. A. M., "Gauge-invariant formulation of quantum electrodynamics", Can. J. Phys. 33 (1955), 650–660. [Originator: canonical Coulomb-gauge quantisation, manifestly unitary treatment of QED.]
Faddeev, L. D. & Popov, V. N., "Feynman diagrams for the Yang-Mills field", Phys. Lett. B 25 (1967), 29–30. [Originator: path-integral gauge fixing and ghost fields.]
Becchi, C., Rouet, A. & Stora, R., "The Abelian Higgs-Kibble model: unitarity of the S-operator", Phys. Lett. B 52 (1974), 344–346; Commun. Math. Phys. 42 (1975), 127–162; "Renormalization of gauge theories", Ann. Phys. 98 (1976), 287–321. [Originator: BRST cohomological reformulation.]
Tyutin, I. V., "Gauge invariance in field theory and statistical physics in operator formalism", Lebedev Institute preprint FIAN No. 39 (1975). [Originator: independent BRST construction.]
Ward, J. C., "An identity in quantum electrodynamics", Phys. Rev. 78 (1950), 182. [Originator: Ward identity for QED.]
Takahashi, Y., "On the generalised Ward identity", Nuovo Cim. 6 (1957), 371–375. [Originator: Ward-Takahashi identity.]
Kugo, T. & Ojima, I., "Local covariant operator formalism of non-Abelian gauge theories and quark confinement problem", Prog. Theor. Phys. Suppl. 66 (1979), 1–130. [BRST unitarity proof, quartet mechanism, BRST-cohomological definition of physical Hilbert space.]
Slavnov, A. A., "Ward identities in gauge theories", Theor. Math. Phys. 10 (1972), 99–104; Taylor, J. C., "Ward identities and charge renormalisation of the Yang-Mills field", Nucl. Phys. B 33 (1971), 436–444. [Slavnov-Taylor identity for non-Abelian gauge theories.]
Jackson, J. D. & Okun, L. B., "Historical roots of gauge invariance", Rev. Mod. Phys. 73 (2001), 663–680. [Historical clarification of Lorenz vs Lorentz attribution.]

Textbooks and monographs:

Berestetskii, V. B., Lifshitz, E. M., Pitaevskii, L. P., Quantum Electrodynamics, 2nd ed. (Pergamon, 1982), §§4, 76–77, 102. [Both Coulomb and Lorenz gauges used fluently throughout; explicit translation between them.]
Weinberg, S., The Quantum Theory of Fields, Vol. I: Foundations (Cambridge, 1995), §§8.5 (Coulomb gauge), 9.5 (covariant-gauge quantisation), 15.7 (BRST symmetry). [The modern axiomatic treatment.]
Peskin, M. E. & Schroeder, D. V., An Introduction to Quantum Field Theory (Westview / Addison-Wesley, 1995), §9.4 (functional quantisation of the EM field, Faddeev-Popov for Abelian and non-Abelian). [Modern graduate textbook treatment.]
Schwartz, M. D., Quantum Field Theory and the Standard Model (Cambridge, 2014), §§8, 14 (path-integral quantisation of gauge theories, gauge invariance of physical amplitudes). [Modern pedagogical synthesis.]
Itzykson, C. & Zuber, J.-B., Quantum Field Theory (McGraw-Hill, 1980), §3-2. [Franco-Russian-school treatment with explicit comparison of canonical and path-integral methods.]
Bjorken, J. D. & Drell, S. D., Relativistic Quantum Fields (McGraw-Hill, 1965), Ch. 14 (Maxwell-field quantisation in covariant gauge). [Pre-Faddeev-Popov canonical treatment.]
Henneaux, M. & Teitelboim, C., Quantisation of Gauge Systems (Princeton UP, 1992). [Comprehensive monograph on Dirac-Bergmann constraint analysis and BRST quantisation.]
Pokorski, S., Gauge Field Theories, 2nd ed. (Cambridge UP, 2000), Chs. 1–3. [Pedagogical treatment of gauge-fixing, BRST, and Slavnov-Taylor for the Standard Model.]

Prerequisites

12.12.01 pending
12.11.01 pending
12.05.05 pending

Tier anchors

beginner: Zee, Quantum Field Theory in a Nutshell, 2e (Princeton 2010), Ch. II.7; Tong, Lectures on Quantum Field Theory, §6
intermediate: Peskin & Schroeder, An Introduction to Quantum Field Theory (Westview 1995), §9.4; Schwartz, Quantum Field Theory and the Standard Model (Cambridge 2014), §14
master: Berestetskii, Lifshitz, Pitaevskii, Quantum Electrodynamics, 2e (Pergamon 1982), §§4, 76–77, 102; Weinberg, The Quantum Theory of Fields, Vol. I (Cambridge 1995), §§8.5, 9.5; Itzykson & Zuber, Quantum Field Theory (McGraw-Hill 1980), §3-2

References

Lorenz, L. V. — On the identity of the vibrations of light with electrical currents · Phil. Mag. 34, 287–301 (1867) — covariant gauge condition predating Maxwell's preferred formulation
Fermi, E. — Quantum Theory of Radiation · Rev. Mod. Phys. 4, 87–132 (1932) — first canonical quantisation of QED in Lorenz gauge with a constraint approach
Gupta, S. N. — Theory of longitudinal photons in quantum electrodynamics · Proc. Phys. Soc. A 63, 681–691 (1950) — indefinite-metric quantisation in Lorenz gauge
Bleuler, K. — Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen · Helv. Phys. Acta 23, 567–586 (1950) — independent covariant-gauge quantisation
Dirac, P. A. M. — Gauge-invariant formulation of quantum electrodynamics · Can. J. Phys. 33, 650–660 (1955) — canonical Coulomb-gauge quantisation
Faddeev, L. D. & Popov, V. N. — Feynman diagrams for the Yang-Mills field · Phys. Lett. B 25, 29–30 (1967) — path-integral gauge fixing and ghost fields
Becchi, C., Rouet, A. & Stora, R. — The Abelian Higgs-Kibble model: unitarity of the S-operator · Phys. Lett. B 52, 344–346 (1974); Commun. Math. Phys. 42, 127–162 (1975); Ann. Phys. 98, 287–321 (1976) — BRST cohomological reformulation
Tyutin, I. V. — Gauge invariance in field theory and statistical physics in operator formalism · Lebedev Institute preprint FIAN No. 39 (1975) — independent BRST construction
Ward, J. C. — An identity in quantum electrodynamics · Phys. Rev. 78, 182 (1950) — Ward identity for QED
Takahashi, Y. — On the generalised Ward identity · Nuovo Cim. 6, 371–375 (1957) — Ward-Takahashi identity
Kugo, T. & Ojima, I. — Local covariant operator formalism of non-Abelian gauge theories and quark confinement problem · Prog. Theor. Phys. Suppl. 66, 1–130 (1979) — BRST unitarity proof and quartet mechanism
Berestetskii, V. B., Lifshitz, E. M. & Pitaevskii, L. P. — Quantum Electrodynamics, 2e · Pergamon (1982), §§4, 76–77, 102 (gauge fixing in QED across the book)
Peskin, M. E. & Schroeder, D. V. — An Introduction to Quantum Field Theory · Westview (1995), §9.4 (functional quantisation of the EM field, Faddeev-Popov for Abelian and non-Abelian)
Weinberg, S. — The Quantum Theory of Fields, Vol. I: Foundations · Cambridge (1995), §§8.5 (Coulomb gauge), 9.5 (covariant-gauge quantisation), 15.7 (BRST symmetry)
Itzykson, C. & Zuber, J.-B. — Quantum Field Theory · McGraw-Hill (1980), §3-2 (gauge invariance and quantisation of the photon field)
Schwartz, M. D. — Quantum Field Theory and the Standard Model · Cambridge (2014), §§8, 14 (path-integral quantisation of gauge theories, gauge invariance of physical amplitudes)

Estimated time

beginner: 15m
intermediate: 35m
master: 60m

Intuition Beginner

Visual Beginner

Worked example Beginner

Check your understanding Beginner

Formal definition Intermediate+

The gauge-fixing problem

Coulomb gauge

Lorenz gauge and the ξ-family

The Gupta-Bleuler subsidiary condition

Faddeev-Popov and the path integral

Key derivation Intermediate+

Exercises Intermediate+

Lean formalization Intermediate+

Advanced results Master

BRST cohomology as the modern foundation

The Slavnov-Taylor identity and renormalisation

Beyond perturbation theory: the Gribov ambiguity

Pinch technique and gauge-invariant Green's functions

Full proof set Master

Connections Master

Historical & philosophical context Master

Bibliography Master

Lorenz gauge and the $ξ$ -family