Free Maxwell / massive vector fields; photon and Proca
Anchor (Master): Weinberg, S., *The Quantum Theory of Fields, Vol. 1: Foundations* (Cambridge, 1995), §5.3 (massive vector fields), §5.9 (causal vector fields; photon and Proca); Itzykson, C. & Zuber, J.-B., *Quantum Field Theory* (McGraw-Hill, 1980), Ch. 3 §3-2 (canonical quantisation of the electromagnetic field in Gupta-Bleuler and Coulomb gauges); Woit, P., *Quantum Theory, Groups and Representations* (Springer, 2017), Ch. 46 (massless spin-1 representations and the photon); Stueckelberg, E. C. G., *Helv. Phys. Acta* 11, 225 (1938); Proca, A., *J. Phys. Radium* 7, 347 (1936)
Intuition Beginner
Every classical wave has polarisation — a direction in which the wave wiggles. A light beam from the sun, looked at carefully, is a superposition of two orthogonal transverse polarisations: the electric field can swing left-right or up-down, but never along the direction of travel. Polaroid sunglasses pass one polarisation and absorb the other. That two-state polarisation freedom is the classical face of what quantum field theory calls the photon's two helicity modes.
Now imagine running the same picture for a hypothetical massive cousin of the photon — a particle that looks like light but moves slower than light and carries rest mass. A massive spin-1 particle has three polarisations, not two: the two transverse modes plus a longitudinal one. The longitudinal mode wiggles along the direction of travel, the mode that the massless photon does not have. The W and Z bosons of the electroweak theory are real-world examples of this massive cousin; the Proca field is the textbook model.
Why does the photon have only two polarisations while a massive vector has three? The answer is that the photon is constrained to travel at the speed of light, and at the speed of light the longitudinal mode is gauge — a redundancy that can be removed by a U(1) rotation of the phase, with no physical content. The massive vector cannot use this escape route, so its longitudinal mode is real and physical.
Visual Beginner
A side-by-side schematic of polarisation vectors for a photon and for a massive vector. On the left, a photon travelling along the -axis with two transverse polarisation arrows in the -plane and a third dashed arrow along marked as "gauge — not physical". On the right, a massive vector travelling along the -axis with the same two transverse arrows plus a solid third arrow along marked as "longitudinal — physical".
The picture captures the central count: 2 polarisations for the photon, 3 for the massive vector. The smooth limit as the mass goes to zero is subtle: the longitudinal mode of the massive vector does not disappear by going soft, but rather decouples by becoming pure gauge. That decoupling is the content of the Stueckelberg reformulation discussed in the higher tiers.
Worked example Beginner
Count the polarisations of a photon of momentum along the -axis directly. Use units in which the speed of light is one.
Step 1. The photon momentum four-vector is , with energy matching the spatial momentum magnitude because the photon has zero rest mass. A polarisation vector is a four-vector that tells us in which spacetime direction the electromagnetic field wiggles.
Step 2. There are four candidate independent four-vectors at each momentum: one along time , one along , one along , and one along .
Step 3. The Lorenz condition keeps only three of these four. With the condition reads , which forces on every surviving polarisation. The two purely transverse choices pass with ; the time-like and the longitudinal ones merge into one combination .
Step 4. The remaining gauge freedom for any number removes the surviving time-like or longitudinal mode. Setting kills the time-component and the longitudinal component simultaneously. Two physical polarisations remain: the two transverse ones .
Step 5. Repeat the count for a massive vector of mass and momentum . The Lorenz condition is now imposed as a dynamical equation of motion rather than a gauge choice — it kills one of the four candidates, leaving three. There is no further gauge freedom because the mass term in the Lagrangian breaks the U(1) symmetry that would have done the killing. Three physical polarisations remain: the two transverse ones plus a longitudinal one.
What this tells us: the count of polarisations follows from Lorentz invariance plus one of two possibilities — gauge invariance for the massless photon (which removes one mode), or a mass term for the massive vector (which adds the longitudinal mode that the photon does not have). The number of physical polarisations is 2 for the photon and 3 for the massive vector, and the count is locked by representation theory of the Poincare group: the photon is a representation of the massless little group ISO(2), and the massive vector is a representation of the massive little group SO(3).
Check your understanding Beginner
Formal definition Intermediate+
Conventions. Spacetime is four-dimensional Minkowski with metric . Greek indices run over ; Latin indices over . Units have . We follow the sign convention of Weinberg [Weinberg 1995 §5.3].
The free Maxwell field is the operator-valued tempered distribution on Minkowski spacetime built from the classical action $$ S_{\text{Maxwell}}[A] = -\tfrac{1}{4} \int d^4 x ; F_{\mu\nu} F^{\mu\nu}, \qquad F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu. $$ The action is invariant under the U(1) gauge transformation for any smooth scalar function . The Euler-Lagrange equation is , i.e. the source-free Maxwell equations.
The free Proca field of mass is the operator-valued tempered distribution built from the action $$ S_{\text{Proca}}[B] = \int d^4 x ; \big( -\tfrac{1}{4} G_{\mu\nu} G^{\mu\nu} + \tfrac{1}{2} m^2 B_\mu B^\mu \big), \qquad G_{\mu\nu} = \partial_\mu B_\nu - \partial_\nu B_\mu. $$ The mass term breaks the U(1) gauge invariance present in the Maxwell case. The Euler-Lagrange equation is . Taking the divergence yields , so follows as a constraint for , and the equation reduces to with the additional constraint .
A polarisation four-vector at on-shell momentum is a complex four-vector that satisfies (i) (the Lorenz / transversality condition) and (ii) (normalisation). The polarisation index runs over for the photon (helicity) and for the Proca field (-component of spin in the rest frame, Wigner-rotated to the chosen frame).
The mode expansion of the photon field in Lorenz gauge reads $$ A_\mu(x) = \int \frac{d^3 k}{(2\pi)^3 \sqrt{2 \omega_k}} \sum_{\lambda = \pm 1} \big( a(k, \lambda) \epsilon_\mu(k, \lambda) e^{-i k \cdot x} + a^\dagger(k, \lambda) \epsilon^*_\mu(k, \lambda) e^{i k \cdot x} \big), $$ where on the photon mass shell and are the bosonic creation and annihilation operators on the photon Fock space, satisfying . The Proca mode expansion is identical in form, but the polarisation sum runs over the three labels and the energy is .
The polarisation-sum identity records the projector onto physical polarisations at fixed on-shell momentum. For the Proca field of mass , $$ \sum_{\lambda = -1}^{+1} \epsilon^{\mu}(k, \lambda) \epsilon^\nu(k, \lambda) = -\eta^{\mu\nu} + \frac{k^\mu k^\nu}{m^2}, \qquad k^2 = m^2. $$ For the photon, the analogous identity depends on the gauge: in Lorenz gauge with an auxiliary lightlike four-vector , $$ \sum_{\lambda = \pm 1} \epsilon^{\mu}(k, \lambda) \epsilon^\nu(k, \lambda) = -\eta^{\mu\nu} + \frac{k^\mu n^\nu + k^\nu n^\mu}{k \cdot n} - \frac{n^2 k^\mu k^\nu}{(k \cdot n)^2}, \qquad k^2 = 0. $$ The -dependent terms are gauge-dependent, but contract to zero against any conserved current (Ward identity), so they do not affect physical amplitudes.
Counterexamples to common slips
- The condition is a gauge choice for the photon (one can change gauge to violate it) but a constraint equation of motion for the Proca field (it follows from the action and cannot be relaxed without changing the theory). Confusing the two is the cleanest version of misunderstanding the photon-Proca distinction.
- The polarisation-sum identity (without correction terms) holds for off-shell virtual photons inside Feynman propagators in Feynman gauge . On-shell physical photons require the corrected formula with the auxiliary vector; using the naive formula to a physical external line introduces spurious contributions from unphysical polarisations.
- Dirac's constraint analysis classifies as a Lagrange multiplier (its time derivative does not appear in the action) and Gauss's law as a first-class constraint that generates the residual U(1) gauge transformations. In the Proca case, is not a Lagrange multiplier: appears in , and Gauss-like equation is second-class. The constraint structure differs sharply between the two cases.
Key theorem with proof Intermediate+
Theorem (polarisation count from the Wigner little group; Weinberg §5.3, §5.9). Let be the unitary irreducible representation of the Poincare group corresponding to a one-particle state of mass and integer spin . Then
(i) For , the little group is and the spin- representation has dimension . The one-particle Hilbert space at fixed on-shell momentum carries three physical polarisations, labelled by in the rest frame.
(ii) For , the little group is (the Euclidean group of the plane). Finite-dimensional unitary irreducibles of in which the translation generators act as the identity are labelled by an integer helicity , each one-dimensional. The photon assembles the helicities and into a CPT-invariant two-state representation. The one-particle Hilbert space at fixed null momentum carries two physical polarisations.
Proof. Wigner's classification (Wigner Ann. Math. 40, 149 (1939) [Wigner 1939]) builds unitary irreducibles of the Poincare group from a momentum orbit plus a little-group representation. Fix a reference momentum on the relevant mass-shell orbit, the little group is the subgroup of the Lorentz group fixing , and the irreducible is induced from a unitary irreducible of .
Step 1: massive case. For , choose , the rest-frame momentum. The little group fixing is the rotation subgroup . Unitary irreducibles of are labelled by half-integer spin and have dimension . For , the dimension is . The induced representation on a general momentum is obtained by acting with a standard boost that takes to ; the little-group dimension at propagates to dimension at every on the mass-shell. This gives the three physical polarisations of the Proca field, labelled by .
Step 2: massless case. For , choose , a reference null momentum along the positive -axis. The little group is the subgroup of fixing this null vector. Direct calculation: a Lorentz transformation fixes iff its matrix has the form $$ \Lambda = \begin{pmatrix} 1 + \tfrac{1}{2}|\alpha|^2 & \alpha^1 & \alpha^2 & -\tfrac{1}{2}|\alpha|^2 \ \alpha^1 & 1 & 0 & -\alpha^1 \ \alpha^2 & 0 & 1 & -\alpha^2 \ \tfrac{1}{2}|\alpha|^2 & \alpha^1 & \alpha^2 & 1 - \tfrac{1}{2}|\alpha|^2 \end{pmatrix} R(\theta), $$ where and is a rotation by angle in the -plane. The matrix on the left is a parabolic Lorentz transformation parametrised by , and the rotation commutes with it modulo the -action. The group law is the semidirect product of rotations of the plane with translations of the plane. So the massless little group is .
Step 3: representations of . Unitary irreducibles of are classified by the action of the translation generators . There are two families:
(a) Translations act non-vanishingly. The irreducible is induced from a character of the translation subgroup with continuous parameter , and is infinite-dimensional. These representations correspond to particles with a continuous internal degree of freedom (a "continuous spin") that has never been observed in nature; massless one-particle states of finite-dimensional internal structure exclude them.
(b) Translations act as the identity. The representation factors through , and unitary irreducibles of are one-dimensional, labelled by helicity (or half-integer for projective representations, accessible after passing to the double cover).
The photon is the case , with and joined by CPT into a single two-state representation. The two states are exactly the left- and right-circularly polarised photons of classical optics. There is no state because would be a scalar, not a vector; and there is no for the photon because the photon transforms as a four-vector , and four-vectors carry only helicities at the massless level.
Step 4: from little-group representation to polarisation vectors. The polarisation vectors are the matrix elements of the embedding as the four-vector index runs over and labels the little-group representation. For the massive case at , the standard choice is , , . For the massless case at , the two helicities are . Boosting to general gives the polarisation four-vectors at every on-shell momentum, related by Wigner rotations to the reference choice.
Bridge. The polarisation count builds toward the entire gauge-theoretic structure of quantum field theory and appears again in 03.07.05 (Yang-Mills action) as the non-abelian generalisation of the same construction. The foundational reason the count is fixed is exactly the Wigner little group: the photon is locked to two helicities by the massless little group ISO(2), and the massive vector is locked to three polarisations by the massive little group SO(3). The central insight is that putting these together produces the gauge-invariance principle as a consequence of representation theory rather than as an additional postulate: a massless spin-1 field that transforms as a Lorentz four-vector necessarily carries a U(1) gauge redundancy, because the four-vector representation of the Lorentz group restricts to a reducible representation of the massless little group, and the unphysical components must be removed by gauge. This is exactly the bridge between Lorentz covariance and gauge invariance, and it identifies gauge symmetry with the kinematic requirement that a Lorentz-covariant massless spin-1 field describe only the physical helicity- modes. The Stueckelberg trick generalises the same picture to the massive case, embedding the longitudinal mode into a U(1) gauge orbit at the cost of one auxiliary scalar field.
Exercises Intermediate+
Lean formalization Intermediate+
Mathlib has no named QFT objects for either the photon or the Proca field. A schematic of the intended formalisation:
import Mathlib.AlgebraicTopology.SingularSet
import Mathlib.Analysis.InnerProductSpace.l2Space
import Mathlib.LinearAlgebra.TensorAlgebra.Basic
/-- The one-particle relativistic Hilbert space for a massless spin-1
particle (the photon): positive-energy null-mass mode functions
fibered over the lightcone, with helicity index running over {-1, +1}. -/
structure PhotonOnePArticleSpace : Type where
carrier : Type
inner_product : carrier → carrier → ℂ
-- positive-energy null mass-shell constraint
-- helicity ±1 index assignment
sorry
/-- The free Maxwell field as an operator-valued tempered distribution
on the photon Fock space (bosonic Fock over the one-particle space). -/
noncomputable def MaxwellField :
Π (x : SpaceTimePoint),
BosonicFock PhotonOnePArticleSpace →ₗ[ℂ]
BosonicFock PhotonOnePArticleSpace :=
sorry -- mode expansion in polarisation index λ ∈ {-1, +1}
/-- The free Proca field of mass m > 0 as an operator-valued
distribution on the Proca Fock space (massive spin-1 one-particle space). -/
noncomputable def ProcaField (m : ℝ) (hm : 0 < m) :
Π (x : SpaceTimePoint),
BosonicFock (ProcaOnePArticleSpace m hm) →ₗ[ℂ]
BosonicFock (ProcaOnePArticleSpace m hm) :=
sorry -- polarisation index λ ∈ {-1, 0, +1}The proof gap is substantive. Mathlib needs: the Wigner classification of unitary irreducibles of the Poincare group (no current artifact); operator-valued tempered distributions on bosonic Fock space (no current artifact); the constraint analysis of singular Lagrangians via the Dirac algorithm (no current artifact); the polarisation-sum identity for the two relevant little groups; the Faddeev-Popov determinant for -gauge fixing; and the Stueckelberg reformulation as a change of dynamical variables. Each piece is its own contribution. The unit ships with lean_status: none per the gap field above.
Advanced results Master
Theorem (Gupta-Bleuler quantisation; Itzykson-Zuber §3-2-1). Quantise the free Maxwell field in Lorenz gauge by promoting all four components to operators on an indefinite-metric Fock space built from a four-state little-group representation. The physical subspace is defined by the Bleuler condition , where is the positive-frequency part. The quotient of physical states by null states is a positive-definite Hilbert space carrying the two physical polarisations.
The Gupta-Bleuler construction is the price one pays for manifest Lorentz covariance at the operator level. The indefinite-metric quantisation has a four-state space at each momentum (the four components of ), but the physical subspace selection plus quotient by null states reduces this to two physical states per momentum, matching the helicity count. The construction is the canonical-side complement to the Faddeev-Popov path-integral approach (Faddeev-Popov, Phys. Lett. B 25, 29 (1967) [Faddeev-Popov 1967]), which handles the same gauge-fixing problem at the level of the generating functional.
Theorem (Coulomb-gauge canonical quantisation; Peskin-Schroeder §4.8). In Coulomb gauge the canonical phase space reduces directly to the two transverse polarisations with no indefinite-metric machinery. The equal-time commutator is $$ [A_i(x), \dot{A}_j(y)] = i \delta^{ij}_T(x - y), \qquad \delta^{ij}_T(x - y) = \delta^{ij} \delta^{(3)}(x - y) - \frac{\partial^i \partial^j}{\nabla^2} \delta^{(3)}(x - y), $$ where is the transverse projector. The Hamiltonian is , manifestly positive-definite.
Coulomb gauge breaks manifest Lorentz covariance but produces a manifestly unitary, positive-norm Hilbert space with only the two physical polarisations. The trade-off between manifest covariance (Lorenz gauge with Gupta-Bleuler) and manifest unitarity (Coulomb gauge) is generic in gauge theory. For computation of S-matrix elements between physical states, both approaches give identical answers, by the Faddeev-Popov gauge-independence theorem.
Theorem (Proca-field canonical quantisation; Weinberg §5.3). The massive vector field of mass is canonically quantised on a positive-definite Fock space carrying three physical polarisations per momentum, with mode expansion $$ B_\mu(x) = \int \frac{d^3 k}{(2\pi)^3 \sqrt{2 \omega_k}} \sum_{\lambda = -1}^{+1} \big( a(k, \lambda) \epsilon_\mu(k, \lambda) e^{-i k \cdot x} + a^\dagger(k, \lambda) \epsilon^*_\mu(k, \lambda) e^{i k \cdot x} \big), \quad \omega_k = \sqrt{k^2 + m^2}. $$ The propagator is $$ D^{F,\text{Proca}}{\mu\nu}(k) = \frac{-i}{k^2 - m^2 + i\varepsilon} \big( \eta{\mu\nu} - \frac{k_\mu k_\nu}{m^2} \big). $$ The term diverges as , signalling the obstruction to a smooth massless limit at the level of the propagator.
The Proca propagator's pole is the diagnostic that distinguishes a massive vector from a massless one. In Feynman diagrams for theories with massive vectors (the W and Z bosons of the Standard Model, the rho meson of strong interactions), the term contributes to scattering amplitudes and must be tracked carefully. The unitary gauge in the Stueckelberg-deformed propagator recovers exactly this form for the gauge-boson sector of broken gauge theories, where the Higgs mechanism plays the role of the Stueckelberg scalar at the non-linear level.
Theorem (Stueckelberg reformulation and the smooth massless limit; Stueckelberg 1938). The Proca Lagrangian augmented with an auxiliary scalar via is equivalent to the original Proca theory in unitary gauge . The propagator in 't Hooft gauge for the Stueckelberg-deformed field is $$ D^F_{\mu\nu}(k) = \frac{-i}{k^2 - m^2 + i\varepsilon} \big( \eta_{\mu\nu} - (1 - \xi) \frac{k_\mu k_\nu}{k^2 - \xi m^2} \big), $$ which has a smooth limit because the dangerous term is replaced by a finite expression. The Stueckelberg scalar becomes a free massless ghost as and decouples from physical observables.
Stueckelberg's 1938 trick was a decade ahead of its time: he wrote down the massive U(1) gauge theory with a Higgs-like auxiliary scalar before the Higgs mechanism was discovered. In modern terms, the Stueckelberg formulation is the special case of the Higgs mechanism with a complex scalar of vanishing radial mass — the longitudinal mode of the massive vector is eaten from the scalar's phase, and the smooth limit corresponds to ungluing this mode and returning it to the scalar sector as a free Goldstone boson. For the full Standard Model story, this is the linearised version of the Brout-Englert-Higgs mechanism.
Theorem (Faddeev-Popov determinant and the path-integral derivation; Faddeev-Popov 1967). The free-Maxwell path-integral over field configurations modulo gauge, $$ Z = \int \mathcal{D} A_\mu ; \delta(\partial^\mu A_\mu) ; \Delta_{FP}[A] ; e^{i S_{\text{Maxwell}}[A]}, $$ equals (after exponentiation of the delta function and the determinant) the gauge-fixed generating functional $$ Z_{\xi} = \int \mathcal{D} A_\mu ; e^{i \int (\mathcal{L}{\text{Maxwell}} - (1/2\xi) (\partial^\mu A\mu)^2) d^4 x}, $$ with no Faddeev-Popov ghost contribution because the gauge group U(1) is abelian (the determinant is field-independent and reabsorbed into the normalisation). The resulting propagator is the propagator computed in Exercise 8.
For non-abelian generalisations (Yang-Mills theory), the Faddeev-Popov determinant is no longer field-independent and contributes ghost fields to the gauge-fixed action. The free Maxwell case is the abelian baseline where the construction simplifies. The Faddeev-Popov logic propagates to non-abelian gauge theory in 03.07.05 (Yang-Mills action) and is the modern path-integral counterpart of the canonical Gupta-Bleuler construction at the operator level.
Theorem (helicity conservation and Wigner rotations; Weinberg §5.9). Under a Lorentz transformation that takes momentum to , the photon helicity states transform as $$ U(\Lambda) |k, \lambda\rangle = e^{i \lambda \theta(\Lambda, k)} |\Lambda k, \lambda\rangle, \qquad \lambda = \pm 1, $$ where is the Wigner rotation angle (the SO(2) element of the little-group action ISO(2) at that the Lorentz transformation contributes after extraction of the standard boost). Helicity is conserved under Lorentz boosts, but the polarisation four-vector picks up a momentum-dependent phase.
This is the operational content of the statement that helicity is a Lorentz-invariant for massless particles. For a massive particle, by contrast, the spin projection is not Lorentz-invariant — the Wigner rotation for the massive case is a genuine rotation, not a phase, and it mixes the three polarisation states. The asymmetry between conserved-helicity (massless) and Wigner-mixed-spin (massive) is the kinematic root of the distinction between photon and Proca scattering amplitudes.
Theorem (positivity and cluster decomposition; Weinberg §5.9). Causal commutators of the free-Maxwell and Proca fields in their physical-Hilbert-space quantisations vanish at spacelike separation, $$ [A_\mu(x), A_\nu(y)] = 0, \qquad (x - y)^2 < 0, $$ and the two-point Wightman function satisfies the Wightman positivity condition. The cluster decomposition property of the vacuum is automatic.
Causality and cluster decomposition are the two structural features that single out the canonical-quantisation Fock-space construction (with whatever gauge-fixing machinery is needed) from naive operator-valued solutions of the wave equation. The constructions above are not arbitrary mathematical exercises; they are the unique (up to choice of gauge) free-field quantisations of a Lorentz-covariant local theory that respects causality, cluster decomposition, and positivity of the spectrum of the Hamiltonian.
Synthesis. The construction of the free Maxwell and Proca quantum fields is the foundational reason every gauge-theoretic quantum field theory has the structure it has. The central insight is that the polarisation count for a Lorentz-covariant spin-1 field is locked by Wigner's little group: two helicities for the massless photon (little group ISO(2)) and three polarisations for the massive vector (little group SO(3)). The bridge is that this count cannot be put on a manifestly Lorentz-covariant footing without either (a) introducing gauge invariance and removing the unphysical components by hand, or (b) accepting an indefinite-metric Fock space and projecting onto a physical subspace. Putting these together, the photon's gauge invariance is not an additional postulate but rather a kinematic consequence of the embedding of the two-state ISO(2) representation into the four-component four-vector , and the Stueckelberg trick generalises the same logic to the massive case at the cost of one auxiliary scalar. This is exactly the bridge between Lorentz covariance and gauge invariance, and it identifies the U(1) of electromagnetism with the kinematic redundancy demanded by the embedding rather than with a global symmetry of matter.
The propagator structure follows from the same analysis. For the photon in -gauge, the propagator has a one-parameter family parametrising the gauge choice, and physical S-matrix elements are -independent by the Ward identity. For the Proca field, the propagator has no gauge-parameter freedom but acquires a singularity in the massless limit. The Stueckelberg reformulation makes the limit smooth by introducing an auxiliary scalar that absorbs the longitudinal mode into a U(1) gauge orbit. Putting these together, the photon and Proca cases form one duality framework, with gauge invariance and mass term as the two extremes: every massive vector field is, in the right variables, a Stueckelberg-formulated gauge field with the gauge parameter controlling the smoothness of the massless limit. The bridge appears again in 03.07.05 (Yang-Mills action) for the non-abelian generalisation, where the Higgs mechanism plays the role of the Stueckelberg scalar at the level of an interacting theory, and the smooth-massless-limit structure of the Standard Model traces back to exactly the same kinematic reasoning developed for the abelian case here.
Full proof set Master
Theorem (polarisation count from Wigner little group), proof. Given in the Intermediate-tier section. The argument has four steps: classify the orbits in momentum space by mass (massive timelike vs. massless null), identify the little group at a reference point on each orbit ( for massive, for massless), classify unitary irreducibles of the little group (spin- representations of dimension for ; helicity- representations of dimension for with identity translation action), and induce up from the reference momentum to a general momentum on the orbit. For spin this yields three polarisations in the massive case and two helicity states in the massless case.
Theorem (Proca-field Lorenz constraint), proof. Given in Exercise 3. Apply the Euler-Lagrange equation to the Proca Lagrangian, then take the four-divergence of the resulting equation of motion. The Maxwell-tensor part vanishes by antisymmetry; the mass term yields , which for forces .
Theorem (polarisation-sum identity for the Proca field), proof. Given in Exercise 4. Compute in the rest frame, then propagate by Lorentz covariance.
Proposition (smooth massless limit fails for the Proca propagator). The propagator does not have a finite limit as at fixed off-shell when contracted with a non-conserved external current.
Proof. Fix an off-shell momentum with and a non-conserved external four-vector with . The contraction has two pieces: $$ J^{\mu} \eta_{\mu\nu} J^\nu \cdot \frac{-i}{k^2 - m^2 + i\varepsilon} - J^{\mu} k_\mu k_\nu J^\nu \cdot \frac{-i}{m^2 (k^2 - m^2 + i\varepsilon)}. $$ The first piece has a finite limit, equal to . The second piece has the form , which diverges as unless , i.e., unless the external current is conserved. Contraction with a conserved current (satisfying on-shell) kills the dangerous piece and the limit is finite. The Stueckelberg reformulation, by contrast, has a smooth limit even for non-conserved external currents, because the auxiliary scalar carries away the would-be-divergent contribution.
Theorem (Faddeev-Popov determinant for abelian gauge theory), proof. Insert the identity into the gauge-invariant path integral, where is the gauge transform and for the gauge-fixing function . The functional determinant evaluates as $$ \Delta_{FP}[A] = \det\big(\partial^\mu \partial_\mu \big) = \det \Box, $$ which is independent of the field . The constant determinant factors out of the path integral and is absorbed into the normalisation, leaving the gauge-fixed action after exponentiating the delta function with parameter . Conclusion: for abelian gauge theory, no ghost fields appear, and the gauge-fixed propagator is the propagator.
Theorem (-gauge photon propagator), proof. Given in Exercise 8. Invert the kinetic operator via the ansatz . Matching coefficients yields and , giving .
Theorem (Gupta-Bleuler subspace), stated without proof — see Itzykson-Zuber §3-2-1 [Itzykson-Zuber 1980]. The full Gupta-Bleuler construction requires the indefinite-metric Fock space, the positive-frequency decomposition of the field, the Bleuler condition on physical states, and the proof that the physical/null quotient is positive-definite with two degrees of freedom per momentum. The construction is canonical-side complement to the Faddeev-Popov path-integral construction at the operator level and is documented in detail in Itzykson-Zuber Ch. 3 [Itzykson-Zuber 1980].
Theorem (Stueckelberg gauge invariance), proof. Given in Exercise 5. The Stueckelberg Lagrangian is invariant under the joint transformation , because the combination is itself invariant, and is built from gauge-invariant antisymmetric derivatives of . The unitary gauge recovers the Proca Lagrangian, and the 't Hooft gauge recovers the Stueckelberg propagator with a smooth limit.
Connections Master
Bosonic Fock space and second quantisation
12.13.01. The one-particle Hilbert spaces for the photon and the Proca field are both built from the bosonic Fock-space construction of unit 12.13.01, with one-particle space taken to be the appropriate Wigner irreducible (massless helicity- for the photon, massive spin-1 for Proca). The free Maxwell and Proca fields are operator-valued tempered distributions on these Fock spaces, with their canonical commutation relations inherited from the underlying CCR algebra of unit 12.13.01.Maxwell's equations in differential form
10.04.01. The classical Lagrangian of the free Maxwell field is exactly the classical Maxwell theory in covariant form, whose differential-form repackaging is the subject of unit 10.04.01. The quantum theory developed here promotes the classical to an operator-valued tempered distribution, but preserves the gauge structure and the equations of motion at the level of operator equations. The classical-to-quantum bridge is canonical quantisation plus gauge-fixing, exactly as developed above.Covariant electrodynamics with the Faraday tensor
10.06.01. The covariant tensor formalism for the classical Maxwell theory developed in unit 10.06.01 is the algebraic framework within which the quantum theory is constructed here. The polarisation vectors are the operator-side incarnation of the polarisation of classical electromagnetic waves; the Faraday tensor promotes to an operator-valued antisymmetric tensor field; and the Lorenz / Coulomb gauge choices have classical-side counterparts in the standard gauge-fixing procedures of classical electrodynamics.Yang-Mills action
03.07.05. The non-abelian generalisation of the free Maxwell theory replaces U(1) by a non-abelian compact Lie group and introduces a genuine Faddeev-Popov ghost contribution because the determinant is no longer field-independent. The polarisation count and the gauge-removal logic developed here for the abelian case generalise directly: gluons and electroweak gauge bosons have exactly two physical helicities each (in the massless limit), and the Higgs mechanism plays the Stueckelberg role at the non-abelian level. The free abelian construction here is the pedagogical baseline; the Yang-Mills unit is the genuine application.Dirac equation and relativistic spin
12.11.01. The Dirac equation provides the spin- companion to the spin- analysis presented here. The Wigner classification used to fix the polarisation count of the photon and the Proca field applies equally to the Dirac field, where the massive little group representation of spin has dimension , and the massless limit has helicities (the Weyl spinors). The QED Lagrangian couples the Dirac field to the Maxwell field via the gauge-covariant derivative, and the constraint analysis of the resulting theory builds on both unit 12.11.01 and the present unit.
Historical & philosophical context Master
The construction of a quantum theory of the electromagnetic field began with Dirac's 1927 paper The quantum theory of the emission and absorption of radiation (Proc. Roy. Soc. A 114, 243 (1927)) [Stueckelberg 1938], which introduced the creation and annihilation operators of the radiation field and quantised the Maxwell theory at the operator level. The classical Lagrangian formulation as traces to Pauli and Weisskopf's 1934 reformulation. The Lorentz-covariant indefinite-metric quantisation was assembled in two parallel papers by Suraj Gupta (Proc. Phys. Soc. A 63, 681 (1950)) [Gupta 1950] and Konrad Bleuler (Helv. Phys. Acta 23, 567 (1950)) [Bleuler 1950], yielding what is now called Gupta-Bleuler quantisation. The Coulomb-gauge alternative was developed by Fermi as early as 1932 and refined by Schwinger and others through the 1940s; the trade-off between manifest covariance (Gupta-Bleuler) and manifest unitarity (Coulomb) became a recurring theme of gauge-theory quantisation.
The massive vector field has a distinct genealogy. Alexandru Proca introduced the equation together with the Lorenz constraint in three 1936 Comptes Rendus notes and a Journal de Physique article (J. Phys. Radium 7, 347 (1936)) [Proca 1936], originally intended as a relativistic spin-1 candidate for the electron (which the Dirac equation already covered). The Proca equation was reinterpreted in the late 1930s as the relativistic equation for a massive spin-1 particle and applied to nuclear physics by Yukawa (1935) and Kemmer (1938). Ernst Stueckelberg's 1938 reformulation introducing an auxiliary scalar to restore U(1) gauge invariance (Helv. Phys. Acta 11, 225 (1938)) [Stueckelberg 1938] anticipated by two decades the Brout-Englert-Higgs mechanism (1964) and the Glashow-Weinberg-Salam unification of the electroweak interactions (1967-68). The modern path-integral quantisation via Faddeev-Popov determinants (Faddeev-Popov, Phys. Lett. B 25, 29 (1967) [Faddeev-Popov 1967]) reunified the Maxwell and Yang-Mills cases under one technique. The Wigner-classification underpinning of the photon and the Proca field was established in Wigner's 1939 Ann. Math. paper [Wigner 1939] and absorbed into the standard QFT presentations of Weinberg (Quantum Theory of Fields, Vol. 1, Ch. 5 (1995)) [Weinberg 1995] and Woit (Quantum Theory, Groups and Representations, Ch. 46 (2017)).
Bibliography Master
@article{Dirac1927Radiation,
author = {Dirac, P. A. M.},
title = {The Quantum Theory of the Emission and Absorption of Radiation},
journal = {Proc. Roy. Soc. A},
volume = {114},
year = {1927},
pages = {243--265}
}
@article{Proca1936,
author = {Proca, Alexandru},
title = {Sur la th{\'e}orie ondulatoire des {\'e}lectrons positifs et n{\'e}gatifs},
journal = {J. Phys. Radium},
volume = {7},
year = {1936},
pages = {347--353}
}
@article{Stueckelberg1938,
author = {Stueckelberg, E. C. G.},
title = {Die Wechselwirkungskr{\"a}fte in der Elektrodynamik und in der Feldtheorie der Kernkr{\"a}fte},
journal = {Helv. Phys. Acta},
volume = {11},
year = {1938},
pages = {225--244, 299--328, 312--328}
}
@article{Wigner1939,
author = {Wigner, Eugene P.},
title = {On Unitary Representations of the Inhomogeneous Lorentz Group},
journal = {Ann. Math.},
volume = {40},
year = {1939},
pages = {149--204}
}
@article{Gupta1950,
author = {Gupta, Suraj N.},
title = {Theory of Longitudinal Photons in Quantum Electrodynamics},
journal = {Proc. Phys. Soc. A},
volume = {63},
year = {1950},
pages = {681--691}
}
@article{Bleuler1950,
author = {Bleuler, Konrad},
title = {Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen},
journal = {Helv. Phys. Acta},
volume = {23},
year = {1950},
pages = {567--586}
}
@article{FaddeevPopov1967,
author = {Faddeev, L. D. and Popov, V. N.},
title = {Feynman Diagrams for the {Y}ang-{M}ills Field},
journal = {Phys. Lett. B},
volume = {25},
year = {1967},
pages = {29--30}
}
@book{Weinberg1995QTFV1,
author = {Weinberg, Steven},
title = {The Quantum Theory of Fields, Vol. 1: Foundations},
publisher = {Cambridge University Press},
year = {1995}
}
@book{ItzyksonZuber1980,
author = {Itzykson, Claude and Zuber, Jean-Bernard},
title = {Quantum Field Theory},
publisher = {McGraw-Hill},
year = {1980}
}
@book{PeskinSchroeder1995,
author = {Peskin, Michael E. and Schroeder, Daniel V.},
title = {An Introduction to Quantum Field Theory},
publisher = {Addison-Wesley},
year = {1995}
}
@book{Srednicki2007,
author = {Srednicki, Mark},
title = {Quantum Field Theory},
publisher = {Cambridge University Press},
year = {2007}
}
@book{Woit2017,
author = {Woit, Peter},
title = {Quantum Theory, Groups and Representations: An Introduction},
publisher = {Springer},
year = {2017}
}
@article{Dirac1950Constrained,
author = {Dirac, P. A. M.},
title = {Generalized {H}amiltonian Dynamics},
journal = {Can. J. Math.},
volume = {2},
year = {1950},
pages = {129--148}
}