12.05.06 · quantum / angular-momentum

Free Maxwell / massive vector fields; photon and Proca

shipped3 tiersLean: nonepending prereqs

Anchor (Master): Weinberg, S., *The Quantum Theory of Fields, Vol. 1: Foundations* (Cambridge, 1995), §5.3 (massive vector fields), §5.9 (causal vector fields; photon and Proca); Itzykson, C. & Zuber, J.-B., *Quantum Field Theory* (McGraw-Hill, 1980), Ch. 3 §3-2 (canonical quantisation of the electromagnetic field in Gupta-Bleuler and Coulomb gauges); Woit, P., *Quantum Theory, Groups and Representations* (Springer, 2017), Ch. 46 (massless spin-1 representations and the photon); Stueckelberg, E. C. G., *Helv. Phys. Acta* 11, 225 (1938); Proca, A., *J. Phys. Radium* 7, 347 (1936)

Intuition Beginner

Every classical wave has polarisation — a direction in which the wave wiggles. A light beam from the sun, looked at carefully, is a superposition of two orthogonal transverse polarisations: the electric field can swing left-right or up-down, but never along the direction of travel. Polaroid sunglasses pass one polarisation and absorb the other. That two-state polarisation freedom is the classical face of what quantum field theory calls the photon's two helicity modes.

Now imagine running the same picture for a hypothetical massive cousin of the photon — a particle that looks like light but moves slower than light and carries rest mass. A massive spin-1 particle has three polarisations, not two: the two transverse modes plus a longitudinal one. The longitudinal mode wiggles along the direction of travel, the mode that the massless photon does not have. The W and Z bosons of the electroweak theory are real-world examples of this massive cousin; the Proca field is the textbook model.

Why does the photon have only two polarisations while a massive vector has three? The answer is that the photon is constrained to travel at the speed of light, and at the speed of light the longitudinal mode is gauge — a redundancy that can be removed by a U(1) rotation of the phase, with no physical content. The massive vector cannot use this escape route, so its longitudinal mode is real and physical.

Visual Beginner

A side-by-side schematic of polarisation vectors for a photon and for a massive vector. On the left, a photon travelling along the $z$ -axis with two transverse polarisation arrows in the $x y$ -plane and a third dashed arrow along $z$ marked as "gauge — not physical". On the right, a massive vector travelling along the $z$ -axis with the same two transverse arrows plus a solid third arrow along $z$ marked as "longitudinal — physical".

The picture captures the central count: 2 polarisations for the photon, 3 for the massive vector. The smooth limit as the mass goes to zero is subtle: the longitudinal mode of the massive vector does not disappear by going soft, but rather decouples by becoming pure gauge. That decoupling is the content of the Stueckelberg reformulation discussed in the higher tiers.

Worked example Beginner

Count the polarisations of a photon of momentum along the $z$ -axis directly. Use units in which the speed of light is one.

Step 1. The photon momentum four-vector is $k = (k, 0, 0, k)$ , with energy $k$ matching the spatial momentum magnitude $k$ because the photon has zero rest mass. A polarisation vector is a four-vector $ϵ^{μ}$ that tells us in which spacetime direction the electromagnetic field wiggles.

Step 2. There are four candidate independent four-vectors at each momentum: one along time $ϵ_{(0)} = (1, 0, 0, 0)$ , one along $x$ $ϵ_{(1)} = (0, 1, 0, 0)$ , one along $y$ $ϵ_{(2)} = (0, 0, 1, 0)$ , and one along $z$ $ϵ_{(3)} = (0, 0, 0, 1)$ .

Step 3. The Lorenz condition $k_{μ} ϵ^{μ} = 0$ keeps only three of these four. With $k = (k, 0, 0, k)$ the condition reads $k ϵ^{0} - k ϵ^{3} = 0$ , which forces $ϵ^{0} = ϵ^{3}$ on every surviving polarisation. The two purely transverse choices $ϵ_{(1)}, ϵ_{(2)}$ pass with $ϵ^{0} = ϵ^{3} = 0$ ; the time-like and the longitudinal ones merge into one combination $ϵ^{0} = ϵ^{3}$ .

Step 4. The remaining gauge freedom $ϵ^{μ} \to ϵ^{μ} + λ k^{μ}$ for any number $λ$ removes the surviving time-like or longitudinal mode. Setting $λ = - ϵ^{0} / k$ kills the time-component and the longitudinal component simultaneously. Two physical polarisations remain: the two transverse ones $ϵ_{(1)}, ϵ_{(2)}$ .

Step 5. Repeat the count for a massive vector of mass $m$ and momentum $k = (k^{2} + m^{2}, 0, 0, k)$ . The Lorenz condition is now imposed as a dynamical equation of motion rather than a gauge choice — it kills one of the four candidates, leaving three. There is no further gauge freedom because the mass term $m^{2} A^{μ} A_{μ}$ in the Lagrangian breaks the U(1) symmetry that would have done the killing. Three physical polarisations remain: the two transverse ones plus a longitudinal one.

What this tells us: the count of polarisations follows from Lorentz invariance plus one of two possibilities — gauge invariance for the massless photon (which removes one mode), or a mass term for the massive vector (which adds the longitudinal mode that the photon does not have). The number of physical polarisations is 2 for the photon and 3 for the massive vector, and the count is locked by representation theory of the Poincare group: the photon is a representation of the massless little group ISO(2), and the massive vector is a representation of the massive little group SO(3).

Check your understanding Beginner

Exercise (easy, true-false).

True or false: in the limit $m \to 0$ , the longitudinal mode of the Proca field simply vanishes and the massive vector smoothly becomes the photon with two polarisations.

Hint

The longitudinal mode of the massive vector does not vanish smoothly; it decouples by becoming a pure-gauge degree of freedom. The Stueckelberg reformulation makes this decoupling explicit.

Answer

False.

The longitudinal mode of the Proca field does not vanish in the massless limit. Instead, it becomes a pure-gauge mode (in the Stueckelberg reformulation, it is absorbed into an auxiliary scalar field), so it decouples from physical observables while not literally disappearing from the action. The smooth $m \to 0$ limit recovers the photon's two transverse polarisations only after the longitudinal mode is removed by the restored U(1) gauge invariance.

Formal definition Intermediate+

Conventions. Spacetime is four-dimensional Minkowski with metric $η_{μν} = diag (+ 1, - 1, - 1, - 1)$ . Greek indices $μ, ν, \dots$ run over ${0, 1, 2, 3}$ ; Latin indices $i, j, \dots$ over ${1, 2, 3}$ . Units have $ℏ = c = 1$ . We follow the sign convention of Weinberg ^{[Weinberg 1995 §5.3]}.

The free Maxwell field is the operator-valued tempered distribution $A_{μ} (x)$ on Minkowski spacetime built from the classical action $$ S_{\text{Maxwell}}[A] = -\tfrac{1}{4} \int d^4 x ; F_{\mu\nu} F^{\mu\nu}, \qquad F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu. $$ The action is invariant under the U(1) gauge transformation $A_{μ} \to A_{μ} + \partial_{μ} χ$ for any smooth scalar function $χ$ . The Euler-Lagrange equation is $\partial^{μ} F_{μν} = 0$ , i.e. the source-free Maxwell equations.

The free Proca field of mass $m > 0$ is the operator-valued tempered distribution $B_{μ} (x)$ built from the action $$ S_{\text{Proca}}[B] = \int d^4 x ; \big( -\tfrac{1}{4} G_{\mu\nu} G^{\mu\nu} + \tfrac{1}{2} m^2 B_\mu B^\mu \big), \qquad G_{\mu\nu} = \partial_\mu B_\nu - \partial_\nu B_\mu. $$ The mass term $m^{2} B_{μ} B^{μ}$ breaks the U(1) gauge invariance present in the Maxwell case. The Euler-Lagrange equation is $\partial^{μ} G_{μν} + m^{2} B_{ν} = 0$ . Taking the divergence yields $m^{2} \partial^{ν} B_{ν} = 0$ , so $\partial^{ν} B_{ν} = 0$ follows as a constraint for $m \neq = 0$ , and the equation reduces to $(□ + m^{2}) B_{ν} = 0$ with the additional constraint $\partial^{ν} B_{ν} = 0$ .

A polarisation four-vector at on-shell momentum $k^{μ}$ is a complex four-vector $ϵ^{μ} (k, λ)$ that satisfies (i) $k_{μ} ϵ^{μ} (k, λ) = 0$ (the Lorenz / transversality condition) and (ii) $ϵ_{μ}^{*} (k, λ) ϵ^{μ} (k, λ^{'}) = - δ_{λ λ^{'}}$ (normalisation). The polarisation index $λ$ runs over ${+ 1, - 1}$ for the photon (helicity) and ${+ 1, 0, - 1}$ for the Proca field ( $z$ -component of spin in the rest frame, Wigner-rotated to the chosen frame).

The mode expansion of the photon field in Lorenz gauge reads $$ A_\mu(x) = \int \frac{d^3 k}{(2\pi)^3 \sqrt{2 \omega_k}} \sum_{\lambda = \pm 1} \big( a(k, \lambda) \epsilon_\mu(k, \lambda) e^{-i k \cdot x} + a^\dagger(k, \lambda) \epsilon^*_\mu(k, \lambda) e^{i k \cdot x} \big), $$ where $ω_{k} = ∣ k ∣$ on the photon mass shell and $a, a^{†}$ are the bosonic creation and annihilation operators on the photon Fock space, satisfying $[a (k, λ), a^{†} (k^{'}, λ^{'})] = (2 π)^{3} δ^{(3)} (k - k^{'}) δ_{λ λ^{'}}$ . The Proca mode expansion is identical in form, but the polarisation sum runs over the three labels $λ \in {+ 1, 0, - 1}$ and the energy is $ω_{k} = k^{2} + m^{2}$ .

The polarisation-sum identity records the projector onto physical polarisations at fixed on-shell momentum. For the Proca field of mass $m$ , $$ \sum_{\lambda = -1}^{+1} \epsilon^{\mu}(k, \lambda) \epsilon^\nu(k, \lambda) = -\eta^{\mu\nu} + \frac{k^\mu k^\nu}{m^2}, \qquad k^2 = m^2. $$ For the photon, the analogous identity depends on the gauge: in Lorenz gauge with an auxiliary lightlike four-vector $n^{μ}$ , $$ \sum_{\lambda = \pm 1} \epsilon^{\mu}(k, \lambda) \epsilon^\nu(k, \lambda) = -\eta^{\mu\nu} + \frac{k^\mu n^\nu + k^\nu n^\mu}{k \cdot n} - \frac{n^2 k^\mu k^\nu}{(k \cdot n)^2}, \qquad k^2 = 0. $$ The $n$ -dependent terms are gauge-dependent, but contract to zero against any conserved current (Ward identity), so they do not affect physical amplitudes.

Counterexamples to common slips

The condition $\partial^{μ} A_{μ} = 0$ is a gauge choice for the photon (one can change gauge to violate it) but a constraint equation of motion for the Proca field (it follows from the action and cannot be relaxed without changing the theory). Confusing the two is the cleanest version of misunderstanding the photon-Proca distinction.
The polarisation-sum identity $\sum_{λ} ϵ^{* μ} ϵ^{ν} = - η^{μν}$ (without correction terms) holds for off-shell virtual photons inside Feynman propagators in Feynman gauge $ξ = 1$ . On-shell physical photons require the corrected formula with the $n^{μ}$ auxiliary vector; using the naive formula to a physical external line introduces spurious contributions from unphysical polarisations.
Dirac's constraint analysis classifies $A_{0}$ as a Lagrange multiplier (its time derivative does not appear in the action) and Gauss's law $\nabla \cdot E = 0$ as a first-class constraint that generates the residual U(1) gauge transformations. In the Proca case, $B_{0}$ is not a Lagrange multiplier: $\partial_{0} B_{0}$ appears in $G_{0 i}$ , and Gauss-like equation is second-class. The constraint structure differs sharply between the two cases.

Key theorem with proof Intermediate+

Theorem (polarisation count from the Wigner little group; Weinberg §5.3, §5.9). Let $V_{m}$ be the unitary irreducible representation of the Poincare group corresponding to a one-particle state of mass $m \geq 0$ and integer spin $j = 1$ . Then

(i) For $m > 0$ , the little group is $S O (3)$ and the spin- $1$ representation has dimension $2 j + 1 = 3$ . The one-particle Hilbert space at fixed on-shell momentum carries three physical polarisations, labelled by $S_{z} \in {+ 1, 0, - 1}$ in the rest frame.

(ii) For $m = 0$ , the little group is $I S O (2)$ (the Euclidean group of the plane). Finite-dimensional unitary irreducibles of $I S O (2)$ in which the translation generators act as the identity are labelled by an integer helicity $h \in Z$ , each one-dimensional. The photon assembles the helicities $h = + 1$ and $h = - 1$ into a CPT-invariant two-state representation. The one-particle Hilbert space at fixed null momentum carries two physical polarisations.

Proof. Wigner's classification (Wigner Ann. Math. 40, 149 (1939) ^{[Wigner 1939]}) builds unitary irreducibles of the Poincare group from a momentum orbit plus a little-group representation. Fix a reference momentum $k_{0}^{μ}$ on the relevant mass-shell orbit, the little group $L_{k_{0}}$ is the subgroup of the Lorentz group fixing $k_{0}^{μ}$ , and the irreducible is induced from a unitary irreducible of $L_{k_{0}}$ .

Step 1: massive case. For $m > 0$ , choose $k_{0}^{μ} = (m, 0, 0, 0)$ , the rest-frame momentum. The little group fixing $k_{0}$ is the rotation subgroup $S O (3) \subset S O (1, 3)$ . Unitary irreducibles of $S O (3)$ are labelled by half-integer spin $j \in \frac{1}{2} Z_{\geq 0}$ and have dimension $2 j + 1$ . For $j = 1$ , the dimension is $3$ . The induced representation $V_{m, 1}$ on a general momentum $k$ is obtained by acting with a standard boost $L (k)$ that takes $k_{0}$ to $k$ ; the little-group dimension at $k_{0}$ propagates to dimension $3$ at every $k$ on the mass-shell. This gives the three physical polarisations of the Proca field, labelled by $S_{z} \in {+ 1, 0, - 1}$ .

Step 2: massless case. For $m = 0$ , choose $k_{0}^{μ} = (ω, 0, 0, ω)$ , a reference null momentum along the positive $z$ -axis. The little group is the subgroup of $S O (1, 3)$ fixing this null vector. Direct calculation: a Lorentz transformation $Λ$ fixes $k_{0}$ iff its $4 \times 4$ matrix has the form $$ \Lambda = \begin{pmatrix} 1 + \tfrac{1}{2}|\alpha|^2 & \alpha^1 & \alpha^2 & -\tfrac{1}{2}|\alpha|^2 \ \alpha^1 & 1 & 0 & -\alpha^1 \ \alpha^2 & 0 & 1 & -\alpha^2 \ \tfrac{1}{2}|\alpha|^2 & \alpha^1 & \alpha^2 & 1 - \tfrac{1}{2}|\alpha|^2 \end{pmatrix} R(\theta), $$ where $α = (α^{1}, α^{2}) \in R^{2}$ and $R (θ)$ is a rotation by angle $θ$ in the $x y$ -plane. The matrix on the left is a parabolic Lorentz transformation parametrised by $α$ , and the rotation $R (θ)$ commutes with it modulo the $α$ -action. The group law is the semidirect product $I S O (2) = S O (2) ⋉ R^{2}$ of rotations of the plane with translations of the plane. So the massless little group is $I S O (2)$ .

Step 3: representations of $I S O (2)$ . Unitary irreducibles of $I S O (2)$ are classified by the action of the translation generators $T_{1}, T_{2} \in R^{2}$ . There are two families:

(a) Translations act non-vanishingly. The irreducible is induced from a character of the translation subgroup with continuous parameter $∣ t ∣ > 0$ , and is infinite-dimensional. These representations correspond to particles with a continuous internal degree of freedom (a "continuous spin") that has never been observed in nature; massless one-particle states of finite-dimensional internal structure exclude them.

(b) Translations act as the identity. The representation factors through $I S O (2) / R^{2} = S O (2)$ , and unitary irreducibles of $S O (2)$ are one-dimensional, labelled by helicity $h \in Z$ (or half-integer $h \in \frac{1}{2} Z$ for projective representations, accessible after passing to the double cover).

The photon is the case $∣ h ∣ = 1$ , with $h = + 1$ and $h = - 1$ joined by CPT into a single two-state representation. The two states are exactly the left- and right-circularly polarised photons of classical optics. There is no $h = 0$ state because $h = 0$ would be a scalar, not a vector; and there is no $∣ h ∣ = 2, 3, \dots$ for the photon because the photon transforms as a four-vector $A_{μ}$ , and four-vectors carry only $h = \pm 1$ helicities at the massless level.

Step 4: from little-group representation to polarisation vectors. The polarisation vectors $ϵ^{μ} (k, λ)$ are the matrix elements of the embedding $V_{m, j} ↪ C^{4}$ as the four-vector index $μ$ runs over ${0, 1, 2, 3}$ and $λ$ labels the little-group representation. For the massive case at $k_{0} = (m, 0, 0, 0)$ , the standard choice is $ϵ_{(1)}^{μ} = (0, 1, 0, 0)$ , $ϵ_{(2)}^{μ} = (0, 0, 1, 0)$ , $ϵ_{(3)}^{μ} = (0, 0, 0, 1)$ . For the massless case at $k_{0} = (ω, 0, 0, ω)$ , the two helicities are $ϵ_{(\pm)}^{μ} = (0, 1, \pm i, 0) / 2$ . Boosting to general $k$ gives the polarisation four-vectors at every on-shell momentum, related by Wigner rotations to the reference choice. $□$

Bridge. The polarisation count builds toward the entire gauge-theoretic structure of quantum field theory and appears again in 03.07.05 (Yang-Mills action) as the non-abelian generalisation of the same construction. The foundational reason the count is fixed is exactly the Wigner little group: the photon is locked to two helicities by the massless little group ISO(2), and the massive vector is locked to three polarisations by the massive little group SO(3). The central insight is that putting these together produces the gauge-invariance principle as a consequence of representation theory rather than as an additional postulate: a massless spin-1 field that transforms as a Lorentz four-vector necessarily carries a U(1) gauge redundancy, because the four-vector representation of the Lorentz group restricts to a reducible representation of the massless little group, and the unphysical components must be removed by gauge. This is exactly the bridge between Lorentz covariance and gauge invariance, and it identifies gauge symmetry with the kinematic requirement that a Lorentz-covariant massless spin-1 field describe only the physical helicity- $\pm 1$ modes. The Stueckelberg trick generalises the same picture to the massive case, embedding the longitudinal mode into a U(1) gauge orbit at the cost of one auxiliary scalar field.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Derive the Proca equation $\partial^{μ} G_{μν} + m^{2} B_{ν} = 0$ from the Proca Lagrangian $L = - \frac{1}{4} G_{μν} G^{μν} + \frac{1}{2} m^{2} B_{μ} B^{μ}$ and show that it implies the Lorenz constraint $\partial^{ν} B_{ν} = 0$ for $m \neq = 0$ .

Hint

Apply the Euler-Lagrange equations for the field $B_{ν}$ , treating $G_{μν} = \partial_{μ} B_{ν} - \partial_{ν} B_{μ}$ as the field strength. Then take the four-divergence of the resulting equation and use the antisymmetry of $G_{μν}$ .

Answer

The Euler-Lagrange equation for $B_{ν}$ is $$ \partial_\mu \frac{\partial \mathcal{L}}{\partial (\partial_\mu B_\nu)} - \frac{\partial \mathcal{L}}{\partial B_\nu} = 0. $$ Compute $\partial L / \partial (\partial_{μ} B_{ν}) = - G^{μν}$ (from the $- \frac{1}{4} G \cdot G$ term, using $G^{μν} = \partial^{μ} B^{ν} - \partial^{ν} B^{μ}$ antisymmetry) and $\partial L / \partial B_{ν} = m^{2} B^{ν}$ . So $$ -\partial_\mu G^{\mu\nu} - m^2 B^\nu = 0, \quad \text{i.e.} \quad \partial^\mu G_{\mu\nu} + m^2 B_\nu = 0. $$ Now take the four-divergence $\partial^{ν}$ of both sides. The first term is $\partial^{ν} \partial^{μ} G_{μν} = 0$ because $G_{μν}$ is antisymmetric and partials commute (contracting a symmetric tensor with an antisymmetric one gives zero). The second term is $m^{2} \partial^{ν} B_{ν}$ . So $m^{2} \partial^{ν} B_{ν} = 0$ , and for $m \neq = 0$ the Lorenz constraint $\partial^{ν} B_{ν} = 0$ follows automatically. Substituting back into the equation of motion yields $(□ + m^{2}) B_{ν} = 0$ with the constraint $\partial^{ν} B_{ν} = 0$ . Rubric: full credit for the EL derivation, the four-divergence, and the deduction of the constraint.

Exercise 4 (medium, short-answer).

Show that the polarisation-sum identity $\sum_{λ = - 1}^{+ 1} ϵ^{* μ} (k, λ) ϵ^{ν} (k, λ) = - η^{μν} + k^{μ} k^{ν} / m^{2}$ holds for a massive vector of mass $m > 0$ at on-shell momentum $k$ .

Hint

Work in the rest frame $k = (m, 0, 0, 0)$ and use the standard polarisation vectors $ϵ_{(1)}^{μ} = (0, 1, 0, 0)$ , $ϵ_{(2)}^{μ} = (0, 0, 1, 0)$ , $ϵ_{(3)}^{μ} = (0, 0, 0, 1)$ . The general-frame result follows by Lorentz covariance.

Answer

In the rest frame $k = (m, 0, 0, 0)$ , the polarisation sum is $$ \sum_{\lambda = 1}^{3} \epsilon^{*\mu}{(\lambda)} \epsilon^\nu{(\lambda)} = \delta_{\mu 1} \delta_{\nu 1} + \delta_{\mu 2} \delta_{\nu 2} + \delta_{\mu 3} \delta_{\nu 3} = \begin{pmatrix} 0 & 0 & 0 & 0 \ 0 & 1 & 0 & 0 \ 0 & 0 & 1 & 0 \ 0 & 0 & 0 & 1 \end{pmatrix}. $$ Compare with $- η^{μν} + k^{μ} k^{ν} / m^{2}$ in the rest frame: $- η^{μν} = diag (- 1, 1, 1, 1)$ and $k^{μ} k^{ν} / m^{2} = diag (1, 0, 0, 0)$ , so the right-hand side equals $diag (0, 1, 1, 1)$ . The two sides agree in the rest frame. Both sides are Lorentz-covariant (the left because the polarisation vectors transform covariantly under Wigner rotations, the right because $η^{μν}$ and $k^{μ} k^{ν}$ are manifestly so), and they agree in one frame, so they agree in every frame. Rubric: full credit for the rest-frame computation plus the Lorentz-covariance argument.

Exercise 5 (medium, short-answer).

State the Stueckelberg reformulation of the Proca field and show that it restores a U(1) gauge invariance.

Hint

Introduce an auxiliary scalar field $ϕ$ and replace $B_{μ} \to B_{μ} + (1/ m) \partial_{μ} ϕ$ in the Proca Lagrangian. The new Lagrangian should be invariant under $B_{μ} \to B_{μ} + \partial_{μ} χ$ , $ϕ \to ϕ - m χ$ .

Answer

The Stueckelberg Lagrangian augments the Proca field with an auxiliary scalar $ϕ$ : $$ \mathcal{L}{\text{Stueck}} = -\tfrac{1}{4} G{\mu\nu} G^{\mu\nu} + \tfrac{1}{2} m^2 \big(B_\mu + (1/m) \partial_\mu \phi\big)\big(B^\mu + (1/m) \partial^\mu \phi\big). $$ Expanding: $L_{Stueck} = - \frac{1}{4} G \cdot G + \frac{1}{2} m^{2} B \cdot B + m B^{μ} \partial_{μ} ϕ + \frac{1}{2} (\partial ϕ)^{2}$ . Check the gauge transformation $B_{μ} \to B_{μ} + \partial_{μ} χ$ , $ϕ \to ϕ - m χ$ . The kinetic $G \cdot G$ is invariant because $G_{μν}$ depends only on $\partial_{μ} B_{ν} - \partial_{ν} B_{μ}$ , which is U(1)-gauge invariant. The combination $m B_{μ} + \partial_{μ} ϕ$ transforms as $m B_{μ} + m \partial_{μ} χ + \partial_{μ} ϕ - m \partial_{μ} χ = m B_{μ} + \partial_{μ} ϕ$ , invariant. So $L_{Stueck}$ is invariant under the joint transformation. The Stueckelberg gauge $ϕ = 0$ recovers the Proca Lagrangian. The unitary gauge eliminates $ϕ$ from the spectrum at the cost of breaking the gauge invariance, and shows that the Stueckelberg formulation and the original Proca formulation describe the same physics. Rubric: full credit for the Lagrangian, the gauge transformation, and the demonstration of invariance.

Exercise 7 (hard, short-answer).

Perform the Dirac constraint analysis of the free Maxwell theory and identify Gauss's law as a first-class constraint.

Hint

The canonical momenta are $π^{μ} = \partial L / \partial (\partial_{0} A_{μ})$ . The momentum conjugate to $A_{0}$ vanishes identically, signalling a primary constraint. Consistency of the primary constraint under time evolution generates a secondary constraint, which is Gauss's law. Classify the constraints by Poisson-bracket structure.

Answer

Start from $L = - \frac{1}{4} F_{μν} F^{μν}$ and compute the canonical momenta: $$ \pi^\mu = \frac{\partial \mathcal{L}}{\partial (\partial_0 A_\mu)} = -F^{0\mu}. $$ For $μ = 0$ : $π^{0} = - F^{00} = 0$ identically — a primary constraint $Φ_{1} \equiv π^{0} \approx 0$ . For $μ = i$ : $π^{i} = - F^{0 i} = E^{i}$ , the electric field.

The Hamiltonian density is $H = π^{μ} \partial_{0} A_{μ} - L = \frac{1}{2} (E^{2} + B^{2}) + E^{i} \partial_{i} A_{0} - A_{0} (\nabla \cdot E)$ , after integration by parts on the constraint surface. Total Hamiltonian with Lagrange multiplier $u$ for the primary constraint: $$ H_T = \int d^3 x , \big[ \tfrac{1}{2}(E^2 + B^2) - A_0 (\nabla \cdot E) + u \pi^0 \big]. $$ Consistency of $Φ_{1} = π^{0} \approx 0$ under time evolution: $\dot{Φ}_{1} = {π^{0}, H_{T}} = \nabla \cdot E$ . Demanding $\dot{Φ}_{1} \approx 0$ produces the secondary constraint $Φ_{2} \equiv \nabla \cdot E \approx 0$ , which is Gauss's law. Checking consistency of $Φ_{2}$ : $\dot{Φ}_{2} = {\nabla \cdot E, H_{T}}$ vanishes identically on the constraint surface, so the cascade terminates.

Compute the Poisson brackets: ${Φ_{1} (x), Φ_{2} (y)} = {π^{0} (x), \nabla \cdot E (y)} = 0$ (since $π^{0}$ is the momentum conjugate to $A_{0}$ , not to any spatial component, and $\nabla \cdot E$ involves only spatial fields and their momenta). Both constraints have vanishing brackets with each other and with themselves, so both are first-class. First-class constraints generate gauge transformations: $Φ_{1}$ generates shifts of $A_{0}$ , and $Φ_{2}$ generates the standard U(1) gauge transformation of the spatial components. The physical phase space is reduced by $2 \times 2 = 4$ degrees of freedom (each first-class constraint kills two phase-space directions), leaving $8 - 4 = 4$ physical degrees of freedom per spacetime point, i.e., two physical polarisations $\times$ (position, momentum). Rubric: full credit for the primary constraint, the Gauss-law secondary, the first-class classification, and the count of physical degrees of freedom.

Exercise 8 (hard, short-answer).

Derive the photon propagator in $R_{ξ}$ -gauge from the gauge-fixed Lagrangian $L_{R_{ξ}} = - \frac{1}{4} F_{μν} F^{μν} - (1/2 ξ) (\partial^{μ} A_{μ})^{2}$ .

Hint

Combine the two terms, integrate by parts, and invert the resulting kinetic operator in momentum space. Use the identity $- \frac{1}{4} F_{μν} F^{μν} = - \frac{1}{2} A^{μ} (η_{μν} □ - \partial_{μ} \partial_{ν}) A^{ν}$ after integration by parts.

Answer

After integration by parts, the gauge-fixed Lagrangian becomes $$ \mathcal{L}{R\xi} = -\tfrac{1}{2} A^\mu \big[ \eta_{\mu\nu} \Box - \partial_\mu \partial_\nu + (1/\xi) \partial_\mu \partial_\nu \big] A^\nu = -\tfrac{1}{2} A^\mu \big[ \eta_{\mu\nu} \Box - (1 - 1/\xi) \partial_\mu \partial_\nu \big] A^\nu. $$ In momentum space, the kinetic operator is $K_{μν} (k) = - k^{2} η_{μν} + (1 - 1/ ξ) k_{μ} k_{ν}$ . The propagator is the inverse: solve $K_{μν} D^{ν ρ} = - i δ_{μ}^{ρ}$ (with the standard sign convention). Try the ansatz $D^{ν ρ} (k) = - i (a η^{ν ρ} + b k^{ν} k^{ρ}) / (k^{2} + i ε)$ . Substituting and matching:

Coefficient of $η_{μ}^{ρ}$ : $- k^{2} \cdot a / (- k^{2}) = a = 1$ , so $a = 1$ .
Coefficient of $k_{μ} k^{ρ}$ : $- k^{2} \cdot b + a (1 - 1/ ξ) k_{μ} k^{ρ}$ projected onto $k_{μ} k^{ρ}$ at fixed $k^{2}$ . Working this out: the longitudinal part has eigenvalue $- k^{2} / ξ$ for the operator $K$ , so its inverse has eigenvalue $- ξ / k^{2}$ . Matching gives $b = (ξ - 1) / k^{2}$ . Wait — to keep the expression manifestly free of double poles, rewrite as $D^{ν ρ} (k) = - i [η^{ν ρ} - (1 - ξ) k^{ν} k^{ρ} / k^{2}] / (k^{2} + i ε)$ .

So the $R_{ξ}$ -gauge propagator is $$ D^F_{\mu\nu}(k) = \frac{-i}{k^2 + i\varepsilon} \big[ \eta_{\mu\nu} - (1 - \xi) \frac{k_\mu k_\nu}{k^2} \big]. $$ Special cases: Feynman gauge $ξ = 1$ gives $D_{μν}^{F} = - i η_{μν} / (k^{2} + i ε)$ , the simplest. Landau gauge $ξ = 0$ gives $D_{μν}^{F} = - i (η_{μν} - k_{μ} k_{ν} / k^{2}) / (k^{2} + i ε)$ , manifestly transverse. Unitary gauge $ξ \to \infty$ does not exist for the photon (the longitudinal mode is unbounded); it does exist for the Stueckelberg-deformed Proca field and decouples the auxiliary scalar there. The gauge-parameter dependence cancels in physical S-matrix elements between conserved currents (Ward identity). Rubric: full credit for the inversion procedure plus the special-gauge limits.

Lean formalization Intermediate+

Mathlib has no named QFT objects for either the photon or the Proca field. A schematic of the intended formalisation:

import Mathlib.AlgebraicTopology.SingularSet
import Mathlib.Analysis.InnerProductSpace.l2Space
import Mathlib.LinearAlgebra.TensorAlgebra.Basic

/-- The one-particle relativistic Hilbert space for a massless spin-1
particle (the photon): positive-energy null-mass mode functions
fibered over the lightcone, with helicity index running over {-1, +1}. -/
structure PhotonOnePArticleSpace : Type where
  carrier : Type
  inner_product : carrier → carrier → ℂ
  -- positive-energy null mass-shell constraint
  -- helicity ±1 index assignment
  sorry

/-- The free Maxwell field as an operator-valued tempered distribution
on the photon Fock space (bosonic Fock over the one-particle space). -/
noncomputable def MaxwellField :
    Π (x : SpaceTimePoint),
    BosonicFock PhotonOnePArticleSpace →ₗ[ℂ]
      BosonicFock PhotonOnePArticleSpace :=
  sorry  -- mode expansion in polarisation index λ ∈ {-1, +1}

/-- The free Proca field of mass m > 0 as an operator-valued
distribution on the Proca Fock space (massive spin-1 one-particle space). -/
noncomputable def ProcaField (m : ℝ) (hm : 0 < m) :
    Π (x : SpaceTimePoint),
    BosonicFock (ProcaOnePArticleSpace m hm) →ₗ[ℂ]
      BosonicFock (ProcaOnePArticleSpace m hm) :=
  sorry  -- polarisation index λ ∈ {-1, 0, +1}

The proof gap is substantive. Mathlib needs: the Wigner classification of unitary irreducibles of the Poincare group (no current artifact); operator-valued tempered distributions on bosonic Fock space (no current artifact); the constraint analysis of singular Lagrangians via the Dirac algorithm (no current artifact); the polarisation-sum identity for the two relevant little groups; the Faddeev-Popov determinant for $R_{ξ}$ -gauge fixing; and the Stueckelberg reformulation as a change of dynamical variables. Each piece is its own contribution. The unit ships with lean_status: none per the gap field above.

Advanced results Master

Theorem (Gupta-Bleuler quantisation; Itzykson-Zuber §3-2-1). Quantise the free Maxwell field in Lorenz gauge $\partial^{μ} A_{μ} = 0$ by promoting all four components $A_{μ}$ to operators on an indefinite-metric Fock space $F_{GB}$ built from a four-state little-group representation. The physical subspace is defined by the Bleuler condition $\partial^{μ} A_{μ}^{(+)} ∣ phys ⟩ = 0$ , where $A_{μ}^{(+)}$ is the positive-frequency part. The quotient of physical states by null states is a positive-definite Hilbert space $H_{phys} = F_{GB, phys} / N$ carrying the two physical polarisations.

The Gupta-Bleuler construction is the price one pays for manifest Lorentz covariance at the operator level. The indefinite-metric quantisation has a four-state space at each momentum (the four components of $A_{μ}$ ), but the physical subspace selection plus quotient by null states reduces this to two physical states per momentum, matching the helicity count. The construction is the canonical-side complement to the Faddeev-Popov path-integral approach (Faddeev-Popov, Phys. Lett. B 25, 29 (1967) ^{[Faddeev-Popov 1967]}), which handles the same gauge-fixing problem at the level of the generating functional.

Theorem (Coulomb-gauge canonical quantisation; Peskin-Schroeder §4.8). In Coulomb gauge $\nabla \cdot A = 0$ the canonical phase space reduces directly to the two transverse polarisations with no indefinite-metric machinery. The equal-time commutator is $$ [A_i(x), \dot{A}_j(y)] = i \delta^{ij}_T(x - y), \qquad \delta^{ij}_T(x - y) = \delta^{ij} \delta^{(3)}(x - y) - \frac{\partial^i \partial^j}{\nabla^2} \delta^{(3)}(x - y), $$ where $δ_{T}^{ij}$ is the transverse projector. The Hamiltonian is $H = \frac{1}{2} \int d^{3} x (E_{T}^{2} + B^{2})$ , manifestly positive-definite.

Coulomb gauge breaks manifest Lorentz covariance but produces a manifestly unitary, positive-norm Hilbert space with only the two physical polarisations. The trade-off between manifest covariance (Lorenz gauge with Gupta-Bleuler) and manifest unitarity (Coulomb gauge) is generic in gauge theory. For computation of S-matrix elements between physical states, both approaches give identical answers, by the Faddeev-Popov gauge-independence theorem.

Theorem (Proca-field canonical quantisation; Weinberg §5.3). The massive vector field $B_{μ}$ of mass $m > 0$ is canonically quantised on a positive-definite Fock space carrying three physical polarisations per momentum, with mode expansion $$ B_\mu(x) = \int \frac{d^3 k}{(2\pi)^3 \sqrt{2 \omega_k}} \sum_{\lambda = -1}^{+1} \big( a(k, \lambda) \epsilon_\mu(k, \lambda) e^{-i k \cdot x} + a^\dagger(k, \lambda) \epsilon^*_\mu(k, \lambda) e^{i k \cdot x} \big), \quad \omega_k = \sqrt{k^2 + m^2}. $$ The propagator is $$ D^{F,\text{Proca}}{\mu\nu}(k) = \frac{-i}{k^2 - m^2 + i\varepsilon} \big( \eta{\mu\nu} - \frac{k_\mu k_\nu}{m^2} \big). $$ The $k_{μ} k_{ν} / m^{2}$ term diverges as $m \to 0$ , signalling the obstruction to a smooth massless limit at the level of the propagator.

The Proca propagator's $k_{μ} k_{ν} / m^{2}$ pole is the diagnostic that distinguishes a massive vector from a massless one. In Feynman diagrams for theories with massive vectors (the W and Z bosons of the Standard Model, the rho meson of strong interactions), the $k_{μ} k_{ν} / m^{2}$ term contributes to scattering amplitudes and must be tracked carefully. The unitary gauge $ξ \to \infty$ in the Stueckelberg-deformed propagator recovers exactly this form for the gauge-boson sector of broken gauge theories, where the Higgs mechanism plays the role of the Stueckelberg scalar at the non-linear level.

Theorem (Stueckelberg reformulation and the smooth massless limit; Stueckelberg 1938). The Proca Lagrangian augmented with an auxiliary scalar $ϕ$ via $L_{Stueck} = - \frac{1}{4} G_{μν} G^{μν} + \frac{1}{2} m^{2} (B_{μ} + (1/ m) \partial_{μ} ϕ)^{2} - (1/2 ξ) (\partial^{μ} B_{μ} + (m / ξ) ϕ)^{2}$ is equivalent to the original Proca theory in unitary gauge $ϕ = 0$ . The propagator in 't Hooft gauge $ξ$ for the Stueckelberg-deformed field is $$ D^F_{\mu\nu}(k) = \frac{-i}{k^2 - m^2 + i\varepsilon} \big( \eta_{\mu\nu} - (1 - \xi) \frac{k_\mu k_\nu}{k^2 - \xi m^2} \big), $$ which has a smooth $m \to 0$ limit because the dangerous $k_{μ} k_{ν} / m^{2}$ term is replaced by a finite expression. The Stueckelberg scalar becomes a free massless ghost as $m \to 0$ and decouples from physical observables.

Stueckelberg's 1938 trick was a decade ahead of its time: he wrote down the massive U(1) gauge theory with a Higgs-like auxiliary scalar before the Higgs mechanism was discovered. In modern terms, the Stueckelberg formulation is the special case of the Higgs mechanism with a complex scalar of vanishing radial mass — the longitudinal mode of the massive vector is eaten from the scalar's phase, and the smooth $m \to 0$ limit corresponds to ungluing this mode and returning it to the scalar sector as a free Goldstone boson. For the full Standard Model story, this is the linearised version of the Brout-Englert-Higgs mechanism.

Theorem (Faddeev-Popov determinant and the path-integral derivation; Faddeev-Popov 1967). The free-Maxwell path-integral over field configurations modulo gauge, $$ Z = \int \mathcal{D} A_\mu ; \delta(\partial^\mu A_\mu) ; \Delta_{FP}[A] ; e^{i S_{\text{Maxwell}}[A]}, $$ equals (after exponentiation of the delta function and the determinant) the gauge-fixed generating functional $$ Z_{\xi} = \int \mathcal{D} A_\mu ; e^{i \int (\mathcal{L}{\text{Maxwell}} - (1/2\xi) (\partial^\mu A\mu)^2) d^4 x}, $$ with no Faddeev-Popov ghost contribution because the gauge group U(1) is abelian (the determinant is field-independent and reabsorbed into the normalisation). The resulting propagator is the $R_{ξ}$ propagator computed in Exercise 8.

For non-abelian generalisations (Yang-Mills theory), the Faddeev-Popov determinant is no longer field-independent and contributes ghost fields to the gauge-fixed action. The free Maxwell case is the abelian baseline where the construction simplifies. The Faddeev-Popov logic propagates to non-abelian gauge theory in 03.07.05 (Yang-Mills action) and is the modern path-integral counterpart of the canonical Gupta-Bleuler construction at the operator level.

Theorem (helicity conservation and Wigner rotations; Weinberg §5.9). Under a Lorentz transformation $Λ \in S O^{+} (1, 3)$ that takes momentum $k$ to $Λ k$ , the photon helicity states transform as $$ U(\Lambda) |k, \lambda\rangle = e^{i \lambda \theta(\Lambda, k)} |\Lambda k, \lambda\rangle, \qquad \lambda = \pm 1, $$ where $θ (Λ, k)$ is the Wigner rotation angle (the SO(2) element of the little-group action ISO(2) at $Λ k$ that the Lorentz transformation $Λ$ contributes after extraction of the standard boost). Helicity is conserved under Lorentz boosts, but the polarisation four-vector picks up a momentum-dependent phase.

This is the operational content of the statement that helicity is a Lorentz-invariant for massless particles. For a massive particle, by contrast, the spin projection $S_{z}$ is not Lorentz-invariant — the Wigner rotation for the massive case is a genuine $S O (3)$ rotation, not a phase, and it mixes the three polarisation states. The asymmetry between conserved-helicity (massless) and Wigner-mixed-spin (massive) is the kinematic root of the distinction between photon and Proca scattering amplitudes.

Theorem (positivity and cluster decomposition; Weinberg §5.9). Causal commutators of the free-Maxwell and Proca fields in their physical-Hilbert-space quantisations vanish at spacelike separation, $$ [A_\mu(x), A_\nu(y)] = 0, \qquad (x - y)^2 < 0, $$ and the two-point Wightman function satisfies the Wightman positivity condition. The cluster decomposition property of the vacuum is automatic.

Causality and cluster decomposition are the two structural features that single out the canonical-quantisation Fock-space construction (with whatever gauge-fixing machinery is needed) from naive operator-valued solutions of the wave equation. The constructions above are not arbitrary mathematical exercises; they are the unique (up to choice of gauge) free-field quantisations of a Lorentz-covariant local theory that respects causality, cluster decomposition, and positivity of the spectrum of the Hamiltonian.

Synthesis. The construction of the free Maxwell and Proca quantum fields is the foundational reason every gauge-theoretic quantum field theory has the structure it has. The central insight is that the polarisation count for a Lorentz-covariant spin-1 field is locked by Wigner's little group: two helicities for the massless photon (little group ISO(2)) and three polarisations for the massive vector (little group SO(3)). The bridge is that this count cannot be put on a manifestly Lorentz-covariant footing without either (a) introducing gauge invariance and removing the unphysical components by hand, or (b) accepting an indefinite-metric Fock space and projecting onto a physical subspace. Putting these together, the photon's gauge invariance is not an additional postulate but rather a kinematic consequence of the embedding of the two-state ISO(2) representation into the four-component four-vector $A_{μ}$ , and the Stueckelberg trick generalises the same logic to the massive case at the cost of one auxiliary scalar. This is exactly the bridge between Lorentz covariance and gauge invariance, and it identifies the U(1) of electromagnetism with the kinematic redundancy demanded by the embedding rather than with a global symmetry of matter.

The propagator structure follows from the same analysis. For the photon in $R_{ξ}$ -gauge, the propagator $- i [η_{μν} - (1 - ξ) k_{μ} k_{ν} / k^{2}] / (k^{2} + i ε)$ has a one-parameter family parametrising the gauge choice, and physical S-matrix elements are $ξ$ -independent by the Ward identity. For the Proca field, the propagator $- i (η_{μν} - k_{μ} k_{ν} / m^{2}) / (k^{2} - m^{2} + i ε)$ has no gauge-parameter freedom but acquires a $k_{μ} k_{ν} / m^{2}$ singularity in the massless limit. The Stueckelberg reformulation makes the limit smooth by introducing an auxiliary scalar that absorbs the longitudinal mode into a U(1) gauge orbit. Putting these together, the photon and Proca cases form one duality framework, with gauge invariance and mass term as the two extremes: every massive vector field is, in the right variables, a Stueckelberg-formulated gauge field with the gauge parameter $ξ$ controlling the smoothness of the massless limit. The bridge appears again in 03.07.05 (Yang-Mills action) for the non-abelian generalisation, where the Higgs mechanism plays the role of the Stueckelberg scalar at the level of an interacting theory, and the smooth-massless-limit structure of the Standard Model traces back to exactly the same kinematic reasoning developed for the abelian case here.

Full proof set Master

Theorem (polarisation count from Wigner little group), proof. Given in the Intermediate-tier section. The argument has four steps: classify the orbits in momentum space by mass (massive timelike vs. massless null), identify the little group at a reference point on each orbit ( $S O (3)$ for massive, $I S O (2)$ for massless), classify unitary irreducibles of the little group (spin- $j$ representations of dimension $2 j + 1$ for $S O (3)$ ; helicity- $h$ representations of dimension $1$ for $I S O (2)$ with identity translation action), and induce up from the reference momentum to a general momentum on the orbit. For spin $j = 1$ this yields three polarisations in the massive case and two helicity states in the massless case. $□$

Theorem (Proca-field Lorenz constraint), proof. Given in Exercise 3. Apply the Euler-Lagrange equation to the Proca Lagrangian, then take the four-divergence of the resulting equation of motion. The Maxwell-tensor part vanishes by antisymmetry; the mass term yields $m^{2} \partial^{ν} B_{ν} = 0$ , which for $m \neq = 0$ forces $\partial^{ν} B_{ν} = 0$ . $□$

Theorem (polarisation-sum identity for the Proca field), proof. Given in Exercise 4. Compute in the rest frame, then propagate by Lorentz covariance. $□$

Proposition (smooth massless limit fails for the Proca propagator). The propagator $D_{μν}^{F, Proca} (k) = - i (η_{μν} - k_{μ} k_{ν} / m^{2}) / (k^{2} - m^{2} + i ε)$ does not have a finite limit as $m \to 0$ at fixed off-shell $k$ when contracted with a non-conserved external current.

Proof. Fix an off-shell momentum $k$ with $k^{2} \neq = 0$ and a non-conserved external four-vector $J^{μ}$ with $k_{μ} J^{μ} \neq = 0$ . The contraction $J^{* μ} D_{μν}^{F, Proca} J^{ν}$ has two pieces: $$ J^{\mu} \eta_{\mu\nu} J^\nu \cdot \frac{-i}{k^2 - m^2 + i\varepsilon} - J^{\mu} k_\mu k_\nu J^\nu \cdot \frac{-i}{m^2 (k^2 - m^2 + i\varepsilon)}. $$ The first piece has a finite $m \to 0$ limit, equal to $- i (J^{*} \cdot J) / (k^{2} + i ε)$ . The second piece has the form $∣ k \cdot J ∣^{2} \cdot i / [m^{2} (k^{2} - m^{2})]$ , which diverges as $1/ m^{2}$ unless $k \cdot J = 0$ , i.e., unless the external current is conserved. Contraction with a conserved current (satisfying $k_{μ} J^{μ} = 0$ on-shell) kills the dangerous $k_{μ} k_{ν} / m^{2}$ piece and the limit is finite. The Stueckelberg reformulation, by contrast, has a smooth $m \to 0$ limit even for non-conserved external currents, because the auxiliary scalar carries away the would-be-divergent contribution. $□$

Theorem (Faddeev-Popov determinant for abelian gauge theory), proof. Insert the identity $1 = \int D χ δ (\partial^{μ} A_{μ}^{χ}) Δ_{F P} [A]$ into the gauge-invariant path integral, where $A_{μ}^{χ} = A_{μ} + \partial_{μ} χ$ is the gauge transform and $Δ_{F P} [A] = det (δ f / δ χ)$ for the gauge-fixing function $f [A^{χ}] = \partial^{μ} A_{μ}^{χ}$ . The functional determinant evaluates as $$ \Delta_{FP}[A] = \det\big(\partial^\mu \partial_\mu \big) = \det \Box, $$ which is independent of the field $A$ . The constant determinant factors out of the path integral and is absorbed into the normalisation, leaving the gauge-fixed action $S_{Maxwell} - (1/2 ξ) (\partial^{μ} A_{μ})^{2}$ after exponentiating the delta function with parameter $ξ$ . Conclusion: for abelian gauge theory, no ghost fields appear, and the gauge-fixed propagator is the $R_{ξ}$ propagator. $□$

Theorem ( $R_{ξ}$ -gauge photon propagator), proof. Given in Exercise 8. Invert the kinetic operator $K_{μν} (k) = - k^{2} η_{μν} + (1 - 1/ ξ) k_{μ} k_{ν}$ via the ansatz $D^{F} = - i (a η + bk k / k^{2}) / (k^{2} + i ε)$ . Matching coefficients yields $a = 1$ and $b = ξ - 1$ , giving $D_{μν}^{F} (k) = - i (η_{μν} - (1 - ξ) k_{μ} k_{ν} / k^{2}) / (k^{2} + i ε)$ . $□$

Theorem (Gupta-Bleuler subspace), stated without proof — see Itzykson-Zuber §3-2-1 ^{[Itzykson-Zuber 1980]}. The full Gupta-Bleuler construction requires the indefinite-metric Fock space, the positive-frequency decomposition of the field, the Bleuler condition on physical states, and the proof that the physical/null quotient is positive-definite with two degrees of freedom per momentum. The construction is canonical-side complement to the Faddeev-Popov path-integral construction at the operator level and is documented in detail in Itzykson-Zuber Ch. 3 ^{[Itzykson-Zuber 1980]}. $□$

Theorem (Stueckelberg gauge invariance), proof. Given in Exercise 5. The Stueckelberg Lagrangian $L_{Stueck} = - \frac{1}{4} G \cdot G + \frac{1}{2} (m B_{μ} + \partial_{μ} ϕ)^{2}$ is invariant under the joint transformation $B_{μ} \to B_{μ} + \partial_{μ} χ$ , $ϕ \to ϕ - m χ$ because the combination $m B_{μ} + \partial_{μ} ϕ$ is itself invariant, and $G_{μν}$ is built from gauge-invariant antisymmetric derivatives of $B_{μ}$ . The unitary gauge $ϕ = 0$ recovers the Proca Lagrangian, and the 't Hooft gauge $ξ$ recovers the Stueckelberg propagator with a smooth $m \to 0$ limit. $□$

Connections Master

Bosonic Fock space and second quantisation 12.13.01. The one-particle Hilbert spaces for the photon and the Proca field are both built from the bosonic Fock-space construction of unit 12.13.01, with one-particle space taken to be the appropriate Wigner irreducible (massless helicity- $\pm 1$ for the photon, massive spin-1 for Proca). The free Maxwell and Proca fields are operator-valued tempered distributions on these Fock spaces, with their canonical commutation relations inherited from the underlying CCR algebra of unit 12.13.01.
Maxwell's equations in differential form 10.04.01. The classical Lagrangian $- \frac{1}{4} F_{μν} F^{μν}$ of the free Maxwell field is exactly the classical Maxwell theory in covariant form, whose differential-form repackaging $\int F \land ⋆ F$ is the subject of unit 10.04.01. The quantum theory developed here promotes the classical $A_{μ}$ to an operator-valued tempered distribution, but preserves the gauge structure and the equations of motion at the level of operator equations. The classical-to-quantum bridge is canonical quantisation plus gauge-fixing, exactly as developed above.
Covariant electrodynamics with the Faraday tensor 10.06.01. The covariant tensor formalism for the classical Maxwell theory developed in unit 10.06.01 is the algebraic framework within which the quantum theory is constructed here. The polarisation vectors $ϵ^{μ} (k, λ)$ are the operator-side incarnation of the polarisation of classical electromagnetic waves; the Faraday tensor $F_{μν}$ promotes to an operator-valued antisymmetric tensor field; and the Lorenz / Coulomb gauge choices have classical-side counterparts in the standard gauge-fixing procedures of classical electrodynamics.
Yang-Mills action 03.07.05. The non-abelian generalisation of the free Maxwell theory replaces U(1) by a non-abelian compact Lie group and introduces a genuine Faddeev-Popov ghost contribution because the determinant is no longer field-independent. The polarisation count and the gauge-removal logic developed here for the abelian case generalise directly: gluons and electroweak gauge bosons have exactly two physical helicities each (in the massless limit), and the Higgs mechanism plays the Stueckelberg role at the non-abelian level. The free abelian construction here is the pedagogical baseline; the Yang-Mills unit is the genuine application.
Dirac equation and relativistic spin 12.11.01. The Dirac equation provides the spin- $\frac{1}{2}$ companion to the spin- $1$ analysis presented here. The Wigner classification used to fix the polarisation count of the photon and the Proca field applies equally to the Dirac field, where the massive little group $S O (3)$ representation of spin $\frac{1}{2}$ has dimension $2$ , and the massless limit has helicities $\pm \frac{1}{2}$ (the Weyl spinors). The QED Lagrangian couples the Dirac field to the Maxwell field via the gauge-covariant derivative, and the constraint analysis of the resulting theory builds on both unit 12.11.01 and the present unit.

Historical & philosophical context Master

The construction of a quantum theory of the electromagnetic field began with Dirac's 1927 paper The quantum theory of the emission and absorption of radiation (Proc. Roy. Soc. A 114, 243 (1927)) ^{[Stueckelberg 1938]}, which introduced the creation and annihilation operators of the radiation field and quantised the Maxwell theory at the operator level. The classical Lagrangian formulation as $- \frac{1}{4} F_{μν} F^{μν}$ traces to Pauli and Weisskopf's 1934 reformulation. The Lorentz-covariant indefinite-metric quantisation was assembled in two parallel papers by Suraj Gupta (Proc. Phys. Soc. A 63, 681 (1950)) ^{[Gupta 1950]} and Konrad Bleuler (Helv. Phys. Acta 23, 567 (1950)) ^{[Bleuler 1950]}, yielding what is now called Gupta-Bleuler quantisation. The Coulomb-gauge alternative was developed by Fermi as early as 1932 and refined by Schwinger and others through the 1940s; the trade-off between manifest covariance (Gupta-Bleuler) and manifest unitarity (Coulomb) became a recurring theme of gauge-theory quantisation.

The massive vector field has a distinct genealogy. Alexandru Proca introduced the equation $(□ + m^{2}) A_{μ} = 0$ together with the Lorenz constraint in three 1936 Comptes Rendus notes and a Journal de Physique article (J. Phys. Radium 7, 347 (1936)) ^{[Proca 1936]}, originally intended as a relativistic spin-1 candidate for the electron (which the Dirac equation already covered). The Proca equation was reinterpreted in the late 1930s as the relativistic equation for a massive spin-1 particle and applied to nuclear physics by Yukawa (1935) and Kemmer (1938). Ernst Stueckelberg's 1938 reformulation introducing an auxiliary scalar to restore U(1) gauge invariance (Helv. Phys. Acta 11, 225 (1938)) ^{[Stueckelberg 1938]} anticipated by two decades the Brout-Englert-Higgs mechanism (1964) and the Glashow-Weinberg-Salam unification of the electroweak interactions (1967-68). The modern path-integral quantisation via Faddeev-Popov determinants (Faddeev-Popov, Phys. Lett. B 25, 29 (1967) ^{[Faddeev-Popov 1967]}) reunified the Maxwell and Yang-Mills cases under one technique. The Wigner-classification underpinning of the photon and the Proca field was established in Wigner's 1939 Ann. Math. paper ^{[Wigner 1939]} and absorbed into the standard QFT presentations of Weinberg (Quantum Theory of Fields, Vol. 1, Ch. 5 (1995)) ^{[Weinberg 1995]} and Woit (Quantum Theory, Groups and Representations, Ch. 46 (2017)).

Bibliography Master

@article{Dirac1927Radiation,
  author  = {Dirac, P. A. M.},
  title   = {The Quantum Theory of the Emission and Absorption of Radiation},
  journal = {Proc. Roy. Soc. A},
  volume  = {114},
  year    = {1927},
  pages   = {243--265}
}

@article{Proca1936,
  author  = {Proca, Alexandru},
  title   = {Sur la th{\'e}orie ondulatoire des {\'e}lectrons positifs et n{\'e}gatifs},
  journal = {J. Phys. Radium},
  volume  = {7},
  year    = {1936},
  pages   = {347--353}
}

@article{Stueckelberg1938,
  author  = {Stueckelberg, E. C. G.},
  title   = {Die Wechselwirkungskr{\"a}fte in der Elektrodynamik und in der Feldtheorie der Kernkr{\"a}fte},
  journal = {Helv. Phys. Acta},
  volume  = {11},
  year    = {1938},
  pages   = {225--244, 299--328, 312--328}
}

@article{Wigner1939,
  author  = {Wigner, Eugene P.},
  title   = {On Unitary Representations of the Inhomogeneous Lorentz Group},
  journal = {Ann. Math.},
  volume  = {40},
  year    = {1939},
  pages   = {149--204}
}

@article{Gupta1950,
  author  = {Gupta, Suraj N.},
  title   = {Theory of Longitudinal Photons in Quantum Electrodynamics},
  journal = {Proc. Phys. Soc. A},
  volume  = {63},
  year    = {1950},
  pages   = {681--691}
}

@article{Bleuler1950,
  author  = {Bleuler, Konrad},
  title   = {Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen},
  journal = {Helv. Phys. Acta},
  volume  = {23},
  year    = {1950},
  pages   = {567--586}
}

@article{FaddeevPopov1967,
  author  = {Faddeev, L. D. and Popov, V. N.},
  title   = {Feynman Diagrams for the {Y}ang-{M}ills Field},
  journal = {Phys. Lett. B},
  volume  = {25},
  year    = {1967},
  pages   = {29--30}
}

@book{Weinberg1995QTFV1,
  author    = {Weinberg, Steven},
  title     = {The Quantum Theory of Fields, Vol. 1: Foundations},
  publisher = {Cambridge University Press},
  year      = {1995}
}

@book{ItzyksonZuber1980,
  author    = {Itzykson, Claude and Zuber, Jean-Bernard},
  title     = {Quantum Field Theory},
  publisher = {McGraw-Hill},
  year      = {1980}
}

@book{PeskinSchroeder1995,
  author    = {Peskin, Michael E. and Schroeder, Daniel V.},
  title     = {An Introduction to Quantum Field Theory},
  publisher = {Addison-Wesley},
  year      = {1995}
}

@book{Srednicki2007,
  author    = {Srednicki, Mark},
  title     = {Quantum Field Theory},
  publisher = {Cambridge University Press},
  year      = {2007}
}

@book{Woit2017,
  author    = {Woit, Peter},
  title     = {Quantum Theory, Groups and Representations: An Introduction},
  publisher = {Springer},
  year      = {2017}
}

@article{Dirac1950Constrained,
  author  = {Dirac, P. A. M.},
  title   = {Generalized {H}amiltonian Dynamics},
  journal = {Can. J. Math.},
  volume  = {2},
  year    = {1950},
  pages   = {129--148}
}

Prerequisites

12.05.01 pending
12.13.01 pending
12.13.02 pending
12.04.01 pending
12.01.01 pending

Tier anchors

beginner: Feynman, Leighton & Sands, *The Feynman Lectures on Physics*, Vol. III (Addison-Wesley, 1965), Ch. 17–18 (polarisation and photon counting); Griffiths, *Introduction to Elementary Particles*, 2nd ed. (Wiley-VCH, 2008), Ch. 8 §1 (the photon as the quantum of the Maxwell field)
intermediate: Peskin, M. E. & Schroeder, D. V., *An Introduction to Quantum Field Theory* (Addison-Wesley, 1995), Ch. 4 §4.8 + Ch. 5 §5.1 (free electromagnetic field, polarisation sums, gauge choices); Srednicki, M., *Quantum Field Theory* (Cambridge, 2007), Ch. 54–56 (Lorenz, Coulomb, axial gauges; Proca field)
master: Weinberg, S., *The Quantum Theory of Fields, Vol. 1: Foundations* (Cambridge, 1995), §5.3 (massive vector fields), §5.9 (causal vector fields; photon and Proca); Itzykson, C. & Zuber, J.-B., *Quantum Field Theory* (McGraw-Hill, 1980), Ch. 3 §3-2 (canonical quantisation of the electromagnetic field in Gupta-Bleuler and Coulomb gauges); Woit, P., *Quantum Theory, Groups and Representations* (Springer, 2017), Ch. 46 (massless spin-1 representations and the photon); Stueckelberg, E. C. G., *Helv. Phys. Acta* 11, 225 (1938); Proca, A., *J. Phys. Radium* 7, 347 (1936)

References

Weinberg, S. — The Quantum Theory of Fields, Vol. 1: Foundations (Cambridge, 1995) · §5.3 (Massive vector fields and the spin-1 little group SO(3)); §5.9 (Causal vector fields; the photon as a massless helicity-pm-1 field; gauge invariance from the embedding of the little group ISO(2) into the polarisation index); Appendix to Ch. 5 (counting polarisation states from the little-group representation)
Itzykson, C. & Zuber, J.-B. — Quantum Field Theory (McGraw-Hill, 1980; Dover reprint, 2005) · Ch. 3 §3-1 (free electromagnetic field, mode expansion, polarisation vectors); §3-2 (canonical quantisation in Gupta-Bleuler with indefinite-metric Fock space; Coulomb-gauge canonical quantisation with only two transverse polarisations); §3-2-3 (covariant propagator $D_{F,mu nu}(x-y)$ and gauge-parameter dependence)
Peskin, M. E. & Schroeder, D. V. — An Introduction to Quantum Field Theory (Addison-Wesley, 1995) · §4.8 (Coulomb-gauge quantisation of the photon, two physical polarisations); §5.1 (transition to covariant gauges, polarisation-sum identity $sum_{lambda} epsilon^*_mu epsilon_nu = -eta_{mu nu} + text{gauge terms}$); §9.4 (path-integral quantisation with the Faddeev-Popov determinant)
Srednicki, M. — Quantum Field Theory (Cambridge, 2007) · Ch. 54–55 (free Maxwell action; constraint analysis; Coulomb-gauge, Lorenz-gauge, and axial-gauge quantisations side by side); Ch. 56 (massive vector / Proca field; canonical formulation without gauge redundancy); Ch. 57 (Stueckelberg trick and the smooth m to 0 limit)
Woit, P. — Quantum Theory, Groups and Representations: An Introduction (Springer, 2017) · Ch. 46 (massless spin-1 unitary irreducibles of the Poincare group; the photon as an ISO(2)-helicity representation); Ch. 42 (relativistic vector fields, mode expansion, the embedding of polarisation vectors into the four-vector index)
Stueckelberg, E. C. G. — Die Wechselwirkungskrafte in der Elektrodynamik und in der Feldtheorie der Kernkrafte · Helv. Phys. Acta 11, 225, 299, 312 (1938) — the auxiliary-scalar reformulation of the massive vector field, restoring a U(1) gauge invariance at the cost of one extra scalar degree of freedom; the originator paper for what we now call the Stueckelberg trick
Proca, A. — Sur la theorie ondulatoire des electrons positifs et negatifs · J. Phys. Radium 7, 347 (1936); see also Compt. Rend. Acad. Sci. Paris 202, 1366 and 202, 1490 (1936) — the originator equations $(square + m^2) A^mu = 0$ with the Lorenz-constraint imposed dynamically rather than by gauge fixing, intended originally as a relativistic-spin-1 candidate for the electron
Wigner, E. P. — On unitary representations of the inhomogeneous Lorentz group · Ann. Math. 40, 149 (1939) — the classification of one-particle Hilbert spaces by mass and little-group representation; the input that fixes the photon to be a massless helicity-pm-1 ISO(2) representation and the massive vector to be a mass-$m$ spin-1 SO(3) representation
Gupta, S. N. — Theory of longitudinal photons in quantum electrodynamics · Proc. Phys. Soc. A 63, 681 (1950) — the indefinite-metric Fock-space quantisation of the photon in Lorenz gauge; the Gupta side of Gupta-Bleuler
Bleuler, K. — Eine neue Methode zur Behandlung der longitudinalen und skalaren Photonen · Helv. Phys. Acta 23, 567 (1950) — the physical-state subspace selection criterion $partial_mu A^{mu (+)} |text{phys}rangle = 0$; the Bleuler half of the Gupta-Bleuler scheme
Faddeev, L. D. & Popov, V. N. — Feynman diagrams for the Yang-Mills field · Phys. Lett. B 25, 29 (1967) — the determinant gauge-fixing technique that retrofits the same logic to the path-integral quantisation of the photon and underlies the modern derivation of the propagator $D^{F}_{mu nu}(k) = -i (eta_{mu nu} - (1 - xi) k_mu k_nu / k^2) / (k^2 + iepsilon)$ in $R_xi$-gauge
Dirac, P. A. M. — Generalized Hamiltonian dynamics · Can. J. Math. 2, 129 (1950); Lectures on Quantum Mechanics (Yeshiva University, 1964) — the constraint-Hamiltonian framework that classifies $A_0$ as a Lagrange multiplier and Gauss's law as a first-class constraint, the canonical-side bookkeeping behind every gauge-theoretic quantisation

Estimated time

beginner: 22m
intermediate: 50m
master: 90m