Four-velocity, four-momentum, and the relativistic energy-momentum identity
Anchor (Master): Misner-Thorne-Wheeler, *Gravitation* (Freeman, 1973), §2; Weinberg, *Gravitation and Cosmology* (Wiley, 1972), §2; Landau-Lifshitz Vol. 2, *The Classical Theory of Fields*, 4e (Pergamon, 1980), §6-9
Intuition Beginner
In Newtonian physics, velocity is a three-vector with three independent components. When you change frames, the velocity changes by Galilean addition: walk forward at 5 km/h on a train moving at 100 km/h, and the ground sees you at 105 km/h. The three-vector is the natural object for ordinary speeds.
Special relativity replaces this picture. Space and time mix under a boost, so velocity must become a four-component object that includes the time direction. The new object is called the four-velocity , with one time component and three space components . The Greek index runs over 0, 1, 2, 3 — the four directions of spacetime.
The four-velocity has a remarkable property. Its Minkowski "length" is fixed at a universal value: it always squares to in the signature where time carries a minus sign. Every massive particle's four-velocity has the same intrinsic magnitude. The only thing that changes between particles is the direction the four-velocity points in spacetime.
Multiplying the four-velocity by the rest mass gives the four-momentum . The time component is the total energy divided by . The space components are the relativistic momentum . Energy and momentum are not separate quantities in relativity — they are different components of one four-vector.
The fixed magnitude of the four-velocity translates into a fixed magnitude for the four-momentum:
This is the famous energy-momentum identity. The total energy squared equals the momentum-energy squared plus the rest-energy squared. It works like a Pythagorean theorem in energy-momentum space.
Three special cases illustrate the identity. A particle at rest has , so — the rest energy. A photon has , so — energy and momentum are proportional. A relativistic electron at has Lorentz factor about 7, so its total energy is roughly seven times its rest energy of 0.511 MeV.
The identity controls every relativistic collision and decay. To find what comes out of a particle interaction, you add up the four-momenta of the incoming particles, then split the result among the outgoing particles in a way that conserves all four components. The "Pythagorean" identity then tells you the rest mass of any sub-system formed by combining particle four-momenta.
The picture above shows the geometric content of the energy-momentum identity. The hypotenuse is the total energy. The horizontal leg is the momentum contribution . The vertical leg is the rest energy . The triangle deforms as the particle's state changes, but the identity holds in every frame.
Worked example Beginner
A relativistic electron is accelerated to . Compute its Lorentz factor, its total energy, its kinetic energy, and its momentum. The electron rest energy is MeV.
Step 1. Compute the Lorentz factor. With , the ratio :
Step 2. Total energy from :
The total energy is roughly seven times the rest energy because the Lorentz factor is roughly seven.
Step 3. Kinetic energy is total minus rest:
Step 4. Momentum from the energy-momentum identity. Rearranging :
So the electron carries momentum MeV/. The ratio — nearly all the energy is in the momentum-energy contribution because the rest energy is small compared with the kinetic energy.
What this tells us: at a relativistic electron is roughly seven times more energetic than at rest, and its energy is almost entirely momentum-energy. The photon limit is the case where the rest term goes to zero and exactly.
Second worked example: pion decay at rest
A neutral pion ( MeV) decays at rest into two photons: . Find the energy and momentum of each photon.
Step 1. Before the decay, the total energy is the pion rest energy: MeV. The total momentum is zero (the pion is at rest).
Step 2. Conservation of four-momentum requires the two photons to share the total energy and to have equal and opposite momenta (so they sum to zero). By symmetry each photon takes half the energy:
Step 3. Each photon has (massless), so its momentum is:
The two photons fly apart in opposite directions, each carrying 67.5 MeV of energy and 67.5 MeV/ of momentum.
What this tells us: when a particle decays at rest, the daughter four-momenta are fixed by conservation alone. The decay products' kinematics are completely determined by the parent's rest mass and the daughters' masses — a fact used routinely in particle-physics experiments to identify parent particles from the kinematics of their decay products.
Check your understanding Beginner
Formal definition Intermediate+
Let a massive particle of rest mass trace out a timelike worldline in Minkowski spacetime, parameterised by proper time . Proper time is the time measured by a clock comoving with the particle, related to coordinate time by where .
Definition (four-velocity). The four-velocity is $$ u^\mu := \frac{dx^\mu}{d\tau} = \gamma,(c, \mathbf{v}), $$ where the four spacetime components are , for .
Throughout this unit we adopt the signature for the Minkowski metric , following Misner-Thorne-Wheeler. Other texts (Landau-Lifshitz, Weinberg) use the opposite signature , which flips signs in the algebra below. The substantive content is signature-independent.
Proposition (norm of the four-velocity). The Minkowski norm of is constant along any timelike worldline: $$ \eta_{\mu\nu},u^\mu u^\nu = -\gamma^2 c^2 + \gamma^2 \mathbf{v}\cdot\mathbf{v} = \gamma^2(-c^2 + v^2) = -c^2. $$ The final equality uses , so .
The four-velocity is timelike and future-directed for any massive particle. Its constancy is a kinematic constraint, not a physical assumption: the constraint follows from the definition together with the relation between proper time and the spacetime interval.
Definition (four-momentum). The four-momentum is $$ \boxed{;;p^\mu := m,u^\mu = (\gamma m c, \gamma m \mathbf{v}) = (E/c, \mathbf{p}),;;} $$ where the energy is and the relativistic three-momentum is . The rest mass is a Lorentz invariant — the same in every inertial frame.
From the constancy of and the definition : $$ \eta_{\mu\nu},p^\mu p^\nu = m^2,\eta_{\mu\nu},u^\mu u^\nu = -m^2 c^2. $$
Writing this out using : $$ -(E/c)^2 + |\mathbf{p}|^2 = -m^2 c^2, $$ which rearranges to the relativistic energy-momentum identity: $$ \boxed{;;E^2 = |\mathbf{p}|^2 c^2 + m^2 c^4.;;} $$
The quantity is the invariant mass-squared of the particle. It has the same numerical value in every inertial frame, even though and separately transform under boosts.
Special cases. A particle at rest has and — the rest energy. A massless particle (photon, gluon, graviton) has and — energy proportional to momentum. The four-momentum of a massless particle is null: , and the worldline lies along the light cone.
Low-speed limit. For , expand : $$ E = mc^2 + \tfrac{1}{2} m v^2 + \tfrac{3}{8} m v^4/c^2 + \cdots. $$ The rest energy is a Lorentz scalar (frame-independent). The next term is the Newtonian kinetic energy . The subsequent terms are relativistic corrections suppressed by powers of .
Four-acceleration
Definition (four-acceleration). The four-acceleration is $$ \alpha^\mu := \frac{du^\mu}{d\tau}. $$ Differentiating the constancy of gives $$ \frac{d}{d\tau}!\left(\eta_{\mu\nu} u^\mu u^\nu\right) = 2\eta_{\mu\nu} u^\mu \alpha^\nu = 0, $$ so the four-acceleration is orthogonal (in the Minkowski sense) to the four-velocity: . The four-acceleration is spacelike for any timelike worldline.
Four-force
The relativistic generalisation of Newton's second law is the four-force $$ f^\mu := \frac{dp^\mu}{d\tau} = m,\alpha^\mu. $$ The spatial part is where is the lab-frame three-force; the time part is . In the low-speed limit and the spatial part of the four-force reduces to Newton's second law .
Counterexamples to common slips
"The rest mass changes with speed." The rest mass is a Lorentz invariant: has the same value in every frame. What grows with speed is , sometimes historically called the "relativistic mass." Modern usage reserves "mass" for the rest mass and lets carry the speed dependence.
" for any particle." The relation applies only to a particle at rest. The general expression is , or equivalently from the energy-momentum identity.
"Photons have no momentum because they have no mass." Photons have from the energy-momentum identity with . Photon momentum is responsible for radiation pressure, Compton scattering, and photon-rocket propulsion.
"The four-acceleration of an inertial observer is non-zero because they are moving." An inertial observer has constant in any inertial frame, so . Four-acceleration measures deviation from geodesic motion, not coordinate velocity.
Key theorem with proof Intermediate+
Theorem (invariance of the energy-momentum norm). For any massive particle of rest mass and four-momentum in some inertial frame, the quantity $$ M^2 c^4 := E^2 - |\mathbf{p}|^2 c^2 $$ is invariant under proper orthochronous Lorentz transformations and equals .
Proof. Let be a proper orthochronous Lorentz transformation, and let be the transformed four-momentum. Compute the transformed norm using the Lorentz invariance of the Minkowski metric: $$ \eta_{\mu\nu},p'^\mu p'^\nu = \eta_{\mu\nu},\Lambda^\mu{}\alpha,\Lambda^\nu{}\beta,p^\alpha p^\beta = \eta_{\alpha\beta},p^\alpha p^\beta, $$ where the last step uses the defining property of Lorentz transformations: . The Minkowski norm of is therefore frame-independent.
The norm has the explicit form $$ \eta_{\mu\nu},p^\mu p^\nu = -(p^0)^2 + |\mathbf{p}|^2 = -E^2/c^2 + |\mathbf{p}|^2 = -M^2 c^2, $$ so is the invariant.
To identify with the rest mass , evaluate in the particle's rest frame, where and : $$ M^2 c^4 = (m c^2)^2 - 0 = m^2 c^4. $$ Hence for any massive particle.
Bridge. The invariant-mass theorem is the foundational reason that high-energy physics computes invariant masses to identify particles. Putting these together with the conservation of four-momentum, an unstable resonance decaying into daughters has , and this combination is Lorentz invariant. The bridge is to detector analyses: measure the four-momenta of the daughters, reconstruct the parent invariant mass, and identify the resonance by its peak in the invariant-mass distribution. This is exactly how the J/ was discovered in 1974 and how the Higgs boson was identified in 2012. The pattern generalises to multi-body decays via Dalitz plots in 10.06.01 and builds toward the Mandelstam-variable formalism for 2-to-2 scattering covered in the Master tier below. Appears again in 12.11.01 when the Dirac equation factorises the energy-momentum identity at the operator level into a first-order spinor equation.
Worked example at intermediate level: Compton scattering
A photon of energy scatters off an electron at rest. The photon is detected at angle with energy . Derive the wavelength shift .
Conservation of four-momentum: . Rearrange and contract with : $$ (p_e'^\mu)(p_{e'\mu}) = (p_\gamma^\mu + p_e^\mu - p_\gamma'^\mu)(p_{\gamma\mu} + p_{e\mu} - p_{\gamma'\mu}). $$
The left side is . Expand the right side using (photon is null), , : $$ -m_e^2 c^2 = -m_e^2 c^2 + 2 p_\gamma \cdot p_e - 2 p_\gamma \cdot p_\gamma' - 2 p_e \cdot p_\gamma'. $$
The terms cancel, leaving . In the lab frame: , , and with . Computing the dot products: $$ -E_\gamma m_e = -E_\gamma E_\gamma'/c^2 + E_\gamma E_\gamma' \cos\theta/c^2 - E_\gamma' m_e. $$
Divide by : $$ -\frac{1}{E_\gamma'} = -\frac{1}{m_e c^2}(1 - \cos\theta) - \frac{1}{E_\gamma}. $$
Multiply by : $$ \frac{1}{E_\gamma'} - \frac{1}{E_\gamma} = \frac{1}{m_e c^2}(1 - \cos\theta). $$
Using and , this becomes the Compton formula: $$ \Delta\lambda = \lambda' - \lambda = \frac{h}{m_e c}(1 - \cos\theta). $$
The constant m is the Compton wavelength of the electron. The wavelength shift depends only on the scattering angle, not on the incident wavelength — a directly verifiable prediction that won Compton the 1927 Nobel Prize.
Exercises Intermediate+
Advanced results Master
The four-momentum as a Noether charge
By Noether's first theorem applied to the spacetime translation symmetry of a Lorentz-invariant action , the conserved current associated with translations in the -direction is the stress-energy tensor $$ T^{\mu\nu} = \frac{\partial \mathcal{L}}{\partial(\partial_\mu \phi)},\partial^\nu \phi - \eta^{\mu\nu}\mathcal{L}, $$ where stands for the dynamical fields. The integrated charges $$ P^\nu = \frac{1}{c}\int T^{0\nu},d^3 x $$ form a four-vector and are conserved under translations: in flat spacetime. For a single point particle, this construction recovers . The Noether-charge interpretation makes the four-momentum a fundamental object of Lorentz-invariant field theory, not merely a kinematic auxiliary.
Theorem 1 (Noether, 1918). For a translation-invariant Lagrangian field theory in flat spacetime, the integrated stress-energy tensor is a conserved four-vector, and for a point particle it equals . (Noether 1918 Nachr. Königl. Ges. Wiss. Göttingen.)
Photons and the de Broglie-Einstein relation
For a plane electromagnetic wave with angular frequency and wave vector satisfying , the photon four-momentum is $$ p^\mu = \hbar,k^\mu, \qquad k^\mu := (\omega/c, \mathbf{k}). $$ The wave four-vector is null: , consistent with for a massless particle. Einstein's 1905 photoelectric paper introduced ; de Broglie's 1924 thesis introduced . The four-vector relation unifies them.
The Pauli-Lubanski four-vector and Wigner classification
For a particle of mass , define the Pauli-Lubanski four-vector $$ W^\mu := -\tfrac{1}{2},\varepsilon^{\mu\nu\rho\sigma},p_\nu,J_{\rho\sigma}, $$ where are the generators of Lorentz rotations and is the totally antisymmetric Levi-Civita tensor. commutes with , and the two Casimir invariants of the Poincaré algebra are and .
Theorem 2 (Wigner, 1939). The unitary irreducible representations of the proper orthochronous Poincaré group are classified by the pair , where is the mass-squared and is the spin. For : with . For : is null and proportional to , with helicity as the substantive label. (Wigner 1939 Ann. Math. 40, 149.)
This classification underpins the entire Standard Model: every elementary particle is identified by a pair labelling an irreducible representation of the Poincaré group.
Mandelstam variables for scattering
For a process with four-momenta , define the Mandelstam invariants: $$ s := -(p_1 + p_2)^\mu(p_1 + p_2)\mu c^2, $$ $$ t := -(p_1 - p_3)^\mu(p_1 - p_3)\mu c^2, $$ $$ u := -(p_1 - p_4)^\mu(p_1 - p_4)_\mu c^2. $$
These are Lorentz invariants: is the square of the centre-of-momentum energy, is the momentum-transfer squared (relevant to forward scattering), and is the cross-channel variable. They satisfy the constraint $$ s + t + u = \sum_{i=1}^{4} m_i^2 c^4. $$
Theorem 3 (Mandelstam, 1958). For a scattering process, the on-shell amplitude is a function of two of the three Mandelstam variables, with the third determined by the constraint. Crossing symmetry relates in the -, -, and -channels via analytic continuation in the complex -, -, -plane. (Mandelstam 1958 Phys. Rev. 112, 1344.)
Mandelstam-variable analysis is the workhorse of perturbative quantum field theory: scattering amplitudes are expressed as functions of , and the analytic structure encodes crossing symmetry, unitarity, and dispersion relations.
Thomas precession
A non-collinear sequence of Lorentz boosts does not return the same frame: it composes to a boost followed by a rotation. The angle of the residual rotation is the Thomas rotation, and the rate at which the spin axis of an accelerated electron precesses relative to the lab frame is the Thomas precession rate.
Theorem 4 (Thomas, 1926). An electron in a circular orbit of angular frequency at speed experiences a Thomas precession of its spin axis with angular frequency $$ \omega_T = (\gamma - 1),\omega_{\rm orb}. $$ For non-relativistic speeds, , which produces the factor-of-1/2 correction (the "Thomas factor") in atomic fine structure. (Thomas 1926 Nature 117, 514.)
The Thomas factor reconciles the spin-orbit coupling computed in the electron rest frame with the experimentally observed atomic fine-structure splittings. Before Thomas's 1926 paper, atomic spectroscopy disagreed with the naive spin-orbit Hamiltonian by a factor of 2; Thomas's relativistic kinematics resolved the discrepancy.
Relativistic action and the principle of extremal proper time
The Lorentz-invariant action for a free relativistic point particle is $$ S = -mc^2\int d\tau = -mc\int\sqrt{-\eta_{\mu\nu},dx^\mu,dx^\nu}. $$
The Euler-Lagrange equations from this action give the geodesic equation , corresponding to inertial (straight) worldlines. The principle of extremal action becomes the principle of extremal proper time: the classical worldline of a free particle is the one that maximises proper time between two events. For a charged particle in an electromagnetic field with four-potential , the action acquires a coupling term: $$ S = -mc\int\sqrt{-\eta_{\mu\nu},dx^\mu,dx^\nu} + \frac{q}{c}\int A_\mu,dx^\mu. $$
The Euler-Lagrange equations now give the covariant Lorentz force $$ \frac{dp^\mu}{d\tau} = \frac{q}{c},F^{\mu\nu},u_\nu, \qquad F^{\mu\nu} := \partial^\mu A^\nu - \partial^\nu A^\mu. $$
The spatial part recovers the familiar Lorentz force with relativistic momentum, while the time part gives the power equation .
The stress-energy tensor of a continuous medium
For a perfect fluid with energy density (measured in its rest frame), pressure , and four-velocity , the stress-energy tensor is $$ T^{\mu\nu} = (\rho c^2 + p),u^\mu u^\nu/c^2 + p,\eta^{\mu\nu}. $$
Conservation reduces in the non-relativistic limit to the continuity equation and the Euler equation . The integrated four-momentum is conserved.
Theorem 5. For a closed system in flat Minkowski spacetime, the integrated four-momentum is conserved and transforms as a four-vector. In general relativity, the conservation law generalises to with the covariant derivative replacing the partial derivative, and becomes the source of the Einstein field equations .
Cosmological redshift of photon four-momentum
In a Friedmann-Lemaître-Robertson-Walker spacetime with scale factor , the four-momentum of a photon is parallel-transported along the null geodesic, and its frequency redshifts as . The observed redshift for light emitted at scale factor and observed at is $$ 1 + z = \frac{a_{\rm obs}}{a_{\rm emit}} = \frac{\omega_{\rm emit}}{\omega_{\rm obs}}. $$
The four-momentum machinery developed here is the kinematic foundation of observational cosmology: cosmic-microwave-background photons emitted at have , corresponding to a temperature drop from approximately 3000 K at the surface of last scattering to 2.7 K today. The Sunyaev-Zel'dovich effect (1972) — a Compton-scattering shift in the CMB spectrum through hot galaxy clusters — is computed directly from the four-momentum kinematics of the Compton-scattering formula derived in the Intermediate tier above.
Centre-of-momentum frame and the Lorentz boost to it
For a system of particles with four-momenta , the total four-momentum is conserved and transforms as a four-vector under Lorentz transformations. The centre-of-momentum (CM) frame is the inertial frame in which the spatial total momentum vanishes: . The Lorentz boost from the lab to the CM frame has velocity $$ \mathbf{v}{\rm CM} = \frac{c^2,\mathbf{P}{\rm tot}}{E_{\rm tot}}, $$ which is always subluminal because for any system with at least one massive particle (or any non-collinear collection of photons). In the CM frame where is the invariant centre-of-momentum energy.
For a fixed-target experiment in which a beam of energy and momentum strikes a target of mass at rest, the invariant CM energy grows only as the square root of the beam energy: $$ \sqrt{s} = \sqrt{m_b^2 c^4 + m_t^2 c^4 + 2 m_t c^2 E_b} \approx \sqrt{2 m_t c^2 E_b}\quad (E_b \gg m_b c^2, m_t c^2). $$
For colliding beams of equal energy and mass, — linear in the beam energy. This factor-of- disadvantage of fixed-target experiments is why high-energy physics has used colliders since the 1970s.
Geodesics, the equivalence principle, and the lift to general relativity
The free-particle action generalises to a Lorentzian manifold by replacing the flat-spacetime proper time with the curved-spacetime arc length: . The Euler-Lagrange equations of this action give the geodesic equation $$ \frac{d^2 x^\mu}{d\tau^2} + \Gamma^\mu{}{\nu\rho},\frac{dx^\nu}{d\tau}\frac{dx^\rho}{d\tau} = 0, $$ where $\Gamma^\mu{}{\nu\rho}p^\mu = m,dx^\mu/d\tau\nabla_u p^\mu = 0\nabla_uu^\mudp^\mu/d\tau = 0p^\mu$ has zero covariant derivative along the worldline.
The equivalence principle — that gravitational and inertial forces are locally indistinguishable — translates kinematically into the statement that, in any sufficiently small spacetime region around a point, one can choose coordinates in which the metric reduces to and the Christoffel symbols vanish to first order. In such local inertial coordinates the special-relativistic four-momentum formalism applies exactly to leading order, with curvature effects appearing at second order through the Riemann tensor.
Synthesis. The four-velocity, four-momentum, and energy-momentum identity build toward the entire edifice of relativistic dynamics. The central insight is that energy and momentum are not independent quantities but different projections of a single Lorentz-covariant object, and this is exactly the structure that identifies kinematic quantities with Noether charges of spacetime symmetries. Putting these together with Wigner's classification, every elementary particle is labelled by an irreducible representation of the Poincaré group, and the energy-momentum identity is the Casimir invariant in disguise.
The Mandelstam-variable structure generalises the energy-momentum identity from a single-particle invariant to a 2-to-2-scattering invariant set , and the bridge is between high-energy collider phenomenology and the abstract analytic structure of scattering amplitudes. The action principle identifies the classical worldline with the proper-time-extremising path; the foundational reason this works is that proper time is the only Lorentz-invariant scalar one can construct from a single worldline, and the principle of extremal action then forces inertial motion. The pattern recurs in general relativity: the geodesic equation in curved spacetime is the Euler-Lagrange equation of with proper time replaced by the arc length of the curved-spacetime metric.
The Thomas precession identifies a non-Abelian feature of the Lorentz group hidden in non-collinear boosts: the composition of two boosts in different directions is a boost plus a rotation, and this rotation produces measurable atomic fine-structure splittings. The foundational reason fine structure is reduced by a factor of 2 compared with the naive computation is precisely this kinematic compositional twist. Appears again in 13.02.01 when the tensor formalism on smooth manifolds generalises the flat-spacetime four-momentum apparatus to general spacetimes, where the conservation law becomes with covariant derivatives.
Full proof set Master
Proposition 1 (Källén triangle function and two-body decay phase space). For a particle of mass at rest decaying into two particles of masses and , the magnitude of the daughter three-momentum in the parent rest frame is $$ |\mathbf{p}|c = \frac{\sqrt{\lambda(M^2 c^4, m_1^2 c^4, m_2^2 c^4)}}{2 M c^2}, \qquad \lambda(a,b,c) := a^2 + b^2 + c^2 - 2ab - 2ac - 2bc. $$ The decay is kinematically allowed if and only if .
Proof. By conservation with . Define . From the energy-momentum identity in the parent rest frame: , . Conservation of energy: . So $$ \sqrt{p^2 c^2 + m_1^2 c^4} + \sqrt{p^2 c^2 + m_2^2 c^4} = M c^2. $$
Isolate one radical and square: $$ p^2 c^2 + m_1^2 c^4 = (M c^2 - \sqrt{p^2 c^2 + m_2^2 c^4})^2 = M^2 c^4 - 2 M c^2 \sqrt{p^2 c^2 + m_2^2 c^4} + p^2 c^2 + m_2^2 c^4. $$ The terms cancel: $$ m_1^2 c^4 - M^2 c^4 - m_2^2 c^4 = -2 M c^2 \sqrt{p^2 c^2 + m_2^2 c^4}. $$
Divide and square: $$ 4 M^2 c^4 (p^2 c^2 + m_2^2 c^4) = (M^2 c^4 + m_2^2 c^4 - m_1^2 c^4)^2. $$ Expand and rearrange: $$ 4 M^2 c^4 \cdot p^2 c^2 = (M^2 c^4 + m_2^2 c^4 - m_1^2 c^4)^2 - 4 M^2 c^4 \cdot m_2^2 c^4. $$
The right side simplifies to the Källén function , so $$ p^2 c^2 = \frac{\lambda(M^2 c^4, m_1^2 c^4, m_2^2 c^4)}{4 M^2 c^4}. $$ Taking the square root gives the claimed formula. The Källén function factorises as for positive arguments; positivity requires , i.e., . The decay is kinematically allowed precisely in this regime.
Proposition 2 (orthogonality of four-acceleration and four-velocity). Along any timelike worldline, .
Proof. The four-velocity satisfies , a constant. Differentiating with respect to proper time: $$ \frac{d}{d\tau}(\eta_{\mu\nu} u^\mu u^\nu) = \eta_{\mu\nu},\frac{du^\mu}{d\tau},u^\nu + \eta_{\mu\nu},u^\mu,\frac{du^\nu}{d\tau} = 2\eta_{\mu\nu},\alpha^\mu u^\nu = 2,u_\mu \alpha^\mu. $$ The total derivative vanishes because the magnitude is constant, so .
Proposition 3 (rapidity additivity for collinear boosts). For two boosts of velocities along the same spatial axis, the composite boost has velocity , equivalently rapidity where .
Proof. The Lorentz boost matrix along the -axis with velocity has the form $$ \Lambda(\phi) = \begin{pmatrix} \cosh\phi & -\sinh\phi & 0 & 0 \ -\sinh\phi & \cosh\phi & 0 & 0 \ 0 & 0 & 1 & 0 \ 0 & 0 & 0 & 1 \end{pmatrix}, $$ with . The product of two such boosts is $$ \Lambda(\phi_1)\Lambda(\phi_2) = \begin{pmatrix} \cosh\phi_1\cosh\phi_2 + \sinh\phi_1\sinh\phi_2 & -(\cosh\phi_1\sinh\phi_2 + \sinh\phi_1\cosh\phi_2) & 0 & 0 \ -(\sinh\phi_1\cosh\phi_2 + \cosh\phi_1\sinh\phi_2) & \cosh\phi_1\cosh\phi_2 + \sinh\phi_1\sinh\phi_2 & 0 & 0 \ 0 & 0 & 1 & 0 \ 0 & 0 & 0 & 1 \end{pmatrix}. $$ By the hyperbolic addition formulas and , this product equals . The corresponding velocity is .
Connections Master
SR postulates and Lorentz transformations
10.05.01. Supplies the framework of Lorentz transformations acting on Minkowski spacetime, on which the four-velocity and four-momentum live as covariant tensors. The transformation law is a direct consequence of the postulates, and the invariance of is the algebraic statement that Lorentz transformations preserve the Minkowski inner product.Relativistic kinematics and dynamics
10.05.02. Direct prerequisite. The relativistic kinematics unit develops , , and the energy-momentum relation in three-vector language. The present unit lifts these to the covariant four-vector formalism and derives the energy-momentum identity as the Minkowski-norm constancy of . The Compton-scattering and threshold-energy computations there are recovered here as applications of four-momentum conservation and Mandelstam-variable analysis.Covariant electrodynamics
10.06.01. The covariant Lorentz-force law uses the four-momentum and four-velocity defined here as the dynamical variables. The Faraday tensor acts on to produce , and the spatial components recover the three-vector Lorentz force . The minimal-coupling action unifies free-particle motion with electromagnetic interaction.Dirac equation
12.11.01. The quantum-mechanical analog of the energy-momentum identity is the Dirac equation , obtained by "taking the square root" of at the operator level. The requirement that the square root yield a first-order linear differential equation forces the introduction of Dirac matrices and four-component spinor wavefunctions, predicting antimatter and spin-1/2 as kinematic consequences of relativistic invariance. The energy-momentum identity is the classical foundation on which relativistic quantum mechanics rests.Tensors on smooth manifolds
13.02.01. The general-relativistic generalisation replaces flat-spacetime four-vectors with tensors on a Lorentzian manifold and the conservation law with the covariant version . The four-momentum is parallel-transported along geodesics, generalising free-particle inertial motion to curved spacetime. The four-momentum framework lifts essentially unchanged from special to general relativity, with only the geometric setting modified.
Historical and philosophical context Master
Einstein 1905 "Zur Elektrodynamik bewegter Körper" [Einstein1905a] derived the Lorentz transformation from the postulates of relativity and the constancy of the speed of light, then deduced relativistic momentum addition and the velocity-addition law. The companion paper [Einstein1905b] "Ist die Trägheit eines Körpers von seinem Energieinhalt abhängig?" derived as the inertia of energy: a body emitting electromagnetic radiation of energy loses mass . The 1907 review article [Einstein1907] "Über das Relativitätsprinzip und die aus demselben gezogenen Folgerungen" (Jahrbuch der Radioaktivität und Elektronik 4, 411-462) consolidated the rest-energy formulation. The four-momentum as a unified geometric object appeared with Minkowski's 1908 Cologne lecture [Minkowski1908] "Raum und Zeit": "Von Stund' an sollen Raum für sich und Zeit für sich völlig zu Schatten herabsinken, und nur noch eine Art Union der beiden soll Selbständigkeit bewahren."
Minkowski's reformulation cast energy and momentum as a single four-vector , and the invariant unified the separate Newtonian conservation laws into a single covariant identity. The geometric viewpoint was initially resisted (Einstein himself was skeptical) but became essential when general relativity required curved-spacetime tensor analysis.
The Noether-charge interpretation [Noether1918] of four-momentum awaited Emmy Noether's 1918 theorem, which identified the integrated stress-energy tensor as the conserved current of spacetime translation symmetry. This connection made four-momentum a fundamental object of Lorentz-invariant field theory rather than a kinematic auxiliary, and it is the modern starting point for relativistic quantum field theory.
Thomas's 1926 paper [Thomas1926] on the spinning electron solved a long-standing puzzle in atomic spectroscopy: the spin-orbit interaction computed naively (in the electron rest frame, then transformed back to the lab) gave a fine-structure splitting twice as large as observed. Thomas showed that the non-collinear boost composition contributes a kinematic precession at half the rate, exactly cancelling the discrepancy. The "Thomas factor" of 1/2 in atomic fine structure is a relativistic kinematic effect with no Newtonian analog.
The Wigner 1939 classification [Wigner1939] of unitary irreducible representations of the Poincaré group made the universal labels of elementary particles. The Pauli-Lubanski four-vector introduced by Pauli 1939 and Lubanski 1942 [Lubanski1942] is the second Casimir of the Poincaré algebra, and its squared norm for massive particles equals . Every elementary particle in the Standard Model is identified by its Wigner pair: photon with helicity , electron , Higgs , graviton (postulated) .
The Mandelstam 1958 paper [Mandelstam1958] introduced the invariants for scattering and the corresponding analytic continuation, which became foundational for -matrix theory and dispersion relations. Mandelstam's framework underlies the analytic-bootstrap programme of the 1960s and the modern amplitude-based approach to quantum field theory.
The interplay between and nuclear physics emerged in the 1930s with the discovery of nuclear binding-energy defects. A deuteron H has rest energy approximately MeV less than the sum of its constituent proton and neutron rest energies, and this mass defect equals the deuteron binding energy. Uranium-235 fission releases roughly MeV per nucleus, corresponding to a mass defect of about 0.1% of the total nuclear mass. The Bethe-Weizsäcker semi-empirical mass formula (Bethe and Bacher 1936 Rev. Mod. Phys. 8, 82; Weizsäcker 1935 Z. Phys. 96, 431) systematised the dependence of binding energy on nuclear mass number, identifying iron-56 as the most tightly bound nucleus per nucleon. The applications to nuclear reactors and weapons followed during the 1940s as direct macroscopic realisations of .
Antimatter prediction and discovery provided dramatic confirmation of the energy-momentum identity in quantum field theory. Dirac's 1928 equation, derived by demanding a first-order linear wave equation consistent with , predicted negative-energy solutions that Dirac interpreted as a sea of filled states with vacancies behaving as positively-charged particles of electron mass. Anderson's 1932 Phys. Rev. 41 discovery of the positron in cosmic-ray cloud-chamber tracks confirmed this prediction, and the subsequent observation of pair production (with the nucleus absorbing recoil momentum) demonstrated direct conversion of electromagnetic energy into rest mass. The threshold MeV follows immediately from four-momentum conservation and the energy-momentum identity, with the nucleus required to satisfy the kinematic constraint that single-photon pair production in vacuum is forbidden because two massive particles cannot have total four-momentum equal to that of a single photon.
The lineage Einstein-Minkowski-Noether-Thomas-Wigner-Mandelstam traces a single thematic arc: the progressive identification of relativistic kinematic quantities with the representation theory of the Poincaré group. The energy-momentum identity is the Casimir constraint of the simplest non-vanishing unitary representation of the spacetime symmetry group, and the entire subsequent development of relativistic quantum field theory rests on this identification.
Bibliography Master
@article{Einstein1905a,
author = {Einstein, Albert},
title = {Zur Elektrodynamik bewegter K\"orper},
journal = {Annalen der Physik},
volume = {17},
pages = {891--921},
year = {1905},
}
@article{Einstein1905b,
author = {Einstein, Albert},
title = {Ist die Tr\"agheit eines K\"orpers von seinem Energieinhalt abh\"angig?},
journal = {Annalen der Physik},
volume = {18},
pages = {639--641},
year = {1905},
}
@misc{Minkowski1908,
author = {Minkowski, Hermann},
title = {Raum und Zeit},
note = {Lecture at the 80th meeting of German Natural Scientists, Cologne, September 21},
year = {1908},
}
@article{Einstein1907,
author = {Einstein, Albert},
title = {\"Uber das Relativit\"atsprinzip und die aus demselben gezogenen Folgerungen},
journal = {Jahrbuch der Radioaktivit\"at und Elektronik},
volume = {4},
pages = {411--462},
year = {1907},
}
@article{Thomas1926,
author = {Thomas, L. H.},
title = {The motion of the spinning electron},
journal = {Nature},
volume = {117},
pages = {514},
year = {1926},
}
@article{Wigner1939,
author = {Wigner, Eugene P.},
title = {On unitary representations of the inhomogeneous {L}orentz group},
journal = {Annals of Mathematics},
volume = {40},
pages = {149--204},
year = {1939},
}
@article{Mandelstam1958,
author = {Mandelstam, S.},
title = {Determination of the pion-nucleon scattering amplitude from dispersion relations and unitarity},
journal = {Physical Review},
volume = {112},
pages = {1344--1360},
year = {1958},
}
@article{Lubanski1942,
author = {Lubanski, J. K.},
title = {Sur la th\'eorie des particules \'el\'ementaires de spin quelconque. I},
journal = {Physica},
volume = {9},
pages = {310--324},
year = {1942},
}
@article{Noether1918,
author = {Noether, Emmy},
title = {Invariante Variationsprobleme},
journal = {Nachrichten der K\"oniglichen Gesellschaft der Wissenschaften zu G\"ottingen, Mathematisch-physikalische Klasse},
pages = {235--257},
year = {1918},
}
@book{MisnerThorneWheeler1973,
author = {Misner, Charles W. and Thorne, Kip S. and Wheeler, John Archibald},
title = {Gravitation},
publisher = {W. H. Freeman},
year = {1973},
}
@book{Weinberg1972,
author = {Weinberg, Steven},
title = {Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity},
publisher = {Wiley},
year = {1972},
}
@book{Rindler1991,
author = {Rindler, Wolfgang},
title = {Introduction to Special Relativity},
edition = {2},
publisher = {Oxford University Press},
year = {1991},
}
@book{LandauLifshitz1980,
author = {Landau, L. D. and Lifshitz, E. M.},
title = {The Classical Theory of Fields},
edition = {4},
publisher = {Pergamon},
year = {1980},
}
@book{Hartle2003,
author = {Hartle, James B.},
title = {Gravity: An Introduction to {E}instein's General Relativity},
publisher = {Pearson},
year = {2003},
}