08.10.08 · stat-mech / qft

Langevin updates and lattice numerics

shipped3 tiersLean: none

Anchor (Master): Parisi & Wu, *Sci. Sin.* 24, 483 (1981) (stochastic quantisation); Damgaard & Hüffel, *Phys. Rep.* 152 (1987) (the canonical review); Zwanziger, *Nucl. Phys. B* 192, 259 (1981) (stochastic gauge fixing); Drummond, Horgan & Pendleton, *Nucl. Phys. B* 220, 119 (1983) (Langevin on lattice gauge theory); Parisi, *Phys. Lett. B* 131, 393 (1983) (complex Langevin); Duane, Kennedy, Pendleton & Roweth, *Phys. Lett. B* 195, 216 (1987) (hybrid Monte Carlo); Kennedy & Bowler, *Nucl. Phys. B* 245, 217 (1985)

Intuition Beginner

A Langevin update is a recipe for sampling a probability distribution by simulating a noisy ball rolling downhill on a potential landscape. The ball's position represents a field configuration. The landscape is the action, and lower-action configurations are deeper valleys. You let the ball roll a small step downhill, then give it a small random kick, then repeat. After a long time, the position of the ball is a sample from the Gibbs distribution that weights configurations by .

This is the whole game of stochastic field theory on a lattice. You replace the continuous space of field configurations with a finite grid, you write down the action as a function of the values on grid points, and you simulate the noisy-ball recipe on the resulting finite-dimensional landscape. The random kicks are calibrated against the step size so that the equilibrium is the Euclidean path-integral measure of the field theory.

Why does this matter? The path-integral measure on field configurations is hard to make sense of directly; the noisy-ball construction gives you a concrete computer-implementable way to draw samples from it. Stochastic quantisation, due to Parisi and Wu in 1981, takes this seriously as a quantisation procedure: the fictitious time of the noisy-ball simulation is an extra dimension, and the original physics lives at the equilibrium of the noisy dynamics. Gauge theories benefit because the equilibrium measure is gauge-invariant; the noisy ball does not need to be told which gauge to pick.

Visual Beginner

A schematic showing a small two-dimensional lattice on the left with field values at each site, and on the right a noisy trajectory in stochastic time that visits many configurations and relaxes toward the equilibrium distribution. Arrows on the trajectory indicate that each step is a small downhill move on the action landscape plus a small random kick.

The picture captures the essence: a lattice fixes the space of field configurations, the action assigns a number to each configuration, and the noisy ball samples that number via as the long-time equilibrium of a simulated stochastic flow.

Worked example Beginner

Sample the harmonic oscillator on a one-site lattice using the Langevin recipe and check the variance of the samples.

Step 1. The configuration is a single number , the action is with , and the temperature is . The Gibbs distribution is , a standard normal. We expect the equilibrium variance of to equal .

Step 2. Pick a step size . Start from . At each step, compute the downhill direction as the negative gradient of the action: . Draw a random kick from a standard normal distribution.

Step 3. The Euler-Maruyama update reads . With this becomes . Each step moves slightly toward the origin and adds a small random kick.

Step 4. Run the update for steps, starting from . The first few hundred samples are biased by the initial condition. Discard the first steps as burn-in. Collect the remaining samples .

Step 5. Compute the sample mean by adding up all values of and dividing by ; the answer is close to . Compute the sample variance by adding up all values of and dividing by ; the answer is close to . The variance is close to the theoretical value . The Gibbs distribution has been recovered as the equilibrium of the noisy dynamics.

What this tells us: the Langevin recipe samples the Gibbs distribution as the long-time equilibrium of a noisy dynamics. The simulation step is short and explicit; the only inputs are the gradient of the action and a random-number generator. The same recipe extends to a lattice with many sites, where the gradient at each site is the local-action derivative and the random kicks are independent across sites.

Check your understanding Beginner

Formal definition Intermediate+

Fix a finite lattice in spatial dimensions with lattice spacing and side length , so is the number of sites. A real scalar field configuration is a function , which we identify with a vector in .

The lattice action is the function $$ S[\phi] = \sum_{x \in \Lambda} \left[ \tfrac{1}{2} |\nabla \phi(x)|^2 + V(\phi(x)) \right], $$ where the lattice gradient is and is a fixed scalar potential — say for a massive theory. The action defines the Euclidean Gibbs measure $$ d\mu_S(\phi) = \frac{1}{Z} e^{-S[\phi]} \prod_{x \in \Lambda} d\phi(x), \qquad Z = \int_{\mathbb{R}^{|\Lambda|}} e^{-S[\phi]} \prod_{x \in \Lambda} d\phi(x). $$ This is a probability measure on , and the expectation recovers the lattice path-integral expectation value of an observable .

The Langevin equation with stochastic time is the system of stochastic differential equations $$ \frac{d \phi(x, \tau)}{d \tau} = -\frac{\delta S}{\delta \phi(x)}[\phi(\cdot, \tau)] + \eta(x, \tau), \qquad x \in \Lambda, $$ where is a Gaussian white-noise field on with mean zero and covariance $$ \mathbb{E}[\eta(x, \tau), \eta(y, \tau')] = 2, \delta_{xy}, \delta(\tau - \tau'). $$ The drift is the negative gradient of the action; the noise is independent across sites and white in stochastic time. The variance factor is the conventional normalisation, chosen so that the equilibrium temperature equals in our units.

The Euler-Maruyama discretisation with step size replaces the continuous SDE by the explicit update rule $$ \phi_{n+1}(x) = \phi_n(x) - \Delta\tau \cdot \frac{\partial S}{\partial \phi(x)}[\phi_n] + \sqrt{2 \Delta\tau} \cdot \xi_n(x), $$ where are i.i.d. standard normal across sites and stochastic-time steps . After steps the discrete trajectory is a random variable whose distribution converges, in a sense made precise in the next section, to the Gibbs measure .

A lattice observable is a measurable function — typically a polynomial in the field values such as for the two-point function. The Langevin estimator of the expectation is the trajectory average $$ \hat{f}N = \frac{1}{N - N_0} \sum{n = N_0 + 1}^{N} f(\phi_n), $$ where is a burn-in cutoff chosen to discard the transient relaxation of from its initial condition. Under coercivity hypotheses on (e.g. as ), the ergodic theorem for Langevin SDEs guarantees almost surely as followed by .

Counterexamples to common slips

  • The noise normalisation matters. The variance factor in is set by the choice of "temperature" in the Gibbs measure ; rescaling the noise to shifts the equilibrium to with a different temperature. Mixing conventions across units is a common source of factor-of-two errors.
  • The drift sign matters. With instead of , the flow goes uphill on the action landscape; the equilibrium does not exist (or, with periodic boundary conditions in , would be , which is non-normalisable for actions unbounded above).
  • The Euler-Maruyama scheme has order- bias in the stationary distribution. The samples come from a perturbed measure with a correction depending on the second derivatives of . The bias vanishes as , or one switches to an exact-sampling variant (hybrid Monte Carlo) that adds a Metropolis accept/reject.

Key theorem with proof Intermediate+

Theorem (Langevin equilibrium identification; Chatterjee Lecture 11 [Chatterjee], Parisi-Wu 1981 [Parisi-Wu]). Let be a finite lattice, a action satisfying the coercivity condition as and . Let solve the Langevin SDE* $$ d\phi(\tau) = -\nabla S(\phi(\tau)), d\tau + \sqrt{2}, dW(\tau), $$ *with a standard -dimensional Brownian motion. Then the unique stationary distribution of is the Gibbs measure* $$ d\mu_S(\phi) = \frac{1}{Z} e^{-S(\phi)} d\phi, \qquad Z = \int_{\mathbb{R}^{|\Lambda|}} e^{-S(\phi)} d\phi. $$ *For every continuous bounded observable and every initial condition , as , with the convergence exponentially fast in when for some (Bakry-Émery curvature-dimension criterion CD()).

Proof. The Langevin SDE is a special case of the Itô SDE with drift and constant diffusion matrix . The Itô formula gives, for any test function , $$ df(\phi(\tau)) = \nabla f(\phi(\tau)) \cdot d\phi(\tau) + \tfrac{1}{2} \sum_{x, y} (\sigma \sigma^T){xy} , \tfrac{\partial^2 f}{\partial \phi(x) \partial \phi(y)} d\tau. $$ Taking the expectation, the stochastic-integral term drops, and substituting and gives the generator of the process: $$ (Lf)(\phi) = -\nabla S(\phi) \cdot \nabla f(\phi) + \Delta f(\phi), $$ where is the Laplacian on . The Fokker-Planck equation for the time-dependent density of is $$ \partial\tau p = L^* p = \nabla \cdot (p \nabla S) + \Delta p, $$ where is the formal -adjoint of .

A stationary density satisfies . Substitute : $$ L^* p_{\mathrm{eq}} = \nabla \cdot (e^{-S}/Z \cdot \nabla S) + \Delta(e^{-S}/Z). $$ Compute the second term using and . So $$ \Delta(e^{-S}/Z) = (1/Z)(- e^{-S} \Delta S + e^{-S} |\nabla S|^2), $$ and $$ \nabla \cdot (e^{-S}/Z \cdot \nabla S) = (1/Z)(- e^{-S} |\nabla S|^2 + e^{-S} \Delta S). $$ The two terms cancel: . So the Gibbs density is a stationary solution of the Fokker-Planck equation.

For uniqueness, recall from 08.10.02 that the Fokker-Planck operator on has a one-dimensional kernel spanned by whenever is coercive, by the spectral gap of the symmetric form on weighted . The Bakry-Émery criterion for identifies as a lower bound for the spectral gap, giving exponential convergence of the relative entropy.

To conclude: for every continuous bounded , write . The Csiszár-Kullback-Pinsker inequality bounds the distance by the relative entropy, which decays exponentially. So at the rate .

Bridge. The Langevin equilibrium identification builds toward every numerical implementation of Euclidean field theory: the same identification on a finite lattice produces the path-integral measure of a regulated quantum field theory as the long-time limit of a stochastic dynamics, and this is exactly the foundational reason stochastic quantisation is a viable construction route. The central insight is exactly that the Fokker-Planck generator of the Langevin SDE is symmetric with respect to the inner product on weighted , putting these together with the spectral-gap argument from 08.10.02 to give a unique exponentially-attracting stationary measure. This is exactly the same detailed-balance / reversibility identification that appears again in 08.10.07 when Wightman functions of a Euclidean field theory are recovered from Osterwalder-Schrader reflection positivity. The foundational reason the cancellation works is the algebraic identity , which generalises to the statement that the Fokker-Planck generator is exactly in the gradient-flow form — the central insight that identifies Langevin dynamics with gradient descent in the Wasserstein metric, the bridge between stochastic processes and optimisation theory. Putting these together with the Euler-Maruyama discretisation gives a computer-implementable sampler whose bias is order , and the bridge to exact sampling is the hybrid Monte Carlo correction discussed below.

Exercises Intermediate+

Advanced results Master

Theorem (Parisi-Wu stochastic quantisation; Parisi-Wu 1981 [Parisi-Wu], Damgaard-Hüffel 1987 [Damgaard-Huffel]). Let be a lattice action on . The Langevin SDE in fictitious stochastic time has the Euclidean Gibbs measure as its unique stationary distribution, and the long-time average of any observable along a Langevin trajectory equals the path-integral expectation value of the lattice quantum field theory. The construction extends to lattice gauge theory: on the configuration manifold of group-valued link variables (with a compact Lie group), the Lie-algebra-valued Langevin SDE with the appropriate Haar-measure drift correction has the Wilson measure as its equilibrium, without conventional gauge fixing.

The proof is the finite-dimensional Fokker-Planck argument of the Intermediate-tier theorem, applied first to for scalar fields and then to the compact group manifold for gauge fields. On the group manifold, the Fokker-Planck operator picks up curvature-of-Haar-measure corrections, and the Stratonovich-to-Itô conversion gives a drift that respects the Haar volume; the Wilson measure then satisfies by the same algebraic-identity computation, with the group Laplacian replacing the Euclidean Laplacian. Damgaard-Hüffel's 1987 review consolidates the framework and the perturbative agreement with the conventional Faddeev-Popov gauge-fixed path integral order-by-order in coupling.

Theorem (Zwanziger stochastic gauge fixing; Zwanziger 1981 [Zwanziger]). Augment the gauge-theory Langevin equation with an additional drift along gauge orbits, parametrised by an arbitrary gauge-fixing function . The augmented SDE $$ \frac{dU(x, \mu, \tau)}{d\tau} = \big( -i \nabla_a S_W - i \nabla_a G[U] \big) T^a U + i\eta_a T^a U $$ has the same gauge-invariant Wilson equilibrium measure as the unmodified SDE. The extra drift selects a representative on each gauge orbit without changing the expectation value of any gauge-invariant observable, providing a Lorentz-covariant computational alternative that does not require a Faddeev-Popov ghost determinant and does not suffer the Gribov ambiguity.

The argument: the Wilson action is gauge-invariant, so the gauge-orbit drift has no projection onto the gauge-invariant equilibrium density. The Fokker-Planck operator splits as , and the gauge-invariant part is what was identified in the unmodified theorem; the gauge-orbit part annihilates any function depending only on gauge-invariant observables. So expectation values of Wilson loops and other gauge-invariant operators are unaffected by . The Lorentz-covariant choice for in the small- continuum limit recovers the conventional covariant gauge with no ghost insertion.

Theorem (Drummond-Horgan-Pendleton lattice-Langevin algorithm; 1983 [Drummond-Horgan-Pendleton]). The discrete-time stochastic Langevin update for lattice gauge theory, with a single-step generator that preserves the Haar measure to leading order in , samples the Wilson measure with -bias of order and converges in distribution to the continuous-time Langevin SDE as . Numerical simulations on and recover the conventional Monte Carlo plaquette expectation values within statistical errors at small enough for the bias to be subleading.

This was the first systematic Langevin implementation of lattice gauge theory and remains a reference benchmark. The update is the group-manifold analogue of Euler-Maruyama: starting from , propose with standard Gaussian. The exponential map keeps the trajectory on the group, and the Stratonovich-style discretisation produces the correct Itô correction.

Theorem (Duane-Kennedy-Pendleton-Roweth hybrid Monte Carlo; 1987 [DKPR]). Augment the configuration space with a fictitious momentum and the augmented Hamiltonian . The HMC update consists of (i) Gibbs sampling of , (ii) leapfrog integration of Hamilton's equations for over steps with size , (iii) a Metropolis accept/reject step with acceptance probability . The augmented Markov chain has as its exact stationary distribution; marginalising over recovers exactly, with no order- bias.

HMC has become the dominant sampling algorithm for lattice QCD because it bypasses the order- bias of plain Langevin updates while maintaining the favourable scaling of stochastic methods over local Metropolis updates. The combination — exact target, no rejection-of-individual-coordinates inefficiency, long-trajectory exploration — is the central reason essentially all production lattice-QCD calculations since the early 1990s use HMC or its descendants (Rational HMC, mass preconditioning, multi-time-scale integration).

Theorem (Kennedy-Bowler pseudofermion Langevin; 1985 [Kennedy-Bowler]). Lattice gauge theory with dynamical fermions has an action , with the lattice Dirac operator. The fermion determinant is replaced by a Gaussian pseudofermion integral . The Langevin update on with the joint action samples the dynamical-fermion measure, with computed by iterative inversion at each step.

This was the workhorse algorithm for lattice QCD with dynamical fermions in the second half of the 1980s, before HMC overtook it. The Langevin-style updates allowed the first systematic studies of unquenched lattice QCD; the pseudofermion trick is what made it computationally feasible to handle the non-local fermion determinant.

Theorem (complex Langevin; Parisi 1983 [Parisi-1983], Aarts-Seiler-Stamatescu 2010 [ASS-2010]). For an action with a complex or signed contribution (e.g., finite chemical potential, real-time path integral), naively the integrand is not a probability density, and conventional Langevin sampling does not apply. Complex Langevin extends the framework by complexifying the configuration space to and integrating the Langevin SDE with complex drift. When the drift on the complexified manifold is holomorphic with sufficient decay at infinity, the resulting expectation values agree with the analytic continuation of the original path integral.

The convergence of complex Langevin in the general case is controversial. Aarts-Seiler-Stamatescu 2010 [ASS-2010] identifies the holomorphic-drift criterion and the requirement of exponentially-decaying measures on the complexified configuration space; specific failure modes include drift singularities, slow decay at infinity, and "incorrect convergence" to wrong-but-stable distributions. The framework remains an active research topic in lattice field theory at finite chemical potential and real-time.

Theorem (connection to Markov-chain Monte Carlo theory). The discretised Langevin update is a Markov chain on with transition kernel $$ P_{\Delta\tau}(\phi, d\phi') = \mathcal{N}(\phi - \Delta\tau \nabla S(\phi),, 2 \Delta\tau \cdot I)(d\phi'). $$ The chain does not satisfy detailed balance with respect to the Gibbs measure at finite — the bias is order — but it does in the limit. The Metropolis-Hastings correction (Roberts-Tweedie 1996; the "MALA" algorithm: Metropolis-adjusted Langevin algorithm) accepts proposals with probability , restoring exact detailed balance. The HMC scheme of Duane-Kennedy-Pendleton-Roweth 1987 is the higher-order symplectic generalisation.

The connection identifies Langevin sampling as a particular case of MCMC theory, with the additional structure of continuous-time relaxation. The unifying perspective gives the modern toolkit of Wasserstein-gradient-flow methods (Jordan-Kinderlehrer-Otto 1998), the entropy-method convergence analysis (Bakry-Émery, Otto-Villani), and the high-dimensional concentration estimates used in machine-learning Langevin samplers (Langevin dynamics MCMC, score-based generative models, diffusion models).

Synthesis. Langevin updates and lattice numerics put together the foundational reason stochastic quantisation is a viable construction route for Euclidean field theory: a finite-dimensional Itô SDE on with drift the negative gradient of the action and white noise has the Euclidean Gibbs measure as its unique exponentially-attracting stationary distribution, and this is exactly the algebraic content of the Fokker-Planck identity derived in 08.10.02. The central insight is exactly that the Langevin-Fokker-Planck pair encodes a reversible Markov dynamics whose detailed-balance distribution is the path-integral measure, putting these together with Euler-Maruyama discretisation and Metropolis accept/reject corrections to produce a computer-implementable sampler with controlled bias. The bridge is the recognition that this framework generalises in three directions: to gauge theories on compact group manifolds (Drummond-Horgan-Pendleton 1983), where the Haar measure replaces Lebesgue measure and the gauge-invariant Wilson measure emerges as the equilibrium without conventional gauge fixing; to fermionic theories (Kennedy-Bowler 1985), where pseudofermion auxiliary fields convert the determinant into an integral that the Langevin sampler can handle; and to signed-measure regimes (Parisi 1983), where complexification of the configuration space gives a partial — and controversial — extension to actions with imaginary parts.

The bridge between Langevin sampling and the Wightman framework of 08.10.07 is Osterwalder-Schrader: the Schwinger functions sampled by Langevin updates on the Euclidean side reconstruct the Wightman functions on the Minkowski side by analytic continuation, identifying stochastic-quantisation expectation values with relativistic vacuum expectations of operator products. This is exactly the same reflection-positivity / detailed-balance identification that appears again in the Glimm-Jaffe constructive program, where the Langevin / Markov-chain framework supplies the existence of the measures that the Osterwalder-Schrader axioms require.

The synthesis runs in the opposite direction as well. Hybrid Monte Carlo (Duane-Kennedy-Pendleton-Roweth 1987) trades the order- bias of plain Langevin for exact sampling at the cost of a Metropolis accept/reject; the central insight is that augmenting the configuration space with a fictitious momentum produces a Hamiltonian flow whose volume-preserving symplectic integrator gives unbiased proposals. This is dual to the Langevin gradient-flow picture in the same way Hamiltonian dynamics is dual to gradient descent: the leapfrog integrator generalises Euler-Maruyama by adding ballistic exploration along the level sets of the augmented Hamiltonian. Putting these together with mass preconditioning (Hasenbusch 2001), rational HMC (Clark-Kennedy 2007), and multi-time-scale integration produces the modern production toolkit for lattice QCD, which has computed the proton mass, the QCD spectrum, hadronic structure functions, and electroweak matrix elements to percent-level precision from first principles. The foundational reason the toolkit works is that Langevin-style stochastic dynamics on the lattice path integral converges to the right equilibrium; the central insight is that the equilibrium is gauge-invariant from the outset, so no Faddeev-Popov machinery is needed to extract physics. The bridge between lattice numerics and machine learning is the recognition that score-based generative models, denoising diffusion models, and Langevin-dynamics MCMC samplers in modern probabilistic ML are all instances of the Parisi-Wu construction with a learned drift; the central insight that identifies stochastic quantisation with score-based generation is that both produce samples from a target measure as the long-time equilibrium of a noisy gradient flow.

Full proof set Master

Proposition (detailed balance for the Langevin SDE). The continuous-time Langevin SDE on satisfies detailed balance with respect to the Gibbs measure . Equivalently, the generator is symmetric on .

Proof. For , compute the bilinear form . Substituting and integrating the Laplacian by parts in : $$ \int (\Delta f)(\phi), g(\phi), e^{-S(\phi)}, d\phi = -\int \nabla f \cdot \nabla(g, e^{-S}), d\phi = -\int \nabla f \cdot \nabla g, e^{-S}, d\phi + \int \nabla f \cdot \nabla S, g, e^{-S}, d\phi. $$ The boundary term at infinity vanishes by the compact support of . Adding the explicit term from : $$ \langle Lf, g \rangle_{\mu_S} = -\frac{1}{Z} \int \nabla f \cdot \nabla g, e^{-S}, d\phi. $$ The right-hand side is symmetric under , so . Symmetry of on is equivalent to detailed balance of the SDE: the transition density satisfies for every and almost every .

Proposition (exponential ergodicity under Bakry-Émery). If the action satisfies pointwise for some (in the sense of symmetric matrices), the Langevin SDE converges exponentially to the Gibbs measure in relative entropy: $$ H(p(\cdot, \tau), |, \mu_S) \leq e^{-2 \rho \tau}, H(p(\cdot, 0), |, \mu_S), $$ where is the Kullback-Leibler divergence.

Proof. Differentiate the relative entropy along the Fokker-Planck flow : $$ \frac{d}{d\tau} H(p, |, \mu_S) = \int \partial_\tau p \cdot \log(p \cdot e^S \cdot Z), d\phi = -\int \frac{|\nabla(p, e^S)|^2}{p, e^S}, e^{-S}, d\phi = -I(p, |, \mu_S), $$ where is the Fisher information of relative to . The Bakry-Émery curvature-dimension criterion implies the log-Sobolev inequality for all probability densities . So $$ \frac{d}{d\tau} H(p | \mu_S) \leq -2\rho, H(p | \mu_S), $$ and Gronwall's inequality gives . The Csiszár-Kullback-Pinsker inequality bounds , giving convergence in total variation.

Proposition (Euler-Maruyama weak convergence). Let be globally Lipschitz and bounded, constant. For every and every bounded , $$ \big| \mathbb{E}[f(\phi_N^{\Delta\tau})] - \mathbb{E}[f(\phi(T))] \big| \leq C(f, T, b), \Delta\tau, $$ where solves and is the Euler-Maruyama discretisation with .

Proof. Define , which satisfies the backward Kolmogorov equation with and terminal condition . Standard regularity for parabolic PDEs gives uniformly bounded on with bounds depending on .

The error decomposes telescopically: $$ \mathbb{E}[f(\phi_N^{\Delta\tau})] - u(\phi_0, T) = \sum_{n=0}^{N-1} \big( \mathbb{E}[u(\phi_{n+1}^{\Delta\tau}, T - (n+1)\Delta\tau)] - \mathbb{E}[u(\phi_n^{\Delta\tau}, T - n \Delta\tau)] \big). $$ Each summand is the single-step error at step . Taylor-expand around to fourth order. With , : $$ u(\phi_{n+1}^{\Delta\tau}, s - \Delta\tau) = u(\phi_n^{\Delta\tau}, s) + (\partial_s u + b \cdot \nabla u + \tfrac{\sigma^2}{2} \Delta u)(-\Delta\tau) + O(\Delta\tau^2) \cdot \xi_n^{\otimes \geq 3}. $$ Take the expectation: the linear-in- term has mean zero, the quadratic-in- term gives the contribution that combines with and to recover at leading order, and the remainder is from the cubic and quartic Taylor terms. Summing steps gives total error .

Proposition (HMC detailed balance). The hybrid Monte Carlo update (Gaussian momentum refresh, leapfrog integration of Hamiltonian for steps with step size , Metropolis accept/reject with probability ) satisfies detailed balance with respect to the augmented measure , hence samples its marginal exactly.

Proof. The augmented update consists of three steps: (a) sample — this is a Gibbs step on conditional on , which preserves by direct verification: the conditional law of given under is exactly . (b) Integrate Hamilton's equations , with the leapfrog scheme: $$ \pi_{1/2} = \pi - \tfrac{\Delta\tau}{2} \nabla S(\phi),\quad \phi' = \phi + \Delta\tau, \pi_{1/2},\quad \pi' = \pi_{1/2} - \tfrac{\Delta\tau}{2} \nabla S(\phi'), $$ iterated times. The leapfrog map is symplectic (preserves the canonical symplectic form ) and time-reversible ( under ). (c) Accept the proposal with probability where is the integrator energy drift.

Detailed balance: for any test sets , $$ \int_A \mu_H(d\phi, d\pi) \cdot P((\phi, \pi), B) = \int_B \mu_H(d\phi', d\pi') \cdot P((\phi', \pi'), A), $$ where is the HMC kernel. Symplecticity of leapfrog gives unit Jacobian, time-reversibility gives the matching backward proposal, and the Metropolis accept/reject ratio is calibrated to make the two sides equal. The technical content is the standard Metropolis-Hastings construction with the proposal density determined by the leapfrog map. Marginalising over leaves , the target measure.

Proposition (lattice harmonic-oscillator equilibrium recovered). Let be a 1D lattice of sites with periodic boundary conditions, , and the action . The Langevin SDE has equilibrium variance at every site, recovering the exact path-integral result (with ).

Proof. The action is a sum of independent harmonic oscillators at each site, so the equilibrium measure factors as . Each factor is a Gaussian with variance . So at every site, independent of . The lattice Langevin samples reproduce this value to within Monte Carlo statistical error.

Proposition (Gaussian-free-field two-point function from Langevin samples). On the 2D torus lattice with action , the Langevin equilibrium two-point function is the inverse lattice Laplacian: $$ \langle \phi(0), \phi(x) \rangle_{\mu_S} = (-\Delta_\Lambda + m^2)^{-1}(0, x), $$ recovering the Gaussian-free-field propagator of 08.06.01.

Proof. The action is a positive-definite quadratic form with the lattice Helmholtz operator. The Gibbs measure is the multivariate Gaussian on , so the equilibrium covariance is . In Fourier transform on the dual lattice with momenta , : $$ \langle \phi(0) \phi(x) \rangle = \frac{1}{L^2} \sum_{k \in \hat{\Lambda}} \frac{e^{i k \cdot x}}{\hat{k}^2 + m^2}, \quad \hat{k}^2 = 4 \sum_\mu \sin^2(k_\mu/2). $$ Langevin samples give the empirical estimator by ergodicity, with statistical error scaling as corrected by the autocorrelation time of the Langevin chain.

Proposition (HMC dominance over plain Langevin in lattice QCD). Hybrid Monte Carlo achieves acceptance with chosen so that the integrator energy error per trajectory is , requiring in degrees of freedom. Plain Langevin requires to keep the bias subleading. So HMC has favourable computational complexity per independent sample versus for plain Langevin, leading to a speedup factor of that grows with system size.

Proof sketch. Acceptance of HMC requires over a trajectory. The leapfrog integrator has local energy error per step (symplectic order-2 integrator), and steps in a trajectory give total energy error . For degrees of freedom the energy fluctuates as , so acceptance requires . Plain Langevin has order- stationary-distribution bias, with the constant scaling as , so the bias requires for accuracy on global observables (or when restricted to local averages). The arithmetic cost per step is in both schemes, giving total cost where is the autocorrelation time. The favourable HMC scaling in dominates the autocorrelation overhead in practice. Creutz 1988 [Damgaard-Huffel] and Kennedy et al. 1990s subsequent work confirm this scaling in lattice QCD simulations.

Connections Master

  • Fokker-Planck equation and equilibrium distribution 08.10.02. The Langevin SDE is the Itô-form prerequisite for which 08.10.02 supplies the equivalent Fokker-Planck description: for , with the spectral-gap argument identifying as the unique stationary solution. The present unit specialises that infinite-dimensional Fokker-Planck framework to a finite lattice and discretises the SDE for computational implementation.

  • Path integral formulation of statistical mechanics 08.07.01. The path-integral measure on lattice field configurations is what the Langevin updates sample. The present unit gives a concrete computer-implementable algorithm for drawing samples from the measure that 08.07.01 introduces as the formal foundation of Euclidean field theory.

  • Gaussian field theory and free boson 08.06.01. The Gaussian-free-field measure with covariance is the Langevin equilibrium for the quadratic action . Langevin samples produce the propagator numerically, with the worked-example on the 2D torus serving as the calibration benchmark.

  • Wick rotation 08.09.01. The fictitious stochastic time of the Langevin SDE is unrelated to the physical Wick-rotated time of the Euclidean field theory: is an extra dimension over which the dynamics relaxes to equilibrium, while is one of the lattice spatial dimensions in the Euclidean signature obtained from 08.09.01. The Wick-rotated action is what enters the Langevin drift; the stochastic time is the extra parameter Parisi-Wu introduce.

  • Wilson's lattice gauge theory 08.08.01. The Drummond-Horgan-Pendleton extension of Langevin updates to -valued link variables is the gauge-theoretic application of the present framework. The Wilson plaquette action supplies the drift; the gauge-invariant equilibrium measure is the Wilson measure of 08.08.01, sampled without conventional Faddeev-Popov gauge fixing.

  • Wightman axioms 08.10.07. Schwinger functions sampled by lattice Langevin updates reconstruct Wightman functions on the Minkowski side by Osterwalder-Schrader analytic continuation. The present stochastic-sampling framework is the constructive-QFT input that Glimm-Jaffe and successors use to verify the Wightman axioms in two and three spacetime dimensions; the Langevin equilibrium is precisely the measure whose Schwinger functions Osterwalder-Schrader reconstruction turns into a Wightman theory.

Historical & philosophical context Master

The Langevin equation originated in Paul Langevin's 1908 note Sur la théorie du mouvement brownien (C. R. Acad. Sci. Paris 146, 530) as a phenomenological model for Brownian motion, with deterministic friction and a stochastic force. The mathematical foundations were assembled in the following decades: Norbert Wiener's 1923 construction of the Wiener measure on continuous paths, Kiyosi Itô's 1944 stochastic integral [Itô-1944] and 1951 chain rule [Itô-1951] (Proc. Imp. Acad. Tokyo 20, 519 and Nagoya Math. J. 3, 55), and Andrey Kolmogorov's 1931 axiomatisation of continuous Markov processes via the forward (Fokker-Planck) and backward equations. The discretised Euler-Maruyama scheme is due to Gisiro Maruyama's 1955 paper Continuous Markov processes and stochastic equations (Rend. Circ. Mat. Palermo 4, 48) [Maruyama-1955], establishing weak convergence of the discrete scheme to the continuous SDE.

The application to quantum field theory is the work of Giorgio Parisi and Yong-Shi Wu, Perturbation theory without gauge fixing (Sci. Sin. 24, 483 (1981)) [Parisi-Wu]. Their proposal: the Euclidean path-integral measure on field configurations can be obtained as the equilibrium distribution of a Langevin process in a fictitious "fifth time" , with the configuration field evolving according to . The proposal had three immediate appeals: it gave a mathematically clean construction of the Euclidean measure as the long-time limit of a stochastic process (sidestepping the measure-theoretic complications of defining an "infinite-dimensional Gaussian times Boltzmann factor"); it bypassed conventional gauge fixing for gauge theories, because the equilibrium measure is automatically gauge-invariant; and it suggested a computational route based on time-stepping the SDE on a lattice. Daniel Zwanziger's 1981 paper Covariant quantization of gauge fields without Gribov ambiguity (Nucl. Phys. B 192, 259) [Zwanziger] introduced stochastic gauge fixing as a Lorentz-covariant alternative that does not require Faddeev-Popov ghosts and does not suffer the Gribov copies of conventional gauge fixing.

The numerical lattice implementation was carried out by I. T. Drummond, R. R. Horgan, and B. J. Pendleton in Improved gauge fixing and Langevin algorithms (Nucl. Phys. B 220, 119 (1983)) [Drummond-Horgan-Pendleton], who implemented discretised Langevin updates on and link variables and compared against conventional Metropolis Monte Carlo. A. D. Kennedy and K. C. Bowler's 1985 paper Updating fermions with Langevin algorithm (Nucl. Phys. B 245, 217) [Kennedy-Bowler] extended the framework to dynamical fermions via pseudofermion auxiliary fields, making lattice QCD with sea quarks computationally feasible for the first time. The 1987 hybrid Monte Carlo paper by Simon Duane, Anthony Kennedy, Brian Pendleton, and Duncan Roweth (Phys. Lett. B 195, 216) [DKPR] combined the Langevin-style stochastic exploration with a Metropolis accept/reject to eliminate the order- bias, and HMC has been the standard sampling algorithm for lattice QCD ever since. The canonical review of the stochastic-quantisation framework is Damgaard and Hüffel's 1987 Phys. Rep. 152, 227 article [Damgaard-Huffel].

Parisi's 1983 paper On complex probabilities (Phys. Lett. B 131, 393) [Parisi-1983] proposed complex Langevin as an extension to actions with imaginary parts — the sign-problem regime at finite chemical potential or in real-time path integrals. Convergence in the general case proved controversial; Gert Aarts, Erhard Seiler, and Ion-Olimpiu Stamatescu's 2010 paper The complex Langevin method: when can it be trusted? (Phys. Rev. D 81, 054508) [ASS-2010] identified the holomorphic-drift criterion and exhibited specific failure modes. The framework remains active in finite-density lattice QCD and real-time field theory research.

Bibliography Master

@article{ParisiWu1981,
  author  = {Parisi, Giorgio and Wu, Yong-Shi},
  title   = {Perturbation theory without gauge fixing},
  journal = {Scientia Sinica},
  volume  = {24},
  year    = {1981},
  pages   = {483--496}
}

@article{DamgaardHuffel1987,
  author  = {Damgaard, Poul H. and H{\"u}ffel, Helmuth},
  title   = {Stochastic quantization},
  journal = {Physics Reports},
  volume  = {152},
  number  = {5-6},
  year    = {1987},
  pages   = {227--398}
}

@article{Zwanziger1981,
  author  = {Zwanziger, Daniel},
  title   = {Covariant quantization of gauge fields without {G}ribov ambiguity},
  journal = {Nuclear Physics B},
  volume  = {192},
  year    = {1981},
  pages   = {259--269}
}

@article{DrummondHorganPendleton1983,
  author  = {Drummond, I. T. and Horgan, R. R. and Pendleton, B. J.},
  title   = {Improved gauge fixing and {L}angevin algorithms},
  journal = {Nuclear Physics B},
  volume  = {220},
  year    = {1983},
  pages   = {119--136}
}

@article{KennedyBowler1985,
  author  = {Kennedy, A. D. and Bowler, K. C.},
  title   = {Updating fermions with {L}angevin algorithm},
  journal = {Nuclear Physics B},
  volume  = {245},
  year    = {1985},
  pages   = {217--232}
}

@article{Parisi1983Complex,
  author  = {Parisi, Giorgio},
  title   = {On complex probabilities},
  journal = {Physics Letters B},
  volume  = {131},
  year    = {1983},
  pages   = {393--395}
}

@article{DKPR1987,
  author  = {Duane, Simon and Kennedy, A. D. and Pendleton, Brian J. and Roweth, Duncan},
  title   = {Hybrid {M}onte {C}arlo},
  journal = {Physics Letters B},
  volume  = {195},
  year    = {1987},
  pages   = {216--222}
}

@article{Ito1944,
  author  = {It{\^o}, Kiyosi},
  title   = {Stochastic integral},
  journal = {Proceedings of the Imperial Academy of Tokyo},
  volume  = {20},
  year    = {1944},
  pages   = {519--524}
}

@article{Ito1951,
  author  = {It{\^o}, Kiyosi},
  title   = {On a formula concerning stochastic differentials},
  journal = {Nagoya Mathematical Journal},
  volume  = {3},
  year    = {1951},
  pages   = {55--65}
}

@article{Maruyama1955,
  author  = {Maruyama, Gisiro},
  title   = {Continuous {M}arkov processes and stochastic equations},
  journal = {Rendiconti del Circolo Matematico di Palermo},
  volume  = {4},
  year    = {1955},
  pages   = {48--90}
}

@book{KloedenPlaten1992,
  author    = {Kloeden, Peter E. and Platen, Eckhard},
  title     = {Numerical Solution of Stochastic Differential Equations},
  publisher = {Springer-Verlag},
  series    = {Stochastic Modelling and Applied Probability},
  volume    = {23},
  year      = {1992}
}

@article{AartsSeilerStamatescu2010,
  author  = {Aarts, Gert and Seiler, Erhard and Stamatescu, Ion-Olimpiu},
  title   = {The complex {L}angevin method: when can it be trusted?},
  journal = {Physical Review D},
  volume  = {81},
  year    = {2010},
  pages   = {054508}
}

@article{Langevin1908,
  author  = {Langevin, Paul},
  title   = {Sur la th{\'e}orie du mouvement brownien},
  journal = {Comptes Rendus de l'Acad{\'e}mie des Sciences Paris},
  volume  = {146},
  year    = {1908},
  pages   = {530--533}
}

@article{Kolmogorov1931,
  author  = {Kolmogorov, Andrey N.},
  title   = {{\"U}ber die analytischen {M}ethoden in der {W}ahrscheinlichkeitsrechnung},
  journal = {Mathematische Annalen},
  volume  = {104},
  year    = {1931},
  pages   = {415--458}
}

@article{JordanKinderlehrerOtto1998,
  author  = {Jordan, Richard and Kinderlehrer, David and Otto, Felix},
  title   = {The variational formulation of the {F}okker-{P}lanck equation},
  journal = {SIAM Journal on Mathematical Analysis},
  volume  = {29},
  year    = {1998},
  pages   = {1--17}
}

@article{BakryEmery1985,
  author  = {Bakry, Dominique and {\'E}mery, Michel},
  title   = {Diffusions hypercontractives},
  journal = {S{\'e}minaire de Probabilit{\'e}s XIX, Lecture Notes in Math.},
  volume  = {1123},
  year    = {1985},
  pages   = {177--206}
}