Fokker-Planck equation and equilibrium distribution
Anchor (Master): Kolmogorov, *Math. Ann.* 104, 415 (1931) (forward/backward equations); Itô, *Proc. Imp. Acad. Tokyo* 20, 519 (1944) (stochastic integral); Parisi-Wu, *Sci. Sin.* 24, 483 (1981) (stochastic quantisation as Langevin route to Euclidean field theory); Bakry-Émery, *Séminaire de Probabilités XIX*, Springer LNM 1123, 177 (1985) (log-Sobolev inequalities via curvature-dimension); Markowich-Villani, *Mat. Contemp.* 19, 1 (2000); Villani, *Mem. AMS* 202 (2009), Ch. 1 (hypocoercivity)
Intuition Beginner
A speck of pollen suspended in water never sits still. Water molecules bump it from all sides, and the speck performs the jittery random walk Robert Brown noticed in 1827. If you also tilt the water container, gravity adds a downward drift on top of the jitter, and the speck still wanders but with a preference for moving downward. After a long time the speck settles into a typical height distribution: more likely near the bottom of the container, less likely high up, with the exact shape of the distribution set by how strong the tilt is compared with the thermal jitter.
The Fokker-Planck equation is the rule that says how the probability of finding the speck at a given height changes with time. It has two ingredients. The drift part pushes the probability in the direction the tilt prefers, like a wind blowing a cloud of dust. The diffusion part spreads the probability out, like a drop of ink spreading in water. The two ingredients fight each other: drift wants to pile probability up at the bottom of the well, diffusion wants to flatten everything out evenly. They balance at a unique equilibrium shape called the Gibbs distribution or the Boltzmann distribution.
The remarkable fact is that the equilibrium shape is the same no matter how you started. Drop a speck at the top of the container, at the bottom, anywhere — after waiting long enough you will see the same probability cloud. The Fokker-Planck equation also tells you how fast the speck forgets its starting position: the answer is set by the curvature of the well, and the deeper the well the faster the equilibration. This balance between drift and diffusion is the central principle of equilibrium statistical mechanics: thermal motion fluctuates, restoring forces relax, and the joint outcome is a probability cloud whose log is the energy divided by the temperature.
Visual Beginner
Picture a deep bowl, and inside the bowl a cloud of dust particles being shaken by random thermal kicks. The walls of the bowl push the particles back toward the centre. The thermal kicks spread them outward. The two forces balance, and the typical view of the cloud is a Gaussian blob centred at the bottom of the bowl, wider when the kicks are stronger or the bowl is shallower, narrower when the kicks are weaker or the bowl is steeper.
The shape on the right of the picture is the equilibrium distribution. Its formula is , where is the height of the bowl at position , is the temperature, and is the number that normalises the probability to one. The shape on the left is the initial cloud at time zero. The arrows show the equilibration: the cloud drifts and spreads and settles. The same picture in many dimensions replaces the bowl with a multidimensional energy landscape and the cloud with a many-variable joint probability cloud.
Worked example Beginner
Compute the equilibrium distribution of a particle in a one-dimensional quadratic well at temperature , with the well shape .
Step 1. Identify the ingredients. The well height is . The drift force pushing the particle back toward the centre is the negative slope of the well, . The temperature is . The equilibrium formula is , and we need to find the normalisation .
Step 2. Plug in the well shape. The unnormalised distribution is . This is a Gaussian centred at with variance . The wider the bowl is at the bottom (small ), the wider the equilibrium cloud; the steeper the bowl (large ), the narrower the cloud.
Step 3. Normalise. The integral of over all real is by the standard Gaussian formula. So , and the normalised equilibrium is .
Step 4. Plug in numbers. Take and . The variance is . The probability of finding the particle within one standard deviation of the centre is about , the same as any standard Gaussian. The typical kinetic-like fluctuation equals , the equipartition result: each quadratic mode in the energy carries thermal energy of order .
Step 5. Check the limits. At low temperature () the variance shrinks to zero and the equilibrium concentrates at the minimum of the well — the classical zero-temperature ground state. At high temperature () the variance grows, the cloud spreads out, and in the strict limit the equilibrium would become uniform — but only if the well is finite. For an unbounded quadratic well, the cloud just gets wider and wider with .
What this tells us: the Fokker-Planck equation's equilibrium is a Gaussian whose width records the competition between the steepness of the energy landscape and the thermal energy. This is the simplest informative case and the prototype of every equilibrium computation in statistical mechanics. The same recipe — write the energy, divide by temperature, exponentiate, normalise — gives the Boltzmann distribution for every classical equilibrium ensemble.
Check your understanding Beginner
Formal definition Intermediate+
Fix the dimension and the temperature . Let be a smooth potential with fast enough as that (coercivity). The Langevin stochastic differential equation for is $$ dx_t = - \nabla V(x_t), dt + \sqrt{2 T}, dW_t, $$ where is a standard -dimensional Brownian motion (a continuous Gaussian process with and ) and the stochastic integral is interpreted in the sense of Itô.
The transition density is the probability density of given ; the forward density is the unconditional density of given a chosen initial law . The Fokker-Planck equation (also called the Kolmogorov forward equation) is the parabolic partial differential equation $$ \partial_t p = \nabla \cdot (p, \nabla V) + T, \Delta p, \qquad p(x, 0) = p_0(x), $$ with the Laplacian and the divergence. The right-hand side is the formal adjoint of the infinitesimal generator $$ L = - \nabla V \cdot \nabla + T, \Delta = T, \Delta - \nabla V \cdot \nabla $$ acting on test functions: for any , $$ \mathbb{E}[\varphi(x_t) \mid x_0] = \varphi(x_0) + \mathbb{E}\Bigl[\int_0^t (L \varphi)(x_s), ds \mid x_0\Bigr]. $$ The generator acts on observables; its formal adjoint acts on densities. The Fokker-Planck equation reads .
A stationary solution is a time-independent solution of the Fokker-Planck equation: . The probability current is the vector field $$ J(x, t) := - p(x, t), \nabla V(x) - T, \nabla p(x, t), $$ so the Fokker-Planck equation is the conservation law . A stationary solution has ; a stationary solution with pointwise is called a detailed-balance or reversible stationary distribution.
The Gibbs / Boltzmann distribution is the candidate equilibrium $$ p_{eq}(x) := \frac{1}{Z}, e^{-V(x)/T}, \qquad Z := \int_{\mathbb{R}^n} e^{-V(x)/T}, dx. $$ A direct computation gives , hence the equilibrium current . The Gibbs distribution is a detailed-balance stationary solution.
The weighted space has inner product . The generator extends to a densely defined self-adjoint operator on with (non-positive spectrum), and the Fokker-Planck semigroup on densities is dual to the observable semigroup on .
Counterexamples to common slips
The drift in the SDE is (force = minus gradient of potential), not . The sign is fixed by the requirement that the well attracts the particle inward; a sign error puts the particle on the wrong side of an instability.
The noise coefficient encodes the Einstein relation between fluctuation and dissipation: the diffusion coefficient in the Fokker-Planck equation is exactly half the square of the SDE noise coefficient. Setting it to instead of in the SDE doubles the equilibrium temperature.
Coercivity of is essential. For on (free Brownian motion) the equation has no normalisable stationary solution: any initial density spreads out to zero pointwise, and the heat-equation Green function shows the variance growing linearly in time without bound.
The Fokker-Planck equation is the forward equation in time. The backward equation, for with , runs backward from a terminal condition . The two equations are formally adjoint and govern different quantities (density vs. observable expectation).
The Itô interpretation is built into the formula . The Stratonovich interpretation of the same SDE — — gives a different Fokker-Planck equation when the noise coefficient is state-dependent; here the noise is additive (constant coefficient), so Itô and Stratonovich coincide.
Key theorem with proof Intermediate+
Theorem (Fokker-Planck from the Langevin SDE; Kolmogorov 1931 [Kolmogorov 1931], Itô 1944 [Itô 1944]). Let satisfy with and suitable growth conditions ensuring a unique strong solution. For any initial density with and , the law of admits a density in satisfying $$ \partial_t p = \nabla \cdot (p, \nabla V) + T, \Delta p, \qquad p(\cdot, 0) = p_0. $$ The Gibbs density is a stationary solution.
Proof. Apply Itô's formula to a test function . For each smooth path of the SDE, $$ d\varphi(x_t) = \nabla \varphi(x_t) \cdot dx_t + \tfrac{1}{2} \sum_{i, j} \partial_i \partial_j \varphi(x_t), d[x_i, x_j]t, $$ where is the quadratic variation of the Itô SDE. Substituting , $$ d\varphi(x_t) = \bigl( - \nabla V(x_t) \cdot \nabla \varphi(x_t) + T, \Delta \varphi(x_t) \bigr) dt + \sqrt{2 T}, \nabla \varphi(x_t) \cdot dW_t. $$ The bracketed quantity is exactly with . Taking expectations against the law of kills the Itô-martingale increment and gives $$ \frac{d}{dt} \mathbb{E}[\varphi(x_t)] = \mathbb{E}[(L \varphi)(x_t)] = \int{\mathbb{R}^n} (L \varphi)(x), p(x, t), dx. $$
Write the left-hand side as . Equating, $$ \int \varphi(x) \partial_t p(x, t), dx = \int (L \varphi)(x), p(x, t), dx = \int \varphi(x), (L^* p)(x, t), dx, $$ where the second equality is integration by parts: for of compact support, $$ \int (T \Delta \varphi), p, dx = T \int \varphi, \Delta p, dx, \qquad - \int (\nabla V \cdot \nabla \varphi), p, dx = \int \varphi, \nabla \cdot (p \nabla V), dx. $$ The boundary terms vanish because . Hence for every , so in the weak sense. Parabolic regularity upgrades on and the equation holds classically.
The Gibbs density is stationary: compute $$ L^* p_{eq} = T \Delta p_{eq} + \nabla \cdot (p_{eq} \nabla V). $$ From obtain and . Substitute: $$ T \Delta p_{eq} = (1/T) p_{eq} |\nabla V|^2 - p_{eq} \Delta V, $$ $$ \nabla \cdot (p_{eq} \nabla V) = \nabla p_{eq} \cdot \nabla V + p_{eq} \Delta V = -(1/T) p_{eq} |\nabla V|^2 + p_{eq} \Delta V. $$ Add the two: . The Gibbs density satisfies the stationarity equation pointwise.
Bridge. The derivation of the Fokker-Planck equation from the Itô SDE builds toward every equilibrium statistical-mechanics computation, and the same drift-plus-diffusion pattern appears again in 08.07.01 in the path-integral / functional-measure language. The foundational reason the Gibbs density is the unique reversible stationary solution is exactly the detailed-balance condition: the probability current vanishes pointwise on , and a coercive potential forces uniqueness via the spectral-gap argument of 08.06.01. This is exactly the equilibrium condition the Boltzmann distribution of 08.01.03 specifies — the equilibrium density's logarithm is the negative energy divided by the temperature. The central insight is that the Langevin dynamics samples the Gibbs measure: in the long-time limit, time averages along a single trajectory equal ensemble averages against the equilibrium density, and this identifies dynamical sampling with thermodynamic ensembles. The bridge is the recognition that Parisi-Wu stochastic quantisation generalises the Fokker-Planck route to field theory: replacing by a field configuration and the potential by the Euclidean action , the Langevin equation in the fictitious time has the Euclidean Gibbs measure as its Fokker-Planck equilibrium. Putting these together, one Fokker-Planck framework produces every equilibrium law of equilibrium statistical mechanics, every Euclidean field theory measure of stochastic quantisation, and every quantitative rate of convergence to equilibrium controlled by the spectral gap or log-Sobolev constant of the generator.
Exercises Intermediate+
Advanced results Master
Theorem (spectral gap of the Ornstein-Uhlenbeck generator; Risken §5.4 [Risken]). Let with symmetric positive-definite on with eigenvalues . The generator is self-adjoint on with . Its spectrum is $$ \sigma(L) = \Bigl{ -\sum_{i=1}^{n} k_i, \omega_i^2, : (k_1, \ldots, k_n) \in \mathbb{N}0^n \Bigr}, $$ *with the spectral gap (smallest non-zero eigenvalue in absolute value) equal to . The eigenfunctions are multivariate Hermite polynomials in the variables , and the Fokker-Planck semigroup decays as $|p(\cdot, t) - p{eq}|{L^2(p{eq}^{-1})} \leq e^{-\omega_1^2 t} |p_0 - p_{eq}|{L^2(p{eq}^{-1})}$.*
The Ornstein-Uhlenbeck spectral gap is the smallest eigenvalue of the Hessian of , and the Hermite polynomial diagonalisation makes the spectral resolution of completely explicit. For non-quadratic with , the gap is bounded below by (Bakry-Émery); for non-convex the gap may be much smaller than — the equilibrium can be multimodal, and the rate of inter-mode tunnelling is exponentially small in (Eyring-Kramers / Freidlin-Wentzell large deviations).
Theorem (H-theorem; Markowich-Villani 2000 [Markowich-Villani]). Let be the Gibbs density on and let solve the Fokker-Planck equation with . The relative entropy is finite for and satisfies $$ \frac{d}{dt} H(p(\cdot, t) | p_{eq}) = -T \int p, |\nabla \log(p/p_{eq})|^2, dx = -T \cdot I(p | p_{eq}), $$ where is the relative Fisher information. The entropy is non-increasing in time, and identically zero only at .
This is the dissipative structure of the Fokker-Planck flow: the relative entropy is a Lyapunov functional decreasing along the dynamics, with dissipation rate equal to . The interpretation is statistical-mechanical: measures the "distance from equilibrium" in an information-theoretic sense (Kullback-Leibler divergence), and the second law of thermodynamics manifests as the monotonic decrease of toward zero. The dissipation is the entropy production rate, and the bridge to the Boltzmann H-theorem is that the same identity holds in any drift-diffusion process with a reversible stationary distribution.
Theorem (logarithmic Sobolev inequality; Bakry-Émery 1985 [Bakry-Emery 1985]). Suppose satisfies for all and some (strict log-concavity). The Gibbs measure satisfies the logarithmic Sobolev inequality (LSI) $$ H(p | p_{eq}) \leq \frac{T}{2 \rho}, I(p | p_{eq}) $$ for every probability density with finite relative entropy. Combined with the H-theorem, , exponential convergence at rate .
The Bakry-Émery proof uses the -calculus: define and . A bilinear-form computation gives . Bakry-Émery's interpolation along the semigroup integrates this pointwise inequality to the LSI. The constant is sharp on the Ornstein-Uhlenbeck process (Gross 1975's original Gaussian LSI). The criterion generalises to weighted Riemannian manifolds via , the curvature-dimension condition CD() that opened the synthetic theory of Ricci curvature (Lott-Sturm-Villani 2009 [Villani 2009]).
Theorem (path-integral / Onsager-Machlup; Damgaard-Hüffel 1987 [Damgaard-Huffel 1987]). Let satisfy the Langevin SDE on with initial law . The probability functional density on continuous paths , defined by Girsanov-Cameron-Martin against the Wiener measure, has the formal weight $$ \mathcal{P}[x] = \mathcal{N}, \exp\Bigl( -\frac{1}{4 T} \int_0^t \bigl( \dot x_s + \nabla V(x_s) \bigr)^2 ds - \frac{1}{2} \int_0^t \Delta V(x_s), ds \Bigr), $$ with a path-space normalisation. The first term is the Onsager-Machlup action, a quadratic dissipative action; the second term is the Itô-Stratonovich correction. The Euclidean path integral of stochastic quantisation is the long-time limit of this expression, with the equilibrium measure on configurations recovered by integrating out the path histories.
The path-integral expression identifies the Fokker-Planck transition density with a path integral over continuous trajectories weighted by the Onsager-Machlup action . This is the dissipative analogue of the Lagrangian path integral of quantum mechanics, with Wick rotation turning the quadratic-in- kinetic term into a positive Euclidean weight. Parisi-Wu stochastic quantisation [Parisi-Wu 1981] takes the path-integral form as a derivation route to Euclidean field theory: a Langevin SDE in a fictitious fifth time , with the Euclidean action as the potential, equilibrates to the Euclidean Gibbs measure as , and the Onsager-Machlup action becomes the dissipative super-action of the stochastic-quantisation construction.
Theorem (hypocoercivity; Villani 2009 [Villani 2009]). Let on . Suppose is coercive () and satisfies a Poincaré inequality with constant : for . Then the Fokker-Planck semigroup decays exponentially in at rate , and in entropy at rate under the stronger LSI. For degenerate diffusions (kinetic Fokker-Planck with momentum-only diffusion), exponential decay still holds under quantitatively explicit conditions, by Villani's hypocoercivity machinery using twisted-norm Lyapunov functionals.
The hypocoercivity result is the modern quantitative theory: for the elliptic Fokker-Planck of the present unit, Poincaré-inequality / LSI / Bakry-Émery give exponential rates; for the kinetic Fokker-Planck where only the momentum variable diffuses (Boltzmann-type collision operators, Kramers' equation), the diffusion is degenerate in position and the Poincaré argument fails directly. Villani's twisted-norm functionals restore exponential convergence, identifying the rate with a combination of the position-space gap and the momentum-space dissipation. The same hypocoercivity machinery applies to dissipative QFT models with field-only or momentum-only noise.
Theorem (uniqueness of the stationary distribution under coercivity). Let be coercive ( as with ). The Gibbs density is the unique probability density in satisfying $L^ p_{eq} = 0$.*
The proof uses the H-theorem: every stationary density has , hence , hence is constant on connected components of the support of . Coercivity forces this constant to be globally one (only one connected support consistent with normalisation), so . The uniqueness is what makes the Langevin SDE an ergodic sampler of the Gibbs measure: every initial condition relaxes to the same equilibrium, justifying the Monte-Carlo / Markov-chain methods of computational statistical mechanics.
Theorem (stochastic-quantisation of Euclidean field theory; Parisi-Wu 1981 [Parisi-Wu 1981]). Let be a Euclidean action on field configurations such that is a well-defined probability measure. The functional Langevin equation $$ \frac{\partial \phi(x, \tau)}{\partial \tau} = - \frac{\delta S[\phi]}{\delta \phi(x, \tau)} + \sqrt{2}, \eta(x, \tau) $$ with a Gaussian white noise in the auxiliary time has, as , the Euclidean Gibbs measure as its stationary distribution under the Fokker-Planck flow. Time-averages of observables along Langevin trajectories converge to Euclidean path-integral expectations.
This is the field-theoretic generalisation of the finite-dimensional Fokker-Planck framework, and the central content of stochastic quantisation. The procedure provides an alternative to gauge fixing in non-abelian gauge theories — the Langevin flow lives on the orbit space of gauge-equivalent configurations, and the equilibrium measure is gauge-invariant. The associated Onsager-Machlup action gives the path-integral weight of stochastic processes in field space, which the modern programme of stochastic regularisation uses as a UV-cutoff scheme (Hüffel-Rumpf, Floratos-Iliopoulos). The connection to the Fock-space construction of 08.10.01 is the Gaussian-Fock correspondence: the Parisi-Wu equilibrium of the free Euclidean action is the Gaussian free-field measure of 08.06.01, which Wick-rotates to the Wightman vacuum expectations of the free Klein-Gordon field on .
Synthesis. The Fokker-Planck equation is the foundational reason every classical equilibrium statistical-mechanics distribution can be realised as the long-time limit of a continuous-time stochastic process. The central insight is that the Langevin SDE has the Gibbs distribution as its unique reversible stationary density under the Fokker-Planck flow , and this is exactly the equilibrium condition of 08.01.03 expressed as a differential equation rather than as a microcanonical / canonical / grand-canonical formula. Putting these together, the drift-diffusion balance, the H-theorem decay of relative entropy, the spectral-gap rate of convergence, and the Bakry-Émery log-Sobolev refinement form one dissipative framework that handles every coercive potential on . The bridge between the SDE and the PDE is the Itô formula: applying it to a smooth observable produces the generator acting on the observable, integration by parts converts into its adjoint acting on the density, and the resulting parabolic equation is the Fokker-Planck equation. This is exactly the bridge that appears again in 08.07.01 (path integral formulation of statistical mechanics) under Wick rotation, where the dissipative Langevin process becomes the imaginary-time path integral of 08.09.01 (Wick rotation).
The Fokker-Planck framework identifies several constructions that look distinct at first inspection. Equilibrium statistical mechanics (the Boltzmann distribution of 08.01.03) is the stationary law of a dissipative SDE. The path-integral formulation (the Onsager-Machlup weight on trajectories) is the path-integral expression of the same stationary law on configuration space. The Parisi-Wu stochastic-quantisation framework generalises the construction to Euclidean field theory: a Langevin equation in fictitious time , with the Euclidean action playing the role of the potential, has as its Fokker-Planck equilibrium, and bypasses gauge fixing because the equilibrium measure is automatically gauge-invariant. The bridge between these is that all are different presentations of the same diffusion-with-drift dynamics: the configuration-space Fokker-Planck flow, the path-space Onsager-Machlup action, and the field-theory functional Fokker-Planck flow are one mathematical object viewed from three angles. The quantitative rates — spectral gap, log-Sobolev constant, Poincaré constant — control how fast a Markov-chain Monte Carlo of the Langevin SDE equilibrates, and the Bakry-Émery -calculus gives sharp rates whenever the potential is strongly convex. The recursion stabilises after the equilibrium is reached: at equilibrium, time-averages along a Langevin trajectory equal ensemble-averages against the Gibbs measure, and this identifies dynamical sampling with thermodynamic ensembles in the precise sense made operational by ergodic theorems for the Markov semigroup.
Full proof set Master
Proposition (Itô formula derivation of the Fokker-Planck equation). Given the Langevin SDE on with and sufficient growth conditions, the density of satisfies in the weak sense, and classically on by parabolic regularity.
Proof. For , Itô's formula applied to reads $$ d\varphi(x_t) = (\nabla \varphi(x_t)) \cdot dx_t + \tfrac{1}{2} \sum_{i,j} (\partial_i \partial_j \varphi(x_t)), d[x_i, x_j]_t. $$ The quadratic variation of the additive-noise Itô SDE is (the deterministic drift contributes zero to the bracket). Substituting and grouping, $$ d\varphi(x_t) = \bigl[ -\nabla V(x_t) \cdot \nabla \varphi(x_t) + T \Delta \varphi(x_t) \bigr], dt + \sqrt{2 T} \nabla \varphi(x_t) \cdot dW_t. $$ Take expectation with respect to the law of . The Itô integral is a martingale with mean zero (under suitable integrability hypotheses, which guarantees through the boundedness of ). Hence $$ \frac{d}{dt} \mathbb{E}[\varphi(x_t)] = \mathbb{E}[L \varphi (x_t)], \qquad L = T \Delta - \nabla V \cdot \nabla. $$
Express both sides as integrals against the density: . Integration by parts gives, with of compact support, $$ \int \varphi \partial_t p, dx = \int \varphi (L^* p), dx, \qquad L^* p = T \Delta p + \nabla \cdot (p \nabla V), $$ where is the formal adjoint computed via and . The weak equation holds against every , and parabolic regularity (interior Schauder estimates for the parabolic operator with coefficients) gives on and the equation classically.
Proposition (Gibbs density is reversible stationary). The Gibbs density satisfies $L^ p_{eq} = 0J_{eq} \equiv 0$ pointwise.*
Proof. The probability current associated with has, on the Gibbs density, $$ J_{eq}(x) = - p_{eq}(x) \nabla V(x) - T \nabla p_{eq}(x). $$ Differentiate : . Substitute: pointwise. So pointwise, hence as an identity, and (with the parabolic-conservation sign convention , this gives ). The detailed-balance condition is the stronger pointwise vanishing of the current; ordinary stationarity is the divergence-free condition.
Proposition (self-adjointness of on and Dirichlet form). On the weighted space, the generator is densely defined, symmetric, and non-positive with Dirichlet form $$ \mathcal{E}(f, g) := -\langle f, L g \rangle_{eq} = T \int (\nabla f \cdot \nabla g), p_{eq}, dx, \qquad f, g \in H^1(p_{eq}). $$ extends to a self-adjoint operator on , the Friedrichs extension of the Dirichlet form, with and spectrum .
Proof. For compute . The drift term integrates by parts using , hence : $$ \int f (-\nabla V \cdot \nabla g) p_{eq}, dx = T \int f (\nabla \log p_{eq}) \cdot \nabla g, p_{eq}, dx = T \int (\nabla g) \cdot (f \nabla p_{eq}), dx. $$ Integration by parts on the diffusion term yields . Adding the two contributions, the terms cancel, leaving $$ \langle f, L g \rangle_{eq} = -T \int (\nabla f) \cdot (\nabla g) p_{eq}, dx = -\mathcal{E}(f, g). $$ Symmetry in gives on , and the Dirichlet form is closable on , so extends to a self-adjoint operator. Setting , , hence . The spectral theorem packages as with a projection-valued measure on .
Proposition (Ornstein-Uhlenbeck explicit solution). For on , the Fokker-Planck equation $$ \partial_t p = T \partial_x^2 p + \partial_x(\omega^2 x p) $$ has fundamental solution (transition density) $$ p(x, t \mid x_0, 0) = \Bigl( \frac{\omega^2}{2 \pi T (1 - e^{-2\omega^2 t})} \Bigr)^{1/2} \exp\Bigl( -\frac{\omega^2 (x - x_0 e^{-\omega^2 t})^2}{2 T (1 - e^{-2\omega^2 t})} \Bigr), $$ the Gaussian centred at the deterministic drift trajectory with variance . As , the variance tends to and the centre tends to , recovering the Gibbs equilibrium .
Proof. The SDE has explicit solution (variation of parameters): . The Itô integral is Gaussian with mean zero and variance . So is Gaussian with mean and variance . Write the Gaussian density to obtain the stated transition formula. Verification that this density satisfies the Fokker-Planck equation is a direct (somewhat tedious) computation in derivatives; alternatively, the explicit Gaussian solution is the heat-kernel of the operator via Mehler's formula. As the variance saturates at and the mean dies to zero, recovering .
Proposition (H-theorem dissipation identity). For solving the Fokker-Planck equation with sufficient regularity and decay, the relative entropy satisfies , with equality iff .
Proof. Set , so and . Compute $$ \frac{dH}{dt} = \int p_{eq}, (\partial_t h) (1 + \log h), dx = \int (\partial_t p) (1 + \log h), dx. $$ Using with (the negative of the probability current; both sign conventions appear in the literature), $$ \frac{dH}{dt} = \int (\nabla \cdot J^) (1 + \log h), dx = -\int J^ \cdot \nabla \log h, dx, $$ where the constant- term gives a divergence-of-current integral that vanishes by mass conservation and the integration by parts has no boundary contribution under decay hypotheses. Substitute : $$ \frac{dH}{dt} = -\int T p_{eq} (\nabla h) \cdot (\nabla \log h), dx = -T \int p_{eq} h |\nabla \log h|^2, dx = -T \int p |\nabla \log h|^2, dx, $$ using . The right-hand side is , the Fisher information of relative to . The integrand is non-negative, and zero iff almost everywhere on , i.e. is constant on its support. Normalisation forces this constant to be , hence .
Proposition (Bakry-Émery identity for the Fokker-Planck generator). For on smooth , $$ \Gamma_2(f, f) = T^2 |\mathrm{Hess}(f)|^2 + T (\nabla f) \cdot \mathrm{Hess}(V) (\nabla f), $$ where and . The inequality implies , the CD() curvature-dimension condition.
Proof. Compute . Expanding : $$ \frac{1}{T} L|\nabla f|^2 = \Delta |\nabla f|^2 - \frac{1}{T}(\nabla V) \cdot \nabla |\nabla f|^2. $$ The Bochner formula gives in Euclidean space. Substitute and use : $$ \Gamma(f, L f) = T \nabla f \cdot \nabla(T \Delta f - \nabla V \cdot \nabla f) = T^2 \nabla f \cdot \nabla \Delta f - T \nabla f \cdot \nabla(\nabla V \cdot \nabla f). $$ Putting these together (after a Bochner-style bookkeeping computation that organises the terms), $$ \Gamma_2(f, f) = T^2 |\mathrm{Hess}(f)|^2 + T \nabla f \cdot \mathrm{Hess}(V) \nabla f. $$ The first term is non-negative; the second is bounded below by when . Hence under the curvature hypothesis. Integrating the pointwise inequality along the semigroup (Bakry-Émery's interpolation argument) yields the LSI .
Proposition (uniqueness of the stationary distribution under coercivity). Let be coercive in the sense . The only probability density satisfying $L^ \tilde p = 0\tilde p = p_{eq} = Z^{-1} e^{-V/T}$.*
Proof. Suppose , , . By the H-theorem applied to the constant function (a stationary solution): , hence . The integrand is non-negative, so almost everywhere on , i.e. is constant on connected components of its support. Connectedness of and the strict positivity of make this constant a single global value, normalised to by . Hence .
Connections Master
Boltzmann distribution
08.01.03. The Gibbs / Boltzmann distribution is the unique reversible stationary density of the Fokker-Planck equation under coercive . The present unit derives the Boltzmann distribution from a dynamical principle — the long-time limit of the Langevin SDE — rather than from a microcanonical / canonical / grand-canonical bookkeeping. The two derivations agree at the level of the density and identify dynamical sampling with thermodynamic ensembles.Path integral formulation of statistical mechanics
08.07.01. The path-integral / Onsager-Machlup form of the Langevin SDE expresses the Fokker-Planck transition density as a path integral over continuous trajectories weighted by the dissipative action . The connection to the Euclidean / imaginary-time path integral of quantum mechanics is Wick rotation: the kinetic action becomes the Brownian-motion Gaussian weight after temperature absorption, and the potential becomes the drift. Both frameworks compute equilibrium correlations of the same Gibbs measure.Wick rotation
08.09.01. The relation between the Fokker-Planck framework and quantum mechanics is Wick rotation: the imaginary-time Schrödinger equation for a quantum-mechanical Hamiltonian has the same form as a Fokker-Planck-like dissipative flow, and the operator is conjugate to a Schrödinger Hamiltonian via , where is the supersymmetric Witten partner potential. This is the Schrödinger-equation route to Fokker-Planck spectral theory.Gaussian field theory and free boson
08.06.01. The Parisi-Wu stochastic-quantisation generalisation of the Fokker-Planck framework to field theory has the free-boson Gaussian measure with covariance as its equilibrium under the functional Langevin equation . The Fokker-Planck flow on field configurations equilibrates to the Gaussian free-field measure, which is the Euclidean Wick rotation of the Fock-space construction of08.10.01.Bosonic Fock space and second quantisation
08.10.01. The Gaussian-Fock correspondence identifies the operator-side vacuum expectations on with the measure-side Gaussian moments of via Isserlis-Wick. Stochastic quantisation generates as a Fokker-Planck equilibrium, providing a third equivalent route to the same free-field correlation functions: canonical Fock-space quantisation (operator algebra), Euclidean path-integral (measure theory), and stochastic quantisation (SDE). The three frameworks meet at the Gibbs equilibrium of a Langevin equation in a fictitious fifth time.Free energy
08.01.04. The negative log of the partition function — the free energy — is the constant that normalises the Gibbs density and the leading-large-time term in the path-integral expression of the transition density. The Fokker-Planck framework recovers as the limit of corrections along the dissipative flow.
Historical & philosophical context Master
Adriaan Fokker derived the diffusion equation that bears his name in his 1914 paper Die mittlere Energie rotierender elektrischer Dipole im Strahlungsfeld (Ann. Phys. (Leipzig) 348, 810) [Fokker 1914], working on the angular distribution of radiating electric dipoles in thermal radiation. The equation appeared as a drift-diffusion equation for the angular probability density, with a drift term proportional to the torque and a diffusion term proportional to the radiation field. Max Planck, in his 1917 paper Über einen Satz der statistischen Dynamik und seine Erweiterung in der Quantentheorie (Sitzungsber. Preuss. Akad. Wiss. Berlin 24, 324) [Planck 1917], extended Fokker's derivation to a general continuous Markov process by truncating the Kramers-Moyal expansion of the master equation at second order. The combined equation has carried the joint Fokker-Planck name since.
The mathematical foundation was laid by Andrey Kolmogorov in his 1931 paper Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung (Math. Ann. 104, 415) [Kolmogorov 1931]. Kolmogorov derived the Fokker-Planck equation (which he called the forward equation) and its dual the backward equation from the Chapman-Kolmogorov consistency requirement on the transition probabilities of a continuous-time Markov process, identifying the conditions on the moments of the increments that make the truncation at second order exact. Kolmogorov's framework treats the Fokker-Planck equation as a derived consequence of probability axioms; the heuristic Fokker-Planck construction of 1914-1917 became a theorem about Markov diffusion semigroups. Kiyosi Itô's 1944 paper Stochastic integral (Proc. Imp. Acad. Tokyo 20, 519) [Itô 1944] introduced the stochastic integral against Brownian motion and the Itô formula, providing the modern derivation route: write an SDE for the trajectory , apply Itô's formula to an observable, take expectations, and the Fokker-Planck equation appears as the equation for the density via integration by parts.
The stochastic-mechanics applications were classical by the 1940s. Subrahmanyan Chandrasekhar's monumental 1943 Stochastic problems in physics and astronomy (Rev. Mod. Phys. 15, 1) [Chandrasekhar 1943] surveyed Brownian motion, the Langevin equation, the Ornstein-Uhlenbeck process, and the Fokker-Planck equation as the unified language of nonequilibrium statistical mechanics. Hans Risken's The Fokker-Planck Equation (1st ed. 1984, 2nd ed. 1989) [Risken] and Crispin Gardiner's Handbook of Stochastic Methods (1st ed. 1983) [Gardiner] became the standard physicists' references. The quantitative theory of convergence to equilibrium developed in parallel: Leonard Gross's 1975 paper introduced the logarithmic Sobolev inequality for the Ornstein-Uhlenbeck process; Dominique Bakry and Michel Émery's 1985 Diffusions hypercontractives (Séminaire de Probabilités XIX) [Bakry-Emery 1985] gave the curvature-dimension criterion CD() under which the LSI holds with sharp constant, and Cédric Villani's 2009 Hypocoercivity (Memoirs AMS 202) [Villani 2009] extended the entropy-method machinery to degenerate Fokker-Planck operators of kinetic-theory type.
The field-theoretic chapter opened with Giorgio Parisi and Yong-Shi Wu's 1981 paper Perturbation theory without gauge fixing (Sci. Sin. 24, 483) [Parisi-Wu 1981]. Parisi-Wu's construction promotes the configuration variable to a field configuration and the potential to the Euclidean action , replacing the finite-dimensional Langevin SDE with a functional Langevin equation in a fictitious fifth time. The Fokker-Planck flow on field configurations equilibrates to the Euclidean Gibbs measure , the path-integral measure of 08.07.01. The construction bypasses the Faddeev-Popov gauge-fixing procedure in non-abelian gauge theories — the Langevin flow lives on the orbit space — and provides a stochastic regularisation scheme alongside lattice and dimensional regularisation. Damgaard-Hüffel's 1987 Stochastic quantization (Phys. Rep. 152, 227) [Damgaard-Huffel 1987] reviewed the programme. The Fokker-Planck equation has thus been, since 1914, a constant presence at the interface between probability, statistical mechanics, and quantum field theory.
Bibliography Master
@article{Fokker1914,
author = {Fokker, A. D.},
title = {Die mittlere {E}nergie rotierender elektrischer {D}ipole im {S}trahlungsfeld},
journal = {Annalen der Physik (Leipzig)},
volume = {348},
year = {1914},
pages = {810--820}
}
@article{Planck1917,
author = {Planck, M.},
title = {{\"U}ber einen {S}atz der statistischen {D}ynamik und seine {E}rweiterung in der {Q}uantentheorie},
journal = {Sitzungsberichte der Preussischen {A}kademie der Wissenschaften, Berlin},
volume = {24},
year = {1917},
pages = {324--341}
}
@article{Kolmogorov1931,
author = {Kolmogorov, A. N.},
title = {{\"U}ber die analytischen {M}ethoden in der {W}ahrscheinlichkeitsrechnung},
journal = {Mathematische Annalen},
volume = {104},
year = {1931},
pages = {415--458}
}
@article{Ito1944,
author = {It{\^o}, Kiyosi},
title = {Stochastic integral},
journal = {Proceedings of the Imperial {A}cademy of {T}okyo},
volume = {20},
year = {1944},
pages = {519--524}
}
@article{ParisiWu1981Stoch,
author = {Parisi, Giorgio and Wu, Yong-Shi},
title = {Perturbation theory without gauge fixing},
journal = {Scientia Sinica},
volume = {24},
year = {1981},
pages = {483--496}
}
@incollection{BakryEmery1985,
author = {Bakry, Dominique and {\'E}mery, Michel},
title = {Diffusions hypercontractives},
booktitle = {S{\'e}minaire de Probabilit{\'e}s XIX 1983/84},
series = {Lecture Notes in Mathematics},
volume = {1123},
publisher = {Springer},
year = {1985},
pages = {177--206}
}
@article{Chandrasekhar1943,
author = {Chandrasekhar, Subrahmanyan},
title = {Stochastic problems in physics and astronomy},
journal = {Reviews of Modern Physics},
volume = {15},
year = {1943},
pages = {1--89}
}
@book{Risken1989,
author = {Risken, Hans},
title = {The {F}okker-{P}lanck Equation: Methods of Solution and Applications},
edition = {2},
publisher = {Springer},
series = {Springer Series in Synergetics},
volume = {18},
year = {1989}
}
@book{Gardiner2004,
author = {Gardiner, Crispin W.},
title = {Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences},
edition = {3},
publisher = {Springer},
year = {2004}
}
@book{vanKampen2007,
author = {van Kampen, N. G.},
title = {Stochastic Processes in Physics and Chemistry},
edition = {3},
publisher = {North-Holland},
year = {2007}
}
@book{Pavliotis2014,
author = {Pavliotis, Grigorios A.},
title = {Stochastic Processes and Applications: Diffusion Processes, the {F}okker-{P}lanck and {L}angevin Equations},
publisher = {Springer},
series = {Texts in Applied Mathematics},
volume = {60},
year = {2014}
}
@book{Villani2009Hypocoercivity,
author = {Villani, C{\'e}dric},
title = {Hypocoercivity},
publisher = {American Mathematical Society},
series = {Memoirs of the AMS},
volume = {202},
year = {2009}
}
@article{MarkowichVillani2000,
author = {Markowich, Peter A. and Villani, C{\'e}dric},
title = {On the trend to equilibrium for the {F}okker-{P}lanck equation: an interplay between physics and functional analysis},
journal = {Matem{\'a}tica Contempor{\^a}nea},
volume = {19},
year = {2000},
pages = {1--29}
}
@article{DamgaardHuffel1987,
author = {Damgaard, Poul H. and H{\"u}ffel, Helmuth},
title = {Stochastic quantization},
journal = {Physics Reports},
volume = {152},
year = {1987},
pages = {227--398}
}
@article{Gross1975LSI,
author = {Gross, Leonard},
title = {Logarithmic {S}obolev inequalities},
journal = {American Journal of Mathematics},
volume = {97},
year = {1975},
pages = {1061--1083}
}