08.10.02 · stat-mech / qft

Fokker-Planck equation and equilibrium distribution

shipped3 tiersLean: none

Anchor (Master): Kolmogorov, *Math. Ann.* 104, 415 (1931) (forward/backward equations); Itô, *Proc. Imp. Acad. Tokyo* 20, 519 (1944) (stochastic integral); Parisi-Wu, *Sci. Sin.* 24, 483 (1981) (stochastic quantisation as Langevin route to Euclidean field theory); Bakry-Émery, *Séminaire de Probabilités XIX*, Springer LNM 1123, 177 (1985) (log-Sobolev inequalities via curvature-dimension); Markowich-Villani, *Mat. Contemp.* 19, 1 (2000); Villani, *Mem. AMS* 202 (2009), Ch. 1 (hypocoercivity)

Intuition Beginner

A speck of pollen suspended in water never sits still. Water molecules bump it from all sides, and the speck performs the jittery random walk Robert Brown noticed in 1827. If you also tilt the water container, gravity adds a downward drift on top of the jitter, and the speck still wanders but with a preference for moving downward. After a long time the speck settles into a typical height distribution: more likely near the bottom of the container, less likely high up, with the exact shape of the distribution set by how strong the tilt is compared with the thermal jitter.

The Fokker-Planck equation is the rule that says how the probability of finding the speck at a given height changes with time. It has two ingredients. The drift part pushes the probability in the direction the tilt prefers, like a wind blowing a cloud of dust. The diffusion part spreads the probability out, like a drop of ink spreading in water. The two ingredients fight each other: drift wants to pile probability up at the bottom of the well, diffusion wants to flatten everything out evenly. They balance at a unique equilibrium shape called the Gibbs distribution or the Boltzmann distribution.

The remarkable fact is that the equilibrium shape is the same no matter how you started. Drop a speck at the top of the container, at the bottom, anywhere — after waiting long enough you will see the same probability cloud. The Fokker-Planck equation also tells you how fast the speck forgets its starting position: the answer is set by the curvature of the well, and the deeper the well the faster the equilibration. This balance between drift and diffusion is the central principle of equilibrium statistical mechanics: thermal motion fluctuates, restoring forces relax, and the joint outcome is a probability cloud whose log is the energy divided by the temperature.

Visual Beginner

Picture a deep bowl, and inside the bowl a cloud of dust particles being shaken by random thermal kicks. The walls of the bowl push the particles back toward the centre. The thermal kicks spread them outward. The two forces balance, and the typical view of the cloud is a Gaussian blob centred at the bottom of the bowl, wider when the kicks are stronger or the bowl is shallower, narrower when the kicks are weaker or the bowl is steeper.

The shape on the right of the picture is the equilibrium distribution. Its formula is $p_{e q} (x) = e^{- V (x) / T} / Z$ , where $V (x)$ is the height of the bowl at position $x$ , $T$ is the temperature, and $Z$ is the number that normalises the probability to one. The shape on the left is the initial cloud at time zero. The arrows show the equilibration: the cloud drifts and spreads and settles. The same picture in many dimensions replaces the bowl with a multidimensional energy landscape and the cloud with a many-variable joint probability cloud.

Worked example Beginner

Compute the equilibrium distribution of a particle in a one-dimensional quadratic well at temperature $T$ , with the well shape $V (x) = (1/2) ω^{2} x^{2}$ .

Step 1. Identify the ingredients. The well height is $V (x) = (1/2) ω^{2} x^{2}$ . The drift force pushing the particle back toward the centre is the negative slope of the well, $- ω^{2} x$ . The temperature is $T$ . The equilibrium formula is $p_{e q} (x) = e^{- V (x) / T} / Z$ , and we need to find the normalisation $Z$ .

Step 2. Plug in the well shape. The unnormalised distribution is $e^{- ω^{2} x^{2} / (2 T)}$ . This is a Gaussian centred at $x = 0$ with variance $σ^{2} = T / ω^{2}$ . The wider the bowl is at the bottom (small $ω$ ), the wider the equilibrium cloud; the steeper the bowl (large $ω$ ), the narrower the cloud.

Step 3. Normalise. The integral of $e^{- ω^{2} x^{2} / (2 T)}$ over all real $x$ is $2 π T / ω^{2}$ by the standard Gaussian formula. So $Z = 2 π T / ω^{2}$ , and the normalised equilibrium is $p_{e q} (x) = (ω^{2} / (2 π T))^{1/2} e^{- ω^{2} x^{2} / (2 T)}$ .

Step 4. Plug in numbers. Take $ω = 1$ and $T = 1$ . The variance is $σ^{2} = 1$ . The probability of finding the particle within one standard deviation of the centre is about $68%$ , the same as any standard Gaussian. The typical kinetic-like fluctuation $⟨ x^{2} ⟩$ equals $T / ω^{2}$ , the equipartition result: each quadratic mode in the energy carries thermal energy of order $T$ .

Step 5. Check the limits. At low temperature ( $T \to 0$ ) the variance shrinks to zero and the equilibrium concentrates at the minimum of the well — the classical zero-temperature ground state. At high temperature ( $T \to \infty$ ) the variance grows, the cloud spreads out, and in the strict limit the equilibrium would become uniform — but only if the well is finite. For an unbounded quadratic well, the cloud just gets wider and wider with $T$ .

What this tells us: the Fokker-Planck equation's equilibrium is a Gaussian whose width records the competition between the steepness of the energy landscape and the thermal energy. This is the simplest informative case and the prototype of every equilibrium computation in statistical mechanics. The same recipe — write the energy, divide by temperature, exponentiate, normalise — gives the Boltzmann distribution for every classical equilibrium ensemble.

Check your understanding Beginner

Formal definition Intermediate+

Fix the dimension $n \geq 1$ and the temperature $T > 0$ . Let $V : R^{n} \to R$ be a smooth potential with $V (x) \to + \infty$ fast enough as $∣ x ∣ \to \infty$ that $\int_{R^{n}} e^{- V (x) / T} d x < \infty$ (coercivity). The Langevin stochastic differential equation for $x_{t} \in R^{n}$ is $$ dx_t = - \nabla V(x_t), dt + \sqrt{2 T}, dW_t, $$ where $W_{t}$ is a standard $n$ -dimensional Brownian motion (a continuous Gaussian process with $E [W_{t}] = 0$ and $E [W_{t} W_{s}^{⊤}] = min (t, s) I_{n}$ ) and the stochastic integral is interpreted in the sense of Itô.

The transition density $p (x, t ∣ x_{0}, 0)$ is the probability density of $x_{t}$ given $x_{0} = x_{0}$ ; the forward density $p (x, t)$ is the unconditional density of $x_{t}$ given a chosen initial law $p (x, 0) = p_{0} (x)$ . The Fokker-Planck equation (also called the Kolmogorov forward equation) is the parabolic partial differential equation $$ \partial_t p = \nabla \cdot (p, \nabla V) + T, \Delta p, \qquad p(x, 0) = p_0(x), $$ with $Δ = \sum_{i = 1}^{n} \partial_{i}^{2}$ the Laplacian and $\nabla \cdot$ the divergence. The right-hand side is the formal adjoint $L^{*}$ of the infinitesimal generator $$ L = - \nabla V \cdot \nabla + T, \Delta = T, \Delta - \nabla V \cdot \nabla $$ acting on test functions: for any $φ \in C_{c}^{2} (R^{n})$ , $$ \mathbb{E}[\varphi(x_t) \mid x_0] = \varphi(x_0) + \mathbb{E}\Bigl[\int_0^t (L \varphi)(x_s), ds \mid x_0\Bigr]. $$ The generator $L$ acts on observables; its formal adjoint $L^{*} := T Δ + \nabla \cdot (\cdot \nabla V)$ acts on densities. The Fokker-Planck equation reads $\partial_{t} p = L^{*} p$ .

A stationary solution is a time-independent solution $p_{e q}$ of the Fokker-Planck equation: $L^{*} p_{e q} = 0$ . The probability current is the vector field $$ J(x, t) := - p(x, t), \nabla V(x) - T, \nabla p(x, t), $$ so the Fokker-Planck equation is the conservation law $\partial_{t} p + \nabla \cdot J = 0$ . A stationary solution has $\nabla \cdot J_{e q} = 0$ ; a stationary solution with $J_{e q} \equiv 0$ pointwise is called a detailed-balance or reversible stationary distribution.

The Gibbs / Boltzmann distribution is the candidate equilibrium $$ p_{eq}(x) := \frac{1}{Z}, e^{-V(x)/T}, \qquad Z := \int_{\mathbb{R}^n} e^{-V(x)/T}, dx. $$ A direct computation gives $\nabla p_{e q} = - (1/ T) p_{e q} \nabla V$ , hence the equilibrium current $J_{e q} = - p_{e q} \nabla V - T \nabla p_{e q} = - p_{e q} \nabla V + p_{e q} \nabla V = 0$ . The Gibbs distribution is a detailed-balance stationary solution.

The weighted $L^{2}$ space $L^{2} (p_{e q} d x)$ has inner product $⟨ f, g ⟩_{e q} := \int f (x) g (x) p_{e q} (x) d x$ . The generator $L$ extends to a densely defined self-adjoint operator on $L^{2} (p_{e q} d x)$ with $L \leq 0$ (non-positive spectrum), and the Fokker-Planck semigroup $e^{t L^{*}}$ on densities is dual to the observable semigroup $e^{t L}$ on $L^{2} (p_{e q} d x)$ .

Counterexamples to common slips

The drift in the SDE is $- \nabla V$ (force = minus gradient of potential), not $+ \nabla V$ . The sign is fixed by the requirement that the well attracts the particle inward; a sign error puts the particle on the wrong side of an instability.
The noise coefficient $2 T$ encodes the Einstein relation between fluctuation and dissipation: the diffusion coefficient $T$ in the Fokker-Planck equation is exactly half the square of the SDE noise coefficient. Setting it to $T$ instead of $2 T$ in the SDE doubles the equilibrium temperature.
Coercivity of $V$ is essential. For $V \equiv 0$ on $R^{n}$ (free Brownian motion) the equation $\partial_{t} p = T Δ p$ has no normalisable stationary solution: any initial density spreads out to zero pointwise, and the heat-equation Green function $p (x, t) = (4 π T t)^{- n /2} e^{- ∣ x ∣^{2} / (4 T t)}$ shows the variance growing linearly in time without bound.
The Fokker-Planck equation is the forward equation in time. The backward equation, $\partial_{t} u = Lu$ for $u (x, t) = E [φ (x_{T}) ∣ x_{t} = x]$ with $T > t$ , runs backward from a terminal condition $u (x, T) = φ (x)$ . The two equations are formally adjoint and govern different quantities (density vs. observable expectation).
The Itô interpretation is built into the formula $\partial_{t} p = \nabla \cdot (p \nabla V) + T Δ p$ . The Stratonovich interpretation of the same SDE — $d x_{t} = - \nabla V (x_{t}) d t + 2 T \circ d W_{t}$ — gives a different Fokker-Planck equation when the noise coefficient is state-dependent; here the noise is additive (constant coefficient), so Itô and Stratonovich coincide.

Key theorem with proof Intermediate+

Theorem (Fokker-Planck from the Langevin SDE; Kolmogorov 1931 ^{[Kolmogorov 1931]}, Itô 1944 ^{[Itô 1944]}). Let $x_{t}$ satisfy $d x_{t} = - \nabla V (x_{t}) d t + 2 T d W_{t}$ with $V \in C^{2} (R^{n})$ and suitable growth conditions ensuring a unique strong solution. For any initial density $p_{0} \in L^{1} (R^{n})$ with $p_{0} \geq 0$ and $\int p_{0} = 1$ , the law of $x_{t}$ admits a density $p (x, t)$ in $C^{2, 1} (R^{n} \times (0, \infty))$ satisfying $$ \partial_t p = \nabla \cdot (p, \nabla V) + T, \Delta p, \qquad p(\cdot, 0) = p_0. $$ The Gibbs density $p_{e q} (x) = Z^{- 1} e^{- V (x) / T}$ is a stationary solution.

Proof. Apply Itô's formula to a test function $φ \in C_{c}^{2} (R^{n})$ . For each smooth path of the SDE, $$ d\varphi(x_t) = \nabla \varphi(x_t) \cdot dx_t + \tfrac{1}{2} \sum_{i, j} \partial_i \partial_j \varphi(x_t), d[x_i, x_j]t, $$ where $d [x_{i}, x_{j}]_{t} = 2 T δ_{ij} d t$ is the quadratic variation of the Itô SDE. Substituting $d x_{t} = - \nabla V (x_{t}) d t + 2 T d W_{t}$ , $$ d\varphi(x_t) = \bigl( - \nabla V(x_t) \cdot \nabla \varphi(x_t) + T, \Delta \varphi(x_t) \bigr) dt + \sqrt{2 T}, \nabla \varphi(x_t) \cdot dW_t. $$ The bracketed quantity is exactly $L φ$ with $L = T Δ - \nabla V \cdot \nabla$ . Taking expectations against the law of $x_{t}$ kills the Itô-martingale increment $\nabla φ \cdot d W_{t}$ and gives $$ \frac{d}{dt} \mathbb{E}[\varphi(x_t)] = \mathbb{E}[(L \varphi)(x_t)] = \int{\mathbb{R}^n} (L \varphi)(x), p(x, t), dx. $$

Write the left-hand side as $\frac{d}{d t} \int φ (x) p (x, t) d x = \int φ (x) \partial_{t} p (x, t) d x$ . Equating, $$ \int \varphi(x) \partial_t p(x, t), dx = \int (L \varphi)(x), p(x, t), dx = \int \varphi(x), (L^* p)(x, t), dx, $$ where the second equality is integration by parts: for $φ$ of compact support, $$ \int (T \Delta \varphi), p, dx = T \int \varphi, \Delta p, dx, \qquad - \int (\nabla V \cdot \nabla \varphi), p, dx = \int \varphi, \nabla \cdot (p \nabla V), dx. $$ The boundary terms vanish because $φ \in C_{c}^{2}$ . Hence $\int φ (x) [\partial_{t} p - L^{*} p] (x, t) d x = 0$ for every $φ \in C_{c}^{2}$ , so $\partial_{t} p = L^{*} p = T Δ p + \nabla \cdot (p \nabla V)$ in the weak sense. Parabolic regularity upgrades $p \in C^{2, 1}$ on $R^{n} \times (0, \infty)$ and the equation holds classically.

The Gibbs density is stationary: compute $$ L^* p_{eq} = T \Delta p_{eq} + \nabla \cdot (p_{eq} \nabla V). $$ From $p_{e q} = Z^{- 1} e^{- V / T}$ obtain $\nabla p_{e q} = - (1/ T) p_{e q} \nabla V$ and $Δ p_{e q} = - (1/ T) \nabla \cdot (p_{e q} \nabla V) = - (1/ T) [\nabla p_{e q} \cdot \nabla V + p_{e q} Δ V] = (1/ T^{2}) p_{e q} ∣\nabla V ∣^{2} - (1/ T) p_{e q} Δ V$ . Substitute: $$ T \Delta p_{eq} = (1/T) p_{eq} |\nabla V|^2 - p_{eq} \Delta V, $$ $$ \nabla \cdot (p_{eq} \nabla V) = \nabla p_{eq} \cdot \nabla V + p_{eq} \Delta V = -(1/T) p_{eq} |\nabla V|^2 + p_{eq} \Delta V. $$ Add the two: $L^{*} p_{e q} = 0$ . The Gibbs density satisfies the stationarity equation pointwise. $□$

Bridge. The derivation of the Fokker-Planck equation from the Itô SDE builds toward every equilibrium statistical-mechanics computation, and the same drift-plus-diffusion pattern appears again in 08.07.01 in the path-integral / functional-measure language. The foundational reason the Gibbs density is the unique reversible stationary solution is exactly the detailed-balance condition: the probability current $J_{e q} = - p_{e q} \nabla V - T \nabla p_{e q}$ vanishes pointwise on $p_{e q} = Z^{- 1} e^{- V / T}$ , and a coercive potential forces uniqueness via the spectral-gap argument of 08.06.01. This is exactly the equilibrium condition the Boltzmann distribution of 08.01.03 specifies — the equilibrium density's logarithm is the negative energy divided by the temperature. The central insight is that the Langevin dynamics samples the Gibbs measure: in the long-time limit, time averages along a single trajectory equal ensemble averages against the equilibrium density, and this identifies dynamical sampling with thermodynamic ensembles. The bridge is the recognition that Parisi-Wu stochastic quantisation generalises the Fokker-Planck route to field theory: replacing $x_{t} \in R^{n}$ by a field configuration $ϕ_{τ} \in S^{'} (R^{d})$ and the potential $V$ by the Euclidean action $S [ϕ]$ , the Langevin equation in the fictitious time $τ$ has the Euclidean Gibbs measure $e^{- S [ϕ]} / Z$ as its Fokker-Planck equilibrium. Putting these together, one Fokker-Planck framework produces every equilibrium law of equilibrium statistical mechanics, every Euclidean field theory measure of stochastic quantisation, and every quantitative rate of convergence to equilibrium controlled by the spectral gap or log-Sobolev constant of the generator.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

For the one-dimensional Ornstein-Uhlenbeck process with $V (x) = (1/2) ω^{2} x^{2}$ , compute $⟨ x_{t} ⟩$ and $⟨ x_{t}^{2} ⟩$ given a deterministic start at $x_{0}$ . Show that $⟨ x_{t} ⟩ = x_{0} e^{- ω^{2} t}$ and $⟨ x_{t}^{2} ⟩ = x_{0}^{2} e^{- 2 ω^{2} t} + (T / ω^{2}) (1 - e^{- 2 ω^{2} t})$ .

Hint

Take expectations of the SDE $d x_{t} = - ω^{2} x_{t} d t + 2 T d W_{t}$ . For the second moment, apply Itô's formula to $x_{t}^{2}$ and take expectations.

Answer

Take expectations of the SDE: $d ⟨ x_{t} ⟩ = - ω^{2} ⟨ x_{t} ⟩ d t$ , since the Brownian increment averages to zero. Solving the ODE with $⟨ x_{0} ⟩ = x_{0}$ gives $⟨ x_{t} ⟩ = x_{0} e^{- ω^{2} t}$ .

For the second moment, apply Itô's formula to $φ (x) = x^{2}$ : $d (x_{t}^{2}) = 2 x_{t} d x_{t} + 2 T d t = (- 2 ω^{2} x_{t}^{2} + 2 T) d t + 2 x_{t} 2 T d W_{t}$ . Take expectations: $d ⟨ x_{t}^{2} ⟩ / d t = - 2 ω^{2} ⟨ x_{t}^{2} ⟩ + 2 T$ . The linear ODE with initial condition $⟨ x_{0}^{2} ⟩ = x_{0}^{2}$ has solution $⟨ x_{t}^{2} ⟩ = x_{0}^{2} e^{- 2 ω^{2} t} + (T / ω^{2}) (1 - e^{- 2 ω^{2} t})$ . As $t \to \infty$ the second moment tends to $T / ω^{2}$ , the equilibrium variance. Rubric: full credit for both moments with the Itô-formula step explained.

Exercise 4 (medium, short-answer).

Show that the generator $L = T Δ - \nabla V \cdot \nabla$ is self-adjoint and non-positive on the weighted $L^{2} (p_{e q})$ space with inner product $⟨ f, g ⟩_{e q} = \int f g p_{e q} d x$ .

Hint

Compute $⟨ f, Lg ⟩_{e q}$ by integration by parts. Use the identity $\nabla p_{e q} = - (1/ T) p_{e q} \nabla V$ to absorb the drift term into a divergence.

Answer

Write $Lg = T Δ g - \nabla V \cdot \nabla g$ . The drift term satisfies $$ \int f (-\nabla V \cdot \nabla g) p_{eq}, dx = -\int (\nabla V \cdot \nabla g) f p_{eq}, dx, $$ and integration by parts on the diffusion term (with $\nabla p_{e q} = - p_{e q} \nabla V / T$ ) gives $$ \int f, T \Delta g, p_{eq}, dx = -T \int \nabla(f p_{eq}) \cdot \nabla g, dx = -T \int (\nabla f \cdot \nabla g) p_{eq}, dx + \int f (\nabla V \cdot \nabla g) p_{eq}, dx. $$ Add: $⟨ f, Lg ⟩_{e q} = - T \int (\nabla f \cdot \nabla g) p_{e q} d x$ . By symmetry in $f, g$ the form is symmetric: $⟨ f, Lg ⟩_{e q} = ⟨ L f, g ⟩_{e q}$ . Setting $f = g$ gives $⟨ g, Lg ⟩_{e q} = - T \int ∣\nabla g ∣^{2} p_{e q} d x \leq 0$ , so $L$ is non-positive. Rubric: full credit for the integration-by-parts identity plus the symmetry argument.

Exercise 5 (medium, symbolic).

For the multi-dimensional Ornstein-Uhlenbeck process $d x_{t} = - A x_{t} d t + 2 T d W_{t}$ with $A$ a symmetric positive-definite matrix, find the equilibrium covariance $Σ$ and verify $A Σ + Σ A = 2 T I$ .

Hint

The equilibrium is Gaussian: $V (x) = (1/2) x^{⊤} A x$ gives $p_{e q} (x) \propto e^{- x^{⊤} A x / (2 T)}$ . Read off the covariance.

Answer

The Gibbs density for $V (x) = (1/2) x^{⊤} A x$ is $p_{e q} (x) = (det (A) / (2 π T)^{n})^{1/2} e^{- x^{⊤} A x / (2 T)}$ , a Gaussian with covariance $Σ = T A^{- 1}$ . Verify the Lyapunov equation: $$ A \Sigma + \Sigma A = A (T A^{-1}) + (T A^{-1}) A = T I + T I = 2 T I. $$ The Lyapunov equation is the matrix-valued stationarity condition for the covariance: $d ⟨ x_{t} x_{t}^{⊤} ⟩ / d t = - A ⟨ x_{t} x_{t}^{⊤} ⟩ - ⟨ x_{t} x_{t}^{⊤} ⟩ A + 2 T I$ from Itô's formula, and the stationary value is exactly $Σ = T A^{- 1}$ . Rubric: full credit for $Σ = T A^{- 1}$ plus the Lyapunov verification.

Exercise 6 (medium, symbolic).

Show that the relative entropy $H (p ∥ p_{e q}) := \int p lo g (p / p_{e q}) d x$ is non-increasing along solutions $p (\cdot, t)$ of the Fokker-Planck equation, with $\frac{d}{d t} H = - T \int p ∣\nabla lo g (p / p_{e q}) ∣^{2} d x$ .

Hint

Differentiate under the integral and substitute $\partial_{t} p = L^{*} p$ . Use $L^{*} p = \nabla \cdot (T \nabla p + p \nabla V)$ and integrate by parts.

Answer

Let $h := lo g (p / p_{e q})$ . Compute $$ \frac{d}{dt} H = \int \partial_t p \cdot (1 + h), dx = \int (\nabla \cdot J^) (1 + h), dx, $$ where $J^ := -T \nabla p - p \nabla V = -p_{eq} \cdot T \nabla(p/p_{eq}) $i s t h e min u s - c u r r e n t . T h eco n s t an t$ 1 $co n t r ib u t i o n v ani s h es (co n ser v a t i o n o f ma ss) :$ \int \nabla \cdot J^, dx = 0$. The remaining term, after integration by parts, $$ \int (\nabla \cdot J^), h, dx = - \int J^* \cdot \nabla h, dx = \int p_{eq} T \nabla(p/p_{eq}) \cdot \nabla h, dx. $$ With $h = lo g (p / p_{e q})$ and $\nabla h = \nabla (p / p_{e q}) / (p / p_{e q})$ , write $p_{e q} \nabla (p / p_{e q}) = p_{e q} (p / p_{e q}) \nabla h = p \nabla h$ . So the expression equals $T \int p ∣\nabla h ∣^{2} d x = T \int p ∣\nabla lo g (p / p_{e q}) ∣^{2} d x$ , and hence $$ \frac{d}{dt} H = - T \int p |\nabla \log(p/p_{eq})|^2 dx \leq 0, $$ the Fokker-Planck H-theorem. Rubric: full credit for the integration-by-parts identification of the entropy production with the Fisher information.

Exercise 7 (hard, short-answer).

State and outline the proof of the Bakry-Émery criterion: if $V \in C^{2} (R^{n})$ satisfies the curvature-dimension condition $Hess (V) \geq ρ I$ pointwise for some $ρ > 0$ , then $H (p (\cdot, t) ∥ p_{e q}) \leq e^{- 2 ρt / T} H (p_{0} ∥ p_{e q})$ for every initial density $p_{0}$ .

Hint

The Bakry-Émery $Γ_{2}$ -calculus identity reads $Γ_{2} (f, f) = ∣ Hess (f) ∣^{2} + \nabla f \cdot Hess (V) \nabla f / T$ for the generator $L$ . Combine with the log-Sobolev inequality $H \leq (T /2 ρ) I$ , where $I = \int p ∣\nabla lo g (p / p_{e q}) ∣^{2} d x$ is the Fisher information.

Answer

The Bakry-Émery criterion is the curvature-dimension condition CD( $ρ, \infty$ ): $Hess (V) \geq ρ I$ pointwise on $R^{n}$ for some $ρ > 0$ . Under this condition the equilibrium measure $p_{e q} \propto e^{- V / T}$ satisfies a logarithmic Sobolev inequality (LSI) $$ H(p | p_{eq}) \leq \frac{T}{2 \rho}, I(p | p_{eq}), \qquad I(p | p_{eq}) := \int p, |\nabla \log(p/p_{eq})|^2, dx. $$ Combine the LSI with the entropy-production identity $\frac{d}{d t} H = - T \cdot I$ from Exercise 6: $$ \frac{d}{dt} H \leq -T \cdot \frac{2 \rho}{T} H = -2 \rho H. $$ Gronwall's inequality yields $H (p (\cdot, t) ∥ p_{e q}) \leq e^{- 2 ρt} H (p_{0} ∥ p_{e q})$ .

(Note the $1/ T$ inside the LSI: the temperature drops out of the final rate when written in the temperature-normalised time variable; the formula in the brief assumes the $T$ -normalised generator. The clean rate is $e^{- 2 ρt}$ .)

The proof of the LSI uses Bakry-Émery's $Γ_{2}$ -calculus: write $Γ (f, f) := \frac{1}{2} (L (f^{2}) - 2 f L f) = T ∣\nabla f ∣^{2}$ and $Γ_{2} (f, f) := \frac{1}{2} (L Γ (f, f) - 2Γ (f, L f))$ . A direct calculation gives $Γ_{2} (f, f) = T^{2} ∣ Hess (f) ∣^{2} + T \nabla f \cdot Hess (V) \nabla f \geq T ρ \cdot T ∣\nabla f ∣^{2} = T ρ Γ (f, f)$ . This $Γ_{2} \geq ρ Γ$ inequality is the $T$ -rescaled CD criterion; it integrates to the LSI by Bakry-Émery's interpolation argument along the semigroup $e^{t L}$ . Rubric: full credit for the LSI statement plus the Gronwall step plus the $Γ_{2}$ identification.

Exercise 8 (hard, short-answer).

Connect Parisi-Wu stochastic quantisation to the Fokker-Planck framework. Show that the field-theoretic Langevin equation $\partial_{τ} ϕ (x, τ) = - (δ S / δ ϕ) (x, τ) + 2 η (x, τ)$ with $⟨ η (x, τ) η (y, τ^{'})⟩ = δ (x - y) δ (τ - τ^{'})$ has the Euclidean Gibbs measure $e^{- S [ϕ]} / Z$ as its $τ \to \infty$ equilibrium.

Hint

The argument is the field-theory analogue of the finite-dimensional one. Identify $ϕ$ with $x \in R^{n}$ , $S$ with $V$ , $τ$ with $t$ , and the noise $η$ with $2 d W / d t$ . The temperature $T = 1$ in the Parisi-Wu normalisation.

Answer

The Parisi-Wu Langevin equation in fictitious time $τ$ is $$ \frac{\partial \phi(x, \tau)}{\partial \tau} = - \frac{\delta S[\phi]}{\delta \phi(x, \tau)} + \sqrt{2}, \eta(x, \tau), \qquad \langle \eta(x, \tau) \eta(y, \tau') \rangle = \delta^d(x - y) \delta(\tau - \tau'), $$ the infinite-dimensional analogue of $d x_{t} = - \nabla V (x_{t}) d t + 2 d W_{t}$ with $x \to ϕ (\cdot, τ)$ , $V \to S [ϕ]$ , $t \to τ$ , temperature $T = 1$ .

The associated Fokker-Planck equation governs the probability functional $P [ϕ, τ]$ on field configurations: $$ \frac{\partial P}{\partial \tau} = \int d^d x, \frac{\delta}{\delta \phi(x)} \Bigl( P, \frac{\delta S}{\delta \phi(x)} + \frac{\delta P}{\delta \phi(x)} \Bigr). $$ Stationarity requires the functional current to vanish: $P_{e q} δ S / δ ϕ + δ P_{e q} / δ ϕ = 0$ at every $x$ . The solution is $P_{e q} [ϕ] = e^{- S [ϕ]} / Z$ , the Euclidean Gibbs measure on field configurations, since $δ (lo g P_{e q}) / δ ϕ (x) = - δ S / δ ϕ (x)$ exactly cancels the drift.

In the long- $τ$ limit, $τ$ -averages of observables $F [ϕ]$ along Langevin trajectories converge to ensemble averages $⟨ F ⟩_{e q} = \int D ϕ F [ϕ] e^{- S [ϕ]} / Z$ . This is the operational content of stochastic quantisation: the Euclidean path-integral measure is the equilibrium of a fictitious-time Langevin process, and observables are computed by Monte Carlo simulation of the SDE rather than by direct sampling of the path integral. The procedure bypasses gauge fixing in non-abelian gauge theories — the gauge orbit of the Langevin trajectory is irrelevant in the long- $τ$ limit because the gauge-invariant equilibrium is the same regardless. Rubric: full credit for the field-theoretic Fokker-Planck plus the equilibrium identification plus the gauge-bypass observation.

Advanced results Master

Theorem (spectral gap of the Ornstein-Uhlenbeck generator; Risken §5.4 ^[Risken]). Let $V (x) = (1/2) x^{⊤} A x$ with $A$ symmetric positive-definite on $R^{n}$ with eigenvalues $ω_{1}^{2} \leq \dots \leq ω_{n}^{2}$ . The generator $L = T Δ - A x \cdot \nabla$ is self-adjoint on $L^{2} (p_{e q})$ with $p_{e q} (x) = (det A / (2 π T)^{n})^{1/2} e^{- x^{⊤} A x / (2 T)}$ . Its spectrum is $$ \sigma(L) = \Bigl{ -\sum_{i=1}^{n} k_i, \omega_i^2, : (k_1, \ldots, k_n) \in \mathbb{N}0^n \Bigr}, $$ *with the spectral gap (smallest non-zero eigenvalue in absolute value) equal to $λ_{1} = ω_{1}^{2}$ . The eigenfunctions are multivariate Hermite polynomials in the variables $A^{1/2} x / T$ , and the Fokker-Planck semigroup decays as $|p(\cdot, t) - p{eq}|{L^2(p{eq}^{-1})} \leq e^{-\omega_1^2 t} |p_0 - p_{eq}|{L^2(p{eq}^{-1})}$.*

The Ornstein-Uhlenbeck spectral gap $ω_{1}^{2}$ is the smallest eigenvalue of the Hessian of $V$ , and the Hermite polynomial diagonalisation makes the spectral resolution of $L$ completely explicit. For non-quadratic $V$ with $Hess (V) \geq ρ I$ , the gap is bounded below by $ρ$ (Bakry-Émery); for non-convex $V$ the gap may be much smaller than $in f_{x} λ_{m i n} (Hess (V))$ — the equilibrium can be multimodal, and the rate of inter-mode tunnelling is exponentially small in $1/ T$ (Eyring-Kramers / Freidlin-Wentzell large deviations).

Theorem (H-theorem; Markowich-Villani 2000 ^{[Markowich-Villani]}). Let $p_{e q}$ be the Gibbs density on $R^{n}$ and let $p (\cdot, t)$ solve the Fokker-Planck equation with $p (\cdot, 0) = p_{0}$ . The relative entropy $H (p ∥ p_{e q}) = \int p lo g (p / p_{e q}) d x$ is finite for $p_{0} lo g (p_{0} / p_{e q}) \in L^{1}$ and satisfies $$ \frac{d}{dt} H(p(\cdot, t) | p_{eq}) = -T \int p, |\nabla \log(p/p_{eq})|^2, dx = -T \cdot I(p | p_{eq}), $$ where $I$ is the relative Fisher information. The entropy is non-increasing in time, and identically zero only at $p = p_{e q}$ .

This is the dissipative structure of the Fokker-Planck flow: the relative entropy is a Lyapunov functional decreasing along the dynamics, with dissipation rate equal to $T \cdot I$ . The interpretation is statistical-mechanical: $H$ measures the "distance from equilibrium" in an information-theoretic sense (Kullback-Leibler divergence), and the second law of thermodynamics manifests as the monotonic decrease of $H$ toward zero. The dissipation $T \cdot I$ is the entropy production rate, and the bridge to the Boltzmann H-theorem is that the same identity holds in any drift-diffusion process with a reversible stationary distribution.

Theorem (logarithmic Sobolev inequality; Bakry-Émery 1985 ^{[Bakry-Emery 1985]}). Suppose $V \in C^{2} (R^{n})$ satisfies $Hess (V) (x) \geq ρ I$ for all $x \in R^{n}$ and some $ρ > 0$ (strict log-concavity). The Gibbs measure $p_{e q} = Z^{- 1} e^{- V / T} d x$ satisfies the logarithmic Sobolev inequality (LSI) $$ H(p | p_{eq}) \leq \frac{T}{2 \rho}, I(p | p_{eq}) $$ for every probability density $p$ with finite relative entropy. Combined with the H-theorem, $H (p (\cdot, t) ∥ p_{e q}) \leq e^{- 2 ρt} H (p_{0} ∥ p_{e q})$ , exponential convergence at rate $2 ρ$ .

The Bakry-Émery proof uses the $Γ_{2}$ -calculus: define $Γ (f, f) = T ∣\nabla f ∣^{2}$ and $Γ_{2} (f, f) = \frac{1}{2} (L Γ (f, f) - 2Γ (f, L f))$ . A bilinear-form computation gives $Γ_{2} (f, f) = T^{2} ∣ Hess (f) ∣^{2} + T \nabla f \cdot Hess (V) \nabla f \geq T ρ \cdot Γ (f, f)$ . Bakry-Émery's interpolation along the semigroup $e^{t L}$ integrates this pointwise inequality to the LSI. The constant $T / (2 ρ)$ is sharp on the Ornstein-Uhlenbeck process (Gross 1975's original Gaussian LSI). The criterion generalises to weighted Riemannian manifolds via $Ric_{M} + Hess (V) \geq ρ g$ , the curvature-dimension condition CD( $ρ, \infty$ ) that opened the synthetic theory of Ricci curvature (Lott-Sturm-Villani 2009 ^{[Villani 2009]}).

Theorem (path-integral / Onsager-Machlup; Damgaard-Hüffel 1987 ^{[Damgaard-Huffel 1987]}). Let $x_{t}$ satisfy the Langevin SDE on $[0, t]$ with initial law $δ (x - x_{0})$ . The probability functional density on continuous paths ${x_{s}}_{0 \leq s \leq t}$ , defined by Girsanov-Cameron-Martin against the Wiener measure, has the formal weight $$ \mathcal{P}[x] = \mathcal{N}, \exp\Bigl( -\frac{1}{4 T} \int_0^t \bigl( \dot x_s + \nabla V(x_s) \bigr)^2 ds - \frac{1}{2} \int_0^t \Delta V(x_s), ds \Bigr), $$ with $N$ a path-space normalisation. The first term is the Onsager-Machlup action, a quadratic dissipative action; the second term is the Itô-Stratonovich correction. The Euclidean path integral of stochastic quantisation is the long-time limit of this expression, with the equilibrium measure on configurations recovered by integrating out the path histories.

The path-integral expression identifies the Fokker-Planck transition density with a path integral over continuous trajectories weighted by the Onsager-Machlup action $\int (\overset{x}{˙} + \nabla V)^{2} / (4 T) d s$ . This is the dissipative analogue of the Lagrangian path integral of quantum mechanics, with Wick rotation turning the quadratic-in- $\overset{x}{˙}$ kinetic term into a positive Euclidean weight. Parisi-Wu stochastic quantisation ^{[Parisi-Wu 1981]} takes the path-integral form as a derivation route to Euclidean field theory: a Langevin SDE in a fictitious fifth time $τ$ , with the Euclidean action $S [ϕ]$ as the potential, equilibrates to the Euclidean Gibbs measure $e^{- S} / Z$ as $τ \to \infty$ , and the Onsager-Machlup action becomes the dissipative super-action of the stochastic-quantisation construction.

Theorem (hypocoercivity; Villani 2009 ^{[Villani 2009]}). Let $L = T Δ - \nabla V \cdot \nabla$ on $L^{2} (p_{e q})$ . Suppose $V \in C^{2} (R^{n})$ is coercive ( $e^{- V / T} \in L^{1}$ ) and satisfies a Poincaré inequality with constant $ρ$ : $Var_{p_{e q}} (f) \leq ρ^{- 1} \int ∣\nabla f ∣^{2} p_{e q} d x$ for $f \in H^{1} (p_{e q})$ . Then the Fokker-Planck semigroup decays exponentially in $L^{2} (p_{e q})$ at rate $ρT$ , and in entropy at rate $2 ρ$ under the stronger LSI. For degenerate diffusions (kinetic Fokker-Planck with momentum-only diffusion), exponential decay still holds under quantitatively explicit conditions, by Villani's hypocoercivity machinery using twisted-norm Lyapunov functionals.

The hypocoercivity result is the modern quantitative theory: for the elliptic Fokker-Planck of the present unit, Poincaré-inequality / LSI / Bakry-Émery give exponential rates; for the kinetic Fokker-Planck where only the momentum variable diffuses (Boltzmann-type collision operators, Kramers' equation), the diffusion is degenerate in position and the Poincaré argument fails directly. Villani's twisted-norm functionals restore exponential convergence, identifying the rate with a combination of the position-space gap and the momentum-space dissipation. The same hypocoercivity machinery applies to dissipative QFT models with field-only or momentum-only noise.

Theorem (uniqueness of the stationary distribution under coercivity). Let $V \in C^{2} (R^{n})$ be coercive ( $V (x) \to \infty$ as $∣ x ∣ \to \infty$ with $e^{- V / T} \in L^{1}$ ). The Gibbs density $p_{e q} = Z^{- 1} e^{- V / T}$ is the unique probability density in $L^{1} (R^{n})$ satisfying $L^ p_{eq} = 0$.*

The proof uses the H-theorem: every stationary density $\tilde{p}$ has $\frac{d}{d t} H (\tilde{p} ∥ p_{e q}) = 0$ , hence $\int \tilde{p} ∣\nabla lo g (\tilde{p} / p_{e q}) ∣^{2} d x = 0$ , hence $\tilde{p} / p_{e q}$ is constant on connected components of the support of $\tilde{p}$ . Coercivity forces this constant to be globally one (only one connected support consistent with normalisation), so $\tilde{p} = p_{e q}$ . The uniqueness is what makes the Langevin SDE an ergodic sampler of the Gibbs measure: every initial condition relaxes to the same equilibrium, justifying the Monte-Carlo / Markov-chain methods of computational statistical mechanics.

Theorem (stochastic-quantisation of Euclidean field theory; Parisi-Wu 1981 ^{[Parisi-Wu 1981]}). Let $S [ϕ]$ be a Euclidean action on field configurations $ϕ \in S^{'} (R^{d})$ such that $e^{- S [ϕ]} D ϕ / Z$ is a well-defined probability measure. The functional Langevin equation $$ \frac{\partial \phi(x, \tau)}{\partial \tau} = - \frac{\delta S[\phi]}{\delta \phi(x, \tau)} + \sqrt{2}, \eta(x, \tau) $$ with $η$ a Gaussian white noise in the auxiliary time $τ$ has, as $τ \to \infty$ , the Euclidean Gibbs measure $e^{- S [ϕ]} D ϕ / Z$ as its stationary distribution under the Fokker-Planck flow. Time-averages of observables along Langevin trajectories converge to Euclidean path-integral expectations.

This is the field-theoretic generalisation of the finite-dimensional Fokker-Planck framework, and the central content of stochastic quantisation. The procedure provides an alternative to gauge fixing in non-abelian gauge theories — the Langevin flow lives on the orbit space of gauge-equivalent configurations, and the equilibrium measure is gauge-invariant. The associated Onsager-Machlup action gives the path-integral weight of stochastic processes in field space, which the modern programme of stochastic regularisation uses as a UV-cutoff scheme (Hüffel-Rumpf, Floratos-Iliopoulos). The connection to the Fock-space construction of 08.10.01 is the Gaussian-Fock correspondence: the Parisi-Wu equilibrium of the free Euclidean action $S [ϕ] = \frac{1}{2} \int (\nabla ϕ)^{2} + m^{2} ϕ^{2}$ is the Gaussian free-field measure of 08.06.01, which Wick-rotates to the Wightman vacuum expectations of the free Klein-Gordon field on $F_{s} (H_{m})$ .

Synthesis. The Fokker-Planck equation is the foundational reason every classical equilibrium statistical-mechanics distribution can be realised as the long-time limit of a continuous-time stochastic process. The central insight is that the Langevin SDE $d x_{t} = - \nabla V d t + 2 T d W_{t}$ has the Gibbs distribution $p_{e q} = Z^{- 1} e^{- V / T}$ as its unique reversible stationary density under the Fokker-Planck flow $\partial_{t} p = \nabla \cdot (p \nabla V) + T Δ p$ , and this is exactly the equilibrium condition of 08.01.03 expressed as a differential equation rather than as a microcanonical / canonical / grand-canonical formula. Putting these together, the drift-diffusion balance, the H-theorem decay of relative entropy, the spectral-gap rate of convergence, and the Bakry-Émery log-Sobolev refinement form one dissipative framework that handles every coercive potential on $R^{n}$ . The bridge between the SDE and the PDE is the Itô formula: applying it to a smooth observable produces the generator $L$ acting on the observable, integration by parts converts $L$ into its adjoint $L^{*}$ acting on the density, and the resulting parabolic equation is the Fokker-Planck equation. This is exactly the bridge that appears again in 08.07.01 (path integral formulation of statistical mechanics) under Wick rotation, where the dissipative Langevin process becomes the imaginary-time path integral of 08.09.01 (Wick rotation).

The Fokker-Planck framework identifies several constructions that look distinct at first inspection. Equilibrium statistical mechanics (the Boltzmann distribution of 08.01.03) is the stationary law of a dissipative SDE. The path-integral formulation (the Onsager-Machlup weight on trajectories) is the path-integral expression of the same stationary law on configuration space. The Parisi-Wu stochastic-quantisation framework generalises the construction to Euclidean field theory: a Langevin equation in fictitious time $τ$ , with the Euclidean action $S [ϕ]$ playing the role of the potential, has $e^{- S} / Z$ as its $τ \to \infty$ Fokker-Planck equilibrium, and bypasses gauge fixing because the equilibrium measure is automatically gauge-invariant. The bridge between these is that all are different presentations of the same diffusion-with-drift dynamics: the configuration-space Fokker-Planck flow, the path-space Onsager-Machlup action, and the field-theory functional Fokker-Planck flow are one mathematical object viewed from three angles. The quantitative rates — spectral gap, log-Sobolev constant, Poincaré constant — control how fast a Markov-chain Monte Carlo of the Langevin SDE equilibrates, and the Bakry-Émery $Γ_{2}$ -calculus gives sharp rates whenever the potential is strongly convex. The recursion stabilises after the equilibrium is reached: at equilibrium, time-averages along a Langevin trajectory equal ensemble-averages against the Gibbs measure, and this identifies dynamical sampling with thermodynamic ensembles in the precise sense made operational by ergodic theorems for the Markov semigroup.

Full proof set Master

Proposition (Itô formula derivation of the Fokker-Planck equation). Given the Langevin SDE $d x_{t} = - \nabla V (x_{t}) d t + 2 T d W_{t}$ on $R^{n}$ with $V \in C^{2}$ and sufficient growth conditions, the density $p (x, t)$ of $x_{t}$ satisfies $\partial_{t} p = T Δ p + \nabla \cdot (p \nabla V)$ in the weak sense, and classically on $R^{n} \times (0, \infty)$ by parabolic regularity.

Proof. For $φ \in C_{c}^{2} (R^{n})$ , Itô's formula applied to $φ (x_{t})$ reads $$ d\varphi(x_t) = (\nabla \varphi(x_t)) \cdot dx_t + \tfrac{1}{2} \sum_{i,j} (\partial_i \partial_j \varphi(x_t)), d[x_i, x_j]_t. $$ The quadratic variation of the additive-noise Itô SDE is $d [x_{i}, x_{j}]_{t} = 2 T δ_{ij} d t$ (the deterministic drift contributes zero to the bracket). Substituting and grouping, $$ d\varphi(x_t) = \bigl[ -\nabla V(x_t) \cdot \nabla \varphi(x_t) + T \Delta \varphi(x_t) \bigr], dt + \sqrt{2 T} \nabla \varphi(x_t) \cdot dW_t. $$ Take expectation with respect to the law of $x_{t}$ . The Itô integral $\int_{0}^{t} \nabla φ (x_{s}) \cdot d W_{s}$ is a martingale with mean zero (under suitable integrability hypotheses, which $φ \in C_{c}^{2}$ guarantees through the boundedness of $\nabla φ$ ). Hence $$ \frac{d}{dt} \mathbb{E}[\varphi(x_t)] = \mathbb{E}[L \varphi (x_t)], \qquad L = T \Delta - \nabla V \cdot \nabla. $$

Express both sides as integrals against the density: $\frac{d}{d t} \int φp d x = \int (L φ) p d x$ . Integration by parts gives, with $φ$ of compact support, $$ \int \varphi \partial_t p, dx = \int \varphi (L^* p), dx, \qquad L^* p = T \Delta p + \nabla \cdot (p \nabla V), $$ where $L^{*}$ is the formal adjoint computed via $\int (T Δ φ) p = \int φ (T Δ p)$ and $\int (- \nabla V \cdot \nabla φ) p = \int φ \nabla \cdot (p \nabla V)$ . The weak equation $\partial_{t} p = L^{*} p$ holds against every $φ \in C_{c}^{2}$ , and parabolic regularity (interior Schauder estimates for the parabolic operator with $C^{1}$ coefficients) gives $p \in C^{2, 1}$ on $R^{n} \times (0, \infty)$ and the equation classically. $□$

Proposition (Gibbs density is reversible stationary). The Gibbs density $p_{e q} (x) = Z^{- 1} e^{- V (x) / T}$ satisfies $L^ p_{eq} = 0 $an d t h e d e t ai l e d - ba l an ceco n d i t i o n$ J_{eq} \equiv 0$ pointwise.*

Proof. The probability current $J = - p \nabla V - T \nabla p$ associated with $\partial_{t} p + \nabla \cdot J = 0$ has, on the Gibbs density, $$ J_{eq}(x) = - p_{eq}(x) \nabla V(x) - T \nabla p_{eq}(x). $$ Differentiate $p_{e q} = Z^{- 1} e^{- V / T}$ : $\nabla p_{e q} = - (1/ T) p_{e q} \nabla V$ . Substitute: $J_{e q} = - p_{e q} \nabla V - T \cdot (- (1/ T) p_{e q} \nabla V) = - p_{e q} \nabla V + p_{e q} \nabla V = 0$ pointwise. So $J_{e q} = 0$ pointwise, hence $\nabla \cdot J_{e q} = 0$ as an identity, and $L^{*} p_{e q} = - \nabla \cdot J_{e q} = 0$ (with the parabolic-conservation sign convention $\partial_{t} p + \nabla \cdot J = 0$ , this gives $L^{*} p_{e q} = 0$ ). The detailed-balance condition is the stronger pointwise vanishing of the current; ordinary stationarity is the divergence-free condition. $□$

Proposition (self-adjointness of $L$ on $L^{2} (p_{e q})$ and Dirichlet form). On the weighted $L^{2} (p_{e q})$ space, the generator $L$ is densely defined, symmetric, and non-positive with Dirichlet form $$ \mathcal{E}(f, g) := -\langle f, L g \rangle_{eq} = T \int (\nabla f \cdot \nabla g), p_{eq}, dx, \qquad f, g \in H^1(p_{eq}). $$ $L$ extends to a self-adjoint operator on $L^{2} (p_{e q})$ , the Friedrichs extension of the Dirichlet form, with $- L \geq 0$ and spectrum $σ (- L) \subseteq [0, \infty)$ .

Proof. For $f, g \in C_{c}^{\infty} (R^{n})$ compute $⟨ f, Lg ⟩_{e q} = \int f (T Δ g - \nabla V \cdot \nabla g) p_{e q} d x$ . The drift term integrates by parts using $\nabla p_{e q} = - (1/ T) p_{e q} \nabla V$ , hence $p_{e q} \nabla V = - T \nabla p_{e q}$ : $$ \int f (-\nabla V \cdot \nabla g) p_{eq}, dx = T \int f (\nabla \log p_{eq}) \cdot \nabla g, p_{eq}, dx = T \int (\nabla g) \cdot (f \nabla p_{eq}), dx. $$ Integration by parts on the diffusion term yields $\int f T Δ g p_{e q} d x = - T \int \nabla (f p_{e q}) \cdot \nabla g d x = - T \int (\nabla f) \cdot (\nabla g) p_{e q} d x - T \int f (\nabla g) \cdot \nabla p_{e q} d x$ . Adding the two contributions, the $f \nabla p_{e q} \cdot \nabla g$ terms cancel, leaving $$ \langle f, L g \rangle_{eq} = -T \int (\nabla f) \cdot (\nabla g) p_{eq}, dx = -\mathcal{E}(f, g). $$ Symmetry in $f, g$ gives $⟨ f, Lg ⟩_{e q} = ⟨ L f, g ⟩_{e q}$ on $C_{c}^{\infty}$ , and the Dirichlet form is closable on $L^{2} (p_{e q})$ , so $L$ extends to a self-adjoint operator. Setting $f = g$ , $E (g, g) = T \int ∣\nabla g ∣^{2} p_{e q} d x \geq 0$ , hence $- L \geq 0$ . The spectral theorem packages $L$ as $- \int_{0}^{\infty} λ d E_{λ}$ with $E_{λ}$ a projection-valued measure on $[0, \infty)$ . $□$

Proposition (Ornstein-Uhlenbeck explicit solution). For $V (x) = (1/2) ω^{2} x^{2}$ on $R$ , the Fokker-Planck equation $$ \partial_t p = T \partial_x^2 p + \partial_x(\omega^2 x p) $$ has fundamental solution (transition density) $$ p(x, t \mid x_0, 0) = \Bigl( \frac{\omega^2}{2 \pi T (1 - e^{-2\omega^2 t})} \Bigr)^{1/2} \exp\Bigl( -\frac{\omega^2 (x - x_0 e^{-\omega^2 t})^2}{2 T (1 - e^{-2\omega^2 t})} \Bigr), $$ the Gaussian centred at the deterministic drift trajectory $x_{0} e^{- ω^{2} t}$ with variance $(T / ω^{2}) (1 - e^{- 2 ω^{2} t})$ . As $t \to \infty$ , the variance tends to $T / ω^{2}$ and the centre tends to $0$ , recovering the Gibbs equilibrium $p_{e q} (x) = (ω^{2} / (2 π T))^{1/2} e^{- ω^{2} x^{2} / (2 T)}$ .

Proof. The SDE $d x_{t} = - ω^{2} x_{t} d t + 2 T d W_{t}$ has explicit solution (variation of parameters): $x_{t} = x_{0} e^{- ω^{2} t} + 2 T \int_{0}^{t} e^{- ω^{2} (t - s)} d W_{s}$ . The Itô integral $\int_{0}^{t} e^{- ω^{2} (t - s)} d W_{s}$ is Gaussian with mean zero and variance $\int_{0}^{t} e^{- 2 ω^{2} (t - s)} d s = (1 - e^{- 2 ω^{2} t}) / (2 ω^{2})$ . So $x_{t}$ is Gaussian with mean $x_{0} e^{- ω^{2} t}$ and variance $2 T \cdot (1 - e^{- 2 ω^{2} t}) / (2 ω^{2}) = (T / ω^{2}) (1 - e^{- 2 ω^{2} t})$ . Write the Gaussian density to obtain the stated transition formula. Verification that this density satisfies the Fokker-Planck equation is a direct (somewhat tedious) computation in derivatives; alternatively, the explicit Gaussian solution is the heat-kernel of the operator $T \partial_{x}^{2} + ω^{2} \partial_{x} (x \cdot)$ via Mehler's formula. As $t \to \infty$ the variance saturates at $T / ω^{2}$ and the mean dies to zero, recovering $p_{e q} (x) = (ω^{2} / (2 π T))^{1/2} e^{- ω^{2} x^{2} / (2 T)}$ . $□$

Proposition (H-theorem dissipation identity). For $p (\cdot, t)$ solving the Fokker-Planck equation with sufficient regularity and decay, the relative entropy $H (t) := \int p (\cdot, t) lo g (p (\cdot, t) / p_{e q}) d x$ satisfies $\frac{d H}{d t} = - T \int p ∣\nabla lo g (p / p_{e q}) ∣^{2} d x \leq 0$ , with equality iff $p = p_{e q}$ .

Proof. Set $h := p / p_{e q}$ , so $p = h p_{e q}$ and $H = \int p_{e q} h lo g h d x$ . Compute $$ \frac{dH}{dt} = \int p_{eq}, (\partial_t h) (1 + \log h), dx = \int (\partial_t p) (1 + \log h), dx. $$ Using $\partial_{t} p = \nabla \cdot J^{*}$ with $J^{*} := T \nabla p + p \nabla V$ (the negative of the probability current; both sign conventions appear in the literature), $$ \frac{dH}{dt} = \int (\nabla \cdot J^) (1 + \log h), dx = -\int J^ \cdot \nabla \log h, dx, $$ where the constant- $1$ term gives a divergence-of-current integral that vanishes by mass conservation and the integration by parts has no boundary contribution under decay hypotheses. Substitute $J^{*} = T \nabla p + p \nabla V = T p_{e q} \nabla (p / p_{e q}) = T p_{e q} \nabla h$ : $$ \frac{dH}{dt} = -\int T p_{eq} (\nabla h) \cdot (\nabla \log h), dx = -T \int p_{eq} h |\nabla \log h|^2, dx = -T \int p |\nabla \log h|^2, dx, $$ using $\nabla lo g h = \nabla h / h$ . The right-hand side is $- T \cdot I (p ∥ p_{e q})$ , the Fisher information of $p$ relative to $p_{e q}$ . The integrand is non-negative, and zero iff $\nabla lo g h = 0$ almost everywhere on ${p > 0}$ , i.e. $h$ is constant on its support. Normalisation forces this constant to be $1$ , hence $p = p_{e q}$ . $□$

Proposition (Bakry-Émery $Γ_{2}$ identity for the Fokker-Planck generator). For $L = T Δ - \nabla V \cdot \nabla$ on smooth $f : R^{n} \to R$ , $$ \Gamma_2(f, f) = T^2 |\mathrm{Hess}(f)|^2 + T (\nabla f) \cdot \mathrm{Hess}(V) (\nabla f), $$ where $Γ (f, f) := T ∣\nabla f ∣^{2}$ and $Γ_{2} (f, f) := \frac{1}{2} (L Γ (f, f) - 2Γ (f, L f))$ . The inequality $Hess (V) \geq ρ I$ implies $Γ_{2} \geq ρ Γ$ , the CD( $ρ, \infty$ ) curvature-dimension condition.

Proof. Compute $L Γ (f, f) = L (T ∣\nabla f ∣^{2})$ . Expanding $L = T Δ - \nabla V \cdot \nabla$ : $$ \frac{1}{T} L|\nabla f|^2 = \Delta |\nabla f|^2 - \frac{1}{T}(\nabla V) \cdot \nabla |\nabla f|^2. $$ The Bochner formula gives $Δ∣\nabla f ∣^{2} = 2∣ Hess (f) ∣^{2} + 2\nabla f \cdot \nabla (Δ f)$ in Euclidean space. Substitute and use $L f = T Δ f - \nabla V \cdot \nabla f$ : $$ \Gamma(f, L f) = T \nabla f \cdot \nabla(T \Delta f - \nabla V \cdot \nabla f) = T^2 \nabla f \cdot \nabla \Delta f - T \nabla f \cdot \nabla(\nabla V \cdot \nabla f). $$ Putting these together (after a Bochner-style bookkeeping computation that organises the terms), $$ \Gamma_2(f, f) = T^2 |\mathrm{Hess}(f)|^2 + T \nabla f \cdot \mathrm{Hess}(V) \nabla f. $$ The first term is non-negative; the second is bounded below by $T ρ ∣\nabla f ∣^{2} = ρ Γ (f, f)$ when $Hess (V) \geq ρ I$ . Hence $Γ_{2} \geq ρ Γ$ under the curvature hypothesis. Integrating the pointwise inequality along the semigroup $e^{t L}$ (Bakry-Émery's interpolation argument) yields the LSI $H \leq (T /2 ρ) I$ . $□$

Proposition (uniqueness of the stationary distribution under coercivity). Let $V \in C^{2} (R^{n})$ be coercive in the sense $e^{- V / T} \in L^{1}$ . The only probability density $\tilde{p} \in L^{1}$ satisfying $L^ \tilde p = 0 $i s$ \tilde p = p_{eq} = Z^{-1} e^{-V/T}$.*

Proof. Suppose $\tilde{p} \geq 0$ , $\int \tilde{p} = 1$ , $L^{*} \tilde{p} = 0$ . By the H-theorem applied to the constant function $p (\cdot, t) \equiv \tilde{p}$ (a stationary solution): $\frac{d H ( p ~ ∥ p _{e q} )}{d t} = 0$ , hence $T \int \tilde{p} ∣\nabla lo g (\tilde{p} / p_{e q}) ∣^{2} d x = 0$ . The integrand is non-negative, so $\nabla lo g (\tilde{p} / p_{e q}) = 0$ almost everywhere on ${\tilde{p} > 0}$ , i.e. $\tilde{p} / p_{e q}$ is constant on connected components of its support. Connectedness of $R^{n}$ and the strict positivity of $p_{e q}$ make this constant a single global value, normalised to $1$ by $\int \tilde{p} = \int p_{e q} = 1$ . Hence $\tilde{p} = p_{e q}$ . $□$

Connections Master

Boltzmann distribution 08.01.03. The Gibbs / Boltzmann distribution $p_{e q} (x) = Z^{- 1} e^{- V (x) / T}$ is the unique reversible stationary density of the Fokker-Planck equation under coercive $V$ . The present unit derives the Boltzmann distribution from a dynamical principle — the long-time limit of the Langevin SDE — rather than from a microcanonical / canonical / grand-canonical bookkeeping. The two derivations agree at the level of the density and identify dynamical sampling with thermodynamic ensembles.
Path integral formulation of statistical mechanics 08.07.01. The path-integral / Onsager-Machlup form of the Langevin SDE expresses the Fokker-Planck transition density as a path integral over continuous trajectories weighted by the dissipative action $\int (\overset{x}{˙} + \nabla V)^{2} / (4 T) d t$ . The connection to the Euclidean / imaginary-time path integral of quantum mechanics is Wick rotation: the kinetic action $\int (\overset{x}{˙})^{2} d t /2$ becomes the Brownian-motion Gaussian weight $\int (\overset{x}{˙})^{2} / (4 T) d t$ after temperature absorption, and the potential becomes the drift. Both frameworks compute equilibrium correlations of the same Gibbs measure.
Wick rotation 08.09.01. The relation between the Fokker-Planck framework and quantum mechanics is Wick rotation: the imaginary-time Schrödinger equation $\partial_{τ} ψ = - H ψ$ for a quantum-mechanical Hamiltonian $H$ has the same form as a Fokker-Planck-like dissipative flow, and the operator $L^{*} = T Δ + \nabla \cdot (\cdot \nabla V)$ is conjugate to a Schrödinger Hamiltonian $- H_{Schr} = T Δ - V_{eff}$ via $p = p_{e q}^{1/2} ψ$ , where $V_{eff} = (∣\nabla V ∣^{2} /4 T) - (Δ V) /2$ is the supersymmetric Witten partner potential. This is the Schrödinger-equation route to Fokker-Planck spectral theory.
Gaussian field theory and free boson 08.06.01. The Parisi-Wu stochastic-quantisation generalisation of the Fokker-Planck framework to field theory has the free-boson Gaussian measure $d μ_{C}$ with covariance $C = (- Δ + m^{2})^{- 1}$ as its equilibrium under the functional Langevin equation $\partial_{τ} ϕ = (Δ - m^{2}) ϕ + 2 η$ . The Fokker-Planck flow on field configurations equilibrates to the Gaussian free-field measure, which is the Euclidean Wick rotation of the Fock-space construction of 08.10.01.
Bosonic Fock space and second quantisation 08.10.01. The Gaussian-Fock correspondence identifies the operator-side vacuum expectations on $F_{s} (H_{m})$ with the measure-side Gaussian moments of $d μ_{C}$ via Isserlis-Wick. Stochastic quantisation generates $d μ_{C}$ as a Fokker-Planck equilibrium, providing a third equivalent route to the same free-field correlation functions: canonical Fock-space quantisation (operator algebra), Euclidean path-integral (measure theory), and stochastic quantisation (SDE). The three frameworks meet at the Gibbs equilibrium of a Langevin equation in a fictitious fifth time.
Free energy 08.01.04. The negative log of the partition function $Z = \int e^{- V / T} d x$ — the free energy $F = - T lo g Z$ — is the constant that normalises the Gibbs density and the leading-large-time term in the path-integral expression of the transition density. The Fokker-Planck framework recovers $F$ as the limit of $- T lo g \int p (\cdot, t) d x$ corrections along the dissipative flow.

Historical & philosophical context Master

Adriaan Fokker derived the diffusion equation that bears his name in his 1914 paper Die mittlere Energie rotierender elektrischer Dipole im Strahlungsfeld (Ann. Phys. (Leipzig) 348, 810) ^{[Fokker 1914]}, working on the angular distribution of radiating electric dipoles in thermal radiation. The equation appeared as a drift-diffusion equation for the angular probability density, with a drift term proportional to the torque and a diffusion term proportional to the radiation field. Max Planck, in his 1917 paper Über einen Satz der statistischen Dynamik und seine Erweiterung in der Quantentheorie (Sitzungsber. Preuss. Akad. Wiss. Berlin 24, 324) ^{[Planck 1917]}, extended Fokker's derivation to a general continuous Markov process by truncating the Kramers-Moyal expansion of the master equation at second order. The combined equation $\partial_{t} p = - \partial_{x} (D_{1} p) + \frac{1}{2} \partial_{x}^{2} (D_{2} p)$ has carried the joint Fokker-Planck name since.

The mathematical foundation was laid by Andrey Kolmogorov in his 1931 paper Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung (Math. Ann. 104, 415) ^{[Kolmogorov 1931]}. Kolmogorov derived the Fokker-Planck equation (which he called the forward equation) and its dual the backward equation from the Chapman-Kolmogorov consistency requirement on the transition probabilities of a continuous-time Markov process, identifying the conditions on the moments of the increments that make the truncation at second order exact. Kolmogorov's framework treats the Fokker-Planck equation as a derived consequence of probability axioms; the heuristic Fokker-Planck construction of 1914-1917 became a theorem about Markov diffusion semigroups. Kiyosi Itô's 1944 paper Stochastic integral (Proc. Imp. Acad. Tokyo 20, 519) ^{[Itô 1944]} introduced the stochastic integral against Brownian motion and the Itô formula, providing the modern derivation route: write an SDE for the trajectory $x_{t}$ , apply Itô's formula to an observable, take expectations, and the Fokker-Planck equation appears as the equation for the density via integration by parts.

The stochastic-mechanics applications were classical by the 1940s. Subrahmanyan Chandrasekhar's monumental 1943 Stochastic problems in physics and astronomy (Rev. Mod. Phys. 15, 1) ^{[Chandrasekhar 1943]} surveyed Brownian motion, the Langevin equation, the Ornstein-Uhlenbeck process, and the Fokker-Planck equation as the unified language of nonequilibrium statistical mechanics. Hans Risken's The Fokker-Planck Equation (1st ed. 1984, 2nd ed. 1989) ^[Risken] and Crispin Gardiner's Handbook of Stochastic Methods (1st ed. 1983) ^[Gardiner] became the standard physicists' references. The quantitative theory of convergence to equilibrium developed in parallel: Leonard Gross's 1975 paper introduced the logarithmic Sobolev inequality for the Ornstein-Uhlenbeck process; Dominique Bakry and Michel Émery's 1985 Diffusions hypercontractives (Séminaire de Probabilités XIX) ^{[Bakry-Emery 1985]} gave the curvature-dimension criterion CD( $ρ, \infty$ ) under which the LSI holds with sharp constant, and Cédric Villani's 2009 Hypocoercivity (Memoirs AMS 202) ^{[Villani 2009]} extended the entropy-method machinery to degenerate Fokker-Planck operators of kinetic-theory type.

The field-theoretic chapter opened with Giorgio Parisi and Yong-Shi Wu's 1981 paper Perturbation theory without gauge fixing (Sci. Sin. 24, 483) ^{[Parisi-Wu 1981]}. Parisi-Wu's construction promotes the configuration variable $x \in R^{n}$ to a field configuration $ϕ (x) \in S^{'} (R^{d})$ and the potential $V$ to the Euclidean action $S [ϕ]$ , replacing the finite-dimensional Langevin SDE with a functional Langevin equation in a fictitious fifth time. The Fokker-Planck flow on field configurations equilibrates to the Euclidean Gibbs measure $e^{- S} / Z$ , the path-integral measure of 08.07.01. The construction bypasses the Faddeev-Popov gauge-fixing procedure in non-abelian gauge theories — the Langevin flow lives on the orbit space — and provides a stochastic regularisation scheme alongside lattice and dimensional regularisation. Damgaard-Hüffel's 1987 Stochastic quantization (Phys. Rep. 152, 227) ^{[Damgaard-Huffel 1987]} reviewed the programme. The Fokker-Planck equation has thus been, since 1914, a constant presence at the interface between probability, statistical mechanics, and quantum field theory.

Bibliography Master

@article{Fokker1914,
  author  = {Fokker, A. D.},
  title   = {Die mittlere {E}nergie rotierender elektrischer {D}ipole im {S}trahlungsfeld},
  journal = {Annalen der Physik (Leipzig)},
  volume  = {348},
  year    = {1914},
  pages   = {810--820}
}

@article{Planck1917,
  author  = {Planck, M.},
  title   = {{\"U}ber einen {S}atz der statistischen {D}ynamik und seine {E}rweiterung in der {Q}uantentheorie},
  journal = {Sitzungsberichte der Preussischen {A}kademie der Wissenschaften, Berlin},
  volume  = {24},
  year    = {1917},
  pages   = {324--341}
}

@article{Kolmogorov1931,
  author  = {Kolmogorov, A. N.},
  title   = {{\"U}ber die analytischen {M}ethoden in der {W}ahrscheinlichkeitsrechnung},
  journal = {Mathematische Annalen},
  volume  = {104},
  year    = {1931},
  pages   = {415--458}
}

@article{Ito1944,
  author  = {It{\^o}, Kiyosi},
  title   = {Stochastic integral},
  journal = {Proceedings of the Imperial {A}cademy of {T}okyo},
  volume  = {20},
  year    = {1944},
  pages   = {519--524}
}

@article{ParisiWu1981Stoch,
  author  = {Parisi, Giorgio and Wu, Yong-Shi},
  title   = {Perturbation theory without gauge fixing},
  journal = {Scientia Sinica},
  volume  = {24},
  year    = {1981},
  pages   = {483--496}
}

@incollection{BakryEmery1985,
  author    = {Bakry, Dominique and {\'E}mery, Michel},
  title     = {Diffusions hypercontractives},
  booktitle = {S{\'e}minaire de Probabilit{\'e}s XIX 1983/84},
  series    = {Lecture Notes in Mathematics},
  volume    = {1123},
  publisher = {Springer},
  year      = {1985},
  pages     = {177--206}
}

@article{Chandrasekhar1943,
  author  = {Chandrasekhar, Subrahmanyan},
  title   = {Stochastic problems in physics and astronomy},
  journal = {Reviews of Modern Physics},
  volume  = {15},
  year    = {1943},
  pages   = {1--89}
}

@book{Risken1989,
  author    = {Risken, Hans},
  title     = {The {F}okker-{P}lanck Equation: Methods of Solution and Applications},
  edition   = {2},
  publisher = {Springer},
  series    = {Springer Series in Synergetics},
  volume    = {18},
  year      = {1989}
}

@book{Gardiner2004,
  author    = {Gardiner, Crispin W.},
  title     = {Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences},
  edition   = {3},
  publisher = {Springer},
  year      = {2004}
}

@book{vanKampen2007,
  author    = {van Kampen, N. G.},
  title     = {Stochastic Processes in Physics and Chemistry},
  edition   = {3},
  publisher = {North-Holland},
  year      = {2007}
}

@book{Pavliotis2014,
  author    = {Pavliotis, Grigorios A.},
  title     = {Stochastic Processes and Applications: Diffusion Processes, the {F}okker-{P}lanck and {L}angevin Equations},
  publisher = {Springer},
  series    = {Texts in Applied Mathematics},
  volume    = {60},
  year      = {2014}
}

@book{Villani2009Hypocoercivity,
  author    = {Villani, C{\'e}dric},
  title     = {Hypocoercivity},
  publisher = {American Mathematical Society},
  series    = {Memoirs of the AMS},
  volume    = {202},
  year      = {2009}
}

@article{MarkowichVillani2000,
  author  = {Markowich, Peter A. and Villani, C{\'e}dric},
  title   = {On the trend to equilibrium for the {F}okker-{P}lanck equation: an interplay between physics and functional analysis},
  journal = {Matem{\'a}tica Contempor{\^a}nea},
  volume  = {19},
  year    = {2000},
  pages   = {1--29}
}

@article{DamgaardHuffel1987,
  author  = {Damgaard, Poul H. and H{\"u}ffel, Helmuth},
  title   = {Stochastic quantization},
  journal = {Physics Reports},
  volume  = {152},
  year    = {1987},
  pages   = {227--398}
}

@article{Gross1975LSI,
  author  = {Gross, Leonard},
  title   = {Logarithmic {S}obolev inequalities},
  journal = {American Journal of Mathematics},
  volume  = {97},
  year    = {1975},
  pages   = {1061--1083}
}

Prerequisites

08.01.01
08.01.03
08.01.04
08.06.01
08.07.01
08.09.01
02.11.08

Tier anchors

beginner: Risken, *The Fokker-Planck Equation: Methods of Solution and Applications*, 2nd ed. (Springer, 1989), Ch. 1 informal opening; Gardiner, *Handbook of Stochastic Methods*, 3rd ed. (Springer, 2004), Ch. 1; van Kampen, *Stochastic Processes in Physics and Chemistry*, 3rd ed. (North-Holland, 2007), Chs. I-III
intermediate: Risken, Chs. 3-6 (drift-diffusion derivation, stationary solutions, Ornstein-Uhlenbeck); Gardiner Chs. 4-5; Pavliotis, *Stochastic Processes and Applications: Diffusion Processes, the Fokker-Planck and Langevin Equations* (Springer, 2014), Chs. 3-4
master: Kolmogorov, *Math. Ann.* 104, 415 (1931) (forward/backward equations); Itô, *Proc. Imp. Acad. Tokyo* 20, 519 (1944) (stochastic integral); Parisi-Wu, *Sci. Sin.* 24, 483 (1981) (stochastic quantisation as Langevin route to Euclidean field theory); Bakry-Émery, *Séminaire de Probabilités XIX*, Springer LNM 1123, 177 (1985) (log-Sobolev inequalities via curvature-dimension); Markowich-Villani, *Mat. Contemp.* 19, 1 (2000); Villani, *Mem. AMS* 202 (2009), Ch. 1 (hypocoercivity)

References

Fokker, A. D. — Die mittlere Energie rotierender elektrischer Dipole im Strahlungsfeld · Ann. Phys. (Leipzig) 348, 810 (1914) — drift-diffusion equation for the angular distribution of radiating dipoles, the originator paper of the equation now bearing his name
Planck, M. — Über einen Satz der statistischen Dynamik und seine Erweiterung in der Quantentheorie · Sitzungsber. Preuss. Akad. Wiss. Berlin 24, 324 (1917) — general derivation of the diffusion equation for the probability density of a stochastic process from a master equation, second originator of the Fokker-Planck equation
Kolmogorov, A. N. — Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung · Math. Ann. 104, 415 (1931) — the mathematical foundation: Chapman-Kolmogorov equation, forward (Fokker-Planck) and backward equations, transition probabilities of a continuous Markov process
Itô, K. — Stochastic integral · Proc. Imp. Acad. Tokyo 20, 519 (1944) — the Itô integral and the chain rule (Itô formula) that powers the modern stochastic-calculus derivation of the Fokker-Planck equation from the Langevin SDE
Parisi, G. & Wu, Y.-S. — Perturbation theory without gauge fixing · Sci. Sin. 24, 483 (1981) — stochastic quantisation: a Langevin equation in a fictitious fifth time has the Euclidean Gibbs measure $e^{-S[\phi]}/Z$ as its equilibrium under the Fokker-Planck flow, providing an alternative quantisation route bypassing gauge fixing
Bakry, D. & Émery, M. — Diffusions hypercontractives · Séminaire de Probabilités XIX 1983/84, Springer Lecture Notes in Mathematics 1123, 177 (1985) — the curvature-dimension criterion CD($\rho, \infty$) for log-Sobolev inequalities, giving sharp exponential convergence rates for the Fokker-Planck flow
Risken, H. — The Fokker-Planck Equation: Methods of Solution and Applications, 2nd ed. · Springer, 1989 — Ch. 1 motivation, Ch. 3 derivation from master equation, Ch. 5 stationary solutions and detailed balance, Ch. 6 Ornstein-Uhlenbeck and harmonic-oscillator example, Ch. 5 §5.4 eigenfunction expansion and spectral gap
Gardiner, C. W. — Handbook of Stochastic Methods, 3rd ed. · Springer, 2004 — Ch. 4 (Itô calculus and the SDE/FPE correspondence); Ch. 5 (Fokker-Planck equation, stationary solutions, detailed balance)
van Kampen, N. G. — Stochastic Processes in Physics and Chemistry, 3rd ed. · North-Holland, 2007 — Chs. VIII-IX (master equation, Fokker-Planck equation from Kramers-Moyal expansion, mean first-passage times)
Pavliotis, G. A. — Stochastic Processes and Applications · Texts in Applied Mathematics 60 (Springer, 2014) — Chs. 3-4 (Itô SDE to Fokker-Planck equation, stationary solutions, generator and adjoint); Ch. 4 §4.4 (spectral gap and exponential convergence); Ch. 4 §4.6 (log-Sobolev and hypercontractivity)
Villani, C. — Hypocoercivity · Memoirs of the American Mathematical Society 202 (2009) — Ch. 1 (entropy production, log-Sobolev, exponential convergence for Fokker-Planck in the elliptic case); Ch. 3 (the kinetic case where the diffusion is degenerate)
Chandrasekhar, S. — Stochastic problems in physics and astronomy · Rev. Mod. Phys. 15, 1 (1943) — classical exposition of Brownian motion, Langevin equation, Fokker-Planck for the Ornstein-Uhlenbeck process, harmonic-oscillator solution
Damgaard, P. H. & Hüffel, H. — Stochastic quantization · Phys. Rep. 152, 227 (1987) — the comprehensive review of Parisi-Wu stochastic quantisation, including the Fokker-Planck approach to the Euclidean path-integral measure and the connection to Langevin field equations
Markowich, P. A. & Villani, C. — On the trend to equilibrium for the Fokker-Planck equation: an interplay between physics and functional analysis · Matemática Contemporânea 19, 1 (2000) — review of entropy methods, log-Sobolev, and rate-of-convergence theorems for the Fokker-Planck equation

Estimated time

beginner: 22m
intermediate: 50m
master: 90m