02.15.03 · analysis / stochastic-analysis

Stochastic differential equations, diffusions, and the infinitesimal generator

shipped3 tiersLean: none

Anchor (Master): Karatzas-Shreve Brownian Motion and Stochastic Calculus Ch. 5 (strong/weak solutions, martingale problem); Stroock-Varadhan Multidimensional Diffusion Processes (1979); Friedman Stochastic Differential Equations and Applications Vol. 1

Intuition Beginner

Picture a particle pushed along by a steady wind and, at the same time, kicked around by random jitter. The wind is predictable: at each location it has a definite direction and strength. The jitter is not: it comes from countless tiny unseen collisions, the same buffeting that makes a speck of pollen tremble in still water. A stochastic differential equation is the rule that blends these two forces into a single recipe for how the particle moves moment to moment.

Over a short interval the particle drifts a little in the wind's direction and also takes a small random step whose size depends on where it is. Add up all these short moves and you get a jagged, never-smooth path. Run the same recipe twice from the same start and you get two different paths, because the random kicks differ each time. What does not differ is the statistical pattern: the cloud of possible paths spreads in a way fixed entirely by the wind field and the jitter field.

The remarkable fact this unit builds is that this cloud of random paths is governed by an ordinary, deterministic equation for its probabilities. The randomness lives in the individual path; the law of the cloud is as orderly as a heat-spreading rule.

Visual Beginner

A two-dimensional plane carries a faint field of arrows showing the wind that pushes the particle. Several jagged trajectories all start from one dot and wander outward, each taking a different jittery route, fanning into a spreading cloud. Beside the cloud a smooth grey shading shows where the particle is likely to be found at a fixed later time: dark where many paths pile up, pale at the fringes.

The picture holds the whole story in one frame. On the left, the individual jagged paths show the random motion of one particle: rough, unrepeatable, pushed by the arrows and shaken by noise. On the right, the smooth shading shows the law of the whole ensemble: a density that flows and spreads in a perfectly definite way. The bridge between the rough left and the smooth right is the object this unit is about.

Worked example Beginner

Take the simplest jitter rule with a pull toward the origin: a particle on a line that is tugged back to zero in proportion to how far it has strayed, while being kicked by noise of constant strength. The wind at position points toward zero with strength proportional to ; the jitter has the same size everywhere.

Start the particle at . The pull tries to bring it home to zero, so on average the particle drifts back toward the origin and settles into hovering near it. But the kicks never stop, so it never sits exactly at zero — it rattles around in a small region whose width is set by the contest between how hard the wind pulls and how hard the noise shakes.

Now run a thousand copies of this particle from . Early on they all lean back toward zero together. After a while the thousand positions form a bell-shaped cloud centred at zero, neither growing nor shrinking: a balance has been struck. That settled bell shape is the long-run law of the motion. We did not have to follow any single rattling path to find it — the balance of pull against shake fixed the shape directly, which is the first hint that an orderly equation controls the cloud.

Check your understanding Beginner

Formal definition Intermediate+

Fix a filtered probability space carrying an -dimensional -Brownian motion , with the Itô integral and Itô's formula as developed in 02.15.02. Let $$ b : \mathbb{R}^n \to \mathbb{R}^n, \qquad \sigma : \mathbb{R}^n \to \mathbb{R}^{n \times m} $$ be the drift and dispersion coefficients. A stochastic differential equation (SDE) is the integral relation written informally as $$ dX_t = b(X_t), dt + \sigma(X_t), dB_t, $$ whose rigorous meaning is the requirement that an -adapted, continuous process satisfy, -almost surely and for every , $$ X_t = X_0 + \int_0^t b(X_s), ds + \int_0^t \sigma(X_s), dB_s, $$ the last term an Itô integral.

Definition (strong solution). Given the Brownian motion and an -measurable initial condition , a strong solution is a process as above, adapted to the augmented filtration generated by and , satisfying the integral equation with on every . Pathwise uniqueness holds when any two strong solutions on the same space with satisfy .

Definition (weak solution and the martingale problem). A weak solution is a pair on some filtered space solving the integral equation; here the Brownian motion is part of the output, not the input. Equivalently (Stroock-Varadhan), a probability measure on path space with solves the martingale problem for the operator below if, for every , $$ M_t^f := f(X_t) - f(x) - \int_0^t (Lf)(X_s), ds $$ is an -martingale under .

Definition (infinitesimal generator). With the (symmetric, positive-semidefinite) diffusion matrix, the generator of the SDE is the second-order operator $$ L = \tfrac{1}{2} \sum_{i,j=1}^n a^{ij}(x), \frac{\partial^2}{\partial x_i \partial x_j} + \sum_{i=1}^n b^i(x), \frac{\partial}{\partial x_i}, $$ acting on functions. When is positive-definite at every , is a (non-divergence-form) elliptic operator of the kind studied through the maximum principle in 02.13.01; this is the diffusion-to-elliptic-operator correspondence the unit develops.

Counterexamples to common slips

  • Strong existence is strictly stronger than weak existence. Tanaka's equation has a weak solution (a Brownian motion, since ) but no strong solution and pathwise uniqueness fails — Lipschitz continuity of is what this example lacks.
  • The generator depends on only through . Two different dispersion matrices with the same produce diffusions with identical laws; the SDE carries more information than its generator, but the law of the process does not.
  • Without the linear-growth bound, solutions can explode in finite time: (no noise) already blows up. Linear growth is what confines the solution to all of .

Key theorem with proof Intermediate+

Theorem (strong existence and pathwise uniqueness; Itô). Suppose and are globally Lipschitz and of linear growth: there is with $$ |b(x) - b(y)| + |\sigma(x) - \sigma(y)| \le K|x - y|, \qquad |b(x)| + |\sigma(x)| \le K(1 + |x|), $$ for all . Let be -measurable with . Then the SDE has a strong solution , unique up to indistinguishability, with for every .

Proof. Fix a horizon and work in the Banach space of continuous adapted processes with norm . Define the Picard map $$ (\Phi Y)t = X_0 + \int_0^t b(Y_s), ds + \int_0^t \sigma(Y_s), dB_s, $$ the exact stochastic analogue of the deterministic iteration of 02.08.01. We estimate . For the drift term, the Cauchy-Schwarz inequality and Lipschitz continuity give $$ \mathbb{E}\sup{s \le t}\Big| \int_0^s (b(Y_r) - b(Z_r)), dr \Big|^2 \le t, K^2 \int_0^t \mathbb{E}\sup_{r \le u}|Y_r - Z_r|^2, du . $$ For the dispersion term, Doob's maximal inequality followed by the Itô isometry 02.15.02 converts the supremum of the stochastic integral into a time integral of the integrand's square: $$ \mathbb{E}\sup_{s \le t}\Big| \int_0^s (\sigma(Y_r) - \sigma(Z_r)), dB_r \Big|^2 \le 4, K^2 \int_0^t \mathbb{E}\sup_{r \le u}|Y_r - Z_r|^2, du . $$ Writing and , the two bounds combine to with . Iterating this inequality times along produces the factorially small bound , so for fixed is a contraction once is large. The Banach fixed-point theorem then yields a unique fixed point in , the strong solution; the linear-growth bound, fed through the same isometry and Gronwall's lemma, gives the moment estimate , ruling out explosion.

Pathwise uniqueness is the , case of the same estimate: forces , and Gronwall gives . Patching the unique solutions across , extends to all of .

Bridge. This existence theorem builds toward the entire diffusion theory of the Master tier and appears again in 02.15.04, where the same solution flow is run against a potential to produce the Feynman-Kac representation. The foundational reason the proof works is that the Itô isometry turns the unfamiliar stochastic integral into a familiar time-integral, so the whole argument becomes the deterministic Picard-Lindelöf contraction of 02.08.01 with one extra term controlled by Doob's inequality — this is exactly the deterministic existence proof with a noise correction. Pathwise uniqueness generalises ODE uniqueness, and the Markov property we extract next is dual to the elliptic operator of 02.13.01: the central insight, putting these together, is that the random flow and the deterministic operator are two faces of one object, the bridge being Dynkin's formula, which equates an expectation over paths with a time-integral of .

Exercises Intermediate+

Advanced results Master

The strong-existence theorem makes the solution flow into a Markov family: for Lipschitz coefficients the solution started at satisfies , where is the transition semigroup. Pathwise uniqueness is what makes the flow consistent — restarting at reproduces the same law — and the moment bound makes a Feller semigroup on . The infinitesimal generator is then literally the derivative of this semigroup, on a suitable domain, and Dynkin's formula is the integrated form of that derivative. This is Oksendal Ch. 7 and Dynkin's Markov Processes (1965); it is the precise sense in which a diffusion is a second-order elliptic operator wearing probabilistic clothes.

The martingale problem of Stroock and Varadhan reorganises the entire theory around the generator alone. Rather than build from a Brownian motion, one seeks a measure on path space under which is a martingale for all test . Existence of a solution to the martingale problem is equivalent to weak existence for the SDE; well-posedness (existence and uniqueness) of the martingale problem is equivalent to the diffusion being a strong Markov Feller process. Stroock and Varadhan proved well-posedness under merely continuous, non-degenerate and bounded measurable — far weaker than Lipschitz — by an elliptic-regularity argument on itself. This is the deepest payoff of the operator viewpoint: uniqueness of the process is reduced to uniqueness for a partial differential equation, namely the resolvent equation , whose solvability is governed by the maximum principle of 02.13.01.

The two Kolmogorov equations are the analytic shadow of the probabilistic semigroup. Writing , differentiating the semigroup gives the backward equation , — an evolution equation in the starting point, with the generator . Dually, the forward (Fokker-Planck) equation governs the density of , with the formal adjoint in divergence-correcting form. Kolmogorov derived both in 1931 by purely analytic means, before Itô built the paths; the SDE is the path-space realisation of Kolmogorov's analytic diffusion. When the diffusion is Brownian motion, and both equations collapse to the heat equation of 02.13.03 — self-adjointness erases the distinction between forward and backward, and the transition density is the Gaussian heat kernel. Every diffusion is thus a variable-coefficient, possibly non-self-adjoint, generalisation of heat flow.

A final structural fact: positivity of (non-degeneracy) is exactly uniform ellipticity of , and it buys smoothing. Hörmander's theorem extends this to degenerate provided the drift and dispersion vector fields, together with their iterated Lie brackets, span the tangent space — a bracket-generating condition tying diffusion smoothing to the geometry of 02.12.01's vector fields. Where ellipticity holds, the transition density exists, is smooth, and the Dirichlet problem is solved probabilistically by the exit-distribution average of Exercise 5.

Synthesis. The infinitesimal generator is the foundational reason a jagged, unrepeatable random path is governed by smooth deterministic equations: packages the drift and diffusion into a single second-order elliptic operator, and the entire analytic theory of the diffusion is read off from it. This is exactly the duality that converts the probabilistic object — the Markov family of solution flows — into the analytic object — the semigroup and its two Kolmogorov equations; the central insight is that strong existence (Picard plus the Itô isometry) and elliptic regularity (the maximum principle on ) are the same uniqueness theorem viewed from the path side and the operator side. Putting these together, Dynkin's formula is the bridge: it equates a path-average with a time-integral of , and from it flow the Feynman-Kac formula of 02.15.04, the probabilistic solution of the Dirichlet problem, and the collapse to the heat equation of 02.13.03 when . The martingale problem generalises strong existence to merely continuous coefficients, and the bracket-generating condition generalises ellipticity to the degenerate case; what is invariant across all of it is that a diffusion is an elliptic operator made dynamical, the probabilistic face of the analysis built in 02.13.01 and 02.13.03.

Full proof set Master

The strong existence and pathwise uniqueness theorem is proved in full in the Key theorem section. The remaining Master claims are recorded here.

Proposition (Dynkin's formula). Let solve the SDE with Lipschitz, linear-growth coefficients and generator . For and a stopping time with , $$ \mathbb{E}x[f(X\tau)] - f(x) = \mathbb{E}_x\Big[ \int_0^\tau (Lf)(X_s), ds \Big]. $$

Proof. Apply Itô's formula 02.15.02 to . Using and the quadratic covariation with , $$ f(X_t) = f(x) + \int_0^t (Lf)(X_s), ds + \int_0^t \sum_{i,k} \partial_i f(X_s), \sigma^{ik}(X_s), dB_s^k , $$ the terms assembling precisely into by the definition of . The stochastic integral is a martingale with , because has compact support and has linear growth, so the integrand is bounded and the Itô isometry gives . Since makes and bounded, is a uniformly integrable martingale on once (the dominating bound is ). Optional stopping yields , and taking expectations in the Itô identity at time gives the claim.

Proposition (semigroup property and the backward equation). Let for . Then is a contraction semigroup, , and for the function solves the Kolmogorov backward equation with .

Proof. The Markov property of the unique solution flow gives, for , $$ P_{t+s}f(x) = \mathbb{E}x[f(X{t+s})] = \mathbb{E}_x\big[\mathbb{E}x[f(X{t+s}) \mid \mathcal{F}_t]\big] = \mathbb{E}_x[(P_s f)(X_t)] = P_t(P_s f)(x), $$ the inner equality by the Markov property and pathwise uniqueness (restarting at gives a solution with the law of ). Contraction is . For the generator, apply Dynkin's formula on the deterministic time : , using Fubini and the semigroup. Differentiating at , ; and by the semigroup commuting with on , . Hence satisfies , .

Proposition (forward equation as the adjoint evolution). If has a density for each and $L^{}L^2Lp\partial_t p = L^{}p$ weakly.

Proof. For , the backward equation gives , i.e. . Integrating by parts to move the derivatives off onto — the boundary terms vanish since has compact support — the right side equals , where . Thus for all test , which is the weak form of .

Connections Master

First-order ODE existence and uniqueness via Picard iteration 02.08.01 is the deterministic skeleton this unit dresses in noise. The Picard map , the contraction estimate, and the Banach fixed-point argument are imported verbatim; the only new ingredient is the stochastic integral term, controlled by the Itô isometry and Doob's inequality. Setting collapses the strong-existence theorem back to Picard-Lindelöf, so the SDE theory is a strict extension of the ODE theory, and the linear-growth no-explosion bound is the stochastic counterpart of the ODE global-existence criterion.

Phase space, vector fields, and integral curves 02.12.01 supply the geometric meaning of the drift. The drift is a vector field on and the zero-noise solution is exactly its integral curve; the diffusion is the integral curve perturbed by Brownian forcing. The bracket-generating (Hörmander) condition for hypoellipticity is a statement about the Lie brackets of the drift and dispersion vector fields, tying the smoothing of the transition density directly to the vector-field geometry developed there.

The Laplace equation, harmonic functions, and the maximum principle 02.13.01 are the elliptic half of the diffusion-operator correspondence. The generator with positive-definite is a non-divergence-form elliptic operator, the Dirichlet problem is solved probabilistically by the exit-distribution average of Exercise 5, and the maximum principle that governs harmonic functions is what powers Stroock-Varadhan uniqueness for the martingale problem. When and , and harmonic functions are exactly the -harmonic functions, with the mean-value property becoming the martingale property of .

The heat equation, heat kernel, and Duhamel's principle 02.13.03 are the special case of the backward and forward equations. There both Kolmogorov equations collapse to , the transition density is the Gaussian heat kernel, and the diffusion is Brownian motion. Every diffusion in this unit is therefore a variable-coefficient generalisation of the heat flow studied there, with the generator playing the role plays for the heat kernel.

The Itô integral and Itô's formula 02.15.02, co-produced in this wave, are the machinery the whole unit runs on. The Itô isometry is the single estimate that makes the Picard contraction close; Itô's formula is what turns into , the identity behind both Dynkin's formula and the martingale-problem reformulation. This unit is the natural sequel: it takes the calculus of 02.15.02 and uses it to solve equations rather than merely differentiate along paths.

The Feynman-Kac formula 02.15.04 is the next step. It runs the very solution flow constructed here against a potential , so that solves — Dynkin's formula with a killing term. The existence and Markov property established in this unit are exactly the hypotheses that make that representation well-defined.

Historical & philosophical context Master

The diffusion equations came before the diffusion paths. Andrei Kolmogorov, in his 1931 Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung (Mathematische Annalen 104, 415–458) [Kolmogorov 1931], derived the forward and backward parabolic equations for a Markov transition probability by purely analytic means, characterising a diffusion through its infinitesimal mean (drift) and infinitesimal variance (the matrix ) without ever constructing a single trajectory. Kiyosi Itô's 1951 memoir On stochastic differential equations (Memoirs of the AMS 4, 1–51) [Ito 1951] supplied the missing object: a pathwise integral against Brownian motion and a fixed-point proof building the trajectories Kolmogorov's equations only described statistically. The generator and the expectation formula equating with a time-integral of were systematised by Eugene Dynkin in Markov Processes (1965) [Dynkin 1965], completing the dictionary between the analytic and probabilistic descriptions.

The philosophical content is a duality between two ways of knowing the same process. Kolmogorov's analytic picture treats a diffusion as an evolution of densities, governed by a deterministic PDE; Itô's pathwise picture treats it as a random trajectory, built by integrating noise. Neither is prior: Dynkin's formula is the precise hinge on which one rotates into the other, and the Stroock-Varadhan martingale problem (1979) [Stroock-Varadhan 1979] shows that the generator alone — the analytic datum — often determines the process uniquely, with no reference to any particular Brownian driver. This is why a diffusion can be characterised three equivalent ways: as a strong solution of an SDE, as a weak solution, or as a solution of a martingale problem. The lesson recurring through the lineage is that the right invariant of a diffusion is not its sample paths, nor its driving noise, but the second-order elliptic operator that generates it; everything else is a representation. That operator-centred view is what later carried diffusion theory into the geometry of manifolds, where becomes the Laplace-Beltrami operator and Brownian motion becomes intrinsic to the metric.

Bibliography Master

@book{oksendal2003sde,
  author    = {{\O}ksendal, Bernt},
  title     = {Stochastic Differential Equations: An Introduction with Applications},
  edition   = {6th},
  publisher = {Springer-Verlag, Berlin},
  year      = {2003}
}

@book{karatzasshreve1991,
  author    = {Karatzas, Ioannis and Shreve, Steven E.},
  title     = {Brownian Motion and Stochastic Calculus},
  edition   = {2nd},
  series    = {Graduate Texts in Mathematics},
  volume    = {113},
  publisher = {Springer-Verlag, New York},
  year      = {1991}
}

@book{stroockvaradhan1979,
  author    = {Stroock, Daniel W. and Varadhan, S. R. Srinivasa},
  title     = {Multidimensional Diffusion Processes},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {233},
  publisher = {Springer-Verlag, Berlin},
  year      = {1979}
}

@article{ito1951sde,
  author  = {It\^o, Kiyosi},
  title   = {On stochastic differential equations},
  journal = {Memoirs of the American Mathematical Society},
  volume  = {4},
  pages   = {1--51},
  year    = {1951}
}

@article{kolmogorov1931,
  author  = {Kolmogorov, Andrei N.},
  title   = {\"Uber die analytischen Methoden in der Wahrscheinlichkeitsrechnung},
  journal = {Mathematische Annalen},
  volume  = {104},
  pages   = {415--458},
  year    = {1931}
}

@book{dynkin1965,
  author    = {Dynkin, Eugene B.},
  title     = {Markov Processes (Volume I)},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {121},
  publisher = {Springer-Verlag, Berlin},
  year      = {1965}
}