The Stratonovich Integral and Stratonovich Calculus
Anchor (Master): Hsu 2002 *Stochastic Analysis on Manifolds* (AMS, GSM 38) Ch. 1-2; Ikeda-Watanabe 1989 *Stochastic Differential Equations and Diffusion Processes* (North-Holland, 2nd ed.) Ch. III-V
Intuition Beginner
Imagine you are adding up tiny pushes from a jittery, unpredictable wind to figure out how far a leaf drifts. To turn the pushes into a total you have to decide when to read the wind's strength during each tiny step: at the start of the step, at the end, or in the middle. For an ordinary smooth breeze the choice would not matter, because over a tiny interval the wind barely changes. Random noise is rougher than that, so the choice genuinely changes the answer.
The Stratonovich integral makes one specific choice: read the strength at the middle of each step and split the difference. That single decision has a beautiful payoff. The familiar rules of calculus you already trust — the chain rule, the product rule — keep working with no extra correction terms. You compute as if the noise were a smooth wind, and the bookkeeping comes out right.
There is a competing choice, the Itô integral, which reads the wind at the start of each step. Itô is the natural choice for gambling and finance, where you must bet before you see the outcome. Stratonovich is the natural choice for physics and geometry, where you want your equations to look the same no matter what coordinates you use to describe space.
Visual Beginner
The picture shows a rough random path and zooms in on one short step. Three dots mark where you could read the random signal's strength: the left end, the middle, and the right end. Itô uses the left dot. Stratonovich uses the middle dot — really the average of the two endpoints. For a smooth curve all three dots sit at almost the same height, so the three rules agree. For the rough random path the dots sit at visibly different heights, and that gap is the reason the two calculi give different answers.
Worked example Beginner
Take the simplest interesting case: a random signal that starts at , and we want to total up the quantity "current height times the next push", written informally as the running sum of times its own increments. Suppose over a unit of time the signal happens to move from height up to height , and we slice the time into many equal steps.
The Itô total (read height at the start of each step) and the Stratonovich total (read height at the middle) differ by a fixed amount. For this self-referential case the Stratonovich answer minus the Itô answer equals one half of the total "wiggle" the path accumulated. Over one unit of time a standard random signal accumulates exactly unit of wiggle on average, so the gap is .
Concretely, the Itô rule gives . The Stratonovich rule gives the clean , exactly what the ordinary calculus formula "integral of is " would predict.
What this tells us: the Stratonovich answer is the one that obeys the schoolbook rule with no correction. That convenience is the whole point — the price is paid elsewhere, in how you set up the problem.
Check your understanding Beginner
Formal definition Intermediate+
Fix a filtered probability space satisfying the usual conditions and a one-dimensional Brownian motion adapted to it. Let be a continuous -adapted process, and more generally let be continuous semimartingales. We use the convention that denotes the quadratic covariation, the limit in probability of along refining partitions of ; in particular .
The Stratonovich integral of against is defined as the limit in probability of the symmetric (midpoint) Riemann sums
where ranges over partitions with mesh . The averaging of the integrand at the two endpoints is the defining feature; the ring symbol distinguishes it from the Itô integral , whose sums use the left endpoint .
When for a function — the case that arises in solving SDEs — the symmetric sum can be rewritten, and the limit relates the Stratonovich integral to the Itô integral by the Itô-Stratonovich conversion formula
The extra term is the correction term: it is the difference between reading the integrand at the midpoint and reading it at the left endpoint, and it is exactly half the covariation of the integrand process with the integrator. For and one has , so the correction is , an ordinary Lebesgue integral. The Stratonovich integral is not a martingale in general — the correction term has nonzero drift — which is the price paid for the chain rule below.
A Stratonovich stochastic differential equation in is written $$ dX_t ;=; V_0(X_t),dt ;+; \sum_{i=1}^d V_i(X_t) \circ dB^i_t, $$ with smooth vector fields on and a -dimensional Brownian motion. The fields are the diffusion vector fields and is the drift vector field.
Key theorem with proof Intermediate+
Theorem (Stratonovich chain rule). Let be a continuous semimartingale and . Then is a continuous semimartingale and, in Stratonovich form, $$ df(X_t) ;=; f'(X_t)\circ dX_t . $$ There is no second-derivative term: the rule is the ordinary Newton-Leibniz chain rule, in contrast to Itô's formula .
Proof. Start from Itô's formula, which holds for any continuous semimartingale [oksendal Ch. 3]: $$ df(X_t) = f'(X_t),dX_t + \tfrac{1}{2}f''(X_t),d[X,X]_t . $$ Apply the conversion formula to the right-hand Stratonovich object we wish to evaluate. With integrand and integrator , $$ \int_0^t f'(X_s)\circ dX_s = \int_0^t f'(X_s),dX_s + \tfrac{1}{2},[f'(X), X]_t . $$ The covariation is computed by Itô's formula applied to the integrand process: when , and only the part contributes to a bracket with , since the finite-variation piece has zero covariation with the continuous-martingale part. Therefore $$ [f'(X), X]_t = \int_0^t f''(X_s),d[X,X]_s . $$ For the same identity holds by a standard mollification argument, approximating by functions and passing to the limit, since both sides depend continuously on in on compacts. Substituting, $$ \int_0^t f'(X_s)\circ dX_s = \int_0^t f'(X_s),dX_s + \tfrac{1}{2}\int_0^t f''(X_s),d[X,X]_s . $$ The right-hand side is precisely the Itô-formula expansion of . Hence , which in differential form reads . The Itô second-derivative term has been absorbed into the symmetric integral's correction, with the two contributions equal and combining to cancel.
Bridge. This cancellation builds toward the entire reason Stratonovich calculus is the right language for stochastic differential equations on manifolds, a thread that appears again in 03.02.45. The foundational reason is functorial: a chain rule with no second-derivative term is a chain rule that commutes with composition by diffeomorphisms exactly as ordinary calculus does, so a Stratonovich SDE pushed forward through a change of coordinates becomes another Stratonovich SDE whose vector fields are the pushed-forward fields . This is exactly the transformation law of a vector field, the same law that governs the Levi-Civita data of 03.02.27; the Itô form fails it because the correction does not transform tensorially. The conversion formula is dual to Itô's formula in the precise sense that each is obtained from the other by adding or subtracting one half the covariation bracket, and putting these together gives the central insight of the subject: the bridge from analysis to geometry is the symmetric integral, because only it lets "solve an SDE" mean the same thing in every chart. The construction generalises verbatim from to any smooth manifold, which is the program the next tier carries out.
Exercises Intermediate+
Advanced results Master
The conversion formula and the chain rule together make the Stratonovich integral the calculus in which stochastic flows are coordinate-free. On a smooth manifold with smooth vector fields , the Stratonovich SDE has an intrinsic meaning: a continuous -valued semimartingale solves it when, for every , the real semimartingale satisfies . The chain-rule absence of a second-derivative term is what makes this test consistent across overlapping charts: composing with a transition map is composing with a diffeomorphism, and the symmetric integral commutes with that composition.
Generator and Hörmander form. The conversion of the Stratonovich SDE to Itô form adds the drift (or in coordinates), and the diffusion process is Markov with infinitesimal generator $$ L ;=; V_0 ;+; \tfrac{1}{2}\sum_{i=1}^d V_i^2 , $$ where each is read as a first-order operator and . This is the Hörmander sum-of-squares operator. Its principal symbol is , degenerate wherever the fields fail to span; the operator is nonetheless hypoelliptic when the Lie algebra generated by spans at every (Hörmander's bracket-generating condition). The Stratonovich form is what exhibits in this manifestly geometric square; the Itô form would present the same with its drift and second-order parts artificially separated and chart-dependent.
Wong-Zakai approximation. Let be smooth approximations to — piecewise-linear interpolation on a mesh of width , or mollification . Let solve the ordinary (random) ODE , path by path. The Wong-Zakai theorem asserts in probability, uniformly on compact time intervals, where solves the Stratonovich SDE with the same coefficients [Wong-Zakai 1965]. The limit is Stratonovich, not Itô, precisely because each obeys the ordinary chain rule, and that property survives the limit only for the symmetric integral. In higher dimensions with non-commuting fields () the smoothing scheme can contribute an additional Lévy-area drift term; the canonical piecewise-linear scheme yields the pure Stratonovich limit, and rough-path theory later organises these correction terms systematically.
Synthesis. The symmetric integral is the foundational reason the three faces of a stochastic flow — its defining SDE, its generator, and its smooth-noise approximation — line up in a single coordinate-free package. The Stratonovich chain rule is exactly the ordinary chain rule, and that is the central insight from which everything else follows: it is what makes the generator appear in the Hörmander square , what makes the Wong-Zakai limit land on the Stratonovich equation rather than an Itô one, and what makes the diffusion fields transform as honest vector fields under change of chart. This is dual to the Itô picture, where the same generator is recovered but its second-order part is glued to a chart-dependent drift; the bridge between the two is the conversion formula, one half a covariation bracket. Putting these together, Stratonovich calculus generalises ordinary calculus to rough integrators while preserving its functoriality, and it is this functoriality — not mere notational convenience — that makes it the calculus of Brownian motion on manifolds, the construction carried out in 03.02.45.
Full proof set Master
Proposition (Stratonovich-to-Itô conversion is well defined and unique). Let be a continuous semimartingale and a continuous semimartingale on a common filtered space. The midpoint Riemann sums defining converge in probability, and the limit equals .
Proof. Write a partition of and decompose the symmetric sum: $$ \sum_k \frac{H_{t_k} + H_{t_{k+1}}}{2},\Delta_k X = \sum_k H_{t_k},\Delta_k X + \frac{1}{2}\sum_k (H_{t_{k+1}} - H_{t_k}),\Delta_k X, $$ where . The first sum is the left-endpoint (Itô) Riemann sum, which converges in probability to the Itô integral as , by the construction of the Itô integral for continuous semimartingales [oksendal Ch. 3]. The second sum is $$ \frac{1}{2}\sum_k (\Delta_k H)(\Delta_k X), $$ which is one half the cross-variation Riemann sum and converges in probability to by the definition of the quadratic covariation as the limit of such products along refining partitions. Both limits exist independently of the choice of refining sequence because each is itself an in-probability limit of an established object, so their sum exists and equals . Uniqueness of the limit follows from uniqueness of limits in probability.
Proposition (coordinate invariance of the Stratonovich SDE). Let be a diffeomorphism between open subsets of , and let solve with values in . Then solves , where is the pushforward field.
Proof. Apply the Stratonovich chain rule componentwise to for each coordinate . By the chain rule with no second-order term, $$ dY^a = \sum_j \partial_j \varphi^a(X)\circ dX^j = \sum_j \partial_j\varphi^a(X)\Bigl(V_0^j(X),dt + \sum_i V_i^j(X)\circ dB^i\Bigr). $$ Grouping the and terms, the coefficient is , using . Identically, the coefficient is . The validity of moving the smooth coefficient through the symmetric integral is precisely the content of the chain rule, which carries no Itô correction. Hence solves the stated Stratonovich SDE with pushed-forward fields. The same computation with Itô integrals would leave a residual term from Itô's formula, which is the obstruction to tensoriality of the Itô form.
Connections Master
The multivariable chain rule 02.05.03 is the deterministic statement that this unit restores in the stochastic setting. The ordinary chain rule is exactly the form the Stratonovich chain rule recovers, with the random integrator in place of the smooth . The entire payoff of the symmetric integral is that it returns calculus to this familiar law, so the second proposition's pushforward computation reads line for line like the deterministic change-of-variables in 02.05.03.
The Itô integral and Itô's formula 02.15.02 is the co-equal counterpart and the route through which the Stratonovich integral is even defined. The conversion formula expresses one in terms of the other, and the proof of the Stratonovich chain rule runs entirely through Itô's formula. Itô is the martingale-preserving calculus of choice for filtration-respecting models such as option pricing, where the integrand must be evaluated before the increment is revealed; Stratonovich trades the martingale property for the ordinary chain rule. Each is the natural setting for problems the other handles awkwardly, and the bracket term is the exact dictionary between them.
The Levi-Civita connection and vector fields on manifolds 03.02.27 supply the geometric scaffolding the Master tier stands on. The pushforward law established here is the same transformation law that governs the connection coefficients there, and the Stratonovich-to-Itô drift correction is written with that connection's covariant derivative. Coordinate invariance of the SDE is what lets the flow be assembled from local charts into a process on the manifold, exactly as a vector field is assembled from local trivialisations.
Brownian motion on a manifold 03.02.45 is the immediate downstream consumer. Constructing -valued Brownian motion means solving a Stratonovich SDE whose diffusion fields are an orthonormal frame for the metric, lifted to the orthonormal frame bundle (the Eells-Elworthy-Malliavin construction); only the symmetric integral makes the chart-by-chart solutions agree on overlaps, so this unit is the analytic prerequisite for that geometric object.
The heat equation and heat kernel 02.13.03 is the analytic shadow of the generator constructed here. When the Hörmander operator is the Laplace-Beltrami operator, the transition density of the Stratonovich diffusion is the heat kernel, and the Feynman-Kac representation expresses heat-equation solutions as expectations along the diffusion. The hypoellipticity of bracket-generating is the probabilistic source of smoothing for the associated parabolic equation, whose Gaussian transition density extends the normal increments of Brownian motion [quantum-well Continuous probability distributions].
Historical & philosophical context Master
The symmetric integral was introduced by Ruslan Stratonovich in 1966 in the context of nonlinear filtering and optimal control, where preserving the ordinary rules of calculus simplified the manipulation of stochastic systems arising in radio engineering [Stratonovich 1966]. Donald Fisk independently developed an equivalent symmetric stochastic integral in the same period, so the construction is sometimes called the Fisk-Stratonovich integral. The choice between Stratonovich's symmetric convention and Itô's non-anticipating convention — Kiyosi Itô's integral dating to 1944 — is not a matter of correctness but of which structural property one wishes to keep: Itô keeps the martingale property and adaptedness, Stratonovich keeps the chain rule.
The approximation theorem that fixes the Stratonovich convention as the physically natural one is due to Eugene Wong and Moshe Zakai in 1965, who showed that ODE solutions driven by smoothed noise converge to the Stratonovich and not the Itô solution [Wong-Zakai 1965]. This resolved a genuine ambiguity in modelling: a physicist writing down an equation with white-noise forcing, understood as the limit of band-limited noise, means the Stratonovich equation. The geometric consequence — that Stratonovich SDEs are the coordinate-invariant ones — was developed into a full theory of stochastic differential geometry by Itô himself, Paul Malliavin, and David Elworthy through the 1970s and 1980s, and the connection to Lars Hörmander's 1967 hypoellipticity theorem made the sum-of-squares generator the meeting point of probability and partial differential equations.
Bibliography Master
@article{stratonovich1966,
author = {Stratonovich, R. L.},
title = {A New Representation for Stochastic Integrals and Equations},
journal = {SIAM Journal on Control},
volume = {4},
number = {2},
pages = {362--371},
year = {1966},
doi = {10.1137/0304028}
}
@article{wongzakai1965,
author = {Wong, Eugene and Zakai, Moshe},
title = {On the Convergence of Ordinary Integrals to Stochastic Integrals},
journal = {The Annals of Mathematical Statistics},
volume = {36},
number = {5},
pages = {1560--1564},
year = {1965},
doi = {10.1214/aoms/1177699916}
}
@article{ito1944,
author = {It\^o, Kiyosi},
title = {Stochastic Integral},
journal = {Proceedings of the Imperial Academy (Tokyo)},
volume = {20},
number = {8},
pages = {519--524},
year = {1944},
doi = {10.3792/pia/1195572786}
}
@article{hormander1967,
author = {H\"ormander, Lars},
title = {Hypoelliptic Second Order Differential Equations},
journal = {Acta Mathematica},
volume = {119},
pages = {147--171},
year = {1967},
doi = {10.1007/BF02392081}
}
@book{ikedawatanabe1989,
author = {Ikeda, Nobuyuki and Watanabe, Shinzo},
title = {Stochastic Differential Equations and Diffusion Processes},
edition = {2nd},
publisher = {North-Holland},
year = {1989}
}
@book{hsu2002,
author = {Hsu, Elton P.},
title = {Stochastic Analysis on Manifolds},
series = {Graduate Studies in Mathematics},
volume = {38},
publisher = {American Mathematical Society},
year = {2002}
}