02.15.02 · analysis / stochastic-analysis

The Itô integral and Itô's formula

shipped3 tiersLean: none

Anchor (Master): Karatzas–Shreve — Brownian Motion and Stochastic Calculus Ch. 3; Revuz–Yor — Continuous Martingales and Brownian Motion Ch. IV; Protter — Stochastic Integration and Differential Equations Ch. II

Intuition Beginner

Ordinary integration adds up a quantity that changes along a smooth path: how far you travel is your speed summed over time. Stochastic integration asks the same kind of question, but the path you are summing against is a jittery random walk — the trajectory of a particle bumped around by countless tiny collisions. You hold some amount of a risky asset, the asset price jiggles randomly, and you want the total gain. The amount you hold can itself change as you watch the price move.

The catch is that the random path is far too rough to handle the usual way. Over any time interval, however short, the path wiggles up and down without settling, and its total back-and-forth length is infinite. So we cannot treat the random increments as ordinary small steps and add them with the familiar rules.

The fix is a rule about timing: you must decide how much to hold before you see the next random jump, never after. That single honesty condition — no peeking ahead — is what makes the whole construction work and what separates this integral from every integral that came before it.

Visual Beginner

Picture a jagged random path climbing and falling across the page from left to right: the running position of a particle in random motion. Below it, a second staircase shows how much of the asset you are holding, flat over each short time block and then jumping to a new flat level at the start of the next block.

Each block contributes one term: your fixed holding over that block, multiplied by the net change in the random path across that block. Because your holding for a block is locked in at its left edge — before the path moves through the block — you never get to use future information. Summing these block contributions gives the total gain. As the blocks shrink toward zero width, the staircase sums settle onto a single number: the stochastic integral.

Worked example Beginner

Take the simplest holding: hold exactly one unit at all times, from time $0$ to time $T$ . The random path starts at $0$ . Split $[0, T]$ into blocks at times $0 = t_{0} < t_{1} < \dots < t_{n} = T$ . Over block $k$ the path changes by $B (t_{k + 1}) - B (t_{k})$ , and your holding is $1$ , so block $k$ contributes that change. Adding all blocks gives $B (t_{1}) - B (t_{0}) + B (t_{2}) - B (t_{1}) + \dots = B (T) - B (0) = B (T)$ . The total gain from holding one unit is just the net displacement $B (T)$ , exactly as you would hope.

Now hold the current path value instead: at the left edge of block $k$ you hold $B (t_{k})$ . Block $k$ then contributes $B (t_{k}) (B (t_{k + 1}) - B (t_{k}))$ . A short algebra rearrangement of the telescoping total gives one half of $B (T)^{2}$ minus one half of the added-up squared changes across all the blocks. The squared changes do not vanish as blocks shrink: their running total settles to $T$ . So the total gain is $\frac{1}{2} B (T)^{2} - \frac{1}{2} T$ , not the $\frac{1}{2} B (T)^{2}$ that ordinary calculus would predict. That stubborn extra $- \frac{1}{2} T$ is the signature of the whole subject: the random path is so rough that its squared wiggles add up to real time.

Check your understanding Beginner

Formal definition Intermediate+

Fix a probability space $(Ω, F, P)$ with a filtration $(F_{t})_{t \geq 0}$ satisfying the usual conditions, and let $(B_{t})_{t \geq 0}$ be a one-dimensional Brownian motion adapted to this filtration, with increments $B_{t} - B_{s}$ independent of $F_{s}$ for $s < t$ (the process constructed in 02.15.01). An integrand $(H_{t})$ is simple predictable when there exist deterministic times $0 = t_{0} < t_{1} < \dots < t_{n} = T$ and bounded $F_{t_{k}}$ -measurable random variables $ξ_{k}$ with $$ H_t(\omega) = \sum_{k=0}^{n-1} \xi_k(\omega), \mathbf{1}{(t_k, t{k+1}]}(t). $$ The measurability of $ξ_{k}$ with respect to $F_{t_{k}}$ is the non-anticipation requirement. For such $H$ the Itô integral is defined pathwise by the finite sum $$ \int_0^T H_s, dB_s := \sum_{k=0}^{n-1} \xi_k,(B_{t_{k+1}} - B_{t_k}), $$ evaluating each integrand at the left endpoint of its block. The left-endpoint choice is the convention that makes the construction a martingale; the right-endpoint or midpoint choices give different integrals, the latter being the Stratonovich integral of 02.15.05.

The defining estimate is the Itô isometry: for simple predictable $H$ with $E \int_{0}^{T} H_{s}^{2} d s < \infty$ , $$ \mathbb{E}!\left[\left(\int_0^T H_s, dB_s\right)^{!2}\right] = \mathbb{E}!\int_0^T H_s^2, ds. $$ This identity says the map $H \mapsto \int_{0}^{T} H d B$ sends the inner-product space of simple integrands isometrically into $L^{2} (Ω)$ . The class $L_{T}^{2}$ of progressively measurable integrands with $E \int_{0}^{T} H_{s}^{2} d s < \infty$ is the completion of the simple integrands under the norm $∥ H ∥ = (E \int_{0}^{T} H_{s}^{2} d s)^{1/2}$ ; the isometry extends the integral uniquely and continuously to all of $L_{T}^{2}$ , since a complete target space ( $L^{2} (Ω)$ , by completeness from 02.07.06) receives the limit of any Cauchy sequence of integrals.

The quadratic variation of a continuous process $(X_{t})$ is the limit in probability $$ [X]t = \lim{|\Pi| \to 0} \sum_{k} (X_{t_{k+1}} - X_{t_k})^2 $$ over partitions $Π$ of $[0, t]$ with mesh $∥Π∥ \to 0$ ; for Brownian motion $[B]_{t} = t$ . The covariation of two processes is the polarisation $[X, Y]_{t} = \frac{1}{2} ([X + Y]_{t} - [X]_{t} - [Y]_{t})$ , so $[B]_{t} = [B, B]_{t}$ . The heuristic $d B_{t} d B_{t} = d t$ , $d B_{t} d t = 0$ , $d t d t = 0$ encodes these facts and drives every Itô computation below.

Counterexamples to common slips

Evaluating the integrand at the right endpoint $ξ_{k} = B_{t_{k + 1}}$ instead of the left destroys the martingale property and shifts the answer by the quadratic variation; this is not the Itô integral.
The quadratic variation $[B]_{t} = t$ is not zero. For an ordinary differentiable path the analogous limit is zero, so importing the smooth-path intuition that squared increments are negligible is the standard error.
The Itô isometry requires $H$ to be non-anticipating. For an integrand that peeks ahead, $E [ξ_{k} (B_{t_{k + 1}} - B_{t_{k}})]$ need not vanish and the isometry fails.

Key theorem with proof Intermediate+

Theorem (Itô's formula, one-dimensional). Let $f \in C^{2} (R)$ and let $B$ be Brownian motion. Then for every $t \geq 0$ , almost surely, $$ f(B_t) = f(B_0) + \int_0^t f'(B_s), dB_s + \frac{1}{2}\int_0^t f''(B_s), ds. $$

Proof. Fix $t$ and a partition $0 = t_{0} < \dots < t_{n} = t$ with mesh tending to zero. A second-order Taylor expansion of $f$ between consecutive points gives, for each $k$ , $$ f(B_{t_{k+1}}) - f(B_{t_k}) = f'(B_{t_k}),\Delta_k B + \tfrac12 f''(B_{t_k}),(\Delta_k B)^2 + R_k, $$ where $Δ_{k} B = B_{t_{k + 1}} - B_{t_{k}}$ and the remainder satisfies $∣ R_{k} ∣ \leq \frac{1}{2} ω (∣ Δ_{k} B ∣) (Δ_{k} B)^{2}$ with $ω$ the modulus of continuity of $f^{''}$ on the (almost surely bounded) range of $B$ over $[0, t]$ . Summing over $k$ telescopes the left side to $f (B_{t}) - f (B_{0})$ .

The first sum $\sum_{k} f^{'} (B_{t_{k}}) Δ_{k} B$ is the Itô-integral approximation of $\int_{0}^{t} f^{'} (B_{s}) d B_{s}$ ; since $f^{'} (B)$ is continuous and adapted, hence in $L^{2}$ on $[0, t]$ , this converges in $L^{2} (P)$ to that integral as the mesh shrinks.

For the second sum, replace $(Δ_{k} B)^{2}$ by $Δ_{k} t = t_{k + 1} - t_{k}$ and control the difference. Write $$ \sum_k f''(B_{t_k})(\Delta_k B)^2 = \sum_k f''(B_{t_k}),\Delta_k t + \sum_k f''(B_{t_k})\big((\Delta_k B)^2 - \Delta_k t\big). $$ The first piece is a Riemann sum converging almost surely to $\int_{0}^{t} f^{''} (B_{s}) d s$ by continuity of $s \mapsto f^{''} (B_{s})$ . For the second piece, the terms $(Δ_{k} B)^{2} - Δ_{k} t$ are conditionally centred — $E [(Δ_{k} B)^{2} - Δ_{k} t ∣ F_{t_{k}}] = 0$ — and conditionally uncorrelated across blocks, so the second moment of the sum is $\sum_{k} E [f^{''} (B_{t_{k}})^{2}] E [((Δ_{k} B)^{2} - Δ_{k} t)^{2}] = \sum_{k} E [f^{''} (B_{t_{k}})^{2}] \cdot 2 (Δ_{k} t)^{2}$ , using $E [(Δ_{k} B)^{2} - Δ_{k} t)^{2}] = 2 (Δ_{k} t)^{2}$ for a Gaussian increment. This is bounded by $2 ∥ f^{''} ∥_{\infty, [0, t]}^{2} \cdot ∥Π∥ \cdot t \to 0$ . Hence the second piece vanishes in $L^{2}$ .

Finally $\sum_{k} R_{k} \to 0$ because $ω (∣ Δ_{k} B ∣) \to 0$ uniformly (the path is uniformly continuous on $[0, t]$ ) while $\sum_{k} (Δ_{k} B)^{2} \to t$ stays bounded. Collecting the limits gives the stated identity, with the factor $\frac{1}{2}$ on the second integral coming directly from the second-order Taylor coefficient. $□$

Bridge. Itô's formula builds toward the entire theory of stochastic differential equations and their links to partial differential equations, and the same correction term appears again in the multidimensional and time-dependent versions below. The foundational reason for the extra $\frac{1}{2} \int f^{''} d s$ is that quadratic variation does not vanish: $[B]_{t} = t$ forces the second-order Taylor term to survive in the limit, which is exactly the heuristic $d B^{2} = d t$ made into a theorem. This is exactly the mechanism by which the worked example produced $\int_{0}^{t} B d B = \frac{1}{2} B_{t}^{2} - \frac{1}{2} t$ rather than $\frac{1}{2} B_{t}^{2}$ , and it generalises the ordinary chain rule, to which it reduces whenever the integrating path has zero quadratic variation. Putting these together, the bridge is that Itô's formula converts the analytic operator $\frac{1}{2} f^{''}$ into the generator of Brownian motion, so that harmonic and heat-equation theory transfer wholesale to the probabilistic setting — the central insight that organises stochastic analysis and reappears in 02.15.03 as the link between SDEs and second-order PDEs.

Exercises Intermediate+

Exercise 6 (hard, short-answer).

Prove the integration-by-parts (product) rule $d (X_{t} Y_{t}) = X_{t} d Y_{t} + Y_{t} d X_{t} + d [X, Y]_{t}$ for two Itô processes, and specialise to $X = Y = B$ to recover $\int_{0}^{t} B d B = \frac{1}{2} (B_{t}^{2} - t)$ .

Hint

Apply the two-dimensional Itô formula to $f (x, y) = x y$ , whose only nonzero second derivative is $\partial_{x y} f = 1$ .

Answer

For $f (x, y) = x y$ the multidimensional Itô formula gives $d (X Y) = \partial_{x} f d X + \partial_{y} f d Y + \frac{1}{2} (\partial_{xx} f d [X] + 2 \partial_{x y} f d [X, Y] + \partial_{y y} f d [Y])$ . Since $\partial_{x} f = y$ , $\partial_{y} f = x$ , $\partial_{xx} f = \partial_{y y} f = 0$ , $\partial_{x y} f = 1$ , this is $d (X Y) = Y d X + X d Y + d [X, Y]$ . With $X = Y = B$ , $[B, B]_{t} = t$ , so $d (B^{2}) = 2 B d B + d t$ ; integrating, $B_{t}^{2} = 2 \int_{0}^{t} B d B + t$ , giving $\int_{0}^{t} B d B = \frac{1}{2} (B_{t}^{2} - t)$ . Rubric: full credit for the product-rule derivation via $f = x y$ and the specialisation reproducing the worked example.

Exercise 7 (hard, short-answer).

Let $u (t, x)$ solve the backward heat equation $\partial_{t} u + \frac{1}{2} \partial_{xx} u = 0$ on $[0, T] \times R$ . Show that $u (t, B_{t})$ is a martingale, and deduce the Feynman-Kac representation $u (0, x) = E [u (T, B_{T}) ∣ B_{0} = x]$ .

Hint

Apply the time-dependent Itô formula to $u (t, B_{t})$ and use the PDE to cancel all $d t$ terms.

Answer

By the time-dependent Itô formula, $d u (t, B_{t}) = (\partial_{t} u + \frac{1}{2} \partial_{xx} u) d t + \partial_{x} u d B_{t}$ . The PDE makes the $d t$ coefficient vanish, leaving $d u (t, B_{t}) = \partial_{x} u (t, B_{t}) d B_{t}$ , a stochastic integral and hence (under integrability of $\partial_{x} u (t, B_{t})$ ) a martingale. The martingale property between times $0$ and $T$ gives $u (0, x) = E [u (0, B_{0})] = E [u (T, B_{T}) ∣ B_{0} = x]$ . Rubric: full credit for the drift cancellation via the PDE and the martingale-to-expectation step. This is the prototype of the SDE-PDE correspondence.

Advanced results Master

The Itô integral against Brownian motion is a continuous $L^{2}$ -bounded martingale: for $H \in L_{T}^{2}$ the process $I_{t} (H) = \int_{0}^{t} H_{s} d B_{s}$ , $0 \leq t \leq T$ , admits a continuous modification, is a martingale with $E [I_{t} (H)] = 0$ , and has quadratic variation $[I (H)]_{t} = \int_{0}^{t} H_{s}^{2} d s$ . The isometry is the case $t = T$ of $E [I_{t} (H)^{2}] = E \int_{0}^{t} H_{s}^{2} d s$ , itself the statement that $I_{t} (H)^{2} - \int_{0}^{t} H_{s}^{2} d s$ is a martingale. This identification of the integral's quadratic variation with the time-integral of the squared integrand is the foundational reason the whole calculus closes on itself: differentiating an Itô integral returns its integrand, and squaring returns the $d s$ -clock that drives every correction term.

Itô's formula extends to continuous semimartingales. For an Itô process $d X_{t} = b_{t} d t + σ_{t} d B_{t}$ with $b, σ$ progressively measurable and locally square-integrable, and $f \in C^{1, 2} ([0, \infty) \times R)$ , $$ df(t, X_t) = \partial_t f, dt + \partial_x f, dX_t + \tfrac12 \partial_{xx} f, d[X]_t, \qquad d[X]_t = \sigma_t^2, dt. $$ The multidimensional version, for $X_{t} \in R^{n}$ driven by an $m$ -dimensional Brownian motion through $d X_{t} = b_{t} d t + σ_{t} d B_{t}$ with $σ_{t} \in R^{n \times m}$ , reads $$ df(t, X_t) = \partial_t f, dt + \nabla f \cdot dX_t + \tfrac12 \operatorname{tr}!\big(\sigma_t \sigma_t^{\mathsf T} D^2 f\big), dt, $$ with $D^{2} f$ the Hessian. The trace term is the contraction of the Hessian against the diffusion matrix $a_{t} = σ_{t} σ_{t}^{T}$ ; it is the multidimensional avatar of the $\frac{1}{2} f^{''}$ correction and identifies $\frac{1}{2} tr (a D^{2} \cdot) + b \cdot \nabla$ as the generator of the diffusion $X$ .

The exponential martingale is the universal example. For an adapted $θ$ with $E \int_{0}^{T} θ_{s}^{2} d s < \infty$ , the Doléans-Dade exponential $$ \mathcal{E}(\theta)_t = \exp!\Big(\int_0^t \theta_s, dB_s - \tfrac12 \int_0^t \theta_s^2, ds\Big) $$ satisfies $d E = θ E d B$ , so it is a local martingale; under Novikov's condition $E [exp (\frac{1}{2} \int_{0}^{T} θ_{s}^{2} d s)] < \infty$ it is a true martingale, and it is the density process of the Girsanov change of measure that removes drift. The $- \frac{1}{2} \int θ^{2} d s$ in the exponent is precisely the Itô correction, the same term that appeared in geometric Brownian motion.

The Burkholder-Davis-Gundy inequalities control the running maximum of a continuous local martingale $M$ with $M_{0} = 0$ by its quadratic variation: for every $p > 0$ there are universal constants $0 < c_{p} \leq C_{p} < \infty$ with $$ c_p, \mathbb{E}\big[[M]T^{,p/2}\big] \le \mathbb{E}\Big[\sup{0 \le t \le T} |M_t|^p\Big] \le C_p, \mathbb{E}\big[[M]_T^{,p/2}\big]. $$ For $p = 2$ the upper bound is Doob's $L^{2}$ inequality combined with the isometry; the general case is the quantitative backbone of $L^{p}$ estimates for stochastic integrals and of existence-uniqueness theory for SDEs.

Synthesis. The foundational reason Itô calculus exists as a closed system is the identity $[I (H)]_{t} = \int_{0}^{t} H_{s}^{2} d s$ : it says the quadratic variation of a stochastic integral is the time-integral of its squared integrand, and this is exactly the bookkeeping that turns the heuristic $d B^{2} = d t$ into operative calculus. Putting these together, the one-dimensional correction $\frac{1}{2} f^{''}$ , the multidimensional trace term $\frac{1}{2} tr (a D^{2} f)$ , and the exponential-martingale drift $- \frac{1}{2} \int θ^{2}$ are one phenomenon wearing three costumes — each is the second-order Taylor remainder that survives because quadratic variation does not vanish. This is exactly the central insight that the generator of a diffusion is a second-order operator, which generalises the ordinary chain rule and is dual to the forward Kolmogorov (Fokker-Planck) evolution of the law of $X$ . The Burkholder-Davis-Gundy inequalities then make the quadratic variation the universal yardstick: control of $[M]$ controls every $L^{p}$ norm of the path, and this is the bridge from the algebra of the differential rules to the analysis of existence, uniqueness, and convergence for stochastic differential equations in 02.15.03.

Full proof set Master

The one-dimensional Itô formula and its partition argument are proved in full in the Key theorem section. The remaining Master claims are recorded here.

Proposition (Itô isometry and the $L^{2}$ extension). For simple predictable $H$ , $E [(\int_{0}^{T} H d B)^{2}] = E \int_{0}^{T} H^{2} d s$ . Consequently the integral extends uniquely to a linear isometry $L_{T}^{2} \to L^{2} (Ω)$ .

Proof. Write $\int_{0}^{T} H d B = \sum_{k} ξ_{k} Δ_{k} B$ with $Δ_{k} B = B_{t_{k + 1}} - B_{t_{k}}$ . Expanding the square, $$ \mathbb{E}\Big[\Big(\sum_k \xi_k \Delta_k B\Big)^2\Big] = \sum_{j,k} \mathbb{E}[\xi_j \xi_k, \Delta_j B, \Delta_k B]. $$ For $j < k$ , condition on $F_{t_{k}}$ : the factor $ξ_{j} ξ_{k} Δ_{j} B$ is $F_{t_{k}}$ -measurable and $Δ_{k} B$ is independent of $F_{t_{k}}$ with mean zero, so $E [ξ_{j} ξ_{k} Δ_{j} B Δ_{k} B] = E [ξ_{j} ξ_{k} Δ_{j} B] E [Δ_{k} B] = 0$ ; the same holds for $j > k$ by symmetry. The diagonal terms give $E [ξ_{k}^{2} (Δ_{k} B)^{2}] = E [ξ_{k}^{2}] E [(Δ_{k} B)^{2}] = E [ξ_{k}^{2}] Δ_{k} t$ , again by independence of the increment from $F_{t_{k}}$ . Summing, $\sum_{k} E [ξ_{k}^{2}] Δ_{k} t = E \int_{0}^{T} H_{s}^{2} d s$ . The isometry holds. Since the simple integrands are dense in $L_{T}^{2}$ and $L^{2} (Ω)$ is complete by 02.07.06, the isometric map sends Cauchy sequences to Cauchy sequences and extends uniquely and continuously to the completion. $□$

Proposition (the integral is a continuous martingale). For $H \in L_{T}^{2}$ the process $I_{t} (H) = \int_{0}^{t} H d B$ is a martingale with respect to $(F_{t})$ and admits a continuous modification.

Proof. For simple $H$ and $s < t$ , the increment $I_{t} (H) - I_{s} (H)$ is a sum of terms $ξ_{k} Δ_{k} B$ over blocks past $s$ ; each has $E [ξ_{k} Δ_{k} B ∣ F_{s}] = E [ξ_{k} E [Δ_{k} B ∣ F_{t_{k}}] ∣ F_{s}] = 0$ by the tower property and mean-zero independent increments, so $E [I_{t} (H) ∣ F_{s}] = I_{s} (H)$ . The martingale property passes to the $L^{2}$ limit because conditional expectation is an $L^{2}$ -contraction. Continuity of the simple-integrand process is plain (it is piecewise a multiple of continuous $B$ ); for general $H$ , take simple $H^{(n)} \to H$ in $L^{2}$ , and Doob's maximal inequality gives $E [sup_{t \leq T} ∣ I_{t} (H^{(n)}) - I_{t} (H^{(m)}) ∣^{2}] \leq 4 E \int_{0}^{T} (H^{(n)} - H^{(m)})^{2} d s \to 0$ , so a subsequence converges uniformly almost surely, and the uniform limit of continuous paths is continuous. $□$

Proposition (quadratic variation of the integral). For $H \in L_{T}^{2}$ , $I_{t} (H)^{2} - \int_{0}^{t} H_{s}^{2} d s$ is a martingale; equivalently $[I (H)]_{t} = \int_{0}^{t} H_{s}^{2} d s$ .

Proof. It suffices to show $E [(I_{t} - I_{s})^{2} ∣ F_{s}] = E [\int_{s}^{t} H^{2} d u ∣ F_{s}]$ for $s < t$ , since then $I_{t}^{2} - \int_{0}^{t} H^{2} = I_{s}^{2} - \int_{0}^{s} H^{2} + (I_{t} - I_{s})^{2} - \int_{s}^{t} H^{2} + 2 I_{s} (I_{t} - I_{s})$ has $F_{s}$ -conditional expectation $I_{s}^{2} - \int_{0}^{s} H^{2}$ , the martingale identity (the cross term vanishes by the martingale property of $I$ ). The displayed equality is the Itô isometry applied to the integral of $H 1_{(s, t]}$ against $B$ conditioned on $F_{s}$ , which holds blockwise for simple integrands by the diagonal computation of the first proposition and extends by the $L^{2}$ -limit. By the characterisation of quadratic variation as the unique continuous increasing process $A$ with $I^{2} - A$ a martingale, $[I (H)]_{t} = \int_{0}^{t} H_{s}^{2} d s$ . $□$

Proposition (multidimensional Itô formula). For $X_{t} \in R^{n}$ with $d X_{t} = b_{t} d t + σ_{t} d B_{t}$ , $B$ an $m$ -dimensional Brownian motion, and $f \in C^{1, 2}$ , $df (t, X_{t}) = \partial_{t} f d t + \nabla f \cdot d X_{t} + \frac{1}{2} tr (σ_{t} σ_{t}^{T} D^{2} f) d t$ .

Proof. The one-dimensional partition argument generalises componentwise. The second-order Taylor expansion of $f$ now carries cross terms $\frac{1}{2} \partial_{x_{i} x_{j}} f Δ X^{i} Δ X^{j}$ . The covariation of the components is $d [X^{i}, X^{j}]_{t} = (σ_{t} σ_{t}^{T})_{ij} d t$ , because $d [B^{a}, B^{b}] = δ_{ab} d t$ for independent Brownian coordinates and $d X^{i} = b^{i} d t + \sum_{a} σ^{ia} d B^{a}$ , so $d [X^{i}, X^{j}] = \sum_{a, b} σ^{ia} σ^{j b} d [B^{a}, B^{b}] = \sum_{a} σ^{ia} σ^{j a} d t = (σ σ^{T})_{ij} d t$ ; products involving $d t$ contribute nothing. Summing the Taylor terms over a shrinking partition, the first-order part assembles to $\nabla f \cdot d X$ , the diagonal-and-cross second-order part to $\frac{1}{2} \sum_{i, j} \partial_{x_{i} x_{j}} f (σ σ^{T})_{ij} d t = \frac{1}{2} tr (σ σ^{T} D^{2} f) d t$ , and the explicit time-dependence to $\partial_{t} f d t$ . The remainder estimate from the one-dimensional case applies coordinatewise. $□$

Connections Master

Brownian motion 02.15.01 is the integrator on which this entire construction rests. The independence of increments and the Gaussian scaling $E [(Δ B)^{2}] = Δ t$ established there are exactly what make the Itô isometry hold and what give $[B]_{t} = t$ ; without the non-anticipating filtration and the mean-zero independent increments of that unit, neither the martingale property of the integral nor the surviving $\frac{1}{2} f^{''}$ correction term would be available.

The Lebesgue integral and monotone convergence 02.07.04 supply the integration theory used pathwise and in expectation throughout: the $d s$ -integrals $\int_{0}^{t} f^{''} (B_{s}) d s$ are ordinary Lebesgue integrals along each path, and the interchange of limit and expectation in the isometry extension and in the martingale-convergence steps rests on the convergence theorems proved there.

$L^{p}$ spaces and completeness 02.07.06 provide the target space for the central extension. The Itô integral is defined as the unique continuous extension of an isometry from simple integrands into $L^{2} (Ω)$ , and that extension exists precisely because $L^{2} (Ω)$ is complete; the Burkholder-Davis-Gundy inequalities then live in the $L^{p} (Ω)$ scale built in that unit.

Stochastic differential equations 02.15.03 are the immediate downstream consumer. Itô's formula is the change-of-variables rule that lets one solve SDEs in closed form (geometric Brownian motion, the exponential martingale) and is the analytic engine behind the Feynman-Kac and Kolmogorov correspondences that tie diffusion processes to second-order parabolic PDEs; the existence-uniqueness theory there runs on the isometry and the Burkholder-Davis-Gundy bounds proved here.

The Stratonovich integral 02.15.05 is the alternative construction that this unit pointedly does not use. Evaluating the integrand at the midpoint rather than the left endpoint restores the ordinary chain rule at the cost of the martingale property; the difference between the two integrals is exactly $\frac{1}{2} \int d [H, B]$ , the same quadratic-covariation term that produces the Itô correction, so the contrast between the two conventions is a direct corollary of the calculus developed here.

Historical & philosophical context Master

The stochastic integral was introduced by Kiyosi Itô in 1944 (Stochastic Integral, Proc. Imperial Academy Tokyo 20, 519–524) ^{[Itô 1944]}, building on Norbert Wiener's 1923 rigorous construction of Brownian motion and on Paul Lévy's structural study of its paths. Itô's decisive move was to define the integral for non-anticipating integrands and to prove the change-of-variables formula with its second-order correction term, the result now universally called Itô's formula; Wolfgang Doeblin had independently arrived at closely related ideas in a sealed note deposited with the Académie des Sciences in 1940 and opened only in 2000, so the lemma is sometimes called the Itô-Doeblin formula. The modern measure-theoretic treatment via the $L^{2}$ isometry and the martingale property is the synthesis of Joseph Doob's martingale theory with Itô's construction; the textbook accounts of Karatzas and Shreve and of Revuz and Yor ^{[Revuz 1999]} codify the semimartingale generality, and Philip Protter's functional-analytic development takes the integral itself as the primitive object.

The conceptual content is that a calculus can be built on a path of infinite total variation provided one fixes the order in which information is revealed. The quadratic variation, which vanishes for every classically differentiable path, becomes the carrier of the new structure: it is the clock against which the second-order term is measured, and the choice of evaluation point (left endpoint for Itô, midpoint for Stratonovich) is the choice of which symmetry to preserve — the martingale property or the ordinary chain rule. Itô's framework gave probability theory its own differential calculus, and through the Feynman-Kac correspondence it supplied a probabilistic representation for solutions of second-order parabolic partial differential equations, a bridge that has organised diffusion theory ever since (Itô 1951, On Stochastic Differential Equations, Memoirs Amer. Math. Soc. 4).

Bibliography Master

@article{ito1944,
  author  = {It\^o, Kiyosi},
  title   = {Stochastic Integral},
  journal = {Proceedings of the Imperial Academy (Tokyo)},
  volume  = {20},
  number  = {8},
  pages   = {519--524},
  year    = {1944}
}

@book{karatzas1991,
  author    = {Karatzas, Ioannis and Shreve, Steven E.},
  title     = {Brownian Motion and Stochastic Calculus},
  series    = {Graduate Texts in Mathematics},
  volume    = {113},
  edition   = {2nd},
  publisher = {Springer-Verlag, New York},
  year      = {1991}
}

@book{revuzyor1999,
  author    = {Revuz, Daniel and Yor, Marc},
  title     = {Continuous Martingales and Brownian Motion},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {293},
  edition   = {3rd},
  publisher = {Springer-Verlag, Berlin},
  year      = {1999}
}

@book{oksendal2003,
  author    = {{\O}ksendal, Bernt},
  title     = {Stochastic Differential Equations: An Introduction with Applications},
  edition   = {6th},
  publisher = {Springer-Verlag, Berlin},
  year      = {2003}
}

@book{protter2005,
  author    = {Protter, Philip E.},
  title     = {Stochastic Integration and Differential Equations},
  series    = {Stochastic Modelling and Applied Probability},
  volume    = {21},
  edition   = {2nd},
  publisher = {Springer-Verlag, Berlin},
  year      = {2005}
}

@article{ito1951sde,
  author  = {It\^o, Kiyosi},
  title   = {On Stochastic Differential Equations},
  journal = {Memoirs of the American Mathematical Society},
  volume  = {4},
  pages   = {1--51},
  year    = {1951}
}

Prerequisites

02.07.04
02.07.06

Tier anchors

beginner: Øksendal — Stochastic Differential Equations Ch. 3 (informal Itô integral); Evans — An Introduction to Stochastic Differential Equations Ch. 4
intermediate: Øksendal — Stochastic Differential Equations (6th ed.) Ch. 3-4; Evans — SDE Ch. 4-5 (Itô isometry, Itô's formula)
master: Karatzas–Shreve — Brownian Motion and Stochastic Calculus Ch. 3; Revuz–Yor — Continuous Martingales and Brownian Motion Ch. IV; Protter — Stochastic Integration and Differential Equations Ch. II

References

Karatzas, Shreve — Brownian Motion and Stochastic Calculus (Springer GTM 113, 2nd ed., 1991) · Ch. 3 §3.1-3.3 (construction of the Itô integral, Itô isometry), §3.3 (Itô's formula), §3.3.D (multidimensional rule)
Øksendal — Stochastic Differential Equations: An Introduction with Applications (Springer, 6th ed., 2003) · Ch. 3 (Itô integral, isometry), Ch. 4 (Itô formula, geometric Brownian motion, exponential martingale)
Revuz, Yor — Continuous Martingales and Brownian Motion (Springer Grundlehren 293, 3rd ed., 1999) · Ch. IV (stochastic integration, quadratic variation, Itô's formula), §IV.4 (Burkholder-Davis-Gundy inequalities)
Itô — Stochastic Integral (Proc. Imperial Academy Tokyo 20, 1944) · pp. 519-524; the original definition of the stochastic integral and the change-of-variables correction term

Estimated time

beginner: 18m
intermediate: 45m
master: 85m