The probabilistic heat kernel and Bismut's formula
Anchor (Master): Bismut 1984 *The Atiyah-Singer theorems: a probabilistic approach I, II* (J. Funct. Anal. 57); Berline-Getzler-Vergne *Heat Kernels and Dirac Operators* (Grundlehren 298) Ch. 11; Hsu *Stochastic Analysis on Manifolds* (GSM 38); Varadhan 1967 *On the behavior of the fundamental solution of the heat equation with variable coefficients* (Comm. Pure Appl. Math. 20); Ikeda-Watanabe *Stochastic Differential Equations and Diffusion Processes* (North-Holland)
Intuition Beginner
Heat spreads. Drop a hot spot at one point of a curved surface and watch the temperature even out. There is a second way to picture the same process that turns out to be far more powerful: instead of tracking temperature, release a tiny particle that jitters around at random, a drunken walker with no memory. After a short time the walker is somewhere nearby, with a bell-shaped spread of likely positions. The probability of finding it at a given spot is exactly the temperature that the heat process would have deposited there. Heat flow and random walking are two faces of one coin.
This is the bridge between analysis and probability. The heat kernel, the function that says how much heat reaches in time from a unit hot spot at , is the same thing as the probability density for the random walker to travel from to in time . To compute heat, you can instead average over all the wandering paths the particle might take.
A remarkable fact follows. For very short times, the most likely paths are the straight ones, the shortest routes across the surface. So watching where the walker lands after a brief moment secretly measures geodesic distance. The random walker is a surveyor.
Now give the walker something to carry: a little arrow, or a spinning gyroscope, that it must keep pointing as straight as it can while the ground curves underneath. The arrow gets twisted around by the curvature of the surface. This is the picture Bismut built in 1984 for spinors: a random path that drags a spinor along, accumulating curvature, and the average over all such paths reproduces the heat flow of the Dirac operator, the square-root of the Laplacian that governs spin.
Visual Beginner
Picture a sphere with a starting dot at the north and a target dot lower down. From the start, draw a fuzzy cloud of jagged random paths, all wandering but on average drifting toward the target. Overlay one smooth geodesic arc, the great-circle route, in bold. The short-time story is that the cloud of paths presses in tightly around that bold arc: the random walker, to reach a far point quickly, has almost no choice but to follow the shortest road.
Along the bold path, draw a small frame of two perpendicular arrows at several points, each rotated a little from the last. That rotation is parallel transport: the arrows stay as straight as the curved surface permits, yet curvature still turns them. The total turning recorded by the frame, when the carried object is a spinor rather than a plain arrow, is the curvature term in Bismut's formula.
The second panel shows the bell curve. Above the target, plot the height for shrinking : the bump sharpens and its logarithm, scaled by , settles onto the squared distance. The shape of heat at short times is the shape of distance.
Worked example Beginner
Take the simplest curved-free case first, the flat line, so the geometry does not get in the way. A random walker starts at and takes independent random steps. After time its position is spread out in a bell curve centred at with width growing like the square root of . The probability density of landing at is
This is the heat kernel of the flat line, and it is also the law of the random walker. Two readings, one formula.
Now read off the short-time fact by hand. Take the logarithm and multiply by :
As shrinks to zero the first term fades next to the second, and what survives is . On the flat line the distance from to is just , so . The scaled logarithm of the heat kernel converges to minus the squared distance. That is Varadhan's law in its baby form, and it holds on any curved surface once is read as geodesic distance.
To average a quantity over the walker, you weight by this density and add up. In symbols the heat-flowed value of at the start point is the expected value of at the walker's random endpoint, . Computing heat becomes computing an average.
Check your understanding Beginner
Formal definition Intermediate+
Let be a complete Riemannian manifold with Laplace-Beltrami operator (the geometer's non-positive Laplacian, so ). Brownian motion on is the diffusion process whose generator is ; equivalently, is the Eells-Elworthy-Malliavin solution of a stochastic differential equation built by horizontally lifting a Euclidean Brownian motion to the orthonormal frame bundle and projecting down (this construction is developed in 03.02.45). Its transition density with respect to the Riemannian volume is exactly the heat kernel , the fundamental solution of .
The basic identity is the probabilistic representation of the heat semigroup,
valid for bounded measurable , with the expectation over Brownian paths started at .
Varadhan's short-time asymptotic sharpens the small- behaviour:
where is the Riemannian distance. The heat kernel sees the metric.
For a Schrodinger operator with potential , the Feynman-Kac formula inserts an exponential weight along the path:
For a connection Laplacian on a vector bundle with curvature, the scalar weight is replaced by an -valued multiplicative functional, carried by stochastic parallel transport; this is the Feynman-Kac-Ito formula, the subject of the key theorem below.
Key theorem with proof Intermediate+
Theorem (Feynman-Kac-Ito for the Bochner Laplacian). Let be a Hermitian vector bundle with metric connection , and let be the connection (Bochner) Laplacian. Let be a self-adjoint bundle endomorphism. Then the heat semigroup of admits the representation
where is stochastic parallel transport along the Brownian path, and is the multiplicative functional solving the pathwise covariant ordinary differential equation , .
Proof. Fix a smooth section and set , the solution of the bundle heat equation with . Pull back along the Brownian path into the fixed fibre by forming evaluated at time ; concretely consider for . Apply the Ito formula for the manifold diffusion. The second-order Ito term produces precisely acting through the parallel transport (this is the defining feature of the horizontal lift: the generator of is the connection Laplacian), the drift term contributes , and the multiplicative functional contributes through its defining equation. The deterministic parts cancel by the heat equation, leaving . So is a local martingale; under the standard completeness and growth hypotheses it is a genuine martingale. Equating expectations at and gives , which is the claim.
Bridge. This representation builds toward Bismut's probabilistic index theorem and is the foundational reason the curvature of a Clifford module enters the heat kernel as a parallel-transported, path-ordered exponential rather than as an explicit Green's function. This is exactly the bundle-valued upgrade of the scalar Feynman-Kac formula 02.15.04, and it generalises the Lichnerowicz decomposition of 03.09.08 into stochastic form: the connection Laplacian becomes Brownian motion with parallel transport, while the zeroth-order curvature term becomes the multiplicative functional . The central insight is that putting these together turns the small-time limit of the supertrace into a Brownian-bridge integral that localises on constant loops, which appears again in the Master tier as Bismut's alternative to the Getzler rescaling of 03.09.20 — the same index density, reached by conditioning paths instead of rescaling coordinates.
Exercises Intermediate+
Advanced results Master
Bismut's Brownian-bridge construction (1984). Bismut's probabilistic proof of the index theorem starts from the McKean-Singer identity , valid for every , and computes the right side as by probability. By the Feynman-Kac-Ito formula the supertrace is an integral over the diagonal of the heat kernel,
and the on-diagonal kernel is a Brownian-bridge expectation: condition the Brownian motion to return, , and average the multiplicative functional against the bridge measure scaled by the scalar heat kernel . Concretely , where the endpoint conditioning is the probabilistic analogue of evaluating on the diagonal.
As the bridge collapses onto the constant loop at , and the rescaled bridge converges to a Gaussian (Ornstein-Uhlenbeck) process governed by the Riemann curvature at — this is the probabilistic mechanism behind Mehler's formula. The Clifford-module curvature and the scalar curvature term enter linearly in the limit, the prefactor supplies the correct power of , and the supertrace's Clifford-algebra bookkeeping (the Berezin/Patodi cancellation) extracts exactly the top-degree component. The surviving small- density is
the Atiyah-Singer integrand, recovered without any Getzler coordinate rescaling.
Hypoellipticity and smoothness (Malliavin). That is smooth even when the diffusion is degenerate is a theorem of Malliavin calculus: Hormander's bracket (bracket-generating) condition on the generating vector fields forces the Malliavin covariance matrix to be non-degenerate, hence the law of has a smooth density. For Brownian motion on a Riemannian manifold the horizontal vector fields together with one bracket span the tangent space, so the condition holds and is smooth off .
Synthesis. Bismut's Brownian-bridge proof is the foundational reason the index theorem can be read as a statement about random paths: the supertrace of the heat operator localises because conditioned Brownian loops shrink to points, and the curvature they accumulate in the limit is exactly the -genus density. This is exactly the probabilistic mirror of the Getzler rescaling of 03.09.20, where coordinate dilation plays the role that path conditioning plays here; putting these together, the two proofs compute the same local index density by dual mechanisms, and the bridge is the recognition that Mehler's formula for the harmonic oscillator is simultaneously the small-time limit of the rescaled Getzler symbol and the Gaussian limit of the rescaled Brownian bridge. The central insight is that the Lichnerowicz curvature term of 03.09.08, the scalar Feynman-Kac weight 02.15.04, and the manifold Brownian motion 03.02.45 are not three separate ingredients but one multiplicative functional carried along a random path, and this same functional generalises to the family setting where the Bismut superconnection of 03.09.23 organises the parametric heat kernel. The whole construction is dual to the analytic localisation: where Getzler rescales the symbol, Bismut conditions the measure, and the agreement of their limits is the central insight that probability and analysis are computing one geometric object.
Full proof set Master
Proposition (McKean-Singer constancy of the supertrace). Let be a self-adjoint Dirac operator on a -graded Clifford module, with having discrete spectrum. Then is independent of and equals .
Proof. Write with , so preserves the grading and , . For each eigenvalue of , the operator restricts to an isomorphism between the -eigenspaces and : if with then , and has inverse on these spaces. Hence for every . Now expand the supertrace over eigenspaces: $$ \mathrm{Str}(e^{-tD^2}) = \sum_{\lambda \geq 0} e^{-t\lambda}\bigl(\dim V_\lambda^+ - \dim V_\lambda^-\bigr). $$ Every term vanishes by the dimension equality just shown, leaving only : , which is and carries no . So the supertrace is constant in and equal to the index.
Corollary (the short-time route is legitimate). Because does not depend on , its value may be computed in the limit . The probabilistic content of Bismut's theorem is the evaluation of that limit as the Brownian-bridge integral above; the corollary is what licenses replacing the global spectral quantity (an integer) by a purely local small-time density. The same constancy underwrites the Getzler proof of 03.09.20, which is why the two methods, analytic and probabilistic, must agree.
Connections Master
The analytic counterpart of this unit is the heat-kernel proof of the index theorem in
03.09.20. There the small- limit is reached by Getzler's rescaling of the Clifford symbol; here it is reached by conditioning Brownian paths into bridges. Both lean on the McKean-Singer constancy proved above, and both produce the identical integrand — the agreement of a coordinate rescaling with a measure conditioning is the bridge between analysis and probability.The family-index refinement lives in
03.09.23, the Bismut superconnection. The multiplicative functional carried along a single Brownian path generalises to a fibrewise heat kernel organised by the superconnection over a parameter space; Bismut's 1986 family-index proof is the parametric heat-equation cousin of his 1984 probabilistic proof, and the curvature term appearing in here is the fibre-direction piece of the superconnection curvature there.The Lichnerowicz decomposition from
03.09.08is what the multiplicative functional encodes pathwise: the connection Laplacian becomes Brownian motion with stochastic parallel transport, and the zeroth-order curvature endomorphism becomes the path-ordered exponential . The probabilistic heat kernel is the stochastic reading of Lichnerowicz.The diffusion itself, Brownian motion on a manifold via the Eells-Elworthy-Malliavin horizontal lift, is constructed in the co-produced
03.02.45; the scalar weighting mechanism, the Feynman-Kac formula for a Schrodinger semigroup, is developed in the co-produced02.15.04. This unit fuses the two into the bundle-valued Feynman-Kac-Ito formula that Bismut needed for spinors.
Historical & philosophical context Master
The idea that heat flow and random motion are the same dates to Einstein's 1905 account of Brownian motion and to Wiener's 1923 rigorous construction of the path measure, but its decisive turn for geometry came with Kac's 1949 reformulation of the Feynman path integral as an honest probabilistic average, the Feynman-Kac formula [Kac 1949]. Varadhan's 1967 theorem that revealed that the heat kernel encodes the Riemannian distance, fusing analysis and geometry through probability [Varadhan 1967]. The frame-bundle construction of manifold Brownian motion is due to Eells-Elworthy and Malliavin, the latter inventing in 1976 a stochastic calculus of variations precisely to give a probabilistic proof of Hormander's hypoellipticity theorem [Malliavin 1976].
The synthesis was Bismut's. In two 1984 papers he gave a fully probabilistic proof of the Atiyah-Singer index theorem and of the Lefschetz fixed-point formulas, representing the heat kernel of a Dirac operator by the Brownian bridge together with stochastic parallel transport and the Ito-Feynman-Kac functional carrying the Clifford-module curvature [Bismut 1984]. Philosophically the achievement is a statement about where mathematical content lives: an integer invariant of an elliptic operator, defined by global spectral data, is recovered as the short-time limit of an average over random loops that shrink to points. The localisation that Getzler engineered by rescaling coordinates, Bismut obtained by letting probability concentrate measure on constant paths — two faces of the principle that the index is local. That a careful averaging over jagged, nowhere-differentiable Brownian paths reproduces the smooth characteristic-class integrand of Atiyah-Singer is among the more striking demonstrations that probability is a legitimate organ of geometry, not merely a computational convenience.
Bibliography Master
@article{Bismut1984I,
author = {Bismut, Jean-Michel},
title = {The {Atiyah-Singer} theorems: a probabilistic approach. {I}. The index theorem},
journal = {Journal of Functional Analysis},
volume = {57},
number = {1},
pages = {56--99},
year = {1984}
}
@article{Bismut1984II,
author = {Bismut, Jean-Michel},
title = {The {Atiyah-Singer} theorems: a probabilistic approach. {II}. The {Lefschetz} fixed point formulas},
journal = {Journal of Functional Analysis},
volume = {57},
number = {3},
pages = {329--348},
year = {1984}
}
@book{BGV1992,
author = {Berline, Nicole and Getzler, Ezra and Vergne, Mich\`ele},
title = {Heat Kernels and Dirac Operators},
series = {Grundlehren der mathematischen Wissenschaften},
volume = {298},
publisher = {Springer},
year = {1992},
note = {Ch. 11, the probabilistic / Mehler-formula approach}
}
@book{Hsu2002,
author = {Hsu, Elton P.},
title = {Stochastic Analysis on Manifolds},
series = {Graduate Studies in Mathematics},
volume = {38},
publisher = {American Mathematical Society},
year = {2002}
}
@article{Varadhan1967,
author = {Varadhan, S. R. S.},
title = {On the behavior of the fundamental solution of the heat equation with variable coefficients},
journal = {Communications on Pure and Applied Mathematics},
volume = {20},
pages = {431--455},
year = {1967}
}
@incollection{Malliavin1976,
author = {Malliavin, Paul},
title = {Stochastic calculus of variation and hypoelliptic operators},
booktitle = {Proceedings of the International Symposium on Stochastic Differential Equations (Kyoto 1976)},
pages = {195--263},
publisher = {Wiley},
year = {1978}
}
@article{Kac1949,
author = {Kac, Mark},
title = {On distributions of certain {Wiener} functionals},
journal = {Transactions of the American Mathematical Society},
volume = {65},
pages = {1--13},
year = {1949}
}