37.07.07 · probability / 07-large-deviations

Varadhan's Integral Lemma and the Laplace Principle

shipped3 tiersLean: none

Anchor (Master): Dembo & Zeitouni 1998 *Large Deviations Techniques and Applications* 2nd ed. (Springer) §4.3-§4.4 (Varadhan, the Laplace principle, Bryc's theorem, the moment condition 4.3.2); Dupuis & Ellis 1997 *A Weak Convergence Approach to the Theory of Large Deviations* (Wiley) Ch. 1, §1.2 (the Laplace principle as primitive); Varadhan 1966 *Asymptotic probabilities and differential equations* (CPAM 19); Varadhan 1984 *Large Deviations and Applications* (SIAM CBMS-NSF 46) §2-§3

Intuition Beginner

Suppose you want to add up an enormous number of contributions, where each contribution is an exponential $e^{n \times (gain at that point)}$ and $n$ is huge. The exponential is so steep that the single largest term swamps all the others put together. So instead of doing the whole sum, you can just hunt for the one point where the gain is largest and read off its value. This shortcut — replace a giant exponential sum or integral by its peak — is the Laplace method, and it is centuries old.

Now layer in randomness. The "points" are the possible values of a random average, and they are not all equally available: a value far from typical is itself exponentially rare, carrying a cost. From the previous units we already know that cost — it is the rate function. So when we weight an exponential reward $e^{n F (x)}$ by how often the random average actually visits $x$ , two exponential forces compete: the reward $F$ pulling toward high-payoff points, and the cost (the rate function $I$ ) pulling back toward typical ones.

Varadhan's lemma is the clean statement of who wins. The total exponential integral grows at the rate set by the single best compromise point — the place where reward minus cost, $F (x) - I (x)$ , is largest. Everything else is exponentially negligible. In one line: averaging an exponential reward over a large-deviation system is the same, on the exponential scale, as solving a tug-of-war between payoff and rarity.

A picture helps. Think of the reward as money offered at each location and the rate function as the admission price to stand there. You will not necessarily go where the money is highest, nor where it is cheapest to stand, but where your net take — money minus admission — is best. The growth rate of your total expected winnings is exactly that best net take.

This pairing of a reward against a cost, picking the best difference, is the same Legendre-Fenchel move met in the convex-duality unit. Varadhan's lemma is that move made into a limit theorem, and run backwards it lets you recover the cost function from the rewards — which is the Laplace principle and its inverse.

Visual Beginner

Figure: two curves over a horizontal axis of values $x$ . The first curve is the reward $F (x)$ , a gently varying bump. The second is the cost $I (x)$ , the familiar valley dipping to zero at the typical value. Below them a third curve plots the difference $F (x) - I (x)$ . A vertical marker sits at the peak of this difference curve — the winning compromise point $x_{⋆}$ — and a caption notes that the exponential integral grows at the rate equal to the height of that peak. An inset shows that if the reward $F$ is flat (constant), the peak of $F - I$ sits at the bottom of the valley, recovering the typical value.

 value
   F(x)   ___                         reward: a gentle bump
        _/   \_

   I(x)  \                       /     cost: the rate-function valley
          \                     /      (zero at the typical value)
           \___       _________/

 F(x)-I(x)      __                     net = reward minus cost
              _/  \_
             /  ^   \                  peak at x_star  =  best compromise
            /   |    \
 ----------+---x_star-+------------- x
   growth rate of the integral  =  height of this peak  =  max (F - I)

Worked example Beginner

Return to the fair-coin average of 37.07.01, whose cost is $I (x) = x lo g (2 x) + (1 - x) lo g (2 (1 - x))$ for a fraction $x$ between $0$ and $1$ , with $I (\frac{1}{2}) = 0$ . Offer a reward $F (x) = x$ : you are paid the fraction of heads itself. We ask at what rate the average payoff $E e^{n F (\overset{ˉ}{X}_{n})} = E e^{n \overset{ˉ}{X}_{n}}$ grows.

Step 1. Set up the tug-of-war. Varadhan's lemma says the growth rate is the largest value of $F (x) - I (x) = x - I (x)$ over $x$ in $[0, 1]$ . Reward rises to the right; cost rises as we leave $x = \frac{1}{2}$ . The best compromise is somewhere past one half.

Step 2. Tabulate the net $x - I (x)$ . Using the values of $I$ computed in the prerequisite unit:

$x$	$I (x)$	net $x - I (x)$
$0.50$	$0.000$	$0.500$
$0.60$	$0.020$	$0.580$
$0.70$	$0.082$	$0.618$
$0.73$	$0.110$	$0.620$
$0.80$	$0.193$	$0.607$

Step 3. Read off the winner. The net take peaks near $x_{⋆} \approx 0.73$ at about $0.620$ . So $\frac{1}{n} lo g E e^{n \overset{ˉ}{X}_{n}} \to 0.620$ as $n$ grows; the average exponential payoff grows like $e^{0.620 n}$ .

Step 4. Sanity check the two pulls. A pure greed strategy would sit at $x = 1$ (all heads), but $I (1) = lo g 2 \approx 0.693$ , giving net $1 - 0.693 = 0.307$ — worse, because all-heads is far too rare. A pure caution strategy sits at $x = \frac{1}{2}$ , net $0.5$ — also worse, because it leaves reward on the table. The optimum balances them.

What this tells us. The growth rate of an exponential average is neither the maximum reward nor the typical value, but the best net of reward minus rarity-cost. That single number, $max_{x} (F (x) - I (x))$ , is the whole content of Varadhan's lemma in this example.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, $X$ is a topological space with Borel $σ$ -algebra, ${μ_{ε}}_{ε > 0}$ is a family of Borel probability measures on $X$ satisfying the large deviation principle 37.07.01 at speed $a_{ε} \to 0$ with a good rate function $I : X \to [0, \infty]$ . For a measurable $F : X \to R$ write the scaled cumulant of $F$ $$ \Lambda_\varepsilon(F) := a_\varepsilon \log \int_{\mathcal{X}} e^{F(x)/a_\varepsilon},\mu_\varepsilon(dx), $$ the object whose $ε \to 0$ behaviour the theory describes.

Definition (the moment / tail condition). A continuous $F : X \to R$ satisfies the Varadhan moment condition if for some $γ > 1$ $$ \limsup_{\varepsilon\to0} a_\varepsilon \log \int_{\mathcal{X}} e^{\gamma F(x)/a_\varepsilon},\mu_\varepsilon(dx) ;<; +\infty. $$ This is the large-deviation surrogate for uniform integrability: it forbids the integral from being dominated by mass on regions where $F$ is large but the LDP control is weak ^{[Dembo & Zeitouni §4.3]}. When $F$ is bounded above the condition holds automatically, because then $\int e^{γ F / a_{ε}} d μ_{ε} \leq e^{γ (s u p F) / a_{ε}}$ .

Definition (the Laplace functional and the variational value). For continuous $F$ define the Laplace functional $L_{ε} (F) := Λ_{ε} (F)$ and the variational value $$ \Lambda(F) := \sup_{x\in\mathcal{X}}\big(F(x) - I(x)\big), $$ the Legendre-type pairing of the gain $F$ against the cost $I$ already met in 37.07.03. Goodness of $I$ guarantees that, for $F$ bounded above, the supremum is attained on the compact sublevel sets of $I$ .

Definition (the Laplace principle). The family ${μ_{ε}}$ satisfies the Laplace principle at speed $a_{ε}$ with rate function $I$ if for every bounded continuous $F \in C_{b} (X)$ $$ \lim_{\varepsilon\to0} a_\varepsilon\log\int_{\mathcal{X}} e^{F(x)/a_\varepsilon},\mu_\varepsilon(dx) ;=; \sup_{x\in\mathcal{X}}\big(F(x) - I(x)\big). $$ Equivalently, the Laplace principle is the conjunction of a Laplace upper bound ( $lim sup_{ε} Λ_{ε} (F) \leq Λ (F)$ for all $F \in C_{b}$ ) and a Laplace lower bound ( $lim inf_{ε} Λ_{ε} (F) \geq Λ (F)$ for all $F \in C_{b}$ ). The two bounds mirror, on the integral side, the closed-set upper bound and open-set lower bound of the LDP.

Varadhan's integral lemma is the assertion that the LDP implies the Laplace limit for every continuous $F$ meeting the moment condition; the converse implication, that the Laplace principle implies the LDP (under exponential tightness), is Bryc's inverse lemma. The two together say the LDP and the Laplace principle are interchangeable descriptions of the same asymptotic data.

Counterexamples to common slips

The moment condition is not removable for unbounded $F$ . On $X = R$ let $μ_{ε} = (1 - p_{ε}) δ_{0} + p_{ε} δ_{1/ ε}$ with $p_{ε} = e^{- 1/ ε}$ and $a_{ε} = ε$ ; this has the weak rate $I (0) = 0$ , $I (x) = \infty$ for $x \neq = 0$ . For $F (x) = 2 x$ the atom at $1/ ε$ contributes $p_{ε} e^{F (1/ ε) / ε} = e^{- 1/ ε} e^{(2/ ε) / ε}$ , which blows the integral up far past $sup_{x} (F - I) = 0$ . The moment condition fails here (no $γ > 1$ controls the $γ F$ integral), and so does Varadhan's conclusion.
Continuity of $F$ is used, not just measurability. Both LDP bounds are stated through interiors and closures; the lower bound needs ${F > c}$ open and the upper bound approximates $F$ by simple functions on closed level sets. A discontinuous $F$ with a jump across the minimiser of $I$ can make the $lim inf$ and $lim sup$ disagree, so the variational identity can fail at exactly the optimising point.
$sup (F - I)$ is a supremum, not the reward at the cost-minimiser. Evaluating $F$ at the typical point $ar g min I$ gives only a lower bound $F (ar g min I) - 0$ ; the true value can be strictly larger because a costlier point may carry a much larger reward. Confusing "where the system usually is" with "where the integral concentrates" is the central error the lemma corrects.

Key theorem with proof Intermediate+

We prove Varadhan's integral lemma in the form most used in practice and isolate the two halves, since they have different hypotheses: the lower bound needs only the LDP lower bound and continuity, while the upper bound needs goodness and the moment condition.

Theorem (Varadhan's integral lemma). Let ${μ_{ε}}$ satisfy the LDP at speed $a_{ε}$ with good rate function $I$ , and let $F : X \to R$ be continuous and satisfy the moment condition $lim sup_{ε} a_{ε} lo g \int e^{γ F / a_{ε}} d μ_{ε} < \infty$ for some $γ > 1$ . Then $$ \lim_{\varepsilon\to0} a_\varepsilon\log\int_{\mathcal{X}} e^{F(x)/a_\varepsilon},\mu_\varepsilon(dx) ;=; \sup_{x\in\mathcal{X}}\big(F(x) - I(x)\big). $$

Proof of the lower bound $lim inf_{ε} Λ_{ε} (F) \geq sup_{x} (F (x) - I (x))$ . Fix $x_{0} \in X$ with $I (x_{0}) < \infty$ and $F (x_{0}) > - \infty$ ; it suffices to show $lim inf_{ε} Λ_{ε} (F) \geq F (x_{0}) - I (x_{0})$ , then take the supremum over $x_{0}$ . By continuity of $F$ , for any $δ > 0$ there is an open neighbourhood $G ∋ x_{0}$ with $F (x) > F (x_{0}) - δ$ on $G$ . Restricting the integral to $G$ and using positivity of the integrand, $$ \int e^{F/a_\varepsilon},d\mu_\varepsilon ;\ge; \int_G e^{F/a_\varepsilon},d\mu_\varepsilon ;\ge; e^{(F(x_0)-\delta)/a_\varepsilon},\mu_\varepsilon(G). $$ Taking $a_{ε} lo g$ and using the LDP open-set lower bound $lim inf_{ε} a_{ε} lo g μ_{ε} (G) \geq - in f_{G} I \geq - I (x_{0})$ , $$ \liminf_\varepsilon \Lambda_\varepsilon(F) ;\ge; (F(x_0)-\delta) - I(x_0). $$ Letting $δ ↓ 0$ gives $lim inf_{ε} Λ_{ε} (F) \geq F (x_{0}) - I (x_{0})$ , and the supremum over $x_{0}$ completes the lower bound. (No moment condition or goodness was used.)

Proof of the upper bound $lim sup_{ε} Λ_{ε} (F) \leq sup_{x} (F (x) - I (x))$ . Write $V := sup_{x} (F (x) - I (x))$ , finite because the lower bound already gives $V \leq lim inf Λ_{ε} (F)$ and the moment condition bounds the $lim sup$ . We first treat $F$ bounded above, then remove the bound by the moment condition.

Step 1 (bounded above). Suppose $F \leq M < \infty$ . Fix $η > 0$ . For each $x$ , $F (x) - I (x) \leq V$ , so $I (x) \geq F (x) - V$ . By upper semicontinuity of $F - V - η$ and lower semicontinuity of $I$ , each point $x$ has an open neighbourhood $G_{x}$ on which $F < F (x) + η$ and $in f_{G_{x}} I > I (x) - η \geq F (x) - V - η$ . Then on $G_{x}$ , $F - in f_{G_{x}} I < F (x) + η - (F (x) - V - η) = V + 2 η$ ... more directly, the closed set $\overline{G_{x}}$ may be chosen (regularity) with $sup_{\overline{G_{x}}} F \leq F (x) + η$ and $in f_{\overline{G_{x}}} I \geq I (x) - η$ . The sublevel set $Ψ_{I} (M - V + 1) = {I \leq M - V + 1}$ is compact (goodness); cover it by finitely many such $\overline{G_{x_{1}}}, \dots, \overline{G_{x_{k}}}$ . Outside $⋃_{j} G_{x_{j}}$ the rate function exceeds $M - V + 1$ , contributing at most $e^{M / a_{ε}} μ_{ε} (rest)$ with $a_{ε} lo g$ value $\leq M - (M - V + 1) = V - 1 < V$ . On each $\overline{G_{x_{j}}}$ , $$ a_\varepsilon\log\int_{\overline{G_{x_j}}}e^{F/a_\varepsilon}d\mu_\varepsilon \le \sup_{\overline{G_{x_j}}}F + \limsup_\varepsilon a_\varepsilon\log\mu_\varepsilon(\overline{G_{x_j}}) \le (F(x_j)+\eta) - \inf_{\overline{G_{x_j}}}I \le (F(x_j)+\eta)-(I(x_j)-\eta)\le V+2\eta. $$ Combining the finitely many pieces by the largest-term rule (the $a_{ε} lo g$ of a finite sum is the max of the pieces, since $a_{ε} lo g (k + 1) \to 0$ ), $lim sup_{ε} Λ_{ε} (F) \leq max {V + 2 η, V - 1} = V + 2 η$ . Let $η ↓ 0$ .

Step 2 (remove the bound, using the moment condition). For general $F$ apply Step 1 to $F \land M$ , which is bounded above and still continuous, giving $lim sup_{ε} Λ_{ε} (F \land M) \leq sup_{x} ((F \land M) (x) - I (x)) \leq V$ . On the set ${F \geq M}$ Hölder/Chebyshev with the moment exponent controls the tail: writing $C := lim sup_{ε} a_{ε} lo g \int e^{γ F / a_{ε}} d μ_{ε} < \infty$ , the inequality $\int_{{F \geq M}} e^{F / a_{ε}} d μ_{ε} \leq e^{- (γ - 1) M / a_{ε}} \int e^{γ F / a_{ε}} d μ_{ε}$ (since on ${F \geq M}$ , $e^{F} = e^{γ F} e^{- (γ - 1) F} \leq e^{γ F} e^{- (γ - 1) M}$ ) gives $lim sup_{ε} a_{ε} lo g \int_{{F \geq M}} e^{F / a_{ε}} d μ_{ε} \leq - (γ - 1) M + C$ . Splitting $\int e^{F / a_{ε}} = \int e^{(F \land M) / a_{ε}} + \int_{{F > M}} (e^{F / a_{ε}} - e^{M / a_{ε}})$ and applying the largest-term rule, $lim sup_{ε} Λ_{ε} (F) \leq max {V, - (γ - 1) M + C}$ . Taking $M \to \infty$ drives the tail term to $- \infty$ , leaving $lim sup_{ε} Λ_{ε} (F) \leq V$ . With the lower bound, the limit equals $V$ . $□$

Bridge. This theorem builds toward every concrete evaluation of exponential asymptotics in the chapter and appears again in Bryc's inverse, Sanov-type integral computations, and the Freidlin-Wentzell exit-cost formulas. This is exactly the rigorous Laplace method on the LDP scale: the integral concentrates at the point realising $sup_{x} (F - I)$ , the central insight being that the rate function plays the role of the phase in the classical Laplace/saddle-point integral $\int e^{n f}$ , while the LDP lower and upper bounds supply the two-sided pinch. The split into a continuity-only lower bound and a goodness-plus-moment upper bound is exactly the open/closed asymmetry of 37.07.01 transported to integrals, and putting these together the variational value $sup_{x} (F - I)$ is dual to the Legendre-Fenchel pairing of 37.07.03: with $F$ linear, $sup_{x} (⟨ λ, x ⟩ - I (x)) = I^{*} (λ)$ , so Varadhan's lemma generalises the cumulant-conjugate identity from linear tilts to arbitrary continuous gains.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Prove the Laplace lower bound is equivalent to the LDP open-set lower bound by deriving the latter from the former. (Assume the Laplace principle holds for all bounded continuous $F$ .)

Hint

For an open $G$ and $x_{0} \in G$ , approximate the indicator of $G$ from below by continuous bounded functions $F_{n}$ that are $0$ on $G^{c}$ and large near $x_{0}$ .

Answer

Fix open $G$ and $x_{0} \in G$ . Choose bounded continuous $F_{M}$ with $F_{M} (x_{0}) = M$ , $0 \leq F_{M} \leq M$ , and $F_{M} = 0$ off $G$ (Urysohn). Then $\int e^{F_{M} / a_{ε}} d μ_{ε} \leq μ_{ε} (G^{c}) \cdot 1 + e^{M / a_{ε}} μ_{ε} (G) \leq 1 + e^{M / a_{ε}} μ_{ε} (G)$ , so $Λ_{ε} (F_{M}) \leq a_{ε} lo g (1 + e^{M / a_{ε}} μ_{ε} (G))$ . As $ε \to 0$ the Laplace principle gives the left side $\to sup_{x} (F_{M} - I) \geq M - I (x_{0})$ . The right side $\leq max {0, M + a_{ε} lo g μ_{ε} (G)} + o (1)$ , so $M - I (x_{0}) \leq M + lim inf_{ε} a_{ε} lo g μ_{ε} (G)$ , i.e. $lim inf_{ε} a_{ε} lo g μ_{ε} (G) \geq - I (x_{0})$ . Taking the infimum over $x_{0} \in G$ yields $\geq - in f_{G} I$ , the LDP lower bound.

Exercise 4 (medium, symbolic).

Let ${μ_{ε}}$ satisfy the LDP with good rate $I$ . Show that for bounded continuous $F$ and $G$ , the variational value is subadditive over maxima: $sup_{x} ((F \lor G) (x) - I (x)) = max {sup_{x} (F - I), sup_{x} (G - I)}$ , and interpret it through the integrals.

Hint

$(F \lor G) - I = max (F - I, G - I)$ pointwise; the supremum of a pointwise max is the max of the suprema.

Answer

Pointwise $(F \lor G) (x) - I (x) = max (F (x) - I (x), G (x) - I (x))$ , and $sup_{x} max (h_{1} (x), h_{2} (x)) = max (sup_{x} h_{1}, sup_{x} h_{2})$ for any functions. Hence the variational value of $F \lor G$ is the max of the two values. On the integral side $\int e^{(F \lor G) / a_{ε}} d μ_{ε}$ lies between $max {\int e^{F / a_{ε}}, \int e^{G / a_{ε}}}$ and their sum, so by the largest-term rule $Λ_{ε} (F \lor G) \to max {Λ (F), Λ (G)}$ , consistent with Varadhan applied to $F \lor G$ (continuous). This is the integral-side shadow of the LDP largest-term/finite-union rule.

Exercise 6 (hard, symbolic).

Prove the Laplace upper bound $lim sup_{ε} Λ_{ε} (F) \leq sup_{x} (F - I)$ for $F$ bounded continuous directly from the LDP closed-set upper bound, without the moment condition, using only that $F$ is bounded above.

Hint

Discretise the range of $F$ into finitely many bands ${c_{j} \leq F < c_{j + 1}}$ whose closures are controlled by the upper bound.

Answer

Let $F \leq M$ and fix $δ > 0$ . Partition $(- \infty, M]$ into bands $B_{j} = {x : (j - 1) δ < F (x) \leq j δ}$ for $j \leq M / δ$ . Each $\overline{B_{j}} \subseteq {F \geq (j - 1) δ}$ , a closed set since $F$ is continuous. On $B_{j}$ , $e^{F / a_{ε}} \leq e^{j δ / a_{ε}}$ , so $$ \int e^{F/a_\varepsilon}d\mu_\varepsilon \le \sum_j e^{j\delta/a_\varepsilon}\mu_\varepsilon(\overline{B_j}). $$ By the LDP closed-set upper bound, $lim sup_{ε} a_{ε} lo g μ_{ε} (\overline{B_{j}}) \leq - in f_{\overline{B_{j}}} I \leq - in f_{{F \geq (j - 1) δ}} I$ . By the largest-term rule (finitely many bands), $lim sup_{ε} Λ_{ε} (F) \leq max_{j} (j δ - in f_{{F \geq (j - 1) δ}} I)$ . For each $j$ there is $x_{j}$ with $F (x_{j}) \geq (j - 1) δ$ and $I (x_{j})$ near the infimum, so $j δ - in f \leq F (x_{j}) + δ - I (x_{j}) \leq sup_{x} (F - I) + δ$ . Hence $lim sup_{ε} Λ_{ε} (F) \leq sup_{x} (F - I) + δ$ ; let $δ ↓ 0$ . The bound used boundedness-above to keep the number of bands finite, exactly where Step 2 of the Key theorem needed the moment condition for unbounded $F$ .

Exercise 7 (hard, symbolic).

State and prove the tilted-measure corollary: if $F$ is bounded continuous and $x_{⋆}$ is the unique maximiser of $F - I$ , then the tilted measures $d μ_{ε}^{F} \propto e^{F / a_{ε}} d μ_{ε}$ concentrate at $x_{⋆}$ , i.e. $μ_{ε}^{F} (G) \to 1$ for every open $G ∋ x_{⋆}$ .

Hint

Compute $a_{ε} lo g μ_{ε}^{F} (G^{c})$ by applying Varadhan separately to numerator and denominator, restricting the numerator to the closed set $G^{c}$ .

Answer

Write $μ_{ε}^{F} (G^{c}) = \frac{\int _{G^{c}} e ^{F / a_{ε}} d μ _{ε}}{\int e ^{F / a_{ε}} d μ _{ε}}$ . The denominator has $a_{ε} lo g \to V := sup_{x} (F - I) = F (x_{⋆}) - I (x_{⋆})$ by Varadhan. For the numerator, $G^{c}$ is closed and $x_{⋆} \in / G^{c}$ ; the upper-bound argument restricted to $G^{c}$ (Exercise 6 with $F$ replaced by $F \cdot 1$ on $G^{c}$ , using the closed-set bound on subsets of $G^{c}$ ) gives $lim sup_{ε} a_{ε} lo g \int_{G^{c}} e^{F / a_{ε}} d μ_{ε} \leq sup_{x \in G^{c}} (F (x) - I (x)) =: V^{'} < V$ , the strict inequality because $x_{⋆}$ is the unique maximiser and lies outside $G^{c}$ (a good rate function makes the sup attained, so the deficit is strict). Therefore $lim sup_{ε} a_{ε} lo g μ_{ε}^{F} (G^{c}) \leq V^{'} - V < 0$ , so $μ_{ε}^{F} (G^{c}) \to 0$ exponentially and $μ_{ε}^{F} (G) \to 1$ . The tilt by $e^{F / a_{ε}}$ relocates the concentration point from $ar g min I$ to $ar g max (F - I)$ .

Exercise 8 (hard, symbolic).

Deduce the LDP closed-set upper bound from the Laplace upper bound, completing (with Exercise 3) the proof that the Laplace principle implies the LDP for bounded continuous test functions. (You may assume exponential tightness so closed sets reduce to compact ones.)

Hint

For compact $K$ and $α < in f_{K} I$ , build a bounded continuous $F$ that is large on $K$ and $0$ far from $K$ , then read the Laplace limit.

Answer

By exponential tightness it suffices to bound compact $K$ . Let $m = in f_{K} I$ and fix $N > 0$ . Choose bounded continuous $F$ with $F = N$ on $K$ , $0 \leq F \leq N$ , and $F$ supported in a neighbourhood $U \supseteq K$ with $in f_{U} I \geq m - η$ (possible by lsc of $I$ and goodness). Then $\int e^{F / a_{ε}} d μ_{ε} \geq e^{N / a_{ε}} μ_{ε} (K)$ , so $N + lim sup_{ε} a_{ε} lo g μ_{ε} (K) \leq lim sup_{ε} Λ_{ε} (F)$ . The Laplace upper bound gives $lim sup_{ε} Λ_{ε} (F) \leq sup_{x} (F - I) \leq max {N - in f_{U} I, 0 - in f I} = max {N - (m - η), 0} = N - m + η$ for $N$ large. Hence $lim sup_{ε} a_{ε} lo g μ_{ε} (K) \leq - m + η$ ; let $η ↓ 0$ to get $\leq - in f_{K} I$ . Exponential tightness upgrades compact to closed, recovering the full LDP upper bound. With Exercise 3, the Laplace principle is equivalent to the LDP.

Advanced results Master

Bryc's inverse lemma: from the Laplace principle to the LDP

The implication of the Key theorem reverses. Bryc's lemma ^{[Bryc 1990]} states: if ${μ_{ε}}$ is exponentially tight and for every $F \in C_{b} (X)$ the limit $$ \Lambda(F) := \lim_{\varepsilon\to0} a_\varepsilon\log\int e^{F/a_\varepsilon},d\mu_\varepsilon $$ exists, then ${μ_{ε}}$ satisfies the LDP with the good rate function $$ I(x) = \sup_{F\in C_b(\mathcal{X})}\big(F(x) - \Lambda(F)\big), $$ the Legendre-Fenchel transform of the functional $Λ$ over the Banach space $C_{b} (X)$ . The proof mirrors Exercises 3 and 8: the lower bound follows by feeding indicator-approximating test functions into the assumed Laplace limit, and the upper bound by feeding compact-supported bumps; exponential tightness reduces closed to compact. Thus the LDP, the Laplace principle, and the existence of all bounded-continuous exponential-integral limits are three packagings of one datum, and the rate function is recovered as a conjugate — the abstract, function-space form of the cumulant-conjugate identity $I = Λ^{*}$ of 37.07.03.

The Laplace principle as a primitive: the Dupuis-Ellis programme

Dupuis and Ellis ^{[Dupuis & Ellis 1997]} invert the logical order, defining large deviations through the Laplace principle and deriving the LDP as a consequence. The payoff is a variational representation of the prelimit functional itself: for many models one has the exact identity $$ -a_\varepsilon\log\int e^{-F/a_\varepsilon},d\mu_\varepsilon = \inf_{\nu}\Big(\mathbb{E}\nu[F] + a\varepsilon, H(\nu,|,\mu_\varepsilon)\Big), $$ the Donsker-Varadhan/Gibbs variational formula with relative entropy $H$ 37.07.06. Passing to the limit, the entropy term becomes the rate function and the infimum becomes $in f_{x} (F (x) + I (x))$ , so the weak-convergence analysis of the controlled representation yields the Laplace limit and hence the LDP. This route makes Varadhan's lemma not a corollary but the organising definition, and turns large-deviation proofs into stochastic-control problems.

The classical Laplace and saddle-point methods recovered

Varadhan's lemma is the probabilistic lift of Laplace's 1782 asymptotic method ^{[Laplace 1782]}. For a deterministic integral $\int_{R^{d}} e^{n g (x)} d x$ with $g$ attaining a unique non-degenerate maximum at $x_{⋆}$ , Laplace's method gives $\frac{1}{n} lo g \int e^{n g} d x \to g (x_{⋆}) = sup_{x} g$ . Identifying the reference family $μ_{ε} =$ normalised Lebesgue (or any family whose LDP rate function is $I \equiv 0$ on the integration domain), the variational value $sup_{x} (g - I) = sup_{x} g$ reproduces the leading-order Laplace exponent; the subexponential Gaussian prefactor $(2 π / n)^{d} / det (- \nabla^{2} g (x_{⋆}))$ lives below the LDP scale and is invisible to $a_{ε} lo g$ . The complex saddle-point/steepest-descent method is the analytic continuation of the same principle. Varadhan's lemma is thus the statement that the Laplace exponent survives the introduction of a genuine cost: the phase $g$ is replaced by the net $F - I$ .

The dominated-convergence analogy

Varadhan's lemma is the large-deviation analogue of the dominated convergence theorem, with $a_{ε} lo g \int$ replacing $\int$ , $sup$ replacing the limit, and the moment condition replacing the dominating integrable envelope. The lower bound is a Fatou-type estimate (it survives without domination, paralleling the moment-condition-free lower bound of the Key theorem), while the upper bound needs the moment condition exactly as the dominated convergence upper passage needs an integrable dominator. The semiring $(R \cup {- \infty}, max, +)$ — the "max-plus" or tropical algebra — is the limiting arithmetic: $a_{ε} lo g$ sends $(+, \times)$ to $(max, +)$ , sums become maxima, products become sums, and integration becomes supremum. In this idempotent measure theory the rate function is a "tropical density" and Varadhan's lemma is the change-of-variables/integration identity.

Synthesis. The central insight is that the LDP and the Laplace principle are equivalent, with Varadhan's lemma supplying one direction and Bryc's inverse the other, and this is exactly the max-plus shadow of the ordinary integral: $a_{ε} lo g \int e^{F / a_{ε}} d μ_{ε}$ degenerates to $sup_{x} (F (x) - I (x))$ , so summation becomes maximisation and the rate function becomes a tropical density. The foundational reason the variational value pairs $F$ against $I$ by subtraction is the Legendre-Fenchel duality of 37.07.03: for linear $F = ⟨ λ, \cdot ⟩$ the value is $I^{*} (λ)$ , and Varadhan's lemma generalises that cumulant-conjugate identity from linear tilts to all continuous gains, while Bryc's inverse is dual to it by recovering $I$ as the conjugate of $Λ$ over $C_{b}$ . Putting these together with the tilted-measure corollary (Exercise 7), the lemma both evaluates exponential integrals and relocates concentration to $ar g max (F - I)$ , appears again in the Gibbs/Donsker-Varadhan entropy representation 37.07.06 and the Freidlin-Wentzell exit theory, and builds toward the weak-convergence (Dupuis-Ellis) reformulation in which the Laplace principle is the definition. The bridge is the equivalence itself: the rate function of 37.07.01, the conjugate of 37.07.03, and the limiting Laplace functional are one object viewed three ways.

Full proof set Master

Proposition 1 (the moment condition is automatic for $F$ bounded above). If $F : X \to R$ is continuous and bounded above by $M$ , then the Varadhan moment condition holds for every $γ > 1$ , and consequently Varadhan's limit holds for $F$ .

Proof. For any $γ > 0$ , $\int e^{γ F / a_{ε}} d μ_{ε} \leq e^{γ M / a_{ε}} μ_{ε} (X) = e^{γ M / a_{ε}}$ , so $a_{ε} lo g \int e^{γ F / a_{ε}} d μ_{ε} \leq γ M < \infty$ for every $ε$ , and the $lim sup$ is $\leq γ M < \infty$ . Choosing any $γ > 1$ satisfies the moment condition, and the Key theorem applies. (Step 2 of the Key theorem is then vacuous: Step 1 already concludes for bounded-above $F$ .) $□$

Proposition 2 (Varadhan's lemma extends the cumulant-conjugate identity). Let ${μ_{ε}}$ on $R^{d}$ satisfy the LDP with good rate $I$ . For every $λ \in R^{d}$ such that $F = ⟨ λ, \cdot ⟩$ meets the moment condition, $$ \lim_{\varepsilon\to0} a_\varepsilon\log\int e^{\langle\lambda,x\rangle/a_\varepsilon},\mu_\varepsilon(dx) = I^(\lambda), $$ the Legendre-Fenchel conjugate of $I$ . In particular if $I=\Lambda^ $f or a c l ose d co n v e x$ \Lambda $t h e n t h e l imi t i s$ \Lambda^{**}=\Lambda$.

Proof. Apply the Key theorem to the continuous $F (x) = ⟨ λ, x ⟩$ : the limit equals $sup_{x} (⟨ λ, x ⟩ - I (x))$ , which is by definition $I^{*} (λ)$ 37.07.03. If $I = Λ^{*}$ with $Λ$ closed proper convex, then $I^{*} = (Λ^{*})^{*} = Λ^{**} = Λ$ by the Fenchel-Moreau biconjugation theorem of 37.07.03. Thus the scaled cumulant generating function of $μ_{ε}$ converges to $Λ$ , the converse direction of the Gärtner-Ellis hypothesis. $□$

Proposition 3 (uniqueness of the rate function via the Laplace functional). Suppose ${μ_{ε}}$ satisfies the Laplace principle with two good rate functions $I$ and $J$ . Then $I = J$ .

Proof. For every $F \in C_{b} (X)$ both rate functions give the same Laplace limit, so $sup_{x} (F (x) - I (x)) = sup_{x} (F (x) - J (x))$ . Fix $x_{0}$ and $δ > 0$ . By goodness and lsc choose, for each $n$ , a bounded continuous $F_{n}$ with $F_{n} (x_{0}) = 0$ and $F_{n} (x) \leq - n$ off a shrinking neighbourhood $G_{n} ∋ x_{0}$ with $in f_{G_{n}} I \geq I (x_{0}) - δ$ . Then $sup_{x} (F_{n} - I) \to - I (x_{0})$ as $n \to \infty$ (the supremum is realised near $x_{0}$ once the far region is suppressed below the value at $x_{0}$ ), and likewise $sup_{x} (F_{n} - J) \to - J (x_{0})$ . Equality of the two suprema for every $n$ forces $- I (x_{0}) = - J (x_{0})$ , hence $I (x_{0}) = J (x_{0})$ . As $x_{0}$ was arbitrary, $I = J$ . This is the integral-side counterpart of the LDP uniqueness theorem of 37.07.01, now driven by separating points with bounded continuous functions. $□$

Connections Master

Varadhan's integral lemma promotes the large deviation principle of 37.07.01 from a statement about probabilities of sets to a statement about exponential integrals: under that unit's good-rate-function LDP and a moment condition, the asymptotics of $\int e^{F / a_{ε}} d μ_{ε}$ are governed by $sup_{x} (F - I)$ , with the continuity-only lower bound and the goodness-plus-moment upper bound transporting that unit's open/closed asymmetry to integrals.
The variational value $sup_{x} (F (x) - I (x))$ is the Legendre-Fenchel pairing of 37.07.03: for a linear gain $F = ⟨ λ, \cdot ⟩$ it is exactly the conjugate $I^{*} (λ)$ , so Varadhan's lemma generalises the cumulant-generating-function-to-rate-function duality from linear exponential tilts to arbitrary continuous test functions, and Bryc's inverse recovers $I$ as the conjugate of the limiting Laplace functional over $C_{b}$ .
The Dupuis-Ellis variational representation expresses the prelimit Laplace functional through relative entropy, linking this unit to the Donsker-Varadhan formula and entropic rate function of 37.07.06: the limit of $in f_{ν} (E_{ν} F + a_{ε} H (ν ∥ μ_{ε}))$ is $in f_{x} (F + I)$ , the Laplace principle read through stochastic control.

Historical & philosophical context Master

The asymptotic evaluation of integrals dominated by their largest integrand value is due to Pierre-Simon Laplace, who in 1782 ^{[Laplace 1782]} developed the method of approximating integrals of the form $\int e^{n f (x)} d x$ for large $n$ by expansion about the maximum of $f$ , in the course of his work on probability and celestial mechanics; the complex-variable refinement is the method of steepest descent associated with Riemann and Debye. The probabilistic generalisation — that the same concentration governs exponential integrals against a family of measures obeying a large deviation principle, with the rate function entering as a competing cost — was proved by S. R. S. Varadhan in 1966 ^{[Varadhan 1966]} alongside his abstract formulation of the LDP, and systematised in his 1984 lectures ^{[Varadhan 1984]}.

The inverse direction, recovering the LDP from the convergence of exponential integrals of bounded continuous functions, was established by Włodzimierz Bryc ^{[Bryc 1990]} using exponential tightness, and the equivalence was elevated to a definitional standpoint by Paul Dupuis and Richard Ellis ^{[Dupuis & Ellis 1997]}, who built large-deviation theory on the Laplace principle and a weak-convergence analysis of entropy-penalised control representations. The standard reference treatment is Dembo and Zeitouni ^{[Dembo & Zeitouni §4.3]}. The max-plus reading, in which $a_{ε} lo g$ degenerates $(+, \times)$ integration to $(max, +)$ optimisation, connects the lemma to idempotent analysis as developed by Maslov and collaborators, where rate functions are densities for an idempotent measure theory.

Bibliography Master

@article{varadhan1966asymptotic,
  author  = {Varadhan, S. R. S.},
  title   = {Asymptotic probabilities and differential equations},
  journal = {Communications on Pure and Applied Mathematics},
  volume  = {19},
  pages   = {261--286},
  year    = {1966}
}

@book{varadhan1984large,
  author    = {Varadhan, S. R. S.},
  title     = {Large Deviations and Applications},
  series    = {CBMS-NSF Regional Conference Series in Applied Mathematics},
  number    = {46},
  publisher = {SIAM},
  year      = {1984}
}

@incollection{bryc1990large,
  author    = {Bryc, W{\l}odzimierz},
  title     = {On the large deviation principle by the asymptotic value method},
  booktitle = {Diffusion Processes and Related Problems in Analysis, Volume I},
  series    = {Progress in Probability},
  number    = {22},
  publisher = {Birkh\"auser},
  pages     = {447--472},
  year      = {1990}
}

@book{dupuisellis1997weak,
  author    = {Dupuis, Paul and Ellis, Richard S.},
  title     = {A Weak Convergence Approach to the Theory of Large Deviations},
  series    = {Wiley Series in Probability and Statistics},
  publisher = {Wiley},
  year      = {1997}
}

@book{dembozeitouni1998ldp,
  author    = {Dembo, Amir and Zeitouni, Ofer},
  title     = {Large Deviations Techniques and Applications},
  edition   = {2nd},
  series    = {Applications of Mathematics},
  number    = {38},
  publisher = {Springer},
  year      = {1998}
}

@book{denhollander2000large,
  author    = {den Hollander, Frank},
  title     = {Large Deviations},
  series    = {Fields Institute Monographs},
  number    = {14},
  publisher = {American Mathematical Society},
  year      = {2000}
}

@incollection{laplace1782memoire,
  author    = {Laplace, Pierre-Simon},
  title     = {M\'emoire sur les approximations des formules qui sont fonctions de tr\`es-grands nombres},
  booktitle = {M\'emoires de l'Acad\'emie Royale des Sciences de Paris},
  year      = {1782}
}

Prerequisites

37.07.01
37.07.03

Tier anchors

beginner: Touchette 2009 *The large deviation approach to statistical mechanics* (Physics Reports 478) §3.3 (the Laplace/saddle-point heuristic); Dembo & Zeitouni 1998 *Large Deviations Techniques and Applications* 2nd ed. (Springer) §4.3 (informal statement of Varadhan's lemma)
intermediate: Dembo & Zeitouni 1998 *Large Deviations Techniques and Applications* 2nd ed. (Springer) §4.3 (Theorem 4.3.1, Varadhan's lemma; Lemma 4.3.4 and 4.3.6) and §4.4 (Theorem 4.4.2, Bryc's inverse); den Hollander 2000 *Large Deviations* (AMS Fields Institute Monographs) §III.3
master: Dembo & Zeitouni 1998 *Large Deviations Techniques and Applications* 2nd ed. (Springer) §4.3-§4.4 (Varadhan, the Laplace principle, Bryc's theorem, the moment condition 4.3.2); Dupuis & Ellis 1997 *A Weak Convergence Approach to the Theory of Large Deviations* (Wiley) Ch. 1, §1.2 (the Laplace principle as primitive); Varadhan 1966 *Asymptotic probabilities and differential equations* (CPAM 19); Varadhan 1984 *Large Deviations and Applications* (SIAM CBMS-NSF 46) §2-§3

References

Dembo, A. & Zeitouni, O. — Large Deviations Techniques and Applications, 2nd ed. (Springer, 1998) · §4.3 (Theorem 4.3.1 Varadhan's lemma; moment condition (4.3.2); Lemma 4.3.4, Lemma 4.3.6); §4.4 (Theorem 4.4.2 Bryc's inverse; Laplace principle); §1.2
Varadhan, S. R. S. — Asymptotic probabilities and differential equations · Communications on Pure and Applied Mathematics 19 (1966), 261-286; the integral lemma in its original form
Varadhan, S. R. S. — Large Deviations and Applications (SIAM CBMS-NSF Regional Conference Series 46, 1984) · §2-§3; the abstract LDP and the integral lemma
Dupuis, P. & Ellis, R. S. — A Weak Convergence Approach to the Theory of Large Deviations (Wiley, 1997) · Ch. 1, §1.2 (the Laplace principle taken as the primitive notion; equivalence with the LDP)
Bryc, W. — On the large deviation principle by the asymptotic value method · in Diffusion Processes and Related Problems in Analysis, vol. I (Birkhäuser, 1990), 447-472; the inverse Varadhan lemma
den Hollander, F. — Large Deviations (AMS Fields Institute Monographs 14, 2000) · §III.3 (Varadhan's lemma and the tilted-measure interpretation)
Laplace, P. S. — Mémoire sur les approximations des formules qui sont fonctions de très-grands nombres · Mémoires de l'Académie Royale des Sciences de Paris (1782); the asymptotic method for integrals e^{n f}

Estimated time

beginner: 17m
intermediate: 42m
master: 76m