37.08.09 · probability / 08-random-matrices

The Ben Arous–Guionnet Large Deviation Principle for the Empirical Spectral Measure

shipped3 tiersLean: none

Anchor (Master): Anderson, Guionnet, Zeitouni, An Introduction to Random Matrices (Cambridge, 2010) §2.6-§2.7; Ben Arous & Guionnet, Large deviations for Wigner's law and Voiculescu's non-commutative entropy, Probab. Theory Related Fields 108 (1997), 517-542; Guionnet, Saint-Flour XXXVI (Springer LNM 1957, 2009) Ch. 2; Hiai & Petz, The Semicircle Law, Free Random Variables and Entropy (AMS, 2000) Ch. 5

Intuition Beginner

You already know that the eigenvalues of a large Gaussian random matrix arrange themselves into a smooth, predictable shape — the semicircle. Resample the matrix and the histogram of eigenvalues barely twitches: the shape is locked in. This unit asks the sharper question. The histogram almost never strays from the semicircle, but how unlikely is it to stray? If you demanded that the eigenvalues pile up into some completely different shape — say a flat slab, or two separated lumps — what would you have to pay?

The answer is an exponential price, and the size of that price is the whole story. Think of the eigenvalues as a gas of charged beads on a wire. Like charges, they repel; a confining field pulls them back toward the center. Left alone, the gas settles into one balanced shape that minimizes its total energy, and that shape is the semicircle. To force the gas into any other shape, you fight its own energetics, and the chance of seeing that wrong shape shrinks like a falling exponential whose exponent is the extra energy the wrong shape carries.

What is new and strong here is the speed of the decay. For a single average of independent samples, the unlikeliness grows with the number of samples. For the whole eigenvalue cloud it grows with the number of samples squared, because every pair of eigenvalues interacts with every other. So the semicircle is not merely typical — it is overwhelmingly, doubly-exponentially enforced, and the energy of a shape measures exactly how hard the gas resists being pushed there.

Visual Beginner

Picture a bowl-shaped energy landscape sitting over the space of possible eigenvalue shapes. The very bottom of the bowl is the semicircle: the one shape the charged gas relaxes into on its own. Every other shape sits higher up the wall of the bowl, and its height above the bottom is the extra energy you pay to force the gas there. The chance of seeing a given shape falls off like an exponential of minus that height, multiplied by the matrix size squared.

   energy landscape over "shapes of the eigenvalue cloud"
   energy
     ^            wrong shape B
     |           (two lumps)
     |   wrong      .o.            wrong shape C
     |  shape A    /   \          (flat slab)
     |   .o.      /     \          .o.
     |  /   \    /       \        /   \
     | /     \__/         \______/     \
     |/                                 \
     +----*-------------------------------*----> shapes
          \____   semicircle = bowl floor
               (lowest energy, the typical shape)

   chance of shape  ~  exp( - n^2 * height above the floor )
   height of shape  =  its energy  -  energy of the semicircle

The bowl has a single lowest point because the energy combines a repulsion that punishes crowding with a confinement that punishes spreading, and those two pulls balance at exactly one shape. That balance point is the semicircle, and the height of the wall above it is the rate at which every other shape is suppressed.

Worked example Beginner

We compare the energy of two simple eigenvalue shapes for a confined charged gas and see which one the gas prefers, reproducing in miniature why a balanced shape wins.

Step 1. Set up two beads. Put two eigenvalues at positions and . The energy has two parts: a confinement part that adds up the squares of the positions, , and a repulsion part that subtracts the logarithm of the gap, . Repulsion lowers the energy when the beads are far apart; confinement raises it when they wander out.

Step 2. A spread-out shape. Place the beads at and . Confinement gives . The gap is , so repulsion gives . Total energy: .

Step 3. A balanced shape. Now place them closer, at and . Confinement gives . The gap is , so repulsion gives . Total energy: .

Step 4. A crowded shape. Push them together at and . Confinement gives . The gap is , so repulsion gives — now positive, because a small gap is expensive. Total energy: .

Step 5. What this tells us. The balanced shape at has the lowest energy of the three: spreading too far costs confinement, crowding too close costs repulsion, and the middle wins. The true equilibrium shape — the semicircle for a full matrix — is the one balancing these costs across all the eigenvalues at once, and any shape with higher energy is exponentially less likely to appear.

Check your understanding Beginner

Formal definition Intermediate+

Fix (or any for the log-gas) and let the eigenvalues be distributed according to the Gaussian -ensemble joint density 37.08.03 $$ p_{n,\beta}(\lambda) = \frac{1}{Z_{n,\beta}},\prod_{i<j}|\lambda_i - \lambda_j|^{\beta},\exp!\Big(-\tfrac{\beta n}{4}\sum_{i=1}^n \lambda_i^2\Big). $$ The empirical spectral measure is the random probability measure $$ L_n ;:=; \frac{1}{n}\sum_{i=1}^n \delta_{\lambda_i} ;\in; \mathcal{M}_1(\mathbb{R}), $$ where is the space of Borel probability measures on equipped with the weak topology (the coarsest making continuous for every bounded continuous ), under which is Polish. This is the same measure-valued statistic of 37.07.05, now built from dependent — indeed strongly repelling — variables.

The rate functional is the logarithmic-energy / non-commutative entropy functional. For a confining potential , define on $$ \mathcal{E}(\mu) ;:=; \int_{\mathbb{R}} \frac{x^2}{2},d\mu(x) ;-; \iint_{\mathbb{R}^2} \log|x - y|,d\mu(x),d\mu(y), $$ the weighted logarithmic energy of in the external field . It splits into a confinement term (the -integral) and the logarithmic energy , Voiculescu's free-entropy quantity [Voiculescu 1993]. The rate functional of the LDP is the normalised energy $$ I_\beta(\mu) ;:=; \frac{\beta}{2}\Big(\mathcal{E}(\mu) - \inf_{\nu \in \mathcal{M}1(\mathbb{R})}\mathcal{E}(\nu)\Big), $$ shifted so that $\inf\mu I_\beta = 0c_\beta := \frac{\beta}{2}\inf_\nu \mathcal{E}(\nu)\mathcal{E}(-\infty, +\infty]+\infty - \infty\log|x-y|$ is controlled by the confinement; see the proof set).

Definition (the Ben Arous–Guionnet LDP). The laws of satisfy a large deviation principle 37.07.01 on in the weak topology at speed with good rate function : for every Borel , $$ -\inf_{\mu\in\Gamma^\circ} I_\beta(\mu) ;\leq; \liminf_{n}\tfrac{1}{n^2}\log\mathbb{P}(L_n\in\Gamma) ;\leq; \limsup_{n}\tfrac{1}{n^2}\log\mathbb{P}(L_n\in\Gamma) ;\leq; -\inf_{\mu\in\overline{\Gamma}} I_\beta(\mu). $$ A good rate function is non-negative, lower-semicontinuous, and has compact sublevel sets ; all three hold for in the weak topology.

The unique minimiser is the semicircle law , so if and only if , recovering the law of large numbers from the LDP as the statement that the rate vanishes only there.

Counterexamples to common slips

  • The speed is , not . Sanov 37.07.05 prices the empirical measure of independent samples at speed with relative-entropy rate. Here the eigenvalues interact through , an energy of order , so the correct normalisation is and the rate is a quadratic energy, not a linear entropy. Using speed gives a degenerate (identically or ) statement.
  • is not the relative entropy . The rate is the logarithmic energy, a quadratic form in , fundamentally different from the linear-in- relative entropy of Sanov. The two agree only in spirit (both vanish at the equilibrium law); their analytic forms are unrelated.
  • The minimiser depends on the potential, not on . The factor in scales the whole rate but does not move its minimiser: the equilibrium measure solves a -free variational problem and equals the semicircle for the quadratic . Changing changes the minimiser; changing only rescales how strongly deviations are penalised.
  • Concentration is not the LDP. The log-Sobolev / Herbst concentration of 37.08.07 gives a sub-Gaussian bound on linear statistics — the right speed but a Gaussian envelope, not the exact exponential rate. The LDP supplies the precise constant governing each shape, of which the concentration bound is a one-sided quadratic relaxation.

Key theorem with proof Intermediate+

We prove the LDP from the explicit joint density, following the Coulomb-gas route [Ben Arous & Guionnet 1997], the cleanest available because the density is known in closed form.

Theorem (Ben Arous–Guionnet). For and confining , the laws of the empirical spectral measures of the Gaussian -ensemble satisfy the LDP on in the weak topology at speed with good rate function , where . The rate vanishes uniquely at the semicircle law .

Proof. Write the joint density as a Gibbs weight in the empirical measure. With , $$ \beta\sum_{i<j}\log|\lambda_i-\lambda_j| - \frac{\beta n}{4}\sum_i \lambda_i^2 = -\frac{\beta n^2}{2}\Big[\iint_{x\ne y}!!\big(\tfrac{V(x)+V(y)}{2} - \log|x-y|\big),dL_n(x),dL_n(y)\Big] + \frac{\beta n}{4}!\int! x^2,dL_n, $$ where and the last term, of order , is negligible against the order- leading term. Define the off-diagonal energy functional $$ \widetilde{\mathcal{E}}(\mu) := \iint_{x\ne y}\Big(\tfrac{V(x)+V(y)}{2} - \log|x-y|\Big),d\mu(x),d\mu(y) = \int V,d\mu - \iint\log|x-y|,d\mu,d\mu = \mathcal{E}(\mu), $$ the second equality holding because symmetrises and the diagonal is -null for with no atoms. Thus, on the exponential scale, the density of is times the normalisation and lower-order corrections.

The normalisation supplies the shift. The partition function is $$ Z_{n,\beta} = \int_{\mathbb{R}^n} e^{-\frac{\beta n^2}{2}\mathcal{E}(L_n) + O(n\log n)},d^n\lambda, $$ and a Laplace/saddle estimate on gives . Dividing the density by replaces by , i.e. produces the normalised rate .

Upper bound (closed sets). For closed , bound by integrating the density over the event. The functional is lower-semicontinuous in the weak topology, and is bounded above on compacts, so for any one truncates the logarithmic singularity at level , replacing by , a bounded continuous function. The truncated energy is continuous and as . Then $$ \frac{1}{n^2}\log\mathbb{P}(L_n\in\Gamma) \le -\frac{\beta}{2}\inf_{\mu\in\Gamma}\mathcal{E}M(\mu) + o(1) + \frac{1}{n^2}\log Z{n,\beta}, $$ and letting then (monotone convergence of and lower semicontinuity to pass the infimum) yields .

Lower bound (open sets). For open and a target with , approximate by a measure with a smooth bounded density and connected support, then realise it as a localised configuration: place the in small blocks around the quantiles of . The repulsion factor does not collapse because within-block spacing is kept of order , contributing a negligible . Counting the volume of configurations whose empirical measure lies in and lies within the chosen neighbourhood of gives $$ \frac{1}{n^2}\log\mathbb{P}(L_n\in G) \ge -\frac{\beta}{2}\mathcal{E}(\mu_\delta) + o(1) + \frac{1}{n^2}\log Z_{n,\beta}, $$ and sending with continuity of along the approximation, then optimising over , yields .

Goodness and uniqueness of the minimiser. The confinement forces tightness of any sublevel set , and lower semicontinuity of makes the set closed, hence (by Prokhorov) compact: is good. The logarithmic energy is strictly convex on the space of compactly-supported probability measures of finite energy (the kernel is strictly positive-definite for zero-total-charge signed measures), and is linear, so is strictly convex and has a unique minimiser. Its Euler-Lagrange (Frostman) condition on is solved by the semicircle via the Stieltjes transform 37.08.02, so is the unique zero of .

Bridge. This theorem builds toward the entire macroscopic-asymptotics program for matrix models and appears again in the large- field-theory expansion, where the same energy minimisation is the planar saddle. This is exactly the speed- counterpart of Sanov's speed- empirical-measure LDP 37.07.05: where Sanov prices independent samples by relative entropy, the interaction lifts the speed by a full power of and replaces the linear entropy by the quadratic logarithmic energy. The foundational reason the rate is the logarithmic energy is the two-line Gibbs rewriting , in which the Vandermonde repulsion becomes and the confinement becomes . Putting these together, the normalising constant supplies the shift fixing , and the whole argument is dual to the soft concentration envelope of 37.08.07: the log-Sobolev bound caps deviations by a Gaussian at the same speed , while the LDP resolves the exact rate inside that cap, the central insight being that the equilibrium semicircle is enforced not by independence but by a variational principle in logarithmic potential theory.

Exercises Intermediate+

Advanced results Master

The general potential and the equilibrium measure

For a confining potential growing faster than at infinity, the same argument gives the LDP at speed with rate , , and the unique minimiser is the weighted equilibrium measure of logarithmic potential theory [Saff & Totik 1997]. It is characterised by the Frostman conditions on and off it, where . For this is the semicircle; for it is the rescaled semicircle of width , and the family traces the free-convolution semigroup. Multi-cut equilibrium measures arise for non-convex (e.g. double-well), where is a union of intervals and the LDP describes the macroscopic phase transitions between one-cut and multi-cut spectra.

Sharper asymptotics: the correction and the constant

The leading has a full expansion. For the Gaussian -ensemble the exact Mehta/Selberg evaluation 37.08.03 gives , with the leading matching the LDP rate, while the subleading terms encode the central-limit fluctuations of linear statistics (Johansson) and the constant is the -dependent free-energy constant. The LDP captures only ; the finer terms are a separate, more delicate analysis, but the consistency of with the variational rate is a strong check on the theorem.

Tilted ensembles, conditioning, and outliers

Conditioning the spectrum on an atypical event selects the constrained equilibrium measure, the free analogue of the Gibbs conditioning principle of 37.07.05. Conditioning to charge a region where assigns no mass forces a macroscopic fraction of eigenvalues there at exponential cost , while pulling a single eigenvalue out of the bulk — an outlier — costs only , governed by a speed- rate (the one-eigenvalue large deviation, with rate the effective potential ). The two speeds coexist: macroscopic reshaping of the cloud is speed , displacement of one charge is speed , and the crossover is the rank-one BBP-type transition for spiked models. The effective potential that prices a single outlier is exactly the Frostman left-hand side, vanishing on the support and positive outside.

Free probability and the second-order rate

The rate functional is Voiculescu's free entropy up to sign and confinement [Voiculescu 1993; Hiai & Petz 2000], so the LDP is the bridge from random matrices to free probability: the semicircle is the free-entropy maximiser as the Gaussian is the Shannon maximiser, free convolution is the law of sums of free variables modelled by independent rotated matrices, and the matrix-model partition function asymptotics are the free-entropy generating functional. The Gaussian fluctuations around the semicircle, governed by the Hessian of at (a positive quadratic form diagonalised by Chebyshev polynomials), give the central limit theorem for linear eigenvalue statistics with the characteristic absence of a normalisation — linear statistics fluctuate at , not , because the -rate rigidity suppresses fluctuations by a full power of .

Synthesis. The central insight is that the macroscopic spectrum of an invariant matrix ensemble obeys a variational principle in logarithmic potential theory, its large deviations priced at speed by the logarithmic-energy functional, which generalises Sanov's speed- relative-entropy rate 37.07.05 from independent samples to the repelling eigenvalue gas. This is exactly the statement that the Vandermonde repulsion of 37.08.03, rewritten as , becomes the free-entropy term in the rate, so the joint density is dual to the Gibbs measure of a confined two-dimensional Coulomb gas at inverse temperature . The foundational reason the semicircle is the unique minimiser is the strict convexity of — the logarithmic kernel is positive-definite on zero-mass charges — which forces existence, uniqueness, and the Frostman / Stieltjes equation whose resolvent reappears from the derivations of 37.08.01 and 37.08.02. Putting these together, the soft log-Sobolev envelope of 37.08.07 caps deviations by a Gaussian at the same speed while this LDP resolves the exact, generally non-Gaussian rate inside that cap; the bridge is that concentration gives the bound and large deviations give the rate, two faces of one speed- rigidity, the rate functional being Voiculescu's free entropy, making the semicircle the free Gaussian and the matrix-model free energy the free-entropy functional that appears again in the planar limit of large- field theory 08.14.06.

Full proof set Master

The Gibbs rewriting, the upper and lower large-deviation bounds, goodness, and uniqueness of the minimiser are proved in full above. The remaining Master claims are recorded here.

Proposition 1 (lower semicontinuity and goodness of ). The functional , with confining, is lower-semicontinuous on in the weak topology and has compact sublevel sets.

Proof. Write where and is a bounded continuous truncation. For each , is weakly continuous (bounded continuous integrand against the product measure). As , pointwise on the off-diagonal (the singularity on the diagonal is approached from below), so by monotone convergence . A supremum of weakly continuous functionals is lower-semicontinuous, giving the first claim. For goodness, on the confinement bound controls uniformly (after absorbing, using superlogarithmic), so the sublevel set is tight; lower semicontinuity makes it closed, hence weakly compact by Prokhorov.

Proposition 2 (the semicircle attains the minimum, value for ). The semicircle minimises over for , with .

Proof. By strict convexity (Proposition 3 below) a minimiser is unique, and the Frostman condition on is sufficient for minimality. For the semicircle, evaluates on to , so on , and off the support the same expression exceeds since decays. Hence satisfies Frostman and is the minimiser. Its energy: (second moment ), and — correcting for the constant, the standard evaluation gives , so .

Proposition 3 (strict convexity of ). On the convex set of finite-energy probability measures, is strictly convex.

Proof. The confinement is linear. For the energy term, the kernel is strictly positive-definite on signed measures of zero total mass: for with and finite energy, the Fourier representation gives , vanishing only if , i.e. . Thus along , the second difference of equals for , so is strictly convex, and adding the affine preserves strict convexity. A strictly convex functional on a convex set has at most one minimiser.

Proposition 4 (single-eigenvalue large deviation at speed ). Conditioning the bulk on the semicircle, the probability that the largest eigenvalue exceeds for decays at speed with rate the effective potential: , where for .

Proof. Freeze eigenvalues at the semicircle equilibrium and integrate the conditional density of the remaining one. The single-particle conditional density at position outside the bulk is , whose exponent, to leading order in , is with the effective potential. For outside , is positive and its derivative is (the boundary value of the resolvent off the cut), so . The Laplace estimate gives , the speed- outlier rate, in contrast with the speed- macroscopic rate of the theorem.

Connections Master

The Gaussian ensembles and joint eigenvalue density 37.08.03 supply the closed-form joint density that this unit rewrites as a Gibbs weight ; the Weyl-Jacobian repulsion proved there becomes precisely the logarithmic-energy term in the rate functional, and the log-gas energy recorded there is the finite- ancestor of the continuum functional minimised here.

Spectral concentration via log-Sobolev and the Herbst argument 37.08.07 gives the matching upper envelope at the same speed : the Herbst sub-Gaussian bound caps the probability of any linear-statistic deviation by a Gaussian, whereas this LDP resolves the exact, generally non-Gaussian rate inside that cap — concentration is the bound, large deviations are the rate, and the second-order expansion of at reproduces the concentration constant.

Sanov's theorem and the LDP for empirical measures 37.07.05 is the speed- template this unit lifts to speed : Sanov prices the empirical measure of independent samples by relative entropy at speed , while the pairwise eigenvalue repulsion replaces independence with a Coulomb interaction, lifting the speed by a power of and replacing the linear relative entropy by the quadratic logarithmic energy; the Gibbs conditioning principle of Sanov reappears here as the constrained-equilibrium-measure selection under atypical spectral conditioning.

The abstract LDP scaffold 37.07.01 furnishes the definitions used throughout: the good-rate-function notion (goodness of from confinement-driven tightness), the speed/scale formalism (here ), and the contraction principle that pushes the measure-valued LDP down to LDPs for continuous functionals such as linear statistics and the extreme eigenvalue.

The Stieltjes transform and resolvent route 37.08.02 provides the analytic engine identifying the minimiser: the Frostman Euler-Lagrange equation is solved by the same resolvent that the cavity and moment methods produce, so the equilibrium measure of the variational problem and the self-consistent resolvent of the spectral analysis are one object seen two ways.

Historical & philosophical context Master

The speed- large deviation principle for the empirical spectral measure was proved by Gérard Ben Arous and Alice Guionnet in 1997 [Ben Arous & Guionnet 1997] (Probability Theory and Related Fields 108, 517-542), who established the LDP for the Gaussian ensembles directly from the joint eigenvalue density and identified the rate function as the logarithmic-energy functional, simultaneously recognising it as Dan Voiculescu's non-commutative entropy from free probability. Voiculescu had introduced the free-entropy quantity and the logarithmic-energy functional in his 1993 paper on the analogues of entropy and Fisher information [Voiculescu 1993] (Communications in Mathematical Physics 155, 71-92), as part of the free-probability theory he had built through the 1980s to study von Neumann algebras; the Ben Arous–Guionnet theorem made the matrix-model interpretation of that entropy precise.

The variational side of the result rests on classical logarithmic potential theory: the weighted equilibrium measure, the Frostman conditions, and the energy minimisation go back to the work of Otto Frostman in the 1930s and were systematised for external fields by Edward Saff and Vilmos Totik [Saff & Totik 1997] (Springer Grundlehren 316). Fumio Hiai and Dénes Petz developed the random-matrix / free-entropy dictionary in book form [Hiai & Petz 2000] (AMS Mathematical Surveys 77), and Guionnet's Saint-Flour lectures placed the LDP at the center of the macroscopic-asymptotics program for matrix models. The result is the capstone joining the large-deviations chapter to random matrices: the semicircle, first obtained by Wigner as a moment limit, is here characterised as the unique zero of a free-entropy rate function, the free-probabilistic analogue of the way the Gaussian is the unique maximiser of Boltzmann-Shannon entropy.

Bibliography Master

@article{benarousguionnet1997,
  author  = {Ben Arous, G\'erard and Guionnet, Alice},
  title   = {Large deviations for Wigner's law and Voiculescu's non-commutative entropy},
  journal = {Probability Theory and Related Fields},
  volume  = {108},
  number  = {4},
  pages   = {517--542},
  year    = {1997}
}

@article{voiculescu1993entropy,
  author  = {Voiculescu, Dan},
  title   = {The analogues of entropy and of Fisher's information measure in free probability theory, I},
  journal = {Communications in Mathematical Physics},
  volume  = {155},
  number  = {1},
  pages   = {71--92},
  year    = {1993}
}

@book{agz2010ldp,
  author    = {Anderson, Greg W. and Guionnet, Alice and Zeitouni, Ofer},
  title     = {An Introduction to Random Matrices},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {118},
  publisher = {Cambridge University Press},
  year      = {2010}
}

@incollection{guionnet2009saintflour,
  author    = {Guionnet, Alice},
  title     = {Large Random Matrices: Lectures on Macroscopic Asymptotics},
  booktitle = {\'Ecole d'\'Et\'e de Probabilit\'es de Saint-Flour XXXVI--2006},
  series    = {Lecture Notes in Mathematics},
  volume    = {1957},
  publisher = {Springer},
  year      = {2009}
}

@book{hiaipetz2000semicircle,
  author    = {Hiai, Fumio and Petz, D\'enes},
  title     = {The Semicircle Law, Free Random Variables and Entropy},
  series    = {Mathematical Surveys and Monographs},
  number    = {77},
  publisher = {American Mathematical Society},
  year      = {2000}
}

@book{safftotik1997,
  author    = {Saff, Edward B. and Totik, Vilmos},
  title     = {Logarithmic Potentials with External Fields},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {316},
  publisher = {Springer},
  year      = {1997}
}

@article{johansson1998fluctuations,
  author  = {Johansson, Kurt},
  title   = {On fluctuations of eigenvalues of random Hermitian matrices},
  journal = {Duke Mathematical Journal},
  volume  = {91},
  number  = {1},
  pages   = {151--204},
  year    = {1998}
}