37.08.04 · probability / 08-random-matrices

Determinantal Point Processes and Sine-Kernel Bulk Universality

shipped3 tiersLean: none

Anchor (Master): Anderson-Guionnet-Zeitouni, An Introduction to Random Matrices (Cambridge, 2010) Ch. 3-4; Mehta, Random Matrices 3e (Elsevier, 2004) Ch. 5-7; Deift, Orthogonal Polynomials and Random Matrices: A Riemann-Hilbert Approach (AMS, 1999); Tracy-Widom, Commun. Math. Phys. 159 (1994); Soshnikov, Russian Math. Surveys 55 (2000) (determinantal random point fields)

Intuition Beginner

The eigenvalues of a random matrix are not scattered like raindrops. Raindrops land independently: one drop tells you nothing about where the next will fall, and two can land arbitrarily close together. Eigenvalues behave oppositely. They push apart. Find one eigenvalue and you have learned something — the spots right next to it are nearly empty, because a second eigenvalue strongly avoids sitting on top of the first. This mutual avoidance is called level repulsion, and it is the single most important fact about how eigenvalues are spaced.

There is a clean bookkeeping device that captures all of this avoidance at once. To each pair of locations we attach one number, and the chance of finding eigenvalues at any list of spots is computed as a single determinant of the little table of those paired numbers. Determinants are built to vanish when two rows agree, so the moment two requested spots coincide the answer drops to zero — the repulsion is wired directly into the arithmetic. A cloud of points whose spacing statistics all come from one such table is called a determinantal process.

The surprise is that when you zoom far into the dense middle of the spectrum, rescaling so neighbouring eigenvalues sit about one unit apart, every reasonable random matrix produces the same paired-number table in the limit. That universal limiting table is the sine pattern, and it governs the fine-grained spacing of eigenvalues no matter what random rule built the matrix.

Visual Beginner

Picture two rows of dots on a line. The top row is a batch of independent raindrop points: they clump, leave big gaps, and sometimes two nearly touch. The bottom row is a batch of eigenvalues: they are eerily evenly spread, each keeping a polite distance from its neighbours, with no near-collisions and no large empty stretches. The eye can tell them apart instantly — the eigenvalue row looks "combed."

The little curve on the right is the punchline. It records the chance of finding a second eigenvalue a distance away from a chosen one. At zero distance the chance is zero — that is the repulsion. As the distance grows the chance climbs, gently overshoots, and levels off at the average rate. For the raindrops this curve would be flat at one everywhere, because independent points never notice each other.

Worked example Beginner

We compute the repulsion curve from the universal sine table in the simplest way, to see the zero-at-coincidence built in.

Step 1. The universal table assigns to a pair of spots separated by a distance the number . When the two spots are the same, , this expression is one (the value of as shrinks to zero). When the spots differ, it is the familiar wavy function that crosses zero at every whole-number distance.

Step 2. The chance of finding eigenvalues at two spots, divided by the chance for two independent points, is one minus the square of that paired number. Call the paired number ; the two-point answer is .

Step 3. Put the two spots on top of each other, . Then , and the answer is . The chance of two eigenvalues at the very same place is zero. That is level repulsion, stated as a single subtraction.

Step 4. Now separate the spots by one unit, . Then , so and the answer is . At distance one — the average spacing — the points behave as if independent again.

Step 5. What this tells us: the one paired number controls the whole spacing story. It equals one at zero distance, forcing the two-point chance down to zero (points cannot collide); and it dies away at larger distances, letting far-apart eigenvalues ignore each other. The single formula is the repulsion curve drawn in the visual.

Check your understanding Beginner

Formal definition Intermediate+

Let be a locally compact Polish space with a reference (Lebesgue-type) measure . A simple point process on is a random locally finite configuration of points; its -point correlation function is defined by requiring, for disjoint Borel sets , $$ \mathbb{E}\Big[\prod_{i=1}^k N(A_i)\Big] = \int_{A_1\times\cdots\times A_k} \rho_k(x_1,\dots,x_k), d\lambda(x_1)\cdots d\lambda(x_k), $$ where counts points in . Informally is the probability of finding a point in each infinitesimal neighbourhood of .

A simple point process is determinantal with correlation kernel if for every and all , $$ \rho_k(x_1,\dots,x_k) = \det\big[K(x_i, x_j)\big]_{i,j=1}^{k}. $$ The kernel is assumed to define a locally trace-class, locally self-adjoint integral operator (so the determinants are nonnegative and the are genuine correlation functions). The defining identity makes repulsion automatic: \rho_2(x,x) = \det\begin{psmallmatrix} K(x,x) & K(x,x)\\ K(x,x) & K(x,x)\end{psmallmatrix} = 0, so no two points coincide.

The GUE eigenvalue process is the central example. The Gaussian Unitary Ensemble of Hermitian matrices has joint eigenvalue density $$ p_n(\lambda_1,\dots,\lambda_n) = \frac{1}{Z_n}\prod_{i<j}(\lambda_i - \lambda_j)^2 \exp\Big(-\tfrac12\sum_{i=1}^n \lambda_i^2\Big), $$ the log-gas developed in 37.08.03. Let be the Hermite polynomials orthonormal for the weight , and set the Hermite functions , so . The Hermite kernel is $$ K_n(x,y) = \sum_{k=0}^{n-1} \phi_k(x),\phi_k(y), $$ the kernel of the orthogonal projection onto . The eigenvalue process of the GUE is determinantal on with kernel .

The Christoffel-Darboux formula collapses the sum into two terms: $$ K_n(x,y) = \sum_{k=0}^{n-1}\phi_k(x)\phi_k(y) = \sqrt{n},\frac{\phi_n(x)\phi_{n-1}(y) - \phi_{n-1}(x)\phi_n(y)}{x - y}, $$ with the diagonal value by l'Hôpital. The constant is the ratio of leading coefficients of the orthonormal Hermite polynomials in this normalisation.

The sine kernel is the bulk scaling limit. Fix a point in the open interval where the semicircle density 37.08.01 is positive, let be that density, and rescale so the mean spacing is one: , . Then $$ \frac{1}{n\rho_{\mathrm{sc}}(E)}, K_n!\Big(E + \tfrac{u}{n\rho_{\mathrm{sc}}(E)},, E + \tfrac{v}{n\rho_{\mathrm{sc}}(E)}\Big) \longrightarrow K_{\sin}(u,v) = \frac{\sin\pi(u - v)}{\pi(u - v)}. $$ The gap probability — the chance that a Borel set contains no points — is the Fredholm determinant of the kernel restricted to : $$ \mathbb{P}\big(N(B) = 0\big) = \det\big(I - K|B\big) = \sum{k=0}^{\infty}\frac{(-1)^k}{k!}\int_{B^k}\det\big[K(x_i,x_j)\big]_{i,j=1}^k, dx_1\cdots dx_k. $$

Counterexamples to common slips Intermediate+

  • Determinantal Gaussian-distributed. The kernel entries are not random; is a fixed function and the randomness is entirely in which points appear. A determinantal process is a recipe for the correlation functions, not a normal distribution.
  • The kernel is not unique. Conjugating by any nonvanishing leaves every determinant unchanged, so it leaves the process unchanged. Two different-looking kernels can describe the same process.
  • The diagonal carries the density. is the one-point intensity. For the Hermite kernel is the GUE spectral density; it is not constant, and forgetting this conflates the local sine limit (constant intensity one after rescaling) with the global semicircle profile.
  • Bulk and edge limits differ. Rescaling at a fixed bulk point gives the sine kernel; rescaling at the spectral edge on the finer scale gives the Airy kernel, not the sine kernel. The scaling exponent and the limit object both change at the edge.

Key theorem with proof Intermediate+

Theorem (sine-kernel bulk limit of the GUE). Let be the Hermite kernel of the GUE, and fix a bulk point with , so the rescaled semicircle density at is in the units where the spectrum fills . Then, with the mean-spacing rescaling , , $$ \lim_{n\to\infty}\frac{1}{n\varrho},K_n!\big(x, y\big) = \frac{\sin\pi(u - v)}{\pi(u - v)} = K_{\sin}(u, v), $$ uniformly for in compact sets. Consequently every rescaled -point correlation function converges to .

Proof. By Christoffel-Darboux the kernel is controlled by the two top Hermite functions , so the limit reduces to their Plancherel-Rotach asymptotics in the oscillatory (bulk) regime. For with in a compact subset of , $$ \phi_n(x) = \Big(\frac{2}{\pi}\Big)^{1/2}(2n)^{-1/4}(\sin\theta)^{-1/2}\Big[\cos\big(n\psi(\theta) - \tfrac{\theta}{2} + \tfrac{\pi}{4}\big) + O(n^{-1})\Big], $$ where has derivative , and the phase advances by per unit length: differentiating the local phase in gives exactly per unit at the centre point . Write the phase of at the point as , where is the phase at ; the shift by multiplies the slowly varying amplitude by and advances the fast phase by .

Abbreviate the common amplitude , evaluated at up to . Then $$ \phi_n(x) = a_n\cos(\Theta_n + \pi u) + o(a_n),\qquad \phi_{n-1}(x) = a_n\cos(\Theta_n + \pi u - \chi_n) + o(a_n), $$ where is the phase lag of relative to ; the recurrence relation for Hermite functions forces in the bulk (the two top functions are asymptotically in quadrature). Insert these into the Christoffel-Darboux numerator. With , , the denominator is , and $$ \phi_n(x)\phi_{n-1}(y) - \phi_{n-1}(x)\phi_n(y) = a_n^2\big[\cos(\alpha + \pi u)\sin(\alpha + \pi v) - \sin(\alpha + \pi u)\cos(\alpha + \pi v)\big] + o(a_n^2), $$ using with . The bracket is , an exact angle-subtraction identity that erases the unknown phase .

Collect constants. Christoffel-Darboux carries the factor , the amplitude squared is , and the diagonal density satisfies , which fixes after matching the one-point intensity to . Putting the pieces together, $$ \frac{1}{n\varrho}K_n(x,y) = \frac{1}{n\varrho}\cdot\sqrt n,\frac{a_n^2,(-\sin\pi(u-v))}{(u - v)/(n\varrho)} + o(1) = \frac{\sin\pi(u-v)}{\pi(u-v)} + o(1), $$ the normalisation absorbing every constant because the diagonal limit pins it. Uniformity on compacts follows from the uniform Plancherel-Rotach error . Since each and the determinant is continuous in its entries, the rescaled -point functions converge to .

Bridge. This computation builds toward the universal local statistics of all Hermitian random matrices, and it appears again in the gap-probability and level-spacing analysis where the Fredholm determinant is evaluated. The foundational reason a single trigonometric kernel survives is that Christoffel-Darboux reduces the entire -term sum to the top two oscillating Hermite functions, and an angle-subtraction identity annihilates their common unknown phase — this is exactly why the limit is phase-independent and hence universal across bulk locations. The determinantal structure generalises the resolvent self-consistency of 37.08.02 from one-point density to all-order correlations: there one analytic equation pins the global density, here one kernel pins every local correlation. Putting these together, the bridge is that the projection kernel onto the lowest Hermite modes, rescaled at the mean spacing, converges to the projection onto the half-line of frequencies — the sine kernel is the Fourier projection — and this is exactly the central insight that the same limit appears for every ensemble and, by the universality theorems, for every Wigner matrix with finite moments.

Exercises Intermediate+

Advanced results Master

The determinantal structure of the GUE is exact at every finite , not merely asymptotic. The squared Vandermonde factors as , and row-reducing the monomials into the orthonormal Hermite functions converts the joint density into . Integrating out variables, using the reproducing identity (because is a projection) and , the Mehta-Gaudin-Dyson integration lemma collapses each integral and yields for all . This is the finite- determinantal identity from which the sine-kernel limit is extracted.

The gap probability and level spacing are governed by the Fredholm determinant of the sine kernel on an interval. Writing for the probability that an interval of length (in mean-spacing units) is empty, the nearest-neighbour spacing density is . Gaudin and Mehta computed through the eigenvalues of the sine-kernel integral operator (prolate spheroidal functions); Jimbo-Miwa-Môri-Sato later identified with a Painlevé V transcendent, the bulk analogue of the Painlevé II governing the Tracy-Widom edge law. The small- behaviour encodes linear level repulsion at ; this linear vanishing (rather than the of Poisson or the of general ) is the determinantal fingerprint.

Soshnikov's theorem characterises which point processes are determinantal: a simple point process whose correlation functions are nonnegative and satisfy the determinantal identity for some Hermitian locally-trace-class kernel exists if and only if the operator satisfies . Projection kernels () such as the Hermite and sine kernels are the extreme case, and they describe processes with exactly the rigidity seen in eigenvalues. The general condition is the determinantal analogue of a probability lying in , and it makes the determinantal class closed under the operations — restriction, conditioning, complementation — that the eigenvalue applications require.

Bulk universality is the statement that the sine kernel is the entry-distribution-independent local limit. For Wigner matrices the proof proceeds by the three-step strategy: (i) a local semicircle law 37.08.02 controlling the density down to scale ; (ii) the relaxation of Dyson Brownian motion 37.08.03 to local equilibrium in time , which produces sine-kernel statistics for the Gaussian-divisible ensemble ; and (iii) a Green's-function comparison (the four-moment theorem of Tao-Vu, or the Erdős-Yau continuity estimate) removing the added Gaussian component because local statistics depend only on the first four moments. The same architecture, with the Airy kernel replacing the sine kernel, gives edge universality and the Tracy-Widom law 37.08.01. The sine kernel is therefore not a feature of the Gaussian weight but a universal attractor for the local correlations of Hermitian spectra.

Synthesis. The foundational reason a single kernel organises all the local statistics is that the GUE eigenvalue process is the determinantal projection onto the lowest Hermite modes, and Christoffel-Darboux compresses this projection into the top two oscillating functions whose common phase the bulk scaling annihilates — this is exactly why the limit is the Fourier projection onto the band and why it is universal. Putting these together, the moment method 37.08.01, the resolvent self-consistent equation 37.08.02, and the log-gas correlation functions 37.08.03 are three views of one spectral object: the first fixes the global semicircle, the second pins it analytically and reaches the edge, and the determinantal kernel of this unit refines both to all-order local correlations, so the global density is the diagonal of the very kernel whose bulk limit is the sine process. This generalises the central-limit principle of classical probability 37.03.01 — there a universal Gaussian governs sums, here a universal sine kernel governs local eigenvalue correlations — and it is dual to the free-probability reading in which the semicircle is the free Gaussian. The central insight is that universality is a statement about which projection survives a scaling limit, and the bridge to the frontier is that the three-step Dyson-Brownian-motion strategy turns this exact-GUE computation into a theorem for every Wigner matrix, making the sine kernel the genuine local limit of Hermitian spectra.

Full proof set Master

The Christoffel-Darboux reduction and the sine-kernel limit are proved above. The remaining Master claims are recorded here.

Proposition (the GUE process is determinantal with the Hermite kernel). The eigenvalue density has -point correlation functions , where .

Proof. The Vandermonde determinant is unchanged, up to the constant of the leading coefficients, by replacing the monomials with the orthonormal Hermite polynomials (each is a triangular linear combination of monomials). Hence . Folding into the rows gives , and the Cauchy-Binet / Andréief identity rewrites the product of the two determinants as . So . The integration lemma — using and the reproducing property valid because projects onto the span of — integrates out the last variables one at a time, each integration reducing a size- determinant to a size- determinant times the scalar , and the factorials cancel to leave .

Proposition (reproducing / projection property of ). is the kernel of the orthogonal projection of onto ; in particular and .

Proof. By orthonormality , so , the idempotence . Symmetry and reality of give self-adjointness, so is an orthogonal projection; its range is because for and for . The trace is .

Proposition (sine kernel is a projection; level repulsion). is the kernel of the orthogonal projection of onto the band-limited subspace of functions with Fourier support in , and the two-point function of the sine process is , vanishing quadratically as .

Proof. From Exercise 4, is the Fourier conjugate of multiplication by , an orthogonal projection since the indicator is real and idempotent; hence as an operator. The determinantal two-point function is , using and symmetry. Near coincidence, from the Taylor expansion of , so , the quadratic vanishing that is linear level repulsion at the level of spacings.

Proposition (gap probability is a Fredholm determinant). For a determinantal process with locally-trace-class kernel and bounded Borel , .

Proof. The void probability equals the generating functional , whose expansion against the correlation functions is (Exercise 6). Substituting gives the series , which is by definition the Fredholm determinant of the trace-class operator . Trace-class-ness on the bounded set and Hadamard's inequality make the series absolutely convergent, so the interchange of sum and expectation is justified.

Connections Master

The Wigner semicircle law and the moment method 37.08.01 supplies the global density that the determinantal kernel refines. The diagonal of the Hermite kernel, , is exactly the GUE spectral density whose large- limit is the semicircle; the local rescaling at a bulk point divides this density out to expose the sine kernel, and the square-root vanishing of the semicircle at the edge is what changes the local scaling from to and the sine kernel to the Airy kernel. This unit is the all-order local-correlation companion of that unit's one-point global law.

The Stieltjes transform and the resolvent 37.08.02 is the analytic route to the same density and the technical input to universality. The local semicircle law proved there — control of down to imaginary part — is the first of the three steps that upgrade the exact-GUE sine-kernel computation of this unit to bulk universality for all Wigner matrices, since it places eigenvalues precisely enough that the Dyson-Brownian-motion relaxation can act on the correct local scale.

The GUE joint density and the log-gas 37.08.03 is the source of the determinantal structure used throughout this unit. The squared Vandermonde of the log-gas is what factors into a single determinant of the Hermite kernel; the and log-gases are only Pfaffian, not determinantal, which is why the clean sine-kernel determinant is special to the unitary symmetry class, and the Dyson-Brownian-motion dynamics of that unit drives the relaxation step of the universality proof.

The QFT large- matrix model and topological expansion 08.14.06 meets this unit at the level of correlations: the connected two-point eigenvalue correlation of a one-matrix model has a universal bulk form — the sine kernel after local rescaling — independent of the matrix potential, the same universality the orthogonal-polynomial method makes exact here, and the loop-equation / Riemann-Hilbert analysis of that model is the field-theoretic counterpart of the Plancherel-Rotach asymptotics used in the proof.

Historical & philosophical context Master

The correlation-function structure of the Gaussian ensembles was worked out by Freeman Dyson and Madan Lal Mehta around 1960-1963. Mehta and Gaudin computed the level-spacing distribution as a Fredholm determinant of an integral operator in 1960 [Gaudin 1960], reducing it to the spectrum of the sine-kernel operator via prolate spheroidal functions; Dyson's 1962 series on the statistical theory of energy levels [Dyson 1962] introduced the correlation functions of the circular ensembles and identified the sine kernel as their bulk limit, alongside his threefold () classification by time-reversal and spin. The determinantal identity for the unitary ensemble and the orthogonal-polynomial method are systematised in Mehta's monograph [Mehta 2004], long the standard reference.

The abstraction to general determinantal point processes was pursued by Odile Macchi, who introduced them in 1975 to model fermionic systems, and the systematic theory — including the existence criterion — was given by Alexander Soshnikov in 2000 [Soshnikov 2000]. The connection to the moment problem and to integrable systems deepened when the gap probabilities were tied to Painlevé transcendents: Jimbo, Miwa, Môri and Sato linked the sine-kernel Fredholm determinant to Painlevé V in 1980, and Tracy and Widom recast both the sine (bulk) and Airy (edge) level-spacing laws in their 1994 Fredholm-determinant framework [Tracy 1994]. The proof that the sine kernel is universal — the local limit for all Wigner matrices, not merely the Gaussian ones — was completed in the late 2000s by the Erdős-Schlein-Yau program and independently by Tao and Vu, vindicating Wigner's hypothesis at the level of fine spacing statistics, and the Riemann-Hilbert asymptotic analysis of Deift and collaborators supplied the orthogonal-polynomial route to the same universality.

Bibliography Master

@article{gaudinmehta1960,
  author  = {Mehta, Madan Lal and Gaudin, Michel},
  title   = {On the density of eigenvalues of a random matrix},
  journal = {Nuclear Physics},
  volume  = {18},
  pages   = {420--427},
  year    = {1960}
}

@article{dyson1962,
  author  = {Dyson, Freeman J.},
  title   = {Statistical theory of the energy levels of complex systems. III},
  journal = {Journal of Mathematical Physics},
  volume  = {3},
  number  = {1},
  pages   = {166--175},
  year    = {1962}
}

@book{mehta2004,
  author    = {Mehta, Madan Lal},
  title     = {Random Matrices},
  edition   = {3rd},
  publisher = {Elsevier/Academic Press, Amsterdam},
  year      = {2004}
}

@article{soshnikov2000,
  author  = {Soshnikov, Alexander},
  title   = {Determinantal random point fields},
  journal = {Russian Mathematical Surveys},
  volume  = {55},
  number  = {5},
  pages   = {923--975},
  year    = {2000}
}

@article{tracywidom1994,
  author  = {Tracy, Craig A. and Widom, Harold},
  title   = {Level-spacing distributions and the Airy kernel},
  journal = {Communications in Mathematical Physics},
  volume  = {159},
  number  = {1},
  pages   = {151--174},
  year    = {1994}
}

@book{deift1999,
  author    = {Deift, Percy},
  title     = {Orthogonal Polynomials and Random Matrices: A Riemann-Hilbert Approach},
  series    = {Courant Lecture Notes},
  volume    = {3},
  publisher = {American Mathematical Society},
  year      = {1999}
}

@book{agz2010,
  author    = {Anderson, Greg W. and Guionnet, Alice and Zeitouni, Ofer},
  title     = {An Introduction to Random Matrices},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {118},
  publisher = {Cambridge University Press},
  year      = {2010}
}