40.08.07 · combinatorics / analytic-combinatorics-asymptotics

Limit Laws and the Quasi-Powers Theorem

shipped3 tiersLean: none

Anchor (Master): Flajolet-Sedgewick 2009 *Analytic Combinatorics* (Cambridge) Ch. IX in full — the bivariate-GF framework, the perturbed meromorphic schema (movable simple pole $\rho(u)$, §IX.6) and the perturbed singularity-analysis / algebraic-singularity schema (movable singularity, §IX.7), Hwang's quasi-powers theorem (§IX.5) and its Berry-Esseen and local-limit refinements, and the canonical worked laws: cycles in permutations (Goncharov, Erdős-Turán), components, descents (Eulerian), patterns and the Poisson regime; Hwang 1994 *Théorèmes limites pour les structures combinatoires et les fonctions arithmétiques* (Thèse, École Polytechnique); Bender 1973 *Central and local limit theorems applied to asymptotic enumeration* (J. Combin. Theory A 15)

Intuition Beginner

The chapter so far has counted objects and measured how fast the counts grow. This unit asks a finer question about a single random object. Pick a permutation of items at random; how many cycles does it have? Pick a set partition at random; how many blocks does it have? These numbers are not fixed — they change from one object to the next. So each one is a quantity that fluctuates, and the question is what its spread looks like across all objects of a given size.

You already know one shape that fluctuating quantities love to take. Add up many small, roughly independent contributions — many coin flips, many dice — and the total piles up into a bell curve, high in the middle and thin at the edges. That is the bell-shaped law that governs averages everywhere. The surprise of this unit is that a counted feature of a random combinatorial object very often follows the same bell-shaped law, even though the object was built by a counting recipe and not by tossing coins.

Why should counting and bell curves meet? Because many combinatorial parameters secretly are sums of many small almost-independent pieces. The number of cycles of a permutation is built up cycle by cycle; the number of blocks of a partition, block by block. Each small piece nudges the total a little, the pieces barely interfere, and the pile-up rule takes over. The bell curve appears for the same underlying reason it appears for coin flips.

The everyday picture is a gumball machine. Each gumball is one object; the number printed on it is the parameter. Most printed numbers cluster near a central value, fewer stray far, and a histogram of the numbers is a smooth hump. This unit explains when that hump is guaranteed to be the bell curve, how wide it is, and how fast it settles into shape.

Visual Beginner

Picture three histograms, one for each size , of the number of cycles across all permutations of that size. As grows the histogram slides right (more cycles on average) and widens a little, but its outline keeps converging to the same smooth bell hump once you re-centre and re-scale it.

The table records what the picture promises and what the machinery of this unit delivers. Each row is a fact you can use before seeing why it holds.

idea in words what it gives you
a parameter of a random object is a fluctuating quantity a spread to describe, not a single number
a marking variable bookkeeps the parameter inside the generating function one function holding every spread at once
dividing two coefficients gives the spread of the size- object the probabilities of each parameter value
many small almost-independent pieces pile into a bell curve a guaranteed bell-shaped limit with a known width

Read top to bottom: a parameter fluctuates, a marking variable tracks it, a ratio of coefficients reads off its spread, and the pile-up rule makes the spread a bell curve. The worked example carries this through for the simplest case and shows the bell curve emerging by hand.

Worked example Beginner

Let us watch the bell curve appear for a quantity you can compute by hand: the number of heads in repeated tosses, which is the model every combinatorial bell-curve law imitates. Take tosses of a fair coin and let the parameter be the number of heads.

Step 1. List the spread. Across all equally likely outcomes, the number of heads is once, four times, six times, four times, once. So the counts are .

Step 2. Centre it. The average number of heads is (half of ). The counts are symmetric around : the same on each side. The middle value is the tallest, and the bars fall away evenly.

Step 3. Read the width. The spread is measured by a width that for fair tosses is the square root of , here the square root of , which is . So the bulk of the outcomes sits within about one step of the centre , that is between and .

Step 4. Match the hump. Plot the bars : a clean hump, tallest in the middle, symmetric, thinning at both ends. This is the bell shape in miniature.

Step 5. Grow . For tosses the centre moves to and the width to the square root of , which is ; the bars from to trace a far smoother bell. The shape sharpens into the ideal curve as grows.

Step 6. Transfer the lesson. A combinatorial parameter that is a pile-up of many small almost-independent contributions copies this behaviour: a centre that grows with size, a width that grows more slowly, and a re-scaled histogram converging to the same bell. The number of cycles of a random permutation does exactly this, with centre and width both growing like the logarithm of .

What this tells us: the bell curve is the universal shape of a pile-up of many small contributions, and a counted feature of a random object inherits it whenever the feature is built that way — centre and width are all you need to locate and size the hump.

Check your understanding Beginner

Formal definition Intermediate+

Let be a combinatorial class with size function and a parameter (a marker such as the number of cycles, blocks, descents, or parts). The bivariate generating function marking is $$ F(z, u) = \sum_{\omega \in \mathcal{F}} z^{|\omega|}, u^{\chi(\omega)} = \sum_{n, k} f_{n,k}, z^n u^k, $$ where (for labelled classes the OGF in is replaced by the EGF , with ). Setting recovers the univariate counting series , whose asymptotics are read by the methods of 40.08.03 and 40.08.06.

Definition (parameter as a random variable). Under the uniform distribution on the objects of size , the parameter becomes a random variable with $$ \mathbb{P}(X_n = k) = \frac{f_{n,k}}{f_n} = \frac{[z^n u^k]F(z,u)}{[z^n]F(z,1)}. $$ Its probability generating function (PGF) is the ratio of coefficient extractions in , $$ p_n(u) = \mathbb{E}\big[u^{X_n}\big] = \sum_k \mathbb{P}(X_n = k), u^k = \frac{[z^n]F(z,u)}{[z^n]F(z,1)}, $$ a polynomial in with . The mean and variance are read from by differentiation at : $$ \mathbb{E}[X_n] = p_n'(1), \qquad \mathrm{Var}(X_n) = p_n''(1) + p_n'(1) - p_n'(1)^2. $$ The notation , , , , and the variability functional below are recorded in _meta/NOTATION.md.

Definition (quasi-powers form). A sequence of PGFs has a quasi-powers expansion if there are functions analytic at with , a sequence , and a sequence such that $$ p_n(u) = A(u), B(u)^{\lambda_n}\big(1 + O(\kappa_n^{-1})\big) $$ holds uniformly for in a fixed complex neighbourhood of . The name records that, to leading order, is a large power — a "quasi-power" — modulated by the fixed factor . The variability constant of is $$ \mathfrak{v}(B) = \frac{B''(1)}{B(1)} + \frac{B'(1)}{B(1)} - \left(\frac{B'(1)}{B(1)}\right)^2, $$ the second derivative of at ; the mean constant is .

A pure power is the PGF of a sum of independent identically distributed variables each with PGF (when is an integer); the quasi-powers hypothesis says that behaves, in the bulk, like such a sum of small pieces, which is exactly the pile-up that produces a Gaussian.

Counterexamples to common slips Intermediate+

  • "Knowing the mean and variance is the limit law." The mean and variance fix the centre and width of the hump but not its shape; a sequence can have linear-in- mean and variance and still converge to a non-Gaussian law. The Gaussian conclusion needs the full quasi-power form uniformly on a complex neighbourhood of , not just the first two moments at .

  • " always." The growth rate is the number of small pieces, and it depends on the schema. For a movable pole (meromorphic schema) , giving mean and variance linear in ; for the number of cycles of a permutation the relevant rate is , giving mean and variance of order . A bounded does not give a Gaussian at all but a discrete (Poisson) limit.

  • "The bivariate singularity is just the univariate one with along for the ride." The whole mechanism is that the dominant singularity in moves with . If were constant in , then , the variability would vanish, and there would be no spreading; the Gaussian comes precisely from the curvature of at .

  • "A central limit theorem gives the individual probabilities." The quasi-powers theorem in its basic form is a central limit theorem: it controls the distribution function . The local limit theorem — convergence of the single probabilities to the Gaussian density — needs an extra condition ruling out oscillation of on away from ; a parameter supported on an arithmetic progression (e.g. always even) satisfies the central but not the local theorem without rescaling.

Key theorem with proof Intermediate+

The signature theorem is Hwang's quasi-powers theorem: a uniform quasi-power expansion of the PGFs forces a Gaussian limit with explicit mean, variance, and convergence speed. Every Gaussian law in this unit — cycles, descents, components — is one instance, obtained by exhibiting the quasi-power form for that parameter's bivariate generating function [Flajolet-Sedgewick §IX.5]. The proof runs the parameter's PGF through the characteristic-function machinery of the central limit theorem of 37.03.01, so this is the point where enumeration borrows the probabilist's continuity theorem.

Theorem (quasi-powers; Hwang). Suppose the PGFs admit a quasi-power expansion $$ p_n(u) = A(u), B(u)^{\lambda_n}\big(1 + O(\kappa_n^{-1})\big), \qquad \kappa_n \to \infty, $$ uniformly for in a fixed complex neighbourhood of , with analytic there, , , and variability . Then , , and the standardised variable converges to the standard normal: $$ \frac{X_n - \lambda_n\mathfrak{m}(B)}{\sqrt{\lambda_n,\mathfrak{v}(B)}} \xrightarrow{d} \mathcal{N}(0,1), $$ with Berry-Esseen speed , where is the standard normal distribution function.

Proof. Write and ; the moment claims follow by differentiating at and using , , the cumulant reading of the PGF.

For the limit law, evaluate the PGF on the unit circle at , which produces the characteristic function of the standardised variable: with ,

Fix . As , , so lies in the neighbourhood where the expansion holds for all large , and

The first term is since and is analytic. Expand at : with one has , , , so . Substituting ,

using and , and . Hence

the linear terms cancelling by the choice of centring. Thus pointwise in , the characteristic function of ; by Lévy's continuity theorem 37.03.01, . The rate follows from the Berry-Esseen inequality: the explicit error in , uniform for , bounds the characteristic-function difference , and Esseen's smoothing inequality converts this integrated difference into the stated sup-norm bound on the distribution functions.

Corollary (mean and variance schema). When comes from a movable singularity and , the mean is and the variance is , both linear in , recovering Bender's central limit theorem [Bender 1973].

Bridge. This theorem builds toward every named limit law in the chapter and appears again in 40.08.04, where the perturbed singularity analysis supplies the uniform expansion of that the hypothesis demands. The foundational reason a counting problem produces a probabilist's bell curve is that the quasi-power is exactly the probability generating function of a sum of independent small pieces, so this is exactly the central limit theorem of 37.03.02 read through a generating function: the marking variable turns the PGF into a characteristic function, and Lévy continuity does the rest. The mechanism generalises the saddle-point Gaussian of 40.08.06 — there the integrand collapsed to a Gaussian in the angle ; here a second Gaussian, in the marking direction , rides on top — and putting these together, the central insight is that the same quadratic expansion of a logarithm that gave Stirling's now gives the limiting , one collapse counting size and the other measuring a parameter.

Exercises Intermediate+

Advanced results Master

The quasi-powers theorem is the analytic kernel; its deployment rests on the two structural mechanisms that generate the quasi-power form from a bivariate generating function — the perturbed meromorphic schema and the perturbed singularity-analysis schema — together with the local-limit refinement, the discrete (Poisson) regime where the hypothesis fails, and the canonical permutation-cycle law as the worked frontier case.

Theorem 1 (meromorphic schema: movable pole gives a Gaussian). Let be analytic in a bidisc around such that, for each in a neighbourhood of , the section is meromorphic with a unique dominant simple pole at , with analytic, the dominant pole of , and the residue analytic and nonzero. Then $$ p_n(u) \sim A(u),\Big(\frac{\rho(1)}{\rho(u)}\Big)^{n}, \qquad A(u) = \frac{\operatorname{Res}{z=\rho(u)}F(z,u)/\rho(u)}{\operatorname{Res}{z=\rho(1)}F(z,1)/\rho(1)}, $$ a quasi-power with , , . Hence is asymptotically Gaussian with mean and variance , both linear in , whenever the variability is nonzero [Flajolet-Sedgewick §IX.6]. This is the law of the number of components in a supercritical sequence, the number of parts in a composition, and the number of -patterns in random words; the movable pole of 40.08.03 perturbed by the marking variable is the whole content.

Theorem 2 (singularity-analysis schema: movable algebraic singularity gives a Gaussian). Let near a movable algebraic singularity , with analytic near and non-constant. By the singularity-analysis transfer theorems (co-produced 40.08.04), uniformly for near , so $$ p_n(u) \sim \frac{c(u),\Gamma(\alpha(1))}{c(1),\Gamma(\alpha(u))},n^{\alpha(u)-\alpha(1)}\Big(\frac{\rho(1)}{\rho(u)}\Big)^{n}. $$ When is non-constant the dominant quasi-power has , ; when but varies, is a quasi-power with , . Either way is asymptotically Gaussian [Flajolet-Sedgewick §IX.7]. The non-constant- case governs the number of leaves and the number of nodes of given degree in simple varieties of random trees and the number of components of random mappings; the constant-, varying- case governs the number of cycles of a permutation (, , ) and the number of distinct prime factors of a random integer (Erdős-Kac).

Theorem 3 (central versus local limit law). Under the quasi-powers hypothesis, satisfies the central limit theorem (convergence of the distribution function to ). If in addition is uniformly small on the arc for some — equivalently is not concentrated on a proper arithmetic progression — then the local limit law holds: $$ \sup_k\left|\sqrt{\mathrm{Var}(X_n)};\mathbb{P}(X_n = k) - \frac{1}{\sqrt{2\pi}}\exp!\left(-\frac{(k - \mathbb{E}[X_n])^2}{2,\mathrm{Var}(X_n)}\right)\right| \to 0. $$ The local theorem is strictly stronger: it gives the individual probabilities, not merely the cumulative ones, and fails for a parameter supported on, say, the even integers until rescaled by the period [Bender 1973]. The arc condition is the marking-variable analogue of the tail-negligibility hypothesis of the saddle-point method of 40.08.06.

Theorem 4 (discrete/Poisson regime: when the hypothesis fails). If stays bounded — the parameter is a count of rare events, each occurring with vanishing probability — the quasi-powers hypothesis fails and there is no Gaussian. Instead, when pointwise, converges to a Poisson law of mean : . The number of fixed points of a random permutation (mean , Poisson in the limit), the number of singleton blocks of a random set partition, and the number of occurrences of a fixed pattern that appears times are governed by this regime; the boundary between Poisson and Gaussian is exactly bounded versus [Flajolet-Sedgewick §IX.11]. The Poisson convergence is the method-of-moments / Janson-inequality regime of the probabilistic combinatorics chapter 40.07.07, reached here analytically through the bivariate generating function.

Theorem 5 (permutation cycles: the Goncharov / Erdős-Turán law). Let be the number of cycles of a uniformly random permutation of , with PGF . Then , , and $$ \frac{X_n - \ln n}{\sqrt{\ln n}} \xrightarrow{d} \mathcal{N}(0,1), $$ with the local limit law for [Flajolet-Sedgewick §IX.9]. The quasi-power is , with , , , ; the rate rather than is the signature of the logarithmic-structure (constant , varying singular exponent) schema, and the same analysis with number of distinct primes of () is the Erdős-Kac theorem.

Synthesis. The quasi-powers theorem is the foundational reason enumeration meets probability: a counted parameter of a random combinatorial object becomes a probabilist's Gaussian precisely when its probability generating function is, to leading order, a large power of a fixed analytic function — the generating-function avatar of a sum of independent small pieces. The central insight is that the marking substitution turns this PGF into the characteristic function of the standardised variable, the quadratic expansion of produces , and Lévy continuity 37.03.01 delivers the normal law with a Berry-Esseen rate; this is exactly the central limit theorem of 37.03.02 read through the bivariate generating function, and it is dual to the univariate saddle-point Gaussian of 40.08.06 — there a Gaussian in the contour angle counts size, here a second Gaussian in the marking direction measures a parameter, and the same logarithm-and-quadratic collapse drives both.

The two mechanisms producing the quasi-power are the two singularity readings of the chapter perturbed by : the movable simple pole of the meromorphic schema 40.08.03, giving and laws linear in (components, descents), and the movable algebraic singularity or movable exponent of the singularity-analysis schema 40.08.04, giving or (trees, mappings, cycles). Putting these together, the whole asymptotic landscape of the chapter — meromorphic 40.08.03, singularity analysis 40.08.04, saddle point 40.08.06 — reappears here perturbed by a marking variable, and the same local analysis that read the size asymptotics now reads the distribution of every parameter the symbolic method can mark; this generalises the chapter's univariate growth analysis into a parametric limit-law machine, and the boundary where stays bounded crosses out of the Gaussian world into the Poisson regime of 40.07.07, so the quasi-powers theorem is the capstone that classifies, by the growth of , when a combinatorial parameter is Gaussian, when it is Poisson, and how fast it converges.

Full proof set Master

Proposition 1 (PGF extraction and moments). For a parameter with bivariate GF and , the PGF of is , a polynomial with , , and .

Proof. By definition , so . At , . Differentiating, , so . Next , so .

Proposition 2 (cumulants of a quasi-power). If uniformly near with , then and .

Proof. Let , the cumulant generating function of (the -term analytic in near , so differentiable term by term by uniform convergence on a complex neighbourhood and Cauchy's estimates). The first cumulant is and the second is . With , : , , by the definitions, and likewise for . Differentiating at gives and . Since is fixed, .

Proposition 3 (quasi-powers central limit theorem). Under the hypotheses of Proposition 2 with and , .

Proof. Set . The characteristic function is . Fix . For large, lies in the complex neighbourhood of where is analytic, so $$ \log p_n(e^{it/\sigma_n}) = g_A(it/\sigma_n) + \lambda_n g_B(it/\sigma_n) + O(\kappa_n^{-1}). $$ Taylor-expanding at : $$ \lambda_n g_B(it/\sigma_n) = \lambda_n\mathfrak{m}(B)\frac{it}{\sigma_n} - \lambda_n\mathfrak{v}(B)\frac{t^2}{2\sigma_n^2} + O!\Big(\frac{\lambda_n}{\sigma_n^3}\Big). $$ Now , , and . The term combines with to leave . Hence , so for every . By Lévy's continuity theorem 37.03.01, pointwise convergence of characteristic functions to the continuous limit implies .

Proposition 4 (Berry-Esseen rate). Under the hypotheses of Proposition 3, .

Proof. By Esseen's smoothing inequality, for any , $$ \sup_x|\mathbb{P}(Y_n\le x) - \Phi(x)| \le \frac{1}{\pi}\int_{-T}^{T}\frac{|\phi_n(t) - e^{-t^2/2}|}{|t|},dt + \frac{C}{T}, $$ with absolute. From Proposition 3, on the expansion gives with , so for up to a fixed multiple of . Integrating against over converges and is ; choosing makes the tail subdominant. Summing the two contributions gives the stated rate.

Proposition 5 (permutation-cycle Gaussian law). The number of cycles of a uniform random permutation satisfies .

Proof. The cycle-marked EGF is (a permutation is a set of cycles, each cycle marked by , and the EGF of cycles is , so the SET gives ). Thus and, since , . The Gamma ratio asymptotic (uniform for in a neighbourhood of , by Stirling for ) gives , a quasi-power with (), (), , (the relative error in the Gamma ratio is , hence ). Then , so , . By Proposition 3, , with the exact mean and variance from Proposition 1 differing from by , absorbed in the standardisation.

Connections Master

  • This unit is the bivariate completion of the meromorphic coefficient asymptotics of 40.08.03: there a single dominant pole fixed the growth of the counts; here the pole moves with the marking variable, , and the curvature of at is exactly the variance of the Gaussian limit law. The simple-pole coefficient asymptotic of that unit, applied uniformly in , is the engine of the meromorphic schema (Theorem 1 and Exercise 8), so the supercritical-sequence component count promised at the end of 40.08.03 is delivered here as a central limit theorem.

  • The companion unit 40.08.04 on singularity analysis supplies the other mechanism: when the dominant singularity is algebraic, , the transfer theorems of 40.08.04 give the uniform-in- coefficient asymptotic that the singularity-analysis schema (Theorem 2) feeds to the quasi-powers theorem; the movable exponent with constant produces the logarithmic rate of the permutation-cycle law, and the movable produces the linear-in- laws of random trees and mappings. This unit and 40.08.04 are co-produced; 40.08.04 is the univariate transfer theory and this unit is its parametric distribution-valued reading.

  • The Gaussian conclusion is imported wholesale from the probability chapter: Lévy's continuity theorem 37.03.01 converts pointwise convergence of the PGF-derived characteristic functions into convergence in distribution, and the Lindeberg-Feller central limit theorem 37.03.02 is the prototype that the quasi-power imitates — a sum of small almost-independent pieces. The Berry-Esseen rate of Proposition 4 is the combinatorial instance of the classical Berry-Esseen bound. The discrete regime of Theorem 4 connects forward to the Poisson-approximation and Janson-inequality methods of the probabilistic combinatorics chapter 40.07.07, reached here analytically through the bivariate generating function.

  • The Eulerian-polynomial descent law (Exercise 7) ties this unit to the permutation-statistics unit 40.01.05, whose real-rooted Eulerian polynomials furnish the cleanest quasi-power certificate (a product of independent-bit PGFs); the saddle-point Gaussian of the co-prerequisite 40.08.06 is the univariate shadow of the quasi-powers Gaussian, the angle-direction collapse there mirrored by the marking-direction collapse here, and the Stirling/Laplace quadratic of 06.01.15 underlies both. The chapter's whole asymptotic toolkit — meromorphic, singularity-analysis, saddle-point — reappears here perturbed by a marking variable to yield the limit law of every markable parameter.

Historical & philosophical context Master

The recognition that combinatorial parameters obey limit laws begins with Goncharov's 1944 determination that the number of cycles of a random permutation is asymptotically normal with mean and variance , and with the parallel number-theoretic discovery by Erdős and Kac (1940) that the number of distinct prime factors of a random integer below is asymptotically normal with mean and variance — the Erdős-Kac theorem, the founding instance of "probabilistic number theory" and a structural twin of the cycle law (both are sums of nearly independent indicators with logarithmic rate). Erdős and Turán's 1965-67 papers on the statistical theory of the symmetric group consolidated the cycle and order statistics of random permutations [Flajolet-Sedgewick §IX.9].

The analytic derivation of such laws directly from generating functions, rather than by the method of moments, was inaugurated by Edward Bender's 1973 paper Central and local limit theorems applied to asymptotic enumeration [Bender 1973], which showed that a bivariate generating function with a smoothly moving dominant singularity forces a Gaussian limit, and gave the local-limit refinement. Bender and Richmond, and later Michael Drmota and Hsien-Kuei Hwang, extended and quantified the framework. Hwang's 1994 thesis and his 1998 paper [Hwang 1998] abstracted the common hypothesis into the quasi-powers theorem: a uniform expansion on a complex neighbourhood of yields the Gaussian law together with a Berry-Esseen convergence rate , unifying the meromorphic and singularity-analysis schemata under one analytic criterion. The codification within analytic combinatorics, organising the limit laws by the two singularity mechanisms and the growth of , is due to Philippe Flajolet and Robert Sedgewick, whose Analytic Combinatorics (2009) Chapter IX is the reference treatment [Flajolet-Sedgewick Ch. IX].

Bibliography Master

@book{flajoletsedgewick2009,
  author    = {Flajolet, Philippe and Sedgewick, Robert},
  title     = {Analytic Combinatorics},
  publisher = {Cambridge University Press},
  year      = {2009}
}

@article{bender1973central,
  author  = {Bender, Edward A.},
  title   = {Central and local limit theorems applied to asymptotic enumeration},
  journal = {Journal of Combinatorial Theory, Series A},
  volume  = {15},
  year    = {1973},
  pages   = {91--111}
}

@article{hwang1998convergence,
  author  = {Hwang, Hsien-Kuei},
  title   = {On convergence rates in the central limit theorems for combinatorial structures},
  journal = {European Journal of Combinatorics},
  volume  = {19},
  year    = {1998},
  pages   = {329--343}
}

@phdthesis{hwang1994thesis,
  author  = {Hwang, Hsien-Kuei},
  title   = {Th\'{e}or\`{e}mes limites pour les structures combinatoires et les fonctions arithm\'{e}tiques},
  school  = {\'{E}cole Polytechnique},
  year    = {1994}
}

@article{erdoskac1940,
  author  = {Erd\H{o}s, Paul and Kac, Mark},
  title   = {The Gaussian law of errors in the theory of additive number theoretic functions},
  journal = {American Journal of Mathematics},
  volume  = {62},
  year    = {1940},
  pages   = {738--742}
}

@book{drmota2009random,
  author    = {Drmota, Michael},
  title     = {Random Trees: An Interplay between Combinatorics and Probability},
  publisher = {Springer},
  year      = {2009}
}