The Saddle-Point Method for Asymptotic Enumeration
Anchor (Master): Flajolet-Sedgewick 2009 *Analytic Combinatorics* (Cambridge) Ch. VIII in full — the Cauchy-coefficient saddle-point analysis of entire and fast-growing functions, the saddle-point bound and method, Hayman's admissibility theorem and the closure properties of H-admissible functions, the Moser-Wyman analysis of Bell numbers, the involution and central-binomial coefficient examples, and the connection to the circle method (Theorems VIII.1-VIII.8); Hayman 1956 *A generalisation of Stirling's formula* (J. Reine Angew. Math. 196) for the admissibility theorem; de Bruijn 1981 *Asymptotic Methods in Analysis* Ch. 5-6; Moser-Wyman 1955 *An asymptotic formula for the Bell numbers* (Trans. Roy. Soc. Canada 49)
Intuition Beginner
Some counting sequences grow so fast that the usual asymptotic tools fail. The number of ways to pair people up, the number of ways to partition a set into groups, the factorial itself — these all outrun any fixed exponential rate. The generating functions that produce them, like the exponential of or the exponential of "the exponential of minus one", have no nearest bad point on the boundary to read the growth from. The previous chapters extracted growth rates from a singularity sitting at a fixed distance from the origin. Here there is no such fixed landmark, so a different idea is needed.
That idea starts from a way of pulling a single count back out of a generating function: a coefficient equals an average of the function around a circle, divided by the radius raised to a power. You get to choose the radius of that circle. Pick it too small and the radius-power in the denominator stays large, which inflates your estimate; pick it too large and the function itself blows up in the numerator. Somewhere in between is a best radius that balances these two opposing effects, and at that radius the estimate is sharpest.
The recipe is therefore: find the radius that balances the growing function against the shrinking radius-power, and read the count off there. The balance point is where a certain "tilt" of the function matches the size you are asking about. As the size grows, the best radius moves, tracking it. This balanced-radius method is the saddle-point method, and it is exactly what is needed for fast-growing, landmark-free sequences.
The everyday picture is a mountain pass. To cross a ridge you do not climb to a peak; you walk through the lowest gap, the pass, where the ground curves up on two sides and down on the other two. The best circle threads the function's integral through such a pass, and almost the entire value of the integral comes from the small stretch right at the gap.
Visual Beginner
Think of extracting a count as walking a loop around the origin and averaging the function as you go, then dividing by the radius raised to the size you want. The one free choice is how big to make the loop. The picture below shows the trade-off that fixes the best size.
The table records the balance and what the method delivers. Each row is a fact you can use without re-deriving it, and the rest of the unit explains why each holds.
| idea in words | what it gives you |
|---|---|
| a count is a loop average of the function divided by the radius-power | an exact formula you may evaluate at any radius |
| the function grows with radius, the radius-power shrinks the estimate | two opposing effects to balance |
| the best radius is where the function's "tilt" equals the size | one equation that fixes the circle |
| at the best radius almost all the loop value sits in a small arc | the estimate becomes a single clean term |
Read top to bottom: a count is a loop average, the two effects fight, the best radius settles the fight, and the answer reads off in one term. The worked example below carries this through for the simplest fast-growing function and recovers a famous formula for the factorial.
Worked example Beginner
Let us estimate the number that the function hangs on its size- peg, using the balance idea, and watch Stirling's formula for fall out. We work the case to keep numbers concrete, then state what the balance gives in general.
Step 1. The growing part is the function value on a circle of radius ; the shrinking part is divided by (the radius-power for size ). The estimate to balance is divided by .
Step 2. Find the best radius. The function's "tilt" for is just itself (the rate at which climbs, times , is ). Setting this tilt equal to the size gives . So the best circle for the size- peg has radius .
Step 3. Read off the estimate. Plug into "function over radius-power": divided by . That is about .
Step 4. Sharpen with the pass width. The gap at the saddle has a spread measured by the square root of times the size, here . Dividing the Step 3 number by this width gives .
Step 5. Compare. The true value is . The estimate is within about two percent.
Step 6. Read it as Stirling. Undoing the division, , which for gives about against the true . This is Stirling's formula, and the balance produced it.
What this tells us: choosing the radius to balance growth against the radius-power, then dividing by the pass width, turns one loop average into a sharp count — and on the simplest fast-growing function it reproduces Stirling.
Check your understanding Beginner
Formal definition Intermediate+
Let be a power series with non-negative coefficients and radius of convergence , analytic in the open disc . The starting point is Cauchy's coefficient formula: for any with , $$ a_n = [z^n] f(z) = \frac{1}{2\pi i} \oint_{|z| = r} \frac{f(z)}{z^{n+1}}, dz = \frac{1}{2\pi}\int_{-\pi}^{\pi} \frac{f(re^{i\theta})}{r^n e^{in\theta}}, d\theta, $$ the contour being the circle of radius , traversed once counterclockwise [Flajolet-Sedgewick §VIII.1]. The integrand is , whose logarithm is . The saddle-point method chooses so that the circle passes through a saddle point of on the positive real axis.
Definition (saddle point / saddle-point equation). A saddle point of is a zero of . On the positive real axis the saddle-point equation is $$ \frac{d}{dr}\Big(\ln f(r) - (n+1)\ln r\Big) = 0 \quad\Longleftrightarrow\quad \frac{r f'(r)}{f(r)} = n+1, $$ and for asymptotic purposes one uses the simpler , defining the saddle radius . The function $$ \alpha(r) := \frac{r f'(r)}{f(r)} = r,\frac{d}{dr}\ln f(r) $$ is the first log-derivative on the circle (a tilt or mean-value parameter); for with positive coefficients increases from and the equation has a unique positive root for each large .
Definition (saddle-point variance). The second log-derivative on the circle is
$$
b(r) := r,\frac{d}{dr}\Big(r,\frac{f'(r)}{f(r)}\Big) = r,\alpha'(r) = \frac{d^2}{d\theta^2}\Big[\ln f(re^{i\theta})\Big]{\theta = 0},
$$
the curvature of along the circle at . It is non-negative for the functions treated here and plays the role of a variance: it is the width of the Gaussian into which the integrand collapses near the saddle. The notation , , , , and the contour integral $\oint{|z|=r}$ are recorded in _meta/NOTATION.md.
Definition (saddle-point estimate). When the integrand concentrates at , the saddle-point estimate is
$$
a_n = [z^n] f(z) \sim \frac{f(r_n)}{r_n^{,n},\sqrt{2\pi, b(r_n)}}, \qquad n \to \infty.
$$
The numerator is the value of the integrand at the saddle; the factor is the Gaussian width of the pass. This is the same Laplace-method reduction that produces Stirling's formula in 06.01.15, here run on the circle rather than the real line.
Counterexamples to common slips Intermediate+
"Singularity analysis and the saddle-point method are interchangeable." They are complementary. Singularity analysis reads the growth of off a dominant singularity at a fixed finite radius , giving . The saddle-point method applies precisely when there is no usable finite singularity: entire (, as , , ) or with a singularity so violent that the transfer theorems fail. The saddle radius then moves with instead of sitting at a fixed .
"The saddle radius is a constant determined by ." It is a function of : solves , so as grows the best circle expands. For , ; for , ; for , gives . A fixed-radius reading would miss the growth entirely.
"The saddle-point bound already gives the asymptotic." The bound holds for every and is one-sided; minimising it over (which lands at the saddle radius) gives the correct exponential order but overestimates by the Gaussian-width factor . The bound captures ; the method supplies the missing and turns an inequality into an asymptotic equality.
"Any function admits the saddle-point estimate." The Gaussian localisation requires that the contribution away from be negligible and that the local expansion be genuinely quadratic. Hayman's admissibility conditions (capture, divergence, width) are a clean sufficient package; a function like with competing saddles on the circle, or one whose coefficients oscillate in sign, can violate the concentration hypothesis and fail the estimate.
Key theorem with proof Intermediate+
The signature theorem is the basic saddle-point estimate for the Cauchy coefficient integral: under concentration at the real saddle, the coefficient is the saddle value of the integrand divided by the Gaussian width. Every worked enumeration in this unit — Stirling, involutions, Bell numbers — is one instance, and the admissibility theory of the next section is the certification that the concentration hypothesis holds [Flajolet-Sedgewick §VIII.2].
Theorem (basic saddle-point estimate). Let have non-negative coefficients and radius of convergence . Let solve the saddle-point equation , with . Suppose the integrand concentrates: for a suitable angular range with and ,
- (central regime) uniformly for , and
- (tail negligibility) ,
then $$ a_n \sim \frac{f(r_n)}{r_n^{,n},\sqrt{2\pi, b_n}}, \qquad n \to \infty. $$
Proof. Apply Cauchy's formula on the saddle circle : $$ a_n = \frac{1}{2\pi}\int_{-\pi}^{\pi} \frac{f(r_n e^{i\theta})}{r_n^{,n}, e^{i n \theta}}, d\theta = \frac{f(r_n)}{2\pi, r_n^{,n}}\int_{-\pi}^{\pi} \exp!\Big(\ln f(r_n e^{i\theta}) - \ln f(r_n) - i n\theta\Big), d\theta. $$ Split the integral at . On the central range , hypothesis (1) gives the exponent , the linear term having cancelled against the saddle choice (which is exactly the vanishing of the first-order term in : ). Hence the central contribution is $$ \frac{f(r_n)}{2\pi, r_n^{,n}}\int_{-\theta_0}^{\theta_0} e^{-\frac{1}{2}b_n\theta^2},(1 + o(1)), d\theta. $$ Substitute . Because , the limits tend to , so , the standard Gaussian integral [de Bruijn Ch. 5]. The central contribution is therefore $$ \frac{f(r_n)}{2\pi, r_n^{,n}}\cdot \frac{\sqrt{2\pi}}{\sqrt{b_n}}(1 + o(1)) = \frac{f(r_n)}{r_n^{,n}\sqrt{2\pi, b_n}}(1 + o(1)). $$ By hypothesis (2) the tail contributes , which is since forces , making the tail subdominant to the central term. Summing the two ranges gives .
Corollary (Stirling from ). For one has , so , and , so . The estimate reads , i.e. , Stirling's formula. This is the same conclusion derived by the Laplace method on the real Gamma integral in 06.01.15; here it is the circle-method shadow of that real-line computation.
Bridge. This estimate builds toward every fast-growth enumeration in the chapter and appears again in 40.08.07, where the same Gaussian localisation, now carried out with a secondary variable marking a parameter, produces quasi-power limit laws. The foundational reason the method works is that taking logarithms turns the product into a sum whose real part has a strict maximum on the saddle circle exactly at , so the integral is a Laplace integral on the circle; this is exactly the steepest-descent picture of 06.01.15 transported from the real axis to the contour, and it generalises Stirling's single computation into a uniform machine. The saddle radius moving with is dual to the fixed dominant singularity of singularity analysis: putting these together, a fixed singularity gives growth while a moving saddle gives the faster-than-exponential growth of entire generating functions, and the central insight is that both are one Cauchy integral read at the location where the integrand peaks.
Exercises Intermediate+
Advanced results Master
The basic estimate is the analytic kernel; its systematic deployment rests on the saddle-point bound as a one-sided estimate, Hayman's admissibility theorem certifying the concentration hypothesis once and for all, the closure calculus of H-admissible functions, and the canonical Moser-Wyman analysis of the Bell numbers as the worked frontier case.
Theorem 1 (saddle-point bound). For with non-negative coefficients and radius of convergence , and every , $$ a_n \le \frac{f(r)}{r^{n}}, \qquad\text{minimised at } r = r_n \text{ where } \alpha(r_n) = n, $$ giving . The bound is immediate from (all terms non-negative), and it is sharp on the exponential scale: the true coefficient is smaller only by the polynomial Gaussian-width factor [Flajolet-Sedgewick §VIII.1]. The bound alone fixes the exponential growth rate and is the workhorse for upper estimates where the full asymptotic is not needed.
Theorem 2 (Hayman's admissibility theorem). Call analytic in with positive Taylor coefficients H-admissible if, writing and , there is a function such that, as :
- (capture) ;
- (central / divergence) uniformly for ;
- (width / negligibility) uniformly for .
Then for the saddle-point estimate holds in its sharp form: $$ a_n = [z^n]f(z) \sim \frac{f(r_n)}{r_n^{,n}\sqrt{2\pi, b(r_n)}}, \qquad \alpha(r_n) = n. $$ The three conditions are exactly the hypotheses of the Key theorem made into properties of alone, so that admissibility is checked once and the asymptotic follows for every [Hayman 1956]. Hayman proved this as a generalisation of Stirling's formula, the case being the prototype.
Theorem 3 (H-admissible function calculus). The class of H-admissible functions is closed under the following operations [Flajolet-Sedgewick §VIII.4]: if are H-admissible and is a polynomial with real coefficients and positive leading coefficient, then (i) is H-admissible; (ii) is H-admissible; (iii) and are H-admissible; (iv) is H-admissible whenever ensures positive coefficients, in particular for with non-negative coefficients and ; (v) if is H-admissible and has positive coefficients with in the relevant sense, then is H-admissible. The rules certify , (involutions), (Bell numbers), (fragmented permutations from 40.08.02), and more, by composition rather than by re-checking the contour integral. The closure under is the engine: each application of to an admissible exponent stays admissible, which is why "exponential of a generating function" — the SET construction of 40.08.02 — is the natural saddle-point input.
Theorem 4 (Moser-Wyman asymptotics for Bell numbers). The Bell numbers satisfy $$ B_n \sim \frac{n!; e^{e^{r_n} - 1}}{r_n^{,n}\sqrt{2\pi, b_n}}, \qquad r_n e^{r_n} = n, \quad b_n = r_n(1 + r_n)e^{r_n} = n(1 + r_n), $$ with the Lambert-W value; equivalently in the Moser-Wyman variable [Moser-Wyman 1955]. The function is entire, H-admissible by Theorem 3 (it is with admissible), and the saddle is the doubly-exponential balance point rather than the linear of ; the Lambert-W saddle is the signature of a doubly-exponential generating function. The same analysis with replaced by handles the -associated Bell and Stirling families.
Theorem 5 (saddle-point asymptotics versus singularity analysis: the boundary). Let be entire of finite order and lower order equal to its order (regular growth). Then admits a saddle-point asymptotic, with saddle radius determined by , and . By contrast, if has a finite dominant singularity at of an integrable-singularity type, is governed by singularity analysis with fixed. The two regimes meet at functions with a finite singularity of exponential type (an essential singularity on the boundary, as or the partition product near ): there the relevant saddle approaches the boundary singularity, , and the analysis is a boundary saddle-point method, the analytic precursor of the circle method of 21.16.02 [Flajolet-Sedgewick §VIII.5]. The classification by where goes — to , to a fixed , or to a boundary singularity — organises the whole asymptotic landscape of the chapter.
Synthesis. The saddle-point method is the foundational reason that entire and fast-growing generating functions are not beyond asymptotic reach: where singularity analysis reads growth off a fixed singularity, the saddle-point method reads it off a moving circle whose radius is pinned by the balance between the function and the radius-power, and at that radius the Cauchy integral collapses to a Gaussian of width . The central insight is that taking the logarithm of the integrand converts a product into a sum with a strict real-part maximum on the saddle circle, so the contour integral is a Laplace integral on the circle — exactly the steepest-descent computation that gives Stirling in 06.01.15, now run as a uniform machine. This is dual to singularity analysis construction by construction: a fixed singularity gives , a moving saddle gives faster-than-exponential growth, and putting these together with Theorem 5 shows the two are one Cauchy integral classified by where its saddle goes — to infinity (entire ), to a fixed (polar/algebraic ), or to a boundary essential singularity (the circle-method regime of 21.16.02).
The bridge to enumeration is that Hayman's admissibility makes the concentration hypothesis a property of closed under , sums, and products, so the SET-construction outputs of 40.08.02 — for Bell numbers, for involutions, for fragmented permutations — are admissible by the calculus and surrender their asymptotics to one formula. This appears again in the quasi-power limit laws of 40.08.07, where the same saddle, perturbed by a marking variable, yields Gaussian fluctuations of combinatorial parameters; the construction calculus, the singularity calculus, and the saddle-point calculus are one analytic combinatorics read at three resolutions.
Full proof set Master
Proposition 1 (Cauchy coefficient formula and saddle-point bound). For analytic in with , and any : , and .
Proof. Analyticity in gives the uniformly convergent series on the circle. Multiply by and integrate over ; by orthogonality , term-by-term integration (justified by uniform convergence) leaves only , yielding . For the bound, since all ; divide by .
Proposition 2 (uniqueness and monotonicity of the saddle radius). If has positive coefficients for all in an unbounded set and , then is strictly increasing on with and ; hence for each sufficiently large the equation has a unique root , and increases to as .
Proof. Write , the mean of under the probability weights . Its derivative is , the variance of under divided by ; since has at least two distinct exponents with positive coefficients, the variance is strictly positive, so and is strictly increasing. As the weights shift mass to arbitrarily large (the radius of convergence being ), so . As , , the least exponent. By the intermediate value theorem and strict monotonicity, has the unique root for , and monotonicity of gives .
Proposition 3 (Gaussian localisation; the basic estimate). Under the hypotheses of the Key theorem (central quadratic expansion and tail negligibility at the saddle radius with ), .
Proof. This is the Key theorem; we record the localisation cleanly. By Proposition 1, with . At the saddle, and , while (the second log-derivative on the circle, real and negative). Thus near . Over with and , , and by the substitution and dominated convergence against . Over , tail negligibility gives a contribution . Hence and .
Proposition 4 (Stirling, involutions, Bell numbers from the estimate). The estimate yields ; for involution numbers; and with for Bell numbers.
Proof. For : , ; , . The estimate gives , i.e. , agreeing with the Laplace-method derivation of 06.01.15. For : , so ; , . Then and ; substituting and collects the exponent to , and reduces, via Stirling for , to . For : , ; ; the estimate is the displayed asymptotic. Each is admissible by Theorem 3, so Proposition 3 applies.
Proposition 5 (admissibility of ). The function is H-admissible, with the saddle-point estimate giving Stirling's formula.
Proof. Here , as , so capture holds. For the central condition take : then , and for , with , and with , so , matching . For width: on , , which is since . All three conditions hold, so is H-admissible; the estimate then reads , Stirling.
Connections Master
The generating functions this unit extracts are exactly the labelled SET-construction outputs of
40.08.02: the Bell-number EGF (sets of nonempty blocks), the involution EGF (sets of - and -cycles), and the fragmented-permutation EGF (sets of nonempty sequences). That unit produces these functions exactly by the symbolic method; this unit reads their large- coefficient asymptotics off a moving saddle, and the H-admissibility closure under is precisely why "exponential of an admissible exponent" — the SET construction — is the canonical saddle-point input.The Laplace-method derivation of Stirling's formula in
06.01.15is the real-line prototype of this unit's circle computation: there the integral is localised at its real maximum and reduced to a Gaussian; here the Cauchy integral on the saddle circle is localised at and reduced to the same Gaussian. The base case of the saddle-point method is Stirling, and that unit's Stirling asymptotic and Gamma function supply the normalisation converting EGF coefficients to counts throughout this unit.The partition-function heuristic of Exercise 8 — a real saddle giving the exponential rate — is made rigorous by the Hardy-Ramanujan-Rademacher circle method of
21.16.02: where the saddle-point method uses the single dominant saddle near , the circle method partitions the unit circle into Farey arcs around every root of unity and sums the modular-transformed contributions, yielding the full convergent Rademacher series. The boundary saddle-point regime of Theorem 5 (where , an essential singularity on the circle of convergence) is the analytic bridge between the two methods.The perturbed saddle, where a secondary variable marks a combinatorial parameter and the saddle-point analysis is carried out for , produces the quasi-power limit laws and Gaussian fluctuations developed in the co-produced
40.08.07; this unit fixes the one-variable saddle machinery and the H-admissibility calculus that40.08.07perturbs to obtain central limit theorems for parameters such as the number of blocks of a random set partition or the number of fixed points of a random involution.
Historical & philosophical context Master
The saddle-point method descends from the method of steepest descent, introduced by Riemann in unpublished work on hypergeometric functions and developed in print by Debye (1909) for the asymptotics of Bessel functions of large order [Flajolet-Sedgewick §VIII]. The idea of deforming a contour through a zero of the derivative of the exponent, along the path where the imaginary part of the exponent is constant so that the modulus decreases as fast as possible, gives the real Laplace integral its complex-analytic generalisation; Laplace's own method for real integrals (eighteenth century, in the asymptotics of factorials and probability integrals) is the real-axis special case. The application to coefficient extraction — combining steepest descent with Cauchy's formula on a circle whose radius is tuned to the coefficient index — became the standard tool for entire and fast-growing generating functions.
The decisive enumerative results are Moser and Wyman's 1955 asymptotic formula for the Bell numbers [Moser-Wyman 1955], which fixed the saddle at the Lambert-W point and computed the full asymptotic of set-partition counts, and Walter Hayman's 1956 A generalisation of Stirling's formula [Hayman 1956], which isolated the three conditions — capture, central convergence, and width — under which the saddle-point estimate holds, and proved the closure of the admissible class under exponentiation, sums, and products. Hayman's admissibility turned the method from a case-by-case contour computation into a calculus: a function built from by , polynomial multiplication, and addition is admissible without re-examination of the integral, and its coefficients obey at once. The codification within analytic combinatorics, with the saddle-point bound, the basic method, the admissibility theory, and the boundary-regime classification linking to the circle method, is due to Flajolet and Sedgewick, whose Analytic Combinatorics (2009) Chapter VIII is the reference treatment on which the asymptotic readings of 40.08.02 rest; de Bruijn's Asymptotic Methods in Analysis (1958) remains the canonical exposition of the underlying steepest-descent technique.
Bibliography Master
@book{flajoletsedgewick2009,
author = {Flajolet, Philippe and Sedgewick, Robert},
title = {Analytic Combinatorics},
publisher = {Cambridge University Press},
year = {2009}
}
@book{debruijn1981asymptotic,
author = {de Bruijn, N. G.},
title = {Asymptotic Methods in Analysis},
publisher = {Dover Publications},
year = {1981},
note = {Corrected reprint of the 1958 North-Holland edition}
}
@article{hayman1956generalisation,
author = {Hayman, W. K.},
title = {A generalisation of {S}tirling's formula},
journal = {Journal f\"{u}r die reine und angewandte Mathematik},
volume = {196},
year = {1956},
pages = {67--95}
}
@article{moserwyman1955bell,
author = {Moser, Leo and Wyman, Max},
title = {An asymptotic formula for the {B}ell numbers},
journal = {Transactions of the Royal Society of Canada, Section III},
volume = {49},
year = {1955},
pages = {49--54}
}
@article{debye1909asymptotische,
author = {Debye, Peter},
title = {N\"{a}herungsformeln f\"{u}r die Zylinderfunktionen f\"{u}r gro\ss{}e Werte des Arguments und unbeschr\"{a}nkt ver\"{a}nderliche Werte des Index},
journal = {Mathematische Annalen},
volume = {67},
year = {1909},
pages = {535--558}
}
@book{wilf2006gfology,
author = {Wilf, Herbert S.},
title = {generatingfunctionology},
edition = {3},
publisher = {A K Peters},
year = {2006}
}