The Symbolic Method for Unlabelled Structures
Anchor (Master): Flajolet-Sedgewick 2009 *Analytic Combinatorics* (Cambridge) Ch. I-II and Ch. IV-VI (the symbolic method feeding singularity analysis); Bergeron-Labelle-Leroux 1998 *Combinatorial Species and Tree-like Structures* (Cambridge) Ch. 1-3 for the species-theoretic foundation of the unlabelled/labelled dictionary; Pólya 1937 *Acta Math.* 68 (the cycle-index origin of the multiset and cycle translations); Stanley 1999 *Enumerative Combinatorics, Volume 2* (Cambridge) Ch. 5 for the symbolic algebra of structures
Intuition Beginner
The previous unit taught a trick one problem at a time: write a recurrence, turn it into one equation in a series, divide, and read off the answer. The symbolic method removes the guessing entirely. You describe what your objects are made of — "a binary tree is either empty or a root with a left tree and a right tree", "a word is a row of letters", "a partition is a heap of parts" — and a fixed dictionary hands you the generating function with no algebra invented along the way.
The idea is that a small number of ways of building structures cover almost everything you want to count. You can take two kinds of object and allow either one (a choice). You can glue one object beside another (a pair). You can lay down a row of any length (a sequence). You can throw objects into an unordered heap (a multiset). Each of these building moves has one matching move on the generating function, and the matches never change. So counting becomes translation: you write the recipe for your objects in a tiny structural language, and the dictionary spells out the series.
Why does this matter beyond saving effort? Because once the generating function is written down by recipe, its shape tells you how fast the counts grow. A series that comes from a sequence-of-pieces recipe behaves one way; a series from a tree recipe behaves another. The later units in this chapter read that growth off the generating function directly. This unit builds the bridge: recipe to series. The next stage, singularity analysis, reads series to growth rate.
Visual Beginner
A generating function is still the clothesline of the previous unit: peg carries the count of objects of size . The new content is that each way of assembling objects is one operation on the line, fixed once and used forever. The table below is the start of the dictionary — the unlabelled version, where objects are plain (no name-tags on their pieces).
| recipe (how you build the object) | operation on the generating function |
|---|---|
| use an A-object or a B-object (a choice) | add the two lines: |
| an A-object beside a B-object (a pair) | multiply the two lines: |
| a row of any number of A-objects (a sequence) | divide: over |
| an unordered heap of A-objects (a multiset) | the Euler product over sizes |
Read a row left to right and you have translated a piece of a recipe into a piece of a formula. A full recipe is several rows chained together, and chaining the operations gives one equation for the whole generating function. The rest of the unit explains why each row is the exact bookkeeping for its building move, and works the dictionary on trees, compositions, partitions, and necklaces.
Worked example Beginner
Let us count compositions of a positive integer — the ways to write it as an ordered sum of positive parts. For the compositions are , , , and , so there are of them. We will get the count for every size at once from a one-line recipe.
Step 1. Name the pieces. A single part is just a positive integer , drawn as a row of dots, so it has size . The class of single parts has one object of each size , so its generating function is , which equals divided by .
Step 2. Write the recipe. A composition is a sequence of parts: lay down one part after another, in order, any number of times (but at least one part for a positive total). The sequence row of the dictionary says: divide by minus the part-line.
Step 3. Apply the dictionary. The part-line is . The sequence operation gives . Clear the inner fraction: , so the whole thing is .
Step 4. Read off counts. Expand . Since , multiplying by gives . The peg carries , matching our hand count, and the peg for carries .
What this tells us: a one-clause recipe ("a sequence of parts") produced the closed count with no recurrence guessed and no induction run. The dictionary did the counting.
Check your understanding Beginner
Formal definition Intermediate+
A combinatorial class is a finite or countable set equipped with a size function such that the number of objects of each size is finite. The sequence is the counting sequence, and the ordinary generating function (OGF) of is
$$
A(z) = \sum_{n \ge 0} A_n z^n = \sum_{\alpha \in \mathcal{A}} z^{|\alpha|} \in \mathbb{Z}[[z]],
$$
the second form being the combinatorial sum over objects, which collects once per object. The formal-power-series algebra (Cauchy product, the unit criterion, formal and , formal convergence as coefficientwise stabilisation) is taken from 40.01.03; nothing analytic is assumed of here.
A class operation is admissible when the counting sequence of the result depends only on the counting sequences of the operands, so that the operation descends to a fixed operation on OGFs. The basic admissible constructions on unlabelled classes, with assumed wherever an unbounded construction is formed (no neutral object of size inside a sequence or multiset, so that formal convergence holds), translate as follows.
- Disjoint union (objects of or of , kept disjoint, size preserved): .
- Cartesian product (ordered pairs with ): , the convolution.
- Sequence (ordered tuples of any length): .
- Multiset (finite unordered collections, repetition allowed): , the Euler transform.
- Powerset (finite unordered collections, no repetition): .
- Cycle (sequences up to cyclic rotation): , with Euler's totient.
Two further operators complete the calculus. Pointing distinguishes one of the atoms of an object, with OGF . Substitution (composition) replaces each atom of an -object by a -object, with OGF when . The symbols , , , , , , are recorded in _meta/NOTATION.md.
A specification is a (possibly recursive) system of equations writing classes in terms of atoms and these constructions; applying the dictionary term by term turns a specification into a system of equations for the OGFs, which is the engine the rest of the chapter runs.
Counterexamples to common slips Intermediate+
" always has OGF ." It requires . If has an object of size then , the geometric series fails to converge formally (infinitely many empty parts can pad any object), and the count at each size is infinite. The same proviso governs , , and .
"Multiset and sequence give the same generating function." The sequence is ordered and gives ; the multiset is unordered and gives . They agree only in degenerate cases. For a single atom (), but coincide because one atom cannot be permuted; with two atom-types the unordered count is strictly smaller.
"The multiset OGF is and the exponential form is a different rule." They are equal. Taking of the product, , so exponentiating recovers the product. The two forms are one translation read two ways.
"Pointing changes the class being counted." Pointing counts -objects each carrying a marked atom, so : the total count is reweighted by size, not altered in kind. It is the combinatorial face of the operator and is the standard device for turning a recursive specification into one solvable by the pointing-and-substitution method.
Key theorem with proof Intermediate+
The signature theorem is the admissibility and OGF translation of the multiset construction, the Euler transform, because it is the first construction whose translation is not a one-line algebraic rule but a genuine combinatorial identity — the passage from an unordered infinite product to an exponential of size-substituted copies — and it is the unlabelled prototype of every later "set of components" translation [Flajolet-Sedgewick Ch. I.2].
Theorem (multiset construction). Let be a combinatorial class with and OGF . The multiset class of finite unordered collections of -objects, with repetition allowed and the size of a collection equal to the sum of its members' sizes, is admissible, and its OGF is $$ M(z) = \prod_{\alpha \in \mathcal{A}}\frac{1}{1 - z^{|\alpha|}} = \prod_{n \ge 1}\frac{1}{(1 - z^n)^{A_n}} = \exp!\Big(\sum_{k \ge 1}\frac{1}{k}A(z^k)\Big). $$
Proof. Choose any enumeration of the objects of . A multiset over is determined by, for each object , a multiplicity recording how many copies of it contains, with only finitely many nonzero. Its size is , so the contribution of across all multiplicities is . Because each object of size contributes a factor of order , every coefficient of the formal product is determined by finitely many factors, so the product converges formally and $$ M(z) = \sum_{\text{multisets}} z^{\text{size}} = \prod_{i}\frac{1}{1 - z^{|\alpha_i|}} = \prod_{n \ge 1}\Big(\frac{1}{1 - z^n}\Big)^{A_n}, $$ collecting the objects of each size into a common exponent. This already shows admissibility: depends only on the counts . For the exponential form, take the formal logarithm and expand each factor by the series with : $$ \log M(z) = \sum_{n \ge 1} A_n \log\frac{1}{1 - z^n} = \sum_{n \ge 1} A_n \sum_{k \ge 1}\frac{z^{nk}}{k} = \sum_{k \ge 1}\frac{1}{k}\sum_{n \ge 1}A_n (z^k)^n = \sum_{k \ge 1}\frac{1}{k}A(z^k), $$ where interchanging the two sums is valid coefficientwise because for fixed total degree only finitely many pairs contribute. Exponentiating gives .
Corollary (integer partitions). Take , one object of each size , so and . A multiset of positive integers is exactly an integer partition, so the partition OGF is , the Euler product foreshadowed at 40.01.06.
Bridge. This theorem is the foundational reason the unlabelled dictionary closes: the multiset translation shows that an unordered construction, where the naive product would overcount by the symmetries of repeated parts, is still admissible, and the device that absorbs those symmetries — the substitution summed with weight — is exactly the Pólya cycle-index of the symmetric group acting on the copies, which builds toward the cycle construction's totient sum where the cyclic group replaces the symmetric one. This is exactly the unlabelled counterpart of the labelled exponential formula of 40.01.03: there a set of labelled components gives with no symmetry correction because labels break all symmetry, and the central insight is that the -substitutions here are precisely the price of having no labels. The multiset translation appears again in 40.06.10 as Pólya-Redfield enumeration and generalises to the species-level statement that the unlabelled OGF is the cycle-index series evaluated on the diagonal; putting these together, the symbolic method is one calculus with two specialisations, unlabelled OGF and labelled EGF, and the bridge is the cycle index that interpolates between them.
Exercises Intermediate+
Advanced results Master
The dictionary is closed under the constructions and the two operators, and its reach extends through recursive specifications, the supercritical schema, the pointing-and-substitution solution of tree recursions, and the cycle-index unification that places the unlabelled OGF and the labelled EGF inside one species-theoretic framework.
Theorem 1 (closure and the symbolic transfer). The class of OGFs obtainable from finite sets of atoms by finitely many applications of , , , , , , pointing , and substitution is exactly the family solving a finite constructible system. Each construction is admissible — its result-counting sequence depends only on the operand sequences — so a specification in these constructions translates termwise to a fixed-point system for the OGFs, where is built from , and composition. When the system is proper (no -cycle), it has a unique formal-power-series solution computable coefficient by coefficient [Flajolet-Sedgewick Ch. I].
Theorem 2 (iterated multiset and the Euler transform). For a class with counting sequence , the multiset has counting sequence given by the Euler transform: , equivalently . The unlabelled enumeration of rooted unordered trees (), of forests, of multigraphs by their connected components, and of integer partitions are all instances; the recursion has no closed form but is computable to any order and has a determinable exponential growth rate (Otter's constant) by singularity analysis [Flajolet-Sedgewick Ch. I.5, VII].
Theorem 3 (cycle construction and the totient sum). The cycle class of nonempty sequences modulo cyclic rotation has OGF
$$
\mathrm{CYC}(A)(z) = \sum_{k \ge 1}\frac{\varphi(k)}{k}\log\frac{1}{1 - A(z^k)},
$$
obtained from the Pólya cycle index of the cyclic group , , substituted with the figure-counting series . For a finite alphabet of letters () this gives the necklace OGF , the Moreau/Pólya necklace count; the full Pólya-Redfield theory of counting under a group action is developed at 40.06.10 [Pólya 1937].
Theorem 4 (pointing-substitution and the tree dictionary). Pointing and substitution interact through the chain rule: at the level of OGFs reflects marking an atom inside a substituted structure. For a simple variety of trees specified by with a construction (sequence, multiset, set of children), the OGF satisfies , and the smooth implicit-function / inversion schema applies: has a square-root singularity at the value where with the corresponding critical value, yielding the universal tree asymptotic. This is the analytic destination of the Lagrange-inversion algebra of 40.01.07 [Flajolet-Sedgewick Ch. VI-VII].
Theorem 5 (supercritical sequence schema). If with having radius of convergence and (the supercritical condition), then has a simple pole at the unique root of , and . The mean number of -components in a random size- object is asymptotically linear in , with the Gaussian limit law of the supercritical-composition scheme. The schema covers compositions, surjections (in the labelled mirror), and walks returning to a finite state space, and is the cleanest meeting point of the symbolic method and singularity analysis [Flajolet-Sedgewick Ch. V].
Synthesis. The symbolic method is the foundational reason combinatorial enumeration is a calculus rather than a catalogue of tricks: a class becomes a generating function, a way of building objects becomes a fixed operation on generating functions, and a structural recipe becomes a system of equations solved once. The central insight is that one dictionary governs everything — disjoint union to , product to , sequence to , multiset to the Euler transform , powerset to its alternating-sign cousin, and cycle to the totient sum — and the unordered constructions differ from the ordered ones by exactly the cycle-index symmetry corrections, so the multiset and cycle rows are the Pólya cycle indices of the symmetric and cyclic groups substituted into the figure series. This is exactly dual to the labelled EGF dictionary of 40.01.03, where labels destroy all symmetry and the set construction collapses to with no -substitution; putting these together, the unlabelled OGF and the labelled EGF are two evaluations of one species, the type series and the exponential series, and the bridge is the cycle index that interpolates between them. Every closed or recursive OGF the dictionary produces — for compositions, for plane trees, for partitions, for unordered trees — carries its growth rate in its dominant singularity, which generalises the partial-fraction asymptotics of 40.01.04 to the algebraic and transcendental singularities of this chapter. The formal recipe and the analytic growth rate are one object read at two resolutions.
Full proof set Master
Proposition 1 (admissibility of union, product, sequence). For unlabelled classes with in the sequence case: has OGF ; has OGF ; has OGF .
Proof. For the disjoint union, the objects of size are the objects from together with the from , with no overlap, so the count is and the OGF is . For the product, an object of size is a pair with ; the number of such pairs is , the Cauchy convolution, so the OGF is by the definition of the product in (from 40.01.03). For the sequence, with the empty tuple of size ; since , the -fold product has OGF with lowest term of order , so for fixed only contribute and converges formally to (the constant term of is , a unit).
Proposition 2 (multiset, exponential form). With , has OGF .
Proof. As established in the Key theorem, indexing a multiset by the multiplicity function (finitely supported) gives , the product converging formally because the factor for an object of size is . Apply formal (well-defined since the product has constant term ): . The double series is summable coefficientwise: the coefficient of receives contributions only from pairs with , finitely many. Reindexing by first, , and because has constant term .
Proposition 3 (cycle construction). With , , the nonempty sequences modulo cyclic rotation, has OGF .
Proof. A length- sequence of -objects (with multiplicities) has a cyclic symmetry group; by Burnside's lemma the number of cyclic classes weights each sequence by the reciprocal of its orbit size. The cyclic group acting on -tuples has cycle index , where is the power-sum placeholder; substituting the figure series so that and summing the generating function over all gives, after collecting by and using , the total . The interchange and the geometric resummation are valid coefficientwise because has order , so each coefficient is a finite sum.
Proposition 4 (substitution and pointing). If , the substituted class (each atom of an -object replaced by a -object) has OGF ; and the pointed class (one atom distinguished) has OGF .
Proof. For substitution, an -object of size has atoms; replacing each by a -object independently and summing sizes, the contribution of that object is , so has OGF , convergent formally because makes of order . For pointing, an object of size admits choices of distinguished atom, so the pointed class has objects of size , OGF . The chain rule then expresses through and , the recursion-unwinding identity used to solve pointed tree specifications.
Connections Master
The unlabelled OGF dictionary is the exact mirror of the labelled EGF dictionary built in
40.01.03: there the set construction of labelled components gives with no symmetry correction because labels break every symmetry, while here the multiset gives with the -substitutions supplying precisely the missing symmetry corrections. The labelled exponential formula and the unlabelled Euler transform are the two evaluations — exponential series and type series — of one combinatorial species, and the co-produced labelled-EGF symbolic-method unit40.08.02develops the labelled half of this dictionary in the analytic-combinatorics framing.The cycle and multiset constructions are the generating-function image of Pólya-Redfield enumeration: the totient sum is the cyclic-group cycle index substituted with the figure series, and the Euler transform is the symmetric-group cycle index. The full theory of counting orbits under a group action — the cycle-index polynomial, Burnside's lemma, and the pattern inventory — is owned by
40.06.10; this unit uses its two most common specialisations (necklaces and multisets) as dictionary entries, and the forward link is exact: every unlabelled construction is a cycle-index evaluation.The integer-partition generating function produced here as is the entry point to the -series and partition theory of
40.01.06, where the same product is read as a -analogue, refined by largest part and number of parts, and connected to Gaussian binomials and the Rogers-Ramanujan identities. The multiset-vs-powerset identity proved in the exercises is the symbolic-method shadow of Euler's "partitions into distinct parts equal partitions into odd parts".The whole point of writing generating functions by recipe is to feed them to singularity analysis: the location and nature of the dominant singularity of determine the asymptotic growth of . The co-produced unit
40.08.04develops the singularity-analysis transfer theorems (the -class functions, the standard-function scale ), and the supercritical-sequence and simple-variety-of-trees schemas previewed here in Theorems 4-5 are the cleanest applications; this unit supplies the generating functions whose singularities those units dissect.
Historical & philosophical context Master
The systematic translation of structural descriptions into generating functions has roots in Euler's 1748 Introductio, where the partition product first appears as the generating function of a multiset of integers, and in the nineteenth-century use of generating functions for trees and graphs by Cayley and Sylvester. The decisive structural step — recognising that a fixed dictionary of constructions translates automatically — came with George Pólya's 1937 cycle-index paper [Pólya 1937], which gave the substitution of a figure-counting series into the cycle index of a permutation group and thereby produced the multiset and cycle translations in their general form, applied to counting chemical isomers, graphs, and trees up to symmetry.
The recasting of this into a fully compositional symbolic method, with an explicit OGF/EGF dictionary closed under sum, product, sequence, set/multiset, and cycle, and with the constructions treated as admissible operators, is due to the school of Philippe Flajolet and his collaborators from the 1980s onward, consolidated in Flajolet and Sedgewick's Analytic Combinatorics (2009) [Flajolet-Sedgewick Ch. I]. The categorical foundation — a species as a functor from finite sets and bijections to finite sets, whose exponential and type generating series are the EGF and OGF — was supplied by André Joyal's 1981 theory of species and developed by Bergeron, Labelle, and Leroux [Bergeron-Labelle-Leroux 1998], who proved that the symbolic constructions are the generating-series images of categorical operations on species, explaining structurally why the dictionary closes. The unlabelled OGF is the species' type series, the diagonal evaluation of its cycle-index series.
Bibliography Master
@book{flajoletsedgewick2009,
author = {Flajolet, Philippe and Sedgewick, Robert},
title = {Analytic Combinatorics},
publisher = {Cambridge University Press},
year = {2009}
}
@book{stanley2012ec1,
author = {Stanley, Richard P.},
title = {Enumerative Combinatorics, Volume 1},
series = {Cambridge Studies in Advanced Mathematics},
volume = {49},
edition = {2},
publisher = {Cambridge University Press},
year = {2012}
}
@book{stanley1999ec2,
author = {Stanley, Richard P.},
title = {Enumerative Combinatorics, Volume 2},
series = {Cambridge Studies in Advanced Mathematics},
volume = {62},
publisher = {Cambridge University Press},
year = {1999}
}
@book{bll1998species,
author = {Bergeron, Fran\c{c}ois and Labelle, Gilbert and Leroux, Pierre},
title = {Combinatorial Species and Tree-like Structures},
series = {Encyclopedia of Mathematics and its Applications},
volume = {67},
publisher = {Cambridge University Press},
year = {1998}
}
@article{polya1937cycleindex,
author = {P\'{o}lya, George},
title = {Kombinatorische Anzahlbestimmungen f\"{u}r Gruppen, Graphen und chemische Verbindungen},
journal = {Acta Mathematica},
volume = {68},
year = {1937},
pages = {145--254}
}
@article{joyal1981species,
author = {Joyal, Andr\'{e}},
title = {Une th\'{e}orie combinatoire des s\'{e}ries formelles},
journal = {Advances in Mathematics},
volume = {42},
number = {1},
year = {1981},
pages = {1--82}
}
@book{euler1748introductio,
author = {Euler, Leonhard},
title = {Introductio in analysin infinitorum},
volume = {1},
publisher = {Marcum-Michaelem Bousquet},
address = {Lausanne},
year = {1748}
}