Topological Entropy
Anchor (Master): Walters 1982 *An Introduction to Ergodic Theory* (Springer GTM 79) Ch. 7; Katok-Hasselblatt 1995 *Introduction to the Modern Theory of Dynamical Systems* (Cambridge University Press) §3.1 and Suppl.; Bowen 1971 *Entropy for group endomorphisms and homogeneous spaces* (Trans. Amer. Math. Soc. 153); Yomdin 1987 / Newhouse 1989 (the volume-growth = entropy theorems)
Intuition Beginner
Picture a continuous machine that takes a space and folds it back onto itself once per tick. You cannot see individual points — your eyesight is finite, so you can only tell two starting points apart if, at some moment, the machine has driven them far enough away from each other to resolve. Topological entropy is one number answering a single question: as you watch for longer and longer, how fast does the count of startings-points-you-can-tell-apart grow?
A tame machine, like a slow rotation, never spreads nearby points; after a thousand ticks you still cannot distinguish many more starts than you could after one tick. The count grows slowly, and the entropy is zero. A wild machine, like a map that stretches the space by a factor of two every tick, pulls apart neighbours fast: each extra tick of watching roughly doubles how many starts you can separate. The count grows like a power of two, and the entropy is the logarithm of that base — here, the logarithm of .
So entropy is a growth rate of distinguishability. It does not care about any measure or probability — only about the topology, about which points the dynamics eventually pries apart. Two machines that are the same after a continuous relabelling have the same entropy, which makes it a fingerprint: a single number that survives every change of coordinates and tells a clockwork system from a genuinely complicated one.
Visual Beginner
Watch the doubling map on a circle: each tick replaces an angle by twice the angle, wrapped around. A tiny arc gets stretched to double its length each tick, and a viewer who can resolve features down to one fixed size sees the number of separately-visible pieces double every tick.
A short table contrasts how fast different machines proliferate distinguishable starts.
| machine | what it does each tick | distinguishable starts after ticks | entropy |
|---|---|---|---|
| rigid rotation | carries arcs rigidly, never spreads | bounded | |
| doubling map | stretches by , wraps around | about | |
| full shift on symbols | reveals one of labels | exactly | |
| stretch by , wrap | stretches by | about |
Worked example Beginner
We count distinguishable starts for the doubling map and read off its entropy. Each tick replaces by and drops the whole-number part. In binary, , and doubling deletes the leading digit: .
Step 1. Fix a viewing resolution. Say you can resolve points down to size — you can tell which half of the circle, left or right, a point sits in. One snapshot sorts every start into groups by its first binary digit.
Step 2. Watch for ticks. After each tick the machine has shifted the digits, so the digit you can currently read was originally the st, then the nd, then the rd, and so on. Watching ticks lets you read the first binary digits of the start. Two starts are distinguishable-by-watching exactly when their first digits differ.
Step 3. Count the groups. There are possible length- binary strings, so watching ticks sorts starts into distinguishable groups: as .
Step 4. Read the growth rate. The count is , so its logarithm is , and dividing by the number of ticks gives per tick. That per-tick growth rate of the logarithm of the count is the topological entropy: .
What this tells us: stretching by a factor of two each tick is exactly what makes the count of tellable-apart starts double each tick, and the entropy is the logarithm of the stretch factor. A map that stretched by instead would give groups and entropy .
Check your understanding Beginner
Formal definition Intermediate+
Let be a continuous map of a compact metric space, as in 38.01.03. There are two equivalent routes to the topological entropy : the open-cover route of Adler, Konheim, and McAndrew, and the metric route of Bowen and Dinaburg.
Open covers. For an open cover of , let be the minimal cardinality of a subcover; compactness makes this finite. The join of covers is , and is again an open cover. Write $$ \mathcal{U}0^{n-1} = \bigvee{k=0}^{n-1} f^{-k}\mathcal{U} = \mathcal{U} \vee f^{-1}\mathcal{U} \vee \cdots \vee f^{-(n-1)}\mathcal{U}. $$ An element of is a set of points whose first iterates follow a prescribed itinerary through . The quantity is subadditive, , so the limit below exists by Fekete's lemma.
Definition (entropy of a cover; topological entropy). The entropy of relative to is $$ h(f, \mathcal{U}) = \lim_{n \to \infty} \frac{1}{n} \log N!\Big( \bigvee_{k=0}^{n-1} f^{-k}\mathcal{U} \Big) = \inf_{n \geq 1} \frac{1}{n} \log N!\Big( \bigvee_{k=0}^{n-1} f^{-k}\mathcal{U} \Big), $$ and the topological entropy of is the supremum over all open covers, $$ h_{\mathrm{top}}(f) = \sup_{\mathcal{U}} h(f, \mathcal{U}). \tag{AKM} $$ [Adler 1965]
Bowen-Dinaburg metrics. For define the Bowen metric $$ d_n(x, y) = \max_{0 \leq k < n} d\big(f^k(x), f^k(y)\big), $$ the largest separation of the two orbits over the first steps. A set is *-separated* if for all distinct , and -spanning if every point of is within -distance of some point of . Let be the maximal cardinality of an -separated set and the minimal cardinality of an -spanning set; compactness makes both finite, and .
Definition (Bowen-Dinaburg entropy). Set $$ h_{\mathrm{sep}}(f) = \lim_{\varepsilon \to 0^+} \limsup_{n \to \infty} \frac{1}{n} \log s(n, \varepsilon), \qquad h_{\mathrm{span}}(f) = \lim_{\varepsilon \to 0^+} \limsup_{n \to \infty} \frac{1}{n} \log r(n, \varepsilon). $$ [Bowen 1971], [Dinaburg 1970] The two limits coincide and equal the open-cover entropy, ; this equivalence is proved in the Key theorem below. The metrics are the Bowen-Dinaburg metrics; counts the orbit segments that are -distinguishable over steps, the rigorous form of the Beginner-tier "distinguishable starts".
The entropy is unchanged under topological conjugacy: if with a homeomorphism of compact metric spaces, then , since carries covers to covers and, being uniformly continuous with uniformly continuous inverse, carries separated sets to separated sets up to a fixed change of scale.
Counterexamples to common slips
- Topological entropy is not a measure-theoretic quantity. It uses no invariant measure; it depends only on the topology and the map. The Kolmogorov-Sinai entropy of
38.06.02is attached to a chosen invariant measure , and in general , with equality for a measure of maximal entropy. - The cover entropy for one cover is only a lower bound. A coarse cover sees little; the one-set cover gives for every . The supremum over covers (or the limit over separated sets) is essential — the analogue of taking the finest practical partition in measure-theoretic entropy.
- Separated and spanning counts differ by an order of refinement, not in their growth rate. The inequalities show the two count functions are interleaved, so after the limit they produce the same number; conflating and at fixed is the slip, taking the limit removes it.
- Entropy is not the topological-mixing or transitivity of
38.01.03. A system can be minimal with zero entropy (irrational rotation) or have positive entropy without being minimal (doubling map). Positive entropy is a quantitative strengthening of "complicated", orthogonal to the connectivity hierarchy minimal/transitive/mixing.
Key theorem with proof Intermediate+
Theorem (Bowen-Dinaburg = Adler-Konheim-McAndrew). Let be continuous on a compact metric space. Then $$ h_{\mathrm{top}}(f) = \lim_{\varepsilon \to 0^+} \limsup_{n \to \infty} \frac{1}{n} \log s(n, \varepsilon) = \lim_{\varepsilon \to 0^+} \limsup_{n \to \infty} \frac{1}{n} \log r(n, \varepsilon). $$ The open-cover entropy and the separated- and spanning-set entropies all agree. (See [Walters Ch. 7], [Bowen 1971], [Katok-Hasselblatt 1995 §3.1].)
Proof. First, and produce the same limit. From — the left because a maximal -separated set is -spanning (else a point at -distance from all of could be added, contradicting maximality), the right because an -spanning set lets at most one separated point lie in each -ball of radius — both count functions have the same exponential growth rate after letting . Call the common value .
Now relate to covers. Fix and let be a finite open cover by sets of diameter , with Lebesgue number (every -ball of radius lies in some ). The atoms of have -diameter , since membership in a single atom forces to share a set of diameter for each . An -separated set meets each atom at most once, so ; dividing by and taking limits, .
Conversely, with the same of Lebesgue number , an -spanning set has the property that the -balls of radius centred at its points cover , and each such ball lies inside a single atom of (by the Lebesgue-number property applied coordinatewise). Hence atoms meeting these balls is a subcover of size , giving . Dividing by , taking then , for every , so . The two inequalities give .
Bridge. This equivalence builds toward every concrete computation and appears again in the model evaluations of the Advanced results, where the separated-set count is the workable side — for the full shift one simply counts cylinder words, for an expanding map one counts inverse branches. The foundational reason the two definitions agree is that an open cover of small diameter and a Bowen ball of small radius measure the same thing — orbit segments that stay close for steps — so the supremum over covers and the limit over resolutions are two readings of one exponential growth rate; this is exactly the topological shadow of the partition-refinement that defines Kolmogorov-Sinai entropy in 38.06.02, where the join of partitions plays the role of the join of covers. The metric form generalises the Beginner-tier counting of distinguishable starts into a definition valid on any compact metric space, and putting these together with the variational principle shows topological entropy to be the supremum over invariant measures of the metric entropies, so the central insight is that one number certified by counting separated orbit segments is simultaneously the maximal rate of measure-theoretic information production. The bridge is the recognition that distinguishability over steps, measured topologically by covers or metrically by Bowen balls, is the single invariant the whole entropy theory turns on.
Exercises Intermediate+
Advanced results Master
Theorem (basic functorial properties). Topological entropy is a topological-conjugacy invariant. It satisfies for , and when is a homeomorphism; for a product, ; and if is a factor map (a continuous surjection with ), then , with equality when is finite-to-one with uniformly bounded fibres. (See [Walters Ch. 7], [Bowen 1971].)
These are the structural laws that make entropy a usable invariant. The product law expresses that independent systems add their complexities; the power law that watching ticks at a time multiplies the per-tick rate; the factor inequality that a quotient cannot manufacture distinguishability the cover did not already see. Invariance under conjugacy is what licenses entropy as a coordinate-free fingerprint, and it is the property that originally let Adler, Konheim, and McAndrew separate maps that no prior topological invariant distinguished.
Theorem (model evaluations). The full shift on symbols has . A subshift of finite type with irreducible transition matrix has , the logarithm of the Perron eigenvalue 38.02.02. The expanding map has , and more generally a expanding map of a compact manifold has when the degree controls the inverse-branch count. A hyperbolic toral automorphism has , the sum over eigenvalues of modulus exceeding one. (See [Katok-Hasselblatt 1995 §3.1], [Walters Ch. 7].)
Each evaluation reads the entropy off a counting problem the separated-set definition makes transparent: words for the shift, admissible paths for the SFT, inverse branches for the expanding map, Bowen rectangles for the automorphism. The toral case is the linear, constant-exponent prototype of the Pesin picture, where entropy equals integrated positive Lyapunov exponents; here the exponents are the constants , and their sum over the expanding directions is the entire entropy.
Theorem (entropy and volume growth; Yomdin-Newhouse). For a self-map of a compact manifold, the topological entropy equals the exponential growth rate of the volume of iterated submanifolds: $$ h_{\mathrm{top}}(f) = \max_{0 \leq i \leq \dim M} \limsup_{n \to \infty} \frac{1}{n} \log \sup_{\sigma} \operatorname{vol}\big(f^n \circ \sigma\big), $$ the supremum over singular -cubes of bounded volume. Newhouse proved the volume-growth rate for maps, and Yomdin the reverse, so the two coincide. (See [Yomdin 1987].)
The Yomdin-Newhouse identity ties the combinatorial entropy to differential geometry: entropy is exactly how fast stretches the area of surfaces, the length of curves, or the volume of higher cells, whichever grows fastest. The hypothesis is necessary — Yomdin's argument controls the loss of smoothness through Newton-polytope estimates on the growth of derivatives, and the theorem fails for merely maps with a defect term of order . It is the bridge from the measure-free, topological counting of Bowen balls to the volume geometry that drives smooth ergodic theory.
Synthesis. Topological entropy is the foundational reason the topological and measurable theories of complexity meet: it is the measure-free growth rate of distinguishable orbit segments, computed equivalently by open covers or Bowen balls, and the central insight of this unit is that this single number governs both the combinatorics of the shift and the geometry of smooth maps. This is exactly the content of the variational principle, which states over invariant measures, so the topological invariant dominates every Kolmogorov-Sinai entropy of 38.06.02 and is attained by a measure of maximal entropy; putting these together with the Yomdin-Newhouse theorem, entropy is simultaneously a counting exponent, a supremum of information rates, and a volume-growth rate, the three faces of one invariant. The construction is dual to the measure-theoretic entropy of 38.06.02 exactly as the join of covers is dual to the join of partitions: where partitions carry the measure, covers carry only the topology, and the supremum over covers replaces the supremum over partitions. The bridge is the recognition that the doubling map's , the golden-mean shift's , and the cat map's are one quantity wearing three computational disguises — symbolic word-counting, Perron-Frobenius spectral radius, and Lyapunov-exponent sum — and this same hierarchy generalises into the thermodynamic formalism, where topological entropy is the pressure of the zero potential and the model evaluations become special cases of the variational principle for pressure.
Full proof set Master
Proposition 1 (entropy is a conjugacy invariant). If is a homeomorphism of compact metric spaces with , then .
Proof. If is an open cover of then is an open cover of , and by the intertwining, so . A homeomorphism carries subcovers to subcovers bijectively, so , whence . Since is a bijection between the open covers of and those of , taking suprema gives .
Proposition 2 (the full shift on symbols). The shift on has .
Proof. With the metric and , holds iff and agree on coordinates (each , , must leave the zeroth coordinates equal). The sequences supported on are pairwise -separated by more than , so ; and any two agreeing on are within -distance , so one representative per length- central word is -spanning, . Hence and .
Proposition 3 (subshift of finite type). For an irreducible matrix , the SFT has .
Proof. By the same computation, equals the number of admissible length- words, and an admissible word is one with for all , so . Irreducibility and non-negativity invoke Perron-Frobenius 38.02.02: has a simple dominant eigenvalue with strictly positive eigenvectors, and with , . Then , so and .
Proposition 4 (the times- map). The expanding map on the circle has .
Proof. The Bowen ball (circle distance) is an arc of length for , since multiplies short-arc lengths by until wrap-around and the tightest constraint is the last iterate . The circle of length is covered by such arcs, so ; a maximal separated set has the same order. Therefore . For this recovers the doubling map's .
Proposition 5 (the power law). For every , .
Proof. Let be an open cover and . Then , so . Taking the supremum over (covers of the form are cofinal, being finer than , and refining can only increase cover entropy up to the limit) gives ; the reverse follows because every cover for satisfies for a refining . Hence .
Connections Master
Topological transitivity, mixing, and Devaney chaos
38.01.03. Entropy quantifies the complexity that the connectivity hierarchy of that unit only classifies qualitatively. The doubling map and full shift are exhibited there as the canonical mixing, chaotic systems, and here they acquire their numerical fingerprints and ; positive topological entropy is a quantitative sufficient condition for the chaotic regime, sharper than transitivity, and the Bowen-metric distinguishability of this unit is the metric refinement of the sensitive dependence defined there.Shifts of finite type, transition matrices, and coding
38.02.02. The SFT entropy is computed from the Perron-Frobenius spectral radius of the transition matrix introduced there, so that unit supplies the linear algebra — irreducibility, the dominant positive eigenvalue, the count — on which this unit's symbolic entropy computations rest. Coding a smooth hyperbolic system to an SFT transfers its entropy to a Perron-root computation, the standard route to evaluating in practice.Kolmogorov-Sinai entropy and the generator theorem
38.06.02. Measure-theoretic entropy is the partition-based companion to the cover-based topological entropy here; the variational principle makes the topological invariant the supremum of the measure-theoretic ones over invariant measures, with the join of partitions playing exactly the role the join of covers plays in this unit. A measure of maximal entropy realises the supremum and ties the two definitions into one.The Shannon-McMillan-Breiman theorem
38.06.03. The asymptotic equipartition property there counts typical measure-theoretic names at rate , the measure-side analogue of this unit's counting of -separated orbit segments at rate ; through the variational principle the topological separated-set count dominates every measure-theoretic typical-name count.Hyperbolic sets, Anosov and Axiom-A systems, the Smale decomposition
38.03.01. The Smale spectral decomposition splits an Axiom-A non-wandering set into transitive basic sets, each coded by a subshift of finite type; the topological entropy of the system is then the maximum of the SFT entropies of its pieces, and the hyperbolic toral automorphism's is the constant-exponent prototype of the Pesin entropy formula for the smooth hyperbolic theory.
Historical & philosophical context Master
Topological entropy was introduced by Roy Adler, Alan Konheim, and M. H. McAndrew in their 1965 Transactions of the American Mathematical Society paper [Adler 1965], who transported Kolmogorov and Sinai's measure-theoretic invariant of 1958-59 to the purely topological setting by replacing measurable partitions with open covers and the partition entropy with the cover quantity . Their definition was a conjugacy invariant computable without reference to any invariant measure, and it immediately distinguished maps that the existing topological theory could not.
The metric reformulation, more convenient for computation, was given independently by Efim Dinaburg in 1970 [Dinaburg 1970] and Rufus Bowen in 1971 [Bowen 1971], who replaced open covers by -separated and spanning sets in the Bowen metric and proved the equivalence with the Adler-Konheim-McAndrew definition; Bowen's formulation became standard and his name attaches to the dynamical metric . The link to measure-theoretic entropy was completed by the variational principle, established in stages by Dinaburg, Goodman, and Goodwyn around 1969-71, asserting that topological entropy is the supremum of Kolmogorov-Sinai entropies over invariant measures. The geometric meaning of entropy as a volume-growth rate was proved by Sheldon Newhouse in 1988 (one inequality) and Yosef Yomdin in 1987 (the reverse) [Yomdin 1987], whose semialgebraic estimates on the growth of derivatives identified the topological entropy of a map with the exponential growth rate of the volumes of iterated submanifolds.
Bibliography Master
@article{AdlerKonheimMcAndrew1965,
author = {Adler, Roy L. and Konheim, Alan G. and McAndrew, M. H.},
title = {Topological entropy},
journal = {Transactions of the American Mathematical Society},
volume = {114},
year = {1965},
pages = {309--319}
}
@article{Bowen1971,
author = {Bowen, Rufus},
title = {Entropy for group endomorphisms and homogeneous spaces},
journal = {Transactions of the American Mathematical Society},
volume = {153},
year = {1971},
pages = {401--414}
}
@article{Dinaburg1970,
author = {Dinaburg, Efim I.},
title = {The relation between topological entropy and metric entropy},
journal = {Doklady Akademii Nauk SSSR},
volume = {190},
year = {1970},
pages = {19--22}
}
@article{Yomdin1987,
author = {Yomdin, Yosef},
title = {Volume growth and entropy},
journal = {Israel Journal of Mathematics},
volume = {57},
number = {3},
year = {1987},
pages = {285--300}
}
@article{Newhouse1989,
author = {Newhouse, Sheldon E.},
title = {Continuity properties of entropy},
journal = {Annals of Mathematics},
volume = {129},
number = {2},
year = {1989},
pages = {215--235}
}
@book{Walters1982,
author = {Walters, Peter},
title = {An Introduction to Ergodic Theory},
publisher = {Springer},
series = {Graduate Texts in Mathematics},
volume = {79},
year = {1982}
}
@book{KatokHasselblatt1995,
author = {Katok, Anatole and Hasselblatt, Boris},
title = {Introduction to the Modern Theory of Dynamical Systems},
publisher = {Cambridge University Press},
series = {Encyclopedia of Mathematics and its Applications},
volume = {54},
year = {1995}
}