Topological Pressure, the Variational Principle, and Equilibrium States
Anchor (Master): Walters 1982 *An Introduction to Ergodic Theory* (Springer GTM 79) Ch. 9 (pressure, the variational principle, equilibrium states, the measure of maximal entropy); Bowen 1975 *Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms* (Springer LNM 470); Ruelle 1978 *Thermodynamic Formalism* (Addison-Wesley, Encyclopedia of Mathematics 5); Keller 1998 *Equilibrium States in Ergodic Theory* (Cambridge LMS Student Texts 42)
Intuition Beginner
Picture all the long histories a system can produce, the same growing tree of distinguishable orbits that measures entropy. Entropy counted those histories as if each mattered equally. Now suppose you care about some more than others: you attach a running score to every step, so a long history earns a total reward equal to the sum of its step scores. Pressure answers a weighted version of the counting question — not just how many histories there are, but how much total reward the whole crowd carries, again as an exponential growth rate. When the score is zero everywhere, every history earns the same reward and pressure collapses back to ordinary entropy.
The name comes from physics. A physicist studying a gas does not just count microscopic configurations; she weights each by its energy and asks for the system's free energy, the quantity that balances having many configurations against those configurations being cheap. Pressure is exactly this free energy for a dynamical system, with the step score playing the role of minus the energy. High pressure means either there are many histories or the favored ones are richly rewarded, and the single number trades these two effects against each other.
There is a beautiful accounting identity hiding here. You can compute pressure two ways. One way looks at the system from the outside, counting weighted histories directly. The other way looks from the inside, by choosing a long-run statistical habit for the system — a way it spreads its time across states — and adding that habit's own entropy to its average score. The variational principle says these always agree: the outside count equals the best inside choice. The habit that achieves the best balance is called an equilibrium state, the statistical law the system would settle into if it were trying to be as rich and rewarding as possible at once.
The takeaway: pressure generalizes entropy by attaching scores to histories, it is the dynamical version of free energy, and the variational principle equates the externally counted pressure with the best achievable sum of entropy plus average score. The optimizing habit is the equilibrium state, and when all scores are zero it is the most chaotic habit of all, the measure of maximal entropy.
Visual Beginner
Picture two columns of bars competing to set a single number, with a dial in the middle that always reads the same whichever column you trust.
The left column counts weighted histories from outside and reads off a growth rate. The right column tries every statistical habit and, for each, stacks its entropy on top of its average score; the best stack reaches exactly the same height as the outside count. The bottom table shows the zero-score case, where pressure is just entropy and the optimizing habit is the most chaotic one the system allows.
Worked example Beginner
We compute the pressure of a two-state system whose allowed moves are recorded by a simple rule, and we find its best habit.
Step 1. The system. There are two states, call them and . At each step the system may stay or switch freely, so every two-letter combination is allowed; this is a fair two-symbol shift. With score zero everywhere, the number of allowed length- histories is , so the pressure with zero score is , the topological entropy. This is the unweighted baseline.
Step 2. Attach a score. Now reward being in state : assign score each step the system reads , and score each step it reads . A length- history earns total reward equal to the number of 's it contains. Histories heavy in now count for more.
Step 3. Count with weights. Add up the weight of each of the histories, where a history's weight is raised to its total reward. Each step independently contributes a factor if it reads and if it reads , so the per-step factor totalled over the two choices is . Over steps the total weighted count is . The growth rate is .
Step 4. Read off the pressure and the habit. So the pressure is , larger than the entropy because the reward inflates the count. The best habit spends a fraction of its time in the rewarded state and in — tilted toward but not all the way, because spending all its time in would throw away the entropy of having choices.
What this tells us: pressure blends counting and reward into one growth rate, here . The equilibrium habit does not greedily chase the reward; it balances earning score against keeping its options open, settling at the tilt versus that maximizes entropy plus average score.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is a continuous map of a compact metric space , and is a potential (a continuous real function). Write for the Birkhoff sum of along the orbit, and for the Bowen metric, which measures how far two orbits diverge over the first steps. Let denote the set of -invariant Borel probability measures, a nonempty convex weak--compact set.
Definition (separated and spanning sets). A set is -separated if for distinct ; a set is -spanning if every has for some . These quantify how many orbit segments of length are distinguishable at resolution .
Definition (topological pressure). The partition function at scale is The topological pressure of is the doubly-asymptotic exponential growth rate of the weighted count of distinguishable orbit segments. The spanning-set version, with over -spanning of , yields the same value, as does the open-cover version over subcovers of .
Definition (topological entropy as a special pressure). Setting recovers the topological entropy , where is the maximal cardinality of an -separated set. The general topological entropy this builds on is developed in 38.06.01; pressure is its potential-weighted refinement.
Definition (equilibrium state). Given , a measure is an equilibrium state for if it attains the supremum in the variational principle below: An equilibrium state for is a measure of maximal entropy: .
Definition (Gibbs state). For an expansive and a potential , an invariant measure is a Gibbs state for if there are constants and a number such that for every and ,
where is the Bowen ball 38.06.03. The constant is then the pressure . On transitive Anosov / Axiom-A systems with Hölder , the Gibbs state and the equilibrium state coincide and are unique.
Counterexamples to common slips Intermediate+
Pressure is not the maximum of the potential. is a growth rate weighted by , not or . The latter ignores the entropy contribution; pressure adds the logarithm of how many comparably-rewarded orbits exist. For , but can be positive.
The variational supremum need not be attained. On a general continuous system may fail to be upper semicontinuous in , and the supremum can be approached but unattained, so equilibrium states may not exist. Expansiveness restores upper semicontinuity and hence existence; this is not automatic.
Equilibrium states need not be unique. Uniqueness is a regularity phenomenon: it holds for Hölder potentials on transitive Anosov / Axiom-A systems (and on subshifts of finite type with the Bowen condition) but fails for merely continuous potentials, where phase transitions — several distinct equilibrium states — occur exactly as in statistical mechanics.
Topological pressure is a topological quantity; equilibrium entropy is measure-theoretic. depends only on , , and the topology, not on any measure; the term is measure-dependent. The variational principle is the bridge identifying the topological supremum with the optimal measure-theoretic value, not an identity of like quantities.
The measure of maximal entropy is not always Lebesgue or "uniform". For an irreducible subshift of finite type it is the Parry measure, whose cylinder weights come from the Perron-Frobenius left/right eigenvectors of the transition matrix, generally not the uniform product measure. Only for the full shift does it reduce to the uniform Bernoulli measure.
Key theorem with proof Intermediate+
Theorem (the variational principle for pressure; Ruelle 1973, Walters 1975). Let be a continuous map of a compact metric space and . Then In particular (the entropy variational principle, the case).
Proof. We prove the two inequalities separately.
Lower bound for every . Fix and a finite Borel partition with and . For the join , the chain rule and concavity of give, for any function ,
the last step by the inequality for a probability vector (Gibbs' inequality / Jensen applied to ), with and . Picking one point per atom gives an -separated set (atoms of diameter in the Bowen metric are -separated up to a controlled correction), so where as by uniform continuity of . Dividing by , using and 38.06.02, and letting then :
Taking the supremum over partitions gives .
Upper bound . Fix and for each choose an -separated set with . Form the atomic measures the second being the empirical average that makes the weak- limit invariant. By weak- compactness pass to a subsequence with . A partition argument identical to the entropy case (Misiurewicz's proof) shows that for a partition of diameter with , The left side is, up to the and the vanishing, . Letting yields . Combining the two bounds proves the principle.
Bridge. The variational principle builds toward every existence-and-uniqueness statement in thermodynamic formalism and appears again in the equilibrium-state and Parry-measure results of the Advanced section, where the abstract supremum is shown to be attained and, for good potentials, uniquely. The foundational reason the two sides agree is that the same separated-set count controls both the topological growth rate from above (the lower-bound partition argument bounds by ) and the measure-theoretic optimum from below (the empirical measures built from near-maximal separated sets converge to an optimizer) — this is exactly the duality that makes pressure a Legendre-type transform of entropy. Putting these together with 38.06.02, the entropy variational principle is the shadow, and it is dual to the generator theorem: where the generator theorem collapses the partition supremum inside a single measure, the variational principle collapses the measure supremum into a single topological count. The central insight is that pressure is free energy and the equilibrium state is its Gibbs measure, the bridge from ergodic theory to statistical mechanics that this unit makes precise.
Exercises Intermediate+
Advanced results Master
Theorem 1 (variational principle; Ruelle 1973, Walters 1975). For continuous on a compact metric space and , . The pressure is a convex, -Lipschitz, monotone functional of , translation-covariant under additive constants, and (cohomology-invariance): pressure depends on only through its cohomology class plus the entropy term [Walters 1975].
Theorem 2 (existence of equilibrium states under upper semicontinuity). If is upper semicontinuous on the weak--compact convex set , then every has at least one equilibrium state, and the equilibrium states form a nonempty compact convex face of whose extreme points are ergodic. Expansive maps — in particular subshifts of finite type, Anosov diffeomorphisms, and Axiom-A systems on their basic sets — have upper semicontinuous entropy, hence equilibrium states for every continuous potential [Bowen 1975].
Theorem 3 (measure of maximal entropy; Parry measure). For an irreducible subshift of finite type with transition matrix , with the Perron-Frobenius eigenvalue, and the unique measure of maximal entropy is the Parry measure, the Markov measure with transition probabilities and stationary vector , where are the left/right Perron eigenvectors normalized by . Uniqueness holds because the SFT is expansive and mixing; for a general expansive system with specification the measure of maximal entropy is likewise unique [Bowen 1974].
Theorem 4 (uniqueness for Hölder potentials; Ruelle-Bowen). On a transitive subshift of finite type, or a transitive Anosov / Axiom-A system, a Hölder-continuous potential has a unique equilibrium state , which is also the unique Gibbs state: . It is constructed from the leading eigendata of the Ruelle-Perron-Frobenius transfer operator , is ergodic with exponential decay of correlations and a central limit theorem, and depends real-analytically on within the Hölder class away from phase transitions [Bowen 1975].
Theorem 5 (SRB measures as equilibrium states). For a transitive Anosov diffeomorphism, the SRB measure is the unique equilibrium state of the geometric potential , with , equivalently the unique invariant measure satisfying Pesin's entropy formula . Its conditionals on unstable manifolds are absolutely continuous, and it is the physical measure: time averages of continuous observables converge to for Lebesgue-a.e. initial point [Bowen 1975].
Synthesis. The five results are one architecture, and the foundational reason they cohere is that the variational principle is a Legendre duality between the convex pressure functional and the concave entropy functional : the equilibrium state is exactly a tangent functional to at , so existence of equilibrium states is the existence of supporting hyperplanes and uniqueness is the differentiability of pressure. This is dual to the generator theorem and the Shannon-McMillan-Breiman theorem of 38.06.02 and 38.06.03, which fix entropy inside a single measure: here the supremum is taken across measures, and the measure of maximal entropy is the tangent — the Parry measure on an SFT, putting the Perron-Frobenius eigenvector to work. Putting these together, the thermodynamic dictionary is exact: pressure is free energy, the equilibrium state is the Gibbs measure, the transfer operator is the transfer matrix of statistical mechanics, phase transitions are non-unique equilibrium states, and the SRB measure is the physically selected equilibrium state for the geometric potential — this is exactly the central insight that the long-run statistics of a chaotic system are governed by a variational principle of the same form that governs a lattice gas, the bridge from Boltzmann-Gibbs thermodynamics to smooth dynamics that Sinai, Ruelle, and Bowen built.
Full proof set Master
Proposition 1 (pressure is well-defined and independent of the separated/spanning choice). For each the spanning-set sum and the separated-set sum have the same exponential growth rate, and exists in .
Proof. A maximal -separated set is -spanning, so writing for the infimum spanning sum, . Conversely an -separated set and an -spanning set admit an injection (each has a distinct -spanning neighbor, distinct because is -separated), and changes by at most between matched points, so . Thus the two growth rates agree up to . The double limit exists because is monotone in (smaller allows more separated points), hence has a limit as ; finiteness above follows from with the spanning count, whose rate is for compact .
Proposition 2 (Gibbs' inequality, the engine of the lower bound). For a probability vector and reals , , with equality iff .
Proof. Let and , a probability vector. The relative entropy is non-negative: by Jensen applied to the convex , equivalently . Expanding , which rearranges to . Equality in Jensen for the strictly convex forces constant, i.e. .
Proposition 3 (the supremum is attained on ergodic measures; ergodic decomposition of pressure). , and if an equilibrium state exists, an ergodic one exists.
Proof. Entropy is affine on : , by the affineness of at each partition (a consequence of the concavity-and-convexity of conditional entropy in the measure) followed by the supremum over partitions. The integral is affine, so is affine. The ergodic decomposition writes any , and affineness gives . Hence the supremum over all invariant measures equals the supremum over ergodic ones. If is an equilibrium state, , forcing for -a.e. ergodic , each of which is then an ergodic equilibrium state.
Proposition 4 (equilibrium states are the tangent functionals to pressure). Let entropy be upper semicontinuous. A measure is an equilibrium state for if and only if is a tangent functional to at , meaning for all .
Proof. If is an equilibrium state for , then for any , , so is tangent. Conversely, suppose is tangent. Upper semicontinuity gives, by the duality between the convex and the concave entropy (the variational principle realizes as the convex conjugate of ), the formula for every invariant . Tangency at says for all , so the infimum defining is attained at , giving , i.e. is an equilibrium state. Uniqueness of the equilibrium state is therefore equivalent to differentiability (Gateaux) of at .
Connections Master
The general topological entropy
38.06.01is the special case of topological pressure: , and the -separated and open-cover constructions of pressure are the potential-weighted refinements of the entropy constructions in that unit. The entropy variational principle is the shadow of the pressure variational principle proved here, so this unit subsumes and quantifies the topological-entropy theory.The Kolmogorov-Sinai entropy and generator theorem
38.06.02supplies the measure-theoretic side of the variational principle: the term is the entropy built in that unit, the chain rule and join structure power the lower-bound partition estimate, and the affineness of that gives the ergodic decomposition of pressure rests on the conditional-entropy machinery there. Pressure takes the supremum of that invariant across the simplex of invariant measures.The Shannon-McMillan-Breiman theorem
38.06.03is the pointwise companion: for the equilibrium (Gibbs) state, SMB makes a.e., and combined with the Gibbs property it yields the a.e. statement , identifying the pressure orbit-by-orbit. The typical-set count becomes the weighted count .The hyperbolic-sets and Smale decomposition theory
38.03.01is where equilibrium states become geometric: Markov partitions code an Axiom-A basic set as a subshift of finite type, transporting the Parry and Gibbs measures of this unit onto the manifold, and the Bowen specification property that guarantees uniqueness is a dynamical consequence of transitive hyperbolicity.Pesin theory and the entropy formula
38.07.02closes the dictionary: the SRB measure is the equilibrium state of the geometric potential with pressure zero, the Ruelle inequality is the one-sided variational bound , and Pesin's formula is the equality case that selects the physical measure as a thermodynamic equilibrium state.
Historical & philosophical context Master
Topological pressure entered dynamics as a deliberate import from statistical mechanics. David Ruelle, working on one-dimensional lattice gases, defined a pressure for expansive systems with specification in a 1973 paper in the Transactions of the American Mathematical Society [Ruelle 1973] and proved a variational principle for that class, modeling the construction on the free energy of a spin system. Peter Walters removed the specification hypothesis in 1975 in the American Journal of Mathematics [Walters 1975], establishing the variational principle for every continuous map of a compact metric space and every continuous potential, the form used today. The case, the entropy variational principle , had been proved earlier by Dinaburg, Goodman, and Goodwyn around 1969-1971; Misiurewicz later gave the short proof of the hard inequality reproduced in the modern literature.
The theory of equilibrium states was built in parallel by Yakov Sinai, Ruelle, and Rufus Bowen. Sinai's 1972 survey [Sinai 1972] introduced Gibbs measures into ergodic theory and constructed them for Anosov systems via Markov partitions; Ruelle developed the transfer-operator (Ruelle-Perron-Frobenius) method, and Bowen's 1974 paper on unique equilibrium states [Bowen 1974] and his 1975 Springer Lecture Notes synthesized the subject, proving existence and uniqueness for Hölder potentials on Axiom-A basic sets and identifying the SRB measure as the equilibrium state of the geometric potential. The resulting dictionary — pressure as free energy, equilibrium states as Gibbs states, the transfer operator as the transfer matrix, non-uniqueness as phase transition — made the analogy between hyperbolic dynamics and one-dimensional statistical mechanics into a working mathematical tool, and the measures it produced became the standard notion of a physically observable invariant measure for chaotic systems.
Bibliography Master
@article{Ruelle1973,
author = {Ruelle, David},
title = {Statistical mechanics on a compact set with $\mathbb{Z}^\nu$ action satisfying expansiveness and specification},
journal = {Transactions of the American Mathematical Society},
volume = {187},
year = {1973},
pages = {237--251}
}
@article{Walters1975,
author = {Walters, Peter},
title = {A variational principle for the pressure of continuous transformations},
journal = {American Journal of Mathematics},
volume = {97},
number = {4},
year = {1975},
pages = {937--971}
}
@article{Bowen1974,
author = {Bowen, Rufus},
title = {Some systems with unique equilibrium states},
journal = {Mathematical Systems Theory},
volume = {8},
number = {3},
year = {1974},
pages = {193--202}
}
@article{Sinai1972,
author = {Sinai, Yakov G.},
title = {Gibbs measures in ergodic theory},
journal = {Russian Mathematical Surveys},
volume = {27},
number = {4},
year = {1972},
pages = {21--69}
}
@book{Bowen1975,
author = {Bowen, Rufus},
title = {Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms},
publisher = {Springer},
series = {Lecture Notes in Mathematics},
volume = {470},
year = {1975}
}
@book{Ruelle1978,
author = {Ruelle, David},
title = {Thermodynamic Formalism},
publisher = {Addison-Wesley},
series = {Encyclopedia of Mathematics and its Applications},
volume = {5},
year = {1978}
}
@book{Walters1982,
author = {Walters, Peter},
title = {An Introduction to Ergodic Theory},
publisher = {Springer},
series = {Graduate Texts in Mathematics},
volume = {79},
year = {1982}
}
@book{Keller1998,
author = {Keller, Gerhard},
title = {Equilibrium States in Ergodic Theory},
publisher = {Cambridge University Press},
series = {London Mathematical Society Student Texts},
volume = {42},
year = {1998}
}