Perron-Frobenius Theory, SFT Growth Rate, and Subshift Entropy
Anchor (Master): Lind-Marcus 1995 *An Introduction to Symbolic Dynamics and Coding* (Cambridge University Press) Ch. 4 and §4.3-§4.5 (the Perron number characterisation of SFT entropies, the spectral radius as growth rate); Walters 1982 *An Introduction to Ergodic Theory* (Springer GTM 79) §7.2-§7.3 (topological entropy, the variational principle, entropy of subshifts); Seneta 1981 *Non-negative Matrices and Markov Chains* (Springer) Ch. 1-2 (Perron-Frobenius, the Birkhoff-Hilbert projective metric contraction proof)
Intuition Beginner
Take the little graph of allowed letter-pairs from the previous unit, the one whose square table of zeros and ones records which steps a walk may take. A natural question is how rich the system is: how fast does the number of allowed words of length grow as climbs? If the count roughly doubles each time you add a letter, the system is twice as rich per step as a system whose count merely grows by a fixed amount. That per-step multiplier is the single most important number attached to the graph, and the whole point of this unit is that one number can be read straight off the table.
The mechanism is the eigenvalue. When you multiply a vector by the table again and again, the vector swings around and then settles into pointing along one favoured direction, stretching by the same factor every step. That stretch factor is the largest eigenvalue, and for a table of non-negative numbers whose graph is all one connected piece it is a single positive number that beats every other in size. The Perron-Frobenius theorem is the guarantee that this favoured direction and this dominant multiplier always exist and are unique. Because the word counts are built by repeatedly applying the table, they end up growing at exactly that rate.
So the growth rate of allowed words is the dominant eigenvalue, and its logarithm is what we call the entropy of the system: the amount of new information, measured per symbol, that the system can carry. For the golden-mean rule with its one forbidden pair, that dominant eigenvalue turns out to be the golden ratio itself, so the entropy is the logarithm of the golden ratio.
Visual Beginner
Picture a vector as an arrow, and the transition table as a machine that takes an arrow and returns a new one. Feed any starting arrow in, take the output, feed it back, and repeat. After a few rounds the arrow stops wobbling and lines up along one fixed heading, and from then on every round just makes it longer by the same factor. That fixed heading is the Perron eigenvector; that fixed stretch factor is the Perron eigenvalue.
The table below contrasts the question we ask with the answer the eigenvalue gives.
| question about the system | answer from the dominant eigenvalue |
|---|---|
| how many allowed words of length ? | about for large |
| how many sequences repeat with period ? | about (the matrix trace) |
| how much information per symbol? | the entropy, |
| which direction does counting settle into? | the Perron eigenvector |
Worked example Beginner
Take the golden-mean rule on letters and , forbidding only the pair . Its table, with rows and columns ordered then , is $$ A = \begin{pmatrix} 1 & 1 \ 1 & 0 \end{pmatrix}. $$ We find its dominant eigenvalue and read off the growth rate of allowed words.
Step 1. Write the eigenvalue equation. An eigenvalue makes shrink to a number along its favoured direction; the condition is that the table collapses, which happens exactly when $$ (1 - \lambda)(0 - \lambda) - (1)(1) = 0. $$
Step 2. Expand. This is .
Step 3. Solve. The two roots are and . The first is about , the golden ratio ; the second is about , smaller in size. The dominant one is .
Step 4. Read the growth rate. The number of allowed words of length grows like . Indeed those counts are the Fibonacci numbers , and each is about times the one before. The entropy is .
What this tells us: a single banned pair pins the per-symbol richness of an infinite family of sequences to one specific number, the golden ratio, and you obtain it by solving one quadratic. The forbidden pair sets the rule; the dominant eigenvalue measures how much room is left.
Check your understanding Beginner
Formal definition Intermediate+
Fix a non-negative real matrix indexed by a finite set. Recall from 38.02.02 that the support graph has an edge iff , that is irreducible iff is strongly connected (for every some ), and that is primitive iff entrywise for some single (irreducible and aperiodic). Write for the spectral radius, the maximum modulus of an eigenvalue.
Definition (Perron root and Perron eigenvectors). For an irreducible non-negative matrix , the Perron root (or Perron-Frobenius eigenvalue) is the spectral radius . A right Perron eigenvector is a vector (entrywise positive) with ; a left Perron eigenvector is with . The theorem below asserts these exist and are unique up to scale.
Definition (period of an irreducible matrix). The period is , independent of by 38.02.02. is aperiodic (equivalently primitive) iff .
Definition (language growth and topological entropy of a subshift). For a subshift over a finite alphabet, let be its set of admissible words of length and the complexity function. The topological entropy is $$ h(\mathsf{X}) = \lim_{n \to \infty} \frac{1}{n} \log p_n = \inf_{n \ge 1} \frac{1}{n} \log p_n, $$ the limit existing because is subadditive (, Fekete's lemma). This agrees with the open-cover and separated-set definitions of topological entropy [Walters 1982 §7.1-§7.3] applied to the shift map on .
Definition (zeta function). With from 38.02.02, the Artin-Mazur zeta function is ; its smallest pole sits at , the reciprocal of the Perron root.
Counterexamples to common slips
The spectral radius need not be an eigenvalue with a positive eigenvector when is reducible. The matrix has , but is a defective eigenvalue and there is no strictly positive eigenvector; irreducibility is what forces a positive Perron eigenvector and simplicity of the root.
Primitive is strictly stronger than irreducible, and only primitivity makes the unique eigenvalue of maximum modulus. For the -cycle the eigenvalues are , both of modulus : the matrix is irreducible but period , so its dominant modulus is achieved twice. Aperiodicity is exactly what removes the rival peripheral eigenvalues.
Entropy is a coarse invariant. Two non-conjugate irreducible SFTs can share a Perron root, hence equal entropy; entropy distinguishes systems but does not classify them (
38.02.02, strong shift equivalence).Growth rate is not the alphabet size. The full -shift has and entropy , but a proper SFT on letters has ; the forbidden words strictly lower the Perron root.
Key theorem with proof Intermediate+
Theorem (Perron-Frobenius and the entropy of an SFT). Let be an irreducible non-negative matrix with spectral radius .
(a) Perron-Frobenius. is an eigenvalue of ; it is algebraically simple; and it has strictly positive right and left eigenvectors , unique up to positive scalar. No eigenvalue exceeds in modulus. If moreover is primitive, then is the unique eigenvalue of maximum modulus: every other eigenvalue satisfies .
(b) Growth rate. If is a transition matrix of an SFT , then satisfies for constants (with at most the size of ), so $$ h(\mathsf{X}A) = \lim{n \to \infty} \frac{1}{n} \log p_n = \log \lambda(A). $$
(See [Brin-Stuck 2002 §3.3-§3.5], [Lind-Marcus 1995 Ch. 4], [Seneta 1981 Ch. 1].)
Proof. Part (a). We give the cone-contraction argument. Let and consider the map on the simplex given by . Irreducibility ensures for , so is a continuous self-map of the compact convex set ; by the Brouwer fixed-point theorem has a fixed point , giving with and . Irreducibility upgrades to : from and , applying (which is entrywise positive on the support of a strongly connected graph) gives , so every coordinate of is positive.
That : any eigenvalue with eigenvector satisfies , so the vector obeys entrywise. Pairing with the positive left eigenvector (obtained by the same argument applied to ) gives , and , hence . So is the spectral radius.
Simplicity: suppose for some real not proportional to . For small the vector is still positive; increasing to the first value where a coordinate hits gives , , has a zero coordinate, and . But must be strictly positive by the irreducibility argument, contradicting the zero coordinate of . So the -eigenspace is one-dimensional; a short additional argument with the left eigenvector (the geometric and algebraic multiplicities coincide because shows is not a defective eigenvalue) gives algebraic simplicity.
Uniqueness of maximal modulus under primitivity: let be an eigenvalue with and eigenvector . Then as above, and pairing with forces equality , so is a positive Perron eigenvector, hence up to scale, so for all . Equality in the triangle inequality at each forces all (over with ) to share a common phase ; writing , primitivity (some ) propagates a single global phase relation with , forcing . (For merely irreducible of period , the same analysis yields exactly peripheral eigenvalues .)
Part (b). Order the alphabet and recall , the total number of admissible words of length (each word is a walk of steps). Let be the all-ones vector. Then . Using the positive Perron eigenvectors normalised so , write where is the action of on the complementary spectral subspace, with spectral radius when is primitive and in general. The lower bound: for large , where . The upper bound: each entry of is at most a polynomial-in- multiple of (the Jordan form has its largest block on the eigenvalue , which is simple, so in fact the -part contributes no polynomial factor and the subdominant Jordan blocks contribute at most ). Taking logarithms and dividing by , both bounds give . Hence .
Bridge. This theorem builds toward the whole entropy theory of dynamical systems and appears again in the variational principle and in thermodynamic formalism, because it pins the topological entropy of a finite-type system to one algebraic number read off a matrix. The foundational reason the spectral radius governs the count is that the word totals are , a fixed linear functional of the matrix powers, so their growth is set by the dominant eigenvalue exactly as the periodic-point count of 38.02.02 was; this is exactly the principle that the trace, the word count, and the zeta-function pole all locate the same number . The Perron-Frobenius half is dual to the stochastic Perron-Frobenius of 37.05.02: there the irreducible non-negative matrix is row-stochastic with Perron root and positive Perron eigenvector the stationary distribution, here the Perron root is and its logarithm is the entropy, but the existence-uniqueness-positivity package and the primitivity-gives-a-spectral-gap statement are the same theorem. Putting these together, entropy generalises from this combinatorial growth rate to the measure-theoretic and topological entropies that organise all of ergodic theory, and the bridge is the variational principle linking to the supremum of measure-theoretic entropies over invariant measures.
Exercises Intermediate+
Advanced results Master
Theorem (the Birkhoff-Hilbert metric contraction proof). Let be a primitive non-negative matrix and equip the open positive cone with the Hilbert projective metric . Then acts as a contraction of with Birkhoff contraction ratio , where is the projective diameter of the image. The unique fixed ray is the Perron eigenvector , and geometrically at rate . (See [Seneta 1981 Ch. 1-2].)
The Hilbert metric collapses the scaling freedom: two positive vectors are at distance iff proportional, and — by Birkhoff's theorem — strictly shrinks this metric whenever its image has finite projective diameter, which a primitive matrix achieves after finitely many steps. The Banach fixed-point theorem then delivers the Perron eigenvector as the unique attracting ray and quantifies the spectral gap as the contraction ratio , so the subdominant modulus obeys . This is the analytic engine behind the cleaner existence proof: Brouwer gives a fixed point, but Birkhoff-Hilbert gives uniqueness, simplicity, and an exponential convergence rate in one stroke, and it is the form that survives to infinite-dimensional Perron-Frobenius (Ruelle's transfer operators in thermodynamic formalism).
Theorem (Lind's characterisation of SFT entropies). A real number equals for some irreducible SFT iff is a Perron number: a real algebraic integer that strictly exceeds the modulus of each of its Galois conjugates. Equivalently, the set of SFT growth rates is exactly , the Perron numbers. (See [Lind-Marcus 1995 Ch. 4].)
That the spectral radius of an irreducible non-negative integer matrix is a Perron number is immediate from Perron-Frobenius: is a simple root of the integer characteristic polynomial, hence an algebraic integer strictly dominating its conjugates in modulus. The depth is the converse: every Perron number is realised as some , proved by Lind via an explicit companion-type construction that builds an integer matrix whose Perron root is the prescribed algebraic integer. The characterisation closes the realisation question and identifies the entropies of finite-type symbolic systems with a clean number-theoretic class, placing topological entropy at the interface of dynamics and algebraic number theory; it is the spectral-radius counterpart of the rationality of the zeta function, both turning combinatorial data into algebraic invariants.
Theorem (variational principle for SFTs and the measure of maximal entropy). For an irreducible SFT , the topological entropy equals over -invariant Borel probability measures , and the supremum is attained by a unique measure, the Parry measure, whose transition probabilities are with stationary distribution built from the left and right Perron eigenvectors. (See [Walters 1982 §7.2-§7.3].)
The variational principle identifies the topological entropy of an SFT with the largest measure-theoretic entropy any invariant measure can carry, and the maximiser is constructed directly from Perron-Frobenius data: rescaling the (or non-negative integer) matrix by the right Perron eigenvector and the eigenvalue produces a stochastic matrix — this is exactly the diagonal similarity with — whose stationary chain is the unique measure of maximal entropy. This is the precise junction with 37.05.02: the Parry measure is the Markov measure whose stochastic transition matrix is the Perron-rescaled topological matrix, so the topological Perron root and the stochastic Perron eigenvalue of 37.05.02 are two views of one diagonal change of variables.
Synthesis. Putting these together, the entire entropy theory of finite-type symbolic dynamics is the spectral theory of one non-negative matrix, and the central insight is that the Perron root is simultaneously the word-growth rate, the periodic-point growth rate, the smallest zeta-pole reciprocal, and the maximal measure-theoretic entropy. The foundational reason these coincide is that each is a linear functional of the matrix powers with a positive component along the simple dominant eigenspace, so the Perron-Frobenius theorem forces them all to grow at rate ; this is exactly the principle that the trace counts loops, the determinant packages them, and the dominant eigenvalue measures the richness. The construction is dual to the stochastic Perron-Frobenius of 37.05.02: the Parry measure is built by the diagonal similarity that turns the topological matrix with Perron root into a stochastic matrix with Perron root , so the stationary distribution there and the measure of maximal entropy here are the same object seen through one rescaling, and the central insight generalises: entropy is the logarithm of a spectral radius. The bridge onward is that the variational principle promotes this identity to all dynamical systems through thermodynamic formalism, where the Perron-Frobenius theorem becomes the Ruelle-Perron-Frobenius theorem for transfer operators and the Perron root becomes the pressure, so this unit's finite matrix is the first nontrivial instance of the operator that organises equilibrium statistical mechanics on shift spaces.
Full proof set Master
Proposition 1 (Perron root is the spectral radius and is positive). For an irreducible non-negative , is an eigenvalue with a strictly positive eigenvector.
Proof. Apply to the simplex ; irreducibility makes for , (some row reaches a nonzero coordinate), so is continuous on the compact convex and Brouwer gives a fixed point with , , . Positivity: is entrywise positive because for a strongly connected graph on vertices every ordered pair is joined by a walk of length , so is strictly positive, forcing . That : for any eigenpair , entrywise; pairing with the positive left eigenvector gives with , so .
Proposition 2 (simplicity of the Perron root). The eigenvalue of an irreducible non-negative matrix is algebraically simple.
Proof. Geometric simplicity first. Suppose with real and independent of . Choose over indices with (or use if needed) so that has at least one zero coordinate and (independence). Then , and is strictly positive by the walk argument of Proposition 1, contradicting the zero coordinate. So the -eigenspace is one-dimensional. Algebraic simplicity: let be the left Perron eigenvector. If were a defective eigenvalue there would be a generalized eigenvector with . Then , but the left side is , a contradiction. Hence the Jordan block at is and is algebraically simple.
Proposition 3 (primitive ⇔ unique eigenvalue of maximal modulus). An irreducible non-negative matrix is primitive iff strictly exceeds the modulus of every other eigenvalue.
Proof. () Let with eigenvector . From and equality after pairing with , we get , so is a positive Perron eigenvector, up to scale. Equality in forces, for each , all with to have a common argument; writing with unitary, the relation becomes on the support, and iterating, . Primitivity gives for some , and a strictly positive matrix equation with diagonal unitary forces and scalar, so and . () If is the strict-maximum-modulus simple eigenvalue, then (Proposition 5 below), so for large : is primitive.
Proposition 4 (entropy equals log Perron root). For an irreducible transition matrix , .
Proof. . Lower bound: with Perron eigenvectors normalised and spectral decomposition (, and the spectral projection), . The constant since , and the remainder is ; dividing by gives . Upper bound: every entry of is bounded by -type Jordan estimates by since and is the largest modulus, so and . The two bounds meet; the limit exists by subadditivity (Fekete) and equals .
Proposition 5 (rank-one power limit, primitive case). For primitive with , .
Proof. Simplicity of (Proposition 2) splits , , both -invariant. Let (idempotent, ) and , so and acts on with eigenvalues the non- eigenvalues of , all of modulus by Proposition 3. Hence and .
Proposition 6 (the Parry measure attains the entropy). For an irreducible matrix with Perron data , the stochastic matrix is well-defined and stochastic, its stationary distribution is (normalised), and the associated Markov measure has , the maximum over invariant measures.
Proof. Stochastic: , and . Stationarity: , so (after normalising ). Entropy of the Markov measure: . Substituting and using (so on the support), the and contributions telescope under stationarity ( cancels), leaving . By the variational principle for every invariant , and equality holds here, so is a measure of maximal entropy; uniqueness follows from strict concavity of entropy on the simplex of Markov measures with full support.
Connections Master
Shifts of finite type, transition matrices, and coding
38.02.02. This unit supplies the spectral half of that prerequisite's combinatorial theory: the periodic-point count and the zeta-pole at established there are governed by the Perron root analysed here, and the entropy becomes the conjugacy invariant that strong shift equivalence preserves. The upstream unit owns the matrix and its trace; this unit owns the dominant eigenvalue and the entropy it computes.Class structure, irreducibility, and periodicity of Markov chains
37.05.02. The Perron-Frobenius theorem proved here is the topological twin of the stochastic Perron-Frobenius there: an irreducible non-negative matrix has a simple positive dominant eigenvalue with positive eigenvectors, equal to for a row-stochastic chain (carrying the stationary distribution) and to for a counting matrix (carrying the entropy). The Parry measure of maximal entropy is exactly the Markov chain of37.05.02obtained by the diagonal Perron rescaling, so this unit'shooks_outrecords the link.Topological transitivity, topological mixing, and Devaney chaos
38.01.03. The irreducible/primitive dichotomy that makes the Perron root simple or strictly dominant is precisely the transitive/mixing dichotomy of that unit read on the transition matrix: irreducibility gives a simple Perron root and topological transitivity, while primitivity gives a strict spectral gap and topological mixing. The positive entropy for any non-degenerate SFT certifies the exponential orbit complexity that the chaos definitions there describe qualitatively.
Historical & philosophical context Master
The dominant-eigenvalue theory originates with Oskar Perron, who proved in his 1907 paper Zur Theorie der Matrices (Mathematische Annalen 64, 248-263) [Perron 1907] that a strictly positive square matrix has a positive eigenvalue of maximal modulus, simple and with a positive eigenvector. Georg Frobenius extended this to non-negative matrices in a sequence of Berlin Academy papers culminating in Über Matrizen aus nicht negativen Elementen (Sitzungsberichte der Preußischen Akademie, 1912), isolating irreducibility as the hypothesis that preserves simplicity and positivity, and developing the imprimitivity (period) theory that pins the peripheral eigenvalues to the roots of unity . The Birkhoff projective-metric proof, which recasts Perron-Frobenius as a contraction in Hilbert's metric on the positive cone, is due to Garrett Birkhoff (1957), building on Hilbert's 1895 projective metric; it is the form that generalises to the Ruelle-Perron-Frobenius transfer operators of thermodynamic formalism.
The application to symbolic dynamics — that the topological entropy of a subshift of finite type is — was crystallised in the ergodic-theory program of the 1960s and 1970s, with William Parry's 1964 construction of the measure of maximal entropy (the Parry measure) from the Perron eigenvectors and the variational principle for topological entropy established by Goodman, Goodwyn, and Dinaburg. The number-theoretic characterisation of the achievable growth rates was completed by Douglas Lind, The entropies of topological Markov shifts and a related class of algebraic integers (Ergodic Theory and Dynamical Systems 4, 1984, 283-300), which proved that the SFT entropies are exactly the logarithms of Perron numbers. The textbook synthesis is Lind and Marcus' 1995 An Introduction to Symbolic Dynamics and Coding (Cambridge) [Lind-Marcus 1995], organising entropy around the Perron eigenvalue, with Seneta's Non-negative Matrices and Markov Chains [Seneta 1981] the standard reference for the matrix theory.
Bibliography Master
@article{Perron1907,
author = {Perron, Oskar},
title = {Zur Theorie der Matrices},
journal = {Mathematische Annalen},
volume = {64},
year = {1907},
pages = {248--263}
}
@article{Frobenius1912,
author = {Frobenius, Georg},
title = {{\"U}ber Matrizen aus nicht negativen Elementen},
journal = {Sitzungsberichte der K{\"o}niglich Preu{\ss}ischen Akademie der Wissenschaften},
year = {1912},
pages = {456--477}
}
@article{Lind1984,
author = {Lind, Douglas A.},
title = {The entropies of topological Markov shifts and a related class of algebraic integers},
journal = {Ergodic Theory and Dynamical Systems},
volume = {4},
year = {1984},
pages = {283--300}
}
@article{Parry1964,
author = {Parry, William},
title = {Intrinsic Markov chains},
journal = {Transactions of the American Mathematical Society},
volume = {112},
year = {1964},
pages = {55--66}
}
@book{LindMarcus1995pf,
author = {Lind, Douglas and Marcus, Brian},
title = {An Introduction to Symbolic Dynamics and Coding},
publisher = {Cambridge University Press},
year = {1995}
}
@book{Seneta1981,
author = {Seneta, Eugene},
title = {Non-negative Matrices and Markov Chains},
publisher = {Springer-Verlag},
series = {Springer Series in Statistics},
year = {1981}
}
@book{Walters1982,
author = {Walters, Peter},
title = {An Introduction to Ergodic Theory},
publisher = {Springer-Verlag},
series = {Graduate Texts in Mathematics},
volume = {79},
year = {1982}
}
@article{Birkhoff1957,
author = {Birkhoff, Garrett},
title = {Extensions of Jentzsch's theorem},
journal = {Transactions of the American Mathematical Society},
volume = {85},
year = {1957},
pages = {219--227}
}