Shift Spaces and Subshifts
Anchor (Master): Lind-Marcus 1995 *An Introduction to Symbolic Dynamics and Coding* (Cambridge University Press) Ch. 1–6 (shift spaces, languages, sliding-block codes, sofic shifts, the conjugacy theory); Brin-Stuck 2002 *Introduction to Dynamical Systems* (Cambridge) Ch. 3; Kitchens 1998 *Symbolic Dynamics: One-sided, Two-sided and Countable State Markov Shifts* (Springer Universitext); Hedlund 1969 *Endomorphisms and automorphisms of the shift dynamical system* (Math. Systems Theory 3) — the originating paper for cellular automata as shift-commuting maps
Intuition Beginner
Imagine writing down what a moving system is doing by recording only a coarse summary at each tick of the clock. You partition the space into a few labelled regions, and at every step you write down the label of the region the system is sitting in. A trajectory becomes a string of symbols — a flight of letters drawn from a small alphabet — and the original complicated motion becomes the much simpler act of reading that string one letter at a time. Symbolic dynamics studies these strings as objects in their own right, and the surprising payoff is that the messy geometry of a chaotic system often becomes the tidy combinatorics of a list of letters.
The clock keeps ticking, so the natural motion on strings is to slide your reading window one place to the right: forget the first letter and start from the second. This sliding is the shift. Watching a string under repeated shifting is the symbolic version of watching a point move under the original rule. The whole space of all possible strings, with the shift acting on it, is the full shift — the universe of every conceivable record.
Not every string can actually occur. A real system may forbid certain short patterns: maybe a can never be immediately followed by another . The strings that avoid every forbidden pattern form a smaller world sitting inside the full shift, called a subshift. The single most useful fact of the subject is that a subshift is completely pinned down by its list of banned words — its grammar. To describe a symbolic system you do not draw a picture; you write down its forbidden words.
Visual Beginner
Picture a long strip of boxes stretching out to the right, each box holding a single or . That is one point of the full shift on two symbols: an endless binary record. The shift slides the whole strip one box to the left, so the second box becomes the first and the rest follow.
To the side sits a tiny rule-graph: two circles labelled and with arrows showing which letter may follow which. An arrow from to , from to , and from to , but no arrow from back to . Strings that only ever walk along arrows of this graph are exactly the strings with no two consecutive ones — the golden-mean subshift. The table below contrasts the full shift with this subshift.
| feature | full shift on two symbols | golden-mean subshift |
|---|---|---|
| alphabet | the letters and | the letters and |
| allowed strings | every string | strings with no next to a |
| forbidden words | none | the single word |
| the move | slide one box left | slide one box left |
| how many length- strings | to the power | the -th Fibonacci-type count |
Worked example Beginner
Let us count how many short strings the golden-mean subshift allows. The rule is simple: a string is allowed when it never contains two ones in a row. We count allowed strings of each length.
Step 1. Length . The allowed strings are and . That is strings.
Step 2. Length . Write all four binary pairs: . The pair has two ones in a row, so it is banned. The allowed pairs are . That is strings.
Step 3. Length . Extend each allowed length- string by one more letter, keeping only extensions that do not create a . From we may add or , giving and . From we may add only (adding would make ), giving . From we may add or , giving and . The allowed length- strings are . That is strings.
Step 4. Spot the pattern. The counts are . Each count is the sum of the two before it: indeed length gives . These are the Fibonacci numbers. The reason is that an allowed string either ends in (and anything allowed can come before it) or ends in (forced, because before a final must come a ).
What this tells us: a single banned word, , already produces a rich counting law. The number of allowed strings grows like the Fibonacci sequence, at a rate governed by the golden ratio — which is exactly why this little system is named the golden-mean subshift. Forbidding one short pattern shapes the entire long-run combinatorics.
Check your understanding Beginner
Formal definition Intermediate+
Fix a finite set with at least two elements, the alphabet. The full shift over is the sequence space (the two-sided shift) or (the one-sided shift). Either is given the product topology, the coarsest topology making every coordinate projection continuous when carries the discrete topology. This topology is induced by the metric
$$
d(x,y) = 2^{-\min{ |n| ,:, x_n \neq y_n }}, \qquad d(x,x) = 0,
$$
with the convention that the minimum over the empty set is . Two sequences are close exactly when they agree on a long central block; this is the shift metric of 38.01.01, here taken as the defining datum.
For a finite word (or block) and an index , the cylinder set $$ [w]i = { x \in A^{\mathbb{Z}} : x{i+j} = w_j \text{ for } 0 \le j < k } $$ is the set of sequences carrying the word starting at position . Cylinder sets are simultaneously open and closed (clopen), and they form a basis for the product topology.
Definition (shift map). The shift is . On the two-sided shift is a homeomorphism; on the one-sided shift is a continuous, surjective, -to-one map.
Definition (subshift). A subshift (or shift space) is a subset that is closed (in the product topology) and shift-invariant (). The pair is a topological dynamical system in the sense of 38.01.01, and the full shift is the case .
Definition (forbidden words and the language). For a set of finite words, let $$ \mathsf{X}{\mathcal{F}} = { x \in A^{\mathbb{Z}} : \text{no word of } \mathcal{F} \text{ occurs in } x }, $$ the sequences avoiding every word of . Conversely, the language of a subset is $$ \mathcal{L}(X) = { w \in A^* : w \text{ occurs in some } x \in X }, $$ the set of words that actually appear. A word occurs in if $x{i} \cdots x_{i+k-1} = wi$.
Definition (subshift of finite type). A subshift is of finite type (an SFT) if for some finite . A subshift is sofic if it is the image of an SFT under a sliding-block code (defined below). When consists of length- words, is a topological Markov chain: it is described by a transition matrix over , with iff the word is allowed, and . This matrix is the topological analogue of the stochastic transition matrix of a Markov chain 37.05.02: where the stochastic matrix records the probability of , the transition matrix records merely whether is permitted, and the irreducibility and period notions transfer verbatim to the support graph.
Definition (sliding-block code). Let and be subshifts. Fix integers (a window ) and a block map on the allowed words of length . The induced map , $$ (\phi x)n = \Phi(x{n-m} , x_{n-m+1} \cdots x_{n+m}), $$ is a sliding-block code (with memory and anticipation ). A sliding-block code commutes with the shift by construction: .
A sign / index convention. Coordinates run over with (the shift moves the window forward in time, equivalently translates the sequence to the left), and the metric weights disagreements by so that agreement on the central block forces . This matches Lind-Marcus and Brin-Stuck; the one-sided shift uses throughout.
Counterexamples to common slips
Forbidden-word sets are far from unique. The golden-mean subshift is but also — adding already-impossible words changes nothing. The canonical invariant is the language , not any particular .
Not every closed invariant set is of finite type. The even subshift (blocks of zeros between successive ones always have even length) needs the infinite forbidden list ; no finite list defines it. It is sofic but not an SFT.
Shift-invariance must be two-sided for the two-sided shift. A subshift requires , not merely . The forward orbit closure of a single non-periodic point need not be shift-invariant unless one takes the full bi-infinite orbit closure.
A subshift can be empty. If forbids every letter of as a length- word, . Non-emptiness is a real hypothesis, guaranteed when every long word extends to a longer allowed word (the extension property).
Key theorem with proof Intermediate+
Theorem (characterisation of subshifts; Curtis-Hedlund-Lyndon). Let be a finite alphabet.
(a) A subset is a subshift (closed and shift-invariant) if and only if for some set of forbidden words $\mathcal{F} \subseteq A^$.*
(b) A map between subshifts is continuous and shift-commuting () if and only if is a sliding-block code.
(See [Lind-Marcus 1995 Ch. 1–2], [Brin-Stuck 2002 §3.2], [Hedlund 1969].)
Proof. Write with the shift metric and product topology throughout; recall that cylinder sets are clopen and that is compact, being a product of finite discrete spaces (Tychonoff).
Part (a), "". Let . Shift-invariance is immediate: avoids every word of iff does, since the occurrences of a word in are exactly the occurrences in shifted by one index. For closedness, fix of length and let , a union of open cylinder sets, hence open. Then is the complement of an open set, hence closed.
Part (a), "". Let be closed and shift-invariant. Let be the set of words that occur in no point of . We show . The inclusion holds because every word occurring in a point of lies in , so no point of contains a word of . For the reverse, take ; every finite central block lies in , so there is a point in which this block occurs, and after applying a power of (using invariance) we may place the block at the centre, giving with for . Then , so ; since is closed, . Hence , and the two sets coincide.
Part (b), "". A sliding-block code with window commutes with the shift by construction, . Continuity: if agree on then agree on , since each output coordinate in that range depends only on input coordinates within of it; thus , which is (uniform) continuity.
Part (b), "". Let be continuous and shift-commuting. The coordinate map , , is continuous on the compact space , hence uniformly continuous: there is such that implies (the target is discrete, so "within " means "equal"). But means agree on the window , so depends only on . This defines a block map with . Shift-commutation transports this to every coordinate: . So is the sliding-block code of .
Bridge. The characterisation theorem builds toward the entire structure theory of symbolic systems and appears again in every classification result of the subject, because once a subshift is known to be a closed invariant set cut out by forbidden words, the conjugacy problem becomes a question about languages and codes rather than about points. The foundational reason both halves hold is the same compactness-plus-locality principle: a finite alphabet makes compact, and the discrete target turns continuity into a finite window of dependence, so global maps are local rules. This is exactly the bridge from 38.01.01: the doubling map seen through binary coding is the one-sided full shift, and conjugacies of the doubling map are sliding-block codes of the shift. The Curtis-Hedlund-Lyndon theorem generalises the elementary observation that the shift commutes with itself, and it is dual to the forbidden-word characterisation — one describes the objects (subshifts as languages), the other the morphisms (codes as local rules) of the category of symbolic systems. The central insight is that symbolic dynamics is the combinatorics of finite words made into a topological dynamical system, and putting these together gives the working definition of "the same symbolic system": a conjugacy is an invertible sliding-block code, and the entire conjugacy theory of the next units rests on this identification.
Exercises Intermediate+
Advanced results Master
Theorem (the full shift is a Cantor set; is the model expansive system). For a finite alphabet with , the full shift with the shift metric is a compact, perfect, totally disconnected metric space — a Cantor set — and the shift is a homeomorphism that is expansive with constant , has points of period dividing , dense periodic points, and a dense orbit (topological transitivity), hence is chaotic in the sense of Devaney. (See [Lind-Marcus 1995 Ch. 6], [Kitchens 1998 Ch. 1].)
Compactness is Tychonoff applied to the finite discrete factors; perfectness is the observation that every cylinder contains at least two points once , so no point is isolated; total disconnectedness follows from the clopen cylinder basis, since any two points are separated by a clopen set. The standard topological characterisation of the Cantor set as the unique non-empty compact, perfect, totally disconnected, metrizable space identifies with the middle-thirds Cantor set as a topological space. The dynamical content lies entirely in : expansiveness says the symbol sequence is a complete invariant of the orbit, dense periodic points come from periodic repetition of central blocks, and a transitive point is built by concatenating an enumeration of all finite words. Every expansive homeomorphism of a Cantor set is, by a theorem of Hedlund-type coding, a subshift, so the full shift and its subshifts exhaust the expansive Cantor systems up to conjugacy.
Theorem (subshifts of finite type and the topological Markov chain). A subshift is of finite type iff, after recoding to a higher block alphabet, is a vertex/edge shift of a finite directed graph; equivalently is described by a transition matrix . The topological entropy is , where is the Perron eigenvalue (spectral radius) of , and the number of period- points is . (Lind-Marcus 1995 [Lind-Marcus 1995 Ch. 4].)
The recoding to length- blocks turns any SFT with forbidden words of length into a Markov (-step) SFT on the alphabet of allowed -blocks, so finite type is exactly "memory of bounded length". The transition matrix is the shadow of the stochastic matrix in 37.05.02: irreducibility of (strong connectedness of its graph) makes the SFT topologically transitive, and the period of (the gcd of its cycle lengths) is the symbolic period; a primitive gives a mixing SFT. The Perron-Frobenius eigenvalue governs growth of the word count exactly as it governs the convergence rate of the stochastic chain, and counts period- points through the matrix's whole spectrum, the zeta function being rational.
Theorem (sofic shifts and the Krieger cover). The sofic shifts are exactly the factors of subshifts of finite type under sliding-block codes; equivalently they are the sets of label-sequences of finite labelled graphs. Every irreducible sofic shift has a unique minimal deterministic presentation, the Krieger (future) cover. Sofic but non-SFT shifts exist, the even shift being the canonical example. (Lind-Marcus 1995 [Lind-Marcus 1995 §3.2, Ch. 3].)
A sofic shift records a regular language of allowed words in the sense of automata theory — the connection that makes symbolic dynamics and finite-state machines two views of one subject — while an SFT records the special regular languages defined by a finite list of forbidden factors. The Krieger cover is the minimal automaton recognising the language read backwards-deterministically; its existence and uniqueness make the sofic class as tractable as the SFT class for entropy and zeta-function computations, even though the even shift shows the SFT-defining property of bounded memory genuinely fails.
Theorem (Sturmian subshifts and the Morse-Hedlund complexity bound). For a bi-infinite sequence over a finite alphabet, let be its word complexity. If for some , then is eventually periodic. The aperiodic sequences of minimal complexity satisfy for all ; these are the Sturmian sequences, realised as codings of irrational circle rotations by a two-interval partition, and their orbit closures are minimal subshifts. (Morse-Hedlund 1938/1940 [Morse-Hedlund 1938].)
The complexity function is non-decreasing, and the dichotomy " once" versus " always" is sharp: periodicity is exactly bounded complexity, while is the slimmest possible aperiodic growth. Sturmian sequences are simultaneously the mechanical words , the cutting sequences of lines of irrational slope across the integer grid, and the symbolic codings of rotations — three faces of one minimal system, the symbolic counterpart of the irrational rotation that served as the model minimal system in 38.01.01.
Synthesis. The four theorems are one architecture seen at four resolutions. The central insight is that a symbolic dynamical system is its language: the forbidden-word characterisation makes a subshift a closed invariant set, the Curtis-Hedlund-Lyndon theorem makes its morphisms local rules, and putting these together the entire category of symbolic systems is the combinatorics of finite words promoted to topological dynamics. This is exactly the bridge from 38.01.01: the doubling map, the horseshoe, and the hyperbolic toral automorphism all reduce to shifts by symbolic coding, so the Cantor-set full shift is the universal model of chaotic recurrence, and its subshifts catalogue the possible chaotic behaviours.
The finite-type and sofic theory is dual to the automata-theoretic hierarchy — SFTs are the bounded-memory languages, sofic shifts the regular languages — and the foundational reason the Perron-Frobenius eigenvalue governs both entropy here and mixing in 37.05.02 is that the transition matrix is the support shadow of the stochastic matrix, so topological and measured Markov dynamics share one combinatorial skeleton. The Sturmian shifts generalise the minimal irrational rotation into the symbolic world and sit at the opposite extreme from the full shift: minimal complexity against maximal, a single low-entropy minimal system against the universal positive-entropy model, both read off from the same word-complexity invariant. The bridge to the conjugacy theory of the next units is the recognition that a topological conjugacy of subshifts is an invertible sliding-block code, so the classification of symbolic systems is the classification of languages up to recoding — the central insight that organises everything downstream.
Full proof set Master
Proposition 1 (the shift metric induces the product topology). On with finite discrete, the metric induces the product topology, and the cylinder sets are a clopen basis.
Proof. A basic open set of the product topology fixes finitely many coordinates, hence contains a cylinder specifying for ; this cylinder equals the metric ball , which is therefore open in the metric. Conversely each metric ball is such a cylinder, so the two topologies have the same basis and coincide. Each cylinder is open (just shown) and closed (its complement is the finite union of the other cylinders specifying the same coordinate block, each open), hence clopen.
Proposition 2 (the shift is a homeomorphism; expansiveness). On the two-sided full shift, is a homeomorphism, and is expansive with constant .
Proof. The inverse is , manifestly two-sided inverse to . Continuity of and : if agree on then agree on (a single index of slack at each end suffices), so is -Lipschitz up to the factor and in particular continuous; the same for . Expansiveness: if choose with ; then gives .
Proposition 3 (a sliding-block code is exactly a continuous shift-commuting map — the local half). Every sliding-block code is continuous and satisfies .
Proof. Shift-commutation: with window and block map , . Continuity: each output coordinate depends only on , so if agree on then agree on , giving .
Proposition 4 (continuous shift-commuting maps are local — the global half of Curtis-Hedlund-Lyndon). If is continuous and , then is a sliding-block code.
Proof. The zero-coordinate evaluation is continuous into the discrete finite set , and is compact, so is uniformly continuous: there is with (agreement on ) forcing , since distinct symbols are at distance in the discrete target. Hence is a function of the window alone. By shift-commutation, , so is the sliding-block code of with window .
Proposition 5 (forbidden-word characterisation; closed invariant equals avoidance set). A subset is closed and shift-invariant iff for $\mathcal{F} = A^ \setminus \mathcal{L}(X)$.*
Proof. If then is closed (complement of the open set ) and invariant (avoidance is shift-stable). Conversely, for closed invariant put . Then since words occurring in lie in . For : given , each central block occurs in some point of ; shift-invariance recentres it to give agreeing with on , so and closedness yields .
Proposition 6 (period-point count of an SFT). For a subshift of finite type with transition matrix (after recoding to a -step presentation), the number of points with equals , and the topological entropy is with the Perron eigenvalue of .
Proof. A point of period is a periodic bi-infinite sequence with (indices mod ); these are exactly the closed walks of length in the graph of , and the number of closed walks of length at vertex is , so the total is . For entropy, the number of allowed length- words is , which grows like by Perron-Frobenius (for irreducible ), so .
Connections Master
Dynamical systems, orbits, and limit sets
38.01.01. Symbolic dynamics is the construction promised in the bridge of the foundational unit: the doubling map is conjugate (off the dyadic rationals) to the one-sided full shift, the Smale horseshoe is conjugate to the two-sided full shift on two symbols, and the cat map admits a Markov-partition coding into a subshift of finite type. The orbit, limit-set, and topological-conjugacy apparatus defined there specialises to the shift, and the present unit supplies the combinatorial models — full shift, SFT, sofic, Sturmian — that classify the recurrent behaviour catalogued abstractly there.Class structure, irreducibility, and periodicity of Markov chains
37.05.02. The transition matrix of a subshift of finite type is the topological shadow of the stochastic transition matrix studied there: communication classes, irreducibility, and the gcd-of-cycle-lengths period transfer verbatim from the stochastic matrix to its support graph, with topological transitivity replacing irreducibility and the mixing SFT corresponding to a primitive matrix. Where the probabilistic theory weights paths by probability and extracts a stationary distribution, the symbolic theory counts paths and extracts the Perron-eigenvalue entropy; both are read off from the same directed graph.Metric space
02.01.05. The entire setting is metric: the shift metric , the completeness and compactness that make subshifts well-behaved, expansiveness measured by a separation constant, and the uniform-continuity argument that powers the Curtis-Hedlund-Lyndon theorem are all metric-space facts imported from there, the compactness of being the single most-used hypothesis of the unit.Topological space
02.01.01. The product (cylinder) topology, the clopen cylinder basis, total disconnectedness, and the Cantor-set characterisation of the full shift are the topological substrate; the identification of as the unique compact perfect totally-disconnected metrizable space rests on the point-set topology developed there, and shift-invariant closed sets are the topological objects this unit studies dynamically.
Historical & philosophical context Master
Symbolic dynamics begins with Jacques Hadamard's 1898 coding of geodesics on negatively curved surfaces by their sequence of crossings, but the subject is named and founded in the 1938 paper Symbolic dynamics of Marston Morse and Gustav Hedlund (American Journal of Mathematics 60) [Morse-Hedlund 1938], which introduced the sequence space, the shift, and the systematic use of forbidden and admissible words; their 1940 sequel established the Sturmian sequences as the minimal-complexity aperiodic codings of rotations. The motivation was geodesic flows on surfaces of negative curvature, where a symbolic coding converts the flow into a shift and makes its recurrence properties combinatorial.
The structural theory of continuous shift-commuting maps is due to Gustav Hedlund, whose 1969 paper Endomorphisms and automorphisms of the shift dynamical system (Mathematical Systems Theory 3) [Hedlund 1969] proved that these maps are exactly the sliding-block codes — the theorem now bearing the names of Morton Curtis, Hedlund, and Roger Lyndon — and thereby identified the endomorphisms of the full shift with cellular automata, founding their algebraic theory. The subject was reorganised around coding theory and the conjugacy problem by Roy Adler, Brian Marcus, and others, and the canonical modern reference is the 1995 textbook of Douglas Lind and Brian Marcus, An Introduction to Symbolic Dynamics and Coding (Cambridge) [Lind-Marcus 1995], which organises the field around shift spaces, sliding-block codes, subshifts of finite type, sofic shifts, and the still-open conjugacy and isomorphism problems; Bruce Kitchens' 1998 Symbolic Dynamics [Kitchens 1998] extends the treatment to one-sided and countable-state shifts. Rufus Bowen's Markov partitions for Axiom A diffeomorphisms (1970s) made symbolic dynamics the standard tool for studying hyperbolic systems, completing the program Morse and Hedlund began.
Bibliography Master
@book{LindMarcus1995,
author = {Lind, Douglas and Marcus, Brian},
title = {An Introduction to Symbolic Dynamics and Coding},
publisher = {Cambridge University Press},
year = {1995}
}
@book{BrinStuck2002sym,
author = {Brin, Michael and Stuck, Garrett},
title = {Introduction to Dynamical Systems},
publisher = {Cambridge University Press},
year = {2002}
}
@article{MorseHedlund1938,
author = {Morse, Marston and Hedlund, Gustav A.},
title = {Symbolic dynamics},
journal = {American Journal of Mathematics},
volume = {60},
year = {1938},
pages = {815--866}
}
@article{Hedlund1969,
author = {Hedlund, Gustav A.},
title = {Endomorphisms and automorphisms of the shift dynamical system},
journal = {Mathematical Systems Theory},
volume = {3},
year = {1969},
pages = {320--375}
}
@book{Kitchens1998,
author = {Kitchens, Bruce P.},
title = {Symbolic Dynamics: One-sided, Two-sided and Countable State Markov Shifts},
publisher = {Springer-Verlag},
series = {Universitext},
year = {1998}
}
@article{MorseHedlund1940,
author = {Morse, Marston and Hedlund, Gustav A.},
title = {Symbolic dynamics II. {S}turmian trajectories},
journal = {American Journal of Mathematics},
volume = {62},
year = {1940},
pages = {1--42}
}