37.05.02 · probability / 05-markov-chains

Class Structure, Irreducibility, and Periodicity

shipped3 tiersLean: none

Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §1.2-1.3, §1.8; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.3 (irreducibility, period); Seneta 2006 *Non-negative Matrices and Markov Chains* 2e Ch. 1 (Perron-Frobenius and cyclic structure)

Intuition Beginner

A Markov chain moves among states one step at a time, and the very first thing to ask about it is a question of geography: starting from one state, which other states can the chain ever reach? Some states can be visited from the start; others are walled off and can never be entered once you leave; still others, once entered, can never be escaped. Sorting the states by who-can-reach-whom is the single most useful thing you can do before computing any probabilities.

Say a state is reachable from a state if there is some sequence of allowed one-step moves that carries the chain from to . If can reach and can reach , the two states communicate: each is accessible from the other. Communicating is an all-or-nothing partnership, and it splits the states into groups, called classes, where everyone inside a group communicates with everyone else in the group. A chain where the whole state space is one single class — everyone can get to everyone — is called irreducible, and irreducible chains are the clean, indecomposable building blocks of the theory.

There is a second, more delicate rhythm hidden inside a class: timing. Suppose that whenever the chain leaves a state it can only return after a number of steps that is always a multiple of three. Then the state has a built-in beat of three, and the chain visits it only on steps three, six, nine, and so on. That beat is the period of the state. When the only common rhythm is a beat of one — the chain can return after some run of consecutive step-counts with no forced gap — the state is called aperiodic, and the chain mixes freely instead of marching in lockstep.

These two features, who-reaches-whom and the hidden beat, are exactly what decide whether a chain settles into a single steady pattern or splits and cycles. The remarkable fact is that the beat is shared by everyone in a class: all states that communicate march to the same period.

The one-sentence takeaway: before computing anything, sort the states into communicating classes and find each class's period, because irreducibility (one class) and aperiodicity (beat one) are the two conditions that let a chain settle to a single long-run pattern.

Visual Beginner

Picture five states drawn as labeled circles with one-way arrows showing which one-step moves are allowed.

Left: the three dashed boxes are the communicating classes. The two-circle clusters and are classes where both members reach each other; the lone state is its own class because nothing returns to it. The box is closed: once you enter it you cannot leave. Right: two states with different beats — one revisited only on multiples of three, one revisited on a run of consecutive step counts — show what the period measures.

Worked example Beginner

Take a chain on four states with these allowed one-step moves: from you go to ; from you go to or to ; from you go to ; from you go to . We classify the states.

Step 1. Check whether and communicate. From you can reach in one step. From you can reach in one step. Each reaches the other, so and communicate and sit in the same class.

Step 2. Check whether and communicate. From you reach in one step, and from you reach in one step. So and communicate and form a class.

Step 3. Can the two pairs reach each other? From you can step to , so can reach . But from and the only moves stay inside — there is no arrow back to or . So cannot reach . The two pairs do not communicate, and there are two classes: and .

Step 4. Decide which classes are closed. From every move stays inside , so it is closed — the chain gets trapped there. From there is an escape (the move ), so is not closed.

Step 5. Find the beat of the closed class . Starting at , you return to after , which takes two steps, and any return uses an even number of steps because each visit to alternates with a visit to . So the period of (and of ) is two.

What this tells us: the chain is not irreducible — it has two classes — and it eventually falls into the closed class , where it oscillates with period two forever. We learned the whole long-run shape of the chain purely from the arrow pattern, before assigning a single probability.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is a time-homogeneous Markov chain on a countable state space with stochastic transition matrix and -step matrix , in the sense of 37.05.01. Write .

Definition (accessibility). State is accessible from state , written , if for some . Equivalently, iff there is a path with for each . Because , one always has .

Definition (communication). States and communicate, written , if and .

Definition (communicating class). The relation is an equivalence relation on (see the Key theorem). Its equivalence classes are the communicating classes of the chain. A class is closed if and together imply — once entered, the chain cannot leave . A singleton closed class , equivalently a state with , is absorbing. A class that is not closed is open (the chain can escape it).

Definition (irreducibility). The chain (or its matrix ) is irreducible if is a single communicating class, i.e. for all .

Definition (period). For a state with , the period of is A state is aperiodic if and periodic if . If (the chain started at never returns to with positive probability) the period is left undefined; this can only happen in an open class. A chain is called aperiodic when every state has period ; for an irreducible chain this is a property of the single class.

Definition (cyclic / periodic subclasses). Let be a communicating class of common period . Fix a reference state and, for each , set The Key theorem shows is well-defined. The periodic subclasses (or cyclic classes) are with ; they partition , and maps into .

Counterexamples to common slips Intermediate+

  • Accessibility is not symmetric; communication is. In the worked example but not , so holds while fails. The relation is a preorder (reflexive and transitive); only the symmetrized relation partitions .

  • A closed class need not be all of a finite chain, but a finite chain must contain at least one closed class. On a finite state space the chain cannot escape forever, so the accessibility preorder has a maximal class with no outward edge. An infinite chain can have every class open — the simple random walk on is irreducible (one open-by-default class, here the whole space) with no proper closed subclass.

  • Period is about the support of returns, not their probability. counts which return times are possible, ignoring how likely they are. A self-loop forces and hence ; a single self-loop anywhere in an irreducible chain makes the whole chain aperiodic.

  • Periodicity is not the same as never returning quickly. A state can have period yet typically take many steps to return; aperiodicity asserts only that the gcd of the possible return lengths is , e.g. because both a length- and a length- return exist, since .

Key theorem with proof Intermediate+

Theorem (class structure and period invariance). Let be a Markov chain on with matrix .

(a) The relation is an equivalence relation on , so partitions into communicating classes.

(b) The period is constant on each communicating class: if and both periods are defined, then .

Proof. The engine is the multiplicativity inequality for the -step probabilities. For all states and all , the Chapman-Kolmogorov identity of 37.05.01 gives , since all terms are nonnegative and the right-hand side is the single term . Thus

Part (a). Reflexivity: gives , hence . Symmetry: means and , which is exactly . Transitivity: suppose and . Then and give, by , ; symmetrically and give . Hence . So is an equivalence relation and its classes partition .

Part (b). Let with (the case is immediate). Since and , choose with and . By , , so and therefore .

Now take any , i.e. . Concatenating the three legs (length ), (length ), (length ) and applying twice, so and hence . Subtracting the relation gives . As was arbitrary, is a common divisor of , so . Interchanging the roles of and yields . Two positive integers dividing each other are equal, so .

Bridge. This theorem builds toward the entire equilibrium theory of 37.05.03 and appears again in every convergence statement for Markov chains, because the partition into classes and the class-period are precisely the invariants that decide whether a chain has a unique stationary distribution and whether its powers converge. The foundational reason the proof is so short is the path-concatenation inequality : accessibility is transitive because positive-probability paths compose, and that one fact gives both the equivalence relation and the divisibility bookkeeping behind period invariance. This is exactly the combinatorial shadow of the matrix identity from 37.05.01; the support pattern of the nonnegative matrix is all that the class structure sees. Restricting to a single closed communicating class produces an irreducible stochastic matrix, and the central insight is that an irreducible aperiodic matrix is exactly the class to which the Perron-Frobenius convergence theorem applies — putting these together, class decomposition reduces every finite chain to its irreducible closed blocks, and the bridge is that period one in each such block is the precise gateway to converging to a rank-one limit.

Exercises Intermediate+

Advanced results Master

The class structure refines into a triangular block form for the matrix, the cyclic decomposition rewrites an irreducible periodic chain as a permutation of aperiodic blocks, and the entire combinatorial picture is the nonnegative-matrix shadow of Perron-Frobenius theory.

Theorem 1 (canonical block decomposition). Order the states of a finite chain so that the open classes precede the closed classes and the accessibility preorder is respected. Then takes the block-upper-triangular form where each is the irreducible stochastic matrix of a closed class, collects the transient open classes, and records escapes from open into closed classes. The closed-class blocks are the indecomposable units: every long-run quantity factors through them, while entrywise because the chain leaves the transient part with probability one (on a finite space). Restricting attention to one closed class is thus exactly the passage to an irreducible chain.

Theorem 2 (cyclic decomposition of an irreducible class). Let be irreducible of period on , with periodic subclasses defined by the residue map. With states ordered by subclass, has the cyclic block form where is the (stochastic) block carrying to . Consequently is block-diagonal, and each diagonal block is an irreducible aperiodic stochastic matrix on its subclass. The period- chain is therefore aperiodic chains glued by a cyclic shift, and converges while itself rotates through the subclasses without settling.

Theorem 3 (Perron-Frobenius for the irreducible aperiodic case). Let be a finite irreducible stochastic matrix. Its spectral radius is , attained by the simple eigenvalue with strictly positive left eigenvector (the unique stationary distribution) and right eigenvector . The peripheral spectrum — eigenvalues of modulus — is exactly , the -th roots of unity, where is the period; these are simple, and apart from them every eigenvalue has modulus . When (aperiodic, equivalently primitive by Exercise 8), is the only peripheral eigenvalue and geometrically with rate the second-largest modulus . When , the unimodular eigenvalues encode the rotation of mass through the cyclic subclasses, and convergence holds only along the subsequence .

Theorem 4 (canonical form is invariant). The partition into communicating classes, the closed/open dichotomy, and each class period are intrinsic to — invariant under relabeling of states — because they are defined through the support pattern and its transitive closure alone. Two stochastic matrices with the same directed support graph have identical class structure and periods, regardless of the numerical values of the positive entries; the probabilities enter only at the level of stationary distributions and mixing rates, not at the level of the combinatorial skeleton.

Synthesis. The foundational reason the whole subject organizes itself is that the support graph of the nonnegative matrix carries all the class information, and putting these together, accessibility is the reachability preorder of that graph while communication is its symmetrization into the strongly-connected components — the communicating classes. This is exactly the combinatorial content of the semigroup law inherited from 37.05.01: path concatenation makes accessibility transitive, which gives both the equivalence relation and, through addition-closure of return times, the class-invariance of the period. The central insight is that an irreducible chain is dual to a single strongly connected component, and its period — the gcd of return lengths — is dual to the greatest common cyclic length of that component, so the cyclic decomposition of Theorem 2 is the graph-theoretic statement that a strongly connected digraph of index partitions into cyclically-ordered levels. The bridge to analysis is Perron-Frobenius (Theorem 3): irreducibility makes the eigenvalue simple, aperiodicity makes it the only unimodular eigenvalue, and these two combinatorial conditions are exactly what the convergence theorem of 37.05.03 requires. The block decomposition reduces every finite chain to its irreducible closed blocks, the cyclic decomposition reduces every periodic block to aperiodic sub-blocks on the slowed clock , and the foundational reason this two-step reduction terminates is that irreducible aperiodic — primitive — is the indecomposable atom of the theory, the precise object on which long-run behavior is a single stationary law.

Full proof set Master

Proposition 1 (accessibility is a preorder; communication its symmetrization). The relation on is reflexive and transitive, and iff and defines the associated equivalence relation.

Proof. Reflexivity: . Transitivity: if and , take with , ; then , so . The symmetrization of any preorder via " and " is reflexive (from reflexivity of ), symmetric (by construction), and transitive (compose in both directions), hence an equivalence relation; applying this to gives .

Proposition 2 (period is well-defined and class-invariant). If then exists, and implies .

Proof. Existence is the gcd of a nonempty set of positive integers. For invariance, choose with . Then , so . For any , concatenation gives , so ; subtracting, . Hence divides every element of , so , and symmetrically , giving equality.

Proposition 3 (return set is a numerical semigroup; eventual positivity). is closed under addition, and if then for some .

Proof. Addition-closure: for . For the eventual-positivity claim, rescale by so . By Bezout choose finitely many elements with integer combination equal to ; separating positive and negative coefficients exhibits (the additive span, contained in by closure) with . For , division () gives . Rescaling restores the multiples of .

Proposition 4 (well-definedness of periodic subclasses). For an irreducible class of period and fixed , the residue over admissible (those with ) is independent of , and .

Proof. If , pick with (since ). Then are both divisible by , so , giving . If and , then a length- path into with extends to a length- path into , so , i.e. .

Proposition 5 (cyclic blocks and aperiodicity of on a subclass). With the cyclic block form of Theorem 2, is block-diagonal and each block is irreducible and aperiodic on .

Proof. Since , iterating times returns to itself, so leaves each invariant and is block-diagonal. Irreducibility on : for , irreducibility of gives a path of some length ; since share residue , , so and , i.e. reaches under . Aperiodicity: the -return times of are , whose gcd is .

Proposition 6 (finite irreducible aperiodic implies primitive). A finite irreducible aperiodic stochastic satisfies for all once , for some finite .

Proof. By Proposition 3 each diagonal entry is eventually positive: for ; set . By irreducibility pick with and let . For , write with , so .

Connections Master

  • The Markov property, transition matrices, and Chapman-Kolmogorov 37.05.01 are the substrate: the inequality that powers every proof here is the single-term lower bound of the Chapman-Kolmogorov sum, and the -step matrix whose support defines accessibility is the matrix power established there. Class structure is exactly what the support pattern of those powers records, with the numerical probabilities suppressed.

  • The stationary distribution and convergence theory of irreducible aperiodic chains 37.05.03 consumes this unit's output directly: the existence and uniqueness of a stationary with requires a single closed communicating class, and the convergence requires that class to be aperiodic; the period- case is handled there by passing to on each cyclic subclass, exactly the decomposition of Theorem 2.

  • The operator-semigroup and discrete-generator viewpoint introduced via 37.05.01 reappears spectrally: the period of an irreducible chain is the number of unimodular eigenvalues of (the -th roots of unity), so periodicity is the eigenvalue-on-the-unit-circle obstruction to convergence, and aperiodicity is precisely the spectral-gap condition that makes the discrete generator have as a simple, isolated eigenvalue.

  • The elementary probability and named-distribution layer 26.02.01 supplies the concrete instances on which class structure is read off — the gambler's-ruin walk with two absorbing classes, the cyclic random walk on of period , and the two-state chain whose single self-loop makes it irreducible and aperiodic — each a worked classification of the kind this unit formalizes.

Historical & philosophical context Master

The decomposition of a chain into communicating classes and the notion of period were already implicit in Andrei A. Markov's founding 1906 paper [Markov 1906], where the finite chains he studied to extend the law of large numbers were tacitly irreducible, so that the limit theorems he sought would hold uniformly; the systematic separation of the combinatorial class skeleton from the analytic limit behavior came only with the later matrix formulation.

The decisive analytic input is the theory of nonnegative matrices begun by Oskar Perron and completed for the reducible and periodic cases by Georg Frobenius. Frobenius's 1912 memoir on matrices with nonnegative entries [Frobenius 1912] introduced irreducibility (in the matrix sense of no nontrivial invariant coordinate subspace), proved that an irreducible nonnegative matrix has a simple positive Perron eigenvalue with a positive eigenvector, and identified the index of imprimitivity — the number of eigenvalues of maximal modulus — as the integer that organizes the cyclic block structure. Translated into the language of Markov chains, Frobenius's index of imprimitivity is exactly the period , his irreducibility is the single-communicating-class condition, and his cyclic normal form is the periodic-subclass decomposition. The modern textbook treatment on countable state spaces, with accessibility and communication defined through the -step probabilities and the period as the gcd of return times, follows Norris [Norris 1997] and the matrix-theoretic account of Seneta [Seneta 2006], who makes the Perron-Frobenius-to-Markov-chain dictionary explicit.

Bibliography Master

@book{Norris1997,
  author    = {Norris, James R.},
  title     = {Markov Chains},
  series    = {Cambridge Series in Statistical and Probabilistic Mathematics},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@article{Markov1906,
  author  = {Markov, Andrei A.},
  title   = {Rasprostranenie zakona bol'shih chisel na velichiny, zavisyashchie drug ot druga},
  journal = {Izvestiya Fiziko-matematicheskogo obshchestva pri Kazanskom universitete (2)},
  volume  = {15},
  year    = {1906},
  pages   = {135--156}
}

@article{Frobenius1912,
  author  = {Frobenius, Georg},
  title   = {\"Uber {M}atrizen aus nicht negativen {E}lementen},
  journal = {Sitzungsberichte der K\"oniglich Preussischen Akademie der Wissenschaften zu Berlin},
  year    = {1912},
  pages   = {456--477}
}

@book{Durrett2019mc,
  author    = {Durrett, Rick},
  title     = {Probability: Theory and Examples},
  edition   = {5},
  publisher = {Cambridge University Press},
  year      = {2019}
}

@book{LevinPeres2017,
  author    = {Levin, David A. and Peres, Yuval},
  title     = {Markov Chains and Mixing Times},
  edition   = {2},
  publisher = {American Mathematical Society},
  year      = {2017}
}

@book{Seneta2006,
  author    = {Seneta, Eugene},
  title     = {Non-negative Matrices and Markov Chains},
  edition   = {2},
  series    = {Springer Series in Statistics},
  publisher = {Springer},
  year      = {2006}
}