Invariant Measures and Distributions; Positive and Null Recurrence
Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §1.7-1.8; Durrett 2019 *Probability: Theory and Examples* 5e §5.5-5.6; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5-1.7; Meyn-Tweedie 2009 *Markov Chains and Stochastic Stability* 2e Ch. 10 (invariant measures, positive recurrence)
Intuition Beginner
Imagine pouring sand onto the states of a Markov chain so that some states get a deep pile and others a shallow one. Now run the chain for one step: each grain at a state gets redistributed to its neighbors according to the transition probabilities. For most ways of piling the sand, the heights change. But there is a special way to pile it so that after one step every pile has exactly the height it started with. The chain stirs the sand around, yet the overall profile stays put. That special profile is called an invariant measure. If the total amount of sand is one unit, so the heights are genuine probabilities, we call it an invariant distribution.
Why care? Because the invariant profile is the chain's long-run habit. If you let the chain wander for a very long time and ask what fraction of its visits landed on each state, the answer is the invariant distribution. It is the equilibrium the chain settles into and then keeps.
There is a beautiful way to build this profile by hand. Fix one home state. Start the chain at home, walk until you come back home, and during that single loop count how many times you stepped on each state. The expected count for each state is the invariant profile (up to an overall scale). The chain restarting fresh at each homecoming is what makes these counts fit together into something the chain preserves.
This construction also reveals a split among chains that always come home. Some come home quickly on average: the expected time for one loop is a finite number. For these the visit-counts add up to a finite total, so you can rescale them into honest probabilities — there is an invariant distribution, and we call the chain positive recurrent. Others always come home but take forever on average: the mean loop time is infinite. Then the counts add up to infinity and no finite rescaling makes them into probabilities. The chain still has an invariant profile of relative heights, but no equilibrium distribution. We call this null recurrent.
The takeaway: an invariant measure is the sand profile a chain leaves unchanged, you can build it from the expected visit-counts in one loop from home, and whether the loop has finite or infinite average length splits "always comes home" into positive recurrent (a real equilibrium exists) and null recurrent (only relative heights exist).
Visual Beginner
Picture one excursion from a home state, with the chain's visits to each other state tallied.
Top: one loop from home state back to . Counting how many times the path lands on each state during the loop, then averaging over many loops, gives the expected visit-count profile . This profile is invariant: feeding it through one step of the chain returns the same profile. Bottom: when the average loop is short (finite mean return time) the counts have a finite total and rescale into an equilibrium distribution — positive recurrence. When the average loop is infinitely long the counts still form a valid relative profile but their total is infinite, so no equilibrium distribution exists — null recurrence.
Worked example Beginner
Take a chain on three states arranged in a cycle that can also pause. From state the walk goes to . From state it goes to . From state it returns to . Every move is certain. We want the long-run share of time spent in each state, that is, the invariant distribution.
Step 1. Write what "unchanged after one step" means. Let the shares be . After one step, all of state 's share flows to state , all of state 's flows to state , and all of state 's flows to state . For the profile to stay the same we need the share arriving at each state to equal the share already there: (everything at lands on ), , and .
Step 2. Solve the balance. These say . The three shares are equal.
Step 3. Make them add to one. Three equal numbers summing to are each . So the invariant distribution is .
Step 4. Check against the return-time rule. Starting at state , the walk goes , returning after exactly steps. So the mean return time to state is . The promised identity says the equilibrium share equals one divided by the mean return time: . This matches , and by symmetry the same holds for states and .
What this tells us: the equilibrium share of a state is the reciprocal of how long, on average, the chain takes to come back to it. Here each return takes steps, so each state holds a share. A finite return time () is exactly what lets the shares add to one — this chain is positive recurrent.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is a time-homogeneous Markov chain on a countable state space with stochastic transition matrix , as in 37.05.01; classes, communication, and irreducibility are as in 37.05.02; hitting and return times are as in 37.05.03 and 37.05.04. We write , for probability and expectation conditioned on , and for the first return time to , with return probability . A state is recurrent if and transient if , per 37.05.04.
Definition (invariant measure and invariant distribution). A measure on is a row vector with . It is invariant (or stationary) for if An invariant measure is nonzero if for at least one . An invariant measure with total mass is an invariant distribution (or stationary distribution), customarily written . If with invariant, then for all , since the law of is .
Definition (excursion / return-time measure). Fix a reference state . The expected occupation measure of an excursion from is the row vector given by the expected number of visits to during one excursion from back to (counting the time- visit to , not the terminal return). By construction , and , the mean return time to .
Definition (positive and null recurrence). A recurrent state is positive recurrent if its mean return time is finite, and null recurrent if . By the solidarity theorem proved below, this property is constant on a communicating class, so one speaks of a positive recurrent class or a null recurrent class, and of a positive/null recurrent chain when it is irreducible.
The chain on cycling has for each , so it is positive recurrent with . The simple symmetric random walk on is recurrent (per 37.05.04) but has , so it is null recurrent; its invariant measure is the counting measure , which has infinite total mass and admits no normalisation.
Counterexamples to common slips Intermediate+
An invariant measure need not be normalisable. A nonzero invariant measure always exists for an irreducible recurrent chain, but may be infinite. Only when this sum is finite (positive recurrence) does an invariant distribution exist. The counting measure on the symmetric walk on is invariant and not normalisable.
Uniqueness is only up to a scalar, and only under irreducibility. For an irreducible recurrent chain the invariant measure is unique up to multiplication by a positive constant. A reducible chain can have several linearly independent invariant measures, one per recurrent class.
Recurrence does not by itself give an invariant distribution. Null recurrence is exactly the case where the chain is recurrent yet has no invariant distribution. "Comes home with probability one" and "comes home in finite expected time" are different statements; the second is positive recurrence.
The excursion sum stops at , not . Including the terminal return would double-count the reference state and break the invariance identity. The visit at time is the time- visit of the next excursion.
Key theorem with proof Intermediate+
Theorem (existence, invariance, and the Kac formula). Let be irreducible and recurrent, and fix a reference state .
(a) (Existence and invariance.) The excursion measure defined above is a nonzero invariant measure: for every , , and .
(b) (Uniqueness up to a scalar.) If is any nonzero invariant measure, then for all ; hence the invariant measure is unique up to a positive multiplicative constant.
(c) (Kac formula and the positive/null split.) The total mass is . The chain is positive recurrent (some, hence every, ) iff it admits an invariant distribution ; that distribution is unique and is given by
Proof. Part (a). Finiteness and positivity: since the chain is irreducible there are with and . Counting visits to in an excursion shows dominates a single-step contribution, giving ; finiteness follows once invariance is established together with , as we record below. The normalisation holds because in the window the chain is at exactly once, at .
Invariance is the heart. Write , using to re-index the time- term of onto the terminal visit at (legitimate because by recurrence). For any , where in the last step we used that is determined by and applied the Markov property at time : on the event has conditional probability . Summing over collapses the inner sum to , so Thus . Finiteness now follows: from and we get , so for the with .
Part (b). Let be a nonzero invariant measure. We show and then equality. Unfolding and isolating the reference state , Iterating this identity and tracking the paths that avoid until the last step reproduces, term by term, the excursion sum: one obtains and letting gives . Set . Then , , and . For any , irreducibility gives with , and forces . Hence .
Part (c). Summing invariance over all states is not what gives the mass; rather, by Tonelli, If then is an invariant distribution. Conversely, if an invariant distribution exists, then by part (b) , so , forcing and . The same applied at any state (using ) yields , the Kac formula, and shows is unique. Finiteness of one forces finiteness of all because for every .
Bridge. This theorem builds toward the convergence-to-equilibrium and ergodic theory of the chapter and appears again in every stationary-distribution computation, because it pins the equilibrium of an irreducible chain to a single excursion. The foundational reason an invariant measure exists at all is the strong Markov restart of 37.05.04: each return to launches an independent excursion, and the expected occupation of one excursion is exactly the quantity the one-step operator preserves. The Kac identity is exactly the renewal-theoretic statement that the long-run fraction of time at is the reciprocal mean gap between visits, and it is dual to the hitting-time calculus of 37.05.03, where the mean return time is computed by first-step analysis; putting these together, the excursion measure and the mean return time are the same object read two ways, and the central insight is that positive recurrence is precisely finiteness of that excursion length. The bridge is that the dichotomy recurrent-versus-transient of 37.05.04 refines into transient / null recurrent / positive recurrent according to whether diverges and whether is finite, and this generalises the finite-state fact that every finite irreducible chain is automatically positive recurrent.
Exercises Intermediate+
Advanced results Master
The excursion measure organises everything: an irreducible recurrent chain has, up to scale, exactly one invariant measure, and the finiteness of its total mass is the sole datum separating positive from null recurrence. The series criterion of 37.05.04, the Kac formula, and the convergence theory are three readings of this one object.
Theorem 1 (existence, uniqueness, and structure of invariant measures). For an irreducible chain: (i) if recurrent, there is a nonzero invariant measure , unique up to a positive scalar, strictly positive on ; (ii) the chain is positive recurrent iff , in which case there is a unique invariant distribution ; (iii) if transient there may be no invariant measure, one, or many — uniqueness fails outside recurrence. For a general (reducible) chain, the extreme invariant distributions are in bijection with the positive recurrent classes: each such class carries one supported on , and every invariant distribution is a convex combination with , . Transient and null recurrent classes carry no invariant mass.
Theorem 2 (positive recurrence and convergence to equilibrium). If is irreducible, aperiodic, and positive recurrent with stationary distribution , then for all , and more generally for every initial distribution . If instead the chain is null recurrent (or transient), then for all : there is no limiting distribution. Thus positive recurrence is exactly the regime in which the chain forgets its start and converges to a genuine equilibrium; the proof couples two independent copies, one started at and one at , and uses that the meeting time is finite a.s. precisely because the product chain is positive recurrent. Periodicity does not affect existence of but blocks pointwise convergence, replaced by Cesàro convergence .
Theorem 3 (ergodic theorem for Markov chains). Let be irreducible positive recurrent with stationary , and with . Then for any initial distribution, almost surely The proof partitions the trajectory into i.i.d. excursions between successive visits to a fixed state , applies the strong law of large numbers to the excursion-sums of and to the excursion lengths, and forms the ratio; the limit is . Taking recovers as the long-run fraction of time at . Null recurrence breaks the theorem: the denominator sends the time-average of any -integrable to .
Theorem 4 (reversibility and detailed balance). A measure satisfying the detailed-balance equations for all is automatically invariant; the converse fails. For an irreducible chain admitting a reversible measure, the transition operator is self-adjoint on , its spectrum is real, and the spectral gap controls the rate of convergence in Theorem 2. Birth-death chains are always reversible, which is why their stationary distributions solve a one-dimensional telescoping recursion rather than the full linear system; positive recurrence of a birth-death chain is then the convergence of the series , the reversible reading of the excursion mass.
Synthesis. The foundational reason the entire long-run theory collapses to one vector is the strong Markov restart of 37.05.04: each return to launches an independent excursion, so the expected occupation of a single excursion is the object the one-step operator fixes, and uniqueness up to a scalar is the statement that an irreducible recurrent chain has only this one degree of freedom. This is exactly the renewal structure that makes the Kac formula a tautology of renewal-reward, and it is dual to the hitting-time calculus of 37.05.03: the mean return time computed there by first-step analysis is the total mass of the excursion measure computed here, the same number read as a potential and as an occupation time. Putting these together, the recurrent-versus-transient dichotomy of 37.05.04 refines into a trichotomy — transient (), null recurrent ( but ), positive recurrent () — and the central insight is that only the third regime supports an equilibrium distribution and convergence to it. The ergodic theorem generalises the elementary to arbitrary observables, the convergence theorem upgrades it from time-averages to the marginal law under aperiodicity, and reversibility specialises it to a self-adjoint operator whose spectral gap quantifies the rate; across all four the bridge is the single excursion measure, whose finiteness is positive recurrence and whose reciprocal mass is the stationary distribution.
Full proof set Master
Proposition 1 (the excursion measure is invariant). For irreducible recurrent and fixed , the measure satisfies , , and for all .
Proof. Write , using under (valid as a.s. by recurrence) to shift the term of the diagonal entry onto . For any , applying the Markov property at time on the event , which is -measurable, So , hence for all . With (the chain is at exactly once in ) and irreducibility giving for some , yields ; and for some gives strict positivity.
Proposition 2 (uniqueness up to a scalar). If is irreducible recurrent and satisfies with , then .
Proof. From , substitute the same identity into each with repeatedly. After substitutions, tracking only the contributions of paths from that avoid at intermediate times, Let : . Put ; then and . For any pick with ; then , forcing . Hence .
Proposition 3 (Kac formula). For irreducible positive recurrent the unique invariant distribution is .
Proof. By Proposition 1, is invariant with and total mass (Tonelli, Exercise 4). Positive recurrence gives , so is an invariant distribution with . Uniqueness: any invariant distribution is, by Proposition 2 normalised, equal to .
Proposition 4 (positive recurrence is a class property; finite chains). Positive recurrence is constant on a communicating class, and every finite irreducible chain is positive recurrent.
Proof. On the (closed, irreducible) recurrent class , Proposition 2 gives a unique invariant measure up to scale; its total mass over is finite or infinite independently of the chosen normalisation, and finiteness is positive recurrence (Proposition 3 produces and for all ). Hence all states of share the property. For a finite irreducible chain, recurrence holds and has entries each finite by Proposition 1, so : positive recurrent.
Proposition 5 (ergodic time-average). For irreducible positive recurrent with stationary and -integrable , a.s.
Proof. Fix a reference ; by recurrence the visit times to are a.s. finite, and by the strong Markov property 37.05.04 the excursion blocks and lengths are i.i.d. for with and (finite by -integrability and ). Writing and squeezing between and where , the strong law gives and , so
Proposition 6 (null recurrence kills the marginal limit). If is irreducible and not positive recurrent (transient or null recurrent), then for all .
Proof. The transient case is the series criterion of 37.05.04: forces . For null recurrence, suppose toward contradiction that along a subsequence. A diagonal/tightness argument extracts a further subsequence along which for all , and with ; by Fatou . Proposition 2 forces for a scalar , so (null recurrence), contradicting . Hence and for all .
Connections Master
Hitting probabilities and expected hitting times
37.05.03supply the mean-return-time side of the Kac formula: the first-step decomposition computes the very quantity whose reciprocal is the stationary mass , so the minimal-nonnegative-solution theory there is the analytic dual of the excursion-occupation theory here, and the gambler's-ruin and birth-death mean times become explicit stationary distributions.The strong Markov property and recurrence/transience dichotomy
37.05.04is refined here from a dichotomy into a trichotomy: its geometric visit law and series criterion separate transient from recurrent, and the excursion decomposition it licenses is exactly what builds the invariant measure and splits recurrence into null () and positive (); every i.i.d.-excursion argument in this unit invokes its restart-at-a-stopping-time identity.Class structure, irreducibility, and periodicity
37.05.02determines where invariant mass can live: invariant distributions are supported on positive recurrent closed classes, one extreme point per class, and aperiodicity (versus the cyclic decomposition into period- subclasses) is what upgrades the existence of into pointwise convergence rather than mere Cesàro convergence.Discrete martingales, stopping times, and optional stopping
37.04.01underlies the ergodic and convergence theorems: the i.i.d. excursion sums are summed via the strong law, and the coupling proof of convergence to equilibrium runs the product chain until a finite meeting time, an optional-stopping-style argument on the stationary lift of the chain.
Historical & philosophical context Master
The notion of a stationary distribution traces to Andrei Markov's own 1906 study of chains with finitely many states, where the equilibrium vector and convergence to it were established for primitive transition matrices; the extension to countably infinite state spaces, where positive and null recurrence genuinely differ, belongs to the 1930s-1950s. Andrei Kolmogorov classified states into recurrent and transient and, in the recurrent case, into what later authors named positive (his "states with finite mean recurrence time") and null. The clean identity tying the stationary mass to the mean return time, , was given its definitive probabilistic form by Mark Kac in 1947 [Kac 1947], whose recurrence-time argument exhibits the stationary probability as the reciprocal of the expected first-return time and generalises to the set-version .
The excursion construction of the invariant measure as expected occupation between returns to a fixed state, and the uniqueness-up-to-a-scalar theorem for irreducible recurrent chains, are the form in which the theory was crystallised in the textbook tradition, presented cleanly by Norris [Norris 1997] following the renewal-theoretic viewpoint of William Feller and the potential-theoretic synthesis of Kai Lai Chung. The general-state-space lift — Harris recurrence, the existence of a unique invariant -finite measure, and the positive/null split via the finiteness of its total mass — is the program of Meyn and Tweedie [Meyn-Tweedie 2009], where the same excursion idea is run off a small set rather than a single point. The boundary example, simple symmetric random walk on , sits exactly at the recurrent/null-recurrent edge: recurrent by Pólya, null recurrent because its unique invariant measure is the non-normalisable counting measure, the discrete shadow of translation-invariance with no finite total mass.
Bibliography Master
@book{Norris1997,
author = {Norris, James R.},
title = {Markov Chains},
series = {Cambridge Series in Statistical and Probabilistic Mathematics},
publisher = {Cambridge University Press},
year = {1997}
}
@article{Kac1947,
author = {Kac, Mark},
title = {On the notion of recurrence in discrete stochastic processes},
journal = {Bulletin of the American Mathematical Society},
volume = {53},
number = {10},
year = {1947},
pages = {1002--1010}
}
@book{Durrett2019mc,
author = {Durrett, Rick},
title = {Probability: Theory and Examples},
edition = {5},
publisher = {Cambridge University Press},
year = {2019}
}
@book{LevinPeres2017,
author = {Levin, David A. and Peres, Yuval},
title = {Markov Chains and Mixing Times},
edition = {2},
publisher = {American Mathematical Society},
year = {2017}
}
@book{MeynTweedie2009,
author = {Meyn, Sean and Tweedie, Richard L.},
title = {Markov Chains and Stochastic Stability},
edition = {2},
publisher = {Cambridge University Press},
year = {2009}
}
@article{Markov1906,
author = {Markov, Andrei A.},
title = {Rasprostranenie zakona bol'shikh chisel na velichiny, zavisyashchie drug ot druga},
journal = {Izvestiya Fiziko-matematicheskogo obshchestva pri Kazanskom universitete},
volume = {15},
year = {1906},
pages = {135--156}
}