37.05.05 · probability / 05-markov-chains

Invariant Measures and Distributions; Positive and Null Recurrence

shipped3 tiersLean: none

Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §1.7-1.8; Durrett 2019 *Probability: Theory and Examples* 5e §5.5-5.6; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5-1.7; Meyn-Tweedie 2009 *Markov Chains and Stochastic Stability* 2e Ch. 10 (invariant measures, positive recurrence)

Intuition Beginner

Imagine pouring sand onto the states of a Markov chain so that some states get a deep pile and others a shallow one. Now run the chain for one step: each grain at a state gets redistributed to its neighbors according to the transition probabilities. For most ways of piling the sand, the heights change. But there is a special way to pile it so that after one step every pile has exactly the height it started with. The chain stirs the sand around, yet the overall profile stays put. That special profile is called an invariant measure. If the total amount of sand is one unit, so the heights are genuine probabilities, we call it an invariant distribution.

Why care? Because the invariant profile is the chain's long-run habit. If you let the chain wander for a very long time and ask what fraction of its visits landed on each state, the answer is the invariant distribution. It is the equilibrium the chain settles into and then keeps.

There is a beautiful way to build this profile by hand. Fix one home state. Start the chain at home, walk until you come back home, and during that single loop count how many times you stepped on each state. The expected count for each state is the invariant profile (up to an overall scale). The chain restarting fresh at each homecoming is what makes these counts fit together into something the chain preserves.

This construction also reveals a split among chains that always come home. Some come home quickly on average: the expected time for one loop is a finite number. For these the visit-counts add up to a finite total, so you can rescale them into honest probabilities — there is an invariant distribution, and we call the chain positive recurrent. Others always come home but take forever on average: the mean loop time is infinite. Then the counts add up to infinity and no finite rescaling makes them into probabilities. The chain still has an invariant profile of relative heights, but no equilibrium distribution. We call this null recurrent.

The takeaway: an invariant measure is the sand profile a chain leaves unchanged, you can build it from the expected visit-counts in one loop from home, and whether the loop has finite or infinite average length splits "always comes home" into positive recurrent (a real equilibrium exists) and null recurrent (only relative heights exist).

Visual Beginner

Picture one excursion from a home state, with the chain's visits to each other state tallied.

Top: one loop from home state $k$ back to $k$ . Counting how many times the path lands on each state during the loop, then averaging over many loops, gives the expected visit-count profile $γ$ . This profile is invariant: feeding it through one step of the chain returns the same profile. Bottom: when the average loop is short (finite mean return time) the counts have a finite total and rescale into an equilibrium distribution — positive recurrence. When the average loop is infinitely long the counts still form a valid relative profile but their total is infinite, so no equilibrium distribution exists — null recurrence.

Worked example Beginner

Take a chain on three states ${1, 2, 3}$ arranged in a cycle that can also pause. From state $1$ the walk goes to $2$ . From state $2$ it goes to $3$ . From state $3$ it returns to $1$ . Every move is certain. We want the long-run share of time spent in each state, that is, the invariant distribution.

Step 1. Write what "unchanged after one step" means. Let the shares be $π_{1}, π_{2}, π_{3}$ . After one step, all of state $1$ 's share flows to state $2$ , all of state $2$ 's flows to state $3$ , and all of state $3$ 's flows to state $1$ . For the profile to stay the same we need the share arriving at each state to equal the share already there: $π_{2} = π_{1}$ (everything at $1$ lands on $2$ ), $π_{3} = π_{2}$ , and $π_{1} = π_{3}$ .

Step 2. Solve the balance. These say $π_{1} = π_{2} = π_{3}$ . The three shares are equal.

Step 3. Make them add to one. Three equal numbers summing to $1$ are each $\frac{1}{3}$ . So the invariant distribution is $π_{1} = π_{2} = π_{3} = \frac{1}{3}$ .

Step 4. Check against the return-time rule. Starting at state $1$ , the walk goes $1 \to 2 \to 3 \to 1$ , returning after exactly $3$ steps. So the mean return time to state $1$ is $3$ . The promised identity says the equilibrium share equals one divided by the mean return time: $\frac{1}{3}$ . This matches $π_{1} = \frac{1}{3}$ , and by symmetry the same holds for states $2$ and $3$ .

What this tells us: the equilibrium share of a state is the reciprocal of how long, on average, the chain takes to come back to it. Here each return takes $3$ steps, so each state holds a $\frac{1}{3}$ share. A finite return time ( $3$ ) is exactly what lets the shares add to one — this chain is positive recurrent.

Check your understanding Beginner

Exercise (easy, multiple choice).

An invariant distribution of a Markov chain is a way of assigning probabilities to states so that:

A. Every state has the same probability B. After one step of the chain, the assignment of probabilities is unchanged C. The chain can never leave its starting state D. The probabilities all increase over time

Hint

"Invariant" means left unchanged by running the chain one step.

Answer

B. Unchanged after one step. An invariant distribution is preserved when you push it through one step of the chain: the share flowing into each state equals the share already there. Feedback-correct: invariance is exactly this fixed-profile condition. Feedback-wrong: A is only true for special symmetric chains, C describes an absorbing state, and D contradicts the total probability staying at one.

Formal definition Intermediate+

Throughout, $(X_{n})_{n \geq 0}$ is a time-homogeneous Markov chain on a countable state space $I$ with stochastic transition matrix $P = (p_{ij})$ , as in 37.05.01; classes, communication, and irreducibility are as in 37.05.02; hitting and return times are as in 37.05.03 and 37.05.04. We write $P_{i}$ , $E_{i}$ for probability and expectation conditioned on $X_{0} = i$ , and $T_{i} := in f {n \geq 1 : X_{n} = i}$ for the first return time to $i$ , with return probability $f_{ii} = P_{i} (T_{i} < \infty)$ . A state is recurrent if $f_{ii} = 1$ and transient if $f_{ii} < 1$ , per 37.05.04.

Definition (invariant measure and invariant distribution). A measure on $I$ is a row vector $ν = (ν_{i})_{i \in I}$ with $ν_{i} \in [0, \infty)$ . It is invariant (or stationary) for $P$ if $ν = ν P, that is ν_{j} = i \in I \sum ν_{i} p_{ij} for every j \in I .$ An invariant measure is nonzero if $ν_{i} > 0$ for at least one $i$ . An invariant measure with total mass $\sum_{i \in I} ν_{i} = 1$ is an invariant distribution (or stationary distribution), customarily written $π$ . If $X_{0} \sim π$ with $π$ invariant, then $X_{n} \sim π$ for all $n$ , since the law of $X_{n}$ is $π P^{n} = π$ .

Definition (excursion / return-time measure). Fix a reference state $k$ . The expected occupation measure of an excursion from $k$ is the row vector $γ^{k} = (γ_{i}^{k})_{i \in I}$ given by $γ_{i}^{k} := E_{k} [n = 0 \sum T_{k} - 1 1_{{X_{n} = i}}] = n \geq 0 \sum P_{k} (X_{n} = i, T_{k} > n),$ the expected number of visits to $i$ during one excursion from $k$ back to $k$ (counting the time- $0$ visit to $k$ , not the terminal return). By construction $γ_{k}^{k} = 1$ , and $\sum_{i \in I} γ_{i}^{k} = E_{k} [T_{k}]$ , the mean return time to $k$ .

Definition (positive and null recurrence). A recurrent state $i$ is positive recurrent if its mean return time $m_{i} := E_{i} [T_{i}]$ is finite, and null recurrent if $m_{i} = \infty$ . By the solidarity theorem proved below, this property is constant on a communicating class, so one speaks of a positive recurrent class or a null recurrent class, and of a positive/null recurrent chain when it is irreducible.

The chain on ${1, 2, 3}$ cycling $1 \to 2 \to 3 \to 1$ has $E_{i} [T_{i}] = 3 < \infty$ for each $i$ , so it is positive recurrent with $π = (\frac{1}{3}, \frac{1}{3}, \frac{1}{3})$ . The simple symmetric random walk on $Z$ is recurrent (per 37.05.04) but has $E_{0} [T_{0}] = \infty$ , so it is null recurrent; its invariant measure is the counting measure $ν_{i} \equiv 1$ , which has infinite total mass and admits no normalisation.

Counterexamples to common slips Intermediate+

An invariant measure need not be normalisable. A nonzero invariant measure $ν = ν P$ always exists for an irreducible recurrent chain, but $\sum_{i} ν_{i}$ may be infinite. Only when this sum is finite (positive recurrence) does an invariant distribution exist. The counting measure on the symmetric walk on $Z$ is invariant and not normalisable.
Uniqueness is only up to a scalar, and only under irreducibility. For an irreducible recurrent chain the invariant measure is unique up to multiplication by a positive constant. A reducible chain can have several linearly independent invariant measures, one per recurrent class.
Recurrence does not by itself give an invariant distribution. Null recurrence is exactly the case where the chain is recurrent yet has no invariant distribution. "Comes home with probability one" and "comes home in finite expected time" are different statements; the second is positive recurrence.
The excursion sum stops at $T_{k} - 1$ , not $T_{k}$ . Including the terminal return would double-count the reference state $k$ and break the invariance identity. The visit at time $T_{k}$ is the time- $0$ visit of the next excursion.

Key theorem with proof Intermediate+

Theorem (existence, invariance, and the Kac formula). Let $P$ be irreducible and recurrent, and fix a reference state $k$ .

(a) (Existence and invariance.) The excursion measure $γ^{k}$ defined above is a nonzero invariant measure: $0 < γ_{i}^{k} < \infty$ for every $i$ , $γ_{k}^{k} = 1$ , and $γ^{k} = γ^{k} P$ .

(b) (Uniqueness up to a scalar.) If $λ$ is any nonzero invariant measure, then $λ_{i} = λ_{k} γ_{i}^{k}$ for all $i$ ; hence the invariant measure is unique up to a positive multiplicative constant.

(c) (Kac formula and the positive/null split.) The total mass is $\sum_{i} γ_{i}^{k} = E_{k} [T_{k}] = m_{k}$ . The chain is positive recurrent (some, hence every, $m_{i} < \infty$ ) iff it admits an invariant distribution $π$ ; that distribution is unique and is given by $π_{i} = \frac{1}{E _{i} [ T _{i} ]} = \frac{1}{m _{i}} .$

Proof. Part (a). Finiteness and positivity: since the chain is irreducible there are $r, s \geq 1$ with $p_{k i}^{(r)} > 0$ and $p_{ik}^{(s)} > 0$ . Counting visits to $i$ in an excursion shows $γ^{k}$ dominates a single-step contribution, giving $γ_{i}^{k} > 0$ ; finiteness $γ_{i}^{k} < \infty$ follows once invariance is established together with $γ_{k}^{k} = 1$ , as we record below. The normalisation $γ_{k}^{k} = 1$ holds because in the window $0 \leq n < T_{k}$ the chain is at $k$ exactly once, at $n = 0$ .

Invariance is the heart. Write $γ_{i}^{k} = \sum_{n \geq 1} P_{k} (X_{n} = i, n \leq T_{k})$ , using $X_{0} = X_{T_{k}} = k$ to re-index the time- $0$ term of $i = k$ onto the terminal visit at $n = T_{k}$ (legitimate because $P_{k} (T_{k} < \infty) = 1$ by recurrence). For any $j$ , $i \sum γ_{i}^{k} p_{ij} = i \sum n \geq 1 \sum P_{k} (X_{n} = i, n \leq T_{k}) p_{ij} = n \geq 1 \sum i \sum P_{k} (X_{n - 1} = i, n - 1 < T_{k}) p_{ij},$ where in the last step we used that ${n \leq T_{k}} = {T_{k} > n - 1}$ is determined by $X_{0}, \dots, X_{n - 1}$ and applied the Markov property at time $n - 1$ : on ${T_{k} > n - 1}$ the event ${X_{n} = j}$ has conditional probability $p_{X_{n - 1} j}$ . Summing over $i$ collapses the inner sum to $P_{k} (X_{n} = j, n \leq T_{k})$ , so $i \sum γ_{i}^{k} p_{ij} = n \geq 1 \sum P_{k} (X_{n} = j, n \leq T_{k}) = γ_{j}^{k} .$ Thus $γ^{k} P = γ^{k}$ . Finiteness now follows: from $γ^{k} = γ^{k} P^{s}$ and $γ_{k}^{k} = 1$ we get $γ_{i}^{k} p_{ik}^{(s)} \leq γ_{k}^{k} = 1$ , so $γ_{i}^{k} \leq 1/ p_{ik}^{(s)} < \infty$ for the $s$ with $p_{ik}^{(s)} > 0$ .

Part (b). Let $λ = λ P$ be a nonzero invariant measure. We show $λ_{i} \geq λ_{k} γ_{i}^{k}$ and then equality. Unfolding $λ = λ P$ and isolating the reference state $k$ , $λ_{j} = λ_{k} p_{k j} + i \neq = k \sum λ_{i} p_{ij} .$ Iterating this identity and tracking the paths that avoid $k$ until the last step reproduces, term by term, the excursion sum: one obtains $λ_{j} \geq λ_{k} n = 0 \sum N P_{k} (X_{n} = j, T_{k} > n) for every N,$ and letting $N \to \infty$ gives $λ_{j} \geq λ_{k} γ_{j}^{k}$ . Set $μ := λ - λ_{k} γ^{k}$ . Then $μ \geq 0$ , $μ = μ P$ , and $μ_{k} = λ_{k} - λ_{k} \cdot 1 = 0$ . For any $i$ , irreducibility gives $r$ with $p_{ik}^{(r)} > 0$ , and $0 = μ_{k} = \sum_{l} μ_{l} p_{l k}^{(r)} \geq μ_{i} p_{ik}^{(r)} \geq 0$ forces $μ_{i} = 0$ . Hence $λ = λ_{k} γ^{k}$ .

Part (c). Summing invariance over all states is not what gives the mass; rather, by Tonelli, $i \sum γ_{i}^{k} = i \sum E_{k} [n = 0 \sum T_{k} - 1 1_{{X_{n} = i}}] = E_{k} [n = 0 \sum T_{k} - 1 1] = E_{k} [T_{k}] = m_{k} .$ If $m_{k} < \infty$ then $π := γ^{k} / m_{k}$ is an invariant distribution. Conversely, if an invariant distribution $π$ exists, then by part (b) $π = π_{k} γ^{k}$ , so $1 = \sum_{i} π_{i} = π_{k} m_{k}$ , forcing $m_{k} = 1/ π_{k} < \infty$ and $π_{k} = 1/ m_{k}$ . The same applied at any state $i$ (using $γ^{i}$ ) yields $π_{i} = 1/ m_{i}$ , the Kac formula, and shows $π$ is unique. Finiteness of one $m_{k}$ forces finiteness of all $m_{i}$ because $π_{i} = π_{k} γ_{i}^{k} > 0$ for every $i$ . $□$

Bridge. This theorem builds toward the convergence-to-equilibrium and ergodic theory of the chapter and appears again in every stationary-distribution computation, because it pins the equilibrium of an irreducible chain to a single excursion. The foundational reason an invariant measure exists at all is the strong Markov restart of 37.05.04: each return to $k$ launches an independent excursion, and the expected occupation of one excursion is exactly the quantity the one-step operator preserves. The Kac identity $π_{i} = 1/ m_{i}$ is exactly the renewal-theoretic statement that the long-run fraction of time at $i$ is the reciprocal mean gap between visits, and it is dual to the hitting-time calculus of 37.05.03, where the mean return time $m_{i} = 1 + \sum_{j} p_{ij} k_{j}^{{i}}$ is computed by first-step analysis; putting these together, the excursion measure and the mean return time are the same object read two ways, and the central insight is that positive recurrence is precisely finiteness of that excursion length. The bridge is that the dichotomy recurrent-versus-transient of 37.05.04 refines into transient / null recurrent / positive recurrent according to whether $\sum_{n} p_{ii}^{(n)}$ diverges and whether $m_{i}$ is finite, and this generalises the finite-state fact that every finite irreducible chain is automatically positive recurrent.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Show that for the simple symmetric random walk on $Z$ ( $p_{i, i + 1} = p_{i, i - 1} = \frac{1}{2}$ ) the counting measure $ν_{i} \equiv 1$ is invariant, and conclude (given that the walk is recurrent) that it is null recurrent.

Hint

Check $ν = ν P$ directly, then note that the total mass $\sum_{i} ν_{i}$ is infinite and invoke uniqueness up to a scalar.

Answer

For each $j$ , $(ν P)_{j} = \sum_{i} ν_{i} p_{ij} = ν_{j - 1} p_{j - 1, j} + ν_{j + 1} p_{j + 1, j} = 1 \cdot \frac{1}{2} + 1 \cdot \frac{1}{2} = 1 = ν_{j}$ , so $ν = ν P$ is invariant. The walk is irreducible and recurrent, so by uniqueness up to a scalar every invariant measure is a positive multiple of $ν$ . Each such multiple has infinite total mass $\sum_{i} c = \infty$ , so no invariant distribution exists. By the Key theorem the absence of an invariant distribution is exactly null recurrence: the walk is recurrent with $E_{0} [T_{0}] = \infty$ .

Exercise 4 (medium, symbolic).

Let $P$ be irreducible and recurrent with invariant measure $γ^{k}$ normalised by $γ_{k}^{k} = 1$ . Prove the Kac return-time formula $E_{k} [T_{k}] = \sum_{i} γ_{i}^{k}$ directly from the definition of $γ^{k}$ , without assuming positive recurrence.

Hint

Interchange the sum over states with the expectation using Tonelli's theorem.

Answer

By definition $γ_{i}^{k} = E_{k} [\sum_{n = 0}^{T_{k} - 1} 1_{{X_{n} = i}}]$ . Summing over $i$ and using Tonelli (all terms nonnegative) to exchange $\sum_{i}$ with $E_{k}$ , $i \sum γ_{i}^{k} = E_{k} [n = 0 \sum T_{k} - 1 i \sum 1_{{X_{n} = i}}] = E_{k} [n = 0 \sum T_{k} - 1 1] = E_{k} [T_{k}],$ since $\sum_{i} 1_{{X_{n} = i}} = 1$ for each $n$ and the number of integers $n$ with $0 \leq n \leq T_{k} - 1$ is $T_{k}$ . This holds whether or not $E_{k} [T_{k}]$ is finite; positive recurrence is the case where the common value is finite.

Exercise 5 (medium, numeric).

A birth-death chain on ${0, 1, 2, 3}$ has up-rates $p_{i, i + 1} = \frac{1}{2}$ and down-rates $p_{i, i - 1} = \frac{1}{2}$ in the interior, with reflecting ends $p_{0, 1} = 1$ , $p_{3, 2} = 1$ . Its invariant distribution satisfies detailed balance $π_{i} p_{i, i + 1} = π_{i + 1} p_{i + 1, i}$ . Find $π_{0}$ as a fraction.

Hint

From detailed balance compute $π_{1}, π_{2}, π_{3}$ in terms of $π_{0}$ using $π_{i + 1} = π_{i} p_{i, i + 1} / p_{i + 1, i}$ , then normalise.

Answer

Detailed balance: $π_{0} p_{0, 1} = π_{1} p_{1, 0}$ gives $π_{1} = π_{0} (1) / (\frac{1}{2}) = 2 π_{0}$ . Next $π_{1} p_{1, 2} = π_{2} p_{2, 1}$ gives $π_{2} = π_{1} (\frac{1}{2}) / (\frac{1}{2}) = π_{1} = 2 π_{0}$ . Then $π_{2} p_{2, 3} = π_{3} p_{3, 2}$ gives $π_{3} = π_{2} (\frac{1}{2}) / (1) = π_{0}$ . Summing: $π_{0} (1 + 2 + 2 + 1) = 6 π_{0} = 1$ , so $π_{0} = \frac{1}{6}$ . The full distribution is $(\frac{1}{6}, \frac{1}{3}, \frac{1}{3}, \frac{1}{6})$ , and the chain is positive recurrent with $E_{0} [T_{0}] = 6$ .

Exercise 6 (medium, multiple choice).

Which statement correctly distinguishes null from positive recurrence for an irreducible chain?

A. Positive recurrent means $f_{ii} = 1$ ; null recurrent means $f_{ii} < 1$ B. Positive recurrent means $E_{i} [T_{i}] < \infty$ ; null recurrent means $f_{ii} = 1$ but $E_{i} [T_{i}] = \infty$ C. Null recurrent chains are transient D. Positive recurrent chains have no invariant measure

Hint

Both are recurrent ( $f_{ii} = 1$ ); the split is on the mean return time.

Answer

B. Both positive and null recurrent states are recurrent ( $f_{ii} = 1$ , so option A confuses recurrence with the finer split). The split is on the mean return time: $E_{i} [T_{i}] < \infty$ is positive recurrence, $E_{i} [T_{i}] = \infty$ is null recurrence. Null recurrent chains are recurrent, not transient (so C is wrong), and positive recurrent chains have an invariant distribution — a normalisable invariant measure — so D is backwards.

Exercise 7 (hard, symbolic).

Prove that positive recurrence is a class property: if $i \leftrightarrow j$ and $i$ is positive recurrent, then $j$ is positive recurrent.

Hint

Use that an irreducible recurrent chain on the class has a unique invariant measure up to scaling; positive recurrence is finiteness of its total mass, a scale-independent property.

Answer

Restrict to the communicating class $C$ of $i$ and $j$ , which is closed (a recurrent class is closed, 37.05.04) and irreducible. By the Key theorem $C$ carries a nonzero invariant measure $γ$ , unique up to a positive scalar, with $γ_{l} > 0$ for all $l \in C$ . Positive recurrence of a state $l \in C$ is equivalent to $\sum_{m \in C} γ_{m} < \infty$ (then $π = γ / \sum γ$ is an invariant distribution and $m_{l} = 1/ π_{l} < \infty$ ). This total-mass condition does not depend on which state we singled out to normalise, because any two invariant measures differ by a positive scalar and so have simultaneously finite or infinite total mass. Hence $i$ positive recurrent $\Rightarrow \sum_{m} γ_{m} < \infty \Rightarrow j$ positive recurrent. Equivalently, $m_{j} < \infty$ for every $j \in C$ , so the property is constant on $C$ .

Exercise 8 (hard, symbolic).

Let $P$ be irreducible positive recurrent with stationary distribution $π$ , and let $A \subset I$ . Prove the general Kac formula: the mean return time to the set $A$ , starting from stationarity restricted to $A$ , satisfies $\sum_{i \in A} π_{i} E_{i} [T_{A}^{+}] = 1$ where $T_{A}^{+} = in f {n \geq 1 : X_{n} \in A}$ , and deduce the single-state form $π_{i} = 1/ E_{i} [T_{i}]$ .

Hint

Decompose the stationary path into excursions away from $A$ ; the long-run fraction of time the chain spends starting a fresh $A$ -excursion is $\sum_{i \in A} π_{i}$ , and each excursion lasts $E_{i} [T_{A}^{+}]$ on average.

Answer

Run the stationary chain. The long-run fraction of steps at which $X_{n} \in A$ is $π (A) = \sum_{i \in A} π_{i}$ by the ergodic theorem (visits to $A$ have density $π (A)$ ). Cut the trajectory at successive entry times to $A$ into excursions; by the strong Markov property 37.05.04 the excursion lengths starting from state $i \in A$ are i.i.d. with mean $E_{i} [T_{A}^{+}]$ , and the entry state has stationary frequency $π_{i} / π (A)$ among $A$ -visits. By renewal-reward the expected excursion length, averaged over entry states, is $\sum_{i \in A} \frac{π _{i}}{π ( A )} E_{i} [T_{A}^{+}]$ , and the visits to $A$ have density $π (A) = 1/ (mean excursion length)$ . Rearranging, $\sum_{i \in A} π_{i} E_{i} [T_{A}^{+}] = π (A) \cdot \frac{1}{π ( A )} = 1$ . Taking $A = {i}$ a single state gives $π_{i} E_{i} [T_{i}] = 1$ , i.e. $π_{i} = 1/ E_{i} [T_{i}]$ , recovering the Kac formula of the Key theorem.

Advanced results Master

The excursion measure organises everything: an irreducible recurrent chain has, up to scale, exactly one invariant measure, and the finiteness of its total mass is the sole datum separating positive from null recurrence. The series criterion of 37.05.04, the Kac formula, and the convergence theory are three readings of this one object.

Theorem 1 (existence, uniqueness, and structure of invariant measures). For an irreducible chain: (i) if recurrent, there is a nonzero invariant measure $γ^{k}$ , unique up to a positive scalar, strictly positive on $I$ ; (ii) the chain is positive recurrent iff $\sum_{i} γ_{i}^{k} = E_{k} [T_{k}] < \infty$ , in which case there is a unique invariant distribution $π_{i} = 1/ E_{i} [T_{i}]$ ; (iii) if transient there may be no invariant measure, one, or many — uniqueness fails outside recurrence. For a general (reducible) chain, the extreme invariant distributions are in bijection with the positive recurrent classes: each such class $C$ carries one $π^{C}$ supported on $C$ , and every invariant distribution is a convex combination $\sum_{C} a_{C} π^{C}$ with $a_{C} \geq 0$ , $\sum_{C} a_{C} = 1$ . Transient and null recurrent classes carry no invariant mass.

Theorem 2 (positive recurrence and convergence to equilibrium). If $P$ is irreducible, aperiodic, and positive recurrent with stationary distribution $π$ , then $p_{ij}^{(n)} \to π_{j}$ for all $i, j$ , and more generally $∥ μ P^{n} - π ∥_{TV} \to 0$ for every initial distribution $μ$ . If instead the chain is null recurrent (or transient), then $p_{ij}^{(n)} \to 0$ for all $i, j$ : there is no limiting distribution. Thus positive recurrence is exactly the regime in which the chain forgets its start and converges to a genuine equilibrium; the proof couples two independent copies, one started at $μ$ and one at $π$ , and uses that the meeting time is finite a.s. precisely because the product chain is positive recurrent. Periodicity does not affect existence of $π$ but blocks pointwise convergence, replaced by Cesàro convergence $\frac{1}{N} \sum_{n < N} p_{ij}^{(n)} \to π_{j}$ .

Theorem 3 (ergodic theorem for Markov chains). Let $P$ be irreducible positive recurrent with stationary $π$ , and $f : I \to R$ with $\sum_{i} π_{i} ∣ f (i) ∣ < \infty$ . Then for any initial distribution, almost surely $\frac{1}{N} n = 0 \sum N - 1 f (X_{n}) ⟶ i \sum π_{i} f (i) = E_{π} [f] .$ The proof partitions the trajectory into i.i.d. excursions between successive visits to a fixed state $k$ , applies the strong law of large numbers to the excursion-sums of $f$ and to the excursion lengths, and forms the ratio; the limit is $E_{k} [\sum_{n < T_{k}} f (X_{n})] / E_{k} [T_{k}] = \sum_{i} γ_{i}^{k} f (i) / m_{k} = \sum_{i} π_{i} f (i)$ . Taking $f = 1_{{i}}$ recovers $π_{i} = 1/ m_{i}$ as the long-run fraction of time at $i$ . Null recurrence breaks the theorem: the denominator $m_{k} = \infty$ sends the time-average of any $π$ -integrable $f$ to $0$ .

Theorem 4 (reversibility and detailed balance). A measure $ν$ satisfying the detailed-balance equations $ν_{i} p_{ij} = ν_{j} p_{j i}$ for all $i, j$ is automatically invariant; the converse fails. For an irreducible chain admitting a reversible measure, the transition operator is self-adjoint on $ℓ^{2} (ν)$ , its spectrum is real, and the spectral gap controls the rate of convergence in Theorem 2. Birth-death chains are always reversible, which is why their stationary distributions solve a one-dimensional telescoping recursion $π_{i + 1} = π_{i} p_{i, i + 1} / p_{i + 1, i}$ rather than the full linear system; positive recurrence of a birth-death chain is then the convergence of the series $\sum_{i} \prod_{m \leq i} p_{m - 1, m} / p_{m, m - 1}$ , the reversible reading of the excursion mass.

Synthesis. The foundational reason the entire long-run theory collapses to one vector is the strong Markov restart of 37.05.04: each return to $k$ launches an independent excursion, so the expected occupation of a single excursion is the object the one-step operator fixes, and uniqueness up to a scalar is the statement that an irreducible recurrent chain has only this one degree of freedom. This is exactly the renewal structure that makes the Kac formula $π_{i} = 1/ m_{i}$ a tautology of renewal-reward, and it is dual to the hitting-time calculus of 37.05.03: the mean return time $m_{i}$ computed there by first-step analysis is the total mass of the excursion measure computed here, the same number read as a potential and as an occupation time. Putting these together, the recurrent-versus-transient dichotomy of 37.05.04 refines into a trichotomy — transient ( $\sum_{n} p_{ii}^{(n)} < \infty$ ), null recurrent ( $\sum_{n} p_{ii}^{(n)} = \infty$ but $m_{i} = \infty$ ), positive recurrent ( $m_{i} < \infty$ ) — and the central insight is that only the third regime supports an equilibrium distribution and convergence to it. The ergodic theorem generalises the elementary $π_{i} = 1/ m_{i}$ to arbitrary observables, the convergence theorem upgrades it from time-averages to the marginal law under aperiodicity, and reversibility specialises it to a self-adjoint operator whose spectral gap quantifies the rate; across all four the bridge is the single excursion measure, whose finiteness is positive recurrence and whose reciprocal mass is the stationary distribution.

Full proof set Master

Proposition 1 (the excursion measure is invariant). For irreducible recurrent $P$ and fixed $k$ , the measure $γ_{i}^{k} = E_{k} [\sum_{n = 0}^{T_{k} - 1} 1_{{X_{n} = i}}]$ satisfies $γ^{k} = γ^{k} P$ , $γ_{k}^{k} = 1$ , and $0 < γ_{i}^{k} < \infty$ for all $i$ .

Proof. Write $γ_{i}^{k} = \sum_{n \geq 1} P_{k} (X_{n} = i, n \leq T_{k})$ , using $X_{T_{k}} = k = X_{0}$ under $P_{k}$ (valid as $T_{k} < \infty$ a.s. by recurrence) to shift the $n = 0$ term of the diagonal entry onto $n = T_{k}$ . For any $j$ , applying the Markov property at time $n - 1$ on the event ${T_{k} > n - 1} = {T_{k} \geq n}$ , which is $F_{n - 1}$ -measurable, $i \sum γ_{i}^{k} p_{ij} = n \geq 1 \sum i \sum P_{k} (X_{n - 1} = i, T_{k} \geq n) p_{ij} = n \geq 1 \sum P_{k} (X_{n} = j, T_{k} \geq n) = γ_{j}^{k} .$ So $γ^{k} = γ^{k} P$ , hence $γ^{k} = γ^{k} P^{m}$ for all $m$ . With $γ_{k}^{k} = 1$ (the chain is at $k$ exactly once in $0 \leq n < T_{k}$ ) and irreducibility giving $p_{ik}^{(s)} > 0$ for some $s$ , $1 = γ_{k}^{k} \geq γ_{i}^{k} p_{ik}^{(s)}$ yields $γ_{i}^{k} \leq 1/ p_{ik}^{(s)} < \infty$ ; and $γ_{i}^{k} \geq γ_{k}^{k} p_{k i}^{(r)} = p_{k i}^{(r)} > 0$ for some $r$ gives strict positivity. $□$

Proposition 2 (uniqueness up to a scalar). If $P$ is irreducible recurrent and $λ \geq 0$ satisfies $λ = λ P$ with $λ \neq = 0$ , then $λ = λ_{k} γ^{k}$ .

Proof. From $λ_{j} = λ_{k} p_{k j} + \sum_{i \neq = k} λ_{i} p_{ij}$ , substitute the same identity into each $λ_{i}$ with $i \neq = k$ repeatedly. After $N$ substitutions, tracking only the contributions of paths from $k$ that avoid $k$ at intermediate times, $λ_{j} \geq λ_{k} n = 0 \sum N P_{k} (X_{n} = j, T_{k} > n) .$ Let $N \to \infty$ : $λ_{j} \geq λ_{k} γ_{j}^{k}$ . Put $μ = λ - λ_{k} γ^{k} \geq 0$ ; then $μ = μ P$ and $μ_{k} = 0$ . For any $i$ pick $r$ with $p_{ik}^{(r)} > 0$ ; then $0 = μ_{k} = (μ P^{r})_{k} = \sum_{l} μ_{l} p_{l k}^{(r)} \geq μ_{i} p_{ik}^{(r)}$ , forcing $μ_{i} = 0$ . Hence $λ = λ_{k} γ^{k}$ . $□$

Proposition 3 (Kac formula). For irreducible positive recurrent $P$ the unique invariant distribution is $π_{i} = 1/ E_{i} [T_{i}]$ .

Proof. By Proposition 1, $γ^{i}$ is invariant with $γ_{i}^{i} = 1$ and total mass $\sum_{l} γ_{l}^{i} = E_{i} [T_{i}] = m_{i}$ (Tonelli, Exercise 4). Positive recurrence gives $m_{i} < \infty$ , so $π := γ^{i} / m_{i}$ is an invariant distribution with $π_{i} = γ_{i}^{i} / m_{i} = 1/ m_{i}$ . Uniqueness: any invariant distribution $π^{'}$ is, by Proposition 2 normalised, equal to $γ^{i} / (\sum_{l} γ_{l}^{i}) = π$ . $□$

Proposition 4 (positive recurrence is a class property; finite chains). Positive recurrence is constant on a communicating class, and every finite irreducible chain is positive recurrent.

Proof. On the (closed, irreducible) recurrent class $C$ , Proposition 2 gives a unique invariant measure up to scale; its total mass over $C$ is finite or infinite independently of the chosen normalisation, and finiteness is positive recurrence (Proposition 3 produces $π$ and $m_{l} = 1/ π_{l} < \infty$ for all $l \in C$ ). Hence all states of $C$ share the property. For a finite irreducible chain, recurrence holds and $γ^{k}$ has $∣ I ∣ < \infty$ entries each finite by Proposition 1, so $m_{k} = \sum_{i} γ_{i}^{k} < \infty$ : positive recurrent. $□$

Proposition 5 (ergodic time-average). For irreducible positive recurrent $P$ with stationary $π$ and $π$ -integrable $f$ , $\frac{1}{N} \sum_{n < N} f (X_{n}) \to E_{π} [f]$ a.s.

Proof. Fix a reference $k$ ; by recurrence the visit times $0 = τ_{0} < τ_{1} < \dots$ to $k$ are a.s. finite, and by the strong Markov property 37.05.04 the excursion blocks $W_{r} := \sum_{n = τ_{r - 1}}^{τ_{r} - 1} f (X_{n})$ and lengths $L_{r} := τ_{r} - τ_{r - 1}$ are i.i.d. for $r \geq 1$ with $E [L_{r}] = m_{k} < \infty$ and $E [W_{r}] = \sum_{i} γ_{i}^{k} f (i) = m_{k} E_{π} [f]$ (finite by $π$ -integrability and $π = γ^{k} / m_{k}$ ). Writing $S_{N} = \sum_{n < N} f (X_{n})$ and squeezing $N$ between $τ_{R}$ and $τ_{R + 1}$ where $R = R (N) \to \infty$ , the strong law gives $\frac{1}{R} \sum_{r \leq R} W_{r} \to E [W_{1}]$ and $\frac{1}{R} \sum_{r \leq R} L_{r} \to E [L_{1}]$ , so $\frac{S _{N}}{N} \sim \frac{\sum _{r \leq R} W _{r}}{\sum _{r \leq R} L _{r}} \to \frac{E [ W _{1} ]}{E [ L _{1} ]} = \frac{m _{k} E _{π} [ f ]}{m _{k}} = E_{π} [f] . □$

Proposition 6 (null recurrence kills the marginal limit). If $P$ is irreducible and not positive recurrent (transient or null recurrent), then $p_{ij}^{(n)} \to 0$ for all $i, j$ .

Proof. The transient case is the series criterion of 37.05.04: $\sum_{n} p_{ij}^{(n)} < \infty$ forces $p_{ij}^{(n)} \to 0$ . For null recurrence, suppose toward contradiction that $lim sup_{n} p_{i_{0} j_{0}}^{(n)} = c > 0$ along a subsequence. A diagonal/tightness argument extracts a further subsequence along which $p_{ij}^{(n)} \to ρ_{j}$ for all $j$ , and $ρ = ρP$ with $ρ \neq = 0$ ; by Fatou $\sum_{j} ρ_{j} \leq 1$ . Proposition 2 forces $ρ = c^{'} γ^{j_{0}}$ for a scalar $c^{'} > 0$ , so $\sum_{j} ρ_{j} = c^{'} m_{j_{0}} = \infty$ (null recurrence), contradicting $\sum_{j} ρ_{j} \leq 1$ . Hence $c = 0$ and $p_{ij}^{(n)} \to 0$ for all $i, j$ . $□$

Connections Master

Hitting probabilities and expected hitting times 37.05.03 supply the mean-return-time side of the Kac formula: the first-step decomposition $m_{i} = E_{i} [T_{i}] = 1 + \sum_{j} p_{ij} k_{j}^{{i}}$ computes the very quantity whose reciprocal is the stationary mass $π_{i}$ , so the minimal-nonnegative-solution theory there is the analytic dual of the excursion-occupation theory here, and the gambler's-ruin and birth-death mean times become explicit stationary distributions.
The strong Markov property and recurrence/transience dichotomy 37.05.04 is refined here from a dichotomy into a trichotomy: its geometric visit law and series criterion $\sum_{n} p_{ii}^{(n)}$ separate transient from recurrent, and the excursion decomposition it licenses is exactly what builds the invariant measure $γ^{k}$ and splits recurrence into null ( $m_{i} = \infty$ ) and positive ( $m_{i} < \infty$ ); every i.i.d.-excursion argument in this unit invokes its restart-at-a-stopping-time identity.
Class structure, irreducibility, and periodicity 37.05.02 determines where invariant mass can live: invariant distributions are supported on positive recurrent closed classes, one extreme point per class, and aperiodicity (versus the cyclic decomposition into period- $d$ subclasses) is what upgrades the existence of $π$ into pointwise convergence $p_{ij}^{(n)} \to π_{j}$ rather than mere Cesàro convergence.
Discrete martingales, stopping times, and optional stopping 37.04.01 underlies the ergodic and convergence theorems: the i.i.d. excursion sums are summed via the strong law, and the coupling proof of convergence to equilibrium runs the product chain until a finite meeting time, an optional-stopping-style argument on the stationary lift of the chain.

Historical & philosophical context Master

The notion of a stationary distribution traces to Andrei Markov's own 1906 study of chains with finitely many states, where the equilibrium vector and convergence to it were established for primitive transition matrices; the extension to countably infinite state spaces, where positive and null recurrence genuinely differ, belongs to the 1930s-1950s. Andrei Kolmogorov classified states into recurrent and transient and, in the recurrent case, into what later authors named positive (his "states with finite mean recurrence time") and null. The clean identity tying the stationary mass to the mean return time, $π_{i} = 1/ E_{i} [T_{i}]$ , was given its definitive probabilistic form by Mark Kac in 1947 ^{[Kac 1947]}, whose recurrence-time argument exhibits the stationary probability as the reciprocal of the expected first-return time and generalises to the set-version $\sum_{i \in A} π_{i} E_{i} [T_{A}^{+}] = 1$ .

The excursion construction of the invariant measure $γ^{k}$ as expected occupation between returns to a fixed state, and the uniqueness-up-to-a-scalar theorem for irreducible recurrent chains, are the form in which the theory was crystallised in the textbook tradition, presented cleanly by Norris ^{[Norris 1997]} following the renewal-theoretic viewpoint of William Feller and the potential-theoretic synthesis of Kai Lai Chung. The general-state-space lift — Harris recurrence, the existence of a unique invariant $σ$ -finite measure, and the positive/null split via the finiteness of its total mass — is the program of Meyn and Tweedie ^{[Meyn-Tweedie 2009]}, where the same excursion idea is run off a small set rather than a single point. The boundary example, simple symmetric random walk on $Z$ , sits exactly at the recurrent/null-recurrent edge: recurrent by Pólya, null recurrent because its unique invariant measure is the non-normalisable counting measure, the discrete shadow of translation-invariance with no finite total mass.

Bibliography Master

@book{Norris1997,
  author    = {Norris, James R.},
  title     = {Markov Chains},
  series    = {Cambridge Series in Statistical and Probabilistic Mathematics},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@article{Kac1947,
  author  = {Kac, Mark},
  title   = {On the notion of recurrence in discrete stochastic processes},
  journal = {Bulletin of the American Mathematical Society},
  volume  = {53},
  number  = {10},
  year    = {1947},
  pages   = {1002--1010}
}

@book{Durrett2019mc,
  author    = {Durrett, Rick},
  title     = {Probability: Theory and Examples},
  edition   = {5},
  publisher = {Cambridge University Press},
  year      = {2019}
}

@book{LevinPeres2017,
  author    = {Levin, David A. and Peres, Yuval},
  title     = {Markov Chains and Mixing Times},
  edition   = {2},
  publisher = {American Mathematical Society},
  year      = {2017}
}

@book{MeynTweedie2009,
  author    = {Meyn, Sean and Tweedie, Richard L.},
  title     = {Markov Chains and Stochastic Stability},
  edition   = {2},
  publisher = {Cambridge University Press},
  year      = {2009}
}

@article{Markov1906,
  author  = {Markov, Andrei A.},
  title   = {Rasprostranenie zakona bol'shikh chisel na velichiny, zavisyashchie drug ot druga},
  journal = {Izvestiya Fiziko-matematicheskogo obshchestva pri Kazanskom universitete},
  volume  = {15},
  year    = {1906},
  pages   = {135--156}
}

Prerequisites

37.05.03
37.05.04

Tier anchors

beginner: Norris 1997 *Markov Chains* (Cambridge) §1.7; informal picture of a long-run share of time spent in each state that the chain settles into and then preserves
intermediate: Norris 1997 *Markov Chains* (Cambridge) §1.7 (invariant measures, the excursion construction, positive and null recurrence); Durrett 2019 *Probability: Theory and Examples* 5e §5.5-5.6
master: Norris 1997 *Markov Chains* (Cambridge) §1.7-1.8; Durrett 2019 *Probability: Theory and Examples* 5e §5.5-5.6; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5-1.7; Meyn-Tweedie 2009 *Markov Chains and Stochastic Stability* 2e Ch. 10 (invariant measures, positive recurrence)

References

Norris — Markov Chains · Cambridge University Press 1997, §1.7 (invariant measures and the return-time construction), §1.8 (convergence to equilibrium)
Durrett — Probability: Theory and Examples, 5e · §5.5-5.6 (stationary measures, positive and null recurrence)
Kac — On the notion of recurrence in discrete stochastic processes · Bulletin of the American Mathematical Society 53 (1947), 1002-1010 (the identity pi_i = 1/E_i[T_i])
Levin-Peres — Markov Chains and Mixing Times, 2e · American Mathematical Society 2017, §1.5-1.7 (stationary distributions, hitting and return times)
Meyn-Tweedie — Markov Chains and Stochastic Stability, 2e · Cambridge University Press 2009, Ch. 10 (invariant measures and positive recurrence)

Estimated time

beginner: 18m
intermediate: 54m
master: 90m