37.05.10 · probability / 05-markov-chains

Recurrence, Invariant Distributions, and Convergence for Continuous-Time Chains

shipped3 tiersLean: none

Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §3.5-3.8; Asmussen 2003 *Applied Probability and Queues* 2e Ch. II §4-5; Liggett 2010 *Continuous Time Markov Processes* Ch. 2-3; Chung 1967 *Markov Chains with Stationary Transition Probabilities* 2e Ch. II (recurrence and the invariant measure)

Intuition Beginner

A continuous-time chain wanders among its states, holding each one for a random stretch of time before jumping to the next. Run it long enough and a striking regularity appears: the fraction of total elapsed time it has spent in each state stops drifting and settles to a fixed split. A small repair shop that cycles between idle, busy, and broken-down spends, over a year, some steady fraction of its hours in each condition, and that fraction does not depend on which condition it happened to start in. This unit is about that long-run split — when it exists, what fixes it, and how to compute it.

Two ideas from earlier units combine to pin the split down. The first is the jump chain: ignore the clock and just record the sequence of states visited. The second is the holding time: how long each visit lasts on average. The long-run time-fraction in a state is built from both — how often the jump chain visits the state, multiplied by how long it stays each time. A state visited rarely but held for a long while, and a state visited often but left quickly, can end up with the same share of the clock.

There is a second, more local way to find the same split, and it is the one practitioners reach for. In the long run the chain must enter and leave each state at equal rates, otherwise probability would pile up or drain away. Writing "rate of flow in equals rate of flow out" for every state gives a system of balance equations whose solution is exactly the long-run split. For many chains a stronger and simpler balance holds: between every pair of states the back-and-forth flow matches pair by pair, a condition called detailed balance, and when it holds the split can be read off almost by inspection.

Once the split is known, two convergence facts follow. The probability of finding the chain in a given state at a far-off time approaches that state's long-run share, with no parity worry that troubled the discrete-time version, because the random holding times smear out any rigid rhythm. And the time-average of any quantity along a single long run approaches the same share-weighted average — one path, watched long enough, samples the whole split.

The takeaway: a continuous-time chain has a long-run time-split among its states; you find it either by weighting jump-chain visits by holding times or by balancing the flow in and out of each state, and both the snapshot probabilities and the time-averages along one path converge to it.

Visual Beginner

Picture a single state of the chain as a tank with water flowing in from other states and out to other states; equilibrium is the level where inflow matches outflow.

The tanks are states; the pipes carry probability flow at the chain's rates. The long-run fill levels are the invariant split: they sit where every tank's inflow rate balances its outflow rate. The inset shows the special, easier situation of detailed balance, where each pipe-pair balances by itself. The timeline at the bottom is one run of the chain, blocks colored by state and as wide as the holding time; the long-run fraction of each color matches the fill levels.

Worked example Beginner

Take a machine that is either working or broken. While working it breaks down at rate (per day). While broken it gets repaired at rate (per day). We find the long-run fraction of time it is working.

Step 1. Read off the rates. From "working" the only move is to "broken," at rate . From "broken" the only move is to "working," at rate . So the chain leaves "working" at rate and leaves "broken" at rate .

Step 2. Balance the flow for the "working" tank. Let be the long-run fraction of time working and the fraction broken. Flow out of "working" is . Flow into "working" comes from "broken" at rate , so it is . Setting inflow equal to outflow: .

Step 3. Solve. The equation gives , so . The machine works of the time and is broken of the time.

Step 4. Check with holding times. The average holding time in "working" is day; in "broken" it is day. The jump chain just alternates working, broken, working, broken, so it visits each equally often. Weighting equal visits by holding time gives working : broken , so the working fraction is . Same answer by a different route.

What this tells us: the long-run split came out two ways. Balancing flow in and out of a state gave directly. Weighting how often the jump chain visits a state by how long it stays gave the same . The first method scales to many states as a system of balance equations; the second explains why those equations are right.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is a countable state space, a stable conservative -matrix with jump matrix and exit rates 37.05.08, the associated minimal right-continuous chain with explosion time , and its transition semigroup, , satisfying , , and the backward/forward equations of 37.05.09. The embedded jump chain is the discrete-time chain with matrix ; its recurrence, invariant measures, and positive/null classification are as in 37.05.02 and 37.05.05. We assume irreducible (equivalently irreducible) and, unless stated, non-explosive, so a.s. and is honest.

Definition (recurrence and transience). A state is recurrent for the continuous-time chain if , and transient if this probability is . For a non-explosive irreducible chain, is recurrent iff is recurrent for the jump chain ; recurrence and transience are class properties shared by all states. The return time to is , where is the first jump time. A recurrent state is positive recurrent if and null recurrent if .

Definition (invariant measure and invariant distribution). A measure , , is invariant for if equivalently (outflow rate inflow rate at ). An invariant distribution is an invariant measure that is a probability vector, . For a non-explosive chain, with holds iff for all (time-stationarity of the semigroup).

Definition (the jump-chain bridge). If is invariant for , then is invariant for the jump chain, ; conversely if then satisfies . Thus continuous-time invariance weights the jump-chain invariant measure by the mean holding time :

Definition (detailed balance / reversibility). The chain is in detailed balance with respect to a measure if A chain admitting a detailed-balance probability distribution is reversible: the stationary chain has the same law as its time-reversal .

Counterexamples to common slips Intermediate+

  • The continuous-time invariant is not the jump-chain invariant. and are different equations; they agree only when all exit rates are equal. Forgetting the holding-time weight is the most common error: a state the jump chain visits often but leaves quickly carries less clock-time than its visit count suggests.

  • Positive recurrence of the chain is not positive recurrence of the jump chain. The continuous-time chain is positive recurrent iff admits an invariant distribution, which can hold even when the jump chain is null recurrent (slow-but-long visits), and can fail when the jump chain is positive recurrent (fast visits with ). The holding times decouple the two notions.

  • No aperiodicity is required for convergence. In discrete time, periodicity blocks 37.05.06. In continuous time the exponential holding times have no period, so holds for every irreducible non-explosive positive-recurrent chain with no parity hypothesis.

  • Detailed balance is sufficient but not necessary for invariance. Summing over gives ; using (conservativity, with the term) yields . The converse fails: a directed cycle at equal rates has a uniform invariant distribution but no detailed-balance measure, since reverse rates vanish.

Key theorem with proof Intermediate+

Theorem (invariant distribution from the jump chain; convergence to equilibrium). Let be irreducible and non-explosive on . Then has an invariant distribution iff the chain is positive recurrent, in which case is unique and given by equivalently where is the (essentially unique) invariant measure of the jump chain . Moreover, for every and every initial distribution , with no aperiodicity hypothesis. [Norris 1997 §3.5-3.8]

Proof. Existence and the formula. Suppose the chain is positive recurrent. The jump chain is recurrent (recurrence is shared, since the chain visits a state unboundedly often in time iff returns infinitely often), so by 37.05.05 it has an invariant measure , unique up to scale, with . Set . Then for each , using for , , and . So . The total mass is , which is finite iff the chain is positive recurrent: by the renewal identity (Proposition 2 below) when is normalized so that counts mean jump-chain visits to per excursion from , whence and , which is (). Normalizing to total mass gives the invariant distribution . Uniqueness follows from uniqueness of up to scale.

Conversely, if has an invariant distribution , then is an invariant measure for with finite weighted mass , forcing positive recurrence.

Convergence to equilibrium. Fix a small and consider the -skeleton, the discrete-time chain with one-step matrix . Because has strictly positive diagonal (a state can stay put over with probability ) and is irreducible (irreducibility of gives for all when the chain is irreducible and non-explosive, as a positive holding-then-jumping path has positive density), the skeleton is irreducible and aperiodic. Its invariant distribution is , since from . By the discrete-time convergence theorem 37.05.06, as . To pass from the skeleton to continuous , note is uniformly continuous (it is Lipschitz with constant on bounded-rate skeleton steps, and in general uniformly continuous by the semigroup contraction inherited from each row being a distribution), so a sequence along the lattice that converges forces the continuous limit; varying over a rationally independent pair removes any lattice gap. Hence , and by dominated convergence over the countable sum. The exponential holding times supply the aperiodicity for free: has positive diagonal for every , so no periodicity obstruction can arise.

Bridge. This theorem builds toward the entire equilibrium theory of continuous-time Markov processes and appears again in queueing theory, where is the stationary queue-length law and () is the mean-busy-cycle formula. The foundational reason the continuous-time invariant differs from the jump-chain invariant is the holding-time weight : this is exactly the bridge that converts visit-frequency into clock-fraction, and it generalises the discrete reciprocal-return-time formula of 37.05.05 by inserting the exit rate. The convergence half is dual to the discrete coupling theorem 37.05.06 but cleaner, because the random holding times erase periodicity; putting these together, the snapshot law and the invariant are linked by the same positive-recurrence input that the next result reads pathwise, and the central insight is that one continuous-time chain carries two stationary objects — the time-stationary with and the jump-stationary with — bridged by the mean holding time.

Exercises Intermediate+

Advanced results Master

The long-run theory rests on four facts: recurrence is inherited from the jump chain but positive recurrence is a holding-time-weighted refinement; the invariant distribution solves and is the mean-holding-time reweighting of the jump-chain invariant measure; convergence to equilibrium holds with no periodicity hypothesis because exponential holding times carry no phase; and detailed balance is the reversible special case in which the invariant measure is read off pairwise.

Theorem 1 (recurrence dichotomy and the invariant measure). Let be irreducible and non-explosive. The chain is recurrent iff its jump chain is recurrent, and then has an invariant measure , unique up to scalar, with where . The chain is positive recurrent iff , iff an invariant distribution exists, iff for one (equivalently every) ; then . Positive and null recurrence are class properties, but the continuous-time class can differ from the jump-chain class: positive recurrent with gives a null-recurrent , and null recurrent with gives a positive-recurrent .

Theorem 2 (the two stationary measures and Asmussen's bridge). An irreducible recurrent carries two distinct stationary objects: the time-stationary measure with , governing and the long-run clock-occupation ; and the jump-stationary measure with , governing the long-run jump-count frequency . They are related by , the rate-conversion identity [Asmussen 2003]: visit-frequency times holding-time is clock-fraction. The PASTA principle and the rate-conservation law of queueing theory are corollaries — a Poisson arrival sees the time-stationary law , not the embedded jump law .

Theorem 3 (convergence with no aperiodicity; spectral form on finite ). For irreducible non-explosive positive-recurrent , for all , and for every , with no parity hypothesis. On a finite state space, 37.05.09 with supplying the rank-one stationary projection and the remaining eigenvalues having strictly negative real part for an irreducible , so convergence is exponential with rate the spectral gap . There is no imaginary-axis obstruction analogous to the discrete unit circle's roots of unity: irreducibility alone forces to be a simple eigenvalue and all others into the open left half-plane.

Theorem 4 (detailed balance, reversibility, and the cycle condition). admits a detailed-balance measure () iff Kolmogorov's cycle condition holds: for every finite cycle , When it holds, is invariant (Exercise 4) and the stationary chain is reversible: . Reversibility makes self-adjoint on , so its spectrum is real, the convergence rate is the spectral gap of a symmetric operator, and the powerful variational (Dirichlet-form) machinery applies. Birth-death chains are automatically reversible (every cycle is a back-and-forth, so the cycle condition is vacuous), which is why their invariant law () is computed by inspection.

Synthesis. The foundational reason continuous-time long-run behaviour splits cleanly is the embedding: the chain is the jump chain run on a clock with exponential holding times, and every long-run quantity factors into a jump-chain part and a holding-time part. This is exactly why recurrence is inherited from while positive recurrence is the holding-time-weighted refinement , and the bridge generalises the discrete reciprocal-return-time formula by inserting the exit rate — visit-frequency is dual to clock-fraction through the mean holding time. Putting these together, the invariant distribution , the convergence , and the ergodic average are three readings of the single positive-recurrence input : the first is its existence, the second its distributional pull, the third its pathwise law of large numbers via renewal-reward over excursions. The central insight is that the random clock buys two things the discrete theory cannot have for free — periodicity disappears, so convergence needs no aperiodicity, and reversibility, when Kolmogorov's cycle condition holds, makes self-adjoint on , so the same invariant that the renewal argument produces also organises the spectral theory, and the bridge between the analytic semigroup of 37.05.09 and the pathwise excursion structure of 37.05.08 is complete.

Full proof set Master

Proposition 1 (invariance is equivalent to time-stationarity). For a non-explosive chain, a probability vector satisfies iff for all .

Proof. If for all , differentiate at : , the derivative existing by 37.05.09. Conversely, suppose . The forward equation 37.05.09 gives (using , valid for the honest non-explosive chain). Set , a probability vector for each ; then with . The constant function also solves with . By uniqueness of the bounded solution of the forward system with given initial data (the -contraction estimate with Grönwall when is bounded, and the minimal-solution uniqueness of 37.05.09 in general), , so .

Proposition 2 (the renewal identity for the return time). For an irreducible positive-recurrent chain with jump-chain invariant measure normalized by (so ), the mean return time is , and hence .

Proof. Decompose the excursion from back to by the states the jump chain visits. Over one excursion the chain visits state a mean number of times (the regeneration formula for the jump chain's invariant measure 37.05.05), and each visit lasts an independent holding time of mean , independent of the visit count by the strong Markov property 37.05.08. By Wald's identity the total expected time of the excursion is . The invariant distribution is with , so , giving .

Proposition 3 (the -skeleton is irreducible and aperiodic). If is irreducible and non-explosive, then for every the matrix is irreducible and aperiodic, with invariant distribution whenever .

Proof. Irreducibility of gives, for each ordered pair , a path of positive jump probabilities; the event of taking these jumps within with all holding times summing to less than has positive probability, so . Hence is irreducible. For aperiodicity, , a positive self-loop, so the period of in divides and equals . Finally when is bounded equals by Proposition 1's series argument; in the unbounded case follows from (Proposition 1).

Proposition 4 (convergence to equilibrium). For irreducible non-explosive positive-recurrent with invariant , for all and for every .

Proof. By Proposition 3 the skeleton is irreducible aperiodic with invariant , so the discrete convergence theorem 37.05.06 gives as , for each fixed . The map is uniformly continuous: for , , and tends to uniformly as on any set where exit rates are bounded; in general uniform continuity follows from the total-variation contraction as for each row. Given , pick with the modulus of continuity below over and with for ; then for , . Hence . Summing, by dominated convergence (each term , dominated by summable to ).

Proposition 5 (continuous-time ergodic theorem). For irreducible positive-recurrent non-explosive and bounded , a.s.

Proof. As in Exercise 7, the successive returns to a fixed state cut the path into i.i.d. excursions by the strong Markov property 37.05.08. The cycle length has mean (positive recurrence) and the reward has mean , the occupation identity from Proposition 2. The renewal-reward theorem gives a.s., the partial last cycle negligible because is bounded and .

Proposition 6 (detailed balance characterises reversibility). A probability with satisfies detailed balance for all iff the stationary chain has the same finite-dimensional laws as its time-reversal .

Proof. The reversed stationary process is Markov with generator (the reversed rates: is the stationary flux relabelled). The reversal equals the forward chain in law iff for all , i.e. , i.e. — exactly detailed balance. (One checks is a conservative -matrix: -corrected since , using that is invariant, which detailed balance guarantees by Exercise 4.) Detailed balance further makes self-adjoint on : , so the spectrum of is real and the convergence rate is the spectral gap of a symmetric operator.

Connections Master

  • The Q-matrix, jump chain, and holding-time construction 37.05.08 is the substrate of every result here: recurrence is inherited from the jump chain , the mean holding time is the weight converting the jump-stationary into the time-stationary , and the i.i.d. excursion structure that the strong Markov property at return times produces is exactly what powers the renewal-reward proof of the ergodic theorem — the continuous-time return time and its finiteness are the positive-recurrence datum read pathwise.

  • The transition semigroup and Kolmogorov equations 37.05.09 supply the analytic side: is the infinitesimal form of time-stationarity , the forward equation gives the uniqueness argument identifying the two, and the matrix exponential on finite state spaces furnishes the spectral form whose term is the stationary projection and whose gap is the exponential convergence rate.

  • The discrete invariant-measure and positive/null-recurrence theory 37.05.05 is the template this unit reweights: the reciprocal-return-time formula becomes , the regeneration construction of the jump-chain invariant measure is imported wholesale, and the positive/null dichotomy is refined by the holding-time-weighted sum that can flip the classification between the chain and its jump chain.

  • The discrete convergence-to-equilibrium and coupling theory 37.05.06 is applied to the -skeleton to drive the continuous-time convergence, but with the periodicity hypothesis removed: the exponential holding times make every skeleton aperiodic, so the coupling theorem's aperiodicity requirement is automatically met, and the meeting-time tail there becomes the spectral-gap decay here.

Historical & philosophical context Master

The systematic theory of stationary and limiting distributions for continuous-time chains grew from Kolmogorov's analytic foundations of the 1930s, but the reversibility strand has a sharper origin. Andrei N. Kolmogorov's 1936 Mathematische Annalen note on the theory of Markov chains [Kolmogorov 1936] isolated the condition under which a chain's stationary process is statistically indistinguishable from its time-reversal, proving that reversibility is equivalent to the cycle condition on transition rates now called Kolmogorov's criterion. This converted a probabilistic symmetry into a checkable algebraic identity on the rate matrix.

The probabilistic computation of the invariant distribution through the embedded jump chain and the holding times — the bridge and the return-time formula — is the form given in Norris [Norris 1997], following the renewal-theoretic tradition of Feller and Chung. The distinction between the time-stationary measure with and the jump-stationary measure with , and its consequences for queueing networks (PASTA, rate conservation), is developed systematically by Asmussen [Asmussen 2003]. Frank Kelly's monograph on reversibility and stochastic networks [Kelly 1979] made detailed balance the organizing principle for product-form equilibrium distributions of interacting queueing systems, where the cycle condition's failure or success determines whether a network has a tractable stationary law.

Bibliography Master

@book{Norris1997,
  author    = {Norris, James R.},
  title     = {Markov Chains},
  series    = {Cambridge Series in Statistical and Probabilistic Mathematics},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@article{Kolmogorov1936,
  author  = {Kolmogorov, Andrei N.},
  title   = {Zur {T}heorie der {M}arkoffschen {K}etten},
  journal = {Mathematische Annalen},
  volume  = {112},
  year    = {1936},
  pages   = {155--160}
}

@book{Asmussen2003,
  author    = {Asmussen, S{\o}ren},
  title     = {Applied Probability and Queues},
  edition   = {2},
  series    = {Stochastic Modelling and Applied Probability},
  publisher = {Springer},
  year      = {2003}
}

@book{Liggett2010,
  author    = {Liggett, Thomas M.},
  title     = {Continuous Time Markov Processes: An Introduction},
  series    = {Graduate Studies in Mathematics},
  volume    = {113},
  publisher = {American Mathematical Society},
  year      = {2010}
}

@book{Kelly1979,
  author    = {Kelly, Frank P.},
  title     = {Reversibility and Stochastic Networks},
  publisher = {Wiley},
  year      = {1979}
}

@book{Chung1967,
  author    = {Chung, Kai Lai},
  title     = {Markov Chains with Stationary Transition Probabilities},
  edition   = {2},
  publisher = {Springer},
  year      = {1967}
}