Hitting Probabilities and Expected Hitting Times
Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §1.3-1.4; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5, Ch. 10 (hitting times); Grimmett-Stirzaker 2020 *Probability and Random Processes* 4e §6.3-6.4
Intuition Beginner
A Markov chain wanders among its states one step at a time. Two of the most natural questions you can ask are: starting from where I am now, will I ever reach a particular target set of states? And if I will, how long should I expect to wait? The first question is about a chance — the hitting probability. The second is about an average waiting time — the expected hitting time. Almost every concrete use of a Markov chain comes down to one of these two numbers.
Think of a token sitting on a row of squares numbered from to , with a wall at each end. At each step the token moves one square left or right by chance. The squares and are the targets: the game ends the moment the token reaches either wall. Starting from square , you might ask for the chance that the token reaches the right wall before the left wall . That is a hitting probability for the target set "right wall". You might also ask how many steps, on average, until the token hits either wall and the game ends. That is an expected hitting time.
The tool that cracks both questions is the same simple idea, called first-step analysis. Stand on your current square and look one move ahead. The chance of eventually reaching the target from here is the average, over the possible next squares, of the chance of reaching the target from each of those. The expected time to reach the target from here is one step (for the move you are about to take) plus the average of the expected times from wherever you land. In each case you relate the unknown at your square to the unknowns at the neighboring squares, and you get one equation per square. Solving the whole system of equations gives every answer at once.
There is one wrinkle worth flagging early. Sometimes these equations have more than one solution, and only one of them is the true answer. The true hitting probability and the true expected time are always the smallest non-negative solution of their equations. Picking the smallest is how you avoid spurious answers that the bare equations would otherwise allow.
The one-sentence takeaway: to find the chance of reaching a target set and the average time to reach it, set up one first-step equation per state and take the smallest non-negative solution.
Visual Beginner
Picture the numbered squares from to with walls at both ends and a token in the middle.
The two walls are the targets: once the token lands on or the walk stops. For the hitting probability of the right wall, the boundary values are fixed — chance if you start on , chance if you start on — and every interior square's value is the average of its two neighbors. For the expected time to reach a wall, the boundary values are both (you are already there) and every interior square's value is one plus the average of its neighbors. The "plus one" is the single extra step you always spend before landing somewhere new.
Worked example Beginner
Take the smaller walk on squares with walls at and . From each interior square the token moves left or right with chance one half each. We find the chance of reaching the right wall before the left wall , starting from each interior square.
Step 1. Name the unknowns. Let be the chances of hitting before , starting from squares . The boundary values are forced: (you start on the left wall, so you never reach the right one first) and (you start on the right wall).
Step 2. Write one first-step equation per interior square. From square the token goes to or with chance one half each, so . Likewise and .
Step 3. Solve. Substitute the expressions for and into the equation for : . So , giving .
Step 4. Back-substitute. Then and .
Step 5. Sanity check. The answers , , are exactly the starting square divided by . That matches the intuition that, in a fair walk, the chance of reaching the right wall first is proportional to how close you start to it.
What this tells us: by writing the chance at each square as the average of the chances at its neighbors and pinning the two boundary values, we turned a question about a random journey of unknown length into a small system of three equations. The hitting probability from square came out as , a clean answer that no amount of step-by-step simulation would have handed us so cleanly.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is a time-homogeneous Markov chain on a countable state space with stochastic transition matrix , in the sense of 37.05.01, and , denote probability and expectation for the chain started at . The class structure of 37.05.02 organizes which targets are reachable.
Definition (hitting time). Let be a target set. The hitting time of is the random variable with the convention , so on the event that the chain never enters . It is a stopping time for the filtration , since . When is a single state, write . The first return time differs from only by the lower index bound .
Definition (hitting probability). The hitting probability of from is The vector takes values in and satisfies for . For finite chains in which is reached with probability one from every state, ; the interesting case is when several disjoint targets compete, and is the absorption probability into .
Definition (mean hitting time). The mean (expected) hitting time of from is taking the value whenever , and otherwise the ordinary expectation of the integer-valued . Then for .
Definition (first-step / hitting system). The hitting-probability system associated with is the set of linear constraints on a vector : The mean-hitting-time system is A solution is non-negative if for all (and, for (K), values in are permitted). The central theorem identifies and as the minimal non-negative solutions of (H) and (K).
Counterexamples to common slips Intermediate+
The first-step equations alone do not determine . On the unrestricted random walk on with for and , the system (H) reads , whose solutions are the affine functions . The constraint and forces ; the genuine hitting probability is the minimal such, (recurrence), not the family . Minimality is what selects the right one.
Hitting probability and return probability differ. always (you start in ), whereas the return probability can be less than one. The lower index bound versus is the whole distinction; conflating with misstates recurrence.
A finite expected hitting time is strictly stronger than hitting with probability one. On the symmetric walk on , the origin is hit from any start with probability one (), yet the mean hitting time is . Solving (K) and obtaining a finite number presupposes the minimal solution is finite, which can fail even when (H) gives .
The sum in (K) runs over , not all . Once the chain enters it stops; the future steps from inside contribute nothing to . Writing instead of double-counts the terminal step and corrupts the system.
Key theorem with proof Intermediate+
Theorem (hitting probabilities are the minimal non-negative solution). The vector of hitting probabilities is the minimal non-negative solution of the system (H): it solves (H), and if is any non-negative solution of (H), then for all [Norris 1997 §1.3].
Proof. We first show solves (H), then prove minimality by an iterated-substitution argument.
Step 1 ( solves (H)). For , under , so . Fix . Then forces , and conditioning on the first step ,
By the Markov property of 37.05.01, conditionally on the post-time- chain is and independent of , and the event from time onward is the event that this restarted chain hits . Hence , and since ,
Thus satisfies (H).
Step 2 (minimality). Let solve (H). On , . Fix and unfold (H) repeatedly. Since for ,
Substitute the same expansion for each with :
using the finite-dimensional law of 37.05.01 to recognize . Iterating times,
The first sum is exactly , and the second (remainder) term is non-negative because and all are non-negative. Therefore
Letting , by continuity of measure along the increasing events . Hence . As was arbitrary, pointwise, which is minimality.
Bridge. This theorem builds toward the entire potential-theoretic and stationary theory of Markov chains, and the same minimal-solution principle appears again in the mean-hitting-time system (K), where is the minimal non-negative solution. The foundational reason both systems behave the same way is that first-step analysis is exactly the one-step conditioning afforded by the Markov property of 37.05.01: the unknown at a state is its target value plus a non-negative average over the next state, so iterating the equation reconstructs the chain's law up to time and leaves a non-negative remainder. This is exactly the discrete analogue of the boundary-value problem on the complement of with on , with the discrete generator playing the role of the Laplacian; the minimal-solution selection generalises to the Perron-Wiener-Brelot solution of the Dirichlet problem in classical potential theory, and the dichotomy " versus " is dual to the recurrence/transience split organized by the class structure of 37.05.02. The central insight is that absorption probabilities, recurrence, and expected costs are all minimal non-negative solutions of a linear system read off from , and putting these together, hitting theory is the harmonic analysis of the operator relative to a boundary set .
Exercises Intermediate+
Advanced results Master
Hitting theory is the discrete potential theory of the operator relative to a boundary set . The minimal-solution principle, the probabilistic representation of solutions, and the reward generalization organize the subject; the gambler's ruin and birth–death chains instantiate each closed form.
Theorem 1 (mean hitting times as the minimal non-negative solution). The vector of mean hitting times is the minimal non-negative solution in of the system (K): for and for . The proof parallels the hitting-probability case: first-step analysis shows solves (K), and for any non-negative solution of (K), iterated substitution yields for every , and letting via monotone convergence gives . The identity (tail-sum formula for a non-negative integer variable) is what converts the iterated remainder into the mean.
Theorem 2 (probabilistic solution of the Dirichlet–Poisson problem). Let and suppose for all . For a bounded boundary datum and a cost , the function is, when finite, the minimal non-negative-plus-bounded solution of the discrete Dirichlet–Poisson problem i.e. off with on . The hitting probability is the case on , "at infinity", ; the mean hitting time is the case , . The operator is the discrete Laplacian, and is its -Green-potential plus the harmonic extension of .
Theorem 3 (gambler's ruin, complete solution). For the walk on with absorbing endpoints, , , the probability of reaching before from is and the expected time to absorption at either wall is Both are the minimal non-negative solutions of (H) and (K), here unique because the finite chain hits with probability one. Letting with fixed gives the half-line ruin probability when and when , the recurrence/transience boundary at .
Theorem 4 (birth–death recurrence and absorption). For a birth–death chain on with , , , set , . The chain is recurrent (hits from every state with probability one) iff ; when the sum is finite the absorption probability at from is and escape to has positive probability. The potential coefficients are the discrete analogue of the scale function of a one-dimensional diffusion, and is the discrete analogue of an inaccessible boundary at infinity.
Synthesis. The foundational reason hitting probabilities and mean hitting times submit to the same method is that first-step analysis is the one-step conditioning of the Markov property of 37.05.01, so each unknown equals a boundary value plus a non-negative average of neighboring unknowns, and putting these together, iterating the equation reconstructs the law of the chain up to time with a non-negative remainder that is discarded only in the limit — which is exactly why the true answer is the minimal non-negative solution rather than any solution. This is exactly the discrete Dirichlet–Poisson problem off with on (Theorem 2): is the harmonic function with boundary value on , is the Green potential of the constant cost , and the central insight is that absorption, recurrence, and expected cost are three readings of one linear system built from , the discrete Laplacian dual to the diffusion generator of 02.15.03.
The gambler's ruin (Theorem 3) and birth–death chain (Theorem 4) are the solvable instances where the recurrence telescopes into a geometric series, and the potential coefficients generalise the scale function of a diffusion — the recurrence criterion is dual to an inaccessible boundary, so the discrete and continuous theories are two faces of the same potential theory. The minimal-solution principle is the bridge from the combinatorial class structure of 37.05.02 — where on a recurrent class and when a competing closed class drains probability — to the analytic equilibrium theory of 37.05.04, where the same operator governs stationary distributions through .
Full proof set Master
Proposition 1 (hitting probability solves (H) and is minimal). satisfies (H), and any non-negative solution of (H) dominates it: .
Proof. That solves (H) is Step 1 of the Key theorem (boundary value on ; first-step conditioning under the Markov property off ). For minimality, let solve (H). Iterating (H) and splitting each sum at membership in gives, for , the inequality because the remainder is a sum of non-negative terms. Continuity of measure along gives .
Proposition 2 (mean hitting time solves (K) and is minimal). satisfies (K), and any non-negative solution of (K) dominates it: .
Proof. First-step analysis (Exercise 3) gives for and on , so solves (K). For minimality, let solve (K). For , . Substituting (K) into itself times, each step contributing a fresh and leaving a non-negative remainder, using the tail-sum formula for a non-negative integer variable . Monotone convergence as gives .
Proposition 3 (gambler's ruin probability). For the walk on with , , , the absorption probability at from is .
Proof. (H) reads , , . Writing , the equation gives , so . Then , and fixes , yielding the stated formula. It is the unique solution meeting both boundary conditions, hence minimal.
Proposition 4 (gambler's ruin expected duration). For the same walk with , the expected time to absorption is .
Proof. (K) reads , . The particular solution satisfies . The homogeneous equation has characteristic roots and , so . Imposing () and gives , , producing the stated closed form.
Proposition 5 (symmetric limits). As the gambler's ruin formulas tend to and .
Proof. Put . By l'Hôpital in (or expanding ), , giving . For , write and expand with ; the first-order terms cancel against and the second-order terms give after collecting the contributions. Directly, is checked to solve since .
Proposition 6 (birth–death absorption formula). With , the finite-horizon absorption probability is , and .
Proof. (H) for is , , . With , the relation gives , so . Summing and yields . The events increase to as , so continuity of measure gives ; the limit is for all iff .
Connections Master
The Markov property, transition matrices, and Chapman–Kolmogorov
37.05.01supply the engine of first-step analysis: conditioning on and restarting the chain is the one-step form of the Markov property, and the iterated-substitution proof of minimality is the finite-dimensional law summed over non- paths. The hitting time is a stopping time for the filtration built there, and the strong Markov property at underlies the regenerative reading of repeated hits.The class structure, irreducibility, and periodicity
37.05.02determine the qualitative outcome of hitting theory: exactly when is reached from every state, which for a finite chain means meets every closed communicating class; when two closed classes compete, the absorption probabilities partition the unit mass among them, and the open (transient) classes are exactly where the mean hitting time can be finite. Recurrence of a class is the statement inside it.The stationary distribution and convergence theory of irreducible chains
37.05.04reuses the same operator : where hitting theory solves off with a boundary condition, equilibrium theory solves the adjoint , and the Kac return-time formula ties the mean return time computed by (K) directly to the stationary probability; positive recurrence is finiteness of that mean hitting time.The continuous-state diffusion generator
02.15.03is the analytic limit: the discrete Dirichlet–Poisson problem off becomes the boundary-value problem for the second-order generator , hitting probabilities become harmonic measure, mean hitting times become solutions of , and the birth–death potential coefficients become the scale function whose finiteness criterion classifies boundary behavior at infinity.
Historical & philosophical context Master
The gambler's ruin problem is among the oldest in probability, posed in the correspondence of Blaise Pascal and Pierre de Fermat in 1656 and given its first printed treatment by Christiaan Huygens, who appended it as one of five problems to his 1657 tract De ratiociniis in ludo aleae [Huygens 1657]. Huygens computed, for the symmetric and the biased game, the probability that one of two players is ruined before the other, effectively solving system (H) for the finite walk centuries before the matrix formalism existed; Jacob Bernoulli and Abraham de Moivre extended the analysis, de Moivre obtaining the closed geometric form for the biased case in the Doctrine of Chances.
The recasting of these computations as the solution of a linear system indexed by the states — first-step analysis — and the recognition that the genuine hitting probability is the minimal non-negative solution belong to the twentieth-century theory of Markov chains, consolidated in the textbook treatments of Feller and, in the form followed here, of Norris [Norris 1997], whose §1.3 isolates the minimality principle as the device that selects the probabilistic solution among the affine family the bare equations admit. The identification of hitting probabilities with harmonic functions and mean hitting times with Green potentials places the subject inside the discrete potential theory developed by Doob, Hunt, and Dynkin, where the Markov chain is the probabilistic counterpart of the Laplacian and the boundary-value problems of classical potential theory acquire a sample-path meaning.
Bibliography Master
@book{Norris1997,
author = {Norris, James R.},
title = {Markov Chains},
series = {Cambridge Series in Statistical and Probabilistic Mathematics},
publisher = {Cambridge University Press},
year = {1997}
}
@book{Huygens1657,
author = {Huygens, Christiaan},
title = {De ratiociniis in ludo aleae},
publisher = {Elsevier (Leiden)},
year = {1657}
}
@book{Durrett2019mc,
author = {Durrett, Rick},
title = {Probability: Theory and Examples},
edition = {5},
publisher = {Cambridge University Press},
year = {2019}
}
@book{LevinPeres2017,
author = {Levin, David A. and Peres, Yuval},
title = {Markov Chains and Mixing Times},
edition = {2},
publisher = {American Mathematical Society},
year = {2017}
}
@book{GrimmettStirzaker2020,
author = {Grimmett, Geoffrey R. and Stirzaker, David R.},
title = {Probability and Random Processes},
edition = {4},
publisher = {Oxford University Press},
year = {2020}
}
@book{Feller1968,
author = {Feller, William},
title = {An Introduction to Probability Theory and Its Applications, Vol. 1},
edition = {3},
publisher = {Wiley},
year = {1968}
}