37.05.03 · probability / 05-markov-chains

Hitting Probabilities and Expected Hitting Times

shipped3 tiersLean: none

Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §1.3-1.4; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5, Ch. 10 (hitting times); Grimmett-Stirzaker 2020 *Probability and Random Processes* 4e §6.3-6.4

Intuition Beginner

A Markov chain wanders among its states one step at a time. Two of the most natural questions you can ask are: starting from where I am now, will I ever reach a particular target set of states? And if I will, how long should I expect to wait? The first question is about a chance — the hitting probability. The second is about an average waiting time — the expected hitting time. Almost every concrete use of a Markov chain comes down to one of these two numbers.

Think of a token sitting on a row of squares numbered from $0$ to $10$ , with a wall at each end. At each step the token moves one square left or right by chance. The squares $0$ and $10$ are the targets: the game ends the moment the token reaches either wall. Starting from square $7$ , you might ask for the chance that the token reaches the right wall $10$ before the left wall $0$ . That is a hitting probability for the target set "right wall". You might also ask how many steps, on average, until the token hits either wall and the game ends. That is an expected hitting time.

The tool that cracks both questions is the same simple idea, called first-step analysis. Stand on your current square and look one move ahead. The chance of eventually reaching the target from here is the average, over the possible next squares, of the chance of reaching the target from each of those. The expected time to reach the target from here is one step (for the move you are about to take) plus the average of the expected times from wherever you land. In each case you relate the unknown at your square to the unknowns at the neighboring squares, and you get one equation per square. Solving the whole system of equations gives every answer at once.

There is one wrinkle worth flagging early. Sometimes these equations have more than one solution, and only one of them is the true answer. The true hitting probability and the true expected time are always the smallest non-negative solution of their equations. Picking the smallest is how you avoid spurious answers that the bare equations would otherwise allow.

The one-sentence takeaway: to find the chance of reaching a target set and the average time to reach it, set up one first-step equation per state and take the smallest non-negative solution.

Visual Beginner

Picture the numbered squares from $0$ to $10$ with walls at both ends and a token in the middle.

The two walls are the targets: once the token lands on $0$ or $10$ the walk stops. For the hitting probability of the right wall, the boundary values are fixed — chance $1$ if you start on $10$ , chance $0$ if you start on $0$ — and every interior square's value is the average of its two neighbors. For the expected time to reach a wall, the boundary values are both $0$ (you are already there) and every interior square's value is one plus the average of its neighbors. The "plus one" is the single extra step you always spend before landing somewhere new.

Worked example Beginner

Take the smaller walk on squares ${0, 1, 2, 3, 4}$ with walls at $0$ and $4$ . From each interior square $1, 2, 3$ the token moves left or right with chance one half each. We find the chance of reaching the right wall $4$ before the left wall $0$ , starting from each interior square.

Step 1. Name the unknowns. Let $h_{1}, h_{2}, h_{3}$ be the chances of hitting $4$ before $0$ , starting from squares $1, 2, 3$ . The boundary values are forced: $h_{0} = 0$ (you start on the left wall, so you never reach the right one first) and $h_{4} = 1$ (you start on the right wall).

Step 2. Write one first-step equation per interior square. From square $2$ the token goes to $1$ or $3$ with chance one half each, so $h_{2} = \frac{1}{2} h_{1} + \frac{1}{2} h_{3}$ . Likewise $h_{1} = \frac{1}{2} h_{0} + \frac{1}{2} h_{2} = \frac{1}{2} h_{2}$ and $h_{3} = \frac{1}{2} h_{2} + \frac{1}{2} h_{4} = \frac{1}{2} h_{2} + \frac{1}{2}$ .

Step 3. Solve. Substitute the expressions for $h_{1}$ and $h_{3}$ into the equation for $h_{2}$ : $h_{2} = \frac{1}{2} (\frac{1}{2} h_{2}) + \frac{1}{2} (\frac{1}{2} h_{2} + \frac{1}{2}) = \frac{1}{2} h_{2} + \frac{1}{4}$ . So $\frac{1}{2} h_{2} = \frac{1}{4}$ , giving $h_{2} = \frac{1}{2}$ .

Step 4. Back-substitute. Then $h_{1} = \frac{1}{2} h_{2} = \frac{1}{4}$ and $h_{3} = \frac{1}{2} h_{2} + \frac{1}{2} = \frac{3}{4}$ .

Step 5. Sanity check. The answers $h_{1} = 1/4$ , $h_{2} = 1/2$ , $h_{3} = 3/4$ are exactly the starting square divided by $4$ . That matches the intuition that, in a fair walk, the chance of reaching the right wall first is proportional to how close you start to it.

What this tells us: by writing the chance at each square as the average of the chances at its neighbors and pinning the two boundary values, we turned a question about a random journey of unknown length into a small system of three equations. The hitting probability from square $i$ came out as $i /4$ , a clean answer that no amount of step-by-step simulation would have handed us so cleanly.

Check your understanding Beginner

Exercise (easy, multiple choice).

In first-step analysis for a hitting probability, the value $h_{i}$ at an interior state $i$ is written as:

A. The sum of the values at all states B. The average of the values $h_{j}$ at the states $j$ you can step to, weighted by the one-step chances $p_{ij}$ C. The value at the target state only D. One plus the average of the neighboring values

Hint

Look one step ahead and average the chance of success over where you might land next. Hitting probabilities carry no "plus one".

Answer

B. A weighted average of the neighbors' values. From an interior state the chance of eventually hitting the target is the average, over the states you might step to next, of the chance of success from each, weighted by the one-step chance of going there. Feedback-correct: this averaging over the next state is the first-step equation. Feedback-wrong: D adds a "plus one" that belongs to expected times, not probabilities; A and C ignore the one-step structure entirely.

Formal definition Intermediate+

Throughout, $(X_{n})_{n \geq 0}$ is a time-homogeneous Markov chain on a countable state space $I$ with stochastic transition matrix $P = (p_{ij})$ , in the sense of 37.05.01, and $P_{i}$ , $E_{i}$ denote probability and expectation for the chain started at $X_{0} = i$ . The class structure of 37.05.02 organizes which targets are reachable.

Definition (hitting time). Let $A \subseteq I$ be a target set. The hitting time of $A$ is the random variable $H^{A} := in f {n \geq 0 : X_{n} \in A},$ with the convention $in f \emptyset = + \infty$ , so $H^{A} = \infty$ on the event that the chain never enters $A$ . It is a stopping time for the filtration $F_{n} = σ (X_{0}, \dots, X_{n})$ , since ${H^{A} \leq n} = ⋃_{m \leq n} {X_{m} \in A} \in F_{n}$ . When $A = {j}$ is a single state, write $H^{j}$ . The first return time $T_{j} := in f {n \geq 1 : X_{n} = j}$ differs from $H^{j}$ only by the lower index bound $n \geq 1$ .

Definition (hitting probability). The hitting probability of $A$ from $i$ is $h_{i}^{A} := P_{i} (H^{A} < \infty) = P_{i} (X_{n} \in A for some n \geq 0) .$ The vector $h^{A} = (h_{i}^{A})_{i \in I}$ takes values in $[0, 1]$ and satisfies $h_{i}^{A} = 1$ for $i \in A$ . For finite chains in which $A$ is reached with probability one from every state, $h^{A} \equiv 1$ ; the interesting case is when several disjoint targets compete, and $h_{i}^{A}$ is the absorption probability into $A$ .

Definition (mean hitting time). The mean (expected) hitting time of $A$ from $i$ is $k_{i}^{A} := E_{i} [H^{A}] = n \geq 0 \sum n P_{i} (H^{A} = n) + \infty \cdot P_{i} (H^{A} = \infty),$ taking the value $+ \infty$ whenever $P_{i} (H^{A} = \infty) > 0$ , and otherwise the ordinary expectation of the integer-valued $H^{A}$ . Then $k_{i}^{A} = 0$ for $i \in A$ .

Definition (first-step / hitting system). The hitting-probability system associated with $A$ is the set of linear constraints on a vector $x = (x_{i})_{i \in I}$ : $x_{i} = 1 (i \in A), x_{i} = j \in I \sum p_{ij} x_{j} (i \in / A) . (H)$ The mean-hitting-time system is $x_{i} = 0 (i \in A), x_{i} = 1 + j \in / A \sum p_{ij} x_{j} (i \in / A) . (K)$ A solution is non-negative if $x_{i} \geq 0$ for all $i$ (and, for (K), values in $[0, \infty]$ are permitted). The central theorem identifies $h^{A}$ and $k^{A}$ as the minimal non-negative solutions of (H) and (K).

Counterexamples to common slips Intermediate+

The first-step equations alone do not determine $h^{A}$ . On the unrestricted random walk on $Z_{\geq 0}$ with $p_{i, i + 1} = p_{i, i - 1} = 1/2$ for $i \geq 1$ and $A = {0}$ , the system (H) reads $x_{i} = \frac{1}{2} x_{i - 1} + \frac{1}{2} x_{i + 1}$ , whose solutions are the affine functions $x_{i} = a + bi$ . The constraint $x_{0} = 1$ and $0 \leq x_{i} \leq 1$ forces $b \leq 0$ ; the genuine hitting probability is the minimal such, $x_{i} \equiv 1$ (recurrence), not the family $1 - bi$ . Minimality is what selects the right one.
Hitting probability and return probability differ. $h_{j}^{j} = 1$ always (you start in $A = {j}$ ), whereas the return probability $P_{j} (T_{j} < \infty)$ can be less than one. The lower index bound $n \geq 0$ versus $n \geq 1$ is the whole distinction; conflating $H^{j}$ with $T_{j}$ misstates recurrence.
A finite expected hitting time is strictly stronger than hitting with probability one. On the symmetric walk on $Z$ , the origin is hit from any start with probability one ( $h = 1$ ), yet the mean hitting time is $+ \infty$ . Solving (K) and obtaining a finite number presupposes the minimal solution is finite, which can fail even when (H) gives $h \equiv 1$ .
The sum in (K) runs over $j \in / A$ , not all $j$ . Once the chain enters $A$ it stops; the future steps from inside $A$ contribute nothing to $H^{A}$ . Writing $\sum_{j \in I}$ instead of $\sum_{j \in / A}$ double-counts the terminal step and corrupts the system.

Key theorem with proof Intermediate+

Theorem (hitting probabilities are the minimal non-negative solution). The vector of hitting probabilities $h^{A} = (h_{i}^{A})_{i \in I}$ is the minimal non-negative solution of the system (H): it solves (H), and if $x = (x_{i})$ is any non-negative solution of (H), then $x_{i} \geq h_{i}^{A}$ for all $i$ ^{[Norris 1997 §1.3]}.

Proof. We first show $h^{A}$ solves (H), then prove minimality by an iterated-substitution argument.

Step 1 ( $h^{A}$ solves (H)). For $i \in A$ , $H^{A} = 0$ under $P_{i}$ , so $h_{i}^{A} = 1$ . Fix $i \in / A$ . Then $X_{0} = i \in / A$ forces $H^{A} \geq 1$ , and conditioning on the first step $X_{1} = j$ , $h_{i}^{A} = P_{i} (H^{A} < \infty) = j \in I \sum P_{i} (X_{1} = j) P_{i} (H^{A} < \infty ∣ X_{1} = j) .$ By the Markov property of 37.05.01, conditionally on $X_{1} = j$ the post-time- $1$ chain is $Markov (δ_{j}, P)$ and independent of $X_{0}$ , and the event ${H^{A} < \infty}$ from time $1$ onward is the event that this restarted chain hits $A$ . Hence $P_{i} (H^{A} < \infty ∣ X_{1} = j) = h_{j}^{A}$ , and since $P_{i} (X_{1} = j) = p_{ij}$ , $h_{i}^{A} = j \in I \sum p_{ij} h_{j}^{A} (i \in / A) .$ Thus $h^{A}$ satisfies (H).

Step 2 (minimality). Let $x \geq 0$ solve (H). On $A$ , $x_{i} = 1 = h_{i}^{A}$ . Fix $i \in / A$ and unfold (H) repeatedly. Since $x_{j} = 1$ for $j \in A$ , $x_{i} = j \sum p_{ij} x_{j} = j \in A \sum p_{ij} + j \in / A \sum p_{ij} x_{j} = P_{i} (X_{1} \in A) + j \in / A \sum p_{ij} x_{j} .$ Substitute the same expansion for each $x_{j}$ with $j \in / A$ : $x_{i} = P_{i} (X_{1} \in A) + j \in / A \sum p_{ij} (P_{j} (X_{1} \in A) + k \in / A \sum p_{j k} x_{k}) = P_{i} (X_{1} \in A) + P_{i} (X_{1} \in / A, X_{2} \in A) + j, k \in / A \sum p_{ij} p_{j k} x_{k},$ using the finite-dimensional law of 37.05.01 to recognize $\sum_{j \in / A} p_{ij} P_{j} (X_{1} \in A) = P_{i} (X_{1} \in / A, X_{2} \in A)$ . Iterating $n$ times, $x_{i} = m = 1 \sum n P_{i} (X_{1} \in / A, \dots, X_{m - 1} \in / A, X_{m} \in A) + j_{1}, \dots, j_{n} \in / A \sum p_{i j_{1}} p_{j_{1} j_{2}} \dots p_{j_{n - 1} j_{n}} x_{j_{n}} .$ The first sum is exactly $P_{i} (H^{A} \leq n)$ , and the second (remainder) term is non-negative because $x \geq 0$ and all $p$ are non-negative. Therefore $x_{i} \geq P_{i} (H^{A} \leq n) for every n .$ Letting $n \to \infty$ , $P_{i} (H^{A} \leq n) ↑ P_{i} (H^{A} < \infty) = h_{i}^{A}$ by continuity of measure along the increasing events ${H^{A} \leq n}$ . Hence $x_{i} \geq h_{i}^{A}$ . As $i$ was arbitrary, $x \geq h^{A}$ pointwise, which is minimality. $□$

Bridge. This theorem builds toward the entire potential-theoretic and stationary theory of Markov chains, and the same minimal-solution principle appears again in the mean-hitting-time system (K), where $k^{A}$ is the minimal non-negative solution. The foundational reason both systems behave the same way is that first-step analysis is exactly the one-step conditioning afforded by the Markov property of 37.05.01: the unknown at a state is its target value plus a non-negative average over the next state, so iterating the equation reconstructs the chain's law up to time $n$ and leaves a non-negative remainder. This is exactly the discrete analogue of the boundary-value problem $L h = 0$ on the complement of $A$ with $h = 1$ on $A$ , with the discrete generator $P - I_{d}$ playing the role of the Laplacian; the minimal-solution selection generalises to the Perron-Wiener-Brelot solution of the Dirichlet problem in classical potential theory, and the dichotomy " $h^{A} \equiv 1$ versus $h^{A} < 1$ " is dual to the recurrence/transience split organized by the class structure of 37.05.02. The central insight is that absorption probabilities, recurrence, and expected costs are all minimal non-negative solutions of a linear system read off from $P$ , and putting these together, hitting theory is the harmonic analysis of the operator $P$ relative to a boundary set $A$ .

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Derive the mean-hitting-time system (K) by first-step analysis: show that for $i \in / A$ , $k_{i}^{A} = 1 + \sum_{j \in / A} p_{ij} k_{j}^{A}$ , and explain why the sum excludes $j \in A$ .

Hint

Condition on $X_{1}$ . On ${X_{1} \in A}$ the chain has already hit $A$ after exactly one step.

Answer

For $i \in / A$ , $X_{0} \in / A$ forces $H^{A} \geq 1$ . Write $H^{A} = 1 + H^{A} \circ θ_{1}$ , where $θ_{1}$ is the time shift, valid because the hitting time of $A$ counted from time $1$ equals $H^{A} - 1$ on ${H^{A} \geq 1}$ . By the Markov property, conditional on $X_{1} = j$ the shifted hitting time $H^{A} \circ θ_{1}$ has the law of $H^{A}$ under $P_{j}$ , so $E_{i} [H^{A} \circ θ_{1} ∣ X_{1} = j] = k_{j}^{A}$ . Taking expectations, $k_{i}^{A} = 1 + \sum_{j \in I} p_{ij} k_{j}^{A}$ . But $k_{j}^{A} = 0$ for $j \in A$ , so those terms drop and $k_{i}^{A} = 1 + \sum_{j \in / A} p_{ij} k_{j}^{A}$ . The sum excludes $A$ precisely because hitting $A$ stops the clock: there is no further waiting time accumulated from a state already in $A$ .

Exercise 5 (medium, symbolic).

Solve the asymmetric gambler's ruin: a walk on ${0, 1, \dots, N}$ with absorbing endpoints, $p_{i, i + 1} = p$ , $p_{i, i - 1} = q = 1 - p$ , $p \neq = q$ . Find $h_{i}$ , the probability of reaching $N$ before $0$ from $i$ .

Hint

The difference $h_{i} - h_{i - 1}$ forms a geometric sequence with ratio $q / p$ . Sum the telescoping differences and use the boundary values $h_{0} = 0$ , $h_{N} = 1$ .

Answer

System (H) reads $h_{i} = p h_{i + 1} + q h_{i - 1}$ for $0 < i < N$ , with $h_{0} = 0$ , $h_{N} = 1$ . Rewrite as $p (h_{i + 1} - h_{i}) = q (h_{i} - h_{i - 1})$ , so the differences $d_{i} := h_{i} - h_{i - 1}$ satisfy $d_{i + 1} = (q / p) d_{i}$ , hence $d_{i} = (q / p)^{i - 1} d_{1}$ . Summing, $h_{i} = \sum_{m = 1}^{i} d_{m} = d_{1} \sum_{m = 1}^{i} (q / p)^{m - 1} = d_{1} \frac{1 - ( q / p ) ^{i}}{1 - ( q / p )}$ . The condition $h_{N} = 1$ fixes $d_{1}$ , yielding $h_{i} = \frac{1 - ( q / p ) ^{i}}{1 - ( q / p ) ^{N}} .$ Because $0 < h_{i} < 1$ strictly and this is the unique solution meeting both boundary conditions, it is in particular the minimal non-negative solution. As $p \to 1/2$ this limits to $h_{i} = i / N$ , recovering the symmetric case.

Exercise 6 (medium, multiple choice).

On the symmetric walk on $Z$ with target $A = {0}$ , the system (H) is $x_{i} = \frac{1}{2} x_{i - 1} + \frac{1}{2} x_{i + 1}$ for $i \neq = 0$ with $x_{0} = 1$ . Which statement is correct?

A. (H) has a unique bounded solution, so minimality is irrelevant B. The genuine hitting probability is $h_{i} \equiv 1$ , the minimal non-negative solution; other affine solutions $1 - b ∣ i ∣$ are spurious C. The hitting probability is $h_{i} = 1/ (1 + ∣ i ∣)$ D. (H) has no non-negative solution

Hint

Affine functions solve the averaging equation. Among non-negative solutions with $x_{0} = 1$ , which is smallest?

Answer

B. The bounded harmonic solutions of $x_{i} = \frac{1}{2} (x_{i - 1} + x_{i + 1})$ are affine on each side, and non-negativity with $x_{0} = 1$ admits the whole family $x_{i} = 1 - b ∣ i ∣$ for small $b \geq 0$ until positivity is violated. The minimal non-negative solution is $h_{i} \equiv 1$ , reflecting recurrence of the symmetric walk: the origin is hit with probability one from every start. Minimality is exactly what discards the spurious sub-one solutions, so A is false; C and D are simply wrong.

Exercise 7 (hard, symbolic).

For the asymmetric gambler's ruin on ${0, \dots, N}$ with $p \neq = q$ , compute the mean hitting time $k_{i}$ of the wall set ${0, N}$ from $i$ .

Hint

Solve the inhomogeneous recurrence $k_{i} = 1 + p k_{i + 1} + q k_{i - 1}$ with $k_{0} = k_{N} = 0$ . A particular solution is linear in $i$ ; the homogeneous part is spanned by $1$ and $(q / p)^{i}$ .

Answer

The system (K) is $k_{i} = 1 + p k_{i + 1} + q k_{i - 1}$ for $0 < i < N$ , $k_{0} = k_{N} = 0$ . A particular solution of $p k_{i + 1} - k_{i} + q k_{i - 1} = - 1$ is $k_{i}^{p} = i / (q - p)$ , since $p \frac{i + 1}{q - p} - \frac{i}{q - p} + q \frac{i - 1}{q - p} = \frac{( p + q ) i + ( p - q )}{q - p} = \frac{i - ( q - p )}{q - p} \cdot$ ; more directly, substituting $k_{i}^{p} = i / (q - p)$ gives $1 + (p i + p - i + q i - q) / (q - p) = 1 + (p - q) / (q - p) = 1 - 1 = 0$ as required. The homogeneous solutions of $p u_{i + 1} - u_{i} + q u_{i - 1} = 0$ are spanned by $1$ and $(q / p)^{i}$ . Hence $k_{i} = \frac{i}{q - p} + A + B (\frac{q}{p})^{i} .$ Imposing $k_{0} = 0$ gives $A + B = 0$ , and $k_{N} = 0$ gives $A + B (q / p)^{N} = - N / (q - p)$ . Solving, $B = \frac{N}{( q - p ) ( ( q / p ) ^{N} - 1 )}$ and $A = - B$ , so $k_{i} = \frac{i}{q - p} - \frac{N}{q - p} \cdot \frac{( q / p ) ^{i} - 1}{( q / p ) ^{N} - 1} .$ This is the unique solution with the two boundary values, hence the minimal non-negative one. As $p \to 1/2$ it tends to $i (N - i)$ , the symmetric formula.

Exercise 8 (hard, symbolic).

A birth–death chain on ${0, 1, 2, \dots}$ has $p_{i, i + 1} = p_{i}$ , $p_{i, i - 1} = q_{i}$ ( $q_{i} + p_{i} = 1$ , $q_{i} > 0$ for $i \geq 1$ ), and $0$ is absorbing. Let $h_{i} = P_{i} (ever hit 0)$ . Show $h_{i} \equiv 1$ for all $i$ iff $\sum_{i \geq 1} γ_{i} = \infty$ , where $γ_{i} = \prod_{m = 1}^{i} \frac{q _{m}}{p _{m}}$ (with $γ_{0} = 1$ ).

Hint

Work with the absorption probability at $0$ via the escape probability. Set $u_{i} = 1 - h_{i} = P_{i} (never hit 0)$ ; it is the minimal non-negative solution of the homogeneous system with $u_{0} = 0$ that is also bounded by considering hitting ${0}$ before ${N}$ and letting $N \to \infty$ .

Answer

Fix $N$ and let $h_{i}^{(N)} = P_{i} (hit 0 before N)$ . By the asymmetric-walk telescoping argument with state-dependent ratios, the differences $d_{i} = h_{i - 1}^{(N)} - h_{i}^{(N)}$ satisfy $p_{i} d_{i + 1} = q_{i} d_{i}$ , so $d_{i + 1} = (q_{i} / p_{i}) d_{i}$ and $d_{i} = γ_{i - 1} \cdot (const)$ after telescoping the product $\prod (q_{m} / p_{m})$ . Imposing $h_{0}^{(N)} = 1$ , $h_{N}^{(N)} = 0$ gives $h_{i}^{(N)} = \frac{\sum _{m = i}^{N - 1} γ _{m}}{\sum _{m = 0}^{N - 1} γ _{m}} .$ As $N \to \infty$ the events ${hit 0 before N}$ increase to ${hit 0}$ , so by continuity of measure $h_{i} = lim_{N} h_{i}^{(N)} = \frac{\sum _{m \geq i} γ _{m}}{\sum _{m \geq 0} γ _{m}}$ when the total sum converges, and $h_{i} = 1$ when $\sum_{m \geq 0} γ_{m} = \infty$ (the numerator and denominator both diverge, with the ratio tending to $1$ since the omitted head $\sum_{m < i} γ_{m}$ is finite). Hence $h_{i} \equiv 1$ for all $i$ iff $\sum_{i \geq 0} γ_{i} = \infty$ , i.e. $\sum_{i \geq 1} γ_{i} = \infty$ . This is the recurrence criterion for the birth–death chain, and the passage through the finite- $N$ problem followed by a monotone limit is the same minimality mechanism as in the Key theorem: $h_{i}$ is the increasing limit of finite-horizon absorption probabilities.

Advanced results Master

Hitting theory is the discrete potential theory of the operator $P$ relative to a boundary set $A$ . The minimal-solution principle, the probabilistic representation of solutions, and the reward generalization organize the subject; the gambler's ruin and birth–death chains instantiate each closed form.

Theorem 1 (mean hitting times as the minimal non-negative solution). The vector $k^{A} = (k_{i}^{A})_{i \in I}$ of mean hitting times is the minimal non-negative solution in $[0, \infty]$ of the system (K): $k_{i}^{A} = 0$ for $i \in A$ and $k_{i}^{A} = 1 + \sum_{j \in / A} p_{ij} k_{j}^{A}$ for $i \in / A$ . The proof parallels the hitting-probability case: first-step analysis shows $k^{A}$ solves (K), and for any non-negative solution $y$ of (K), iterated substitution yields $y_{i} \geq \sum_{m = 1}^{n} P_{i} (H^{A} \geq m) = E_{i} [H^{A} \land n]$ for every $n$ , and letting $n \to \infty$ via monotone convergence gives $y_{i} \geq E_{i} [H^{A}] = k_{i}^{A}$ . The identity $E_{i} [H^{A}] = \sum_{m \geq 1} P_{i} (H^{A} \geq m)$ (tail-sum formula for a non-negative integer variable) is what converts the iterated remainder into the mean.

Theorem 2 (probabilistic solution of the Dirichlet–Poisson problem). Let $A \subseteq I$ and suppose $P_{i} (H^{A} < \infty) = 1$ for all $i$ . For a bounded boundary datum $f : A \to R$ and a cost $g : I ∖ A \to R_{\geq 0}$ , the function $ϕ_{i} := E_{i} f (X_{H^{A}}) + n = 0 \sum H^{A} - 1 g (X_{n})$ is, when finite, the minimal non-negative-plus-bounded solution of the discrete Dirichlet–Poisson problem $ϕ_{i} = f (i) (i \in A), ϕ_{i} - j \sum p_{ij} ϕ_{j} = g (i) (i \in / A),$ i.e. $(I_{d} - P) ϕ = g$ off $A$ with $ϕ = f$ on $A$ . The hitting probability $h^{A}$ is the case $f \equiv 1$ on $A$ , $f \equiv 0$ "at infinity", $g \equiv 0$ ; the mean hitting time $k^{A}$ is the case $f \equiv 0$ , $g \equiv 1$ . The operator $I_{d} - P = - (P - I_{d})$ is the discrete Laplacian, and $ϕ$ is its $g$ -Green-potential plus the harmonic extension of $f$ .

Theorem 3 (gambler's ruin, complete solution). For the walk on ${0, 1, \dots, N}$ with absorbing endpoints, $p_{i, i + 1} = p$ , $p_{i, i - 1} = q = 1 - p$ , the probability of reaching $N$ before $0$ from $i$ is $h_{i} = ⎩ ⎨ ⎧ \frac{1 - ( q / p ) ^{i}}{1 - ( q / p ) ^{N}}, \frac{i}{N}, p \neq = q, p = q = \frac{1}{2},$ and the expected time to absorption at either wall is $k_{i} = ⎩ ⎨ ⎧ \frac{i}{q - p} - \frac{N}{q - p} \cdot \frac{( q / p ) ^{i} - 1}{( q / p ) ^{N} - 1}, i (N - i), p \neq = q, p = q = \frac{1}{2} .$ Both are the minimal non-negative solutions of (H) and (K), here unique because the finite chain hits ${0, N}$ with probability one. Letting $N \to \infty$ with $i$ fixed gives the half-line ruin probability $P_{i} (hit 0) = 1$ when $p \leq q$ and $(q / p)^{i}$ when $p > q$ , the recurrence/transience boundary at $p = 1/2$ .

Theorem 4 (birth–death recurrence and absorption). For a birth–death chain on $Z_{\geq 0}$ with $p_{i, i + 1} = p_{i}$ , $p_{i, i - 1} = q_{i}$ , $q_{i} > 0$ , set $γ_{i} = \prod_{m = 1}^{i} (q_{m} / p_{m})$ , $γ_{0} = 1$ . The chain is recurrent (hits $0$ from every state with probability one) iff $\sum_{i \geq 0} γ_{i} = \infty$ ; when the sum is finite the absorption probability at $0$ from $i$ is $h_{i} = (\sum_{m \geq i} γ_{m}) / (\sum_{m \geq 0} γ_{m}) < 1$ and escape to $+ \infty$ has positive probability. The potential coefficients $γ_{i}$ are the discrete analogue of the scale function of a one-dimensional diffusion, and $\sum γ_{i} = \infty$ is the discrete analogue of an inaccessible boundary at infinity.

Synthesis. The foundational reason hitting probabilities and mean hitting times submit to the same method is that first-step analysis is the one-step conditioning of the Markov property of 37.05.01, so each unknown equals a boundary value plus a non-negative average of neighboring unknowns, and putting these together, iterating the equation reconstructs the law of the chain up to time $n$ with a non-negative remainder that is discarded only in the limit — which is exactly why the true answer is the minimal non-negative solution rather than any solution. This is exactly the discrete Dirichlet–Poisson problem $(I_{d} - P) ϕ = g$ off $A$ with $ϕ = f$ on $A$ (Theorem 2): $h^{A}$ is the harmonic function with boundary value $1$ on $A$ , $k^{A}$ is the Green potential of the constant cost $1$ , and the central insight is that absorption, recurrence, and expected cost are three readings of one linear system built from $P - I_{d}$ , the discrete Laplacian dual to the diffusion generator of 02.15.03.

The gambler's ruin (Theorem 3) and birth–death chain (Theorem 4) are the solvable instances where the recurrence $p h_{i + 1} - h_{i} + q h_{i - 1} = 0$ telescopes into a geometric series, and the potential coefficients $γ_{i}$ generalise the scale function of a diffusion — the recurrence criterion $\sum γ_{i} = \infty$ is dual to an inaccessible boundary, so the discrete and continuous theories are two faces of the same potential theory. The minimal-solution principle is the bridge from the combinatorial class structure of 37.05.02 — where $h^{A} \equiv 1$ on a recurrent class and $h^{A} < 1$ when a competing closed class drains probability — to the analytic equilibrium theory of 37.05.04, where the same operator $P - I_{d}$ governs stationary distributions through $π (P - I_{d}) = 0$ .

Full proof set Master

Proposition 1 (hitting probability solves (H) and is minimal). $h^{A}$ satisfies (H), and any non-negative solution $x$ of (H) dominates it: $x \geq h^{A}$ .

Proof. That $h^{A}$ solves (H) is Step 1 of the Key theorem (boundary value $1$ on $A$ ; first-step conditioning under the Markov property off $A$ ). For minimality, let $x \geq 0$ solve (H). Iterating (H) and splitting each sum at membership in $A$ gives, for $i \in / A$ , $x_{i} = m = 1 \sum n P_{i} (X_{1} \in / A, \dots, X_{m - 1} \in / A, X_{m} \in A) + j_{1}, \dots, j_{n} \in / A \sum p_{i j_{1}} \dots p_{j_{n - 1} j_{n}} x_{j_{n}} \geq P_{i} (H^{A} \leq n),$ the inequality because the remainder is a sum of non-negative terms. Continuity of measure along ${H^{A} \leq n} ↑ {H^{A} < \infty}$ gives $x_{i} \geq h_{i}^{A}$ . $□$

Proposition 2 (mean hitting time solves (K) and is minimal). $k^{A}$ satisfies (K), and any non-negative solution $y \in [0, \infty]^{I}$ of (K) dominates it: $y \geq k^{A}$ .

Proof. First-step analysis (Exercise 3) gives $k_{i}^{A} = 1 + \sum_{j \in / A} p_{ij} k_{j}^{A}$ for $i \in / A$ and $k_{i}^{A} = 0$ on $A$ , so $k^{A}$ solves (K). For minimality, let $y \geq 0$ solve (K). For $i \in / A$ , $y_{i} = 1 + \sum_{j \in / A} p_{ij} y_{j} \geq 1 = P_{i} (H^{A} \geq 1)$ . Substituting (K) into itself $n$ times, each step contributing a fresh $P_{i} (H^{A} \geq m)$ and leaving a non-negative remainder, $y_{i} \geq m = 1 \sum n P_{i} (H^{A} \geq m) = E_{i} [H^{A} \land n],$ using the tail-sum formula $E [Z \land n] = \sum_{m = 1}^{n} P (Z \geq m)$ for a non-negative integer variable $Z$ . Monotone convergence as $n \to \infty$ gives $y_{i} \geq E_{i} [H^{A}] = k_{i}^{A}$ . $□$

Proposition 3 (gambler's ruin probability). For the walk on ${0, \dots, N}$ with $p_{i, i + 1} = p$ , $p_{i, i - 1} = q$ , $p \neq = q$ , the absorption probability at $N$ from $i$ is $h_{i} = (1 - (q / p)^{i}) / (1 - (q / p)^{N})$ .

Proof. (H) reads $h_{i} = p h_{i + 1} + q h_{i - 1}$ , $h_{0} = 0$ , $h_{N} = 1$ . Writing $d_{i} = h_{i} - h_{i - 1}$ , the equation $p (h_{i + 1} - h_{i}) = q (h_{i} - h_{i - 1})$ gives $d_{i + 1} = (q / p) d_{i}$ , so $d_{i} = (q / p)^{i - 1} d_{1}$ . Then $h_{i} = \sum_{m = 1}^{i} d_{m} = d_{1} \frac{1 - ( q / p ) ^{i}}{1 - ( q / p )}$ , and $h_{N} = 1$ fixes $d_{1} \frac{1}{1 - ( q / p )} = 1/ (1 - (q / p)^{N})$ , yielding the stated formula. It is the unique solution meeting both boundary conditions, hence minimal. $□$

Proposition 4 (gambler's ruin expected duration). For the same walk with $p \neq = q$ , the expected time to absorption is $k_{i} = \frac{i}{q - p} - \frac{N}{q - p} \cdot \frac{( q / p ) ^{i} - 1}{( q / p ) ^{N} - 1}$ .

Proof. (K) reads $k_{i} = 1 + p k_{i + 1} + q k_{i - 1}$ , $k_{0} = k_{N} = 0$ . The particular solution $k_{i}^{p} = i / (q - p)$ satisfies $1 + p \frac{i + 1}{q - p} - \frac{i}{q - p} + q \frac{i - 1}{q - p} = 1 + \frac{( p + q - 1 ) i + ( p - q )}{q - p} = 1 + \frac{p - q}{q - p} = 0$ . The homogeneous equation $p u_{i + 1} - u_{i} + q u_{i - 1} = 0$ has characteristic roots $1$ and $q / p$ , so $k_{i} = \frac{i}{q - p} + A + B (q / p)^{i}$ . Imposing $k_{0} = 0$ ( $A + B = 0$ ) and $k_{N} = 0$ gives $B = N / ((q - p) ((q / p)^{N} - 1))$ , $A = - B$ , producing the stated closed form. $□$

Proposition 5 (symmetric limits). As $p \to 1/2$ the gambler's ruin formulas tend to $h_{i} = i / N$ and $k_{i} = i (N - i)$ .

Proof. Put $r = q / p \to 1$ . By l'Hôpital in $r$ (or expanding $r = 1 + ϵ$ ), $\frac{1 - r ^{i}}{1 - r ^{N}} \to \frac{i}{N}$ , giving $h_{i} \to i / N$ . For $k_{i}$ , write $q - p = - (1 - 2 p)$ and expand $r^{i} = 1 + i ϵ + (2 i) ϵ^{2} + O (ϵ^{3})$ with $ϵ = r - 1$ ; the first-order terms cancel against $i / (q - p)$ and the second-order terms give $k_{i} \to \frac{1}{2} (N i - i^{2}) \cdot 2 = i (N - i)$ after collecting the $(2 i), (2 N)$ contributions. Directly, $k_{i} = i (N - i)$ is checked to solve $k_{i} = 1 + \frac{1}{2} k_{i + 1} + \frac{1}{2} k_{i - 1}$ since $\frac{1}{2} [(i + 1) (N - i - 1) + (i - 1) (N - i + 1)] = i (N - i) - 1$ . $□$

Proposition 6 (birth–death absorption formula). With $γ_{i} = \prod_{m = 1}^{i} (q_{m} / p_{m})$ , the finite-horizon absorption probability is $h_{i}^{(N)} = (\sum_{m = i}^{N - 1} γ_{m}) / (\sum_{m = 0}^{N - 1} γ_{m})$ , and $h_{i} = lim_{N} h_{i}^{(N)}$ .

Proof. (H) for $h_{i}^{(N)} = P_{i} (hit 0 before N)$ is $h_{i}^{(N)} = p_{i} h_{i + 1}^{(N)} + q_{i} h_{i - 1}^{(N)}$ , $h_{0}^{(N)} = 1$ , $h_{N}^{(N)} = 0$ . With $d_{i} = h_{i - 1}^{(N)} - h_{i}^{(N)} \geq 0$ , the relation $p_{i} (h_{i}^{(N)} - h_{i + 1}^{(N)}) = q_{i} (h_{i - 1}^{(N)} - h_{i}^{(N)})$ gives $d_{i + 1} = (q_{i} / p_{i}) d_{i}$ , so $d_{i + 1} = γ_{i} d_{1}$ . Summing $d_{i + 1} + \dots + d_{N} = h_{i}^{(N)} - h_{N}^{(N)} = h_{i}^{(N)}$ and $d_{1} + \dots + d_{N} = 1$ yields $h_{i}^{(N)} = (\sum_{m = i}^{N - 1} γ_{m}) / (\sum_{m = 0}^{N - 1} γ_{m})$ . The events ${hit 0 before N}$ increase to ${hit 0}$ as $N ↑ \infty$ , so continuity of measure gives $h_{i} = lim_{N} h_{i}^{(N)}$ ; the limit is $1$ for all $i$ iff $\sum_{m} γ_{m} = \infty$ . $□$

Connections Master

The Markov property, transition matrices, and Chapman–Kolmogorov 37.05.01 supply the engine of first-step analysis: conditioning on $X_{1}$ and restarting the chain is the one-step form of the Markov property, and the iterated-substitution proof of minimality is the finite-dimensional law $λ_{i_{0}} \prod p_{i_{r} i_{r + 1}}$ summed over non- $A$ paths. The hitting time $H^{A}$ is a stopping time for the filtration built there, and the strong Markov property at $H^{A}$ underlies the regenerative reading of repeated hits.
The class structure, irreducibility, and periodicity 37.05.02 determine the qualitative outcome of hitting theory: $h^{A} \equiv 1$ exactly when $A$ is reached from every state, which for a finite chain means $A$ meets every closed communicating class; when two closed classes compete, the absorption probabilities $h_{i}^{A}$ partition the unit mass among them, and the open (transient) classes are exactly where the mean hitting time can be finite. Recurrence of a class is the statement $h_{i}^{{j}} \equiv 1$ inside it.
The stationary distribution and convergence theory of irreducible chains 37.05.04 reuses the same operator $P - I_{d}$ : where hitting theory solves $(P - I_{d}) h = 0$ off $A$ with a boundary condition, equilibrium theory solves the adjoint $π (P - I_{d}) = 0$ , and the Kac return-time formula $E_{i} [T_{i}] = 1/ π_{i}$ ties the mean return time computed by (K) directly to the stationary probability; positive recurrence is finiteness of that mean hitting time.
The continuous-state diffusion generator 02.15.03 is the analytic limit: the discrete Dirichlet–Poisson problem $(I_{d} - P) ϕ = g$ off $A$ becomes the boundary-value problem $- L ϕ = g$ for the second-order generator $L$ , hitting probabilities become harmonic measure, mean hitting times become solutions of $L u = - 1$ , and the birth–death potential coefficients $γ_{i}$ become the scale function whose finiteness criterion classifies boundary behavior at infinity.

Historical & philosophical context Master

The gambler's ruin problem is among the oldest in probability, posed in the correspondence of Blaise Pascal and Pierre de Fermat in 1656 and given its first printed treatment by Christiaan Huygens, who appended it as one of five problems to his 1657 tract De ratiociniis in ludo aleae ^{[Huygens 1657]}. Huygens computed, for the symmetric and the biased game, the probability that one of two players is ruined before the other, effectively solving system (H) for the finite walk centuries before the matrix formalism existed; Jacob Bernoulli and Abraham de Moivre extended the analysis, de Moivre obtaining the closed geometric form $(q / p)^{i}$ for the biased case in the Doctrine of Chances.

The recasting of these computations as the solution of a linear system indexed by the states — first-step analysis — and the recognition that the genuine hitting probability is the minimal non-negative solution belong to the twentieth-century theory of Markov chains, consolidated in the textbook treatments of Feller and, in the form followed here, of Norris ^{[Norris 1997]}, whose §1.3 isolates the minimality principle as the device that selects the probabilistic solution among the affine family the bare equations admit. The identification of hitting probabilities with harmonic functions and mean hitting times with Green potentials places the subject inside the discrete potential theory developed by Doob, Hunt, and Dynkin, where the Markov chain is the probabilistic counterpart of the Laplacian and the boundary-value problems of classical potential theory acquire a sample-path meaning.

Bibliography Master

@book{Norris1997,
  author    = {Norris, James R.},
  title     = {Markov Chains},
  series    = {Cambridge Series in Statistical and Probabilistic Mathematics},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@book{Huygens1657,
  author    = {Huygens, Christiaan},
  title     = {De ratiociniis in ludo aleae},
  publisher = {Elsevier (Leiden)},
  year      = {1657}
}

@book{Durrett2019mc,
  author    = {Durrett, Rick},
  title     = {Probability: Theory and Examples},
  edition   = {5},
  publisher = {Cambridge University Press},
  year      = {2019}
}

@book{LevinPeres2017,
  author    = {Levin, David A. and Peres, Yuval},
  title     = {Markov Chains and Mixing Times},
  edition   = {2},
  publisher = {American Mathematical Society},
  year      = {2017}
}

@book{GrimmettStirzaker2020,
  author    = {Grimmett, Geoffrey R. and Stirzaker, David R.},
  title     = {Probability and Random Processes},
  edition   = {4},
  publisher = {Oxford University Press},
  year      = {2020}
}

@book{Feller1968,
  author    = {Feller, William},
  title     = {An Introduction to Probability Theory and Its Applications, Vol. 1},
  edition   = {3},
  publisher = {Wiley},
  year      = {1968}
}

Prerequisites

37.05.01
37.05.02

Tier anchors

beginner: Norris 1997 *Markov Chains* (Cambridge) §1.3; informal picture of a token wandering between two walls and the chance it reaches one before the other
intermediate: Norris 1997 *Markov Chains* (Cambridge) §1.3-1.4; Durrett 2019 *Probability: Theory and Examples* 5e §5.3 (exit distributions and times)
master: Norris 1997 *Markov Chains* (Cambridge) §1.3-1.4; Levin-Peres 2017 *Markov Chains and Mixing Times* 2e §1.5, Ch. 10 (hitting times); Grimmett-Stirzaker 2020 *Probability and Random Processes* 4e §6.3-6.4

References

Norris — Markov Chains · Cambridge University Press 1997, §1.3-1.4 (hitting probabilities, mean hitting times)
Huygens — De ratiociniis in ludo aleae · Leiden 1657 (gambler's ruin among the five problems appended)
Durrett — Probability: Theory and Examples, 5e · §5.3 (exit distributions and exit times)
Levin-Peres — Markov Chains and Mixing Times, 2e · American Mathematical Society 2017, §1.5, Ch. 10
Grimmett-Stirzaker — Probability and Random Processes, 4e · Oxford University Press 2020, §6.3-6.4 (first passage, gambler's ruin)

Estimated time

beginner: 18m
intermediate: 55m
master: 88m