The Poisson Process: Equivalent Characterizations
Anchor (Master): Norris 1997 *Markov Chains* (Cambridge) §2.4; Kingman 1993 *Poisson Processes* (Oxford) §2-§5 (superposition, thinning, the Poisson point process, order statistics); Daley-Vere-Jones 2003 *An Introduction to the Theory of Point Processes* 2e Vol. I Ch. 2
Intuition Beginner
A Poisson process is the simplest model of random events arriving in time: calls reaching a switchboard, clicks of a Geiger counter, customers entering a shop. You scatter points along a time line so events happen at a steady average rate but with no schedule. The single number that controls everything is the rate, written : the average number of events per unit time, so over an hour you expect about events, and the bigger is, the more crowded the line of points becomes.
What makes the model so clean is two assumptions you can picture separately. First, the average rate never changes with time, so a window of a given length is statistically the same wherever you slide it. Second, events in non-overlapping stretches of time have nothing to do with each other: knowing how many calls arrived this morning tells you nothing about how many arrive this afternoon. These two ideas, a steady rate and independent disjoint windows, pin the model down almost completely.
Remarkably, three different-looking descriptions all pin down the same process. You can say how many events fall in each window (the count in a window of length follows the Poisson recipe with average ). You can describe the process instant by instant (in a tiny slice of length , one event appears with chance about , and two-at-once essentially never happen). Or you can describe the gaps between consecutive events (each gap is the memoryless exponential clock, and the gaps are independent and identically distributed). Each description sounds like a separate rule, yet each forces the other two.
Two more operations keep you inside the family. Merge two independent Poisson streams and you get a Poisson stream whose rate is the sum (superposition). Flip a coin at each event and keep only the heads, and the kept events again form a Poisson stream, now with a thinned rate (thinning).
The one-sentence takeaway: a Poisson process drops random points on a time line at average rate with independent disjoint windows, and the same process can be described by Poisson counts, by a tiny-interval rate, or by exponential gaps — three names for one thing, stable under merging and coin-flip thinning.
Visual Beginner
Picture a horizontal time line with events marked as ticks, and read off the three ways to describe it.
Top: the count picture — disjoint windows hold independent Poisson counts with mean . Bottom-left: the gap picture — the spaces between ticks are independent exponential waits. Right: the instant picture — a tiny slice holds at most one event with chance about . The merging and coin-flip panels show that superposition adds rates and thinning scales the rate by the keep-probability.
Worked example Beginner
Suppose a help desk receives calls as a Poisson process at rate calls per hour. We compute a few quantities by hand.
Step 1. The expected number of calls in one hour is , and in a half hour it is . The count in any window of length averages .
Step 2. The chance of exactly calls in one hour uses the Poisson recipe: the chance of events when the average is is . For this is . So about a chance of a silent hour.
Step 3. The chance of exactly call in one hour is .
Step 4. The average gap between consecutive calls is of an hour, that is minutes. The gaps are exponential waits with rate .
Step 5. Now thin the stream: suppose each call is an emergency with probability , independently. The emergencies form their own Poisson process at the thinned rate emergencies per hour, so on average one emergency every hours.
What this tells us: the rate alone fixed the average count, the Poisson recipe gave the exact chances of and calls, the same rate gave the average -minute gap, and a coin flip at each event produced a smaller Poisson process. Every number came from the one parameter and the two simple assumptions.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, events occur in continuous time on a probability space . The counting process is , where records the number of events in ; it is right-continuous, non-decreasing, integer-valued, with , and increases by unit jumps. The arrival times are for (with ), and the inter-arrival times (gaps) are for . The counting process and the arrival sequence are inverse to one another: and . The recollections of 26.02.01 — the Poisson law with and the exponential law with tail — are used freely, as is the continuous-time Markov apparatus of 37.05.08.
Fix a rate . The three definitions below are stated separately; the Key theorem proves they describe the same law.
Definition A (independent stationary increments with Poisson marginals). has independent increments if for the increments are independent, and *stationary increments* if the law of depends only on . The process is a *Poisson process of rate * under Definition A if it has independent stationary increments and for all , .
Definition B (infinitesimal / Markov definition). is a Poisson process of rate under Definition B if it has independent stationary increments, , and as , uniformly in ,
Equivalently, is the continuous-time Markov chain on whose -matrix 37.05.08 has , , and otherwise: a pure birth chain with every birth rate equal to .
Definition C (i.i.d. exponential inter-arrival times). is a Poisson process of rate under Definition C if the inter-arrival times are independent and identically -distributed, and with . Equivalently, the arrival times are the partial sums of an i.i.d. sequence, so with density on .
Counterexamples to common slips Intermediate+
Stationary increments alone do not give Poisson. A process can have stationary independent increments yet have Bernoulli- or compound-distributed; the Poisson marginal is an additional input in Definition A. What Definition B shows is that the infinitesimal one-at-a-time condition forces the marginal to be Poisson, so the marginal is not free once the rare-jump assumption is imposed.
Memorylessness is what couples the three views. The exponential gaps of Definition C are the only continuous inter-arrival law compatible with the lack of memory that independent increments demand; replacing by any other gap law (a renewal process) keeps the i.i.d.-gap structure but destroys both the independent-increments property and the Poisson marginals.
The order-statistics property is conditional, not unconditional. Given , the arrival times are distributed as the order statistics of i.i.d. points — but unconditionally the arrival times are spread out by the Gamma laws, not uniform. The uniformity appears only after conditioning on the count.
Thinning needs independent marking. Keeping each event with probability yields a rate- Poisson process only when the marks are independent of each other and of the process. Marking that depends on the inter-arrival times (e.g. keep an event iff its gap exceeds a threshold) produces a non-Poisson kept stream.
Key theorem with proof Intermediate+
Theorem (equivalence of the three definitions). For a counting process with and a fixed rate , Definitions A, B, and C are equivalent: each implies the other two, and all three determine the same law on path space [Norris 1997 §2.4].
Proof. We prove C A B C.
Step 1 (C A: Poisson marginal from exponential gaps). Assume the gaps are i.i.d. , so . Using and conditioning on with independent, and , so , i.e. . For the increments and their independence, the memoryless property of is decisive: at any fixed time , conditional on the past, the residual gap is again and independent of , because erases the elapsed wait. Hence the post- process is a fresh copy of the same exponential-gap construction, independent of the past and with the same law shifted by ; this is precisely stationary independent increments, and the marginal computation gives . Thus C A.
Step 2 (A B: the infinitesimal rates). Assume Definition A. Stationary independent increments are inherited directly. From ,
the last by Taylor expansion . Stationarity makes these estimates uniform in . These are exactly the infinitesimal conditions of Definition B, and they identify the generator entries , : a pure birth chain with constant rate in the sense of 37.05.08. Thus A B.
Step 3 (B C: exponential gaps from the Markov structure). Assume Definition B. Read as the continuous-time Markov chain with -matrix , 37.05.08. Its holding time in each state is with , and the jump chain is deterministic, . By the jump-chain/holding-time construction of 37.05.08, the times spent in successive states are independent variables, and these times are exactly the inter-arrival gaps . Hence are i.i.d. and , which is Definition C. Since each birth rate equals the constant and , the chain is non-explosive 37.05.08, so for all almost surely and the construction is consistent on all of . Thus B C, closing the cycle.
Bridge. This equivalence builds toward the entire theory of point processes and appears again in the order-statistics property, superposition, and thinning, each of which is most transparent in whichever of the three pictures suits it. The foundational reason the three definitions coincide is that the memoryless exponential is the unique continuous inter-arrival law the independent-increments structure permits: this is exactly the same uniqueness that fixed the holding-time law in the continuous-time Markov chain of 37.05.08, so the Poisson process is the constant-rate pure birth chain viewed three ways. The count picture (Definition A) generalises to the spatial Poisson point process where time is replaced by a measure space; the infinitesimal picture (Definition B) is dual to the forward equation that the Poisson semigroup satisfies; the gap picture (Definition C) is the renewal-theoretic skeleton. Putting these together, the three definitions are one law, and the central insight is that the rate is the single invariant tying the Poisson count, the infinitesimal rate, and the exponential gap into a single object.
Exercises Intermediate+
Advanced results Master
Beyond the equivalence, the theory organizes around the conditioning (order-statistics) property, the closure operations of superposition and thinning, the compound process built by attaching marks, and the passage from the time line to a general measure space, where the Poisson process becomes the Poisson point process whose count picture is the only one of the three definitions that survives.
Theorem 1 (order-statistics / conditioning property). Let be a rate- Poisson process. Conditional on , the vector of arrival times has the distribution of the order statistics of i.i.d. random variables; equivalently, its conditional density is on the simplex . The conditional law is free of . This is the precise sense in which a Poisson process is "completely random": once the number of points in a window is fixed, their positions are as uniform and exchangeable as possible.
Theorem 2 (superposition). If are independent Poisson processes of rates , their superposition is a Poisson process of rate . The construction is associative and commutative, so the Poisson processes of all rates form a one-parameter convolution semigroup under merging, the rate being the additive label. Each contributing point can be tagged by which stream produced it, and given a point of the tag is , independent across points — the inverse operation of thinning.
Theorem 3 (thinning / colouring). Let be a rate- Poisson process and mark each point independently with one of colours, colour with probability (). Then the colour- points form a Poisson process of rate , and the coloured processes are mutually independent. The independence is the surprising half: although the coloured counts in a fixed window sum to the total count, decoupling them by independent marking removes all dependence, because conditional on the colour assignment is multinomial and the Poisson mixing diagonalises the joint generating function into a product.
Theorem 4 (compound Poisson process). Attach to the -th point an i.i.d. mark , independent of , and set . Then has stationary independent increments with characteristic function , where . Its mean is and variance . The compound Poisson laws are exactly the infinitely divisible laws with finite Lévy measure , and they are the pure-jump building blocks of the Lévy–Khintchine representation: every Lévy process is a Brownian motion with drift plus a (possibly infinite-activity) limit of compound Poisson processes.
Theorem 5 (spatial Poisson point process). Let be a -finite measure space. A Poisson point process with mean measure is a random integer-valued measure such that (i) for disjoint the counts are independent (complete independence), and (ii) for every with . Existence holds for every -finite , and when with this recovers the rate- Poisson process of Definitions A–C. On a general space only the count picture (Definition A) generalises: there is no linear order, so neither the infinitesimal-rate description (Definition B) nor the inter-arrival-gap description (Definition C) has an unconditional analogue, though the order-statistics property persists as the statement that, given , the points are i.i.d. with law .
Synthesis. The foundational reason the subject coheres is that one rate — promoted to a mean measure in the spatial setting — encodes the entire law, and putting these together, the three definitions are exactly the same information presented as a count law, an infinitesimal rate, and an exponential gap law, with the memoryless exponential the unique gap the independent-increments structure allows. The order-statistics property is the central insight that conditioning on the count strips out the rate and leaves uniform, exchangeable positions: this is exactly what makes superposition and thinning transparent, since merging adds rates and colouring multiplies the rate by the colour probability, and these two operations are dual — superposition is the inverse of thinning, the colour tag of a merged point being the rate share. The compound Poisson process generalises the count by attaching i.i.d. marks, and this is dual to the Lévy–Khintchine description in which compound Poisson laws are the finite-activity pure-jump pieces of every infinitely divisible law. The bridge to the general theory is the spatial Poisson point process, where the count definition is the foundational reason the construction survives the loss of a time order: complete independence plus Poisson marginals defines a random measure on any -finite space, and the rate- process on the half-line is its one-dimensional shadow with .
Full proof set Master
Proposition 1 (order-statistics property, general ). For a rate- Poisson process, conditional on the joint density of is on , the order-statistics law of i.i.d. points.
Proof. The gaps are i.i.d. with joint density . The map with has unit Jacobian, so has density on . The event is . Integrating out over gives , so the density of restricted to is on the simplex , constant in the . Dividing by yields the conditional density , independent of . This is the order-statistics density of i.i.d. variables.
Proposition 2 (sum of independent Poisson laws). If and are independent, then .
Proof. The probability generating function of is . By independence , the generating function of . Generating functions determine the law of a non-negative integer variable, so .
Proposition 3 (thinning produces independent Poisson streams). Let be rate- Poisson and colour each point red with probability , blue with probability , independently. Then are independent with , .
Proof. Condition on : the colours are i.i.d., so is , i.e. . Then using and . The joint mass factors into the product of the two marginals, so and are independent with the stated Poisson laws. Independent increments across disjoint windows transfer from because the colouring is independent across points.
Proposition 4 (compound Poisson characteristic function and moments). For with i.i.d. of law independent of the rate- process , , , and .
Proof. Condition on and use independence of the : . Averaging over , Write for the cumulant exponent. Then gives , and gives the second cumulant , since the variance is the second cumulant.
Proposition 5 (existence of the spatial Poisson point process). For any -finite measure on there exists a point process with complete independence and for .
Proof. First suppose . Draw and, given , place points i.i.d. with law ; set . For disjoint with union of complement-aware total, the conditional counts are multinomial, and the Poisson mixing factorises exactly as in Proposition 3 (the -colour version): are independent with . For -finite , partition with , build independent finite Poisson processes on each piece, and superpose: . Complete independence and the Poisson marginals are preserved under the disjoint superposition by Proposition 2 applied countably (the counts over any split as an independent sum of Poisson variables with summable means ).
Connections Master
The continuous-time Markov chain framework
37.05.08is the parent structure: the Poisson process is the constant-rate pure birth chain with -matrix , , so its exponential holding times are the inter-arrival gaps, its deterministic jump chain is the march , and its non-explosion () is what makes finite for all ; the equivalence B C is exactly the jump-chain/holding-time construction specialised to constant rate.The elementary probability rules and named distributions
26.02.01supply the Poisson and exponential laws, the Gamma arrival-time law as a sum of exponentials, the Binomial-Poisson thinning identity, and the generating-function and conditioning machinery (multiplication rule, total variance) that every proof here runs on; the Poisson process is the canonical lift of those static distributions into a process indexed by continuous time.The order-statistics property points forward to the general theory of exchangeability and to the spatial point process: given the count, the points are exchangeable and uniform, which is the discrete shadow of the statement that a Poisson point process conditioned on its count is an i.i.d. sample from the normalised mean measure, connecting this unit to the measure-theoretic point-process formalism
37.01.01.
Historical & philosophical context Master
The Poisson distribution appears in Siméon Denis Poisson's 1837 treatise on the probability of judgments [Poisson 1837], derived as the limit of the binomial when the number of trials grows and the success probability shrinks with their product held fixed. Poisson studied the distribution, not the process: the dynamical object — random points scattered in time with independent increments — was assembled later, as the rare-event limit of the binomial was recognised as the marginal of a process whose disjoint windows are independent.
The process itself crystallised in early-twentieth-century applied work. Ladislaus Bortkiewicz's 1898 Das Gesetz der kleinen Zahlen popularised the Poisson law for rare events, Agner Krarup Erlang's 1909 telephone-traffic studies introduced the exponential inter-arrival and Poisson-arrival model that founded queueing theory, and Ernest Rutherford and Hans Geiger's 1910 radioactive-decay counts gave the canonical physical realisation, with the exponential gaps between clicks. The unification of the count, infinitesimal, and gap descriptions into the single object treated here, and the closure properties of superposition and thinning, are the modern synthesis of Norris [Norris 1997] and Kingman [Kingman 1993].
The spatial generalisation is due in its modern form to John Frank Charles Kingman, whose 1993 monograph [Kingman 1993] organised the Poisson point process on an abstract measure space around complete independence and the Poisson marginal, with the mean measure replacing the rate; the compound Poisson process and its role as the finite-activity piece of the Lévy–Khintchine decomposition trace to the work on infinitely divisible laws and were brought into statistical modelling by, among others, Lewis's 1964 branching-Poisson failure model [Lewis 1964].
Bibliography Master
@book{Norris1997,
author = {Norris, James R.},
title = {Markov Chains},
series = {Cambridge Series in Statistical and Probabilistic Mathematics},
publisher = {Cambridge University Press},
year = {1997}
}
@book{Kingman1993,
author = {Kingman, John Frank Charles},
title = {Poisson Processes},
series = {Oxford Studies in Probability},
publisher = {Oxford University Press},
year = {1993}
}
@book{Poisson1837,
author = {Poisson, Sim\'eon Denis},
title = {Recherches sur la probabilit\'e des jugements en mati\`ere criminelle et en mati\`ere civile},
publisher = {Bachelier},
address = {Paris},
year = {1837}
}
@book{DaleyVereJones2003,
author = {Daley, Daryl J. and Vere-Jones, David},
title = {An Introduction to the Theory of Point Processes, Volume I: Elementary Theory and Methods},
edition = {2},
publisher = {Springer},
year = {2003}
}
@article{Lewis1964,
author = {Lewis, Peter A. W.},
title = {A branching {P}oisson process model for the analysis of computer failure patterns},
journal = {Journal of the Royal Statistical Society, Series B},
volume = {26},
year = {1964},
pages = {398--456}
}