Ergodicity, Unique Ergodicity, and Equidistribution
Anchor (Master): Walters 1982 *An Introduction to Ergodic Theory* (Springer GTM 79) Ch. 1, Ch. 6 (unique ergodicity and uniform convergence; Weyl's theorem); Einsiedler-Ward 2011 *Ergodic Theory with a View towards Number Theory* (Springer GTM 259) Ch. 4, Ch. 6; Cornfeld-Fomin-Sinai 1982 *Ergodic Theory* (Springer Grundlehren 245) Ch. 3 (toral automorphisms, spectral criterion); Glasner 2003 *Ergodic Theory via Joinings* (AMS) Ch. 4 (ergodic decomposition)
Intuition Beginner
A measure-preserving rule keeps the total volume of states fixed, but that alone does not stop the space from splitting into separate compartments that never talk to each other. Imagine a race track split into two lanes by a wall: a runner started in the inner lane circles there forever and never enters the outer one. The rule preserves length in each lane, yet a runner's long-run behaviour depends entirely on which lane you dropped them into. Ergodicity is the condition that forbids such walls: the only regions the rule leaves completely unchanged are the empty region and the whole space. There are no invariant compartments, so a single trajectory is free to explore everything.
When there are no walls, a beautiful thing happens. Track how often one long trajectory visits a given region. Because the trajectory cannot get trapped anywhere, the fraction of time it spends in a region settles down to the size of that region, the same answer no matter which starting point you chose (apart from a negligible set of exceptions). The average measured along one orbit over time equals the average taken over space all at once. This is the headline slogan of the subject: time average equals space average.
Now sharpen the question. Sometimes a rule preserves not just one notion of volume but exactly one — there is a single way to assign sizes that the rule respects, and no other. Such a rule is called uniquely ergodic, and it is even better behaved: the fraction-of-time answer converges for every starting point, with no exceptions at all, and it converges evenly across all starting points at once. The cleanest example is rotating a circle by an angle that is not a simple fraction of a full turn: the orbit of any point, with no exceptions, spreads perfectly evenly around the circle.
The takeaway: ergodicity rules out invariant walls and makes time-average equal space-average for almost every orbit; unique ergodicity is the stronger, single-invariant-measure version that makes the equality hold for every orbit and evenly. Its most famous payoff is that the points wrapped around the circle land everywhere with perfectly uniform density when is irrational.
Visual Beginner
Picture the circle drawn as a clock face and mark the points you get by repeatedly adding a fixed irrational step, wrapping around each time. Early on the marks look scattered; after many steps they fill the circle with even spacing.
The three panels show the same orbit at , , and steps: scattered, then spreading, then perfectly even. The shaded arc illustrates the punchline — the share of marks falling in any arc approaches that arc's length, identically for every starting point. That even filling is equidistribution, and it is the visible signature of unique ergodicity.
Worked example Beginner
We watch the points fill the circle, using the slightly irrational-looking step for a hand-computable run, and check that an arc collects its fair share.
Step 1. The system. Take the circle as the numbers from up to , wrapping so is the same point as . The rule adds and wraps. Starting from , the orbit is — it cycles through five evenly spaced points . (A truly irrational step never repeats; is rational, so it repeats every five steps, which is enough to show the counting.)
Step 2. The arc to watch. Let be the arc from up to , of length , that is two fifths of the circle. The five visited points include and inside (taking the arc as ), so two of the five points sit in .
Step 3. The fraction of time. Over one full cycle of five steps the orbit lands in on of them, a fraction . Because the orbit just repeats this cycle forever, the long-run fraction of visits to stays exactly .
Step 4. Compare to the size of . The arc has length , which is exactly the long-run fraction of visits we found. The time-average ( of the steps land in ) equals the space-average (the arc is of the circle).
What this tells us: even for this repeating rational example, the share of visits to an arc matches the arc's length. For a genuinely irrational step the orbit never repeats and fills the circle with no gaps, and the same equality — fraction of visits equals length of arc — holds for every starting point and every arc. That is exactly Weyl's equidistribution statement, and it is what unique ergodicity of the irrational rotation delivers.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is a probability space and is measure-preserving, in the sense of 38.04.01: for all . The Koopman operator is the associated isometry.
Definition (ergodicity). The system is ergodic if every almost-invariant set — one with — has . Equivalently (Proposition below), the invariant -algebra is -degenerate, every set in it having measure or .
Proposition (characterisations of ergodicity). The following are equivalent for a measure-preserving system:
- is ergodic (every almost-invariant set is null or conull).
- Every measurable with a.e. is a.e. constant.
- Every with is a.e. constant; i.e. the eigenvalue of is simple, its eigenspace being the constants.
- For all , .
- For every the Birkhoff averages converge a.e. to the constant .
Definition (unique ergodicity). Let be a compact metric space and continuous. Write for the Borel probability measures on and for the -invariant ones. The set is nonempty (Krylov-Bogolyubov, below), convex, and weak- compact. The system is uniquely ergodic if is a singleton: there is exactly one -invariant Borel probability measure . That unique is automatically ergodic, since the extreme points of are precisely the ergodic measures and a one-point convex set is its own unique extreme point.
Definition (equidistribution). A sequence in a compact metric space carrying a probability measure is equidistributed (or -uniformly distributed) if for every continuous ,
For with Lebesgue measure this is equivalent, by 21.15.02, to the Weyl criterion: for every nonzero integer .
Canonical examples. (i) Irrational rotation , : ergodic and uniquely ergodic for Lebesgue measure. (ii) Rational rotation : not ergodic — each orbit is a -point set and the union of finitely many such orbits is invariant. (iii) Doubling map : ergodic for Lebesgue measure but not uniquely ergodic, since Dirac masses on periodic orbits (e.g. ) are also invariant. (iv) Toral automorphism on with : ergodic for Lebesgue measure iff no eigenvalue of is a root of unity. (v) Bernoulli shift : ergodic (indeed mixing) but not uniquely ergodic.
Counterexamples to common slips Intermediate+
Ergodic does not mean uniquely ergodic. The doubling map and every Bernoulli shift are ergodic for their natural measure yet carry many other invariant measures (Dirac masses on periodic points, other Bernoulli laws). Unique ergodicity is the much rarer property that the invariant measure is the only one; it forces the dynamics to be, in a sense, minimal and rigid.
Ergodicity is a property of the pair , not of . The doubling map is ergodic for Lebesgue measure and also "ergodic" for the Dirac mass (vacuously, as has no invariant sets of intermediate measure), but these are different systems. Naming a map ergodic without naming the measure is the standard ambiguity.
The averaging criterion is Cesàro, not termwise. Condition (4) requires the averaged correlations to converge to ; the stronger termwise convergence is mixing, a strictly stronger property than ergodicity. An irrational rotation is ergodic but not mixing.
Equidistribution of needs irrational. For rational the sequence visits only the points and is not equidistributed for Lebesgue measure; it is equidistributed for the uniform measure on those points. The Weyl criterion fails at , where for all .
Unique ergodicity gives convergence for every and continuous only. The uniform-convergence theorem is stated for continuous test functions; for a discontinuous the convergence holds at every provided (a -continuity set), by approximating above and below by continuous functions.
Key theorem with proof Intermediate+
Theorem (Weyl equidistribution via unique ergodicity; Weyl 1916). Let be irrational and the rotation of . Then is uniquely ergodic, its unique invariant measure being Lebesgue measure , and consequently for every the orbit is equidistributed: for every continuous , In particular is equidistributed in .
Proof. First, preserves by translation invariance, and is ergodic for . To see ergodicity, let satisfy , i.e. a.e. Expand in the Fourier basis with . Then has Fourier coefficients , so invariance forces for every . For , irrationality of gives , hence and . Thus is a.e. constant, and by characterisation (3) above is ergodic.
Now we upgrade ergodicity to unique ergodicity together with the uniform convergence. Let be any -invariant Borel probability measure. Its Fourier coefficients satisfy, by invariance, . For , forces , so has the same Fourier coefficients as (, for ). By uniqueness of Fourier coefficients on the circle, . Hence is the only invariant measure: is uniquely ergodic.
For the uniform convergence, verify it first on trigonometric polynomials. For with , a geometric sum gives whose modulus is at most as , uniformly in , while . For the average is identically . By linearity the uniform convergence holds for all trigonometric polynomials. Given a continuous and , Weierstrass approximation supplies a trigonometric polynomial with ; since the Birkhoff averaging operator has operator norm on and likewise, the triangle inequality gives The last term tends to , so the limsup is at most for every , giving uniform convergence for . Taking to approximate from above and below (a Lebesgue-continuity set) yields , the equidistribution of .
Bridge. This theorem builds toward the general principle that unique ergodicity is the dynamical engine of equidistribution, and it appears again in the Master-tier equivalence between unique ergodicity and uniform convergence of Birkhoff averages for all continuous . The foundational reason the proof works is that an invariant measure is pinned down by its action on the characters , and irrationality kills every nonzero character — this is exactly the Fourier-analytic Weyl criterion 21.15.02 read as a statement about invariant measures, so the number-theoretic and the dynamical proofs of Weyl's theorem are dual to one another. The central insight is that ergodicity for gives a.e. convergence by Birkhoff 37.02.03, but unique ergodicity upgrades "almost every " to "every , uniformly," because there is no second invariant measure to host an exceptional orbit. Putting these together, recurrence and the Kac geometry of 38.04.01 guarantee orbits return, ergodicity makes the return frequencies equal to measure, and unique ergodicity makes those frequencies hold pointwise; the bridge is that minimality plus a unique invariant measure forces every orbit to be a faithful sampler of that measure.
Exercises Intermediate+
Advanced results Master
Theorem 1 (characterisations of ergodicity). For a measure-preserving system the conditions are equivalent: () every almost-invariant set is null or conull; () every -invariant measurable function is a.e. constant; () is a simple eigenvalue of the Koopman operator on , with the constants as eigenspace; () for all , ; () the Birkhoff averages of every converge a.e. to . The spectral condition (c) places ergodicity at the bottom of the spectral hierarchy: weak mixing is continuous spectrum on the orthocomplement of the constants, and (strong) mixing strengthens the Cesàro condition (d) to a termwise limit [Birkhoff 1931].
Theorem 2 (ergodicity of the irrational rotation and toral automorphisms). The rotation on is ergodic for Haar measure iff are rationally independent. A toral automorphism is ergodic for Haar measure iff no eigenvalue of is a root of unity, equivalently iff the dual action on has no finite orbit; ergodic toral automorphisms are exactly the hyperbolic (no eigenvalue on the unit circle) ones among those with a root-of-unity-free spectrum, and the hyperbolic ones are mixing of all orders and Bernoulli. The character-orbit proof is the Fourier shadow of the dynamical statement [Walters 1982].
Theorem 3 (unique ergodicity uniform convergence; Oxtoby 1952). A continuous map of a compact metric space is uniquely ergodic iff for every the Birkhoff averages converge uniformly to the constant . Equivalently, the empirical measures converge weak- to uniformly in . Unique ergodicity therefore removes the a.e.-exceptional set of Birkhoff's theorem entirely: every orbit equidistributes, and it does so at a rate independent of the basepoint [Oxtoby 1952].
Theorem 4 (Weyl equidistribution; Weyl 1916). For irrational , the sequence is equidistributed in ; more generally, for a polynomial with at least one of irrational, is equidistributed. The linear case is unique ergodicity of the rotation; the polynomial case is unique ergodicity of a unipotent affine map on (the skew product / Anzai skew shift) together with Weyl differencing 21.15.02. Equidistribution is the analytic face of unique ergodicity, and the Fourier-analytic Weyl criterion is its number-theoretic mirror [Weyl 1916].
Theorem 5 (ergodic decomposition; Krylov-Bogolyubov 1937). Let be a continuous (or Borel) map of a standard Borel space and . Then is a Choquet simplex whose extreme points are exactly the ergodic measures, and has a unique representation as a barycentre over the set of ergodic measures, where is a probability measure on . Concretely there is a measurable map (the conditional measures on the invariant -algebra) with each ergodic and , so that the Birkhoff limit samples the ergodic component through . The non-ergodic theory thereby reduces to the ergodic theory fibre by fibre [Krylov-Bogolyubov 1937].
Synthesis. The five results are one statement at successive depths, and the foundational reason they cohere is that each measures how badly the invariant -algebra can fail to be degenerate. Ergodicity is the demand that be -degenerate, which is exactly the simplicity of the Koopman eigenvalue and exactly the collapse of the Birkhoff limit to a constant; this is the central insight that an invariant set of intermediate measure, an invariant non-constant function, a surviving nonzero Fourier mode, and a non-extreme invariant measure are four names for one obstruction. Unique ergodicity is the case where is a single point, so the simplex of Theorem 5 degenerates and the a.e. statement of Birkhoff becomes the everywhere-and-uniform statement of Oxtoby — this is dual to the spectral picture, where having no invariant sets is replaced by having no invariant measures. Putting these together, Weyl equidistribution generalises from a fact about to the assertion that every orbit of a uniquely ergodic system faithfully samples the unique measure, and the bridge is that the character orbits the Fourier criterion 21.15.02 tracks are precisely the obstructions a unique invariant measure annihilates. The ergodic decomposition is the foundational reason the subject reduces to its ergodic case: any invariant measure is built from ergodic bricks, so theorems for ergodic systems propagate by integration, and the recurrence-and-tower technology of 38.04.01 together with Birkhoff 37.02.03 assembles into the structure of stationary dynamics.
Full proof set Master
Proposition 1 (ergodicity invariant functions constant). A measure-preserving is ergodic iff every with a.e. is a.e. constant.
Proof. () Let be invariant. The super-level sets satisfy (mod ), so each is almost invariant; ergodicity gives . The non-increasing right-continuous -valued function has a unique jump at , and for all forces a.e. () If is almost invariant, is invariant, hence a.e. constant, so .
Proposition 2 (irrational rotation is uniquely ergodic). For irrational , Lebesgue measure is the unique -invariant Borel probability measure of .
Proof. Let . For , invariance gives . For , (irrationality), so ; and . These coincide with the Fourier coefficients of , and a finite Borel measure on the circle is determined by its Fourier coefficients (the characters are dense in , so equality of and on characters extends to all ). Hence .
Proposition 3 (unique ergodicity forces uniform convergence on ). If is uniquely ergodic with invariant measure , then uniformly for every .
Proof. Suppose not: there are , , , and with . The empirical measures have a weak- cluster point . For , , so is invariant, hence by unique ergodicity. But stays from along the subsequence converging to , contradicting weak- convergence.
Proposition 4 (existence of invariant measures; Krylov-Bogolyubov). Every continuous map of a compact metric space has an invariant Borel probability measure.
Proof. For arbitrary set . By weak- compactness of choose . The telescoping identity gives for every ; weak- continuity of then yields .
Proposition 5 (ergodic measures are the extreme points of ). A measure is ergodic iff it is an extreme point of the convex set .
Proof. () Let be ergodic and with , . Then , and the Radon-Nikodym derivative is -invariant: for invariant test sets the pushforward identity gives a.e. Ergodicity makes a.e. constant, so and ; thus is extreme. () If is not ergodic there is an invariant set with ; the conditioned measures and are invariant, distinct, and is a proper convex combination of distinct measures, so is not extreme.
Connections Master
Measure-preserving systems, Poincaré recurrence, and the Kac formula
38.04.01supply the ambient framework: ergodicity is the indecomposability condition layered on top of measure-preservation, the Koopman operator and invariant-set language are imported directly, and the ergodic Kac formula is exactly the statement that an ergodic system's saturation of any positive-measure set is everything. This unit sharpens recurrence into equidistribution.The ergodic theorems of Birkhoff, von Neumann, and Kingman
37.02.03are the analytic engine: characterisation (e) of ergodicity is Birkhoff's theorem with a degenerate invariant -algebra, and the mean ergodic theorem is the statement that the Koopman averages converge to the projection onto the constants. Unique ergodicity is precisely what upgrades Birkhoff's a.e. convergence to the uniform convergence of Oxtoby's theorem.Weyl sums, Weyl differencing, and equidistribution
21.15.02give the number-theoretic mirror: the Weyl criterion is the Fourier-analytic test that this unit derives from unique ergodicity of the rotation, and Weyl differencing extends equidistribution from to polynomial sequences , which correspond dynamically to unipotent skew products on the torus.Mixing and the spectral theory of dynamical systems
38.05.01sit one level above ergodicity: weak mixing is continuous Koopman spectrum on the orthocomplement of the constants, strong mixing strengthens the Cesàro correlation decay (d) to a termwise limit, and the Halmos-von Neumann theorem identifies discrete-spectrum systems with group rotations — the uniquely ergodic rotations of this unit being the prototype.Topological dynamics and minimality
38.01.01interlocks with unique ergodicity: a uniquely ergodic system whose unique measure has full support is strictly ergodic, and minimal isometries (such as irrational rotations) are uniquely ergodic, so the equidistribution proved here is the measurable counterpart of topological minimality.
Historical & philosophical context Master
Hermann Weyl proved the equidistribution of and the general polynomial case in his 1916 Mathematische Annalen paper [Weyl 1916], introducing the exponential-sum criterion that bears his name. Weyl's argument was Fourier-analytic and predated the abstract ergodic theory by fifteen years; the recognition that his theorem is the unique ergodicity of the rotation came only after Birkhoff's 1931 pointwise theorem [Birkhoff 1931] and the operator-theoretic reframing of the early 1930s. The two proofs of Weyl's theorem — the direct estimation of exponential sums and the dynamical deduction from a single invariant measure — remain the standard illustration of the bridge between analytic number theory and ergodic theory, and the dynamical viewpoint generalised, through Furstenberg's work on the skew shift and later through Ratner's measure-classification theorems, into the modern theory of equidistribution on homogeneous spaces.
The notion of unique ergodicity and its equivalence with uniform convergence was isolated by John Oxtoby in 1952 [Oxtoby 1952], building on the Krylov-Bogolyubov existence theorem of 1937 [Krylov-Bogolyubov 1937], which guaranteed at least one invariant measure for any continuous map of a compact space and, with it, the convex-analytic setting in which the ergodic measures are the extreme points. The ergodic decomposition — that every invariant measure is a barycentre of ergodic ones — was developed in the same circle and given its definitive Choquet-simplex formulation in the 1950s and 1960s. The toral-automorphism criterion, that ergodicity is governed by the root-of-unity-freeness of the eigenvalues, was worked out by Halmos and others and connected the spectral theory of the Koopman operator to the arithmetic of the defining integer matrix, a thread that runs forward into the entropy theory of hyperbolic toral automorphisms.
Bibliography Master
@article{Weyl1916,
author = {Weyl, Hermann},
title = {\"Uber die Gleichverteilung von Zahlen mod. Eins},
journal = {Mathematische Annalen},
volume = {77},
number = {3},
year = {1916},
pages = {313--352}
}
@article{Birkhoff1931,
author = {Birkhoff, George D.},
title = {Proof of the ergodic theorem},
journal = {Proceedings of the National Academy of Sciences},
volume = {17},
number = {12},
year = {1931},
pages = {656--660}
}
@article{Oxtoby1952,
author = {Oxtoby, John C.},
title = {Ergodic sets},
journal = {Bulletin of the American Mathematical Society},
volume = {58},
number = {2},
year = {1952},
pages = {116--136}
}
@article{KrylovBogolyubov1937,
author = {Krylov, Nikolai and Bogolyubov, Nikolai},
title = {La th\'eorie g\'en\'erale de la mesure dans son application \`a l'\'etude des syst\`emes dynamiques de la m\'ecanique non lin\'eaire},
journal = {Annals of Mathematics},
volume = {38},
number = {1},
year = {1937},
pages = {65--113}
}
@book{Walters1982,
author = {Walters, Peter},
title = {An Introduction to Ergodic Theory},
publisher = {Springer},
series = {Graduate Texts in Mathematics},
volume = {79},
year = {1982}
}
@book{EinsiedlerWard2011,
author = {Einsiedler, Manfred and Ward, Thomas},
title = {Ergodic Theory with a View towards Number Theory},
publisher = {Springer},
series = {Graduate Texts in Mathematics},
volume = {259},
year = {2011}
}
@book{CornfeldFominSinai1982,
author = {Cornfeld, Isaac P. and Fomin, Sergei V. and Sinai, Yakov G.},
title = {Ergodic Theory},
publisher = {Springer},
series = {Grundlehren der mathematischen Wissenschaften},
volume = {245},
year = {1982}
}
@book{Glasner2003,
author = {Glasner, Eli},
title = {Ergodic Theory via Joinings},
publisher = {American Mathematical Society},
series = {Mathematical Surveys and Monographs},
volume = {101},
year = {2003}
}
@book{Petersen1983,
author = {Petersen, Karl},
title = {Ergodic Theory},
publisher = {Cambridge University Press},
year = {1983}
}