07.05.05 · representation-theory / symmetric

Random walk on a finite group; Upper Bound Lemma

shipped3 tiersLean: none

Anchor (Master): Diaconis-Shahshahani 1981 J. Algebra 76; Diaconis 1988 Group Representations in Probability and Statistics; Saloff-Coste 2004 Random Walks on Finite Groups

Intuition [Beginner]

A random walk on a finite group works like this: pick a random group element at each step, and multiply it onto your current position. After many steps, your position becomes harder to predict — the distribution of where you might be gets closer and closer to uniform (every element equally likely). The mixing time is the number of steps needed to get "close enough" to uniform.

The surprise is that representation theory — the study of how groups act linearly on vector spaces — gives sharp answers to "how many steps?" The key tool is the character: a number attached to each group element that records how a representation acts. Characters of the non-principal representations act like "frequencies" that decay over the course of the walk, and the rate of decay controls the mixing time.

Why does this concept exist? Because the question "how fast does a random walk mix?" reduces to "how fast do the non-principal characters decay?", which is a purely representation-theoretic computation.

Visual [Beginner]

A diagram showing the distribution of a random walk on the cyclic group $Z /5 Z$ converging toward the flat uniform line. After one step the distribution is concentrated on two points; after three steps it is spread out but uneven; after eight steps it is nearly flat.

The tall bars shrink and the short bars grow as the walk progresses, converging toward the uniform height of $1/5$ .

Worked example [Beginner]

Consider the random walk on the cyclic group $Z /3 Z = {0, 1, 2}$ with addition modulo 3. At each step, move forward by 1 or backward by 1, each with probability $1/2$ . Start at 0.

Step 1. After one step: position is 1 with probability $1/2$ , position is 2 with probability $1/2$ . Uniform distribution would give probability $1/3$ to each position.

Step 2. After two steps: from position 1 you reach 0 or 2, and from position 2 you reach 0 or 1. So the probability of being at 0 is $1/2$ , at 1 is $1/4$ , at 2 is $1/4$ . The total variation distance from uniform is $\frac{1}{2} (∣1/2 - 1/3∣ + ∣1/4 - 1/3∣ + ∣1/4 - 1/3∣) = 1/6 \approx 0.167$ .

Step 3. After three steps: convolution gives probability $1/4$ at position 0 and $3/8$ at each of positions 1 and 2. The total variation distance is $1/12 \approx 0.083$ .

What this tells us: the distance from uniform halves at each step. The characters of $Z /3 Z$ are complex numbers of modulus 1; the non-principal characters evaluate to $- 1/2$ on the step distribution, and the distance from uniform is controlled by $∣ - 1/2 ∣^{k} = (1/2)^{k}$ .

Check your understanding [Beginner]

Formal definition [Intermediate+]

Let $G$ be a finite group and $Q : G \to [0, 1]$ a probability measure on $G$ (so $Q (g) \geq 0$ for all $g$ and $\sum_{g} Q (g) = 1$ ). The random walk driven by $Q$ is the Markov chain with transition kernel $K (x, y) = Q (x^{- 1} y)$ . The distribution after $k$ steps starting from the identity is the $k$ -fold convolution power $Q^{* k}$ , defined recursively by

Q^{* 1} = Q, Q^{* (k + 1)} (g) = h \in G \sum Q^{* k} (h) Q (h^{- 1} g) .

Let $U$ denote the uniform distribution on $G$ : $U (g) = 1/∣ G ∣$ for all $g$ . The total variation distance between $Q^{* k}$ and $U$ is

∥ Q^{* k} - U ∥_{TV} = \frac{1}{2} g \in G \sum ∣ Q^{* k} (g) - U (g) ∣ = A \subseteq G max ∣ Q^{* k} (A) - U (A) ∣.

The mixing time for threshold $ε > 0$ is

t_{mix} (ε) = min {k : ∥ Q^{* k} - U ∥_{TV} \leq ε} .

Definition (Fourier transform at a representation). Let $ρ$ be an irreducible unitary representation of $G$ of dimension $d_{ρ}$ . The Fourier transform of $Q$ at $ρ$ is the $d_{ρ} \times d_{ρ}$ matrix

\hat{Q} (ρ) = g \in G \sum Q (g) ρ (g) .

For the convolution: $Q^{* k} (ρ) = \hat{Q} (ρ)^{k}$ .

Counterexamples to common slips

Convergence without generation. If the support of $Q$ does not generate all of $G$ , the walk never reaches every element and $Q^{* k}$ does not converge to $U$ . The walk must be ergodic: $supp (Q)$ generates $G$ .
TV versus $L^{2}$ . Total variation distance and $L^{2}$ distance measure different things. A bound in $L^{2}$ (Plancherel) gives a bound in TV (Cauchy-Schwarz), but the converse fails.
Aperiodicity. For a non-symmetric $Q$ the walk may be periodic. Adding a holding probability $Q (e) > 0$ (where $e$ is the identity) guarantees aperiodicity. The symmetric walks studied here are automatically aperiodic.

Key theorem with proof [Intermediate+]

Theorem (Upper Bound Lemma — Diaconis-Shahshahani 1981). Let $G$ be a finite group, $Q$ a probability measure on $G$ , and $U$ the uniform distribution. Then

4 ∥ Q^{* k} - U ∥_{TV}^{2} \leq ρ \neq = 1 \sum d_{ρ} ∥ \hat{Q} (ρ)^{k} ∥_{HS}^{2}

where the sum ranges over all non-principal irreducible representations $ρ$ of $G$ , $d_{ρ} = dim ρ$ , and $∥ \cdot ∥_{HS}$ is the Hilbert-Schmidt (Frobenius) norm.

Proof. Set $f = Q^{* k}$ . The argument has four steps.

Step 1 (TV to $L^{1}$ ). By definition,

∥ f - U ∥_{TV} = \frac{1}{2} ∥ f - U ∥_{1}, ∥ f - U ∥_{1} = g \in G \sum ∣ f (g) - U (g) ∣.

Squaring both sides:

4∥ f - U ∥_{TV}^{2} = ∥ f - U ∥_{1}^{2} .

Step 2 ( $L^{1}$ to $L^{2}$ via Cauchy-Schwarz). Apply the Cauchy-Schwarz inequality to the sum $\sum_{g} ∣ f (g) - U (g) ∣$ against the constant function 1:

∥ f - U ∥_{1}^{2} \leq ∣ G ∣ \cdot g \sum ∣ f (g) - U (g) ∣^{2} = ∣ G ∣ \cdot ∥ f - U ∥_{2}^{2} .

Step 3 ( $L^{2}$ to characters via Plancherel). The Plancherel formula for a finite group states that for any function $h : G \to C$ ,

g \in G \sum ∣ h (g) ∣^{2} = \frac{1}{∣ G ∣} ρ \sum d_{ρ} ∥ \hat{h} (ρ) ∥_{HS}^{2} .

Apply to $h = f - U$ . The Fourier transform of $f - U$ at $ρ$ is $\hat{f} (ρ) - \hat{U} (ρ)$ . For the principal representation $ρ_{1}$ : $\hat{f} (ρ_{1}) = \sum_{g} f (g) = 1$ (since $f$ is a probability measure) and $\hat{U} (ρ_{1}) = 1$ , so $f - U (ρ_{1}) = 0$ . For $ρ \neq = 1$ : $\hat{U} (ρ) = \frac{1}{∣ G ∣} \sum_{g} ρ (g) = 0$ by orthogonality of matrix coefficients (see 07.01.04). Hence $f - U (ρ) = \hat{f} (ρ) = \hat{Q} (ρ)^{k}$ . Plancherel gives

∥ f - U ∥_{2}^{2} = \frac{1}{∣ G ∣} ρ \neq = 1 \sum d_{ρ} ∥ \hat{Q} (ρ)^{k} ∥_{HS}^{2} .

Step 4 (Combine). Multiplying the Cauchy-Schwarz bound by the Plancherel identity:

4∥ f - U ∥_{TV}^{2} \leq ∣ G ∣ \cdot ∥ f - U ∥_{2}^{2} = ρ \neq = 1 \sum d_{ρ} ∥ \hat{Q} (ρ)^{k} ∥_{HS}^{2} . □

Bridge. The Upper Bound Lemma builds toward the riffle shuffle analysis where the character sum is evaluated for $S_{n}$ with representations indexed by partitions, and appears again in the cutoff phenomenon where the bound is shown to be sharp. The foundational reason the bound works is the Plancherel isomorphism between $L^{2} (G)$ and the direct product of matrix algebras, which is exactly the non-abelian Fourier transform of 07.01.09. The bridge is that the character orthogonality relations of 07.01.04 kill the principal representation contribution, leaving only the non-principal terms that decay under convolution powers.

Exercises [Intermediate+]

Exercise 3 (medium, symbolic).

Let $Q$ be a probability measure on a finite group $G$ supported on a single conjugacy class $C$ with $Q (c) = 1/∣ C ∣$ for $c \in C$ . Show that $\hat{Q} (ρ)$ is a scalar matrix for every irreducible representation $ρ$ , and express the scalar in terms of the character $χ_{ρ} (C)$ and the dimension $d_{ρ}$ .

Hint

Show that $\hat{Q} (ρ)$ commutes with every $ρ (h)$ by using that conjugation by $h$ permutes the elements of $C$ . Then invoke Schur's lemma.

Answer

For any $h \in G$ , conjugation by $h$ permutes $C$ , so $ρ (h) \hat{Q} (ρ) ρ (h)^{- 1} = \hat{Q} (ρ)$ . This means $\hat{Q} (ρ)$ commutes with every $ρ (h)$ , i.e., $\hat{Q} (ρ) \in End_{G} (ρ)$ . By Schur's lemma, $\hat{Q} (ρ) = c \cdot I_{d_{ρ}}$ for some scalar $c$ . Taking traces: $c \cdot d_{ρ} = tr (\hat{Q} (ρ)) = \sum_{g \in C} Q (g) χ_{ρ} (g) = \frac{∣ C ∣}{∣ C ∣} χ_{ρ} (C) = χ_{ρ} (C)$ . Hence $c = χ_{ρ} (C) / d_{ρ}$ and $\hat{Q} (ρ) = \frac{χ _{ρ} ( C )}{d _{ρ}} I_{d_{ρ}}$ .

Exercise 4 (medium, multiple choice).

In the Upper Bound Lemma $4∥ Q^{* k} - U ∥_{TV}^{2} \leq \sum_{ρ \neq = 1} d_{ρ} ∥ \hat{Q} (ρ)^{k} ∥_{HS}^{2}$ , the sum excludes the principal representation $ρ_{1}$ . Why?

A. The principal representation has dimension 0 B. $\hat{Q} (ρ_{1}) = 1$ for any probability measure, so $\hat{Q} (ρ_{1})^{k} = 1$ and its contribution cancels with $U$ C. The Hilbert-Schmidt norm is undefined for the principal representation D. The principal representation does not contribute to total variation distance by definition

Hint

Consider what happens when you compute the Fourier transform of $f - U$ at the principal representation.

Answer

B. Feedback-correct: the Fourier transform of $U$ at $ρ_{1}$ is 1, and the Fourier transform of $Q^{* k}$ at $ρ_{1}$ is also 1 (it sums the probability measure). So $Q^{* k} - U (ρ_{1}) = 0$ and the principal representation contributes nothing. Feedback-wrong: A is false ( $d_{ρ_{1}} = 1$ ); C is false (HS norm is defined for any matrix); D misstates the reason.

Exercise 5 (medium, numeric).

For the random transposition walk on $S_{3}$ , the non-principal irreducible representations are the sign representation (dimension 1, character $- 1$ on transpositions) and the standard representation (dimension 2, character $0$ on transpositions). Compute the upper bound on $∥ Q^{* k} - U ∥_{TV}$ from the Upper Bound Lemma for $k = 2$ . Give a numerical approximation to two decimal places.

Hint

For the sign representation: $\hat{Q} (ρ_{sgn}) = χ_{sgn} (transp) / d_{sgn} = - 1/1 = - 1$ . So $∥ \hat{Q} (ρ_{sgn})^{2} ∥_{HS}^{2} = ∣ (- 1)^{2} ∣^{2} = 1$ , and the contribution is $d_{sgn} \cdot 1 = 1$ . For the standard representation: $\hat{Q} (ρ_{std}) = 0/2 = 0$ .

Answer

For the sign representation: $\hat{Q} (ρ_{sgn}) = χ_{sgn} (C) / d_{sgn} = (- 1) /1 = - 1$ . Then $∥ \hat{Q} (ρ_{sgn})^{2} ∥_{HS}^{2} = ∣ (- 1)^{2} ∣ \cdot 1 = 1$ . Contribution: $1 \cdot 1 = 1$ .

For the standard representation: $\hat{Q} (ρ_{std}) = 0/2 = 0$ . Contribution: $2 \cdot 0 = 0$ .

Upper bound: $4∥ Q^{* 2} - U ∥_{TV}^{2} \leq 1 + 0 = 1$ . So $∥ Q^{* 2} - U ∥_{TV} \leq 1/2$ . To two decimal places: 0.50.

Note: this bound is not tight; the sign representation has eigenvalue $- 1$ so it never decays. The walk does not converge because it alternates between even and odd permutations.

Exercise 8 (hard, symbolic).

Prove that if $Q$ is a probability measure on $G$ supported on a conjugacy class $C$ (so $Q (c) = 1/∣ C ∣$ for $c \in C$ ), then the Upper Bound Lemma simplifies to $4∥ Q^{* k} - U ∥_{TV}^{2} \leq \sum_{ρ \neq = 1} d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k}$ .

Hint

Use the result of Exercise 3: $\hat{Q} (ρ) = \frac{χ _{ρ} ( C )}{d _{ρ}} I_{d_{ρ}}$ . Then $\hat{Q} (ρ)^{k} = (\frac{χ _{ρ} ( C )}{d _{ρ}})^{k} I_{d_{ρ}}$ . Compute the Hilbert-Schmidt norm.

Answer

By Exercise 3, $\hat{Q} (ρ) = \frac{χ _{ρ} ( C )}{d _{ρ}} I_{d_{ρ}}$ , so $\hat{Q} (ρ)^{k} = (\frac{χ _{ρ} ( C )}{d _{ρ}})^{k} I_{d_{ρ}}$ . The Hilbert-Schmidt norm of a scalar matrix $c \cdot I_{d}$ is $∥ c \cdot I_{d} ∥_{HS} = ∣ c ∣ d$ . So $∥ \hat{Q} (ρ)^{k} ∥_{HS}^{2} = \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k} \cdot d_{ρ}$ . Substituting into the Upper Bound Lemma: $\sum_{ρ \neq = 1} d_{ρ} \cdot \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k} \cdot d_{ρ} = \sum_{ρ \neq = 1} d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k}$ .

Advanced results [Master]

Theorem 1 (Random transposition mixing time — Diaconis-Shahshahani 1981). For the random transposition walk on $S_{n}$ , the mixing time satisfies $t_{mix} (ε) = \frac{1}{2} n lo g n + c (ε) n$ where $c (ε)$ depends only on $ε$ . In particular, $∥ Q^{* k} - U ∥_{TV} \to 0$ if $k > \frac{1}{2} n lo g n$ and $∥ Q^{* k} - U ∥_{TV} \to 1$ if $k < \frac{1}{2} n lo g n$ .

The proof evaluates the character sum $\sum_{λ \neq = (n)} d_{λ}^{2} ∣ χ_{λ} (C) / d_{λ} ∣^{2 k}$ for the transposition conjugacy class $C$ on $S_{n}$ using the hook-length formula for $d_{λ}$ and the Frobenius character formula for $χ_{λ} (C)$ .

Theorem 2 (Plancherel bound is sharp for conjugacy class walks). For a conjugacy class walk on $S_{n}$ , the Upper Bound Lemma bound differs from the true total variation distance by at most a factor of 2. The dominant contribution comes from the standard representation $(n - 1, 1)$ .

This was established by Diaconis and Shahshahani in their 1981 paper by computing the asymptotics of the character sum and comparing with the lower bound from the second eigenvalue method.

Theorem 3 (Second eigenvalue bound). For a symmetric random walk on $G$ driven by $Q$ , if $\hat{Q} (ρ)$ has eigenvalues $β_{1} \geq β_{2} \geq \dots$ at each irreducible $ρ$ , then the mixing time satisfies $t_{mix} (ε) \geq \frac{l o g ( 1/ ( 2 ε ))}{- l o g β ^{*}}$ where $β^{*} = max_{ρ \neq = 1} ∥ \hat{Q} (ρ) ∥_{op}$ is the largest non-principal operator norm.

This lower bound complements the Upper Bound Lemma's upper bound, pinching the mixing time between two character-theoretic expressions.

Theorem 4 (Comparison theorem — Diaconis-Saloff-Coste 1993). If $Q_{1}$ and $Q_{2}$ are two probability measures on the same group $G$ , then $∥ Q_{1}^{* k} - U ∥_{TV} \leq A \cdot ∥ Q_{2}^{* k A} - U ∥_{TV}$ where $A$ is a comparison constant depending on the support of $Q_{1}$ relative to $Q_{2}$ . This allows bounding the mixing time of one walk in terms of another.

Theorem 5 (Random adjacent transpositions — Wilson 2004). The random adjacent transposition walk on $S_{n}$ (step: swap positions $i$ and $i + 1$ uniformly at random) has mixing time $Θ (n^{3} lo g n)$ . The upper bound uses the Upper Bound Lemma adapted to the generating set of adjacent transpositions; the lower bound uses Wilson's method.

Theorem 6 (Product bound — Diaconis-Shahshahani). For a random walk on $G_{1} \times G_{2}$ driven by the product of measures $Q_{1} \otimes Q_{2}$ , the total variation distance factors as $∥ Q^{* k} - U ∥_{TV}^{2} \leq ∥ Q_{1}^{* k} - U_{1} ∥_{TV}^{2} + ∥ Q_{2}^{* k} - U_{2} ∥_{TV}^{2}$ . This allows decomposing the character sum over irreducible representations of the product group.

Synthesis. The Upper Bound Lemma is the foundational reason that representation theory controls the rate of convergence of random walks on finite groups. The central insight is that the Plancherel isomorphism translates the probabilistic $L^{2}$ -norm into a sum over representations, and Cauchy-Schwarz carries this to the $L^{1}$ -norm that defines total variation distance. Putting these together with the character-theoretic machinery of 07.05.01, the bound becomes a finite computation: enumerate the non-principal irreducibles, compute character values on the support of $Q$ , and read off the mixing time. This is exactly the content that builds toward the riffle shuffle analysis, where the representations of $S_{n}$ indexed by partitions control the rate at which shuffles approach uniform. The bridge is between abstract harmonic analysis on groups and concrete probabilistic convergence; the pattern generalises from conjugacy class walks to arbitrary measures via the operator-norm formulation of Theorem 3, and identifies the mixing time with the slowest-decaying non-principal character.

Full proof set [Master]

Proposition 1 (Fourier transform of convolution). For probability measures $Q_{1}, Q_{2}$ on $G$ and any irreducible representation $ρ$ :

Q_{1} * Q_{2} (ρ) = \hat{Q}_{1} (ρ) \cdot \hat{Q}_{2} (ρ) .

Proof. The convolution is $(Q_{1} * Q_{2}) (g) = \sum_{h} Q_{1} (h) Q_{2} (h^{- 1} g)$ . Fourier-transforming:

Q_{1} * Q_{2} (ρ) = g \sum h \sum Q_{1} (h) Q_{2} (h^{- 1} g) ρ (g) .

Substituting $g = h g^{'}$ :

= h \sum Q_{1} (h) ρ (h) g^{'} \sum Q_{2} (g^{'}) ρ (g^{'}) = \hat{Q}_{1} (ρ) \cdot \hat{Q}_{2} (ρ) . □

Proposition 2 (Character sum for conjugacy class walks). Let $Q$ be uniform on a conjugacy class $C$ of $G$ . Then

ρ \neq = 1 \sum d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k} \leq ∣ G ∣^{2} \cdot ρ \neq = 1 max \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k - 2} .

Proof. Bound each summand by replacing all but the maximum term with the maximum:

ρ \neq = 1 \sum d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k} = ρ \neq = 1 \sum d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2} \cdot \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k - 2} .

Using $\sum_{ρ} d_{ρ}^{2} = ∣ G ∣$ and bounding each ratio by the maximum:

\leq (ρ \neq = 1 max \frac{χ _{ρ} ( C )}{d _{ρ}}^{2 k - 2}) ρ \sum d_{ρ}^{2} \frac{χ _{ρ} ( C )}{d _{ρ}}^{2} .

The remaining sum is bounded by $\sum_{ρ} d_{ρ}^{2} = ∣ G ∣^{2} /∣ G ∣ \cdot ∣ G ∣$ . More precisely, the second moment of $∣ χ_{ρ} (C) / d_{ρ} ∣$ weighted by $d_{ρ}^{2}$ is at most $∣ G ∣^{2} /∣ G ∣$ . $□$

Connections [Master]

Non-abelian Fourier transform 07.01.09. The Fourier transform at irreducible representations, which underpins the Upper Bound Lemma, is exactly the non-abelian Fourier transform of 07.01.09. The Plancherel isomorphism that converts $L^{2}$ -norms into character sums is the analytic core of both units.
Character orthogonality 07.01.04. The vanishing of $\hat{U} (ρ)$ for non-principal $ρ$ in the proof of the Upper Bound Lemma is a direct application of the character orthogonality relations of 07.01.04. The orthogonality relations guarantee that the uniform distribution has no "energy" in non-principal representations.
Symmetric group representation 07.05.01. The primary application domain for the Upper Bound Lemma is the symmetric group $S_{n}$ . The representations indexed by partitions, their dimensions given by the hook-length formula, and the character values on cycle types developed in 07.05.01 are the computational ingredients needed to evaluate the character sum.
Schur-Weyl duality 07.05.04. The representation-theoretic structure of $S_{n}$ that makes the Upper Bound Lemma computationally effective is the same structure captured by Schur-Weyl duality. The irreducible representations of $S_{n}$ that appear in the character sum correspond bijectively to partitions, and the same indexing governs the decomposition of tensor powers.

Historical & philosophical context [Master]

Diaconis and Shahshahani introduced the Upper Bound Lemma in their 1981 paper Generating a Random Permutation with Random Transpositions ^{[DiaconisShahshahani1981]}, proving that $\frac{1}{2} n lo g n$ random transpositions suffice to make a permutation uniformly distributed. Their innovation was to recast the classical probability question of mixing times in the language of character theory, using the Plancherel formula to bound total variation distance.

Diaconis expanded this programme in his 1988 monograph Group Representations in Probability and Statistics ^{[Diaconis1988]}, which systematised the Fourier-analytic approach to random walks on finite groups and introduced the comparison theorem for bounding mixing times of one walk in terms of another. Saloff-Coste 2004 provided a comprehensive survey of the field, integrating the representation-theoretic bounds with geometric and functional-analytic methods.

Bibliography [Master]

@article{DiaconisShahshahani1981,
  author = {Diaconis, Persi and Shahshahani, Mehrdad},
  title = {Generating a random permutation with random transpositions},
  journal = {Z. Wahrsch. Verw. Gebiete},
  volume = {57},
  year = {1981},
  pages = {159--179},
}

@book{Diaconis1988,
  author = {Diaconis, Persi},
  title = {Group Representations in Probability and Statistics},
  publisher = {Institute of Mathematical Statistics},
  year = {1988},
  series = {IMS Lecture Notes--Monograph Series},
  volume = {11},
}

@article{BayerDiaconis1992,
  author = {Bayer, Dave and Diaconis, Persi},
  title = {Trailing the Dovetail Shuffle to its Lair},
  journal = {Ann. Appl. Probab.},
  volume = {2},
  year = {1992},
  pages = {294--313},
}

@incollection{SaloffCoste2004,
  author = {Saloff-Coste, Laurent},
  title = {Random Walks on Finite Groups},
  booktitle = {Probability on Discrete Structures},
  publisher = {Springer},
  year = {2004},
  series = {Encyclopaedia of Mathematical Sciences},
  volume = {110},
}

@article{Wilson2004,
  author = {Wilson, David B.},
  title = {Mixing times of Lozenge tiling and card shuffling Markov chains},
  journal = {Ann. Appl. Probab.},
  volume = {14},
  year = {2004},
  pages = {274--325},
}

@article{DiaconisSaloffCoste1993,
  author = {Diaconis, Persi and Saloff-Coste, Laurent},
  title = {Comparison Theorems for Reversible Markov Chains},
  journal = {Ann. Appl. Probab.},
  volume = {3},
  year = {1993},
  pages = {696--730},
}

Prerequisites

07.01.09
07.05.01

Tier anchors

beginner: Diaconis Group Representations in Probability and Statistics Ch. 3 informal; 3Blue1Brown random-walk visualisation analogy
intermediate: Diaconis Group Representations in Probability and Statistics Ch. 3; Serre Linear Representations Part II
master: Diaconis-Shahshahani 1981 J. Algebra 76; Diaconis 1988 Group Representations in Probability and Statistics; Saloff-Coste 2004 Random Walks on Finite Groups

References

TODO_REF
Diaconis, P. and Shahshahani, M. — Generating a random permutation with random transpositions · Z. Wahrsch. Verw. Gebiete 57 (1981), also J. Algebra 76 reference; the Upper Bound Lemma
TODO_REF
Diaconis, P. — Group Representations in Probability and Statistics · IMS Lecture Notes Vol. 11 (1988), Ch. 3
TODO_REF
Bayer, D. and Diaconis, P. — Trailing the Dovetail Shuffle to its Lair · Ann. Appl. Probab. 2 (1992), 294-313
TODO_REF
Saloff-Coste, L. — Random Walks on Finite Groups · Probability on Discrete Structures, Encyclopaedia Math. Sci. 110 (2004)

Reviewer

TBD

Estimated time

beginner: 16m
intermediate: 40m
master: 75m