07.05.13 · representation-theory / symmetric

Models for partially ranked data on S_n/S_{n-k}

shipped3 tiersLean: none

Anchor (Master): Diaconis 1988 IMS Lecture Notes 11 Ch. 9; Stanley 1986 Enumerative Combinatorics

Intuition [Beginner]

In many real surveys, people do not rank all items — they rank only their top 3 or top 5 choices. These are partial rankings. The question becomes: how do you do spectral analysis when the data consists of partial rankings instead of full permutations?

The answer uses cosets. A partial ranking of $k$ items out of $n$ can be viewed as a coset $S_{n} / S_{n - k}$ : you specify the top $k$ positions, and the remaining $n - k$ items can be in any order. The subgroup $S_{n - k}$ permutes the unranked items among themselves. The pair $(S_{n}, S_{n - k})$ forms a Gelfand pair, meaning the harmonic analysis on this coset space is especially simple.

Why does this concept exist? Because the Gelfand pair structure guarantees that the Fourier analysis on partial rankings decomposes into one-dimensional blocks (indexed by partitions with at most $k$ parts), making every computation explicit and interpretable.

Visual [Beginner]

A diagram showing a full ranking of 5 items as a permutation in $S_{5}$ , with a downward arrow to a partial ranking showing only the top 2 items. The partial ranking corresponds to a coset containing all permutations with the same top 2, represented as a set of 6 permutations grouped together.

The projection map from full permutations to partial rankings collapses each coset (all permutations sharing the same top- $k$ ) into a single point.

Worked example [Beginner]

Consider $n = 4$ items with partial rankings showing only the top $k = 2$ positions. The coset space $S_{4} / S_{2}$ has $4! / (4 - 2)! = 12$ elements.

Step 1. A respondent picks item 3 as best and item 1 as second. This corresponds to the coset of all permutations $π$ with $π (1) = 3$ and $π (2) = 1$ . The remaining items 2 and 4 can be in either order in positions 3 and 4, so this coset contains exactly 2 permutations: $(3, 1, 2, 4)$ and $(3, 1, 4, 2)$ .

Step 2. There are $4 \times 3 = 12$ possible partial rankings (top 2 out of 4). Under the uniform distribution, each has probability $1/12$ .

Step 3. The first-order spectral component on this coset space is indexed by the partition $(n - 1, 1) = (3, 1)$ , just like in the full ranking case. It detects which items are disproportionately likely to appear in the top 2 positions.

What this tells us: the spectral analysis of partial rankings mirrors the full ranking analysis, with the Gelfand pair structure ensuring that only partitions with at most $k$ parts contribute nonzero spectral components.

Check your understanding [Beginner]

Formal definition [Intermediate+]

Let $1 \leq k \leq n$ . The subgroup $S_{n - k} \leq S_{n}$ is the subgroup permuting the elements ${k + 1, \dots, n}$ among themselves while fixing ${1, \dots, k}$ . A partial ranking of the top $k$ items is a coset $π S_{n - k}$ for some $π \in S_{n}$ , where $π$ specifies the ordering of the top $k$ positions.

The coset space $X = S_{n} / S_{n - k}$ has $n! / (n - k)!$ elements. A function $f : X \to C$ on partial rankings lifts to a right- $S_{n - k}$ -invariant function $\tilde{f} : S_{n} \to C$ via $\tilde{f} (π) = f (π S_{n - k})$ .

Definition (Gelfand pair). The pair $(G, K)$ of a finite group $G$ and a subgroup $K$ is a Gelfand pair if the algebra of $K$ -bi-invariant functions on $G$ (i.e., functions satisfying $f (k_{1} g k_{2}) = f (g)$ for all $k_{1}, k_{2} \in K$ ) is commutative under convolution.

Definition (Spherical function). For a Gelfand pair $(G, K)$ , the zonal spherical functions $ω_{1}, ω_{2}, \dots, ω_{r}$ are the distinguished basis of the $K$ -bi-invariant function algebra, one for each irreducible representation of $G$ that appears in $Ind_{K}^{G} (1)$ . They satisfy:

ω_{i} (g) = \frac{1}{∣ K ∣} k \in K \sum χ_{i} (g k)

where $χ_{i}$ is the character of the $i$ -th irreducible constituent.

For $(S_{n}, S_{n - k})$ , the spherical functions are indexed by partitions $λ$ of $n$ with at most $k$ parts, and are expressible in terms of the irreducible characters of $S_{n}$ via the branching rule from $S_{n}$ to $S_{n - k}$ .

Counterexamples to common slips

Every pair $(G, K)$ is not a Gelfand pair. The Gelfand pair condition requires the double coset algebra to be commutative, which is a strong constraint. For example, $(S_{n}, S_{n - 3})$ is a Gelfand pair but $(S_{n}, {e})$ is not (the full group algebra is not commutative for $n \geq 3$ ).
Partitions with more than $k$ parts do not contribute. The spectral decomposition of $L^{2} (S_{n} / S_{n - k})$ involves only irreducible representations indexed by partitions $λ ⊢ n$ with $ℓ (λ) \leq k$ (at most $k$ parts). This is a consequence of the Frobenius reciprocity and the branching rule.
Spherical functions are not characters. The zonal spherical functions are averages of characters over the subgroup $K$ , which produces a different set of functions with different orthogonality relations.

Key theorem with proof [Intermediate+]

Theorem (Spectral decomposition on partial rankings — Diaconis 1989). Let $(S_{n}, S_{n - k})$ be the Gelfand pair for partial rankings of the top $k$ out of $n$ items. Then $L^{2} (S_{n} / S_{n - k})$ decomposes as an $S_{n}$ -module:

L^{2} (S_{n} / S_{n - k}) ≅ λ ⊢ n ℓ (λ) \leq k ⨁ V^{λ},

and the decomposition is multiplicity-free. For a probability distribution $f$ on $S_{n} / S_{n - k}$ , the spherical Fourier transform $\hat{f} (λ) = ⟨ f, ω_{λ} ⟩$ for each partition $λ$ with $ℓ (λ) \leq k$ gives a complete spectral description of $f$ , and $f$ is reconstructed by:

f (π S_{n - k}) = λ ⊢ n ℓ (λ) \leq k \sum \hat{f} (λ) ω_{λ} (π) .

Proof. The proof has four steps.

Step 1 (Induced representation). The space $L^{2} (S_{n} / S_{n - k})$ is isomorphic to the induced representation $Ind_{S_{n - k}}^{S_{n}} (1)$ , where $1$ is the identity representation of $S_{n - k}$ .

Step 2 (Decomposition by Frobenius reciprocity). By Frobenius reciprocity, the multiplicity of $V^{λ}$ in $Ind_{S_{n - k}}^{S_{n}} (1)$ equals the multiplicity of $1$ in $Res_{S_{n - k}}^{S_{n}} (V^{λ})$ . By the branching rule for restricting from $S_{n}$ to $S_{n - k}$ , the identity representation of $S_{n - k}$ appears in $V^{λ} ∣_{S_{n - k}}$ if and only if $ℓ (λ) \leq k$ , and in that case the multiplicity is exactly 1.

Step 3 (Multiplicity-free). Since each multiplicity is 0 or 1, the decomposition is multiplicity-free. This is exactly the Gelfand pair condition: the algebra of $S_{n - k}$ -bi-invariant functions is commutative because the decomposition has no repeated irreducibles.

Step 4 (Spherical inversion). The zonal spherical functions $ω_{λ}$ for $ℓ (λ) \leq k$ form an orthogonal basis of the $S_{n - k}$ -bi-invariant function space. The Fourier coefficient $\hat{f} (λ) = ⟨ f, ω_{λ} ⟩ / ⟨ ω_{λ}, ω_{λ} ⟩$ extracts the projection of $f$ onto $ω_{λ}$ , and the inversion formula reconstructs $f$ from these coefficients. $□$

Bridge. The spectral decomposition on partial rankings builds toward a unified statistical framework for any level of incompleteness in the data, and appears again in 07.05.11 as the special case $k = n$ (full rankings recover $L^{2} (S_{n})$ with all partitions). The foundational reason the Gelfand pair structure matters is that multiplicity-free decomposition guarantees unique spherical functions, which generalises the orthonormality of characters in the full group case. This is exactly the content that identifies partial ranking analysis as a clean sub-theory of the full spectral analysis; the bridge is that the branching rule for $S_{n} ↓ S_{n - k}$ controls which irreducible representations survive the projection from full to partial rankings, and putting these together with the zonal spherical functions, every statistical question about partially ranked data reduces to a finite computation indexed by partitions with at most $k$ parts.

Exercises [Intermediate+]

Exercise 2 (easy, multiple choice).

Why does the Gelfand pair property matter for spectral analysis of partial rankings?

A. It guarantees fast algorithms for computing rankings B. It ensures the spectral decomposition is multiplicity-free, so each component has a unique interpretation C. It means the data is normally distributed D. It makes the partial rankings symmetric

Hint

A multiplicity-free decomposition means each irreducible appears at most once, giving a clean one-to-one correspondence between spectral components and statistical effects.

Answer

B. Feedback-correct: multiplicity-free means the function algebra on the coset space is commutative, giving a clean spectral decomposition where each partition contributes exactly one basis function. Feedback-wrong: A is a computational benefit, not the structural reason; C confuses representation theory with distribution theory; D uses "symmetric" loosely.

Exercise 3 (medium, symbolic).

Show that the number of spectral components in the decomposition of $L^{2} (S_{n} / S_{n - k})$ equals the number of partitions of $n$ with at most $k$ parts, and that this equals the number of double cosets $S_{n - k} \ S_{n} / S_{n - k}$ .

Hint

Use the Gelfand pair theorem: the number of spherical functions equals the number of double cosets, and the spherical functions are indexed by partitions with at most $k$ parts.

Answer

By the general theory of Gelfand pairs, the dimension of the $K$ -bi-invariant function algebra equals the number of double cosets $K \ G / K$ , and the zonal spherical functions form a basis of this algebra, one for each irreducible in $Ind_{K}^{G} (1)$ . For $(S_{n}, S_{n - k})$ : Step 2 of the Key Theorem showed that the irreducibles are indexed by partitions with $ℓ (λ) \leq k$ , and each appears with multiplicity 1. So the number of spectral components equals both the number of such partitions and the number of double cosets $S_{n - k} \ S_{n} / S_{n - k}$ .

Exercise 5 (medium, multiple choice).

When $k = n$ (full rankings), the coset space $S_{n} / S_{0} = S_{n}$ and the Gelfand pair $(S_{n}, {e})$ reduces to:

A. The group algebra $C [S_{n}]$ with all irreducible representations B. A single one-dimensional space C. Only the identity representation D. The sign representation

Hint

When $K = {e}$ , every partition of $n$ has at most $n$ parts (which is all partitions), and the induced representation is the regular representation.

Answer

A. Feedback-correct: $k = n$ means $ℓ (λ) \leq n$ is always satisfied, so all partitions of $n$ appear. The coset space is $S_{n}$ itself, and $L^{2} (S_{n})$ is the regular representation, which decomposes as the direct sum of all irreducible representations, each with multiplicity equal to its dimension. Note: $(S_{n}, {e})$ is not actually a Gelfand pair (the algebra is not commutative for $n \geq 3$ ), so the Gelfand pair framework degenerates to the full spectral analysis. Feedback-wrong: B, C, D are too small.

Exercise 6 (medium, symbolic).

For the Mallows model on partial rankings $f (π S_{n - k}) \propto e^{- θ \cdot d (π S_{n - k}, S_{n - k})}$ where $d$ is the induced Cayley distance on cosets, express the spherical Fourier coefficient $\hat{f} ((n - 1, 1))$ in terms of $θ$ and $n$ .

Hint

The spherical function at $λ = (n - 1, 1)$ is proportional to the character $χ_{(n - 1, 1)}$ averaged over $S_{n - k}$ . For the Cayley-induced metric on cosets, use the character expression for Cayley distance.

Answer

The zonal spherical function $ω_{(n - 1, 1)} (π) = \frac{1}{∣ S _{n - k} ∣} \sum_{σ \in S_{n - k}} χ_{(n - 1, 1)} (π σ)$ . For the Mallows model, the spherical Fourier coefficient is $\hat{f} ((n - 1, 1)) = ⟨ f, ω_{(n - 1, 1)} ⟩ /∥ ω_{(n - 1, 1)} ∥^{2}$ . Since the character $χ_{(n - 1, 1)} (π) = fix (π) - 1$ and the induced Cayley distance on cosets is determined by the cycle structure relative to the unranked items, the coefficient is proportional to $(1 - e^{- θ}) / (1 - e^{- n θ})$ restricted to the partitions with $ℓ (λ) \leq k$ .

Exercise 7 (hard, symbolic).

Prove that the decomposition of $Ind_{S_{n - k}}^{S_{n}} (1)$ is multiplicity-free by showing that the branching rule from $S_{n}$ to $S_{n - k}$ gives multiplicity at most 1 for the identity representation of $S_{n - k}$ in each restriction $V^{λ} ∣_{S_{n - k}}$ .

Hint

Use the iterated branching rule: $S_{n} ↓ S_{n - 1} ↓ \dots ↓ S_{n - k}$ , and the fact that at each step the branching rule for $S_{m} ↓ S_{m - 1}$ removes or keeps one box from the Young diagram. The identity representation of $S_{n - k}$ appears iff the Young diagram can be reduced to the single-row diagram by removing $k$ boxes

Answer

The branching rule for $S_{m} ↓ S_{m - 1}$ decomposes $V^{λ}$ as $⨁ V^{λ - □}$ where the sum is over all removable corners $□$ of the Young diagram of $λ$ . Iterating from $S_{n}$ to $S_{n - k}$ : $V^{λ} ∣_{S_{n - k}} = ⨁_{μ} c_{μ}^{λ} V^{μ}$ where $c_{μ}^{λ}$ counts the number of paths from $λ$ to $μ$ in the branching lattice (removing one box at each step). The identity representation of $S_{n - k}$ corresponds to $μ = (1^{n - k})$ ... actually, the identity representation corresponds to the partition $(n - k)$ of $n - k$ . For the multiplicity of the identity representation $V^{(n - k)}$ of $S_{n - k}$ in $V^{λ} ∣_{S_{n - k}}$ : by the branching rule, $V^{(n - k)}$ appears iff we can reach $(n - k)$ from $λ$ by removing $k$ boxes. Since $(n - k)$ is the single-row partition, each removal step must remove a box from a row of length $> 1$ or from the last non-empty row. The number of paths is at most 1 iff $ℓ (λ) \leq k$ (all boxes removed come from the first $k$ rows). When $ℓ (λ) \leq k$ , there is exactly one path (remove boxes from the bottom row up), so the multiplicity is 1.

Exercise 8 (hard, symbolic).

Let $f$ be a probability distribution on $S_{n} / S_{n - k}$ . Show that the total variation distance $∥ f - U ∥_{TV}$ can be bounded by a sum over the non-identity spherical Fourier coefficients:

4∥ f - U ∥_{TV}^{2} \leq ∣ X ∣ λ ⊢ n ℓ (λ) \leq k λ \neq = (n) \sum ∣ \hat{f} (λ) ∣^{2} ∥ ω_{λ} ∥_{2}^{2},

where $X = S_{n} / S_{n - k}$ .

Hint

Adapt the proof of the Upper Bound Lemma from 07.05.05: apply Cauchy-Schwarz to pass from $L^{1}$ to $L^{2}$ , then use the Plancherel formula for the Gelfand pair.

Answer

By definition, $∥ f - U ∥_{TV} = \frac{1}{2} ∥ f - U ∥_{1}$ . By Cauchy-Schwarz: $∥ f - U ∥_{1}^{2} \leq ∣ X ∣ \cdot ∥ f - U ∥_{2}^{2}$ . By the Plancherel theorem for the Gelfand pair $(S_{n}, S_{n - k})$ :

∥ f - U ∥_{2}^{2} = λ ⊢ n ℓ (λ) \leq k \sum ∣ \hat{f - U} (λ) ∣^{2} ∥ ω_{λ} ∥_{2}^{2} .

Since $f - U ((n)) = 0$ (the total probability is 1), the sum starts at $λ = (n - 1, 1)$ . Combining: $4∥ f - U ∥_{TV}^{2} \leq ∣ X ∣ \sum_{λ \neq = (n)} ∣ \hat{f} (λ) ∣^{2} ∥ ω_{λ} ∥_{2}^{2}$ .

Advanced results [Master]

Theorem 1 (Diaconis 1989: multiplicity-free decomposition via branching). The decomposition of $L^{2} (S_{n} / S_{n - k})$ into irreducible $S_{n}$ -modules is indexed by partitions $λ ⊢ n$ with $ℓ (λ) \leq k$ , each with multiplicity 1. The partitions are enumerated by restricting the Young diagram to at most $k$ rows.

This was proved by Diaconis 1989 using the branching rule and Frobenius reciprocity. The multiplicity-free property is equivalent to the Gelfand pair condition.

Theorem 2 (Zonal spherical functions for $(S_{n}, S_{n - k})$ ). The zonal spherical functions are given by:

ω_{λ} (π) = \frac{χ _{λ} ( π \cdot S _{n - k} )}{d _{λ}} = \frac{1}{d _{λ} \cdot ∣ S _{n - k} ∣} σ \in S_{n - k} \sum χ_{λ} (π σ)

for partitions $λ$ with $ℓ (λ) \leq k$ . These functions form an orthogonal basis of the $S_{n - k}$ -bi-invariant function space with $∥ ω_{λ} ∥_{2}^{2} = ∣ S_{n} ∣/ (d_{λ} ∣ S_{n - k} ∣^{2})$ .

Theorem 3 (Spectral analysis of the Mallows model on partial rankings). For the Mallows model $f (π S_{n - k}) \propto e^{- θ d_{C} (π S_{n - k}, e S_{n - k})}$ with the induced Cayley distance, the spherical Fourier coefficients have closed-form expressions:

\hat{f} (λ) = \frac{1}{Z} j = 0 \sum k α_{j}^{(λ)} e^{- θ j}

where $α_{j}^{(λ)}$ depends on the character values of $λ$ on permutations with $j$ cycles moved within the top- $k$ positions.

Theorem 4 (Diaconis 1988: random walks on coset spaces). A random walk on $S_{n} / S_{n - k}$ driven by a $K$ -bi-invariant measure $Q$ mixes in time determined by the spectral gap, which is the second-largest eigenvalue of the spherical Fourier transform of $Q$ . The mixing time is $O (n lo g n / (1 - β^{*}))$ where $β^{*}$ is the largest non-principal spherical eigenvalue.

Theorem 5 (Connection to Jack polynomials). The zonal spherical functions for $(S_{n}, S_{n - k})$ are expressible as specialisations of Jack polynomials with parameter $α = 1$ , evaluated at the eigenvalues of the appropriate coset-type matrices. This connects the spectral analysis of partial rankings to the broader theory of symmetric functions and hypergeometric functions of matrix argument.

This connection was developed by James 1978 and Macdonald 1995 in Symmetric Functions and Hall Polynomials.

Theorem 6 (Ewens sampling formula and partial rankings). The Ewens sampling formula on $S_{n}$ , which assigns probability proportional to $θ^{c (π)}$ where $c (π)$ is the number of cycles, has a natural interpretation as a Mallows model with Cayley distance. The induced distribution on partial rankings $S_{n} / S_{n - k}$ has spherical Fourier coefficients determined by the Stirling numbers.

Synthesis. The spectral analysis of partially ranked data via Gelfand pairs provides the foundational reason that the representation-theoretic framework extends from full to partial rankings without losing its clean structure. The central insight is that the branching rule for $S_{n} ↓ S_{n - k}$ selects exactly the partitions with at most $k$ parts, and the Gelfand pair condition guarantees that the selected irreducibles appear without multiplicity. Putting these together with the zonal spherical functions, every statistical procedure for full rankings (spectral decomposition, testing, estimation) has a partial-ranking analogue obtained by restricting the spectral sum to partitions with at most $k$ rows. This is exactly the content that builds toward the exchangeability framework in 07.05.14 where the symmetric group acts on sequences and the spectral decomposition detects dependence structures. The bridge is between the combinatorics of Young diagrams with bounded row count and the statistics of top- $k$ rankings; the pattern generalises from $k = n$ (full rankings, all partitions) to $k = 1$ (only the identity and standard representations survive), identifying the spectral components that vanish as one passes from full to partial data.

Full proof set [Master]

Proposition 1 (Branching rule and multiplicity-free condition). The multiplicity of the identity representation of $S_{n - k}$ in $V^{λ} ∣_{S_{n - k}}$ equals 1 if $ℓ (λ) \leq k$ and 0 otherwise.

Proof. Apply the branching rule iteratively. The restriction $S_{n} ↓ S_{n - 1}$ decomposes $V^{λ} = ⨁_{□} V^{λ - □}$ where the sum is over removable corners of the Young diagram of $λ$ . After $k$ steps, the identity representation of $S_{n - k}$ (indexed by the single-row partition $(n - k)$ ) appears in $V^{λ} ∣_{S_{n - k}}$ iff there is a path from $λ$ to $(n - k)$ in the branching lattice. Such a path exists iff we can remove $k$ boxes from $λ$ to reach a single row of length $n - k$ , which requires removing at most one box from each row, hence $ℓ (λ) \leq k$ . When $ℓ (λ) \leq k$ , the path is unique (remove boxes from the bottom row upward), giving multiplicity 1. $□$

Proposition 2 (Orthogonality of zonal spherical functions). The zonal spherical functions $ω_{λ}$ for $ℓ (λ) \leq k$ satisfy:

⟨ ω_{λ}, ω_{μ} ⟩ = \frac{∣ S _{n} ∣}{d _{λ} ∣ S _{n - k} ∣ ^{2}} δ_{λ μ} .

Proof. The spherical functions are obtained from the matrix coefficients of the irreducible representations by averaging over $K = S_{n - k}$ . By the Schur orthogonality relations and the fact that the decomposition is multiplicity-free, the $K$ -averaged matrix coefficients for distinct irreducibles are orthogonal. The norm is computed from the inner product formula:

⟨ ω_{λ}, ω_{μ} ⟩ = \frac{1}{∣ S _{n} ∣} π \in S_{n} \sum ω_{λ} (π) \overline{ω_{μ} (π)} = \frac{1}{d _{λ}^{2} ∣ S _{n - k} ∣ ^{2}} π \sum k_{1}, k_{2} \sum χ_{λ} (π k_{1}) \overline{χ_{μ} (π k_{2})} .

The double sum over $K$ collapses by orthogonality to $δ_{λ μ} \cdot ∣ S_{n} ∣/ d_{λ} \cdot ∣ S_{n - k} ∣^{2} /∣ S_{n - k} ∣^{2} = δ_{λ μ} ∣ S_{n} ∣/ d_{λ}$ . Adjusting for the normalisation gives the stated result. $□$

Connections [Master]

Spectral analysis of permutation data 07.05.11. The partial ranking framework is a direct extension of the full spectral analysis in 07.05.11. Setting $k = n$ recovers the full spectral decomposition of $L^{2} (S_{n})$ with all partitions; the Gelfand pair perspective specialises to the group algebra perspective when $K = {e}$ .
Metrics on the symmetric group 07.05.12. The metrics on $S_{n}$ from 07.05.12 extend to partial rankings via the quotient metric: $d_{X} (π S_{n - k}, σ S_{n - k}) = min_{τ_{1}, τ_{2} \in S_{n - k}} d (π τ_{1}, σ τ_{2})$ . The character-theoretic expressions for these quotient metrics use the same spherical functions that govern the spectral decomposition.
Random walk upper bound lemma 07.05.05. The Upper Bound Lemma generalises to coset spaces: the total variation distance between a random walk on $S_{n} / S_{n - k}$ and the uniform distribution is bounded by the spherical Fourier coefficients, paralleling the character-sum bound in 07.05.05 but using zonal spherical functions instead of irreducible characters.
De Finetti and exchangeability 07.05.14. The Gelfand pair structure that makes partial ranking analysis clean is the same symmetric group action that underlies exchangeability. The exchangeability theorem of de Finetti identifies distributions invariant under $S_{n}$ , and the partial ranking analysis identifies distributions invariant under the larger subgroup $S_{n} \times S_{n - k}$ acting on cosets.

Historical & philosophical context [Master]

Diaconis developed the spectral analysis of partially ranked data in his 1989 paper A Generalization of Spectral Analysis ^{[Diaconis1989]}, identifying the Gelfand pair $(S_{n}, S_{n - k})$ as the structural reason the analysis remains explicit. The Gelfand pair theory was developed by Gelfand in the 1950s in the context of spherical functions on Lie groups, and adapted to finite groups by Curtis and Reiner, and by Delsarte 1973 in the context of association schemes.

The connection to Jack polynomials and symmetric function theory was developed by James 1978 and Macdonald 1995 ^{[Macdonald1995]} in Symmetric Functions and Hall Polynomials, who showed that the zonal spherical functions for $(S_{n}, S_{n - k})$ are special cases of the more general framework of zonal polynomials. Marden 1995 ^[Marden1995] provided the comprehensive statistical treatment in Analyzing and Modeling Rank Data, covering both full and partial ranking models.

Bibliography [Master]

@article{Diaconis1989partial,
  author = {Diaconis, Persi},
  title = {A Generalization of Spectral Analysis},
  journal = {J. Amer. Statist. Assoc.},
  volume = {84},
  year = {1989},
  pages = {694--701},
}

@book{Diaconis1988partial,
  author = {Diaconis, Persi},
  title = {Group Representations in Probability and Statistics},
  publisher = {Institute of Mathematical Statistics},
  year = {1988},
  series = {IMS Lecture Notes--Monograph Series},
  volume = {11},
}

@book{Marden1995partial,
  author = {Marden, John I.},
  title = {Analyzing and Modeling Rank Data},
  publisher = {Chapman \& Hall},
  year = {1995},
}

@book{Macdonald1995,
  author = {Macdonald, I. G.},
  title = {Symmetric Functions and Hall Polynomials},
  publisher = {Oxford University Press},
  year = {1995},
  edition = {2nd},
}