40.03.02 · combinatorics / symmetric-functions-rsk

Schur Functions: the Combinatorial Definition and the Jacobi-Trudi Determinant

shipped3 tiersLean: none

Anchor (Master): Macdonald 1995 *Symmetric Functions and Hall Polynomials* 2e (Oxford) Ch. I §3, §5 (the Schur functions as the bialternant, Jacobi-Trudi and the dual Nägelsbach-Kostka identity, the Pieri and Littlewood-Richardson rules, skew Schur functions, the Hall inner product and orthonormality); Stanley 1999 *Enumerative Combinatorics, Volume 2* Ch. 7 and Appendix 2; Sagan 2001 *The Symmetric Group* 2e Ch. 4; Cauchy 1815 (the bialternant antisymmetric quotient); Jacobi 1841 *Crelle* 22 (the determinant of alternants); Schur 1901 (the identification with $\mathrm{GL}_n$ characters)

Intuition Beginner

Picture a staircase of empty boxes — a few boxes in the top row, no more in the second row, fewer still further down. This staircase shape is called a Young diagram, and its row lengths spell out a partition of a number, met in the previous unit. Now fill each box with a positive whole number, following two house rules: numbers never decrease as you read a row left to right, and they strictly increase as you read a column top to bottom. A filling obeying both rules is a semistandard tableau. The Schur function is the bookkeeping device that records every legal filling at once.

Why build such a thing? Each filling is a recipe for a single product of variables: a box holding the number $3$ contributes one factor of the third variable $x_{3}$ . Multiply across all boxes and you get one term; add up the terms over every legal filling and you get the Schur function of that shape. It is a sum that remembers, for each pattern of variable-counts, how many fillings produce it. The shape is the input; the symmetric expression is the output.

The pay-off is that these expressions are the single most important family in the whole subject. They are symmetric — relabelling the variables leaves them unchanged — even though the filling rules look lopsided, treating rows and columns differently. That hidden symmetry is a small miracle, and proving it is the first real theorem here. Once you have it, the Schur functions become the natural alphabet: every symmetric expression spells out uniquely in them, and they are the ones that count fillings, measure dimensions, and translate into the language of group symmetry later in the curriculum.

Visual Beginner

A semistandard filling of the staircase shape with rows of length $2$ and $1$ . Reading rule one (rows): left-to-right values do not decrease. Reading rule two (columns): top-to-bottom values strictly increase. The variable-record of a filling multiplies one $x$ per box, indexed by the box's number.

shape (boxes)	a legal filling	the record it contributes
row of 2, row of 1	top row $11$ , bottom row $2$	$x_{1} \cdot x_{1} \cdot x_{2} = x_{1}^{2} x_{2}$
row of 2, row of 1	top row $12$ , bottom row $2$	$x_{1} \cdot x_{2} \cdot x_{2} = x_{1} x_{2}^{2}$
row of 2, row of 1	top row $12$ , bottom row $3$	$x_{1} x_{2} x_{3}$

Notice the two rules pull in opposite directions: rows allow repeats, columns forbid them. The forbidding is what keeps the count finite in each shape and degree. If you collect every legal filling using only the variables $x_{1}, x_{2}, x_{3}$ and add their records, the messy-looking sum turns out to be symmetric: swapping the roles of $x_{1}$ and $x_{2}$ throughout permutes the fillings among themselves and leaves the total alone.

Worked example Beginner

Let us compute the Schur function of the shape with a single row of two boxes, using only two variables $x_{1}$ and $x_{2}$ , and watch the symmetry appear.

Step 1. List the legal fillings. One row of two boxes needs values that do not decrease left to right, with no column rule to worry about (each column has one box). With entries from ${1, 2}$ the choices are: $11$ , then $12$ , then $22$ . That is three fillings.

Step 2. Record each filling. The filling $11$ gives $x_{1} \cdot x_{1} = x_{1}^{2}$ . The filling $12$ gives $x_{1} \cdot x_{2}$ . The filling $22$ gives $x_{2}^{2}$ .

Step 3. Add. The Schur function of this shape is $x_{1}^{2} + x_{1} x_{2} + x_{2}^{2}$ .

Step 4. Test the symmetry. Swap the names $x_{1}$ and $x_{2}$ everywhere. The sum becomes $x_{2}^{2} + x_{2} x_{1} + x_{1}^{2}$ , which is the same three terms in a different order. The total did not change, so this expression is symmetric.

Step 5. Recognise it. The expression $x_{1}^{2} + x_{1} x_{2} + x_{2}^{2}$ is exactly the complete function $h_{2}$ in two variables from the previous unit. A single row of $r$ boxes always reproduces the complete function $h_{r}$ , and a single column of $r$ boxes reproduces the elementary function $e_{r}$ .

What this tells us: the Schur functions are not new strangers — the simplest shapes give back the building blocks already met. The single-row and single-column shapes are the complete and elementary functions; the interesting shapes are the staircases in between, and the rules that knit rows and columns together are what make the whole family symmetric.

Check your understanding Beginner

Formal definition Intermediate+

Work in the ring $Λ$ of symmetric functions over $Z$ of 40.03.01; all notation ( $m_{λ}, e_{λ}, h_{λ}, p_{λ}$ , the dominance order $⊵$ , the conjugate $λ^{'}$ , $z_{λ}$ , the involution $ω$ ) is inherited from that unit and recorded in _meta/NOTATION.md. Fix a partition $λ = (λ_{1} \geq \dots \geq λ_{ℓ} > 0)$ with Young diagram the array of cells $(i, j)$ with $1 \leq j \leq λ_{i}$ .

Definition (semistandard Young tableau). A semistandard Young tableau (SSYT) of shape $λ$ is a filling $T$ of the cells of $λ$ with entries from ${1, 2, 3, \dots}$ that is weakly increasing along each row ( $T (i, j) \leq T (i, j + 1)$ ) and strictly increasing down each column ( $T (i, j) < T (i + 1, j)$ ). Its content is the vector $(μ_{1}, μ_{2}, \dots)$ with $μ_{k} = # {cells holding k}$ , and its weight monomial is $x^{T} = \prod_{(i, j) \in λ} x_{T (i, j)} = x_{1}^{μ_{1}} x_{2}^{μ_{2}} \dots$ . Write $SSYT (λ)$ for the set of all such fillings (entries unbounded) and $SSYT (λ, n)$ for those with entries in ${1, \dots, n}$ .

Definition (Schur function, combinatorial). The Schur function of shape $λ$ is the formal sum $$ s_\lambda ;=; \sum_{T \in \mathrm{SSYT}(\lambda)} x^T ;\in; \mathbb{Z}[[x_1, x_2, \dots]], $$ homogeneous of degree $∣ λ ∣$ . The single-row shape $(r)$ gives $s_{(r)} = h_{r}$ ; the single-column shape $(1^{r})$ gives $s_{(1^{r})} = e_{r}$ . The skew Schur function $s_{λ / μ}$ for $μ \subseteq λ$ is the same sum over SSYT of the skew shape $λ / μ$ (the cells of $λ$ not in $μ$ ), with the identical row/column conditions.

Definition (the bialternant). In $n$ variables, set $δ = (n - 1, n - 2, \dots, 1, 0)$ and for a sequence $α = (α_{1}, \dots, α_{n})$ define the alternant $a_{α} = det (x_{i}^{α_{j}})_{1 \leq i, j \leq n}$ . The alternant $a_{δ} = det (x_{i}^{n - j}) = \prod_{i < j} (x_{i} - x_{j})$ is the Vandermonde determinant. The bialternant definition of the Schur polynomial is the ratio $$ s_\lambda(x_1, \dots, x_n) ;=; \frac{a_{\lambda + \delta}}{a_\delta} ;=; \frac{\det!\big(x_i^{\lambda_j + n - j}\big)}{\det!\big(x_i^{n - j}\big)}, $$ defined for partitions with at most $n$ parts. Each $a_{λ + δ}$ is alternating (a transposition of two variables changes its sign), hence divisible by $a_{δ}$ , so the quotient is a genuine polynomial. The Key theorem proves it equals the SSYT sum restricted to entries $\leq n$ .

Kostka expansion. Collecting SSYT by content gives $s_{λ} = \sum_{μ} K_{λ μ} m_{μ}$ , where the Kostka number $K_{λ μ} = # {T \in SSYT (λ) : content (T) = μ}$ counts SSYT of shape $λ$ and content $μ$ . One has $K_{λλ} = 1$ (the unique filling of row $i$ entirely by $i$ ) and $K_{λ μ} = 0$ unless $λ ⊵ μ$ in dominance order, so the transition ${s_{λ}} \to {m_{μ}}$ is unitriangular and ${s_{λ} : λ ⊢ d}$ is a $Z$ -basis of $Λ^{d}$ ^{[Macdonald §3]}.

Counterexamples to common slips Intermediate+

"The SSYT rules are symmetric in rows and columns." They are not: rows are weak ( $\leq$ ), columns are strict ( $<$ ). This asymmetry is exactly what makes a single row give $h_{r}$ (repetition allowed) and a single column give $e_{r}$ (repetition forbidden). The symmetry that does hold is of the output $s_{λ}$ under permuting variables, not of the rules.
" $s_{λ} (x_{1}, \dots, x_{n})$ is nonzero for every partition $λ$ ." In $n$ variables, $s_{λ} = 0$ as soon as $λ$ has more than $n$ parts: a column of height exceeding $n$ has no strictly increasing filling from ${1, \dots, n}$ , and correspondingly $λ + δ$ then has a repeated entry, killing the alternant.
"Jacobi-Trudi needs $λ$ and the matrix to be square of size $ℓ (λ)$ ." The determinant $det (h_{λ_{i} - i + j})$ may be taken over any $N \times N$ block with $N \geq ℓ (λ)$ ; padding $λ$ by trailing zeros adds rows/columns that are $δ_{ij}$ on the diagonal ( $h_{0} = 1$ ) and leave the determinant unchanged. The convention $h_{k} = 0$ for $k < 0$ is essential.

Key theorem with proof Intermediate+

The signature theorem is that the three definitions of $s_{λ}$ coincide: the combinatorial SSYT sum is symmetric, equals the bialternant, and equals the Jacobi-Trudi determinant in the complete functions.

Theorem (symmetry, bialternant, and Jacobi-Trudi). For every partition $λ$ with $ℓ (λ) \leq n$ , $$ \sum_{T \in \mathrm{SSYT}(\lambda, n)} x^T ;=; \frac{a_{\lambda+\delta}}{a_\delta} ;=; \det!\big(h_{\lambda_i - i + j}\big){1 \le i, j \le n}, $$ *with the convention $h_{0} = 1$ , $h_{k} = 0$ for $k < 0$ . In particular $s\lambda $i s a sy mm e t r i c p o l y n o mia l, an d i t ss t ab l e ($ n \to \infty $) l imi t i s t h esy mm e t r i c f u n c t i o n$ s_\lambda \in \Lambda $. D u a l l y,$ s_\lambda = \det(e_{\lambda'_i - i + j})$.* ^{[Stanley §7.16]}

Proof. Three identifications.

(i) Symmetry of the SSYT sum. It suffices to show $s_{λ}$ is invariant under each adjacent transposition $s_{k} = (k, k + 1)$ of variables, i.e. that swapping the exponents of $x_{k}$ and $x_{k + 1}$ permutes SSYT among themselves. The Bender-Knuth involution $β_{k}$ does this. In an SSYT $T$ , restrict attention to entries equal to $k$ or $k + 1$ . In each column at most one such entry occurs that is "free" — a $k$ with no $k + 1$ directly below in the same column-pair, or vice versa — while $k$ -over- $(k + 1)$ vertical pairs are "fixed". In each row the free $k$ s and free $(k + 1)$ s form a block $k^{a} (k + 1)^{b}$ ; replace it by $k^{b} (k + 1)^{a}$ . This swap preserves the row-weak and column-strict conditions (the fixed pairs guard the columns) and exchanges the multiplicities of $k$ and $k + 1$ , so $β_{k}$ is an involution on $SSYT (λ)$ with $x^{β_{k} (T)} = s_{k} \cdot x^{T}$ . Hence $s_{k}$ fixes the sum, and as the $s_{k}$ generate $S_{n}$ , $s_{λ}$ is symmetric.

(ii) Bialternant equals the SSYT sum. Both sides are symmetric polynomials of degree $∣ λ ∣$ ; compare them after multiplying by $a_{δ}$ . Expand $a_{λ + δ} = \sum_{w \in S_{n}} sgn (w) x^{w (λ + δ)}$ and $a_{δ} = \sum_{w} sgn (w) x^{w δ}$ . A Lindström-type sign-reversing involution on pairs (lattice point configurations / column-strict fillings carrying a sign) cancels all cross terms except those assembling a column-strict tableau: writing $a_{λ + δ} = a_{δ} \cdot (\sum_{T} x^{T})$ amounts to the Gessel-Viennot model below read in the variables $x_{i}$ . Concretely, the quotient $a_{λ + δ} / a_{δ}$ is the Weyl character of $GL_{n}$ at highest weight $λ$ , and the Weyl character is the trace over a basis indexed by SSYT (Gelfand-Tsetlin patterns), giving the same monomial sum.

(iii) Jacobi-Trudi by nonintersecting paths. Use the Lindström-Gessel-Viennot lemma of 40.01.04: in an acyclic weighted digraph, the signed sum over $N$ -tuples of paths between sources $A_{i}$ and sinks $B_{j}$ equals $det (M)$ with $M_{ij}$ the path-generating function $A_{i} \to B_{j}$ , and only the nonintersecting tuples survive. Take the planar lattice $Z^{2}$ with east and north steps, sources $A_{i} = (1 - i, 0)$ and sinks $B_{j} = (λ_{j} + N - j, N - 1)$ , weighting each east step at height $r$ by $x_{r}$ . The generating function $A_{i} \to B_{j}$ is then $h_{λ_{j} - j + i}$ (a single row of a tableau), so the LGV determinant is $det (h_{λ_{j} - j + i}) = det (h_{λ_{i} - i + j})^{T}$ . A nonintersecting tuple of such paths is precisely an SSYT of shape $λ$ (the $i$ -th path traces the cells holding entries $\geq i$ , the heights of its east steps reading off the columns), so the determinant equals $\sum_{T} x^{T} = s_{λ}$ . Applying $ω$ (which sends $h_{k} \mapsto e_{k}$ and, as shown in Advanced results, $s_{λ} \mapsto s_{λ^{'}}$ ) to the row form yields the dual column form $s_{λ} = det (e_{λ_{i}^{'} - i + j})$ . $□$

Bridge. This theorem is the foundational reason the Schur functions are inevitable: it shows that the combinatorial, the determinantal, and the representation-theoretic descriptions are one object, so any property provable in one language transfers to the others. It builds toward the Pieri and Littlewood-Richardson rules of Advanced results, where the SSYT picture controls the structure constants, and it appears again in the Hall inner product of 40.03.03, under which these same $s_{λ}$ are orthonormal. The bialternant side is exactly the Weyl character formula, which is dual to the Jacobi-Trudi side through the Vandermonde $a_{δ}$ ; this is the central insight that the antisymmetrisation $\sum_{w} sgn (w)$ in $a_{λ + δ}$ and the sign-reversing involution of the LGV lemma are the same cancellation seen twice. Putting these together, the SSYT definition supplies positivity (a sum of monomials), the bialternant supplies symmetry and the basis property, and Jacobi-Trudi supplies the algebra inside $Λ = Z [h_{1}, h_{2}, \dots]$ — three faces of one function.

Exercises Intermediate+

Exercise 6 (hard, symbolic).

Prove the general Pieri rule $s_{λ} h_{r} = \sum_{μ} s_{μ}$ , the sum over partitions $μ \supseteq λ$ with $∣ μ / λ ∣ = r$ and $μ / λ$ a horizontal strip (no two added cells in the same column).

Hint

Expand $s_{λ} h_{r}$ as a sum over SSYT of shape $λ$ times a single row of $r$ boxes, and use a tableau-insertion (or the SSYT-of-skew-shape) argument to glue the row onto $λ$ in all horizontal-strip ways.

Answer

Write $h_{r} = s_{(r)} = \sum_{T_{0}} x^{T_{0}}$ over single rows $T_{0}$ of length $r$ , and $s_{λ} = \sum_{T} x^{T}$ . A product term $x^{T} x^{T_{0}}$ is recorded by overlaying: insert the $r$ entries of $T_{0}$ into $λ$ to extend it to a shape $μ$ with $μ / λ$ filled column-strict-compatibly. Because $T_{0}$ is a single weakly increasing row, the added cells in any fixed column number at most one (a second cell in a column would force a strict column repeat or violate weak-row order on insertion), so $μ / λ$ is a horizontal strip of size $r$ . Conversely every SSYT of skew shape $μ / λ$ that is a horizontal strip arises uniquely, and the content bookkeeping gives $s_{μ / λ} = h_{r}$ -coefficient $= 1$ for each such $μ$ . Hence $s_{λ} h_{r} = \sum_{μ} s_{μ}$ over horizontal strips. (Equivalently, via the Hall inner product of 40.03.03, $⟨ s_{λ} h_{r}, s_{μ} ⟩ = ⟨ s_{λ}, s_{μ} h_{r}^{⊥} ⟩$ counts horizontal strips.) Rubric: full credit for the row-overlay argument, the horizontal-strip characterisation, and the multiplicity-one conclusion.

Exercise 7 (hard, symbolic).

Prove that ${s_{λ} : λ ⊢ d}$ is a $Z$ -basis of $Λ^{d}$ by establishing the Kostka unitriangularity $s_{λ} = m_{λ} + \sum_{μ ⊲ λ} K_{λ μ} m_{μ}$ .

Hint

Show $K_{λλ} = 1$ and $K_{λ μ} = 0$ unless $λ ⊵ μ$ . For the dominance bound, sum the column-strict inequalities over the top $j$ rows.

Answer

By definition $s_{λ} = \sum_{μ} K_{λ μ} m_{μ}$ with $K_{λ μ} = # {T \in SSYT (λ) : content (T) = μ}$ . The unique SSYT of content $λ$ fills row $i$ entirely with $i$ (column-strictness forces row $i$ 's entries to exceed row $(i - 1)$ 's, and the content budget leaves no slack), so $K_{λλ} = 1$ . For any $T$ of content $μ$ , the entries $\leq j$ occupy cells confined to the first $j$ rows (column-strictness: an entry $\leq j$ in row $i$ forces distinct smaller entries above it, using up rows), so $μ_{1} + \dots + μ_{j} = # {cells \leq j} \leq λ_{1} + \dots + λ_{j}$ for all $j$ ; that is $λ ⊵ μ$ . Hence $K_{λ μ} = 0$ unless $λ ⊵ μ$ , and the transition matrix from ${s_{λ}}$ to ${m_{μ}}$ is unitriangular for the dominance order with $1$ 's on the diagonal. An integer unitriangular matrix is invertible over $Z$ , and ${m_{μ}}$ is a $Z$ -basis of $Λ^{d}$ (40.03.01), so ${s_{λ}}$ is too. Rubric: full credit for $K_{λλ} = 1$ , the dominance bound via the row-confinement of small entries, and the unitriangular-invertibility conclusion.

Advanced results Master

The Schur basis carries the multiplicative and inner-product structure of $Λ$ , and the determinantal formulas extend to skew shapes, specialisations, and the Cauchy kernel.

Theorem 1 (Pieri, dual Pieri, and Littlewood-Richardson). The complete and elementary functions act on the Schur basis by adding strips: $$ s_\lambda, h_r = \sum_{\mu} s_\mu \ \ (\mu/\lambda \text{ a horizontal } r\text{-strip}), \qquad s_\lambda, e_r = \sum_{\mu} s_\mu \ \ (\mu/\lambda \text{ a vertical } r\text{-strip}). $$ These are the $ν = (r)$ and $ν = (1^{r})$ cases of the general structure constants $s_{λ} s_{μ} = \sum_{ν} c_{λ μ}^{ν} s_{ν}$ , where $c_{λ μ}^{ν}$ is the Littlewood-Richardson coefficient counting LR skew tableaux of shape $ν / λ$ and content $μ$ ; the $c_{λ μ}^{ν}$ are non-negative integers, developed in the co-produced 40.03.05. The two Pieri rules are conjugate to each other under $ω$ , since $ω (h_{r}) = e_{r}$ and $ω (s_{λ}) = s_{λ^{'}}$ ^{[Macdonald §5]}.

Theorem 2 (skew Jacobi-Trudi and Giambelli). For $μ \subseteq λ$ the skew Schur function satisfies the determinantal formulas $$ s_{\lambda/\mu} = \det!\big(h_{\lambda_i - \mu_j - i + j}\big){1 \le i,j \le \ell(\lambda)} = \det!\big(e{\lambda'i - \mu'j - i + j}\big), $$ again by the LGV lemma with sources displaced by $μ$ . The expansion $s{\lambda/\mu} = \sum\nu c^\lambda_{\mu\nu} s_\nu $ha s t h es am e L i ttl e w oo d - R i c ha r d so n coe f f i c i e n t s, sos k e w S c h u r f u n c t i o n s a r e S c h u r - p os i t i v e . T h e * * G iamb e l l i (h oo k) d e t er minan t * * w r i t es an y$ s_\lambda $in F r o b e ni u s h oo k coor d ina t es$ \lambda = (a_1, \dots, a_d \mid b_1, \dots, b_d) $a s$ s_\lambda = \det\big(s_{(a_i \mid b_j)}\big)$, a determinant of single-hook Schur functions — the Plücker/Schubert-calculus form.

Theorem 3 (Cauchy identity and orthonormality). The Schur functions diagonalise the Cauchy kernel: $$ \prod_{i,j} \frac{1}{1 - x_i y_j} = \sum_{\lambda} s_\lambda(x), s_\lambda(y), \qquad \prod_{i,j}(1 + x_i y_j) = \sum_\lambda s_\lambda(x), s_{\lambda'}(y). $$ Under the Hall inner product of the co-produced 40.03.03 — defined by $⟨ h_{λ}, m_{μ} ⟩ = δ_{λ μ}$ , equivalently $⟨ p_{λ}, p_{μ} ⟩ = z_{λ} δ_{λ μ}$ — the Cauchy identity is equivalent to $⟨ s_{λ}, s_{μ} ⟩ = δ_{λ μ}$ : the Schur functions are an orthonormal $Z$ -basis, the unique (up to sign) basis that is both orthonormal and unitriangular against ${m_{μ}}$ . The involution $ω$ is then an isometry with $ω (s_{λ}) = s_{λ^{'}}$ .

Theorem 4 (principal specialisation, hook-content and hook-length). Setting $x_{i} = q^{i - 1}$ for $i = 1, \dots, n$ and the rest $0$ gives the principal specialisation $$ s_\lambda(1, q, q^2, \dots, q^{n-1}) = q^{,n(\lambda)} \prod_{(i,j) \in \lambda} \frac{1 - q^{,n + c(i,j)}}{1 - q^{,h(i,j)}}, $$ the hook-content formula, where $c (i, j) = j - i$ is the content of a cell, $h (i, j)$ its hook length, and $n (λ) = \sum_{i} (i - 1) λ_{i}$ . Letting $q \to 1$ recovers $s_{λ} (1^{n}) = \prod_{(i, j)} \frac{n + c ( i , j )}{h ( i , j )}$ , the number of SSYT of shape $λ$ with entries $\leq n$ , i.e. the dimension of the irreducible $GL_{n}$ -representation; and the standard-tableau count $f^{λ} = \frac{∣ λ ∣ !}{\prod _{(i, j)} h ( i , j )}$ is the hook-length formula for the symmetric-group irreducible, cross-referenced to 07.05.02. The coefficient of the lowest power, $q^{n (λ)}$ , is the charge/cocharge statistic that recurs in the Kostka-Foulkes refinement.

Synthesis. Putting these together, the Schur basis is the fixed point at which every structure on $Λ$ becomes diagonal. The combinatorial definition makes each $s_{λ}$ manifestly Schur-positive and the Kostka unitriangularity makes them a $Z$ -basis; the Cauchy identity of Theorem 3 is exactly the statement that this basis is orthonormal for the Hall inner product of 40.03.03, so the foundational reason the $s_{λ}$ dominate the subject is that they are the self-dual basis, and $ω$ — conjugation of diagrams in 40.03.01 — is precisely the isometry $s_{λ} \mapsto s_{λ^{'}}$ . The central insight is that the antisymmetrisation of the bialternant, the sign-reversing cancellation of the LGV proof of Jacobi-Trudi, and the orthogonality of characters are one phenomenon: the Weyl character formula is dual to the Jacobi-Trudi determinant through the Vandermonde, and this is exactly why the Pieri rules of Theorem 1 (multiplication by $h_{r}$ and $e_{r}$ ) are conjugate under $ω$ and why the Littlewood-Richardson coefficients are symmetric. The bridge is the Frobenius characteristic of the co-produced 40.03.06: it carries the irreducible $S_{n}$ -characters to the $s_{λ}$ , so the hook-length formula of Theorem 4 — counting standard tableaux — generalises into the principal specialisation, and the entire determinantal calculus developed here reappears as the character theory of 07.05.02 and the Murnaghan-Nakayama rule of 07.05.10. The same diagram conjugation that defines $ω$ in 40.03.01 is the duality threading Pieri, Cauchy, and the hook-content formula.

Full proof set Master

Proposition 1 (the SSYT sum is symmetric). For every partition $λ$ , $s_{λ} = \sum_{T \in SSYT (λ)} x^{T}$ is a symmetric function.

Proof. Fix $k \geq 1$ ; we produce an involution $β_{k}$ on $SSYT (λ)$ exchanging the multiplicities of $k$ and $k + 1$ while fixing all others, which gives invariance under the adjacent transposition $s_{k}$ and hence (as the $s_{k}$ generate the symmetric group) full symmetry. In $T$ , call a cell holding $k$ or $k + 1$ paired if it lies directly above a $k + 1$ (for a $k$ ) or directly below a $k$ (for a $k + 1$ ) in the same column; paired cells form vertical $(k + 1 k)$ dominoes and are left untouched. The remaining free $k$ s and $(k + 1)$ s lie in horizontal runs within rows: in a given row the free entries equal to $k$ or $k + 1$ form a contiguous block $k^{a} (k + 1)^{b}$ (any intervening other value would separate them into independent runs, handled identically). Replace each such run by $k^{b} (k + 1)^{a}$ . Row-weak monotonicity is preserved because the run was the only place $k, k + 1$ were adjacent and free; column-strictness is preserved because the paired dominoes that could conflict were excluded from the free set. The map is an involution ( $a \leftrightarrow b$ twice returns the original) and replaces $μ_{k}, μ_{k + 1}$ by $μ_{k + 1}, μ_{k}$ , so $x^{β_{k} T} = s_{k} \cdot x^{T}$ . Summing, $s_{k} (s_{λ}) = s_{λ}$ . $□$

Proposition 2 (Jacobi-Trudi via Lindström-Gessel-Viennot). With $h_{0} = 1$ , $h_{m} = 0$ for $m < 0$ , and $N \geq ℓ (λ)$ , $s_{λ} = det (h_{λ_{i} - i + j})_{1 \leq i, j \leq N}$ .

Proof. Consider the directed lattice on $Z^{2}$ with unit east and north steps; weight an east step from $(\cdot, r)$ to $(\cdot + 1, r)$ by $x_{r + 1}$ , so a path's weight is $\prod x$ over its east steps. Place sources $A_{i} = (1 - i, 0)$ and sinks $B_{j} = (λ_{j} - j + 1, N - 1)$ for $1 \leq i, j \leq N$ (padding $λ$ with zeros to length $N$ ). The generating function of all paths $A_{i} \to B_{j}$ is the complete homogeneous function $h_{(λ_{j} - j + 1) - (1 - i)} = h_{λ_{j} - j + i}$ , since a monotone path making $m$ east steps across heights records a weakly increasing length- $m$ word, whose weight sum is $h_{m}$ . The Lindström-Gessel-Viennot lemma 40.01.04 gives $$ \sum_{w \in S_N} \mathrm{sgn}(w) \prod_{i} (\text{paths } A_i \to B_{w(i)}) = \det\big(h_{\lambda_j - j + i}\big) = \sum_{(P_1,\dots,P_N) \text{ nonintersecting}} \mathrm{sgn}(\sigma), x^{P_1}\cdots x^{P_N}, $$ and for this source/sink placement only the identity permutation admits nonintersecting tuples, all with sign $+ 1$ . A nonintersecting tuple is in content-preserving bijection with an SSYT of shape $λ$ : path $P_{i}$ contributes the east-step heights as the entries of the cells in the $i$ -th row read against the columns, the nonintersection condition translating exactly into the column-strict, row-weak conditions. Therefore $det (h_{λ_{j} - j + i}) = \sum_{T} x^{T} = s_{λ}$ , and transposing the matrix (a determinant is invariant under transpose) gives the stated $det (h_{λ_{i} - i + j})$ . $□$

Proposition 3 (bialternant equals Jacobi-Trudi). In $n$ variables, $a_{λ + δ} / a_{δ} = det (h_{λ_{i} - i + j})_{1 \leq i, j \leq n}$ .

Proof. Use the generating-function identity $\sum_{m} h_{m} t^{m} = \prod_{i = 1}^{n} (1 - x_{i} t)^{- 1} = 1/ H_{n} (t)^{- 1}$ , where here $H_{n} (t)^{- 1} = \prod_{i} (1 - x_{i} t)$ . In matrix form, the complete functions are the entries of the inverse of the Vandermonde-style matrix built from the $e$ 's; concretely, the Jacobi-Trudi determinant arises by Cramer's rule. Form the $n \times n$ matrices $X = (x_{i}^{n - j})$ (so $det X = a_{δ}$ ) and $X^{(λ)} = (x_{i}^{λ_{j} + n - j})$ (so $det X^{(λ)} = a_{λ + δ}$ ). Expanding $\prod_{k} (1 - x_{i} t)^{- 1}$ shows that the matrix $H = (h_{i - j})_{1 \leq i, j}$ (lower-triangular-unipotent in the appropriate truncation) satisfies $X^{(λ)} = X \cdot M_{λ}$ in each graded piece, where $M_{λ}$ is the matrix with $(M_{λ})_{j k} = h_{λ_{k} - k + j}$ shifted by the staircase; taking determinants, $a_{λ + δ} = a_{δ} \cdot det (h_{λ_{i} - i + j})$ . Equivalently, both sides are symmetric, agree in the stable limit by Proposition 2, and the finite-variable identity follows by setting $x_{n + 1} = \dots = 0$ , under which $h_{m}$ specialises correctly and a column of $λ$ exceeding $n$ kills both sides. Hence the ratio equals the determinant. $□$

Proposition 4 (Pieri rule). $s_{λ} h_{r} = μ : μ / λ horizontal r -strip \sum s_{μ}$ .

Proof. Pair the Jacobi-Trudi expression with the Hall inner product of 40.03.03, under which ${s_{μ}}$ is orthonormal, so the coefficient of $s_{μ}$ in $s_{λ} h_{r}$ is $⟨ s_{λ} h_{r}, s_{μ} ⟩ = ⟨ s_{λ}, h_{r}^{⊥} s_{μ} ⟩$ . The adjoint $h_{r}^{⊥}$ of multiplication by $h_{r}$ acts as the skewing operator $s_{μ} \mapsto s_{μ / (r)} = \sum_{ν} (coeff) s_{ν}$ ; by the dual-basis property $⟨ s_{λ}, s_{μ / (r)} ⟩$ is the multiplicity of $s_{λ}$ in $s_{μ / (r)}$ , which equals $1$ if $μ / λ$ is a horizontal $r$ -strip and $0$ otherwise. Combinatorially: $s_{λ} h_{r} = \sum_{T, T_{0}} x^{T} x^{T_{0}}$ over SSYT $T$ of shape $λ$ and single rows $T_{0}$ of length $r$ ; the RSK-free insertion of $T_{0}$ 's weakly increasing row onto $λ$ produces exactly the SSYT of skew shapes $μ / λ$ that are horizontal strips, each once, since two inserted cells in one column would force a strict-column violation. Grouping by $μ$ gives $\sum_{μ} s_{μ}$ over horizontal $r$ -strips. $□$

Proposition 5 (orthonormality from Cauchy). Under $⟨ h_{λ}, m_{μ} ⟩ = δ_{λ μ}$ , one has $⟨ s_{λ}, s_{μ} ⟩ = δ_{λ μ}$ .

Proof. The Cauchy kernel factors two ways: $\prod_{i, j} (1 - x_{i} y_{j})^{- 1} = \sum_{λ} h_{λ} (x) m_{λ} (y) = \sum_{λ} m_{λ} (x) h_{λ} (y)$ , exhibiting ${h_{λ}}$ and ${m_{μ}}$ as dual bases, which is the definition of the inner product. Two graded bases ${u_{λ}}, {v_{λ}}$ are dual ( $⟨ u_{λ}, v_{μ} ⟩ = δ_{λ μ}$ ) if and only if $\sum_{λ} u_{λ} (x) v_{λ} (y) = \prod_{i, j} (1 - x_{i} y_{j})^{- 1}$ . Write $s_{λ} = \sum_{μ} K_{λ μ} m_{μ}$ and the dual expansion $s_{λ} = \sum_{μ} K^{- 1} \dots$ ; substituting the Jacobi-Trudi/Kostka relations, $\sum_{λ} s_{λ} (x) s_{λ} (y) = \sum_{λ} (\sum_{μ} K_{λ μ} m_{μ} (x)) (\sum_{ν} K_{λ ν} m_{ν} (y))$ . The Kostka matrix $K$ is unitriangular, and the orthogonality $\sum_{λ} K_{λ μ} K_{λ ν}$ against the $h$ - $m$ duality collapses the double sum to $\sum_{μ} h_{μ} (x) m_{μ} (y)$ precisely when ${s_{λ}}$ is self-dual. Thus $\sum_{λ} s_{λ} (x) s_{λ} (y) = \prod_{i, j} (1 - x_{i} y_{j})^{- 1}$ is equivalent to $⟨ s_{λ}, s_{μ} ⟩ = δ_{λ μ}$ ; the Cauchy identity (proved by RSK in the co-produced 40.03.04) supplies the left equality, giving orthonormality. $□$

Connections Master

The Schur functions are built directly on the ring and bases of 40.03.01: the Kostka unitriangularity $s_{λ} = m_{λ} + \sum_{μ ⊲ λ} K_{λ μ} m_{μ}$ uses the dominance order and monomial basis defined there, the Jacobi-Trudi determinants live in $Λ = Z [h_{1}, h_{2}, \dots]$ , and the diagram conjugation that gives the involution $ω$ of 40.03.01 acts here as $ω (s_{λ}) = s_{λ^{'}}$ , interchanging the row Jacobi-Trudi formula with its dual column form. The single-row and single-column Schur functions recover the complete $h_{r}$ and elementary $e_{r}$ generators of that unit.
The Hall inner product making ${s_{λ}}$ orthonormal, and the Cauchy identity $\prod (1 - x_{i} y_{j})^{- 1} = \sum_{λ} s_{λ} (x) s_{λ} (y)$ that is equivalent to it, are the subject of the co-produced 40.03.03; the RSK correspondence that proves the Cauchy identity bijectively is the co-produced 40.03.04; and the Littlewood-Richardson coefficients $c_{λ μ}^{ν}$ generalising the Pieri rules proved here are the co-produced 40.03.05. The Frobenius characteristic of the co-produced 40.03.06 carries the Schur functions to the irreducible characters of the symmetric groups, turning every Schur identity into a character identity.
The bialternant $s_{λ} = a_{λ + δ} / a_{δ}$ is the Weyl character formula for $GL_{n}$ , so this unit is the combinatorial face of the representation theory in 07.05.02 (Young diagrams and the irreducibles) and 07.05.04 (Schur-Weyl duality, where $s_{λ}$ is simultaneously a $GL_{n}$ and an $S_{n}$ object); the hook-length formula $f^{λ} = ∣ λ ∣! / \prod h (i, j)$ of Theorem 4 is the dimension of the Specht module of 07.05.03, and the power-sum expansion of $s_{λ}$ is the Murnaghan-Nakayama rule of 07.05.10. The Jacobi-Trudi proof reuses the Lindström-Gessel-Viennot lemma of 40.01.04, the same nonintersecting-path determinant that computes transfer matrices and $P$ -partition generating functions there.

Historical & philosophical context Master

The bialternant quotient $a_{λ + δ} / a_{δ}$ that defines the Schur function appears first in Cauchy's 1815 memoir on alternating functions ^{[Cauchy 1815]}, where the ratio of an alternant by the Vandermonde is studied as the basic symmetric object built from a determinant; the determinant of alternants and the manipulation that yields what is now Jacobi-Trudi were developed by Jacobi in his 1841 paper De functionibus alternantibus in Crelle's Journal ^{[Jacobi 1841]}, from which the modern name of the identity descends. The functions acquired their combinatorial and representation-theoretic meaning in Issai Schur's 1901 Berlin dissertation, which identified $a_{λ + δ} / a_{δ}$ with the irreducible polynomial character of $GL_{n} (C)$ of highest weight $λ$ ; the name Schur function is due to the later literature.

The combinatorial definition by semistandard tableaux, the symmetry by what are now called the Bender-Knuth involutions, and the systematic Pieri and Littlewood-Richardson calculus belong to the tableau tradition running from Alfred Young's substitutional analysis (1900-1930s) through Dudley Littlewood and Archibald Richardson's 1934 paper stating the multiplication rule, to its rigorous proofs decades later by Schützenberger, Thomas, and others. The nonintersecting-lattice-path proof of the determinantal formulas is due to Lindström (1973) and, in the combinatorial form used here, to Gessel and Viennot (1985). The bialternant-to-tableau equivalence is the combinatorial shadow of the Weyl character formula proved by Hermann Weyl (1925-26) for compact Lie groups, of which the $GL_{n}$ case is the Schur polynomial.

Bibliography Master

@book{stanley1999ec2,
  author    = {Stanley, Richard P.},
  title     = {Enumerative Combinatorics, Volume 2},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {62},
  publisher = {Cambridge University Press},
  year      = {1999}
}

@book{macdonald1995symmetric,
  author    = {Macdonald, Ian G.},
  title     = {Symmetric Functions and Hall Polynomials},
  series    = {Oxford Mathematical Monographs},
  edition   = {2},
  publisher = {Oxford University Press},
  year      = {1995}
}

@book{fulton1997young,
  author    = {Fulton, William},
  title     = {Young Tableaux: With Applications to Representation Theory and Geometry},
  series    = {London Mathematical Society Student Texts},
  volume    = {35},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@article{jacobi1841functionibus,
  author  = {Jacobi, Carl Gustav Jacob},
  title   = {De functionibus alternantibus earumque divisione per productum e differentiis elementorum conflatum},
  journal = {Journal f\"{u}r die reine und angewandte Mathematik (Crelle)},
  volume  = {22},
  pages   = {360--371},
  year    = {1841}
}

@article{cauchy1815memoire,
  author  = {Cauchy, Augustin-Louis},
  title   = {M\'{e}moire sur les fonctions qui ne peuvent obtenir que deux valeurs \'{e}gales et de signes contraires par suite des transpositions op\'{e}r\'{e}es entre les variables qu'elles renferment},
  journal = {Journal de l'\'{E}cole Polytechnique},
  volume  = {10},
  pages   = {29--112},
  year    = {1815}
}

@phdthesis{schur1901,
  author  = {Schur, Issai},
  title   = {\"{U}ber eine Klasse von Matrizen, die sich einer gegebenen Matrix zuordnen lassen},
  school  = {Universit\"{a}t Berlin},
  year    = {1901}
}

@article{littlewoodrichardson1934,
  author  = {Littlewood, Dudley E. and Richardson, Archibald R.},
  title   = {Group characters and algebra},
  journal = {Philosophical Transactions of the Royal Society A},
  volume  = {233},
  pages   = {99--141},
  year    = {1934}
}

@article{gesselviennot1985,
  author  = {Gessel, Ira and Viennot, G\'{e}rard},
  title   = {Binomial determinants, paths, and hook length formulae},
  journal = {Advances in Mathematics},
  volume  = {58},
  number  = {3},
  pages   = {300--321},
  year    = {1985}
}

Prerequisites

40.03.01

Tier anchors

beginner: Stanley 1999 *Enumerative Combinatorics, Volume 2* (Cambridge) §7.10 opening (the Schur function as the weight generating function of semistandard tableaux, by example in two and three variables); Sagan 2001 *The Symmetric Group* 2e (Springer) §4.4 (semistandard tableaux and the combinatorial Schur function)
intermediate: Stanley 1999 *Enumerative Combinatorics, Volume 2* 2e (Cambridge) §7.10-7.16 (the SSYT definition, the bialternant/Weyl formula, the Jacobi-Trudi identities by Lindström-Gessel-Viennot, the Pieri rules, skew Schur functions, the dual Cauchy and orthonormality); Fulton 1997 *Young Tableaux* (Cambridge LMS Student Texts 35) Ch. 2, 4 (tableaux, the determinantal formulas, Pieri)
master: Macdonald 1995 *Symmetric Functions and Hall Polynomials* 2e (Oxford) Ch. I §3, §5 (the Schur functions as the bialternant, Jacobi-Trudi and the dual Nägelsbach-Kostka identity, the Pieri and Littlewood-Richardson rules, skew Schur functions, the Hall inner product and orthonormality); Stanley 1999 *Enumerative Combinatorics, Volume 2* Ch. 7 and Appendix 2; Sagan 2001 *The Symmetric Group* 2e Ch. 4; Cauchy 1815 (the bialternant antisymmetric quotient); Jacobi 1841 *Crelle* 22 (the determinant of alternants); Schur 1901 (the identification with $\mathrm{GL}_n$ characters)

References

Stanley, R. P. — Enumerative Combinatorics, Volume 2 · Cambridge Studies in Advanced Mathematics 62 (1999). Chapter 7 §7.10 defines the Schur function $s_\lambda = \sum_T x^{\mathrm{wt}(T)}$ as the sum over semistandard Young tableaux (column-strict, row-weakly-increasing fillings) of shape $\lambda$ with entries in $\{1,2,\dots\}$, weighted by the content monomial; the symmetry of $s_\lambda$ is proved via the Bender-Knuth involutions on tableaux (§7.10-7.11). §7.15 gives the bialternant (Weyl character) formula $s_\lambda(x_1,\dots,x_n) = a_{\lambda+\delta}/a_\delta$ with $a_\mu = \det(x_i^{\mu_j})$ and $\delta = (n-1,\dots,1,0)$, and the dual-basis/Kostka expansion $s_\lambda = \sum_\mu K_{\lambda\mu} m_\mu$. The Jacobi-Trudi identity $s_\lambda = \det(h_{\lambda_i - i + j})_{1\le i,j \le \ell}$ and its dual $s_\lambda = \det(e_{\lambda'_i - i + j})$ are §7.16, proved both algebraically and by the Lindström-Gessel-Viennot lemma on nonintersecting lattice paths. The Pieri rules $s_\lambda h_r = \sum_\mu s_\mu$ (sum over horizontal strips) and $s_\lambda e_r = \sum_\mu s_\mu$ (vertical strips), the orthonormality $\langle s_\lambda, s_\mu\rangle = \delta_{\lambda\mu}$ under the Hall inner product, skew Schur functions $s_{\lambda/\mu}$, the Cauchy identity, and the principal specialisation / hook-content formula are §7.12-7.21.
Macdonald, I. G. — Symmetric Functions and Hall Polynomials · Oxford Mathematical Monographs, 2nd edition (1995). Chapter I §3 defines the Schur function as the bialternant $s_\lambda = a_{\lambda+\delta}/a_\delta$, proves it is a symmetric polynomial, gives the combinatorial expansion $s_\lambda = \sum_T x^T$ over tableaux and the Kostka expansion $s_\lambda = \sum_\mu K_{\lambda\mu} m_\mu$ (unitriangular against dominance, so the $s_\lambda$ form a $\mathbb{Z}$-basis of $\Lambda$). The Jacobi-Trudi formulas (I.3 (3.4), (3.5)) express $s_\lambda$ as a determinant in the $h$ and (dually, via $\omega$) in the $e$ functions. §5 develops the Pieri rules, the Littlewood-Richardson rule, skew Schur functions $s_{\lambda/\mu} = \det(h_{\lambda_i - \mu_j - i + j})$, and the Hall inner product under which $\{s_\lambda\}$ is orthonormal, $\langle h_\lambda, m_\mu\rangle = \delta_{\lambda\mu}$, and $\omega(s_\lambda) = s_{\lambda'}$.
Fulton, W. — Young Tableaux: With Applications to Representation Theory and Geometry · Cambridge University Press, London Mathematical Society Student Texts 35 (1997). Develops the tableau combinatorics with the determinantal formulas in view: Chapter 2 on the bumping/insertion and the ring of symmetric functions, Chapter 4 on the Pieri and Giambelli (Jacobi-Trudi) formulas and their proof by nonintersecting paths, and the connection to the Schubert calculus on Grassmannians. The Schur polynomial $s_\lambda(x_1,\dots,x_n)$ is the character of the irreducible $\mathrm{GL}_n(\mathbb{C})$-representation of highest weight $\lambda$, and the hook-length / hook-content formulas count standard and semistandard tableaux.

Estimated time

beginner: 20m
intermediate: 50m
master: 90m