40.03.02 · combinatorics / symmetric-functions-rsk

Schur Functions: the Combinatorial Definition and the Jacobi-Trudi Determinant

shipped3 tiersLean: none

Anchor (Master): Macdonald 1995 *Symmetric Functions and Hall Polynomials* 2e (Oxford) Ch. I §3, §5 (the Schur functions as the bialternant, Jacobi-Trudi and the dual Nägelsbach-Kostka identity, the Pieri and Littlewood-Richardson rules, skew Schur functions, the Hall inner product and orthonormality); Stanley 1999 *Enumerative Combinatorics, Volume 2* Ch. 7 and Appendix 2; Sagan 2001 *The Symmetric Group* 2e Ch. 4; Cauchy 1815 (the bialternant antisymmetric quotient); Jacobi 1841 *Crelle* 22 (the determinant of alternants); Schur 1901 (the identification with $\mathrm{GL}_n$ characters)

Intuition Beginner

Picture a staircase of empty boxes — a few boxes in the top row, no more in the second row, fewer still further down. This staircase shape is called a Young diagram, and its row lengths spell out a partition of a number, met in the previous unit. Now fill each box with a positive whole number, following two house rules: numbers never decrease as you read a row left to right, and they strictly increase as you read a column top to bottom. A filling obeying both rules is a semistandard tableau. The Schur function is the bookkeeping device that records every legal filling at once.

Why build such a thing? Each filling is a recipe for a single product of variables: a box holding the number contributes one factor of the third variable . Multiply across all boxes and you get one term; add up the terms over every legal filling and you get the Schur function of that shape. It is a sum that remembers, for each pattern of variable-counts, how many fillings produce it. The shape is the input; the symmetric expression is the output.

The pay-off is that these expressions are the single most important family in the whole subject. They are symmetric — relabelling the variables leaves them unchanged — even though the filling rules look lopsided, treating rows and columns differently. That hidden symmetry is a small miracle, and proving it is the first real theorem here. Once you have it, the Schur functions become the natural alphabet: every symmetric expression spells out uniquely in them, and they are the ones that count fillings, measure dimensions, and translate into the language of group symmetry later in the curriculum.

Visual Beginner

A semistandard filling of the staircase shape with rows of length and . Reading rule one (rows): left-to-right values do not decrease. Reading rule two (columns): top-to-bottom values strictly increase. The variable-record of a filling multiplies one per box, indexed by the box's number.

shape (boxes) a legal filling the record it contributes
row of 2, row of 1 top row , bottom row
row of 2, row of 1 top row , bottom row
row of 2, row of 1 top row , bottom row

Notice the two rules pull in opposite directions: rows allow repeats, columns forbid them. The forbidding is what keeps the count finite in each shape and degree. If you collect every legal filling using only the variables and add their records, the messy-looking sum turns out to be symmetric: swapping the roles of and throughout permutes the fillings among themselves and leaves the total alone.

Worked example Beginner

Let us compute the Schur function of the shape with a single row of two boxes, using only two variables and , and watch the symmetry appear.

Step 1. List the legal fillings. One row of two boxes needs values that do not decrease left to right, with no column rule to worry about (each column has one box). With entries from the choices are: , then , then . That is three fillings.

Step 2. Record each filling. The filling gives . The filling gives . The filling gives .

Step 3. Add. The Schur function of this shape is .

Step 4. Test the symmetry. Swap the names and everywhere. The sum becomes , which is the same three terms in a different order. The total did not change, so this expression is symmetric.

Step 5. Recognise it. The expression is exactly the complete function in two variables from the previous unit. A single row of boxes always reproduces the complete function , and a single column of boxes reproduces the elementary function .

What this tells us: the Schur functions are not new strangers — the simplest shapes give back the building blocks already met. The single-row and single-column shapes are the complete and elementary functions; the interesting shapes are the staircases in between, and the rules that knit rows and columns together are what make the whole family symmetric.

Check your understanding Beginner

Formal definition Intermediate+

Work in the ring of symmetric functions over of 40.03.01; all notation (, the dominance order , the conjugate , , the involution ) is inherited from that unit and recorded in _meta/NOTATION.md. Fix a partition with Young diagram the array of cells with .

Definition (semistandard Young tableau). A semistandard Young tableau (SSYT) of shape is a filling of the cells of with entries from that is weakly increasing along each row () and strictly increasing down each column (). Its content is the vector with , and its weight monomial is . Write for the set of all such fillings (entries unbounded) and for those with entries in .

Definition (Schur function, combinatorial). The Schur function of shape is the formal sum $$ s_\lambda ;=; \sum_{T \in \mathrm{SSYT}(\lambda)} x^T ;\in; \mathbb{Z}[[x_1, x_2, \dots]], $$ homogeneous of degree . The single-row shape gives ; the single-column shape gives . The skew Schur function for is the same sum over SSYT of the skew shape (the cells of not in ), with the identical row/column conditions.

Definition (the bialternant). In variables, set and for a sequence define the alternant . The alternant is the Vandermonde determinant. The bialternant definition of the Schur polynomial is the ratio $$ s_\lambda(x_1, \dots, x_n) ;=; \frac{a_{\lambda + \delta}}{a_\delta} ;=; \frac{\det!\big(x_i^{\lambda_j + n - j}\big)}{\det!\big(x_i^{n - j}\big)}, $$ defined for partitions with at most parts. Each is alternating (a transposition of two variables changes its sign), hence divisible by , so the quotient is a genuine polynomial. The Key theorem proves it equals the SSYT sum restricted to entries .

Kostka expansion. Collecting SSYT by content gives , where the Kostka number counts SSYT of shape and content . One has (the unique filling of row entirely by ) and unless in dominance order, so the transition is unitriangular and is a -basis of [Macdonald §3].

Counterexamples to common slips Intermediate+

  • "The SSYT rules are symmetric in rows and columns." They are not: rows are weak (), columns are strict (). This asymmetry is exactly what makes a single row give (repetition allowed) and a single column give (repetition forbidden). The symmetry that does hold is of the output under permuting variables, not of the rules.

  • " is nonzero for every partition ." In variables, as soon as has more than parts: a column of height exceeding has no strictly increasing filling from , and correspondingly then has a repeated entry, killing the alternant.

  • "Jacobi-Trudi needs and the matrix to be square of size ." The determinant may be taken over any block with ; padding by trailing zeros adds rows/columns that are on the diagonal () and leave the determinant unchanged. The convention for is essential.

Key theorem with proof Intermediate+

The signature theorem is that the three definitions of coincide: the combinatorial SSYT sum is symmetric, equals the bialternant, and equals the Jacobi-Trudi determinant in the complete functions.

Theorem (symmetry, bialternant, and Jacobi-Trudi). For every partition with , $$ \sum_{T \in \mathrm{SSYT}(\lambda, n)} x^T ;=; \frac{a_{\lambda+\delta}}{a_\delta} ;=; \det!\big(h_{\lambda_i - i + j}\big){1 \le i, j \le n}, $$ *with the convention , for . In particular $s\lambdan \to \inftys_\lambda \in \Lambdas_\lambda = \det(e_{\lambda'_i - i + j})$.* [Stanley §7.16]

Proof. Three identifications.

(i) Symmetry of the SSYT sum. It suffices to show is invariant under each adjacent transposition of variables, i.e. that swapping the exponents of and permutes SSYT among themselves. The Bender-Knuth involution does this. In an SSYT , restrict attention to entries equal to or . In each column at most one such entry occurs that is "free" — a with no directly below in the same column-pair, or vice versa — while -over- vertical pairs are "fixed". In each row the free s and free s form a block ; replace it by . This swap preserves the row-weak and column-strict conditions (the fixed pairs guard the columns) and exchanges the multiplicities of and , so is an involution on with . Hence fixes the sum, and as the generate , is symmetric.

(ii) Bialternant equals the SSYT sum. Both sides are symmetric polynomials of degree ; compare them after multiplying by . Expand and . A Lindström-type sign-reversing involution on pairs (lattice point configurations / column-strict fillings carrying a sign) cancels all cross terms except those assembling a column-strict tableau: writing amounts to the Gessel-Viennot model below read in the variables . Concretely, the quotient is the Weyl character of at highest weight , and the Weyl character is the trace over a basis indexed by SSYT (Gelfand-Tsetlin patterns), giving the same monomial sum.

(iii) Jacobi-Trudi by nonintersecting paths. Use the Lindström-Gessel-Viennot lemma of 40.01.04: in an acyclic weighted digraph, the signed sum over -tuples of paths between sources and sinks equals with the path-generating function , and only the nonintersecting tuples survive. Take the planar lattice with east and north steps, sources and sinks , weighting each east step at height by . The generating function is then (a single row of a tableau), so the LGV determinant is . A nonintersecting tuple of such paths is precisely an SSYT of shape (the -th path traces the cells holding entries , the heights of its east steps reading off the columns), so the determinant equals . Applying (which sends and, as shown in Advanced results, ) to the row form yields the dual column form .

Bridge. This theorem is the foundational reason the Schur functions are inevitable: it shows that the combinatorial, the determinantal, and the representation-theoretic descriptions are one object, so any property provable in one language transfers to the others. It builds toward the Pieri and Littlewood-Richardson rules of Advanced results, where the SSYT picture controls the structure constants, and it appears again in the Hall inner product of 40.03.03, under which these same are orthonormal. The bialternant side is exactly the Weyl character formula, which is dual to the Jacobi-Trudi side through the Vandermonde ; this is the central insight that the antisymmetrisation in and the sign-reversing involution of the LGV lemma are the same cancellation seen twice. Putting these together, the SSYT definition supplies positivity (a sum of monomials), the bialternant supplies symmetry and the basis property, and Jacobi-Trudi supplies the algebra inside — three faces of one function.

Exercises Intermediate+

Advanced results Master

The Schur basis carries the multiplicative and inner-product structure of , and the determinantal formulas extend to skew shapes, specialisations, and the Cauchy kernel.

Theorem 1 (Pieri, dual Pieri, and Littlewood-Richardson). The complete and elementary functions act on the Schur basis by adding strips: $$ s_\lambda, h_r = \sum_{\mu} s_\mu \ \ (\mu/\lambda \text{ a horizontal } r\text{-strip}), \qquad s_\lambda, e_r = \sum_{\mu} s_\mu \ \ (\mu/\lambda \text{ a vertical } r\text{-strip}). $$ These are the and cases of the general structure constants , where is the Littlewood-Richardson coefficient counting LR skew tableaux of shape and content ; the are non-negative integers, developed in the co-produced 40.03.05. The two Pieri rules are conjugate to each other under , since and [Macdonald §5].

Theorem 2 (skew Jacobi-Trudi and Giambelli). For the skew Schur function satisfies the determinantal formulas $$ s_{\lambda/\mu} = \det!\big(h_{\lambda_i - \mu_j - i + j}\big){1 \le i,j \le \ell(\lambda)} = \det!\big(e{\lambda'i - \mu'j - i + j}\big), $$ again by the LGV lemma with sources displaced by . The expansion $s{\lambda/\mu} = \sum\nu c^\lambda_{\mu\nu} s_\nus_\lambda\lambda = (a_1, \dots, a_d \mid b_1, \dots, b_d)s_\lambda = \det\big(s_{(a_i \mid b_j)}\big)$, a determinant of single-hook Schur functions — the Plücker/Schubert-calculus form.

Theorem 3 (Cauchy identity and orthonormality). The Schur functions diagonalise the Cauchy kernel: $$ \prod_{i,j} \frac{1}{1 - x_i y_j} = \sum_{\lambda} s_\lambda(x), s_\lambda(y), \qquad \prod_{i,j}(1 + x_i y_j) = \sum_\lambda s_\lambda(x), s_{\lambda'}(y). $$ Under the Hall inner product of the co-produced 40.03.03 — defined by , equivalently — the Cauchy identity is equivalent to : the Schur functions are an orthonormal -basis, the unique (up to sign) basis that is both orthonormal and unitriangular against . The involution is then an isometry with .

Theorem 4 (principal specialisation, hook-content and hook-length). Setting for and the rest gives the principal specialisation $$ s_\lambda(1, q, q^2, \dots, q^{n-1}) = q^{,n(\lambda)} \prod_{(i,j) \in \lambda} \frac{1 - q^{,n + c(i,j)}}{1 - q^{,h(i,j)}}, $$ the hook-content formula, where is the content of a cell, its hook length, and . Letting recovers , the number of SSYT of shape with entries , i.e. the dimension of the irreducible -representation; and the standard-tableau count is the hook-length formula for the symmetric-group irreducible, cross-referenced to 07.05.02. The coefficient of the lowest power, , is the charge/cocharge statistic that recurs in the Kostka-Foulkes refinement.

Synthesis. Putting these together, the Schur basis is the fixed point at which every structure on becomes diagonal. The combinatorial definition makes each manifestly Schur-positive and the Kostka unitriangularity makes them a -basis; the Cauchy identity of Theorem 3 is exactly the statement that this basis is orthonormal for the Hall inner product of 40.03.03, so the foundational reason the dominate the subject is that they are the self-dual basis, and — conjugation of diagrams in 40.03.01 — is precisely the isometry . The central insight is that the antisymmetrisation of the bialternant, the sign-reversing cancellation of the LGV proof of Jacobi-Trudi, and the orthogonality of characters are one phenomenon: the Weyl character formula is dual to the Jacobi-Trudi determinant through the Vandermonde, and this is exactly why the Pieri rules of Theorem 1 (multiplication by and ) are conjugate under and why the Littlewood-Richardson coefficients are symmetric. The bridge is the Frobenius characteristic of the co-produced 40.03.06: it carries the irreducible -characters to the , so the hook-length formula of Theorem 4 — counting standard tableaux — generalises into the principal specialisation, and the entire determinantal calculus developed here reappears as the character theory of 07.05.02 and the Murnaghan-Nakayama rule of 07.05.10. The same diagram conjugation that defines in 40.03.01 is the duality threading Pieri, Cauchy, and the hook-content formula.

Full proof set Master

Proposition 1 (the SSYT sum is symmetric). For every partition , is a symmetric function.

Proof. Fix ; we produce an involution on exchanging the multiplicities of and while fixing all others, which gives invariance under the adjacent transposition and hence (as the generate the symmetric group) full symmetry. In , call a cell holding or paired if it lies directly above a (for a ) or directly below a (for a ) in the same column; paired cells form vertical dominoes and are left untouched. The remaining free s and s lie in horizontal runs within rows: in a given row the free entries equal to or form a contiguous block (any intervening other value would separate them into independent runs, handled identically). Replace each such run by . Row-weak monotonicity is preserved because the run was the only place were adjacent and free; column-strictness is preserved because the paired dominoes that could conflict were excluded from the free set. The map is an involution ( twice returns the original) and replaces by , so . Summing, .

Proposition 2 (Jacobi-Trudi via Lindström-Gessel-Viennot). With , for , and , .

Proof. Consider the directed lattice on with unit east and north steps; weight an east step from to by , so a path's weight is over its east steps. Place sources and sinks for (padding with zeros to length ). The generating function of all paths is the complete homogeneous function , since a monotone path making east steps across heights records a weakly increasing length- word, whose weight sum is . The Lindström-Gessel-Viennot lemma 40.01.04 gives $$ \sum_{w \in S_N} \mathrm{sgn}(w) \prod_{i} (\text{paths } A_i \to B_{w(i)}) = \det\big(h_{\lambda_j - j + i}\big) = \sum_{(P_1,\dots,P_N) \text{ nonintersecting}} \mathrm{sgn}(\sigma), x^{P_1}\cdots x^{P_N}, $$ and for this source/sink placement only the identity permutation admits nonintersecting tuples, all with sign . A nonintersecting tuple is in content-preserving bijection with an SSYT of shape : path contributes the east-step heights as the entries of the cells in the -th row read against the columns, the nonintersection condition translating exactly into the column-strict, row-weak conditions. Therefore , and transposing the matrix (a determinant is invariant under transpose) gives the stated .

Proposition 3 (bialternant equals Jacobi-Trudi). In variables, .

Proof. Use the generating-function identity , where here . In matrix form, the complete functions are the entries of the inverse of the Vandermonde-style matrix built from the 's; concretely, the Jacobi-Trudi determinant arises by Cramer's rule. Form the matrices (so ) and (so ). Expanding shows that the matrix (lower-triangular-unipotent in the appropriate truncation) satisfies in each graded piece, where is the matrix with shifted by the staircase; taking determinants, . Equivalently, both sides are symmetric, agree in the stable limit by Proposition 2, and the finite-variable identity follows by setting , under which specialises correctly and a column of exceeding kills both sides. Hence the ratio equals the determinant.

Proposition 4 (Pieri rule). .

Proof. Pair the Jacobi-Trudi expression with the Hall inner product of 40.03.03, under which is orthonormal, so the coefficient of in is . The adjoint of multiplication by acts as the skewing operator ; by the dual-basis property is the multiplicity of in , which equals if is a horizontal -strip and otherwise. Combinatorially: over SSYT of shape and single rows of length ; the RSK-free insertion of 's weakly increasing row onto produces exactly the SSYT of skew shapes that are horizontal strips, each once, since two inserted cells in one column would force a strict-column violation. Grouping by gives over horizontal -strips.

Proposition 5 (orthonormality from Cauchy). Under , one has .

Proof. The Cauchy kernel factors two ways: , exhibiting and as dual bases, which is the definition of the inner product. Two graded bases are dual () if and only if . Write and the dual expansion ; substituting the Jacobi-Trudi/Kostka relations, . The Kostka matrix is unitriangular, and the orthogonality against the - duality collapses the double sum to precisely when is self-dual. Thus is equivalent to ; the Cauchy identity (proved by RSK in the co-produced 40.03.04) supplies the left equality, giving orthonormality.

Connections Master

  • The Schur functions are built directly on the ring and bases of 40.03.01: the Kostka unitriangularity uses the dominance order and monomial basis defined there, the Jacobi-Trudi determinants live in , and the diagram conjugation that gives the involution of 40.03.01 acts here as , interchanging the row Jacobi-Trudi formula with its dual column form. The single-row and single-column Schur functions recover the complete and elementary generators of that unit.

  • The Hall inner product making orthonormal, and the Cauchy identity that is equivalent to it, are the subject of the co-produced 40.03.03; the RSK correspondence that proves the Cauchy identity bijectively is the co-produced 40.03.04; and the Littlewood-Richardson coefficients generalising the Pieri rules proved here are the co-produced 40.03.05. The Frobenius characteristic of the co-produced 40.03.06 carries the Schur functions to the irreducible characters of the symmetric groups, turning every Schur identity into a character identity.

  • The bialternant is the Weyl character formula for , so this unit is the combinatorial face of the representation theory in 07.05.02 (Young diagrams and the irreducibles) and 07.05.04 (Schur-Weyl duality, where is simultaneously a and an object); the hook-length formula of Theorem 4 is the dimension of the Specht module of 07.05.03, and the power-sum expansion of is the Murnaghan-Nakayama rule of 07.05.10. The Jacobi-Trudi proof reuses the Lindström-Gessel-Viennot lemma of 40.01.04, the same nonintersecting-path determinant that computes transfer matrices and -partition generating functions there.

Historical & philosophical context Master

The bialternant quotient that defines the Schur function appears first in Cauchy's 1815 memoir on alternating functions [Cauchy 1815], where the ratio of an alternant by the Vandermonde is studied as the basic symmetric object built from a determinant; the determinant of alternants and the manipulation that yields what is now Jacobi-Trudi were developed by Jacobi in his 1841 paper De functionibus alternantibus in Crelle's Journal [Jacobi 1841], from which the modern name of the identity descends. The functions acquired their combinatorial and representation-theoretic meaning in Issai Schur's 1901 Berlin dissertation, which identified with the irreducible polynomial character of of highest weight ; the name Schur function is due to the later literature.

The combinatorial definition by semistandard tableaux, the symmetry by what are now called the Bender-Knuth involutions, and the systematic Pieri and Littlewood-Richardson calculus belong to the tableau tradition running from Alfred Young's substitutional analysis (1900-1930s) through Dudley Littlewood and Archibald Richardson's 1934 paper stating the multiplication rule, to its rigorous proofs decades later by Schützenberger, Thomas, and others. The nonintersecting-lattice-path proof of the determinantal formulas is due to Lindström (1973) and, in the combinatorial form used here, to Gessel and Viennot (1985). The bialternant-to-tableau equivalence is the combinatorial shadow of the Weyl character formula proved by Hermann Weyl (1925-26) for compact Lie groups, of which the case is the Schur polynomial.

Bibliography Master

@book{stanley1999ec2,
  author    = {Stanley, Richard P.},
  title     = {Enumerative Combinatorics, Volume 2},
  series    = {Cambridge Studies in Advanced Mathematics},
  volume    = {62},
  publisher = {Cambridge University Press},
  year      = {1999}
}

@book{macdonald1995symmetric,
  author    = {Macdonald, Ian G.},
  title     = {Symmetric Functions and Hall Polynomials},
  series    = {Oxford Mathematical Monographs},
  edition   = {2},
  publisher = {Oxford University Press},
  year      = {1995}
}

@book{fulton1997young,
  author    = {Fulton, William},
  title     = {Young Tableaux: With Applications to Representation Theory and Geometry},
  series    = {London Mathematical Society Student Texts},
  volume    = {35},
  publisher = {Cambridge University Press},
  year      = {1997}
}

@article{jacobi1841functionibus,
  author  = {Jacobi, Carl Gustav Jacob},
  title   = {De functionibus alternantibus earumque divisione per productum e differentiis elementorum conflatum},
  journal = {Journal f\"{u}r die reine und angewandte Mathematik (Crelle)},
  volume  = {22},
  pages   = {360--371},
  year    = {1841}
}

@article{cauchy1815memoire,
  author  = {Cauchy, Augustin-Louis},
  title   = {M\'{e}moire sur les fonctions qui ne peuvent obtenir que deux valeurs \'{e}gales et de signes contraires par suite des transpositions op\'{e}r\'{e}es entre les variables qu'elles renferment},
  journal = {Journal de l'\'{E}cole Polytechnique},
  volume  = {10},
  pages   = {29--112},
  year    = {1815}
}

@phdthesis{schur1901,
  author  = {Schur, Issai},
  title   = {\"{U}ber eine Klasse von Matrizen, die sich einer gegebenen Matrix zuordnen lassen},
  school  = {Universit\"{a}t Berlin},
  year    = {1901}
}

@article{littlewoodrichardson1934,
  author  = {Littlewood, Dudley E. and Richardson, Archibald R.},
  title   = {Group characters and algebra},
  journal = {Philosophical Transactions of the Royal Society A},
  volume  = {233},
  pages   = {99--141},
  year    = {1934}
}

@article{gesselviennot1985,
  author  = {Gessel, Ira and Viennot, G\'{e}rard},
  title   = {Binomial determinants, paths, and hook length formulae},
  journal = {Advances in Mathematics},
  volume  = {58},
  number  = {3},
  pages   = {300--321},
  year    = {1985}
}