The Yang–Baxter equation and the star–triangle relation
Anchor (Master): Baxter Ch. 9 §9.6–9.8; Yang 1967; Baxter 1972 Ann. Phys. 70:193
Intuition Beginner
Imagine a factory line that builds a long lattice one row at a time. A machine reads the current row of states and stamps out the next row, with each local pattern carrying a weight. Doing this row after row builds the whole lattice, and the total weight of every finished lattice is the quantity physicists most want to compute. The stamping machine is the transfer matrix: feed it a row, it returns the next.
Now suppose you own two slightly different stamping machines, tuned to two different settings. Usually the order in which you apply machines matters, the way putting on socks then shoes differs from shoes then socks. The deep question is: when do these two machines give the same result no matter which one you run first? When they do, the machines commute, and the lattice becomes special. It can be solved exactly.
The condition that makes two such machines commute is a single local rule about how three weights fit together at a crossing. That rule is the star–triangle relation, and its modern face is the Yang–Baxter equation.
Visual Beginner
The figure shows the heart of the rule. On the left, three lines cross so that one line passes on the left of a meeting point of the other two; on the right, the same three lines cross so that line passes on the right. The star–triangle relation says these two pictures carry the same total weight when you sum over the internal states.
Reading it as machines: sliding one line past a crossing of the other two does not change the answer. That single sliding move, repeated, is what lets you reorder a whole stack of stamping machines, which is why the lattice becomes exactly solvable.
Worked example Beginner
Take the simplest version: three lines, each carrying an arrow that can point one of two ways, meeting at a point. A weight is attached to each small crossing depending on the two arrows that meet there. Call the three pairwise crossing-weights , , .
In the left picture the lines cross in the order: first the lower two, then the outer line with each. In the right picture they cross in the opposite order. To compare them, fix the three arrows coming in from outside and the three going out, then add up the weight over every choice of the arrows on the internal segments.
Suppose every weight is just when the two arrows agree and when they disagree. Then in both pictures the only surviving term forces all three lines to carry the same arrow straight through, and each picture sums to exactly . The two sides match. With richer weights the matching becomes a real constraint on , , — and that constraint is the star–triangle relation.
What this tells us: the relation is a balance condition on local crossing-weights. When it holds, reordering crossings is free, and a global reordering of transfer matrices follows.
Check your understanding Beginner
Formal definition Intermediate+
Fix a finite-dimensional vector space , the local state space sitting on each edge of the lattice. The basic object is the -matrix, a linear operator depending on a complex spectral parameter (equivalently a rapidity). For each ordered pair of factors in a tensor product , write for the operator that acts as on factors and and as the identity elsewhere.
The Yang–Baxter equation is the identity in
required to hold for all values of the spectral parameters , . The additive form of the argument, on the middle factor, is the convention for difference models, where depends only on differences of rapidities; this convention is fixed once here and carried through the unit.
In the lattice setting the same data appears as a vertex weight. Assign to each edge of the square lattice a state in , and to each vertex the Boltzmann weight depending on the four adjoining edge states (left, top, right, bottom). The components of the -matrix are exactly these vertex weights,
with the spectral parameter setting the anisotropy of the weights. The star–triangle relation is the component form of the Yang–Baxter equation: for three rapidities it equates the weighted sum over the internal edge of a "star" (three weights meeting at a point) with the weighted sum over a "triangle" (the same three lines re-routed). Onsager's name star–triangle records its origin in the – transformation of the triangular Ising lattice; Baxter's reading is that the same equation, parametrised by a spectral variable, controls every solvable vertex model [Baxter Ch. 9].
The row-to-row transfer matrix built from these weights is
an operator on the physical space obtained by multiplying -matrices along a row and tracing out the auxiliary space that threads the row (the monodromy matrix is the product before the trace). This is the construction of 08.03.02 cast in -matrix language. The spectral parameter labels a one-parameter family of transfer matrices on the same physical space.
Key theorem with proof Intermediate+
Theorem (Yang–Baxter implies commuting transfer matrices). Let satisfy the Yang–Baxter equation and let be the associated row-to-row transfer matrix on . If is invertible for generic , then
Proof. Introduce two auxiliary spaces, carrying and carrying , alongside the physical spaces. The product is the partial trace over both auxiliary spaces of the operator
the last equality holding because operators on disjoint physical factors commute, so the two products interleave site by site. Write and for the two monodromy matrices, so .
The Yang–Baxter equation, with the two auxiliary spaces playing the roles of factors and and a physical site playing factor , gives the intertwining (RTT) relation
where acts on the two auxiliary spaces only. This is proved by induction on : for it is the Yang–Baxter equation itself, written as ; the inductive step pushes through one more site using the same three-factor identity and leaves the already-treated sites untouched, since they act on different physical spaces.
Because is invertible for generic , multiply the intertwining relation on the left by :
Take the trace over both auxiliary spaces. The trace is cyclic on the auxiliary factors, so the conjugating drops out:
By continuity the identity extends from generic to all values, including coincident spectral parameters. Hence the family commutes.
Bridge. This theorem builds toward the eigenvalue analysis of every solvable model in this chapter: a commuting family is simultaneously diagonalisable, so a single -independent eigenvector basis diagonalises for all at once, and expanding in produces an infinite tower of mutually commuting conserved charges — the algebraic meaning of integrability that appears again in 05.02.03. The same factorised-scattering content reappears in 08.13.02, where the two-body phase that fixes the Bethe-ansatz amplitudes is exactly a matrix element of and the Bethe equations are the consistency of carrying one rapidity around the ring. Putting these together, the star–triangle relation is not one technique among many but the structural origin of exact solvability, and the spectral parameter is the device that turns a single algebraic identity into a continuous commuting family. The same intertwining relation organises the eight-vertex model, whose elliptic -matrix is the apex solution of the very equation proved here.
Exercises Intermediate+
Advanced results Master
The intertwining relation proved above carries more information than commutation alone. Writing the monodromy matrix in the auxiliary space as a matrix of operators on the physical space,
the single relation encodes sixteen scalar commutation relations among at shifted spectral parameters. Among them is , recovering the theorem, and the relations together with . These are the engine of the algebraic Bethe ansatz: creates excitations over a reference state, and the structure constants — themselves ratios of -matrix weights — reproduce the Bethe equations of 08.13.02 without ever solving a difference equation in position space. The factorised scattering of Yang's many-body problem [Yang 1967] is the statement that the -matrix of particles is an ordered product of two-body -matrices, consistent precisely because those two-body factors obey the Yang–Baxter equation; McGuire's earlier three-body analysis [McGuire 1964] is the special case that first exposed the consistency condition.
Two structural specialisations sit on either side of the spectral-parameter axis. Setting the spectral parameter to a fixed value (or taking a rational, trigonometric, or elliptic limit in which becomes -independent) reduces the Yang–Baxter equation to the braid relation. Conjugating by the permutation, , the unbraided equation becomes , which is precisely the defining relation of the Artin braid group acting on . Each solution of the constant Yang–Baxter equation therefore yields a representation of every braid group, and through the Markov trace a link invariant; the trigonometric six-vertex -matrix produces the Jones polynomial. On the other side, the spectral-parameter-dependent solutions are organised by quantum groups: the -matrix is the image, in a tensor product of representations, of the universal -matrix of a quasitriangular Hopf algebra , and the Yang–Baxter equation is the image of the abstract quantum-Yang–Baxter identity satisfied by that universal element [Baxter Ch. 9].
The classification of solutions is graded by the analytic dependence on . Rational -matrices (Yangian symmetry) govern the isotropic Heisenberg chain and Yang's delta-interaction gas; trigonometric -matrices (quantum affine ) govern the six-vertex model and the anisotropic XXZ chain; elliptic -matrices (Sklyanin algebra) govern the eight-vertex model and the XYZ chain. The elliptic case is the most general and the apex: Baxter's solution of the eight-vertex model is the parametrisation of by Jacobi theta functions of the spectral parameter such that the star–triangle relation becomes a theta-function addition theorem, linking this machinery laterally to 06.06.05.
A notation crosswalk fixes the conventions inherited from Baxter. The dimensionless coupling is (Baxter's convention), in contrast to the used in some shipped Ising units; the spectral parameter is written here (Baxter and the integrable-systems literature also use or, multiplicatively, ); the elliptic modulus and nome are and respectively, to be matched against the theta-function conventions of 06.06.05; and the eight-vertex weights refer to the four arrow-conserving-mod-2 vertex classes. These pins hold across 08.13.02 and the eight-vertex unit.
Synthesis. Putting these together, the Yang–Baxter equation identifies three subjects that look unrelated from a distance: lattice statistical mechanics, where it is the star–triangle relation that makes transfer matrices commute and the free energy computable; low-dimensional topology, where its constant limit is the braid relation and its Markov trace is a knot invariant; and the representation theory of quantum groups, where it is the image of a universal -matrix. The spectral parameter is the thread tying the first to the third, and the commuting family it generates is the structural origin of integrability that reappears in 05.02.03 as commuting Hamiltonians in involution. The same factorised-scattering content appears again in 08.13.02, whose two-body Bethe phase is a matrix element of , and the elliptic solution of the equation is the apex eight-vertex model whose non-universal critical exponents sharpen the universality picture of 08.05.01. The central identification is that solvability is a property of a single local equation, not of any individual model: a model is exactly solved when its Boltzmann weights solve the star–triangle relation.
Full proof set Master
Proposition (the braid limit). Let be a difference-form solution of the Yang–Baxter equation that admits a limit at some value (rational, trigonometric, or elliptic degeneration), and set . Then satisfies the braid relation , so copies generate a representation of the Artin braid group on .
Proof. In the difference form the three arguments collapse to a common value in the degeneration limit, so the spectral-parameter dependence drops and the Yang–Baxter equation becomes the constant identity . Multiply through by permutation operators using the intertwining identities and , which hold because conjugation by a permutation relabels the factors an -matrix acts on. Inserting and pushing every permutation to the right, the non-adjacent operators are converted to adjacent ones, and the constant Yang–Baxter equation rearranges exactly into . Because the for commute (disjoint factors) and adjacent ones satisfy the braid relation, the assignment extends to a homomorphism from .
Proposition (uniqueness of the conserved tower). Let be the commuting family of the theorem with analytic and invertible near , and suppose is any operator commuting with every . Then is a function of the conserved charges .
Proof. Since the commute and are simultaneously diagonalisable, decompose the physical space into joint eigenspaces; on each, acts by a scalar eigenvalue function , analytic in . The charges act by , and an operator commuting with all preserves each joint eigenspace, hence acts by a scalar there. Two joint eigenspaces carry distinct eigenvalue functions (else they merge), so the family of Taylor coefficients separates the eigenspaces; therefore the scalar by which acts is determined by those coefficients, i.e. for some function .
The eigenvalue functions themselves are constrained by a functional equation — Baxter's relation — whose derivation belongs to the eight-vertex unit; here it suffices that analyticity plus the commuting-family structure already forces the spectrum into one analytic family per joint eigenspace.
Connections Master
The transfer matrix, the trace formula , and the largest-eigenvalue free energy are imported from 08.03.02; this unit supplies the structural reason a family of such matrices can be diagonalised simultaneously, which that unit could not provide.
Integrability as the existence of commuting conserved quantities is the classical-mechanics statement of 05.02.03; the present theorem is its lattice incarnation, with the spectral-parameter expansion of producing the tower of commuting charges in place of Poisson-commuting integrals of motion. The translation between commuting Hamiltonians in involution and commuting transfer matrices is the bridge between the two chapters.
The six-vertex model and coordinate Bethe ansatz of 08.13.02 are the first application: the two-body scattering phase that fixes its plane-wave amplitudes is a matrix element of , the Bethe equations are the consistency of the intertwining relation around the ring, and the algebraic Bethe ansatz reconstructs the same eigenvectors from the operators of the monodromy matrix. The eight-vertex model extends the construction to an elliptic -matrix, parametrised by the theta functions of 06.06.05, and its non-universal critical exponents are the integrable counterexample that refines the universality discussion of 08.05.01.
The constant limit links to low-dimensional topology: the braid-group representations built from yield, via the Markov trace, polynomial invariants of knots and links, with the six-vertex -matrix producing the Jones polynomial. The spectral-parameter-dependent theory links forward to quantum groups, where is the image of the universal -matrix of .
Historical & philosophical context Master
The equation entered physics from two directions. On the lattice side, the star–triangle relation is the spectral-parametrised descendant of Onsager's – transformation, used in his 1944 solution of the two-dimensional Ising model and in the duality analysis of triangular and honeycomb lattices. Rodney Baxter recognised in 1971–1972 that the condition for two transfer matrices to commute is exactly a star–triangle relation on their Boltzmann weights, and that commutation supplies the infinite family of conserved quantities behind exact solvability; this is the organising insight of his solution of the eight-vertex model and of his 1982 monograph, which presents each solvable model through the algebra that diagonalises it [Baxter 1972].
On the quantum many-body side, J. B. McGuire showed in 1964 that the -body problem with delta-function interactions in one dimension has factorised, reflectionless scattering, with the three-body -matrix consistently decomposing into two-body factors [McGuire 1964]. Chen Ning Yang sharpened this in 1967 into the consistency condition on the two-body scattering matrix that now bears his name jointly with Baxter's, solving the repulsive delta-interaction gas by what became the nested Bethe ansatz [Yang 1967]. The two strands — Baxter's commuting transfer matrices and Yang's factorised scattering — were recognised as the same equation in the 1970s. Ludwig Faddeev, Evgeny Sklyanin and Leon Takhtajan abstracted it after 1979 into the quantum inverse scattering method, and Vladimir Drinfeld and Michio Jimbo found in 1985 that its spectral-parameter solutions are governed by quantum groups, the quasitriangular Hopf algebras whose universal -matrix solves the equation at the abstract level.
Bibliography Master
@article{McGuire1964,
author = {McGuire, J. B.},
title = {Study of Exactly Soluble One-Dimensional N-Body Problems},
journal = {Journal of Mathematical Physics},
volume = {5},
year = {1964},
pages = {622--636}
}
@article{Yang1967,
author = {Yang, Chen Ning},
title = {Some Exact Results for the Many-Body Problem in One Dimension with Repulsive Delta-Function Interaction},
journal = {Physical Review Letters},
volume = {19},
year = {1967},
pages = {1312--1315}
}
@article{Baxter1971eightvertex,
author = {Baxter, Rodney J.},
title = {Eight-Vertex Model in Lattice Statistics},
journal = {Physical Review Letters},
volume = {26},
year = {1971},
pages = {832--833}
}
@article{Baxter1972,
author = {Baxter, Rodney J.},
title = {Partition Function of the Eight-Vertex Lattice Model},
journal = {Annals of Physics},
volume = {70},
year = {1972},
pages = {193--228}
}
@book{Baxter1982,
author = {Baxter, Rodney J.},
title = {Exactly Solved Models in Statistical Mechanics},
publisher = {Academic Press},
address = {London},
year = {1982}
}
@article{FaddeevSklyaninTakhtajan1979,
author = {Faddeev, L. D. and Sklyanin, E. K. and Takhtajan, L. A.},
title = {The Quantum Inverse Problem Method. I},
journal = {Theoretical and Mathematical Physics},
volume = {40},
year = {1979},
pages = {688--706}
}
@article{Drinfeld1985,
author = {Drinfeld, V. G.},
title = {Hopf Algebras and the Quantum Yang--Baxter Equation},
journal = {Soviet Mathematics Doklady},
volume = {32},
year = {1985},
pages = {254--258}
}
@article{Jimbo1985,
author = {Jimbo, Michio},
title = {A $q$-Difference Analogue of $U(\mathfrak{g})$ and the Yang--Baxter Equation},
journal = {Letters in Mathematical Physics},
volume = {10},
year = {1985},
pages = {63--69}
}