Generalised symmetries (Lie-Bäcklund) and recursion operators
Anchor (Master): Olver §5.1-§5.2-§5.3; Olver 1977 (J. Math. Phys. 18); Fokas 1987 (Stud. Appl. Math. 77); Mikhailov-Shabat-Sokolov in *What is Integrability?* (1991)
Intuition Beginner
A symmetry of an equation is a way of nudging its solutions that turns each one into another solution. The earlier symmetries you met moved only the plain variables: shift the input, scale the output, rotate the picture. The nudge at a point depended only on where that point was. But there is no reason the nudge has to be so modest. You can let the nudge at a point depend also on the slope of the solution there, on its curvature, on how the curvature is changing, and so on up the ladder of derivatives.
When the nudge is allowed to read off the slope and higher bending of the curve, it is called a generalised symmetry, or a Lie-Bäcklund symmetry after the man who first studied transformations of this richer kind. These symmetries are invisible to the older point-by-point bookkeeping, yet they are often the more important ones. For the equations that physicists call integrable, the point symmetries are few, but the generalised symmetries form an endless tower, one for every rung of an infinite ladder.
The tool that climbs the ladder is a recursion operator. It is a fixed recipe that takes one symmetry as input and returns a new, higher symmetry as output. Feed it the simplest symmetry, apply it again and again, and out comes the whole tower for free. The recipe is what makes an integrable equation special.
Visual Beginner
Picture a solution drawn as a curve. At one point, attach not just an arrow saying which way to push, but a whole reading of the curve there: its height, its slope, its curvature. A generalised symmetry is a push whose direction is computed from that full reading. Two points with the same height but different slopes get pushed differently.
Beside the curve, draw a ladder. Each rung is one symmetry of the equation. The bottom rung is the plainest one. A single curved arrow, the recursion operator, hooks onto any rung and lifts you to the next rung up. Because the same arrow works at every level, one operator builds an infinite ladder. The picture captures the whole idea: symmetries may read the curve as deeply as they like, and one recipe manufactures all of them.
Worked example Beginner
Take the heat equation, the law that says the rate of change in time of a temperature equals how its profile bends in space. Write the temperature as , its rate of change in time as , and the bending in space as . The law is equals .
A plain symmetry is "shift in space": move the whole profile sideways and it still solves the law. The nudge here reads only position. Now try a deeper nudge: push the temperature at each point by the amount , the third derivative in space, the rate at which the bending changes.
Why is this a symmetry? Differentiate the law equals three times in space. The left side becomes the time rate of , and the right side becomes the space bending of . So obeys the very same heat law that does. Pushing every solution by its own therefore lands you on another solution.
What this tells us: the heat equation has a symmetry for every spatial derivative, , , , and onward, an infinite family. None of these except the first is a plain point symmetry, because each reads the curve through high derivatives. This endless family is exactly what generalised symmetries capture and what a recursion operator is built to generate.
Check your understanding Beginner
Formal definition Intermediate+
Work in the algebra of differential functions: smooth functions of the independent variables , the dependent variables , and finitely many derivative coordinates on the infinite jet bundle of 05.05.05, where ranges over symmetric multi-indices. The total derivative acts as a derivation of , and . The development follows Olver [Olver §5.1].
Definition (generalised vector field). A generalised vector field is a formal expression
$$
v = \xi^i(x, u^{(n)}),\frac{\partial}{\partial x^i} + \phi^\alpha(x, u^{(n)}),\frac{\partial}{\partial u^\alpha},
$$
whose coefficients may depend on derivatives of any finite order , in contrast to the point fields of 05.05.06 where depend on alone. Its characteristic is with , and its evolutionary representative is the vertical field
$$
v_Q = \sum_{\alpha} Q^\alpha,\frac{\partial}{\partial u^\alpha}, \qquad \mathrm{pr},v_Q = \sum_{\alpha, J} (D_J Q^\alpha),\frac{\partial}{\partial u^\alpha_J},
$$
the prolongation formula reducing to because has no horizontal part. The symbols , , and the evolutionary field are recorded in _meta/NOTATION.md.
Definition (generalised symmetry). Let be a system of PDEs. A generalised vector field with characteristic is a generalised symmetry (Lie-Bäcklund symmetry) of if $$ \mathrm{pr},v_Q(\Delta_\nu) = 0 \quad\text{on every solution of } \mathcal{S}, $$ that is, lies in the differential ideal generated by and their total derivatives. For an evolution equation written with , a -independent characteristic is a symmetry exactly when the characteristic bracket vanishes, $$ [K, Q] := \mathrm{pr},v_K(Q) - \mathrm{pr},v_Q(K) = 0, $$ so symmetries are the centraliser of in the characteristic Lie algebra. Here recovers the Fréchet derivative , the linearisation .
Definition (null characteristic; equivalence). A characteristic is null for (the standard literature also calls these vanishing-on-solutions characteristics) if on every solution, equivalently lies in the differential ideal of . Two generalised symmetries are equivalent if their characteristics differ by a null characteristic; the evolutionary field of a null characteristic acts as zero on solutions, so generalised symmetries are properly objects of the quotient of characteristics modulo the null ones. Within this quotient every generalised vector field is equivalent to its evolutionary representative , since and differ by the total field , which is tangent to all prolongations.
Definition (recursion operator). A linear operator on characteristics (generally a formal integro-differential operator, built from , multiplication by elements of , and the formal antiderivative ) is a recursion operator for the evolution equation if it satisfies the operator identity $$ R_t = [,K', R,] := K',R - R,K', $$ where differentiates the coefficients of along the flow and is the Fréchet derivative of . When is a symmetry, , then is again a symmetry. For autonomous (time-independent) the condition reads , the statement that commutes with the linearised flow.
A non-example fixes the meaning. The bare differential operator applied to the KdV symmetry returns , which is not a KdV symmetry on its own: fails the commutator identity for . The genuine KdV recursion operator must include the nonlocal correction and the multiplication term ; only the full satisfies the commutator condition.
Counterexamples to common slips
- A generalised symmetry need not be a point symmetry, and conversely. The characteristic for the heat equation is a genuine symmetry but corresponds to no transformation of alone; it lives only on the jet bundle.
- Null characteristics are not zero — they vanish only on solutions. For the characteristic is null: it vanishes on every solution and its evolutionary field acts as zero there, so it must be quotiented out before counting symmetries.
- is not the KdV recursion operator. Dropping the nonlocal tail breaks the commutator identity ; the differential part alone does not map symmetries to symmetries.
- is a formal antiderivative, not a bounded operator. Expressions act on the hierarchy because the integrands are total derivatives there; applying to an arbitrary element of is not a closed operation.
Key theorem with proof Intermediate+
Theorem (recursion operators map symmetries to symmetries). Let be an evolution equation with Fréchet derivative , and let be a recursion operator satisfying . If is a generalised symmetry, that is equivalently for a -independent symmetry along the flow, then is a generalised symmetry.
Proof. Recall the characterisation of a symmetry through the linearisation. A -independent characteristic is a symmetry of precisely when its evolutionary field commutes with the flow, which in linearised form is the equation $$ \frac{d}{dt},Q = K'[Q] $$ along solutions, where is the total time derivative carrying the explicit and implicit -dependence. For a -independent the explicit part drops and this is , identical to after using and the antisymmetric definition of the bracket.
Apply the operator and the product rule to the candidate . Its total time derivative is $$ \frac{d}{dt}(RQ) = \Big(\frac{d}{dt}R\Big)Q + R,\frac{d}{dt}Q = R_t,Q + R,K'[Q], $$ using for the coefficient evolution of and from the symmetry hypothesis. Substitute the recursion-operator identity : $$ \frac{d}{dt}(RQ) = (K'R - R K'),Q + R,K'[Q] = K'[RQ] - R,K'[Q] + R,K'[Q] = K'[RQ]. $$ The two middle terms cancel, leaving . This is precisely the linearised-symmetry equation for the characteristic , so satisfies the symmetry criterion and is a generalised symmetry of .
Bridge. This theorem is the engine of the symmetry approach to integrability. It builds toward the construction of an infinite hierarchy: seeding with the translation symmetry and iterating produces , all symmetries by induction. It appears again in 05.09.11, where the very same recursion operator is the object the master symmetry rescales through , so that the operator proved here to preserve symmetries is the one bracketing generates the hierarchy from. The construction reuses the prolongation calculus of 05.05.06 in a new guise: there tangency of to the equation defined a symmetry, and the central insight is that the recursion condition is exactly the statement that intertwines the linearised flow with itself, so symmetry-preservation is intertwining of the linearisation. Putting these together, the recursion operator, the Fréchet derivative, and the characteristic bracket are three readings of one structure: the commutator algebra of the linearised evolution.
Exercises Intermediate+
Lean formalization Intermediate+
Mathlib contains neither the differential algebra of differential functions on the infinite jet bundle, nor generalised vector fields, nor the formal integro-differential operators with the antiderivative that a recursion operator requires. The unit is formalisation-free; a meaningful Lean statement needs the infrastructure documented in Mathlib gap analysis. The aspirational target statement has the schematic form:
-- Aspirational, not currently realisable in Mathlib.
theorem recursion_maps_symmetries
(K : Characteristic) -- the evolution u_t = K
(R : IntegroDiffOperator) -- formal operator built from D, ·, D⁻¹
(hR : R.timeDeriv K = lieBracket (frechet K) R) -- R_t = [K', R]
(Q : Characteristic)
(hQ : isSymmetry K Q) : -- [K, Q] = 0
isSymmetry K (R.apply Q) := by
sorryThe statement needs Characteristic (differential functions modulo null characteristics), frechet K (the Fréchet derivative as a linear differential operator), isSymmetry K Q (the bracket condition ), and IntegroDiffOperator with a timeDeriv and a Lie bracket against . The closest existing Mathlib infrastructure is Derivation and LieAlgebra, but neither the jet-space differential algebra nor the pseudo-differential extension with exists. Tracked as a long-horizon contribution roadmap; the human reviewer is the correctness gate for the prolongation and commutator computations.
Advanced results Master
The generalised-symmetry framework converts the question "is this equation integrable?" into "does it possess an infinite hierarchy of higher symmetries?", and the recursion operator is the device that produces the hierarchy. Two worked systems and the structural results frame the theory; the exposition follows Olver [Olver §5.2] and the symmetry approach to integrability of Fokas [Fokas 1987].
The KdV hierarchy from the recursion operator. For the operator satisfies , where is the Fréchet derivative. Seeding with the translation characteristic , the iterates are higher symmetries by the symmetry-preservation theorem: (KdV itself), (the fifth-order flow), and onward. Each generates a commuting evolution , and the family is involutive, , the defining feature of an integrable hierarchy. The local conserved densities of KdV are produced alongside, one for each rung.
The Burgers hierarchy and linearisability. Burgers carries , a first-order recursion operator. The hierarchy is comparatively simple because the Cole-Hopf transform linearises Burgers to the heat equation , under which pulls back to the bare acting on . The higher symmetries of Burgers are the images of the elementary heat-equation symmetries . The contrast with KdV is structural: KdV is integrable but genuinely nonlinear, with a nonlocal recursion operator and a hierarchy that resists linearisation, while Burgers is C-integrable (linearisable by a change of variables), with an essentially simpler hierarchy whose recursion operator becomes local after that change.
Equivalence and the structure of the symmetry algebra. The characteristics form a Lie algebra under , and the symmetries of are the centraliser of , a subalgebra by the Jacobi identity. For an integrable equation this centraliser is infinite-dimensional, spanned by the hierarchy together with the master symmetries of 05.09.11; the higher symmetries form an abelian ideal on which the master symmetries act as a centerless Virasoro algebra. Generalised symmetries here are taken as elements of the quotient of characteristics by the null ones, within which each generalised vector field is equivalent to its evolutionary representative .
The symmetry test for integrability. Ibragimov and Shabat [Ibragimov-Shabat 1980] turned the existence of higher symmetries into a classification tool: an evolution equation admits a genuine Lie-Bäcklund symmetry of high order only under stringent constraints on , and these constraints, made algorithmic through the formal symmetry / canonical-density method of Mikhailov-Shabat-Sokolov, classify the integrable scalar evolution equations. The existence of a single higher symmetry of sufficiently high order forces the existence of the entire hierarchy and of a recursion operator, so "possessing one genuine generalised symmetry" is, for a broad class, equivalent to integrability.
Synthesis. The generalised-symmetry theory reduces integrability to a single structural feature, an infinite hierarchy of commuting higher symmetries, and the recursion operator is the operator that manufactures the hierarchy from one seed. The argument runs along four linked stages, each carried by the total derivative of 05.05.05. Generalised symmetries are defined as characteristics on the jet bundle with tangent to the equation, taken modulo null characteristics, so that the evolutionary representative is the canonical object. The symmetry criterion is rewritten through the Fréchet derivative as , the linearised-flow equation. A recursion operator is then exactly an operator intertwining the linearised flow with itself, , and the central computation shows this intertwining propagates the symmetry equation from to . Iterating from the translation seed generates the abelian tower , which is the integrable hierarchy, the same tower the master symmetry of 05.09.11 generates by bracketing and the bi-Hamiltonian pencil generates by . The KdV and Burgers hierarchies instantiate every stage, and the same operator that built the jet-bundle contact structure here defines the Fréchet derivative, the recursion operator, and the characteristic bracket.
Full proof set Master
Proposition (Fréchet-derivative form of the symmetry criterion). For an autonomous evolution equation , a -independent characteristic is a generalised symmetry if and only if , equivalently where .
Proof. The evolutionary field is a symmetry when its flow commutes with the flow of , equivalently when the Lie bracket of the prolonged evolutionary fields vanishes: . The bracket of two evolutionary fields is again evolutionary with characteristic ; this is the standard identity that the assignment is a Lie-algebra anti-homomorphism from to vector fields. Now by the chain rule on , the Fréchet derivative being evaluated in the direction of its argument; symmetrically . Hence , and the symmetry condition reads . Along the flow , the total time derivative of is ; combined with this gives , the linearised-symmetry equation.
Proposition (the recursion identity is intertwining of the linearised flow). An autonomous operator on characteristics satisfies the recursion condition if and only if maps the kernel of the linearised-symmetry operator into itself; consequently maps symmetries to symmetries.
Proof. Let act on -dependent characteristics, so that the symmetries are the -independent elements of along the flow. Compute the commutator acting on a characteristic : $$ [L, R]Q = L(RQ) - R(LQ) = \Big(\frac{d}{dt}(RQ) - K'[RQ]\Big) - R\Big(\frac{d}{dt}Q - K'[Q]\Big). $$ Expand with : $$ [L, R]Q = R_t Q + R\tfrac{d}{dt}Q - K'[RQ] - R\tfrac{d}{dt}Q + R,K'[Q] = R_t Q - K'[RQ] + R,K'[Q] = \big(R_t - [K', R]\big)Q. $$ Therefore as operators. The recursion condition holds if and only if , that is commutes with . A commuting operator preserves : if then . Restricting to -independent elements, maps symmetries to symmetries.
Proposition (the KdV operator satisfies the recursion identity). For with , the operator satisfies , where is the coefficient evolution along the KdV flow.
Proof. Along the flow, and , so $$ R_t = 4 u_t + 2 u_{xt} D^{-1} = 4(u_{xxx} + 6 u u_x) + 2(u_{4x} + 6 u_x^2 + 6 u u_{xx}) D^{-1}. $$ For the right side, expand with and , using the operator product rule for multiplication operators and the relations , in the formal pseudo-differential algebra. Collecting terms by differential order, the purely differential parts of and cancel through order , the multiplication-operator residue is , and the nonlocal residue multiplying is , matching term by term. The two sides agree, , so is a recursion operator for KdV. The computation is the explicit verification underlying Olver's general result [Olver §5.2].
Connections Master
The prolongation calculus of 05.05.06 is the substrate: the evolutionary field , the characteristic , and the tangency criterion are the point-symmetry constructions extended to characteristics of unbounded order. A generalised symmetry is what Lie's infinitesimal criterion becomes when the coefficient functions are allowed to read the whole jet.
The recursion operator proved here to map symmetries to symmetries is exactly the operator iterated in 05.09.11: the master symmetry satisfies , and the hierarchy it bracketing-generates is the abelian tower of higher symmetries this unit produces directly from the recursion identity . The two units describe the same operator from the symmetry-preservation side and the hierarchy-generation side.
The infinite hierarchy of commuting symmetries is the defining datum of an integrable system 05.02.03 in the infinite-dimensional setting: the higher symmetries are the commuting Hamiltonian flows and their conserved densities are the involutive integrals, the field-theoretic analogue of Liouville-Arnold integrability.
The bi-Hamiltonian factorisation and the heredity (Nijenhuis) condition that upgrades single symmetry-preservation to involutivity of the whole tower are developed in 05.09.11 and surface in the finite-gap integration of 05.09.09, where the stationary higher symmetries cut out the algebro-geometric solutions and the recursion operator's spectrum organises the spectral curve.
Historical & philosophical context Master
The notion that a transformation of a differential equation may legitimately depend on derivatives of any order originates with Albert Victor Bäcklund, whose 1880 study of surface transformations [Bäcklund 1880] (Mathematische Annalen 9) introduced transformations that mix a function with its first and higher derivatives, the higher-order tangent transformations now called Lie-Bäcklund transformations. Sophus Lie had shown that genuine contact transformations of finite-dimensional jet space involve at most first derivatives; Bäcklund's contribution was to recognise that admitting all orders opens a vastly larger class, at the cost of working on the infinite jet bundle. Emmy Noether's 1918 Invariante Variationsprobleme [Noether 1918] (Göttinger Nachrichten) already worked with symmetries in characteristic (evolutionary) form, pairing them with conservation laws, and her formalism is the one in which generalised symmetries are most naturally expressed.
The modern theory crystallised in the 1970s with the discovery that the Korteweg-de Vries equation possesses infinitely many symmetries. Peter Olver's 1977 paper Evolution equations possessing infinitely many symmetries [Olver 1977] (J. Math. Phys. 18) introduced the recursion operator as the explicit generator of the KdV hierarchy of higher symmetries and gave the operator condition characterising such recursion operators, later canonised in his 1986 / 1993 Applications of Lie Groups to Differential Equations. Nail Ibragimov and Alexey Shabat's 1980 work [Ibragimov-Shabat 1980] (Funct. Anal. Appl. 14) turned the existence of higher Lie-Bäcklund symmetries into a classification criterion for integrable evolution equations, the seed of the symmetry approach to integrability developed algorithmically by Mikhailov, Shabat, and Sokolov. Athanassios Fokas's 1987 survey Symmetries and integrability [Fokas 1987] (Stud. Appl. Math. 77) synthesised the recursion-operator, bi-Hamiltonian, and symmetry-classification viewpoints, establishing that for a wide class of scalar evolution equations the existence of a single genuine higher symmetry is equivalent to integrability.
Bibliography Master
@article{Backlund1880,
author = {B{\"a}cklund, Albert Victor},
title = {{\"U}ber Fl{\"a}chentransformationen},
journal = {Mathematische Annalen},
volume = {9},
year = {1880},
pages = {297--320}
}
@article{Noether1918,
author = {Noether, Emmy},
title = {Invariante Variationsprobleme},
journal = {Nachrichten von der Gesellschaft der Wissenschaften zu G{\"o}ttingen, Mathematisch-Physikalische Klasse},
year = {1918},
pages = {235--257}
}
@article{Olver1977,
author = {Olver, Peter J.},
title = {Evolution equations possessing infinitely many symmetries},
journal = {Journal of Mathematical Physics},
volume = {18},
year = {1977},
pages = {1212--1215}
}
@article{IbragimovShabat1980,
author = {Ibragimov, Nail H. and Shabat, Alexey B.},
title = {Evolutionary equations with a nontrivial {L}ie--{B}{\"a}cklund group},
journal = {Functional Analysis and its Applications},
volume = {14},
year = {1980},
pages = {19--28}
}
@article{Fokas1987,
author = {Fokas, Athanassios S.},
title = {Symmetries and integrability},
journal = {Studies in Applied Mathematics},
volume = {77},
year = {1987},
pages = {253--299}
}
@book{OlverLieGroups,
author = {Olver, Peter J.},
title = {Applications of Lie Groups to Differential Equations},
series = {Graduate Texts in Mathematics},
volume = {107},
publisher = {Springer},
year = {1993},
edition = {2nd}
}
@incollection{MikhailovShabatSokolov1991,
author = {Mikhailov, Alexander V. and Shabat, Alexey B. and Sokolov, Vladimir V.},
title = {The symmetry approach to classification of integrable equations},
booktitle = {What is Integrability?},
editor = {Zakharov, V. E.},
publisher = {Springer},
year = {1991},
pages = {115--184}
}
@book{Ibragimov1985,
author = {Ibragimov, Nail H.},
title = {Transformation Groups Applied to Mathematical Physics},
publisher = {Reidel},
year = {1985}
}