KAM theorem and chaos
Anchor (Master): Arnold, *Mathematical Methods of Classical Mechanics*, 2nd ed. (1989), §52; Arnold, *Geometric Methods in the Theory of ODEs*
Intuition [Beginner]
Integrable systems are the exception, not the rule. An integrable Hamiltonian system 09.06.01 pending has enough conserved quantities to confine motion to smooth tori in phase space. But add almost any perturbation and most of that clean structure breaks down. The motion becomes chaotic: trajectories that start close together diverge exponentially fast, and long-time prediction becomes impossible even though the equations are deterministic.
What does "chaos" mean in mechanics? Three signatures define it. First, sensitive dependence on initial conditions: change the starting point by a tiny amount and the trajectory eventually wanders far away — the butterfly effect. Second, mixing in phase space: a small region of initial conditions gets stretched, folded, and spread throughout the available phase space, like a drop of dye mixed into water. Third, unpredictable long-time behaviour: even with exact equations and exact arithmetic, you cannot say where the system will be after a long time without following the trajectory step by step.
The KAM theorem (Kolmogorov, Arnold, Moser) gives the partial good news: if the perturbation is small enough, most of the invariant tori survive. Not all — some break up — but a set of positive measure persists. The surviving tori still confine trajectories and prevent them from wandering through all of phase space. The tori that do break up create the chaotic regions visible between the surviving ones.
The analogy: think of a smoothly flowing river with some whirlpools. The river current is the regular motion on surviving tori. The whirlpools are the chaotic zones where tori have broken up and trajectories get trapped in complicated, swirling patterns. As the perturbation grows, more whirlpools appear and the regular current shrinks.
Visual: the standard map [Beginner]
The simplest system that shows KAM destruction is the standard map (also called the kicked rotor). It is a discrete map of the plane that models a pendulum kicked by periodic impulses. The map takes a point to a new point via
The parameter controls the perturbation strength.
When , every horizontal line constant is an invariant torus. The motion is purely horizontal — regular and predictable. As increases, some of these horizontal curves break apart while others persist. The broken ones dissolve into a scatter of points that fills a chaotic band. The surviving ones bend and wobble but remain intact as barriers.
At small (say ), most tori survive with thin chaotic layers visible between them. At large (say ), nearly all tori have been destroyed and the phase portrait is dominated by chaos. The transition is not uniform: the last torus to break is the "golden mean" torus whose rotation number equals where . This torus is the most resistant to perturbation because the golden ratio is the irrational number hardest to approximate by rationals.
Worked example [Beginner]
Compute five iterates of the standard map starting from with .
Step 1. First iterate. Compute . With , the coefficient is . The sine: . So .
Now .
Step 2. Second iterate. . Here . So . And .
Step 3. Continue the pattern. . Then .
Step 4. . Then (modulo 1).
Step 5. . Then .
After five iterates the point has moved from to approximately . The -coordinate nearly returned, but drifted upward. Over many iterates the trajectory may stay near a surviving torus or begin to wander diffusively through the chaotic zone, depending on the initial conditions.
Check your understanding [Beginner]
Formal definition [Intermediate+]
An -degree-of-freedom Hamiltonian system is Liouville-integrable 09.06.01 pending when it admits independent conserved quantities in involution. By the Liouville-Arnold theorem, the motion then lies on invariant -tori parametrised by action-angle variables , with and the flow , .
A near-integrable Hamiltonian has the form
where is integrable, is a smooth perturbation, and . The question answered by KAM theory is: what survives of the invariant tori when ?
Poincare sections. For a 2-degree-of-freedom system, fix energy and record the intersection of the trajectory with a surface (say). Each such intersection is a point in the plane — the Poincare section. On an invariant torus, successive points trace a smooth closed curve. In a chaotic zone, the points scatter diffusively. The Poincare section is the primary diagnostic tool for visualising KAM destruction.
Lyapunov exponents. The maximal Lyapunov exponent of a trajectory measures the exponential rate of separation of nearby orbits:
where is the separation between two initially close phase-space points evolved by the same flow. A positive Lyapunov exponent () is the quantitative signature of chaos. For a trajectory on a surviving KAM torus, .
Non-degeneracy (Kolmogorov condition). The unperturbed Hamiltonian must satisfy
This ensures the frequency map is a local diffeomorphism, so distinct tori have genuinely distinct frequencies.
The Henon-Heiles system
The Henon-Heiles Hamiltonian is
At low energy ( for ), the cubic perturbation is small and Poincare sections show mostly smooth invariant curves (surviving tori). Above , chaotic regions appear and grow rapidly. This system is the standard numerical demonstration of the KAM transition: the perturbation parameter is effectively the energy itself.
Key theorem with proof [Intermediate+]
Theorem (Kolmogorov-Arnold-Moser). Consider a near-integrable Hamiltonian on a -dimensional phase space, with real-analytic and satisfying the Kolmogorov non-degeneracy condition . Then for sufficiently small , the perturbed system possesses invariant -tori on which the flow is conjugate to a linear flow with Diophantine frequencies. The union of these surviving tori has positive Lebesgue measure in phase space.
Proof sketch. The proof follows a Newton-type iteration in function space — often called the superconvergent or quadratic scheme.
Step 1 (Ansatz). Seek a canonical transformation that maps the perturbed Hamiltonian back to integrable form on a single torus. The generating function must solve the homological equation
where collects the angle-dependent terms at first order.
Step 2 (Small divisors). The solution is , summing over Fourier modes . The denominators can be arbitrarily small — the small-divisor problem. The Diophantine condition excludes resonant tori and controls the rate at which denominators approach zero.
Step 3 (Quadratic convergence). A single step reduces the perturbation from to — this is the Newton iteration in function space. After steps the perturbation is , which converges super-exponentently. This rapid convergence overcomes the small-divisor losses at each step.
Step 4 (Measure estimate). For each Diophantine frequency vector, the corresponding torus survives. The set of Diophantine vectors has full measure in the frequency space, so a positive-measure set of tori persists.
The complete proof requires careful control of analyticity losses at each step, maintained by restricting the domain of analyticity at a controlled rate. Arnold's 1963 proof uses a modified scheme optimised for the Hamiltonian setting; Moser's 1962 proof (the twist theorem) handles the area-preserving map case with finite differentiability. ∎
Moser's twist theorem. For area-preserving twist maps of the annulus (the discrete-time analogue of 1.5-degree-of-freedom systems), Moser proved that invariant curves with Diophantine rotation numbers persist under small perturbations. This is the KAM theorem for maps and is the result that directly applies to the standard map.
Exercises [Intermediate+]
The KAM theorem in full detail [Master]
The precise statement requires several technical ingredients.
Diophantine frequencies. A frequency vector is -Diophantine if there exist , such that
The set of Diophantine vectors with parameters has Lebesgue measure as ; the complement (resonant vectors) has measure .
Kolmogorov's theorem (1954). Let be real-analytic and non-degenerate (). Then for each Diophantine there exists such that for the perturbed Hamiltonian has an invariant torus with frequency .
Arnold's theorem (1963). Under the same hypotheses, with the additional assumption that is real-analytic, the set of surviving invariant tori has positive Lebesgue measure in phase space, and the measure tends to full measure as .
Moser's twist theorem (1962). Let be an area-preserving twist map of the annulus of class with (later refined to by Hermann, then by Xia). Then for sufficiently small perturbations from integrable, every invariant circle with Diophantine rotation number persists. Moser's original proof required 333 derivatives; the modern smoothness threshold is .
The convergence scheme: Newton's method in function space
The KAM iteration is a Newton method on the space of generating functions. At step , one has an approximate invariant torus with a small error . One step of the Newton scheme (solve the linearised problem, correct the torus, update the error) reduces to . The quadratic convergence rate compensates for two losses:
- Small divisors: each step requires dividing by , which the Diophantine condition bounds from below at the cost of losing powers of .
- Domain shrinkage: at each step, the domain of analyticity in the angle variables shrinks by a fixed amount. The super-exponential convergence ensures the total shrinkage is finite.
After infinitely many steps, the sequence converges to a true invariant torus. The key estimate is that the total loss of analyticity is bounded: if the initial domain has width , the -th step has width .
Arnold diffusion and Nekhoroshev estimates
For degrees of freedom, the surviving KAM tori are -dimensional in -dimensional phase space — codimension — and do not separate the energy surface into disconnected components. Arnold diffusion is the phenomenon whereby trajectories in the chaotic zone drift in the action variables over exponentially long timescales, moving distances in phase space.
Arnold's original 1964 example constructed a Hamiltonian where orbits near a normally-hyperbolic invariant cylinder execute a heteroclinic excursion, gaining a net change in the action variable. The drift time is in Arnold's model.
The Nekhoroshev theorem (1977) provides the complementary upper bound: under a steepness (quasi-convexity) condition on , every trajectory satisfies
where depends on (typically ), and are constants. The stability time is exponentially long but finite; diffusion eventually occurs for trajectories not on KAM tori. The Nekhoroshev regime ends when , after which the action can drift by .
Symplectic maps and the standard map
The standard map is the prototypical symplectic map of the cylinder. Its KAM theory is governed by Moser's twist theorem rather than the Hamiltonian KAM theorem, but the structure is parallel.
The twist condition requires , which holds for all . The critical parameter marks the destruction of the last invariant circle (the golden-mean torus). For , no invariant circles span the cylinder and unbounded diffusion in becomes possible — a discrete analogue of Arnold diffusion, sometimes called Chirikov diffusion.
The Greene-MacKay numerics establish via the residue criterion: an invariant circle with rotation number (convergent of ) has a stability residue ; the circle exists if and only if at the critical parameter. The extrapolation at provides the best numerical value.
Connection to ergodic theory
The KAM theorem sits at the interface of Hamiltonian mechanics and ergodic theory. For a near-integrable system:
- The surviving KAM tori carry quasi-periodic motion — ergodic with respect to the uniform measure on the torus, but not mixing. The spectrum is pure point.
- The chaotic zones carry motion that is conjectured to be ergodic (on each connected component) with respect to the Liouville measure restricted to the zone. Rigorous results exist for specific systems (e.g., the stadium billiard is ergodic by Bunimovich's theorem).
- The phase space decomposes into a mixture of regular and chaotic zones, each of positive measure. This is the KAM picture: neither fully integrable nor fully chaotic, but a structured coexistence.
The Kolmogorov-Arnold theorem implies that generic near-integrable Hamiltonian systems are not ergodic on the full energy surface, because the KAM tori are invariant sets of positive measure that block ergodicity. This was a decisive answer to the Fermi-Pasta-Ulam question and refuted the early ergodic hypothesis in its strongest form.
Resonance overlap and the Chirikov criterion [Master]
A single resonance in a near-integrable Hamiltonian produces a thin chaotic layer around the separatrix of the resonant island. When multiple resonances are present — and they always are, because the Fourier expansion of contains infinitely many modes — the question becomes: when do the chaotic layers from distinct resonances merge to produce large-scale chaos?
Proposition (Resonance half-width). For a single resonance with mode amplitude , the half-width of the resonance zone in action space scales as
Proof. Near the resonant action where , introduce the resonant phase and expand the Hamiltonian to first order in . The frequency expands as , and the resonance condition kills , leaving the leading term for the unperturbed angle dynamics.
The resulting reduced Hamiltonian is that of a pendulum:
where is the effective mass and the Fourier amplitude of the resonant mode. The pendulum separatrix has energy . The maximum excursion in the momentum satisfies , giving and . Substituting yields the stated scaling.
The Chirikov criterion. Chirikov's 1979 insight [Chirikov 1979] was that global chaos sets in when the half-widths of neighbouring resonances overlap. For two primary resonances at actions with wave vectors , the overlap condition is
When this inequality holds, the separatrix layers of the two resonances intersect. A trajectory near one resonance can then cross into the domain of the other, initiating widespread chaotic diffusion through action space.
For the standard map, the primary resonances are at rational rotation numbers with half-widths . The Chirikov criterion predicts global chaos for , in rough agreement with the numerically determined critical value .
The Chirikov criterion is an asymptotic estimate, not a theorem. It overestimates the chaos threshold because it neglects higher-order resonances and the fine structure of the separatrix splitting. Refined versions (Lichtenberg-Lieberman, Regular and Chaotic Dynamics, 1992) account for the renormalisation of resonance widths by secondary resonances and give more accurate predictions. The criterion remains the standard engineering tool in plasma physics and accelerator design because it is simple, conservative, and applicable to systems where rigorous KAM estimates are unavailable.
Application to the standard map. The distance between the and resonances is (they sit at and ). The half-width of each is . The overlap condition gives , which overestimates by about 60 percent. The discrepancy arises because the golden-mean torus at lies between these two resonances and survives past the Chirikov threshold. The last torus persists until , well below the Chirikov prediction. This gap between the resonance-overlap estimate and the true critical value is a quantitative measure of how much stability the most robust KAM torus provides beyond the naive resonance-width picture.
Renormalisation and torus breakup universality [Master]
The destruction of the last KAM torus in the standard map exhibits universal scaling behaviour that is independent of the specific map, depending only on the arithmetic properties of the torus's rotation number. This universality is the dynamical-systems analogue of Feigenbaum universality in period-doubling cascades 02.12.17.
Greene's method. Greene (1979) established that an invariant circle with rotation number exists if and only if the periodic orbits with rotation numbers (the convergents of 's continued fraction) are all linearly stable. The stability residue of a periodic orbit of period is
where is the Jacobian of the -th iterate of the map evaluated on the orbit. For an area-preserving map, corresponds to linear stability (elliptic), to instability (hyperbolic), and is the marginal case.
The critical parameter for the destruction of the torus with rotation number is determined by the condition that the residues of the convergent orbits approach from below as . Greene's conjecture — confirmed for the golden-mean torus by de la Llave and collaborators — is that the torus exists at parameter if and only if .
Scaling at the breakup. As approaches from below, the residues converge to a universal function of and the distance . The convergence is geometric with ratio where is a universal constant. For the golden-mean torus, . This constant appears for any analytic area-preserving twist map whose golden-mean torus is at the point of breakup — it is a property of the golden mean's continued fraction , not of the specific dynamics. Different noble rotation numbers produce different universal constants, all sharing the same renormalisation mechanism.
Renormalisation-group interpretation. MacKay (1983) reformulated Greene's scaling as a renormalisation-group (RG) transformation on the space of area-preserving maps. The RG operator acts on a map by: (i) restricting attention to a neighbourhood of the torus, (ii) rescaling to unit size, and (iii) iterating by the number of steps given by the continued-fraction convergent. For the golden mean, corresponds to the self-similar substitution .
The fixed point of satisfies . The universal constant is the unstable eigenvalue of . The critical surface — the set of maps whose golden-mean torus is at the point of breakup — is the stable manifold of ; maps on one side have a surviving torus, maps on the other have a broken one.
Proposition (Universality of breakup scaling). For any analytic area-preserving twist map of the cylinder, the scaling of the distance to criticality with the residue convergence rate is governed by the universal constants (determined by the continued fraction of ). The residue sequence satisfies
for a universal function independent of the map.
Proof sketch. The renormalisation operator has a hyperbolic fixed point in the space of analytic area-preserving twist maps. By the stable-manifold theorem for , orbits of the RG dynamics near converge to along the stable manifold (parameterised by the critical surface) and diverge along the unstable direction at rate . The residue is a smooth functional on the space of maps; composing with gives . Linearising around : for near the critical surface, where measures the distance of from criticality. Hence , and rescaling gives the stated form.
The connection to Feigenbaum's period-doubling universality is structural: both arise from a fixed point of a renormalisation operator on the space of dynamical systems, with universality emerging from the eigenvalues of the linearised RG transformation. The difference is the RG transformation itself — Feigenbaum's acts on unimodal maps by composition and rescaling, while the KAM version acts on area-preserving maps by a continued-fraction-driven iteration and rescaling. The common thread is that self-similar structure in the bifurcation sequence produces scale-invariant critical behaviour.
Numerical determination of . The best current numerical value for the standard map's critical parameter is , determined by computing the residues for orbits with rotation numbers equal to the convergents of the golden mean (where are Fibonacci numbers) and extrapolating to using the scaling law. The convergence is geometric with ratio , giving about one additional decimal digit per two iterations. This value is consistent with the analytic bounds obtained by the parameter-exclusion method of de la Llave and Petrov (2002).
Resonance webs and heteroclinic connections [Master]
For systems with degrees of freedom, the resonance structure in action space forms a web of intersecting hypersurfaces. Each resonance defines a codimension-1 surface in the -dimensional action space. Where two resonances intersect, the codimension increases and the dynamics become more complex. The full structure — the resonance web — controls the topological possibility and the rate of Arnold diffusion.
Separatrix splitting. In an integrable system, the separatrix of a resonance is a smooth curve connecting the hyperbolic fixed points. Under perturbation, the stable and unstable manifolds of the hyperbolic points no longer coincide: they split transversely, creating a homoclinic tangle. The splitting is exponentially small in for analytic systems (a result due to Lazutkin and others), which is why it cannot be detected by any finite-order perturbation theory. The size of the splitting determines the width of the chaotic layer and the rate of diffusion along the resonance.
Proposition (Exponential smallness of the splitting). For the standard map with parameter , the splitting angle between the stable and unstable manifolds of the hyperbolic fixed point at the origin is of order for some constant , as .
This exponentially small splitting explains why numerical detection of chaos requires either sufficiently large or exponentially long integration times. For , the splitting is of order , which is numerically undetectable. For , the splitting is substantial and the chaotic layer is readily visible in Poincare sections.
The transition-chain mechanism. Arnold's 1964 example [Arnold 1964] constructs a near-integrable system in which orbits drift by in the action variables over exponentially long times. The mechanism has three ingredients: a normally-hyperbolic invariant cylinder (a "highway" in the phase space), heteroclinic connections between the cylinder's stable and unstable manifolds at different action levels, and a "transition chain" — a sequence of overlapping hyperbolic cylinders that the orbit follows, gaining a net displacement in action.
The drift time along a single transition chain is in Arnold's original model. Nekhoroshev's theorem provides the complementary upper bound: every orbit has its action confined within for times up to . Together, these results delineate the stability window: orbits are confined for exponentially long but not infinite times, and diffusion is possible but slow.
Modern results on Arnold diffusion. The existence of Arnold diffusion for generic systems was rigorously established by Cheng and Yan (2004) for the a priori stable case (both and fixed, the only parameter). The diffusion rate is bounded by for some , giving a minimum drift time to move in action. This is polynomial, not exponential — faster than Arnold's original estimate — but still very slow for small . The precise exponent depends on the dimension and the resonance structure; for degrees of freedom, is the current best bound.
The variational approach of Mather (1991–2004) recasts Arnold diffusion as a problem in global variational dynamics. Mather's -function and the associated action-minimising orbits (Aubry-Mather sets) provide a topological framework that avoids the explicit construction of transition chains. For generic Hamiltonians, Mather proved that diffusion occurs whenever there exists a "diffusion path" in the action space along which the Mather -function has a strict local minimum — a condition that is satisfied for generic perturbations.
Synthesis. The foundational reason the KAM framework controls long-time dynamics in near-integrable systems is the interplay between three structural features: the Diophantine tori that survive perturbation (KAM), the exponential confinement of all orbits between tori (Nekhoroshev), and the universal scaling at torus breakup (renormalisation). Putting these together identifies the transition from regular to chaotic motion as a critical phenomenon governed by renormalisation-group fixed points, and the central insight is that the arithmetic properties of rotation numbers — how well approximable they are by rationals — determine both which tori survive and how the surviving tori eventually break.
The bridge is between the abstract measure-theoretic persistence of KAM and the concrete numerical observation that the golden-mean torus is the last to go, with universal critical exponents that are independent of the mechanical system. This pattern recurs throughout Hamiltonian mechanics: stability is the norm for small perturbations, instability arrives through resonance overlap at a universal threshold, and the transition exhibits the same renormalisation universality as phase transitions in statistical mechanics 02.12.17. The generalises the notion of structural stability from the Kolmogorov-Arnold setting to the Nekhoroshev setting, where exponential — not eternal — confinement holds for every orbit, not just those on Diophantine tori.
Historical context [Master]
The story begins with Poincare's 1890 prize competition. King Oscar II of Sweden offered a prize for a solution to the -body problem. Poincare submitted a memoir claiming the convergence of perturbation series, which would have resolved the stability of the solar system. During revision he discovered that the series diverge due to resonances — the small-divisor problem. This discovery, contained in the corrected memoir (Les Methodes Nouvelles de la Mecanique Celeste, 1892-1899), is the origin of chaos theory. Poincare identified homoclinic tangles and the impossibility of convergent perturbation series for generic systems.
Kolmogorov (1954) announced the positive result at the ICM in Amsterdam: despite divergence of the formal series, most tori survive. His idea was to replace the divergent series by a convergent Newton iteration. The announcement was four pages in Doklady Akademii Nauk; the proof sketch was presented orally.
Arnold (1963) gave the first complete proof for the Hamiltonian case, published in Russian Mathematical Surveys 18 (1963), pp. 9-36 ("Proof of a theorem of A. N. Kolmogorov on the preservation of conditionally periodic motions under a small perturbation of the Hamiltonian"). Arnold's proof introduced the technique of fast convergence (quadratic scheme) in the analytic category and addressed the measure estimate rigorously.
Moser (1962) proved the corresponding result for area-preserving maps of the annulus ("On invariant curves of area-preserving mappings of an annulus", Nachr. Akad. Wiss. Gottingen, Math.-Phys. Kl. II 1962, 1-20). His proof used a Nash-style implicit-function theorem and required 333 derivatives. The subsequent reduction to is due to Herman and later Xia.
The KAM theorem resolved a century-old tension in celestial mechanics: the solar system is near-integrable, perturbation series diverge, but the system is not necessarily unstable. The surviving tori provide effective barriers (for 1.5 degrees of freedom) or exponentially long stability (Nekhoroshev), reconciling Poincare's divergence with observed planetary stability.
Connections [Master]
Integrable systems and action-angle variables
09.06.01pending: KAM theory is the perturbation theory for the Liouville-Arnold torus structure. Without the integrable backbone, there are no tori to perturb.Hamiltonian flow and symplectic geometry
09.04.02pending: The KAM theorem is a statement about symplectic maps. Moser's twist theorem is the symplectic-map version; the Arnold version uses canonical transformations at each Newton step.Dynamical systems / bifurcation theory
02.12.17: The breakup of KAM tori produces period-doubling cascades, homoclinic bifurcations, and strange attractors. The Feigenbaum universality for period-doubling is one manifestation of the cascade from regular to chaotic motion.Ergodic theory: The KAM theorem provides the definitive counterexample to the strong ergodic hypothesis. Generic Hamiltonian systems have mixed phase space — neither fully regular nor fully chaotic. This motivates the study of almost-invariant sets (Froyland's coherent structures) and stickiness near island chains.
Celestial mechanics: The stability of the solar system is a KAM question. Laskar's numerical integrations (1994-2009) show that the inner planets are chaotic with a Lyapunov time of about 5 Myr, but remain bounded for more than 1 Gyr — consistent with Nekhoroshev-type estimates rather than permanent KAM stability.
Plasma physics and accelerator physics: Magnetic confinement devices (tokamaks) are near-integrable Hamiltonian systems for charged-particle orbits. The destruction of KAM surfaces leads to orbit loss. The Chirikov overlap criterion for resonance overlap is the practical engineering tool derived from KAM theory.
Quantum chaos: The semiclassical limit of a classically chaotic Hamiltonian has level statistics described by random matrix theory (Bohigas-Giannoni-Schmit conjecture). The KAM transition from regular to chaotic is reflected in the Poisson-to-Wigner-Dyson transition in the spectral statistics.
Bibliography [Master]
Kolmogorov, A. N. "On the conservation of conditionally periodic motions under a small perturbation of the Hamiltonian." Dokl. Akad. Nauk SSSR 98 (1954): 527-530. The originator paper; four-page announcement of the theorem.
Arnold, V. I. "Proof of a theorem of A. N. Kolmogorov on the preservation of conditionally periodic motions under a small perturbation of the Hamiltonian." Uspekhi Mat. Nauk 18(5) (1963): 13-40. [English: Russian Math. Surveys 18(5), 9-36.] The first complete proof.
Moser, J. "On invariant curves of area-preserving mappings of an annulus." Nachr. Akad. Wiss. Gottingen, Math.-Phys. Kl. II (1962): 1-20. The area-preserving-map version.
Arnold, V. I. Mathematical Methods of Classical Mechanics, 2nd ed. Springer GTM 60, 1989. §52 (averaging) and §53 (perturbation theory). The canonical textbook treatment.
Arnold, V. I. "Instability of dynamical systems with many degrees of freedom." Dokl. Akad. Nauk SSSR 156 (1964): 9-12. The Arnold diffusion example.
Nekhoroshev, N. N. "An exponential estimate of the time of stability of nearly integrable Hamiltonian systems." Uspekhi Mat. Nauk 32(6) (1977): 5-66. [English: Russian Math. Surveys 32(6), 1-65.] The Nekhoroshev estimates.
Henon, M. and Heiles, C. "The applicability of the third integral of motion: some numerical experiments." Astron. J. 69 (1964): 73-79. The Henon-Heiles system.
Chirikov, B. V. "A universal instability of many-dimensional oscillator systems." Phys. Rep. 52 (1979): 263-379. The Chirikov resonance-overlap criterion and the standard map.
de la Llave, R. "A tutorial on KAM theory." In Smooth Ergodic Theory and its Applications, Proc. Sympos. Pure Math. 69, AMS (2001): 175-292. A modern expository account with full proofs.
Broer, H. W. "KAM theory: the legacy of Kolmogorov's 1954 paper." Bull. Amer. Math. Soc. 41(4) (2004): 507-521. Historical survey.