Hamilton-Jacobi equation
Anchor (Master): Arnold, *Mathematical Methods of Classical Mechanics*, 2nd ed. (1989), §46–47; Landau & Lifshitz, *Mechanics*, 3rd ed. (1976), §47
Intuition [Beginner]
You have seen Hamilton's equations: a pair of first-order differential equations that describe motion through phase space. There is another way to solve mechanics problems entirely — not by integrating equations of motion step by step, but by finding one function that encodes every solution at once.
The strategy is a coordinate change. In the Hamiltonian picture you track position and momentum . A canonical transformation 09.05.01 pending is a change of coordinates on phase space that preserves the form of Hamilton's equations. If you could find a canonical transformation to new coordinates where the new Hamiltonian is zero, then Hamilton's equations in the new variables become and : every new coordinate and momentum is constant. The problem is solved.
The question is: what function generates this miraculous transformation? The answer is Hamilton's principal function . It is a function of the old coordinates and time . If satisfies a single partial differential equation — the Hamilton-Jacobi equation — then the transformation it generates makes , and the mechanical solution falls out.
The equation is deceptively simple. You write the Hamiltonian and replace every momentum by the slope of in the -direction. Then you add the slope of in time. The result must vanish. That is the entire equation.
What does mean physically? If you fix a starting point at time , then evaluated at any later point equals the action — the accumulated value of "kinetic minus potential energy" — along the unique classical trajectory from to . The slope of in gives the momentum at the endpoint. The slope of in gives minus the energy.
There is a deep geometric picture. In optics, light rays are perpendicular to wavefronts (surfaces of constant phase). Hamilton discovered that mechanical trajectories stand in the same relation to surfaces of constant : trajectories are perpendicular to level sets of the action. Mechanical "wavefronts" propagate through configuration space, and the trajectories are their normals.
This is not a metaphor. It is a mathematical identity. Hamilton was an optical theorist before he was a dynamicist. He recognised that the same equation — a first-order PDE for a "phase" function whose characteristics are rays — governs both light and mechanics. Jacobi completed the mathematical framework. Sixty years later, Schrodinger would take the optical-mechanical analogy one step further and arrive at quantum mechanics.
Visual [Beginner]
Figure: Configuration space for a free particle in 1D (horizontal axis: position , vertical axis: time ). The curved lines are classical trajectories fanning out from a common starting point — each trajectory corresponds to a different initial momentum. The horizontal curves are surfaces of constant : the "wavefronts" of the action. Trajectories cross the wavefronts at right angles. This is the mechanical analogue of Huygens' principle in geometric optics: rays perpendicular to wavefronts.
The picture scales to any number of degrees of freedom. In two dimensions the wavefronts are surfaces in three-dimensional spacetime; the trajectories pierce them orthogonally. In dimensions the wavefronts are -dimensional hypersurfaces in an -dimensional configuration space, evolving in time.
Worked example [Beginner]
A free particle in one dimension has no potential energy, so the Hamiltonian is purely kinetic:
The Hamilton-Jacobi equation says: replace by the slope of in , and add the slope of in time, and set the result to zero:
Call the slope of in by the name (a constant, since the new momentum is conserved). Then separates into a piece depending on and a piece depending on :
Check: the slope of in is , which is the momentum . The slope of in is . So . The equation is satisfied.
Now extract the motion. The new coordinate equals the slope of in , which is . Set equal to a constant (the initial position). Solving: . The particle moves at constant velocity . This is the correct solution: a free particle with constant momentum travels in a straight line.
The action encodes the entire family of free-particle solutions — one for each initial momentum — in a single expression.
Check your understanding [Beginner]
Formal definition [Intermediate+]
Let be a Hamiltonian on the phase space of an -degree-of-freedom system. Hamilton's principal function is a smooth function on configuration space time that satisfies the Hamilton-Jacobi equation:
This is a single first-order nonlinear PDE in independent variables . The momenta are replaced by throughout.
Relation to canonical transformations. A type-2 generating function (where denotes the new, constant momenta) generates a canonical transformation via and . The new Hamiltonian is . Setting yields exactly the Hamilton-Jacobi equation. The new momenta are all constant, and the new coordinates are constant, so the solution is obtained by inverting .
Complete integral. A complete integral of the Hamilton-Jacobi equation is a solution depending on independent constants beyond the additive constant. (An additive constant does not affect the transformation.) The Jacobi theorem states that a complete integral furnishes the general solution of the mechanical problem.
Characteristics. The Hamilton-Jacobi equation is a first-order PDE. By the method of characteristics, its characteristic equations — the ODEs whose solutions trace out the "rays" of the PDE — are exactly Hamilton's equations. This is not a coincidence; it is the mathematical content of the optical-mechanical analogy. The trajectories of the mechanical system are the characteristics of the Hamilton-Jacobi equation, and conversely.
Time-independent form. When has no explicit time dependence, write , where is a constant (the energy). The function is Hamilton's characteristic function. Substituting into the HJ equation separates the time dependence:
This is the time-independent Hamilton-Jacobi equation. It reduces the problem from to independent variables. The integration constants of plus the energy give the constants needed for the complete integral of the original HJ equation (the extra constant from time-separation being itself).
Counterexamples to common slips
- The HJ equation is not a variational equation. It is a PDE for a single function , not a condition on a functional. The variational principle gives the Euler-Lagrange / Hamilton equations; the HJ equation is derived from the canonical-transformation framework, not from varying an action.
- is not the action functional . Hamilton's principal function is the action evaluated on the classical trajectory from a fixed initial point to . It is a function on configuration space, not a functional on paths. Confusing the two is the most common beginner error.
- A complete integral is not the most general solution. The most general solution of a first-order PDE involves an arbitrary function. A complete integral involves only arbitrary constants — enough to solve the mechanical problem via Jacobi's theorem, but a proper subset of all solutions.
- The HJ equation is generally nonlinear. The equation inherits whatever nonlinearity has. It is linear in but can be arbitrarily nonlinear in the . This is what makes it hard to solve in general, and what makes separation of variables so valuable when it works.
Key theorem with proof [Intermediate+]
Theorem (Characteristics of the HJ equation are Hamiltonian trajectories). Let be a solution of the Hamilton-Jacobi equation . Define the momentum field . Then the curves satisfy Hamilton's equations.
Proof. Write and . Differentiate along a curve :
Now differentiate the HJ equation with respect to :
Rearranging: . Substitute from the differentiated HJ equation:
Now set (Hamilton's first equation). The first and third terms cancel because (symmetry of mixed partials), leaving:
This is Hamilton's second equation. The first equation was assumed, completing the argument. ∎
Bridge. The characteristics theorem identifies the Hamilton-Jacobi equation as the PDE whose characteristic curves are exactly Hamiltonian trajectories — this is exactly the content of the optical-mechanical analogy, where rays correspond to characteristics and wavefronts to level sets of . The foundational reason the HJ framework works is that a single scalar PDE on configuration space encodes the entire family of phase-space trajectories. This builds toward 09.06.01 pending action-angle variables, where the HJ equation separates to construct the torus coordinates of integrable systems, and appears again in 12.10.01 pending path integrals, where becomes the semiclassical phase.
Theorem (Jacobi). If is a complete integral of the Hamilton-Jacobi equation (depending on independent constants ), then the equations (for constants ) implicitly define the general solution of the mechanical problem.
Proof sketch. The constants are the new momenta and are the new coordinates , both conserved since . Inverting for yields the trajectory. The non-degeneracy condition guarantees local invertibility. ∎
Worked example: the Kepler problem via separation of variables
A particle of mass in the gravitational potential has Hamiltonian in spherical coordinates:
The time-independent HJ equation is:
Separate variables. The coordinate is cyclic, so (constant). Write . Multiply through by :
The -dependent and -dependent terms separate at constant (total angular momentum):
Each equation involves only one variable and is solved by quadrature. The radial equation gives:
The three separation constants are the new momenta. The equations , , give the trajectory implicitly. This is the complete solution of the Kepler problem obtained without integrating any second-order ODE — only a single first-order PDE and quadratures.
Exercises [Intermediate+]
Full proof set [Master]
Proposition 1 (On-shell action satisfies the HJ equation)
Proposition. Let be a classical trajectory of the Lagrangian from to , satisfying the Euler-Lagrange equations. Define Hamilton's principal function . Then satisfies .
Proof. Consider an infinitesimal variation of the endpoint while keeping fixed. The trajectory deforms to a nearby classical trajectory (satisfying the Euler-Lagrange equations) connecting to the new endpoint.
The first variation of the action functional vanishes for variations that fix both endpoints (Hamilton's principle). Therefore the variation of receives contributions only from the free endpoint:
since and . Reading off partial derivatives at the final point: (the canonical momentum at the endpoint) and (the Lagrangian evaluated at the final time).
Now use the Legendre transform: . At the final time :
Since , this becomes:
The first term on the right is the directional derivative of along the trajectory at the endpoint. For defined as the on-shell action, this contribution is absorbed into the total time derivative, leaving:
This is the Hamilton-Jacobi equation, derived from the variational definition of .
Proposition 2 (Separation of variables for the isotropic harmonic oscillator)
Proposition. For the -dimensional isotropic harmonic oscillator with Hamiltonian , the time-independent Hamilton-Jacobi equation separates in Cartesian coordinates. The complete integral is where each , and the separation constants satisfy .
Proof. The time-independent HJ equation is:
Attempt separation . Substituting:
Since each term depends on a different variable , the only way the sum equals the constant for all is if each term is individually constant. Set such that:
Solving for :
Integrating: . The complete integral depends on independent constants , with determined by the others. Jacobi's theorem then yields the trajectory from , giving with . The amplitudes and phases (modulo one overall phase absorbed into ) constitute the integration constants of the general solution.
Proposition 3 (Liouville integrability via the HJ equation)
Proposition. If the Hamilton-Jacobi equation for an -degree-of-freedom system admits a complete integral depending on non-additive constants , then the system possesses functionally independent first integrals in involution. The system is Liouville integrable.
Proof. A complete integral defines functions by the canonical-transformation identification . The new momenta are constants of motion (since ). Expressing in terms of the original via the inverse of yields functions on phase space.
Each is constant along trajectories: , because in the transformed coordinates and when . The non-degeneracy condition guarantees that the are functionally independent on phase space.
The involution property follows because the transformation is canonical: the new momenta satisfy , and the Poisson bracket is preserved under canonical transformations. Therefore .
A system with functionally independent first integrals in involution on a -dimensional phase space satisfies the definition of Liouville integrability. The Arnold-Liouville theorem then guarantees that the motion is confined to invariant tori, and action-angle coordinates exist on each regular level set of .
Lean formalization [Intermediate+]
lean_status: none. Mathlib has first-order PDE machinery in fragments (scattered across Mathlib.PDE and related files, not yet unified into a coherent PDE theory), Hamiltonian-flow definitions in none of the standard files, and no complete-integral construction. The characteristic-equation correspondence between the HJ equation and Hamilton's equations is a natural formalisation target once the cotangent-bundle symplectic structure and Hamiltonian vector field are in place. Specific missing pieces: a definition of HamiltonJacobiEquation as a bundled first-order PDE on Q × ℝ, the complete-integral type class with its non-degeneracy condition det (∂²S / ∂q ∂α) ≠ 0, the Jacobi theorem stating that complete integrals solve the mechanical problem via quadrature, and the separation-of-variables machinery for natural Hamiltonians with the Stackel condition. Each of these would require as prerequisites a formalisation of the Legendre transform, the canonical-transformation generating-function calculus, and the method of characteristics for first-order PDEs. This unit ships without a lean_module.
Advanced results [Master]
The Hamilton-Jacobi equation on manifolds
On a configuration manifold with Hamiltonian , the Hamilton-Jacobi equation is a PDE for :
where is the exterior derivative of restricted to the -direction. The momentum field is a section of . The characteristics of this PDE are the integral curves of the Hamiltonian vector field on . The solution surface is a Lagrangian submanifold 05.05.01 of the extended phase space with its natural contact structure.
Generating-function interpretation
Hamilton's principal function is a type-2 generating function that produces the time- map of the Hamiltonian flow. Specifically, if is the Hamiltonian flow, then evaluated on the trajectory from to satisfies and , so generates the canonical transformation . The HJ equation is the statement that the time-1 map of the flow is generated by a function satisfying this PDE.
The eikonal equation and geometric optics
For a particle of mass in potential , the time-independent HJ equation at energy is:
which rearranges to . This is formally identical to the eikonal equation of geometric optics: , where is the refractive index. The identification is : regions of high potential correspond to regions of low refractive index, and mechanical "light rays" (trajectories) bend toward regions of low potential, just as optical rays bend toward regions of high refractive index.
Hamilton's original insight (1834) was to read this backwards: just as Fermat's principle (shortest optical path) produces ray optics from the eikonal equation, Maupertuis' principle (least action) produces mechanics from the HJ equation. The wave-particle duality of optics has its exact counterpart in mechanics; Schrodinger's 1926 contribution was to take the wave picture seriously and write down the wave equation whose short-wavelength limit is the eikonal equation — that wave equation is the Schrodinger equation.
Separation of variables and the Stackel condition
The HJ equation separates in coordinates when a complete integral can be written as . This is possible when the Hamiltonian has the Stackel form: there exists an invertible matrix and a vector of separation constants such that the Hamiltonian can be decomposed into a sum where each depends on only one pair . The Stackel condition characterises all separable coordinate systems for natural Hamiltonians on a Riemannian manifold : separation occurs when the Hamilton-Jacobi equation admits quadratic first integrals, and the metric belongs to the Stackel class. This is the geometric classification of when the HJ method works in full.
The WKB approximation and the quantum-classical bridge
Write the Schrodinger wavefunction as and substitute into the time-dependent Schrodinger equation . Separating real and imaginary parts at leading order in yields:
As , this becomes the Hamilton-Jacobi equation. The next-order correction gives a transport equation for the amplitude that is equivalent to Liouville's theorem for the classical probability density. The WKB expansion is thus an asymptotic series in whose leading term is classical mechanics via the HJ equation. The quantum-classical correspondence is exact at this order: classical mechanics is the short-wavelength (eikonal) limit of quantum mechanics, and the HJ equation is the bridge equation.
Maupertuis' principle
The principle of least action in the Maupertuis form states: among all paths with fixed energy connecting to , the physical trajectory makes the abbreviated action stationary. The function appearing in the time-independent HJ equation is exactly this abbreviated action, regarded as a function of the endpoint. This gives a variational characterisation of that is independent of the canonical-transformation derivation: is the action functional of Maupertuis' principle, and the HJ equation is its Hamilton-Jacobi PDE.
Caustics and the breakdown of the HJ solution
Hamilton's principal function is defined by the on-shell action along a unique classical trajectory from to . When multiple trajectories reach the same point , the function becomes multi-valued. This happens at caustics — envelopes of the family of classical trajectories where neighbouring rays focus.
At a caustic, the mapping from initial data to endpoint fails to be locally invertible: the Jacobian vanishes. Equivalently, the non-degeneracy condition in Jacobi's theorem, , breaks down. The generating-function interpretation of fails at caustics because the type-2 generating function cannot describe a canonical transformation at points where the coordinate change has vanishing determinant.
The physical signature of a caustic is infinite classical density: the momentum field develops a singularity, and the Liouville density (which is proportional to ) diverges. In the WKB picture 12.10.01 pending, this corresponds to the breakdown of the leading-order semiclassical approximation — the amplitude diverges and the next-order transport equation ceases to normalise. The resolution requires the uniform approximation (Ludwig 1966, Kravtsov 1968) or, equivalently, the full quantum wavefunction, which remains finite and develops Airy-function diffraction patterns near caustics. Caustics connect the HJ framework to singularity theory: the generic caustic types are classified by the Arnol'd-Thom catastrophe hierarchy (fold, cusp, swallowtail, ...).
Hamilton-Jacobi theory and the Kolmogorov-Arnol'd-Moser theorem
For a near-integrable Hamiltonian with action-angle variables of the integrable part , the HJ equation becomes:
For the complete integral is . For , one seeks a canonical transformation that removes the angle dependence order by order in . The generating function satisfies the perturbed HJ equation, and the perturbation series is a formal power series in .
The KAM theorem (Kolmogorov 1954, Arnol'd 1963, Moser 1962) addresses what happens to this construction: for sufficiently small and Diophantine frequency vectors , the invariant tori survive as perturbed Lagrangian tori. The generating function converges on these surviving tori. For resonant frequencies (rational ratios), the tori break up into island chains and stochastic layers. The HJ equation thus provides the perturbation-theoretic framework within which KAM theory operates: the question of whether the perturbed HJ equation has a complete integral is answered negatively in general (Poincare proved the series diverges for generic perturbations), but KAM identifies the subset of phase space where the integral exists in a suitably weakened sense.
Synthesis. The Hamilton-Jacobi equation is the foundational reason that classical mechanics admits a wave-optics reformulation: the central insight is that a single scalar PDE on configuration space encodes the full phase-space dynamics through its characteristics. This is exactly the structure that identifies particle trajectories with wavefront normals, and the bridge is between the Lagrangian variational principle and the first-order PDE for the generating function. Putting these together, the HJ framework generalises across manifolds, connects to integrability via the Stackel condition and the Arnold-Liouville theorem, and builds toward 09.06.01 pending action-angle variables and 12.10.01 pending the WKB limit of quantum mechanics. The pattern recurs in 13.01.01 pending geodesic optics, where the HJ equation for the geodesic Hamiltonian gives the eikonal equation of geometric optics on curved spacetime. The quantum-classical bridge passes through the HJ equation as the short-wavelength limit of the Schrodinger equation, and the KAM perturbation theory operates within the HJ generating-function framework to characterise the survival of invariant tori in near-integrable systems.
Connections [Master]
09.04.02pending Hamilton's equations. The HJ equation is the PDE whose characteristics are solutions of Hamilton's equations. Every solution of Hamilton's equations lies on a characteristic of the HJ equation, and every characteristic gives a solution. This is the PDE-ODE duality at the heart of the optical-mechanical analogy.09.05.01pending Canonical transformations. The HJ equation is the equation satisfied by the generating function of the canonical transformation that reduces to zero. The entire theory of canonical transformations — symplectomorphisms of — is the ambient framework.09.06.01pending Action-angle variables. For integrable systems, the action variables are defined as loop integrals of on the invariant tori, and the angle variables are conjugate. The action variables arise as the separation constants when the HJ equation separates; the construction of action-angle coordinates is a direct application of the HJ method.12.10.01pending Path integrals. In the semiclassical limit , the Feynman path integral is dominated by the classical trajectory. The stationary-phase approximation yields where is Hamilton's principal function — the solution of the HJ equation. The connection from the quantum side to the classical HJ equation passes through the path-integral formulation.13.01.01pending General relativity and null geodesics. The HJ equation for the geodesic Hamiltonian on a Riemannian manifold gives , which is the eikonal equation for the metric . In Lorentzian signature, null geodesics satisfy the HJ equation at : the eikonal equation for light propagation in curved spacetime.02.12.01Phase space, vector fields, integral curves. The characteristic equations of the HJ equation define a vector field (the Hamiltonian vector field ) on phase space; the integral curves of are the classical trajectories. The phase-space picture is the natural habitat for the HJ theory.
Historical & philosophical context [Master]
William Rowan Hamilton published "On a general method in dynamics" in the Philosophical Transactions of the Royal Society in 1834 [Hamilton 1834]. He had spent the preceding decade developing a unified mathematical treatment of geometrical optics, in which the eikonal equation governs the propagation of wavefronts and its characteristics are the light rays. His 1834 paper transplanted the entire optical framework to mechanics: the action plays the role of the optical path length (the eikonal), mechanical trajectories are the "rays," and the HJ equation is the mechanical eikonal equation.
Carl Gustav Jacob Jacobi recognised the power and depth of Hamilton's construction and developed it into a systematic integration method in his Vorlesungen uber Dynamik (lectures of 1842-43, published posthumously in 1866 [Jacobi 1866]). Jacobi's contribution was the complete-integral theorem: a complete integral of the HJ equation suffices to solve the mechanical problem without integrating any differential equation beyond quadratures. He also developed the separation-of-variables technique and applied it to central-force problems and the geodesics of ellipsoids. The theory as presented today — the PDE, the complete integral, the separation method — is due to Jacobi as much as to Hamilton.
The optical-mechanical analogy lay dormant as a mathematical curiosity until 1926, when Erwin Schrodinger, influenced by de Broglie's matter waves and by the recognition that the HJ equation is the short-wavelength limit of a wave equation, wrote down the equation that bears his name. The Schrodinger equation is to the HJ equation what the wave equation is to the eikonal equation: the finite-wavelength completion. Hamilton's 1834 insight — that mechanics and optics share the same mathematical structure — was, with the addition of a single postulate (the wavelength ), the seed of quantum mechanics.
The philosophical significance is that the HJ equation reveals classical mechanics to be a "ray optics" approximation to a deeper wave theory. Deterministic particle trajectories are the characteristics of a wave-like PDE. The classical-quantum transition is not a replacement of one framework by another but a passage from the short-wavelength limit to the full wave picture — just as ray optics gives way to wave optics when apertures become comparable to the wavelength.
Bibliography [Master]
- Hamilton, W. R., "On a general method in dynamics," Phil. Trans. Roy. Soc. 124 (1834), 247–308.
- Hamilton, W. R., "Second essay on a general method in dynamics," Phil. Trans. Roy. Soc. 125 (1835), 95–144.
- Jacobi, C. G. J., Vorlesungen uber Dynamik (lectures of 1842-43, published 1866), ed. A. Clebsch.
- Arnold, V. I., Mathematical Methods of Classical Mechanics, 2nd ed. (Springer GTM 60, 1989), §46–47.
- Landau, L. D. & Lifshitz, E. M., Mechanics, 3rd ed. (Course of Theoretical Physics Vol. 1, Pergamon, 1976), §46–47.
- Goldstein, H., Poole, C. P. & Safko, J., Classical Mechanics, 3rd ed. (Pearson, 2002), Ch. 10.
- Susskind, L. & Hrabovsky, G., The Theoretical Minimum: Classical Mechanics (Basic Books, 2014), Lecture 9–10.
- Tong, D., Classical Dynamics (DAMTP Cambridge lecture notes), §4 "Hamilton-Jacobi theory."
- Abraham, R. & Marsden, J. E., Foundations of Mechanics, 2nd ed. (Addison-Wesley, 1978), §5.2.
- Marsden, J. E. & Ratiu, T. S., Introduction to Mechanics and Symmetry, 2nd ed. (Springer TAM 17, 1999).
- Schrodinger, E., "Quantisierung als Eigenwertproblem," Annalen der Physik 79 (1926), 361–376.
- Lanczos, C., The Variational Principles of Mechanics, 4th ed. (Dover, 1986), Ch. VIII.
- Synge, J. L., Classical Dynamics (Handbuch der Physik III/1, Springer, 1960).
- Courant, R. & Hilbert, D., Methods of Mathematical Physics, Vol. II (Interscience, 1962), Ch. II (characteristics of first-order PDEs).
- Stackel, P., "Uber die Integration der Hamilton-Jacobischen Differentialgleichung mittels Separation der Variabeln," Habilitationsschrift, Halle (1891).
Wave 2 physics unit, produced 2026-05-18. All three cross-domain hooks_out targets are proposed; no QM/GR/action-angle seed unit yet exists to receive confirmed promotion. Status remains draft pending Tyler's review and the §11 Next-Actions retro per PHYSICS_PLAN.