Galerkin Existence and Finite Propagation Speed for Second-Order Hyperbolic Equations
Anchor (Master): Evans §7.2; Lions-Magenes Ch. 3-4; Wloka, Partial Differential Equations (Cambridge 1987), §29-§30; Ladyzhenskaya, The Boundary Value Problems of Mathematical Physics (Springer 1985), Ch. IV-V; Lax, Hyperbolic Partial Differential Equations (Courant Lecture Notes 14, AMS 2006); John, Partial Differential Equations, 4e (Springer 1982), §5-§6
Intuition Beginner
A wave equation describes a quantity that oscillates rather than settles: a plucked string, a drumhead, a sound pulse in air, a tremor through rock. The companion unit on the wave equation handed you one such equation on an infinite line, with constant wave speed, and one explicit formula, the splitting of a bump into a left-mover and a right-mover. But most real vibration happens in a bounded region, through a material whose stiffness varies from place to place and may even change in time. There is no tidy splitting formula for that. The sharp question returns: does a solution even exist, and is it the only one?
This unit answers yes by building the solution out of simple pieces instead of guessing a formula, exactly as the parabolic companion did for diffusion. Pick a handful of standard shapes a function on the region can have, the lowest vibration modes, and look for the best blend of just those few shapes. Each shape carries a strength that changes in time, and feeding the blend into the wave rule turns the partial differential equation into an ordinary system of rate equations for those strengths, the kind of system that always has a solution. Then add more shapes, and more, and watch the approximations settle toward an honest solution.
Why should adding shapes settle down? Because a vibrating system has a total energy, part stored in motion and part stored in stretch, and that total stays put. It does not leak away the way heat does; it only sloshes back and forth between the two stores, apart from whatever a driving force feeds in. That fixed energy budget turns into a uniform numerical ceiling on every approximation at once, no matter how many shapes were used, and a uniformly capped family cannot run off to infinity.
Here is the sharp difference from the diffusion companion. Diffusion drains energy, so its solutions only get smoother. A wave conserves energy, so a kink stays a kink: there is no smoothing, no regularity gained for free. Conservation also carries a new fact diffusion lacks. A wiggle started in one small region cannot be felt arbitrarily far away right away; it can spread only at a bounded speed, so after a given time it lives inside a cone. That bounded reach, finite propagation speed, is the second main result of this unit, and it holds even when the material varies.
Visual Beginner
Two pictures carry this unit: an energy that swaps between two stores without leaking, and a cone that bounds how far a disturbance can reach.
Read the left panel first. A vibrating system holds energy in two forms: the energy of motion, set by how fast each piece is moving, and the energy of stretch, set by how bent or strained the shape is. As the system vibrates, energy pours from one tank into the other and back, but the sealed lid means the total never falls on its own. Only an outside driving force, the small inlet, can change the total, and only by a controlled amount. This sealed-total picture is the conservation law, and it is the single energy ceiling that holds every shape-by-shape approximation at once.
The right panel is the cone. Start a disturbance inside a small interval at time zero. A signal can travel only so fast, so by a later time the disturbance can have reached only the points within that travel distance of where it began. Plotting space across and time up, the reachable region is a triangle widening as time grows, with slanted sides whose steepness is set by the fastest signal speed.
A point outside the triangle has not yet been touched: the medium there is still exactly at rest. That bounded reach is finite propagation speed, and it survives even when the material, and hence the local signal speed, varies from place to place, as long as the cone is drawn with a slope at least the largest local speed.
The contrast with the diffusion companion is the whole point. There the right-hand picture was a single energy curve sloping downward, energy leaking away and the solution smoothing. Here the energy curve is flat and the reach is bounded by a cone. Conservation in place of dissipation, a cone in place of an infinite instantaneous reach.
Worked example Beginner
We run the shape-by-shape recipe by hand on the simplest bounded vibration and watch the energy stay fixed. Take a string from zero to , pinned at zero displacement at both ends, governed by the wave rule "acceleration equals curvature", which is . Start it from the shape released from rest, so the initial velocity is zero.
Step 1. Choose the shapes. The natural shapes for a string pinned at both ends are the standing waves , , , and so on. Each keeps its shape under the wave rule and only changes in strength.
Step 2. Write the blend. Look for , a single shape since the start is a single shape. The starting strength is , and the starting rate of change is because the string is released from rest.
Step 3. Turn the rule into a rate equation. The curvature of is . Matching acceleration to curvature gives . This is the equation of a mass on a spring: the strength oscillates rather than decays.
Step 4. Solve it. The solution with strength and zero starting rate is . So . The whole string swings up and down in unison, a standing wave, returning to its start every time advances by .
Step 5. Watch the energy stay fixed. The energy of motion is set by the velocity , and the energy of stretch is set by the slope . Adding the totals of and across the string and using that and each total in square gives . The and add to one at every instant, so the total is the same constant for all time.
What this tells us: restricting to one shape turned the partial differential equation into a single spring equation we solved exactly, and the total energy, the sum of the motion part and the stretch part, never changed; it only poured back and forth between the two as and traded places. With no driving force, the energy ceiling is just the starting value, and every approximation stays under it forever. That fixed energy is the engine that, in the general case, keeps the shape-by-shape approximations from blowing up as more shapes are added, and the absence of any decay is exactly why a wave keeps its wiggles.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is open and bounded, is fixed, and the spatial operator at each time is the symmetric divergence-form second-order operator of 02.16.04,
with coefficients , , , the time-derivatives , and uniform ellipticity for a.e. and all , with . The symmetry and the absence of first-order drift make formally self-adjoint, the natural setting for a conserved energy. The time-dependent bilinear form is
bounded on with constant uniformly in , symmetric in , and Gårding-coercive, uniformly in , by the estimates of 02.16.04.
Definition (Gelfand triple and solution space). Let and , with continuous and dense, and form the Gelfand triple exactly as in 02.18.01, so and the duality pairing extends the inner product of . The Bochner spaces are those of 02.18.01. The second-order solution space for the hyperbolic problem is
where and are the weak (distributional) time-derivatives valued in and respectively. Every has, after modification on a null set, and ; under the sharper regularity of the existence theorem below one gains and , which is what gives the two initial conditions a meaning in and .
Definition (weak solution of the hyperbolic problem). Given , , and , a function with , , is a weak solution of the initial/boundary-value problem
if in , in , and
The equation is an identity in at a.e. time; because for . Two pieces of initial data are required, one for position in and one for velocity in , the hyperbolic analogue of needing both and for the d'Alembert problem of 02.13.04.
Definition (Galerkin approximation). Let be the orthonormal-in-, orthogonal-in- Dirichlet eigenbasis of 02.16.04, complete in both and . The -th Galerkin approximation is
whose coefficient vector solves the second-order finite-dimensional ODE system
Counterexamples to common slips Intermediate+
Conservation, not dissipation. For the hyperbolic problem the natural energy is not monotone; its derivative is controlled but not sign-definite, in contrast to the parabolic energy of
02.18.01that decays. Treating the hyperbolic energy as dissipative and expecting a smoothing gain is the central error: there is no parabolic regularization, so is no smoother than and no smoother than .The velocity lives in , the acceleration in $V^u'L^\infty(0,T;H)u' \in L^2(0,T;V^)u_m'|u_m'|_Hu''V^u'' \in L^2(0,T;H)$ is a genuine regularity gain requiring more of the data, not part of the definition.
Two initial conditions, not one. The equation is second-order in time, so alone underdetermines the solution; the velocity is independent data. A scheme that initializes only the position, copying the parabolic template, solves the wrong problem.
Finite propagation speed needs the right cone slope. The local energy on a backward cone is nonincreasing only when the cone's lateral slope is at least the largest local characteristic speed . A cone drawn too steeply (slope below the wave speed) can leak energy through its lateral boundary, and the monotonicity fails; the propagation speed is set by the coefficients, not chosen freely.
Key theorem with proof Intermediate+
Theorem (Galerkin existence, uniqueness, and energy estimate). Let be symmetric, uniformly elliptic with coefficients and on , so is bounded with constant , symmetric, and Gårding-coercive. Then for every , , and there is a unique weak solution of the hyperbolic problem with and it obeys the a priori estimate with and the time-coefficient bound [Evans 2010 §7.2] [Lions-Magenes 1972 Ch. 3-4].
Proof. Step 1 (the finite-dimensional system is solvable). Fix . With and , the Galerkin equations read , a linear second-order system with and . Rewriting as a first-order system in and applying the Carathéodory existence theorem gives a unique with the prescribed , hence a unique with .
Step 2 (energy estimate, uniform in ). Multiply the -th Galerkin equation by and sum over ; since this is the test choice : The first term is . For the second, symmetry of gives where uses and ; hence . Writing the kinetic-plus-form energy , Estimate the right side: by Young, and . By Gårding, , and . Collecting, and using from the fundamental theorem, Setting and absorbing, , so by the integral Grönwall inequality [Grönwall 1919], Finally , since are -projections of onto . From and coercivity, and are bounded uniformly in by .
Step 3 (estimate on the second time-derivative). Fix , , split with the -orthogonal projection, . Then , so . Taking the supremum identifies with , whence , bounded uniformly in .
Step 4 (passage to the limit). By Steps 2-3 the sequences , , are bounded in , , . The first two are duals of separable spaces, so by Banach-Alaoglu (weak-* sequential compactness) and reflexivity of a subsequence satisfies in , in , and in (weak limits of derivatives are derivatives of the weak limit). Fix , with , and with . For , multiply the Galerkin identity by and integrate by parts twice in time: Every term is linear and continuous for the weak-* topologies, so letting replaces by , by , by (projections converge in ). The resulting identity holds for all , by density for all , and undoing the integration by parts gives a.e. with , . The a priori estimate is the limit of the uniform bounds by weak-* lower semicontinuity.
Step 5 (uniqueness). Let be the difference of two solutions, so , , . The pairing identity (Proposition 4 below) makes absolutely continuous with , since the first pairing vanishes by the equation. Hence , and with , Grönwall on starting from forces , so and, with , .
Bridge. The hyperbolic energy estimate is the foundational reason the construction converges, and it is the Gårding boundedness and coercivity of 02.16.04 read through the conserved quantity rather than the dissipative one of 02.18.01: testing with the velocity replaces testing with , so the time-integral of the -norm is now controlled through the form energy and the -norm of the velocity is propagated by the data. This is exactly the parabolic scheme of 02.18.01 with dissipation traded for conservation: where the parabolic energy fell monotonically, the hyperbolic energy is merely Grönwall-bounded, and the loss of monotonicity is the precise statement that there is no smoothing. The construction builds toward the semigroup picture of 02.18.03, where the first-order reduction generates a unitary-up-to-Grönwall group rather than a contraction semigroup, and the finite-propagation-speed theorem below generalises the constant-coefficient backward-light-cone uniqueness of 02.13.04 to variable coefficients. Putting these together, the central insight is that a hyperbolic equation is an elliptic energy identity tested against the velocity and closed by Grönwall, so existence again costs no explicit kernel; this appears again in the localized form, where restricting the same energy to a shrinking cone yields the domain-of-dependence statement that the wave equation respects causality.
Exercises Intermediate+
Advanced results Master
The Galerkin/energy existence theorem for the hyperbolic problem sits inside a wider structure that runs in close parallel to the parabolic theory of 02.18.01, with conservation everywhere replacing dissipation. The first-order reduction recasts the autonomous case as a group rather than a semigroup; the local-energy argument upgrades the global estimate to the finite-propagation-speed and domain-of-dependence theorems; the symmetric-hyperbolic formalism of Friedrichs embeds the second-order equation into a first-order system to which the same energy method applies verbatim; improved regularity is bought by differencing the equation in time rather than by parabolic smoothing; and the eigenfunction expansion makes the autonomous solution an exact superposition of oscillating modes.
Theorem 1 (first-order reduction and the propagator group; autonomous case). Let be time-independent, self-adjoint with form domain . Writing on the energy space , the equation becomes with \mathcal{A}=\begin{psmallmatrix}0&I\\-L&0\end{psmallmatrix}, and generates, by the Stone/Hille-Yosida theory of 02.18.03, a strongly continuous group that is unitary on in the energy norm . The Galerkin solution coincides with the Duhamel/variation-of-parameters formula , the constant-coefficient instance being the d'Alembert/Kirchhoff representation of 02.13.04. Unitarity is the operator form of exact energy conservation; reversibility in (a group, not a one-sided semigroup) is the operator form of time-reversibility, the contrast with the analytic contraction semigroup of the parabolic case.
Theorem 2 (finite propagation speed and domain of dependence). Let be the weak solution with symmetric uniformly elliptic, , and set . If and vanish on the ball and vanishes on the spacetime cone , then on that cone; equivalently, depends only on the data in [Courant-Friedrichs-Lewy 1928] [Evans 2010 §7.2]. The proof is the local-energy flux inequality of Exercise 8 applied to the difference of two solutions: the local energy is nonincreasing once the cone slope exceeds the supremal characteristic speed, and vanishing initial local energy then propagates up the cone. The speed is intrinsic to the coefficients, the variable-coefficient replacement for the single constant of 02.13.04; sharper bounds use the pointwise top eigenvalue of to draw a curved cone of locally varying slope.
Theorem 3 (symmetric-hyperbolic embedding; Friedrichs 1954). Introducing and , the scalar equation becomes a first-order system for with the symmetric matrices [Friedrichs 1954]. Friedrichs' theory of symmetric hyperbolic systems gives existence, uniqueness, and the energy estimate directly from the symmetry of the by the same integration-by-parts that produced the scalar energy here, and the finite propagation speed is read off as the maximal eigenvalue of over unit covectors . This is the framework in which Maxwell's equations, linearized elasticity, and the linearized Einstein equations are all hyperbolic, and it is the natural home for the cone-of-dependence geometry of Theorem 2.
Theorem 4 (improved regularity). If in addition with , , , and the coefficients are in time, then the weak solution satisfies , , , with the second-energy estimate
[Evans 2010 §7.2] [Ladyzhenskaya 1985 Ch. V]. The mechanism is to differentiate the Galerkin equation in time and apply the first energy estimate to , testing with ; unlike the parabolic gain of 02.18.01, where testing with traded one time-derivative for one elliptic derivative through smoothing, here each differentiation in time costs one degree of regularity in the data, and there is no free gain. Higher regularity follows by repeated time-differencing plus elliptic regularity of 02.16.04, recovering the classical solution under compatibility conditions.
Theorem 5 (eigenfunction expansion; autonomous case). For time-independent with Dirichlet eigenpairs , , the solution of , , , is the exact superposition
with energy conserved term by term, each mode an undamped oscillator of frequency . This is the bounded-domain analogue of the Fourier representation behind d'Alembert's formula 02.13.04, and the convergence of the partial sums is the Galerkin convergence of the main theorem made explicit in the eigenbasis.
Synthesis. The hyperbolic energy estimate is the foundational reason the Galerkin scheme converges, and it is exactly the Gårding boundedness of 02.16.04 read through the conserved energy , tested against the velocity rather than the parabolic : the kinetic norm of the velocity is propagated and the form energy controls the -norm, so existence costs no kernel and the construction is dual to the elliptic theory in the same precise sense as the parabolic case, the frozen-time Lax-Milgram solvability making the Galerkin matrix invertible at each instant. This is exactly the parabolic scheme of 02.18.01 with one structural substitution, dissipation replaced by conservation: where the parabolic energy fell and bought smoothing, the hyperbolic energy is conserved and buys none, and the loss of the monotone sign is the central insight that hyperbolic regularity is exactly the regularity of the data, propagated.
Putting these together, the same energy identity generalises in two directions at once. Localized to a shrinking cone it gives the finite-propagation-speed theorem, the variable-coefficient promotion of the constant-speed backward-light-cone of 02.13.04, so the bridge from this unit reaches back to the d'Alembert causality it generalises; reduced to first order it gives the unitary propagator group of 02.18.03, the conservative twin of the analytic semigroup, so the bridge reaches forward to the generation theory the conserved energy feeds. The central insight is that one symmetric energy identity, tested against the velocity and closed by Grönwall, simultaneously yields existence, uniqueness, the absence of smoothing, and the cone of dependence, the four facts that distinguish hyperbolic evolution from its parabolic sibling.
Full proof set Master
Proposition 1 (uniform hyperbolic energy estimate). Under the hypotheses of the main theorem the Galerkin approximations satisfy .
Proof. Testing the Galerkin identity with gives . Symmetry of yields , so the energy obeys . Young bounds , and by Gårding. With , set to get . Grönwall (Exercise 4) gives , and . Coercivity converts the -bound into bounds on and . Finally (Step 3 of the theorem), so .
Proposition 2 (existence of a weak solution). A subsequence of converges weak-* in with in and in to a weak solution with , .
Proof. Proposition 1 and the separability of give, via Banach-Alaoglu, a subsequence with in , in , in . For , : and ; passing to the limit identifies and , so . Fix , with . For , integrating the Galerkin identity twice by parts in time gives . Each term is weak-* continuous in , and , in , so the limit reads with in place of . Density of extends this to all ; choosing recovers the equation a.e., and comparing the boundary terms for general forces , .
Proposition 3 (uniqueness and continuous dependence). The weak solution is unique, and .
Proof. The estimate is the weak-* lower-semicontinuous limit of Proposition 1, with the continuity in time supplied by the regularity , (Proposition 4). For uniqueness, the difference of two solutions with equal data solves , , . By Proposition 4 the energy is absolutely continuous with , the pairing vanishing by the equation. Then and , so ; Grönwall from gives , hence and .
Proposition 4 (regularity in time and the energy identity). Every weak solution with , , has representatives , , and is absolutely continuous with for a.e. .
Proof. The strong continuity , is not automatic from the function-space memberships and is proved by a mollification-in-time argument (Lions-Magenes): regularize to smooth in , for which the classical product rule gives . The right-hand side converges in as because in and in (the latter is where the regularity claim is used and proved, via the boundedness of the mollified energies and an Ascoli argument in upgraded to strong continuity by the continuity of ). Integrating and passing to the limit yields the absolute continuity of and the stated derivative; the same passage gives the pointwise-in-time values defining , .
Proposition 5 (finite propagation speed, variable coefficients). Let solve weakly with symmetric uniformly elliptic, , and let . If and vanish on , then on the cone .
Proof. By Proposition 4 (applied locally, using that is, for a.e. , an function and that the local energy below is finite and absolutely continuous in ) define . Differentiating with the domain shrinking at radial speed , Using a.e. and integrating the first interior term by parts, ; the interior gradient terms cancel against (symmetry ). Thus On , by the -Cauchy-Schwarz bound and Young. With the boundary integrals cancel up to sign, leaving . Grönwall with (data vanish on ) gives , so on . Hence and on , and with the vanishing initial data on .
Connections Master
The spatial engine is the elliptic weak theory of
02.16.04: the boundedness and Gårding coercivity proved there from uniform ellipticity are the exact inputs to the hyperbolic energy estimate, and the Dirichlet eigenbasis driving the Galerkin scheme and the eigenfunction expansion of Theorem 5 is the spectral decomposition of the compact resolvent built there. This unit owns the conservative time-evolution layer;02.16.04owns the frozen-time elliptic solvability the Galerkin matrix inherits at each instant.The parabolic sibling
02.18.01supplies the entire Bochner-space and Galerkin apparatus reused here; the two units differ by exactly one structural substitution, dissipation against conservation. The parabolic energy is tested against and decays, buying smoothing; the hyperbolic energy is tested against and is conserved, buying none. Reading the two units side by side is the cleanest way to see why parabolic solutions are instantly smooth while hyperbolic solutions merely propagate their initial regularity.The constant-coefficient whole-space wave equation of
02.13.04is the explicitly solvable special case: its d'Alembert, Kirchhoff, and Poisson formulas are the closed-form realizations of the abstract propagator group of Theorem 1, and its backward-light-cone uniqueness is the constant-speed instance of the finite-propagation-speed theorem proved here for variable coefficients. This unit is the variable-coefficient existence theory of which02.13.04is the one computable instance.The first-order reduction of Theorem 1 is developed in its own right in
02.18.03(Hille-Yosida and -semigroups), where the generator \mathcal{A}=\begin{psmallmatrix}0&I\\-L&0\end{psmallmatrix} is shown to generate a unitary group by Stone's theorem; the conserved hyperbolic energy of this unit is the unitarity hypothesis of that generation theorem, so the two units are the constructive and the generator-theoretic faces of the same conservative evolution.The variational/minimization theory of
02.18.04(the direct method) finds elliptic critical points as energy stationary points; the hyperbolic flow of this unit is the formal Hamiltonian flow of that same Dirichlet energy, with the kinetic term, so the conserved of this unit is the Hamiltonian whose stationary configurations are the variational solutions of02.18.04. Time-evolution existence and variational existence are thereby the dynamic and the static faces of one energy functional within the chapter.
Historical & philosophical context Master
The extension of the Galerkin spatial-projection idea from steady-state to time-dependent problems, due to Sandro Faedo in his 1949 Annali della Scuola Normale Superiore di Pisa memoir [Faedo 1949], applies verbatim to the hyperbolic case: project onto finitely many spatial modes, solve the resulting system of ordinary differential equations, and pass to the limit using a-priori energy bounds. For the second-order-in-time equation the bounds come from the conserved energy rather than the dissipated one, the only structural change from the parabolic construction. The functional-analytic completion in the Gelfand-triple framework belongs to Jacques-Louis Lions and Enrico Magenes, whose 1972 Non-Homogeneous Boundary Value Problems and Applications I [Lions-Magenes 1972] treats hyperbolic and parabolic evolution equations within one apparatus, proving the regularity-in-time and trace theorems that give the two hyperbolic initial conditions their meaning.
The finite-propagation-speed phenomenon was isolated in the 1928 Mathematische Annalen paper of Richard Courant, Kurt Friedrichs, and Hans Lewy [Courant-Friedrichs-Lewy 1928], whose domain-of-dependence analysis, introduced to prove convergence of finite-difference schemes, identified the backward characteristic cone as the carrier of all information reaching a spacetime point. The same cone became the geometric object underlying Friedrichs' 1954 theory of symmetric hyperbolic systems [Friedrichs 1954], which embeds the second-order scalar equation into a first-order system whose energy method and propagation speed are read directly from the symmetry and eigenvalues of the coefficient matrices, the framework that covers Maxwell, elasticity, and the linearized Einstein equations. The systematic energy-method treatment of mixed initial-boundary-value problems for hyperbolic equations on bounded domains is developed in Olga Ladyzhenskaya's Boundary Value Problems of Mathematical Physics [Ladyzhenskaya 1985], and the modern lecture-note synthesis is Peter Lax's Hyperbolic Partial Differential Equations [Lax 2006].
Bibliography Master
@book{Evans2010,
author = {Evans, Lawrence C.},
title = {Partial Differential Equations},
edition = {2},
series = {Graduate Studies in Mathematics},
number = {19},
publisher = {American Mathematical Society},
year = {2010}
}
@article{Faedo1949,
author = {Faedo, Sandro},
title = {Un nuovo metodo per l'analisi esistenziale e quantitativa dei problemi di propagazione},
journal = {Annali della Scuola Normale Superiore di Pisa, Serie 3},
volume = {1},
year = {1949},
pages = {1--41}
}
@book{LionsMagenes1972,
author = {Lions, Jacques-Louis and Magenes, Enrico},
title = {Non-Homogeneous Boundary Value Problems and Applications I},
series = {Grundlehren der mathematischen Wissenschaften},
number = {181},
publisher = {Springer},
year = {1972}
}
@article{CourantFriedrichsLewy1928,
author = {Courant, Richard and Friedrichs, Kurt and Lewy, Hans},
title = {\"Uber die partiellen Differenzengleichungen der mathematischen Physik},
journal = {Mathematische Annalen},
volume = {100},
year = {1928},
pages = {32--74}
}
@article{Friedrichs1954,
author = {Friedrichs, Kurt O.},
title = {Symmetric hyperbolic linear differential equations},
journal = {Communications on Pure and Applied Mathematics},
volume = {7},
year = {1954},
pages = {345--392}
}
@article{Gronwall1919,
author = {Gr\"onwall, Thomas H.},
title = {Note on the derivatives with respect to a parameter of the solutions of a system of differential equations},
journal = {Annals of Mathematics},
volume = {20},
year = {1919},
pages = {292--296}
}
@book{Ladyzhenskaya1985,
author = {Ladyzhenskaya, Olga A.},
title = {The Boundary Value Problems of Mathematical Physics},
series = {Applied Mathematical Sciences},
number = {49},
publisher = {Springer},
year = {1985}
}
@book{Lax2006,
author = {Lax, Peter D.},
title = {Hyperbolic Partial Differential Equations},
series = {Courant Lecture Notes in Mathematics},
number = {14},
publisher = {American Mathematical Society},
year = {2006}
}