Lax-Milgram and Existence of Weak Solutions of Elliptic Boundary-Value Problems
Anchor (Master): Evans §6.1-§6.3; Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §8.1-§8.6; Lions-Magenes, Non-Homogeneous Boundary Value Problems and Applications I (Springer 1972), Ch. 2; Nečas, Direct Methods in the Theory of Elliptic Equations (Springer 2012), Ch. 3
Intuition Beginner
A differential equation asks you to find a function whose slopes and curvatures obey a rule at every single point. That is a stringent demand, and for rough data there may be no function smooth enough to satisfy it point by point. The trick of this unit is to stop testing the rule at points and start testing it on average, by multiplying the equation by an arbitrary "test" function and adding everything up. This converts the equation into a single balance condition: a certain weighted sum built from the unknown must match a weighted sum built from the data, for every test function you try. A function that passes all these averaged tests is called a weak solution.
Why trade the sharp point-by-point demand for a blurry averaged one? Because the averaged demand is far easier to satisfy, and once you find a weak solution you can often prove afterward that it was secretly smooth all along. The averaged form also hides a beautiful structure: the weighted sum on the left behaves like a tilted, bowl-shaped energy, and solving the equation is the same as finding the bottom of the bowl.
Here is the engine that finds that bottom. Picture an energy landscape over all candidate functions. Two features guarantee a unique lowest point. First, the landscape is bounded in steepness: nearby functions have nearby energies, so nothing jumps. Second, the landscape genuinely curves upward in every direction and never flattens into a valley floor that runs off to infinity; this upward curving is called coercivity. A steepness cap plus guaranteed upward curving, set over a space with no missing limit points, forces exactly one minimum to exist.
The Lax-Milgram theorem is this picture made into a guarantee. Whenever the averaged balance condition is bounded in steepness and curves upward, there is one and only one weak solution, and it depends continuously on the data: nudge the data a little and the solution moves only a little. For the elliptic equations of this unit, the upward-curving comes from the ellipticity of the operator, the precise statement that the equation diffuses in every direction rather than along a thin set of favored ones.
Visual Beginner
The single picture to hold is a tilted bowl-shaped energy over the space of candidate functions, with one lowest point, and the two guarantees that force that point to exist and be unique.
Read the three panels left to right. The left panel is the move from a point-by-point rule to a single averaged balance condition. Multiplying by a test function and summing turns the demanding local equation into a softer global one, tested against every test function at once.
The middle panel is the heart of the matter: the averaged balance defines a tilted bowl over the space of candidate functions. The cap on steepness means the bowl has no cliffs, and the upward curving means it has no flat-bottomed trenches running to infinity. A bowl with neither cliffs nor trenches, sitting over a space that contains all its limit points, has exactly one lowest point, and that point is the weak solution.
The right panel is continuous dependence. Tilt or reshape the bowl a little by changing the data, and its lowest point slides only a little. This is what makes weak solutions a usable notion: the solution is not just unique, it is stable against small perturbations of the data.
Worked example Beginner
We turn a simple boundary-value problem into its averaged form and check the upward-curving by hand. Take the interval from zero to one, and the equation with held at zero at both ends. This is the one-dimensional version of the elliptic problems of this unit.
Step 1. Multiply and sum. Pick any test function that is also zero at both ends. Multiply the equation by and add up over the interval: the left side becomes the total of across the interval, and the right side becomes the total of .
Step 2. Move one slope across. Using integration by parts and the fact that vanishes at the two ends, the total of equals the total of . So the averaged balance reads: the total of equals the total of , for every test function . The second derivative has been traded for a product of first slopes, which is the weak form.
Step 3. Read off the energy. Setting in the left side gives the total of times itself, that is the total of . This is the "bowl" value at , and it can never be negative.
Step 4. Check the upward curving with a number. Take , which is zero at both ends. Its slope is . The total of over the interval works out to , a strictly positive number. The energy of any nonzero clamped function is strictly positive, never zero; this is the upward curving that coercivity demands. By the Poincaré comparison of the sibling unit, the height of is also controlled by this same slope total, so the bowl really does climb in every direction.
Step 5. See what coercivity buys. Because the energy of a nonzero clamped function is strictly positive and bounded below by the function's own size, the bowl cannot run flat to infinity. A capped steepness plus this strictly positive curving give exactly one lowest point: one weak solution.
What this tells us: the weak form replaces a second-derivative equation by a balance of first slopes, and the positivity of the slope-energy is precisely the curving that guarantees a unique solution. The clamping at the ends is what makes the energy strictly positive, exactly as it made the Poincaré inequality possible.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is open and bounded, is the Hilbert space of 02.16.03 and 24.01.01 with inner product (equivalent, by the Poincaré inequality of 02.16.03, to the full inner product) and dual . The duality pairing of with is written . All function spaces are real; the abstract theorem and its hypotheses are taken over a real Hilbert space, with 02.11.08 supplying completeness, the inner product, and the Riesz representation theorem.
Definition (bilinear form, boundedness, coercivity). Let be a real Hilbert space. A map is a bilinear form if it is linear in each argument separately. It is bounded (continuous) if there is with and **coercive** if there is with No symmetry is assumed: is not required. When is symmetric, coercivity makes an equivalent inner product on .
Definition (the elliptic operator and its weak form). Let with , and consider the second-order divergence-form operator
is uniformly elliptic if there is with
Multiplying by and integrating by parts moves one derivative off the leading term and defines the bilinear form associated to ,
which extends to all by density. Given , a function is a weak solution of the Dirichlet problem in , on , if
The boundary condition on is encoded by the membership (zero trace, 02.16.03); no pointwise regularity of is presumed.
Counterexamples to common slips Intermediate+
Coercivity is not automatic from ellipticity alone. Uniform ellipticity gives , but the lower-order terms can spoil positivity. With a large negative , e.g. for above the first Dirichlet eigenvalue , the form is negative on the first eigenfunction. Lax-Milgram does not apply; one shifts to and uses the Fredholm alternative instead.
Boundedness needs the coefficients in , not merely measurable. If some is unbounded, need not satisfy , and the form is not even defined on all of . The hypothesis is what makes a bounded bilinear form on the Sobolev space.
Symmetry is a convenience, not a requirement. The first-order drift makes non-symmetric. The symmetric Riesz/variational route (minimizing ) then fails, since a non-symmetric is not the second differential of an energy. Lax-Milgram is exactly the extension of Riesz representation that survives the loss of symmetry.
The wrong sign of the leading term destroys coercivity. Writing (the backward heat / anti-elliptic sign) flips the leading quadratic to , and no coercivity constant exists. The divergence-form minus sign is the convention that makes the principal part positive; this unit fixes that sign in the definition of .
Key theorem with proof Intermediate+
Theorem (Lax-Milgram). Let be a real Hilbert space and a bounded coercive bilinear form, with constants and as above. Then for every bounded linear functional there is a unique with and the solution obeys the a priori bound [Lax-Milgram 1954] [Evans 2010 §6.2].
Proof. For each fixed , the map is a bounded linear functional on , of norm at most . By the Riesz representation theorem 02.11.08 there is a unique element, call it , with for all . The map is linear (bilinearity of and uniqueness of the Riesz representative) and bounded: . Likewise has a Riesz representative with . The problem for all is therefore equivalent to the single equation in .
It remains to show is a bijection of . Coercivity gives, for every , so . This lower bound forces to be injective and to have closed range: if , then is Cauchy because , so and by continuity. Suppose the range were a proper closed subspace; then there is a nonzero , and in particular , forcing , a contradiction. Hence and is a continuous linear bijection with . Setting solves uniquely, and .
An equivalent and constructive proof replaces the closed-range argument by a contraction. For the affine map satisfies with ; choosing makes the bracket strictly below , so is a contraction on the complete space and the Banach fixed-point theorem yields the unique fixed point , i.e. .
Bridge. Lax-Milgram is exactly the Riesz representation theorem of 02.11.08 with symmetry removed: when is symmetric and coercive it is an equivalent inner product and the solution is the genuine Riesz representative of , while the general case substitutes the bounded inverse , built either by the closed-range argument or by the contraction, for that representative — this is the foundational reason existence here costs no compactness and no spectral theory. The a priori bound is the continuous-dependence statement of the Beginner tier made quantitative, and it builds toward the energy method for elliptic problems where coercivity is supplied by uniform ellipticity through Gårding's inequality. The abstract theorem appears again in 24.01.03 in the finite-dimensional Galerkin form that underlies the finite element method, where the same coercivity gives Céa's quasi-optimality estimate; putting these together, the central insight is that a bounded coercive bilinear form on a complete space is invertible by representation alone, and this is exactly what turns the averaged weak formulation into a solvable problem.
Exercises Intermediate+
Advanced results Master
The abstract theorem and the elliptic existence theorem sit inside a larger structure: the variational inequalities of Stampacchia that generalize the equation to convex constraints, the Babuška-Nečas inf-sup condition that replaces coercivity for saddle-point and non-coercive problems, the Gårding-plus-Fredholm dichotomy that handles the full range of zeroth-order coefficients, the Galerkin discretization that descends the whole apparatus to finite dimensions, and the higher regularity that recovers the classical solution from the weak one. Each refines the bounded-coercive-form argument of the Intermediate tier.
Theorem 1 (Stampacchia, variational inequalities). Let be a bounded coercive bilinear form on and a nonempty closed convex set. For every there is a unique with and depends Lipschitz-continuously on [Stampacchia 1964]. When the inequality forces equality and recovers Lax-Milgram; when is symmetric, minimizes over , the constrained Dirichlet principle. The proof iterates the projection onto (which exists and is a contraction because is closed convex in a Hilbert space) with the contraction of the Lax-Milgram fixed-point argument; the composition is still a contraction for small , and its fixed point solves the inequality. Obstacle problems and elliptic free-boundary problems are the standard instances.
Theorem 2 (Babuška-Nečas inf-sup; the coercivity-free generalization). Coercivity is sufficient but not necessary for well-posedness of . On Hilbert spaces (trial) and (test), the problem is well-posed for every if and only if is bounded, satisfies the inf-sup condition and is non-degenerate in the test variable ( for each ), with . Coercivity is the special case with the diagonal already realizing the supremum. The inf-sup framework is what makes mixed finite element methods (Stokes flow, Darcy flow, the Hellinger-Reissner elasticity formulation) well-posed, where the natural bilinear form is a saddle-point form with no coercivity on the whole space.
Theorem 3 (the full Fredholm alternative for ). Let be uniformly elliptic with coefficients on a bounded domain. There is (the Gårding constant) such that for , is boundedly invertible . Consequently exactly one of the following holds [Evans 2010 §6.2] [Gilbarg-Trudinger 1983 §8.6]:
(i) for every the problem has a unique weak solution; or
(ii) the homogeneous problem has a nonzero solution, the solution spaces of and of the formal adjoint are finite-dimensional of equal dimension, and is solvable iff annihilates that adjoint kernel. The set of for which has a nonzero solution — the spectrum — is at most countable, real when is self-adjoint, and discrete with no finite accumulation point, by the compactness of 02.16.03 applied to the resolvent. This is the elliptic analogue of the finite-dimensional dichotomy "a square matrix is either invertible or has a kernel."
Theorem 4 (Galerkin approximation and Céa's lemma). Let be a finite-dimensional subspace and the unique solution of for all (existence by Lax-Milgram applied on ). Then the Galerkin solution is quasi-optimal:
so the discrete error is, up to the condition-number factor , the best approximation error of from . The proof is Galerkin orthogonality combined with coercivity and boundedness. This is the theoretical foundation of the finite element method of 24.01.03: convergence of the numerical scheme is reduced to the approximation power of the subspaces .
Theorem 5 (interior regularity). If additionally , , and , then the weak solution of lies in with the estimate for , and holds a.e. [Evans 2010 §6.3] [Gilbarg-Trudinger 1983 §8.3]. The method is difference quotients: the translation estimate of 02.16.03 is run in reverse — a uniform bound on the difference quotients in promotes to , hence to . Bootstrapping with Schauder or theory recovers the classical solution when the data are smooth: the weak solution found by Lax-Milgram was the classical solution all along.
Synthesis. Lax-Milgram is the foundational reason the weak formulation is solvable: it is exactly the Riesz representation theorem with symmetry removed, and the existence of weak solutions of elliptic boundary-value problems is this single abstract fact fed the bilinear form whose coercivity comes from uniform ellipticity through Gårding's inequality. The a priori bound is dual to the coercivity constant, one read as stability of the solution and the other as positivity of the energy, and putting these together the central insight is that bounded-plus-coercive is the entire content of well-posedness in the coercive case — generalised by Babuška-Nečas to the inf-sup condition when the diagonal no longer realizes the supremum, and by Stampacchia to convex constraints when the equation becomes a variational inequality.
The compactness of the Sobolev embedding 02.16.03 is what upgrades the coercive shift into the Fredholm alternative for the full operator, so the existence theory and the spectral theory of are the same compactness read twice; this is exactly the structure that descends, through Galerkin orthogonality and Céa's quasi-optimality, to the finite element method of 24.01.03, where the same constants and that proved existence now control the numerical error. The whole edifice appears again wherever an elliptic problem must be solved before it can be analyzed: the higher regularity of Theorem 5 recovers the classical solution from the weak one by running the translation estimate of 02.16.03 in reverse, closing the loop from the rough averaged formulation back to the smooth point-by-point equation.
Full proof set Master
Proposition 1 (existence and uniqueness for coercive ). Let be uniformly elliptic on a bounded with , and suppose the associated form is coercive on (e.g. , , or above the threshold of Exercise 5). Then for every there is a unique weak solution of , with .
Proof. Boundedness of on is the estimate of Exercise 3, valid because the coefficients lie in and the Poincaré inequality of 02.16.03 controls by . Coercivity is assumed. The functional is bounded on by definition of . The Lax-Milgram theorem applied to the real Hilbert space (complete by 24.01.01, with inner product and Riesz representation from 02.11.08) yields a unique with for all , which is the definition of a weak solution, and the a priori bound is the Lax-Milgram estimate.
Proposition 2 (Gårding's inequality). Under uniform ellipticity with constant and coefficients in , there are and with for all .
Proof. Ellipticity gives . For the first-order term, Cauchy-Schwarz and Young's inequality with parameter give , choosing so the first term is . The zeroth-order term satisfies . Summing the three contributions, which is the claim with and . When and both subtracted terms vanish and is coercive outright.
Proposition 3 (solvability of the shifted problem and the Fredholm alternative). With as in Proposition 2, has a coercive form for every , so is a bounded operator ; moreover is compact, and the Fredholm alternative holds for .
Proof. For , , and is bounded, so by Proposition 1 the map solving is well-defined and bounded . Restricting the source to and post-composing with the compact embedding of 02.16.03, is the composition of a bounded and a compact map, hence compact. A function solves iff , i.e. , i.e. in . Since is compact and self-adjoint when is (positive coercive symmetric form), the Fredholm alternative and spectral theorem for compact operators 02.11.08 give: either is invertible and has a unique solution for all , or is an eigenvalue of of finite multiplicity, the kernels of and have equal finite dimension, and is solvable iff . The eigenvalues of accumulate only at , so the spectrum of is discrete with no finite accumulation point.
Proposition 4 (Céa quasi-optimality of the Galerkin solution). Let be finite-dimensional and the Galerkin solution. Then .
Proof. Lax-Milgram on the finite-dimensional (hence complete) subspace gives existence and uniqueness of with for all . Subtracting from yields Galerkin orthogonality for all . For any , coercivity and orthogonality give , where the middle equality inserts into the orthogonal slot. Dividing by gives , and taking the infimum over completes the proof.
Connections Master
The abstract engine is the Riesz representation theorem and the Hilbert-space structure of
02.11.08: Lax-Milgram is that theorem with symmetry dropped, and the whole existence theory of this unit is the representation of the data functional by a solution through the bounded inverse of the form-induced operator . This unit owns the non-symmetric extension and its elliptic application;02.11.08owns the symmetric inner-product representation.The coercivity that powers the energy method is the Poincaré inequality and the compact embedding of
02.16.03: Poincaré makes an equivalent norm so that uniform ellipticity yields coercivity, and the compactness is exactly what turns the coercive shift into the Fredholm alternative and the discrete spectrum for the full operator. The sibling unit supplies both the coercivity input and the compactness input to this existence theory.The Sobolev-space framework — the spaces and , weak derivatives, the trace giving meaning to the zero boundary condition — is built in
24.01.01, and the weak/variational formulation itself, together with the finite element discretization governed by Céa's lemma, is developed concretely for the Poisson case in24.01.03. This unit provides the general-operator existence theorem of which24.01.03treats the symmetric Poisson special case.The higher regularity that recovers a classical solution from the weak one runs the difference-quotient method, the reverse of the translation estimate proved in
02.16.03; the Schauder and De Giorgi-Nash-Moser theories of02.17.04and02.17.07then bootstrap the weak solution to full classical regularity, so the existence theorem of this unit is the entry point to the regularity chapter.The Fredholm alternative and the discreteness of the elliptic spectrum invoke the compact-operator and spectral theory of
02.11.05: the solution operator of the coercive shift composed with the compact Sobolev embedding is the single compact operator whose Fredholm theory governs solvability of , making this unit the place where abstract compact-operator theory meets the concrete elliptic boundary-value problem.
Historical & philosophical context Master
The weak formulation of elliptic problems grew out of the Dirichlet principle, the nineteenth-century idea — championed by Riemann and rehabilitated by Hilbert — of solving by minimizing the Dirichlet energy. The rigorous Hilbert-space form of the existence question crystallized after Frigyes Riesz's 1934 representation theorem for continuous linear functionals on a Hilbert space [Riesz 1934], which identified the symmetric case completely. Peter Lax and Arthur Milgram, in their 1954 paper in the Contributions to the Theory of Partial Differential Equations volume of the Annals of Mathematics Studies [Lax-Milgram 1954], extended representation to bounded coercive forms without symmetry, precisely the generality needed for elliptic operators carrying first-order drift; their motivation was parabolic equations, where the non-self-adjoint structure is unavoidable.
The coercivity that makes the method work for genuine elliptic operators is Lars Gårding's 1953 inequality [Gårding 1953], which derived the energy estimate from uniform ellipticity and thereby connected the algebraic positivity of the coefficient matrix to the functional-analytic coercivity of the form. The Fredholm alternative invoked for the non-coercive range traces to Erik Ivar Fredholm's 1903 Acta Mathematica theory of integral equations [Fredholm 1903], whose dichotomy for with compact is the abstract template realized here through the compact Sobolev embedding. Guido Stampacchia's 1964 Comptes Rendus note [Stampacchia 1964] extended the existence theory from equations to variational inequalities on convex sets, opening the modern theory of obstacle and free-boundary problems, and the inf-sup condition formalized by Ivo Babuška and Jindřich Nečas removed even coercivity, supplying the well-posedness theory for the saddle-point problems of mixed finite elements.
Bibliography Master
@incollection{LaxMilgram1954,
author = {Lax, Peter D. and Milgram, Arthur N.},
title = {Parabolic equations},
booktitle = {Contributions to the Theory of Partial Differential Equations},
series = {Annals of Mathematics Studies},
number = {33},
publisher = {Princeton University Press},
year = {1954},
pages = {167--190}
}
@article{Garding1953,
author = {G\aa{}rding, Lars},
title = {Dirichlet's problem for linear elliptic partial differential equations},
journal = {Mathematica Scandinavica},
volume = {1},
year = {1953},
pages = {55--72}
}
@article{Stampacchia1964,
author = {Stampacchia, Guido},
title = {Formes bilin\'eaires coercitives sur les ensembles convexes},
journal = {Comptes Rendus de l'Acad\'emie des Sciences Paris},
volume = {258},
year = {1964},
pages = {4413--4416}
}
@article{Fredholm1903,
author = {Fredholm, Erik Ivar},
title = {Sur une classe d'\'equations fonctionnelles},
journal = {Acta Mathematica},
volume = {27},
year = {1903},
pages = {365--390}
}
@article{Riesz1934,
author = {Riesz, Frigyes},
title = {Zur Theorie des Hilbertschen Raumes},
journal = {Acta Scientiarum Mathematicarum (Szeged)},
volume = {7},
year = {1934},
pages = {34--38}
}
@book{Necas2012,
author = {Ne\v{c}as, Ji\v{r}\'i},
title = {Direct Methods in the Theory of Elliptic Equations},
series = {Springer Monographs in Mathematics},
publisher = {Springer},
year = {2012}
}