Interior and Boundary H^2 Regularity of Weak Elliptic Solutions
Anchor (Master): Evans §6.3; Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 2001), §8.3-§8.4, §8.8-§8.13; Nirenberg, Remarks on strongly elliptic partial differential equations (Comm. Pure Appl. Math. 1955); Agmon-Douglis-Nirenberg I (Comm. Pure Appl. Math. 1959)
Intuition Beginner
A weak solution of an elliptic equation is found by a clever evasion. Instead of demanding that a function obey a differentiation rule at every point, the weak formulation only asks that it pass an averaged balance test against every probe function. This is much easier to satisfy, and the sibling unit shows it always has a unique answer. But it leaves a worry hanging: the function we found might be jagged, barely possessing one derivative, while the equation we wanted to solve speaks of second derivatives. Did we really solve the problem, or only a watered-down shadow of it?
The answer, for elliptic equations with smooth ingredients, is reassuring. The weak solution is secretly far smoother than the bare definition guarantees. If the data feeding the equation is square-summable, the solution turns out to have two square-summable derivatives. If the data is smoother still, the solution is smoother still, and the gain is always exactly two derivatives over the data. Push this all the way and a solution driven by infinitely smooth data is itself infinitely smooth. The watered-down weak solution was the real thing in disguise.
Why should averaging buy back smoothness? The secret is that elliptic equations control curvature in every direction at once; there is no favored thin direction along which roughness can hide. A non-elliptic equation, say one that diffuses only sideways, would let a solution stay rough in the unwatched direction. Ellipticity watches everything, so any roughness the solution might carry is immediately seen by the equation and forbidden by the data being smooth.
The tool that makes this precise is the divided difference: compare the solution's value at a point with its value a tiny step away, and divide by the step size. As the step shrinks this becomes a derivative. The trick is to show these divided differences stay bounded no matter how small the step, because a function whose divided differences stay bounded already has the next derivative. The whole proof is a careful accounting that the divided differences of the gradient cannot blow up, which hands us the second derivative for free.
Visual Beginner
The single picture to hold is a divided difference closing in on a derivative, paired with the idea that a uniform bound on those divided differences is exactly the next derivative waiting to be claimed.
Read the panels left to right. The left panel is the basic move: a divided difference is a secant slope, and shrinking the spacing tilts the secant toward the tangent. This is the everyday picture of a derivative as a limit of divided differences.
The middle panel is the engine of the whole subject. We do not take the limit first; we instead show that the secant slopes of the gradient stay trapped inside a fixed band no matter how small the spacing. A function whose divided differences never escape such a band is exactly a function that already owns the next derivative, and that derivative inherits the size of the band. Trapping the divided differences of the gradient therefore promotes the gradient to having its own derivatives, which are the second derivatives of the solution.
The right panel is the payoff. A weak solution starts life looking as though it might be rough, but feeding it through an elliptic equation with smooth data smooths it, gaining exactly two derivatives over whatever smoothness the data carries.
Worked example Beginner
We watch divided differences in action on a function we already understand, to see why a uniform bound on them is the same as owning the next derivative. Work on the line and take the smooth function .
Step 1. Form the divided difference. Pick a small step and compute the divided difference of : take the value a step to the right, subtract the value here, and divide by the step. This is the quantity , which for equals .
Step 2. Expand and simplify. The top is . Dividing by leaves . So the divided difference of at spacing is exactly .
Step 3. Read the bound. On the interval from to , for any step between and , the value stays between and . The divided differences are bounded by , and this bound does not depend on how small is. That uniform bound is the whole point.
Step 4. Take the limit. As the step shrinks to zero, approaches , which is the genuine derivative of . The bounded divided differences converged to an honest derivative, and the size of that derivative is no bigger than the bound we found.
Step 5. Run it backward. The logic also works in reverse, and that is the form used in the proof. Suppose all we knew was that the divided differences of some function stayed below a fixed number for every spacing. That alone forces the function to have a square-summable derivative bounded by the same number. We never need the formula; the uniform bound by itself delivers the next derivative.
What this tells us: a divided difference is a stand-in for a derivative that we can manipulate before the derivative is known to exist, and a bound on the divided differences that ignores the step size is exactly a certificate that the next derivative exists and is no larger. The regularity proof manufactures such a certificate for the gradient of the solution, and that certificate is the second derivative.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is open and bounded, and is the divergence-form operator of 02.16.04,
uniformly elliptic with ellipticity constant , meaning for a.e. and all . The Sobolev spaces , the local spaces , and the inequalities used below are those of 24.01.01 and 02.16.01. Given , a function is a weak solution of if
the bilinear form being that of 02.16.04; existence and uniqueness in are supplied by the Lax-Milgram theory of that unit.
Definition (difference quotient). For a function on , a coordinate direction , and a step , the -th difference quotient of size is defined for in any subdomain once . Write . The discrete product rule reads , and the discrete integration-by-parts (summation by parts) identity reads whenever one factor is supported away from at scale .
Definition (the estimate). The operator satisfies the interior estimate if for every there is with for every weak solution of . The same inequality with replaced by and by , valid when is and , is the boundary (global) estimate.
Counterexamples to common slips Intermediate+
Regularity is not free of hypotheses on the coefficients. The leading coefficients must be at least Lipschitz (here or better) for regularity; merely bounded measurable give Hölder regularity through De Giorgi-Nash-Moser
02.17.07but not . The difference-quotient argument differentiates the equation once, and that costs one derivative of the coefficients.The estimate is a priori, not a stand-alone bound. The inequality presumes is already a weak solution. It does not assert that any function with small is in ; the term on the right is genuinely needed and cannot be dropped (constants solve and require the lower-order term).
Interior regularity says nothing at the boundary by itself. A weak solution can lie in yet fail to be in if has a corner. The square domain's solution of the Dirichlet problem can have an unbounded second derivative at a reentrant corner; up-to-the-boundary requires (or convexity), and this is why the boundary argument is separated from the interior one.
The gain is exactly two derivatives, not more. With one gets , not . To reach one needs and ; demanding from alone is false, as the Newtonian potential of an density shows.
Key theorem with proof Intermediate+
Theorem (interior regularity; Nirenberg). Let with , let , let be uniformly elliptic with constant , and let . If is a weak solution of , then , the equation holds almost everywhere, and for every with depending only on , the norms of the , the norms of , and [Nirenberg 1955] [Evans 2010 §6.3].
Proof. It suffices to bound the tangential and mixed second derivatives by difference quotients and then recover the remaining second derivative from the equation. Fix and a cutoff with and on . Work first with the principal part; the lower-order terms are absorbed at the end. Write the weak form , where collects the source and the lower-order terms (these lie in because ).
Fix a direction and a step with , and insert the test function legitimate because the difference quotients of an function are again and localises the support. Using summation by parts and the discrete product rule, the left side becomes . Expanding and writing , the left side splits as For , expand and use uniform ellipticity of the shifted matrix on the leading piece: The cross term is bounded, by Cauchy-Schwarz and Young's inequality , by , where depends on , , . For , the factor is bounded in by (mean value theorem), and , so after the same Young's inequality. The right side is estimated by ; since , one more Young's inequality gives .
Collecting the four estimates and absorbing the three terms into the left-hand leaves with independent of . The right side is finite and independent of , so the family is bounded in uniformly in . By the Nirenberg difference-quotient lemma (Proposition 1 below), for every , that is , with Finally the norm on the right is itself controlled by through the Caccioppoli energy inequality (testing the weak form against and using ellipticity), giving the stated estimate. Because , one may integrate the weak form by parts in reverse to find for all , whence a.e.
Bridge. This interior estimate is exactly the difference-quotient method of the sibling unit run in reverse: the translation estimate of 02.16.01, read forwards, says an function has bounded difference quotients, and the foundational reason regularity works is that the converse also holds, so a uniform bound on the difference quotients of promotes to and to . The energy supplied by Lax-Milgram in 02.16.04 is the coercivity that, transplanted to the differentiated equation, produces the controlling bound ; this is exactly the same ellipticity inequality powering existence, now powering smoothness. The estimate builds toward the bootstrap and appears again in 24.01.01 as the regularity that makes finite-element error estimates of optimal order possible, and putting these together, the central insight is that ellipticity controls all second derivatives simultaneously, so existence and regularity are one coercivity estimate read at two different levels of differentiation.
Exercises Intermediate+
Advanced results Master
The interior estimate sits at the base of a tower: the up-to-the-boundary estimate on smooth domains, the analogue of Calderón-Zygmund that replaces the Hilbert-space energy by singular-integral bounds, the Schauder companion on the Hölder scale, the elliptic-systems generalisation of Agmon-Douglis-Nirenberg, and the analytic-hypoellipticity refinement for analytic data. Each is the same principle — ellipticity converts smoothness of the data into smoothness of the solution with a fixed two-derivative gain — read on a different scale of function spaces.
Theorem 1 (global regularity up to the boundary). Let , , , , and let be the weak solution of with zero boundary data. Then and with [Evans 2010 §6.3] [Gilbarg-Trudinger 2001 §8.12]. The proof flattens the boundary: near a boundary point a change of variables straightens to a piece of the hyperplane , the operator transforms to another uniformly elliptic operator with leading coefficients, and the difference quotients are taken only in the tangential directions — these preserve the homogeneous Dirichlet condition because translating tangentially keeps on the flattened boundary. The interior argument then bounds all tangential and mixed second derivatives with in . The single remaining pure normal second derivative is recovered algebraically from the equation: , and makes the coefficient invertible, so . A partition of unity glues the boundary charts to the interior estimate.
Theorem 2 ( (Calderón-Zygmund) regularity). Let , (or VMO), . Then the weak solution lies in with . The energy method is unavailable for because is not a Hilbert space; the proof instead represents through the second derivatives of the Newtonian potential, which is a Calderón-Zygmund singular integral, and invokes the boundedness of such operators. At this reduces, by Plancherel, to the energy estimate of this unit; the difference-quotient method is the shortcut that avoids the full singular-integral machinery. The borderline cases and fail, which is why data does not give solutions and Schauder theory uses Hölder spaces instead.
Theorem 3 (Schauder companion). If instead and for some , the weak solution is in with . This is the Hölder-scale twin of the estimate developed in 02.17.04; the two scales are linked by the Sobolev embedding for , so high-integrability regularity feeds Hölder regularity and conversely. The Schauder constant degenerates as or , mirroring the failure of theory at .
Theorem 4 (Agmon-Douglis-Nirenberg for elliptic systems). For a properly elliptic system of order with smooth coefficients and complementing (Lopatinskii-Shapiro) boundary conditions, the a priori estimate holds, with the same fixed gain of derivatives over the data [Agmon-Douglis-Nirenberg 1959]. The scalar second-order case of this unit is the prototype; the systems theory replaces uniform ellipticity by the invertibility of the principal symbol and replaces the single Dirichlet condition by the algebraic complementing condition that makes the boundary value problem Fredholm.
Theorem 5 (analytic and hypoellipticity). If the coefficients and are real-analytic on , the solution is real-analytic on (elliptic operators are analytic-hypoelliptic), strengthening the corollary; this is the elliptic case of the Petrowsky theorem. The version states that is hypoelliptic: on an open set forces there, regardless of how rough is globally. Ellipticity is what makes the operator invertible modulo smoothing on the symbol level, the pseudodifferential reading of regularity.
Synthesis. The difference-quotient estimate is the foundational reason the weak formulation loses nothing: existence in 02.16.04 and regularity here are the single coercivity inequality of ellipticity, read first at the level of the equation and then at the level of its derivative, and this is exactly why the rough averaged solution is the smooth classical solution in disguise. The translation estimate of 02.16.01 is dual to the Nirenberg lemma — one says implies bounded difference quotients, the other says bounded difference quotients imply — and putting these together the gain of exactly two derivatives is the central insight that organises every scale: the scale here, the Calderón-Zygmund scale of Theorem 2, and the Hölder scale of 02.17.04 are three readings of the same gain, generalising to the systems estimate of Agmon-Douglis-Nirenberg where the gain becomes the order of the operator. The boundary argument appears again wherever a finite-element or spectral method needs the solution to be smooth enough for its convergence rate, so the regularity proved here is dual to the numerical approximation theory of 24.01.01; the bridge is that ellipticity, expressed as invertibility of the principal symbol, is the structural fact that converts data regularity into solution regularity with a fixed gain on every function-space scale at once.
Full proof set Master
Proposition 1 (Nirenberg difference-quotient lemma). Let . Then with if and only if for every and every direction , .
Proof. () is Exercise 3: for , . () is Exercise 4: the uniform bound makes bounded in , so a subsequence weakly with ; testing the summation-by-parts identity against and letting identifies as the weak derivative , so with .
Proposition 2 (Caccioppoli energy inequality). Let be a weak solution of with , uniformly elliptic with bounded coefficients. For and a cutoff , on ,
Proof. Test the weak form against . The principal part gives . Uniform ellipticity bounds the first term below by . The second term, the drift, and the zeroth-order term are bounded above using Cauchy-Schwarz and Young's inequality with a small parameter , absorbing into the leading term; the source contributes . Collecting and absorbing leaves ; restricting the left integral to where gives the claim.
Proposition 3 (interior estimate, assembled). Under the hypotheses of the Key theorem, with for every .
Proof. The difference-quotient estimate of the Key theorem yields for each . Proposition 1 applied to (whose difference quotients are controlled by the previous bound) gives for all , hence with . Proposition 2 replaces by . Adding and to the left gives the full norm with the stated bound.
Proposition 4 (boundary via tangential difference quotients). Let and solve . Then with the global estimate of Theorem 1.
Proof. Cover by finitely many charts; in each a diffeomorphism flattens to , carrying to a uniformly elliptic with leading coefficients and to vanishing on . For the tangential difference quotient still vanishes on , so is an admissible test function in the half-ball, and the interior argument bounds uniformly in . By Proposition 1 the second derivatives with lie in . The pure normal derivative is recovered from the equation: , and gives . Transforming back and summing over the charts with a partition of unity, together with the interior estimate of Proposition 3 on the inner region, gives and the global bound.
Connections Master
The existence theory of
02.16.04produces the weak solution whose smoothness this unit establishes: Lax-Milgram supplies , and the difference-quotient method upgrades it to , so the two units are the existence and regularity halves of a single elliptic boundary-value theory. The coercivity that powered existence is the same ellipticity inequality that, transplanted to the differentiated equation, powers the regularity estimate; this unit owns the regularity gain,02.16.04owns the solvability.The Sobolev inequalities and embeddings of
02.16.01provide both the translation estimate that is the forward half of the Nirenberg characterisation and the embedding that converts the bootstrap into the corollary. The sibling unit supplies the difference-quotient input and the smoothness-output dictionary that bracket the regularity argument at both ends.The Sobolev-space framework , , traces, and extension of
24.01.01is the ambient setting in which the estimate is even stated, and the up-to-the-boundary regularity proved here is exactly the hypothesis that makes finite-element error estimates of optimal order in the energy norm and in rigorous, so this unit is the regularity backbone of the numerical theory of24.01.01.The maximum principle of
02.17.02is the pointwise complement to this regularity theory: where the maximum principle controls the size and sign of from boundary data, the estimate controls the size of from the source, and together they give both the qualitative and quantitative a priori control needed for the method of continuity in02.17.05.The Hölder-scale Schauder theory of
02.17.04and the Calderón-Zygmund theory of02.17.06are the two siblings of this estimate on other function-space scales; all three deliver the same two-derivative gain, are linked by Sobolev embedding, and feed the existence-by-a-priori-estimates scheme, making this unit the energy-method entry to the three-scale regularity package of the chapter.
Historical & philosophical context Master
The question of whether a weak or generalised solution is automatically smooth — Hilbert's nineteenth problem in its analytic form — drove the development of elliptic regularity through the first half of the twentieth century. Hermann Weyl's 1940 lemma, proved by the method of orthogonal projection [Weyl 1940], gave the first clean modern instance: a distribution that is weakly harmonic is smooth and classically harmonic, establishing that the averaged notion of a harmonic function loses nothing. Kurt Friedrichs extended the smoothness conclusion to general strongly elliptic operators in his 1953 paper on the differentiability of solutions [Friedrichs 1953], introducing mollification and the systematic use of a priori estimates in Sobolev norms.
The difference-quotient method that makes the theory elementary is due to Louis Nirenberg, whose 1955 paper on strongly elliptic equations [Nirenberg 1955] isolated the characterisation of membership through uniformly bounded difference quotients and used it to prove interior regularity by differentiating the equation discretely rather than through the Fourier transform. The method's economy — it needs only the energy inequality and weak compactness, no singular integrals — is what made it the standard textbook route. Sergei Agmon, Avron Douglis, and Nirenberg then established the definitive and Schauder estimates up to the boundary for general elliptic systems with complementing boundary conditions in their 1959 paper [Agmon-Douglis-Nirenberg 1959], fixing the now-standard form of the a priori estimate with its characteristic gain of the operator's order. The energy estimate of this unit is the Hilbert-space, second-order, scalar case of that general theory.
Bibliography Master
@article{Nirenberg1955,
author = {Nirenberg, Louis},
title = {Remarks on strongly elliptic partial differential equations},
journal = {Communications on Pure and Applied Mathematics},
volume = {8},
year = {1955},
pages = {649--675}
}
@article{ADN1959,
author = {Agmon, Shmuel and Douglis, Avron and Nirenberg, Louis},
title = {Estimates near the boundary for solutions of elliptic partial differential equations satisfying general boundary conditions I},
journal = {Communications on Pure and Applied Mathematics},
volume = {12},
year = {1959},
pages = {623--727}
}
@article{Friedrichs1953,
author = {Friedrichs, Kurt O.},
title = {On the differentiability of the solutions of linear elliptic differential equations},
journal = {Communications on Pure and Applied Mathematics},
volume = {6},
year = {1953},
pages = {299--326}
}
@article{Weyl1940,
author = {Weyl, Hermann},
title = {The method of orthogonal projection in potential theory},
journal = {Duke Mathematical Journal},
volume = {7},
year = {1940},
pages = {411--444}
}
@book{Evans2010,
author = {Evans, Lawrence C.},
title = {Partial Differential Equations},
edition = {2},
series = {Graduate Studies in Mathematics 19},
publisher = {American Mathematical Society},
year = {2010}
}
@book{GilbargTrudinger2001,
author = {Gilbarg, David and Trudinger, Neil S.},
title = {Elliptic Partial Differential Equations of Second Order},
edition = {2},
series = {Grundlehren der mathematischen Wissenschaften 224},
publisher = {Springer},
year = {2001}
}