02.16.01 · analysis / sobolev-weak-solutions

Sobolev Inequalities: the Gagliardo-Nirenberg-Sobolev and Morrey Inequalities

shipped3 tiersLean: none

Anchor (Master): Evans §5.6-§5.8; Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §7.7-§7.8; Adams-Fournier, Sobolev Spaces, 2e (Academic Press 2003), Ch. 4-5; Maz'ya, Sobolev Spaces, 2e (Springer 2011), Ch. 1; Lieb-Loss, Analysis, 2e (AMS 2001), §8.3 (sharp constants, Talenti-Aubin)

Intuition Beginner

A Sobolev inequality is a precise statement of an idea you already trust: if you know that a function does not change too fast, then the function itself cannot be too big or too wild. The rate of change is captured by the derivative, the gradient; the inequality says that controlling the size of the gradient automatically controls the size of the function. You give up information about the slope, and in exchange you get information about the height.

Why would anyone want this trade? Because in most of mathematical physics the natural quantity you can bound is an energy, and energy is built from the gradient. The energy of a stretched membrane, the energy stored in an electric field, the kinetic energy of a flow: each is an integral of the square of a derivative. A Sobolev inequality is the bridge that converts a bound on this energy into a bound on the thing you actually care about, the displacement, the potential, the velocity itself.

There are two faces of the same coin, and which one you get depends on a competition between two numbers: how many derivatives you control, and how many dimensions you live in. When dimension wins, the function need not even be bounded, but its overall size in an averaged sense improves: you started controlling the function in one averaging scale and you end up controlling it in a stronger one. This is the Gagliardo-Nirenberg-Sobolev regime. When the derivative wins, you get something much stronger and more tangible: the function is genuinely continuous, with no jumps, and in fact it cannot change value too quickly between any two nearby points. This is the Morrey regime.

The dividing line is set by comparing the number of derivatives you control against the dimension. Below the line you buy improved averaged size; above the line you buy honest continuity. Right on the line is a delicate borderline case where the function is almost bounded, missing it only by a whisker, and the correct statement involves an exponential rather than a power.

A useful everyday picture: imagine pouring a fixed amount of paint that must be spread so that its slope is never steep. In a cramped low-dimensional space, the paint has nowhere to hide and piles up into a visible, continuous coat. In a roomy high-dimensional space, the same slope budget lets the paint thin out and spread, so it need not form a continuous coat, but its total spread-out concentration still improves. Dimension is room to spread; derivative control is the leash on the slope. The Sobolev inequalities measure exactly how these two forces balance.

Visual Beginner

The single most useful picture is the competition between two numbers, drawn as a number line.

The number line is the whole story. Put your finger at the dimension. Slide left, where the integrability of the gradient is smaller than the dimension: you are in the regime where you buy averaged size, and the averaging scale you end up with, the critical exponent, is determined by a formula that gets larger and larger as you approach the dimension from below. Slide right, where the integrability of the gradient exceeds the dimension: you buy continuity, and a quantitative smoothness number that measures how gently the function is allowed to vary. The exact crossing point, where the two numbers are equal, is the borderline case, where the function is almost but not quite bounded.

The second picture is the paint cartoons. A fixed slope budget in a small room forces a visible continuous coat; the same slope budget in a large room lets the paint thin and spread without ever forming a coat. The Sobolev inequalities turn this qualitative intuition into an exact accounting.

Worked example Beginner

We test the central formula on a concrete shape and watch the bookkeeping work out. Take three-dimensional space, so the dimension is three. Suppose we control the gradient in the averaging scale with exponent two, that is, we control the integral of the squared gradient, the most common energy in physics. We want to know the improved averaging scale for the function itself.

Step 1. Identify the numbers. The dimension is three. The exponent controlling the gradient is two. We need the dimension to beat the exponent for this regime, and indeed three is bigger than two, so we are on the Gagliardo-Nirenberg-Sobolev side.

Step 2. Apply the critical-exponent formula. The improved averaging scale is the dimension times the exponent, divided by the dimension minus the exponent. Substitute: the numerator is three times two, which is six. The denominator is three minus two, which is one. So the improved averaging scale is six divided by one, namely six.

Step 3. Read the conclusion. Controlling the integral of the squared gradient of a function in three-dimensional space controls the integral of the sixth power of the function itself. We started controlling the function's slope at the level of squares and we ended up controlling the function at the level of sixth powers, a genuinely stronger grip.

Step 4. Sanity-check the direction. The improved number, six, is larger than the starting number, two. A larger averaging exponent is a stronger form of control, because it weighs tall narrow spikes more heavily and so forbids them more strictly. The trade gave us something better than we put in, which is the whole point.

Step 5. Watch the formula misbehave as a warning. Keep the dimension at three but imagine pushing the gradient exponent up toward three. The denominator, the dimension minus the exponent, shrinks toward zero, and the improved averaging scale blows up. This is the inequality announcing the borderline: when the gradient exponent equals the dimension, the clean power-law conclusion breaks and a subtler exponential statement takes over.

What this tells us: the formula is a bookkeeping device that converts control of the slope into stronger control of the function, with the exact strength dictated by a competition between the gradient exponent and the dimension. The blow-up as the two approach each other is not a flaw; it is the inequality pointing at its own boundary.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, is the dimension, is open, and . The Sobolev space and its norm are taken as already defined 24.01.01: means together with weak partial derivatives for , with , where and . We write for the closure of in . The machinery, Hölder's inequality, and Minkowski's inequality are taken as available 02.07.06.

Definition (critical Sobolev exponent). For , the *critical Sobolev exponent** is Note and $p^ \to \inftyp \uparrow n$.

The exponent is forced by scaling. If and for , then a change of variables gives and . For an inequality of the form to hold for all with a constant independent of , the two powers of must match: , which rearranges to . Any other exponent fails by sending or .

Definition (Hölder space). For and open, the Hölder space consists of bounded continuous for which the seminorm is finite, normed by . A function in is Hölder continuous with exponent ; is Lipschitz continuity.

The two regimes. The Gagliardo-Nirenberg-Sobolev (GNS) inequality covers and asserts a continuous embedding . The Morrey inequality covers (here is admissible, with the seminorm read off below) and asserts with , after redefinition of on a set of measure zero. The borderline falls under neither power-law conclusion: need not be bounded (the standard example near the origin in dimension lies in of a ball yet is unbounded), and the correct endpoint statement is exponential integrability in the sense of Trudinger-Moser, or membership in BMO [Trudinger 1967] [Brezis-Wainger 1980].

Counterexamples to common slips Intermediate+

  • The GNS exponent is not a free parameter. The embedding for holds with the single critical on all of ; lower exponents fail on because constants and slowly-decaying functions have finite gradient norm but infinite norm. On a bounded domain the full ladder for holds, because the finite measure lets Hölder interpolate down from , but the gain stops at .
  • Morrey requires strictly above . At the candidate Hölder exponent degenerates to zero, and continuity genuinely fails: has gradient in near the origin but is unbounded, so no Hölder estimate can hold.
  • The embeddings are for the right space. The GNS inequality with the gradient norm alone on the right needs either or decaying at infinity; for general on a bounded domain one needs the full norm on the right and a Lipschitz (or extension-domain) boundary, because the bare gradient cannot see additive constants.
  • Weak derivatives, not classical. All statements use the weak gradient. The Morrey conclusion is that the equivalence class of contains a (unique) Hölder-continuous representative; the original function may have been defined arbitrarily on a null set.

Key theorem with proof Intermediate+

Theorem (Gagliardo-Nirenberg-Sobolev inequality). Let . There is a constant such that for every , [Gagliardo 1958] [Nirenberg 1959] [Evans 2010 §5.6.1].

Proof. The heart is the case , where ; the general case follows by applying the result to a power of and using Hölder's inequality.

Step 1 (fundamental theorem of calculus on each axis). Fix . For each and each , integrate the -th partial derivative along the -th coordinate line from up to . Because has compact support, , and therefore where records all coordinates except the -th, and depends only on . This holds for every simultaneously.

Step 2 (multiply the estimates). Raise to the power and multiply over : The left side is the target exponent: . The right side is a product of functions, the -th of which is independent of the variable .

Step 3 (Loomis-Whitney / iterated generalized Hölder). Integrate the inequality of Step 2 over , one variable at a time, pulling out the factor that does not depend on the current integration variable and applying the generalized Hölder inequality with exponents each equal to to the remaining factors. Integrate first in . The factor is constant in and pulls out; the remaining factors () each depend on , and generalized Hölder with exponents gives Repeat the procedure successively in . After all integrations, each has been integrated over all of its variables exactly once inside a power, yielding the last step using and the arithmetic-geometric mean inequality on the factors. This is precisely the case: (The combinatorial estimate just executed is the Loomis-Whitney inequality: the size of a function is bounded by the geometric mean of the sizes of its coordinate marginals.)

Step 4 (bootstrap from to general ). For apply the inequality to with the exponent chosen below. Then almost everywhere, so where is the Hölder conjugate. The exponent is fixed by the requirement that the two powers of match: . Solving gives , and then . Dividing both sides by the common -integral factor (finite because ) leaves The exponent on the left simplifies to , giving with .

Bridge. The proof is the foundational reason the critical exponent is what it is: is not chosen for convenience but forced, first by the scaling identity that opened the formal-definition section and again here by the algebra that makes the two -powers in Step 4 coincide. This is exactly the same move — integrate a derivative back to the function, then balance exponents — that the Morrey estimate below performs with a single radial integration instead of axis integrations, so the two inequalities are dual faces of one fundamental-theorem-of-calculus argument. Putting these together with a density argument extends both from to all of , which builds toward the general embedding ladder for and the compactness refinement (Rellich-Kondrachov). The GNS inequality appears again in 02.16.02 as the existence engine for weak solutions of elliptic equations via the direct method, and the sharp-constant version generalises to the isoperimetric inequality, recovered as the case applied to the indicator of a smooth set.

Exercises Intermediate+

Advanced results Master

The first-order inequalities organize a much larger structure: the full Sobolev embedding ladder, the sharp constants and their extremals, the compactness refinement, the borderline endpoints, and the fractional and trace generalizations. Each is a refinement of the two fundamental-theorem-of-calculus arguments above.

Theorem 1 (general Sobolev embedding; Sobolev 1938). Let be open and bounded with boundary, , and . If , then for , the embedding being continuous. If , then , where and if , and any number in if [Sobolev 1938] [Evans 2010 §5.6.3]. The two clauses are the iterated GNS and iterated Morrey arguments of Exercise 8, with the reciprocal-exponent accounting supplying the unified bookkeeping: integrability and differentiability trade at the fixed rate per derivative.

Theorem 2 (Rellich-Kondrachov compactness; Rellich 1930, Kondrachov 1945). Let be bounded with boundary and . Then for every with (strictly subcritical), the embedding is compact: bounded sequences in have subsequences converging strongly in . Compactness fails at the critical exponent itself, where concentration (a bubble ) and translation to infinity both produce bounded non-convergent sequences. The loss of compactness at is the central analytic difficulty in critical elliptic problems and is quantified by the concentration-compactness principle.

Theorem 3 (sharp constant and extremals; Talenti 1976, Aubin 1976). For , the best constant in is and the extremal functions (where equality holds) are exactly the **Aubin-Talenti bubbles** [Talenti 1976] [Aubin 1976]. The extremals are unique modulo the symmetry group (translations, dilations, scalar multiples), and for they are the standard bubbles solving the critical Lane-Emden equation that appear in the Yamabe problem.

Theorem 4 (Trudinger-Moser borderline; Trudinger 1967, Moser 1971). At the borderline the embedding fails, but exponential integrability holds: there are constants such that for , bounded, with , and the constant (with the surface area of the unit sphere) is sharp: for any larger the supremum is infinite [Trudinger 1967] [Moser 1971]. The Trudinger-Moser inequality is the correct endpoint of the Sobolev ladder, replacing the failed bound by membership in the exponential Orlicz class; an alternative endpoint reading places in BMO and the John-Nirenberg space [Brezis-Wainger 1980].

Theorem 5 (fractional Sobolev and trace; Gagliardo 1957, Aronszajn-Slobodeckij). For and , the fractional Sobolev space , normed by the Gagliardo seminorm , embeds continuously into with , and the trace operator maps onto for . The trace theorem is the precise statement that boundary values of functions lose exactly of a derivative, the foundational fact for boundary-value problems in the weak formulation. The fractional scale interpolates the integer Sobolev spaces and is the natural setting for the nonlocal operators and for boundary integral equations.

Synthesis. The Sobolev inequalities are the foundational reason the calculus of variations and the weak theory of PDE work at all, and the entire structure is generated by a single principle made precise in two ways: integrate a derivative to recover the function, then balance exponents by scaling. The GNS inequality is exactly this with axis integrations and the Loomis-Whitney product estimate; the Morrey inequality is exactly this with one radial integration and a Riesz potential; and the general embedding ladder of Theorem 1 is the iteration of these two, with the reciprocal-exponent rule as the bridge that makes integrability and differentiability a single tradeable currency. Putting these together, the central insight is that the critical exponent is not a parameter but a scaling invariant, and this is exactly why compactness fails there (Theorem 2): the dilation symmetry that fixes also produces the non-compact bubbling sequences, and the same Aubin-Talenti bubbles that saturate the sharp constant (Theorem 3) are the bubbles that obstruct compactness and that solve the critical Euler-Lagrange equations.

The borderline generalises the power-law conclusion to the exponential Trudinger-Moser endpoint, and the whole edifice generalises further: to fractional orders and trace spaces (Theorem 5), which appears again in 02.16.04 as the device that gives meaning to boundary data; to Riemannian manifolds, where the sharp constant controls the Yamabe invariant; and to the abstract theory of interpolation spaces, where the Sobolev embedding is a single instance of the real and complex interpolation functors. The arc from Sobolev's 1938 averaging lemma to the modern concentration-compactness method is one continuous refinement of the same fundamental-theorem-of-calculus argument, generalised until it became the load-bearing inequality of twentieth-century analysis.

Full proof set Master

Proposition 1 (Loomis-Whitney inequality). Let be non-negative measurable functions on , and for write for with the -th coordinate deleted. Then

Proof. Induct on using the generalized Hölder inequality with equal exponents . For the claim is , an equality by Tonelli. For the inductive step, integrate first in . The factor does not depend on and pulls out; the remaining factors , , each depend on , and generalized Hölder with exponents equal to gives Now integrate the result over the remaining variables and apply the inductive hypothesis in dimension to the functions (each a function of of the variables, after the -integration), absorbing the leftover factor by Tonelli. The exponents collect to give the stated product bound.

Proposition 2 (GNS by density on ). The inequality extends from to all , .

Proof. Let . Smooth functions with compact support are dense in 24.01.01, so choose with in ; in particular in . The Key Theorem gives , so is Cauchy in , with limit . Passing to a subsequence, almost everywhere (from convergence) and almost everywhere (from convergence), so a.e. Then .

Proposition 3 (Morrey inequality). Let . There is so that every has, for all , and consequently (after redefinition on a null set, in the statement) lies in .

Proof. Fix , set , and let , a set of measure . For , the oscillation bound of Exercise 7 applied at and at gives and likewise at . Average over : Since and , each average over is bounded by a constant times the corresponding average over , hence by . Adding, . Taking on any fixed ball bounds the sup norm; for one applies the estimate to a smooth approximating sequence, which is then uniformly Hölder, hence uniformly convergent to a continuous representative.

Proposition 4 (failure at ). In dimension , .

Proof. Take near the origin (smoothly cut off near ). In polar coordinates, as . Then , which converges for by the substitution , . So . Yet as , so . Hence the embedding into fails exactly at the borderline , where the Morrey exponent degenerates to zero.

Connections Master

  • The Sobolev space scaffolding — the definition of , the weak derivative, the density of smooth functions, and the extension and approximation theorems — is supplied by 24.01.01, which surveys the embedding theorems proved here in full. This unit owns the deep embedding theorems; 24.01.01 owns the space. The split follows the spec discipline that the foundational unit defines the object and the downstream unit builds the heavy theory.

  • The apparatus on which every estimate rests — Hölder's inequality, the generalized Hölder inequality with several exponents, Minkowski's inequality, and completeness — is developed in 02.07.06. The Loomis-Whitney product estimate is iterated generalized Hölder, and the bootstrap from to general in the Key Theorem is a single application of Hölder with conjugate exponents and .

  • The fundamental-theorem-of-calculus and integration-along-segments steps, together with the chain rule used in the bootstrap, are the multivariable differentiation results of 02.05.04; the Morrey oscillation bound is a multivariable mean-value estimate along the segment joining two points.

  • The embeddings are the existence engine for weak solutions of elliptic boundary-value problems in 02.16.02: the direct method of the calculus of variations minimizes an energy over , and Rellich-Kondrachov compactness (Theorem 2) is exactly what upgrades a minimizing sequence's weak limit to a genuine minimizer. The trace theorem (Theorem 5) gives meaning to the boundary data, developed in 02.16.04.

  • The critical-exponent obstruction reappears in geometric analysis: the sharp constant (Theorem 3) is the Yamabe constant of the round sphere, and the Aubin-Talenti bubbles are the concentration profiles in the Yamabe problem and in critical semilinear equations , linking to the variational and geometric PDE material in 02.16.05.

Historical & philosophical context Master

Sergei Sobolev introduced the spaces now bearing his name and proved the foundational embedding lemma in his 1938 Matematicheskii Sbornik paper [Sobolev 1938], motivated by the Cauchy problem for hyperbolic equations and the need for a function-space framework in which weak solutions could be sought and bounded. Sobolev's averaging method (mollification by a smooth kernel) and his integral lemma gave the first general embedding , establishing the reciprocal-exponent trade that organizes the entire theory.

The sharp first-order inequalities were brought to their modern form by Emilio Gagliardo in his 1958 Ricerche di Matematica paper [Gagliardo 1958] and independently by Louis Nirenberg in his 1959 Annali della Scuola Normale Superiore di Pisa paper on elliptic equations [Nirenberg 1959]; the combinatorial product estimate at the heart of the case had appeared a decade earlier in the 1949 note of Lynn Loomis and Hassler Whitney [Loomis-Whitney 1949], who proved it as a discrete-geometric inequality bounding a set's measure by its coordinate projections. Charles Morrey's 1940 Duke Mathematical Journal paper [Morrey 1940] established the complementary regime, the embedding into Hölder spaces for , in his study of the differentiability of solutions of variational problems.

The sharp constants and extremal functions were found independently in 1976 by Giorgio Talenti [Talenti 1976], using symmetric decreasing rearrangement and the radial Euler-Lagrange equation, and by Thierry Aubin [Aubin 1976] in the context of the Yamabe problem in Riemannian geometry; the resulting Aubin-Talenti bubbles became the canonical concentration profiles of critical-exponent problems. The borderline was settled by Neil Trudinger's 1967 Journal of Mathematics and Mechanics paper [Trudinger 1967] establishing exponential integrability, with the sharp constant determined by Jürgen Moser's 1971 Indiana University Mathematics Journal paper [Moser 1971]. The complementary BMO and John-Nirenberg reading of the endpoint was clarified by Haïm Brezis and Stephen Wainger in 1980 [Brezis-Wainger 1980].

Bibliography Master

@article{Sobolev1938,
  author  = {Sobolev, Sergei L.},
  title   = {On a theorem of functional analysis},
  journal = {Matematicheskii Sbornik},
  volume  = {4(46)},
  year    = {1938},
  pages   = {471--497},
  note    = {English transl. AMS Translations (2) 34 (1963), 39--68}
}

@article{Gagliardo1958,
  author  = {Gagliardo, Emilio},
  title   = {Propriet\`a di alcune classi di funzioni in pi\`u variabili},
  journal = {Ricerche di Matematica},
  volume  = {7},
  year    = {1958},
  pages   = {102--137}
}

@article{Nirenberg1959,
  author  = {Nirenberg, Louis},
  title   = {On elliptic partial differential equations},
  journal = {Annali della Scuola Normale Superiore di Pisa, Serie 3},
  volume  = {13},
  year    = {1959},
  pages   = {115--162}
}

@article{Morrey1940,
  author  = {Morrey, Charles B.},
  title   = {Functions of several variables and absolute continuity, II},
  journal = {Duke Mathematical Journal},
  volume  = {6},
  year    = {1940},
  pages   = {187--215}
}

@article{LoomisWhitney1949,
  author  = {Loomis, Lynn H. and Whitney, Hassler},
  title   = {An inequality related to the isoperimetric inequality},
  journal = {Bulletin of the American Mathematical Society},
  volume  = {55},
  year    = {1949},
  pages   = {961--962}
}

@article{Talenti1976,
  author  = {Talenti, Giorgio},
  title   = {Best constant in {S}obolev inequality},
  journal = {Annali di Matematica Pura ed Applicata},
  volume  = {110},
  year    = {1976},
  pages   = {353--372}
}

@article{Aubin1976,
  author  = {Aubin, Thierry},
  title   = {Probl\`emes isop\'erim\'etriques et espaces de {S}obolev},
  journal = {Journal of Differential Geometry},
  volume  = {11},
  year    = {1976},
  pages   = {573--598}
}

@article{Trudinger1967,
  author  = {Trudinger, Neil S.},
  title   = {On imbeddings into {O}rlicz spaces and some applications},
  journal = {Journal of Mathematics and Mechanics},
  volume  = {17},
  year    = {1967},
  pages   = {473--483}
}

@article{Moser1971,
  author  = {Moser, J\"urgen},
  title   = {A sharp form of an inequality by {N}. {T}rudinger},
  journal = {Indiana University Mathematics Journal},
  volume  = {20},
  year    = {1971},
  pages   = {1077--1092}
}

@article{BrezisWainger1980,
  author  = {Brezis, Ha\"im and Wainger, Stephen},
  title   = {A note on limiting cases of {S}obolev embeddings and convolution inequalities},
  journal = {Communications in Partial Differential Equations},
  volume  = {5},
  year    = {1980},
  pages   = {773--789}
}

@book{Evans2010,
  author    = {Evans, Lawrence C.},
  title     = {Partial Differential Equations},
  edition   = {2},
  publisher = {American Mathematical Society},
  series    = {Graduate Studies in Mathematics},
  volume    = {19},
  year      = {2010}
}

@book{GilbargTrudinger1983,
  author    = {Gilbarg, David and Trudinger, Neil S.},
  title     = {Elliptic Partial Differential Equations of Second Order},
  edition   = {2},
  publisher = {Springer},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {224},
  year      = {1983}
}

@book{AdamsFournier2003,
  author    = {Adams, Robert A. and Fournier, John J. F.},
  title     = {Sobolev Spaces},
  edition   = {2},
  publisher = {Academic Press},
  year      = {2003}
}

@book{Mazya2011,
  author    = {Maz'ya, Vladimir},
  title     = {Sobolev Spaces},
  edition   = {2},
  publisher = {Springer},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {342},
  year      = {2011}
}

@book{LiebLoss2001,
  author    = {Lieb, Elliott H. and Loss, Michael},
  title     = {Analysis},
  edition   = {2},
  publisher = {American Mathematical Society},
  series    = {Graduate Studies in Mathematics},
  volume    = {14},
  year      = {2001}
}