02.16.01 · analysis / sobolev-weak-solutions

Sobolev Inequalities: the Gagliardo-Nirenberg-Sobolev and Morrey Inequalities

shipped3 tiersLean: none

Anchor (Master): Evans §5.6-§5.8; Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §7.7-§7.8; Adams-Fournier, Sobolev Spaces, 2e (Academic Press 2003), Ch. 4-5; Maz'ya, Sobolev Spaces, 2e (Springer 2011), Ch. 1; Lieb-Loss, Analysis, 2e (AMS 2001), §8.3 (sharp constants, Talenti-Aubin)

Intuition Beginner

A Sobolev inequality is a precise statement of an idea you already trust: if you know that a function does not change too fast, then the function itself cannot be too big or too wild. The rate of change is captured by the derivative, the gradient; the inequality says that controlling the size of the gradient automatically controls the size of the function. You give up information about the slope, and in exchange you get information about the height.

Why would anyone want this trade? Because in most of mathematical physics the natural quantity you can bound is an energy, and energy is built from the gradient. The energy of a stretched membrane, the energy stored in an electric field, the kinetic energy of a flow: each is an integral of the square of a derivative. A Sobolev inequality is the bridge that converts a bound on this energy into a bound on the thing you actually care about, the displacement, the potential, the velocity itself.

There are two faces of the same coin, and which one you get depends on a competition between two numbers: how many derivatives you control, and how many dimensions you live in. When dimension wins, the function need not even be bounded, but its overall size in an averaged sense improves: you started controlling the function in one averaging scale and you end up controlling it in a stronger one. This is the Gagliardo-Nirenberg-Sobolev regime. When the derivative wins, you get something much stronger and more tangible: the function is genuinely continuous, with no jumps, and in fact it cannot change value too quickly between any two nearby points. This is the Morrey regime.

The dividing line is set by comparing the number of derivatives you control against the dimension. Below the line you buy improved averaged size; above the line you buy honest continuity. Right on the line is a delicate borderline case where the function is almost bounded, missing it only by a whisker, and the correct statement involves an exponential rather than a power.

A useful everyday picture: imagine pouring a fixed amount of paint that must be spread so that its slope is never steep. In a cramped low-dimensional space, the paint has nowhere to hide and piles up into a visible, continuous coat. In a roomy high-dimensional space, the same slope budget lets the paint thin out and spread, so it need not form a continuous coat, but its total spread-out concentration still improves. Dimension is room to spread; derivative control is the leash on the slope. The Sobolev inequalities measure exactly how these two forces balance.

Visual Beginner

The single most useful picture is the competition between two numbers, drawn as a number line.

The number line is the whole story. Put your finger at the dimension. Slide left, where the integrability of the gradient is smaller than the dimension: you are in the regime where you buy averaged size, and the averaging scale you end up with, the critical exponent, is determined by a formula that gets larger and larger as you approach the dimension from below. Slide right, where the integrability of the gradient exceeds the dimension: you buy continuity, and a quantitative smoothness number that measures how gently the function is allowed to vary. The exact crossing point, where the two numbers are equal, is the borderline case, where the function is almost but not quite bounded.

The second picture is the paint cartoons. A fixed slope budget in a small room forces a visible continuous coat; the same slope budget in a large room lets the paint thin and spread without ever forming a coat. The Sobolev inequalities turn this qualitative intuition into an exact accounting.

Worked example Beginner

We test the central formula on a concrete shape and watch the bookkeeping work out. Take three-dimensional space, so the dimension is three. Suppose we control the gradient in the averaging scale with exponent two, that is, we control the integral of the squared gradient, the most common energy in physics. We want to know the improved averaging scale for the function itself.

Step 1. Identify the numbers. The dimension is three. The exponent controlling the gradient is two. We need the dimension to beat the exponent for this regime, and indeed three is bigger than two, so we are on the Gagliardo-Nirenberg-Sobolev side.

Step 2. Apply the critical-exponent formula. The improved averaging scale is the dimension times the exponent, divided by the dimension minus the exponent. Substitute: the numerator is three times two, which is six. The denominator is three minus two, which is one. So the improved averaging scale is six divided by one, namely six.

Step 3. Read the conclusion. Controlling the integral of the squared gradient of a function in three-dimensional space controls the integral of the sixth power of the function itself. We started controlling the function's slope at the level of squares and we ended up controlling the function at the level of sixth powers, a genuinely stronger grip.

Step 4. Sanity-check the direction. The improved number, six, is larger than the starting number, two. A larger averaging exponent is a stronger form of control, because it weighs tall narrow spikes more heavily and so forbids them more strictly. The trade gave us something better than we put in, which is the whole point.

Step 5. Watch the formula misbehave as a warning. Keep the dimension at three but imagine pushing the gradient exponent up toward three. The denominator, the dimension minus the exponent, shrinks toward zero, and the improved averaging scale blows up. This is the inequality announcing the borderline: when the gradient exponent equals the dimension, the clean power-law conclusion breaks and a subtler exponential statement takes over.

What this tells us: the formula is a bookkeeping device that converts control of the slope into stronger control of the function, with the exact strength dictated by a competition between the gradient exponent and the dimension. The blow-up as the two approach each other is not a flaw; it is the inequality pointing at its own boundary.

Check your understanding Beginner

Exercise (easy, multiple choice).

In the Gagliardo-Nirenberg-Sobolev regime, the critical exponent (the improved averaging scale for the function) is given by which formula, where $n$ is the dimension and $p$ is the gradient exponent with $p < n$ ?

A. $p^{*} = n + p$ B. $p^{*} = n p / (n - p)$ C. $p^{*} = (n - p) / (n p)$ D. $p^{*} = n - p$

Hint

The new exponent should grow without bound as $p$ rises toward $n$ , since the denominator must shrink to zero there.

Answer

B. $p^ = np / (n - p)$.* The critical Sobolev exponent puts the product $n p$ over the difference $n - p$ . As $p$ approaches $n$ from below, the denominator $n - p$ shrinks to zero and $p^{*}$ grows without bound, which is the signal that the borderline case $p = n$ needs separate treatment. Feedback-correct: this is the only formula that blows up at $p = n$ and gives a number larger than $p$ . Feedback-wrong: A and D give numbers no larger than $n$ and do not blow up; C is upside down and would shrink to zero, the opposite of improved control.

Formal definition Intermediate+

Throughout, $n \geq 1$ is the dimension, $U \subseteq R^{n}$ is open, and $1 \leq p < \infty$ . The Sobolev space $W^{1, p} (U)$ and its norm are taken as already defined 24.01.01: $u \in W^{1, p} (U)$ means $u \in L^{p} (U)$ together with weak partial derivatives $D_{i} u \in L^{p} (U)$ for $i = 1, \dots, n$ , with $∥ u ∥_{W^{1, p} (U)} = (∥ u ∥_{L^{p} (U)}^{p} + ∥ D u ∥_{L^{p} (U)}^{p})^{1/ p}$ , where $D u = (D_{1} u, \dots, D_{n} u)$ and $∣ D u ∣ = (\sum_{i} (D_{i} u)^{2})^{1/2}$ . We write $W_{0}^{1, p} (U)$ for the closure of $C_{c}^{\infty} (U)$ in $W^{1, p} (U)$ . The $L^{p}$ machinery, Hölder's inequality, and Minkowski's inequality are taken as available 02.07.06.

Definition (critical Sobolev exponent). For $1 \leq p < n$ , the *critical Sobolev exponent** is $p^{*} = \frac{n p}{n - p}, equivalently \frac{1}{p ^{*}} = \frac{1}{p} - \frac{1}{n} .$ Note $p^{*} > p$ and $p^ \to \infty $a s$ p \uparrow n$.

The exponent $p^{*}$ is forced by scaling. If $u \in C_{c}^{\infty} (R^{n})$ and $u_{λ} (x) = u (λ x)$ for $λ > 0$ , then a change of variables gives $∥ u_{λ} ∥_{L^{q}} = λ^{- n / q} ∥ u ∥_{L^{q}}$ and $∥ D u_{λ} ∥_{L^{p}} = λ^{1 - n / p} ∥ D u ∥_{L^{p}}$ . For an inequality of the form $∥ u ∥_{L^{q}} \leq C ∥ D u ∥_{L^{p}}$ to hold for all $u$ with a constant $C$ independent of $λ$ , the two powers of $λ$ must match: $- n / q = 1 - n / p$ , which rearranges to $q = p^{*}$ . Any other exponent fails by sending $λ \to 0$ or $λ \to \infty$ .

Definition (Hölder space). For $0 < γ \leq 1$ and $U$ open, the Hölder space $C^{0, γ} (\overset{ˉ}{U})$ consists of bounded continuous $u : \overset{ˉ}{U} \to R$ for which the seminorm $[u]_{C^{0, γ} (\overset{ˉ}{U})} = x, y \in U x \neq = y sup \frac{∣ u ( x ) - u ( y ) ∣}{∣ x - y ∣ ^{γ}}$ is finite, normed by $∥ u ∥_{C^{0, γ} (\overset{ˉ}{U})} = sup_{U} ∣ u ∣ + [u]_{C^{0, γ} (\overset{ˉ}{U})}$ . A function in $C^{0, γ}$ is Hölder continuous with exponent $γ$ ; $γ = 1$ is Lipschitz continuity.

The two regimes. The Gagliardo-Nirenberg-Sobolev (GNS) inequality covers $1 \leq p < n$ and asserts a continuous embedding $W^{1, p} (R^{n}) ↪ L^{p^{*}} (R^{n})$ . The Morrey inequality covers $n < p < \infty$ (here $p \leq \infty$ is admissible, with the seminorm read off below) and asserts $W^{1, p} (R^{n}) ↪ C^{0, γ} (R^{n})$ with $γ = 1 - n / p$ , after redefinition of $u$ on a set of measure zero. The borderline $p = n$ falls under neither power-law conclusion: $u$ need not be bounded (the standard example $u (x) = lo g lo g (1 + 1/∣ x ∣)$ near the origin in dimension $n$ lies in $W^{1, n}$ of a ball yet is unbounded), and the correct endpoint statement is exponential integrability in the sense of Trudinger-Moser, or membership in BMO ^{[Trudinger 1967]} ^{[Brezis-Wainger 1980]}.

Counterexamples to common slips Intermediate+

The GNS exponent is not a free parameter. The embedding $W^{1, p} ↪ L^{q}$ for $p < n$ holds with the single critical $q = p^{*}$ on all of $R^{n}$ ; lower exponents $q < p^{*}$ fail on $R^{n}$ because constants and slowly-decaying functions have finite gradient norm but infinite $L^{q}$ norm. On a bounded domain the full ladder $W^{1, p} ↪ L^{q}$ for $p \leq q \leq p^{*}$ holds, because the finite measure lets Hölder interpolate down from $p^{*}$ , but the gain stops at $p^{*}$ .
Morrey requires $p$ strictly above $n$ . At $p = n$ the candidate Hölder exponent $1 - n / p$ degenerates to zero, and continuity genuinely fails: $lo g lo g (1 + 1/∣ x ∣)$ has gradient in $L^{n}$ near the origin but is unbounded, so no Hölder estimate can hold.
The embeddings are for the right space. The GNS inequality $∥ u ∥_{L^{p^{*}}} \leq C ∥ D u ∥_{L^{p}}$ with the gradient norm alone on the right needs either $u \in W_{0}^{1, p}$ or $u$ decaying at infinity; for general $u \in W^{1, p} (U)$ on a bounded domain one needs the full norm $∥ u ∥_{W^{1, p}}$ on the right and a Lipschitz (or extension-domain) boundary, because the bare gradient cannot see additive constants.
Weak derivatives, not classical. All statements use the weak gradient. The Morrey conclusion is that the equivalence class of $u$ contains a (unique) Hölder-continuous representative; the original $L^{p}$ function may have been defined arbitrarily on a null set.

Key theorem with proof Intermediate+

Theorem (Gagliardo-Nirenberg-Sobolev inequality). Let $1 \leq p < n$ . There is a constant $C = C (n, p)$ such that for every $u \in C_{c}^{1} (R^{n})$ , $∥ u ∥_{L^{p^{*}} (R^{n})} \leq C ∥ D u ∥_{L^{p} (R^{n})}, p^{*} = \frac{n p}{n - p}$ ^{[Gagliardo 1958]} ^{[Nirenberg 1959]} ^{[Evans 2010 §5.6.1]}.

Proof. The heart is the case $p = 1$ , where $p^{*} = n / (n - 1)$ ; the general case follows by applying the $p = 1$ result to a power of $∣ u ∣$ and using Hölder's inequality.

Step 1 (fundamental theorem of calculus on each axis). Fix $u \in C_{c}^{1} (R^{n})$ . For each $i \in {1, \dots, n}$ and each $x \in R^{n}$ , integrate the $i$ -th partial derivative along the $i$ -th coordinate line from $- \infty$ up to $x_{i}$ . Because $u$ has compact support, $u (x) = \int_{- \infty}^{x_{i}} D_{i} u (x_{1}, \dots, t, \dots, x_{n}) d t$ , and therefore $∣ u (x) ∣ \leq \int_{- \infty}^{\infty} ∣ D_{i} u (x_{1}, \dots, t, \dots, x_{n}) ∣ d t =: g_{i} (\overset{x}{^}_{i}),$ where $\overset{x}{^}_{i} = (x_{1}, \dots, x_{i - 1}, x_{i + 1}, \dots, x_{n})$ records all coordinates except the $i$ -th, and $g_{i}$ depends only on $\overset{x}{^}_{i}$ . This holds for every $i$ simultaneously.

Step 2 (multiply the $n$ estimates). Raise to the power $1/ (n - 1)$ and multiply over $i$ : $∣ u (x) ∣^{n / (n - 1)} = i = 1 \prod n ∣ u (x) ∣^{1/ (n - 1)} \leq i = 1 \prod n g_{i} (\overset{x}{^}_{i})^{1/ (n - 1)} .$ The left side is the target exponent: $n / (n - 1) = 1^{*}$ . The right side is a product of $n$ functions, the $i$ -th of which is independent of the variable $x_{i}$ .

Step 3 (Loomis-Whitney / iterated generalized Hölder). Integrate the inequality of Step 2 over $R^{n}$ , one variable at a time, pulling out the factor that does not depend on the current integration variable and applying the generalized Hölder inequality with $n - 1$ exponents each equal to $n - 1$ to the remaining factors. Integrate first in $x_{1}$ . The factor $g_{1} (\overset{x}{^}_{1})^{1/ (n - 1)}$ is constant in $x_{1}$ and pulls out; the remaining $n - 1$ factors $g_{i} (\overset{x}{^}_{i})^{1/ (n - 1)}$ ( $i \geq 2$ ) each depend on $x_{1}$ , and generalized Hölder with exponents $n - 1$ gives $\int_{R} i = 1 \prod n g_{i}^{1/ (n - 1)} d x_{1} \leq g_{1}^{1/ (n - 1)} i = 2 \prod n (\int_{R} g_{i} d x_{1})^{1/ (n - 1)} .$ Repeat the procedure successively in $x_{2}, \dots, x_{n}$ . After all $n$ integrations, each $g_{i}$ has been integrated over all of its $n - 1$ variables exactly once inside a $1/ (n - 1)$ power, yielding $\int_{R^{n}} ∣ u ∣^{n / (n - 1)} d x \leq i = 1 \prod n (\int_{R^{n}} ∣ D_{i} u ∣ d x)^{1/ (n - 1)} \leq (\int_{R^{n}} ∣ D u ∣ d x)^{n / (n - 1)},$ the last step using $∣ D_{i} u ∣ \leq ∣ D u ∣$ and the arithmetic-geometric mean inequality on the $n$ factors. This is precisely the $p = 1$ case: $∥ u ∥_{L^{n / (n - 1)}} \leq \int_{R^{n}} ∣ D u ∣ d x = ∥ D u ∥_{L^{1}} .$ (The combinatorial estimate just executed is the Loomis-Whitney inequality: the $L^{1}$ size of a function is bounded by the geometric mean of the $L^{1}$ sizes of its $n$ coordinate marginals.)

Step 4 (bootstrap from $p = 1$ to general $p$ ). For $1 < p < n$ apply the $p = 1$ inequality to $v = ∣ u ∣^{γ}$ with the exponent $γ = p (n - 1) / (n - p) > 1$ chosen below. Then $∣ D v ∣ = γ ∣ u ∣^{γ - 1} ∣ D u ∣$ almost everywhere, so $(\int ∣ u ∣^{γ n / (n - 1)})^{(n - 1) / n} \leq γ \int ∣ u ∣^{γ - 1} ∣ D u ∣ \leq γ (\int ∣ u ∣^{(γ - 1) p^{'}})^{1/ p^{'}} (\int ∣ D u ∣^{p})^{1/ p},$ where $p^{'} = p / (p - 1)$ is the Hölder conjugate. The exponent $γ$ is fixed by the requirement that the two powers of $∣ u ∣$ match: $γ n / (n - 1) = (γ - 1) p^{'}$ . Solving gives $γ = p (n - 1) / (n - p)$ , and then $γ n / (n - 1) = p^{*}$ . Dividing both sides by the common $∣ u ∣$ -integral factor (finite because $u \in C_{c}^{1}$ ) leaves $(\int ∣ u ∣^{p^{*}})^{(n - 1) / n - 1/ p^{'}} \leq γ (\int ∣ D u ∣^{p})^{1/ p} .$ The exponent on the left simplifies to $1/ p^{*}$ , giving $∥ u ∥_{L^{p^{*}}} \leq C (n, p) ∥ D u ∥_{L^{p}}$ with $C (n, p) = γ = p (n - 1) / (n - p)$ . $□$

Bridge. The proof is the foundational reason the critical exponent is what it is: $p^{*}$ is not chosen for convenience but forced, first by the scaling identity that opened the formal-definition section and again here by the algebra that makes the two $∣ u ∣$ -powers in Step 4 coincide. This is exactly the same move — integrate a derivative back to the function, then balance exponents — that the Morrey estimate below performs with a single radial integration instead of $n$ axis integrations, so the two inequalities are dual faces of one fundamental-theorem-of-calculus argument. Putting these together with a density argument extends both from $C_{c}^{1}$ to all of $W^{1, p}$ , which builds toward the general embedding ladder for $W^{k, p}$ and the compactness refinement (Rellich-Kondrachov). The GNS inequality appears again in 02.16.02 as the existence engine for weak solutions of elliptic equations via the direct method, and the sharp-constant version generalises to the isoperimetric inequality, recovered as the $p = 1$ case applied to the indicator of a smooth set.

Exercises Intermediate+

Exercise 4 (medium, symbolic).

Carry out the $p = 1$ , $n = 2$ case of GNS by hand: for $u \in C_{c}^{1} (R^{2})$ show $∥ u ∥_{L^{2}} \leq ∥ D_{1} u ∥_{L^{1}}^{1/2} ∥ D_{2} u ∥_{L^{1}}^{1/2} \leq \frac{1}{2} ∥ D u ∥_{L^{1}} \cdot$ (constant), via the two-axis fundamental theorem of calculus.

Hint

$∣ u (x_{1}, x_{2}) ∣^{2} \leq g_{1} (x_{2}) g_{2} (x_{1})$ where $g_{1} (x_{2}) = \int ∣ D_{1} u ∣ d x_{1}$ and $g_{2} (x_{1}) = \int ∣ D_{2} u ∣ d x_{2}$ . Integrate over $x_{1}$ then $x_{2}$ and pull out the constant-in-the-variable factor each time.

Answer

From the fundamental theorem of calculus along each axis, $∣ u (x_{1}, x_{2}) ∣ \leq \int_{R} ∣ D_{1} u (t, x_{2}) ∣ d t =: g_{1} (x_{2})$ and $∣ u (x_{1}, x_{2}) ∣ \leq \int_{R} ∣ D_{2} u (x_{1}, s) ∣ d s =: g_{2} (x_{1})$ . Multiply: $∣ u ∣^{2} \leq g_{1} (x_{2}) g_{2} (x_{1})$ . Integrate in $x_{1}$ (with $g_{1} (x_{2})$ constant in $x_{1}$ ): $\int ∣ u ∣^{2} d x_{1} \leq g_{1} (x_{2}) \int g_{2} (x_{1}) d x_{1} = g_{1} (x_{2}) ∥ D_{2} u ∥_{L^{1}}$ . Now integrate in $x_{2}$ : $\int ∣ u ∣^{2} d x \leq ∥ D_{2} u ∥_{L^{1}} \int g_{1} (x_{2}) d x_{2} = ∥ D_{1} u ∥_{L^{1}} ∥ D_{2} u ∥_{L^{1}}$ . So $∥ u ∥_{L^{2}} \leq ∥ D_{1} u ∥_{L^{1}}^{1/2} ∥ D_{2} u ∥_{L^{1}}^{1/2}$ . By AM-GM, $∥ D_{1} u ∥_{L^{1}}^{1/2} ∥ D_{2} u ∥_{L^{1}}^{1/2} \leq \frac{1}{2} (∥ D_{1} u ∥_{L^{1}} + ∥ D_{2} u ∥_{L^{1}}) \leq ∥ D u ∥_{L^{1}}$ (the last from $∣ D_{1} u ∣ + ∣ D_{2} u ∣ \leq 2 ∣ D u ∣$ , then absorbing the $2$ ). This is GNS at $n = 2$ , $p = 1$ , $p^{*} = 2$ .

Exercise 6 (medium, symbolic).

Verify that $u (x) = ∣ x ∣^{- α}$ on the unit ball $B_{1} \subset R^{n}$ lies in $W^{1, p} (B_{1})$ exactly when $α < (n - p) / p = n / p - 1$ , and check that the borderline $α = (n - p) / p$ makes $u \in L^{p^{*}}$ fail, confirming sharpness of GNS.

Hint

In polar coordinates $\int_{B_{1}} ∣ x ∣^{- β} d x < \infty$ iff $β < n$ . The gradient of $∣ x ∣^{- α}$ has size $\sim ∣ x ∣^{- α - 1}$ .

Answer

In polar coordinates $\int_{B_{1}} ∣ x ∣^{- β} d x = ω_{n - 1} \int_{0}^{1} r^{- β} r^{n - 1} d r$ , finite iff $n - 1 - β > - 1$ , i.e. $β < n$ . For $u = ∣ x ∣^{- α}$ : $∣ u ∣^{p} \sim ∣ x ∣^{- α p}$ is integrable iff $α p < n$ ; $∣ D u ∣ \sim ∣ x ∣^{- α - 1}$ , so $∣ D u ∣^{p} \sim ∣ x ∣^{- (α + 1) p}$ is integrable iff $(α + 1) p < n$ , i.e. $α < n / p - 1 = (n - p) / p$ . This is the binding constraint, so $u \in W^{1, p} (B_{1})$ iff $α < (n - p) / p$ . At the borderline $α = (n - p) / p$ one computes $α p^{*} = \frac{n - p}{p} \cdot \frac{n p}{n - p} = n$ , so $∣ u ∣^{p^{*}} \sim ∣ x ∣^{- n}$ is not integrable: $u \in / L^{p^{*}}$ at the borderline. Thus no exponent larger than $p^{*}$ (and not even $p^{*}$ for this borderline family) can be inserted, confirming GNS is sharp at $p^{*}$ .

Exercise 7 (hard, symbolic).

Prove the Morrey estimate at a point in the form of an oscillation bound: for $u \in C^{1} (R^{n})$ and $p > n$ , the average of $u$ over a ball $B (x, r)$ satisfies $\fint_{B (x, r)} ∣ u (y) - u (x) ∣ d y \leq C (n) r (\fint_{B (x, r)} ∣ D u ∣^{p} d y)^{1/ p} r^{?},$ identifying the missing power and deducing $γ = 1 - n / p$ .

Hint

Write $u (y) - u (x) = \int_{0}^{1} D u (x + t (y - x)) \cdot (y - x) d t$ . Average over $y \in B (x, r)$ , swap order of integration, change variables, and bound the resulting potential $\int_{B (x, r)} ∣ D u (z) ∣∣ z - x ∣^{1 - n} d z$ by Hölder with exponent $p$ .

Answer

By the fundamental theorem of calculus along the segment, $u (y) - u (x) = \int_{0}^{1} D u (x + t (y - x)) \cdot (y - x) d t$ , so $∣ u (y) - u (x) ∣ \leq ∣ y - x ∣ \int_{0}^{1} ∣ D u (x + t (y - x)) ∣ d t$ . Average over $y \in B (x, r)$ and swap integrals; the substitution $z = x + t (y - x)$ converts the average into the Riesz-type potential $\fint_{B (x, r)} ∣ u (y) - u (x) ∣ d y \leq \frac{C ( n )}{r ^{n}} \int_{B (x, r)} \frac{∣ D u ( z ) ∣}{∣ z - x ∣ ^{n - 1}} d z .$ Apply Hölder with exponents $p$ and $p^{'} = p / (p - 1)$ : $\int_{B (x, r)} \frac{∣ D u ( z ) ∣}{∣ z - x ∣ ^{n - 1}} d z \leq ∥ D u ∥_{L^{p} (B (x, r))} (\int_{B (x, r)} ∣ z - x ∣^{- (n - 1) p^{'}} d z)^{1/ p^{'}} .$ The singular integral converges precisely because $p > n$ : in polar form the exponent is $- (n - 1) p^{'} + (n - 1) = (n - 1) (1 - p^{'})$ and one needs $(n - 1) p^{'} < n + (...)$ ; the radial integral evaluates to a constant times $r^{(n - (n - 1) p^{'}) / p^{'}} = r^{n / p^{'} - (n - 1)} = r^{n - n / p - (n - 1)} = r^{1 - n / p}$ . Collecting, $\fint_{B (x, r)} ∣ u (y) - u (x) ∣ d y \leq C (n, p) ∥ D u ∥_{L^{p}} r^{1 - n / p}$ , the missing power being $r^{1 - n / p}$ . Applying this oscillation bound to two nearby points $x, y$ with $r = ∣ x - y ∣$ and the triangle inequality yields $∣ u (x) - u (y) ∣ \leq C ∥ D u ∥_{L^{p}} ∣ x - y ∣^{1 - n / p}$ , i.e. Hölder continuity with $γ = 1 - n / p$ .

Exercise 8 (hard, symbolic).

Deduce the general embedding $W^{k, p} (R^{n}) ↪ L^{q} (R^{n})$ with $\frac{1}{q} = \frac{1}{p} - \frac{k}{n}$ (for $k p < n$ ) by iterating the first-order GNS inequality $k$ times, stating the exponent at each stage.

Hint

One application of GNS to $u \in W^{1, p}$ gives $u \in L^{p^{*}}$ with $1/ p^{*} = 1/ p - 1/ n$ . If $u \in W^{k, p}$ then $D^{k - 1} u \in W^{1, p}$ . Iterate the reciprocal-exponent bookkeeping.

Answer

If $u \in W^{k, p}$ with $k p < n$ , then each derivative $D^{k - 1} u$ of order $k - 1$ lies in $W^{1, p}$ , so GNS gives $D^{k - 1} u \in L^{p_{1}}$ with $\frac{1}{p _{1}} = \frac{1}{p} - \frac{1}{n}$ ; hence $u \in W^{k - 1, p_{1}}$ . Iterate: after $j$ steps $u \in W^{k - j, p_{j}}$ with $\frac{1}{p _{j}} = \frac{1}{p} - \frac{j}{n}$ , valid as long as $p_{j} < n$ , i.e. $j p < n$ in the reciprocal sense. After $k$ steps, $u \in W^{0, p_{k}} = L^{p_{k}}$ with $\frac{1}{p _{k}} = \frac{1}{p} - \frac{k}{n} = \frac{1}{q}$ . Thus $W^{k, p} ↪ L^{q}$ , $\frac{1}{q} = \frac{1}{p} - \frac{k}{n}$ . The bookkeeping is additive in reciprocal exponents: each derivative gained is worth exactly $1/ n$ of integrability, so $k$ derivatives are worth $k / n$ . When $k p > n$ the iteration crosses the Morrey threshold at the step where $p_{j} > n$ , and the conclusion upgrades to Hölder continuity $C^{m, γ}$ with $m = k - ⌊ n / p ⌋ - 1$ and $γ$ the fractional remainder, the general Sobolev embedding theorem.

Advanced results Master

The first-order inequalities organize a much larger structure: the full Sobolev embedding ladder, the sharp constants and their extremals, the compactness refinement, the borderline endpoints, and the fractional and trace generalizations. Each is a refinement of the two fundamental-theorem-of-calculus arguments above.

Theorem 1 (general Sobolev embedding; Sobolev 1938). Let $U \subseteq R^{n}$ be open and bounded with $C^{1}$ boundary, $1 \leq p < \infty$ , and $k \in N$ . If $k p < n$ , then $W^{k, p} (U) ↪ L^{q} (U)$ for $\frac{1}{q} = \frac{1}{p} - \frac{k}{n}$ , the embedding being continuous. If $k p > n$ , then $W^{k, p} (U) ↪ C^{m, γ} (\overset{ˉ}{U})$ , where $m = k - ⌊ n / p ⌋ - 1$ and $γ = ⌊ n / p ⌋ + 1 - n / p$ if $n / p \in / Z$ , and $γ$ any number in $(0, 1)$ if $n / p \in Z$ ^{[Sobolev 1938]} ^{[Evans 2010 §5.6.3]}. The two clauses are the iterated GNS and iterated Morrey arguments of Exercise 8, with the reciprocal-exponent accounting $\frac{1}{q} = \frac{1}{p} - \frac{k}{n}$ supplying the unified bookkeeping: integrability and differentiability trade at the fixed rate $1/ n$ per derivative.

Theorem 2 (Rellich-Kondrachov compactness; Rellich 1930, Kondrachov 1945). Let $U$ be bounded with $C^{1}$ boundary and $1 \leq p < n$ . Then for every $q$ with $1 \leq q < p^{*}$ (strictly subcritical), the embedding $W^{1, p} (U) ↪↪ L^{q} (U)$ is compact: bounded sequences in $W^{1, p}$ have subsequences converging strongly in $L^{q}$ . Compactness fails at the critical exponent $q = p^{*}$ itself, where concentration (a bubble $u_{ε} (x) = ε^{- (n - p) / p} ϕ (x / ε)$ ) and translation to infinity both produce bounded non-convergent sequences. The loss of compactness at $p^{*}$ is the central analytic difficulty in critical elliptic problems and is quantified by the concentration-compactness principle.

Theorem 3 (sharp constant and extremals; Talenti 1976, Aubin 1976). For $1 < p < n$ , the best constant in $∥ u ∥_{L^{p^{*}} (R^{n})} \leq C ∥ D u ∥_{L^{p} (R^{n})}$ is $C_{n, p} = π^{- 1/2} n^{- 1/ p} (\frac{p - 1}{n - p})^{1 - 1/ p} (\frac{Γ ( 1 + n /2 ) Γ ( n )}{Γ ( n / p ) Γ ( 1 + n - n / p )})^{1/ n},$ and the extremal functions (where equality holds) are exactly the **Aubin-Talenti bubbles** $u (x) = (a + b ∣ x - x_{0} ∣^{p / (p - 1)})^{- (n - p) / p}, a, b > 0, x_{0} \in R^{n}$ ^{[Talenti 1976]} ^{[Aubin 1976]}. The extremals are unique modulo the symmetry group (translations, dilations, scalar multiples), and for $p = 2$ they are the standard bubbles solving the critical Lane-Emden equation $- Δ u = u^{(n + 2) / (n - 2)}$ that appear in the Yamabe problem.

Theorem 4 (Trudinger-Moser borderline; Trudinger 1967, Moser 1971). At the borderline $p = n$ the embedding $W^{1, n} ↪ L^{\infty}$ fails, but exponential integrability holds: there are constants $α_{n}, C_{n} > 0$ such that for $u \in W_{0}^{1, n} (U)$ , $U$ bounded, with $∥ D u ∥_{L^{n}} \leq 1$ , $\int_{U} exp (α_{n} ∣ u ∣^{n / (n - 1)}) d x \leq C_{n} ∣ U ∣,$ and the constant $α_{n} = n ω_{n - 1}^{1/ (n - 1)}$ (with $ω_{n - 1}$ the surface area of the unit sphere) is sharp: for any larger $α$ the supremum is infinite ^{[Trudinger 1967]} ^{[Moser 1971]}. The Trudinger-Moser inequality is the correct endpoint of the Sobolev ladder, replacing the failed $L^{\infty}$ bound by membership in the exponential Orlicz class; an alternative endpoint reading places $W^{1, n}$ in BMO and the John-Nirenberg space ^{[Brezis-Wainger 1980]}.

Theorem 5 (fractional Sobolev and trace; Gagliardo 1957, Aronszajn-Slobodeckij). For $s \in (0, 1)$ and $s p < n$ , the fractional Sobolev space $W^{s, p} (R^{n})$ , normed by the Gagliardo seminorm $[u]_{s, p}^{p} = \iint \frac{∣ u ( x ) - u ( y ) ∣ ^{p}}{∣ x - y ∣ ^{n + s p}} d x d y$ , embeds continuously into $L^{q}$ with $\frac{1}{q} = \frac{1}{p} - \frac{s}{n}$ , and the trace operator $u \mapsto u ∣_{\partial U}$ maps $W^{1, p} (U)$ onto $W^{1 - 1/ p, p} (\partial U)$ for $p > 1$ . The trace theorem is the precise statement that boundary values of $W^{1, p}$ functions lose exactly $1/ p$ of a derivative, the foundational fact for boundary-value problems in the weak formulation. The fractional scale interpolates the integer Sobolev spaces and is the natural setting for the nonlocal operators $(- Δ)^{s}$ and for boundary integral equations.

Synthesis. The Sobolev inequalities are the foundational reason the calculus of variations and the weak theory of PDE work at all, and the entire structure is generated by a single principle made precise in two ways: integrate a derivative to recover the function, then balance exponents by scaling. The GNS inequality is exactly this with $n$ axis integrations and the Loomis-Whitney product estimate; the Morrey inequality is exactly this with one radial integration and a Riesz potential; and the general embedding ladder of Theorem 1 is the iteration of these two, with the reciprocal-exponent rule $\frac{1}{q} = \frac{1}{p} - \frac{k}{n}$ as the bridge that makes integrability and differentiability a single tradeable currency. Putting these together, the central insight is that the critical exponent $p^{*}$ is not a parameter but a scaling invariant, and this is exactly why compactness fails there (Theorem 2): the dilation symmetry that fixes $p^{*}$ also produces the non-compact bubbling sequences, and the same Aubin-Talenti bubbles that saturate the sharp constant (Theorem 3) are the bubbles that obstruct compactness and that solve the critical Euler-Lagrange equations.

The borderline $p = n$ generalises the power-law conclusion to the exponential Trudinger-Moser endpoint, and the whole edifice generalises further: to fractional orders and trace spaces (Theorem 5), which appears again in 02.16.04 as the device that gives meaning to boundary data; to Riemannian manifolds, where the sharp constant controls the Yamabe invariant; and to the abstract theory of interpolation spaces, where the Sobolev embedding is a single instance of the real and complex interpolation functors. The arc from Sobolev's 1938 averaging lemma to the modern concentration-compactness method is one continuous refinement of the same fundamental-theorem-of-calculus argument, generalised until it became the load-bearing inequality of twentieth-century analysis.

Full proof set Master

Proposition 1 (Loomis-Whitney inequality). Let $f_{1}, \dots, f_{n}$ be non-negative measurable functions on $R^{n - 1}$ , and for $x \in R^{n}$ write $\overset{x}{^}_{i}$ for $x$ with the $i$ -th coordinate deleted. Then $\int_{R^{n}} i = 1 \prod n f_{i} (\overset{x}{^}_{i})^{1/ (n - 1)} d x \leq i = 1 \prod n (\int_{R^{n - 1}} f_{i})^{1/ (n - 1)} .$

Proof. Induct on $n$ using the generalized Hölder inequality with $n - 1$ equal exponents $n - 1$ . For $n = 2$ the claim is $\int_{R^{2}} f_{1} (x_{2}) f_{2} (x_{1}) d x = (\int f_{1}) (\int f_{2})$ , an equality by Tonelli. For the inductive step, integrate first in $x_{n}$ . The factor $f_{n} (\overset{x}{^}_{n})^{1/ (n - 1)}$ does not depend on $x_{n}$ and pulls out; the remaining $n - 1$ factors $f_{i} (\overset{x}{^}_{i})^{1/ (n - 1)}$ , $i < n$ , each depend on $x_{n}$ , and generalized Hölder with $n - 1$ exponents equal to $n - 1$ gives $\int_{R} i = 1 \prod n f_{i}^{1/ (n - 1)} d x_{n} \leq f_{n}^{1/ (n - 1)} i = 1 \prod n - 1 (\int_{R} f_{i} d x_{n})^{1/ (n - 1)} .$ Now integrate the result over the remaining variables $(x_{1}, \dots, x_{n - 1})$ and apply the inductive hypothesis in dimension $n - 1$ to the $n - 1$ functions $g_{i} = \int_{R} f_{i} d x_{n}$ (each a function of $n - 2$ of the variables, after the $x_{n}$ -integration), absorbing the leftover $f_{n}$ factor by Tonelli. The exponents collect to give the stated product bound. $□$

Proposition 2 (GNS by density on $W^{1, p} (R^{n})$ ). The inequality $∥ u ∥_{L^{p^{*}}} \leq C (n, p) ∥ D u ∥_{L^{p}}$ extends from $C_{c}^{1} (R^{n})$ to all $u \in W^{1, p} (R^{n})$ , $1 \leq p < n$ .

Proof. Let $u \in W^{1, p} (R^{n})$ . Smooth functions with compact support are dense in $W^{1, p} (R^{n})$ 24.01.01, so choose $u_{m} \in C_{c}^{\infty}$ with $u_{m} \to u$ in $W^{1, p}$ ; in particular $D u_{m} \to D u$ in $L^{p}$ . The Key Theorem gives $∥ u_{m} - u_{ℓ} ∥_{L^{p^{*}}} \leq C ∥ D (u_{m} - u_{ℓ}) ∥_{L^{p}} \to 0$ , so $(u_{m})$ is Cauchy in $L^{p^{*}}$ , with limit $v \in L^{p^{*}}$ . Passing to a subsequence, $u_{m} \to u$ almost everywhere (from $L^{p}$ convergence) and $u_{m} \to v$ almost everywhere (from $L^{p^{*}}$ convergence), so $v = u$ a.e. Then $∥ u ∥_{L^{p^{*}}} = lim ∥ u_{m} ∥_{L^{p^{*}}} \leq lim C ∥ D u_{m} ∥_{L^{p}} = C ∥ D u ∥_{L^{p}}$ . $□$

Proposition 3 (Morrey inequality). Let $n < p \leq \infty$ . There is $C = C (n, p)$ so that every $u \in C^{1} (R^{n})$ has, for all $x, y \in R^{n}$ , $∣ u (x) - u (y) ∣ \leq C ∥ D u ∥_{L^{p} (R^{n})} ∣ x - y ∣^{1 - n / p},$ and consequently $u$ (after redefinition on a null set, in the $W^{1, p}$ statement) lies in $C^{0, 1 - n / p}$ .

Proof. Fix $x, y$ , set $r = ∣ x - y ∣$ , and let $W = B (x, r) \cap B (y, r)$ , a set of measure $\geq c (n) r^{n}$ . For $z \in W$ , the oscillation bound of Exercise 7 applied at $x$ and at $y$ gives $\fint_{B (x, r)} ∣ u - u (x) ∣ \leq C ∥ D u ∥_{L^{p}} r^{1 - n / p}$ and likewise at $y$ . Average $u (x) - u (y) = (u (x) - u (z)) + (u (z) - u (y))$ over $z \in W$ : $∣ u (x) - u (y) ∣ \leq \fint_{W} ∣ u (x) - u (z) ∣ d z + \fint_{W} ∣ u (z) - u (y) ∣ d z .$ Since $W \subseteq B (x, r)$ and $∣ W ∣ \geq c (n) ∣ B (x, r) ∣$ , each average over $W$ is bounded by a constant times the corresponding average over $B (x, r)$ , hence by $C ∥ D u ∥_{L^{p}} r^{1 - n / p}$ . Adding, $∣ u (x) - u (y) ∣ \leq C (n, p) ∥ D u ∥_{L^{p}} ∣ x - y ∣^{1 - n / p}$ . Taking $sup ∣ u ∣ \leq ∣ u (x_{0}) ∣ + C ∥ D u ∥_{L^{p}}$ on any fixed ball bounds the sup norm; for $u \in W^{1, p}$ one applies the estimate to a smooth approximating sequence, which is then uniformly Hölder, hence uniformly convergent to a continuous representative. $□$

Proposition 4 (failure at $p = n$ ). In dimension $n \geq 2$ , $W^{1, n} (B_{1}) \neq ↪ L^{\infty} (B_{1})$ .

Proof. Take $u (x) = lo g lo g (1 + 1/∣ x ∣)$ near the origin (smoothly cut off near $∣ x ∣ = 1$ ). In polar coordinates, $∣ D u (x) ∣ \sim \frac{1}{∣ x ∣ ( 1 + ∣ x ∣ ) l o g ( 1 + 1/∣ x ∣ )} \sim \frac{1}{∣ x ∣ l o g ( 1/∣ x ∣ )}$ as $∣ x ∣ \to 0$ . Then $\int_{B_{1/2}} ∣ D u ∣^{n} d x \sim ω_{n - 1} \int_{0}^{1/2} \frac{r ^{n - 1}}{r ^{n} ( l o g ( 1/ r ) ) ^{n}} d r = ω_{n - 1} \int_{0}^{1/2} \frac{d r}{r ( l o g ( 1/ r ) ) ^{n}}$ , which converges for $n \geq 2$ by the substitution $s = lo g (1/ r)$ , $\int^{\infty} s^{- n} d s < \infty$ . So $u \in W^{1, n} (B_{1})$ . Yet $u (x) \to \infty$ as $∣ x ∣ \to 0$ , so $u \in / L^{\infty}$ . Hence the embedding into $L^{\infty}$ fails exactly at the borderline $p = n$ , where the Morrey exponent $1 - n / p$ degenerates to zero. $□$

Connections Master

The Sobolev space scaffolding — the definition of $W^{k, p}$ , the weak derivative, the density of smooth functions, and the extension and approximation theorems — is supplied by 24.01.01, which surveys the embedding theorems proved here in full. This unit owns the deep embedding theorems; 24.01.01 owns the space. The split follows the spec discipline that the foundational unit defines the object and the downstream unit builds the heavy theory.
The $L^{p}$ apparatus on which every estimate rests — Hölder's inequality, the generalized Hölder inequality with several exponents, Minkowski's inequality, and completeness — is developed in 02.07.06. The Loomis-Whitney product estimate is iterated generalized Hölder, and the bootstrap from $p = 1$ to general $p$ in the Key Theorem is a single application of Hölder with conjugate exponents $p$ and $p^{'}$ .
The fundamental-theorem-of-calculus and integration-along-segments steps, together with the chain rule $∣ D ∣ u ∣^{γ} ∣ = γ ∣ u ∣^{γ - 1} ∣ D u ∣$ used in the bootstrap, are the multivariable differentiation results of 02.05.04; the Morrey oscillation bound is a multivariable mean-value estimate along the segment joining two points.
The embeddings are the existence engine for weak solutions of elliptic boundary-value problems in 02.16.02: the direct method of the calculus of variations minimizes an energy over $W^{1, p}$ , and Rellich-Kondrachov compactness (Theorem 2) is exactly what upgrades a minimizing sequence's weak limit to a genuine minimizer. The trace theorem (Theorem 5) gives meaning to the boundary data, developed in 02.16.04.
The critical-exponent obstruction reappears in geometric analysis: the $p = 2$ sharp constant (Theorem 3) is the Yamabe constant of the round sphere, and the Aubin-Talenti bubbles are the concentration profiles in the Yamabe problem and in critical semilinear equations $- Δ u = u^{(n + 2) / (n - 2)}$ , linking to the variational and geometric PDE material in 02.16.05.

Historical & philosophical context Master

Sergei Sobolev introduced the spaces now bearing his name and proved the foundational embedding lemma in his 1938 Matematicheskii Sbornik paper ^{[Sobolev 1938]}, motivated by the Cauchy problem for hyperbolic equations and the need for a function-space framework in which weak solutions could be sought and bounded. Sobolev's averaging method (mollification by a smooth kernel) and his integral lemma gave the first general embedding $W^{k, p} ↪ L^{q}$ , establishing the reciprocal-exponent trade $1/ q = 1/ p - k / n$ that organizes the entire theory.

The sharp first-order inequalities were brought to their modern form by Emilio Gagliardo in his 1958 Ricerche di Matematica paper ^{[Gagliardo 1958]} and independently by Louis Nirenberg in his 1959 Annali della Scuola Normale Superiore di Pisa paper on elliptic equations ^{[Nirenberg 1959]}; the combinatorial product estimate at the heart of the $p = 1$ case had appeared a decade earlier in the 1949 note of Lynn Loomis and Hassler Whitney ^{[Loomis-Whitney 1949]}, who proved it as a discrete-geometric inequality bounding a set's measure by its coordinate projections. Charles Morrey's 1940 Duke Mathematical Journal paper ^{[Morrey 1940]} established the complementary regime, the embedding into Hölder spaces for $p > n$ , in his study of the differentiability of solutions of variational problems.

The sharp constants and extremal functions were found independently in 1976 by Giorgio Talenti ^{[Talenti 1976]}, using symmetric decreasing rearrangement and the radial Euler-Lagrange equation, and by Thierry Aubin ^{[Aubin 1976]} in the context of the Yamabe problem in Riemannian geometry; the resulting Aubin-Talenti bubbles became the canonical concentration profiles of critical-exponent problems. The borderline $p = n$ was settled by Neil Trudinger's 1967 Journal of Mathematics and Mechanics paper ^{[Trudinger 1967]} establishing exponential integrability, with the sharp constant determined by Jürgen Moser's 1971 Indiana University Mathematics Journal paper ^{[Moser 1971]}. The complementary BMO and John-Nirenberg reading of the endpoint was clarified by Haïm Brezis and Stephen Wainger in 1980 ^{[Brezis-Wainger 1980]}.

Bibliography Master

@article{Sobolev1938,
  author  = {Sobolev, Sergei L.},
  title   = {On a theorem of functional analysis},
  journal = {Matematicheskii Sbornik},
  volume  = {4(46)},
  year    = {1938},
  pages   = {471--497},
  note    = {English transl. AMS Translations (2) 34 (1963), 39--68}
}

@article{Gagliardo1958,
  author  = {Gagliardo, Emilio},
  title   = {Propriet\`a di alcune classi di funzioni in pi\`u variabili},
  journal = {Ricerche di Matematica},
  volume  = {7},
  year    = {1958},
  pages   = {102--137}
}

@article{Nirenberg1959,
  author  = {Nirenberg, Louis},
  title   = {On elliptic partial differential equations},
  journal = {Annali della Scuola Normale Superiore di Pisa, Serie 3},
  volume  = {13},
  year    = {1959},
  pages   = {115--162}
}

@article{Morrey1940,
  author  = {Morrey, Charles B.},
  title   = {Functions of several variables and absolute continuity, II},
  journal = {Duke Mathematical Journal},
  volume  = {6},
  year    = {1940},
  pages   = {187--215}
}

@article{LoomisWhitney1949,
  author  = {Loomis, Lynn H. and Whitney, Hassler},
  title   = {An inequality related to the isoperimetric inequality},
  journal = {Bulletin of the American Mathematical Society},
  volume  = {55},
  year    = {1949},
  pages   = {961--962}
}

@article{Talenti1976,
  author  = {Talenti, Giorgio},
  title   = {Best constant in {S}obolev inequality},
  journal = {Annali di Matematica Pura ed Applicata},
  volume  = {110},
  year    = {1976},
  pages   = {353--372}
}

@article{Aubin1976,
  author  = {Aubin, Thierry},
  title   = {Probl\`emes isop\'erim\'etriques et espaces de {S}obolev},
  journal = {Journal of Differential Geometry},
  volume  = {11},
  year    = {1976},
  pages   = {573--598}
}

@article{Trudinger1967,
  author  = {Trudinger, Neil S.},
  title   = {On imbeddings into {O}rlicz spaces and some applications},
  journal = {Journal of Mathematics and Mechanics},
  volume  = {17},
  year    = {1967},
  pages   = {473--483}
}

@article{Moser1971,
  author  = {Moser, J\"urgen},
  title   = {A sharp form of an inequality by {N}. {T}rudinger},
  journal = {Indiana University Mathematics Journal},
  volume  = {20},
  year    = {1971},
  pages   = {1077--1092}
}

@article{BrezisWainger1980,
  author  = {Brezis, Ha\"im and Wainger, Stephen},
  title   = {A note on limiting cases of {S}obolev embeddings and convolution inequalities},
  journal = {Communications in Partial Differential Equations},
  volume  = {5},
  year    = {1980},
  pages   = {773--789}
}

@book{Evans2010,
  author    = {Evans, Lawrence C.},
  title     = {Partial Differential Equations},
  edition   = {2},
  publisher = {American Mathematical Society},
  series    = {Graduate Studies in Mathematics},
  volume    = {19},
  year      = {2010}
}

@book{GilbargTrudinger1983,
  author    = {Gilbarg, David and Trudinger, Neil S.},
  title     = {Elliptic Partial Differential Equations of Second Order},
  edition   = {2},
  publisher = {Springer},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {224},
  year      = {1983}
}

@book{AdamsFournier2003,
  author    = {Adams, Robert A. and Fournier, John J. F.},
  title     = {Sobolev Spaces},
  edition   = {2},
  publisher = {Academic Press},
  year      = {2003}
}

@book{Mazya2011,
  author    = {Maz'ya, Vladimir},
  title     = {Sobolev Spaces},
  edition   = {2},
  publisher = {Springer},
  series    = {Grundlehren der mathematischen Wissenschaften},
  volume    = {342},
  year      = {2011}
}

@book{LiebLoss2001,
  author    = {Lieb, Elliott H. and Loss, Michael},
  title     = {Analysis},
  edition   = {2},
  publisher = {American Mathematical Society},
  series    = {Graduate Studies in Mathematics},
  volume    = {14},
  year      = {2001}
}

Prerequisites

24.01.01
02.07.06
02.05.04

Tier anchors

beginner: Strogatz-style scaling-and-units intuition for why a function's size is controlled by the size of its slope; Tao's blog essay 'Amplitude-frequency dualities and the heat equation' for the heuristic that integrating a derivative recovers the function
intermediate: Evans, Partial Differential Equations, 2e (AMS GSM 19, 2010), §5.6.1-§5.6.2 (Gagliardo-Nirenberg-Sobolev and Morrey inequalities); Brezis, Functional Analysis, Sobolev Spaces and PDEs (Springer 2011), §9.3
master: Evans §5.6-§5.8; Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §7.7-§7.8; Adams-Fournier, Sobolev Spaces, 2e (Academic Press 2003), Ch. 4-5; Maz'ya, Sobolev Spaces, 2e (Springer 2011), Ch. 1; Lieb-Loss, Analysis, 2e (AMS 2001), §8.3 (sharp constants, Talenti-Aubin)

References

Sobolev — On a theorem of functional analysis · Matematicheskii Sbornik 4(46) (1938), 471-497; English transl. AMS Translations (2) 34 (1963), 39-68
Gagliardo — Proprietà di alcune classi di funzioni in più variabili · Ricerche di Matematica 7 (1958), 102-137
Nirenberg — On elliptic partial differential equations · Annali della Scuola Normale Superiore di Pisa (3) 13 (1959), 115-162
Morrey — Functions of several variables and absolute continuity, II · Duke Mathematical Journal 6 (1940), 187-215
Loomis-Whitney — An inequality related to the isoperimetric inequality · Bulletin of the American Mathematical Society 55 (1949), 961-962
Talenti — Best constant in Sobolev inequality · Annali di Matematica Pura ed Applicata 110 (1976), 353-372
Aubin — Problèmes isopérimétriques et espaces de Sobolev · Journal of Differential Geometry 11 (1976), 573-598
Trudinger — On imbeddings into Orlicz spaces and some applications · Journal of Mathematics and Mechanics 17 (1967), 473-483
Moser — A sharp form of an inequality by N. Trudinger · Indiana University Mathematics Journal 20 (1971), 1077-1092
Brezis-Wainger — A note on limiting cases of Sobolev embeddings and convolution inequalities · Communications in Partial Differential Equations 5 (1980), 773-789
Evans — Partial Differential Equations, 2e · AMS Graduate Studies in Mathematics 19 (2010), §5.6
Gilbarg-Trudinger — Elliptic Partial Differential Equations of Second Order, 2e · Springer Grundlehren 224 (1983), §7.7-§7.8
Adams-Fournier — Sobolev Spaces, 2e · Academic Press (2003), Ch. 4-5
Maz'ya — Sobolev Spaces, 2e · Springer Grundlehren 342 (2011), Ch. 1
Lieb-Loss — Analysis, 2e · AMS Graduate Studies in Mathematics 14 (2001), §8.3

Estimated time

beginner: 25m
intermediate: 65m
master: 105m