02.16.03 · analysis / sobolev-weak-solutions

The Rellich-Kondrachov Compactness Theorem and the Poincaré Inequalities

shipped3 tiersLean: none

Anchor (Master): Evans §5.7-§5.8; Adams-Fournier, Sobolev Spaces, 2e (Academic Press 2003), Ch. 6 (compact embeddings); Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §7.10; Maz'ya, Sobolev Spaces, 2e (Springer 2011), Ch. 1-2; Hanche-Olsen-Holden, The Kolmogorov-Riesz compactness theorem (Expositiones Mathematicae 2010)

Intuition Beginner

Imagine a family of functions that all live in a fixed box and all obey the same speed limit on their slope. They can wiggle, they can shift around, but they cannot be too tall and they cannot change too fast. The compactness theorem of this unit says that such a family is, in a strong sense, crowded: you can always pull out a sequence from it that settles down and converges. No matter how the functions in the family scatter, the slope budget plus the fixed box prevent them from running off to infinity or oscillating forever, so some sequence must bunch up around a single limiting function.

Why is this worth a theorem? Because most existence proofs in physics and geometry work by setting up a sequence that gets closer and closer to solving a problem, then extracting a limit and showing the limit is the answer. The danger is that the sequence might not have a limit at all: it could spread its energy thinner and thinner, or push a bump out toward the edge and lose it. Controlling the slope inside a bounded region is exactly the ticket that forbids these escapes and guarantees a usable limit.

The companion inequalities of this unit are the slope-budget bookkeeping made precise. The Poincaré inequality says that if a function is pinned to zero on the boundary of a bounded region, then its overall size is controlled entirely by the size of its slope: a function that starts at zero on the edge and never changes fast cannot grow large in the middle. The slope alone caps the height.

There is a wrinkle when the function is not pinned at the boundary. A constant function has zero slope but is not zero, so slope alone cannot control size. The fix is to measure size after subtracting the average value: the spread of a function around its own average is controlled by its slope. This is the Poincaré-Wirtinger inequality, and it says a function with a small slope budget cannot stray far from its mean.

A picture for all three: think of a taut sheet clamped along the rim of a frame. If the clamp holds the edge at height zero and the sheet cannot crease too sharply, the whole sheet stays close to zero. If instead the sheet floats free, it can sit at any height, but its bumps above and below its own resting level are still limited by how sharply it is allowed to crease. And a whole family of such sheets, all clamped in the same frame with the same crease limit, has the crowding property: some sequence of them must converge.

Visual Beginner

The single picture to hold is a bounded box holding a crowded family of slope-limited functions, with the two Poincaré statements drawn alongside.

Read the three panels left to right. The left panel is the compactness statement: a whole family of functions sharing one height cap and one slope limit inside a fixed box is so crowded that you can always extract a converging sequence. The thing that prevents escape is the combination of the box, which stops bumps from drifting away, and the slope limit, which stops endless oscillation.

The middle panel is the Poincaré inequality. The curve is clamped to zero at the edges of the frame, so it cannot float up; its only way to gain height is to climb away from the baseline, which costs slope. The shaded area, a stand-in for the total size of the function, is therefore bounded by how much slope the function spends.

The right panel is the Poincaré-Wirtinger inequality, the version for functions that are not clamped. Now the curve can float to any height, so we measure its size not from the baseline but from its own average, the dashed line. The bumps above and below that average line are again paid for in slope, so the spread around the average is capped by the slope budget.

Worked example Beginner

We test the Poincaré idea on the simplest clamped function and watch the slope-controls-size bookkeeping work out. Take the region to be the interval from zero to one on the number line, and the function $u (x) = x (1 - x)$ , a single hump that is zero at both ends, which is exactly the clamped-at-the-boundary condition.

Step 1. Confirm the clamp. At the left end, $u (0) = 0 \times 1 = 0$ . At the right end, $u (1) = 1 \times 0 = 0$ . The function is pinned to zero on the boundary of the interval, so the Poincaré inequality applies.

Step 2. Measure the size of the function. The peak is at the middle, where $u (1/2) = (1/2) (1/2) = 1/4$ . So the largest the function ever gets is one quarter. Its size is modest, capped at $0.25$ .

Step 3. Measure the size of the slope. The slope is $u^{'} (x) = 1 - 2 x$ . At the left end it is $1 - 0 = 1$ ; at the right end it is $1 - 2 = - 1$ . So the slope ranges between minus one and plus one, and its largest magnitude is one.

Step 4. Compare the two. The size of the function, one quarter, is smaller than the size of the slope, one. The Poincaré inequality is the promise that this direction of comparison always holds for clamped functions on this interval: the function's size is at most a fixed constant times the slope's size. Here a constant of one already works comfortably, with room to spare.

Step 5. See why the clamp matters. Replace the hump by the constant function $u (x) = 5$ . Its slope is zero everywhere, but its size is five, not zero. A constant times zero can never reach five, so no Poincaré inequality can hold for it. The escape is allowed precisely because the constant is not clamped to zero at the boundary; remove the clamp and slope stops controlling size.

What this tells us: when a function is held at zero on the edge of a bounded region, its slope budget alone caps how big it can get, and the constant in the cap depends only on the region. The constant-function failure in the last step is the warning that the clamp, or in the unclamped version the subtraction of the average, is what makes the inequality possible.

Check your understanding Beginner

Exercise (easy, multiple choice).

What does the Rellich-Kondrachov compactness theorem guarantee for a family of functions sharing one height cap and one slope limit inside a bounded region?

A. Every function in the family is the same. B. Some sequence drawn from the family converges to a limiting function. C. The functions all have the same average value. D. No function in the family can ever be zero.

Hint

Compactness is about crowding: a controlled family is so packed that a converging sequence can always be extracted.

Answer

B. Some sequence drawn from the family converges to a limiting function. The combination of a bounded region and a slope limit forbids escape by spreading out or by drifting bumps to the edge, so the family is crowded enough that a converging sequence can always be pulled out. Feedback-correct: compactness is the extraction of a convergent sequence. Feedback-wrong: A, C, and D assert sameness or constraints the theorem never claims; the functions may differ widely and still admit a converging subsequence.

Formal definition Intermediate+

Throughout, $n \geq 1$ , $1 \leq p < \infty$ , and $Ω \subseteq R^{n}$ is open and bounded. We write $W^{1, p} (Ω)$ for the Sobolev space with norm $∥ u ∥_{W^{1, p} (Ω)} = (∥ u ∥_{L^{p} (Ω)}^{p} + ∥ D u ∥_{L^{p} (Ω)}^{p})^{1/ p}$ , and $W_{0}^{1, p} (Ω)$ for the closure of $C_{c}^{\infty} (Ω)$ in this norm; both, together with the $L^{p}$ machinery, Hölder's inequality, and Minkowski's inequality, are taken as available 02.07.06. The critical Sobolev exponent $p^{*} = n p / (n - p)$ for $1 \leq p < n$ is from 02.16.01, and the bounded extension operator $E : W^{1, p} (Ω) \to W^{1, p} (R^{n})$ on a bounded $C^{1}$ domain is from 02.16.02. For $u \in L^{1} (Ω)$ and a measurable $A \subseteq Ω$ with $∣ A ∣ > 0$ we write $(u)_{A} = \fint_{A} u d x = \frac{1}{∣ A ∣} \int_{A} u d x$ for the average. For $h \in R^{n}$ the translation operator is $(τ_{h} u) (x) = u (x + h)$ .

Definition (compact embedding). A continuous embedding $X ↪ Y$ of Banach spaces is compact, written $X ↪↪ Y$ , if the inclusion map sends bounded subsets of $X$ to precompact subsets of $Y$ : every sequence bounded in the $X$ -norm has a subsequence converging in the $Y$ -norm. Equivalently, the inclusion is a compact operator in the sense of 02.11.05.

Definition (Fréchet-Kolmogorov / Kolmogorov-Riesz precompactness). A subset $F \subseteq L^{p} (R^{n})$ , $1 \leq p < \infty$ , is precompact (has compact closure) if and only if three conditions hold:

(boundedness) $sup_{f \in F} ∥ f ∥_{L^{p} (R^{n})} < \infty$ ;
(uniform equicontinuity of translation) $sup_{f \in F} ∥ τ_{h} f - f ∥_{L^{p} (R^{n})} \to 0$ as $h \to 0$ ;
(uniform decay at infinity) $sup_{f \in F} ∥ f ∥_{L^{p} (R^{n} ∖ B (0, R))} \to 0$ as $R \to \infty$ .

This is the $L^{p}$ analogue of the Arzelà-Ascoli theorem: equicontinuity of translation replaces equicontinuity of values, and uniform decay at infinity replaces a uniform domain bound ^{[Kolmogorov 1931]} ^{[Riesz 1933]} ^{[Hanche-Olsen-Holden 2010]}.

Definition (Poincaré inequality). $Ω$ admits a Poincaré inequality at exponent $p$ if there is a constant $C = C (n, p, Ω)$ with $∥ u ∥_{L^{p} (Ω)} \leq C ∥ D u ∥_{L^{p} (Ω)} for all u \in W_{0}^{1, p} (Ω) .$ The smallest admissible $C$ is the reciprocal of the square root of the first Dirichlet eigenvalue of $- Δ$ when $p = 2$ ; the inequality fails without the zero-boundary restriction, since constants are excluded only by it.

Definition (Poincaré-Wirtinger inequality). A bounded connected open $Ω$ with $C^{1}$ boundary (more generally an extension domain) admits a Poincaré-Wirtinger inequality at exponent $p$ if there is a constant $C = C (n, p, Ω)$ with $∥ u - (u)_{Ω} ∥_{L^{p} (Ω)} \leq C ∥ D u ∥_{L^{p} (Ω)} for all u \in W^{1, p} (Ω) .$ Connectedness is essential: on a disconnected $Ω$ a function constant on each piece has zero gradient but nonzero variance around the global mean, defeating the bound.

Counterexamples to common slips Intermediate+

Compactness is strictly subcritical. The embedding $W^{1, p} (Ω) ↪ L^{q} (Ω)$ is compact for $1 \leq q < p^{*}$ but only bounded, never compact, at $q = p^{*}$ . The dilating bubble $u_{ε} (x) = ε^{- (n - p) / p} ϕ (x / ε)$ stays bounded in $W^{1, p}$ yet has no $L^{p^{*}}$ -convergent subsequence: its mass concentrates at a point. Compactness is exactly the gap between the bound and the critical exponent.
Poincaré needs the boundary condition; Poincaré-Wirtinger needs the mean subtraction. On $W^{1, p} (Ω)$ without either device the constant $u \equiv 1$ has $D u = 0$ but positive $L^{p}$ norm, so neither inequality can hold with the bare gradient on the right. The zero trace (for Poincaré) and the subtraction of $(u)_{Ω}$ (for Poincaré-Wirtinger) are the two distinct ways to quotient out the constants.
Boundedness of $Ω$ is essential. On an unbounded $Ω$ the Poincaré inequality fails: rescaling a fixed bump $u_{λ} (x) = u (x / λ)$ makes $∥ u_{λ} ∥_{L^{p}} /∥ D u_{λ} ∥_{L^{p}} = λ ∥ u ∥_{L^{p}} /∥ D u ∥_{L^{p}} \to \infty$ as $λ \to \infty$ . A finite diameter is what gives the constant; on a slab bounded in one direction the inequality survives, using only the bounded direction.
Connectedness is essential for Poincaré-Wirtinger. On $Ω = B (0, 1) \cup B (10, 1)$ (two disjoint balls) the function equal to $0$ on the first ball and $1$ on the second has $D u = 0$ but $∥ u - (u)_{Ω} ∥_{L^{p}} > 0$ . No Poincaré-Wirtinger constant can hold; connectedness forbids exactly this locally-constant-but-globally-varying escape.

Key theorem with proof Intermediate+

Theorem (Rellich-Kondrachov). Let $Ω \subseteq R^{n}$ be bounded with $C^{1}$ boundary and $1 \leq p < n$ . Then for every $q$ with $1 \leq q < p^{*} = n p / (n - p)$ , the embedding $W^{1, p} (Ω) ↪↪ L^{q} (Ω)$ is compact: every sequence bounded in $W^{1, p} (Ω)$ has a subsequence converging strongly in $L^{q} (Ω)$ . (For $p \geq n$ the embedding $W^{1, p} (Ω) ↪↪ L^{q} (Ω)$ is compact for every $q \in [1, \infty)$ .) ^{[Rellich 1930]} ^{[Kondrachov 1945]} ^{[Evans 2010 §5.7]}.

Proof. Let $(u_{m})$ be bounded in $W^{1, p} (Ω)$ , say $∥ u_{m} ∥_{W^{1, p} (Ω)} \leq M$ .

Step 1 (extend and cut off). Apply the extension operator $E$ of 02.16.02: the functions $\overset{u}{ˉ}_{m} = E u_{m}$ are supported in a fixed bounded open set $V \supset\supset Ω$ and satisfy $∥ \overset{u}{ˉ}_{m} ∥_{W^{1, p} (R^{n})} \leq C M$ , with $\overset{u}{ˉ}_{m} = u_{m}$ on $Ω$ . It suffices to show $(\overset{u}{ˉ}_{m})$ is precompact in $L^{q} (V)$ ; restriction to $Ω$ then gives the conclusion. Because $V$ is bounded the uniform-decay-at-infinity condition is automatic, so by the Fréchet-Kolmogorov criterion it remains to establish the uniform translation estimate.

Step 2 (translation estimate for smooth functions). First suppose $v \in C_{c}^{\infty} (R^{n})$ . For $h \in R^{n}$ , write $v (x + h) - v (x) = \int_{0}^{1} D v (x + t h) \cdot h d t$ , so by Minkowski's integral inequality and a change of variables $∥ τ_{h} v - v ∥_{L^{1} (R^{n})} \leq ∣ h ∣ \int_{0}^{1} ∥ D v (\cdot + t h) ∥_{L^{1}} d t = ∣ h ∣ ∥ D v ∥_{L^{1} (R^{n})} .$ By density of $C_{c}^{\infty}$ in $W^{1, 1} (R^{n})$ and in $W^{1, p} (R^{n})$ , the same estimate holds for every $\overset{u}{ˉ}_{m}$ : $∥ τ_{h} \overset{u}{ˉ}_{m} - \overset{u}{ˉ}_{m} ∥_{L^{1} (R^{n})} \leq ∣ h ∣ ∥ D \overset{u}{ˉ}_{m} ∥_{L^{1} (V^{'})} \leq ∣ h ∣ ∣ V^{'} ∣^{1/ p^{'}} ∥ D \overset{u}{ˉ}_{m} ∥_{L^{p}} \leq C ∣ h ∣ M,$ where $V^{'}$ is a fixed bounded neighbourhood containing all the translates and Hölder's inequality converts the $L^{1}$ gradient norm to the $L^{p}$ one on the finite-measure set $V^{'}$ .

Step 3 (upgrade the translation estimate to $L^{q}$ by interpolation). Fix $q$ with $1 \leq q < p^{*}$ . The Sobolev embedding 02.16.01 gives $∥ \overset{u}{ˉ}_{m} ∥_{L^{p^{*}} (R^{n})} \leq C ∥ \overset{u}{ˉ}_{m} ∥_{W^{1, p}} \leq C M$ , hence $∥ τ_{h} \overset{u}{ˉ}_{m} - \overset{u}{ˉ}_{m} ∥_{L^{p^{*}}} \leq 2 C M$ as well. Choose $θ \in (0, 1]$ with $\frac{1}{q} = θ + \frac{1 - θ}{p ^{*}}$ (possible exactly because $q < p^{*}$ , so $q \geq 1$ interpolates between $L^{1}$ and $L^{p^{*}}$ ). The interpolation inequality for $L^{q}$ norms gives $∥ τ_{h} \overset{u}{ˉ}_{m} - \overset{u}{ˉ}_{m} ∥_{L^{q}} \leq ∥ τ_{h} \overset{u}{ˉ}_{m} - \overset{u}{ˉ}_{m} ∥_{L^{1}}^{θ} ∥ τ_{h} \overset{u}{ˉ}_{m} - \overset{u}{ˉ}_{m} ∥_{L^{p^{*}}}^{1 - θ} \leq (C ∣ h ∣ M)^{θ} (2 C M)^{1 - θ} = C^{'} M ∣ h ∣^{θ} .$ The right side is independent of $m$ and tends to zero as $h \to 0$ : the family $(\overset{u}{ˉ}_{m})$ has uniformly equicontinuous translation in $L^{q}$ .

Step 4 (apply Fréchet-Kolmogorov and extract). The family $(\overset{u}{ˉ}_{m})$ is bounded in $L^{q} (R^{n})$ (by the $L^{p^{*}}$ bound and finite measure of $V$ ), supported in the fixed bounded set $V$ (uniform decay), and uniformly equicontinuous under translation in $L^{q}$ (Step 3). By the Fréchet-Kolmogorov compactness criterion the family is precompact in $L^{q} (R^{n})$ , so $(\overset{u}{ˉ}_{m})$ has a subsequence converging in $L^{q}$ . Restricting that subsequence to $Ω$ gives a subsequence of $(u_{m})$ converging in $L^{q} (Ω)$ . $□$

Bridge. The compactness is exactly the strict-subcriticality gap of the Sobolev embedding 02.16.01 turned into a compact operator 02.11.05: at $q = p^{*}$ the embedding is only bounded, and the dilation symmetry that fixes $p^{*}$ also produces the concentrating bubbles that destroy compactness, so the foundational reason the theorem stops short of $p^{*}$ is the same scaling invariance that forced $p^{*}$ in the first place. The engine is the translation estimate $∥ τ_{h} u - u ∥_{L^{1}} \leq ∣ h ∣∥ D u ∥_{L^{1}}$ , which is the integral form of the fundamental theorem of calculus and is dual to the trace estimate of 02.16.02 — there one integrated the normal derivative to the boundary, here one integrates the gradient along the translation vector. Putting these together, the central insight is that a bounded slope budget converts a bounded family into a precompact one, and this is exactly what the direct method of the calculus of variations needs: it builds toward the existence of weak solutions in 02.16.04, where a minimizing sequence's weak $W^{1, p}$ limit is upgraded to a strong $L^{q}$ limit precisely by Rellich-Kondrachov, and it appears again in the spectral theory of the Laplacian, whose discrete spectrum is the compactness of this same embedding read through 02.11.05.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Prove the translation estimate that powers the compactness theorem: for $u \in W^{1, 1} (R^{n})$ and $h \in R^{n}$ , $∥ τ_{h} u - u ∥_{L^{1} (R^{n})} \leq ∣ h ∣ ∥ D u ∥_{L^{1} (R^{n})} .$ Reduce to $u \in C_{c}^{\infty}$ by density.

Hint

For smooth $u$ , write $u (x + h) - u (x) = \int_{0}^{1} \frac{d}{d t} u (x + t h) d t = \int_{0}^{1} D u (x + t h) \cdot h d t$ , integrate $∣ \cdot ∣$ over $x$ , and swap the order with Tonelli; the inner integral over $x$ of $∣ D u (x + t h) ∣$ is translation-invariant.

Answer

For $u \in C_{c}^{\infty} (R^{n})$ the fundamental theorem of calculus along the segment gives $u (x + h) - u (x) = \int_{0}^{1} D u (x + t h) \cdot h d t$ , so $∣ u (x + h) - u (x) ∣ \leq ∣ h ∣ \int_{0}^{1} ∣ D u (x + t h) ∣ d t$ . Integrating over $x$ and applying Tonelli to swap the order, $∥ τ_{h} u - u ∥_{L^{1}} \leq ∣ h ∣ \int_{0}^{1} (\int_{R^{n}} ∣ D u (x + t h) ∣ d x) d t = ∣ h ∣ \int_{0}^{1} ∥ D u ∥_{L^{1}} d t = ∣ h ∣ ∥ D u ∥_{L^{1}},$ using translation invariance of Lebesgue measure for the inner integral. For general $u \in W^{1, 1} (R^{n})$ , take $u_{m} \in C_{c}^{\infty}$ with $u_{m} \to u$ and $D u_{m} \to D u$ in $L^{1}$ ; both sides pass to the limit since translation is an $L^{1}$ -isometry, giving the estimate for $u$ . This is the equicontinuity-of-translation input to Fréchet-Kolmogorov.

Exercise 4 (medium, symbolic).

Prove the Poincaré inequality on $W_{0}^{1, p} (Ω)$ for bounded $Ω$ contained in a slab ${0 < x_{n} < d}$ , with the explicit constant $C = d$ : for $u \in C_{c}^{\infty} (Ω)$ , $∥ u ∥_{L^{p} (Ω)} \leq d ∥ D_{n} u ∥_{L^{p} (Ω)} \leq d ∥ D u ∥_{L^{p} (Ω)} .$

Hint

Extend $u$ by zero to the slab. Since $u (x^{'}, 0) = 0$ , write $u (x^{'}, x_{n}) = \int_{0}^{x_{n}} D_{n} u (x^{'}, t) d t$ , apply Hölder in $t$ to bound $∣ u ∣^{p}$ , then integrate over the slab.

Answer

Extend $u \in C_{c}^{\infty} (Ω)$ by zero to the slab $S = {0 < x_{n} < d}$ . Since $u (x^{'}, 0) = 0$ , the fundamental theorem of calculus gives $u (x^{'}, x_{n}) = \int_{0}^{x_{n}} D_{n} u (x^{'}, t) d t$ , so by Hölder with conjugate exponents $p, p^{'}$ , $∣ u (x^{'}, x_{n}) ∣^{p} \leq (\int_{0}^{x_{n}} ∣ D_{n} u (x^{'}, t) ∣ d t)^{p} \leq x_{n}^{p / p^{'}} \int_{0}^{x_{n}} ∣ D_{n} u (x^{'}, t) ∣^{p} d t \leq d^{p - 1} \int_{0}^{d} ∣ D_{n} u (x^{'}, t) ∣^{p} d t .$ Integrate over $x_{n} \in (0, d)$ : $\int_{0}^{d} ∣ u (x^{'}, x_{n}) ∣^{p} d x_{n} \leq d^{p - 1} \cdot d \int_{0}^{d} ∣ D_{n} u ∣^{p} d t = d^{p} \int_{0}^{d} ∣ D_{n} u ∣^{p} d t$ . Now integrate over $x^{'} \in R^{n - 1}$ : $∥ u ∥_{L^{p} (S)}^{p} \leq d^{p} ∥ D_{n} u ∥_{L^{p} (S)}^{p}$ , i.e. $∥ u ∥_{L^{p}} \leq d ∥ D_{n} u ∥_{L^{p}} \leq d ∥ D u ∥_{L^{p}}$ . Density of $C_{c}^{\infty} (Ω)$ in $W_{0}^{1, p} (Ω)$ extends the bound to the whole space. Only one bounded direction was used, so the inequality survives on slabs.

Exercise 6 (medium, symbolic).

Prove the Poincaré-Wirtinger inequality by a compactness-contradiction argument: assuming Rellich-Kondrachov on a bounded connected $C^{1}$ domain $Ω$ , show there is $C$ with $∥ u - (u)_{Ω} ∥_{L^{p}} \leq C ∥ D u ∥_{L^{p}}$ for all $u \in W^{1, p} (Ω)$ .

Hint

If no constant works, there are $u_{k}$ with $∥ u_{k} - (u_{k})_{Ω} ∥_{L^{p}} = 1$ but $∥ D u_{k} ∥_{L^{p}} \to 0$ . Set $v_{k} = u_{k} - (u_{k})_{Ω}$ , so $(v_{k})_{Ω} = 0$ , $∥ v_{k} ∥_{L^{p}} = 1$ , $∥ D v_{k} ∥_{L^{p}} \to 0$ . Use Rellich-Kondrachov to extract an $L^{p}$ -convergent subsequence and identify the limit.

Answer

Suppose no such $C$ exists. Then for each $k$ there is $u_{k}$ with $∥ u_{k} - (u_{k})_{Ω} ∥_{L^{p}} > k ∥ D u_{k} ∥_{L^{p}}$ . Set $v_{k} = (u_{k} - (u_{k})_{Ω}) /∥ u_{k} - (u_{k})_{Ω} ∥_{L^{p}}$ , so $(v_{k})_{Ω} = 0$ , $∥ v_{k} ∥_{L^{p}} = 1$ , and $∥ D v_{k} ∥_{L^{p}} < 1/ k \to 0$ . Then $(v_{k})$ is bounded in $W^{1, p} (Ω)$ , so by Rellich-Kondrachov a subsequence converges in $L^{p} (Ω)$ to some $v$ with $∥ v ∥_{L^{p}} = 1$ and $(v)_{Ω} = 0$ . Since $D v_{k} \to 0$ in $L^{p}$ and $v_{k} \to v$ in $L^{p}$ , for every $φ \in C_{c}^{\infty} (Ω)$ , $\int_{Ω} v D_{i} φ = lim \int_{Ω} v_{k} D_{i} φ = - lim \int_{Ω} D_{i} v_{k} φ = 0$ , so $D v = 0$ weakly. On a connected domain a function with vanishing weak gradient is constant, so $v \equiv c$ ; but $(v)_{Ω} = 0$ forces $c = 0$ , contradicting $∥ v ∥_{L^{p}} = 1$ . Hence the constant $C$ exists. Connectedness entered exactly at the " $D v = 0 \Rightarrow v$ constant" step.

Exercise 7 (hard, symbolic).

State and verify the Fréchet-Kolmogorov criterion on a fixed bounded set: show that a family $F \subseteq L^{p} (R^{n})$ supported in a fixed bounded $V$ , bounded in $L^{p}$ , with $sup_{f \in F} ∥ τ_{h} f - f ∥_{L^{p}} \to 0$ as $h \to 0$ , is precompact in $L^{p}$ . Sketch the mollification argument.

Hint

Mollify: $f * ρ_{δ}$ . The translation hypothesis bounds $∥ f * ρ_{δ} - f ∥_{L^{p}}$ uniformly in $f$ (it is an average of translations). For fixed $δ$ , the mollified family ${f * ρ_{δ}}$ is bounded and equicontinuous on the compact $\overset{ˉ}{V}$ , hence precompact in $C (\overset{ˉ}{V})$ by Arzelà-Ascoli, so precompact in $L^{p} (V)$ . Combine via a total-boundedness/ $ε$ -net argument.

Answer

Let $ρ_{δ} (y) = δ^{- n} ρ (y / δ)$ be a standard mollifier, $ρ \geq 0$ , $\int ρ = 1$ , $supp ρ \subseteq B (0, 1)$ . For $f \in F$ , $(f * ρ_{δ}) (x) - f (x) = \int_{B (0, δ)} ρ_{δ} (y) (f (x - y) - f (x)) d y$ , so by Minkowski's integral inequality $∥ f * ρ_{δ} - f ∥_{L^{p}} \leq \int ρ_{δ} (y) ∥ τ_{- y} f - f ∥_{L^{p}} d y \leq sup_{∣ y ∣ \leq δ} ∥ τ_{- y} f - f ∥_{L^{p}} =: ω (δ)$ , and $ω (δ) \to 0$ uniformly in $f$ by hypothesis. Fix $ε > 0$ and choose $δ$ with $ω (δ) < ε /2$ . For this $δ$ , the mollified family ${f * ρ_{δ} : f \in F}$ is uniformly bounded ( $∣ f * ρ_{δ} ∣ \leq ∥ ρ_{δ} ∥_{L^{p^{'}}} ∥ f ∥_{L^{p}} \leq C_{δ}$ ) and equicontinuous ( $∣ f * ρ_{δ} (x) - f * ρ_{δ} (x^{'}) ∣ \leq ∥ f ∥_{L^{p}} ∥ τ_{x - x^{'}} ρ_{δ} - ρ_{δ} ∥_{L^{p^{'}}}$ , small in $∣ x - x^{'} ∣$ uniformly in $f$ since $ρ_{δ}$ is fixed and smooth), so on the compact $\overset{ˉ}{V}$ Arzelà-Ascoli makes it precompact in $C (\overset{ˉ}{V})$ , hence in $L^{p} (V)$ : it has a finite $(ε /2)$ -net in $L^{p}$ . The same finite set is an $ε$ -net for $F$ itself, because each $f$ is within $ε /2$ of its mollification. Since $F$ admits a finite $ε$ -net for every $ε$ , it is totally bounded, hence precompact in the complete space $L^{p}$ . (Uniform decay at infinity is supplied here by the fixed support $V$ .)

Exercise 8 (hard, symbolic).

Show compactness fails at the critical exponent. For $1 \leq p < n$ and a fixed $ϕ \in C_{c}^{\infty} (B (0, 1))$ with $∥ ϕ ∥_{L^{p^{*}}} = 1$ , define $u_{ε} (x) = ε^{- (n - p) / p} ϕ (x / ε)$ . Show $(u_{ε})$ is bounded in $W^{1, p}$ but has no subsequence converging in $L^{p^{*}}$ , so $W^{1, p} (Ω) ↪ L^{p^{*}} (Ω)$ is not compact.

Hint

Compute $∥ u_{ε} ∥_{L^{p^{*}}}$ and $∥ D u_{ε} ∥_{L^{p}}$ by the change of variables $y = x / ε$ ; check both are $ε$ -independent. For non-convergence, note $u_{ε} \to 0$ a.e. but $∥ u_{ε} ∥_{L^{p^{*}}}$ stays equal to $1$ , so any $L^{p^{*}}$ limit would be $0$ , contradicting the norm.

Answer

With $y = x / ε$ , $d x = ε^{n} d y$ . The exponent $(n - p) / p$ is chosen so the $W^{1, p}$ norm is scale-invariant: $∥ u_{ε} ∥_{L^{p^{*}}}^{p^{*}} = ε^{- (n - p) p^{*} / p} \int ∣ ϕ (x / ε) ∣^{p^{*}} d x = ε^{- (n - p) p^{*} / p + n} ∥ ϕ ∥_{L^{p^{*}}}^{p^{*}}$ , and since $(n - p) p^{*} / p = (n - p) \cdot \frac{n}{n - p} = n$ , the exponent of $ε$ is $- n + n = 0$ , giving $∥ u_{ε} ∥_{L^{p^{*}}} = ∥ ϕ ∥_{L^{p^{*}}} = 1$ for all $ε$ . For the gradient, $D u_{ε} (x) = ε^{- (n - p) / p - 1} (D ϕ) (x / ε)$ , so $∥ D u_{ε} ∥_{L^{p}}^{p} = ε^{- (n - p) p / p - p + n} ∥ D ϕ ∥_{L^{p}}^{p} = ε^{- (n - p) - p + n} ∥ D ϕ ∥_{L^{p}}^{p} = ε^{0} ∥ D ϕ ∥_{L^{p}}^{p}$ , again $ε$ -independent. Since $ϕ$ has compact support and $∥ u_{ε} ∥_{L^{p}} = ε ∥ ϕ ∥_{L^{p}} \to 0$ , the family is bounded in $W^{1, p}$ . But for $x \neq = 0$ , $u_{ε} (x) = 0$ once $ε < ∣ x ∣$ , so $u_{ε} \to 0$ pointwise a.e.; an $L^{p^{*}}$ -convergent subsequence would have limit $0$ a.e., yet $∥ u_{ε} ∥_{L^{p^{*}}} = 1 \neq \to 0$ , a contradiction. Hence no $L^{p^{*}}$ -convergent subsequence exists and the critical embedding is not compact: the mass of $u_{ε}$ concentrates at the origin, the bubbling that obstructs compactness precisely at $p^{*}$ .

Advanced results Master

The compactness theorem and the Poincaré inequalities organize a larger structure: the sharp range of compact embeddings across the Sobolev scale, the spectral identification of the optimal constants, the failure profile at the critical exponent, the role of the embedding in Fredholm theory and the direct method, and the geometric content of the Poincaré constant. Each refines the translation-estimate and quotient-out-the-constants arguments of the Intermediate tier.

Theorem 1 (full Rellich-Kondrachov scale; Rellich 1930, Kondrachov 1945). Let $Ω$ be bounded with $C^{1}$ boundary, $1 \leq p < \infty$ , $k \in N$ . If $k p < n$ , the embedding $W^{k, p} (Ω) ↪↪ L^{q} (Ω)$ is compact for $1 \leq q < p_{k}^{*}$ , where $\frac{1}{p _{k}^{*}} = \frac{1}{p} - \frac{k}{n}$ ; if $k p = n$ it is compact into $L^{q}$ for all $q < \infty$ ; if $k p > n$ it is compact into $C^{0, γ} (\overset{ˉ}{Ω})$ for $0 \leq γ < k - n / p$ , and into $C^{m, γ}$ for the appropriate $m$ when $k - n / p > 1$ ^{[Rellich 1930]} ^{[Kondrachov 1945]} ^{[Adams-Fournier 2003]}. The strictness $q < p_{k}^{*}$ is sharp: at the endpoint $q = p_{k}^{*}$ the embedding is bounded (the Sobolev embedding of 02.16.01) but loses compactness to concentration. The compact embeddings are exactly the bounded ones at a strictly smaller integrability index, the difference between the two being the scaling-invariant endpoint.

Theorem 2 (spectral form; Poincaré 1890, Courant-Fischer). For $p = 2$ and bounded $Ω$ , the optimal Poincaré constant on $H_{0}^{1} (Ω)$ is $λ_{1} (Ω)^{- 1/2}$ , where $λ_{1} (Ω) = u \in H_{0}^{1} (Ω) ∖ {0} min \frac{∥ D u ∥ _{L^{2}}^{2}}{∥ u ∥ _{L^{2}}^{2}}$ is the first Dirichlet eigenvalue of $- Δ$ , with the minimizing $u$ the first Dirichlet eigenfunction ^{[Poincaré 1890]}. The optimal Poincaré-Wirtinger constant is $μ_{2} (Ω)^{- 1/2}$ , where $μ_{2}$ is the first nonzero Neumann eigenvalue (the spectral gap above the constant Neumann eigenfunction). That the minimum is attained — rather than merely an infimum — is precisely Rellich-Kondrachov: a minimizing sequence is bounded in $H_{0}^{1}$ , so a subsequence converges strongly in $L^{2}$ , and lower semicontinuity of the Dirichlet energy promotes the weak $H^{1}$ limit to a genuine minimizer. The discreteness of the entire Dirichlet spectrum follows by applying 02.11.05 to the compact resolvent $(- Δ)^{- 1}$ .

Theorem 3 (Rellich-Kondrachov and the Fredholm alternative). The compactness of $W^{1, 2} (Ω) ↪↪ L^{2} (Ω)$ makes the solution operator of a uniformly elliptic problem $Lu = f$ a compact perturbation of the identity on $L^{2} (Ω)$ : writing $L = - Δ + (L + Δ)$ and inverting the leading part, the lower-order terms factor through the compact embedding, so $(I - K)$ with $K$ compact governs solvability. The Fredholm alternative of 02.11.05 then applies: either $Lu = f$ has a unique weak solution for every $f$ , or the homogeneous problem $Lu = 0$ has a finite-dimensional solution space and $Lu = f$ is solvable exactly when $f$ is orthogonal to the cokernel. Compactness is the single structural input converting an infinite-dimensional boundary-value problem into the linear algebra of finite-rank obstructions.

Theorem 4 (Maz'ya capacitary and measure-theoretic characterizations). The Poincaré and compactness phenomena persist far beyond $C^{1}$ domains. Maz'ya's theory characterizes the domains and measures for which $W^{1, p} (Ω) ↪ L^{q} (Ω, μ)$ is bounded or compact in terms of isocapacitary and isoperimetric inequalities: boundedness corresponds to a capacity-volume inequality $μ (K) \leq C cap_{p} (K)^{q / p}$ uniformly over compact $K$ , and compactness to the same with the constant tending to zero on small sets ^{[Maz'ya 2011]}. For $p = 1$ the relevant inequality is isoperimetric, tying the Poincaré constant to the Cheeger constant $h (Ω) = in f_{A} \frac{∣ \partial A \cap Ω∣}{∣ A ∣}$ via $λ_{1} \geq h^{2} /4$ (Cheeger's inequality). Domains with sufficiently sharp outward cusps fail the extension property of 02.16.02 and can lose both the compact embedding and the Poincaré inequality.

Theorem 5 (concentration-compactness; the failure at $p^{*}$ made quantitative). At the critical exponent the loss of compactness is not arbitrary but structured: a $W^{1, p}$ -bounded sequence with $u_{m} ⇀ u$ weakly fails to converge strongly in $L^{p^{*}}$ only through a countable sum of concentrating bubbles, $∣ D u_{m} ∣^{p} ⇀ ∣ D u ∣^{p} + \sum_{j} ν_{j} δ_{x_{j}}$ and $∣ u_{m} ∣^{p^{*}} ⇀ ∣ u ∣^{p^{*}} + \sum_{j} μ_{j} δ_{x_{j}}$ as measures, with the masses linked by the Sobolev inequality $μ_{j}^{p / p^{*}} \leq S^{- 1} ν_{j}$ . This is the concentration-compactness principle: strong $L^{p^{*}}$ convergence holds if and only if no mass escapes into Dirac bubbles, and the bubbles are dilates of the Aubin-Talenti extremals of 02.16.01. It restores a usable substitute for compactness in critical variational problems, where Rellich-Kondrachov alone is unavailable.

Synthesis. The compactness theorem is the foundational reason the direct method of the calculus of variations produces actual minimizers rather than mere infima, and the whole structure is generated by the same single principle as the Sobolev inequalities of 02.16.01: a derivative integrated along a segment controls the function, here in the translation form $∥ τ_{h} u - u ∥_{L^{1}} \leq ∣ h ∣∥ D u ∥_{L^{1}}$ that feeds the Fréchet-Kolmogorov criterion. This translation estimate is dual to the normal-direction integration of the trace theorem 02.16.02, one estimate run along the translation vector and the other run inward to the boundary. The compact embedding is exactly the strictly-subcritical Sobolev embedding read as a compact operator through 02.11.05; the Poincaré inequality is this same control with the constants quotiented out by the zero-boundary condition, and the Poincaré-Wirtinger inequality quotients them out instead by subtracting the mean — putting these together, both optimal constants are eigenvalues, the Dirichlet $λ_{1}$ and the Neumann $μ_{2}$ . The central insight at this stage is that the attainment of those eigenvalues, rather than their being mere infima, is itself the compactness theorem applied to a minimizing sequence: this is exactly the upgrade from weak to strong convergence that the direct method demands.

The central insight is that a bounded slope budget on a bounded region is precisely a precompactness certificate, and this is exactly why the theorem stops at $p^{*}$ : the dilation symmetry that forces the critical exponent also manufactures the concentrating bubbles of Theorem 5 that no slope budget can prevent. From Poincaré's 1890 inequality through Rellich's 1930 mean-convergence lemma and Kondrachov's 1945 extension across the subcritical range to the Maz'ya capacitary theory and the concentration-compactness method, the subject is one continuous refinement of a single fundamental-theorem-of-calculus estimate, generalised until it became the compactness engine of elliptic Fredholm theory and the spectral theory of the Laplacian, and it appears again in 02.16.04 as the device that turns a minimizing sequence into a weak solution.

Full proof set Master

Proposition 1 (Poincaré inequality on a bounded domain, general constant). Let $Ω \subseteq R^{n}$ be bounded with diameter $d$ . Then $∥ u ∥_{L^{p} (Ω)} \leq d ∥ D u ∥_{L^{p} (Ω)}$ for all $u \in W_{0}^{1, p} (Ω)$ , $1 \leq p < \infty$ .

Proof. By density it suffices to treat $u \in C_{c}^{\infty} (Ω)$ , extended by zero to $R^{n}$ . Enclose $Ω$ in a slab ${a < x_{n} < a + d}$ of width $d$ (possible since $diam Ω = d$ ). For each fixed $x^{'} = (x_{1}, \dots, x_{n - 1})$ , the function $t \mapsto u (x^{'}, t)$ vanishes for $t \leq a$ , so $u (x^{'}, x_{n}) = \int_{a}^{x_{n}} D_{n} u (x^{'}, t) d t$ . By Hölder with conjugate exponents $p, p^{'}$ , $∣ u (x^{'}, x_{n}) ∣^{p} \leq (x_{n} - a)^{p / p^{'}} \int_{a}^{a + d} ∣ D_{n} u (x^{'}, t) ∣^{p} d t \leq d^{p - 1} \int_{a}^{a + d} ∣ D_{n} u (x^{'}, t) ∣^{p} d t .$ Integrate in $x_{n}$ over the slab width $d$ : $\int_{a}^{a + d} ∣ u (x^{'}, x_{n}) ∣^{p} d x_{n} \leq d^{p} \int_{a}^{a + d} ∣ D_{n} u (x^{'}, t) ∣^{p} d t$ . Integrating over $x^{'} \in R^{n - 1}$ and using $∣ D_{n} u ∣ \leq ∣ D u ∣$ yields $∥ u ∥_{L^{p} (Ω)}^{p} \leq d^{p} ∥ D u ∥_{L^{p} (Ω)}^{p}$ , i.e. $∥ u ∥_{L^{p}} \leq d ∥ D u ∥_{L^{p}}$ . The constant uses only the slab width, so boundedness in a single direction suffices. $□$

Proposition 2 (Poincaré-Wirtinger inequality). Let $Ω$ be bounded, connected, with $C^{1}$ boundary, $1 \leq p < \infty$ . There is $C = C (n, p, Ω)$ with $∥ u - (u)_{Ω} ∥_{L^{p} (Ω)} \leq C ∥ D u ∥_{L^{p} (Ω)}$ for all $u \in W^{1, p} (Ω)$ .

Proof. Argue by contradiction using Rellich-Kondrachov. If the inequality fails for every $C$ , choose $u_{k} \in W^{1, p} (Ω)$ with $∥ u_{k} - (u_{k})_{Ω} ∥_{L^{p}} > k ∥ D u_{k} ∥_{L^{p}}$ . Normalize $v_{k} = (u_{k} - (u_{k})_{Ω}) /∥ u_{k} - (u_{k})_{Ω} ∥_{L^{p}}$ , so $(v_{k})_{Ω} = 0$ , $∥ v_{k} ∥_{L^{p}} = 1$ , and $∥ D v_{k} ∥_{L^{p}} < 1/ k$ . Then $∥ v_{k} ∥_{W^{1, p}}^{p} = 1 + ∥ D v_{k} ∥_{L^{p}}^{p} \leq 2$ for $k \geq 1$ , so $(v_{k})$ is bounded in $W^{1, p} (Ω)$ . By the Key Theorem (Rellich-Kondrachov, taking $q = p < p^{*}$ , valid since $Ω$ is a $C^{1}$ extension domain) a subsequence converges strongly in $L^{p} (Ω)$ to some $v$ with $∥ v ∥_{L^{p}} = 1$ and $(v)_{Ω} = lim (v_{k})_{Ω} = 0$ . For every $φ \in C_{c}^{\infty} (Ω)$ and each $i$ , $\int_{Ω} v D_{i} φ d x = lim_{k} \int_{Ω} v_{k} D_{i} φ d x = - lim_{k} \int_{Ω} (D_{i} v_{k}) φ d x = 0$ , the last equality since $∥ D v_{k} ∥_{L^{p}} \to 0$ and $φ$ is bounded. Thus $D v = 0$ weakly. On the connected open set $Ω$ a weak gradient that vanishes forces $v$ constant a.e.; with $(v)_{Ω} = 0$ this constant is $0$ , contradicting $∥ v ∥_{L^{p}} = 1$ . The inequality therefore holds for some finite $C$ . $□$

Proposition 3 (translation estimate in $L^{p}$ ). For $u \in W^{1, p} (R^{n})$ , $1 \leq p < \infty$ , and $h \in R^{n}$ , $∥ τ_{h} u - u ∥_{L^{p} (R^{n})} \leq ∣ h ∣ ∥ D u ∥_{L^{p} (R^{n})}$ .

Proof. For $u \in C_{c}^{\infty} (R^{n})$ , $u (x + h) - u (x) = \int_{0}^{1} D u (x + t h) \cdot h d t$ . By Minkowski's integral inequality, viewing the difference as an integral over $t$ of the $L^{p}$ -valued map $x \mapsto D u (x + t h) \cdot h$ , $∥ τ_{h} u - u ∥_{L^{p}} \leq \int_{0}^{1} D u (\cdot + t h) \cdot h_{L^{p}} d t \leq ∣ h ∣ \int_{0}^{1} ∥ D u (\cdot + t h) ∥_{L^{p}} d t = ∣ h ∣ ∥ D u ∥_{L^{p}},$ using $∣ D u (\cdot + t h) \cdot h ∣ \leq ∣ h ∣ ∣ D u (\cdot + t h) ∣$ pointwise and translation invariance of the $L^{p}$ norm. For general $u \in W^{1, p}$ , approximate by $u_{m} \in C_{c}^{\infty}$ with $u_{m} \to u$ , $D u_{m} \to D u$ in $L^{p}$ ; translation is an $L^{p}$ -isometry, so both sides converge and the estimate persists. $□$

Proposition 4 (discreteness of the Dirichlet spectrum via compactness). Let $Ω$ be bounded with $C^{1}$ boundary. The operator $- Δ$ on $Ω$ with zero Dirichlet boundary data has compact resolvent, hence a discrete spectrum $0 < λ_{1} \leq λ_{2} \leq \dots \to \infty$ with an $L^{2} (Ω)$ -orthonormal basis of eigenfunctions.

Proof. For $f \in L^{2} (Ω)$ the weak problem $\int_{Ω} D u \cdot D v = \int_{Ω} f v$ for all $v \in H_{0}^{1} (Ω)$ has, by the Poincaré inequality of Proposition 1 (which makes $∥ D u ∥_{L^{2}}$ an equivalent norm on $H_{0}^{1}$ ) and the Riesz representation theorem 02.11.05, a unique solution $u =: G f \in H_{0}^{1} (Ω)$ with $∥ u ∥_{H_{0}^{1}} \leq C ∥ f ∥_{L^{2}}$ . Thus $G : L^{2} (Ω) \to H_{0}^{1} (Ω)$ is bounded. Composing with the compact inclusion $H_{0}^{1} (Ω) ↪↪ L^{2} (Ω)$ from the Key Theorem ( $q = 2 < 2^{*}$ ), the operator $G : L^{2} (Ω) \to L^{2} (Ω)$ is compact. It is also self-adjoint and positive (from the symmetric, coercive bilinear form). The spectral theorem for compact self-adjoint operators 02.11.05 gives an orthonormal eigenbasis ${ϕ_{k}}$ with eigenvalues $σ_{k} ↓ 0$ ; setting $λ_{k} = 1/ σ_{k}$ gives $- Δ ϕ_{k} = λ_{k} ϕ_{k}$ with $λ_{k} ↑ \infty$ , and $λ_{1} > 0$ by the Poincaré inequality. $□$

Connections Master

The compact embedding is the strictly-subcritical Sobolev embedding of 02.16.01 read as a compact operator: the boundedness $W^{1, p} (Ω) ↪ L^{p^{*}} (Ω)$ proved there is upgraded here to compactness for every $q < p^{*}$ , and the concentration phenomenon that obstructs compactness at $q = p^{*}$ is exactly the Aubin-Talenti bubbling of that unit's sharp-constant theory. This unit owns the compactness; 02.16.01 owns the boundedness and the sharp constant.
The extension operator $E : W^{1, p} (Ω) \to W^{1, p} (R^{n})$ of 02.16.02 is the indispensable first step of the compactness proof: it transfers the bounded sequence to a fixed bounded set in $R^{n}$ where the Fréchet-Kolmogorov criterion can be run, and it is the device whose failure on cuspidal domains makes both the compact embedding and the Poincaré inequality fail there. The trace operator of that unit also supplies the boundary control that the Poincaré inequality complements in the interior.
The abstract framework — compact operators, the Fredholm alternative, and the spectral theorem for compact self-adjoint operators — is supplied by 02.11.05; this unit provides the single concrete compact operator (the Sobolev embedding) that makes elliptic boundary-value problems Fredholm and gives $- Δ$ a discrete spectrum, so the abstract theory of 02.11.05 acquires its principal application here.
The $L^{p}$ apparatus on which the translation estimate and the interpolation step rest — Hölder's inequality, Minkowski's integral inequality, and the interpolation of $L^{q}$ norms between $L^{1}$ and $L^{p^{*}}$ — is developed in 02.07.06; the mollification argument behind Fréchet-Kolmogorov is the same approximate-identity machinery used there.
The compactness theorem is the existence engine for weak solutions of elliptic boundary-value problems in 02.16.04: the direct method minimizes an energy over $W^{1, p}$ , and Rellich-Kondrachov is exactly what upgrades a minimizing sequence's weak limit to a strong $L^{q}$ limit, while the Poincaré inequality supplies the coercivity that keeps the minimizing sequence bounded.

Historical & philosophical context Master

Henri Poincaré introduced the inequality bearing his name in his 1890 American Journal of Mathematics memoir on the partial differential equations of mathematical physics ^{[Poincaré 1890]}, where he needed to bound a function by its gradient to control the eigenvalue problems arising in heat conduction and potential theory. The one-dimensional sharp form, bounding a periodic mean-zero function by its derivative with the optimal constant attained by the first trigonometric mode, is attributed to Wilhelm Wirtinger and was popularized through Wilhelm Blaschke's 1916 Kreis und Kugel ^{[Wirtinger 1916]}; the combined name Poincaré-Wirtinger reflects this dual ancestry in potential theory and in the isoperimetric problem.

The compactness theorem originates with Franz Rellich's 1930 Göttingen note Ein Satz über mittlere Konvergenz ^{[Rellich 1930]}, which proved that an $H^{1}$ -bounded sequence on a bounded domain has an $L^{2}$ -convergent subsequence — the case $p = q = 2$ . Vladimir Kondrachov extended the result across the full subcritical range of exponents in his 1945 Doklady note ^{[Kondrachov 1945]}, establishing the compact embedding $W^{1, p} ↪ L^{q}$ for $q < p^{*}$ . The compactness criterion underlying the modern proof is the Fréchet-Kolmogorov / Kolmogorov-Riesz theorem, whose threads run from Maurice Fréchet's 1907 work on compact sets of functions ^{[Fréchet 1907]} through Andrei Kolmogorov's 1931 Göttingen note characterizing precompact families in $L^{p}$ by mean continuity ^{[Kolmogorov 1931]} and Marcel Riesz's 1933 Acta Szeged completion to general exponents ^{[Riesz 1933]}; the consolidated statement and its history are surveyed by Harald Hanche-Olsen and Helge Holden ^{[Hanche-Olsen-Holden 2010]}. Vladimir Maz'ya's capacitary theory later identified the exact geometric conditions on domains and measures for both boundedness and compactness, completing the picture begun by Rellich and Kondrachov.

Bibliography Master

@article{Rellich1930,
  author  = {Rellich, Franz},
  title   = {Ein Satz \"uber mittlere Konvergenz},
  journal = {Nachrichten von der Gesellschaft der Wissenschaften zu G\"ottingen, Mathematisch-Physikalische Klasse},
  year    = {1930},
  pages   = {30--35}
}

@article{Kondrachov1945,
  author  = {Kondrachov, Vladimir I.},
  title   = {Sur certaines propri\'et\'es des fonctions dans l'espace $L^p$},
  journal = {Doklady Akademii Nauk SSSR},
  volume  = {48},
  year    = {1945},
  pages   = {535--538}
}

@article{Poincare1890,
  author  = {Poincar\'e, Henri},
  title   = {Sur les \'equations aux d\'eriv\'ees partielles de la physique math\'ematique},
  journal = {American Journal of Mathematics},
  volume  = {12},
  year    = {1890},
  pages   = {211--294}
}

@incollection{Wirtinger1916,
  author    = {Blaschke, Wilhelm},
  title     = {Kreis und Kugel},
  publisher = {Veit, Leipzig},
  year      = {1916},
  note      = {\S1.6; the one-dimensional Wirtinger inequality}
}

@article{Kolmogorov1931,
  author  = {Kolmogorov, Andrei N.},
  title   = {\"Uber Kompaktheit der Funktionenmengen bei der Konvergenz im Mittel},
  journal = {Nachrichten von der Gesellschaft der Wissenschaften zu G\"ottingen},
  year    = {1931},
  pages   = {60--63}
}

@article{Riesz1933,
  author  = {Riesz, Marcel},
  title   = {Sur les ensembles compacts de fonctions sommables},
  journal = {Acta Scientiarum Mathematicarum (Szeged)},
  volume  = {6},
  year    = {1933},
  pages   = {136--142}
}

@article{HancheOlsenHolden2010,
  author  = {Hanche-Olsen, Harald and Holden, Helge},
  title   = {The Kolmogorov-Riesz compactness theorem},
  journal = {Expositiones Mathematicae},
  volume  = {28},
  year    = {2010},
  pages   = {385--394}
}

@book{Mazya2011,
  author    = {Maz'ya, Vladimir},
  title     = {Sobolev Spaces, with Applications to Elliptic Partial Differential Equations},
  edition   = {2},
  publisher = {Springer Grundlehren 342},
  year      = {2011}
}

Prerequisites

02.16.02
02.11.05
02.07.06

Tier anchors

beginner: Strogatz-style intuition for why a budget on a function's slope plus a fixed-size box forces a sequence of functions to bunch up rather than scatter; Tao's blog discussion of compactness as the principle that bounded families with controlled oscillation must have convergent subsequences
intermediate: Evans, Partial Differential Equations, 2e (AMS GSM 19, 2010), §5.7 (Rellich-Kondrachov) and §5.8.1 (Poincaré inequalities); Brezis, Functional Analysis, Sobolev Spaces and PDEs (Springer 2011), §9.3 and Theorem 9.16
master: Evans §5.7-§5.8; Adams-Fournier, Sobolev Spaces, 2e (Academic Press 2003), Ch. 6 (compact embeddings); Gilbarg-Trudinger, Elliptic Partial Differential Equations of Second Order, 2e (Springer 1983), §7.10; Maz'ya, Sobolev Spaces, 2e (Springer 2011), Ch. 1-2; Hanche-Olsen-Holden, The Kolmogorov-Riesz compactness theorem (Expositiones Mathematicae 2010)

References

Rellich — Ein Satz über mittlere Konvergenz · Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen, Mathematisch-Physikalische Klasse (1930), 30-35
Kondrachov — Sur certaines propriétés des fonctions dans l'espace L^p · Doklady Akademii Nauk SSSR 48 (1945), 535-538
Poincaré — Sur les équations aux dérivées partielles de la physique mathématique · American Journal of Mathematics 12 (1890), 211-294
Wirtinger — as reported in W. Blaschke, Kreis und Kugel · Veit, Leipzig (1916), §1.6 (the Poincaré-Wirtinger / one-dimensional Wirtinger inequality)
Kolmogorov — Über Kompaktheit der Funktionenmengen bei der Konvergenz im Mittel · Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen (1931), 60-63
Riesz, M. — Sur les ensembles compacts de fonctions sommables · Acta Scientiarum Mathematicarum (Szeged) 6 (1933), 136-142
Fréchet — Sur les ensembles de fonctions et les opérations linéaires · Comptes Rendus de l'Académie des Sciences Paris 144 (1907), 1414-1416
Hanche-Olsen, Holden — The Kolmogorov-Riesz compactness theorem · Expositiones Mathematicae 28 (2010), 385-394
Evans — Partial Differential Equations, 2e · AMS Graduate Studies in Mathematics 19 (2010), §5.7-§5.8
Adams-Fournier — Sobolev Spaces, 2e · Academic Press (2003), Ch. 6
Gilbarg-Trudinger — Elliptic Partial Differential Equations of Second Order, 2e · Springer Grundlehren 224 (1983), §7.8, §7.10
Brezis — Functional Analysis, Sobolev Spaces and Partial Differential Equations · Springer Universitext (2011), §9.3, Theorem 9.16

Estimated time

beginner: 25m
intermediate: 65m
master: 105m