02.20.01 · analysis / littlewood-paley-interpolation

BMO and the John-Nirenberg Inequality

shipped3 tiersLean: none

Anchor (Master): Stein 1993 *Harmonic Analysis* (Princeton) Ch. IV; Grafakos 2014 *Modern Fourier Analysis* 3e (Springer) §7.1-7.4; Duoandikoetxea 2001 *Fourier Analysis* (AMS) §6

Intuition Beginner

Imagine measuring the price of one product across thousands of shops. Two kinds of "spread" are possible. One is that prices everywhere stay within a fixed band — never below a floor, never above a ceiling. The other, looser kind says nothing about a ceiling: prices can climb arbitrarily high in a few places, but if you zoom into any neighbourhood and compare each shop to the local average, the typical gap from that local average stays modest. The second condition controls how much things wiggle locally, not how big they get globally.

Functions of bounded mean oscillation are exactly the second kind. For each box you compute the average of the function over that box, then measure how far the function sits from its own average, on average, inside the box. If that averaged gap is uniformly small across every box and every size, the function has bounded mean oscillation. A bounded function automatically qualifies, because it cannot stray far from any average. But the reverse fails: a function can have bounded mean oscillation while shooting off to infinity.

The headline example is the logarithm of distance to a point. Near that point the function plunges to negative infinity, so it is nowhere bounded. Yet over any box the logarithm's gap from its box-average stays controlled, because the logarithm grows so slowly that rescaling a box barely changes its shape. This single example shows the new class is genuinely bigger than the bounded functions, and it is the reason the class matters: it is the natural home for outputs of operations that just barely fail to keep bounded inputs bounded.

The one-sentence takeaway: bounded mean oscillation asks each box's typical deviation from its own average to be small, which lets functions blow up slowly like a logarithm while still being tightly controlled at every scale.

Visual Beginner

Picture the graph of a function over a single box, with a horizontal line drawn at the height equal to the function's average over that box. The mean oscillation is the average vertical distance between the graph and that horizontal line. A nearly flat graph hugs its average line, so its mean oscillation is tiny. A graph that swings wildly above and below the line has large mean oscillation. The defining demand is that no box, of any size or position, produces a large average gap.

The lower panel carries the punchline. Even though the logarithm dives to negative infinity at the marked point, when you draw any box around the region and compare the curve to its box-average, the typical gap stays the same modest size no matter how small the box gets. Boundedness of the function is a statement about the curve's height; bounded mean oscillation is a statement about the curve's spread around its sliding average. The second can hold while the first fails.

Worked example Beginner

We measure the mean oscillation of a simple step function over a box and read off the number by hand.

Step 1. Work on the line and let $f$ equal $0$ on the left half of the interval from $0$ to $2$ , and equal $4$ on the right half, that is $f = 0$ on $[0, 1)$ and $f = 4$ on $[1, 2]$ . Take the box to be the whole interval $Q = [0, 2]$ , of length $2$ .

Step 2. Compute the average of $f$ over $Q$ . The function is $0$ over a length $1$ and $4$ over a length $1$ , so the total area is $0 \times 1 + 4 \times 1 = 4$ , and the average is the area divided by the length, $4$ divided by $2$ , which is $2$ .

Step 3. Compute how far $f$ sits from this average of $2$ . On the left half $f = 0$ , so the gap is $∣0 - 2∣ = 2$ . On the right half $f = 4$ , so the gap is $∣4 - 2∣ = 2$ . The gap is $2$ everywhere.

Step 4. Average the gap over the box. Since the gap is the constant $2$ across the whole interval, its average is just $2$ . So the mean oscillation of $f$ over this box is $2$ .

What this tells us: the mean oscillation measures the typical distance from the local average, and here it came out to exactly half the jump height, because the function spends equal time on each side of its average. A function of bounded mean oscillation is one where this number stays below a fixed ceiling for every box at once; raising the jump height would raise the oscillation, while shrinking the box around a single level would lower it toward zero.

Check your understanding Beginner

Exercise (easy, multiple choice).

The mean oscillation of a function over a box measures:

A. The largest value the function takes on the box B. The average distance between the function and its own average over the box C. The difference between the maximum and minimum of the function on the box D. The average value of the function over the box

Hint

The word "oscillation" refers to spread around a centre, and the centre being used is the box-average. Which option measures spread around that centre?

Answer

B. The average distance between the function and its own average over the box.

Feedback-correct: mean oscillation first centres the function by subtracting its box-average, then averages the size of what remains. Feedback-wrong: A and D report a single value (a maximum or the centre itself), not a spread; C reports the full range, which is a cruder measure than the averaged deviation and is not what bounded mean oscillation controls.

Formal definition Intermediate+

Throughout, $Q$ ranges over cubes in $R^{n}$ with sides parallel to the axes (equivalently, over balls; the two families give comparable seminorms), $∣ Q ∣$ is the Lebesgue measure of $Q$ , and $f_{Q} = \fint_{Q} f = \frac{1}{∣ Q ∣} \int_{Q} f$ is the average of $f$ over $Q$ . All functions are real- or complex-valued and locally integrable.

Definition (bounded mean oscillation). For $f \in L_{loc}^{1} (R^{n})$ the mean oscillation seminorm is $∥ f ∥_{*} = Q sup \frac{1}{∣ Q ∣} \int_{Q} ∣ f (x) - f_{Q} ∣ d x,$ the supremum taken over all cubes $Q$ . The space of bounded mean oscillation is $BMO (R^{n}) = {f \in L_{loc}^{1} (R^{n}) : ∥ f ∥_{*} < \infty} .$ Since $∥ f ∥_{*} = 0$ if and only if $f$ is almost everywhere constant, $∥ \cdot ∥_{*}$ is a seminorm whose null space is the constants; $BMO$ is a Banach space once functions are identified modulo additive constants. The competitor centring $in f_{c} \fint_{Q} ∣ f - c ∣$ is comparable: $in f_{c} \fint_{Q} ∣ f - c ∣ \leq \fint_{Q} ∣ f - f_{Q} ∣ \leq 2 in f_{c} \fint_{Q} ∣ f - c ∣$ , so replacing $f_{Q}$ by the best constant changes $∥ f ∥_{*}$ by at most a factor $2$ .

Definition (sharp maximal function). The Fefferman-Stein sharp maximal function of $f$ is $M^{#} f (x) = Q ∋ x sup \frac{1}{∣ Q ∣} \int_{Q} ∣ f (y) - f_{Q} ∣ d y,$ the supremum over cubes containing $x$ . By definition $∥ f ∥_{*} = ∥ M^{#} f ∥_{L^{\infty}}$ , so $f \in BMO$ exactly when $M^{#} f \in L^{\infty}$ , the BMO seminorm being the $L^{\infty}$ -norm of the pointwise oscillation. One has $M^{#} f \leq 2 M f$ pointwise, where $M$ is the Hardy-Littlewood maximal operator 02.19.01.

Non-examples and the canonical example. The inclusion $L^{\infty} \subseteq BMO$ holds with $∥ f ∥_{*} \leq 2∥ f ∥_{\infty}$ , and it is strict: the function $f (x) = lo g ∣ x ∣$ on $R^{n}$ is unbounded yet lies in $BMO$ , with $∥ lo g ∣ x ∣ ∥_{*}$ bounded by a dimensional constant. The function $sgn (lo g ∣ x ∣)$ , by contrast, is bounded — hence in $BMO$ — even though it is the composition of a bounded function with a $BMO$ function, illustrating that $BMO$ is not preserved by all bounded nonlinearities. The point of the class is captured by the slogan: $BMO$ is the smallest rearrangement-stable space containing $L^{\infty}$ on which Calderón-Zygmund operators remain bounded.

Counterexamples to common slips Intermediate+

BMO is a seminorm space, not a norm space, until you quotient by constants. Adding a constant to $f$ leaves $∥ f ∥_{*}$ unchanged because $f - f_{Q}$ is insensitive to constants. Treating $∥ \cdot ∥_{*}$ as a norm on functions (rather than on cosets) gives a non-Hausdorff "space"; the genuine Banach space is $L_{loc}^{1} / C$ with $∥ \cdot ∥_{*}$ .
$lo g ∣ x ∣ \in BMO$ but $∣ lo g ∣ x ∣∣ \in / BMO$ is false — both are in BMO, yet $sgn (x) lo g ∣ x ∣$ can leave it. The subtle fact is that BMO is not a lattice: $f \in BMO$ does not force $∣ f ∣ \in BMO$ in general, though for $lo g ∣ x ∣$ it happens to hold. Pointwise nonlinear operations are not BMO-bounded in general.
The supremum is over all cubes, not a fixed scale. A function may have small oscillation over large cubes yet large oscillation over small ones, or vice versa. Checking one scale never certifies BMO; the defining supremum couples all positions and all sizes.
Cubes versus balls and centred versus best-constant change only the constant. Each admissible variant of the definition — cubes or balls, $f_{Q}$ or the infimising constant, $L^{1}$ or (after John-Nirenberg) $L^{p}$ averaging — yields a seminorm comparable to $∥ \cdot ∥_{*}$ up to dimensional factors. The space $BMO$ is the same; only the numerical seminorm shifts.

Key theorem with proof Intermediate+

Theorem (John-Nirenberg inequality; John-Nirenberg 1961 Comm. Pure Appl. Math. 14, 415). There are dimensional constants $C_{1}, C_{2} > 0$ such that for every $f \in BMO (R^{n})$ , every cube $Q$ , and every $λ > 0$ , ${x \in Q : ∣ f (x) - f_{Q} ∣ > λ} \leq C_{1} ∣ Q ∣ exp (- \frac{C _{2} λ}{∥ f ∥ _{*}}) .$ One may take $C_{1} = e$ and $C_{2} = (2^{n} e)^{- 1}$ (with the dyadic-cube normalization below).

Proof. By homogeneity (replace $f$ by $f /∥ f ∥_{*}$ ) and translation/dilation invariance of both sides, it suffices to assume $∥ f ∥_{*} = 1$ and prove $Φ (λ) := Q sup \frac{1}{∣ Q ∣} {x \in Q : ∣ f - f_{Q} ∣ > λ} \leq C_{1} e^{- C_{2} λ} .$

Step 1 (one Calderón-Zygmund step). Fix a cube $Q$ and set $g = (f - f_{Q}) χ_{Q}$ , so $\fint_{Q} ∣ g ∣ = \fint_{Q} ∣ f - f_{Q} ∣ \leq ∥ f ∥_{*} = 1$ . Apply the dyadic Calderón-Zygmund decomposition 02.19.02 to $∣ g ∣$ on $Q$ at height $α = 2^{n}$ : subdividing $Q$ dyadically and stopping at the maximal subcubes ${Q_{j}}$ where $\fint_{Q_{j}} ∣ g ∣ > 2^{n}$ , one obtains pairwise disjoint dyadic subcubes with $2^{n} < \fint_{Q_{j}} ∣ g ∣ \leq 2^{2 n}, j \sum ∣ Q_{j} ∣ \leq 2^{- n} \int_{Q} ∣ g ∣ \leq 2^{- n} ∣ Q ∣,$ the first from the unselected dyadic parent and the second from the stopping inequality $∣ Q_{j} ∣ < 2^{- n} \int_{Q_{j}} ∣ g ∣$ summed over the disjoint family. Off $⋃_{j} Q_{j}$ the Lebesgue differentiation theorem 02.19.01 gives $∣ g ∣ \leq 2^{n}$ a.e.

Step 2 (the averages barely move). On each selected cube $Q_{j}$ , $∣ f_{Q_{j}} - f_{Q} ∣ = \fint_{Q_{j}} (f - f_{Q}) \leq \fint_{Q_{j}} ∣ g ∣ \leq 2^{2 n} .$ Thus passing from $Q$ to a child $Q_{j}$ shifts the centring constant by at most $2^{2 n}$ . Consequently, for $x \in Q_{j}$ , $∣ f (x) - f_{Q} ∣ \leq ∣ f (x) - f_{Q_{j}} ∣ + ∣ f_{Q_{j}} - f_{Q} ∣ \leq ∣ f (x) - f_{Q_{j}} ∣ + 2^{2 n} .$

Step 3 (the recursion). The set where $∣ f - f_{Q} ∣ > 2^{2 n}$ inside $Q$ is, up to a null set, contained in $⋃_{j} Q_{j}$ : off the selected cubes $∣ g ∣ = ∣ f - f_{Q} ∣ \leq 2^{n} \leq 2^{2 n}$ a.e. Hence for any $λ > 0$ , ${x \in Q : ∣ f - f_{Q} ∣ > λ + 2^{2 n}} \leq j \sum {x \in Q_{j} : ∣ f - f_{Q_{j}} ∣ > λ} \leq j \sum Φ (λ) ∣ Q_{j} ∣ \leq 2^{- n} Φ (λ) ∣ Q ∣,$ using Step 2 to absorb the constant shift, the definition of $Φ$ on each $Q_{j}$ , and the measure bound $\sum_{j} ∣ Q_{j} ∣ \leq 2^{- n} ∣ Q ∣$ . Taking the supremum over $Q$ yields the functional inequality $Φ (λ + b) \leq 2^{- n} Φ (λ), b := 2^{2 n} .$

Step 4 (iteration to exponential decay). Since $Φ \leq 1$ always (it is the relative measure of a subset of $Q$ ), iterating $Φ (λ + k b) \leq 2^{- nk}$ for every integer $k \geq 0$ . For arbitrary $λ \geq 0$ write $λ = k b + s$ with $k = ⌊ λ / b ⌋$ and $0 \leq s < b$ ; monotonicity of $Φ$ gives $Φ (λ) \leq Φ (k b) \leq 2^{- nk} = 2^{- n ⌊ λ / b ⌋} \leq 2^{n} 2^{- nλ / b} = 2^{n} exp (- \frac{n lo g 2}{2 ^{2 n}} λ) .$ This is the claimed bound with $C_{1} = 2^{n}$ and $C_{2} = n 2^{- 2 n} lo g 2 > 0$ , after restoring $∥ f ∥_{*}$ by the initial normalization. $□$

Bridge. The John-Nirenberg inequality builds toward the identification of $BMO$ as the correct $p \to \infty$ endpoint of the $L^{p}$ scale, and it appears again in the Fefferman-Stein sharp-maximal theory and the $H^{1}$ - $BMO$ duality that anchor the rest of this chapter. The foundational reason the proof works is that the Calderón-Zygmund decomposition 02.19.02 is self-similar: one stopping-time step at height $2^{n}$ reproduces the same distributional question one scale down with a fixed measure contraction $2^{- n}$ and a fixed constant shift $2^{2 n}$ , and this is exactly the structure that turns a single linear oscillation bound into geometric — hence exponential — decay. The central insight is that bounded mean oscillation, an $L^{1}$ averaged condition, self-improves to exponential integrability: putting these together, $∥ f ∥_{*} < \infty$ forces $\fint_{Q} e^{c ∣ f - f_{Q} ∣} \leq C$ for small $c$ , so BMO functions are locally in every $L^{p}$ and in the Orlicz space $e^{L}$ . The bridge is that this is dual to the atomic $H^{1}$ estimate, the $e^{L}$ -integrability of $BMO$ generalises the plain $L^{\infty}$ bound, and the recursion here is exactly the iterated stopping-time selection flagged in the Calderón-Zygmund decomposition's own forward map.

Exercises Intermediate+

Exercise 3 (medium, symbolic).

Prove that $lo g ∣ x ∣ \in BMO (R^{n})$ , with $∥ lo g ∣ x ∣ ∥_{*} \leq C_{n}$ depending only on the dimension.

Hint

By dilation invariance of $∥ \cdot ∥_{*}$ reduce to cubes of side $1$ . Split into cubes near the origin (where $lo g ∣ x ∣$ is integrable) and cubes far from the origin (where $lo g ∣ x ∣$ is Lipschitz with small constant), and centre with a well-chosen constant rather than the exact average.

Answer

Write $f (x) = lo g ∣ x ∣$ . The seminorm is invariant under dilations $x \mapsto t x$ (which send $f$ to $f + lo g t$ , a constant shift) and under rotations, so it suffices to bound $in f_{c} \fint_{Q} ∣ f - c ∣$ over cubes of side length $1$ , since $∥ f ∥_{*} \leq 2 sup_{Q} in f_{c} \fint_{Q} ∣ f - c ∣$ .

Let $Q$ be a cube of side $1$ centred at $x_{0}$ with $∣ x_{0} ∣ = R$ . If $R \leq 2 n$ (the cube is near the origin), choose $c = 0$ : then $\fint_{Q} ∣ lo g ∣ x ∣∣ d x \leq \int_{∣ x ∣ \leq 3 n} ∣ lo g ∣ x ∣∣ d x \leq A_{n} < \infty,$ because $lo g ∣ x ∣$ is locally integrable on $R^{n}$ (its only singularity is the integrable logarithm at $0$ ), and $A_{n}$ is a dimensional constant. If $R > 2 n$ , then on $Q$ one has $∣ x ∣ \geq R - n /2 \geq R /2$ , and $\nabla f (x) = x /∣ x ∣^{2}$ has $∣\nabla f (x) ∣ = 1/∣ x ∣ \leq 2/ R$ . By the mean value inequality $f$ is Lipschitz on $Q$ with constant $2/ R$ , so with $c = f (x_{0})$ , $\fint_{Q} ∣ f - c ∣ \leq \frac{2}{R} \cdot diam (Q) = \frac{2 n}{R} \leq n .$ In both regimes $in f_{c} \fint_{Q} ∣ f - c ∣ \leq max (A_{n}, n) =: C_{n}^{'}$ , hence $∥ f ∥_{*} \leq 2 C_{n}^{'} =: C_{n}$ . The logarithm is the borderline of BMO: it is unbounded yet has bounded mean oscillation, and any faster growth (e.g. a power $∣ x ∣^{ε}$ ) leaves BMO.

Exercise 4 (medium, symbolic).

Using the John-Nirenberg inequality, prove that for $1 \leq p < \infty$ the seminorm $∥ f ∥_{*, p} = Q sup (\fint_{Q} ∣ f - f_{Q} ∣^{p})^{1/ p}$ is finite for every $f \in BMO$ and comparable to $∥ f ∥_{*}$ : there are dimensional constants with $∥ f ∥_{*} \leq ∥ f ∥_{*, p} \leq C_{n, p} ∥ f ∥_{*}$ .

Hint

The lower bound is Jensen ( $p \geq 1$ ). For the upper bound use the layer-cake formula $\fint_{Q} ∣ f - f_{Q} ∣^{p} = p \int_{0}^{\infty} λ^{p - 1} \fint_{Q} χ_{{∣ f - f_{Q} ∣ > λ}} d λ$ and the exponential distributional bound.

Answer

The lower bound $∥ f ∥_{*} = ∥ f ∥_{*, 1} \leq ∥ f ∥_{*, p}$ is Jensen's inequality, since $t \mapsto t^{p}$ is convex for $p \geq 1$ and so $\fint_{Q} ∣ f - f_{Q} ∣ \leq (\fint_{Q} ∣ f - f_{Q} ∣^{p})^{1/ p}$ .

For the upper bound, fix $Q$ and apply the layer-cake (Cavalieri) representation together with John-Nirenberg (normalising $∥ f ∥_{*} = 1$ ): $\fint_{Q} ∣ f - f_{Q} ∣^{p} = \frac{p}{∣ Q ∣} \int_{0}^{\infty} λ^{p - 1} {x \in Q : ∣ f - f_{Q} ∣ > λ} d λ \leq p C_{1} \int_{0}^{\infty} λ^{p - 1} e^{- C_{2} λ} d λ .$ The remaining integral is $Γ (p) C_{2}^{- p} = p! C_{2}^{- p}$ for integer $p$ and $Γ (p) C_{2}^{- p}$ in general, a finite constant $γ_{n, p}$ depending only on $n$ and $p$ . Hence $\fint_{Q} ∣ f - f_{Q} ∣^{p} \leq p C_{1} γ_{n, p} =: C_{n, p}^{p}$ uniformly in $Q$ , giving $∥ f ∥_{*, p} \leq C_{n, p} ∥ f ∥_{*}$ after undoing the normalization. Thus all the $L^{p}$ -averaged BMO seminorms define the same space with comparable seminorms — the John-Nirenberg self-improvement in quantitative form.

Exercise 5 (medium, symbolic).

Prove the exponential-integrability corollary: there is a dimensional constant $c_{n} > 0$ such that for every $f \in BMO$ with $∥ f ∥_{*} \leq 1$ and every cube $Q$ , $\fint_{Q} exp (c_{n} ∣ f - f_{Q} ∣) d x \leq 2.$

Hint

Expand the exponential, or integrate the John-Nirenberg tail bound directly: $\fint_{Q} e^{c ∣ f - f_{Q} ∣} = 1 + c \int_{0}^{\infty} e^{c λ} \fint_{Q} χ_{{∣ f - f_{Q} ∣ > λ}} d λ$ . Choose $c < C_{2}$ .

Answer

For $c > 0$ , the distribution-function identity gives $\fint_{Q} e^{c ∣ f - f_{Q} ∣} d x = 1 + \frac{c}{∣ Q ∣} \int_{0}^{\infty} e^{c λ} {x \in Q : ∣ f - f_{Q} ∣ > λ} d λ \leq 1 + c C_{1} \int_{0}^{\infty} e^{(c - C_{2}) λ} d λ,$ using John-Nirenberg with $∥ f ∥_{*} \leq 1$ . If $c < C_{2}$ the integral converges to $(C_{2} - c)^{- 1}$ , so $\fint_{Q} e^{c ∣ f - f_{Q} ∣} d x \leq 1 + \frac{c C _{1}}{C _{2} - c} .$ Choosing $c = c_{n}$ small enough that $c_{n} C_{1} \leq C_{2} - c_{n}$ — for instance $c_{n} = C_{2} / (1 + C_{1})$ — makes the second term at most $1$ , giving the bound $2$ . This is the cleanest face of John-Nirenberg: BMO functions are locally exponentially integrable, the Orlicz-space statement $f - f_{Q} \in exp (L)$ uniformly over cubes, which is the substitute for the boundedness that $L^{\infty}$ would give.

Exercise 6 (medium, symbolic).

Let $K$ be a Calderón-Zygmund kernel and $T$ the associated operator, bounded on $L^{2}$ with the Hörmander kernel condition 02.19.02. Sketch the proof that $T : L^{\infty} (R^{n}) \to BMO (R^{n})$ is bounded, i.e. $∥ T f ∥_{*} \leq C ∥ f ∥_{\infty}$ .

Hint

Fix a cube $Q$ with centre $x_{Q}$ . Split $f = f χ_{2 Q} + f χ_{(2 Q)^{c}}$ . For the local part use $L^{2}$ -boundedness and Cauchy-Schwarz; for the far part subtract the constant $T (f χ_{(2 Q)^{c}}) (x_{Q})$ and use the Hörmander condition.

Answer

Fix a cube $Q$ with centre $x_{Q}$ and side $ℓ$ . By definition of $∥ \cdot ∥_{*}$ it suffices to find, for each $Q$ , a constant $c_{Q}$ with $\fint_{Q} ∣ T f - c_{Q} ∣ \leq C ∥ f ∥_{\infty}$ (since $\fint_{Q} ∣ T f - (T f)_{Q} ∣ \leq 2 \fint_{Q} ∣ T f - c_{Q} ∣$ ). Write $f = f_{1} + f_{2}$ with $f_{1} = f χ_{2 Q}$ , $f_{2} = f χ_{(2 Q)^{c}}$ , and choose $c_{Q} = (T f_{2}) (x_{Q})$ .

Local part. By Cauchy-Schwarz and the $L^{2}$ -boundedness of $T$ , $\fint_{Q} ∣ T f_{1} ∣ \leq (\fint_{Q} ∣ T f_{1} ∣^{2})^{1/2} \leq ∣ Q ∣^{- 1/2} ∥ T f_{1} ∥_{2} \leq A ∣ Q ∣^{- 1/2} ∥ f_{1} ∥_{2} \leq A ∣ Q ∣^{- 1/2} ∥ f ∥_{\infty} ∣2 Q ∣^{1/2} = 2^{n /2} A ∥ f ∥_{\infty} .$

Far part. For $x \in Q$ and $y \in (2 Q)^{c}$ one has $∣ x - y ∣ > 2∣ x - x_{Q} ∣$ (since $∣ x - x_{Q} ∣ \leq n ℓ /2$ while $∣ x_{Q} - y ∣ \geq ℓ$ ), so the Hörmander condition applies: $∣ T f_{2} (x) - T f_{2} (x_{Q}) ∣ \leq \int_{(2 Q)^{c}} ∣ K (x, y) - K (x_{Q}, y) ∣ ∣ f (y) ∣ d y \leq ∥ f ∥_{\infty} \int_{∣ x_{Q} - y ∣ > 2∣ x - x_{Q} ∣} ∣ K (x, y) - K (x_{Q}, y) ∣ d y \leq B ∥ f ∥_{\infty} .$ Averaging over $x \in Q$ , $\fint_{Q} ∣ T f_{2} - c_{Q} ∣ \leq B ∥ f ∥_{\infty}$ . Adding the two parts, $\fint_{Q} ∣ T f - c_{Q} ∣ \leq (2^{n /2} A + B) ∥ f ∥_{\infty}$ uniformly in $Q$ , so $∥ T f ∥_{*} \leq 2 (2^{n /2} A + B) ∥ f ∥_{\infty}$ . This is the endpoint that replaces the false statement $T : L^{\infty} \to L^{\infty}$ : Calderón-Zygmund operators do not preserve $L^{\infty}$ , but they map it into $BMO$ .

Exercise 7 (hard, symbolic).

Prove the Fefferman-Stein inequality in the form: for $1 < p < \infty$ and $f$ with $f \in L^{p_{0}}$ for some $p_{0} < \infty$ , $∥ f ∥_{L^{p}} \leq C_{n, p} ∥ M^{#} f ∥_{L^{p}},$ assuming the good- $λ$ inequality $∣ {M f > 2 λ, M^{#} f \leq γ λ} ∣ \leq C γ ∣ {M f > λ} ∣$ for small $γ > 0$ , where $M$ is the Hardy-Littlewood maximal operator 02.19.01.

Hint

Integrate the good- $λ$ inequality against $p λ^{p - 1} d λ$ to get $∥ M f ∥_{p} \leq C ∥ M^{#} f ∥_{p}$ after absorbing a small term, then use $∣ f ∣ \leq M f$ a.e. (Lebesgue points).

Answer

Let $Λ (λ) = ∣ {M f > λ} ∣$ . Splitting the level set, $Λ (2 λ) \leq ∣ {M f > 2 λ, M^{#} f \leq γ λ} ∣ + ∣ {M^{#} f > γ λ} ∣ \leq C γ Λ (λ) + ∣ {M^{#} f > γ λ} ∣.$ Multiply by $p (2 λ)^{p - 1} \cdot 2$ and integrate in $λ$ over $(0, \infty)$ ; since $f \in L^{p_{0}}$ ensures $M f \in L^{p_{0}}$ and the layer-cake integrals are finite (a truncation argument makes this rigorous, integrating to $N$ and letting $N \to \infty$ ), $∥ M f ∥_{L^{p}}^{p} = p \int_{0}^{\infty} (2 λ)^{p - 1} Λ (2 λ) 2 d λ \leq 2^{p} C γ ∥ M f ∥_{L^{p}}^{p} + \frac{2 ^{p}}{γ ^{p}} p \int_{0}^{\infty} (γ λ)^{p - 1} ∣ {M^{#} f > γ λ} ∣ d λ .$ The last integral equals $γ^{- 1} ∥ M^{#} f ∥_{L^{p}}^{p}$ after the substitution $μ = γ λ$ . Choosing $γ$ so small that $2^{p} C γ \leq 1/2$ , the first term on the right is absorbed into the left: $\frac{1}{2} ∥ M f ∥_{L^{p}}^{p} \leq \frac{2 ^{p}}{γ ^{p + 1}} ∥ M^{#} f ∥_{L^{p}}^{p}, so ∥ M f ∥_{L^{p}} \leq C_{n, p} ∥ M^{#} f ∥_{L^{p}} .$ Finally $∣ f (x) ∣ \leq M f (x)$ at every Lebesgue point of $f$ 02.19.01, which is a.e., so $∥ f ∥_{L^{p}} \leq ∥ M f ∥_{L^{p}} \leq C_{n, p} ∥ M^{#} f ∥_{L^{p}}$ . The reverse $∥ M^{#} f ∥_{p} \leq 2∥ M f ∥_{p} \leq C ∥ f ∥_{p}$ is immediate from $M^{#} f \leq 2 M f$ , giving the equivalence $∥ f ∥_{L^{p}} \approx ∥ M^{#} f ∥_{L^{p}}$ .

Exercise 8 (hard, symbolic).

Use the John-Nirenberg inequality to prove the interpolation statement that if a linear operator $T$ is bounded on $L^{p_{0}}$ ( $1 < p_{0} < \infty$ ) and bounded from $L^{\infty}$ into $BMO$ , then $T$ is bounded on $L^{p}$ for all $p_{0} \leq p < \infty$ (the Stampacchia/Fefferman-Stein $L^{p}$ - $BMO$ interpolation).

Hint

Use the sharp-maximal control $∥ T f ∥_{p} \leq C ∥ M^{#} (T f) ∥_{p}$ from Exercise 7, the pointwise bound $M^{#} (T f) \leq ∥ T f ∥_{*}$ -type estimates, and interpolate the operator $f \mapsto M^{#} (T f)$ between the $L^{p_{0}}$ and the $L^{\infty} \to L^{\infty}$ (via BMO) endpoints by Marcinkiewicz 02.07.06.

Answer

Consider the sublinear operator $S f = M^{#} (T f)$ . It has two endpoint bounds. At $p_{0}$ : by Exercise 7's reverse inequality and the $L^{p_{0}}$ -boundedness of $T$ , $∥ S f ∥_{L^{p_{0}}} = ∥ M^{#} (T f) ∥_{L^{p_{0}}} \leq 2∥ M (T f) ∥_{L^{p_{0}}} \leq C ∥ T f ∥_{L^{p_{0}}} \leq C ∥ f ∥_{L^{p_{0}}}$ , so $S$ is of strong (hence weak) type $(p_{0}, p_{0})$ . At $\infty$ : by hypothesis $∥ T f ∥_{*} \leq C ∥ f ∥_{\infty}$ , and $∥ M^{#} (T f) ∥_{\infty} = ∥ T f ∥_{*}$ by the very definition of the sharp maximal function, so $∥ S f ∥_{L^{\infty}} \leq C ∥ f ∥_{L^{\infty}}$ , i.e. $S$ is of strong type $(\infty, \infty)$ .

Marcinkiewicz interpolation 02.07.06 applied to the sublinear $S$ between the weak- $(p_{0}, p_{0})$ and weak- $(\infty, \infty)$ endpoints yields $∥ S f ∥_{L^{p}} \leq C_{p} ∥ f ∥_{L^{p}}$ for all $p_{0} < p < \infty$ . Now invoke the forward Fefferman-Stein inequality (Exercise 7): for $f \in L^{p_{0}} \cap L^{p}$ , $∥ T f ∥_{L^{p}} \leq C_{n, p} ∥ M^{#} (T f) ∥_{L^{p}} = C_{n, p} ∥ S f ∥_{L^{p}} \leq C_{n, p} C_{p} ∥ f ∥_{L^{p}} .$ By density of $L^{p_{0}} \cap L^{p}$ in $L^{p}$ this extends to all of $L^{p}$ , proving $T$ is bounded on $L^{p}$ for $p_{0} \leq p < \infty$ . The role of John-Nirenberg is hidden but essential: it is what makes $M^{#}$ control the full $L^{p}$ -norm (Exercise 7), so that the BMO endpoint is a genuine substitute for an $L^{\infty}$ endpoint in the interpolation.

Advanced results Master

Theorem 1 (John-Nirenberg, exponential form and self-improvement; John-Nirenberg 1961 Comm. Pure Appl. Math. 14, 415). A locally integrable $f$ lies in $BMO$ if and only if there exist constants $c, C > 0$ with $\fint_{Q} exp (c ∣ f - f_{Q} ∣/∥ f ∥_{*}) \leq C$ for all cubes $Q$ . Consequently the seminorms $∥ f ∥_{*, p}$ for $1 \leq p < \infty$ are all comparable, and the BMO condition stated with $L^{1}$ -oscillation self-improves to $L^{p}$ - and to exponential-oscillation with no enlargement of the space. The constant $c$ is the reciprocal of the John-Nirenberg decay rate $C_{2}$ ^{[John-Nirenberg 1961]}.

Theorem 2 (Fefferman-Stein sharp maximal theorem; Fefferman-Stein 1972 Acta Math. 129, 137). For $1 < p < \infty$ and $f$ in a suitable dense class, $∥ f ∥_{L^{p}} \approx ∥ M^{#} f ∥_{L^{p}}$ with constants depending only on $n, p$ . The sharp maximal function thereby characterizes $L^{p}$ through pure mean-oscillation data, and $BMO = {M^{#} f \in L^{\infty}}$ is the $p \to \infty$ endpoint of the family ${M^{#} f \in L^{p}}$ . This is the precise sense in which $BMO$ replaces $L^{\infty}$ at the top of the scale ^{[Fefferman-Stein 1972]}.

Theorem 3 (Calderón-Zygmund operators on the endpoints). Every Calderón-Zygmund operator $T$ maps $L^{\infty} \to BMO$ boundedly, and by the duality of Theorem 4 its (formal) adjoint maps $H^{1} \to L^{1}$ boundedly. The pair $(L^{\infty} \to BMO, H^{1} \to L^{1})$ is the correct endpoint replacement for the failed $L^{\infty} \to L^{\infty}$ and $L^{1} \to L^{1}$ bounds, and interpolating either with the $L^{2}$ bound recovers the full $L^{p}$ range $1 < p < \infty$ ^{[Stein 1993]}.

Theorem 4 (Fefferman duality $(H^1)^ = \mathrm{BMO}$; Fefferman 1971 Bull. AMS 77, 587; Fefferman-Stein 1972 Acta Math. 129, 137).* The dual of the real Hardy space $H^{1} (R^{n})$ is $BMO (R^{n})$ : every $φ \in BMO$ defines a bounded linear functional on $H^{1}$ by $⟨ f, φ ⟩ = \int f φ$ (suitably interpreted on atoms), with $∥ φ ∥_{(H^{1})^{*}} \approx ∥ φ ∥_{*}$ , and every bounded functional on $H^{1}$ arises this way. The pairing is well defined because the atomic structure of $H^{1}$ (mean-zero, $L^{2}$ -normalized bumps supported on cubes) is exactly matched by the John-Nirenberg integrability of BMO, so that $\int a φ = \int a (φ - φ_{Q})$ converges absolutely by Cauchy-Schwarz ^{[Fefferman-Stein 1972]}.

Theorem 5 (Carleson-measure characterization). A function $φ \in L_{loc}^{2}$ lies in $BMO$ if and only if $∣\nabla u (x, t) ∣^{2} t d x d t$ is a Carleson measure on the upper half-space, where $u (x, t) = (P_{t} * φ) (x)$ is the Poisson extension: $sup_{Q} \frac{1}{∣ Q ∣} \int_{0}^{ℓ (Q)} \int_{Q} ∣\nabla u ∣^{2} t d x d t < \infty$ . This analytic characterization, equivalent to membership in $BMO$ up to constants, is the route by which $BMO$ enters the theory of the $\overset{ˉ}{\partial}$ -equation and the corona problem, and it is the form in which Fefferman first proved the duality ^{[Fefferman 1971]}.

Theorem 6 (commutators; Coifman-Rochberg-Weiss). For a Calderón-Zygmund operator $T$ and $b \in BMO$ , the commutator $[b, T] f = b T f - T (b f)$ is bounded on $L^{p}$ for $1 < p < \infty$ , with $∥ [b, T] ∥_{L^{p} \to L^{p}} \leq C ∥ b ∥_{*}$ ; conversely, for $T$ a Riesz transform, boundedness of $[b, T]$ on some $L^{p}$ forces $b \in BMO$ . The commutator thereby gives an operator-theoretic characterization of $BMO$ , and the proof runs through the John-Nirenberg exponential integrability to control the product $b \cdot T f$ ^{[Stein 1993]}.

Synthesis. The space $BMO$ is the foundational reason that the failure of Calderón-Zygmund operators to preserve $L^{\infty}$ and $L^{1}$ is a feature: $BMO$ and its predual $H^{1}$ are the correct endpoint spaces, and this is exactly what the four characterizations — mean-oscillation, sharp maximal, Carleson measure, commutator — assert from four directions. The central insight is the self-improvement engine of John-Nirenberg: a single $L^{1}$ -averaged oscillation bound, fed through the self-similar stopping-time recursion of 02.19.02, upgrades to exponential integrability, and this is exactly what makes the pairing $\int a φ$ converge against the $L^{2}$ -normalized atoms of $H^{1}$ . Putting these together, the picture generalises in three directions: vertically, $BMO$ caps the $L^{p}$ scale as the $p \to \infty$ endpoint via the Fefferman-Stein sharp maximal theorem (Theorem 2), bracketing the whole scale $1 \leq p \leq \infty$ by the dual pair $(H^{1}, BMO)$ ; horizontally, the Carleson-measure characterization (Theorem 5) ports the cube-oscillation picture to the half-space and is dual to the tent-space theory; structurally, the commutator characterization (Theorem 6) reveals $BMO$ as the symbol class for which multiplication interacts boundedly with singular integration. The bridge unifying them is the John-Nirenberg inequality: each equivalence restates that bounded mean oscillation is secretly exponential control, dual atom by atom to the cancellation built into $H^{1}$ .

Full proof set Master

Proposition 1 ( $∥ \cdot ∥_{*}$ is a seminorm with null space the constants). $∥ \cdot ∥_{*}$ is subadditive and absolutely homogeneous, and $∥ f ∥_{*} = 0$ if and only if $f$ is a.e. equal to a constant.

Proof. Absolute homogeneity is clear from $∣ (c f) - (c f)_{Q} ∣ = ∣ c ∣ ∣ f - f_{Q} ∣$ . For subadditivity, $(f + g)_{Q} = f_{Q} + g_{Q}$ , so $∣ (f + g) - (f + g)_{Q} ∣ \leq ∣ f - f_{Q} ∣ + ∣ g - g_{Q} ∣$ pointwise; averaging and taking the supremum over $Q$ gives $∥ f + g ∥_{*} \leq ∥ f ∥_{*} + ∥ g ∥_{*}$ . If $f \equiv c$ then $f_{Q} = c$ and the oscillation vanishes, so $∥ f ∥_{*} = 0$ . Conversely if $∥ f ∥_{*} = 0$ then $\fint_{Q} ∣ f - f_{Q} ∣ = 0$ for every cube, so $f = f_{Q}$ a.e. on each $Q$ ; since the constants $f_{Q}$ must agree on overlapping cubes, $f$ is a.e. a single constant. $□$

Proposition 2 ( $BMO$ is complete modulo constants). The quotient $BMO / C$ with seminorm $∥ \cdot ∥_{*}$ is a Banach space.

Proof. Let $(f_{k})$ be Cauchy in $∥ \cdot ∥_{*}$ ; normalize representatives so that $(f_{k})_{Q_{0}} = 0$ for a fixed unit cube $Q_{0}$ . For any fixed cube $Q$ , the averages $\fint_{Q} ∣ f_{j} - f_{k} ∣ \leq \fint_{Q} ∣ (f_{j} - f_{k}) - (f_{j} - f_{k})_{Q} ∣ + ∣ (f_{j} - f_{k})_{Q} ∣$ ; the first term is $\leq ∥ f_{j} - f_{k} ∥_{*}$ , and chaining $Q$ to $Q_{0}$ through finitely many overlapping cubes bounds $∣ (f_{j} - f_{k})_{Q} ∣$ by $C_{Q} ∥ f_{j} - f_{k} ∥_{*}$ . Hence $(f_{k})$ is Cauchy in $L^{1} (Q)$ for every $Q$ , so it converges in $L_{loc}^{1}$ to some $f$ . Lower semicontinuity of $\fint_{Q} ∣ \cdot - (\cdot)_{Q} ∣$ under $L^{1} (Q)$ -convergence gives $∥ f - f_{k} ∥_{*} \leq lim inf_{m} ∥ f_{m} - f_{k} ∥_{*} \to 0$ , so $f \in BMO$ and $f_{k} \to f$ . $□$

Proposition 3 (one Calderón-Zygmund step recursion, restated). With $Φ (λ) = sup_{Q} \frac{1}{∣ Q ∣} ∣ {x \in Q : ∣ f - f_{Q} ∣ > λ} ∣$ and $∥ f ∥_{*} = 1$ , one has $Φ (λ + 2^{2 n}) \leq 2^{- n} Φ (λ)$ for all $λ \geq 0$ .

Proof. This is Steps 1-3 of the Key Theorem. The decomposition of $g = (f - f_{Q}) χ_{Q}$ at height $2^{n}$ produces disjoint dyadic subcubes ${Q_{j}}$ with $\sum_{j} ∣ Q_{j} ∣ \leq 2^{- n} ∣ Q ∣$ and $∣ f_{Q_{j}} - f_{Q} ∣ \leq 2^{2 n}$ , off which $∣ f - f_{Q} ∣ \leq 2^{n}$ a.e. Hence ${x \in Q : ∣ f - f_{Q} ∣ > λ + 2^{2 n}} \subseteq ⋃_{j} {x \in Q_{j} : ∣ f - f_{Q_{j}} ∣ > λ}$ up to null sets, and summing the per-cube bound $Φ (λ) ∣ Q_{j} ∣$ yields the contraction. $□$

Proposition 4 (exponential decay from the recursion). $Φ (λ) \leq 2^{n} e^{- C_{2} λ}$ with $C_{2} = n 2^{- 2 n} lo g 2$ .

Proof. From $Φ \leq 1$ and Proposition 3, induction gives $Φ (k 2^{2 n}) \leq 2^{- nk}$ . For general $λ$ , write $k = ⌊ λ / 2^{2 n} ⌋$ and use monotonicity: $Φ (λ) \leq Φ (k 2^{2 n}) \leq 2^{- nk} \leq 2^{n} 2^{- nλ / 2^{2 n}} = 2^{n} e^{- C_{2} λ}$ . $□$

Proposition 5 ( $lo g ∣ x ∣ \in BMO$ , sharpness). $lo g ∣ x ∣ \in BMO (R^{n})$ , while $∣ x ∣^{ε} \in / BMO$ for any $ε > 0$ .

Proof. Membership is Exercise 3. For sharpness, test $f (x) = ∣ x ∣^{ε}$ on cubes $Q_{R}$ of side $R$ at the origin: $f_{Q_{R}} \sim R^{ε}$ and $\fint_{Q_{R}} ∣ f - f_{Q_{R}} ∣ \sim R^{ε} \to \infty$ as $R \to \infty$ , so $∥ f ∥_{*} = \infty$ . The dilation $f (R x) = R^{ε} f (x)$ scales the oscillation by $R^{ε}$ , which is bounded over scales only when $ε = 0$ ; the logarithm sits exactly at $ε = 0$ , where the dilation acts by an additive constant the seminorm ignores. $□$

Proposition 6 (sharp maximal pointwise bounds). $M^{#} f \leq 2 M f$ pointwise, and $∥ f ∥_{*} = ∥ M^{#} f ∥_{L^{\infty}}$ .

Proof. For a cube $Q ∋ x$ , $\fint_{Q} ∣ f - f_{Q} ∣ \leq \fint_{Q} ∣ f ∣ + ∣ f_{Q} ∣ \leq 2 \fint_{Q} ∣ f ∣ \leq 2 M f (x)$ after replacing the cube by a comparable centred ball; taking the supremum over $Q ∋ x$ gives $M^{#} f (x) \leq 2 M f (x)$ . The identity $∥ f ∥_{*} = ∥ M^{#} f ∥_{\infty}$ is immediate: $∥ M^{#} f ∥_{\infty} = ess sup_{x} sup_{Q ∋ x} \fint_{Q} ∣ f - f_{Q} ∣ = sup_{Q} \fint_{Q} ∣ f - f_{Q} ∣ = ∥ f ∥_{*}$ , since every cube contains some point and the essential supremum over $x$ of a quantity depending only on the cubes through $x$ recovers the supremum over all cubes. $□$

Connections Master

The Calderón-Zygmund decomposition 02.19.02. The direct prerequisite and the engine of the John-Nirenberg proof. One stopping-time step at height $2^{n}$ on $(f - f_{Q}) χ_{Q}$ produces the disjoint subcubes and the measure contraction $2^{- n}$ that make the distributional recursion self-similar; iterating that single decomposition is exactly what converts bounded mean oscillation into exponential decay, so the John-Nirenberg inequality is the decomposition applied to itself across scales.
Hardy-Littlewood maximal function and the Vitali covering lemma 02.19.01. The direct prerequisite supplying the sharp maximal function's domination $M^{#} f \leq 2 M f$ , the Lebesgue differentiation theorem used to control the good part off the selected cubes, and the pointwise bound $∣ f ∣ \leq M f$ a.e. that closes the Fefferman-Stein equivalence $∥ f ∥_{L^{p}} \approx ∥ M^{#} f ∥_{L^{p}}$ .
$L^{p}$ spaces, Hölder, Minkowski, Riesz-Fischer completeness 02.07.06. The direct prerequisite carrying the layer-cake distribution-function calculus behind the $L^{p}$ -BMO seminorm comparison and the exponential-integrability corollary, the Cauchy-Schwarz estimate in the $L^{\infty} \to BMO$ proof, and the Marcinkiewicz interpolation that turns the $L^{p}$ - $BMO$ endpoints into boundedness on the full range.
Singular integral operators and the Hilbert transform [forward: 02.19.03]. The principal application. Calderón-Zygmund operators fail to preserve $L^{\infty}$ , and $BMO$ is precisely the space into which they map it; the commutator $[b, T]$ is $L^{p}$ -bounded exactly when $b \in BMO$ , making BMO the natural symbol class for singular-integral multiplication.
Hardy spaces and atomic decomposition [forward: 02.20.02]. The dual partner. The Fefferman duality $(H^{1})^{*} = BMO$ pairs the mean-zero $L^{2}$ -normalized atoms of $H^{1}$ against BMO functions; the absolute convergence of the pairing $\int a (φ - φ_{Q})$ is supplied by the John-Nirenberg integrability proved here.
Littlewood-Paley theory and square functions [forward: 02.20.03]. The analytic face. The Carleson-measure characterization expresses $∥ φ ∥_{*}$ through the Littlewood-Paley square function of the Poisson extension, embedding BMO into the tent-space and square-function framework that organises the rest of this chapter.

Historical & philosophical context Master

The space $BMO$ and the inequality bearing their names were introduced by Fritz John and Louis Nirenberg in their 1961 Communications on Pure and Applied Mathematics paper On functions of bounded mean oscillation ^{[John-Nirenberg 1961]}. John had arrived at the mean-oscillation condition through his work on elasticity and quasi-isometric (bi-Lipschitz) deformations, where the logarithm of the Jacobian of such a map is the prototypical bounded-mean-oscillation function; the exponential integrability they proved was the tool needed to control such deformations. Their original argument is the Calderón-Zygmund stopping-time iteration reproduced above, run on the level sets of the oscillation.

The decisive structural role of $BMO$ in harmonic analysis emerged a decade later. Charles Fefferman announced the duality $(H^{1})^{*} = BMO$ in his 1971 Bulletin of the American Mathematical Society note Characterizations of bounded mean oscillation ^{[Fefferman 1971]}, identifying the dual of the real Hardy space — until then an analytically awkward object — with the concrete and computable space of mean oscillation, and supplying the Carleson-measure characterization through the Poisson extension. The full theory, including the sharp maximal function and the systematic use of BMO as the $L^{\infty}$ -substitute endpoint for singular integrals, was developed by Fefferman and Elias Stein in their 1972 Acta Mathematica paper $H^{p}$ spaces of several variables ^{[Fefferman-Stein 1972]}. The commutator characterization, completing the operator-theoretic picture, was established by Ronald Coifman, Richard Rochberg, and Guido Weiss in 1976. Within this lineage the John-Nirenberg inequality is the technical keystone: it is the self-improvement that makes mean oscillation strong enough to serve as a dual space and as an interpolation endpoint.

Bibliography Master

@article{JohnNirenberg1961,
  author  = {John, Fritz and Nirenberg, Louis},
  title   = {On functions of bounded mean oscillation},
  journal = {Communications on Pure and Applied Mathematics},
  volume  = {14},
  year    = {1961},
  pages   = {415--426}
}

@article{Fefferman1971,
  author  = {Fefferman, Charles},
  title   = {Characterizations of bounded mean oscillation},
  journal = {Bulletin of the American Mathematical Society},
  volume  = {77},
  year    = {1971},
  pages   = {587--588}
}

@article{FeffermanStein1972,
  author  = {Fefferman, Charles and Stein, Elias M.},
  title   = {$H^p$ spaces of several variables},
  journal = {Acta Mathematica},
  volume  = {129},
  year    = {1972},
  pages   = {137--193}
}

@article{CoifmanRochbergWeiss1976,
  author  = {Coifman, Ronald R. and Rochberg, Richard and Weiss, Guido},
  title   = {Factorization theorems for Hardy spaces in several variables},
  journal = {Annals of Mathematics},
  volume  = {103},
  year    = {1976},
  pages   = {611--635}
}

@book{Stein1993,
  author    = {Stein, Elias M.},
  title     = {Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Integrals},
  publisher = {Princeton University Press},
  year      = {1993}
}

@book{Grafakos2014Modern,
  author    = {Grafakos, Loukas},
  title     = {Modern Fourier Analysis},
  edition   = {3},
  publisher = {Springer},
  year      = {2014}
}

@book{Duoandikoetxea2001,
  author    = {Duoandikoetxea, Javier},
  title     = {Fourier Analysis},
  publisher = {American Mathematical Society},
  year      = {2001}
}

Prerequisites

02.19.02
02.19.01
02.07.06

Tier anchors

beginner: Stein-Shakarchi 2005 *Real Analysis* (Princeton) Ch. 3; informal averaged-oscillation picture of $\log|x|$
intermediate: Stein 1993 *Harmonic Analysis* (Princeton) Ch. IV §1; Grafakos 2014 *Modern Fourier Analysis* 3e (Springer) §7.1
master: Stein 1993 *Harmonic Analysis* (Princeton) Ch. IV; Grafakos 2014 *Modern Fourier Analysis* 3e (Springer) §7.1-7.4; Duoandikoetxea 2001 *Fourier Analysis* (AMS) §6

References

John-Nirenberg — On functions of bounded mean oscillation · Comm. Pure Appl. Math. 14 (1961), 415-426
Fefferman-Stein — H^p spaces of several variables · Acta Mathematica 129 (1972), 137-193
Stein — Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Integrals · Ch. IV, §1-2, BMO and the John-Nirenberg inequality
Grafakos — Modern Fourier Analysis, 3e · §7.1-7.4, BMO, John-Nirenberg, H^1-BMO duality
Duoandikoetxea — Fourier Analysis · §6, the space BMO
Fefferman — Characterizations of bounded mean oscillation · Bull. Amer. Math. Soc. 77 (1971), 587-588

Estimated time

beginner: 18m
intermediate: 55m
master: 90m