02.20.02 · analysis / littlewood-paley-interpolation

Real-Variable Hardy Spaces H^p and the Atomic Decomposition

shipped3 tiersLean: none

Anchor (Master): Stein 1993 *Harmonic Analysis* (Princeton) Ch. III-IV; Fefferman-Stein 1972 *Acta Math.* 129; Coifman-Weiss 1977 *Bull. AMS* 83; Grafakos 2014 *Modern Fourier Analysis* 3e §2.2-2.5, §6.4-6.7

Intuition Beginner

The space of absolutely-integrable functions — the ones whose total accumulated size is finite — is the natural home for a great deal of analysis. But it has a defect that shows up the moment you apply the most useful operations in the subject, the ones that recover a signal's twin or its derivatives from averaged data. Those operations take a perfectly fine integrable input and produce an output whose total size is infinite, by a hair: the output decays just slowly enough at infinity to escape being integrable. The integrable functions are not closed under the operations you most want to use.

Real Hardy space is the repair. It is a slightly smaller class of inputs, carved out by one extra demand: the function must have a kind of built-in balance. A bounded function automatically lives in the integrable world; a Hardy-space function must additionally average out to zero in a precise sense, and at the finest level it is glued together from tiny standardized pieces, each one a bump that integrates to zero. Because every piece is balanced, the dangerous slow tails that the useful operations produce get cancelled before they can accumulate to infinity.

The headline payoff: the operations that wreck the integrable class send Hardy space neatly back into the integrable class — and send Hardy space back into itself. The class of balanced inputs is exactly the right place to stand so that singular operations stay tame. And there is a dual side of the same coin: the functions that pair nicely against these balanced inputs are precisely the ones with bounded mean oscillation, the slowly-growing functions met in the previous unit.

The one-sentence takeaway: real Hardy space is the corrected version of the integrable functions — built from standardized balanced bumps — on which the singular operations of analysis stay bounded, exactly where the plain integrable class fails.

Visual Beginner

Picture a single building block: a bump supported in one small box, with equal area above and below the zero line, so its total signed area is zero. Stacking many such balanced bumps, scaled and shifted, with the sizes of the scalings adding up in a controlled way, reconstructs any Hardy-space function. The balance of each piece is the whole point; an unbalanced bump (all positive, like a plain spike) would not qualify.

The lower panel carries the punchline. A Hardy-space function is not measured by its raw height or its total size; it is measured by the cheapest way to write it as a balanced sum of these standardized boxed bumps, adding up the coefficient sizes. A plain integrable function can fail to have any such balanced decomposition, which is exactly why it can be wrecked by singular operations. A Hardy-space function, by construction, always does — and the operations act atom by atom, each balanced piece staying tame.

Worked example Beginner

We build the simplest possible balanced building block on the line and check the two defining properties by hand.

Step 1. Take the box to be the interval from to , of width . Define a bump that equals on the left half, from to , equals on the right half, from to , and equals everywhere outside the box.

Step 2. Check the balance (the zero-average property). The signed area is the value times the width on each half: on the left it is , on the right it is . Adding them, . The bump integrates to zero, so it is balanced.

Step 3. Check the size cap. A standardized building block on a box of width is required to be no taller than one divided by . Here , so the cap is one-half. But our bump has height , which is bigger than one-half. So itself is twice too tall; the genuine standardized block is divided by , namely the bump equal to on the left half and on the right half. That one is balanced and obeys the size cap.

Step 4. Read off the coefficient. To write the original as a coefficient times a standardized block, we use . The coefficient is . The cost we record for this one-piece decomposition is the size of the coefficient, namely .

What this tells us: every Hardy-space function is assembled from balanced boxed bumps like this standardized block, and the price of the assembly is the sum of the coefficient sizes. Balance plus a size cap are the only two demands on each piece. The plain absolute value used to measure integrable functions is replaced by this assembly cost, and the balance is what tames the singular operations.

Check your understanding Beginner

Formal definition Intermediate+

Throughout, unless stated otherwise; cubes have sides parallel to the axes, is Lebesgue measure, and . The Fourier transform is that of 02.10.04. Fix a Schwartz function with and write .

Definition (grand maximal function). For let . The grand maximal function of a tempered distribution is The non-tangential maximal function attached to a single is , and the *radial* (vertical) maximal function is .

Definition (real Hardy space, maximal characterization). For and large enough, the real Hardy space is This is a (quasi-)Banach space; for , is subadditive but is only a quasinorm. The Fefferman-Stein theorem (below) makes , , and define the same space with comparable (quasi)norms, so the choices of and are immaterial.

Definition (-atom). Let with , and set the moment order . A function is a -atom if there is a cube with:

  1. (Support.) .
  2. (Size.) .
  3. (Moments / cancellation.) for every multi-index with .

For the moment order is , so a -atom is a single-mean-zero bump with ; the case gives the height cap .

Definition (atomic Hardy space). The atomic Hardy space consists of all (convergence in ) with each a -atom and , normed by the infimum over all atomic representations.

Counterexamples to common slips Intermediate+

  • is not a space of functions for in the naïve sense; its elements are distributions whose maximal function is in . For even -functions need not be locally integrable, and the cancellation forces them to be genuinely oscillatory; treating as a subspace of is wrong — and for .

  • More vanishing moments are needed as decreases — a single mean-zero condition does not suffice below . The moment order grows without bound as . Using only for small breaks the atom-to-maximal estimate; the higher moments are essential.

  • The size normalization is tied to , not . Stating the size bound as only describes -atoms; the general -atom uses . All choices of give the same atomic space — a theorem, not a definition.

  • is not . Every function is in with , but the inclusion is strict: a single non-balanced bump (mean nonzero) is in . The defining difference is the global cancellation together with its maximal-function refinement.

Key theorem with proof Intermediate+

Theorem (atomic decomposition of ; Coifman 1974, Latter 1978). Fix and , . A tempered distribution lies in if and only if it admits a decomposition into -atoms with , and with constants depending only on . In particular all the atomic spaces () coincide with and with each other.

Proof. We prove the harder direction, the construction of atoms from the maximal-function definition (the ""), in the model case , ; the easy direction follows the Bridge. Assume with the radial maximal function; one may take with for the construction (the general distributional case is reached by approximation).

Step 1 (Calderón-Zygmund selection on the maximal function). For set the open level sets . Each has finite measure, , and . Apply a Whitney decomposition 02.19.03 to each , obtaining cubes with bounded overlap of their dilates , , and side length comparable to the distance to .

Step 2 (the bad/atomic split). Choose a partition of unity subordinate to with . Define the good part at level by freezing off and replacing it on each Whitney cube by the -weighted local mean, so that has . The telescoping difference is supported in , has integral zero on each cube, and — using that off and the bounded-overlap geometry — satisfies the pointwise bound .

Step 3 (atoms emerge). Write the difference as a sum over cubes of mean-zero bumps, Set and . Then is supported in the cube , has mean zero, and obeys — a -atom.

Step 4 (summability of coefficients). Since in as and as (because ), telescoping gives in , hence in . The coefficient sum is controlled by the maximal function: using (bounded overlap) and the level-set identity, Thus , which is the coefficient bound.

Bridge. This decomposition builds toward every boundedness theorem for , and the same atoms appear again in the Fefferman duality and in the continuity of Calderón-Zygmund operators. The foundational reason the construction works is that the Calderón-Zygmund stopping-time selection of 02.19.03, run on the level sets of the maximal function rather than on itself, manufactures the cancellation by hand: each telescoped difference is forced to have mean zero on every Whitney cube, so the atoms are balanced by fiat and their coefficient sum is exactly the layer-cake integral of . This is exactly the dual mechanism to the John-Nirenberg recursion of 02.20.01: there the stopping time upgraded mean oscillation to exponential integrability, here it disassembles a maximal-function bound into balanced unit pieces, and putting these together the pairing converges because the atom's cancellation meets the function's John-Nirenberg integrability. The easy direction — that any atomic sum has — is dual to this and generalises the single-atom estimate , the central insight being that one normalized atom has -quasinorm bounded by a constant, after which subadditivity of assembles the sum.

Exercises Intermediate+

Advanced results Master

Theorem 1 (maximal-function characterizations coincide; Fefferman-Stein 1972 Acta Math. 129, 137; Burkholder-Gundy-Silverstein 1971). For and sufficiently large, the grand, non-tangential, and radial maximal functions all define the same space, with constants depending only on . The radial maximal function alone suffices: a tempered distribution with already has all the stronger maximal functions in . This is the theorem that makes the maximal definition of robust [Fefferman-Stein 1972].

Theorem 2 (atomic decomposition; Coifman 1974 Studia Math. 51, 269; Latter 1978 Studia Math. 62, 93). For and any , , the atomic space equals with comparable (quasi)norms, and the moment order is sharp: -atoms with fewer vanishing moments do not generate . The independence from means one may always reduce to the most convenient atom size, or [Latter 1978].

Theorem 3 (Calderón-Zygmund operators on the endpoints). A Calderón-Zygmund operator bounded on with Hörmander kernel maps boundedly, and under the additional cancellation (i.e. maps atoms to atoms in a suitable sense) maps for . The pair is the correct endpoint replacement for the failed and bounds; interpolating with recovers for [Stein 1993].

Theorem 4 (Fefferman duality $(H^1)^ = \mathrm{BMO}$; Fefferman 1971; Fefferman-Stein 1972 Acta Math. 129, 137).* The dual of is : every defines a bounded functional on (interpreted via atoms) with , and every bounded functional on arises this way. The pairing converges because the atom's cancellation meets the John-Nirenberg integrability of 02.20.01, so is absolutely convergent [Fefferman-Stein 1972].

Theorem 5 (interpolation: the scale). The complex and real interpolation spaces between and (, with read as ) are again Hardy spaces: with . In particular interpolation between and reproduces the full scale , placing and as the genuine endpoints below and above [Fefferman-Stein 1972].

Theorem 6 (Coifman-Weiss spaces of homogeneous type; Coifman-Weiss 1977 Bull. AMS 83, 569). The atomic definition of requires only a quasimetric and a doubling measure: on any space of homogeneous type the atomic Hardy space is well defined for near , the - duality persists, and Calderón-Zygmund operators remain bounded. This abstraction reveals that the Euclidean Fourier-analytic apparatus is inessential: cancellation against a doubling geometry is the true content [Coifman-Weiss 1977].

Synthesis. The atomic decomposition is the foundational reason that real Hardy space behaves well under singular operations: it disassembles every element into balanced unit pieces, and this is exactly the structural device — Calderón-Zygmund stopping-time selection of 02.19.03 run on the maximal function's level sets — that converts a single maximal-function bound into a sum of mean-zero atoms with controlled coefficients. The central insight is a chain of dualities centered on cancellation. Putting these together, the four characterizations (grand maximal, non-tangential, atomic, Riesz-transform) are the same object seen from four directions, and the equivalence is what lets each boundedness proof choose its most convenient face: atoms for (Theorem 3), the maximal function for interpolation (Theorem 5). This is dual to the BMO theory of 02.20.01: the John-Nirenberg exponential integrability there is exactly what makes the atomic pairing converge here, so is the precise statement that atoms and oscillation are paired endpoints. The bridge unifying the picture is that and bracket the Lebesgue scale: interpolating between them generalises Plancherel into the continuum , and the whole edifice ports — by Coifman-Weiss (Theorem 6) — to any doubling geometry, revealing that the Euclidean Fourier transform was never the point; the central insight is that cancellation against a doubling measure is.

Full proof set Master

Proposition 1 ( for ). For and the radial maximal function, .

Proof. For : is not quite right because need not be radial-decreasing; instead dominate by a radial-decreasing majorant, giving pointwise. Since is bounded on for 02.19.03, . Conversely, in and a.e. as (approximate identity), so at Lebesgue points, whence .

Proposition 2 (a single atom has bounded -quasinorm). For and a -atom with the required vanishing moments, .

Proof. Let be supported in , , with vanishing moments up to order . Split , . On : , and by Hölder with exponent and the -boundedness of , Off : a Taylor expansion of to order , killed against the vanishing moments, yields after the size normalization; since , the exponent makes integrable off with . Adding, . The vanishing-moment order is exactly the smallest making the far tail -integrable.

Proposition 3 (atomic sums converge in ). If and each is a -atom, then converges in and .

Proof. By -subadditivity of the maximal functional (the maximal operator is sublinear, and is subadditive for ), pointwise in the sense that . Integrating and using Proposition 2, Convergence in follows because partial sums are Cauchy in and continuously. This is the easy direction; Theorem 2 supplies the converse.

Proposition 4 (). with , and the inclusion is strict.

Proof. For , a.e. (Lebesgue points of the approximate identity), so . Strictness: take on , a nonnegative bump with . Then for large , at the optimal scale , so , which is not integrable on . Hence . The obstruction is precisely the nonvanishing mean: forces a non-integrable maximal tail.

Proposition 5 (necessity of for ). If then .

Proof. Since (Proposition 4), is continuous and . For , ; choosing with , the bound with shows forces the mean of to vanish: were , the tail estimate of Proposition 4 gives , contradiction. Hence .

Proposition 6 (BMO functionals are well-defined on ). For the functional is well-defined and bounded on with .

Proof. This is Exercise 8: on finite -atomic sums , by the pairing bound (Exercise 6: using the atom size and John-Nirenberg -oscillation 02.20.01). Infimizing over representations and using (Theorem 2) gives ; density of finite atomic sums extends to all of . Representation-independence follows from the uniform bound.

Connections Master

  • BMO and the John-Nirenberg inequality 02.20.01. The dual partner and direct prerequisite. The Fefferman duality pairs the mean-zero atoms of against functions, and the absolute convergence of the pairing is supplied exactly by the John-Nirenberg - and exponential-oscillation integrability proved there; the BMO unit's Theorem 4 and this unit's Theorem 4 are the two faces of one duality.

  • Calderón-Zygmund singular integral operators 02.19.03. The direct prerequisite and the engine of the atomic decomposition. The Whitney/stopping-time selection that builds atoms from the maximal function is the Calderón-Zygmund decomposition run on level sets, and the same Hörmander kernel estimate that gives weak- there gives here — the endpoint that replaces the failed on which singular integrals are genuinely bounded.

  • spaces, Hölder, Minkowski, Riesz-Fischer completeness 02.07.06. The direct prerequisite carrying the layer-cake calculus behind the coefficient-sum bound, the Cauchy-Schwarz and Hölder estimates in the single-atom and pairing proofs, and the interpolation scaffolding that places and as the endpoints bracketing the scale via Theorem 5.

  • Fourier transform on and the Plancherel theorem 02.10.04. The direct prerequisite supplying the Riesz-transform symbols , the harmonic-conjugate (Stein-Weiss) system behind the Riesz characterization, and the mean-vanishing consequence that distinguishes from .

  • Littlewood-Paley theory and square functions 02.20.03. The analytic sibling. The square-function characterization of for the Littlewood-Paley square function — is the third major description of Hardy space, dual to the Carleson-measure characterization of and equivalent to both the maximal and atomic definitions developed here.

Historical & philosophical context Master

The Hardy spaces originate in complex analysis: G. H. Hardy and others studied, for the unit disc, the analytic functions whose boundary values have controlled means, and the conjugate-function theorems of M. Riesz and Kolmogorov gave the first and weak- bounds. The real-variable theory — freeing from analyticity and conjugate harmonic functions — was inaugurated by Elias Stein and Guido Weiss, who replaced the single analytic function by a system of conjugate harmonic functions (the Riesz-transform system) and defined for in higher dimensions through it.

The maximal-function characterization, which removed even the harmonic structure, was the achievement of Charles Fefferman and Elias Stein in their 1972 Acta Mathematica paper spaces of several variables [Fefferman-Stein 1972], building on the disc-model maximal theorem of Donald Burkholder, Richard Gundy, and Martin Silverstein [Burkholder-Gundy-Silverstein 1971]; the same paper established the duality . The atomic decomposition — reducing the entire space to balanced standardized pieces — was found by Ronald Coifman in dimension one [Coifman 1974] and extended to by Robert Latter [Latter 1978]. Ronald Coifman and Guido Weiss then showed that atoms require only a doubling quasimetric, defining on spaces of homogeneous type [Coifman-Weiss 1977] and severing the theory's last tie to the Fourier transform.

Bibliography Master

@article{FeffermanStein1972,
  author  = {Fefferman, Charles and Stein, Elias M.},
  title   = {$H^p$ spaces of several variables},
  journal = {Acta Mathematica},
  volume  = {129},
  year    = {1972},
  pages   = {137--193}
}

@article{BurkholderGundySilverstein1971,
  author  = {Burkholder, Donald L. and Gundy, Richard F. and Silverstein, Martin L.},
  title   = {A maximal function characterization of the class $H^p$},
  journal = {Transactions of the American Mathematical Society},
  volume  = {157},
  year    = {1971},
  pages   = {137--153}
}

@article{Coifman1974,
  author  = {Coifman, Ronald R.},
  title   = {A real variable characterization of $H^p$},
  journal = {Studia Mathematica},
  volume  = {51},
  year    = {1974},
  pages   = {269--274}
}

@article{Latter1978,
  author  = {Latter, Robert H.},
  title   = {A characterization of $H^p(\mathbb{R}^n)$ in terms of atoms},
  journal = {Studia Mathematica},
  volume  = {62},
  year    = {1978},
  pages   = {93--101}
}

@article{CoifmanWeiss1977,
  author  = {Coifman, Ronald R. and Weiss, Guido},
  title   = {Extensions of Hardy spaces and their use in analysis},
  journal = {Bulletin of the American Mathematical Society},
  volume  = {83},
  year    = {1977},
  pages   = {569--645}
}

@article{SteinWeiss1960,
  author  = {Stein, Elias M. and Weiss, Guido},
  title   = {On the theory of harmonic functions of several variables I},
  journal = {Acta Mathematica},
  volume  = {103},
  year    = {1960},
  pages   = {25--62}
}

@book{Stein1993,
  author    = {Stein, Elias M.},
  title     = {Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Integrals},
  publisher = {Princeton University Press},
  year      = {1993}
}

@book{Grafakos2014Modern,
  author    = {Grafakos, Loukas},
  title     = {Modern Fourier Analysis},
  edition   = {3},
  publisher = {Springer},
  year      = {2014}
}