Riesz and Bessel Potentials and the Hardy-Littlewood-Sobolev Inequality
Anchor (Master): Stein 1970 *Singular Integrals* (Princeton) Ch. V; Lieb-Loss 2001 *Analysis* 2e (AMS GSM 14) §4.3; Grafakos 2014 *Modern Fourier Analysis* 3e (Springer) §1.2-1.3
Intuition Beginner
Imagine a sharp signal — a spike of charge, a burst of heat, a jagged density. Differentiating sharpens it further; integrating smooths it out. The Riesz potential is a knob that lets you smooth by a fractional amount: you can integrate "one and a half times" instead of only once or twice. The number you turn the knob to is a smoothing order, written as a positive amount . Turn it a little and you round off the sharpest corners; turn it more and the signal spreads into a gentle hill. This single adjustable operation interpolates continuously between leaving a function alone and integrating it many whole times.
Why bother with fractional smoothing? Because the right amount of smoothing is often not a whole number. A function might be just barely too rough to be continuous, and the honest description of how rough it is uses a fraction. The Riesz potential gives you the vocabulary for "this function is exactly one-half of a derivative away from being nice," which whole-number derivatives cannot express.
The headline fact is a trade you get for free: smoothing by amount does not just make a function look nicer, it makes it live in a better space of functions. If you measure a function's size by how its values pile up — its averaged -th power — then after smoothing by you can measure with a more demanding exponent and still get a finite answer, with the new exponent tied to the old one by a clean bookkeeping rule. That guaranteed upgrade is the Hardy-Littlewood-Sobolev inequality, and it is the engine behind why solutions of equations are smoother than the data that produced them.
The one-sentence takeaway: the Riesz potential smooths a function by an adjustable fractional amount, and the price you pay in spread is repaid by a guaranteed improvement in how the function's size is measured — that guaranteed improvement is the Hardy-Littlewood-Sobolev inequality.
Visual Beginner
Picture a tall narrow spike sitting at the origin of a line. Apply a small amount of smoothing: the spike becomes a rounded bump, lower and wider. Apply more smoothing: the bump flattens into a broad gentle hill that decays slowly out to the sides. The amount of smoothing is the dial ; larger means more spreading. The width of the spread is governed by a kernel that decays like a power of the distance — close to the source it is large, far away it tapers off at a rate set by the dimension and by .
The companion picture is the bookkeeping rule. Draw a dial labelled with the input measuring-exponent on the left and the output measuring-exponent on the right; turning the smoothing knob to amount slides the output exponent up to a value determined by the equation that subtracts a fixed fraction from the reciprocal of . The Bessel potential is the same picture with the long power-law tails clipped to decay rapidly, which is the version that matches the everyday Sobolev spaces.
Worked example Beginner
We work out the smoothing-exponent bookkeeping in a concrete case so the rule is no longer abstract.
Step 1. Work in ordinary three-dimensional space, so the dimension is . Choose to smooth by the amount . We start with a function whose size is measured with the input exponent , meaning we look at the average of its values raised to the power .
Step 2. The bookkeeping rule says the reciprocal of the output exponent equals the reciprocal of the input exponent minus the smoothing amount divided by the dimension. In numbers: one over equals one over minus divided by .
Step 3. Compute the right side. One over is . The smoothing amount over the dimension is . Subtract: . So one over equals .
Step 4. Invert to find the output exponent: . The smoothing by one unit raised the measuring exponent from all the way to .
What this tells us: starting from a function controlled only in the weak averaged sense, one unit of fractional smoothing produces a function controlled in the much more demanding sense — a genuine upgrade in regularity bought purely by the smoothing operation. This is the exact arithmetic behind the Sobolev embedding: a function with one derivative in the sense automatically sits in the space of functions with finite cube-average, with no further hypotheses.
Check your understanding Beginner
Formal definition Intermediate+
Throughout, is the Fourier transform with the unitary normalisation, is Lebesgue measure, and . The Hardy-Littlewood maximal operator and its weak-type and strong-type bounds are taken from 02.19.01; the Sobolev space from 24.01.01; the Newtonian-potential prototype from 02.13.02.
Definition (Riesz potential). For the Riesz potential of order is the convolution operator
The kernel is locally integrable precisely because , and the normalisation is fixed so that on the Fourier side
i.e. as a Fourier multiplier. The Newtonian potential of 02.13.02 is the case , .
Definition (Bessel potential). For the Bessel potential of order is the operator defined by the Fourier multiplier Its convolution kernel , the Bessel kernel, is a positive integrable function: with , smooth away from the origin, with the same singularity as the Riesz kernel near but with exponential rather than power decay at infinity. Concretely , and the multiplier expands as at high frequency, so and differ by an operator bounded on every .
Definition (Bessel-potential space). For and the Bessel-potential space is Because is injective (its multiplier never vanishes), every has a unique , and the norm is well defined.
Definition (weak-type and the scaling line). A sublinear operator is of weak type if for all . The Sobolev scaling line for is the relation forced by dimensional analysis: under the dilation one has , and matching the norms of both sides leaves this single admissible relation.
Counterexamples to common slips Intermediate+
The Hardy-Littlewood-Sobolev inequality fails at . The scaling line at gives , but is not bounded ; only the weak-type estimate holds. Testing on an approximate identity concentrating at the origin shows the strong bound must fail, exactly as the Newtonian potential fails strong type at .
The inequality fails at the upper endpoint . There , i.e. , and does not map into : the Riesz potential of an function can be unbounded (a logarithmic blow-up). The correct endpoint statement is , the space of functions of bounded mean oscillation.
Riesz potentials do not preserve . For the operator is never bounded from to itself; it always changes the exponent. The Bessel potential does map to (its kernel is integrable), which is the structural reason , not , is the right tool for defining the function spaces .
The semigroup law has a range restriction. The composition law holds only while , because is undefined once the kernel exponent becomes non-positive. The Bessel potentials carry no such restriction: for all , since their multipliers multiply cleanly.
Key theorem with proof Intermediate+
Theorem (Hardy-Littlewood-Sobolev inequality; Hardy-Littlewood 1928 Math. Z. 27, Sobolev 1938 Mat. Sb. 4). Let and let satisfy the scaling relation . Then there is a constant with
Proof. Normalise ; by homogeneity it suffices to show . The argument splits the kernel at a radius to be optimised pointwise, bounds the near part by the maximal function and the far part by Hölder, and then runs Marcinkiewicz interpolation. Write, for fixed and any ,
Step 1 (near part is controlled by the maximal function). Decompose the near ball into dyadic annuli , . On the -th annulus , so
where and is the centred maximal function of 02.19.01. The -th term is , and summing the geometric series in gives
Step 2 (far part is controlled by Hölder). Let be the conjugate exponent, . By Hölder, The radial integral converges at infinity exactly when , i.e. when , equivalently — precisely the hypothesis . Computing in polar coordinates, Raising to the power and using , where and .
Step 3 (optimise the splitting radius). Combining Steps 1 and 2, The two terms balance when , i.e. at (any with ; where the function vanishes near and ). Substituting this , both terms become a constant multiple of , so using .
Step 4 (Marcinkiewicz / direct integration). Raise to the -th power and integrate:
where is the strong-type bound for from 02.19.01 (valid since ), and . Taking -th roots gives , the claim. (The pointwise bound together with the maximal theorem is the Hedberg form of the argument; it is the Marcinkiewicz interpolation between the weak endpoints made explicit.)
Bridge. The Hardy-Littlewood-Sobolev inequality builds toward the full Sobolev embedding theorem and appears again in the Calderón-Zygmund regularity theory of 02.13.02, where the second derivatives of the Newtonian potential are controlled in by exactly the same kind of singular-kernel estimate. The central insight is that fractional integration is tamed by the maximal function: the pointwise bound converts a global convolution estimate into a local averaging estimate, and this is exactly the foundational reason the proof needs no Fourier analysis at all. The bridge is that the maximal theorem of 02.19.01 and the Sobolev embedding of 24.01.01 are two faces of one inequality: putting these together, the weak-type bound for supplies, via the splitting-radius optimisation, the strong-type bound for , which generalises the Newtonian-potential mapping to every fractional order.
Exercises Intermediate+
Advanced results Master
Theorem 1 (strong-type HLS and its proof structure; Hardy-Littlewood 1928, Sobolev 1938). For and on the scaling line , the Riesz potential is bounded. The proof reduces to the Hedberg pointwise inequality and the maximal theorem of 02.19.01; no Fourier analysis or singular-integral machinery is required, which is the distinctive economy of the maximal-function route [Stein 1970].
Theorem 2 (sharp constant and extremals; Lieb 1983). The optimal constant in the HLS inequality in its bilinear form is attained, and in the conformally invariant diagonal case the extremals are exactly the translates, dilates, and scalar multiples of . Lieb computed the constant by symmetric-decreasing rearrangement, identifying the extremal profiles as the stereographic images of constants on the sphere [Lieb 1983]. The diagonal sharp HLS is dual to the sharp Sobolev inequality of Talenti and Aubin and underlies the Yamabe problem in conformal geometry.
Theorem 3 (Bessel-potential spaces equal Sobolev spaces; Calderón 1961, Aronszajn-Smith 1961). For and every non-negative integer , and for non-integer the Bessel-potential space is the natural fractional-order interpolant. The norm equivalence is the content of the Calderón-Zygmund theory applied to the Riesz-transform symbols , which are -bounded Fourier multipliers for [Calderon 1961] [Aronszajn-Smith 1961]. At the identification is exact by Plancherel: .
Theorem 4 (the borderline endpoint ). At the upper endpoint , the scaling line gives but fails to map into . The correct statement is that maps continuously into the space of functions of bounded mean oscillation:
This is the harmonic-analysis shadow of the Trudinger-Moser borderline of Sobolev embedding: at the critical exponent the embedding into fails, and the honest target is (or, for the exponential-integrability refinement, the Orlicz space ). The full John-Nirenberg theory of — that bounded mean oscillation forces exponential-tail distributional decay — is developed in 02.20.01 [Stein 1970].
Theorem 5 (Riesz transforms and the factoring of ). The Riesz transforms , the singular-integral operators with Fourier multipliers , factor the relation between fractional integration and differentiation. One has on the Fourier side, hence for , so a derivative of a Riesz potential is a Riesz transform of a Riesz potential of one lower order. Because each is bounded on every () by the Calderón-Zygmund theorem, the Sobolev mapping properties of transfer to all of its derivatives, which is the mechanism behind the regularity gain " orders of smoothness" being uniform across derivatives [Stein 1970].
Theorem 6 (the semigroup, fractional Laplacian, and stable processes). The Riesz potentials form the negative-power semigroup of the Laplacian: with for . Their inverses are the fractional Laplacians , the generators of -stable Lévy processes for , with the pointwise singular-integral representation . This identifies the analytic theory of Riesz potentials with the probabilistic theory of jump processes and with the modern theory of nonlocal elliptic equations [Riesz 1949].
Synthesis. The Riesz potential is the foundational reason that the gain of regularity in elliptic and Sobolev theory is fractional and scale-invariant rather than tied to whole derivatives, and this is exactly the structural fact that unifies the maximal-function, singular-integral, and Sobolev-embedding strands of the chapter. The central insight is the Hedberg dictionary: fractional integration is dominated pointwise by a power of the maximal function, so the entire theory of is dual to the weak-type theory of , and putting these together the Sobolev embedding of 24.01.01 is revealed as the case of one inequality whose engine is the covering lemma of 02.19.01.
The pattern generalises in three directions that recur throughout harmonic analysis: vertically, from the Riesz potential to the Bessel potential, where clipping the power-law tail to exponential decay produces the -bounded operator that identifies with and is dual to the Calderón-Zygmund multiplier theory; horizontally, from the strong-type interior of the scaling line to its two endpoints, where the lower endpoint degrades to weak type and the upper endpoint degrades to , the bridge to the John-Nirenberg theory of 02.20.01; and structurally, from negative powers to positive powers , the fractional Laplacians that generalise the local operator of 02.13.02 to the nonlocal generators of stable processes. The central insight binding all three is that inverts a fractional power of the Laplacian by convolution against a homogeneous kernel, exactly as the Newtonian potential inverts , and the Sobolev-scaling line is the unique exponent relation this homogeneity permits.
Full proof set Master
Proposition 1 (local integrability and well-definedness of on ). For and , the integral defining converges absolutely for almost every and every .
Proof. Split the kernel at radius : . The near piece lies in since , so its convolution with is in by Young, finite a.e. The far piece lies in exactly when , i.e. , since then ; its convolution with is therefore bounded by pointwise by Hölder, finite everywhere. The sum converges a.e.
Proposition 2 (Fourier-multiplier identity for ). On the Schwartz class, , and this fixes the constant .
Proof. The homogeneous distribution has Fourier transform a constant multiple of , by the Riesz formula for the Fourier transform of homogeneous functions: for , . Set , so and the prefactor is . Dividing the kernel by converts this prefactor into , giving . Since convolution becomes multiplication under , .
Proposition 3 (Hedberg pointwise inequality). For and , .
Proof. This is Exercise 4: split at radius , bound the near part by via dyadic annuli against , bound the far part by via Hölder, and optimise to balance the two. The balanced value is .
Proposition 4 (strong-type HLS from Hedberg and the maximal theorem). For and on the scaling line, .
Proof. Raise the Hedberg bound to the -th power: . The exponent since . Integrate:
The total power of is . Taking -th roots gives . The hypothesis is used precisely to invoke the strong-type maximal theorem of 02.19.01.
Proposition 5 (Bessel kernel is a positive probability density). The Bessel kernel satisfies , , .
Proof. Use the subordination formula obtained by writing the multiplier as a Laplace transform: for , Apply with and take inverse Fourier transforms, using (the heat kernel at time ): Every factor in the integrand is positive, so . Integrating in and using gives . Hence is an probability density, and is a contraction on every .
Proposition 6 ( identification ). For every , with equal norms, where .
Proof. By definition means , i.e. by Proposition 2's multiplier description applied to and Plancherel. That is exactly the defining condition of , and the norms coincide:
For integer this equals after expanding the binomial and matching each term to via Plancherel, recovering 24.01.01's definition at .
Connections Master
Hardy-Littlewood maximal function and Vitali covering
02.19.01. The direct engine of the proof. The Hedberg pointwise bound reduces the entire mapping theory of the Riesz potential to the strong-type maximal inequality, and the weak-type bound supplies the endpoint. The covering-lemma economy of that unit is exactly why HLS needs no Fourier analysis, and the dyadic-annulus splitting in Step 1 of the Key Theorem is the same scale-decomposition used there.spaces, Hölder, Minkowski, Marcinkiewicz interpolation
02.07.06. The function-space substrate. Hölder's inequality bounds the far part of the kernel split; the conjugate-exponent arithmetic determines the convergence threshold ; and Marcinkiewicz interpolation between the weak endpoints is the abstract form of the Hedberg balancing argument. Young's inequality from this unit gives the -boundedness of the Bessel potential.Sobolev spaces and
24.01.01. The destination. The Bessel-potential space is identified with , giving an intrinsic, Fourier-analytic definition of fractional Sobolev regularity; and the Sobolev embedding is re-derived here as the case of HLS applied to the gradient, complementing that unit's direct Gagliardo-Nirenberg-Sobolev proof.Poisson equation, fundamental solution, Newtonian potential
02.13.02. The prototype and the special case. The Newtonian potential is , so HLS upgrades the Newtonian-potential mapping from a special exponent to the whole scaling line, and the fractional Laplacians generalise that unit's local operator to nonlocal generators of stable processes.John-Nirenberg inequality and BMO [forward: 02.20.01]. The endpoint successor. The upper-endpoint statement derived here as a closing preview is completed there, where the John-Nirenberg theorem shows bounded mean oscillation forces exponential distributional decay, the borderline refinement of the Sobolev embedding into .
Historical & philosophical context Master
The fractional integral traces to the Riemann-Liouville fractional calculus of the nineteenth century, but its modern multidimensional theory begins with Marcel Riesz, who in his 1949 Acta Mathematica memoir L'intégrale de Riemann-Liouville et le problème de Cauchy [Riesz 1949] introduced the -dimensional Riesz potentials with the normalisation that makes them a convolution semigroup and identifies them as negative powers of the Laplacian. The boundedness theorem that bears the Hardy-Littlewood-Sobolev name was assembled in two stages: Godfrey Harold Hardy and John Edensor Littlewood proved the one-dimensional fractional-integration inequality in their 1928 Mathematische Zeitschrift paper Some properties of fractional integrals. I [Hardy-Littlewood 1928], and Sergei Lvovich Sobolev extended it to in his 1938 Matematicheskii Sbornik paper On a theorem of functional analysis [Sobolev 1938], in the course of establishing the embedding theorems that now carry his name.
The identification of the Bessel-potential spaces with the Sobolev spaces is due to Alberto Calderón, whose 1961 Proceedings of Symposia in Pure Mathematics article Lebesgue spaces of differentiable functions and distributions [Calderon 1961] used the Calderón-Zygmund multiplier theory to prove the norm equivalence for all , with the systematic theory of Bessel potentials developed in parallel by Nachman Aronszajn and Kennard Smith in their 1961 Annales de l'Institut Fourier paper [Aronszajn-Smith 1961]. The maximal-function proof of HLS presented here, which dispenses with Fourier analysis entirely, is the route taken by Elias Stein in his 1970 monograph [Stein 1970], where the Riesz and Bessel potentials open the chapter that culminates in the Sobolev embedding theorems. The sharp constants and the conformal extremals were found by Elliott Lieb in his 1983 Annals of Mathematics paper [Lieb 1983], connecting the inequality to the Yamabe problem and to symmetric-decreasing rearrangement.
Bibliography Master
@article{Riesz1949,
author = {Riesz, Marcel},
title = {L'int\'egrale de Riemann-Liouville et le probl\`eme de Cauchy},
journal = {Acta Mathematica},
volume = {81},
year = {1949},
pages = {1--223}
}
@article{HardyLittlewood1928,
author = {Hardy, G. H. and Littlewood, J. E.},
title = {Some properties of fractional integrals. I},
journal = {Mathematische Zeitschrift},
volume = {27},
year = {1928},
pages = {565--606}
}
@article{Sobolev1938,
author = {Sobolev, S. L.},
title = {On a theorem of functional analysis},
journal = {Matematicheskii Sbornik (N.S.)},
volume = {4},
year = {1938},
pages = {471--497}
}
@book{Stein1970,
author = {Stein, Elias M.},
title = {Singular Integrals and Differentiability Properties of Functions},
publisher = {Princeton University Press},
year = {1970}
}
@article{Calderon1961,
author = {Calder\'on, Alberto P.},
title = {Lebesgue spaces of differentiable functions and distributions},
journal = {Proceedings of Symposia in Pure Mathematics},
volume = {IV},
year = {1961},
pages = {33--49}
}
@article{AronszajnSmith1961,
author = {Aronszajn, Nachman and Smith, Kennard T.},
title = {Theory of Bessel potentials. I},
journal = {Annales de l'Institut Fourier},
volume = {11},
year = {1961},
pages = {385--475}
}
@article{Lieb1983,
author = {Lieb, Elliott H.},
title = {Sharp constants in the Hardy-Littlewood-Sobolev and related inequalities},
journal = {Annals of Mathematics},
volume = {118},
year = {1983},
pages = {349--374}
}
@book{LiebLoss2001,
author = {Lieb, Elliott H. and Loss, Michael},
title = {Analysis},
edition = {2},
publisher = {American Mathematical Society},
year = {2001}
}