02.07.08 · analysis / measure-theory

Absolute Continuity and the Radon-Nikodym Theorem

shipped3 tiersLean: partial

Anchor (Master): Bogachev, Measure Theory Vol. 1 §3.2; Rudin, Real and Complex Analysis 3e §6-7 (von Neumann Hilbert-space proof)

Intuition Beginner

A measure tells you how much "stuff" sits in each region of a space: Lebesgue measure on the real line tells you the length of an interval, counting measure tells you how many points it contains, and a probability measure tells you the chance of an outcome falling in that region. Suppose you have two measures on the same space and you ask: can the second measure be written as the first measure weighted by some density function? If the answer is yes, that density is what the Radon-Nikodym theorem produces, and the equation it satisfies is the foundational identity behind probability densities, mass distributions, and conditional expectation.

The picture to keep in mind is mass spread along a wire. The "background" measure is uniform length along the wire, and the "target" measure assigns each piece of the wire a different mass depending on how thick the wire is at that point. The local thickness is the density of mass relative to length, and it lets you recover the mass of any segment by integrating the thickness over the length of that segment. The Radon-Nikodym derivative is the abstract version of this thickness function: it is the density of one measure with respect to another.

The condition that makes the procedure work is called absolute continuity. The target measure must not place any mass on a region that the background measure says has length zero. If the target ever puts mass on a length-zero set, no density function defined pointwise can capture that mass — the integral of any density over a length-zero set is zero. So the assignment of a density is possible exactly when the target is absolutely continuous with respect to the background.

When the target measure does place mass on length-zero regions, the Lebesgue decomposition theorem says you can still split the target into two parts: an absolutely continuous part that has a density, plus a singular part concentrated entirely on length-zero pieces. The classic example is a probability distribution that has both a continuous bell curve and a delta spike at one specific point: the bell curve part has a density, and the delta spike is the singular part.

The one-sentence takeaway: when a target measure assigns no mass to background-null regions, the Radon-Nikodym theorem says the target can be written as the background measure weighted by a unique density function, and this density is the abstract version of a probability density or mass distribution.

Visual Beginner

Imagine two ways of measuring the same wire. The first method assigns each piece its length (the background). The second method weighs each piece of the wire to get its mass (the target). If the wire is thicker in some places and thinner in others, the mass per unit length varies along the wire.

Top: length measure on the wire is uniform. Bottom: mass measure depends on local thickness, captured by the density curve. The mass of any segment is the area under the density curve over that segment.

Mathematically, the mass of a segment equals the integral of the density (which is the Radon-Nikodym derivative of mass with respect to length) over that segment. The condition for this picture to be possible is absolute continuity: no piece of zero length carries any mass.

Worked example Beginner

We compute a Radon-Nikodym density between two probability measures and confirm it acts as a density.

Step 1. The setup. Let the background measure be the standard normal distribution on the real line, with density $φ (x) = e^{- x^{2} /2} / 2 π$ against Lebesgue measure. Let the target measure be a different normal distribution with mean $1$ and the same variance, with density $ψ (x) = e^{- (x - 1)^{2} /2} / 2 π$ against Lebesgue measure.

Step 2. The density of target against background. Since both measures have densities against Lebesgue measure, the density of the target against the background is the ratio of the two density functions: $ψ (x) / φ (x) = e^{- (x - 1)^{2} /2} / e^{- x^{2} /2} = e^{- (x - 1)^{2} /2 + x^{2} /2} = e^{x - 1/2}$ .

Step 3. Verify by integrating an indicator. Consider the interval $[0, 1]$ . The mass the background gives to this interval is the area under the standard-normal bell curve from $0$ to $1$ , roughly $0.341$ . The mass the target gives is the area under the shifted bell curve from $0$ to $1$ , roughly $0.341$ as well (by translation symmetry).

According to the Radon-Nikodym formula, the target mass should equal the area under the product of the density of target against background times the background density, taken over the interval: the area under $e^{x - 1/2} φ (x)$ from $0$ to $1$ . Plugging in gives the area under $e^{x - 1/2} \cdot e^{- x^{2} /2} / 2 π$ from $0$ to $1$ . The exponent simplifies via $x - 1/2 - x^{2} /2 = - (x - 1)^{2} /2$ , so the integrand becomes $e^{- (x - 1)^{2} /2} / 2 π = ψ (x)$ , and the resulting area is indeed the target mass.

Step 4. The probability picture. The function $e^{x - 1/2}$ is what statisticians call the likelihood ratio of the shifted normal against the standard normal. It tells you how much more likely each outcome is under the target than under the background. Importance sampling, hypothesis testing, and the change-of-measure technique in stochastic analysis all build on this Radon-Nikodym density.

Step 5. A singular example. Now consider a probability measure that is a half-and-half mix of the standard normal and a point mass at $x = 5$ : the measure that puts half its mass on the continuous bell curve and half on the single point ${5}$ . There is no density function with respect to Lebesgue measure that captures the point-mass half, because any function integrated over the single point ${5}$ gives zero — but the point mass assigns probability $1/2$ to that point. The Lebesgue decomposition splits this mixed measure into its absolutely continuous half (the bell curve, with density $φ (x) /2$ ) plus its singular half (the point mass concentrated on ${5}$ ).

What this tells us: when the target measure has no mass on background-null sets, the Radon-Nikodym derivative exists as an integrable function. When it does have such mass, the Lebesgue decomposition separates the well-behaved density-having part from the singular part that resists density representation.

Check your understanding Beginner

Exercise (easy, multiple choice).

Which condition guarantees that the target measure $ν$ can be written as a density times the background measure $μ$ ?

A. The target $ν$ is a probability measure B. The target $ν$ assigns zero mass to every set of $μ$ -measure zero C. The two measures $ν$ and $μ$ are equal D. The background $μ$ is Lebesgue measure

Hint

Recall the definition of absolute continuity and the statement of Radon-Nikodym.

Answer

B. The target assigns zero mass to every $μ$ -null set. This is the definition of absolute continuity of $ν$ with respect to $μ$ , written $ν ≪ μ$ . The Radon-Nikodym theorem says that under absolute continuity (plus $σ$ -finiteness of $μ$ ), there is a non-negative measurable density function $d ν / d μ$ such that the $ν$ -mass of every measurable set $E$ equals the area under the density curve over $E$ measured against $μ$ . Option A is unrelated; option C is too restrictive (and gives the constant density $1$ ); option D is too restrictive (any $σ$ -finite background works).

Formal definition Intermediate+

Let $(X, M)$ be a measurable space and let $μ, ν$ be measures on $M$ .

Definition (absolute continuity). The measure $ν$ is absolutely continuous with respect to $μ$ , written $ν ≪ μ$ , if every $μ$ -null measurable set is also $ν$ -null: $μ (E) = 0$ implies $ν (E) = 0$ for every $E \in M$ .

Definition (mutual singularity). The measures $μ$ and $ν$ are mutually singular, written $μ ⊥ ν$ , if there exists a measurable set $A \in M$ with $μ (A) = 0$ and $ν (A^{c}) = 0$ . Equivalently, $μ$ and $ν$ are concentrated on disjoint measurable sets.

Definition (signed measure). A signed measure on $(X, M)$ is a countably additive set-function $λ : M \to [- \infty, \infty]$ taking at most one of the values $+ \infty, - \infty$ , with $λ (\emptyset) = 0$ . The positive part $λ^{+}$ and negative part $λ^{-}$ are non-negative measures arising from the Hahn-Jordan decomposition (Theorem 1 below), and $λ = λ^{+} - λ^{-}$ .

Definition (Radon-Nikodym derivative). If $ν ≪ μ$ and both measures are $σ$ -finite, then there is a non-negative $M$ -measurable function $f : X \to [0, \infty)$ , unique up to $μ$ -almost everywhere equality, such that $ν (E) = \int_{E} f d μ for every E \in M .$ The function $f$ is the Radon-Nikodym derivative of $ν$ with respect to $μ$ , denoted $d ν / d μ$ . The defining identity rewrites as $d ν = f d μ$ in differential notation.

Definition (Lebesgue decomposition). For two $σ$ -finite measures $μ, ν$ on $M$ , there exist unique measures $ν_{a}, ν_{s}$ on $M$ with $ν = ν_{a} + ν_{s}$ , $ν_{a} ≪ μ$ , and $ν_{s} ⊥ μ$ . The pair $(ν_{a}, ν_{s})$ is the Lebesgue decomposition of $ν$ relative to $μ$ .

Counterexamples to common slips Intermediate+

Absolute continuity does not require equivalence. The relation $ν ≪ μ$ is one-directional: $ν$ may have density against $μ$ without $μ$ having density against $ν$ . For example, Lebesgue measure on $[0, 1]$ is absolutely continuous with respect to Lebesgue measure on $[0, 2]$ , but not conversely (the half $[1, 2]$ has positive mass on $[0, 2]$ but is outside the support of the first measure).
Without $σ$ -finiteness the Radon-Nikodym derivative may not exist. On the real line, let $μ$ be counting measure (which is not $σ$ -finite on $R$ ) and let $ν$ be Lebesgue measure. Every counting-null set is the empty set (counting measure of any non-empty set is at least $1$ ), so vacuously $ν ≪ μ$ . But there is no Borel-measurable function $f$ with $ν (E) = \int_{E} f d μ$ for every Borel $E$ : such a function would need $ν ({x}) = f (x) \cdot 1$ , forcing $f \equiv 0$ , but then $\int_{E} f d μ = 0 \neq = ν (E)$ for any $E$ of positive Lebesgue measure. The $σ$ -finiteness of $μ$ is load-bearing.
Mutual singularity is not the negation of absolute continuity. Two measures can fail to be absolutely continuous without being mutually singular. The example: on the real line, take $μ =$ Lebesgue measure on $[0, 2]$ and $ν =$ Lebesgue measure on $[1, 3]$ . Neither is $≪$ the other (each has support outside the other's), but they are not mutually singular either: they overlap on $[1, 2]$ where both are non-zero. The Lebesgue decomposition of $ν$ with respect to $μ$ here is $ν = (ν ∣_{[1, 2]}) + (ν ∣_{[2, 3]})$ , with the first piece $≪ μ$ and the second piece $⊥ μ$ .
The Radon-Nikodym derivative is a function, not a number. Common slip: writing $d ν / d μ = c$ for a constant $c$ when in fact the derivative depends on $x$ . The constant case happens precisely when $ν = c μ$ , that is when $ν$ is a scalar multiple of $μ$ . For typical pairs of measures the derivative varies pointwise.
Uniqueness is only $μ$ -a.e., not pointwise. Two functions $f, g$ that agree $μ$ -almost everywhere give the same Radon-Nikodym density. Any pointwise modification on a $μ$ -null set leaves the integral unchanged. So the Radon-Nikodym derivative is an equivalence class in $L_{loc}^{1} (μ)$ (or $L^{1} (μ)$ when $ν$ is finite), not a specific function.

Key theorem with proof Intermediate+

Theorem (Lebesgue-Radon-Nikodym; Radon 1913 Sitzungsber. Wien 122, 1295; Nikodym 1930 Fund. Math. 15, 131). Let $(X, M)$ be a measurable space and let $μ, ν$ be $σ$ -finite measures on $M$ . There exist unique $σ$ -finite measures $ν_{a}, ν_{s}$ on $M$ with:

(a) $ν = ν_{a} + ν_{s}$ ;

(b) $ν_{a} ≪ μ$ ;

Moreover, there exists a non-negative $M$ -measurable function $f : X \to [0, \infty)$ , unique up to $μ$ -a.e. equality, such that $ν_{a} (E) = \int_{E} f d μ for every E \in M .$ The function $f = d ν_{a} / d μ$ is the Radon-Nikodym derivative of $ν_{a}$ with respect to $μ$ .

Proof (von Neumann's Hilbert-space argument; von Neumann 1940 Bull. AMS 46, 376). We first treat the case where both $μ$ and $ν$ are finite; the $σ$ -finite case follows by a standard exhaustion.

Step 1 (the Hilbert space). Let $λ = μ + ν$ . Both $μ$ and $ν$ are dominated by $λ$ (each is $\leq λ$ ), and $λ$ is a finite measure since $μ, ν$ are finite. Form the Hilbert space $L^{2} (λ) = L^{2} (X, M, λ)$ with inner product $⟨ u, v ⟩_{λ} = \int u \overset{v}{ˉ} d λ$ . The space $L^{2} (λ)$ is a Hilbert space by Riesz-Fischer completeness 02.07.06.

Step 2 (a bounded linear functional). Define $T : L^{2} (λ) \to R$ by $T (u) = \int u d ν$ . The functional $T$ is well-defined and bounded: by the Cauchy-Schwarz inequality applied to $∣ T (u) ∣ = ∣ \int u \cdot 1 d ν ∣ \leq \int ∣ u ∣ d ν \leq \int ∣ u ∣ d λ \leq ∥ u ∥_{L^{2} (λ)} \cdot ∥1 ∥_{L^{2} (λ)} = ∥ u ∥_{L^{2} (λ)} \cdot λ (X)$ , using $ν \leq λ$ in the second inequality and Cauchy-Schwarz with the constant $1 \in L^{2} (λ)$ in the third. So $∥ T ∥_{L^{2} (λ)^{*}} \leq λ (X)$ .

Step 3 (Riesz representation). By the Riesz representation theorem for Hilbert spaces, there exists $g \in L^{2} (λ)$ with $T (u) = \int u d ν = \int ug d λ$ for every $u \in L^{2} (λ)$ . The function $g$ is uniquely determined as the orthogonal projection of $1$ onto the relevant subspace, and $0 \leq g \leq 1$ $λ$ -a.e. (the upper and lower bounds are forced by choosing $u = χ_{{g > 1}}$ and $u = χ_{{g < 0}}$ as test functions and observing the integral identities).

Step 4 (the decomposition sets). Define $A = {x \in X : g (x) < 1}$ and $B = {x \in X : g (x) = 1}$ . Both sets are measurable since $g$ is. The set $B$ is the support of the singular part: on $B$ , the identity $\int u d ν = \int ug d λ = \int u d λ$ for $u$ supported on $B$ rearranges (using $λ = μ + ν$ ) to $\int u d μ = 0$ for every such $u$ , hence $μ (B) = 0$ .

Step 5 (the absolutely continuous part). Define $ν_{a} (E) = ν (E \cap A)$ and $ν_{s} (E) = ν (E \cap B)$ . Then $ν = ν_{a} + ν_{s}$ . The singular part $ν_{s}$ is concentrated on $B$ , where $μ (B) = 0$ , so $ν_{s} ⊥ μ$ .

For the absolutely continuous part: on $A$ , $g < 1$ , so $1 - g > 0$ . Setting $u = χ_{E} \cdot (1 - g)^{- 1}$ for $E \subseteq A$ measurable (after verifying integrability), the Riesz identity gives $\int_{E} (1 - g)^{- 1} d ν = \int_{E} (1 - g)^{- 1} g d λ$ . Using $λ = μ + ν$ , the right-hand side splits as $\int_{E} (1 - g)^{- 1} g d μ + \int_{E} (1 - g)^{- 1} g d ν$ , and rearranging gives $\int_{E} (1 - g)^{- 1} (1 - g) d ν = \int_{E} (g / (1 - g)) d μ$ , i.e., $ν (E) = \int_{E} f d μ$ where $f = g / (1 - g)$ on $A$ (and $f = 0$ on $B$ ). So $ν_{a} (E) = ν (E \cap A) = \int_{E \cap A} f d μ = \int_{E} f d μ$ (the integral over $B$ vanishes since $μ (B) = 0$ ), establishing $ν_{a} = f d μ$ with $f = d ν_{a} / d μ = g / (1 - g)$ on $A$ and $f = 0$ on $B$ .

Step 6 ( $σ$ -finite case). Decompose $X = ⋃_{n} X_{n}$ with $μ (X_{n}), ν (X_{n}) < \infty$ and $X_{n}$ disjoint. Apply the finite-measure result to each $X_{n}$ to get densities $f_{n}$ on $X_{n}$ . Patch together: define $f (x) = f_{n} (x)$ for $x \in X_{n}$ . The patched function is $M$ -measurable and gives the global Radon-Nikodym density.

Step 7 (uniqueness). Suppose $ν_{a} = f d μ = f^{'} d μ$ both hold. Then $\int_{E} (f - f^{'}) d μ = 0$ for every $E$ , which forces $f = f^{'}$ $μ$ -almost everywhere by the standard argument (apply to $E = {f > f^{'}}$ and $E = {f < f^{'}}$ to deduce $μ ({f \neq = f^{'}}) = 0$ ).

$□$

Bridge. The von Neumann Hilbert-space proof of Lebesgue-Radon-Nikodym is one of the cleanest applications of the Riesz representation theorem on $L^{2}$ , replacing the older measure-theoretic arguments (Radon 1913, Nikodym 1930, Hahn 1921) with a functional-analytic argument that pivots on a single bounded linear functional and its orthogonal projection. The resulting density $f = d ν_{a} / d μ$ is the foundational object behind every density representation in probability theory: probability density functions against Lebesgue measure, the likelihood ratio between two probability measures, the conditional expectation $E [X ∣ G]$ defined as the $G$ -measurable Radon-Nikodym derivative of $X d P$ with respect to $P ∣_{G}$ , and the change-of-measure formula in Girsanov's theorem for Brownian motion under a drifted measure. Each of these probabilistic constructions is a special case of the abstract Radon-Nikodym derivative, with the underlying $σ$ -finite Hilbert-space argument running invisibly behind the explicit formulas. Beyond probability, Radon-Nikodym is the load-bearing tool in the duality $(L^{p})^{*} ≅ L^{q}$ for $1 \leq p < \infty$ on $σ$ -finite measure spaces, the foundation of $L^{p}$ functional analysis; in the construction of conditional expectation as the orthogonal projection onto an $L^{2}$ -subspace, the foundation of martingale theory; and in the chain rule $d ν / d μ = (d ν / d λ) / (d μ / d λ)$ for nested density representations, the foundation of pushforward measures in differential geometry and information theory.

Exercises Intermediate+

Exercise 2 (easy, true-false).

If $ν ≪ μ$ and $μ ≪ ν$ (the two measures are mutually absolutely continuous), then $d ν / d μ \cdot d μ / d ν = 1$ almost everywhere.

Hint

Apply the Radon-Nikodym identity $ν (E) = \int_{E} (d ν / d μ) d μ$ and then $μ (E) = \int_{E} (d μ / d ν) d ν$ , and chain-rule.

Answer

True. Under mutual absolute continuity, both derivatives exist as positive-almost-everywhere functions (positivity is forced by the symmetry of the relation). The chain rule for Radon-Nikodym derivatives gives $d ν / d ν = (d ν / d μ) \cdot (d μ / d ν)$ , and the left-hand side is identically $1$ . So the product is $1$ $ν$ -almost everywhere (equivalently $μ$ -almost everywhere by mutual absolute continuity). This is the foundational identity behind the change-of-variables formula in measure theory and the symmetry of the Kullback-Leibler divergence's two-sided log-likelihood-ratio integrand.

Exercise 3 (medium, symbolic).

Compute the Radon-Nikodym derivative of a normal distribution with mean $θ$ and variance $1$ against the standard normal (mean $0$ , variance $1$ ) on the real line.

Hint

Both measures have densities against Lebesgue measure; the derivative of one against the other is the ratio of these densities.

Answer

The density of the shifted normal against Lebesgue is $ψ (x) = e^{- (x - θ)^{2} /2} / 2 π$ ; the density of the standard normal is $φ (x) = e^{- x^{2} /2} / 2 π$ . The ratio: $\frac{d ψ}{d φ} (x) = \frac{ψ ( x )}{φ ( x )} = e^{- (x - θ)^{2} /2 + x^{2} /2} = e^{θ x - θ^{2} /2} .$ This is the likelihood ratio of the shifted-mean hypothesis against the null hypothesis $θ = 0$ , foundational in statistical hypothesis testing (Neyman-Pearson lemma) and in importance sampling: if you want to sample from the shifted distribution but you can only sample from the standard normal, you sample from the standard normal and reweight each sample by $e^{θ x - θ^{2} /2}$ .

Exercise 4 (medium, symbolic).

State and prove the chain rule for Radon-Nikodym derivatives: if $λ ≪ μ ≪ ν$ and all three are $σ$ -finite, then $d λ / d ν = (d λ / d μ) \cdot (d μ / d ν)$ $ν$ -almost everywhere.

Hint

Use the defining identity $λ (E) = \int_{E} (d λ / d μ) d μ$ and substitute the density of $μ$ against $ν$ .

Answer

By Radon-Nikodym applied to $λ ≪ μ$ and $μ ≪ ν$ , there exist non-negative measurable $f, g$ with $λ (E) = \int_{E} f d μ$ and $μ (E) = \int_{E} g d ν$ . Substitute the second into the first via the change-of-variables identity $\int_{E} h d μ = \int_{E} h \cdot g d ν$ for non-negative measurable $h$ (a standard MCT-extension of the defining identity from indicators to general non-negative functions). Taking $h = f$ : $λ (E) = \int_{E} f d μ = \int_{E} f \cdot g d ν .$ By uniqueness of the Radon-Nikodym derivative against $ν$ , $d λ / d ν = f \cdot g = (d λ / d μ) \cdot (d μ / d ν)$ $ν$ -almost everywhere.

The chain rule is the multiplicative version of the additive chain rule of calculus, and it is the foundational identity behind the pushforward-pullback algebra of densities in probability theory and differential geometry.

Exercise 5 (medium, symbolic).

Show that the conditional expectation $E [X ∣ G]$ of an integrable random variable $X$ on a probability space $(Ω, F, P)$ with respect to a sub- $σ$ -algebra $G \subseteq F$ exists and is unique up to $P$ -almost-everywhere equality, by identifying it as a Radon-Nikodym derivative.

Hint

Define a signed measure $ν (E) = \int_{E} X d P$ on $G$ . Apply Radon-Nikodym to $ν$ against $P ∣_{G}$ .

Answer

Define $ν : G \to R$ by $ν (E) = \int_{E} X d P$ . The set-function $ν$ is a signed measure on $(Ω, G)$ (countable additivity follows from the dominated convergence theorem applied to $X \cdot χ_{⨆_{n} E_{n}}$ ).

The signed measure $ν$ is absolutely continuous with respect to $P ∣_{G}$ : if $P (E) = 0$ for $E \in G$ , then $\int_{E} X d P = 0$ (the integral of any integrable function over a measure-zero set vanishes), so $ν (E) = 0$ .

Apply the signed-measure Radon-Nikodym theorem (Theorem 6 below — decompose $X = X^{+} - X^{-}$ and apply the unsigned theorem to each part): there exists a $G$ -measurable function $Y \in L^{1} (P ∣_{G})$ , unique up to $P$ -a.e. equality, with $ν (E) = \int_{E} Y d P$ for every $E \in G$ . Combining with the defining identity for $ν$ : $\int_{E} Y d P = \int_{E} X d P$ for every $E \in G$ .

The function $Y$ is the conditional expectation of $X$ given $G$ , denoted $E [X ∣ G]$ . The defining property (integral of $E [X ∣ G]$ against any $G$ -measurable indicator equals the corresponding integral of $X$ ) is the Radon-Nikodym identity translated into probability language.

This identification of conditional expectation with a Radon-Nikodym derivative is the foundational construction of modern probability theory (Kolmogorov 1933 Grundbegriffe der Wahrscheinlichkeitsrechnung), and it is the starting point for martingale theory, stochastic calculus, and Bayesian inference.

Exercise 6 (medium, numeric).

Let $ν$ on $[0, 1]$ be the sum of a continuous part with density $f (x) = x$ against Lebesgue measure plus a point mass of size $1/2$ at $x = 1/2$ . Compute the Lebesgue decomposition of $ν$ relative to Lebesgue measure $μ$ .

Hint

The continuous part is absolutely continuous; the point mass is singular.

Answer

Write $ν = ν_{a} + ν_{s}$ where:

$ν_{a}$ is the absolutely continuous part: $ν_{a} (E) = \int_{E} x d x$ , with Radon-Nikodym derivative $d ν_{a} / d μ = x$ .
$ν_{s}$ is the singular part: $ν_{s} = (1/2) δ_{1/2}$ , the point mass of size $1/2$ at $x = 1/2$ , concentrated on the Lebesgue-null set ${1/2}$ .

Verify the decomposition: total mass of $ν_{a}$ on $[0, 1]$ is $\int_{0}^{1} x d x = 1/2$ ; total mass of $ν_{s}$ is $1/2$ ; together $1$ , consistent with $ν$ being a probability measure. Absolute continuity of $ν_{a}$ : any Lebesgue-null set $N$ has $ν_{a} (N) = \int_{N} x d x = 0$ . Singularity of $ν_{s}$ with respect to $μ$ : take $A = {1/2}$ , so $μ (A) = 0$ and $ν_{s} (A^{c}) = 0$ (the point mass is concentrated on $A$ ).

The Lebesgue decomposition $ν = ν_{a} + ν_{s}$ is the unique splitting into a density-having piece plus a piece concentrated on a Lebesgue-null set, and it is the abstract version of separating "continuous" and "discrete" distributions in elementary probability.

Exercise 7 (hard, symbolic).

Use the Radon-Nikodym theorem to prove the Riesz representation theorem for the dual of $L^{p}$ on a $σ$ -finite measure space, for $1 \leq p < \infty$ : every bounded linear functional $Φ$ on $L^{p} (μ)$ is of the form $Φ (f) = \int f g d μ$ for a unique $g \in L^{q} (μ)$ with $1/ p + 1/ q = 1$ .

Hint

Define a signed measure $ν (E) = Φ (χ_{E})$ on sets of finite measure, show $ν ≪ μ$ , and apply Radon-Nikodym.

Answer

Step 1 (the signed measure). For a bounded linear functional $Φ$ on $L^{p} (μ)$ with $σ$ -finite $μ$ , define $ν (E) = Φ (χ_{E})$ for measurable $E$ of finite measure (so $χ_{E} \in L^{p}$ ). The set-function $ν$ is finitely additive on the algebra of finite-measure sets (by linearity of $Φ$ ) and $σ$ -additive (by continuity of $Φ$ and the dominated convergence theorem applied to the increasing union $χ_{⋃_{n} E_{n}} \to χ_{⋃ E_{n}}$ in $L^{p}$ for sets of finite measure). Extension via $σ$ -finiteness gives a signed measure on the full $σ$ -algebra.

Step 2 (absolute continuity). If $μ (E) = 0$ , then $χ_{E} = 0$ in $L^{p}$ ( $L^{p}$ identifies a.e.-equal functions), so $Φ (χ_{E}) = Φ (0) = 0$ , hence $ν (E) = 0$ . So $ν ≪ μ$ .

Step 3 (Radon-Nikodym). By the signed-measure Radon-Nikodym theorem (Theorem 6 below), there exists a $μ$ -measurable function $g$ with $ν (E) = \int_{E} g d μ$ for every measurable $E$ of finite measure.

Step 4 (linearity to general simple functions). By linearity of both sides, $Φ (s) = \int s g d μ$ for every simple function $s = \sum_{i} c_{i} χ_{E_{i}}$ in $L^{p}$ .

Step 5 (density). Simple functions in $L^{p}$ are dense (standard $L^{p}$ -density theorem). By continuity of $Φ$ and of the integral pairing $f \mapsto \int f g d μ$ (the latter requires $g \in L^{q}$ ), the identity extends to all $f \in L^{p}$ .

Step 6 ( $g \in L^{q}$ and norm equality). The integral identity $∣Φ (f) ∣ = ∣ \int f g d μ ∣ \leq ∥ f ∥_{p} ∥ g ∥_{q}$ (Hölder) gives $∥ g ∥_{q} \leq ∥Φ ∥_{(L^{p})^{*}}$ . Conversely, choosing $f = ∣ g ∣^{q - 1} sgn (g) /∥ g ∥_{q}^{q - 1}$ (the duality-extremising function) and computing $Φ (f)$ gives $∥ g ∥_{q} \geq ∥Φ ∥_{(L^{p})^{*}}$ . So $∥ g ∥_{q} = ∥Φ ∥_{(L^{p})^{*}}$ and $g \in L^{q}$ .

Step 7 (uniqueness). Two representers $g, g^{'}$ would give $\int f (g - g^{'}) d μ = 0$ for all $f \in L^{p}$ , forcing $g = g^{'}$ $μ$ -a.e. by testing on $f = ∣ g - g^{'} ∣^{q - 1} sgn (g - g^{'})$ .

Hence $(L^{p})^{*} ≅ L^{q}$ isometrically for $1 \leq p < \infty$ on $σ$ -finite measure spaces, with the isomorphism realised by Radon-Nikodym. This is the load-bearing role of Radon-Nikodym in functional analysis: it converts a structural-duality question into a density-existence question on signed measures.

Exercise 8 (hard, symbolic).

Show how the martingale convergence theorem (Doob 1953 Stochastic Processes Ch. VII) connects to Radon-Nikodym: if $(M_{n})$ is a uniformly integrable martingale on $(Ω, F, P)$ with respect to a filtration $(F_{n})$ , then $M_{n} \to M_{\infty}$ almost surely and in $L^{1}$ , where $M_{\infty} = E [M_{\infty} ∣ F_{\infty}]$ is identified via Radon-Nikodym as a density.

Hint

Each $M_{n}$ is the conditional expectation $E [M_{\infty} ∣ F_{n}]$ , which is the Radon-Nikodym derivative of the measure $ν_{n} (E) = \int_{E} M_{\infty} d P$ against $P ∣_{F_{n}}$ .

Answer

Step 1 (the martingale and its limit measure). A uniformly integrable martingale $(M_{n})$ adapted to filtration $(F_{n})$ satisfies $M_{n} = E [M_{n + 1} ∣ F_{n}]$ for each $n$ , and uniform integrability gives an $L^{1}$ -convergent subsequence by Dunford-Pettis. The full sequence converges almost surely by Doob's almost-sure convergence theorem (Doob 1940 Trans. AMS 47, 455).

Step 2 (the Radon-Nikodym connection). Define $ν_{n} (E) = \int_{E} M_{n} d P$ on $(Ω, F_{n})$ . The signed measure $ν_{n}$ is absolutely continuous with respect to $P ∣_{F_{n}}$ , with Radon-Nikodym derivative $d ν_{n} / d P ∣_{F_{n}} = M_{n}$ (this is the defining identity of conditional expectation, via Exercise 5).

The martingale property $M_{n} = E [M_{n + 1} ∣ F_{n}]$ translates: the Radon-Nikodym derivative of $ν_{n + 1}$ against $P ∣_{F_{n}}$ equals $M_{n}$ . So the sequence of signed measures $ν_{n}$ is consistent under restriction: $ν_{n + 1} ∣_{F_{n}} = ν_{n}$ .

Step 3 (Kolmogorov extension). By the Kolmogorov consistency theorem for signed measures, the consistent sequence $(ν_{n})$ extends to a single signed measure $ν_{\infty}$ on $F_{\infty} = σ (⋃_{n} F_{n})$ , with $ν_{\infty} ∣_{F_{n}} = ν_{n}$ for each $n$ .

Step 4 (Radon-Nikodym on $F_{\infty}$ ). Apply Radon-Nikodym to $ν_{\infty} ≪ P ∣_{F_{\infty}}$ (absolute continuity descends from each $F_{n}$ via the consistency relation, plus a density argument). The resulting derivative $M_{\infty} = d ν_{\infty} / d P ∣_{F_{\infty}}$ is the limit of the martingale: each $M_{n}$ is the conditional expectation $E [M_{\infty} ∣ F_{n}]$ , so the convergence $M_{n} \to M_{\infty}$ a.s. and in $L^{1}$ follows from Doob's almost-sure convergence theorem combined with uniform integrability.

Step 5 (probabilistic interpretation). The martingale convergence theorem is the time-asymptotic Radon-Nikodym identification of a sequence of consistent conditional expectations with a single limit density. The interpretation: the sequence $(M_{n})$ represents the evolving estimates of $M_{\infty}$ as more information $F_{n}$ accumulates, and the convergence theorem says these estimates do converge to the true value $M_{\infty}$ both almost surely and in $L^{1}$ , with the limit identified via Radon-Nikodym on the union $σ$ -algebra.

This is the foundational result of martingale theory and the abstract setting of every estimation procedure in mathematical statistics and stochastic finance (the Black-Scholes pricing of derivatives is a martingale-convergence-driven Radon-Nikodym change of measure under the risk-neutral pricing measure).

Lean formalization Intermediate+

lean_status: partial — Mathlib provides the central Lebesgue-Radon-Nikodym architecture. MeasureTheory.Measure.AbsolutelyContinuous is the relation $ν ≪ μ$ , with notation ν ≪ μ. MeasureTheory.Measure.MutuallySingular is the relation $ν ⊥ μ$ , with notation ν ⟂ₘ μ. MeasureTheory.Measure.haveLebesgueDecomposition_of_sigmaFinite gives existence of the Lebesgue decomposition on $σ$ -finite measures, with MeasureTheory.Measure.singularPart and MeasureTheory.Measure.rnDeriv extracting the singular part and the Radon-Nikodym derivative. The change-of-variables formula appears as MeasureTheory.lintegral_rnDeriv_mul. The von Neumann Hilbert-space proof is not explicitly reconstructed in Mathlib — Mathlib follows a more direct decomposition argument via the Hahn-Jordan decomposition — but the resulting density and Lebesgue decomposition are fully formalised in the companion module Codex.Analysis.MeasureTheory.AbsoluteContinuityRadonNikodym.

import Mathlib.MeasureTheory.Decomposition.RadonNikodym
import Mathlib.MeasureTheory.Decomposition.Lebesgue
import Mathlib.MeasureTheory.Measure.AbsolutelyContinuous

open MeasureTheory

variable {α : Type*} [MeasurableSpace α]
variable (μ ν : Measure α) [SigmaFinite μ] [SigmaFinite ν]

-- Absolute continuity: ν ≪ μ means every μ-null set is ν-null
example (h : ν ≪ μ) (E : Set α) (hE : MeasurableSet E) (hμ : μ E = 0) :
    ν E = 0 := h hμ

-- Lebesgue decomposition: ν = ν.singularPart μ + μ.withDensity (ν.rnDeriv μ)
example [μ.HaveLebesgueDecomposition ν] :
    ν = ν.singularPart μ + μ.withDensity (ν.rnDeriv μ) :=
  (Measure.haveLebesgueDecomposition_add ν μ).symm

-- Radon-Nikodym identity: when ν ≪ μ, integration against ν reduces to
-- integration against μ weighted by the RN derivative
example (h : ν ≪ μ) (f : α → ENNReal) (hf : Measurable f) :
    ∫⁻ x, f x ∂ν = ∫⁻ x, f x * ν.rnDeriv μ x ∂μ :=
  lintegral_rnDeriv_mul h hf.aemeasurable

Advanced results Master

The advanced theory of absolute continuity, mutual singularity, and the Radon-Nikodym derivative splits across nine strands: the Hahn-Jordan decomposition of signed measures, the Lebesgue decomposition into absolutely continuous plus singular parts, the unsigned and signed Radon-Nikodym theorems, the chain rule and change-of-variables for Radon-Nikodym derivatives, $σ$ -finiteness as a load-bearing hypothesis and explicit counterexamples without it, conditional expectation in probability theory, the Riesz representation of $(L^{p})^{*}$ , the martingale convergence theorem, and Girsanov-style changes of measure in stochastic analysis.

Theorem 1 (Hahn-Jordan decomposition; Hahn 1921 Ann. Sc. Norm. Sup. Pisa 9, 429). Let $(X, M)$ be a measurable space and let $λ$ be a signed measure on $M$ . There exist measurable sets $P, N \in M$ with $P \cup N = X$ , $P \cap N = \emptyset$ , and $λ (E) \geq 0$ for every measurable $E \subseteq P$ and $λ (E) \leq 0$ for every measurable $E \subseteq N$ . The decomposition $(P, N)$ is unique up to $λ$ -null modifications. Defining $λ^{+} (E) = λ (E \cap P)$ and $λ^{-} (E) = - λ (E \cap N)$ gives non-negative measures $λ^{+}, λ^{-}$ with $λ = λ^{+} - λ^{-}$ and $λ^{+} ⊥ λ^{-}$ .

The Hahn-Jordan decomposition is the structural foundation of signed-measure theory: every signed measure splits canonically into a positive part and a negative part, supported on disjoint sets. The total variation measure is $∣ λ ∣ = λ^{+} + λ^{-}$ , and the total variation norm is $∥ λ ∥_{T V} = ∣ λ ∣ (X)$ . The proof uses an extremal-set argument: take $P$ to be a set of maximum positive measure, that is one achieving the supremum $sup {λ (A) : A \in M}$ , and verify the positive-set property via a refinement argument that successively removes negative-mass subsets ^{[Hahn 1921]}.

Theorem 2 (Lebesgue decomposition; Lebesgue 1904 Leçons sur l'intégration §VII). Let $μ, ν$ be $σ$ -finite measures on $(X, M)$ . There exist unique $σ$ -finite measures $ν_{a}, ν_{s}$ on $M$ with $ν = ν_{a} + ν_{s}$ , $ν_{a} ≪ μ$ , and $ν_{s} ⊥ μ$ .

Lebesgue's 1904 Leçons sur l'intégration et la recherche des fonctions primitives ^{[Lebesgue 1904]} introduced the decomposition for the special case of measures on the real line, motivated by the structure of distribution functions: every monotone increasing function on $[a, b]$ has a unique decomposition as the sum of an absolutely continuous monotone function (with $L^{1}$ -density derivative), a singular continuous monotone function (with derivative zero a.e., concentrated on a Lebesgue-null set, like the Cantor function), and a jump function (a sum of step functions). The abstract measure-theoretic version (general $σ$ -finite $μ, ν$ ) was completed by Radon 1913 and Nikodym 1930. The proof is essentially Step 4-5 of the main theorem's proof: the decomposition sets are $A = {g < 1}$ and $B = {g = 1}$ from the Hilbert-space argument.

Theorem 3 (Radon-Nikodym, unsigned; Radon 1913, Nikodym 1930). Let $μ, ν$ be $σ$ -finite measures on $(X, M)$ with $ν ≪ μ$ . There exists a non-negative $M$ -measurable function $f : X \to [0, \infty)$ , unique up to $μ$ -a.e. equality, with $ν (E) = \int_{E} f d μ$ for every $E \in M$ .

Radon's 1913 paper ^{[Radon 1913]}, titled Theorie und Anwendungen der absolut additiven Mengenfunktionen, established the theorem for measures absolutely continuous with respect to Lebesgue measure on $R^{n}$ . Nikodym's 1930 Fundamenta Mathematicae paper ^{[Nikodym 1930]} extended it to arbitrary $σ$ -finite background measures on abstract measurable spaces, completing the modern statement. The proof is the von Neumann Hilbert-space argument (Step 1-7 of the main theorem) for the finite case, plus $σ$ -finite exhaustion.

Theorem 4 (Radon-Nikodym, signed and complex; Folland 2e Theorem 3.8). Let $μ$ be a $σ$ -finite measure on $(X, M)$ . For every $σ$ -finite signed measure $λ$ with $λ ≪ μ$ (defined via $∣ λ ∣ ≪ μ$ ), there exists an $M$ -measurable $f : X \to R$ in $L_{loc}^{1} (μ)$ , unique up to $μ$ -a.e. equality, with $λ (E) = \int_{E} f d μ$ for every $E \in M$ . The complex case extends by decomposing into real and imaginary parts.

The signed-and-complex Radon-Nikodym theorem reduces to the unsigned case via the Hahn-Jordan decomposition $λ = λ^{+} - λ^{-}$ : apply Theorem 3 to each of $λ^{+}, λ^{-}$ to get non-negative densities $f^{+}, f^{-}$ , then $f = f^{+} - f^{-}$ is the signed density. The complex case decomposes the complex measure $λ = λ_{r} + i λ_{i}$ into real and imaginary signed measures and applies the real case to each.

Theorem 5 (Chain rule for Radon-Nikodym; Folland 2e Theorem 3.7). Let $λ, μ, ν$ be $σ$ -finite measures on $(X, M)$ with $λ ≪ μ ≪ ν$ . Then $λ ≪ ν$ , and the Radon-Nikodym derivatives satisfy $d λ / d ν = (d λ / d μ) \cdot (d μ / d ν)$ $ν$ -almost everywhere.

The chain rule is the multiplicative analogue of the additive chain rule of calculus, and it is the foundational algebraic identity for nested density representations. The proof is Exercise 4 above: substitute the density of $μ$ against $ν$ into the defining identity of $λ$ against $μ$ and apply uniqueness of the resulting density against $ν$ .

A consequence: under mutual absolute continuity $ν ≪ μ ≪ ν$ , the derivatives are reciprocals: $d ν / d μ \cdot d μ / d ν = 1$ $μ$ -a.e. (and $ν$ -a.e.). This is the symmetric-density identity behind the Kullback-Leibler divergence and the change-of-coordinates formula in differential geometry.

Theorem 6 (Change of variables under Radon-Nikodym; Folland 2e Theorem 3.9). Let $μ, ν$ be $σ$ -finite with $ν ≪ μ$ and Radon-Nikodym derivative $f = d ν / d μ$ . For every non-negative $M$ -measurable function $g : X \to [0, \infty]$ : $\int_{X} g d ν = \int_{X} g \cdot f d μ .$ For signed/complex measurable $g$ , the identity holds when either side has finite absolute integral, with both being defined as $L^{1}$ -convergent integrals on a full-measure set.

The change-of-variables formula extends the defining identity $ν (E) = \int_{E} f d μ$ (which is the special case $g = χ_{E}$ ) to general non-negative measurable functions via the standard machine: indicators $\to$ simple $\to$ non-negative measurable (MCT) $\to$ signed via $g = g^{+} - g^{-}$ . The identity is the abstract version of the calculus substitution rule $\int g (y) d y = \int g (ϕ (x)) \cdot ∣ ϕ^{'} (x) ∣ d x$ for a smooth bijection $ϕ$ : in measure-theoretic language, the Jacobian determinant $∣ ϕ^{'} ∣$ is the Radon-Nikodym derivative of the pullback measure $ϕ_{*} (d y)$ against $d x$ .

Theorem 7 ($(L^p)^ \cong L^q$ via Radon-Nikodym; Riesz 1910 Math. Ann. 69, 449).* Let $(X, M, μ)$ be a $σ$ -finite measure space and $1 \leq p < \infty$ with conjugate exponent $1/ p + 1/ q = 1$ . The map $Ψ : L^{q} (μ) \to (L^{p} (μ))^{*}$ defined by $Ψ (g) (f) = \int f g d μ$ is an isometric isomorphism.

The Radon-Nikodym theorem is the load-bearing step in the proof: starting from a bounded linear functional $Φ : L^{p} \to C$ , define a signed measure $ν (E) = Φ (χ_{E})$ on sets of finite measure, observe $ν ≪ μ$ , apply Radon-Nikodym to get a density $g$ , and verify $g \in L^{q}$ with $∥ g ∥_{q} = ∥Φ ∥_{(L^{p})^{*}}$ . The full proof is Exercise 7. The duality identification $(L^{p})^{*} ≅ L^{q}$ is the foundational structural fact behind the weak topology on $L^{p}$ , the Banach-Alaoglu compactness of bounded sequences, and the reflexivity of $L^{p}$ for $1 < p < \infty$ (since $L^{p} ≅ L^{q} ≅ L^{p}$ via two applications of the duality, modulo Clarkson's inequality controlling the canonical embedding).

The endpoint $p = \infty$ fails: $(L^{\infty})^{*}$ is strictly larger than $L^{1}$ on a $σ$ -finite space, with the difference filled by "purely finitely additive" measures (Banach limits and finitely-additive set-functions). The construction of these non- $L^{1}$ functionals on $L^{\infty}$ requires the axiom of choice via the Hahn-Banach theorem ^{[Riesz 1910]}.

Theorem 8 (Conditional expectation as Radon-Nikodym derivative; Kolmogorov 1933 Grundbegriffe der Wahrscheinlichkeitsrechnung). Let $(Ω, F, P)$ be a probability space, $X \in L^{1} (P)$ , and $G \subseteq F$ a sub- $σ$ -algebra. There exists a $G$ -measurable function $Y \in L^{1} (P ∣_{G})$ , unique up to $P$ -a.e. equality, with $\int_{E} Y d P = \int_{E} X d P$ for every $E \in G$ . The function $Y$ is the conditional expectation of $X$ given $G$ , denoted $E [X ∣ G]$ .

The proof (Exercise 5) identifies $Y$ as the Radon-Nikodym derivative of the signed measure $ν (E) = \int_{E} X d P$ (on $G$ ) against $P ∣_{G}$ . This identification is the foundational construction of modern probability theory: conditional expectation is not defined pointwise but rather as an equivalence class of $G$ -measurable random variables satisfying a Radon-Nikodym-type integral identity.

The basic properties — tower property $E [E [X ∣ G] ∣ H] = E [X ∣ H]$ for nested $H \subseteq G$ , linearity, monotonicity, $L^{p}$ -boundedness for $p \in [1, \infty]$ (Jensen's inequality), and projection $E [\cdot ∣ G]$ being the $L^{2}$ -orthogonal projection onto the closed subspace $L^{2} (G, P ∣_{G}) ↪ L^{2} (F, P)$ — all derive from the Radon-Nikodym defining identity.

Theorem 9 (Doob martingale convergence; Doob 1940 Trans. AMS 47, 455; Doob 1953 Stochastic Processes Ch. VII). Let $(M_{n})$ be a uniformly integrable martingale on $(Ω, F, P)$ adapted to filtration $(F_{n})$ . Then $M_{n} \to M_{\infty}$ almost surely and in $L^{1}$ , where $M_{\infty} \in L^{1} (P)$ is $F_{\infty}$ -measurable and $M_{n} = E [M_{\infty} ∣ F_{n}]$ for every $n$ .

The proof (sketched in Exercise 8) identifies $M_{\infty}$ as the Radon-Nikodym derivative on $F_{\infty}$ of the limit signed measure $ν_{\infty}$ extending the consistent sequence $ν_{n} (E) = \int_{E} M_{n} d P$ . The almost-sure convergence comes from Doob's upcrossing inequality and the $L^{1}$ -convergence from uniform integrability via Dunford-Pettis ^{[Doob — Stochastic Processes]}.

Doob's theorem is the time-asymptotic Radon-Nikodym theorem, with the filtration $(F_{n})$ representing accumulating information and the martingale $(M_{n})$ representing the evolving estimates. The convergence theorem says the estimates do converge to the limit density on the union $σ$ -algebra, both almost surely and in $L^{1}$ . The result is the foundational convergence theorem of probability theory and the abstract underpinning of every recursive estimation procedure (Kalman filtering, stochastic approximation, Bayesian posterior updating).

Theorem 10 (Girsanov change of measure; Girsanov 1960 Theory Probab. Appl. 5, 285). Let $(B_{t})_{t \in [0, T]}$ be a Brownian motion on $(Ω, F, P)$ , $θ : [0, T] \times Ω \to R$ an adapted process with $\int_{0}^{T} θ_{s}^{2} d s < \infty$ a.s. and Novikov's condition $E [exp ((1/2) \int_{0}^{T} θ_{s}^{2} d s)] < \infty$ . Define $Q$ by $d Q / d P = exp (\int_{0}^{T} θ_{s} d B_{s} - (1/2) \int_{0}^{T} θ_{s}^{2} d s)$ . Then under $Q$ , the process $\tilde{B}_{t} = B_{t} - \int_{0}^{t} θ_{s} d s$ is a Brownian motion.

Girsanov's theorem is the change-of-measure formula in stochastic analysis: the Radon-Nikodym derivative $d Q / d P$ is the exponential martingale, and the change of measure shifts the drift of the Brownian motion by $θ$ . The construction is foundational in mathematical finance, where it underlies the risk-neutral pricing of derivatives (Black-Scholes 1973 J. Polit. Econ. 81, 637 and Harrison-Kreps 1979 J. Econ. Theory 20, 381): the equivalent martingale measure $Q$ is constructed via Girsanov to make the discounted asset price a martingale, and option prices are computed as $Q$ -expectations ^{[Folland 2e for Radon-Nikodym background; the Girsanov construction itself is downstream]}.

Synthesis. The Lebesgue-Radon-Nikodym architecture identifies one of the three most foundational pillars of measure theory (alongside Carathéodory's outer-measure construction and Fubini-Tonelli on product spaces). The central insight is the existence and uniqueness of a density function $f = d ν / d μ$ for absolutely continuous pairs of $σ$ -finite measures, together with the Lebesgue decomposition splitting general pairs into an absolutely continuous part plus a singular part. The von Neumann Hilbert-space proof reveals the structural reason the theorem works: the density is the orthogonal projection of the constant function $1$ onto a specific closed subspace of $L^{2} (μ + ν)$ , and the Riesz representation theorem on Hilbert spaces does all the work.

The pattern generalises in five directions. First, to probability theory: conditional expectation, regular conditional probabilities, disintegrations of measures, and martingales are all explicit Radon-Nikodym-derivative constructions. Second, to functional analysis: the duality $(L^{p})^{*} ≅ L^{q}$ for $1 \leq p < \infty$ on $σ$ -finite spaces is a direct Radon-Nikodym application. Third, to stochastic analysis: Girsanov's theorem and the risk-neutral pricing framework of mathematical finance are exponential-martingale Radon-Nikodym derivatives. Fourth, to information theory: the Kullback-Leibler divergence $D_{K L} (ν ∥ μ) = \int lo g (d ν / d μ) d ν$ is the integral of the log-density, the foundational information-theoretic measure of distance between probability distributions. Fifth, to differential geometry: the pullback and pushforward of measures under smooth maps are governed by Jacobian determinants, which are the Radon-Nikodym derivatives in coordinate charts.

Putting these together identifies Radon-Nikodym as the bridge between absolute continuity (a measure-theoretic relation) and density-existence (a function-theoretic representation), with the Hilbert-space proof revealing the bridge as a Riesz-representation projection in disguise — six structurally distinct but Radon-Nikodym-unified pillars of modern analysis, probability, and applied mathematics.

Full proof set Master

Proposition 1 (Hahn decomposition — full proof). Every signed measure $λ$ on $(X, M)$ admits a Hahn decomposition $X = P ⊔ N$ with $P$ positive and $N$ negative.

Proof. Assume $λ$ takes values in $[- \infty, \infty)$ (the case $λ$ takes values in $(- \infty, \infty]$ is symmetric). Let $M = sup {λ (A) : A \in M}$ . Choose sets $A_{n} \in M$ with $λ (A_{n}) \to M$ . The claim is that one can construct a measurable $P$ with $λ (P) = M$ (and $P$ positive).

Sub-step 1a (positivity lemma). For each $A_{n}$ , define $P_{n} \subseteq A_{n}$ to be a positive set (every measurable subset has non-negative $λ$ -measure) achieving $λ (P_{n}) \geq λ (A_{n})$ . This is constructed by transfinite removal of negative-mass measurable subsets: if $A_{n}$ contains a measurable subset $B \subseteq A_{n}$ with $λ (B) < 0$ , replace $A_{n}$ by $A_{n} ∖ B$ , which has $λ (A_{n} ∖ B) = λ (A_{n}) - λ (B) > λ (A_{n})$ . Iterate (countably or transfinitely as needed); the process terminates with a positive set $P_{n}$ , because at each step the removed-measure exceeds $- 1/ k_{n}$ for some increasing sequence $k_{n}$ , with the total removed measure bounded by $λ (A_{n}) - λ (P_{n}) \leq M - λ (A_{n})$ .

Sub-step 1b (Hahn set). Define $P = ⋃_{n} P_{n}$ . The union of positive sets is positive (the $λ$ -measure of a measurable subset $E \subseteq P$ equals $lim_{n} λ (E \cap P_{n}) \geq 0$ ). By construction $λ (P) \geq sup_{n} λ (P_{n}) \geq sup_{n} λ (A_{n}) - ϵ_{n} = M$ , and $λ (P) \leq M$ by the supremum definition. So $λ (P) = M$ and $P$ is positive.

Sub-step 1c (negative complement). Define $N = X ∖ P$ . Every measurable subset $E \subseteq N$ satisfies $λ (E) \leq 0$ : otherwise $P \cup E$ would have $λ (P \cup E) = M + λ (E) > M$ , contradicting the supremum. So $N$ is negative.

$□$

Proposition 2 (Lebesgue decomposition existence and uniqueness; finite-measure case). Let $μ, ν$ be finite measures on $(X, M)$ . There exist unique measures $ν_{a}, ν_{s}$ with $ν = ν_{a} + ν_{s}$ , $ν_{a} ≪ μ$ , and $ν_{s} ⊥ μ$ .

Proof. The existence is Step 4-5 of the main theorem's proof: the Hilbert-space argument produces measurable sets $A, B$ with $B$ being the singular support and $A$ being the absolutely continuous support. The measures $ν_{a} (E) = ν (E \cap A)$ and $ν_{s} (E) = ν (E \cap B)$ give the decomposition.

For uniqueness: suppose $ν = \tilde{ν}_{a} + \tilde{ν}_{s}$ is another decomposition. Then $ν_{a} - \tilde{ν}_{a} = \tilde{ν}_{s} - ν_{s}$ . The left-hand side is absolutely continuous against $μ$ (difference of two $≪ μ$ measures), and the right-hand side is mutually singular with $μ$ (difference of two $⊥ μ$ measures, both supported on $μ$ -null sets). The only signed measure that is both absolutely continuous against $μ$ and mutually singular with $μ$ is the zero measure (the absolutely continuous part vanishes on $μ$ -null sets, and the singular part is concentrated on a $μ$ -null set). So $ν_{a} = \tilde{ν}_{a}$ and $ν_{s} = \tilde{ν}_{s}$ .

$□$

Proposition 3 ( $σ$ -finite extension). The Lebesgue-Radon-Nikodym theorem extends from finite to $σ$ -finite measures via exhaustion.

Proof. Let $X = ⋃_{n} X_{n}$ with $X_{n}$ disjoint and $μ (X_{n}), ν (X_{n}) < \infty$ . Apply the finite-measure Radon-Nikodym (Proposition 2 plus the Hilbert-space density argument) to each $X_{n}$ : get densities $f_{n}$ on $X_{n}$ and singular parts $ν_{s, n}$ on $X_{n}$ . The patched functions $f (x) = f_{n} (x)$ for $x \in X_{n}$ and $ν_{s} (E) = \sum_{n} ν_{s, n} (E \cap X_{n})$ are the global density and singular part. Measurability is preserved because the patching is along a countable measurable partition.

$□$

Proposition 4 (Uniqueness of the Radon-Nikodym derivative). If $ν_{a} = f d μ = f^{'} d μ$ , then $f = f^{'}$ $μ$ -almost everywhere.

Proof. Set $h = f - f^{'}$ . The hypothesis gives $\int_{E} h d μ = 0$ for every $E \in M$ . Apply this with $E = {h > 0} \cap X_{n}$ on each $σ$ -finite piece $X_{n}$ : $\int_{E} h d μ = 0$ with non-negative integrand $h \cdot χ_{E}$ , so $h \cdot χ_{E} = 0$ $μ$ -a.e. on $X_{n}$ , hence $μ (E) = 0$ on $X_{n}$ . Similarly $μ ({h < 0} \cap X_{n}) = 0$ . Taking the union over $n$ gives $μ ({h \neq = 0}) = 0$ , so $f = f^{'}$ $μ$ -a.e.

$□$

Proposition 5 (Signed Radon-Nikodym). Let $μ$ be $σ$ -finite and $λ$ a signed measure with $∣ λ ∣ ≪ μ$ . Then there is $f \in L_{loc}^{1} (μ)$ with $λ (E) = \int_{E} f d μ$ for every $E \in M$ , unique up to $μ$ -a.e. equality.

Proof. By the Hahn-Jordan decomposition (Proposition 1), $λ = λ^{+} - λ^{-}$ with $λ^{+}, λ^{-}$ non-negative measures and $λ^{\pm} ≪ μ$ (since $∣ λ ∣ = λ^{+} + λ^{-} ≪ μ$ ). Apply the unsigned Radon-Nikodym (Proposition 3) to each: get $f^{+} = d λ^{+} / d μ$ and $f^{-} = d λ^{-} / d μ$ . Set $f = f^{+} - f^{-}$ . Then $λ (E) = λ^{+} (E) - λ^{-} (E) = \int_{E} f^{+} d μ - \int_{E} f^{-} d μ = \int_{E} f d μ$ by linearity of the integral. The function $f$ is in $L_{loc}^{1} (μ)$ because $f^{\pm}$ are. Uniqueness reduces to the unsigned case (Proposition 4) applied to $f^{+}$ and $f^{-}$ separately.

$□$

Proposition 6 (Chain rule). For $σ$ -finite measures $λ ≪ μ ≪ ν$ , the derivatives satisfy $d λ / d ν = (d λ / d μ) (d μ / d ν)$ $ν$ -a.e.

Proof. Let $f = d λ / d μ$ and $g = d μ / d ν$ . The change-of-variables formula (Theorem 6, Proposition 7 below) gives, for non-negative measurable $h$ : $\int_{X} h d λ = \int_{X} h f d μ = \int_{X} h f g d ν,$ where the second equality applies the change-of-variables formula with the density $g$ of $μ$ against $ν$ . Specialising to $h = χ_{E}$ : $λ (E) = \int_{E} f g d ν$ . By uniqueness of the Radon-Nikodym derivative of $λ$ against $ν$ , $d λ / d ν = f g = (d λ / d μ) (d μ / d ν)$ $ν$ -a.e.

$□$

Proposition 7 (Change of variables for Radon-Nikodym). For $σ$ -finite $μ, ν$ with $ν ≪ μ$ , density $f = d ν / d μ$ , and non-negative measurable $g : X \to [0, \infty]$ : $\int_{X} g d ν = \int_{X} g f d μ .$

Proof. The standard machine: indicators $\to$ simple $\to$ non-negative measurable.

Step 1 (indicators). For $g = χ_{E}$ : $\int_{X} χ_{E} d ν = ν (E) = \int_{E} f d μ = \int_{X} χ_{E} f d μ$ , where the second equality is the defining identity of the Radon-Nikodym derivative.

Step 2 (simple). For $g = \sum_{i} c_{i} χ_{E_{i}}$ a non-negative simple function: linearity of the integral on both sides gives $\int g d ν = \sum_{i} c_{i} \int χ_{E_{i}} d ν = \sum_{i} c_{i} \int χ_{E_{i}} f d μ = \int g f d μ$ .

Step 3 (non-negative measurable). Approximate $g$ by an increasing sequence of non-negative simple functions $s_{n} ↑ g$ via the standard simple-function approximation 02.07.03. By MCT 02.07.04 applied to both sides: $\int s_{n} d ν ↑ \int g d ν$ and $\int s_{n} f d μ ↑ \int g f d μ$ (note $s_{n} f ↑ g f$ pointwise since $f \geq 0$ ). The identity for simple $s_{n}$ (Step 2) passes to the limit, giving the identity for $g$ .

$□$

Proposition 8 ($(L^p)^ \cong L^q $f or$ 1 \leq p < \infty $o n$ \sigma $- f ini t e$ \mu$).* The map $Ψ : L^{q} (μ) \to (L^{p} (μ))^{*}$ defined by $Ψ (g) (f) = \int f g d μ$ is an isometric isomorphism.

Proof. Step 1-7 of Exercise 7. The injectivity and isometry $∥ g ∥_{q} = ∥Ψ (g) ∥_{(L^{p})^{*}}$ are immediate from Hölder and the duality-extremising function. The surjectivity uses Radon-Nikodym: starting from $Φ \in (L^{p})^{*}$ , define the signed measure $ν (E) = Φ (χ_{E})$ on finite-measure sets, observe $ν ≪ μ$ , apply Radon-Nikodym to get a density $g$ that recovers $Φ$ on simple functions, then extend by $L^{p}$ -density to all $f \in L^{p}$ .

The $σ$ -finiteness of $μ$ is load-bearing twice: once for the existence of the Radon-Nikodym derivative on the signed measure $ν$ , and once for the $L^{p}$ -density of simple functions of finite support (which fails on non- $σ$ -finite measure spaces, where the dual of $L^{p}$ is strictly larger than $L^{q}$ ).

$□$

Proposition 9 (Conditional expectation properties). For $X \in L^{1} (P)$ and $G \subseteq F$ , the conditional expectation $E [X ∣ G]$ defined via Radon-Nikodym satisfies:

(a) Linearity: $E [a X + bY ∣ G] = a E [X ∣ G] + b E [Y ∣ G]$ for $a, b \in R$ .

(b) Tower property: $E [E [X ∣ G] ∣ H] = E [X ∣ H]$ for $H \subseteq G$ .

(c) $L^{2}$ -projection: for $X \in L^{2} (P)$ , $E [X ∣ G]$ is the orthogonal projection of $X$ onto the closed subspace $L^{2} (G, P ∣_{G}) ↪ L^{2} (F, P)$ .

(d) Jensen: for convex $φ : R \to R$ with $φ (X) \in L^{1}$ , $φ (E [X ∣ G]) \leq E [φ (X) ∣ G]$ $P$ -a.s.

Proof sketch. All four properties derive from the defining Radon-Nikodym identity $\int_{E} E [X ∣ G] d P = \int_{E} X d P$ for $E \in G$ .

Linearity (a): both sides satisfy the defining identity with $a X + bY$ on the right. By uniqueness, they are $P$ -a.e. equal.

Tower (b): for $E \in H \subseteq G$ , $\int_{E} E [E [X ∣ G] ∣ H] d P = \int_{E} E [X ∣ G] d P = \int_{E} X d P = \int_{E} E [X ∣ H] d P$ . By uniqueness of the Radon-Nikodym derivative on $H$ , the two sides agree $P$ -a.s.

$L^{2}$ -projection (c): for $X \in L^{2}$ , $E [X ∣ G]$ is in $L^{2} (G)$ (Jensen with $φ (t) = t^{2}$ ). The orthogonality condition $⟨ X - E [X ∣ G], Z ⟩ = 0$ for all $Z \in L^{2} (G)$ reduces (by linearity in $Z$ ) to $\int_{E} (X - E [X ∣ G]) d P = 0$ for $E \in G$ , which is exactly the defining identity. So $E [X ∣ G]$ is the orthogonal projection.

Jensen (d): apply the convex inequality $φ (E [X ∣ G]) \leq E [φ (X) ∣ G]$ pointwise using the supporting-line characterisation of convex functions. The proof uses a representation $φ (t) = sup_{i} (a_{i} t + b_{i})$ over affine minorants, applies linearity and monotonicity of conditional expectation to each affine minorant, and takes the supremum.

$□$

Proposition 10 (Doob martingale convergence — outline). A uniformly integrable martingale $(M_{n})$ converges $P$ -a.s. and in $L^{1}$ to a limit $M_{\infty}$ , identified as the Radon-Nikodym derivative on $F_{\infty}$ .

Proof outline. Step 1 (almost-sure convergence): Doob's upcrossing inequality $E [U_{n}^{[a, b]}] \leq (E [∣ M_{n} ∣] + ∣ a ∣) / (b - a)$ on the number of upcrossings of any interval $[a, b]$ shows that with probability $1$ , the sequence $(M_{n} (ω))$ makes only finitely many oscillations across any rational interval, hence converges in $\overline{R}$ .

Step 2 ( $L^{1}$ -convergence under uniform integrability): the Dunford-Pettis theorem characterises uniformly integrable subsets of $L^{1}$ as relatively weakly compact, and combined with the a.s. limit gives $L^{1}$ -convergence (avoiding the Vitali convergence theorem direct path).

Step 3 (Radon-Nikodym identification): the consistent sequence of signed measures $ν_{n} (E) = \int_{E} M_{n} d P$ on $F_{n}$ extends to a single signed measure $ν_{\infty}$ on $F_{\infty}$ (Kolmogorov extension on the increasing union $σ$ -algebra). The Radon-Nikodym derivative $d ν_{\infty} / d P ∣_{F_{\infty}} = M_{\infty}$ , identifying the martingale limit as a density.

$□$

Connections Master

Lebesgue integral and monotone convergence 02.07.04. The MCT in the change-of-variables formula (Proposition 7) extends the defining identity $ν (E) = \int_{E} f d μ$ from indicators to non-negative measurable functions, the standard-machine step that bridges the measure-theoretic statement (the indicator-level identity) to the integral-theoretic statement (the general non-negative integrand).
Fatou's lemma and dominated convergence 02.07.05. The DCT is used in the proof of the $L^{p}$ -density theorem invoked in Proposition 8 (Riesz representation) and in the proof of $L^{1}$ -convergence in the martingale convergence theorem (Proposition 10): under uniform integrability the a.s. convergence implies $L^{1}$ -convergence via a DCT-style domination by the uniformly integrable family.
$L^{p}$ spaces and Riesz-Fischer completeness 02.07.06. The just-shipped peer in $L^{p}$ -theory. The Hilbert space $L^{2} (μ + ν)$ in the von Neumann proof of Radon-Nikodym is constructed via Riesz-Fischer completeness on $L^{2}$ . The duality $(L^{p})^{*} ≅ L^{q}$ (Proposition 8 / Exercise 7) is a direct Radon-Nikodym application and is the foundational structural theorem of $L^{p}$ -theory.
Fubini-Tonelli theorem and product measures 02.07.07. The immediate predecessor. Fubini-Tonelli and Radon-Nikodym together comprise the two pillars of modern integration theory beyond the basic Lebesgue-MCT-DCT framework: Fubini-Tonelli handles product spaces, while Radon-Nikodym handles density representations on single measure spaces. The two theorems combine in the construction of regular conditional probabilities, where disintegrations of a joint measure on a product space are obtained via Fubini-style fibrewise decomposition and Radon-Nikodym fibrewise density.
Lebesgue outer measure and Carathéodory construction 02.07.02. The construction of the singular part $ν_{s}$ in the Lebesgue decomposition relies on Carathéodory's outer-measure machinery applied to the measurable structure of the support set $B$ . Without Carathéodory the structural splitting into $σ$ -additive pieces is unavailable.
Riesz representation theorem in functional analysis [forward: 02.11.14]. Proposition 8's identification $(L^{p})^{*} ≅ L^{q}$ for $1 \leq p < \infty$ is one of the founding theorems of functional analysis, with Radon-Nikodym as the load-bearing step. The forward connection to functional-analytic representation theorems (Riesz-Markov for measures on locally compact spaces, Riesz-Fréchet on Hilbert spaces, Hahn-Banach extensions) all build on the Radon-Nikodym density-representation framework.
Conditional expectation and martingale theory [forward: probability theory]. Theorem 8 and Proposition 9 identify conditional expectation as a Radon-Nikodym derivative and develop the algebraic properties (linearity, tower, $L^{2}$ -projection, Jensen) from the defining identity. Doob's martingale convergence (Theorem 9, Proposition 10) is the time-asymptotic version of Radon-Nikodym, and it is the foundational convergence theorem of probability theory. Forward connections to stochastic calculus, Girsanov's theorem (Theorem 10), and mathematical finance all build on this Radon-Nikodym-driven framework.
Fourier analysis and PDE via weak derivatives [lateral: 02.10 Fourier; 02.13 PDE]. The Fourier transform of an $L^{1}$ -function is a bounded continuous function (Riemann-Lebesgue), and the inverse transform is defined via a density representation against the Fourier-conjugate measure. Weak derivatives in Sobolev space theory are defined via Radon-Nikodym-type integration-by-parts identities: $\int u \cdot \partial_{i} φ d x = - \int \partial_{i} u \cdot φ d x$ for test functions $φ$ , with $\partial_{i} u$ the unique $L^{1}$ -density satisfying the integral identity. The lateral connection to PDE theory is via this distributional integration-by-parts formulation, which is a Radon-Nikodym statement at heart.
Information theory: Kullback-Leibler divergence and entropy [lateral: information theory]. The Kullback-Leibler divergence $D_{K L} (ν ∥ μ) = \int lo g (d ν / d μ) d ν$ is the integral of the log-density, the foundational information-theoretic measure of distance between two probability distributions. Shannon entropy is the special case where one measure is the uniform reference. The lateral connection to coding theory, channel capacity, and statistical inference is via this Radon-Nikodym log-density framework.
Differential geometry: Jacobians and measure pullback [lateral: differential geometry]. Under a smooth bijection $ϕ$ , the pullback measure $ϕ^{*} (ν)$ has Radon-Nikodym derivative the Jacobian determinant: $d (ϕ^{*} ν) / d μ = ∣ det D ϕ ∣ \cdot (d ν / d μ) \circ ϕ$ . The lateral connection to coordinate changes in integration on manifolds is via this Jacobian-as-Radon-Nikodym-derivative framework.

Historical & philosophical context Master

Henri Lebesgue's 1904 Leçons sur l'intégration et la recherche des fonctions primitives ^{[Lebesgue 1904]} introduced the modern integration theory and along the way characterised the structure of monotone increasing functions on $[a, b]$ : every such function has a unique decomposition as the sum of an absolutely continuous part (with $L^{1}$ -density derivative), a singular continuous part (with derivative zero almost everywhere, like the Cantor function), and a jump part (a sum of step functions). Lebesgue's decomposition theorem for distribution functions was the historical germ of the Lebesgue decomposition theorem for measures, although the abstract formulation in terms of general $σ$ -finite measures appeared only with Radon 1913 and Nikodym 1930.

Johann Radon's 1913 Sitzungsberichte der Wiener Akademie paper ^{[Radon 1913]}, titled Theorie und Anwendungen der absolut additiven Mengenfunktionen (Theory and applications of absolutely additive set-functions), introduced the abstract measure-theoretic framework for "absolute additive set-functions" (in modern terminology, signed measures) and proved the existence of a density function for measures absolutely continuous with respect to Lebesgue measure on $R^{n}$ . Radon's paper was foundational in two ways: it axiomatised the modern signed-measure framework, and it established the density-existence theorem for Lebesgue background. The proof technique used a refinement-of-partitions argument that was technically intricate; the cleaner Hilbert-space proof of von Neumann came three decades later.

Hans Hahn's 1921 Annali della Scuola Normale Superiore di Pisa paper ^{[Hahn 1921]}, titled Über die Multiplikation total-additiver Mengenfunktionen (On the multiplication of totally additive set-functions), proved the Hahn-Jordan decomposition: every signed measure splits canonically into a positive part and a negative part, supported on disjoint measurable sets. The Hahn decomposition is the structural foundation of signed-measure theory, and it is the missing piece that lets one reduce the signed Radon-Nikodym theorem to the unsigned case via the Hahn-Jordan splitting.

Otton Nikodym's 1930 Fundamenta Mathematicae paper ^{[Nikodym 1930]}, titled Sur une généralisation des intégrales de M. J. Radon (On a generalisation of M. J. Radon's integrals), extended Radon's theorem to arbitrary $σ$ -finite background measures on abstract measurable spaces, completing the modern Lebesgue-Radon-Nikodym theorem. Nikodym's contribution was the recognition that the abstract measure-theoretic framework Radon had axiomatised did not require Lebesgue measure as background — any $σ$ -finite measure could play the role of the reference, and the density-existence theorem went through. Nikodym's paper also clarified the $σ$ -finiteness hypothesis as load-bearing, with explicit examples of failure under counting-measure-type non- $σ$ -finite backgrounds.

John von Neumann's 1940 Bulletin of the AMS paper ^{[von Neumann 1940]}, titled On rings of operators III (with the relevant Hilbert-space proof of Radon-Nikodym appearing in §6), introduced the elegant Hilbert-space proof presented in this unit's main theorem. Von Neumann's insight was that the Radon-Nikodym density $f = d ν / d μ$ could be identified as $g / (1 - g)$ where $g$ is the orthogonal projection of the constant function $1$ onto a specific closed subspace of $L^{2} (μ + ν)$ . The Riesz representation theorem on Hilbert spaces does all the work, reducing the original measure-theoretic existence question to a one-line application of the Riesz lemma. Von Neumann's proof became the standard textbook proof of Radon-Nikodym after 1950, displacing the older refinement-of-partitions arguments of Radon and Nikodym.

Paul Halmos's 1950 Measure Theory monograph and Walter Rudin's 1966 Real and Complex Analysis ^{[Rudin RCA; Folland 2e]} codified the modern textbook formulation of Lebesgue-Radon-Nikodym: the von Neumann Hilbert-space proof, the Hahn-Jordan decomposition for the signed case, the Lebesgue decomposition for the singular-plus-absolutely-continuous splitting, and the chain rule and change-of-variables for the Radon-Nikodym derivative. Joseph Doob's 1953 Stochastic Processes ^{[Doob — Stochastic Processes]} applied the framework to martingale theory, identifying conditional expectation as a Radon-Nikodym derivative and proving the martingale convergence theorem as a time-asymptotic version of Radon-Nikodym (Theorem 9). Igor Girsanov's 1960 Theory of Probability and its Applications paper extended the framework to stochastic analysis, with the exponential-martingale Radon-Nikodym derivative behind the change-of-measure formula for Brownian motion (Theorem 10), foundational in mathematical finance.

The philosophical thread: Lebesgue-Radon-Nikodym is the rigorous version of the question "can one measure be written as a density against another?" — a question that goes back at least to Newton and Leibniz's mass-density formulations in classical mechanics, and to Daniel Bernoulli's 1738 Hydrodynamica formulation of probability density. The seven-decade arc from Lebesgue 1904 to von Neumann 1940 to Doob 1953 to Girsanov 1960 tracks the maturation of the density-existence question from a special case for monotone functions on $[a, b]$ to an abstract structural theorem on $σ$ -finite measure spaces with applications across probability theory, functional analysis, stochastic analysis, and mathematical finance. The von Neumann Hilbert-space proof reveals the deep structural reason the theorem works: the density is an orthogonal projection in disguise, and the Riesz representation theorem on Hilbert spaces is the load-bearing step. This functional-analytic insight is one of the most elegant proofs in modern analysis and a model for how Hilbert-space arguments can illuminate measure-theoretic phenomena.

Bibliography Master

@book{Lebesgue1904,
  author    = {Lebesgue, Henri},
  title     = {Le\c{c}ons sur l'int\'egration et la recherche des fonctions primitives},
  publisher = {Gauthier-Villars},
  address   = {Paris},
  year      = {1904}
}

@article{Radon1913,
  author  = {Radon, Johann},
  title   = {Theorie und {A}nwendungen der absolut additiven {M}engenfunktionen},
  journal = {Sitzungsberichte der Kaiserlichen Akademie der Wissenschaften in Wien, Mathematisch-naturwissenschaftliche Klasse},
  volume  = {122},
  year    = {1913},
  pages   = {1295--1438}
}

@article{Hahn1921,
  author  = {Hahn, Hans},
  title   = {\"Uber die {M}ultiplikation total-additiver {M}engenfunktionen},
  journal = {Annali della Scuola Normale Superiore di Pisa},
  series  = {2},
  volume  = {9},
  year    = {1921},
  pages   = {429--452}
}

@article{Nikodym1930,
  author  = {Nikodym, Otton},
  title   = {Sur une g\'en\'eralisation des int\'egrales de {M}. {J}. {R}adon},
  journal = {Fundamenta Mathematicae},
  volume  = {15},
  year    = {1930},
  pages   = {131--179}
}

@article{vonNeumann1940,
  author  = {von Neumann, John},
  title   = {On rings of operators {III}},
  journal = {Annals of Mathematics},
  series  = {2},
  volume  = {41},
  year    = {1940},
  pages   = {94--161}
}

@article{Riesz1910,
  author  = {Riesz, Frigyes},
  title   = {Untersuchungen \"uber {S}ysteme integrierbarer {F}unktionen},
  journal = {Mathematische Annalen},
  volume  = {69},
  year    = {1910},
  pages   = {449--497}
}

@book{Halmos1950,
  author    = {Halmos, Paul R.},
  title     = {Measure Theory},
  publisher = {Van Nostrand},
  address   = {New York},
  year      = {1950}
}

@book{Folland1999,
  author    = {Folland, Gerald B.},
  title     = {Real Analysis: Modern Techniques and Their Applications},
  edition   = {2},
  publisher = {Wiley},
  year      = {1999}
}

@book{Rudin1987,
  author    = {Rudin, Walter},
  title     = {Real and Complex Analysis},
  edition   = {3},
  publisher = {McGraw-Hill},
  year      = {1987}
}

@book{Bogachev2007,
  author    = {Bogachev, Vladimir I.},
  title     = {Measure Theory, Volume 1},
  publisher = {Springer},
  year      = {2007}
}

@book{RoydenFitzpatrick2010,
  author    = {Royden, H. L. and Fitzpatrick, P. M.},
  title     = {Real Analysis},
  edition   = {4},
  publisher = {Pearson},
  year      = {2010}
}

@book{Doob1953,
  author    = {Doob, Joseph L.},
  title     = {Stochastic Processes},
  publisher = {Wiley},
  address   = {New York},
  year      = {1953}
}

@book{Kolmogorov1933,
  author    = {Kolmogorov, Andrey N.},
  title     = {Grundbegriffe der {W}ahrscheinlichkeitsrechnung},
  publisher = {Springer},
  address   = {Berlin},
  year      = {1933}
}

@article{Girsanov1960,
  author  = {Girsanov, Igor V.},
  title   = {On transforming a certain class of stochastic processes by absolutely continuous substitution of measures},
  journal = {Theory of Probability and its Applications},
  volume  = {5},
  year    = {1960},
  pages   = {285--301}
}

@book{Tao2016,
  author    = {Tao, Terence},
  title     = {Analysis II},
  edition   = {3},
  publisher = {Hindustan Book Agency},
  year      = {2016}
}

Prerequisites

02.07.04
02.07.05
02.07.06
02.07.07

Tier anchors

beginner: Tao, Analysis II §8.6; informal motivation as densities of one measure against another
intermediate: Folland, Real Analysis 2e §3.2; Royden, Real Analysis 4e §18
master: Bogachev, Measure Theory Vol. 1 §3.2; Rudin, Real and Complex Analysis 3e §6-7 (von Neumann Hilbert-space proof)

References

Radon — Theorie und Anwendungen der absolut additiven Mengenfunktionen · Sitzungsberichte der Kaiserlichen Akademie der Wissenschaften in Wien, Math.-naturwiss. Klasse 122 (1913), 1295-1438
Nikodym — Sur une généralisation des intégrales de M. J. Radon · Fundamenta Mathematicae 15 (1930), 131-179
Lebesgue — Leçons sur l'intégration et la recherche des fonctions primitives · Gauthier-Villars, Paris, 1904
von Neumann — On rings of operators III · Annals of Mathematics (2) 41 (1940), 94-161; see also Bulletin of the AMS 46 (1940), 376-381 for the Hilbert-space proof of Radon-Nikodym
Hahn — Über die Multiplikation total-additiver Mengenfunktionen · Annali della Scuola Normale Superiore di Pisa (2) 9 (1921), 429-452
Folland — Real Analysis: Modern Techniques and Their Applications, 2e · §3.2, Signed measures and the Radon-Nikodym theorem
Bogachev — Measure Theory, Vol. 1 · §3.2, Absolute continuity and the Radon-Nikodym theorem
Royden — Real Analysis, 4e · §18, The Daniell integral; §19, General measure and integration
Rudin — Real and Complex Analysis, 3e · §6, Complex measures; §7, Differentiation
Doob — Stochastic Processes · Wiley 1953, Ch. VII, Martingale theory

Lean module

Codex.Analysis.MeasureTheory.AbsoluteContinuityRadonNikodym

Estimated time

beginner: 18m
intermediate: 55m
master: 100m