12.07.01 · quantum / perturbation

Time-independent perturbation theory

draft3 tiersLean: nonepending prereqs

Anchor (Master): Kato, Perturbation Theory for Linear Operators (Springer, 1966); Epstein 1916

Intuition [Beginner]

Most quantum mechanics problems cannot be solved exactly. The hydrogen atom is a rare success — you get explicit energy levels and wavefunctions. But add a second electron (helium), place hydrogen in an electric field (Stark effect), or include the electron spin interacting with its orbital motion (fine structure), and closed-form solutions disappear.

Perturbation theory is the standard tool for these situations. Split the Hamiltonian into two pieces, $H = H_{0} + λV$ , where $H_{0}$ is exactly solvable and $λV$ is a small correction. The parameter $λ$ is a bookkeeping device; set $λ = 1$ at the end. The energies and eigenstates of $H_{0}$ are already known. Perturbation theory computes how they shift when you turn on $V$ .

Think of it as a Taylor expansion for eigenvalues. Expand the $n$ th energy as a power series in $λ$ : $E_{n} = E_{n}^{(0)} + λ E_{n}^{(1)} + λ^{2} E_{n}^{(2)} + \dots$ . The zeroth-order term $E_{n}^{(0)}$ is the unperturbed energy you already know. The first-order correction $E_{n}^{(1)} = ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩$ averages the perturbation over the unperturbed wavefunction.

The state itself also gets corrected. The perturbed state $∣ n ⟩$ is the original $∣ n^{(0)} ⟩$ plus small admixtures of every other state $∣ m^{(0)} ⟩$ . The amount of $∣ m^{(0)} ⟩$ mixed in is proportional to the coupling $⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩$ divided by the energy gap $E_{n}^{(0)} - E_{m}^{(0)}$ . Strong coupling and small energy gaps produce large corrections.

The second-order energy correction $E_{n}^{(2)}$ captures how these admixtures feed back into the energy. Each state $∣ m^{(0)} ⟩$ with $m \neq = n$ contributes $∣ ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩ ∣^{2} / (E_{n}^{(0)} - E_{m}^{(0)})$ . Two features stand out. First, if $E_{n}^{(0)}$ is the ground state energy, every denominator is negative, so $E_{n}^{(2)}$ is always negative: the second-order correction pushes the ground state down. Second, nearby states contribute the most because the energy gap sits in the denominator.

A crucial restriction: the formulas above require the unperturbed energy level to be non-degenerate. If two distinct states share the same unperturbed energy, the denominator $E_{n}^{(0)} - E_{m}^{(0)}$ vanishes and the expression diverges. Degenerate levels need a different strategy: diagonalize $V$ within the degenerate subspace first, splitting the level, then apply non-degenerate perturbation theory to each resulting state separately.

What does "small" mean quantitatively? The perturbation is small when $∣ ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩ ∣$ is much less than $∣ E_{n}^{(0)} - E_{m}^{(0)} ∣$ for all states that couple appreciably. When this condition holds, the first few terms approximate the true energy well. If the perturbation is comparable to the level spacing, the series diverges and the approach fails.

Three physical applications drive the theory. The Stark effect places an atom in a uniform electric field. For hydrogen's ground state, the first-order energy correction vanishes because the wavefunction has definite parity and the field perturbation is odd — the expectation value of an odd operator in an even state is zero.

The Zeeman effect splits degenerate magnetic sub-levels with a magnetic field, each splitting computed as a first-order correction. Hydrogen fine structure — spin-orbit coupling, relativistic kinetic-energy corrections, and the Darwin term — adds perturbative corrections smaller than the Bohr energies by a factor of $α^{2} \approx 5 \times 1 0^{- 5}$ , where $α$ is the fine-structure constant. Each application starts from the exactly solvable hydrogen atom and adds physically realistic corrections.

The framework is systematic: take a solvable system, add a small perturbation, compute corrections order by order. The rest of this unit develops the formalism, proves the key formulas, works through the Stark effect in detail, and extends the theory to degenerate levels where the standard formulas break down.

Visual [Beginner]

Picture a row of horizontal lines on a number line — these are the unperturbed energy levels $E_{1}^{(0)}, E_{2}^{(0)}, E_{3}^{(0)}, \dots$ from bottom to top. Now turn on a perturbation $V$ . Each level shifts: $E_{n}^{(0)}$ moves to $E_{n}^{(0)} + ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩$ at first order, then receives an additional smaller second-order push. Nearby levels shift more than distant ones because the energy gap controls the mixing strength.

The perturbed state $∣ n ⟩$ acquires thin contributions from every other state, visualized as arrows from $∣ n^{(0)} ⟩$ to each $∣ m^{(0)} ⟩$ with thickness proportional to $∣ ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩ ∣/ (E_{n}^{(0)} - E_{m}^{(0)})$ . Thick arrows connect nearby levels; distant levels contribute thin, nearly invisible arrows.

Worked example [Beginner]

Stark effect for the hydrogen ground state. Place a hydrogen atom in a uniform electric field $E$ pointing in the $z$ -direction. The perturbation is $V = e E z = e E r cos θ$ , where $e$ is the elementary charge and $r, θ$ are spherical coordinates. The unperturbed ground state is $∣1, 0, 0 ⟩$ with energy $E_{1}^{(0)} = - 13.6$ eV.

First-order correction. Compute $E_{1}^{(1)} = ⟨ 1, 0, 0∣ V ∣1, 0, 0 ⟩ = e E ⟨ 1, 0, 0∣ z ∣1, 0, 0 ⟩$ . The ground-state wavefunction $ψ_{100} (r, θ, ϕ)$ depends on $r$ alone — it is spherically symmetric. The operator $z = r cos θ$ is odd under parity ( $z \to - z$ ). Integrating an odd function times an even function over all space gives zero. So $E_{1}^{(1)} = 0$ .

The first-order Stark shift vanishes for the ground state. This is a general pattern: any non-degenerate state with definite parity has zero first-order energy shift under a perturbation that is odd under the same parity operation.

Second-order correction. With the first-order term gone, the leading Stark shift comes from $E_{1}^{(2)}$ , which adds contributions from every excited state $∣ m^{(0)} ⟩$ :

$E_{1}^{(2)} = \frac{∣ ⟨ 2 , 1 , 0∣ V ∣1 , 0 , 0 ⟩ ∣ ^{2}}{E _{1}^{(0)} - E _{2}^{(0)}} + \frac{∣ ⟨ 3 , 1 , 0∣ V ∣1 , 0 , 0 ⟩ ∣ ^{2}}{E _{1}^{(0)} - E _{3}^{(0)}} + \dots$

Each term is negative (every denominator is negative since $E_{1}^{(0)}$ is the lowest energy), so the ground state is pushed down. The explicit summation over all hydrogen bound states and the continuum gives $E_{1}^{(2)} = - \frac{9}{4} a_{0}^{3} E^{2}$ , where $a_{0}$ is the Bohr radius. The shift is quadratic in the field: the atom develops an induced electric dipole moment proportional to $E$ , and the energy is the dipole-field interaction, proportional to $E^{2}$ .

Check your understanding [Beginner]

Exercise (easy, multiple choice).

A quantum system has unperturbed energy $E_{n}^{(0)}$ and a perturbation $V \neq = 0$ . Can the first-order energy correction $E_{n}^{(1)} = ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩$ ever be zero?

A. No, because $V \neq = 0$ means the energy must shift. B. Yes, for example when the state has definite parity and $V$ has opposite parity. C. Only if $V$ is identically zero everywhere. D. Only for the ground state.

Hint

The expectation value of an operator can vanish even when the operator itself is nonzero. Think about symmetry.

Answer

B. The hydrogen ground state in an electric field is the canonical example: $⟨ 1, 0, 0∣ z ∣1, 0, 0 ⟩ = 0$ because the integrand is odd under parity even though $z$ is not zero. A nonzero perturbation can produce a zero first-order correction whenever the expectation value happens to vanish.

Exercise (easy, multiple choice).

Two unperturbed states $∣ a ⟩$ and $∣ b ⟩$ have the same energy $E^{(0)}$ . A perturbation $V$ couples them ( $⟨ a ∣ V ∣ b ⟩ \neq = 0$ ). What goes wrong with the non-degenerate perturbation formulas?

A. Nothing; the formulas work as stated. B. The energy denominators vanish, so the series diverges. C. The first-order correction becomes infinite. D. Both B and C.

Hint

The state correction involves $⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩ / (E_{n}^{(0)} - E_{m}^{(0)})$ . What happens when $E_{n}^{(0)} = E_{m}^{(0)}$ ?

Answer

D. When $E_{n}^{(0)} = E_{m}^{(0)}$ , the denominator in the first-order state correction vanishes while the numerator remains nonzero, producing an infinite correction. The second-order energy formula also diverges. Degenerate states require diagonalizing $V$ within the degenerate subspace first — this is degenerate perturbation theory, developed in the Master tier.

Formal definition [Intermediate+]

Let $H_{0}$ be a Hamiltonian with known discrete non-degenerate spectrum: $H_{0} ∣ n^{(0)} ⟩ = E_{n}^{(0)} ∣ n^{(0)} ⟩$ , where ${∣ n^{(0)} ⟩}_{n = 0}^{\infty}$ forms an orthonormal basis of the Hilbert space. The full Hamiltonian is

$H = H_{0} + λV,$

where $V$ is a Hermitian operator (the perturbation) and $λ \in [0, 1]$ is a dimensionless coupling parameter. The goal is to find the eigenvalues $E_{n}$ and eigenvectors $∣ n ⟩$ of $H$ as power series in $λ$ :

$E_{n} = k = 0 \sum \infty λ^{k} E_{n}^{(k)}, ∣ n ⟩ = k = 0 \sum \infty λ^{k} ∣ n^{(k)} ⟩ .$

Intermediate normalization. Fix the phase convention by requiring $⟨ n^{(0)} ∣ n ⟩ = 1$ . This implies $⟨ n^{(0)} ∣ n^{(k)} ⟩ = 0$ for all $k \geq 1$ . The norm of $∣ n ⟩$ is then $⟨ n ∣ n ⟩ = 1 + \sum_{k = 1}^{\infty} λ^{2 k} ⟨ n^{(k)} ∣ n^{(k)} ⟩ > 1$ ; normalization to unit norm can be imposed later by dividing.

Substitute the series into the time-independent Schrödinger equation $H ∣ n ⟩ = E_{n} ∣ n ⟩$ and collect powers of $λ$ .

Order $λ^{0}$ : $H_{0} ∣ n^{(0)} ⟩ = E_{n}^{(0)} ∣ n^{(0)} ⟩$ , satisfied by assumption.

Order $λ^{1}$ :

$(H_{0} - E_{n}^{(0)}) ∣ n^{(1)} ⟩ = (E_{n}^{(1)} - V) ∣ n^{(0)} ⟩ .$

Project onto $⟨ n^{(0)} ∣$ and use $⟨ n^{(0)} ∣ (H_{0} - E_{n}^{(0)}) = 0$ :

$E_{n}^{(1)} = ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩ .$

Expand $∣ n^{(1)} ⟩ = \sum_{m \neq = n} c_{m}^{(1)} ∣ m^{(0)} ⟩$ (no component along $∣ n^{(0)} ⟩$ by intermediate normalization). Projecting onto $⟨ m^{(0)} ∣$ with $m \neq = n$ :

$c_{m}^{(1)} = \frac{⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩}{E _{n}^{(0)} - E _{m}^{(0)}} .$

Order $λ^{2}$ : Projecting the second-order equation onto $⟨ n^{(0)} ∣$ yields

$E_{n}^{(2)} = m \neq = n \sum \frac{∣ ⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩ ∣ ^{2}}{E _{n}^{(0)} - E _{m}^{(0)}} .$

This sum runs over all unperturbed states (including the continuum, if present). Each term is the square of a coupling divided by an energy gap. States close in energy dominate the sum.

Validity. The expansion is well-defined when $∣ ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩ ∣ ≪ ∣ E_{n}^{(0)} - E_{m}^{(0)} ∣$ for all $m \neq = n$ with appreciable coupling. This is the non-degeneracy condition. When violated, the series diverges or converges too slowly for practical use, and one must resort to degenerate perturbation theory or non-perturbative methods.

Higher orders. The procedure continues: at order $λ^{k}$ , project onto $⟨ n^{(0)} ∣$ to extract $E_{n}^{(k)}$ , then project onto $⟨ m^{(0)} ∣$ for $m \neq = n$ to fix the components of $∣ n^{(k)} ⟩$ . Each order depends only on lower-order quantities. The third-order energy correction is

$E_{n}^{(3)} = m \neq = n \sum ℓ \neq = n \sum \frac{⟨ n ^{(0)} ∣ V ∣ m ^{(0)} ⟩ ⟨ m ^{(0)} ∣ V ∣ ℓ ^{(0)} ⟩ ⟨ ℓ ^{(0)} ∣ V ∣ n ^{(0)} ⟩}{( E _{n}^{(0)} - E _{m}^{(0)} ) ( E _{n}^{(0)} - E _{ℓ}^{(0)} )} - E_{n}^{(1)} m \neq = n \sum \frac{∣ ⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩ ∣ ^{2}}{( E _{n}^{(0)} - E _{m}^{(0)} ) ^{2}} .$

The algebraic complexity grows rapidly with order, but the systematic structure remains the same.

Key theorem with proof [Intermediate+]

Theorem (Rayleigh-Schrodinger perturbation series). Let $H_{0}$ have a discrete non-degenerate spectrum ${E_{n}^{(0)}}$ with normalized eigenstates ${∣ n^{(0)} ⟩}$ . Let $V$ be a Hermitian perturbation. Then for $λ$ sufficiently small, the perturbed Hamiltonian $H = H_{0} + λV$ has eigenvalues and eigenstates given by the formal power series

$E_{n} = E_{n}^{(0)} + λ ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩ + λ^{2} m \neq = n \sum \frac{∣ ⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩ ∣ ^{2}}{E _{n}^{(0)} - E _{m}^{(0)}} + O (λ^{3}),$

$∣ n ⟩ = ∣ n^{(0)} ⟩ + λ m \neq = n \sum \frac{⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩}{E _{n}^{(0)} - E _{m}^{(0)}} ∣ m^{(0)} ⟩ + O (λ^{2}) .$

Proof. Substitute $E_{n} = \sum_{k} λ^{k} E_{n}^{(k)}$ and $∣ n ⟩ = \sum_{k} λ^{k} ∣ n^{(k)} ⟩$ into $H ∣ n ⟩ = E_{n} ∣ n ⟩$ :

$(H_{0} + λV) k = 0 \sum \infty λ^{k} ∣ n^{(k)} ⟩ = (j = 0 \sum \infty λ^{j} E_{n}^{(j)}) (k = 0 \sum \infty λ^{k} ∣ n^{(k)} ⟩) .$

Collect terms at order $λ^{0}$ : $H_{0} ∣ n^{(0)} ⟩ = E_{n}^{(0)} ∣ n^{(0)} ⟩$ , the unperturbed eigenvalue equation, satisfied by construction.

At order $λ^{1}$ :

$H_{0} ∣ n^{(1)} ⟩ + V ∣ n^{(0)} ⟩ = E_{n}^{(0)} ∣ n^{(1)} ⟩ + E_{n}^{(1)} ∣ n^{(0)} ⟩ .$

Rearrange: $(H_{0} - E_{n}^{(0)}) ∣ n^{(1)} ⟩ = (E_{n}^{(1)} - V) ∣ n^{(0)} ⟩$ . Left-multiply by $⟨ n^{(0)} ∣$ and use the self-adjointness of $H_{0} - E_{n}^{(0)}$ together with the zeroth-order equation to kill the left side: $0 = E_{n}^{(1)} - ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩$ . This gives $E_{n}^{(1)}$ .

For the state correction, left-multiply by $⟨ m^{(0)} ∣$ with $m \neq = n$ : $(E_{m}^{(0)} - E_{n}^{(0)}) ⟨ m^{(0)} ∣ n^{(1)} ⟩ = - ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩$ . The non-degeneracy hypothesis $E_{m}^{(0)} \neq = E_{n}^{(0)}$ allows division, giving the first-order state coefficients.

At order $λ^{2}$ :

$H_{0} ∣ n^{(2)} ⟩ + V ∣ n^{(1)} ⟩ = E_{n}^{(0)} ∣ n^{(2)} ⟩ + E_{n}^{(1)} ∣ n^{(1)} ⟩ + E_{n}^{(2)} ∣ n^{(0)} ⟩ .$

Left-multiply by $⟨ n^{(0)} ∣$ . The $H_{0}$ and $E_{n}^{(0)}$ terms cancel by the same argument as above. What remains is

$⟨ n^{(0)} ∣ V ∣ n^{(1)} ⟩ = E_{n}^{(2)} .$

Substituting the first-order state $∣ n^{(1)} ⟩ = \sum_{m \neq = n} c_{m}^{(1)} ∣ m^{(0)} ⟩$ and using the expression for $c_{m}^{(1)}$ yields the second-order energy formula. $□$

Corollary (Hellmann-Feynman). If $H (λ) = H_{0} + λV$ , then $d E_{n} / d λ = ⟨ n ∣ V ∣ n ⟩$ at $λ = 0$ . The first-order perturbation formula is the first term in the Taylor expansion of $E_{n} (λ)$ . This connects perturbation theory to the variational principle: the first-order energy is the derivative of the eigenvalue with respect to the coupling.

Bridge. The Rayleigh-Schrodinger series builds toward 12.07.02 time-dependent perturbation theory, where the same matrix elements $V_{mn}$ govern transition rates between unperturbed states, and appears again in 14.04.01 pending for the helium atom's first-order electron-electron repulsion correction. The foundational reason perturbation theory works at all is that the spectrum of $H_{0}$ is discrete and isolated — this is exactly the hypothesis of the Kato-Rellich theorem (Master tier), which guarantees the perturbed eigenvalue is an analytic function of $λ$ . Putting these together, the power-series ansatz is not merely formal: the central insight is that analyticity in $λ$ converts the eigenvalue problem into a hierarchy of linear equations solvable order by order, and the bridge is between the algebraic recursion derived here and the resolvent-based analytic continuation that justifies it.

Worked example: the two-level system

Consider $H_{0} = (E_{1}^{(0)} 0 0 E_{2}^{(0)})$ with $E_{1}^{(0)} < E_{2}^{(0)}$ and perturbation $V = (0 v^{*} v 0)$ (real $v$ for simplicity).

First-order: $E_{1}^{(1)} = V_{11} = 0$ , $E_{2}^{(1)} = V_{22} = 0$ . The diagonal elements of $V$ are zero, so the first-order energy shift vanishes.

Second-order:

$E_{1}^{(2)} = \frac{∣ V _{12} ∣ ^{2}}{E _{1}^{(0)} - E _{2}^{(0)}} = - \frac{v ^{2}}{E _{2}^{(0)} - E _{1}^{(0)}}, E_{2}^{(2)} = \frac{∣ V _{21} ∣ ^{2}}{E _{2}^{(0)} - E _{1}^{(0)}} = \frac{v ^{2}}{E _{2}^{(0)} - E _{1}^{(0)}} .$

The lower level is pushed down and the upper level is pushed up: the perturbation repels the two levels. This is a general feature — second-order corrections tend to increase the spacing between levels.

Exact comparison: The full Hamiltonian $H = (E_{1}^{(0)} λ v λ v E_{2}^{(0)})$ has eigenvalues

$E_{\pm} = \frac{E _{1}^{(0)} + E _{2}^{(0)}}{2} \pm (\frac{E _{2}^{(0)} - E _{1}^{(0)}}{2})^{2} + λ^{2} v^{2} .$

Expanding the square root for small $λ v / (E_{2}^{(0)} - E_{1}^{(0)})$ recovers $E_{1}^{(0)} - λ^{2} v^{2} / (E_{2}^{(0)} - E_{1}^{(0)}) + O (λ^{4})$ , confirming the second-order result. The exact answer also shows that the perturbative expansion breaks down when $λ v \sim E_{2}^{(0)} - E_{1}^{(0)}$ .

Exercises [Intermediate+]

Exercise 1 (easy, symbolic).

A one-dimensional harmonic oscillator has $H_{0} = p^{2} / (2 m) + \frac{1}{2} m ω^{2} x^{2}$ with unperturbed energies $E_{n}^{(0)} = (n + \frac{1}{2}) ℏ ω$ . A perturbation $V = \frac{1}{2} α x^{2}$ is applied with $α ≪ m ω^{2}$ . Compute the first-order correction $E_{n}^{(1)}$ to all levels.

Hint

Express $⟨ n ∣ x^{2} ∣ n ⟩$ using the ladder operators, or use the known result $⟨ n ∣ x^{2} ∣ n ⟩ = (ℏ/2 mω) (2 n + 1)$ .

Answer

$E_{n}^{(1)} = \frac{1}{2} α ⟨ n ∣ x^{2} ∣ n ⟩ = \frac{1}{2} α \cdot \frac{ℏ}{2 mω} (2 n + 1) = \frac{α ℏ}{4 mω} (2 n + 1)$ . The correction shifts every level up, preserving the equal spacing. This makes sense: adding $α x^{2} /2$ to the potential just increases the spring constant from $m ω^{2}$ to $m ω^{2} + α$ , shifting the frequency to $ω^{'} = ω 1 + α / (m ω^{2}) \approx ω + α / (2 mω)$ . The exact energy is $(n + \frac{1}{2}) ℏ ω^{'}$ , and expanding to first order in $α$ gives the same result.

Exercise 3 (easy, multiple choice).

For a non-degenerate level $n$ with first-order correction $E_{n}^{(1)} \neq = 0$ , the second-order correction $E_{n}^{(2)}$ has what sign?

A. Always positive. B. Always negative. C. Can be positive, negative, or zero depending on the spectrum and couplings. D. Has the same sign as $E_{n}^{(1)}$ .

Hint

Look at the sum $E_{n}^{(2)} = \sum_{m \neq = n} ∣ V_{mn} ∣^{2} / (E_{n}^{(0)} - E_{m}^{(0)})$ . What determines the sign of each term?

Answer

C. For the ground state, every denominator is negative, so $E_{1}^{(2)} < 0$ . But for an excited state, there are both positive denominators (contributions from lower levels) and negative denominators (contributions from higher levels). The net sign depends on which dominates. For example, in the two-level system, $E_{2}^{(2)} = ∣ V_{12} ∣^{2} / (E_{2}^{(0)} - E_{1}^{(0)}) > 0$ .

Exercise 4 (medium, symbolic).

A particle in an infinite square well ( $0 < x < a$ ) has unperturbed energies $E_{n}^{(0)} = n^{2} π^{2} ℏ^{2} / (2 m a^{2})$ and wavefunctions $ψ_{n} (x) = 2/ a sin (nπ x / a)$ . A perturbation $V (x) = V_{0} sin (π x / a)$ is applied. Compute the first-order energy corrections $E_{n}^{(1)}$ for all $n$ .

Hint

Compute $E_{n}^{(1)} = V_{0} \cdot \frac{2}{a} \int_{0}^{a} sin^{2} (nπ x / a) sin (π x / a) d x$ . Use the product-to-sum identity for the integrand.

Answer

$E_{n}^{(1)} = V_{0} \cdot \frac{2}{a} \int_{0}^{a} sin (nπ x / a) sin (π x / a) sin (nπ x / a) d x = \frac{2 V _{0}}{a} \int_{0}^{a} sin^{2} (nπ x / a) sin (π x / a) d x$ .

Using $sin^{2} θ = (1 - cos 2 θ) /2$ and the product-to-sum identity, or computing directly for each $n$ :

For $n = 1$ : $E_{1}^{(1)} = \frac{2 V _{0}}{a} \int_{0}^{a} sin^{3} (π x / a) d x = \frac{2 V _{0}}{a} \cdot \frac{4 a}{3 π} = \frac{8 V _{0}}{3 π}$ .

For $n > 1$ : the integral $\int_{0}^{a} sin^{2} (nπ x / a) sin (π x / a) d x$ evaluates to zero for even $n$ , and to a value proportional to $1/ [1 - 4 n^{2}]$ for odd $n$ .

The key point: only states that are not orthogonal to $V (x)$ get a first-order shift, and the parity/symmetry of the perturbation selects which states couple.

Exercise 5 (medium, symbolic).

Show that if $V$ is an odd-parity operator ( $V (- x) = - V (x)$ in one dimension) and the unperturbed state $∣ n^{(0)} ⟩$ has definite parity, then $E_{n}^{(1)} = 0$ .

Hint

Write out the integral $E_{n}^{(1)} = \int_{- \infty}^{\infty} ∣ ψ_{n} (x) ∣^{2} V (x) d x$ and split the integration range into $(- \infty, 0]$ and $[0, \infty)$ .

Answer

Write $E_{n}^{(1)} = \int_{- \infty}^{\infty} ∣ ψ_{n} (x) ∣^{2} V (x) d x$ . Split: $= \int_{- \infty}^{0} ∣ ψ_{n} ∣^{2} V d x + \int_{0}^{\infty} ∣ ψ_{n} ∣^{2} V d x$ . Substitute $x \to - x$ in the first integral: $\int_{0}^{\infty} ∣ ψ_{n} (- x) ∣^{2} V (- x) d x = \int_{0}^{\infty} ∣ ψ_{n} (- x) ∣^{2} (- V (x)) d x$ .

If $∣ ψ_{n} (- x) ∣^{2} = ∣ ψ_{n} (x) ∣^{2}$ (definite parity), this equals $- \int_{0}^{\infty} ∣ ψ_{n} (x) ∣^{2} V (x) d x$ , which cancels the second integral. So $E_{n}^{(1)} = 0$ .

This is why the first-order Stark shift vanishes for the hydrogen ground state: $ψ_{100}$ has even parity and $V = e E z$ is odd.

Exercise 6 (medium, symbolic).

Compute the first-order state correction $∣ 0^{(1)} ⟩$ for the harmonic oscillator ground state under $V = ϵ x$ , where $ϵ$ is a small constant. Express the result in terms of the unperturbed states $∣ n^{(0)} ⟩$ .

Hint

Use the matrix element $⟨ n^{(0)} ∣ x ∣ 0^{(0)} ⟩ = ℏ/ (2 mω) δ_{n, 1}$ .

Answer

$∣ 0^{(1)} ⟩ = \sum_{n \neq = 0} \frac{⟨ n ^{(0)} ∣ ϵ x ∣ 0 ^{(0)} ⟩}{E _{0}^{(0)} - E _{n}^{(0)}} ∣ n^{(0)} ⟩$ . The selection rule for $x$ in the harmonic oscillator gives $⟨ n ∣ x ∣0 ⟩ = ℏ/ (2 mω) δ_{n, 1}$ . So only $n = 1$ contributes:

$∣ 0^{(1)} ⟩ = \frac{ϵ ℏ/ ( 2 mω )}{- ℏ ω} ∣ 1^{(0)} ⟩ = - \frac{ϵ}{ω 2 m ℏ ω} ∣ 1^{(0)} ⟩ .$

The perturbed ground state acquires a small admixture of the first excited state. Physically, the linear potential shifts the minimum of the harmonic well, and the state shifts with it — the $∣ 1^{(0)} ⟩$ component is the first-order signature of that displacement.

Exercise 7 (medium, symbolic).

For the two-level system of the key-theorem worked example, compute the second-order state correction $∣ 1^{(2)} ⟩$ and verify that $⟨ 1^{(0)} ∣ 1^{(2)} ⟩ = 0$ .

Hint

The first-order state correction $∣ 1^{(1)} ⟩ = \frac{v}{E _{1}^{(0)} - E _{2}^{(0)}} ∣ 2^{(0)} ⟩$ . Use the general formula for the second-order state correction, noting that only $∣ 2^{(0)} ⟩$ couples to $∣ 1^{(0)} ⟩$ .

Answer

Let $Δ = E_{2}^{(0)} - E_{1}^{(0)}$ . The first-order state correction is $∣ 1^{(1)} ⟩ = - (v /Δ) ∣ 2^{(0)} ⟩$ . The second-order state correction for a two-level system is:

$∣ 1^{(2)} ⟩ = m \neq = 1 \sum \frac{⟨ m ^{(0)} ∣ V ∣ 1 ^{(1)} ⟩ - E _{1}^{(1)} ⟨ m ^{(0)} ∣ 1 ^{(1)} ⟩}{E _{1}^{(0)} - E _{m}^{(0)}} ∣ m^{(0)} ⟩ .$

Only $m = 2$ is available: $⟨ 2^{(0)} ∣ V ∣ 1^{(1)} ⟩ = v \cdot (- v /Δ) = - v^{2} /Δ$ , and $E_{1}^{(1)} = V_{11} = 0$ . So $∣ 1^{(2)} ⟩ = \frac{- v ^{2} /Δ}{E _{1}^{(0)} - E _{2}^{(0)}} ∣ 2^{(0)} ⟩ = (v^{2} / Δ^{2}) ∣ 1^{(0)} ⟩ \cdot δ_{m, 1}$ ... Wait: only $m = 2$ appears, and the component along $∣ 1^{(0)} ⟩$ is excluded by intermediate normalization. So $∣ 1^{(2)} ⟩ = 0$ — in this two-level system, there is no state other than $∣ 2^{(0)} ⟩$ for the correction to go into, and the second-order component along $∣ 1^{(0)} ⟩$ is forbidden. So $∣ 1^{(2)} ⟩ = 0$ and $⟨ 1^{(0)} ∣ 1^{(2)} ⟩ = 0$ .

Exercise 8 (hard, symbolic).

Hellmann-Feynman theorem. Let $H (η)$ be a Hamiltonian depending smoothly on a real parameter $η$ , with non-degenerate eigenvalue $E_{n} (η)$ and normalized eigenstate $∣ n (η)⟩$ . Prove that

$\frac{d E _{n}}{d η} = ⟨ n \frac{\partial H}{\partial η} n ⟩ .$

Hint

Differentiate $H ∣ n ⟩ = E_{n} ∣ n ⟩$ with respect to $η$ , then left-multiply by $⟨ n ∣$ .

Answer

Differentiate $H (η) ∣ n (η)⟩ = E_{n} (η) ∣ n (η)⟩$ :

$\frac{\partial H}{\partial η} ∣ n ⟩ + H \frac{\partial ∣ n ⟩}{\partial η} = \frac{d E _{n}}{d η} ∣ n ⟩ + E_{n} \frac{\partial ∣ n ⟩}{\partial η} .$

Left-multiply by $⟨ n ∣$ :

$⟨ n ∣ \frac{\partial H}{\partial η} ∣ n ⟩ + ⟨ n ∣ H ∣ \partial_{η} n ⟩ = \frac{d E _{n}}{d η} + E_{n} ⟨ n ∣ \partial_{η} n ⟩ .$

The second term on the left is $⟨ n ∣ H ∣ \partial_{η} n ⟩ = E_{n} ⟨ n ∣ \partial_{η} n ⟩$ (using $H ∣ n ⟩ = E_{n} ∣ n ⟩$ and $H^{†} = H$ ). This cancels the second term on the right, leaving

$\frac{d E _{n}}{d η} = ⟨ n ∣ \partial H / \partial η ∣ n ⟩ .$

This is the Hellmann-Feynman theorem. It is a consequence of first-order perturbation theory: if $η = λ$ and $H = H_{0} + λV$ , then $\partial H / \partial λ = V$ , and the theorem gives $d E_{n} / d λ = ⟨ n ∣ V ∣ n ⟩$ , which is the first-order perturbation formula.

Exercise 9 (hard, symbolic).

Derive the third-order energy correction $E_{n}^{(3)}$ in terms of the unperturbed matrix elements $V_{mn} = ⟨ m^{(0)} ∣ V ∣ n^{(0)} ⟩$ and energy gaps $E_{n}^{(0)} - E_{m}^{(0)}$ .

Hint

Take the order- $λ^{3}$ equation and project onto $⟨ n^{(0)} ∣$ . The result involves $E_{n}^{(1)}$ , $∣ n^{(1)} ⟩$ , and $∣ n^{(2)} ⟩$ . Use intermediate normalization to simplify.

Answer

The order- $λ^{3}$ equation projected onto $⟨ n^{(0)} ∣$ gives

$⟨ n^{(0)} ∣ V ∣ n^{(2)} ⟩ = E_{n}^{(1)} ⟨ n^{(0)} ∣ n^{(2)} ⟩ + E_{n}^{(2)} ⟨ n^{(0)} ∣ n^{(1)} ⟩ + E_{n}^{(3)} .$

By intermediate normalization, $⟨ n^{(0)} ∣ n^{(1)} ⟩ = ⟨ n^{(0)} ∣ n^{(2)} ⟩ = 0$ . So $E_{n}^{(3)} = ⟨ n^{(0)} ∣ V ∣ n^{(2)} ⟩$ . Expanding $∣ n^{(2)} ⟩ = \sum_{m \neq = n} c_{m}^{(2)} ∣ m^{(0)} ⟩$ and computing the coefficients from the order- $λ^{2}$ equation projected onto $⟨ m^{(0)} ∣$ (with $m \neq = n$ ):

$c_{m}^{(2)} = ℓ \neq = n \sum \frac{V _{m ℓ} V _{ℓ n}}{( E _{n}^{(0)} - E _{m}^{(0)} ) ( E _{n}^{(0)} - E _{ℓ}^{(0)} )} - \frac{V _{nn} V _{mn}}{( E _{n}^{(0)} - E _{m}^{(0)} ) ^{2}} .$

Substituting back:

$E_{n}^{(3)} = m \neq = n \sum ℓ \neq = n \sum \frac{V _{nm} V _{m ℓ} V _{ℓ n}}{( E _{n}^{(0)} - E _{m}^{(0)} ) ( E _{n}^{(0)} - E _{ℓ}^{(0)} )} - V_{nn} m \neq = n \sum \frac{∣ V _{mn} ∣ ^{2}}{( E _{n}^{(0)} - E _{m}^{(0)} ) ^{2}} .$

The first term is a "three-hop" process: $n \to ℓ \to m \to n$ . The second subtracts an overcounting correction proportional to $V_{nn} = E_{n}^{(1)}$ .

Exercise 10 (hard, symbolic).

Dalgarno-Lewis method. The second-order energy $E_{n}^{(2)} = \sum_{m \neq = n} ∣ V_{mn} ∣^{2} / (E_{n}^{(0)} - E_{m}^{(0)})$ requires summing over all states, including the continuum. The Dalgarno-Lewis method avoids this by finding an operator $F$ satisfying $(F H_{0} - H_{0} F) ∣ n^{(0)} ⟩ = (V - V_{nn}) ∣ n^{(0)} ⟩$ . Show that $E_{n}^{(2)} = ⟨ n^{(0)} ∣ V F ∣ n^{(0)} ⟩$ .

Hint

Expand $F ∣ n^{(0)} ⟩ = \sum_{m} f_{m} ∣ m^{(0)} ⟩$ in the unperturbed basis and match coefficients in the defining equation. Compare with the second-order energy sum.

Answer

Write $F ∣ n^{(0)} ⟩ = \sum_{m} f_{m} ∣ m^{(0)} ⟩$ . The defining equation becomes

$m \sum f_{m} (E_{m}^{(0)} - E_{n}^{(0)}) ∣ m^{(0)} ⟩ = m \neq = n \sum V_{mn} ∣ m^{(0)} ⟩ .$

(The $m = n$ component of the right side is removed by subtracting $V_{nn}$ .) Matching coefficients: $f_{m} = V_{mn} / (E_{m}^{(0)} - E_{n}^{(0)})$ for $m \neq = n$ , and $f_{n}$ is arbitrary (set to zero for convenience). Then

$⟨ n^{(0)} ∣ V F ∣ n^{(0)} ⟩ = m \sum V_{nm} f_{m} = m \neq = n \sum \frac{V _{nm} V _{mn}}{E _{m}^{(0)} - E _{n}^{(0)}} = m \neq = n \sum \frac{∣ V _{mn} ∣ ^{2}}{E _{n}^{(0)} - E _{m}^{(0)}} = E_{n}^{(2)} .$

The power of the method: one solves the differential equation for $F$ directly, avoiding the infinite sum. For the hydrogen ground state with $V = e E z$ , the Dalgarno-Lewis equation can be solved analytically, yielding the polarizability $α = (9/2) a_{0}^{3}$ without summing over all hydrogenic states.

Lean formalization [Intermediate+]

Mathlib does not yet cover quantum perturbation theory. The closest layers are:

Mathlib.Analysis.InnerProductSpace.Spectrum: spectral theory for bounded operators on Hilbert spaces.
Mathlib.Analysis.Calculus.Deriv: derivatives of functions between normed spaces, needed for analytic perturbation theory.
Mathlib.LinearAlgebra.Matrix.GeneralLinearGroup: invertible matrices, relevant to the finite-dimensional case.

There is no Mathlib definition of "Rayleigh-Schrodinger perturbation series", no Kato-Rellich theorem, no resolvent-based expansion of eigenvalues, and no formalization of the intermediate normalization convention. The formalisation pathway is outlined in lean_mathlib_gap. The lean_status: none reflects this gap; no lean_module ships with this unit. Tyler's review attests intermediate-tier correctness.

Degenerate perturbation theory [Master]

The non-degenerate formulas break down when the unperturbed energy $E_{n}^{(0)}$ is $g$ -fold degenerate: there exist $g$ orthonormal states $∣ n, α^{(0)} ⟩$ ( $α = 1, \dots, g$ ) with $H_{0} ∣ n, α^{(0)} ⟩ = E_{n}^{(0)} ∣ n, α^{(0)} ⟩$ . The energy denominator $E_{n}^{(0)} - E_{n, α}^{(0)}$ vanishes within the degenerate subspace, and the first-order state correction diverges.

The resolution is to choose the right starting states before applying perturbation theory. Within the degenerate subspace $D$ , form the $g \times g$ matrix $W$ with entries

$W_{α β} = ⟨ n, α^{(0)} ∣ V ∣ n, β^{(0)} ⟩ .$

Diagonalize $W$ : solve the secular equation

$det (W - E^{(1)} I_{g}) = 0.$

The $g$ eigenvalues $E_{1}^{(1)}, \dots, E_{g}^{(1)}$ are the first-order energy corrections. The corresponding eigenvectors $∣ ϕ_{j}^{(0)} ⟩ = \sum_{α} c_{j α} ∣ n, α^{(0)} ⟩$ are the correct zeroth-order states: the particular linear combinations of degenerate states that perturbation theory selects as the physical starting points.

Theorem (first-order degenerate perturbation theory). Let $H_{0}$ have a $g$ -fold degenerate eigenvalue $E^{(0)}$ with orthonormal eigenstates ${∣ n, α^{(0)} ⟩}_{α = 1}^{g}$ . The first-order energy corrections are the eigenvalues of the $g \times g$ Hermitian matrix $W_{α β} = ⟨ n, α^{(0)} ∣ V ∣ n, β^{(0)} ⟩$ , and the correct zeroth-order states are the corresponding eigenvectors.

Proof. Within the degenerate subspace $D$ , write a general zeroth-order state as $∣ ψ^{(0)} ⟩ = \sum_{α} c_{α} ∣ n, α^{(0)} ⟩$ . The first-order equation $(H_{0} - E^{(0)}) ∣ ψ^{(1)} ⟩ = (E^{(1)} - V) ∣ ψ^{(0)} ⟩$ is well-defined because the left side annihilates any state in $D$ . Projecting onto $⟨ n, β^{(0)} ∣$ gives $0 = E^{(1)} c_{β} - \sum_{α} W_{β α} c_{α}$ , the eigenvalue equation $W c = E^{(1)} c$ . Since $V$ is Hermitian, $W$ is Hermitian and has $g$ real eigenvalues with orthonormal eigenvectors. $□$

If all $g$ eigenvalues of $W$ are distinct, the degeneracy is completely lifted at first order. If some eigenvalues coincide, the degeneracy is partially lifted, and higher-order degenerate perturbation theory applies within the remaining degenerate subspaces.

Once the correct zeroth-order states are found, higher-order corrections follow the same pattern as non-degenerate theory, with the sums over $m \neq = n$ now excluding only the states within the already-split degenerate subspace (those states are accounted for by the diagonalization of $W$ ).

Example: Stark effect for hydrogen $n = 2$ . The $n = 2$ shell of hydrogen is four-fold degenerate: $∣2, 0, 0 ⟩$ , $∣2, 1, 0 ⟩$ , $∣2, 1, 1 ⟩$ , $∣2, 1, - 1 ⟩$ . The perturbation $V = e E z = e E r cos θ$ has selection rules $Δ ℓ = \pm 1$ , $Δ m = 0$ . Within the $n = 2$ manifold, the only nonzero off-diagonal matrix element is

$⟨ 2, 0, 0∣ V ∣2, 1, 0 ⟩ = - 3 e E a_{0} .$

The states $∣2, 1, \pm 1 ⟩$ are decoupled from all others by the $Δ m = 0$ rule. The secular equation reduces to a $2 \times 2$ block:

$W = (0 - 3 e E a_{0} - 3 e E a_{0} 0),$

with eigenvalues $E^{(1)} = \pm 3 e E a_{0}$ and eigenvectors $(∣2, 0, 0 ⟩ \mp ∣2, 1, 0 ⟩) / 2$ . The four-fold degenerate level splits into three: one shifted up, one shifted down (linearly in the field), and two unshifted ( $∣2, 1, \pm 1 ⟩$ ). This is the linear Stark effect, possible only for degenerate states — in contrast to the ground state's quadratic Stark effect.

The variational principle and perturbation theory [Master]

The Rayleigh-Ritz variational principle states that for any normalized trial state $∣ ψ ⟩$ ,

$⟨ ψ ∣ H ∣ ψ ⟩ \geq E_{0},$

where $E_{0}$ is the exact ground state energy. This gives an upper bound on $E_{0}$ , non-perturbative and valid regardless of the size of the perturbation.

Perturbation theory complements this. For the ground state, second-order perturbation theory always gives $E_{0}^{PT} = E_{0}^{(0)} + E_{0}^{(1)} + E_{0}^{(2)}$ with $E_{0}^{(2)} < 0$ , which (when the series converges) is a lower bound on the true ground state energy at any finite order. Together, the variational principle and perturbation theory bracket the ground state:

$E_{0}^{PT} \leq E_{0} \leq ⟨ ψ ∣ H ∣ ψ ⟩ .$

The variational principle applies directly only to the ground state (excited-state bounds require orthogonalization constraints). Perturbation theory, by contrast, gives systematic expansions for all states but requires the perturbation to be small. In practice, the two methods are often combined: a variational calculation provides the trial state, and perturbation theory corrects it.

For systems where the perturbation is not small (strongly correlated electrons, deep quantum wells), the perturbative expansion diverges and only variational or other non-perturbative methods apply. The Hellmann-Feynman theorem (Exercise 8) provides a bridge: it is exact for any eigenstate, but its usefulness in computation relies on having a good approximation to the state.

The Kato-Rellich theorem and convergence [Master]

The formal power series derived above is meaningful only if it converges (or at least provides a useful asymptotic expansion). The rigorous foundation is the Kato-Rellich theorem ^{[Kato 1966]} .

Theorem (Rellich, 1937; Kato, 1949). Let $H_{0}$ be a self-adjoint operator on a Hilbert space with an isolated eigenvalue $E_{n}^{(0)}$ of finite multiplicity. Let $V$ be symmetric and relatively bounded with respect to $H_{0}$ with relative bound less than 1. Then there exists $λ_{0} > 0$ such that for $∣ λ ∣ < λ_{0}$ , the operator $H_{0} + λV$ has an eigenvalue $E_{n} (λ)$ that is an analytic function of $λ$ , with $E_{n} (0) = E_{n}^{(0)}$ .

The relative boundedness condition means there exist constants $a < 1$ and $b$ such that $∥ V ψ ∥ \leq a ∥ H_{0} ψ ∥ + b ∥ ψ ∥$ for all $ψ$ in the domain of $H_{0}$ . This is a regularity condition that excludes perturbations that are too singular.

The theorem guarantees that the perturbation series converges for small enough $λ$ , not merely that it is asymptotic. The radius of convergence depends on the distance from $E_{n}^{(0)}$ to the rest of the spectrum: $λ_{0} \geq d / (2∥ V ∥)$ where $d = dist (E_{n}^{(0)}, σ (H_{0}) ∖ {E_{n}^{(0)}})$ is the spectral gap. When the spectral gap is small, the radius of convergence is small, and perturbation theory requires many orders.

For degenerate eigenvalues, Kato's theorem generalizes: the degenerate level splits into $g$ analytic branches, and the splitting at first order is given by the secular equation above. The eigenvalues may undergo avoided crossings as $λ$ varies — they approach each other but do not cross, a phenomenon connected to the Wigner-von Neumann non-crossing rule.

The Wigner-von Neumann non-crossing rule [Master]

For a one-parameter family of real-symmetric (or Hermitian) matrices $H (λ)$ , two eigenvalues $E_{i} (λ)$ and $E_{j} (λ)$ that are distinct at $λ = 0$ generically avoid crossing as $λ$ increases. The repulsion is a consequence of dimensional counting: a crossing requires $E_{i} = E_{j}$ (one real condition) and $⟨ ψ_{i} ∣ V ∣ ψ_{j} ⟩ = 0$ (one additional real condition for Hermitian matrices, two for real-symmetric matrices with additional symmetry). In a one-parameter family, satisfying two conditions simultaneously is generically impossible.

This is visible in the two-level system: $E_{\pm} (λ) = (E_{1} + E_{2}) /2 \pm (Δ/2)^{2} + λ^{2} v^{2}$ , with minimum gap $2∣ v λ ∣$ — the levels approach but never cross. Perturbation theory captures the leading order of this repulsion: the second-order correction pushes the levels apart.

For systems with additional symmetries (e.g., different angular momentum quantum numbers), crossings can and do occur because the symmetry forbids coupling between the two states, setting the off-diagonal matrix element to zero identically. The non-crossing rule applies only to levels that are coupled by the perturbation.

Brillouin-Wigner perturbation theory [Master]

An alternative to the Rayleigh-Schrodinger expansion is Brillouin-Wigner perturbation theory, in which the exact energy appears in the denominators. The first few iterations give

$E_{n} = E_{n}^{(0)} + ⟨ n^{(0)} ∣ V ∣ n^{(0)} ⟩ + m \neq = n \sum \frac{∣ ⟨ m ^{(0)} ∣ V ∣ n ^{(0)} ⟩ ∣ ^{2}}{E _{n} - E _{m}^{(0)}} + \dots .$

The advantage is that the denominators use the exact $E_{n}$ rather than $E_{n}^{(0)}$ , which can improve convergence for larger perturbations. The disadvantage is that the equation is implicit ( $E_{n}$ appears on both sides) and must be solved iteratively. For sufficiently small $λ$ , both formulations agree order by order. The Brillouin-Wigner form is the starting point for many-body perturbation theory in condensed-matter and nuclear physics.

Resolvent formalism and the spectral projection [Master]

The rigorous treatment of perturbation theory uses the resolvent of the Hamiltonian, $R (z) = (z - H)^{- 1}$ , defined for $z$ in the resolvent set $ρ (H) = C ∖ σ (H)$ . For the unperturbed Hamiltonian $H_{0}$ with isolated eigenvalue $E_{n}^{(0)}$ , choose a contour $Γ$ in the complex plane enclosing $E_{n}^{(0)}$ but no other point of $σ (H_{0})$ . The spectral projection onto the eigenspace of $E_{n}^{(0)}$ is

$P_{n}^{(0)} = - \frac{1}{2 π i} \oint_{Γ} R_{0} (z) d z,$

where $R_{0} (z) = (z - H_{0})^{- 1}$ .

When $H = H_{0} + λV$ , the resolvent expands as a Neumann series:

$R (z) = R_{0} (z) k = 0 \sum \infty λ^{k} (V R_{0} (z))^{k},$

convergent for $∣ λ ∣ < 1/ (∥ V ∥ \cdot sup_{z \in Γ} ∥ R_{0} (z) ∥)$ . The perturbed spectral projection is

$P_{n} (λ) = - \frac{1}{2 π i} \oint_{Γ} R (z) d z,$

and for $∣ λ ∣$ small enough, $Γ$ encloses exactly one eigenvalue $E_{n} (λ)$ of $H$ with $E_{n} (0) = E_{n}^{(0)}$ . The perturbed eigenvalue is recovered by $E_{n} (λ) = tr (H P_{n} (λ)) / tr (P_{n} (λ))$ .

Theorem (analyticity of the perturbed eigenvalue). Under the hypotheses of the Kato-Rellich theorem, $E_{n} (λ)$ and $P_{n} (λ)$ are analytic functions of $λ$ in a disk $∣ λ ∣ < λ_{0}$ . The Taylor coefficients of $E_{n} (λ)$ at $λ = 0$ are the Rayleigh-Schrodinger corrections.

Proof sketch. Analyticity of $R (z)$ in $λ$ follows from the Neumann series: each term $λ^{k} R_{0} (z) (V R_{0} (z))^{k}$ is analytic in $λ$ , and uniform convergence on compact subsets preserves analyticity. The contour integral of an analytic function is analytic, giving analyticity of $P_{n} (λ)$ . The trace $tr (H P_{n} (λ))$ is then analytic as a product of analytic functions. Uniqueness follows from discreteness of the spectrum inside $Γ$ : for $λ = 0$ the enclosed eigenvalue is $E_{n}^{(0)}$ by construction, and analyticity plus continuity prevents eigenvalue exchange across the contour. $□$

The resolvent approach has two advantages over the direct power-series substitution. First, it yields explicit radius-of-convergence estimates via the Neumann-series bound. Second, it extends to degenerate eigenvalues without modification: the contour encloses the entire degenerate cluster, and $P_{n} (λ)$ captures all $g$ perturbed eigenvalues at once, splitting them through the eigenvalues of $P_{n} (λ) H P_{n} (λ)$ restricted to the perturbed subspace.

Synthesis. The formalism developed in this unit is the foundational reason that most quantitative predictions in atomic, molecular, and condensed-matter physics are computable at all: the central insight is that an exactly solvable reference system plus a small correction yields a systematic expansion for every observable. The Rayleigh-Schrodinger series identifies the first-order energy correction with a simple expectation value and the second-order correction with a sum-over-states formula, and this is exactly the structure that generalises to time-dependent transitions 12.07.02, to the many-body diagrams of Goldstone perturbation theory, and to the Brillouin-Wigner implicit summation. The degenerate extension identifies the correct diagonalisation of the perturbation within the degenerate subspace as the bridge between the vanishing-denominator pathology and the physical level-splitting observed in the Stark and Zeeman effects. Putting these together with the Kato-Rellich theorem, the resolvent formalism provides the analytic foundation: perturbation theory is not a formal trick but a convergent expansion whose radius is controlled by the spectral gap. The pattern recurs throughout quantum physics — from the helium atom 14.04.01 pending to quantum electrodynamics — that physically meaningful corrections emerge from well-defined mathematical objects governed by the analytic structure of the perturbed resolvent.

Full proof set [Master]

Proposition (negativity of the ground-state second-order correction). Let $H_{0}$ have discrete spectrum $E_{0}^{(0)} < E_{1}^{(0)} \leq E_{2}^{(0)} \leq \dots$ with $E_{0}^{(0)}$ non-degenerate. Then $E_{0}^{(2)} < 0$ for any nonzero Hermitian perturbation $V$ .

Proof. The second-order correction is $E_{0}^{(2)} = \sum_{m \neq = 0} ∣ V_{m 0} ∣^{2} / (E_{0}^{(0)} - E_{m}^{(0)})$ . Since $E_{0}^{(0)}$ is the ground state, every denominator $E_{0}^{(0)} - E_{m}^{(0)}$ is strictly negative. The numerators $∣ V_{m 0} ∣^{2}$ are non-negative. At least one numerator is strictly positive: if all $V_{m 0} = 0$ for $m \neq = 0$ , then $V ∣ 0^{(0)} ⟩ = V_{00} ∣ 0^{(0)} ⟩$ , so $∣ 0^{(0)} ⟩$ is an eigenstate of $V$ . But $V \neq = 0$ and the unperturbed basis is complete, so some off-diagonal matrix element is nonzero — meaning at least one coupling to a different state exists. Hence at least one term is strictly negative and no term is positive, giving $E_{0}^{(2)} < 0$ . $□$

Proposition (variational bound for the perturbative ground state). For any normalized $∣ ψ ⟩$ , $⟨ ψ ∣ H ∣ ψ ⟩ \geq E_{0}$ where $E_{0}$ is the exact ground-state energy. When the perturbation series converges, $E_{0}^{(0)} + E_{0}^{(1)} \geq E_{0} \geq E_{0}^{(0)} + E_{0}^{(1)} + E_{0}^{(2)}$ , so the first-order result is an upper bound and the second-order result is a lower bound.

Proof. Expand $∣ ψ ⟩ = \sum_{n} c_{n} ∣ n ⟩$ in the exact eigenbasis of $H$ with $H ∣ n ⟩ = E_{n} ∣ n ⟩$ . Then $⟨ ψ ∣ H ∣ ψ ⟩ = \sum_{n} ∣ c_{n} ∣^{2} E_{n} \geq \sum_{n} ∣ c_{n} ∣^{2} E_{0} = E_{0}$ . For the first-order perturbative estimate: $⟨ 0^{(0)} ∣ H ∣ 0^{(0)} ⟩ = E_{0}^{(0)} + E_{0}^{(1)}$ is the Rayleigh-Ritz functional evaluated on $∣ 0^{(0)} ⟩$ , hence an upper bound on $E_{0}$ . The second-order correction satisfies $E_{0}^{(2)} < 0$ by the previous proposition, so $E_{0}^{(0)} + E_{0}^{(1)} + E_{0}^{(2)}$ lies below the first-order estimate. For convergent perturbation series, this sum approaches $E_{0}$ from below, establishing the lower bound. $□$

Proposition (level repulsion in the two-level system). For the two-level Hamiltonian $H = (E_{1}^{(0)} λ v λ v E_{2}^{(0)})$ with $v \neq = 0$ and $Δ = E_{2}^{(0)} - E_{1}^{(0)} > 0$ , the gap between the two eigenvalues satisfies $E_{+} - E_{-} = 2 (Δ/2)^{2} + λ^{2} v^{2} > Δ$ for all $λ \neq = 0$ .

Proof. The exact eigenvalues are $E_{\pm} = \overset{ˉ}{E} \pm (Δ/2)^{2} + λ^{2} v^{2}$ with $\overset{ˉ}{E} = (E_{1}^{(0)} + E_{2}^{(0)}) /2$ . The gap is $2 (Δ/2)^{2} + λ^{2} v^{2}$ . Since $v \neq = 0$ and $λ \neq = 0$ , the term $λ^{2} v^{2} > 0$ , so the gap exceeds $Δ$ . The second-order perturbation corrections $E_{1}^{(2)} = - v^{2} /Δ$ and $E_{2}^{(2)} = v^{2} /Δ$ capture the leading-order repulsion, and this is exactly the Wigner-von Neumann non-crossing rule for a one-parameter Hermitian family. $□$

Connections [Master]

Hydrogen atom (12.06.01) is the primary testbed for perturbation theory in quantum mechanics. The Bohr energies and hydrogenic wavefunctions serve as the unperturbed starting point for Stark, Zeeman, and fine-structure calculations. Every perturbative correction to hydrogen relies on the integrals $⟨ n, ℓ, m ∣ r^{k} Y_{q}^{p} ∣ n^{'}, ℓ^{'}, m^{'} ⟩$ computed from the solutions of 12.06.01.
Multi-electron atoms (14.04.01) extend the hydrogenic framework via perturbation theory. The helium atom's ground state is treated by taking $H_{0}$ as two independent hydrogen Hamiltonians and $V$ as the electron-electron repulsion $e^{2} / (4 π ϵ_{0} ∣ r_{1} - r_{2} ∣)$ . The first-order correction gives a 34% error relative to the measured ionization energy — large, but systematically improvable.
Schrodinger equation (12.04.01) is the eigenvalue equation into which perturbation theory inserts its power-series ansatz. The entire formalism is a systematic procedure for solving $H ∣ ψ ⟩ = E ∣ ψ ⟩$ when $H = H_{0} + λV$ .
Hilbert space and operators (12.02.02) provides the mathematical framework. The Hermiticity of $V$ ensures real energy corrections, the completeness of the unperturbed basis underlies the expansion of state corrections, and the spectral theorem justifies the eigenvalue decomposition.
Variational methods provide a complementary approach. Where perturbation theory expands in a small parameter, the variational principle optimizes over trial states. The two bracket the ground state energy from opposite sides and are often used together.
Time-dependent perturbation theory extends the formalism to perturbations that depend on time, governing transitions between states. The time-independent theory developed here is the prerequisite: the transition rates involve the same matrix elements $⟨ m ∣ V ∣ n ⟩$ and energy denominators.
Kato's theory of linear operators ^{[Kato 1966]} places the Rayleigh-Schrodinger expansion on rigorous mathematical footing via the analytic theory of operator-valued functions.

Historical & philosophical context [Master]

Perturbation theory in the physical sciences predates quantum mechanics. Lord Rayleigh developed perturbative methods for acoustics and vibration theory in The Theory of Sound (1877, 2nd edition), computing corrections to the frequencies of vibrating systems with small inhomogeneities. His approach — expand the solution in a small parameter and match orders — is formally identical to what quantum mechanics uses, applied to the classical wave equation rather than the Schrodinger equation.

The quantum version was introduced by Schrodinger in his fourth communication on wave mechanics (1926) ^{[Schrodinger 1926]}, where he applied the Rayleigh method to the Schrodinger equation to compute the Stark effect. Independently, Epstein (1916) and Schwarzschild (1916) had computed the Stark effect using the old quantum theory of Bohr and Sommerfeld, obtaining results that agreed with the experimental measurements of Stark (1913). Schrodinger's wave-mechanical perturbation theory reproduced these results from a more systematic framework and extended them to problems the old quantum theory could not handle.

The method became known as Rayleigh-Schrodinger perturbation theory, acknowledging both the classical origin and the quantum reformulation. It remains the standard approach taught in every quantum mechanics course and used in atomic, molecular, and condensed-matter physics.

The mathematical foundations were developed over several decades. Rellich (1937-1942) proved the analyticity of isolated eigenvalues under perturbation for bounded operators. Kato (1949, 1966) extended this to unbounded operators — the physically relevant case, since quantum Hamiltonians are generically unbounded — and established the relative boundedness condition that guarantees the perturbation series converges for small coupling. Kato's Perturbation Theory for Linear Operators (1966) ^{[Kato 1966]} remains the definitive mathematical reference.

The Brillouin-Wigner formulation (Brillouin 1932, Wigner 1935) introduced the implicit form with exact energies in the denominators. It saw limited use in atomic physics but became the foundation of many-body perturbation theory through the work of Goldstone (1957), Hugenholtz (1957), and others, who derived diagrammatic perturbation expansions for interacting fermion systems.

Perturbation theory encodes a philosophical stance: the real world is a small correction to an idealized model. This is both its power and its limitation. When the correction is genuinely small (hydrogen fine structure, weak-field Stark and Zeeman effects), the method is extraordinarily precise. When it is not (strongly correlated systems, quantum chromodynamics at low energies), the expansion diverges and non-perturbative methods — variational principles, lattice simulations, renormalization group techniques — become necessary.

Bibliography [Master]

Primary literature and historical sources:

Schrodinger, E., "Quantisierung als Eigenwertproblem (Vierte Mitteilung)", Annalen der Physik 81 (1926), 109–139. [Wave-mechanical perturbation theory applied to the Stark effect.]
Epstein, P. S., "Zur Theorie des Starkeffektes", Annalen der Physik 50 (1916), 489–520. [Stark effect via old quantum theory.]
Schwarzschild, K., "Zur Quantenhypothese", Sitzungsber. Kgl. Preuss. Akad. Wiss. (1916), 548–568. [Independent Stark effect calculation.]
Rayleigh, J. W. S., The Theory of Sound, 2nd ed. (Macmillan, 1894), §90-95. [Classical perturbation theory for vibrating systems.]
Kato, T., "On the convergence of the perturbation method", J. Fac. Sci. Univ. Tokyo 6 (1951), 145–226.
Rellich, F., "Storungstheorie der Spektralzerlegung", Math. Annalen 113 (1937), 600–619; 116 (1939), 555–570; 117 (1940), 356–382; 118 (1942), 462–484.

Textbooks:

Griffiths, D. J. & Schroeter, D. F., Introduction to Quantum Mechanics, 3rd ed. (Cambridge, 2018), Ch. 7. [Pedagogical introduction; the standard undergraduate reference.]
Sakurai, J. J. & Napolitano, J., Modern Quantum Mechanics, 2nd ed. (Cambridge, 2017), Ch. 5.1-5.2. [Graduate-level treatment with degenerate perturbation theory.]
Landau, L. D. & Lifshitz, E. M., Quantum Mechanics: Non-Relativistic Theory, 3rd ed. (Pergamon, 1977), §38-41. [Concise and physically motivated.]
Shankar, R., Principles of Quantum Mechanics, 2nd ed. (Plenum, 1994), Ch. 17-18. [Detailed exposition with many worked examples.]
Cohen-Tannoudji, C., Diu, B. & Laloee, F., Quantum Mechanics, Vols. I-II (Wiley, 1977), Ch. XI. [Thorough coverage of both non-degenerate and degenerate cases.]
Messiah, A., Quantum Mechanics, Vols. I-II (North-Holland, 1961), Ch. XVI. [Systematic treatment at the graduate level.]
Merzbacher, E., Quantum Mechanics, 3rd ed. (Wiley, 1998), Ch. 18. [Compact but rigorous.]

Mathematical foundations:

Kato, T., Perturbation Theory for Linear Operators (Springer, 1966; repr. 1995). [The definitive mathematical reference on analytic perturbation theory.]
Reed, M. & Simon, B., Methods of Modern Mathematical Physics, Vol. IV: Analysis of Operators (Academic Press, 1978), Ch. XII-XIII. [Rigorous spectral analysis and perturbation of isolated eigenvalues.]
Friedrichs, K. O., Perturbation of Spectra in Hilbert Space (AMS, 1965). [Compact presentation of the analytic theory.]

Wave 3 physics unit, produced per PHYSICS_PLAN §5 runbook. Hooks_out targets 14.04.01 and 12.06.01 are both proposed; no receiving unit yet confirms them. Status remains draft pending Tyler's review and external QM reviewer sign-off.

Prerequisites

12.06.01 pending
12.04.01 pending
12.02.02 pending

Tier anchors

beginner: Griffiths & Schroeter, Introduction to Quantum Mechanics, 3e (2018), Ch. 7 intro
intermediate: Sakurai & Napolitano, Modern Quantum Mechanics, 2e (2017), Ch. 5.1-5.2
master: Kato, Perturbation Theory for Linear Operators (Springer, 1966); Epstein 1916

References

tong
raw/pdfs/qft/qft.pdf · §1 Perturbation theory, hydrogen fine structure
TODO_REF
Sakurai & Napolitano — Modern Quantum Mechanics, 2e (Cambridge, 2017) · Ch. 5 Approximation Methods, §5.1-5.2
TODO_REF
Griffiths & Schroeter — Introduction to Quantum Mechanics, 3e (Cambridge, 2018) · Ch. 7 Time-Independent Perturbation Theory
TODO_REF
Kato — Perturbation Theory for Linear Operators (Springer, 1966) · Ch. 1-2

Reviewer

Tyler (pending external QM reviewer per PHYSICS_PLAN §6)

Estimated time

beginner: 15m
intermediate: 40m
master: 50m