12.09.02 · quantum / identical-particles

Exchange interaction and the helium atom

shipped3 tiersLean: none

Anchor (Master): Landau, L. D. & Lifshitz, E. M., *Quantum Mechanics: Non-Relativistic Theory*, 3rd ed. (Pergamon, 1977), Ch. IX §§62–66 (exchange interaction, the helium atom, the Heisenberg spin Hamiltonian); Bethe, H. A. & Salpeter, E. E., *Quantum Mechanics of One- and Two-Electron Atoms* (Springer, 1957), §§32–33

Intuition Beginner

Two electrons are perfectly interchangeable. You cannot paint one red and the other blue and follow them around, because nature does not let you tell them apart. Quantum mechanics takes this seriously: swapping the two electrons must leave every measurable prediction unchanged. For electrons there is a sharp rule about how the swap acts on the wavefunction — it must flip the sign. This sign rule is the whole source of the strange effects in this unit, and it has a famous consequence: two electrons can never share the exact same state, the fact behind the structure of the periodic table.

The sign rule ties the electrons' positions to their spins. If the two electrons line up their spins the same way, the rule forces them to keep their positions apart. If they pair their spins as opposites, the rule lets them sit close together. So aligning the spins changes where the electrons can be, even though the spins themselves do not push or pull. This indirect link is the exchange interaction.

Why does this matter? The energy of two electrons depends on how close they get, because closer electrons repel more. Through the sign rule, their spin arrangement decides how close they get, and therefore their energy. A purely magnetic-sounding question — are the spins aligned? — turns into a question about electric repulsion energy. That hidden lever is what makes magnets magnetic and what splits the spectrum of helium into two families.

Visual Beginner

Picture two clouds, one for each electron, sharing the space around a nucleus. In the first picture the spins are opposite. The sign rule lets the two clouds overlap, piling up in the middle, so on average the electrons sit closer and feel more repulsion. In the second picture the spins are aligned. Now the rule forces a gap: the clouds pull away from each other, leaving a hole between them where neither electron likes to be. The electrons sit farther apart and feel less repulsion.

The picture shows the heart of the matter. No force acts directly between the spins. The spin pattern only sets the shape of the position cloud, and the shape sets the repulsion. The energy gap between the two pictures is the exchange energy, and its size is fixed by how much the two clouds would have overlapped.

Worked example Beginner

Take helium in a simple excited arrangement: one electron stays in the lowest orbit (call it the orbit) and the other sits in a higher orbit (call it ). We ask how the energy depends on whether the spins are aligned or opposite.

Step 1. Split the repulsion energy into two pieces. The first piece is the plain average repulsion of the two clouds, a positive number; call it . The second piece is an extra correction tied to the overlap of the two orbits; call it , also positive here.

Step 2. Write the two energies. When the spins are opposite, the energy gets . When the spins are aligned, it gets . The only difference is the sign in front of .

Step 3. Compare them. Since is positive, is smaller than . The aligned-spin arrangement has the lower energy.

Step 4. Put numbers in. Suppose for this orbit pair and (in electron-volts, above some baseline). Then opposite spins give and aligned spins give . The gap between the two is electron-volts.

What this tells us: aligning the spins lowers the energy by , purely because the sign rule keeps the aligned electrons farther apart. The spins never touch; the bookkeeping of position does all the work. This gap is exactly the split between the two families of helium spectral lines.

Check your understanding Beginner

Formal definition Intermediate+

Two electrons have the antisymmetrised state space , the antisymmetric subspace of , with one-particle space . A coordinate is , bundling position and spin. The antisymmetrisation postulate for fermions requires that the total wavefunction change sign under exchange: $$ \Psi(\mathbf x_1, \mathbf x_2) = -,\Psi(\mathbf x_2, \mathbf x_1). $$ This is the two-particle case of the spin-statistics rule fixed in the fermionic-Fock-space treatment 12.13.02. Because the non-relativistic Hamiltonian is spin-independent, eigenstates factor into a spatial part times a spin part, and antisymmetry of the product forces opposite symmetries on the factors.

Two electron spins live in , which decomposes under the total spin into the antisymmetric singlet () and the symmetric triplet (): $$ \chi_{\text{singlet}} = \tfrac{1}{\sqrt2}\big(!\uparrow\downarrow - \downarrow\uparrow\big), \qquad \chi_{\text{triplet}} \in \Big{\uparrow\uparrow,\ \tfrac{1}{\sqrt2}(\uparrow\downarrow + \downarrow\uparrow),\ \downarrow\downarrow\Big}. $$ Antisymmetry of the full state then pairs the symmetric spatial wavefunction with the singlet and the antisymmetric spatial wavefunction with the triplet: $$ \Psi_{+} = \psi_{\text{sym}}(\mathbf r_1,\mathbf r_2),\chi_{\text{singlet}}, \qquad \Psi_{-} = \psi_{\text{antisym}}(\mathbf r_1,\mathbf r_2),\chi_{\text{triplet}}. $$

Given two orthonormal spatial orbitals , the symmetric and antisymmetric spatial states are $$ \psi_{\pm}(\mathbf r_1,\mathbf r_2) = \tfrac{1}{\sqrt2}\big[\phi_a(\mathbf r_1)\phi_b(\mathbf r_2) \pm \phi_b(\mathbf r_1)\phi_a(\mathbf r_2)\big], $$ with for the singlet and for the triplet. The triplet spatial state is the spatial face of a Slater determinant of the two spin-orbitals; the singlet spatial state is its symmetric partner. For the Coulomb perturbation (atomic units, ), define the Coulomb integral and the exchange integral: $$ J = \iint \phi_a^(\mathbf r_1)\phi_b^(\mathbf r_2),\frac{1}{r_{12}},\phi_a(\mathbf r_1)\phi_b(\mathbf r_2),d^3r_1,d^3r_2, $$ $$ K = \iint \phi_a^(\mathbf r_1)\phi_b^(\mathbf r_2),\frac{1}{r_{12}},\phi_b(\mathbf r_1)\phi_a(\mathbf r_2),d^3r_1,d^3r_2. $$ The integral is the classical electrostatic repulsion of the two charge clouds and ; the integral has no classical counterpart and arises from the cross term in which the two orbitals trade places between the bra and the ket. The same reappears as the exchange operator of the self-consistent field 12.09.03.

Counterexamples to common slips

  • The exchange integral is not a magnetic spin-spin coupling. It is an electrostatic matrix element; spin enters only by selecting which spatial symmetry is allowed. A spin-independent Hamiltonian produces a spin-dependent spectrum through this selection alone.
  • is real and, for the kernel with real orbitals, positive. Writing , one has , a self-energy of the overlap density. Positivity of is what makes the triplet lie below the singlet for excited helium.
  • The split into is exact only to first order in perturbation theory and only when are the same fixed orbitals in both states. If the orbitals are allowed to relax differently in singlet and triplet, the clean form is corrected.

Key derivation Intermediate+

Theorem (singlet/triplet splitting and the helium energies). Let two electrons occupy orthonormal spatial orbitals , perturbed by . To first order in degenerate perturbation theory the symmetric (singlet) and antisymmetric (triplet) spatial states have energies $$ E_{\pm} = E_a + E_b + J \pm K, $$ with for the singlet and for the triplet, where are the unperturbed orbital energies and are the Coulomb and exchange integrals.

Proof. The unperturbed two-electron Hamiltonian has the product states and as a degenerate pair with energy (taking ). The correct zeroth-order states under the perturbation are the symmetry-adapted combinations , which diagonalise within the degenerate subspace because commutes with particle exchange. Compute the first-order shift as the diagonal matrix element $$ \langle \psi_\pm | V | \psi_\pm \rangle = \tfrac12 \big\langle \phi_a\phi_b \pm \phi_b\phi_a ,\big|, V ,\big|, \phi_a\phi_b \pm \phi_b\phi_a \big\rangle. $$ Expanding the four terms and using the symmetry (relabel the two integration variables) and , $$ \langle \psi_\pm | V | \psi_\pm \rangle = \tfrac12\big( J + J \pm K \pm K\big) = J \pm K. $$ Adding the unperturbed energy gives . The cross terms carry the orbital swap, so the swap-induced integral enters with the sign of the spatial symmetry: for the symmetric (singlet) state, for the antisymmetric (triplet) state.

Bridge. This computation builds toward the entire theory of magnetic ordering, and the bridge is the observation that a spin-independent operator has produced a spin-dependent energy purely by selecting spatial symmetry. The foundational reason is the antisymmetrisation postulate: it locks spatial symmetry to total spin, so the position integral becomes a function of the spin state. This is exactly the mechanism that reappears again in the Heisenberg spin Hamiltonian derived at Master tier, where the energies are repackaged as eigenvalues of , and it generalises to the -electron case where the same becomes the non-local exchange operator of the Hartree-Fock field 12.09.03. The central insight, putting these together, is that exchange is not a force but a constraint, and the energy it costs or saves is the electrostatic self-energy of the overlap density .

Exercises Intermediate+

Advanced results Master

Theorem (Dirac spin Hamiltonian for two electrons). The first-order Coulomb splitting of a two-electron configuration is reproduced exactly by the effective spin operator $$ \hat H_{\text{spin}} = \big(E_a + E_b + J - \tfrac12 K\big),\mathbb 1 ; -; 2K,\mathbf S_1\cdot\mathbf S_2, $$ acting on the four-dimensional spin space, where are spin- operators in units of . The proof uses the projector identity , with eigenvalues on the triplet and on the singlet. Substituting, the triplet receives and the singlet receives , matching and . Dirac obtained this spin-operator form in 1929, recognising that a spin-independent electrostatic interaction can be encoded as an effective coupling between spins whose strength is the exchange integral. The sign of decides the ground state: (the usual atomic case) favours the triplet, an effective ferromagnetic coupling; favours the singlet, an effective antiferromagnetic coupling.

Ortho- and para-helium. The two-electron exchange splitting partitions the helium spectrum into two non-combining systems. Para-helium is the singlet () ladder, including the ground state; ortho-helium is the triplet () ladder, whose lowest member is the metastable state. Since electric-dipole radiation does not change total spin to good approximation, transitions between the two systems are strongly suppressed, and early spectroscopists mistook them for two distinct elements. For every excited configuration the ortho (triplet) term lies below the corresponding para (singlet) term by , a direct spectroscopic readout of the exchange integral. The metastability of — it cannot decay to the singlet ground state without flipping a spin — gives it a lifetime of thousands of seconds, the longest of any neutral-atom excited state.

Heitler-London and the covalent bond. The same machinery applied to two hydrogen atoms gives the first quantum explanation of the chemical bond. Heitler and London in 1927 built symmetric and antisymmetric spatial combinations of two atomic orbitals centred on the two protons. Here the overlap makes large and, crucially, the symmetric (singlet) spatial state is lower because the relevant exchange integral is negative once the nuclear attraction and overlap normalisation are included: the bonding orbital piles electron density between the nuclei, screening their repulsion. The result is the singlet ground state of with a binding energy of order an electron-volt, with the triplet purely repulsive. The same structure thus explains both the antibonding/bonding split of a molecule and the ortho/para split of an atom; the sign of the effective exchange is what distinguishes them.

From two spins to a lattice: the Heisenberg model. Heisenberg's 1928 insight was that exchange, not magnetic dipole forces, is strong enough to order spins in a ferromagnet. Magnetic dipole-dipole energies between neighbouring atomic moments are of order eV, far too small to survive room-temperature thermal agitation ( eV). Exchange integrals are of order eV. Summing the two-electron Dirac form over neighbouring pairs on a lattice gives the Heisenberg Hamiltonian $$ \hat H = -\sum_{\langle i j\rangle} 2 K_{ij},\mathbf S_i\cdot\mathbf S_j = -\sum_{\langle ij\rangle} J^{\text{ex}}{ij},\mathbf S_i\cdot\mathbf S_j, $$ writing $J^{\text{ex}}{ij} = 2K_{ij}J^{\text{ex}}J^{\text{ex}}$ anti-aligns them (antiferromagnetism). This model, built atom by atom from the two-electron exchange computed in this unit, underlies the modern theory of magnetism, spin waves, and quantum phase transitions.

Synthesis. The exchange interaction is the foundational reason that a spinless electrostatic Hamiltonian governs the magnetic order of matter, and the central insight is that antisymmetry converts a question about spin alignment into a question about charge overlap, with the exchange integral as the exact conversion factor. Putting these together, the first-order energies , the Dirac operator , the ortho/para split of helium, and the Heitler-London bond are four faces of one computation; the sign and magnitude of select among them. This is exactly the structure that generalises to the lattice: the two-electron exchange coupling, summed over neighbours, builds toward the Heisenberg model, and the same integral that appears here as a perturbative matrix element reappears again in the Hartree-Fock self-consistent field 12.09.03 as the non-local exchange operator that cancels the electronic self-interaction. The bridge from this unit to all of magnetism and quantum chemistry is the recognition that exchange is a constraint masquerading as a force, whose energetic price is the self-energy of the overlap density.

Full proof set Master

Proposition (correct zeroth-order states diagonalise the Coulomb perturbation). Within the degenerate subspace spanned by and (), the symmetry-adapted states diagonalise with eigenvalues .

Proof. In the ordered basis , the perturbation matrix has entries and by relabelling the dummy integration variables, while and since and the kernel are real and symmetric under . The matrix $$ \begin{pmatrix} J & K \ K & J \end{pmatrix} $$ has eigenvalues with eigenvectors , which are exactly the coefficient vectors of . Because for the exchange operator , these eigenvectors are simultaneously exchange-symmetric () and exchange-antisymmetric (), so the diagonalisation respects the symmetry required by the antisymmetrisation postulate.

Proposition (exchange eigenvalues of ). On two spin- particles, has eigenvalue on the singlet and on the triplet (units of ), and the projectors onto singlet and triplet are and .

Proof. The identity follows from expanding . With and , the singlet () gives and the triplet () gives . For the projectors, note that an operator with two eigenvalues and has spectral projectors for . Substituting gives , and .

Proposition (uniqueness of the Dirac spin Hamiltonian on the two-spin space). Any Hermitian operator on that is rotationally invariant (commutes with ) and assigns energy to the singlet and to the triplet is uniquely , equal to .

Proof. A rotationally invariant Hermitian operator is constant on each irreducible -multiplet by Schur's lemma, so it is determined by its singlet and triplet eigenvalues and equals . Insert the projectors from the previous proposition: . Setting and gives coefficient on and constant , the Dirac form. Uniqueness is the Schur step: no other rotational invariant exists on this four-dimensional space beyond the two projectors.

Connections Master

  • Fermionic Fock space and the Pauli principle 12.13.02. The antisymmetrisation postulate used throughout this unit is the two-particle instance of the canonical anticommutation relations: the triplet spatial state is the configuration-space image of a single occupied state of two fermionic creation operators, and the vanishing of the antisymmetric spatial wavefunction when the two orbitals coincide is the Pauli exclusion principle expressed as . The exchange integral is the matrix element of the two-body interaction between the direct and swapped occupations of the same orbital pair.

  • Hartree-Fock self-consistent field 12.09.03. The exchange integral defined here as a perturbative matrix element reappears in the -electron self-consistent field as the non-local exchange operator , whose kernel is the off-diagonal density . The two-electron split is the smallest case of the Slater-Condon energy expression, and the cancellation that removes self-interaction in Hartree-Fock is the same algebra that makes the helium ground state () carry no exchange splitting.

  • Angular momentum and the structure of spin 12.05.01. The singlet/triplet decomposition is the Clebsch-Gordan series for two spin- representations, and the operator is diagonalised by the same total-spin eigenstates. The eigenvalues and the projector construction are pure representation theory of , applied here to convert an electrostatic energy into a spin coupling.

  • Time-independent perturbation theory 12.07.01. The helium energies are a textbook application of first-order degenerate perturbation theory: the Coulomb repulsion is the perturbation, the degenerate product states are the unperturbed manifold, and symmetry-adaptation supplies the correct zeroth-order states that diagonalise the perturbation within the degenerate subspace.

Historical & philosophical context Master

Werner Heisenberg introduced the exchange interaction in 1926 in Zeitschrift für Physik [Heisenberg 1926], treating the two-electron problem as a resonance between the configurations in which the electrons are interchanged. He found that the energy splits according to the symmetry of the wavefunction under exchange, and that this "exchange resonance" (Austauschresonanz) accounts for the previously mysterious division of the helium spectrum into ortho- and para-systems. The effect had no classical analogue: it followed entirely from the indistinguishability of identical particles and the antisymmetry that quantum mechanics demands of electrons.

Walter Heitler and Fritz London applied the same idea to two hydrogen atoms in 1927 [Heitler-London 1927], producing the first quantum-mechanical theory of the covalent bond and showing that the symmetric spatial (singlet) state binds while the antisymmetric (triplet) state is repulsive. Paul Dirac in 1929 [Dirac 1929] recast the exchange splitting as an effective spin coupling, writing the interaction as a function of and thereby translating an electrostatic computation into the language of spin operators. Heisenberg then turned the two-electron result into a theory of bulk magnetism in 1928 [Heisenberg 1928], arguing that the exchange integral, not the weak magnetic dipole interaction, supplies the energy scale needed for ferromagnetic ordering, and summing the pairwise coupling into what is now the Heisenberg model. Landau and Lifshitz give the unified treatment of symmetry, the helium atom, and the spin Hamiltonian in Chapter IX of Quantum Mechanics [Landau-Lifshitz 1977], deriving the exchange interaction from the antisymmetry postulate and the spin Hamiltonian from the projector identity.

Bibliography Master

@article{Heisenberg1926,
  author  = {Heisenberg, Werner},
  title   = {Mehrk{\"o}rperproblem und Resonanz in der Quantenmechanik},
  journal = {Zeitschrift f{\"u}r Physik},
  volume  = {38},
  number  = {6--7},
  year    = {1926},
  pages   = {411--426}
}

@article{Heisenberg1928,
  author  = {Heisenberg, Werner},
  title   = {Zur Theorie des Ferromagnetismus},
  journal = {Zeitschrift f{\"u}r Physik},
  volume  = {49},
  number  = {9--10},
  year    = {1928},
  pages   = {619--636}
}

@article{Dirac1929,
  author  = {Dirac, Paul A. M.},
  title   = {Quantum Mechanics of Many-Electron Systems},
  journal = {Proceedings of the Royal Society of London A},
  volume  = {123},
  number  = {792},
  year    = {1929},
  pages   = {714--733}
}

@article{HeitlerLondon1927,
  author  = {Heitler, Walter and London, Fritz},
  title   = {Wechselwirkung neutraler Atome und hom{\"o}opolare Bindung nach der Quantenmechanik},
  journal = {Zeitschrift f{\"u}r Physik},
  volume  = {44},
  number  = {6--7},
  year    = {1927},
  pages   = {455--472}
}

@book{LandauLifshitzQM1977,
  author    = {Landau, Lev D. and Lifshitz, Evgeny M.},
  title     = {Quantum Mechanics: Non-Relativistic Theory},
  edition   = {3},
  publisher = {Pergamon Press},
  year      = {1977},
  note      = {Course of Theoretical Physics, Vol. 3, Ch. IX}
}

@book{BetheSalpeter1957,
  author    = {Bethe, Hans A. and Salpeter, Edwin E.},
  title     = {Quantum Mechanics of One- and Two-Electron Atoms},
  publisher = {Springer},
  year      = {1957}
}

@book{GriffithsQM2005,
  author    = {Griffiths, David J.},
  title     = {Introduction to Quantum Mechanics},
  edition   = {2},
  publisher = {Pearson Prentice Hall},
  year      = {2005}
}

@book{BransdenJoachain2003,
  author    = {Bransden, Brian H. and Joachain, Charles J.},
  title     = {Physics of Atoms and Molecules},
  edition   = {2},
  publisher = {Pearson Education},
  year      = {2003}
}