Density matrix and pure / mixed states
Anchor (Master): Landau & Lifshitz, Quantum Mechanics, 3e (Pergamon, 1977), §14; von Neumann, Mathematische Grundlagen der Quantenmechanik (1932), Ch. IV; Fano, Rev. Mod. Phys. 29 (1957), 74-93
Intuition Beginner
In unit 12.02.01 a quantum state was a single vector — a ket in a Hilbert space. That picture works when you know the state exactly. But two everyday situations break it, and both are completely ordinary in a real laboratory.
The first is ignorance. Suppose a machine prepares a particle in state half the time and the other half, and it does not tell you which. You do not have one ket. You have a list of kets, each with a probability. A single vector cannot record "fifty-fifty between these two".
The second is more surprising. Take two particles prepared together so that their fates are linked. Even if the pair as a whole sits in one definite joint state, one particle on its own has no state vector of its own. Look at it alone and it behaves as if it were a probabilistic mixture.
The density matrix (also called the density operator) is the object that handles both cases with one piece of machinery. Write it . For a known pure state it is built from the ket as — a way of packaging the same information. For a mixture it is a probability-weighted sum, , where the numbers add to one.
Why is one object enough? Because everything you can ever measure is an average, and the density matrix is exactly the thing that produces averages. The recipe is short: to predict the average reading of any instrument , take the matrix product and add up its diagonal. That sum-of-the-diagonal operation is called the trace, written . So the one rule you need is the average equals .
There is a clean test that separates the two situations. A state is pure — a single honest ket — when acts like a projector: applying it twice gives back the same thing, . A state is mixed when applying it twice shrinks it, so the diagonal sum of drops below one. The number is called the purity, and it equals one for a genuine pure state and less than one for any real blend.
Think of it like a weather forecast. A pure state is "tomorrow is sunny, certainly". A mixed state is "seventy percent sunny, thirty percent rain". Both are legitimate descriptions of your knowledge, and you act on the forecast by computing expected outcomes. The density matrix is the forecast written in the language quantum mechanics actually uses.
Visual Beginner
For a single spin (a qubit) every density matrix is a point in a solid ball, the Bloch ball. The centre is the completely mixed state: every measurement comes out fifty-fifty. The surface is the set of pure states, the honest single kets. The length of the arrow from the centre measures how polarised the spin is; a half-length arrow is a partial mixture with purity between one half and one.
Worked example Beginner
Compare two spin-1/2 ensembles, both giving fifty-fifty for a measurement along the -axis, but for opposite reasons.
Ensemble one: an unpolarised beam. Half the spins point up, half point down, and you do not know which is which. The density matrix is
The purity is the diagonal sum of the square. Squaring gives , and the diagonal sum is . Purity , so this is mixed.
Ensemble two: a polarised beam. Every spin points along , the single ket . Its density matrix is
Squaring this matrix returns the same matrix (check the top-left entry: ). So the diagonal sum of the square is . Purity , so this is pure.
Both have the same diagonal — both read fifty-fifty along . The off-diagonal entries tell them apart. The polarised beam has off-diagonal terms of ; measure it along and every spin reads with certainty. The unpolarised beam has zero off-diagonal terms; measure it along any axis and you still get fifty-fifty.
What this tells us: the diagonal alone does not fix a quantum state. The off-diagonal entries, the coherences, carry the difference between a definite superposition and a genuine blend, and the purity number captures that difference in a single value.
Check your understanding Beginner
Formal definition Intermediate+
Let be a Hilbert space. A density operator (density matrix) on is a linear operator satisfying three conditions:
- Hermitian: .
- Positive semidefinite: for every . Written .
- Unit trace: .
The standard construction realises any such operator as a statistical ensemble. Given normalised states and probabilities with , set
This satisfies all three conditions, and the spectral theorem (unit 12.02.02) shows every operator meeting the three conditions arises this way: its eigenvalues are the probabilities and its eigenvectors the ensemble members. The ensemble realising a given is not unique; only the operator is physically meaningful, a point made precise in the Master tier.
A state is pure when is a rank-one projector, for some unit vector , equivalently . Otherwise the state is mixed. The purity is the functional
bounded by in dimension , with for pure states and for the maximally mixed state .
Expectation values. For an observable (a self-adjoint operator), the expected measurement value in state is
For a pure state this reduces to the familiar , since . The trace formula extends the Born rule from kets to ensembles without any new postulate: it is the probability-weighted average of the pure-state expectation values, .
The sign convention here is the physicist's: probabilities are eigenvalues of and are non-negative, the trace is unit, and time evolution carries a factor (fixed in the Key derivation). A reader switching to a convention must flip the sign of every commutator in the dynamics.
Counterexamples to common slips
Same diagonal, different state. The matrices and share a diagonal. The first is maximally mixed (purity ); the second is the pure state (purity ). The diagonal alone never determines purity.
A mixture is not a superposition. The ensemble is not the pure superposition . The first has no off-diagonal coherences; the second does. Writing a mixture as a single ket is the most common error in the subject.
Different ensembles, one operator. The qubit operator is realised by the up/down ensemble and equally by the left/right ensemble . No measurement separates them, because they are the same operator.
Key derivation Intermediate+
Theorem (Purity criterion). Let be a density operator on a finite-dimensional Hilbert space . Then , with equality if and only if is pure, i.e. for some unit vector .
Proof. Since is Hermitian and positive semidefinite with unit trace, the spectral theorem furnishes an orthonormal eigenbasis and eigenvalues with
The eigenvalues form a probability distribution. From the spectral form, , so
For any probability distribution, , because each forces , and summing gives . This establishes the bound.
Equality requires for every , hence each . With the constraint , exactly one eigenvalue equals and the rest vanish. Then is a rank-one projector — a pure state. Conversely, a pure state has eigenvalues , giving .
Corollary (expectation reproduces the Born rule). Writing , the trace formula gives , the classical average of quantum expectation values. The two layers of probability — classical ignorance () and quantum amplitude () — combine in one trace.
Bridge. This criterion is exactly the statement that the purity is a basis-independent fingerprint of how spread an ensemble is, and it builds toward the entropy that quantifies that spread in the Master tier. The foundational reason the trace formula needs no new postulate is that generalises by linearity in : putting these together, the convex set of density operators has the pure states as its extreme points, and every mixed state is an average of pure ones. The bridge is the eigenvalue distribution , which appears again in the von Neumann entropy , where the same numbers that measure purity measure information. The central insight is that classical probability and quantum amplitude are different axes of one object, and the density operator is where they meet.
Exercises Intermediate+
Lean formalization Intermediate+
A Lean statement of the purity criterion sits just outside current Mathlib. The three defining properties can be stated with Matrix.IsHermitian, Matrix.PosSemidef, and Matrix.trace, but the packaging and the spectral argument for the purity criterion are not yet assembled. The intended statement, with the proof gap recorded in Mathlib gap analysis, reads:
-- Intended statement; not yet in Mathlib (see lean_mathlib_gap).
-- A density matrix: Hermitian, positive semidefinite, unit trace.
structure DensityMatrix (n : ℕ) where
ρ : Matrix (Fin n) (Fin n) ℂ
herm : ρ.IsHermitian
posdef : ρ.PosSemidef
tr_one : ρ.trace = 1
-- Purity criterion: tr(ρ²) ≤ 1, equality iff ρ is a rank-one projector.
theorem purity_le_one {n : ℕ} (d : DensityMatrix n) :
(d.ρ * d.ρ).trace.re ≤ 1 := by
sorry -- spectral theorem + ∑ λₙ² ≤ (∑ λₙ)² for a probability vectorThe proof reduces to the eigenvalue inequality for a probability vector, which Mathlib can express but does not currently bundle with the density-matrix structure.
Advanced results Master
The reduced density matrix and the partial trace. Let the total system live on with state . The partial trace over is the linear map defined on product operators by and extended by linearity. The reduced density matrix is the unique operator on reproducing all local expectation values: for every observable on . This uniqueness is what makes the honest state of subsystem : it is the only object an experimenter confined to can ever access.
A pure global state can have a mixed reduced state, and this is the formal seat of entanglement. For the Bell state , the reduced state is
maximally mixed despite the pair being in a definite pure state. Knowing the whole with certainty and knowing each part only statistically coexist without contradiction.
The von Neumann (Liouville) equation. If the closed system evolves by the Schrödinger equation , then evolves by
The sign is opposite to the Heisenberg equation of motion for observables, the quantum echo of the classical Liouville equation with the Poisson bracket replaced by . The solution is conjugation by the unitary propagator, with .
Von Neumann entropy. The information content of a state is
the Shannon entropy of the eigenvalue distribution. It vanishes for pure states and reaches for the maximally mixed state. Under unitary evolution , because conjugation permutes nothing in the spectrum — closed-system entropy is conserved, and entropy increase is a statement about coarse-graining or tracing out, not about the microscopic dynamics.
Purification. Every mixed state is the shadow of a pure one. Given on , introduce an auxiliary space of dimension at least and define . Then . Any two purifications of the same differ only by a unitary on the auxiliary system.
The qubit Bloch ball. Every qubit density matrix is with and . Purity is , so pure states sit on the sphere and the centre is maximally mixed. Unitary evolution rotates rigidly; decoherence shrinks it toward the centre.
Synthesis. Putting these together, the density operator is the single object on which statistics, dynamics, and information all act compatibly. The foundational reason a subsystem of a pure whole looks mixed is that the partial trace discards the correlations, and this is exactly the content of the Bell-pair computation: the reduced state is dual to the maximal entanglement of the global state. The von Neumann equation generalises Schrödinger evolution from kets to operators, and its unitary solution is precisely what keeps the entropy constant — the central insight that microscopic reversibility and the second law live at different levels of description. Purification identifies every mixed state with a pure state on a larger space, so the bridge between the open-system picture and the closed-system picture is a partial trace in one direction and an enlargement in the other. The Bloch-ball geometry makes the convex structure visible: pure states on the boundary, mixed states inside, and the same eigenvalue distribution that fixes purity also fixes entropy, the recurring thread that this pattern recurs throughout quantum information.
Full proof set Master
Proposition 1 (Partial trace is well-defined and trace-preserving). The map defined on a fixed orthonormal basis by is independent of the basis and satisfies .
Proof. Let and be two orthonormal bases related by a unitary , . Substituting,
The unitarity relation collapses the sum to , the original expression. For trace preservation, take a basis of ; then , since is an orthonormal basis of the tensor product.
Proposition 2 (Reduced state of a maximally entangled pair is maximally mixed). For , the reduced state is .
Proof. Form , expanding into four terms. Apply . The cross terms and its conjugate contribute for both , so they drop. The surviving diagonal terms give
Maximal entanglement of the pair becomes maximal ignorance of the part.
Proposition 3 (Unitary invariance of von Neumann entropy). For any unitary , .
Proof. The operator has the same eigenvalues as : if , then , so is an eigenvector with the same eigenvalue . Since depends only on the multiset , and that multiset is preserved, the entropy is unchanged. As a corollary, von Neumann evolution conserves entropy for all .
Proposition 4 (Solution of the von Neumann equation). The unique solution of with time-independent and initial value is , .
Proof. Differentiate the proposed solution: . With and ,
which rearranges to . Uniqueness follows because the equation is a linear first-order ODE in the finite-dimensional space of operators with a Lipschitz right-hand side, so the Picard-Lindelöf theorem applies.
Connections Master
The reduced density matrix and partial trace introduced here are the exact apparatus on which the Schmidt decomposition and entanglement entropy of unit 12.17.02 are built; the eigenvalues of are the squared Schmidt coefficients, and the entanglement entropy is the von Neumann entropy of that reduced state.
The quantum-information treatment of the density matrix in unit 12.17.01 specialises this foundational formalism to quantum channels, the data-processing inequality, and the operational tasks of state discrimination; that unit takes the purity criterion and the trace formula proved here as given and pushes toward Kraus maps and accessible information.
The von Neumann equation derived here is the statistical-ensemble companion of the Schrödinger and Heisenberg pictures in unit 12.03.01: the opposite commutator sign relative to the Heisenberg equation of motion is precisely the distinction between evolving states and evolving observables, and the trace formula is invariant under moving the time dependence between the two.
The spectral theorem of unit 12.02.02 supplies the eigenbasis that turns every abstract density operator into a probability distribution over orthonormal states, which is the single step that makes the purity criterion and the entropy functional computable; without Hermiticity and positivity the trace formulas would not yield real, bounded expectation values.
The Bloch-ball parametrisation connects forward to the angular-momentum and spin formalism of unit 12.05.01, where the vector is the expectation of the spin operator and unitary evolution acts as a rigid rotation generated by the algebra.
Historical & philosophical context Master
The density matrix has a genuinely dual origin in 1927. Lev Landau, then nineteen, introduced it in his first published paper on radiation damping, where the description of a quantum subsystem interacting with a field forced an object beyond the wavefunction [Landau 1927]. Independently and in the same year, John von Neumann constructed the same operator from the thermodynamic side, seeking a quantum analogue of the Gibbs ensemble and a definition of entropy for quantum states [von Neumann 1927]. Von Neumann's 1932 monograph gave the formalism its now-standard axiomatic shape, fixing the three defining properties and the trace rule for expectation values [von Neumann 1932].
The operator-technique synthesis that physicists use today, including the multipole and Bloch-vector expansions, was systematised by Ugo Fano in 1957, whose review consolidated the density-matrix and statistical-tensor methods scattered across atomic and nuclear physics [Fano 1957]. Landau and Lifshitz present the formalism compactly in §14 of their quantum mechanics volume, deferring the fuller entropy treatment to their statistical-physics volume [Landau-Lifshitz 1977], a division that mirrors the historical split between Landau's dynamical motivation and von Neumann's thermodynamic one.
The distinction between a proper mixture (classical ignorance about which pure state was prepared) and an improper mixture (the reduced state of an entangled whole) became a fault line in the philosophy of measurement. Both are described by identical density operators, and no local measurement separates them, yet they encode different physical situations. This indistinguishability is the lever decoherence theory uses, and it is the reason the reduced density matrix sits at the centre of the modern measurement-problem literature.
Bibliography Master
@article{landau1927daempfung,
author = {Landau, Lev D.},
title = {Das D{\"a}mpfungsproblem in der Wellenmechanik},
journal = {Zeitschrift f{\"u}r Physik},
volume = {45},
pages = {430--441},
year = {1927}
}
@article{vonneumann1927thermo,
author = {von Neumann, John},
title = {Thermodynamik quantenmechanischer Gesamtheiten},
journal = {Nachrichten von der Gesellschaft der Wissenschaften zu G{\"o}ttingen, Mathematisch-Physikalische Klasse},
pages = {273--291},
year = {1927}
}
@book{vonneumann1932grundlagen,
author = {von Neumann, John},
title = {Mathematische Grundlagen der Quantenmechanik},
publisher = {Springer},
year = {1932},
note = {English translation: Mathematical Foundations of Quantum Mechanics, Princeton University Press, 1955, Ch. IV}
}
@article{fano1957density,
author = {Fano, Ugo},
title = {Description of States in Quantum Mechanics by Density Matrix and Operator Techniques},
journal = {Reviews of Modern Physics},
volume = {29},
pages = {74--93},
year = {1957}
}
@book{landau1977quantum,
author = {Landau, Lev D. and Lifshitz, Evgeny M.},
title = {Quantum Mechanics: Non-Relativistic Theory},
edition = {3rd},
publisher = {Pergamon Press},
year = {1977},
note = {Course of Theoretical Physics, Vol. 3, \S 14}
}
@book{nielsen2010quantum,
author = {Nielsen, Michael A. and Chuang, Isaac L.},
title = {Quantum Computation and Quantum Information},
edition = {10th Anniversary},
publisher = {Cambridge University Press},
year = {2010},
note = {\S 2.4, density operators and partial trace}
}