01.01.02 · foundations / linear-algebra

Dual space and double dual

shipped3 tiersLean: none

Anchor (Master): Halmos — Finite-Dimensional Vector Spaces §13–§17; Lang — Algebra Ch. III §6; Bourbaki — Algèbre Ch. II §2

Intuition Beginner

A dual space is the home of all the rulers you can hold up to a vector. A vector is an object you can add and stretch. A ruler is a rule that reads off a single number from any vector you point at, and reads it off in a way that respects adding and stretching: a longer vector gets a bigger reading, twice the vector gets twice the reading, and the sum of two vectors gets the sum of the two readings.

The collection of all such rulers for a given vector space is itself a vector space, called the dual space . You can add two rulers, you can scale a ruler, and the result is still a ruler. So the dual space is built from the same toolkit as the original, but its members are measuring devices rather than the things being measured.

Why does this matter? Every measurement of a physical vector is a ruler in disguise. Reading off one coordinate, integrating a function against a fixed weight, evaluating a polynomial at a fixed point — these are all rulers, and they all live in the dual space of whatever space the vectors come from.

Visual Beginner

A column of numbers on the left stands for a vector in . A row of numbers in the middle stands for a ruler in . Multiplying the row by the column collapses both into a single number — the reading. Two different rows give two different readings of the same column.

The picture also shows the second dual . Every vector in doubles as a ruler on rulers: hand it a ruler , and the vector reads back , the number the ruler reads on . Roles swap: vectors become the things doing the measuring, and rulers become the things being measured.

Worked example Beginner

Work in the coordinate space of length-three columns. Take the vector

A ruler on this space is a row of three numbers that reads off from any column .

Take the ruler . Applying to gives . So reads off the first coordinate of any column.

Take a second ruler . Applying to gives .

Now add the two rulers: . Applying to gives . Adding the rulers added the readings.

What this tells us: rulers are themselves vectors. The dual space is built from the same combine-and-scale operations as the original space, but its members are the measurement rules.

Check your understanding Beginner

Formal definition Intermediate+

Let be a vector space over a field . The dual space is the set $$ V^* = {f : V \to \mathbb{F} \mid f \text{ is } \mathbb{F}\text{-linear}} $$ of -linear maps from to . Pointwise addition and pointwise scalar multiplication make into an -vector space: and for , , . Elements of are called linear functionals, linear forms, or covectors; the value is often written to emphasise the symmetric pairing .

When is finite-dimensional with ordered basis , the dual basis of consists of the functionals defined on the basis by the Kronecker rule $$ e^i(e_j) = \delta^i_j = \begin{cases} 1 & i = j, \ 0 & i \ne j, \end{cases} $$ and extended to all of by linearity: . The functional reads off the -th coordinate of a vector relative to the chosen basis.

The double dual is , the space of linear functionals on the dual space. There is a canonical map $$ \mathrm{ev} : V \to V^{**}, \qquad v \mapsto \mathrm{ev}_v, \qquad \mathrm{ev}_v(f) = f(v). $$ The notation does not name a basis: is the rule that hands any functional back its value at . This map is -linear in (by linearity of ) and is natural in with respect to linear maps, in a sense made precise below.

For a subspace , the annihilator of is $$ W^\perp = {f \in V^* : f(w) = 0 \text{ for every } w \in W} \subseteq V^. $$ It is a subspace of $V^\Phi \subseteq V^*\Phi^\perp = {v \in V : f(v) = 0 \text{ for every } f \in \Phi} \subseteq VW \subseteq VV \cong V^{**}W$ exactly.

For a linear map , the transpose or dual map is $$ T^* : W^* \to V^, \qquad T^(g) = g \circ T. $$ The arrow direction reverses: sends to , while sends to . Composition reverses, , and identities are preserved, . In matrix terms with respect to dual bases, the matrix of is the transpose of the matrix of — the operation that gives the construction its name.

Counterexamples to common slips

  • The isomorphism in finite dimensions is real but is not canonical: it requires a choice of basis (or of an inner product). Two different bases of give two different identifications , and the difference is visible in coordinates. The isomorphism via requires no such choice.
  • For infinite-dimensional , the canonical map is still injective but is no longer surjective. The algebraic double dual is strictly larger than already for a countable-dimensional vector space, and the topological double dual of a Banach space is larger still unless the space is reflexive.
  • The annihilator is taken inside the dual space, not inside the original. The orthogonal complement that one meets in inner-product geometry is a different object: it depends on a chosen inner product, while the dual-space annihilator is canonical.

Key theorem with proof Intermediate+

Theorem (dual-basis and dimension; Halmos §14, Shilov §4). Let be a finite-dimensional vector space over with ordered basis . The functionals defined by form a basis of $V^\dim V^* = \dim V = n\mathrm{ev} : V \to V^{**}v \mapsto \mathrm{ev}_v$, is an isomorphism.*

Proof. Three claims to establish in sequence: the are linearly independent in , the span , and the evaluation map is an isomorphism.

Step 1: independence of the dual basis. Suppose scalars satisfy $$ c_1 e^1 + c_2 e^2 + \cdots + c_n e^n = 0 $$ in . Apply both sides to the basis vector for an arbitrary . The left-hand side gives $$ \sum_{i=1}^n c_i , e^i(e_j) = \sum_{i=1}^n c_i , \delta^i_j = c_j, $$ and the right-hand side gives . Hence for every , so the family is linearly independent.

Step 2: spanning. Let be arbitrary. Define scalars for , and consider the functional $$ g = \sum_{i=1}^n c_i , e^i \in V^. $$ For any basis vector , $$ g(e_j) = \sum_{i=1}^n c_i , e^i(e_j) = \sum_{i=1}^n c_i , \delta^i_j = c_j = f(e_j). $$ The two functionals and agree on a basis of , so they agree on all of by linearity. Therefore , establishing spanning. The form a basis of $V^\dim V^* = n = \dim V$.

Step 3: evaluation is an isomorphism. The map , , is -linear in : for and , $$ \mathrm{ev}(au + bv)(f) = f(au + bv) = a f(u) + b f(v) = a , \mathrm{ev}(u)(f) + b , \mathrm{ev}(v)(f), $$ so .

The map is injective: if in , then for every . Expanding and applying gives for every , so .

By Step 1 and Step 2 applied to , the dual of has dimension , so . An injective linear map between finite-dimensional spaces of the same dimension is an isomorphism (rank-nullity, 01.01.05). The evaluation map is an isomorphism.

Bridge. This builds toward every algebraic and analytic use of duality. The foundational reason the construction works is exactly the dual-basis pairing : it identifies and as two sides of one bilinear pairing , and that pairing is the structural input behind every subsequent duality theorem. The central insight is that the isomorphism is dual to a choice of basis — change the basis and the identification changes — while the isomorphism via is canonical and natural. This pattern recurs in 01.01.15 (bilinear forms), where a non-degenerate bilinear form is exactly a basis-free isomorphism , and it generalises to functional analysis, where the Riesz representation theorem identifies a Hilbert space with its own continuous dual under the inner product. Putting these together, the dual space identifies with the space of scalar measurements of , the double dual identifies with itself canonically, and the transpose map identifies a linear map with its action on rulers — three facets of one duality framework. The bridge is the recognition that algebraic duality, geometric pairing, and the index calculus of upper and lower indices in tensor algebra are different presentations of one biorthogonal pairing . This appears again in 03.01.01 (tensor product) and downstream in 01.01.15 (bilinear and quadratic forms), where the same pairing is the universal target of bilinear maps.

Exercises Intermediate+

Lean formalization Intermediate+

Mathlib provides the dual-space infrastructure as Module.Dual K V := V →ₗ[K] K and the canonical double-dual map via Module.Dual.eval and Module.evalEquiv. The intended formalisation reads:

import Mathlib.LinearAlgebra.Dual
import Mathlib.LinearAlgebra.FiniteDimensional

variable {K V : Type*} [Field K] [AddCommGroup V] [Module K V]

/-- Dual basis of a finite basis. -/
example (b : Basis (Fin n) K V) : Basis (Fin n) K (Module.Dual K V) :=
  b.dualBasis

/-- Dimension of dual equals dimension of original for finite-dim V. -/
example [FiniteDimensional K V] :
    FiniteDimensional.finrank K (Module.Dual K V) =
      FiniteDimensional.finrank K V :=
  Module.finrank_dual_eq_finrank K V

/-- Canonical map V → V** is an isomorphism in finite dimensions. -/
noncomputable example [FiniteDimensional K V] :
    V ≃ₗ[K] Module.Dual K (Module.Dual K V) :=
  Module.evalEquiv K V

/-- Transpose / dual map; functoriality. -/
example {W : Type*} [AddCommGroup W] [Module K W] (T : V →ₗ[K] W) :
    Module.Dual K W →ₗ[K] Module.Dual K V :=
  T.dualMap

The proof gap declared in Mathlib gap analysis lies one level downstream. Mathlib has the abstract dual-basis API but does not ship the explicit Lagrange-interpolation duality witness for paired with point-evaluation functionals at distinct nodes. The double-annihilator identity for a submodule of a finite-dimensional space is recoverable from Submodule.dualAnnihilator plus Module.evalEquiv plus rank-counting, but is not packaged as a single simp lemma. Both are short upstream-contribution targets.

Advanced results Master

Proposition (functoriality of ). The assignment $V \mapsto V^T \mapsto T^\mathbb{F}(T \circ S)^ = S^* \circ T^\mathrm{id}_V^ = \mathrm{id}_{V^}(V \oplus W)^ \cong V^* \oplus W^$, with the obvious projections / injections swapped.

The composite is therefore a covariant functor. The natural transformation has components , , and is a natural isomorphism on the full subcategory of finite-dimensional vector spaces. Naturality means: for every linear map , the square $$ \begin{array}{ccc} V & \xrightarrow{\mathrm{ev}_V} & V^{} \ T \downarrow & & \downarrow T^{} \ W & \xrightarrow{\mathrm{ev}_W} & W^{} \end{array} $$ commutes. The verification is one line: $T^{}(\mathrm{ev}_V(v))(g) = \mathrm{ev}_V(v)(T^(g)) = (T^(g))(v) = g(T(v)) = \mathrm{ev}_W(T(v))(g)V \cong V^{**}V \cong V^*$ does not.

Proposition (rank of the transpose). For a linear map between finite-dimensional vector spaces, $\mathrm{rank}, T^ = \mathrm{rank}, T\ker T^* = (\mathrm{im}, T)^\perp\mathrm{im}, T^* = (\ker T)^\perp$.*

The matrix-theoretic content of this proposition is the row-rank equals column-rank theorem: for any matrix , . The duality proof exhibits the matrix theorem as a corollary of a categorical fact about the dual functor.

Proposition (annihilator duality). For finite-dimensional and a subspace , the dual map of the inclusion has kernel , and the induced map $V^/W^\perp \to W^V \to V/W(V/W)^ \to V^W^\perpW^(V/W)^V^W^\perp$ via the short exact sequence* $$ 0 \to W^\perp \to V^* \to W^* \to 0. $$

Theorem (algebraic-versus-topological dual; Banach 1932). For an infinite-dimensional Banach space with continuous dual , the canonical evaluation map is an isometric embedding. The space is called reflexive when is surjective. Reflexive examples: every Hilbert space (by Riesz representation), every for , every finite-dimensional Banach space. Non-reflexive examples: for infinite has continuous dual whose continuous dual is strictly larger than ; and are also non-reflexive.

The algebraic double dual of an infinite-dimensional vector space is always strictly larger than the original, by a cardinality argument on Hamel bases. The topological double dual of a Banach space can be either larger than or canonically isomorphic to the original, with reflexivity the dividing line.

Theorem (Riesz representation; Hilbert-space case). Let be a Hilbert space over or with inner product . For every continuous linear functional there is a unique with for every , and . The map , , is an isometric anti-linear isomorphism (linear in the real case).

Riesz representation realises the inner product as a canonical identification in the topological category — the analogue, for Hilbert spaces, of the basis-free identification in the algebraic category. The inner-product structure plays the role that the choice of basis plays in the algebraic finite-dimensional case, but here the identification with is canonical, not basis-dependent, because the inner product itself is the canonical extra datum that distinguishes a Hilbert space from a general Banach space.

Theorem (Bourbaki dual pairing). Let be -vector spaces and let be a bilinear pairing. The pairing is non-degenerate if the induced maps $V \to W^v \mapsto \langle v, - \rangleW \to V^w \mapsto \langle -, w \rangle\dim V = \dim W$.

The non-degenerate bilinear pairing is the canonical input to all of bilinear-form theory: symmetric non-degenerate pairings give quadratic forms (01.01.15), antisymmetric non-degenerate pairings give symplectic forms, and the Bourbaki framing makes the universal-property structure transparent.

Theorem (tensor algebra incarnation). Let be a finite-dimensional vector space over . The space of bilinear forms is canonically identified with $V^ \otimes V^kV^k \to \mathbb{F}k(V^)^{\otimes k}(p, q)pqV^{\otimes p} \otimes (V^)^{\otimes q}V \cong V^{**}VV^V$.*

This proposition is the algebraic substrate of the abstract index calculus in differential geometry and general relativity: upper indices live in , lower indices live in , contraction is the pairing , and a metric tensor on provides the basis-free isomorphism that lowers and raises indices in a coordinate-free way.

Synthesis. Duality is the foundational reason linear algebra has the symmetric structure it does. The central insight is that every vector space carries a canonical companion of linear functionals, that the companion has the same dimension in the finite-dimensional case, and that the second iteration returns to via a canonical map — basis-free and natural in the functorial sense. This builds toward the entire theory of bilinear forms (01.01.15), tensor products (03.01.01), differential forms on manifolds, and the topological duality theory of Banach and locally convex spaces. The bridge is that algebraic duality and analytic duality are the same construction at different levels of structural enrichment: the algebraic dual is the universal vector space of linear functionals, while the topological dual restricts to continuous functionals, and the Riesz representation theorem identifies the topological dual of a Hilbert space with itself canonically, generalising the basis-free finite-dimensional identification . Putting these together, every classical pairing in linear algebra — the matrix transpose, the annihilator, the bilinear form, the inner product, the integration pairing on function spaces — is dual to itself in a precise sense: applying the dual construction twice returns the original, and the canonical map back is the universal datum that organises all of duality.

The duality also identifies several constructions that look distinct at first inspection. The transpose of a linear map and the matrix transpose of its matrix are not two different operations but one: the latter is the matrix presentation of the former in dual bases. The annihilator of a subspace and the orthogonal complement under an inner product are not two different operations but one: the latter is the image of the former under the Riesz isomorphism . The cotangent space at a point of a smooth manifold and the dual of the tangent space are not two different objects but one: differential forms are sections of the cotangent bundle, and the entire de Rham complex is a tensor-power construction over the dual functor. This is exactly the unifying observation that appears again in 01.01.15 (bilinear forms), where a non-degenerate bilinear form is by construction an isomorphism , and in 03.01.01 (tensor product), where the tensor algebra is built from and together as the natural ambient for all multilinear constructions on the underlying vector space.

Full proof set Master

Proposition (functoriality of ), proof. For linear maps and , the composite satisfies $$ (T \circ S)^(g) = g \circ (T \circ S) = (g \circ T) \circ S = T^(g) \circ S = S^(T^(g)) $$ for any . So . For , , so . Additivity: . Direct sum: the pair of projections and has duals and whose images are complementary subspaces of with intersection , exhibiting the splitting . The component identifications are: corresponds to functionals supported on the summand.

Proposition (rank of the transpose), proof. A functional lies in iff iff vanishes on iff . The dimension identity combined with rank-nullity for , i.e. — gives , so . For the image identity, is immediate from , which vanishes whenever ; equality follows by dimension count, .

Proposition (annihilator duality), proof. Let be the inclusion. The dual map restricts a functional on to a functional on . A functional lies in iff its restriction to is zero iff . So . The first isomorphism theorem applied to gives an injection . The image is all of : any functional extends to a functional on by extending a basis of to a basis of and assigning arbitrary values on the new basis elements; the extension restricts back to . So .

For the quotient case, the projection has dual taking a functional on to , a functional on that vanishes on (since kills ). So . Conversely, any factors through as with well-defined on cosets by the vanishing on . So , and gives an isomorphism . Combining the two identifications, the short exact sequence is the dual of the short exact sequence .

Theorem (algebraic-versus-topological dual; Banach 1932), proof sketch. The Hahn-Banach extension theorem [Hahn 1927] says: every bounded linear functional on a subspace of a normed space extends to a bounded linear functional on with the same norm. Applied to the one-dimensional subspace spanned by a fixed nonzero , with the functional of norm , Hahn-Banach produces a functional with and . The evaluation map , , then has , so , while the Hahn-Banach functional witnesses , giving . The map is an isometric embedding. Surjectivity is the definition of reflexivity and is not automatic: has continuous dual whose continuous dual is strictly larger than (Banach 1932 Ch. IV [Banach 1932]).

Theorem (Riesz representation), proof sketch. Let be a continuous linear functional on the Hilbert space . If , take . Otherwise is a closed proper subspace of ; by Hilbert-space orthogonality, and is one-dimensional (since restricted to it is a nonzero linear functional from a one-dimensional space to ). Pick a unit vector and let . Set . For any decompose with , . Then and (using and ). The two agree, and is the desired representative. Uniqueness: if for every , then for every , in particular for , giving and . The norm equality comes from Cauchy-Schwarz saturated at .

Proposition (tensor algebra incarnation), proof sketch. A bilinear form is by the universal property of (proved in 03.01.01) the same as a linear map , equivalently an element of in finite dimensions. The natural isomorphism in finite dimensions is verified by exhibiting bases: a basis of is dual to the basis of , and the dual functional pairs to exactly as would. The dimensions agree, , so the bilinear-functor argument gives an isomorphism. Iterating produces the identification of -multilinear forms with and of mixed tensors with .

Connections Master

  • Vector space 01.01.03. The dual space is built from the same field and the same combine-and-scale operations as , but its elements are scalar-valued linear maps rather than vectors. Without the vector-space axioms there is no notion of linear functional, and without the field as codomain there is no target for the functionals to land in. The dual construction is the simplest nontrivial functor out of the category of -vector spaces.

  • Subspace, basis, dimension 01.01.04. The dual-basis construction depends on having a basis to start with, and the dimension identity rests on the equivalence of finite-dimensional bases. The annihilator dimension formula is a direct consequence of extending a basis of to a basis of and reading off the dual-basis support — making the dimension theory of subspaces the foundation for the dimension theory of annihilators.

  • Linear transformation, rank-nullity 01.01.05. The transpose / dual map is a linear map between dual spaces, and rank-nullity for gives the rank equality that is the duality content of the row-rank equals column-rank theorem for matrices. The kernel-image identities and are the dual-space packaging of the four fundamental subspaces of a matrix.

  • Bilinear and quadratic forms 01.01.15. A bilinear form is the same datum as a linear map , and is non-degenerate iff this map is an isomorphism. So bilinear-form theory is built directly on top of the dual space: symmetric forms give quadratic forms, antisymmetric forms give symplectic forms, and Hermitian forms give inner products. The classification of bilinear forms is a classification of basis-free isomorphisms .

  • Tensor product 03.01.01. The tensor square is canonically identified with the space of bilinear forms on in finite dimensions, and more generally classifies -multilinear forms. The mixed tensor space is the abstract home of tensors of type in differential geometry, with index raising and lowering performed by a fixed bilinear form on . The dual space is half the data of every tensor construction.

Historical & philosophical context Master

The notion of a linear functional emerged in two strands. The algebraic strand began with the recognition that the row vectors and column vectors of a matrix were two presentations of related-but-distinct objects: rows acted on columns to produce scalars, and column-on-row was a different operation. Hermann Grassmann's Die lineale Ausdehnungslehre (1844) [Bourbaki] introduced the algebra of -dimensional extension that implicitly treated linear forms as objects distinct from vectors, though the formal vector-space concept stabilised only with Giuseppe Peano's Calcolo geometrico secondo l'Ausdehnungslehre di H. Grassmann (1888). Élie Cartan's tensor calculus, developed across his 1894–1922 papers on Lie groups and differential geometry, used the index notation with upper and lower indices to encode the distinction between vectors and covectors at the level of computation.

The functional-analytic strand grew out of integral equation theory. Salvatore Pincherle's Mémoire sur le calcul fonctionnel distributif (Math. Annalen 49, 1897, 325–382) [Pincherle 1897] developed an early calculus of linear operators on function spaces, treating linear functionals as fundamental objects. Maurice Fréchet's 1906 thesis introduced function spaces with norm-style structure, and David Hilbert's 1904–1910 lectures on integral equations developed the spectral theory that demanded a coherent duality framework. Hans Hahn's Über lineare Gleichungssysteme in linearen Räumen (J. reine angew. Math. 157, 1927, 214–229) [Hahn 1927] proved what is now the Hahn-Banach extension theorem, the foundational result that makes the topological dual of a Banach space a substantive object. Stefan Banach's Théorie des opérations linéaires (Monografie Matematyczne 1, 1932) [Banach 1932] gave the systematic treatment of normed-space duality, including reflexivity, the weak and weak-star topologies, and the topological double dual.

The modern algebraic framing was assembled by the Bourbaki collective. Algèbre Ch. II (1947) presented the dual of a vector space as the universal example of a contravariant additive functor, and Algèbre Multilinéaire (Ch. III) absorbed the tensor algebra into a single functor-of-many-arguments framework. Frigyes Riesz's 1907 paper on functionals and his 1909 paper on functionals (Bull. Soc. Math. France) gave the representation theorems that identify these spaces with their own continuous duals; Riesz-Fischer theory and the abstract Hilbert-space Riesz representation followed. Saunders Mac Lane's categorical framing in Categories for the Working Mathematician (1971) made naturality of the double-dual embedding a textbook example of a natural transformation. Halmos's Finite-Dimensional Vector Spaces (Princeton 1942, 2nd ed. 1958) became the standard pedagogical source for the algebraic finite-dimensional theory; Shilov's Linear Algebra §4–§5 (English ed. Dover 1977) is the immediate source for the present unit's framing of linear forms, dual bases, and biorthogonal systems.

Bibliography Master

@book{Shilov1977LinearAlgebra,
  author    = {Shilov, Georgii E.},
  title     = {Linear Algebra},
  publisher = {Dover Publications},
  year      = {1977},
  note      = {Translated from the 1971 Russian edition by Richard A. Silverman}
}

@book{Halmos1958FDVS,
  author    = {Halmos, Paul R.},
  title     = {Finite-Dimensional Vector Spaces},
  publisher = {Van Nostrand},
  edition   = {2},
  year      = {1958}
}

@book{LangAlgebra,
  author    = {Lang, Serge},
  title     = {Algebra},
  publisher = {Springer-Verlag},
  series    = {Graduate Texts in Mathematics},
  volume    = {211},
  edition   = {Revised 3rd},
  year      = {2002}
}

@book{BourbakiAlgebreII,
  author    = {Bourbaki, Nicolas},
  title     = {{\'E}l{\'e}ments de math{\'e}matique: Alg{\`e}bre, Chapitres 1 {\`a} 3},
  publisher = {Hermann},
  year      = {1970}
}

@article{Pincherle1897,
  author  = {Pincherle, Salvatore},
  title   = {M{\'e}moire sur le calcul fonctionnel distributif},
  journal = {Math. Annalen},
  volume  = {49},
  year    = {1897},
  pages   = {325--382}
}

@article{Hahn1927,
  author  = {Hahn, Hans},
  title   = {{\"U}ber lineare Gleichungssysteme in linearen R{\"a}umen},
  journal = {J. reine angew. Math.},
  volume  = {157},
  year    = {1927},
  pages   = {214--229}
}

@book{Banach1932,
  author    = {Banach, Stefan},
  title     = {Th{\'e}orie des op{\'e}rations lin{\'e}aires},
  publisher = {Monografie Matematyczne},
  volume    = {1},
  year      = {1932}
}

@article{Riesz1907,
  author  = {Riesz, Frigyes},
  title   = {Sur une esp{\`e}ce de g{\'e}om{\'e}trie analytique des syst{\`e}mes de fonctions sommables},
  journal = {C. R. Acad. Sci. Paris},
  volume  = {144},
  year    = {1907},
  pages   = {1409--1411}
}

@book{Grassmann1844,
  author    = {Grassmann, Hermann},
  title     = {Die lineale Ausdehnungslehre, ein neuer Zweig der Mathematik},
  publisher = {Otto Wigand, Leipzig},
  year      = {1844}
}

@book{Axler2015LADR,
  author    = {Axler, Sheldon},
  title     = {Linear Algebra Done Right},
  publisher = {Springer},
  edition   = {3},
  year      = {2015}
}

@book{HoffmanKunze1971,
  author    = {Hoffman, Kenneth and Kunze, Ray},
  title     = {Linear Algebra},
  publisher = {Prentice-Hall},
  edition   = {2},
  year      = {1971}
}