Linear manifolds, hyperplanes, and affine subspaces
Anchor (Master): Shilov *Linear Algebra* Ch. 2; Berger *Geometry I* Ch. 2–3 (affine spaces, affine maps, the affine group, the projective completion); Gallier *Geometric Methods and Applications* Ch. 2–3 (affine and projective geometry); Bourbaki *Algèbre* Ch. II §9 (affine and projective spaces); Artin *Geometric Algebra* Ch. II
Intuition Beginner
A subspace is a flat sheet that must pass through the origin: a line through the centre, a plane through the centre, the whole space. But most of the lines and planes you actually care about do not pass through the origin. The line a thrown ball traces, the plane of a tabletop sitting above the floor — these are flat, but they are shifted away from the centre. A linear manifold is exactly such a shifted flat: take a subspace through the origin and slide the whole thing by one fixed offset.
So every linear manifold is built from two pieces. One piece is a direction: which way the flat runs, a subspace that records the slopes but forgets the position. The other piece is a single anchor point that says where the flat actually sits. Two different anchor points on the same flat give the same direction, because sliding from one point of the flat to another never leaves the flat. The direction is forced by the flat; the anchor is your free choice.
The most important case is a flat that is one dimension short of filling the whole space — a line inside a plane, or a plane inside ordinary space. These are hyperplanes, and they are exactly the things cut out by a single linear equation. The solution set of one equation in three unknowns is a plane; the solution set of one equation in two unknowns is a line. A whole system of equations carves out the flat where all those single-equation flats overlap.
Visual Beginner
Picture a flat plane through the origin, tilted in space, and then picture a second plane parallel to it, floating a fixed distance above. The lower plane is a subspace — it contains the origin. The upper plane is a linear manifold — the same set of directions, but lifted off the origin by one offset arrow. Every point of the upper plane is the tip of that offset arrow plus some arrow lying in the lower plane.
Two things to read off. The two planes share their directions but not their position, so they are parallel. And the upper plane is the set of points where one fixed measurement — the height read along the perpendicular arrow — equals one fixed number. That single measurement equalling a constant is the linear equation that cuts out the flat.
Worked example Beginner
Take ordinary three-dimensional space with coordinates . Consider the flat described by the single equation $$ x + y + z = 6. $$ We will write the same flat in two ways: as an anchor point plus directions, and as a level set of one measurement.
Step 1. Find one point on . Try , , : then , so the point lies on . Call it the anchor.
Step 2. Find the directions. A direction is a step you can take from one point of to another without leaving . Stepping from to keeps the sum at , so is a direction. Stepping to also keeps the sum at , so is a second direction. These two arrows are not multiples of each other, so they span a plane of directions.
Step 3. Write the flat both ways. As anchor plus directions, $$ L = (6, 0, 0) + s,(-1, 1, 0) + t,(-1, 0, 1), $$ where and range over all numbers. As a level set, is the set of points where the measurement "add the three coordinates" returns the number .
Step 4. Check. Put , : the point is , and . The point lands on , as it must.
What this tells us: one linear equation in three unknowns describes a plane, and that plane is an anchor point plus a two-dimensional sheet of directions. The equation form and the anchor-plus-directions form are two descriptions of one flat.
Check your understanding Beginner
Formal definition Intermediate+
Let be a field and a -vector space, with subspaces, bases, and dimension as in 01.01.04 and linear functionals and the dual space as in 01.01.02.
Definition (linear manifold / affine subspace). A linear manifold (equivalently affine subspace or flat) of is a subset of the form $$ L = v_0 + W = {, v_0 + w : w \in W ,}, $$ where is a linear subspace, called the direction (or direction space) of , and is a base point. The empty set is included by convention only where stated; otherwise "linear manifold" means a nonempty coset of a subspace. The dimension of is . A flat of dimension is a point, of dimension a line, of dimension a plane, and of dimension a hyperplane.
The base point is not unique, but the direction is. If as sets, then , and is an admissible base point precisely when . Thus determines exactly, and determines only modulo .
Definition (affine combination). An affine combination of points is a linear combination whose coefficients satisfy the affine constraint . A set is affinely closed when every affine combination of finitely many points of again lies in .
Definition (affine independence and affine span). Points are affinely independent when the displacement vectors are linearly independent in ; this condition does not depend on which point is singled out as . The affine span is the smallest linear manifold containing them, namely . When the points are affinely independent, , and every point of the span has a unique tuple of barycentric coordinates with and .
Definition (hyperplane). A hyperplane of is a linear manifold whose direction has codimension , that is where is the direction of . In finite dimension this is .
Definition (parallelism, half-spaces). Two linear manifolds are parallel when the direction of one contains the direction of the other, or . Over , a hyperplane with nonzero determines two closed half-spaces and , with .
Notation: is the coset of through ; ; is the difference set, equal to the direction; is the affine span of ; is the level set of at ; is the quotient space, whose dimension is the codimension of .
Counterexamples to common slips
- A linear manifold is not a subspace unless it contains . The set is a line in but is not closed under addition: and lie on it, yet their sum does not. It is closed under affine combinations, not linear ones.
- The base point is not part of the data. Writing tempts one to treat as intrinsic, but any other point of serves equally; only the difference set is forced by .
- "Hyperplane" means codimension one, not dimension one. In a hyperplane is three-dimensional. The defining feature is a single linear equation, not a single direction.
- The defining functional of a hyperplane is unique only up to a common nonzero scalar. The equations and cut out the same line; and describe one hyperplane for every nonzero .
Key theorem with proof Intermediate+
Theorem (three faces of a linear manifold; Shilov Ch. 2 [source pending]; Berger Ch. 2 [source pending]). Let be a -vector space and a nonempty subset. The following are equivalent.
- is a coset of a subspace .
- is affinely closed: every affine combination of points of with lies in .
- For some, equivalently every, point , the difference set is a subspace of .
Moreover, is a hyperplane if and only if for some nonzero linear functional $f \in V^c \in K(f, c)L$ up to a common nonzero scalar factor.*
Proof. . Suppose with a subspace. Take points of , with , and scalars summing to . Then $$ \sum_i \lambda_i x_i = \sum_i \lambda_i (v_0 + w_i) = \Big(\sum_i \lambda_i\Big) v_0 + \sum_i \lambda_i w_i = v_0 + \sum_i \lambda_i w_i, $$ using in the last step. Since is a subspace, , so the affine combination equals . Thus is affinely closed.
. Assume is affinely closed and fix any ; set . To show is a subspace we check it contains and is closed under sums and scalar multiples. First . For closure under scaling, let with and let . The combination is an affine combination of (coefficients sum to ), so it lies in ; subtracting , $$ \big(\mu x + (1-\mu) v_0\big) - v_0 = \mu (x - v_0) = \mu w \in W. $$ For closure under addition, let and with . The point is the affine combination , whose coefficients sum to , so it lies in ; subtracting , $$ (x_1 + x_2 - v_0) - v_0 = (x_1 - v_0) + (x_2 - v_0) = w_1 + w_2 \in W. $$ Hence is a subspace. The same construction with any other base point yields , which differs from by the translation and is therefore the same subspace; so the condition holds for every point once it holds for one.
. If is a subspace, then by construction, which is statement .
Hyperplane correspondence. Suppose first with nonzero. Since there is with ; then satisfies , so and is nonempty. For , if and only if , that is . Hence , a coset of the subspace . By the rank-nullity theorem for the functional from 01.01.05, has dimension , so has dimension ; thus has codimension and is a hyperplane.
Conversely, let be a hyperplane, so . The quotient map followed by any linear isomorphism gives a nonzero functional with . Set and . For , means , that is ; so .
Uniqueness up to scalar. Suppose with nonzero. Both kernels equal the direction of , so is a codimension-one subspace. Two functionals with the same codimension-one kernel are proportional: choose with ; every decomposes as with , whence , so with . Evaluating at any gives . Thus .
Bridge. The coset model builds toward the quotient construction of 01.01.04: the points of the quotient are the parallel cosets, so the set of all flats with a fixed direction is exactly the vector space , and the dimension count is the source-side reading of . The hyperplane-as-level-set correspondence appears again in the dual-space pairing of 01.01.02, where a hyperplane through the origin in is precisely a point of the projectivised dual , the seed of the points-versus-hyperplanes duality of projective geometry. The affine-combination characterisation connects forward to convex geometry, where restricting the coefficients to turns affine spans into convex hulls and hyperplanes into the separating hyperplanes of the Hahn-Banach circle. And the solution-set reading specialises here the Kronecker-Capelli theorem of 01.01.06: every consistent linear system presents its solution flat as the intersection of the hyperplanes cut out by its individual equations.
Exercises Intermediate+
Advanced results Master
Theorem (the affine group and the structure of affine maps; Berger Ch. 2 [source pending]; Gallier Ch. 2 [source pending]). Let be a finite-dimensional -vector space. A map is an affine map — one preserving affine combinations, whenever — if and only if for a unique linear map , the linear part, and a unique vector . The invertible affine maps form the affine group $$ \operatorname{Aff}(V) = V \rtimes \operatorname{GL}(V), $$ the semidirect product of the translation group by the general linear group, with multiplication . The linear part is the homomorphism , , whose kernel is the translation subgroup. Affine maps send flats to flats, preserve dimension when invertible, preserve parallelism, and preserve barycentric coordinates — these last being the complete affine invariants, in the sense that ratios of collinear lengths are preserved while absolute lengths and angles are not. The affine group sits inside the projective group of the projective completion as the stabiliser of the hyperplane at infinity, which is the precise statement that affine geometry is projective geometry with one hyperplane distinguished.
Theorem (projective completion and points at infinity). Embed as the affine hyperplane . The projective space — the set of lines through the origin of — decomposes as $$ \mathbb{P}(V \oplus K) = \underbrace{V}{\text{affine part}} \ \sqcup\ \underbrace{\mathbb{P}(V)}{\text{hyperplane at infinity}}, $$ where the affine part is the image of the embedding and the hyperplane at infinity is the set of directions: each line of acquires exactly one point at infinity, namely its direction, and two affine lines meet at infinity precisely when they are parallel. In this completion a flat of closes up to a projective subspace whose points at infinity are , the directions of ; parallelism becomes incidence at infinity, and the troublesome case-splitting of affine intersection — meet, or be parallel — collapses into the uniform projective statement that any two projective subspaces meet in the expected dimension.
Theorem (points–hyperplanes duality). In with , the hyperplanes are exactly the projective subspaces of dimension , and the map sending a hyperplane to the line $[f] \in \mathbb{P}(V^)$ is a bijection*
$$
{\text{hyperplanes of } \mathbb{P}(V)} \ \xrightarrow{\ \sim\ }\ \mathbb{P}(V^).
$$
This bijection is a duality: it inverts incidence, sending the pencil of hyperplanes through a fixed point to the points of a fixed hyperplane in $\mathbb{P}(V^)\mathbb{P}(V^{**})\mathbb{P}(V)$ through the canonical evaluation of 01.01.02. Every theorem about points and lines in the plane thereby acquires a dual theorem with the words "point" and "line" interchanged — the symmetry that makes Desargues' theorem self-dual and pairs Pascal's theorem with Brianchon's.
Synthesis. A flat carries two separable pieces of data, a direction subspace and a position, and every elementary operation on flats acts on these two pieces independently. Dimension, parallelism, and the count of defining equations are properties of the direction alone, read in the quotient and its annihilator ; position enters only through a single base point modulo , which is why the set of flats of fixed direction is itself the vector space . The coset model and the affine-combination model are two presentations of this same separation: the coset displays the direction explicitly, the affine combination encodes it implicitly through the constraint that quotients out the position. The hyperplane is the boundary case codimension one, where the direction is the kernel of a single functional and the position is the single scalar value of that functional; this is what makes a linear system a list of hyperplanes and its solution set their common flat, the geometric content of Kronecker-Capelli 01.01.06.
Passing to the projective completion absorbs the position data into the geometry by adjoining the directions as honest points at infinity, so that the affine group is recovered as the stabiliser of the hyperplane at infinity and the affine case-splitting of intersection dissolves into uniform projective incidence. Duality then exchanges the two pieces one level up: a hyperplane in is a point in , the annihilator of a flat is the flat of its defining functionals, and the entire affine theory of flats becomes a shadow, on a distinguished hyperplane, of the symmetric projective theory of subspaces.
Full proof set Master
Proposition (the difference set is the direction, and the base point is determined modulo it). Let be a nonempty linear manifold. Then , and a vector is an admissible base point — meaning — if and only if .
Proof. Take , so and with ; then , giving . Conversely, for , the points and lie in and have difference , so . Hence , an invariant of independent of the base point. For the base-point claim: if then for some , and since . Conversely, if then .
Proposition (affine span and barycentric coordinates are well-defined). Let be affinely independent. Then has dimension , and each of its points has a unique tuple of barycentric coordinates .
Proof. Write for ; affine independence means are linearly independent, so they span a subspace of dimension . A point with rewrites, using , as $$ x = p_0 + \sum_{i=1}^k \lambda_i (p_i - p_0) = p_0 + \sum_{i=1}^k \lambda_i u_i \in p_0 + W, $$ and conversely every element of arises this way. Hence the set of affine combinations equals , a flat of dimension , and it is the smallest flat containing the because any flat containing them contains and all displacements , hence . For uniqueness, suppose with both coefficient tuples summing to . Subtracting and grouping at , $$ 0 = \sum_{i=1}^k (\lambda_i - \mu_i) u_i, $$ and linear independence of the forces for ; the constraint then forces . The barycentric coordinates are therefore unique.
Proposition (intersection of a finite family of flats is a flat or empty; the parallel dichotomy). Let be a finite family of linear manifolds with directions . If , then it is a linear manifold with direction ; in particular two flats , either meet, in which case for any common point , or are disjoint, and disjoint flats with or are parallel.
Proof. Suppose , so for every by the base-point proposition. A point lies in if and only if for every , that is . Thus , and is a subspace as an intersection of subspaces; so the intersection is a flat of direction . The two-flat statement is the case . For the dichotomy: when the intersection is by definition empty; the additional remark records that if moreover one direction contains the other the flats are parallel by definition, which is the disjoint configuration of nested directions — for instance two distinct parallel lines, where and the difference obstructs a common point.
Proposition (a hyperplane separates its complement into two pieces over an ordered field). Let be an ordered field, a -vector space, and a hyperplane with $f \in V^V \setminus HH^{+}{\circ} = {x : f(x) > c}H^{-}{\circ} = {x : f(x) < c}[x, y] = {(1-t)x + ty : 0 \le t \le 1}H^{+}{\circ}H^{-}{\circ}H$ in exactly one point.*
Proof. For , , so or but not both, by trichotomy in the ordered field ; this partitions into and disjointly. Convexity of : if and and , then , using that a convex combination of two quantities each exceeding exceeds ; likewise for . For the segment crossing, parametrise , an affine function of with and . Solving gives the unique $$ t_* = \frac{f(x) - c}{f(x) - f(y)} \in (0, 1), $$ the denominator nonzero since , and because numerator and denominator are both positive and the numerator is the smaller. The single crossing point lies on , and no other value of solves the affine equation .
Connections Master
The coset model is the geometric face of the quotient space of 01.01.04: the points of the quotient are exactly the parallel cosets of , so the family of all flats sharing a direction is itself a vector space, and the rank-nullity identity is the dimension count read on flats.
The hyperplane-as-level-set correspondence is dual to the dual-space theory of 01.01.02: a hyperplane through the origin is the kernel of a nonzero functional, the annihilator of a -flat's direction is an -dimensional space of defining functionals, and the points-versus-hyperplanes duality of projective space is the projectivisation of the canonical pairing . The same annihilator computation reappears in the four-fundamental-subspaces orthogonality of 01.01.10, where over an inner-product space the defining functional is realised by the normal vector and the half-spaces acquire a metric meaning.
The affine structure of the solution set is precisely the Kronecker-Capelli theorem of 01.01.06 read geometrically: a consistent linear system is an intersection of hyperplanes, its solution flat has direction and dimension , and the particular-plus-homogeneous decomposition is the choice of a base point plus the direction. This flat-of-solutions picture propagates to the geodesics and affine connections of 13.02.01, where the flat affine structure of is the local model that a connection curves, and the affine group reappears as the structure group of an affine bundle.
Historical & philosophical context Master
The systematic idea that geometry could be done with points described by weights, rather than by coordinates relative to an origin, is due to August Ferdinand Möbius, whose 1827 Der barycentrische Calcul introduced barycentric coordinates: a point of a triangle or simplex specified by the masses one would place at its vertices to balance there [Möbius 1827]. The barycentric description is intrinsically affine — it never names an origin — and it is the historical source of the affine-combination characterisation of flats used in this unit. The general -dimensional theory of linear extension, in which flats are spanned by points and carry a dimension independent of any fixed coordinate frame, was constructed by Hermann Grassmann in the 1844 Ausdehnungslehre [Grassmann 1844]. The separation of the affine from the projective and the metric, and the recognition that affine geometry is projective geometry with a distinguished hyperplane at infinity, belongs to the nineteenth-century projective school and was given its group-theoretic form in Felix Klein's Erlangen programme.
The treatment of linear manifolds as cosets of subspaces, with hyperplanes as level surfaces of a linear form and the solution set of a system as a manifold, is the form in Georgi Shilov's Linear Algebra (1971 English translation) followed here; the modern axiomatic affine-space framework, in which the difference of two points is a vector and the affine group is the semidirect product , is the presentation of Marcel Berger's Geometry I (1987) and of Jean Gallier's Geometric Methods [Berger 1987].
Bibliography Master
@book{Mobius1827,
author = {M\"obius, August Ferdinand},
title = {Der barycentrische Calcul},
publisher = {Johann Ambrosius Barth},
address = {Leipzig},
year = {1827}
}
@book{Grassmann1844,
author = {Grassmann, Hermann},
title = {Die lineale Ausdehnungslehre, ein neuer Zweig der Mathematik},
publisher = {Otto Wigand},
address = {Leipzig},
year = {1844}
}
@book{Shilov1977,
author = {Shilov, Georgi E.},
title = {Linear Algebra},
publisher = {Dover Publications},
address = {New York},
year = {1977},
note = {Translation of the 1971 Russian edition, transl. R. A. Silverman}
}
@book{Berger1987,
author = {Berger, Marcel},
title = {Geometry I},
series = {Universitext},
publisher = {Springer-Verlag},
address = {Berlin},
year = {1987},
note = {Translation of G\'eom\'etrie, Cedic/Nathan, 1977}
}
@book{Gallier2011,
author = {Gallier, Jean},
title = {Geometric Methods and Applications: For Computer Science and Engineering},
edition = {2nd},
series = {Texts in Applied Mathematics},
volume = {38},
publisher = {Springer},
year = {2011}
}
@book{Artin1957,
author = {Artin, Emil},
title = {Geometric Algebra},
publisher = {Interscience Publishers},
address = {New York},
year = {1957}
}
@book{Bourbaki1970,
author = {Bourbaki, Nicolas},
title = {Alg\`ebre, Chapitres 1 \`a 3},
publisher = {Hermann},
address = {Paris},
year = {1970}
}