A Deductive Calculus for First-Order Logic and Soundness
Anchor (Master): Enderton 2001 *A Mathematical Introduction to Logic* 2e §2.4-2.5 (the full Hilbert calculus with the equality axioms and normal models, the generalization theorem, the deduction theorem, the soundness theorem, and the syntactic notions of consistency and inconsistency that the completeness theorem will tie to satisfiability); Mendelson 2015 *Introduction to Mathematical Logic* 6e (CRC) Ch. 2 (the predicate calculus, its axioms and the generalization rule, the deduction theorem with its eigenvariable restriction); Shoenfield 1967 *Mathematical Logic* (Addison-Wesley) Ch. 2-3 (an economical Hilbert system, the equality and substitution axioms); van Dalen 2013 *Logic and Structure* 5e (Springer) Ch. 2-3 (natural deduction for first-order logic as an alternative calculus); Takeuti 1987 *Proof Theory* 2e (North-Holland) Ch. 1 (the sequent calculus LK, the cut rule, and the cut-elimination theorem)
Intuition Beginner
The previous unit answered when a sentence is true: you pick a world, fix what every symbol means, and check. That is a question about meaning. This unit answers a completely different question, one that never mentions meaning at all: when can a sentence be proved? A proof here is a chain of written lines, each one either an assumption you started with, or a fixed logical starting-line that is allowed no matter what, or a single mechanical step that reads two earlier lines and writes a third. The last line of the chain is what you proved.
The striking thing is that this rule-pushing game refuses to look at any world. It never asks what the objects are or what the symbols mean. It only checks the shape of the lines. A clerk who knew no mathematics could verify a proof by pattern-matching alone, and a machine certainly can. So we now have two separate notions sitting side by side: "true in every world," which is about meaning, and "reachable by the chain game," which is about shape.
A natural worry is whether the shape game might cheat — whether it could prove something that is actually false in some world. This unit shows it cannot. Every starting-line is true in every world, and the one mechanical step never turns truths into a falsehood. So everything you can prove really is true everywhere. That guarantee is called soundness. The harder reverse question — is everything true-everywhere also provable? — is saved for the next unit.
Visual Beginner
A proof is a vertical list of lines. Each line is justified by one of exactly three things: it is an assumption, it is a free logical starting-line, or it is built from two lines already above it by the single detachment step. The diagram shows a short proof and how each line earns its place.
THE ONLY MECHANICAL STEP (detachment):
if you have written A
and you have written A then B
you may write down B
A SHORT PROOF of q from the assumptions { p , p then q }:
line 1 p (an assumption)
line 2 p then q (an assumption)
line 3 q (detachment from lines 1 and 2)
^^^ the last line is what was proved
NOTHING in this proof ever asked what p or q MEAN.Read it top to bottom. Lines 1 and 2 are just the assumptions, copied in. Line 3 is the only real move: it sees the pattern "A" on line 1 and "A then B" on line 2, and writes "B." The whole proof is finite, every line is justified by a line above it or by a fixed rule, and at no point does anyone decide whether is true.
| line | what it says | why it is allowed |
|---|---|---|
| 1 | it is an assumption | |
| 2 | then | it is an assumption |
| 3 | detachment from lines 1 and 2 |
Because the rule only matches shapes, the same three-line proof works whatever and stand for.
Worked example Beginner
We build a complete proof of one specific sentence and then check, by hand, that the move we relied on is safe in every world. The target: from the single assumption "for-all , ( is happy)," prove " is happy," where is one fixed name.
Step 1. Write the assumption as the first line: "for-all , ( is happy)." This is our only starting material.
Step 2. Use a free logical starting-line. One of the allowed starting-lines says: from "for-all , (something about )" you may pass to "the same thing with the name put in for ." Written as a line it reads: "( for-all , ( is happy) ) then ( is happy)." We are allowed to write this down for free, because it is one of the fixed logical axioms.
Step 3. Now we have two lines: the assumption "for-all , ( is happy)," and the starting-line "( for-all , ( is happy) ) then ( is happy)." These match the detachment pattern: line 1 is "A," line 2 is "A then B." Apply detachment.
Step 4. Detachment writes the third line: " is happy." That is the target, so the proof is finished in three lines.
Step 5. Check the move is safe in any world. Suppose a world makes "for-all , ( is happy)" true. Then every object of that world is happy, and the object named is one of those objects, so " is happy" is true there too. The step never failed.
What this tells us: a single fixed starting-line ("drop the for-all to any name") plus detachment already proves a real fact, and the starting-line is true in every world for the same plain reason a universal claim covers each of its instances. The next sections turn "every starting-line is safe in every world" into the precise guarantee that proof can never outrun truth.
Check your understanding Beginner
Formal definition Intermediate+
Fix a first-order language with equality and the syntax of 42.01.04 pending. The deductive calculus is Hilbert-style: many logical axioms, one rule. The logical axioms are all generalizations (prefixing zero or more 's) of formulas of the following groups, where range over formulas, over variables, over terms substitutable for in , and over variables [Enderton §2.4]:
where in (A6) is atomic and arises from by replacing some occurrences of by . The single rule of inference is modus ponens (detachment): from and infer . There is no separate quantifier-introduction rule; generalization is recovered as a derived rule below.
A deduction of from a set of formulas is a finite sequence with in which each is a logical axiom, a member of , or follows from two earlier entries () by modus ponens. One writes (" proves "), and for (a theorem). The relation is the purely syntactic counterpart of the semantic of 42.01.04 pending, defined without any reference to structures or truth.
A set is inconsistent if for every formula ; equivalently (using a tautology axiom) if and for some . Otherwise is consistent. A structure with an assignment is a model of if for all ; when is read as genuine identity, is a normal model, the only kind the equality axioms (A5)–(A6) are designed to be sound for.
Counterexamples to common slips Intermediate+
"Generalization is a basic rule of the calculus." It is not a primitive rule here; the only primitive rule is modus ponens. Generalization is a derived rule with a side condition (the variable must not be free in the premises). Treating it as unconditional licenses the false deduction from a formula with free.
"The instantiation axiom (A2) holds for every term ." It requires substitutable for in . Without that, would be an instance with , which is false in any two-element structure. Substitutability blocks the variable capture.
"Consistency means proves no contradiction of the form ." Equivalently, but the working definition is that does not prove every formula. The two agree because a single proved contradiction, via the tautology (A1) and modus ponens, proves an arbitrary .
"The equality axioms make definable." They constrain its behavior to a congruence but do not define it. A structure may satisfy (A5)–(A6) with a coarser congruence than identity; such a non-normal model is repaired by quotienting, the maneuver the completeness theorem
42.01.06pending uses to manufacture a normal model.
Key theorem with proof Intermediate+
The load-bearing metatheorem of the syntax-versus-semantics match is soundness: nothing the calculus proves can fail in a model. It is the routine half of the bridge whose hard half is completeness 42.01.06 pending, and its proof is the template for every "induction on a deduction" argument in the subject.
Theorem (Soundness). If then . In particular every theorem is valid, and every satisfiable set of formulas is consistent.
Proof. It suffices to prove two lemmas: every logical axiom is valid (satisfied by every structure under every assignment), and modus ponens preserves satisfaction by a fixed pair . Granting these, fix with a deduction , and fix any with for all . Induct on to show . If this holds by choice of ; if is a logical axiom it holds by the validity lemma; if comes by modus ponens from earlier , then and by induction, so by the preservation lemma. At this is , so .
Validity of the axioms. A generalization is valid iff is, since for all iff for all ; so it is enough to validate each base form. (A1): a tautology is true under every assignment of truth values to its prime subformulas, hence under the actual truth values and supply, so it is satisfied by every . (A2): suppose ; then for all , in particular for , and the substitution lemma of 42.01.04 pending gives , using substitutability of . (A3): if and , then for every both and , whence for all , i.e. . (A4): if is not free in and , then by the coincidence lemma of 42.01.04 pending for every , so . (A5): always. (A6): if , then replacing some occurrences of by in an atomic leaves the tuple fed to the relation (or to identity) unchanged, so , and the conditional holds.
Preservation by modus ponens. If and , then by the satisfaction clause for either or ; the first is excluded, so .
Corollary (consistency from satisfiability). If has a model then is consistent. For if were inconsistent it would prove some and ; soundness gives and , so a model of satisfies both and , impossible.
Bridge. Soundness is the foundational reason the syntactic turnstile can never overshoot the semantic , and this is exactly the validity of axiom (A2) read off the substitution lemma of 42.01.04 pending: the lemma converts the semantic instantiation at into the syntactic substitution , so the one axiom that touches terms is correct precisely on the substitutability side condition isolated in the syntax unit. The argument builds toward the deduction theorem and the generalization theorem below, whose proofs reuse the same induction-on-deductions skeleton, and it appears again in the completeness theorem 42.01.06 pending, where the converse inclusion is proved by building a model out of a consistent set — making consistency and satisfiability coincide, the equivalence whose easy half is the corollary just proved. The central insight is that an inductive syntax (deductions built by one rule from fixed axioms) admits proof-by-induction of any property preserved by the rule and held by the axioms; this is exactly the schema soundness instantiates, and it generalises from "being valid" to consistency, conservativity, and the subformula property of the sequent calculus. Putting these together, the calculus is now pinned to the semantics from one side, and the entire weight of the chapter shifts to the single open inclusion that the next unit closes.
Exercises Intermediate+
Advanced results Master
The calculus fixed in the Formal definition supports four developments: the derived metarules (deduction theorem, generalization, generalization on constants, the rule of substitution) that make workable; the soundness theorem refined to the equality axioms and normal models; the syntactic theory of consistency and its soundness-given link to satisfiability; and the alternative calculi — natural deduction and the sequent calculus with cut — that compute the same by other means.
Theorem 1 (the derived rules: deduction, generalization, substitution). The deduction theorem holds with no side condition, because modus ponens is the sole rule [Enderton §2.4]. The generalization theorem states that if and is free in no member of , then ; the restriction is the eigenvariable condition, and dropping it falsifies . The generalization-on-constants theorem says that if and occurs in no member of , then for a fresh , the syntactic embodiment of " was arbitrary." The rule of substitution follows: from one obtains for substitutable , and from one obtains every instance by (A2) and modus ponens. These four turn the spartan axiomatic system into the calculus actually used in proofs.
Theorem 2 (soundness with equality and normal models). Soundness holds over the class of all structures interpreting as genuine identity [Enderton §2.4]. The equality axioms (A5) and (A6) are valid in exactly the normal models, where is identity; they fail if is read as a proper congruence, which is why a model produced syntactically must be quotiented by the provable-equality relation to become normal. The validity argument is the axiom-by-axiom check of the Key theorem, with (A6) reflecting that a relation cannot distinguish equal tuples. Soundness for sentences specializes to : every closed theorem is true in every normal model.
Theorem 3 (syntactic consistency and the easy half of the equivalence). A set is consistent iff it does not prove every formula, iff for no does and [Shoenfield Ch. 3]. Consistency is finitary: is inconsistent iff some finite is, since any deduction of a contradiction uses finitely many premises — the syntactic root of compactness 42.01.02 pending. Soundness yields the easy direction of the consistency–satisfiability equivalence: every satisfiable is consistent, equivalently every inconsistent is unsatisfiable. The hard direction — every consistent is satisfiable, by a Henkin model construction — is the completeness theorem 42.01.06 pending, and the two together make and extensionally identical and consistency a synonym for "has a model."
Theorem 4 (alternative calculi: natural deduction, sequent calculus, cut). The Hilbert turnstile is reproduced by natural deduction, whose introduction/elimination rules for carry eigenvariable (proper-variable) conditions; -introduction discharges a fresh-variable derivation exactly as the generalization theorem does [van Dalen Ch. 2]. Gentzen's sequent calculus LK manipulates sequents with left/right rules per connective and quantifier, the structural rules, and the cut rule — from and infer — which is modus ponens internalized [Takeuti Ch. 1]. The cut-elimination theorem (Gentzen's Hauptsatz) shows every LK derivation transforms into a cut-free one, whence the subformula property and a syntactic consistency proof of pure logic. All three calculi prove the same theorems; the full development of cut-elimination and its consequences belongs to proof theory 42.05.01 pending.
Synthesis. Soundness is the foundational reason is pinned below , and putting these together it organizes all four developments: it is exactly what turns a proved contradiction into an unsatisfiable theory, so the consistency–satisfiability link of Theorem 3 is its corollary, and the eigenvariable condition of Theorems 1 and 4 is the precise syntactic shadow of the semantic fact that a free variable held fixed by a premise cannot be universally generalized — this is the central insight that the generalization restriction and the (A4) vacuous-quantification axiom both encode. The deduction theorem generalises the propositional deduction theorem of 42.01.01 pending to the first-order setting unchanged, because the lone rule is still modus ponens, while the generalization and constants theorems are the genuinely first-order additions, dual to one another across " free" versus " fresh." The equality axioms and the quotient-to-normal-model maneuver are the bridge from the bare term model to a structure where is identity, the maneuver completeness 42.01.06 pending performs in full. And the alternative calculi — natural deduction's eigenvariable rules and LK's cut, whose elimination is the namesake theorem of proof theory 42.05.01 pending — show the same syntactic-consequence relation computed three ways, so that soundness, proved once for the Hilbert system, transfers by mutual simulation. This is the syntactic half of the architecture: a meaning-free engine for , sound for the Tarski semantics of 42.01.04 pending, standing ready to be proved complete in 42.01.06 pending.
Full proof set Master
Proposition 1 (soundness, full statement). If then .
Proof. Fix with for all and a deduction . Induct on to show . Premise lines hold by hypothesis. Axiom lines hold by the validity lemma below. A modus-ponens line from inherits and from the inductive hypothesis; the -clause forces . At , ; as was an arbitrary model of , .
Validity lemma. Every generalization of a base axiom is valid iff is, so check the bases. (A1): a propositional tautology stays true under the truth values assigns its prime subformulas. (A2): gives , and the substitution lemma of 42.01.04 pending rewrites this as . (A3): universal hypotheses on and at every give at every . (A4): with , coincidence makes entail for all . (A5): identity is reflexive. (A6): equalizes the tuples in an atomic versus .
Proposition 2 (deduction theorem). iff .
Proof. () Adjoin and apply modus ponens. () Induct on a deduction from , proving . If is an axiom or in : from and the (A1) tautology , modus ponens gives . If : is an (A1) tautology. If is modus ponens of : from the inductive and and the (A1) tautology , two modus ponens steps give . The lone rule being modus ponens, no generalization-restriction can be violated. At , .
Proposition 3 (generalization theorem). If and is free in no member of , then .
Proof. Induct on a deduction from , proving . If is a logical axiom, then is a logical axiom (closure under prefixing ), so . If , then by hypothesis, so the (A4) axiom with modus ponens on the premise gives . If is modus ponens of , then by induction and ; the (A3) axiom with two modus ponens steps gives . At , . The hypothesis is used exactly at the premise case, where (A4) demands not free in .
Proposition 4 (generalization on constants). If and the constant occurs in no member of , then and for a variable occurring nowhere in the chosen deduction.
Proof. Take a deduction from and a variable in none of its lines. Replace every by throughout. A premise line is unchanged (). A logical axiom maps to a logical axiom: the tautology shape (A1) is preserved under a uniform constant-for-variable replacement, (A2)'s substitutability is preserved because is new and so substitutable wherever was, and (A3)–(A6) are shape-closed under the replacement. A modus-ponens step maps to a modus-ponens step since the replacement commutes with the connective . Hence is a deduction of from , giving . As the fresh is free in no member of , so Proposition 3 gives .
Proposition 5 (consistency from satisfiability, and finitary inconsistency). Every satisfiable is consistent; and is inconsistent iff some finite is inconsistent.
Proof. If has a model but is inconsistent, then and for some ; Proposition 1 gives and , so and , a contradiction. Hence satisfiable sets are consistent. For finitariness: if is inconsistent, fix deductions of some and from ; each is finite and cites finitely many premises, and the union of those cited premises is a finite subset with and , so is inconsistent. Conversely an inconsistent finite makes inconsistent since deductions from are deductions from .
Connections Master
Structures and Tarski's definition of truth
42.01.04pending supplies the semantics this calculus is matched against. The validity of the instantiation axiom (A2) is exactly the substitution lemma of42.01.04pending — converting semantic instantiation at into syntactic substitution — and the validity of the vacuous-quantification axiom (A4) is the coincidence lemma there. That unit owns the meaning of formulas; this unit owns the meaning-free engine that derives them, the two meeting in the soundness proof, where each axiom's validity is read off a semantic lemma of42.01.04pending.The completeness theorem for first-order logic, co-produced as
42.01.06pending, proves the converse inclusion this unit leaves open: Gödel's theorem gives , so that with the soundness of this unit the turnstile and the semantic consequence of42.01.04pending coincide. Henkin's construction extends a consistent to a maximal consistent set with witnesses and reads a normal model off the term algebra quotiented by provable equality — the quotient the equality axioms (A5)–(A6) of this unit make a congruence — so consistency (defined here) becomes synonymous with satisfiability.Compactness for first-order logic
42.01.02pending rests on the finitariness of consistency proved here: a deduction of a contradiction uses finitely many premises, so an inconsistent theory has an inconsistent finite subtheory, and through completeness42.01.06pending this becomes "a theory with no model has a finite subtheory with no model." The syntactic finiteness of this unit's deductions is the proof-theoretic engine behind that semantic theorem.Proof theory and cut-elimination
42.05.01pending takes the alternative calculi previewed in Theorem 4 as its subject: Gentzen's sequent calculus LK, where the cut rule internalizes the modus ponens of this unit, and the Hauptsatz that eliminates cut to yield the subformula property and a syntactic consistency proof. Natural deduction's eigenvariable conditions are the proof-theoretic form of this unit's generalization restriction, and the mutual simulation of the three calculi transfers the soundness proved here.
Historical & philosophical context Master
The axiomatic style of this calculus descends from Gottlob Frege's Begriffsschrift (1879), the first formal system with explicit axioms, a conditional and negation, the universal quantifier, and a single detachment rule, and from the predicate-logic axiomatizations of David Hilbert and Wilhelm Ackermann's Grundzüge der theoretischen Logik (1928), which posed the completeness of the calculus as an open problem [Enderton §2.4]. Kurt Gödel's 1930 dissertation Die Vollständigkeit der Axiome des logischen Funktionenkalküls answered it, proving that the Hilbert-style first-order calculus derives exactly the valid formulas; the soundness half — that derivations cannot overshoot validity — is the routine direction his theorem packages with the substantive converse.
Gerhard Gentzen's Untersuchungen über das logische Schließen (1934-35) introduced both natural deduction, with its introduction and elimination rules and eigenvariable conditions, and the sequent calculus LK, proving the cut-elimination Hauptsatz that gives a constructive consistency argument for pure logic [Takeuti Ch. 1]. The economical Hilbert systems and the deduction-theorem technique were standardized by Stephen Kleene's Introduction to Metamathematics (1952), by Joseph Shoenfield's Mathematical Logic (1967) [Shoenfield Ch. 2], and by Elliott Mendelson's textbook [Mendelson Ch. 2], while the natural-deduction presentation became standard through Dirk van Dalen's Logic and Structure [van Dalen Ch. 2]; the equality axioms and the normal-model reading trace to the same period's treatment of identity as a logical, rather than nonlogical, relation.
Bibliography Master
@book{enderton2001logic,
author = {Enderton, Herbert B.},
title = {A Mathematical Introduction to Logic},
edition = {2},
publisher = {Harcourt/Academic Press},
year = {2001}
}
@book{mendelson2015logic,
author = {Mendelson, Elliott},
title = {Introduction to Mathematical Logic},
edition = {6},
publisher = {CRC Press},
year = {2015}
}
@book{shoenfield1967logic,
author = {Shoenfield, Joseph R.},
title = {Mathematical Logic},
publisher = {Addison-Wesley},
year = {1967}
}
@book{vandalen2013logic,
author = {van Dalen, Dirk},
title = {Logic and Structure},
edition = {5},
publisher = {Springer},
year = {2013}
}
@book{takeuti1987prooftheory,
author = {Takeuti, Gaisi},
title = {Proof Theory},
edition = {2},
publisher = {North-Holland},
year = {1987}
}
@phdthesis{godel1930completeness,
author = {G\"{o}del, Kurt},
title = {Die Vollst\"{a}ndigkeit der Axiome des logischen Funktionenkalk\"{u}ls},
school = {University of Vienna},
year = {1930},
note = {Published in Monatshefte f\"{u}r Mathematik und Physik 37 (1930), 349--360}
}
@article{gentzen1935untersuchungen,
author = {Gentzen, Gerhard},
title = {Untersuchungen \"{u}ber das logische Schlie{\ss}en},
journal = {Mathematische Zeitschrift},
volume = {39},
year = {1935},
pages = {176--210, 405--431}
}
@book{frege1879begriffsschrift,
author = {Frege, Gottlob},
title = {Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens},
publisher = {Louis Nebert, Halle},
year = {1879}
}
@book{hilbertackermann1928,
author = {Hilbert, David and Ackermann, Wilhelm},
title = {Grundz\"{u}ge der theoretischen Logik},
publisher = {Springer},
year = {1928}
}
@book{kleene1952metamathematics,
author = {Kleene, Stephen Cole},
title = {Introduction to Metamathematics},
publisher = {North-Holland},
year = {1952}
}