Representability of Recursive Functions in Arithmetic
Anchor (Master): Enderton 2001 *A Mathematical Introduction to Logic* 2e §3.3-3.5 (representability, the proof predicate, the road to incompleteness); Boolos, Burgess and Jeffrey 2007 *Computability and Logic* 5e (Cambridge) Ch. 16 (the arithmetisation of recursive functions and Q-representability); Hájek and Pudlák 1993 *Metamathematics of First-Order Arithmetic* (Springer) Ch. I (Q, PA, the $\Sigma_n$/$\Pi_n$ arithmetical hierarchy over $\Delta_0$, $\Sigma_1$-completeness, the MRDP route to $\Sigma_1$-definability); Smorynski 1977 'The incompleteness theorems' in the *Handbook of Mathematical Logic* (North-Holland)
Intuition Beginner
A computer can add, multiply, and compare whole numbers. A logical theory of arithmetic is a fixed list of basic rules about those same operations, together with everything you can prove from them. These look like two unrelated worlds: one is a machine grinding through steps, the other is a stack of statements and proofs. This unit shows they are secretly the same world. Whenever a machine settles on a definite answer for definite inputs, a very weak arithmetic can prove the matching numerical fact.
Picture a tiny rulebook for whole numbers. It knows that adding zero changes nothing, that adding one more is the successor, that multiplying builds up from repeated adding, and a few more facts of that kind. It is almost laughably weak: it cannot even prove that addition is commutative in general. Yet for any specific sum or product, with specific numbers written out, it can step through the rules and reach the right total. Ask it about and it will prove the answer is . Ask about and it will prove .
The deep point is that "compute a function" can be turned into "prove a statement of arithmetic." A function is represented by a formula when the rulebook proves the formula true on exactly the right inputs and outputs, and false on all the wrong ones. The surprising theorem is that every function a machine can compute can be represented this way, in that same weak rulebook. Computation can be mirrored inside arithmetic.
Why does this matter? Once arithmetic can mirror any computation, it can mirror the computation that checks proofs of arithmetic itself. A theory that can talk about its own proofs is a theory that can be made to talk about itself — and that is the doorway to the incompleteness results that come next.
Visual Beginner
The picture is a bridge with two banks. On the left bank, a machine computes; on the right bank, a weak arithmetic proves. Representability is the bridge: every computation on the left has a matching proof on the right.
LEFT BANK BRIDGE RIGHT BANK
(a machine computes) (representability) (weak arithmetic Q proves)
input 2, 3 numerals S S 0 , S S S 0
| |
v v
add -> 5 ===================> Q proves: "S S 0 + S S S 0 = S S S S S 0"
| |
output 5 a finished proof, line by line
the SAME fact, said two ways:
machine: add(2,3) = 5
arithmetic: Q |- ( 2 + 3 = 5 ) (with numbers written as S S ... 0)A numeral is how arithmetic writes a number: start from and apply "successor" (, meaning add one) that many times. So is written and is written . The machine's "add and to get " becomes the arithmetic's "prove that equals ."
| machine does | arithmetic proves | matches? |
|---|---|---|
| yes | ||
| yes | ||
| (so the wrong answer is disproved) | the wrong answer is refuted |
The bridge runs both ways for facts: the right answer gets proved, every wrong answer gets refuted.
Worked example Beginner
We watch the weak rulebook prove one specific sum, , using only its rules about adding. Write numbers as numerals: is and is . The rulebook has two facts about addition we will use. First, adding zero changes nothing: . Second, adding a successor pushes the successor outside: .
Step 1. Start from . The second number is , which is — a successor. Apply "" with and . This rewrites as .
Step 2. Look inside at . Again the second number is a successor, . Apply the same rule with . This rewrites as . So far the whole expression is .
Step 3. Now the inside is . The second number is , so apply "." This gives . The whole expression is now .
Step 4. Count the successors: is , which is the numeral for . The rulebook has proved , that is, .
What this tells us: the rulebook never needed any general theory of addition. It only peeled the second numeral down one successor at a time until it hit zero, then read off the answer. Every specific sum works this way, and the same style of step-by-step peeling handles every specific product too. That is exactly why a machine's definite answer always has a matching proof: the proof simply retraces the computation.
Check your understanding Beginner
Formal definition Intermediate+
Fix the language of arithmetic and its standard structure 42.01.04 pending. For each the numeral is the closed term , defined by and ; it is the canonical -name of , with .
Robinson arithmetic is the -theory axiomatised by the universal closures of [Enderton §3.3]:
Peano arithmetic extends (with Q3 then derivable) by the first-order induction schema: for every -formula , $$ \forall \bar z,\Big[\big(\varphi(0,\bar z) \wedge \forall x,(\varphi(x,\bar z) \to \varphi(Sx,\bar z))\big) \to \forall x,\varphi(x,\bar z)\Big]. $$ Both are sound for : and , so every theorem of either is true in the standard model.
A relation is represented in a theory by a formula when for all ,
$$
\bar a \in R \implies T \vdash \rho(\underline{a_1}, \dots, \underline{a_k}), \qquad \bar a \notin R \implies T \vdash \neg\rho(\underline{a_1}, \dots, \underline{a_k}).
$$
A function is represented in by when for all ,
$$
T \vdash \forall y,\big(\varphi(\underline{a_1}, \dots, \underline{a_k}, y) \leftrightarrow y = \underline{f(\bar a)}\big).
$$
Equivalently proves the correct instance and proves uniqueness . The notion of recursive function (closure of the zero, successor, and projection functions under composition, primitive recursion, and minimisation) and recursive relation is imported from the theory of computation 42.04.01; the present unit takes that definition as given and builds the formula that captures each closure step inside .
A formula is (bounded) when every quantifier in it is bounded, of the form or for a term not containing ; it is when of the form with , and when with . Bounded quantification is decided by finite search, so the truth value of a closed sentence in is mechanically computable.
Counterexamples to common slips Intermediate+
"Representable means true in ." Representability is a provability condition, strictly stronger than truth on numerals. A formula can be true of in the standard model yet fail to be proved or refuted by on some numerals. The theorem is that recursive relations admit a formula actually decides at every numeral, not merely one verifies.
" proves induction for specific numbers, so it is almost ." has no induction schema at all. It proves each numerical instance of a recursive computation by finite term-rewriting, but it cannot prove or . Representability deliberately needs only this induction-free strength.
"Function representability is just the graph being represented as a relation." Representing the graph as a relation gives the two implications on numerals; function representability additionally demands prove uniqueness, , an internal functionality that relation-representability does not supply.
"-completeness means proves every true sentence." It proves every true sentence. True sentences — for instance true but unprovable consistency statements — are exactly where completeness fails; the asymmetry between (existential, witness-checkable) and (universal) is the whole content.
Key theorem with proof Intermediate+
The signature theorem is that the weak, induction-free theory already represents every recursive function. Its engine is the arithmetisation of finite sequences: primitive recursion threads a function value through a growing list of earlier values, and to define it inside arithmetic one must quantify over such lists using only numbers. Gödel's -function does exactly this.
Lemma (-function). Define , the remainder of on division by . For every finite sequence of natural numbers there exist with for all . Moreover is representable in by a formula [Enderton §3.3].
Proof. Put where , and let for . The are pairwise coprime: if a prime divided both and with , it would divide their difference ; since forces , divides , so and , contradicting . Each because . By the Chinese remainder theorem there is with for all ; as , the least nonnegative residue is itself, so . For representability, is the unique with and , a condition; unfolds to the formula .
Theorem (representability). Every recursive function is representable in , and every recursive relation is representable in .
Proof. Induct on the construction of as a recursive function 42.04.01. The zero function is represented by ; the successor by ; the projection by . For each, proves the value at any numeral and proves uniqueness because the value formula already has the shape , which is its own functionality statement.
Composition. If are represented by and , set $$ \varphi(\bar x, y) := \exists w_1 \cdots \exists w_m,\big(\gamma_1(\bar x, w_1) \wedge \cdots \wedge \gamma_m(\bar x, w_m) \wedge \eta(w_1, \dots, w_m, y)\big). $$ Given , let and . Each forces provably, and then forces ; proves and the uniqueness, so is represented.
Primitive recursion. Suppose and with represented by . The value is determined by the sequence with . Encode that sequence by a single pair through the -lemma and set $$ \varphi(\bar x, t, y) := \exists c,\exists d,\Big[\beta(c,d,0) = \text{the value of } g \wedge \forall i < t,\big(\beta(c,d,Si) = h(\bar x, i, \beta(c,d,i))\big) \wedge \beta(c,d,t) = y\Big], $$ where each occurrence of and is replaced by its representing formula and by its formula. For numerals put and choose coding by the -lemma; proves each conjunct on the witnessing and proves , and uniqueness follows because proves the bounded conjunction pins to regardless of the coding pair. So is represented.
Minimisation. If with represented by and total where used, set $$ \varphi(\bar x, y) := \gamma(\bar x, y, 0) \wedge \forall z < y,\neg,\gamma(\bar x, z, 0). $$ For with , proves and, for each , ; using Q8 and the provable fact , collapses the bounded to that finite conjunction and proves it, giving and the uniqueness from the leastness. A recursive relation is represented by representing its characteristic function and reading membership off the value versus .
Bridge. This theorem is the foundational reason a theory of numbers can host a theory of computation: proves, numeral by numeral, exactly what a machine computes, so the recursive functions of 42.04.01 live inside arithmetic as provably-functional formulas. It builds toward the arithmetisation of syntax and the proof predicate , a primitive recursive relation that is therefore representable here, and it appears again in the first incompleteness theorem 42.01.09 pending, where the diagonal lemma feeds a representing formula back into itself. The -function is the central insight: by trading quantification over finite sequences for quantification over two numbers it makes primitive recursion arithmetically definable, which is exactly the step that lets unbounded computation be mirrored by a bounded coding. Putting these together, the weakness of — no induction at all — becomes a feature: incompleteness will follow already for any consistent theory extending this minimal base, so the phenomenon cannot be blamed on the strength of .
Exercises Intermediate+
Advanced results Master
The representability theorem supports four developments: the optimality of as a base, the exact strength of -completeness, the equivalence of -definability with recursive enumerability that runs through the MRDP theorem, and the arithmetisation of provability that opens onto incompleteness.
Theorem 1 (representable recursive in ; optimality of the base). A function is representable in iff it is recursive, and likewise for relations [Boolos-Burgess-Jeffrey Ch. 16]. The forward direction is the Key theorem; the reverse holds because a representing formula gives an effective procedure — search for a -proof of over , which by representability halts at — so representability in any recursively axiomatised consistent theory implies recursiveness. The base cannot be weakened past in the relevant sense: the eight axioms are exactly what the induction over composition, primitive recursion (via ), and minimisation (via Q8) consume, and any consistent theory in which all recursive functions are representable interprets . Representability is thus an absolute, theory-independent boundary coinciding with computability, and it holds already without induction.
Theorem 2 (-completeness and its sharp boundary). Every true sentence is a theorem of , and the bound is exact: correctly decides every closed sentence, true existentials are provable by numeral witnesses, but true sentences need not be provable [Hájek-Pudlák Ch. I]. The proof reduces a sentence to a closed instance at a numeral witness and runs the bounded-evaluation lemma, which proves by metatheoretic induction on bounded formulas that agrees with on every closed sentence. The asymmetry is the engine of incompleteness: a consistency statement is , true (when is consistent) but unprovable, and the first incompleteness sentence is for the same reason — its negation would be a false sentence, which -completeness would force to disprove.
Theorem 3 (the /recursively-enumerable identification and MRDP). A relation is -definable in iff it is recursively enumerable iff its characteristic-of-membership function is the domain of a partial recursive function [Hájek-Pudlák Ch. I]. The forward implication uses representability to convert a recursive enumeration into an existential over proof-or-computation codes; the converse is Kleene's normal form, over a Kleene- predicate. The Matiyasevich-Robinson-Davis-Putnam theorem sharpens this dramatically: every (equivalently c.e.) relation is Diophantine, defined by for polynomials with natural-number coefficients, so the existential quantifier prefix can be collapsed onto a single polynomial equation (co-produced as 42.04.07). Representability in is the textbook-level half of this tower; MRDP is its number-theoretic apex, and together they make "computably enumerable" and "arithmetically existential" one notion.
Theorem 4 (arithmetisation of syntax and the proof predicate). Fix a Gödel numbering assigning numbers to terms, formulas, and finite sequences of formulas. The relations " codes a formula," " codes a proof in of the formula coded by ," and the substitution function are all primitive recursive for any recursively axiomatised [Enderton §3.5]. By the representability theorem each is representable in : there is a formula and the provability predicate , and a representing formula for . This is the arithmetisation groundwork: the theory's own syntax becomes a recursive structure on numbers, and proves the matching numerical facts about it. The diagonal lemma — for every a sentence with — is then a one-line consequence of the representability of , and feeding yields the Gödel sentence (developed in the co-produced 42.01.09 pending).
Synthesis. The representability theorem is the foundational reason computation can be mirrored inside arithmetic, and putting these together it controls all four developments: the -function makes primitive recursion arithmetically definable, which is exactly what lets every recursive function become a provably-functional formula of , and this is the central insight that the whole incompleteness phenomenon rests on a theory with no induction. Theorem 1's identification of representable with recursive is dual to the computability-side fact that representability gives an effective halting search, so the boundary is the same object seen from logic and from computation 42.04.01. The -completeness of Theorem 2 generalises the numeral-by-numeral provability of the Key theorem from individual function values to all existential truths, and its failure is the bridge to the unprovable consistency statement; Theorem 3 widens -definability into the recursively enumerable sets and, through MRDP 42.04.07, onto Diophantine equations, so that "halts" (the halting problem of 42.04.02) and "satisfies a polynomial equation" and "is provable" are one notion in three languages. Theorem 4 turns the representing apparatus on the theory's own syntax: the proof predicate is primitive recursive, hence representable, hence arithmetic can speak of its own provability — and the diagonal lemma closes the loop, so that the self-reference defeating the halting checker 42.04.02 reappears as a sentence of arithmetic asserting its own unprovability 42.01.09 pending.
Full proof set Master
Proposition 1 (-function lemma). For and any there are with for ; and is -representable in .
Proof. Let , , . Coprimality: a common prime of () divides ; from and one gets , so , hence and , a contradiction. Bound: . CRT yields with ; since , . The graph is , all quantifiers bounded by , hence ; proves it on numerals because the division identity and the strict inequality are closed facts, decided by in accordance with .
Proposition 2 (initial functions and composition). Zero, successor, and projections are representable in , and the representable functions are closed under composition.
Proof. by , by , by ; in each case proves the unique value at any numeral tuple directly from the term identities and the reflexivity of equality. For composition with and representing formulas , set . At , write , : and the uniqueness force , and with its uniqueness force . Hence and .
Proposition 3 (primitive recursion). If are representable then so is defined by , .
Proof. Let represent . Using Proposition 1, set $$ \varphi(\bar x, t, y) := \exists c,\exists d,\Big[\exists v,(\beta(c,d,0) = v \wedge \gamma(\bar x, v)) \wedge \forall i < t,\exists u \exists u',(\beta(c,d,i) = u \wedge \beta(c,d,Si) = u' \wedge \eta(\bar x, i, u, u')) \wedge \beta(c,d,t) = y\Big]. $$ At numerals , set , , , and pick coding . proves the base conjunct via , each bounded- conjunct via and the -graph (the bounded collapsing to the explicit finite conjunction since ), and the final . So . Uniqueness: any with forces, through the same bounded conjunction and the provable functionality of and , that , hence ; thus .
Proposition 4 (minimisation). If is representable and is total, then is representable in .
Proof. Let represent and set . At with : gives ; for each , gives by representability of as a function (its value at is a numeral , and proves distinct numerals unequal). The bounded collapses to via (provable from Q8 and the numeral case analysis), which proves. Hence , and uniqueness follows because any satisfying has and is below none of the witnesses, forcing by the leastness and numeral trichotomy.
Proposition 5 (-completeness of ). Every true sentence is provable in .
Proof. First, the bounded-evaluation lemma: for every closed sentence , and , by metatheoretic induction on . Atomic closed equalities and inequalities between closed terms evaluate to numerals, and proves when and when (from Q1, Q2), and decides from Q8; Boolean connectives pass through; a bounded quantifier is equivalent over to the finite disjunction (using ), and to the finite conjunction, reducing to the inductive hypothesis on each closed instance. Now a sentence true in has a numeral witness with ; by the lemma , and existential introduction gives .
Proposition 6 (the proof predicate is representable; provability is ). For recursively axiomatised the relation is primitive recursive, hence representable in by a formula, and is .
Proof. Gödel numbering renders " codes an -formula" and " codes a finite sequence of formulas each of which is a logical axiom, an axiom of , or follows from earlier ones by a rule, the last being the formula coded by " as primitive recursive predicates: each clause is a bounded check over the components of , decidable by primitive recursion on the sequence-coding (itself built from the -function of Proposition 1). Being primitive recursive, is recursive, so by Propositions 2-4 it is representable in ; as the checking is bounded by the representing formula is . Then is by definition. By Proposition 5, every true instance — that is, every actual theorem — is -provably so in , the formal counterpart of "if then ."
Connections Master
Structures and Tarski semantics
42.01.04pending supply the standard model and the satisfaction relation against which truth of and sentences is measured, and the numeral is the closed term whose value makes "true on numerals" precise. That unit owns the semantics of the arithmetic language; this unit adds the provability layer over it, the gap between and that -completeness measures and that incompleteness exploits.Models of computation and the recursive functions
42.04.01define the recursive functions whose representability is the theorem of this unit; the inductive closure conditions — composition, primitive recursion, minimisation — are exactly the cases of the representability induction, and the equivalence "representable in iff recursive" identifies that unit's notion of computability with this unit's notion of provable-numeral-functionality. The -function is the device that imports primitive recursion, defined there, into the formula language here.The halting problem and the recursion theorem
42.04.02are the computational shadow of this arithmetisation: the self-application that defeats the halting checker is the same diagonal that, once the proof predicate is representable here, produces the Gödel sentence; and the undecidability of arithmetic — — is immediate once provability is and representability lets track computation. Church's theorem on the Entscheidungsproblem routes through exactly the representability established here.Gödel's incompleteness theorems
42.01.09pending, co-produced with this unit, are its destination: the diagonal lemma is a one-line consequence of the representability of the substitution function proved here, and the first incompleteness sentence is built from the provability predicate of Proposition 6. The second incompleteness theorem additionally needs the Hilbert-Bernays-Löb derivability conditions, whose first condition is precisely the -completeness of Proposition 5 formalised inside .
Historical & philosophical context Master
The representability of the recursive functions in a weak arithmetic is the technical core of Kurt Gödel's 1931 paper Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I, where the -function (Gödel's "") was introduced precisely to express primitive recursion inside arithmetic without a primitive sequence type, using the Chinese remainder theorem to code finite sequences by pairs of numbers [Enderton §3.3]. Gödel proved that the primitive recursive functions are definable in the arithmetic of Principia; the extension to all recursive functions and the isolation of the minimal theory followed from the convergence of the competing analyses of computability — Gödel and Herbrand's general recursive functions, Church's -definability, and Turing's machines — into the single class fixed by the Church-Turing thesis.
Raphael Robinson isolated the finitely axiomatised theory in 1950 (in "An essentially undecidable axiom system"), showing that this induction-free fragment already suffices for representability and hence for essential undecidability, so that the incompleteness and undecidability phenomena attach to a theory far weaker than full Peano arithmetic [Boolos-Burgess-Jeffrey Ch. 16]. Alfred Tarski, Andrzej Mostowski, and Robinson systematised the method of essentially undecidable theories in Undecidable Theories (1953), making -representability the standard route to undecidability results across mathematics. The sharpening of -definability to Diophantine definability — that every recursively enumerable set is the solution set of a polynomial equation — was completed by Yuri Matiyasevich in 1970, building on Julia Robinson, Martin Davis, and Hilary Putnam, resolving Hilbert's tenth problem in the negative [Hájek-Pudlák Ch. I].
Bibliography Master
@book{enderton2001logic,
author = {Enderton, Herbert B.},
title = {A Mathematical Introduction to Logic},
edition = {2},
publisher = {Harcourt/Academic Press},
year = {2001}
}
@article{godel1931,
author = {G\"{o}del, Kurt},
title = {\"{U}ber formal unentscheidbare S\"{a}tze der {Principia Mathematica} und verwandter Systeme {I}},
journal = {Monatshefte f\"{u}r Mathematik und Physik},
volume = {38},
year = {1931},
pages = {173--198}
}
@article{robinson1950essentially,
author = {Robinson, Raphael M.},
title = {An essentially undecidable axiom system},
journal = {Proceedings of the International Congress of Mathematicians (Cambridge, Mass.)},
volume = {1},
year = {1950},
pages = {729--730}
}
@book{tarskimostowskirobinson1953,
author = {Tarski, Alfred and Mostowski, Andrzej and Robinson, Raphael M.},
title = {Undecidable Theories},
publisher = {North-Holland},
year = {1953}
}
@book{boolosburgessjeffrey2007,
author = {Boolos, George S. and Burgess, John P. and Jeffrey, Richard C.},
title = {Computability and Logic},
edition = {5},
publisher = {Cambridge University Press},
year = {2007}
}
@book{hajekpudlak1993,
author = {H\'{a}jek, Petr and Pudl\'{a}k, Pavel},
title = {Metamathematics of First-Order Arithmetic},
series = {Perspectives in Mathematical Logic},
publisher = {Springer},
year = {1993}
}
@article{matiyasevich1970,
author = {Matiyasevich, Yuri V.},
title = {Enumerable sets are {Diophantine}},
journal = {Soviet Mathematics Doklady},
volume = {11},
year = {1970},
pages = {354--358}
}
@incollection{smorynski1977incompleteness,
author = {Smory\'{n}ski, Craig},
title = {The incompleteness theorems},
booktitle = {Handbook of Mathematical Logic},
editor = {Barwise, Jon},
publisher = {North-Holland},
year = {1977},
pages = {821--865}
}