Formal proofs

It is possible, though not very pleasant, to produce formal proofs in Peano arithmetic.

Consider, for example, the following proof of the fact that \({ 2 + 2 = 4 }\), which in the language we’re using is written as \({ [ ( 0 '' + 0 '' ) = 0 '''' ] }\).

\({ [ ∃ x . ( x = 0 ) ] }\)
\({ [ ∃ y . ( y = 0 ) ] }\)
\({ \{ ∀ x . [ ∃ y . ( y = x ' ) ] \} }\)
\({ \{ [ ∃ x . ( x = 0 ) ] ⊃ [ ∃ y . ( y = 0 ' ) ] \} }\)
\({ [ ∃ y . ( y = 0 ' ) ] }\)
\({ [ ∃ x . ( x = 0 ' ) ] }\)
\({ \{ [ ∃ x . ( x = 0 ' ) ] ⊃ [ ∃ y . ( y = 0 '' ) ] \} }\)
\({ [ ∃ y . ( y = 0 '' ) ] }\)
\({ [ ∃ x . ( x = 0 '' ) ] }\)
\({ \{ ∀ x . [ ( x + 0 ) = x ] \} }\)
\({ \{ [ ∃ x . ( x = 0 '' ) ] ⊃ [ ( 0 '' + 0 ) = 0 '' ] \} }\)
\({ [ ( 0 '' + 0 ) = 0 '' ] }\)
\({ [ ( 0 '' + 0 ) ' = 0 ''' ] }\)
\({ ( ∀ x . \{ ∀ y . [ ( x + y ' ) = ( x + y ) ' ] \} ) }\)
\({ ( [ ∃ x . ( x = 0 '' ) ] ⊃ \{ ∀ y . [ ( 0 '' + y ' ) = ( 0 '' + y ) ' ] \} ) }\)
\({ \{ ∀ y . [ ( 0 '' + y ' ) = ( 0 '' + y ) ' ] \} }\)
\({ \{ [ ∃ y . ( y = 0 ) ] ⊃ [ ( 0 '' + 0 ' ) = ( 0 '' + 0 ) ' ] \} }\)
\({ [ ( 0 '' + 0 ' ) = ( 0 '' + 0 ) ' ] }\)
\({ [ ( 0 '' + 0 ' ) = 0 ''' ] }\)
\({ [ ( 0 '' + 0 ' ) ' = 0 '''' ] }\)
\({ \{ [ ∃ y . ( y = 0 ' ) ] ⊃ [ ( 0 '' + 0 '' ) = ( 0 '' + 0 ' ) ' ] \} }\)
\({ [ ( 0 '' + 0 '' ) = ( 0 '' + 0 ' ) ' ] }\)
\({ [ ( 0 '' + 0 '' ) = 0 '''' ] }\)

Here 1 is an axiom, and 2 is the result of substitution on 1. 3 is an axiom, 4 is obtained from 3 by one of our quantifier rules, 5 is modus ponens applied to 1 and 4, and 6 is substitution on 6. 7 is a quantifier rule applied to 3, 8 is modus ponens applied to 6 and 9, and 9 is substitution on 8. 10 is an axiom, 11 is a quantifier rule applied to 10, and 12 is modus ponens on 9 and 11. 13 is derived from 12 by one of our arithmetic rules of inference. 14 is an axiom, 15 is derive from it by a quantifier rule, and 16 is modus ponens on 9 and 15. 17 is a quantifier rule applied to 16 and 18 is modus ponens on 2 and 17. 19 is derived from 13 and 18 by one of our rules of inference for equality. 20 comes from an arithmetic rule of inference on 19. 21 is a quantifier rule applied to 16 and 22 is modus ponens on 5 and 21. Finally, 23 is one of the rules of equality applied to 20 and 22.

Statements more complicated than \({ 2 + 2 = 4 }\) have correspondingly longer proofs. The following is a proof of \[ [ ∀ x . ( ∀ y . \{ ∃ z . [ z = ( x + y ) ] \} ] ) ] , \] i.e. the fact that the sum of any two natural numbers is a natural number.

\({ [ ∃ x . ( x = 0 ) ] }\)
\({ ( 0 = 0 ) }\)
\({ \{ [ ∃ x . ( x = 0 ) ] ∧ ( 0 = 0 ) \} }\)
\({ [ ∃ x . ( x = x ) ] }\)
\({ [ ∃ y . ( y = y ) ] }\)
\({ [ ∃ z . ( z = z ) ] }\)
\({ ( z = z ) }\)
\({ \{ [ ∃ z . ( z = z ) ] ∧ ( z = z ) \} }\)
\({ [ ∃ z . ( z = x ) ] }\)
\({ [ ∃ z . ( z = w ) ] }\)
\({ [ ∃ x . ( x = w ) ] }\)
\({ \{ ∀ x . [ ∃ y . ( y = x ' ) ] \} }\)
\({ \{ [ ∃ x . ( x = w ) ] ⊃ [ ∃ y . ( y = w ' ) ] \} }\)
\({ [ ∃ y . ( y = w ' ) ] }\)
\({ [ ∃ z . ( z = w ' ) ] }\)
\({ \{ ∀ x . [ ( x + 0 ) = x ] \} }\)
\({ \{ [ ∃ x . ( x = x ) ] ⊃ [ ( x + 0 ) = x ] \} }\)
\({ [ ( x + 0 ) = x ] }\)
\({ ( ∃ z . \{ [ z = ( x + 0 ) ] \} ) }\)
\({ . \quad ( ∃ z . \{ [ z = ( x + y ) ] \} ) }\)
\({ . \quad ( [ ∃ z . ( z = w ) ] ∧ \{ [ w = ( x + y ) ] \} ) }\)
\({ . \quad [ w = ( x + y ) ] }\)
\({ . \quad [ w ' = ( x + y ) ' ] }\)
\({ . \quad ( ∀ x . \{ ∀ y . [ ( x + y ' ) = ( x + y ) ' ] \} ) }\)
\({ . \quad ( [ ∃ x . ( x = x ) ] ⊃ \{ ∀ y . [ ( x + y ' ) = ( x + y ) ' ] \} ) }\)
\({ . \quad \{ ∀ y . [ ( x + y ' ) = ( x + y ) ' ] \} }\)
\({ . \quad \{ [ ∃ y . ( y = y ) ] ⊃ [ ( x + y ' ) = ( x + y ) ' ] \} }\)
\({ . \quad [ ( x + y ' ) = ( x + y ) ' ] }\)
\({ . \quad [ ( x + y ) ' = ( x + y ' ) ] }\)
\({ . \quad [ w ' = ( x + y ' ) ] }\)
\({ . \quad \{ [ ∃ z . ( z = w ' ) ] ∧ [ w ' = ( x + y ' ) ] \} }\)
\({ . \quad \{ ∃ z . [ z = ( x + y ' ) ] \} }\)
\({ \{ ∃ z . [ z = ( x + y ) ] \} ⊃ \{ ∃ z . [ z = ( x + y ' ) ] \} ) }\)
\({ \{ ∃ z . [ z = ( x + y ) ] \} }\)
\({ \{ ¬ [ ∃ y . ( y = y ) ] \} ∨ \{ ∃ z . [ z = ( x + y ) ] \} }\)
\({ [ ∃ y . ( y = y ) ] ⊃ \{ ∃ z . [ z = ( x + y ) ] \} }\)
\({ ( ∀ y . \{ ∃ z . [ z = ( x + y ) ] \} ) }\)
\({ [ \{ ¬ [ ∃ x . ( x = x ) ] \} ∨ ( ∀ y . \{ ∃ z . [ z = ( x + y ) ] \} ] ) }\)
\({ \{ [ ∃ x . ( x = x ) ] ⊃ ( ∀ y . \{ ∃ z . [ z = ( x + y ) ] \} ] ) \} }\)
\({ [ ∀ x . ( ∀ y . \{ ∃ z . [ z = ( x + y ) ] \} ] ) ] }\)

This one is somewhat more complicated than the previous one. A hypothesis is introduced at 20 and then discharged at 33. This, together with 19, allows us to use induction at 34.

Had we used a logic with existential presuppositions the proof could have been reduced to four lines. That’s largely because such a logic assumes that any expression we can write down refers to something in the domain, so the fact that we have a notation for addition already essentially assumes that the sum of any two natural numbers is a natural number. That’s very convenient, but unfortunately the same would apply to subtraction. If we introduce a notation for subtraction into a system based on such a logic then we can give a four line proof of \[ [ ∀ x . ( ∀ y . \{ ∃ z . [ z = ( x - y ) ] \} ] ) ] , \] but unfortunately this statement is false. The difference of natural numbers need not be a natural number. Formal systems for elementary arithmetic based on first order logic with existential suppositions avoid this problem by not having a notation for subtraction. But then there are important facts about elementary arithmetic which they can’t express. For example, the correct version of the statement above is \[ ( ∀ x . \{ ∀ y . [ ( x ≥ y ) ⊃ \{ ∃ z . [ z = ( x - y ) ] \} ] \} ) , \] which says that \({ x - y }\) is a natural number if \({ x ≥ y }\). This is a theorem in our system but can’t even be stated in a system with existential suppositions.

As we saw earlier, it’s possible to give a fairly concise statement within the language of Peano arithmetic of the fact that there are infinitely primes. It’s possible to give a formal proof as well, but it’s hardly an enjoyable exercise. Logicians, though, are generally much more interesting in figuring out what can or can’t be proved within a formal system than with actually supplying proofs. In other words, they tend to live in the world of semiformal proofs rather than formal proofs.