Lecture 15: Predicate Logic and Natural Deduction

Syntax

In propositional logic, the statements we are proving are completely abstract. To be able to prove programs correct, we need a logic that can talk about the things that programs compute on: integers, strings, tuples, datatype constructors, and functions. We'll enrich propositional logic with the ability to talk about these things, obtaining a version of predicate logic.

The syntax extends propositional logic with a few new expressions, shown in blue:

Predicates P,Q,R ::=
          ⊤             (* true *)
        | ⊥             (* false *)
        | ¬P            (* complement; equivalent to P ⇒ ⊥ *)
        | P ∧ Q         (* conjunction (and) *)
        | P ∨ Q         (* disjunction (or) *)
        | P ⇒ Q        (* implication (if-then) *) 
        | ∀x.P          (* P is true for all x. P can mention x*)
        | ∃x.P          (* There exists some x such that P is true *)
        | t₁ = t₂       (* t₁ is equal to t₂ *)
        | P(t₁,...,t_n)  (* n-ary predicate P is true for t₁,...,t_n *)

Terms t ::=     
          c             (* constants (integers, tuples, other values) *)
        | x             (* variables *)
        | f(t₁,...,t_n)  (* result of applying n-ary function f to t₁,...,t_n *)

Terms t stand for individual elements of some domain of objects we are reasoning about, such as the natural numbers. Predicates P are of type Boolean.

The formula ∀x.P means that the formula P is true for any choice of x. This is called universal quantification, and ∀ is the universal quantifier. The formula ∃x.P denotes existential quantification. It means that the formula P is true for some choice of x, though there may be more than one such x. Existential and universal quantifiers can be turned into each other using negation. The formula ∃x.P(x) implies ¬∀x.¬P(x), because if P is true of some x, then P cannot be false for all x. The converse is valid classically, but not intuitionistically. Similarly, the formula ∀x.P is equivalent to ¬∃x.¬P. These equivalences are generalizations of DeMorgan's laws to existential and universal quantifiers.

It is possible to restrict the range of quantifiers to quantify over some subset of the domain of possible values. For universal quantifiers, we use an implication ⇒, and for existential quantifiers, we use conjunction ∧. For example, if we wanted to say that all positive numbers x satisfy some property Q(x), we could write ∀x.x > 0 ⇒ Q(x). This works because the quantified formula is vacuously true for numbers not greater than 0. To say that there exists a positive number that satisfies Q, we can write ∃x.x > 0 ∧ Q(x).

Using quantifiers, we can express some interesting statements. For example, we can express the idea that a number n is prime in various logically equivalent ways:

Prime(n)	⇔	∀m. 1 < m ∧ m < n ⇒ ¬∃k. k*m = n
	⇔	¬∃m. 1 < m ∧ m < n ∧ ∃k. k*m = n
	⇔	¬∃m. ∃k. 1 < m ∧ m < n ∧ k*m = n

Rules for Quantifiers

Introduction and elimination rules can be defined for universal and existential quantifiers. In the following rules, P(a) refers to P(x) with all free occurrences of the variable x replaced by the term t and variable a, respectively.

rule name		intuition
∀	intro	() This rule can only be applied if x does not occur in Γ. We can conclude that P holds for all x if we choose an arbitrary* x and prove P(x).
∀	elim	If we've proven P holds for all x, we can conclude that P holds for any given x.
∃	intro	We can prove that there exists some x with property P by simply producing an a with property P.
∃	elim	(*) this rule can only be applied if a does not occur in Q. This is like ∨ elimination — we can only conclude something from the existence of an x if the conclusion Q doesn't depend on which x satisfies P.

The proviso (*) in the (∀-intro) and (∃-elim) rules is a restriction on the use of the rule. This restriction prevents us from doing unsound reasoning like the following:

This proof says that if a particular x is greater than 10, then every x is greater than 10, something that is clearly false! The problem is the use of ∀-intro: we are able to prove that x > 10, but not for an arbitrary x, only for the particular x we had already made assumptions about.

However, it is fine for the variable a to appear in an assumption that is made after the point where (∀-intro) is applied. For example,

In automated proof assistants that allow a user to develop natural deduction proofs by subgoaling, proofs are generated from bottom to top. In such systems, the proviso (*) can be enforced by generating a fresh variable a when either (∀-intro) or (∃-elim) is applied.

The rule (∀-elim) specializes the formula P(x) to a particular value t of x. (We require implicitly that t be of the right type to be substituted for x.) Since P holds for all x, it should hold for any particular choice of x, including t. The (∀-intro) rule formalizes the type of argument that starts, "Let a be an arbitrary element..." If one can prove a fact P(a) for arbitrarily chosen a, then P(x) holds for all x.

The rule (∃-intro) derives ∃x.P(x) because a witness t to the existential has been produced. Intuitively, if P(t) holds for some t, then certainly there exists an x such that P(x) holds. The idea behind rule (∃-elim) is that if Q can be shown without using any information about the witness a other than P(a), then the mere existence of an element satisfying P is enough to imply Q.

Reasoning with Equality

Predicate logic allows the use of arbitary predicates P. Equality (=) is such a predicate. It applies to two arguments; we can read t₁=t₂ as a predicate =(t₁,t₂). But in addition to the rules above for arbitrary predicates, equality has some special properties. The following three rules capture that equality is an equivalence relation: it is reflexive, symmetric, and transitive.

t = t

(reflexivity)

t₁ = t₂

t₂ = t₁

(symmetry)

t₁ = t₂	t₂ = t₃
t₁ = t₃

(transitivity)

Beyond being an equivalence relation, equality preserves meaning under substitution. If two things are equal, substituting one for the other in equal terms results in equal terms. This is known as Leibniz's rule (substitution of equals for equals):

t₁ = t₂
t{t₁/x} = t{t₂/x}

Leibniz's rule can also be applied to show propositions are logically equivalent :

t = t'
P{t/x} ⇔ P{t'/x}

For example, suppose we know y = x+1 and x(x+1)+(x+1) = (x+1)². Then we can use this rule to prove xy+(x+1) = y² by applying this rule with t = x+1, t' = y, and P = (xz+(x+1) = z²).

The same idea can be applied completely at the propositional level as well. If we can prove that two formulas are equivalent, they can be substituted for one another within any other formula.

Q ⇔ R
P{Q/A} ⇔ P{R/A}

This admissible rule can be very convenient for writing proofs, though anything we can prove with it can be proved using just the basic rules. It can be very handy when there is a large library of logical equivalences to draw upon, because it allows rewriting of deeply nested subformulas.

Reasoning on Integers and Other Sets

For reasoning about specific kinds of values, we need axioms that describe how those values behave. For example, the following axioms partly describe the integers and can be used to prove many facts about integers. In fact, they define a more general structure, a commutative ring, so anything proved with them holds for any commutative ring. These axioms are all considered to be implicitly universally quantified.

(x+y)+z = x+(y+z) (associativity of +)
x+y = y+x (commutativity of +)
(x*y)*z = x*(y*z) (associativity of *)
x*y = y*x (commutativity of *)
x*(y+z) = x*y+x*z (distributivity of * over +)
x + 0 = x (additive identity)
x + (-x) = 0 (additive inverse)
x*1 = x (multiplicative identity)
x*0 = 0 (annihilation)

These rules use a number of functions: +, *, -, 0, and 1 (we can think of 0 and 1 as functions that take zero arguments). These symbols are represented by the metavariable f in the grammar earlier.

Proving facts about arithmetic can be tedious. For our purposes, we will write proofs that do reasonable algebraic manipulations as a single step, e.g.:

(x+2)² = 2*x
x² = −2*x−4
(algebra)

This proof step can be done explicitly using the rules and axioms above, but it takes several steps.

(x+y)+z = x+(y+z)	(associativity of +)
x+y = y+x	(commutativity of +)
(xy)z = x(yz)	(associativity of *)
xy = yx	(commutativity of *)
x(y+z) = xy+x*z	(distributivity of * over +)
x + 0 = x	(additive identity)
x + (-x) = 0	(additive inverse)
x*1 = x	(multiplicative identity)
x*0 = 0	(annihilation)