SubSubtyping

Concepts

A Motivating Example

Suppose we are writing a program involving two record types defined as follows:
      Person  = {name:String, age:Nat}
      Student = {name:String, age:Nat, gpa:Nat}
Problem: In the pure STLC with records, the following term is not typable:
    (\r:Person. (r.age)+1) {name="Pat",age=21,gpa=1}
This is a shame.

Idea: Introduce subtyping, formalizing the observation that "some types are better than others."

Safe substitution principle:
  • S is a subtype of T, written S <: T, if a value of type S can safely be used in any context where a value of type T is expected.

Subtyping and Object-Oriented Languages

Subtyping plays a fundamental role in OO programming languages.
Roughly, an object can be thought of as a record of functions ("methods") and data values ("fields" or "instance variables").
  • Invoking a method m of an object o on some arguments a1..an consists of projecting out the m field of o and applying it to a1..an.
The type of an object is a class (or an interface).
Classes are related by the subclass relation.
  • An object belonging to a subclass must provide all the methods and fields of one belonging to a superclass, plus possibly some more.
  • Thus a subclass object can be used anywhere a superclass object is expected.
  • Very handy for organizing large libraries

Of course, real OO languages have lots of other features...
  • mutable fields
  • private and other visibility modifiers
  • method inheritance
  • static components
  • etc., etc.
We'll ignore all these and focus on core mechanisms.

The Subsumption Rule

Our goal for this chapter is to add subtyping to the simply typed lambda-calculus (with some of the basic extensions from MoreStlc). This involves two steps:
  • Defining a binary subtype relation between types.
  • Enriching the typing relation to take subtyping into account.
The second step is actually very simple. We add just a single rule to the typing relation: the so-called rule of subsumption:
Gamma ⊢ t ∈ S     S <: T (T_Sub)  

Gamma ⊢ t ∈ T
This rule says, intuitively, that it is OK to "forget" some of what we know about a term.

The Subtype Relation

The first step -- the definition of the relation S <: T -- is where all the action is. Let's look at each of the clauses of its definition.

Structural Rules

To start off, we impose two "structural rules" that are independent of any particular type constructor: a rule of transitivity, which says intuitively that, if S is better (richer, safer) than U and U is better than T, then S is better than T...
S <: U    U <: T (S_Trans)  

S <: T
... and a rule of reflexivity, since certainly any type T is as good as itself:
   (S_Refl)  

T <: T

Products

Now we consider the individual type constructors, one by one, beginning with product types. We consider one pair to be a subtype of another if each of its components is.
S1 <: T1    S2 <: T2 (S_Prod)  

S1 * S2 <: T1 * T2

Arrows

Suppose we have functions f and g with these types:
       f : CStudent
       g : (CPerson) → D
Is it safe to allow the application g f?
Yes.
So we want:
      CStudent <: CPerson
I.e., arrow is covariant in its right-hand argument.

Now suppose we have:
       f : PersonC
       g : (StudentC) → D
Is it safe to allow the application g f?
Again yes.
So we want:
      PersonC <: StudentC
I.e., arrow is contravariant in its left-hand argument.

Putting these together...
T1 <: S1    S2 <: T2 (S_Arrow)  

S1 -> S2 <: T1 -> T2

Suppose we have S <: T and U <: V. Which of the following subtyping assertions is false?
(1) S×U <: T×V
(2) TU <: SU
(3) (SU) (S×V) <: (SU) (T×U)
(4) (T×U) V <: (S×U) V
(5) SU <: SV

Suppose again that we have S <: T and U <: V. Which of the following is incorrect?
(1) (TT)*U <: (ST)*V
(2) TU <: SV
(3) (SU) (SV) <: (TU) (TV)
(4) (SV) V <: (TU) V
(5) S (VU) <: S (UU)

Records

What about subtyping for record types?

The basic intuition is that it is always safe to use a "bigger" record in place of a "smaller" one. That is, given a record type, adding extra fields will always result in a subtype. If some code is expecting a record with fields x and y, it is perfectly safe for it to receive a record with fields x, y, and z; the z field will simply be ignored. For example,
    {name:String, age:Nat, gpa:Nat} <: {name:String, age:Nat}
    {name:String, age:Nat} <: {name:String}
    {name:String} <: {}
This is known as "width subtyping" for records.

We can also create a subtype of a record type by replacing the type of one of its fields with a subtype. If some code is expecting a record with a field x of type T, it will be happy with a record having a field x of type S as long as S is a subtype of T. For example,
    {x:Student} <: {x:Person}
This is known as "depth subtyping".

Finally, although the fields of a record type are written in a particular order, the order does not really matter. For example,
    {name:String,age:Nat} <: {age:Nat,name:String}
This is known as "permutation subtyping".

We could formalize these requirements in a single subtyping rule for records as follows:
forall jk in j1..jn,
exists ip in i1..im, such that
jk=ip and Sp <: Tk (S_Rcd)  

{i1:S1...im:Sm} <: {j1:T1...jn:Tn}
That is, the record on the left should have all the field labels of the one on the right (and possibly more), while the types of the common fields should be in the subtype relation.
However, this rule is rather heavy and hard to read, so it is often decomposed into three simpler rules, which can be combined using S_Trans to achieve all the same effects.

First, adding fields to the end of a record type gives a subtype:
n > m (S_RcdWidth)  

{i1:T1...in:Tn} <: {i1:T1...im:Tm}
We can use S_RcdWidth to drop later fields of a multi-field record while keeping earlier fields, showing for example that {age:Nat,name:String} <: {age:Nat}.

Second, subtyping can be applied inside the components of a compound record type:
S1 <: T1 ... Sn <: Tn (S_RcdDepth)  

{i1:S1...in:Sn} <: {i1:T1...in:Tn}
For example, we can use S_RcdDepth and S_RcdWidth together to show that {y:Student, x:Nat} <: {y:Person}.

Third, subtyping can reorder fields. For example, we want {name:String, gpa:Nat, age:Nat} <: Person. (We haven't quite achieved this yet: using just S_RcdDepth and S_RcdWidth we can only drop fields from the end of a record type.) So we add:
{i1:S1...in:Sn} is a permutation of {j1:T1...jn:Tn} (S_RcdPerm)  

{i1:S1...in:Sn} <: {j1:T1...jn:Tn}

It is worth noting that full-blown language designs may choose not to adopt all of these subtyping rules. For example, in Java:
  • Each class member (field or method) can be assigned a single index, adding new indices "on the right" as more members are added in subclasses (i.e., no permutation for classes).
  • A class may implement multiple interfaces -- so-called "multiple inheritance" of interfaces (i.e., permutation is allowed for interfaces).
  • In early versions of Java, a subclass could not change the argument or result types of a method of its superclass (i.e., no depth subtyping or no arrow subtyping, depending how you look at it).

Top

Finally, it is convenient to give the subtype relation a maximum element -- a type that lies above every other type and is inhabited by all (well-typed) values. We do this by adding to the language one new type constant, called Top, together with a subtyping rule that places it above every other type in the subtype relation:
   (S_Top)  

S <: Top
The Top type is an analog of the Object type in Java and C#.

Summary

In summary, we form the STLC with subtyping by starting with the pure STLC (over some set of base types) and then...
  • adding a base type Top,
  • adding the rule of subsumption
    Gamma ⊢ t ∈ S     S <: T (T_Sub)  

    Gamma ⊢ t ∈ T
    to the typing relation, and
  • defining a subtype relation as follows:
    S <: U    U <: T (S_Trans)  

    S <: T
       (S_Refl)  

    T <: T
       (S_Top)  

    S <: Top
    S1 <: T1    S2 <: T2 (S_Prod)  

    S1 * S2 <: T1 * T2
    T1 <: S1    S2 <: T2 (S_Arrow)  

    S1 -> S2 <: T1 -> T2
    n > m (S_RcdWidth)  

    {i1:T1...in:Tn} <: {i1:T1...im:Tm}
    S1 <: T1 ... Sn <: Tn (S_RcdDepth)  

    {i1:S1...in:Sn} <: {i1:T1...in:Tn}
    {i1:S1...in:Sn} is a permutation of {j1:T1...jn:Tn} (S_RcdPerm)  

    {i1:S1...in:Sn} <: {j1:T1...jn:Tn}
Suppose we have S <: T and U <: V. Which of the following subtyping assertions is false?
(1) S×U <: Top
(2) {i1:S,i2:T}->U <: {i1:S,i2:T,i3:V}->U
(3) (ST) (Top Top) <: (ST) Top
(4) (Top Top) V <: Top V
(5) S {i1:U,i2:V} <: S {i2:V,i1:U}
How about these?
(1) {i1:Top} <: Top
(2) Top (Top Top) <: Top Top
(3) {i1:T} {i1:T} <: {i1:T,i2:S} Top
(4) {i1:T,i2:V,i3:V} <: {i1:S,i2:U} × {i3:V}
(5) Top {i1:U,i2:V} <: {i1:S} {i2:V,i1:V}

What is the smallest type T that makes the following assertion true?
       a:A ⊢ (\p:(A×T). (p.snd) (p.fst)) (a, \z:A.z) \in A
(1) Top
(2) A
(3) TopTop
(4) TopA
(5) AA
(6) ATop
What is the largest type T that makes the following assertion true?
       a:A ⊢ (\p:(A×T). (p.snd) (p.fst)) (a, \z:A.z) \in A
(1) Top
(2) A
(3) TopTop
(4) TopA
(5) AA
(6) ATop
"The type Bool has no proper subtypes." (I.e., the only type smaller than Bool is Bool itself.)
(1) True
(2) False
"Suppose S, T1, and T2 are types with S <: T1 T2. Then S itself is an arrow type -- i.e., S = S1 S2 for some S1 and S2 -- with T1 <: S1 and S2 <: T2."
(1) True
(2) False

Formal Definitions

Most of the definitions needed to formalize what we've discussed above -- in particular, the syntax and operational semantics of the language -- are identical to what we saw in the last chapter. We just need to extend the typing relation with the subsumption rule and add a new Inductive definition for the subtyping relation. Let's first do the identical bits.

Core Definitions

Syntax


(* (Omitting records, to avoid dealing with ... stuff.) *)

Inductive ty : Type :=
  | Top : ty
  | Bool : ty
  | Base : stringty
  | Arrow : tytyty
  | Unit : ty
.

Inductive tm : Type :=
  | var : stringtm
  | app : tmtmtm
  | abs : stringtytmtm
  | tru : tm
  | fls : tm
  | test : tmtmtmtm
  | unit : tm
.

Substitution

The definition of substitution remains exactly the same as for the pure STLC.

Fixpoint subst (x:string) (s:tm) (t:tm) : tm :=
  match t with
  | var y
      if eqb_string x y then s else t
  | abs y T t1
      abs y T (if eqb_string x y then t1 else (subst x s t1))
  | app t1 t2
      app (subst x s t1) (subst x s t2)
  | tru
      tru
  | fls
      fls
  | test t1 t2 t3
      test (subst x s t1) (subst x s t2) (subst x s t3)
  | unit
      unit
  end.

Notation "'[' x ':=' s ']' t" := (subst x s t) (at level 20).

Reduction

Likewise the definitions of the value property and the step relation.

Inductive value : tmProp :=
  | v_abs : x T t,
      value (abs x T t)
  | v_true :
      value tru
  | v_false :
      value fls
  | v_unit :
      value unit
.

Hint Constructors value.


Reserved Notation "t1 '-->' t2" (at level 40).

Inductive step : tmtmProp :=
  | ST_AppAbs : x T t12 v2,
         value v2
         (app (abs x T t12) v2) --> [x:=v2]t12
  | ST_App1 : t1 t1' t2,
         t1 --> t1'
         (app t1 t2) --> (app t1' t2)
  | ST_App2 : v1 t2 t2',
         value v1
         t2 --> t2'
         (app v1 t2) --> (app v1 t2')
  | ST_TestTrue : t1 t2,
      (test tru t1 t2) --> t1
  | ST_TestFalse : t1 t2,
      (test fls t1 t2) --> t2
  | ST_Test : t1 t1' t2 t3,
      t1 --> t1'
      (test t1 t2 t3) --> (test t1' t2 t3)
where "t1 '-->' t2" := (step t1 t2).

Hint Constructors step.

Subtyping

The definition of subtyping is just what we sketched in the motivating discussion.

Reserved Notation "T '<:' U" (at level 40).

Inductive subtype : tytyProp :=
  | S_Refl : T,
      T <: T
  | S_Trans : S U T,
      S <: U
      U <: T
      S <: T
  | S_Top : S,
      S <: Top
  | S_Arrow : S1 S2 T1 T2,
      T1 <: S1
      S2 <: T2
      (Arrow S1 S2) <: (Arrow T1 T2)
where "T '<:' U" := (subtype T U).
Note that we don't need any special rules for base types (Bool and Base): they are automatically subtypes of themselves (by S_Refl) and Top (by S_Top), and that's all we want.

Hint Constructors subtype.

Typing

The only change to the typing relation is the addition of the rule of subsumption, T_Sub.

Definition context := partial_map ty.

Reserved Notation "Gamma '⊢' t '∈' T" (at level 40).

Inductive has_type : contexttmtyProp :=
  (* Same as before *)
  | T_Var : Gamma x T,
      Gamma x = Some T
      Gammavar x \in T
  | T_Abs : Gamma x T11 T12 t12,
      (x > T11 ; Gamma) ⊢ t12 \in T12
      Gammaabs x T11 t12 \in Arrow T11 T12
  | T_App : T1 T2 Gamma t1 t2,
      Gammat1 \in Arrow T1 T2
      Gammat2 \in T1
      Gammaapp t1 t2 \in T2
  | T_True : Gamma,
       Gammatru \in Bool
  | T_False : Gamma,
       Gammafls \in Bool
  | T_Test : t1 t2 t3 T Gamma,
       Gammat1 \in Bool
       Gammat2 \in T
       Gammat3 \in T
       Gammatest t1 t2 t3 \in T
  | T_Unit : Gamma,
      Gammaunit \in Unit
  (* New rule of subsumption *)
  | T_Sub : Gamma t S T,
      Gammat \in S
      S <: T
      Gammat \in T

where "Gamma '⊢' t '∈' T" := (has_type Gamma t T).

Hint Constructors has_type.
The following hints help auto and eauto construct typing derivations. They are only used in a few places, but they give a nice illustration of what auto can do with a bit more programming. See chapter UseAuto for more on hints.

Hint Extern 2 (has_type _ (app _ _) _) ⇒
  eapply T_App; auto.
Hint Extern 2 (_ = _) ⇒ compute; reflexivity.

Properties

We want the same properties as always: progress + preservation.
  • Statements of these theorems don't need to change, compared to pure STLC
  • But proofs are a bit more involved, to account for the additional flexibility in the typing relation

Inversion Lemmas for Subtyping

Before we look at the properties of the typing relation, we need to establish a couple of critical structural properties of the subtype relation:
  • Bool is the only subtype of Bool, and
  • every subtype of an arrow type is itself an arrow type.
Formally:

Lemma sub_inversion_Bool : U,
     U <: Bool
     U = Bool.
Proof with auto.
  intros U Hs.
  remember Bool as V.
  (* FILL IN HERE *) Admitted.

Lemma sub_inversion_arrow : U V1 V2,
     U <: Arrow V1 V2
      U1 U2,
     U = Arrow U1 U2V1 <: U1U2 <: V2.
Proof with eauto.
  intros U V1 V2 Hs.
  remember (Arrow V1 V2) as V.
  generalize dependent V2. generalize dependent V1.
  (* FILL IN HERE *) Admitted.

Canonical Forms

The proof of progress uses facts of the form "every value belonging to an arrow type is an abstraction."
In the pure STLC, such facts are "immediate from the definition" (we can directly use the inversion tactic).
With subtyping, they require real inductive proofs...

Lemma canonical_forms_of_arrow_types : Gamma s T1 T2,
  Gammas \in Arrow T1 T2
  value s
   x S1 s2,
     s = abs x S1 s2.
Proof with eauto.
  (* FILL IN HERE *) Admitted.
Similarly, the canonical forms of type Bool are the constants tru and fls.

Lemma canonical_forms_of_Bool : Gamma s,
  Gammas \in Bool
  value s
  s = trus = fls.
Proof with eauto.
  intros Gamma s Hty Hv.
  remember Bool as T.
  induction Hty; try solve_by_invert...
  - (* T_Sub *)
    subst. apply sub_inversion_Bool in H. subst...
Qed.

Progress

Theorem (Progress): For any term t and type T, if empty t \in T then t is a value or t --> t' for some term t'.
Proof: Let t and T be given, with empty t \in T. Proceed by induction on the typing derivation.
The cases for T_Abs, T_Unit, T_True and T_False are immediate because abstractions, unit, tru, and fls are already values. The T_Var case is vacuous because variables cannot be typed in the empty context. The remaining cases are more interesting:
  • If the last step in the typing derivation uses rule T_App, then there are terms t1 t2 and types T1 and T2 such that t = t1 t2, T = T2, empty t1 \in T1 T2, and empty t2 \in T1. Moreover, by the induction hypothesis, either t1 is a value or it steps, and either t2 is a value or it steps. There are three possibilities to consider:
    • Suppose t1 --> t1' for some term t1'. Then t1 t2 --> t1' t2 by ST_App1.
    • Suppose t1 is a value and t2 --> t2' for some term t2'. Then t1 t2 --> t1 t2' by rule ST_App2 because t1 is a value.
    • Finally, suppose t1 and t2 are both values. By the canonical forms lemma for arrow types, we know that t1 has the form \x:S1.s2 for some x, S1, and s2. But then (\x:S1.s2) t2 --> [x:=t2]s2 by ST_AppAbs, since t2 is a value.
  • If the final step of the derivation uses rule T_Test, then there are terms t1, t2, and t3 such that t = test t1 then t2 else t3, with empty t1 \in Bool and with empty t2 \in T and empty t3 \in T. Moreover, by the induction hypothesis, either t1 is a value or it steps.
    • If t1 is a value, then by the canonical forms lemma for booleans, either t1 = tru or t1 = fls. In either case, t can step, using rule ST_TestTrue or ST_TestFalse.
    • If t1 can step, then so can t, by rule ST_Test.
  • If the final step of the derivation is by T_Sub, then there is a type S such that S <: T and empty t \in S. The desired result is exactly the induction hypothesis for the typing subderivation.
Formally:

Theorem progress : t T,
     emptyt \in T
     value t t', t --> t'.
Proof with eauto.
  intros t T Ht.
  remember empty as Gamma.
  revert HeqGamma.
  induction Ht;
    intros HeqGamma; subst...
  - (* T_Var *)
    inversion H.
  - (* T_App *)
    right.
    destruct IHHt1; subst...
    + (* t1 is a value *)
      destruct IHHt2; subst...
      × (* t2 is a value *)
        destruct (canonical_forms_of_arrow_types empty t1 T1 T2)
          as [x [S1 [t12 Heqt1]]]...
        subst. ([x:=t2]t12)...
      × (* t2 steps *)
        inversion H0 as [t2' Hstp]. (app t1 t2')...
    + (* t1 steps *)
      inversion H as [t1' Hstp]. (app t1' t2)...
  - (* T_Test *)
    right.
    destruct IHHt1.
    + (* t1 is a value *) eauto.
    + assert (t1 = trut1 = fls)
        by (eapply canonical_forms_of_Bool; eauto).
      inversion H0; subst...
    + inversion H. rename x into t1'. eauto.
Qed.

Inversion Lemmas for Typing

We also need inversion lemmas for some structural facts about the typing relation that were "obvious from the definition" in pure STLC.
Lemma: If Gamma \x:S1.t2 \in T, then there is a type S2 such that x>S1; Gamma t2 \in S2 and S1 S2 <: T.
(Notice that the lemma does not say, "then T itself is an arrow type" -- this is tempting, but false!)
Proof: Let Gamma, x, S1, t2 and T be given as described. Proceed by induction on the derivation of Gamma \x:S1.t2 \in T. Cases T_Var, T_App, are vacuous as those rules cannot be used to give a type to a syntactic abstraction.
  • If the last step of the derivation is a use of T_Abs then there is a type T12 such that T = S1 T12 and x:S1; Gamma t2 \in T12. Picking T12 for S2 gives us what we need: S1 T12 <: S1 T12 follows from S_Refl.
  • If the last step of the derivation is a use of T_Sub then there is a type S such that S <: T and Gamma \x:S1.t2 \in S. The IH for the typing subderivation tells us that there is some type S2 with S1 S2 <: S and x:S1; Gamma t2 \in S2. Picking type S2 gives us what we need, since S1 S2 <: T then follows by S_Trans.
Formally:

Lemma typing_inversion_abs : Gamma x S1 t2 T,
     Gamma ⊢ (abs x S1 t2) \in T
      S2,
       Arrow S1 S2 <: T
       ∧ (x > S1 ; Gamma) ⊢ t2 \in S2.
Proof with eauto.
  intros Gamma x S1 t2 T H.
  remember (abs x S1 t2) as t.
  induction H;
    inversion Heqt; subst; intros; try solve_by_invert.
  - (* T_Abs *)
     T12...
  - (* T_Sub *)
    destruct IHhas_type as [S2 [Hsub Hty]]...
  Qed.
Similarly...

Lemma typing_inversion_var : Gamma x T,
  Gamma ⊢ (var x) \in T
   S,
    Gamma x = Some SS <: T.
Proof with eauto.
  intros Gamma x T Hty.
  remember (var x) as t.
  induction Hty; intros;
    inversion Heqt; subst; try solve_by_invert.
  - (* T_Var *)
     T...
  - (* T_Sub *)
    destruct IHHty as [U [Hctx HsubU]]... Qed.

Lemma typing_inversion_app : Gamma t1 t2 T2,
  Gamma ⊢ (app t1 t2) \in T2
   T1,
    Gammat1 \in (Arrow T1 T2) ∧
    Gammat2 \in T1.
Proof with eauto.
  intros Gamma t1 t2 T2 Hty.
  remember (app t1 t2) as t.
  induction Hty; intros;
    inversion Heqt; subst; try solve_by_invert.
  - (* T_App *)
     T1...
  - (* T_Sub *)
    destruct IHHty as [U1 [Hty1 Hty2]]...
Qed.

Lemma typing_inversion_true : Gamma T,
  Gammatru \in T
  Bool <: T.
Proof with eauto.
  intros Gamma T Htyp. remember tru as tu.
  induction Htyp;
    inversion Heqtu; subst; intros...
Qed.

Lemma typing_inversion_false : Gamma T,
  Gammafls \in T
  Bool <: T.
Proof with eauto.
  intros Gamma T Htyp. remember fls as tu.
  induction Htyp;
    inversion Heqtu; subst; intros...
Qed.

Lemma typing_inversion_if : Gamma t1 t2 t3 T,
  Gamma ⊢ (test t1 t2 t3) \in T
  Gammat1 \in Bool
  ∧ Gammat2 \in T
  ∧ Gammat3 \in T.
Proof with eauto.
  intros Gamma t1 t2 t3 T Hty.
  remember (test t1 t2 t3) as t.
  induction Hty; intros;
    inversion Heqt; subst; try solve_by_invert.
  - (* T_Test *)
    auto.
  - (* T_Sub *)
    destruct (IHHty H0) as [H1 [H2 H3]]...
Qed.

Lemma typing_inversion_unit : Gamma T,
  Gammaunit \in T
    Unit <: T.
Proof with eauto.
  intros Gamma T Htyp. remember unit as tu.
  induction Htyp;
    inversion Heqtu; subst; intros...
Qed.
The inversion lemmas for typing and for subtyping between arrow types can be packaged up as a useful "combination lemma" telling us exactly what we'll actually require below.

Lemma abs_arrow : x S1 s2 T1 T2,
  empty ⊢ (abs x S1 s2) \in (Arrow T1 T2) →
     T1 <: S1
  ∧ (x > S1 ; empty) ⊢ s2 \in T2.
Proof with eauto.
  intros x S1 s2 T1 T2 Hty.
  apply typing_inversion_abs in Hty.
  inversion Hty as [S2 [Hsub Hty1]].
  apply sub_inversion_arrow in Hsub.
  inversion Hsub as [U1 [U2 [Heq [Hsub1 Hsub2]]]].
  inversion Heq; subst... Qed.

Context Invariance

The context invariance lemma follows the same pattern as in the pure STLC.

Inductive appears_free_in : stringtmProp :=
  | afi_var : x,
      appears_free_in x (var x)
  | afi_app1 : x t1 t2,
      appears_free_in x t1appears_free_in x (app t1 t2)
  | afi_app2 : x t1 t2,
      appears_free_in x t2appears_free_in x (app t1 t2)
  | afi_abs : x y T11 t12,
        yx
        appears_free_in x t12
        appears_free_in x (abs y T11 t12)
  | afi_test1 : x t1 t2 t3,
      appears_free_in x t1
      appears_free_in x (test t1 t2 t3)
  | afi_test2 : x t1 t2 t3,
      appears_free_in x t2
      appears_free_in x (test t1 t2 t3)
  | afi_test3 : x t1 t2 t3,
      appears_free_in x t3
      appears_free_in x (test t1 t2 t3)
.

Hint Constructors appears_free_in.

Lemma context_invariance : Gamma Gamma' t S,
     Gammat \in S
     ( x, appears_free_in x tGamma x = Gamma' x) →
     Gamma't \in S.
Proof with eauto.
  intros. generalize dependent Gamma'.
  induction H;
    intros Gamma' Heqv...
  - (* T_Var *)
    apply T_Var... rewrite <- Heqv...
  - (* T_Abs *)
    apply T_Abs... apply IHhas_type. intros x0 Hafi.
    unfold update, t_update. destruct (eqb_stringP x x0)...
  - (* T_Test *)
    apply T_Test...
Qed.

Lemma free_in_context : x t T Gamma,
   appears_free_in x t
   Gammat \in T
    T', Gamma x = Some T'.
Proof with eauto.
  intros x t T Gamma Hafi Htyp.
  induction Htyp;
      subst; inversion Hafi; subst...
  - (* T_Abs *)
    destruct (IHHtyp H4) as [T Hctx]. T.
    unfold update, t_update in Hctx.
    rewrite <- eqb_string_false_iff in H2.
    rewrite H2 in Hctx... Qed.

Substitution

The substitution lemma uses the inversion properties that we proved above.

Lemma substitution_preserves_typing : Gamma x U v t S,
     (x > U ; Gamma) ⊢ t \in S
     emptyv \in U
     Gamma ⊢ [x:=v]t \in S.
Proof with eauto.
  intros Gamma x U v t S Htypt Htypv.
  generalize dependent S. generalize dependent Gamma.
  induction t; intros; simpl.
  - (* var *)
    rename s into y.
    destruct (typing_inversion_var _ _ _ Htypt)
        as [T [Hctx Hsub]].
    unfold update, t_update in Hctx.
    destruct (eqb_stringP x y) as [Hxy|Hxy]; eauto;
    subst.
    inversion Hctx; subst. clear Hctx.
    apply context_invariance with empty...
    intros x Hcontra.
    destruct (free_in_context _ _ S empty Hcontra)
        as [T' HT']...
    inversion HT'.
  - (* app *)
    destruct (typing_inversion_app _ _ _ _ Htypt)
        as [T1 [Htypt1 Htypt2]].
    eapply T_App...
  - (* abs *)
    rename s into y. rename t into T1.
    destruct (typing_inversion_abs _ _ _ _ _ Htypt)
      as [T2 [Hsub Htypt2]].
    apply T_Sub with (Arrow T1 T2)... apply T_Abs...
    destruct (eqb_stringP x y) as [Hxy|Hxy].
    + (* x=y *)
      eapply context_invariance...
      subst.
      intros x Hafi. unfold update, t_update.
      destruct (eqb_string y x)...
    + (* x<>y *)
      apply IHt. eapply context_invariance...
      intros z Hafi. unfold update, t_update.
      destruct (eqb_stringP y z)...
      subst.
      rewrite <- eqb_string_false_iff in Hxy. rewrite Hxy...
  - (* tru *)
      assert (Bool <: S)
        by apply (typing_inversion_true _ _ Htypt)...
  - (* fls *)
      assert (Bool <: S)
        by apply (typing_inversion_false _ _ Htypt)...
  - (* test *)
    assert ((x > U ; Gamma) ⊢ t1 \in Bool
         ∧ (x > U ; Gamma) ⊢ t2 \in S
         ∧ (x > U ; Gamma) ⊢ t3 \in S)
      by apply (typing_inversion_if _ _ _ _ _ Htypt).
    inversion H as [H1 [H2 H3]].
    apply IHt1 in H1. apply IHt2 in H2. apply IHt3 in H3.
    auto.
  - (* unit *)
    assert (Unit <: S)
      by apply (typing_inversion_unit _ _ Htypt)...
Qed.

Preservation

The proof of preservation now proceeds pretty much as in earlier chapters, using the substitution lemma at the appropriate point and again using inversion lemmas from above to extract structural information from typing assumptions.
Theorem (Preservation): If t, t' are terms and T is a type such that empty t \in T and t --> t', then empty t' \in T.
Proof: Let t and T be given such that empty t \in T. We proceed by induction on the structure of this typing derivation, leaving t' general. The cases T_Abs, T_Unit, T_True, and T_False cases are vacuous because abstractions and constants don't step. Case T_Var is vacuous as well, since the context is empty.
  • If the final step of the derivation is by T_App, then there are terms t1 and t2 and types T1 and T2 such that t = t1 t2, T = T2, empty t1 \in T1 T2, and empty t2 \in T1.
    By the definition of the step relation, there are three ways t1 t2 can step. Cases ST_App1 and ST_App2 follow immediately by the induction hypotheses for the typing subderivations and a use of T_App.
    Suppose instead t1 t2 steps by ST_AppAbs. Then t1 = \x:S.t12 for some type S and term t12, and t' = [x:=t2]t12.
    By lemma abs_arrow, we have T1 <: S and x:S1 s2 \in T2. It then follows by the substitution lemma (substitution_preserves_typing) that empty [x:=t2] t12 \in T2 as desired.
    • If the final step of the derivation uses rule T_Test, then there are terms t1, t2, and t3 such that t = test t1 then t2 else t3, with empty t1 \in Bool and with empty t2 \in T and empty t3 \in T. Moreover, by the induction hypothesis, if t1 steps to t1' then empty t1' : Bool. There are three cases to consider, depending on which rule was used to show t --> t'.
      • If t --> t' by rule ST_Test, then t' = test t1' then t2 else t3 with t1 --> t1'. By the induction hypothesis, empty t1' \in Bool, and so empty t' \in T by T_Test.
      • If t --> t' by rule ST_TestTrue or ST_TestFalse, then either t' = t2 or t' = t3, and empty t' \in T follows by assumption.
  • If the final step of the derivation is by T_Sub, then there is a type S such that S <: T and empty t \in S. The result is immediate by the induction hypothesis for the typing subderivation and an application of T_Sub.

Theorem preservation : t t' T,
     emptyt \in T
     t --> t'
     emptyt' \in T.
Proof with eauto.
  intros t t' T HT.
  remember empty as Gamma. generalize dependent HeqGamma.
  generalize dependent t'.
  induction HT;
    intros t' HeqGamma HE; subst; inversion HE; subst...
  - (* T_App *)
    inversion HE; subst...
    + (* ST_AppAbs *)
      destruct (abs_arrow _ _ _ _ _ HT1) as [HA1 HA2].
      apply substitution_preserves_typing with T...
Qed.