Type Inference (Part I)

Overview
The Language
Polymorphic Type Inference: Definitions
Type Inference (Informal Approach)
- Test Yourself #1

For primitive functions
- Test Yourself #2
For user-defined functions

Overview

Most programming languages include the notion of a type system. This is because types help uncover logical errors. Although typechecking can be done either statically (at compile time) or dynamically (at run time), static typechecking has the advantage that if your program typechecks, you know that there will be no type violations on any run; if typechecking is done dynamically, the fact that one run produced no type errors generally provides no guarantees about what will happen on other runs.

There are two different approaches to static typechecking:

Static Typing: A language is said to be statically typed if the type of every expression in program can be determined at compile time.
Strong Typing: A language is said to use strong typing if the types of all expressions can be analyzed at compile time to see if they are mutually consistent (if so, there will be no runtime type errors); however, the actual types of some expressions may not be known until runtime.

Both static and strong typing require type inference: a technique that determines the type of every expression (possibly given declarations of the types for some variables and some user-defined functions). The goal of static typing is to assign a monotype (a single type) to each expression; in contrast, strong typing may assign some expressions a polytype (a type with type variables).

We will consider how to do strong typing in the presence of polymorphism, using Milner's polymorphic type inference algorithm, "Algorithm W" (see the paper by Lucca Cardelli on polymorphic typechecking).

Milner's algorithm was developed for the language ML. We'll use a simpler language (defined below). We'll start with an informal definition of how to do type-inference (in English), then we'll give a formal definition via axioms and rules of inference (so that an expression e has a type t iff there is a proof in this system). We'll see that algorithm W is:

sound: If it says that expression e has type t, then there is a proof in our formal system.
not quite complete: There are some expressions that do have types according to the formal system, yet the algorithm fails to find those types.
complete up to shallow types (which we'll define later).

The Language

The language we'll use is a subset of ML that is basically an enriched version of lambda calculus. A program will be an expression as defined by the following grammar:

exp → ID
| literal // int, bool, list, or pair
| λ ID . exp // function definition
| exp (exp) // function application
| if exp then exp else exp // normal if-then-else
| let ID = exp in exp // define a "macro" or a non-recursive fn
| let rec ID = exp in exp // define a recursive fn

The primitive types are:

int
bool
homogeneous list (all elements have the same type)
pair (not necessarily homogeneous)

The primitive functions are:

plus, iszero, succ, pred
cons (restricted to lists, not arbitrary s-expressions), car, cdr, null
pair (create a pair of objects - not necessarily of the same type)

We will assume that all functions are in curried form (i.e., take only one argument),we'll use functions instead of operators (e.g., "plus(1)(2)" instead of "1+2"), and we'll use square brackets for list literals (e.g., [1,2,3]).

Polymorphic Type Inference: Definitions

The goal of polymorphic type inference is:

given

find

most general type

A definition of "most general type" appears below.

The interesting part of polymorphic type inference is that types can include type variables. A type is defined as follows:

Base types (e.g., int, boolean) are types.
Type variables (denoted by lower-case Greek letters or by t1 t2 etc.) are types.
If T1 and T2 are types, so are:

If a type contains a variable it is a polytype; otherwise it is a monotype.

Here are the types of some of our primitive functions:

function type
succ int → int
iszero int → boolean
plus (uncurried form) (int x int) → int
plus (curried form) int → (int → int)
cons (uncurried form) (α x α-list) → α-list
cons (curried form) α → (α-list → α-list)
car α-list → α
pair α → (β → (α x β))

function	type
succ	int → int
iszero	int → boolean
plus (uncurried form)	(int x int) → int
plus (curried form)	int → (int → int)
cons (uncurried form)	(α x α-list) → α-list
cons (curried form)	α → (α-list → α-list)
car	α-list → α
pair	α → (β → (α x β))

Intuitively, a type variable means "any type", although if one type variable occurs multiple times in a type, then they all have to refer to the same type. For example, since we've restricted our attention to homogeneous lists, cons is restricted to operate on an object of some type and a list of objects of that same type, rather than an arbitrary object and an arbitrary list. Therefore, the type of cons is α → (α-list → α-list). We impose no such restriction on pair; its arguments can have unrelated types.

Recall that our goal is to find the most general type for each expression in a program. We'll use "T1 ⊇ T2" (where T1 and T2 are types) to mean: T1 is at least as general as T2, and we'll use "T1 ⊃ T2" to mean T1 is strictly more general than T2. Here's how the ordering is defined:

(α → α) ⇒ (int → int)

(α → β) ⇒ (α → bool) ⇒ (int → bool)

By definition:

(T1 ⊇ T2) iff (T1 ⇒* T2), and
(T1 ⊃ T2) iff (T1 ⊇ T2) and not (T2 ⊇ T1)

So for example:

Type Inference: Informal Approach

Let's start by looking at an example:

let rec

then

else

What do we know about the types of the expressions in this (incomplete) program?

The type of null is: α-list → bool.
The type of 0 is: int.
The type of succ is: int → int.
The type of cdr is: α-list → α-list.
The type of the condition part of an if must be: bool.
The types of the then and else parts of an if must be the same (so that the whole expression has a well-defined type).

Point 1 tells us that the type of L must be: α-list. Point 3 tells us that the type of length (cdr L) must be: int, and that the type of succ (length (cdr L)) is: int. The type of null(L) (namely bool) is consistent with point 5. Since both branches of the if have type int, the type of the whole if is int, and thus the type of length is: α-list → int.

TEST YOURSELF #1

Try the same, informal type-inference for the following function definition:

let rec

then

else

solution

Below are (informal) type-inference rules (one set of rules for each kind of expression).

(1) if cond then exp1 else exp2
(a) the type of cond must be bool
(b) the types of exp1 and exp2 must be the same
(c) the type of the whole expression must be the same as the types of exp1 and exp2

(2) function application: fn(arg)
(a) the type of fn must be α → β
(b) the type of arg must be α
(c) the type of the whole expression is β

(3) function abstraction: λ id.exp
(a) the type of id is α
(b) the type of exp is β
(c) the type of the whole expression is α → β

(4) let id = e1 in e2
let rec id = e1 in e2
(a) inside e2 the type of id is the type of e1
(b) the type of the whole expression is the type of e2

Given these rules, here's an informal algorithm for how to typecheck an expression (i.e., how to infer the types of all subexpressions, and make sure that everything is consistent); we assume that we're given the abstract-syntax tree representation of the expression:

Start with a type environment that maps the primitive functions to their types. As typechecking proceeds, the environment will be updated to include mappings of program IDs to type expressions.
Work in post-order in the AST (bottom-up, left-to-right), assigning a type to each node:
- For leaves:
  - if a literal, assign appropriate type
  - if an identifier, look it up in the type environment; if not there, add it with a new type variable.
- For internal nodes, use the inference rules; this may involve discovering some "forced" equalities; keep track of those, too
Reject the program if a type rule is violated.

Below is the AST for the length function, annotated to show the result of typechecking (the type of each node is shown in parentheses). The type environment is also shown, as are the forced equalities discovered during typechecking (shown in a table at the bottom right, and also as ** xx = yy ** at the point in the tree where they are discovered). Note that some of the types in the type environment are "not quite right." That issue is explained in the next section.

   let rec length = λ L. if null(L) then 0 else succ (length (cdr(L)))


            let rec  ** t1 = α-list → int **
           --------
         /          \
        /            \
 (t1) length         lambda   
                   (α-list → int)
                  /            \
                 /              \
          (t2)  L             if-then-else (int)
                               /       |        \
                              /        |         \     ** t3 = int**
       **t2=α-list** (bool) apply  0 (int)   apply (int) 
                             /  \               /     \
                           null  L            succ     apply  (t3)
                 (α-list → bool)  (t2)    (int → int)   /   \ **t1:β-list → t3**
                                                       /     \
                                                   length    apply (β-list)
                                                   (t1)      /   \ **α = β**
                                                            /     \
                                                         cdr       L (α-list)
                                              (β-list → β-list)
                                                    

              type env                        equalities
             ---------                       -----------
           *   null: α-list → bool           t2 = α-list
               succ: int → int               α = β
           *   cdr: β-list → β-list          t1 = β-list → t3
                                             t3 = int
           * length: t1
                  l: t2

             (Note: * means not quite right)

Generic and Non-Generic Type Variables

In the example given above, the types for null, cdr, and length were marked as "not quite right". In this section we explain the problem and the solution.

For primitive functions

As a motivating example, assume that length is a primitive function with type α-list → int. Consider the following function application (recall that list literals are enclosed in square brackets):

plus(length([1,2,3])) (length([true, false]))

This expression should typecheck, and should be discovered to have type int. However, using the approach defined so far, this code will be rejected. Here's a partially annotated AST; what we'd get after typechecking the left subtree (the application of plus to the result of length([1,2,3])).


                
                              apply
                             /     \
                (int → int) /       \
                        apply        apply
                       /   |          |   \
                      /    | (int)    |    \
                 plus     apply    length   [true,false]
       int → (int → int)  /  \    
                         /    \      
                     length   [1,2,3]
         (α-list → int)      (int-list)
          ** α = int **

              type env                    equalities
             ----------                  -----------
          plus: int → (int → int)          α = int
          length: α-list → int

Note that typechecking length([1,2,3]) caused us to infer that α = int (since length, of type α-list→α is applied to an int-list). That means that length is actually of type int-list→int. This is clearly wrong (since length should be applicable to any kind of list), and when we try to typecheck the right subtree (the application of length to the list [true, false]), we're in trouble! The application of length to the list [true,false] will be rejected, since you can't apply a function of type int-list→int to an argument of type bool-list.

The solution is to use generic type variables for polymorphic functions. A generic type variable is one that appears inside a "forall" quantifier. For example, the type of length should be the generic type:

∀ α. α-list → int

instead of the non-generic (unquantified) type we were using, and the type of pair should be:

∀ α. ∀ β. α → (β → (α x β))

When a function's type involves a generic type variable, it means that the type variable can take on different values each time the function occurs in the AST.

TEST YOURSELF #2

What are the correct types for the other primitive functions (plus and cons)?

solution

Here's how generic and non-generic types are used during typechecking: First, we initialize our type environment with the correct generic types for the primitive functions. Then, when an identifier appears as a leaf of an abstract-syntax tree that is being typechecked, the type is looked up in the type environment. If it is a non-generic type T, the tree node for the identifier is given type T. If it is a generic type of the form

∀ α.T then the tree node is given type

T[t/α] where t is a new type variable. Note that we're using substitution here (just like we did for the lambda calculus): T[t/α] means T with all free occurrences of α replaced by t. If the generic type has multiple quantifiers, then we do a substitution for each. For example, since pair has type

∀ α. ∀ β. α → (β → (α x β)) the first instance of pair in the AST would be given type

t1 → (t2 → (t1 x t2)) and the second instance would be given type

t3 → (t4 → (t3 x t4)) and so on.

Here's the example from above, this time using a generic type for length:

           

                
                                apply
                               /     \
               (int → int)   /       \
                        apply          apply
                       /   |            |   \
                      /    | (int)      |    \
                 plus     apply      length   [true,false]
      int → (int → int)    /  \   (t2-list → int) (bool-list)
                          /    \         ** t2 = bool **
                     length   [1,2,3]
           (t1-list → int)  (int-list)
                  ** t1 = int **

              type env                    equalities
             ----------                  -----------
           plus: int → (int → int)        t1 = int
           length: ∀ α.α-list → int      t2 = bool

Note that now the two occurrences of length have different types (one is t1-list→int and the other is t2-list→int), there is no conflict, and the whole expression will typecheck as it should.

For user-defined functions

What about user-defined functions? We certainly want to allow polymorphic user-defined functions (like length). However, we have to be careful. Consider the following expression, which defines an anonymous function whose argument, f, is also a function:

λ f. pair(f(3))(f(true))

If f is given a generic type "∀ α. α", then we would infer: t1→(t2 x t3) as the type of the whole expression. That may seem OK, but now consider:

(λf. pair(f(3))(f(true)))(succ)

This expression should not typecheck (because succ applied to true is an error). However, if f gets a generic type, then typechecking will succeed (will fail to find the type error); we will find that t1 = int→int, and that the whole expression has type (t2 x t3):


                                
                        apply  (t2 x t3) 
                       /     \
                      /       \
  ( t1 → (t2 x t3) ) λ      succ (int → int)
                       
                   ...

To prevent this, the ID in an expression of the form

exp

must be given a non-generic type to be used when typechecking exp. However, if the inferred type of exp includes type variables, then the type of the whole expression should in general be generic. For example, consider:

let

This expression should typecheck (and the type of the whole expression should be: int x bool). That only happens if the type of λa.a is generic: ∀α.α→α.

We might conclude that in general, for a let expression:

let

exp1

exp2

we should do the following:

Typecheck exp1, producing type T1.
Add the mapping (ID → T2) to the type environment, where T2 is T1 with "foralls" added to the front for every type variable in T1.
Typecheck exp2.

This is almost right, but we have to be a bit more careful about making the type variables in T1 generic. Consider one more example:

let

This is like our first example, we've just snuck in a let expression. Nevertheless, this expression should not typecheck. In this case, exp1 is "f = g", and the type T1 inferred for that expression is the type t1 of the variable g bound by the enclosing lambda. Because of that, we do not want f's type to be ∀t1.t1; it should simply be the non-generic t1.

So to typecheck a let expression we should do the three steps listed above except that we do not add a "forall" to the front of T1 for a type variable that is the type of an ID in some enclosing lambda.

What about recursive definitions; expressions of the form:

let rec

exp1

exp2

where f occurs in exp1 (as well as exp2). Recall that we can think of a recursive definition as the fixed point of the corresponding functional:

let

exp1

exp2

So we see that exp1 is really in the scope of a lambda, and thus f should be given a non-generic type when exp1 is typechecked.

T1 → T2	// function
T1 x T2	// pair
T1-list	// list of objects of type T1
(T1)	// as usual, parens can be used for grouping

exp	→	ID
	\|	literal	// int, bool, list, or pair
	\|	λ ID . exp	// function definition
	\|	exp (exp)	// function application
	\|	if exp then exp else exp	// normal if-then-else
	\|	let ID = exp in exp	// define a "macro" or a non-recursive fn
	\|	let rec ID = exp in exp	// define a recursive fn

(1)	if cond then exp1 else exp2
	(a) the type of cond must be bool
	(b) the types of exp1 and exp2 must be the same
	(c) the type of the whole expression must be the same as the types of exp1 and exp2
(2)	function application: fn(arg)
	(a) the type of fn must be α → β
	(b) the type of arg must be α
	(c) the type of the whole expression is β
(3)	function abstraction: λ id.exp
	(a) the type of id is α
	(b) the type of exp is β
	(c) the type of the whole expression is α → β
(4)	let id = e1 in e2
	let rec id = e1 in e2
	(a) inside e2 the type of id is the type of e1
	(b) the type of the whole expression is the type of e2

Type Inference (Part I)

Contents

Overview

The Language

Polymorphic Type Inference: Definitions

Type Inference: Informal Approach

Generic and Non-Generic Type Variables

For primitive functions

For user-defined functions