Lambda calculus definition

Last updated May 15, 2024

Lambda calculus is a formal mathematical system based on lambda abstraction and function application. Two definitions of the language are given here: a standard definition, and a definition using mathematical formulas.

Standard definition
Definition
Notation
Free and bound variables
Reduction
Normalization
Syntax definition in BNF
Definition as mathematical formulas
Semantics
Canonym - Canonical Names
Map operators
Substitution operator
Free and bound variable sets
Evaluation strategy
Derivation of standard from the math definition
Free and bound variables 2
Changes to the substitution operator
Transformation
References

Standard definition

This formal definition was given by Alonzo Church.

Definition

Lambda expressions are composed of

variables $v_{1}$ , $v_{2}$ , ..., $v_{n}$ , ...
the abstraction symbols lambda ' $\lambda$ ' and dot '.'
parentheses ( )

The set of lambda expressions, $\Lambda$ , can be defined inductively:

If $x$ is a variable, then $x\in \Lambda$
If $x$ is a variable and $M\in \Lambda$ , then $(\lambda x.M)\in \Lambda$
If $M,N\in \Lambda$ , then $(M\ N)\in \Lambda$

Instances of rule 2 are known as abstractions and instances of rule 3 are known as applications.^[1]

Notation

To keep the notation of lambda expressions uncluttered, the following conventions are usually applied.

Outermost parentheses are dropped: $M\ N$ instead of $(M\ N)$
Applications are assumed to be left-associative: $M\ N\ P$ may be written instead of $((M\ N)\ P)$ ^[2]
The body of an abstraction extends as far right as possible: $\lambda x.M\ N$ means $\lambda x.(M\ N)$ and not $(\lambda x.M)\ N$
A sequence of abstractions is contracted: $\lambda x.\lambda y.\lambda z.N$ is abbreviated as $\lambda xyz.N$ ^[3]^[4]

Free and bound variables

The abstraction operator, $\lambda$ , is said to bind its variable wherever it occurs in the body of the abstraction. Variables that fall within the scope of an abstraction are said to be bound. All other variables are called free. For example, in the following expression $y$ is a bound variable and $x$ is free: $\lambda y.x\ x\ y$ . Also note that a variable is bound by its "nearest" abstraction. In the following example the single occurrence of $x$ in the expression is bound by the second lambda: $\lambda x.y(\lambda x.z\ x)$

The set of free variables of a lambda expression, $M$ , is denoted as $\operatorname {FV} (M)$ and is defined by recursion on the structure of the terms, as follows:

$\operatorname {FV} (x)=\{x\}$ , where $x$ is a variable
$\operatorname {FV} (\lambda x.M)=\operatorname {FV} (M)\backslash \{x\}$
$\operatorname {FV} (M\ N)=\operatorname {FV} (M)\cup \operatorname {FV} (N)$ ^[5]

An expression that contains no free variables is said to be closed. Closed lambda expressions are also known as combinators and are equivalent to terms in combinatory logic.

Reduction

The meaning of lambda expressions is defined by how expressions can be reduced.^[6]

There are three kinds of reduction:

α-conversion: changing bound variables (alpha);
β-reduction: applying functions to their arguments (beta);
η-reduction: which captures a notion of extensionality (eta).

We also speak of the resulting equivalences: two expressions are β-equivalent, if they can be β-converted into the same expression, and α/η-equivalence are defined similarly.

The term redex, short for reducible expression, refers to subterms that can be reduced by one of the reduction rules. For example, $(\lambda x.M)\ N$ is a β-redex in expressing the substitution of $N$ for $x$ in $M$ ; if $x$ is not free in $M$ , $\lambda x.M\ x$ is an η-redex. The expression to which a redex reduces is called its reduct; using the previous example, the reducts of these expressions are respectively $M[x:=N]$ and $M$ .

α-conversion

Alpha-conversion, sometimes known as alpha-renaming,^[7] allows bound variable names to be changed. For example, alpha-conversion of $\lambda x.x$ might yield $\lambda y.y$ . Terms that differ only by alpha-conversion are called α-equivalent. Frequently in uses of lambda calculus, α-equivalent terms are considered to be equivalent.

The precise rules for alpha-conversion are not completely trivial. First, when alpha-converting an abstraction, the only variable occurrences that are renamed are those that are bound by the same abstraction. For example, an alpha-conversion of $\lambda x.\lambda x.x$ could result in $\lambda y.\lambda x.x$ , but it could not result in $\lambda y.\lambda x.y$ . The latter has a different meaning from the original.

Second, alpha-conversion is not possible if it would result in a variable getting captured by a different abstraction. For example, if we replace $x$ with $y$ in $\lambda x.\lambda y.x$ , we get $\lambda y.\lambda y.y$ , which is not at all the same.

In programming languages with static scope, alpha-conversion can be used to make name resolution simpler by ensuring that no variable name masks a name in a containing scope (see alpha renaming to make name resolution trivial).

Substitution

Substitution, written $E[V:=R]$ , is the process of replacing all free occurrences of the variable $V$ in the expression $E$ with expression $R$ . Substitution on terms of the lambda calculus is defined by recursion on the structure of terms, as follows (note: x and y are only variables while M and N are any λ expression).

{\begin{aligned}x[x:=N]&\equiv N\\y[x:=N]&\equiv y{\text{, if }}x\neq y\end{aligned}}

{\begin{aligned}(M_{1}\ M_{2})[x:=N]&\equiv (M_{1}[x:=N])\ (M_{2}[x:=N])\\(\lambda x.M)[x:=N]&\equiv \lambda x.M\\(\lambda y.M)[x:=N]&\equiv \lambda y.(M[x:=N]){\text{, if }}x\neq y{\text{, provided }}y\notin FV(N)\end{aligned}}

To substitute into a lambda abstraction, it is sometimes necessary to α-convert the expression. For example, it is not correct for $(\lambda x.y)[y:=x]$ to result in $(\lambda x.x)$ , because the substituted $x$ was supposed to be free but ended up being bound. The correct substitution in this case is $(\lambda z.x)$ , up to α-equivalence. Notice that substitution is defined uniquely up to α-equivalence.

β-reduction

β-reduction captures the idea of function application. β-reduction is defined in terms of substitution: the β-reduction of $((\lambda V.E)\ E')$ is $E[V:=E']$ .

For example, assuming some encoding of $2,7,\times$ , we have the following β-reduction: $((\lambda n.\ n\times 2)\ 7)\rightarrow 7\times 2$ .

η-reduction

η-reduction expresses the idea of extensionality, which in this context is that two functions are the same if and only if they give the same result for all arguments. η-reduction converts between $\lambda x.(fx)$ and $f$ whenever $x$ does not appear free in $f$ .

Normalization

The purpose of β-reduction is to calculate a value. A value in lambda calculus is a function. So β-reduction continues until the expression looks like a function abstraction.

A lambda expression that cannot be reduced further, by either β-redex, or η-redex is in normal form. Note that alpha-conversion may convert functions. All normal forms that can be converted into each other by α-conversion are defined to be equal. See the main article on Beta normal form for details.

Normal Form Type	Definition.
Normal Form	No β- or η-reductions are possible.
Head Normal Form	In the form of a lambda abstraction whose body is not reducible.
Weak Head Normal Form	In the form of a lambda abstraction.

Syntax definition in BNF

Lambda Calculus has a simple syntax. A lambda calculus program has the syntax of an expression where,

Name	BNF	Description
Abstraction	<expression>::= λ <variable-list> . <expression>	Anonymous function definition.
Application term	<expression>::=<application-term>
Application	<application-term>::=<application-term><item>	A function call.
Item	<application-term>::=<item>
Variable	<item>::=<variable>	E.g. x, y, fact, sum, ...
Grouping	<item>::= ( <expression> )	Bracketed expression.

The variable list is defined as,

<variable-list>::=<variable> | <variable>, <variable-list>

A variable as used by computer scientists has the syntax,

<variable>::=<alpha><extension><extension>::=<extension>::=<extension-char><extension><extension-char>::=<alpha> | <digit> | _

Mathematicians will sometimes restrict a variable to be a single alphabetic character. When using this convention the comma is omitted from the variable list.

A lambda abstraction has a lower precedence than an application, so;

\lambda x.y\ z=\lambda x.(y\ z)

Applications are left associative;

x\ y\ z=(x\ y)\ z

An abstraction with multiple parameters is equivalent to multiple abstractions of one parameter.

\lambda x.y.z=\lambda x.\lambda y.z

where,

x is a variable
y is a variable list
z is an expression

Definition as mathematical formulas

The problem of how variables may be renamed is difficult. This definition avoids the problem by substituting all names with canonical names, which are constructed based on the position of the definition of the name in the expression. The approach is analogous to what a compiler does, but has been adapted to work within the constraints of mathematics.

Semantics

The execution of a lambda expression proceeds using the following reductions and transformations,

α-conversion - $\operatorname {alpha-conv} (a)\to \operatorname {canonym} [A,P]=\operatorname {canonym} [a[A],P]$
β-reduction - $\operatorname {beta-redex} [\lambda p.b\ v]=b[p:=v]$
η-reduction - $x\not \in \operatorname {FV} (f)\to \operatorname {eta-redex} [\lambda x.(f\ x)]=f$

where,

canonym is a renaming of a lambda expression to give the expression standard names, based on the position of the name in the expression.
Substitution Operator, $b[p:=v]$ is the substitution of the name $p$ by the lambda expression $v$ in lambda expression $b$ .
Free Variable Set $\operatorname {FV} (f)$ is the set of variables that do not belong to a lambda abstraction in $f$ .

Execution is performing β-reductions and η-reductions on subexpressions in the canonym of a lambda expression until the result is a lambda function (abstraction) in the normal form.

All α-conversions of a λ-expression are considered to be equivalent.

Canonym - Canonical Names

Canonym is a function that takes a lambda expression and renames all names canonically, based on their positions in the expression. This might be implemented as,

{\begin{aligned}\operatorname {canonym} [L,Q]&=\operatorname {canonym} [L,O,Q]\\\operatorname {canonym} [\lambda p.b,M,Q]&=\lambda \operatorname {name} (Q).\operatorname {canonym} [b,M[p:=Q],Q+N]\\\operatorname {canonym} [X\ Y,x,Q]&=\operatorname {canonym} [X,x,Q+F]\ \operatorname {canonym} [Y,x,E+S]\\\operatorname {canonym} [x,M,Q]&=\operatorname {name} (M[x])\end{aligned}}

Where, N is the string "N", F is the string "F", S is the string "S", + is concatenation, and "name" converts a string into a name

Map operators

Map from one value to another if the value is in the map. O is the empty map.

$O[x]=x$
$M[x:=y][x]=y$
$x\neq z\to M[x:=y][z]=M[z]$

Substitution operator

If L is a lambda expression, x is a name, and y is a lambda expression; $L[x:=y]$ means substitute x by y in L. The rules are,

$(\lambda p.b)[x:=y]=\lambda p.b[x:=y]$
$(X\,Y)[x:=y]=X[x:=y]\,Y[x:=y]$
$z=x\to (z)[x:=y]=y$
$z\neq x\to (z)[x:=y]=z$

Note that rule 1 must be modified if it is to be used on non canonically renamed lambda expressions. See Changes to the substitution operator.

Free and bound variable sets

The set of free variables of a lambda expression, M, is denoted as FV(M). This is the set of variable names that have instances not bound (used) in a lambda abstraction, within the lambda expression. They are the variable names that may be bound to formal parameter variables from outside the lambda expression.

The set of bound variables of a lambda expression, M, is denoted as BV(M). This is the set of variable names that have instances bound (used) in a lambda abstraction, within the lambda expression.

The rules for the two sets are given below.^[5]

$\mathrm {FV} (M)$ - Free Variable Set	Comment	$\mathrm {BV} (M)$ - Bound Variable Set	Comment
$\mathrm {FV} (x)=\{x\}$	where x is a variable	$\mathrm {BV} (x)=\emptyset$	where x is a variable
$\mathrm {FV} (\lambda x.M)=\mathrm {FV} (M)\setminus \{x\}$	Free variables of M excluding x	$\mathrm {BV} (\lambda x.M)=\mathrm {BV} (M)\cup \{x\}$	Bound variables of M plus x.
$\mathrm {FV} (M\ N)=\mathrm {FV} (M)\cup \mathrm {FV} (N)$	Combine the free variables from the function and the parameter	$\mathrm {BV} (M\ N)=\mathrm {BV} (M)\cup \mathrm {BV} (N)$	Combine the bound variables from the function and the parameter

Usage;

The Free Variable Set, FV is used above in the definition of the η-reduction.
The Bound Variable Set, BV, is used in the rule for β-redex of non canonical lambda expression.

Evaluation strategy

This mathematical definition is structured so that it represents the result, and not the way it gets calculated. However the result may be different between lazy and eager evaluation. This difference is described in the evaluation formulas.

The definitions given here assume that the first definition that matches the lambda expression will be used. This convention is used to make the definition more readable. Otherwise some if conditions would be required to make the definition precise.

Running or evaluating a lambda expression L is,

\operatorname {eval} [\operatorname {canonym} [L,Q]]

where Q is a name prefix possibly an empty string and eval is defined by,

{\begin{aligned}\operatorname {eval} [x\ y]&=\operatorname {eval} [\operatorname {apply} [\operatorname {eval} [x]\ \operatorname {strategy} [y]]]\\\operatorname {apply} [(\lambda x.y)\ z]&=\operatorname {canonym} [\operatorname {beta-redex} [(\lambda x.y)\ z],x]\\\operatorname {apply} [x]&=x{\text{ if x does match the above.}}\\\operatorname {eval} [\lambda x.(f\ x)]&=\operatorname {eval} [\operatorname {eta-redex} [\lambda x.(f\ x)]]\\\operatorname {eval} [L]&=L\\\operatorname {lazy} [X]&=X\\\operatorname {eager} [X]&=\operatorname {eval} [X]\end{aligned}}

Then the evaluation strategy may be chosen as either,

{\begin{aligned}\operatorname {strategy} &=\operatorname {lazy} \\\operatorname {strategy} &=\operatorname {eager} \end{aligned}}

The result may be different depending on the strategy used. Eager evaluation will apply all reductions possible, leaving the result in normal form, while lazy evaluation will omit some reductions in parameters, leaving the result in "weak head normal form".

Normal form

All reductions that can be applied have been applied. This is the result obtained from applying eager evaluation.

{\begin{aligned}\operatorname {normal} [(\lambda x.y)\ z]&=\operatorname {false} \\\operatorname {normal} [\lambda x.(f\ x)]&=\operatorname {false} \\\operatorname {normal} [x\ y]&=\operatorname {normal} [x]\land \operatorname {normal} [y]\end{aligned}}

In all other cases,

\operatorname {normal} [x]=\operatorname {true}

Weak head normal form

(The definition below is flawed, it is in contradiction with the definition saying that weak head normal form is either head normal form or the term is an abstraction.^[8] The notion has been introduced by Simon Peyton Jones.^[9])

Reductions to the function (the head) have been applied, but not all reductions to the parameter have been applied. This is the result obtained from applying lazy evaluation.

{\begin{aligned}\operatorname {whnf} [(\lambda x.y)\ z]&=\operatorname {false} \\\operatorname {whnf} [\lambda x.(f\ x)]&=\operatorname {false} \\\operatorname {whnf} [x\ y]&=\operatorname {whnf} [x]\end{aligned}}

In all other cases,

\operatorname {whnf} [x]=\operatorname {true}

Derivation of standard from the math definition

The standard definition of lambda calculus uses some definitions which may be considered as theorems, which can be proved based on the definition as mathematical formulas.

The canonical naming definition deals with the problem of variable identity by constructing a unique name for each variable based on the position of the lambda abstraction for the variable name in the expression.

This definition introduces the rules used in the standard definition and relates explains them in terms of the canonical renaming definition.