Church encoding

In mathematics, Church encoding is a means of representing data and operators in the lambda calculus. The Church numerals are a representation of the natural numbers using lambda notation. The method is named for Alonzo Church, who first encoded data in the lambda calculus this way.

Terms that are usually considered primitive in other notations (such as integers, booleans, pairs, lists, and tagged unions) are mapped to higher-order functions under Church encoding. The Church–Turing thesis asserts that any computable operator (and its operands) can be represented under Church encoding. In the untyped lambda calculus the only primitive data type is the function.

Use
A straightforward implementation of Church encoding slows some access operations from $$O(1)$$ to $$O(n)$$, where $$n$$ is the size of the data structure, making Church encoding impractical. Research has shown that this can be addressed by targeted optimizations, but most functional programming languages instead expand their intermediate representations to contain algebraic data types. Nonetheless Church encoding is often used in theoretical arguments, as it is a natural representation for partial evaluation and theorem proving. Operations can be typed using higher-ranked types, and primitive recursion is easily accessible. The assumption that functions are the only primitive data types streamlines many proofs.

Church encoding is complete but only representationally. Additional functions are needed to translate the representation into common data types, for display to people. It is not possible in general to decide if two functions are extensionally equal due to the undecidability of equivalence from Church's theorem. The translation may apply the function in some way to retrieve the value it represents, or look up its value as a literal lambda term. Lambda calculus is usually interpreted as using intensional equality. There are potential problems with the interpretation of results because of the difference between the intensional and extensional definition of equality.

Church numerals
Church numerals are the representations of natural numbers under Church encoding. The higher-order function that represents natural number n is a function that maps any function $$f$$ to its n-fold composition. In simpler terms, the "value" of the numeral is equivalent to the number of times the function encapsulates its argument.


 * $$f^{\circ n} = \underbrace{f \circ f \circ \cdots \circ f}_{n\text{ times}}.\,$$

All Church numerals are functions that take two parameters. Church numerals 0, 1, 2, ..., are defined as follows in the lambda calculus.

Starting with 0 not applying the function at all, proceed with 1 applying the function once, 2 applying the function twice, 3 applying the function three times, etc.:



\begin{array}{r|l|l} \text{Number} & \text{Function definition} & \text{Lambda expression} \\ \hline 0 & 0\ f\ x = x & 0 = \lambda f.\lambda x.x \\ 1 & 1\ f\ x = f\ x & 1 = \lambda f.\lambda x.f\ x \\ 2 & 2\ f\ x = f\ (f\ x) & 2 = \lambda f.\lambda x.f\ (f\ x) \\ 3 & 3\ f\ x = f\ (f\ (f\ x)) & 3 = \lambda f.\lambda x.f\ (f\ (f\ x)) \\ \vdots & \vdots & \vdots \\ n & n\ f\ x = f^n\ x & n = \lambda f.\lambda x.f^{\circ n}\ x \end{array} $$

The Church numeral 3 represents the action of applying any given function three times to a value. The supplied function is first applied to a supplied parameter and then successively to its own result. The end result is not the numeral 3 (unless the supplied parameter happens to be 0 and the function is a successor function). The function itself, and not its end result, is the Church numeral 3. The Church numeral 3 means simply to do anything three times. It is an ostensive demonstration of what is meant by "three times".

Calculation with Church numerals
Arithmetic operations on numbers may be represented by functions on Church numerals. These functions may be defined in lambda calculus, or implemented in most functional programming languages (see converting lambda expressions to functions).

The addition function $$\operatorname{plus}(m, n)= m+n$$ uses the identity $$f^{\circ (m+n)}(x)=f^{\circ m}(f^{\circ n}(x))$$.


 * $$\operatorname{plus} \equiv \lambda m.\lambda n.\lambda f.\lambda x. m\ f\ (n\ f\ x)$$

The successor function $$\operatorname{succ}(n)=n+1$$ is β-equivalent to $$(\operatorname{plus}\ 1)$$.


 * $$\operatorname{succ} \equiv \lambda n.\lambda f.\lambda x. f\ (n\ f\ x)$$

The multiplication function $$\operatorname{mult}(m, n) = m*n$$ uses the identity $$f^{\circ (m*n)}(x) = (f^{\circ n})^{\circ m}(x)$$.


 * $$\operatorname{mult} \equiv \lambda m.\lambda n.\lambda f.\lambda x. m\ (n\ f)\ x$$

The exponentiation function $$\operatorname{exp}(m, n) = m^n$$ is given by the definition of Church numerals, $$n\ h\ x = h^n\ x $$. In the definition substitute $$ h \to m, x \to f$$ to get $$n\ m\ f = m^n\ f $$ and,
 * $$\operatorname{exp}\ m\ n = m^n = n\ m $$

which gives the lambda expression,
 * $$\operatorname{exp} \equiv \lambda m.\lambda n. n\ m$$

The $$\operatorname{pred}(n)$$ function is more difficult to understand.


 * $$\operatorname{pred} \equiv \lambda n.\lambda f.\lambda x. n\ (\lambda g.\lambda h. h\ (g\ f))\ (\lambda u. x)\ (\lambda u. u)$$

A Church numeral applies a function n times. The predecessor function must return a function that applies its parameter n - 1 times. This is achieved by building a container around f and x, which is initialized in a way that omits the application of the function the first time. See predecessor for a more detailed explanation.

The subtraction function can be written based on the predecessor function.


 * $$\operatorname{minus} \equiv \lambda m.\lambda n. (n \operatorname{pred})\ m$$

Derivation of predecessor function
The predecessor function used in the Church encoding is,


 * $$\operatorname{pred}(n) = \begin{cases} 0 & \mbox{if }n=0, \\ n-1 & \mbox{otherwise}\end{cases}$$.

We need a way of applying the function 1 fewer times to build the predecessor. A numeral $n$ applies the function $n$ $f$ times to $n$. The predecessor function must use the numeral $x$ to apply the function $n-1$ times.

Before implementing the predecessor function, here is a scheme that wraps the value in a container function. We will define new functions to use in place of $n$ and $f$, called $inc$ and $init$. The container function is called $value$. The left-hand side of the table shows a numeral $x$ applied to $inc$ and $init$.



\begin{array}{r|r|r} \text{Number} & \text{Using init} & \text{Using const} \\ \hline 0 & \operatorname{init} = \operatorname{value}\ x & \\ 1 & \operatorname{inc}\ \operatorname{init} = \operatorname{value}\ (f\ x) & \operatorname{inc}\ \operatorname{const} = \operatorname{value}\ x \\ 2 & \operatorname{inc}\ (\operatorname{inc}\ \operatorname{init}) = \operatorname{value}\ (f\ (f\ x)) & \operatorname{inc}\ (\operatorname{inc}\ \operatorname{const}) = \operatorname{value}\ (f\ x) \\ 3 & \operatorname{inc}\ (\operatorname{inc}\ (\operatorname{inc}\ \operatorname{init})) = \operatorname{value}\ (f\ (f\ (f\ x))) & \operatorname{inc}\ (\operatorname{inc}\ (\operatorname{inc}\ \operatorname{const})) = \operatorname{value}\ (f\ (f\ x)) \\ \vdots & \vdots & \vdots \\ n & n \operatorname{inc}\ \operatorname{init} = \operatorname{value}\ (f^n\ x) = \operatorname{value}\ (n\ f\ x) & n \operatorname{inc}\ \operatorname{const} = \operatorname{value}\ (f^{n-1}\ x) = \operatorname{value}\ ((n-1)\ f\ x) \\ \end{array} $$

The general recurrence rule is,
 * $$ \operatorname{inc}\ (\operatorname{value}\ v) = \operatorname{value}\ (f\ v)$$

If there is also a function to retrieve the value from the container (called $extract$),
 * $$ \operatorname{extract}\ (\operatorname{value}\ v) = v$$

Then $extract$ may be used to define the $samenum$ function as,
 * $$\operatorname{samenum} = \lambda n.\lambda f.\lambda x.\operatorname{extract}\ (n \operatorname{inc} \operatorname{init}) = \lambda n.\lambda f.\lambda x.\operatorname{extract}\ (\operatorname{value}\ (n\ f\ x)) = \lambda n.\lambda f.\lambda x.n\ f\ x = \lambda n.n$$

The $samenum$ function is not intrinsically useful. However, as $inc$ delegates calling of $n$ to its container argument, we can arrange that on the first application $inc$ receives a special container that ignores its argument allowing to skip the first application of $f$. Call this new initial container $const$. The right-hand side of the above table shows the expansions of $f$ $inc$ $const$. Then by replacing $init$ with $const$ in the expression for the $same$ function we get the predecessor function,


 * $$\operatorname{pred} = \lambda n.\lambda f.\lambda x.\operatorname{extract}\ (n \operatorname{inc} \operatorname{const}) = \lambda n.\lambda f.\lambda x.\operatorname{extract}\ (\operatorname{value}\ ((n-1)\ f\ x)) = \lambda n.\lambda f.\lambda x.(n-1)\ f\ x = \lambda n.(n-1)$$

As explained below the functions $inc$, $init$, $const$, $value$ and $extract$ may be defined as,
 * $$\begin{align}

\operatorname{value} &= \lambda v.(\lambda h.h\ v) \\ \operatorname{extract} k &= k\ \lambda u.u \\ \operatorname{inc} &= \lambda g.\lambda h.h\ (g\ f) \\ \operatorname{init} &= \lambda h.h\ x \\ \operatorname{const} &= \lambda u.x \end{align}$$

Which gives the lambda expression for $pred$ as,
 * $$\operatorname{pred} = \lambda n.\lambda f.\lambda x.n\ (\lambda g.\lambda h.h\ (g\ f))\ (\lambda u.x)\ (\lambda u.u) $$

{| class="wikitable"
 * align='top' |
 * align='top' |

Value container
The value container applies a function to its value. It is defined by,
 * $$ \operatorname{value}\ v\ h = h\ v $$

so,
 * $$ \operatorname{value} = \lambda v.(\lambda h.h\ v) $$

Inc
The $inc$ function should take a value containing $n$, and return a new value containing $v$.
 * $$ \operatorname{inc}\ (\operatorname{value}\ v) = \operatorname{value}\ (f\ v)$$

Letting g be the value container,
 * $$ g = \operatorname{value}\ v$$

then,
 * $$ g\ f = \operatorname{value}\ v\ f = f\ v$$

so,
 * $$ \operatorname{inc}\ g = \operatorname{value}\ (g\ f) $$
 * $$ \operatorname{inc} = \lambda g.\lambda h.h\ (g\ f) $$


 * align='top' |

Extract
The value may be extracted by applying the identity function,
 * $$ I = \lambda u.u $$

Using $f v$,
 * $$ \operatorname{value}\ v\ I = v $$

so,
 * $$ \operatorname{extract}\ k = k\ I $$

Const
To implement $pred$ the $init$ function is replaced with the $const$ that does not apply $I$. We need $const$ to satisfy,
 * $$ \operatorname{inc}\ \operatorname{const} = \operatorname{value}\ x $$
 * $$ \lambda h.h\ (\operatorname{const}\ f) = \lambda h.h\ x $$

Which is satisfied if,
 * $$ \operatorname{const}\ f = x $$

Or as a lambda expression,
 * $$ \operatorname{const} = \lambda u.x $$


 * }

Another way of defining pred
Pred may also be defined using pairs:
 * $$\begin{align}

\operatorname{f} =&\ \lambda p.\ \operatorname{pair}\ (\operatorname{second}\ p)\ (\operatorname{succ}\ (\operatorname{second}\ p)) \\ \operatorname{zero} =&\ (\lambda f.\lambda x.\ x) \\ \operatorname{pc0} =&\ \operatorname{pair}\ \operatorname{zero}\ \operatorname{zero} \\ \operatorname{pred} =&\ \lambda n.\ \operatorname{first}\ (n\ \operatorname{f}\ \operatorname{pc0}) \\ \end{align}$$

This is a simpler definition but leads to a more complex expression for pred. The expansion for $$\operatorname{pred} \operatorname{three}$$:
 * $$\begin{align}

\operatorname{pred} \operatorname{three} =&\ \operatorname{first}\ (\operatorname{f}\ (\operatorname{f}\ (\operatorname{f}\ (\operatorname{pair}\ \operatorname{zero}\ \operatorname{zero})))) \\ =&\ \operatorname{first}\ (\operatorname{f}\ (\operatorname{f}\ (\operatorname{pair}\ \operatorname{zero}\ \operatorname{one}))) \\ =&\ \operatorname{first}\ (\operatorname{f}\ (\operatorname{pair}\ \operatorname{one}\ \operatorname{two})) \\ =&\ \operatorname{first}\ (\operatorname{pair}\ \operatorname{two}\ \operatorname{three}) \\ =&\ \operatorname{two} \end{align}$$

Division
Division of natural numbers may be implemented by,


 * $$ n/m = \operatorname{if}\ n \ge m\ \operatorname{then}\ 1 + (n-m)/m\ \operatorname{else}\ 0 $$

Calculating $$n-m$$ takes many beta reductions. Unless doing the reduction by hand, this doesn't matter that much, but it is preferable to not have to do this calculation twice. The simplest predicate for testing numbers is IsZero so consider the condition.
 * $$ \operatorname{IsZero}\ (\operatorname{minus}\ n\ m) $$

But this condition is equivalent to $$ n \le m $$, not $$ n<m $$. If this expression is used then the mathematical definition of division given above is translated into function on Church numerals as,
 * $$ \operatorname{divide1}\ n\ m\ f\ x = (\lambda d.\operatorname{IsZero}\ d\ (0\ f\ x)\ (f\ (\operatorname{divide1}\ d\ m\ f\ x)))\ (\operatorname{minus}\ n\ m) $$

As desired, this definition has a single call to $$ \operatorname{minus}\ n\ m $$. However the result is that this formula gives the value of $$(n-1)/ m$$.

This problem may be corrected by adding 1 to n before calling divide. The definition of divide is then,
 * $$ \operatorname{divide}\ n = \operatorname{divide1}\ (\operatorname{succ}\ n) $$

divide1 is a recursive definition. The Y combinator may be used to implement the recursion. Create a new function called div by; to get,
 * In the left hand side $$ \operatorname{divide1} \rightarrow \operatorname{div} \ c$$
 * In the right hand side $$ \operatorname{divide1} \rightarrow c $$
 * $$ \operatorname{div} = \lambda c.\lambda n.\lambda m.\lambda f.\lambda x.(\lambda d.\operatorname{IsZero}\ d\ (0\ f\ x)\ (f\ (c\ d\ m\ f\ x)))\ (\operatorname{minus}\ n\ m) $$

Then,
 * $$ \operatorname{divide} = \lambda n.\operatorname{divide1}\ (\operatorname{succ}\ n) $$

where,
 * $$\begin{align}

\operatorname{divide1} &= Y\ \operatorname{div} \\ \operatorname{succ} &= \lambda n.\lambda f.\lambda x. f\ (n\ f\ x) \\ Y &= \lambda f.(\lambda x.f\ (x\ x))\ (\lambda x.f\ (x\ x)) \\ 0 &= \lambda f.\lambda x.x\\ \operatorname{IsZero} &= \lambda n.n\ (\lambda x.\operatorname{false})\ \operatorname{true} \end{align}$$
 * $$\begin{align}

\operatorname{true} &\equiv \lambda a.\lambda b.a\\ \operatorname{false} &\equiv \lambda a.\lambda b.b \end{align}$$
 * $$\begin{align}

\operatorname{minus} &= \lambda m.\lambda n.n \operatorname{pred} m\\ \operatorname{pred} &= \lambda n.\lambda f.\lambda x.n\ (\lambda g.\lambda h.h\ (g\ f))\ (\lambda u.x)\ (\lambda u.u) \end{align}$$ Gives,
 * $$\scriptstyle \operatorname{divide} = \lambda n.((\lambda f.(\lambda x.x\ x)\ (\lambda x.f\ (x\ x)))\ (\lambda c.\lambda n.\lambda m.\lambda f.\lambda x.(\lambda d.(\lambda n.n\ (\lambda x.(\lambda a.\lambda b.b))\ (\lambda a.\lambda b.a))\ d\ ((\lambda f.\lambda x.x)\ f\ x)\ (f\ (c\ d\ m\ f\ x)))\ ((\lambda m.\lambda n.n (\lambda n.\lambda f.\lambda x.n\ (\lambda g.\lambda h.h\ (g\ f))\ (\lambda u.x)\ (\lambda u.u)) m)\ n\ m)))\ ((\lambda n.\lambda f.\lambda x. f\ (n\ f\ x))\ n) $$

Or as text, using \ for $f$, divide = (\n.((\f.(\x.x x) (\x.f (x x))) (\c.\n.\m.\f.\x.(\d.(\n.n (\x.(\a.\b.b)) (\a.\b.a)) d ((\f.\x.x) f x) (f (c d m f x))) ((\m.\n.n (\n.\f.\x.n (\g.\h.h (g f)) (\u.x) (\u.u)) m) n m))) ((\n.\f.\x. f (n f x)) n))

For example, 9/3 is represented by divide (\f.\x.f (f (f (f (f (f (f (f (f x))))))))) (\f.\x.f (f (f x)))

Using a lambda calculus calculator, the above expression reduces to 3, using normal order. \f.\x.f (f (f (x)))

Signed numbers
One simple approach for extending Church Numerals to signed numbers is to use a Church pair, containing Church numerals representing a positive and a negative value. The integer value is the difference between the two Church numerals.

A natural number is converted to a signed number by,


 * $$\operatorname{convert}_s = \lambda x.\operatorname{pair}\ x\ 0 $$

Negation is performed by swapping the values.


 * $$\operatorname{neg}_s = \lambda x.\operatorname{pair}\ (\operatorname{second}\ x)\ (\operatorname{first}\ x) $$

The integer value is more naturally represented if one of the pair is zero. The OneZero function achieves this condition,


 * $$\operatorname{OneZero} = \lambda x.\operatorname{IsZero}\ (\operatorname{first}\ x)\ x\ (\operatorname{IsZero}\ (\operatorname{second}\ x)\ x\ (\operatorname{OneZero}\ \operatorname{pair}\ (\operatorname{pred}\ (\operatorname{first}\ x))\ (\operatorname{pred}\ (\operatorname{second}\ x))))$$

The recursion may be implemented using the Y combinator,
 * $$\operatorname{OneZ} = \lambda c.\lambda x.\operatorname{IsZero}\ (\operatorname{first}\ x)\ x\ (\operatorname{IsZero}\ (\operatorname{second}\ x)\ x\ (c\ \operatorname{pair}\ (\operatorname{pred}\ (\operatorname{first}\ x))\ (\operatorname{pred}\ (\operatorname{second}\ x))))$$


 * $$\operatorname{OneZero} = Y \operatorname{OneZ}$$

Plus and minus
Addition is defined mathematically on the pair by,
 * $$x + y = [x_p, x_n] + [y_p, y_n] = x_p - x_n + y_p - y_n = (x_p + y_p) - (x_n + y_n) = [x_p + y_p, x_n + y_n] $$

The last expression is translated into lambda calculus as,
 * $$\operatorname{plus}_s = \lambda x.\lambda y.\operatorname{OneZero}\ (\operatorname{pair}\ (\operatorname{plus}\ (\operatorname{first}\ x)\ (\operatorname{first}\ y))\ (\operatorname{plus}\ (\operatorname{second}\ x)\ (\operatorname{second}\ y))) $$

Similarly subtraction is defined,
 * $$x - y = [x_p, x_n] - [y_p, y_n] = x_p - x_n - y_p + y_n = (x_p + y_n) - (x_n + y_p) = [x_p + y_n, x_n + y_p] $$

giving,
 * $$\operatorname{minus}_s = \lambda x.\lambda y.\operatorname{OneZero}\ (\operatorname{pair}\ (\operatorname{plus}\ (\operatorname{first}\ x)\ (\operatorname{second}\ y))\ (\operatorname{plus}\ (\operatorname{second}\ x)\ (\operatorname{first}\ y))) $$

Multiply and divide
Multiplication may be defined by,
 * $$x*y = [x_p, x_n]*[y_p, y_n] =(x_p - x_n)*(y_p - y_n) = (x_p*y_p + x_n*y_n) - (x_p*y_n + x_n*y_p) = [x_p*y_p + x_n*y_n, x_p*y_n + x_n*y_p]$$

The last expression is translated into lambda calculus as,
 * $$\operatorname{mult}_s = \lambda x.\lambda y.\operatorname{pair}\

(\operatorname{plus}\ (\operatorname{mult}\ (\operatorname{first}\ x)\ (\operatorname{first}\ y))\ (\operatorname{mult}\ (\operatorname{second}\ x)\ (\operatorname{second}\ y)))\ (\operatorname{plus}\ (\operatorname{mult}\ (\operatorname{first}\ x)\ (\operatorname{second}\ y))\ (\operatorname{mult}\ (\operatorname{second}\ x)\ (\operatorname{first}\ y))) $$

A similar definition is given here for division, except in this definition, one value in each pair must be zero (see OneZero above). The divZ function allows us to ignore the value that has a zero component.
 * $$\operatorname{divZ} = \lambda x.\lambda y.\operatorname{IsZero}\ y\ 0 \ (\operatorname{divide}\ x\ y) $$

divZ is then used in the following formula, which is the same as for multiplication, but with mult replaced by divZ.


 * $$\operatorname{divide}_s = \lambda x.\lambda y.\operatorname{pair}\

(\operatorname{plus}\ (\operatorname{divZ}\ (\operatorname{first}\ x)\ (\operatorname{first}\ y))\ (\operatorname{divZ}\ (\operatorname{second}\ x)\ (\operatorname{second}\ y)))\ (\operatorname{plus}\ (\operatorname{divZ}\ (\operatorname{first}\ x)\ (\operatorname{second}\ y))\ (\operatorname{divZ}\ (\operatorname{second}\ x)\ (\operatorname{first}\ y))) $$

Rational and real numbers
Rational and computable real numbers may also be encoded in lambda calculus. Rational numbers may be encoded as a pair of signed numbers. Computable real numbers may be encoded by a limiting process that guarantees that the difference from the real value differs by a number which may be made as small as we need. The references given describe software that could, in theory, be translated into lambda calculus. Once real numbers are defined, complex numbers are naturally encoded as a pair of real numbers.

The data types and functions described above demonstrate that any data type or calculation may be encoded in lambda calculus. This is the Church–Turing thesis.

Translation with other representations
Most real-world languages have support for machine-native integers; the church and unchurch functions convert between nonnegative integers and their corresponding Church numerals. The functions are given here in Haskell, where the  corresponds to the λ of Lambda calculus. Implementations in other languages are similar.

Church Booleans
Church Booleans are the Church encoding of the Boolean values true and false. Some programming languages use these as an implementation model for Boolean arithmetic; examples are Smalltalk and Pico.

Boolean logic may be considered as a choice. The Church encoding of true and false are functions of two parameters:
 * true chooses the first parameter.
 * false chooses the second parameter.

The two definitions are known as Church Booleans:
 * $$\begin{align}

\operatorname{true} &\equiv \lambda a.\lambda b.a\\ \operatorname{false} &\equiv \lambda a.\lambda b.b \end{align}$$

This definition allows predicates (i.e. functions returning logical values) to directly act as if-clauses. A function returning a Boolean, which is then applied to two parameters, returns either the first or the second parameter:
 * $$\operatorname{predicate-}x\ \operatorname{then-clause}\ \operatorname{else-clause} $$

evaluates to then-clause if predicate-x evaluates to true, and to else-clause if predicate-x evaluates to false.

Because true and false choose the first or second parameter they may be combined to provide logic operators. Note that there are multiple possible implementations of not.
 * $$\begin{align}

\operatorname{and} &= \lambda p.\lambda q.p\ q\ p\\ \operatorname{or} &= \lambda p.\lambda q.p\ p\ q\\ \operatorname{not}_1 &= \lambda p.\lambda a.\lambda b.p\ b\ a\\ \operatorname{not}_2 &= \lambda p.p\ (\lambda a.\lambda b. b)\ (\lambda a.\lambda b. a) = \lambda p.p \operatorname{false} \operatorname{true}\\ \operatorname{xor} &= \lambda a.\lambda b.a\ (\operatorname{not}\ b)\ b\\ \operatorname{if} &= \lambda p.\lambda a.\lambda b.p\ a\ b \end{align}$$

Some examples:


 * $$\begin{align}

\operatorname{and} \operatorname{true} \operatorname{false} &= (\lambda p.\lambda q.p\ q\ p)\ \operatorname{true}\ \operatorname{false} = \operatorname{true} \operatorname{false} \operatorname{true} = (\lambda a.\lambda b.a) \operatorname{false} \operatorname{true} = \operatorname{false} \\ \operatorname{or} \operatorname{true} \operatorname{false} &= (\lambda p.\lambda q.p\ p\ q)\ (\lambda a.\lambda b.a)\ (\lambda a.\lambda b.b) = (\lambda a.\lambda b.a)\ (\lambda a.\lambda b.a)\ (\lambda a.\lambda b.b) = (\lambda a.\lambda b.a) = \operatorname{true} \\ \operatorname{not}_1\ \operatorname{true} &= (\lambda p.\lambda a.\lambda b.p\ b\ a) (\lambda a.\lambda b.a) = \lambda a.\lambda b.(\lambda a.\lambda b.a)\ b\ a = \lambda a.\lambda b.(\lambda c.b)\ a = \lambda a.\lambda b.b = \operatorname{false} \\ \operatorname{not}_2\ \operatorname{true} &= (\lambda p.p\ (\lambda a.\lambda b. b) (\lambda a.\lambda b. a)) (\lambda a.\lambda b. a) = (\lambda a.\lambda b. a) (\lambda a.\lambda b. b) (\lambda a.\lambda b. a) = (\lambda b. (\lambda a.\lambda b. b))\ (\lambda a.\lambda b. a) = \lambda a.\lambda b.b = \operatorname{false} \end{align}$$

Predicates
A predicate is a function that returns a Boolean value. The most fundamental predicate is $$\operatorname{IsZero}$$, which returns $$\operatorname{true}$$ if its argument is the Church numeral $$0$$, and $$\operatorname{false}$$ if its argument is any other Church numeral:
 * $$\operatorname{IsZero} = \lambda n.n\ (\lambda x.\operatorname{false})\ \operatorname{true}$$

The following predicate tests whether the first argument is less-than-or-equal-to the second:
 * $$\operatorname{LEQ} = \lambda m.\lambda n.\operatorname{IsZero}\ (\operatorname{minus}\ m\ n)$$,

Because of the identity,
 * $$ x = y \equiv (x \le y \land y \le x) $$

The test for equality may be implemented as,
 * $$\operatorname{EQ} = \lambda m.\lambda n.\operatorname{and}\ (\operatorname{LEQ}\ m\ n)\ (\operatorname{LEQ}\ n\ m) $$

Church pairs
Church pairs are the Church encoding of the pair (two-tuple) type. The pair is represented as a function that takes a function argument. When given its argument it will apply the argument to the two components of the pair. The definition in lambda calculus is,


 * $$\begin{align}

\operatorname{pair} &\equiv \lambda x.\lambda y.\lambda z.z\ x\ y \\ \operatorname{first} &\equiv \lambda p.p\ (\lambda x.\lambda y.x) \\ \operatorname{second} &\equiv \lambda p.p\ (\lambda x.\lambda y.y) \end{align} $$

For example,
 * $$\begin{align}

& \operatorname{first}\ (\operatorname{pair}\ a\ b) \\ = & (\lambda p.p\ (\lambda x.\lambda y.x))\ ((\lambda x.\lambda y.\lambda z.z\ x\ y)\ a\ b) \\ = & (\lambda p.p\ (\lambda x.\lambda y.x))\ (\lambda z.z\ a\ b) \\ = & (\lambda z.z\ a\ b)\ (\lambda x.\lambda y.x) \\ = & (\lambda x.\lambda y.x)\ a\ b = a \end{align} $$

List encodings
An (immutable) list is constructed from list nodes. The basic operations on the list are;

We give four different representations of lists below:
 * Build each list node from two pairs (to allow for empty lists).
 * Build each list node from one pair.
 * Represent the list using the right fold function.
 * Represent the list using Scott's encoding that takes cases of match expression as arguments

Two pairs as a list node
A nonempty list can be implemented by a Church pair;
 * First contains the head.
 * Second contains the tail.

However this does not give a representation of the empty list, because there is no "null" pointer. To represent null, the pair may be wrapped in another pair, giving three values:
 * First - the null pointer (empty list).
 * Second.First contains the head.
 * Second.Second contains the tail.

Using this idea the basic list operations can be defined like this:

In a nil node second is never accessed, provided that head and tail are only applied to nonempty lists.

One pair as a list node
Alternatively, define
 * $$\begin{align}

\operatorname{cons} &\equiv \operatorname{pair} \\ \operatorname{head} &\equiv \operatorname{first} \\ \operatorname{tail} &\equiv \operatorname{second} \\ \operatorname{nil}  &\equiv \operatorname{false} \\ \operatorname{isnil} &\equiv \lambda l.l (\lambda h.\lambda t.\lambda d.\operatorname{false}) \operatorname{true} \end{align}$$ where the last definition is a special case of the general
 * $$\operatorname{process-list} \equiv \lambda l.l (\lambda h.\lambda t.\lambda d. \operatorname{head-and-tail-clause}) \operatorname{nil-clause} $$

Represent the list using right fold
As an alternative to the encoding using Church pairs, a list can be encoded by identifying it with its right fold function. For example, a list of three elements x, y and z can be encoded by a higher-order function that when applied to a combinator c and a value n returns c x (c y (c z n)).



\begin{align} \operatorname{nil} &\equiv \lambda c.\lambda n.n\\ \operatorname{isnil} &\equiv \lambda l.l\ (\lambda h.\lambda t.\operatorname{false})\ \operatorname{true}\\ \operatorname{cons} &\equiv \lambda h.\lambda t.\lambda c.\lambda n.c\ h\ (t\ c\ n)\\ \operatorname{head} &\equiv \lambda l.l\ (\lambda h.\lambda t.h)\ \operatorname{false}\\

\operatorname{tail} &\equiv \lambda l.\lambda c.\lambda n.l\ (\lambda h.\lambda t.\lambda g.g\ h\ (t\ c))\ (\lambda t.n)\ (\lambda h.\lambda t.t) \end{align}$$

This list representation can be given type in System F.

Represent the list using Scott encoding
An alternative representation is Scott encoding, which uses the idea of continuations and can lead to simpler code. (see also Mogensen–Scott encoding).

In this approach, we use the fact that lists can be observed using pattern matching expression. For example, using Scala notation, if  denotes a value of type   with empty list   and constructor   we can inspect the list and compute   in case the list is empty and consCode(h, t) when the list is not empty: The list is given by how it acts upon nilCode and consCode. We therefore define a list as a function that accepts such nilCode and consCode as arguments, so that instead of the above pattern match we may simply write:

\operatorname{list}\ \operatorname{nilCode}\ \operatorname{consCode} $$

Let us denote by n the parameter corresponding to nilCode and by c the parameter corresponding to consCode. The empty list is the one that returns the nil argument:

\operatorname{nil} \equiv \lambda n. \lambda c.\ n $$ The non-empty list with head h and tail t is given by

\operatorname{cons}\ h\ t\ \ \equiv\ \ \lambda n.\lambda c.\ c\ h\ t $$

More generally, an algebraic data type with $$m$$ alternatives becomes a function with $$m$$ parameters. When the $$i$$th constructor has $$n_i$$ arguments, the corresponding parameter of the encoding takes $$n_i$$ arguments as well.

Scott encoding can be done in untyped lambda calculus, whereas its use with types requires a type system with recursion and type polymorphism. A list with element type E in this representation that is used to compute values of type C would have the following recursive type definition, where '=>' denotes function type: A list that can be used to compute arbitrary types would have a type that quantifies over. A list generic in   would also take   as the type argument.