Implementation of mathematics in set theory

This article examines the implementation of mathematical concepts in set theory. The implementation of a number of basic mathematical concepts is carried out in parallel in ZFC (the dominant set theory) and in NFU, the version of Quine's New Foundations shown to be consistent by R. B. Jensen in 1969 (here understood to include at least axioms of Infinity and Choice).

What is said here applies also to two families of set theories: on the one hand, a range of theories including Zermelo set theory near the lower end of the scale and going up to ZFC extended with large cardinal hypotheses such as "there is a measurable cardinal"; and on the other hand a hierarchy of extensions of NFU which is surveyed in the New Foundations article. These correspond to different general views of what the set-theoretical universe is like, and it is the approaches to implementation of mathematical concepts under these two general views that are being compared and contrasted.

It is not the primary aim of this article to say anything about the relative merits of these theories as foundations for mathematics. The reason for the use of two different set theories is to illustrate that multiple approaches to the implementation of mathematics are feasible. Precisely because of this approach, this article is not a source of "official" definitions for any mathematical concept.

Preliminaries
The following sections carry out certain constructions in the two theories ZFC and NFU and compare the resulting implementations of certain mathematical structures (such as the natural numbers).

Mathematical theories prove theorems (and nothing else). So saying that a theory allows the construction of a certain object means that it is a theorem of that theory that that object exists. This is a statement about a definition of the form "the x such that $$\phi$$ exists", where $$\phi$$ is a formula of our language: the theory proves the existence of "the x such that $$\phi$$" just in case it is a theorem that "there is one and only one x such that $$\phi$$". (See Bertrand Russell's theory of descriptions.) Loosely, the theory "defines" or "constructs" this object in this case. If the statement is not a theorem, the theory cannot show that the object exists; if the statement is provably false in the theory, it proves that the object cannot exist; loosely, the object cannot be constructed.

ZFC and NFU share the language of set theory, so the same formal definitions "the x such that $$\phi$$" can be contemplated in the two theories. A specific form of definition in the language of set theory is set-builder notation: $$\{x \mid \phi\}$$ means "the set A such that for all x, $$x \in A \leftrightarrow \phi$$" (A cannot be free in $$\phi$$). This notation admits certain conventional extensions: $$\{x \in B \mid \phi\}$$ is synonymous with $$\{x \mid x \in B \wedge \phi\}$$; $$\{f(x_1,\ldots,x_n) \mid \phi\}$$ is defined as $$\{z \mid \exists x_1,\ldots,x_n\,(z=f(x_1,\dots,x_n) \wedge \phi)\}$$, where $$f(x_1,\ldots,x_n)$$ is an expression already defined.

Expressions definable in set-builder notation make sense in both ZFC and NFU: it may be that both theories prove that a given definition succeeds, or that neither do (the expression $$\{x \mid x\not\in x\}$$ fails to refer to anything in any set theory with classical logic; in class theories like NBG this notation does refer to a class, but it is defined differently), or that one does and the other doesn't. Further, an object defined in the same way in ZFC and NFU may turn out to have different properties in the two theories (or there may be a difference in what can be proved where there is no provable difference between their properties).

Further, set theory imports concepts from other branches of mathematics (in intention, all branches of mathematics). In some cases, there are different ways to import the concepts into ZFC and NFU. For example, the usual definition of the first infinite ordinal $$\omega$$ in ZFC is not suitable for NFU because the object (defined in purely set theoretical language as the set of all finite von Neumann ordinals) cannot be shown to exist in NFU. The usual definition of $$\omega$$ in NFU is (in purely set theoretical language) the set of all infinite well-orderings all of whose proper initial segments are finite, an object which can be shown not to exist in ZFC. In the case of such imported objects, there may be different definitions, one for use in ZFC and related theories, and one for use in NFU and related theories. For such "implementations" of imported mathematical concepts to make sense, it is necessary to be able to show that the two parallel interpretations have the expected properties: for example, the implementations of the natural numbers in ZFC and NFU are different, but both are implementations of the same mathematical structure, because both include definitions for all the primitives of Peano arithmetic and satisfy (the translations of) the Peano axioms. It is then possible to compare what happens in the two theories as when only set theoretical language is in use, as long as the definitions appropriate to ZFC are understood to be used in the ZFC context and the definitions appropriate to NFU are understood to be used in the NFU context.

Whatever is proven to exist in a theory clearly provably exists in any extension of that theory; moreover, analysis of the proof that an object exists in a given theory may show that it exists in weaker versions of that theory (one may consider Zermelo set theory instead of ZFC for much of what is done in this article, for example).

Empty set, singleton, unordered pairs and tuples
These constructions appear first because they are the simplest constructions in set theory, not because they are the first constructions that come to mind in mathematics (though the notion of finite set is certainly fundamental). Even though NFU also allows the construction of set ur-elements yet to become members of a set, the empty set is the unique set with no members:
 * $$\left.\varnothing\right. \, \overset{\mathrm{def.}}{=} \left\{x : x \neq x\right\}$$

For each object $$x$$, there is a set $$\{x\}$$ with $$x$$ as its only element:
 * $$\left\{x\right\} \overset{\mathrm{def.}}{=} \left\{y : y = x\right\}$$

For objects $$x$$ and $$y$$, there is a set $$\{x,y\}$$ containing $$x$$ and $$y$$ as its only elements:
 * $$\left\{x,y\right\} \overset{\mathrm{def.}}{=} \left\{z : z=x \vee z = y\right\}$$

The union of two sets is defined in the usual way:
 * $$\left.x \cup y\right. \, \overset{\mathrm{def.}}{=} \left\{z : z \in x \vee z \in y\right\}$$

This is a recursive definition of unordered $$n$$-tuples for any concrete $$n$$ (finite sets given as lists of their elements:)
 * $$\left\{x_1, \ldots, x_n, x_{n+1}\right\} \overset{\mathrm{def.}}{=} \left\{x_1, \ldots, x_n\right\} \cup \left\{x_{n+1}\right\}$$

In NFU, all the set definitions given work by stratified comprehension; in ZFC, the existence of the unordered pair is given by the Axiom of Pairing, the existence of the empty set follows by Separation from the existence of any set, and the binary union of two sets exists by the axioms of Pairing and Union ($$x \cup y = \bigcup\{x,y\}$$).

Ordered pair
First, consider the ordered pair. The reason that this comes first is technical: ordered pairs are needed to implement relations and functions, which are needed to implement other concepts which may seem to be prior. The first definition of the ordered pair was the definition $$(x,y) \overset{\mathrm{def}}{=} \{\{\{x\},\emptyset\},\{\{y\}\}\}$$ proposed by Norbert Wiener in 1914 in the context of the type theory of Principia Mathematica. Wiener observed that this allowed the elimination of types of n-ary relations for n > 1 from the system of that work. It is more usual now to use the definition $$(x,y) \overset{\mathrm{def.}}{=} \{\{x\},\{x,y\}\}$$, due to Kuratowski. Either of these definitions works in either ZFC or NFU. In NFU, these two definitions have a technical disadvantage: the Kuratowski ordered pair is two types higher than its projections, while the Wiener ordered pair is three types higher. It is common to postulate the existence of a type-level ordered pair (a pair $$(x,y)$$ which is the same type as its projections) in NFU. It is convenient to use the Kuratowski pair in both systems until the use of type-level pairs can be formally justified. The internal details of these definitions have nothing to do with their actual mathematical function. For any notion $$(x,y)$$ of ordered pair, the thing that matters is that it satisfies the defining condition
 * $$(x,y)=(z,w) \ \equiv \ x=z \wedge y=w$$

…and that it be reasonably easy to collect ordered pairs into sets.

Relations
Relations are sets whose members are all ordered pairs. Where possible, a relation $$R$$ (understood as a binary predicate) is implemented as $$\{(x,y) \mid x R y\}$$ (which may be written as $$\{z \mid \pi_1(z) R \pi_2(z)\}$$). When $$R$$ is a relation, the notation $$xRy$$ means $$\left(x, y\right) \in R$$.

In ZFC, some relations (such as the general equality relation or subset relation on sets) are 'too large' to be sets (but may be harmlessly reified as proper classes). In NFU, some relations (such as the membership relation) are not sets because their definitions are not stratified: in $$\{(x,y) \mid x \in y\}$$, $$x$$ and $$y$$ would need to have the same type (because they appear as projections of the same pair), but also successive types (because $$x$$ is considered as an element of $$y$$).

Related definitions
Let $$R$$ and $$S$$ be given binary relations. Then the following concepts are useful:

The converse of $$R$$ is the relation $$\left\{\left(y, x\right) : xRy\right\}$$.

The domain of $$R$$ is the set $$\left\{x : \exists y \left(xRy\right)\right\}$$.

The range of $$R$$ is the domain of the converse of $$R$$. That is, the set $$\left\{y : \exists x \left(xRy\right)\right\}$$.

The field of $$R$$ is the union of the domain and range of $$R$$.

The preimage of a member $$x$$ of the field of $$R$$ is the set $$\left\{y : yRx\right\}$$ (used in the definition of 'well-founded' below.)

The downward closure of a member $$x$$ of the field of $$R$$ is the smallest set $$D$$ containing $$x$$, and containing each $$zRy$$ for each $$y \in D$$ (i.e., including the preimage of each of its elements with respect to $$R$$ as a subset.)

The relative product $$R|S$$ of $$R$$ and $$S$$ is the relation $$\left\{\left(x, z\right) : \exists y\,\left(xRy \wedge ySz\right)\right\}$$.

Notice that with our formal definition of a binary relation, the range and codomain of a relation are not distinguished. This could be done by representing a relation $$R$$ with codomain $$B$$ as $$\left(R, B\right)$$, but our development will not require this.

In ZFC, any relation whose domain is a subset of a set $$A$$ and whose range is a subset of a set $$B$$ will be a set, since the Cartesian product $$A \times B = \left\{\left(a, b\right) : a \in A \wedge b \in B\right\}$$ is a set (being a subclass of $$\mathcal{P}\!\left(A \cup B\right)$$), and Separation provides for the existence of $$\left\{\left(x, y\right) \in A \times B : xRy\right\}$$. In NFU, some relations with global scope (such as equality and subset) can be implemented as sets. In NFU, bear in mind that $$x$$ and $$y$$ are three types lower than $$R$$ in $$xRy$$ (one type lower if a type-level ordered pair is used).

Properties and kinds of relations
A binary relation $$R$$ is:
 * Reflexive if $$xRx$$ for every $$x$$ in the field of $$R$$.
 * Symmetric if $$\forall x, y \,(xRy \to yRx)$$.
 * Transitive if $$\forall x, y, z \,(xRy \wedge yRz \rightarrow xRz)$$.
 * Antisymmetric if $$\forall x, y \,(xRy \wedge yRx \rightarrow x=y)$$.
 * Well-founded if for every set $$S$$ which meets the field of $$R$$, $$\ \exists x \in S$$ whose preimage under $$R$$ does not meet $$S$$.
 * Extensional if for every $$x, y$$ in the field of $$R$$, $$x = y$$ if and only if $$x$$ and $$y$$ have the same preimage under $$R$$.

Relations having certain combinations of the above properties have standard names. A binary relation $$R$$ is:


 * An equivalence relation if $$R$$ is reflexive, symmetric, and transitive.
 * A partial order if $$R$$ is reflexive, antisymmetric, and transitive.
 * A linear order if $$R$$ is a partial order and for every $$x, y$$ in the field of $$R$$, either $$xRy$$ or $$yRx$$.
 * A well-ordering if $$R$$ is a linear order and well-founded.
 * A set picture if $$R$$ is well-founded and extensional, and the field of $$R$$ either equals the downward closure of one of its members (called its top element), or is empty.

Functions
A functional relation is a binary predicate $$F$$ such that $$\forall x, y, z\,\left(xFy \wedge xFz \to y = z\right).$$ Such a relation (predicate) is implemented as a relation (set) exactly as described in the previous section. So the predicate $$F$$ is implemented by the set $$\left\{\left(x, y\right) : xFy\right\}$$. A relation $$F$$ is a function if and only if $$\forall x, y, z\,\left(\left(x, y\right) \in F \wedge \left(x, z\right) \in F \to y = z\right).$$ It is therefore possible to define the value function $$F\!\left(x\right)$$ as the unique object $$y$$ such that $$xFy$$&thinsp;–&thinsp;i.e.: $$x$$ is $$F$$-related to $$y$$ such that the relation $$f$$ holds between $$x$$ and $$y$$&thinsp;–&thinsp;or as the unique object $$y$$ such that $$\left(x, y\right) \in F$$. The presence in both theories of functional predicates which are not sets makes it useful to allow the notation $$F\!\left(x\right)$$ both for sets $$F$$ and for important functional predicates. As long as one does not quantify over functions in the latter sense, all such uses are in principle eliminable.

Outside of formal set theory, we usually specify a function in terms of its domain and codomain, as in the phrase "Let $$f: A \to B$$ be a function". The domain of a function is just its domain as a relation, but we have not yet defined the codomain of a function. To do this we introduce the terminology that a function is from $$A$$ to $$B$$ if its domain equals $$A$$ and its range is contained in $$B$$. In this way, every function is a function from its domain to its range, and a function $$f$$ from $$A$$ to $$B$$ is also a function from $$A$$ to $$C$$ for any set $$C$$ containing $$B$$.

Indeed, no matter which set we consider to be the codomain of a function, the function does not change as a set since by definition it is just a set of ordered pairs. That is, a function does not determine its codomain by our definition. If one finds this unappealing then one can instead define a function as the ordered pair $$(f, B)$$, where $$f$$ is a functional relation and $$B$$ is its codomain, but we do not take this approach in this article (more elegantly, if one first defines ordered triples - for example as $$(x, y, z) = (x, (y, z))$$- then one could define a function as the ordered triple $$(f, A, B)$$ so as to also include the domain). Note that the same issue exists for relations: outside of formal set theory we usually say "Let $$R \subseteq A \times B$$ be a binary relation", but formally $$R$$ is a set of ordered pairs such that $$\text{dom}\,R \subseteq A$$ and $$\text{ran}\,R \subseteq B$$.

In NFU, $$x$$ has the same type as $$F\!\left(x\right)$$, and $$F$$ is three types higher than $$F\!\left(x\right)$$ (one type higher, if a type-level ordered pair is used). To solve this problem, one could define $$F\left[A\right]$$ as $$\left\{y : \exists x\,\left(x \in A \wedge y = F\!\left(x\right)\right)\right\}$$ for any set $$A$$, but this is more conveniently written as $$\left\{F\!\left(x\right) : x \in A\right\}$$. Then, if $$A$$ is a set and $$F$$ is any functional relation, the Axiom of Replacement assures that $$F\left[A\right]$$ is a set in ZFC. In NFU, $$F\left[A\right]$$ and $$A$$ now have the same type, and $$F$$ is two types higher than $$F\left[A\right]$$ (the same type, if a type-level ordered pair is used).

The function $$I$$ such that $$I\!\left(x\right) = x$$ is not a set in ZFC because it is "too large". $$I$$ is however a set in NFU. The function (predicate) $$S$$ such that $$S\!\left(x\right) = \left\{x\right\}$$ is neither a function nor a set in either theory; in ZFC, this is true because such a set would be too large, and, in NFU, this is true because its definition would not be stratified. Moreover, $$S$$ can be proved not to exist in NFU (see the resolution of Cantor's paradox in New Foundations.)

Operations on functions
Let $$f$$ and $$g$$ be arbitrary functions. The composition of $$f$$ and $$g$$, $$g \circ f$$, is defined as the relative product $$f\,|\,g$$, but only if this results in a function such that $$g \circ f$$ is also a function, with $$\left(g \circ f\right)\!\left(x\right) = g\!\left(f\!\left(x\right)\right)$$, if the range of $$f$$ is a subset of the domain of $$g$$. The inverse of $$f$$, $$f^\left(-1\right)$$, is defined as the converse of $$f$$ if this is a function. Given any set $$A$$, the identity function $$i_A$$ is the set $$\left\{\left(x, x\right) \mid x \in A\right\}$$, and this is a set in both ZFC and NFU for different reasons.

Special kinds of function
A function is an injective (also called one-to-one) if it has an inverse function.

A function $$f$$ from $$A$$ to $$B$$ is a:
 * Injection from $$A$$ to $$B$$ if the images under $$f$$ of distinct members of $$A$$ are distinct members of $$B$$.
 * Surjection from $$A$$ to $$B$$ if the range of $$f$$ is $$B$$.
 * Bijection from $$A$$ to $$B$$ if $$f$$ is both an injection and a surjection.

Defining functions as ordered pairs $$(f, B)$$ or ordered triples $$(f, A, B)$$ has the advantages that we do not have to introduce the terminology of being a function "from $$A$$ to $$B$$", and that we can speak of "being surjective" outright as opposed to only being able to speak of "being surjective onto $$B$$".

Size of sets
In both ZFC and NFU, two sets A and B are the same size (or are equinumerous) if and only if there is a bijection f from A to B. This can be written as $$|A|=|B|$$, but note that (for the moment) this expresses a relation between A and B rather than a relation between yet-undefined objects $$|A|$$ and $$|B|$$. Denote this relation by $$A \sim B$$ in contexts such as the actual definition of the cardinals where even the appearance of presupposing abstract cardinals should be avoided.

Similarly, define $$|A| \leq |B|$$ as holding if and only if there is an injection from A to B.

It is straightforward to show that the relation of equinumerousness is an equivalence relation: equinumerousness of A with A is witnessed by $$i_A$$; if f witnesses $$|A|=|B|$$, then $$f^{-1}$$ witnesses $$|B|=|A|$$; and if f witnesses $$|A|=|B|$$ and g witnesses $$|B|=|C|$$, then $$g\circ f$$ witnesses $$|A|=|C|$$.

It can be shown that $$|A| \leq |B|$$ is a linear order on abstract cardinals, but not on sets. Reflexivity is obvious and transitivity is proven just as for equinumerousness. The Schröder–Bernstein theorem, provable in ZFC and NFU in an entirely standard way, establishes that (this establishes antisymmetry on cardinals), and follows in a standard way in either theory from the axiom of choice.
 * $$|A| \leq |B| \wedge |B| \leq |A| \rightarrow |A| = |B|$$
 * $$|A| \leq |B| \vee |B| \leq |A|$$

Finite sets and natural numbers
Natural numbers can be considered either as finite ordinals or finite cardinals. Here consider them as finite cardinal numbers. This is the first place where a major difference between the implementations in ZFC and NFU becomes evident.

The Axiom of Infinity of ZFC tells us that there is a set A which contains $$\emptyset$$ and contains $$y \cup \{y\}$$ for each $$y \in A$$. This set A is not uniquely determined (it can be made larger while preserving this closure property): the set N of natural numbers is which is the intersection of all sets which contain the empty set and are closed under the "successor" operation $$y \mapsto y \cup \{y\}$$.
 * $$\{x \in A \mid \forall B\,(\emptyset \in B \wedge \forall y\,(y \in B \rightarrow y \cup \{y\} \in B) \rightarrow x \in B)\}$$

In ZFC, a set $$A$$ is finite if and only if there is $$n \in N$$ such that $$|n|=|A|$$: further, define $$|A|$$ as this n for finite A. (It can be proved that no two distinct natural numbers are the same size).

The usual operations of arithmetic can be defined recursively and in a style very similar to that in which the set of natural numbers itself is defined. For example, + (the addition operation on natural numbers) can be defined as the smallest set which contains $$((x,\emptyset),x)$$ for each natural number $$x$$ and contains $$((x,y \cup \{y\}),z \cup \{z\})$$ whenever it contains $$((x,y),z)$$.

In NFU, it is not obvious that this approach can be used, since the successor operation $$y \cup \{y\}$$ is unstratified and so the set N as defined above cannot be shown to exist in NFU (it is consistent for the set of finite von Neumann ordinals to exist in NFU, but this strengthens the theory, as the existence of this set implies the Axiom of Counting (for which see below or the New Foundations article)).

The standard definition of the natural numbers, which is actually the oldest set-theoretic definition of natural numbers, is as equivalence classes of finite sets under equinumerousness. Essentially the same definition is appropriate to NFU (this is not the usual definition, but the results are the same): define Fin, the set of finite sets, as
 * $$\{A \mid \forall F\,(\emptyset \in F \wedge \forall x,y\,(x \in F \rightarrow x \cup \{y\} \in F) \rightarrow A \in F)\}$$

For any set $$A \in Fin$$, define $$|A|$$ as $$\{B \mid A \sim B\}$$. Define N as the set $$\{|A| \mid A \in Fin\}$$.

The Axiom of Infinity of NFU can be expressed as $$V \not\in Fin$$: this is enough to establish that each natural number has a nonempty successor (the successor of $$|A|$$ being $$|A \cup \{x\}|$$ for any $$x \not\in A$$) which is the hard part of showing that the Peano axioms of arithmetic are satisfied.

The operations of arithmetic can be defined in a style similar to the style given above (using the definition of successor just given). They can also be defined in a natural set theoretical way: if A and B are disjoint finite sets, define |A|+|B| as $$|A \cup B|$$. More formally, define m+n for m and n in N as
 * $$\{A \mid \exists B,C\,(B \in m \wedge C \in n \wedge B \cap C = \emptyset \wedge A = B \cup C)\}$$

(But note that this style of definition is feasible for the ZFC numerals as well, but more circuitous: the form of the NFU definition facilitates set manipulations while the form of the ZFC definition facilitates recursive definitions, but either theory supports either style of definition).

The two implementations are quite different. In ZFC, choose a representative of each finite cardinality (the equivalence classes themselves are too large to be sets); in NFU the equivalence classes themselves are sets, and are thus an obvious choice for objects to stand in for the cardinalities. However, the arithmetic of the two theories is identical: the same abstraction is implemented by these two superficially different approaches.

Equivalence relations and partitions
A general technique for implementing abstractions in set theory is the use of equivalence classes. If an equivalence relation R tells us that elements of its field A are alike in some particular respect, then for any set x, regard the set $$[x]_R=\{y \in A \mid x R y\}$$ as representing an abstraction from the set x respecting just those features (identify elements of A up to R).

For any set A, a set $$P$$ is a partition of A if all elements of P are nonempty, any two distinct elements of P are disjoint, and $$A=\bigcup P$$.

For every equivalence relation R with field A, $$\{[x]_R \mid x \in A\}$$ is a partition of A. Moreover, each partition P of A determines an equivalence relation $$\{(x,y) \mid \exists A \in P\,(x \in A \wedge y \in A)\}$$.

This technique has limitations in both ZFC and NFU. In ZFC, since the universe is not a set, it seems possible to abstract features only from elements of small domains. This can be circumvented using a trick due to Dana Scott: if R is an equivalence relation on the universe, define $$[x]_R$$ as the set of all y such that $$y R x$$ and the rank of y is less than or equal to the rank of any $$z R x$$. This works because the ranks are sets. Of course, there still may be a proper class of $$[x]_R$$'s. In NFU, the main difficulty is that $$[x]_R$$ is one type higher than x, so for example the "map" $$x \mapsto [x]_R$$ is not in general a (set) function (though $$\{x\} \mapsto [x]_R$$ is a set). This can be circumvented by the use of the Axiom of Choice to select a representative from each equivalence class to replace $$[x]_R$$, which will be at the same type as x, or by choosing a canonical representative if there is a way to do this without invoking Choice (the use of representatives is hardly unknown in ZFC, either). In NFU, the use of equivalence class constructions to abstract properties of general sets is more common, as for example in the definitions of cardinal and ordinal number below.

Ordinal numbers
Two well-orderings $$W_1$$ and $$W_2$$ are similar and write $$W_1 \sim W_2$$ just in case there is a bijection f from the field of $$W_1$$ to the field of $$W_2$$ such that $$x W_1 y \leftrightarrow f(x)W_2f(y)$$ for all x and y.

Similarity is shown to be an equivalence relation in much the same way that equinumerousness was shown to be an equivalence relation above.

In New Foundations (NFU), the order type of a well-ordering W is the set of all well-orderings which are similar to W. The set of ordinal numbers is the set of all order types of well-orderings.

This does not work in ZFC, because the equivalence classes are too large. It would be formally possible to use Scott's trick to define the ordinals in essentially the same way, but a device of von Neumann is more commonly used.

For any partial order $$\leq$$, the corresponding strict partial order &lt; is defined as $$\{(x,y) \mid x \leq y \wedge x \neq y\}$$. Strict linear orders and strict well-orderings are defined similarly.

A set A is said to be transitive if $$\bigcup A \subseteq A$$: each element of an element of A is also an element of A. A (von Neumann) ordinal is a transitive set on which membership is a strict well-ordering.

In ZFC, the order type of a well-ordering W is then defined as the unique von Neumann ordinal which is equinumerous with the field of W and membership on which is isomorphic to the strict well-ordering associated with W. (the equinumerousness condition distinguishes between well-orderings with fields of size 0 and 1, whose associated strict well-orderings are indistinguishable).

In ZFC there cannot be a set of all ordinals. In fact, the von Neumann ordinals are an inconsistent totality in any set theory: it can be shown with modest set theoretical assumptions that every element of a von Neumann ordinal is a von Neumann ordinal and the von Neumann ordinals are strictly well-ordered by membership. It follows that the class of von Neumann ordinals would be a von Neumann ordinal if it were a set: but it would then be an element of itself, which contradicts the fact that membership is a strict well-ordering of the von Neumann ordinals.

The existence of order types for all well-orderings is not a theorem of Zermelo set theory: it requires the Axiom of replacement. Even Scott's trick cannot be used in Zermelo set theory without an additional assumption (such as the assumption that every set belongs to a rank which is a set, which does not essentially strengthen Zermelo set theory but is not a theorem of that theory).

In NFU, the collection of all ordinals is a set by stratified comprehension. The Burali-Forti paradox is evaded in an unexpected way. There is a natural order on the ordinals defined by $$\alpha\leq \beta$$ if and only if some (and so any) $$W_1 \in \alpha$$ is similar to an initial segment of some (and so any) $$W_2\in \beta$$. Further, it can be shown that this natural order is a well-ordering of the ordinals and so must have an order type $$\Omega$$. It would seem that the order type of the ordinals less than $$\Omega$$ with the natural order would be $$\Omega$$, contradicting the fact that $$\Omega$$ is the order type of the entire natural order on the ordinals (and so not of any of its proper initial segments). But this relies on one's intuition (correct in ZFC) that the order type of the natural order on the ordinals less than $$\alpha$$ is $$\alpha$$ for any ordinal $$\alpha$$. This assertion is unstratified, because the type of the second $$\alpha$$ is four higher than the type of the first (two higher if a type level pair is used). The assertion which is true and provable in NFU is that the order type of the natural order on the ordinals less than $$\alpha$$ is $$T^4(\alpha)$$ for any ordinal $$\alpha$$, where $$T(\alpha)$$ is the order type of $$W^{\iota}=\{(\{x\},\{y\})\mid xWy\}$$ for any $$W \in \alpha$$ (it is easy to show that this does not depend on the choice of W; note that T raises type by one). Thus the order type of the ordinals less than $$\Omega$$ with the natural order is $$T^4(\Omega)$$, and $$T^4(\Omega)<\Omega$$. All uses of $$T^4$$ here can be replaced with $$T^2$$ if a type-level pair is used.

This shows that the T operation is nontrivial, which has a number of consequences. It follows immediately that the singleton map $$x \mapsto \{x\}$$ is not a set, as otherwise restrictions of this map would establish the similarity of W and $$W^{\iota}$$ for any well-ordering W. T is (externally) bijective and order-preserving. Because of this, the fact $$T^4(\Omega)<\Omega$$ establishes that $$\Omega > T(\Omega) > T^2(\Omega) \ldots$$ is a "descending sequence" in the ordinals which cannot be a set.

Ordinals fixed by T are called Cantorian ordinals, and ordinals which dominate only cantorian ordinals (which are easily shown to be cantorian themselves) are said to be strongly cantorian. There can be no set of cantorian ordinals or set of strongly cantorian ordinals.

Digression: von Neumann ordinals in NFU
It is possible to reason about von Neumann ordinals in NFU. Recall that a von Neumann ordinal is a transitive set A such that the restriction of membership to A is a strict well-ordering. This is quite a strong condition in the NFU context, since the membership relation involves a difference of type. A von Neumann ordinal A is not an ordinal in the sense of NFU, but $$\in\lceil A$$ belongs to an ordinal $$\alpha$$ which may be termed the order type of (membership on) A. It is easy to show that the order type of a von Neumann ordinal A is cantorian: for any well-ordering W of order type $$\alpha$$, the induced well-ordering of initial segments of W by inclusion has order type $$T(\alpha)$$ (it is one type higher, thus the application of T): but the order types of the well-ordering of a von Neumann ordinal A by membership and the well-ordering of its initial segments by inclusion are clearly the same because the two well-orderings are actually the same relation, so the order type of A is fixed under T. Moreover, the same argument applies to any smaller ordinal (which will be the order type of an initial segment of A, also a von Neumann ordinal) so the order type of any von Neumann ordinal is strongly cantorian.

The only von Neumann ordinals which can be shown to exist in NFU without additional assumptions are the concrete finite ones. However, the application of a permutation method can convert any model of NFU to a model in which every strongly cantorian ordinal is the order type of a von Neumann ordinal. This suggests that the concept "strongly cantorian ordinal of NFU" might be a better analogue to "ordinal of ZFC" than is the apparent analogue "ordinal of NFU".

Cardinal numbers
Cardinal numbers are defined in NFU in a way which generalizes the definition of natural number: for any set A, $$|A| \,\overset{\mathrm{def}}{=} \left\{B \mid B \sim A\right\}$$.

In ZFC, these equivalence classes are too large as usual. Scott's trick could be used (and indeed is used in ZF), $$|A|$$ is usually defined as the smallest order type (here a von Neumann ordinal) of a well-ordering of A (that every set can be well-ordered follows from the Axiom of Choice in the usual way in both theories).

The natural order on cardinal numbers is seen to be a well-ordering: that it is reflexive, antisymmetric (on abstract cardinals, which are now available) and transitive has been shown above. That it is a linear order follows from the Axiom of Choice: well-order two sets and an initial segment of one well-ordering will be isomorphic to the other, so one set will have cardinality smaller than that of the other. That it is a well-ordering follows from the Axiom of Choice in a similar way.

With each infinite cardinal, many order types are associated for the usual reasons (in either set theory).

Cantor's theorem shows (in both theories) that there are nontrivial distinctions between infinite cardinal numbers. In ZFC, one proves $$|A|<|P(A)|.$$ In NFU, the usual form of Cantor's theorem is false (consider the case A=V), but Cantor's theorem is an ill-typed statement. The correct form of the theorem in NFU is $$|P_1(A)|<|P(A)|$$, where $$P_1(A)$$ is the set of one-element subsets of A. $$|P_1(V)|<|P(V)|$$ shows that there are "fewer" singletons than sets (the obvious bijection $$x \mapsto \{x\}$$ from $$P_1(V)$$ to V has already been seen not to be a set). It is actually provable in NFU + Choice that $$|P_1(V)|<|P(V)|\ll|V|$$ (where $$\ll$$ signals the existence of many intervening cardinals; there are many, many urelements!). Define a type-raising T operation on cardinals analogous to the T operation on ordinals: $$T(|A|) = |P_1(A)|$$; this is an external endomorphism of the cardinals just as the T operation on ordinals is an external endomorphism of the ordinals.

A set A is said to be cantorian just in case $$|A| = |P_1(A)| = T(|A|)$$; the cardinal $$|A|$$ is also said to be a cantorian cardinal. A set A is said to be strongly cantorian (and its cardinal to be strongly cantorian as well) just in case the restriction of the singleton map to A ($$(x \mapsto \{x\})\lceil A$$) is a set. Well-orderings of strongly cantorian sets are always strongly cantorian ordinals; this is not always true of well-orderings of cantorian sets (though the shortest well-ordering of a cantorian set will be cantorian). A cantorian set is a set which satisfies the usual form of Cantor's theorem.

The operations of cardinal arithmetic are defined in a set-theoretically motivated way in both theories. $$|A| + |B| = \{C \cup D \mid C \sim A \wedge D \sim B \wedge C \cap D = \emptyset\}$$. One would like to define $$|A|\cdot|B|$$ as $$|A \times B|$$, and one does this in ZFC, but there is an obstruction in NFU when using the Kuratowski pair: one defines $$|A|\cdot|B|$$ as $$T^{-2}(|A \times B|)$$ because of the type displacement of 2 between the pair and its projections, which implies a type displacement of two between a cartesian product and its factors. It is straightforward to prove that the product always exists (but requires attention because the inverse of T is not total).

Defining the exponential operation on cardinals requires T in an essential way: if $$B^A$$ was defined as the collection of functions from A to B, this is three types higher than A or B, so it is reasonable to define $$|B|^{|A|}$$ as $$T^{-3}(|B^A|)$$ so that it is the same type as A or B ($$T^{-1}$$ replaces $$T^{-3}$$ with type-level pairs). An effect of this is that the exponential operation is partial: for example, $$2^{|V|}$$ is undefined. In ZFC one defines $$|B|^{|A|}$$ as $$|B^A|$$ without difficulty.

The exponential operation is total and behaves exactly as expected on cantorian cardinals, since T fixes such cardinals and it is easy to show that a function space between cantorian sets is cantorian (as are power sets, cartesian products, and other usual type constructors). This offers further encouragement to the view that the "standard" cardinalities in NFU are the cantorian (indeed, the strongly cantorian) cardinalities, just as the "standard" ordinals seem to be the strongly cantorian ordinals.

Now the usual theorems of cardinal arithmetic with the axiom of choice can be proved, including $$\kappa \cdot \kappa = \kappa$$. From the case $$|V|\cdot |V| = |V|$$ the existence of a type level ordered pair can be derived: $$|V| \cdot |V| = T^{-2}(|V \times V|)$$ is equal to $$|V|$$ just in case $$|V \times V| = T^2(|V|) = |P_1^2(V)|$$, which would be witnessed by a one-to-one correspondence between Kuratowski pairs $$(a,b)$$ and double singletons $$\{\{c\}\}$$: redefine $$(a,b)$$ as the c such that $$\{\{c\}\}$$ is associated with the Kuratowski $$(a,b)$$: this is a type-level notion of ordered pair.

The Axiom of Counting and subversion of stratification
So there are two different implementations of the natural numbers in NFU (though they are the same in ZFC): finite ordinals and finite cardinals. Each of these supports a T operation in NFU (basically the same operation). It is easy to prove that $$T(n)$$ is a natural number if n is a natural number in NFU + Infinity + Choice (and so $$|N|$$ and the first infinite ordinal $$\omega$$ are cantorian) but it is not possible to prove in this theory that $$T(n)=n$$. However, common sense indicates that this should be true, and so it can be adopted as an axiom: One natural consequence of this axiom (and indeed its original formulation) is All that can be proved in NFU without Counting is $$|\{1,\ldots,n\}| = T^2(n)$$.
 * Rosser's Axiom of Counting: For each natural number n, $$T(n)=n$$.
 * $$|\{1,\ldots,n\}| = n$$ for each natural number n.

A consequence of Counting is that N is a strongly cantorian set (again, this is an equivalent assertion).

Properties of strongly cantorian sets
The type of any variable restricted to a strongly cantorian set A can be raised or lowered as desired by replacing references to $$a \in A$$ with references to $$\bigcup f(a)$$ (type of a raised; this presupposes that it is known that a is a set; otherwise one must say "the element of $$f(a)$$" to get this effect) or $$f^{-1}(\{a\})$$ (type of a lowered) where $$f(a) = \{a\}$$ for all $$a \in A$$, so it is not necessary to assign types to such variables for purposes of stratification.

Any subset of a strongly cantorian set is strongly cantorian. The power set of a strongly cantorian set is strongly cantorian. The cartesian product of two strongly cantorian sets is strongly cantorian.

Introducing the Axiom of Counting means that types need not be assigned to variables restricted to N or to P(N), R (the set of reals) or indeed any set ever considered in classical mathematics outside of set theory.

There are no analogous phenomena in ZFC. See the main New Foundations article for stronger axioms that can be adjoined to NFU to enforce "standard" behavior of familiar mathematical objects.

Familiar number systems: positive rationals, magnitudes, and reals
Represent positive fractions as pairs of positive natural numbers (0 is excluded): $$\frac pq$$ is represented by the pair $$(p,q)$$. To make $$\frac pq = \frac rs \leftrightarrow ps=qr$$, introduce the relation $$\sim$$ defined by $$(p,q)\sim (r,s) \leftrightarrow ps=qr$$. It is provable that this is an equivalence relation: define positive rational numbers as equivalence classes of pairs of positive natural numbers under this relation. Arithmetic operations on positive rational numbers and the order relation on positive rationals are defined just as in elementary school and proved (with some effort) to have the expected properties.

Represent magnitudes (positive reals) as nonempty proper initial segments of the positive rationals with no largest element. The operations of addition and multiplication on magnitudes are implemented by elementwise addition of the positive rational elements of the magnitudes. Order is implemented as set inclusion.

Represent real numbers as differences $$m-n$$ of magnitudes: formally speaking, a real number is an equivalence class of pairs $$(m,n)$$ of magnitudes under the equivalence relation $$\sim$$ defined by $$(m,n) \sim (r,s) \leftrightarrow m+s = n+r$$. The operations of addition and multiplication on real numbers are defined just as one would expect from the algebraic rules for adding and multiplying differences. The treatment of order is also as in elementary algebra.

This is the briefest sketch of the constructions. Note that the constructions are exactly the same in ZFC and in NFU, except for the difference in the constructions of the natural numbers: since all variables are restricted to strongly cantorian sets, there is no need to worry about stratification restrictions. Without the Axiom of Counting, it might be necessary to introduce some applications of T in a full discussion of these constructions.

Operations on indexed families of sets
In this class of constructions it appears that ZFC has an advantage over NFU: though the constructions are clearly feasible in NFU, they are more complicated than in ZFC for reasons having to do with stratification.

Throughout this section assume a type-level ordered pair. Define $$(x_1,x_2,\ldots,x_n)$$ as $$(x_1,(x_2,\ldots,x_n))$$. The definition of the general n-tuple using the Kuratowski pair is trickier, as one needs to keep the types of all the projections the same, and the type displacement between the n-tuple and its projections increases as n increases. Here, the n-tuple has the same type as each of its projections.

General cartesian products are defined similarly: $$A_1 \times A_2 \times \ldots \times A_n = A_1 \times (A_2 \times \ldots \times A_n)$$

The definitions are the same in ZFC but without any worries about stratification (the grouping given here is opposite to that more usually used, but this is easily corrected for).

Now consider the infinite cartesian product $$\Pi_{i \in I}A_i$$. In ZFC, this is defined as the set of all functions f with domain I such that $$f(i) \in A_i$$ (where A is implicitly understood as a function taking each i to $$A_i$$).

In NFU, this is requires attention to type. Given a set I and set valued function A whose value at $$\{i\}$$ in $$P_1(I)$$ is written $$A_i$$, Define $$\Pi_{i \in I}A_i$$ as the set of all functions f with domain I such that $$f(i) \in A_i$$: notice that $$f(i) \in A_i = A(\{i\})$$ is stratified because of our convention that A is a function with values at singletons of the indices. Note that the very largest families of sets (which cannot be indexed by sets of singletons) will not have cartesian products under this definition. Note further that the sets $$A_i$$ are at the same type as the index set I (since one type higher than its elements); the product, as a set of functions with domain I (so at the same type as I) is one type higher (assuming a type-level ordered pair).

Now consider the product $$\Pi_{i \in I}|A_i|$$ of the cardinals of these sets. The cardinality |$$\Pi_{i \in I}A_i$$| is one type higher than the cardinals $$|A_i|$$, so the correct definition of the infinite product of cardinals is $$T^{-1}(|\Pi_{i \in I}A_i|)$$ (because the inverse of T is not total, it is possible that this may not exist).

Repeat this for disjoint unions of families of sets and sums of families of cardinals. Again, let A be a set-valued function with domain $$P_1(I)$$: write $$A_i$$ for $$A(\{i\})$$. The disjoint union $$\Sigma_{i \in I}A_i$$ is the set $$\{(i,a) \mid a \in A_i\}$$. This set is at the same type as the sets $$A_i$$.

The correct definition of the sum $$\Sigma_{i \in I}|A_i|$$ is thus $$|\Sigma_{i \in I}A_i|$$, since there is no type displacement.

It is possible to extend these definitions to handle index sets which are not sets of singletons, but this introduces an additional type level and is not needed for most purposes.

In ZFC, define the disjoint union $$\Sigma_{i \in I}A_i$$ as $$\{(i,a) \mid a \in A_i\}$$, where $$A_i$$ abbreviates $$A(i)$$.

Permutation methods can be used to show relative consistency with NFU of the assertion that for every strongly cantorian set A there is a set I of the same size whose elements are self-singletons: $$i = \{i\}$$ for each i in I.

The cumulative hierarchy
In ZFC, define the cumulative hierarchy as the ordinal-indexed sequence of sets satisfying the following conditions: $$V_0 = \emptyset$$; $$V_{\alpha+1} = P(V_{\alpha})$$; $$V_{\lambda} = \bigcup\{V_{\beta} \mid \beta<\lambda\}$$ for limit ordinals $$\lambda$$. This is an example of a construction by transfinite recursion. The rank of a set A is said to be $$\alpha$$ if and only if $$A \in V_{\alpha+1}-V_{\alpha}$$. The existence of the ranks as sets depends on the axiom of replacement at each limit step (the hierarchy cannot be constructed in Zermelo set theory); by the axiom of foundation, every set belongs to some rank.

The cardinal $$|P(V_{\omega + \alpha})|$$ is called $$\beth_{\alpha}$$.

This construction cannot be carried out in NFU because the power set operation is not a set function in NFU ($$P(A)$$ is one type higher than A for purposes of stratification).

The sequence of cardinals $$\beth_{\alpha}$$ can be implemented in NFU. Recall that $$2^{|A|}$$ is defined as $$T^{-1}(|\{0,1\}^A|)$$, where $$\{0,1\}$$ is a convenient set of size 2, and $$|\{0,1\}^A|=|P(A)|$$. Let $$\beth$$ be the smallest set of cardinals which contains $$|N|$$ (the cardinality of the set of natural numbers), contains the cardinal $$2^{|A|}$$ whenever it contains $$|A|$$, and which is closed under suprema of sets of cardinals.

A convention for ordinal indexing of any well-ordering $$W_\alpha$$ is defined as the element x of the field of $$W$$ such that the order type of the restriction of $$W$$ to $$\{y \mid y W x\}$$ is $$\alpha$$; then define $$\beth_{\alpha}$$ as the element with index $$\alpha$$ in the natural order on the elements of $$\beth$$. The cardinal $$\aleph_{\alpha}$$ is the element with index $$\alpha$$ in the natural order on all infinite cardinals (which is a well-ordering, see above). Note that $$\aleph_0 = |N|$$ follows immediately from this definition. In all these constructions, notice that the type of the index $$\alpha$$ is two higher (with type-level ordered pair) than the type of $$W_{\alpha}$$.

Each set A of ZFC has a transitive closure $$TC(A)$$ (the intersection of all transitive sets which contains A). By the axiom of foundation, the restriction of the membership relation to the transitive closure of A is a well-founded relation. The relation $$\in \lceil TC(A)$$ is either empty or has A as its top element, so this relation is a set picture. It can be proved in ZFC that every set picture is isomorphic to some $$\in \lceil TC(A)$$.

This suggests that (an initial segment of) the cumulative hierarchy can be studied by considering the isomorphism classes of set pictures. These isomorphism classes are sets and make up a set in NFU. There is a natural set relation analogous to membership on isomorphism classes of set pictures: if $$x$$ is a set picture, write $$[x]$$ for its isomorphism class and define $$[x] E [y]$$ as holding if $$[x]$$ is the isomorphism class of the restriction of y to the downward closure of one of the elements of the preimage under y of the top element of y. The relation E is a set relation, and it is straightforward to prove that it is well-founded and extensional. If the definition of E is confusing, it can be deduced from the observation that it is induced by precisely the relationship which holds between the set picture associated with A and the set picture associated with B when $$A \in B$$ in the usual set theory.

There is a T operation on isomorphism classes of set pictures analogous to the T operation on ordinals: if x is a set picture, so is $$x^{\iota} = \{(\{a\},\{b\})\mid (a,b) \in x\}$$. Define $$T([x])$$ as $$[x^{\iota}]$$. It is easy to see that $$[x]E[y] \leftrightarrow T([x])=T([y])$$.

An axiom of extensionality for this simulated set theory follows from E's extensionality. From its well-foundedness follows an axiom of foundation. There remains the question of what comprehension axiom E may have. Consider any collection of set pictures $$\{x^{\iota}\mid x \in S\}$$ (collection of set pictures whose fields are made up entirely of singletons). Since each $$x^{\iota}$$ is one type higher than x (using a type-level ordered pair), replacing each element $$\{a\}$$ of the field of each $$x^{\iota}$$ in the collection with $$(x,\{a\})$$ results in a collection of set pictures isomorphic to the original collection but with their fields disjoint. The union of these set pictures with a new top element yields a set picture whose isomorphism type will have as its preimages under E exactly the elements of the original collection. That is, for any collection of isomorphism types $$[x^{\iota}] = T([x])$$, there is an isomorphism type $$[y]$$ whose preimage under E is exactly this collection.

In particular, there will be an isomorphism type [v] whose preimage under E is the collection of all T[x]'s (including T[v]). Since T[v] E v and E is well-founded, $$T[v] \neq v$$. This resembles the resolution of the Burali–Forti paradox discussed above and in the New Foundations article, and is in fact the local resolution of Mirimanoff's paradox of the set of all well-founded sets.

There are ranks of isomorphism classes of set pictures just as there are ranks of sets in the usual set theory. For any collection of set pictures A, define S(A) as the set of all isomorphism classes of set pictures whose preimage under E is a subset of A; call A a "complete" set if every subset of A is a preimage under E. The collection of "ranks" is the smallest collection containing the empty set and closed under the S operation (which is a kind of power set construction) and under unions of its subcollections. It is straightforward to prove (much as in the usual set theory) that the ranks are well-ordered by inclusion, and so the ranks have an index in this well-order: refer to the rank with index $$\alpha$$ as $$R_{\alpha}$$. It is provable that $$|R_{\alpha}|=\beth_{\alpha}$$ for complete ranks $$R_{\alpha}$$. The union of the complete ranks (which will be the first incomplete rank) with the relation E looks like an initial segment of the universe of Zermelo-style set theory (not necessarily like the full universe of ZFC because it may not be large enough). It is provable that if $$R_{\alpha}$$ is the first incomplete rank, then $$R_{T(\alpha)}$$ is a complete rank and thus $$T(\alpha)<\alpha$$. So there is a "rank of the cumulative hierarchy" with an "external automorphism" T moving the rank downward, exactly the condition on a nonstandard model of a rank in the cumulative hierarchy under which a model of NFU is constructed in the New Foundations article. There are technical details to verify, but there is an interpretation not only of a fragment of ZFC but of NFU itself in this structure, with $$[x]\in_{NFU}[y]$$ defined as $$T([x]) E [y] \wedge [y] \in R_{T(\alpha)+1}$$: this "relation" $$E_{NFU}$$ is not a set relation but has the same type displacement between its arguments as the usual membership relation $$\in$$.

So there is a natural construction inside NFU of the cumulative hierarchy of sets which internalizes the natural construction of a model of NFU in Zermelo-style set theory.

Under the Axiom of Cantorian Sets described in the New Foundations article, the strongly cantorian part of the set of isomorphism classes of set pictures with the E relation as membership becomes a (proper class) model of ZFC (in which there are n-Mahlo cardinals for each n; this extension of NFU is strictly stronger than ZFC). This is a proper class model because the strongly cantorian isomorphism classes do not make up a set.

Permutation methods can be used to create from any model of NFU a model in which every strongly cantorian isomorphism type of set pictures is actually realized as the restriction of the true membership relation to the transitive closure of a set.