New Foundations

In mathematical logic, New Foundations (NF) is a non-well-founded, finitely axiomatizable set theory conceived by Willard Van Orman Quine as a simplification of the theory of types of Principia Mathematica.

Definition
The well-formed formulas of NF are the standard formulas of propositional calculus with two primitive predicates equality ($$=$$) and membership ($$\in$$). NF can be presented with only two axiom schemata:
 * Extensionality: Two objects with the same elements are the same object; formally, given any set A and any set B, if for every set X, X is a member of A if and only if X is a member of B, then A is equal to B.
 * A restricted axiom schema of comprehension: $$\{x \mid \phi \}$$ exists for each stratified formula $$\phi$$.

A formula $$\phi$$ is said to be stratified if there exists a function f from pieces of $$\phi$$'s syntax to the natural numbers, such that for any atomic subformula $$x \in y$$ of $$\phi$$ we have f(y) = f(x) + 1, while for any atomic subformula $$x=y$$ of $$\phi$$, we have f(x) = f(y).

Finite axiomatization
NF can be finitely axiomatized. One advantage of such a finite axiomatization is that it eliminates the notion of stratification. The axioms in a finite axiomatization correspond to natural basic constructions, whereas stratified comprehension is powerful but not necessarily intuitive. In his introductory book, Holmes opted to take the finite axiomatization as basic, and prove stratified comprehension as a theorem. The precise set of axioms can vary, but includes most of the following, with the others provable as theorems:


 * Extensionality: If $$A$$ and $$B$$ are sets, and for each object $$x$$, $$x$$ is an element of $$A$$ if and only if $$x$$ is an element of $$B$$, then $$A = B$$. This can also be viewed as defining the equality symbol.
 * Singleton: For every object $$x$$, the set $$\iota(x) = \{x\} = \{y | y = x\}$$ exists, and is called the singleton of $$x$$.
 * Cartesian Product: For any sets $$A$$, $$B$$, the set $$A \times B = \{(a, b) | a \in A \text{ and } b \in B\}$$, called the Cartesian product of $$A$$ and $$B$$, exists. This can be restricted to the existence of one of the cross products $$A \times V$$ or $$V \times B$$.
 * Converse: For each relation $$R$$, the set $$R^{-1} = \{(x, y) | (y,x) \in R\}$$ exists; observe that $$x R^{-1} y$$ exactly if $$y R x$$.
 * Singleton Image: For any relation $$R$$, the set $$R\iota = \{(\{x\}, \{y\}) | (x,y) \in R\}$$, called the singleton image of $$R$$, exists.
 * Domain: If $$R$$ is a relation, the set $$\text{dom}(R) = \{x | \exists y. (x,y) \in R\}$$, called the domain of $$R$$, exists. This can be defined using the operation of type lowering.
 * Inclusion: The set $$[\subseteq] = \{(x, y) | x \subseteq y\}$$ exists. Equivalently, we may consider the set $$[\in] = [\subseteq] \cap (1 \times V) = \{(\{x\}, y) | x \in y\}$$.


 * Complement: For each set $$A$$, the set $$A^c = \{x | x \notin A\}$$, called the complement of $$A$$, exists.
 * (Boolean) Union: If $$A$$ and $$B$$ are sets, the set $$A \cup B = \{x | x \in A \text{ or } x \in B \text{ or both}\}$$, called the (Boolean) union of $$A$$ and $$B$$, exists.
 * Universal Set: $$V = \{x | x = x\}$$ exists. It is straightforward that for any set $$x$$, $$x \cup x^c = V$$.
 * Ordered Pair: For each $$a$$, $$b$$, the ordered pair of $$a$$ and $$b$$, $$(a, b)$$, exists; $$(a, b) = (c, d)$$ exactly if $$a = c$$ and $$b = d$$. This and larger tuples can be a definition rather than an axiom if an ordered pair construction is used.
 * Projections: The sets $$\pi_1 = \{((x, y), x) | x, y \in V \}$$ and $$\pi_2 = \{((x, y), y) | x, y \in V \}$$ exist (these are the relations which an ordered pair has to its first and second terms, which are technically referred to as its projections).
 * Diagonal: The set $$[=] = \{(x, x) | x \in V \}$$ exists, called the equality relation.
 * Set Union: If $$A$$ is a set all of whose elements are sets, the set $$\bigcup [A] = \{x | \text{for some } B, x \in B \text{ and } B \in A\}$$, called the (set) union of $$A$$, exists.
 * Relative Product: If $$R$$, $$S$$ are relations, the set $$(R|S) = \{(x, y) | \text{for some } z, x R z \text{ and } z S y\}$$, called the relative product of $$R$$ and $$S$$, exists.


 * Anti-intersection: $$x|y = \{z : \neg(z \in x \land z \in y) \}$$ exists. This is equivalent to complement and union together, with $$x^c = x|x$$ and $$x \cup y =x^c|y^c$$.
 * Cardinal one: The set $$1$$ of all singletons, $$\{ x | \exists y : (\forall w : w \in x \leftrightarrow w = y) \}$$, exists.
 * Tuple Insertions: For a relation $$R$$, the sets $$I_2(R) = \{ (z, w, t) : (z, t) \in R \}$$ and $$I_3(R) = \{ (z, w, t) : (z, w) \in R \}$$ exist.
 * Type lowering: For any set $$S$$, the set $$\text{TL}(S) = \{ z : \forall w : (w, \{z\}) \in S \}$$ exists.

Typed Set Theory
New Foundations is closely related to Russellian unramified typed set theory (TST), a streamlined version of the theory of types of Principia Mathematica with a linear hierarchy of types. In this many-sorted theory, each variable and set is assigned a type. It is customary to write the type indices as superscripts: $$x^n$$ denotes a variable of type n. Type 0 consists of individuals otherwise undescribed. For each (meta-) natural number n, type n+1 objects are sets of type n objects; objects connected by identity have equal types and sets of type n have members of type n-1. The axioms of TST are extensionality, on sets of the same (positive) type, and comprehension, namely that if $$\phi(x^n)$$ is a formula, then the set $$\{x^n \mid \phi(x^n)\}^{n+1}\!$$ exists. In other words, given any formula $$\phi(x^n)\!$$, the formula $$\exists A^{n+1} \forall x^n [ x^n \in A^{n+1} \leftrightarrow \phi(x^n) ]$$ is an axiom where $$A^{n+1}\!$$ represents the set $$\{x^n \mid \phi(x^n)\}^{n+1}\!$$ and is not free in $$\phi(x^n)$$. This type theory is much less complicated than the one first set out in the Principia Mathematica, which included types for relations whose arguments were not necessarily all of the same types.

There is a correspondence between New Foundations and TST in terms of adding or erasing type annotations. In NF's comprehension schema, a formula is stratified exactly when the formula can be assigned types according to the rules of TST. This can be extended to map every NF formula to a set of corresponding TST formulas with various type index annotations. The mapping is one-to-many because TST has many similar formulas. For example, raising every type index in a TST formula by 1 results in a new, valid TST formula.

Tangled Type Theory
Tangled Type Theory (TTT) is an extension of TST where each variable is typed by an ordinal rather than a natural number. The well-formed atomic formulas are $$x^{n} = y^{n}$$ and $$x^{m} \in y^{n}$$ where $$m<n$$. The axioms of TTT are those of TST where each variable of type $$i$$ is mapped to a variable $$s(i)$$ where $$s$$ is an increasing function.

TTT is considered a "weird" theory because each type is related to each lower type in the same way. For example, type 2 sets have both type 1 members and type 0 members, and extensionality axioms assert that a type 2 set is determined uniquely by either its type 1 members or its type 0 members. Whereas TST has natural models where each type $$i + 1$$ is the power set of type $$i$$, in TTT each type is being interpreted as the power set of each lower type simultaneously. Regardless, a model of NF can be easily converted to a model of TTT, because in NF all the types are already one and the same. Conversely, with a more complicated argument, it can also be shown that the consistency of TTT implies the consistency of NF.

NFU and other variants
NF with urelements (NFU) is an important variant of NF due to Jensen and clarified by Holmes. Urelements are objects that are not sets and do not contain any elements, but can be contained in sets. One of the simplest forms of axiomatization of NFU regards urelements as multiple, unequal empty sets, thus weakening the extensionality axiom of NF to:
 * Weak extensionality: Two non-empty objects with the same elements are the same object; formally,
 * $$\forall x y w. (w \in x) \to (x = y \leftrightarrow (\forall z. z \in x \leftrightarrow z \in y)))$$

In this axiomatization, the comprehension schema is unchanged, although the set $$\{x \mid \phi(x)\}$$ will not be unique if it is empty (i.e. if $$\phi(x)$$ is unsatisfiable).

However, for ease of use, it is more convenient to have a unique, "canonical" empty set. This can be done by introducing a sethood predicate $$\mathrm{set}(x)$$ to distinguish sets from atoms. The axioms are then:
 * Sets: Only sets have members, i.e. $$\forall x y. x \in y \to \mathrm{set}(y).$$
 * Extensionality: Two sets with the same elements are the same set, i.e. $$\forall y z. (\mathrm{set}(y) \wedge \mathrm{set}(z) \wedge (\forall x. x \in y \leftrightarrow x \in z)) \to y = z.$$
 * Comprehension: The set $$\{x \mid \phi(x) \}$$ exists for each stratified formula $$\phi(x)$$, i.e. $$\exists A. \mathrm{set}(A) \wedge (\forall x. x \in A \leftrightarrow \phi(x)).$$

NF3 is the fragment of NF with full extensionality (no urelements) and those instances of comprehension which can be stratified using at most three types. NF4 is the same theory as NF.

Mathematical Logic (ML) is an extension of NF that includes proper classes as well as sets. ML was proposed by Quine and revised by Hao Wang, who proved that NF and the revised ML are equiconsistent.

Constructions
This section discusses some problematic constructions in NF. For a further development of mathematics in NFU, with a comparison to the development of the same in ZFC, see implementation of mathematics in set theory.

Ordered pairs
Relations and functions are defined in TST (and in NF and NFU) as sets of ordered pairs in the usual way. For purposes of stratification, it is desirable that a relation or function is merely one type higher than the type of the members of its field. This requires defining the ordered pair so that its type is the same as that of its arguments (resulting in a type-level ordered pair). The usual definition of the ordered pair, namely $$(a, \ b)_K \; := \ \{ \{ a \}, \ \{ a, \ b \} \}$$, results in a type two higher than the type of its arguments a and b. Hence for purposes of determining stratification, a function is three types higher than the members of its field. NF and related theories usually employ Quine's set-theoretic definition of the ordered pair, which yields a type-level ordered pair. However, Quine's definition relies on set operations on each of the elements a and b, and therefore does not directly work in NFU.

As an alternative approach, Holmes takes the ordered pair (a, b) as a primitive notion, as well as its left and right projections $$\pi_1$$ and $$\pi_2$$, i.e., functions such that $$\pi_1((a, b)) = a$$ and $$\pi_2((a, b)) = b$$ (in Holmes' axiomatization of NFU, the comprehension schema that asserts the existence of $$\{x \mid \phi \}$$ for any stratified formula $$\phi$$ is considered a theorem and only proved later, so expressions like $$\pi_1 = \{((a, b), a) \mid a, b \in V\}$$ are not considered proper definitions). Fortunately, whether the ordered pair is type-level by definition or by assumption (i.e., taken as primitive) usually does not matter.

Natural numbers and the axiom of infinity
The usual form of the axiom of infinity is based on the von Neumann construction of the natural numbers, which is not suitable for NF, since the description of the successor operation (and many other aspects of von Neumann numerals) is necessarily unstratified. The usual form of natural numbers used in NF follows Frege's definition, i.e., the natural number n is represented by the set of all sets with n elements. Under this definition, 0 is easily defined as $$\{\varnothing\}$$, and the successor operation can be defined in a stratified way: $$S(A) = \{a \cup \{x\} \mid a \in A \wedge x \notin a\}.$$ Under this definition, one can write down a statement analogous to the usual form of the axiom of infinity. However, that statement would be trivially true, since the universal set $$V$$ would be an inductive set.

Since inductive sets always exist, the set of natural numbers $$\mathbf{N}$$ can be defined as the intersection of all inductive sets. This definition enables mathematical induction for stratified statements $$P(n)$$, because the set $$\{n \in \mathbf{N} \mid P(n)\}$$ can be constructed, and when $$P(n)$$ satisfies the conditions for mathematical induction, this set is an inductive set.

Finite sets can then be defined as sets that belong to a natural number. However, it is not trivial to prove that $$V$$ is not a "finite set", i.e., that the size of the universe $$|V|$$ is not a natural number. Suppose that $$|V| = n \in \mathbf{N}$$. Then $$n = \{V\}$$ (it can be shown inductively that a finite set is not equinumerous with any of its proper subsets), $$n + 1 = S(n) = \varnothing$$, and each subsequent natural number would be $$\varnothing$$ too, causing arithmetic to break down. To prevent this, one can introduce the axiom of infinity for NF: $$\varnothing \notin \mathbf{N}.$$

It may intuitively seem that one should be able to prove Infinity in NF(U) by constructing any "externally" infinite sequence of sets, such as $$\varnothing, \{\varnothing\}, \{\{{\varnothing}\}\}, \ldots$$. However, such a sequence could only be constructed through unstratified constructions (evidenced by the fact that TST itself has finite models), so such a proof could not be carried out in NF(U). In fact, Infinity is logically independent of NFU: There exists models of NFU where $$|V|$$ is a non-standard natural number. In such models, mathematical induction can prove statements about $$|V|$$, making it impossible to "distinguish" $$|V|$$ from standard natural numbers.

However, there are some cases where Infinity can be proven (in which cases it may be referred to as the theorem of infinity):


 * In NF (without urelements), Specker has shown that the axiom of choice is false. Since it can be proved through induction that every finite set has a choice function (a stratified condition), it follows that $$V$$ is infinite.
 * In NFU with axioms asserting the existence of a type-level ordered pair, $$V$$ is equinumerous with its proper subset $$V \times \{0\}$$, which implies Infinity. Conversely, NFU + Infinity + Choice proves the existence of a type-level ordered pair. NFU + Infinity interprets NFU + "there is a type-level ordered pair" (they are not quite the same theory, but the differences are inessential).

Stronger axioms of infinity exist, such as that the set of natural numbers is a strongly Cantorian set, or NFUM = NFU + Infinity + Large Ordinals + Small Ordinals which is equivalent to Morse–Kelley set theory plus a predicate on proper classes which is a κ-complete nonprincipal ultrafilter on the proper class ordinal κ.

Large sets
NF (and NFU + Infinity + Choice, described below and known consistent) allow the construction of two kinds of sets that ZFC and its proper extensions disallow because they are "too large" (some set theories admit these entities under the heading of proper classes):
 * The universal set V. Because $$x=x$$ is a stratified formula, the universal set V = {x | x=x} exists by Comprehension. An immediate consequence is that all sets have complements, and the entire set-theoretic universe under NF has a Boolean structure.
 * Cardinal and ordinal numbers. In NF (and TST), the set of all sets having n elements (the circularity here is only apparent) exists. Hence Frege's definition of the cardinal numbers works in NF and NFU: a cardinal number is an equivalence class of sets under the relation of equinumerosity: the sets A and B are equinumerous if there exists a bijection between them, in which case we write $$A \sim B$$. Likewise, an ordinal number is an equivalence class of well-ordered sets.

Cartesian closure
The category whose objects are the sets of NF and whose arrows are the functions between those sets is not Cartesian closed; Since NF lacks Cartesian closure, not every function curries as one might intuitively expect, and NF is not a topos.

Resolution of set-theoretic paradoxes
NF may seem to run afoul of problems similar to those in naive set theory, but this is not the case. For example, the existence of the impossible Russell class $$\{x \mid x \not\in x\}$$ is not an axiom of NF, because $$ x \not\in x $$ cannot be stratified. NF steers clear of the three well-known paradoxes of set theory in drastically different ways than how those paradoxes are resolved in well-founded set theories such as ZFC. Many useful concepts that are unique to NF and its variants can be developed from the resolution of those paradoxes.

Russell's paradox
The resolution of Russell's paradox is trivial: $$x \not\in x$$ is not a stratified formula, so the existence of $$\{x \mid x \not\in x\}$$ is not asserted by any instance of Comprehension. Quine said that he constructed NF with this paradox uppermost in mind.

Cantor's paradox and Cantorian sets
Cantor's paradox boils down to the question of whether there exists a largest cardinal number, or equivalently, whether there exists a set with the largest cardinality. In NF, the universal set $$V$$ is obviously a set with the largest cardinality. However, Cantor's theorem says (given ZFC) that the power set $$P(A)$$ of any set $$A$$ is larger than $$A$$ (there can be no injection (one-to-one map) from $$P(A)$$ into $$A$$), which seems to imply a contradiction when $$A = V$$.

Of course there is an injection from $$P(V)$$ into $$V$$ since $$V$$ is the universal set, so it must be that Cantor's theorem (in its original form) does not hold in NF. Indeed, the proof of Cantor's theorem uses the diagonalization argument by considering the set $$B = \{x \in A \mid x \notin f(x)\}$$. In NF, $$x$$ and $$f(x)$$ should be assigned the same type, so the definition of $$B$$ is not stratified. Indeed, if $$f: P(V) \to V$$ is the trivial injection $$x \mapsto x$$, then $$B$$ is the same (ill-defined) set in Russell's paradox.

This failure is not surprising since $$|A| < |P(A)|$$ makes no sense in TST: the type of $$P(A)$$ is one higher than the type of $$A$$. In NF, $$|A| < |P(A)|$$ is a syntactical sentence due to the conflation of all the types, but any general proof involving Comprehension is unlikely to work.

The usual way to correct such a type problem is to replace $$A$$ with $$P_1(A)$$, the set of one-element subsets of $$A$$. Indeed, the correctly typed version of Cantor's theorem $$|P_1(A)| < |P(A)|$$ is a theorem in TST (thanks to the diagonalization argument), and thus also a theorem in NF. In particular, $$|P_1(V)| < |P(V)|$$: there are fewer one-element sets than sets (and so fewer one-element sets than general objects, if we are in NFU). The "obvious" bijection $$x \mapsto \{x\}$$ from the universe to the one-element sets is not a set; it is not a set because its definition is unstratified. Note that in all models of NFU + Choice it is the case that $$|P_1(V)| < |P(V)| \ll |V|$$; Choice allows one not only to prove that there are urelements but that there are many cardinals between $$|P(V)|$$ and $$|V|$$.

However, unlike in TST, $$|A| = |P_1(A)|$$ is a syntactical sentence in NF(U), and as shown above one can talk about its truth value for specific values of $$A$$ (e.g. when $$A = V$$ it is false). A set $$A$$ which satisfies the intuitively appealing $$|A| = |P_1(A)|$$ is said to be Cantorian: a Cantorian set satisfies the usual form of Cantor's theorem. A set $$A$$ which satisfies the further condition that $$(x \mapsto \{x\})\lceil A$$, the restriction of the singleton map to A, is a set is not only Cantorian set but strongly Cantorian.

Burali-Forti paradox and the T operation
The Burali-Forti paradox of the largest ordinal number is resolved in the opposite way: In NF, having access to the set of ordinals does not allow one to construct a "largest ordinal number". One can construct the ordinal $$\Omega$$ that corresponds to the natural well-ordering of all ordinals, but that does not mean that $$\Omega$$ is larger than all those ordinals.

To formalize the Burali-Forti paradox in NF, it is necessary to first formalize the concept of ordinal numbers. In NF, ordinals are defined (in the same way as in naive set theory) as equivalence classes of well-orderings under isomorphism. This is a stratified definition, so the set of ordinals $$\mathrm{Ord}$$ can be defined with no problem. Transfinite induction works on stratified statements, which allows one to prove that the natural ordering of ordinals ($$\alpha \le \beta$$ iff there exists well-orderings $$R \in \alpha, S \in \beta$$ such that $$S$$ is a continuation of $$R$$) is a well-ordering of $$\mathrm{Ord}$$. By definition of ordinals, this well-ordering also belongs to an ordinal $$\Omega \in \mathrm{Ord}$$. In naive set theory, one would go on to prove by transfinite induction that each ordinal $$\alpha$$ is the order type of the natural order on the ordinals less than $$\alpha$$, which would imply an contradiction since $$\Omega$$ by definition is the order type of all ordinals, not any proper initial segment of them.

However, the statement "$$\alpha$$ is the order type of the natural order on the ordinals less than $$\alpha$$" is not stratified, so the transfinite induction argument does not work in NF. In fact, "the order type $$\beta$$ of the natural order $$R_\alpha$$ on the ordinals less than $$\alpha$$" is at least two types higher than $$\alpha$$: The order relation $$R_\alpha = \{(x, y) \mid x \le y < \alpha\}$$ is one type higher than $$\alpha$$ assuming that $$(x, y)$$ is a type-level ordered pair, and the order type (equivalence class) $$\beta = \{S \mid S \sim R_\alpha\}$$ is one type higher than $$R_\alpha$$. If $$(x, y)$$ is the usual Kuratowski ordered pair (two types higher than $$x$$ and $$y$$), then $$\beta$$ would be four types higher than $$\alpha$$.

To correct such a type problem, one needs the T operation, $$T(\alpha)$$, that "raises the type" of an ordinal $$\alpha$$, just like how $$P_1(A)$$ "raises the type" of the set $$A$$. The T operation is defined as follows: If $$W \in \alpha$$, then $$T(\alpha)$$ is the order type of the order $$W^{\iota} = \{(\{x\},\{y\}) \mid xWy\}$$. Now the lemma on order types may be restated in a stratified manner:
 * The order type of the natural order on the ordinals $$ < \alpha$$ is $$T^2(\alpha)$$ or $$T^4(\alpha)$$, depending on which ordered pair is used.

Both versions of this statement can be proven by transfinite induction; we assume the type level pair hereinafter. This means that $$T^2(\alpha)$$ is always less than $$\Omega$$, the order type of all ordinals. In particular, $$T^2(\Omega)<\Omega$$.

Another (stratified) statement that can be proven by transfinite induction is that T is a strictly monotone (order-preserving) operation on the ordinals, i.e., $$T(\alpha) < T(\beta)$$ iff $$\alpha < \beta$$. Hence the T operation is not a function: The collection of ordinals $$\{\alpha \mid T(\alpha) < \alpha\}$$ cannot have a least member, and thus cannot be a set. More concretely, the monotonicity of T implies $$\Omega > T^2(\Omega) > T^4(\Omega)\ldots$$, a "descending sequence" in the ordinals which also cannot be a set.

One might assert that this result shows that no model of NF(U) is "standard", since the ordinals in any model of NFU are externally not well-ordered. This is a philosophical question, not a question of what can be proved within the formal theory. Note that even within NFU it can be proven that any set model of NFU has non-well-ordered "ordinals"; NFU does not conclude that the universe $$V$$ is a model of NFU, despite $$V$$ being a set, because the membership relation is not a set relation.

Consistency
Some mathematicians have questioned the consistency of NF, partly because it is not clear why it avoids the known paradoxes. A key issue was that Specker proved NF combined with the Axiom of Choice is inconsistent. The proof is complex and involves T-operations. However, since 2010, Holmes has claimed to have shown that NF is consistent relative to the consistency of standard set theory (ZFC). In 2024, Sky Wilshaw confirmed Holmes' proof using the Lean proof assistant.

Although NFU resolves the paradoxes similarly to NF, it has a much simpler consistency proof. The proof can be formalized within Peano Arithmetic (PA), a theory weaker than ZF that most mathematicians accept without question. This does not conflict with Gödel's second incompleteness theorem because NFU does not include the Axiom of Infinity and therefore PA cannot be modeled in NFU, avoiding a contradiction. PA also proves that NFU with Infinity and NFU with both Infinity and Choice are equiconsistent with TST with Infinity and TST with both Infinity and Choice, respectively. Therefore, a stronger theory like ZFC, which proves the consistency of TST, will also prove the consistency of NFU with these additions. In simpler terms, NFU is generally seen as weaker than NF because, in NFU, the collection of all sets (the power set of the universe) can be smaller than the universe itself, especially when urelements are included, as required by NFU with Choice.

Models of NFU
Jensen's proof gives a fairly simple method for producing models of NFU in bulk. Using well-known techniques of model theory, one can construct a nonstandard model of Zermelo set theory (nothing nearly as strong as full ZFC is needed for the basic technique) on which there is an external automorphism j (not a set of the model) which moves a rank $$V_{\alpha}$$ of the cumulative hierarchy of sets. We may suppose without loss of generality that $$j(\alpha)<\alpha$$.

The domain of the model of NFU will be the nonstandard rank $$V_{\alpha}$$. The basic idea is that the automorphism j codes the "power set" $$V_{\alpha+1}$$ of our "universe" $$V_{\alpha}$$ into its externally isomorphic copy $$V_{j(\alpha)+1}$$ inside our "universe." The remaining objects not coding subsets of the universe are treated as urelements. Formally, the membership relation of the model of NFU will be $$x \in_{NFU} y \equiv_{def} j(x) \in y \wedge y \in V_{j(\alpha)+1}.$$

It may now be proved that this actually is a model of NFU. Let $$\phi$$ be a stratified formula in the language of NFU. Choose an assignment of types to all variables in the formula which witnesses the fact that it is stratified. Choose a natural number N greater than all types assigned to variables by this stratification. Expand the formula $$\phi$$ into a formula $$\phi_1$$ in the language of the nonstandard model of Zermelo set theory with automorphism j using the definition of membership in the model of NFU. Application of any power of j to both sides of an equation or membership statement preserves its truth value because j is an automorphism. Make such an application to each atomic formula in $$\phi_1$$ in such a way that each variable x assigned type i occurs with exactly $$N-i$$ applications of j. This is possible thanks to the form of the atomic membership statements derived from NFU membership statements, and to the formula being stratified. Each quantified sentence $$(\forall x \in V_{\alpha}.\psi(j^{N-i}(x)))$$ can be converted to the form $$(\forall x \in j^{N-i}(V_{\alpha}).\psi(x))$$ (and similarly for existential quantifiers). Carry out this transformation everywhere and obtain a formula $$\phi_2$$ in which j is never applied to a bound variable. Choose any free variable y in $$\phi$$ assigned type i. Apply $$j^{i-N}$$ uniformly to the entire formula to obtain a formula $$\phi_3$$ in which y appears without any application of j. Now $$\{y \in V_{\alpha} \mid \phi_3\}$$ exists (because j appears applied only to free variables and constants), belongs to $$V_{\alpha+1}$$, and contains exactly those y which satisfy the original formula $$\phi$$ in the model of NFU. $$j(\{y \in V_{\alpha} \mid \phi_3\})$$ has this extension in the model of NFU (the application of j corrects for the different definition of membership in the model of NFU). This establishes that Stratified Comprehension holds in the model of NFU.

To see that weak Extensionality holds is straightforward: each nonempty element of $$V_{j(\alpha)+1}$$ inherits a unique extension from the nonstandard model, the empty set inherits its usual extension as well, and all other objects are urelements.

If $$\alpha$$ is a natural number n, one gets a model of NFU which claims that the universe is finite (it is externally infinite, of course). If $$\alpha$$ is infinite and the Choice holds in the nonstandard model of ZFC, one obtains a model of NFU + Infinity + Choice.

Self-sufficiency of mathematical foundations in NFU
For philosophical reasons, it is important to note that it is not necessary to work in ZFC or any related system to carry out this proof. A common argument against the use of NFU as a foundation for mathematics is that the reasons for relying on it have to do with the intuition that ZFC is correct. It is sufficient to accept TST (in fact TSTU). In outline: take the type theory TSTU (allowing urelements in each positive type) as a metatheory and consider the theory of set models of TSTU in TSTU (these models will be sequences of sets $$T_i$$ (all of the same type in the metatheory) with embeddings of each $$P(T_i)$$ into $$P_1(T_{i+1})$$ coding embeddings of the power set of $$T_i$$ into $$T_{i+1}$$ in a type-respecting manner). Given an embedding of $$T_0$$ into $$T_1$$ (identifying elements of the base "type" with subsets of the base type), embeddings may be defined from each "type" into its successor in a natural way. This can be generalized to transfinite sequences $$T_{\alpha}$$ with care.

Note that the construction of such sequences of sets is limited by the size of the type in which they are being constructed; this prevents TSTU from proving its own consistency (TSTU + Infinity can prove the consistency of TSTU; to prove the consistency of TSTU+Infinity one needs a type containing a set of cardinality $$\beth_{\omega}$$, which cannot be proved to exist in TSTU+Infinity without stronger assumptions). Now the same results of model theory can be used to build a model of NFU and verify that it is a model of NFU in much the same way, with the $$T_{\alpha}$$'s being used in place of $$V_{\alpha}$$ in the usual construction. The final move is to observe that since NFU is consistent, we can drop the use of absolute types in our metatheory, bootstrapping the metatheory from TSTU to NFU.

Facts about the automorphism j
The automorphism j of a model of this kind is closely related to certain natural operations in NFU. For example, if W is a well-ordering in the nonstandard model (we suppose here that we use Kuratowski pairs so that the coding of functions in the two theories will agree to some extent) which is also a well-ordering in NFU (all well-orderings of NFU are well-orderings in the nonstandard model of Zermelo set theory, but not vice versa, due to the formation of urelements in the construction of the model), and W has type α in NFU, then j(W) will be a well-ordering of type T(α) in NFU.

In fact, j is coded by a function in the model of NFU. The function in the nonstandard model which sends the singleton of any element of $$V_{j(\alpha)}$$ to its sole element, becomes in NFU a function which sends each singleton {x}, where x is any object in the universe, to j(x). Call this function Endo and let it have the following properties: Endo is an injection from the set of singletons into the set of sets, with the property that Endo( {x} ) = {Endo( {y} ) | y∈x} for each set x. This function can define a type level "membership" relation on the universe, one reproducing the membership relation of the original nonstandard model.

History
In 1914, Norbert Wiener showed how to code the ordered pair as a set of sets, making it possible to eliminate the relation types of Principia Mathematica in favor of the linear hierarchy of sets in TST. The usual definition of the ordered pair was first proposed by Kuratowski in 1921. Willard Van Orman Quine first proposed NF as a way to avoid the "disagreeable consequences" of TST in a 1937 article titled New Foundations for Mathematical Logic; hence the name. Quine extended the theory in his book Mathematical Logic, whose first edition was published in 1940. In the book, Quine introduced the system "Mathematical Logic" or "ML", an extension of NF that included proper classes as well as sets. The first edition's set theory married NF to the proper classes of NBG set theory and included an axiom schema of unrestricted comprehension for proper classes. However, J. Barkley Rosser proved that the system was subject to the Burali-Forti paradox. Hao Wang showed how to amend Quine's axioms for ML so as to avoid this problem. Quine included the resulting axiomatization in the second and final edition, published in 1951.

In 1944, Theodore Hailperin showed that Comprehension is equivalent to a finite conjunction of its instances, In 1953, Ernst Specker showed that the axiom of choice is false in NF (without urelements). In 1969, Jensen showed that adding urelements to NF yields a theory (NFU) that is provably consistent. That same year, Grishin proved NF3 consistent. Specker additionally showed that NF is equiconsistent with TST plus the axiom scheme of "typical ambiguity". NF is also equiconsistent with TST augmented with a "type shifting automorphism", an operation (external to the theory) which raises type by one, mapping each type onto the next higher type, and preserves equality and membership relations.

In 1983, Marcel Crabbé proved consistent a system he called NFI, whose axioms are unrestricted extensionality and those instances of comprehension in which no variable is assigned a type higher than that of the set asserted to exist. This is a predicativity restriction, though NFI is not a predicative theory: it admits enough impredicativity to define the set of natural numbers (defined as the intersection of all inductive sets; note that the inductive sets quantified over are of the same type as the set of natural numbers being defined). Crabbé also discussed a subtheory of NFI, in which only parameters (free variables) are allowed to have the type of the set asserted to exist by an instance of comprehension. He called the result "predicative NF" (NFP); it is, of course, doubtful whether any theory with a self-membered universe is truly predicative. Holmes has shown that NFP has the same consistency strength as the predicative theory of types of Principia Mathematica without the axiom of reducibility.

The Metamath database implemented Hailperin's finite axiomatization for New Foundations. Since 2015, several candidate proofs by Randall Holmes of the consistency of NF relative to ZF were available both on arXiv and on the logician's home page. His proofs were based on demonstrating the equiconsistency of a "weird" variant of TST, "tangled type theory with λ-types" (TTTλ), with NF, and then showing that TTTλ is consistent relative to ZF with atoms but without choice (ZFA) by constructing a class model of ZFA which includes "tangled webs of cardinals" in ZF with atoms and choice (ZFA+C). These proofs were "difficult to read, insanely involved, and involve the sort of elaborate bookkeeping which makes it easy to introduce errors". In 2024, Sky Wilshaw formalized a version of Holmes' proof using the proof assistant Lean, finally resolving the question of NF's consistency. Timothy Chow characterized Wilshaw's work as showing that the reluctance of peer reviewers to engage with a difficult to understand proof can be addressed with the help of proof assistants.