Axiom of limitation of size



In set theory, the axiom of limitation of size was proposed by John von Neumann in his 1925 axiom system for sets and classes. It formalizes the limitation of size principle, which avoids the paradoxes encountered in earlier formulations of set theory by recognizing that some classes are too big to be sets. Von Neumann realized that the paradoxes are caused by permitting these big classes to be members of a class. A class that is a member of a class is a set; a class that is not a set is a proper class. Every class is a subclass of V, the class of all sets. The axiom of limitation of size says that a class is a set if and only if it is smaller than V—that is, there is no function mapping it onto V. Usually, this axiom is stated in the equivalent form: A class is a proper class if and only if there is a function that maps it onto V.

Von Neumann's axiom implies the axioms of replacement, separation, union, and global choice. It is equivalent to the combination of replacement, union, and global choice in Von Neumann–Bernays–Gödel set theory (NBG) and Morse–Kelley set theory. Later expositions of class theories—such as those of Paul Bernays, Kurt Gödel, and John L. Kelley—use replacement, union, and a choice axiom equivalent to global choice rather than von Neumann's axiom. In 1930, Ernst Zermelo defined models of set theory satisfying the axiom of limitation of size.

Abraham Fraenkel and Azriel Lévy have stated that the axiom of limitation of size does not capture all of the "limitation of size doctrine" because it does not imply the power set axiom. Michael Hallett has argued that the limitation of size doctrine does not justify the power set axiom and that "von Neumann's explicit assumption [of the smallness of power-sets] seems preferable to Zermelo's, Fraenkel's, and Lévy's obscurely hidden implicit assumption of the smallness of power-sets."

Formal statement
The usual version of the axiom of limitation of size—a class is a proper class if and only if there is a function that maps it onto V—is expressed in the formal language of set theory as:
 * $$\begin{align}

\forall C \Bigl[ \lnot \exist D \left(C \in D\right) \iff \exist F \bigl[&\,\forall y \bigl(\exist D(y \in D) \implies \exist x [\,x \in C \land (x, y) \in F\,]\bigr) \\ &\, \land \, \forall x \forall y \forall z \bigl(\,[\,(x, y) \in F \land (x, z) \in F\,] \implies y = z\bigr)\,\bigr]\,\Bigr] \end{align}$$

Gödel introduced the convention that uppercase variables range over all the classes, while lowercase variables range over all the sets. This convention allows us to write: indent=1
 * $$\exist y\, \varphi(y)$$ instead of $$\exist y \bigl(\exist D (y \in D) \land \varphi(y)\bigr)$$
 * $$\forall y\, \varphi(y)$$ instead of $$\forall y \bigl(\exist D (y \in D) \implies \varphi(y)\bigr)$$

With Gödel's convention, the axiom of limitation of size can be written:
 * $$\begin{align}

\forall C \Bigl[ \lnot \exist D \left( C \in D\right) \iff \exist F \bigl[&\,\forall y \exist x \bigl( x \in C \land (x, y) \in F \bigr) \\ &\, \land \, \forall x \forall y \forall z \bigl(\,[\,(x, y) \in F \land (x, z) \in F\,] \implies y = z\bigr)\,\bigr]\,\Bigr] \end{align}$$

Implications of the axiom
Von Neumann proved that the axiom of limitation of size implies the axiom of replacement, which can be expressed as: If F is a function and A is a set, then F(A) is a set. This is proved by contradiction. Let F be a function and A be a set. Assume that F(A) is a proper class. Then there is a function G that maps F(A) onto V. Since the composite function G ∘ F maps A onto V, the axiom of limitation of size implies that A is a proper class, which contradicts A being a set. Therefore, F(A) is a set. Since the axiom of replacement implies the axiom of separation, the axiom of limitation of size implies the axiom of separation.

Von Neumann also proved that his axiom implies that V can be well-ordered. The proof starts by proving by contradiction that Ord, the class of all ordinals, is a proper class. Assume that Ord is a set. Since it is a transitive set that is strictly well-ordered by ∈, it is an ordinal. So Ord ∈ Ord, which contradicts Ord being strictly well-ordered by ∈. Therefore, Ord is a proper class. So von Neumann's axiom implies that there is a function F that maps Ord onto V. To define a well-ordering of V, let G be the subclass of F consisting of the ordered pairs (α, x) where α is the least β such that (β, x) ∈ F; that is, G = {(α, x) ∈ F: ∀β((β, x) ∈ F ⇒ α ≤ β)}. The function G is a one-to-one correspondence between a subset of Ord and V. Therefore, x < y if G−1(x) < G−1(y) defines a well-ordering of V. This well-ordering defines a global choice function: Let Inf(x) be the least element of a non-empty set x. Since Inf(x) ∈ x, this function chooses an element of x for every non-empty set x. Therefore, Inf(x) is a global choice function, so Von Neumann's axiom implies the axiom of global choice.

In 1968, Azriel Lévy proved that von Neumann's axiom implies the axiom of union. First, he proved without using the axiom of union that every set of ordinals has an upper bound. Then he used a function that maps Ord onto V to prove that if A is a set, then ∪A is a set.

The axioms of replacement, global choice, and union (with the other axioms of NBG) imply the axiom of limitation of size. Therefore, this axiom is equivalent to the combination of replacement, global choice, and union in NBG or Morse–Kelley set theory. These set theories only substituted the axiom of replacement and a form of the axiom of choice for the axiom of limitation of size because von Neumann's axiom system contains the axiom of union. Lévy's proof that this axiom is redundant came many years later.

The axioms of NBG with the axiom of global choice replaced by the usual axiom of choice do not imply the axiom of limitation of size. In 1964, William B. Easton used forcing to build a model of NBG with global choice replaced by the axiom of choice. In Easton's model, V cannot be linearly ordered, so it cannot be well-ordered. Therefore, the axiom of limitation of size fails in this model. Ord is an example of a proper class that cannot be mapped onto V because (as proved above) if there is a function mapping Ord onto V, then V can be well-ordered.

The axioms of NBG with the axiom of replacement replaced by the weaker axiom of separation do not imply the axiom of limitation of size. Define $$\omega_\alpha$$ as the $$\alpha$$-th infinite initial ordinal, which is also the cardinal $$\aleph_\alpha$$; numbering starts at $$0$$, so $$\omega_0 = \omega.$$ In 1939, Gödel pointed out that Lω ω, a subset of the constructible universe, is a model of ZFC with replacement replaced by separation. To expand it into a model of NBG with replacement replaced by separation, let its classes be the sets of Lω ω+1, which are the constructible subsets of Lω ω. This model satisfies NBG's class existence axioms because restricting the set variables of these axioms to Lω ω produces instances of the axiom of separation, which holds in L. It satisfies the axiom of global choice because there is a function belonging to Lω ω+1 that maps ωω onto Lω ω, which implies that Lω ω is well-ordered. The axiom of limitation of size fails because the proper class {ωn : n ∈ ω} has cardinality $\aleph_0$, so it cannot be mapped onto Lω ω, which has cardinality $$\aleph_\omega$$.

In a 1923 letter to Zermelo, von Neumann stated the first version of his axiom: A class is a proper class if and only if there is a one-to-one correspondence between it and V. The axiom of limitation of size implies von Neumann's 1923 axiom. Therefore, it also implies that all proper classes are equinumerous with V.

$$

Zermelo's models and the axiom of limitation of size


In 1930, Zermelo published an article on models of set theory, in which he proved that some of his models satisfy the axiom of limitation of size. These models are built in ZFC by using the cumulative hierarchy Vα, which is defined by transfinite recursion:
 * 1) V0 = ∅.
 * 2) Vα+1 = Vα ∪ P(Vα). That is, the union of Vα and its power set.
 * 3) For limit β: Vβ = ∪α < β Vα. That is, Vβ is the union of the preceding Vα.

Zermelo worked with models of the form Vκ where κ is a cardinal. The classes of the model are the subsets of Vκ, and the model's ∈-relation is the standard ∈-relation. The sets of the model are the classes X such that X ∈ Vκ. Zermelo identified cardinals κ such that Vκ satisfies:
 * Theorem 1. A class X is a set if and only if |X| < κ.
 * Theorem 2. |Vκ| = κ.

Since every class is a subset of Vκ, Theorem 2 implies that every class X has cardinality ≤ κ. Combining this with Theorem 1 proves: every proper class has cardinality κ. Hence, every proper class can be put into one-to-one correspondence with Vκ. This correspondence is a subset of Vκ, so it is a class of the model. Therefore, the axiom of limitation of size holds for the model Vκ.

The theorem stating that Vκ has a well-ordering can be proved directly. Since κ is an ordinal of cardinality κ and |Vκ| = κ, there is a one-to-one correspondence between κ and Vκ. This correspondence produces a well-ordering of Vκ. Von Neumann's proof is indirect. It uses the Burali-Forti paradox to prove by contradiction that the class of all ordinals is a proper class. Hence, the axiom of limitation of size implies that there is a function that maps the class of all ordinals onto the class of all sets. This function produces a well-ordering of Vκ.

The model Vω
To demonstrate that Theorems 1 and 2 hold for some Vκ, we first prove that if a set belongs to Vα then it belongs to all subsequent Vβ, or equivalently: Vα ⊆ Vβ for α ≤ β. This is proved by transfinite induction on β: Sets enter the cumulative hierarchy through the power set P(Vβ) at step β+1. The following definitions will be needed:
 * 1) β = 0: V0 ⊆ V0.
 * 2) For β+1: By inductive hypothesis, Vα ⊆ Vβ. Hence, Vα ⊆ Vβ ⊆ Vβ ∪ P(Vβ) = Vβ+1.
 * 3) For limit β: If α < β, then Vα ⊆ ∪ξ < β Vξ = Vβ. If α = β, then Vα ⊆ Vβ.
 * If x is a set, rank(x) is the least ordinal β such that x ∈ Vβ+1.
 * The supremum of a set of ordinals A, denoted by sup A, is the least ordinal β such that α ≤ β for all α ∈ A.

Zermelo's smallest model is Vω. Mathematical induction proves that Vn is finite for all n < ω:
 * |V0| = 0.
 * |Vn+1| = |Vn ∪ P(Vn)| ≤ |Vn| + 2 undefined, which is finite since Vn is finite by inductive hypothesis.

Proof of Theorem 1: A set X enters Vω through P(Vn) for some n < ω, so X ⊆ Vn. Since Vn is finite, X is finite. Conversely: If a class X is finite, let N = sup {rank(x): x ∈ X}. Since rank(x) ≤ N for all x ∈ X, we have X ⊆ VN+1, so X ∈ VN+2 ⊆ Vω. Therefore, X ∈ Vω.

Proof of Theorem 2: Vω is the union of countably infinitely many finite sets of increasing size. Hence, it has cardinality $$\aleph_0$$, which equals ω by von Neumann cardinal assignment.

The sets and classes of Vω satisfy all the axioms of NBG except the axiom of infinity.

The models Vκ where κ is a strongly inaccessible cardinal
Two properties of finiteness were used to prove Theorems 1 and 2 for Vω: To find models satisfying the axiom of infinity, replace "finite" by "< κ" to produce the properties that define strongly inaccessible cardinals. A cardinal κ is strongly inaccessible if κ > ω and: These properties assert that κ cannot be reached from below. The first property says κ cannot be reached by power sets; the second says κ cannot be reached by the axiom of replacement. Just as the axiom of infinity is required to obtain ω, an axiom is needed to obtain strongly inaccessible cardinals. Zermelo postulated the existence of an unbounded sequence of strongly inaccessible cardinals.
 * 1) If λ is a finite cardinal, then 2λ is finite.
 * 2) If A is a set of ordinals such that |A| is finite, and α is finite for all α ∈ A, then sup A is finite.
 * 1) If λ is a cardinal such that λ < κ, then 2λ < κ.
 * 2) If A is a set of ordinals such that |A| < κ, and α < κ for all α ∈ A, then sup A < κ.

If κ is a strongly inaccessible cardinal, then transfinite induction proves |Vα| < κ for all α < κ:
 * 1) α = 0: |V0| = 0.
 * 2) For α+1: |Vα+1| = |Vα ∪ P(Vα)| ≤ |Vα| + 2 undefined = 2 undefined < κ. Last inequality uses inductive hypothesis and κ being strongly inaccessible.
 * 3) For limit α: |Vα| = |∪ξ < α Vξ| ≤ sup {|Vξ| : ξ < α} < κ. Last inequality uses inductive hypothesis and κ being strongly inaccessible.

Proof of Theorem 1: A set X enters Vκ through P(Vα) for some α < κ, so X ⊆ Vα. Since |Vα| < κ, we obtain |X| < κ. Conversely: If a class X has |X| < κ, let β = sup {rank(x): x ∈ X}. Because κ is strongly inaccessible, |X| < κ and rank(x) < κ for all x ∈ X imply β = sup {rank(x): x ∈ X} < κ. Since rank(x) ≤ β for all x ∈ X, we have X ⊆ Vβ+1, so X ∈ Vβ+2 ⊆ Vκ. Therefore, X ∈ Vκ.

Proof of Theorem 2: |Vκ| = |∪α < κ Vα| ≤ sup {|Vα| : α < κ}. Let β be this supremum. Since each ordinal in the supremum is less than κ, we have β ≤ κ. Assume β < κ. Then there is a cardinal λ such that β < λ < κ; for example, let λ = 2undefined. Since λ ⊆ Vλ and |Vλ| is in the supremum, we have λ ≤ |Vλ| ≤ β. This contradicts β < λ. Therefore, |Vκ| = β = κ.

The sets and classes of Vκ satisfy all the axioms of NBG.

Limitation of size doctrine
The limitation of size doctrine is a heuristic principle that is used to justify axioms of set theory. It avoids the set theoretical paradoxes by restricting the full (contradictory) comprehension axiom schema:
 * $$\forall w_1,\ldots,w_n \, \exists x \, \forall u \, ( u \in x \iff \varphi(u, w_1, \ldots, w_n) )$$

to instances "that do not give sets 'too much bigger' than the ones they use."

If "bigger" means "bigger in cardinal size," then most of the axioms can be justified: The axiom of separation produces a subset of x that is not bigger than x. The axiom of replacement produces an image set f(x) that is not bigger than x. The axiom of union produces a union whose size is not bigger than the size of the biggest set in the union times the number of sets in the union. The axiom of choice produces a choice set whose size is not bigger than the size of the given set of nonempty sets.

The limitation of size doctrine does not justify the axiom of infinity:
 * $$\exists y \, [\empty \in y \, \land \, \forall x (x \in y \implies x \cup \{x\} \in y)],$$

which uses the empty set and sets obtained from the empty set by iterating the ordinal successor operation. Since these sets are finite, any set satisfying this axiom, such as ω, is much bigger than these sets. Fraenkel and Lévy regard the empty set and the infinite set of natural numbers, whose existence is implied by the axioms of infinity and separation, as the starting point for generating sets.

Von Neumann's approach to limitation of size uses the axiom of limitation of size. As mentioned in, von Neumann's axiom implies the axioms of separation, replacement, union, and choice. Like Fraenkel and Lévy, von Neumann had to add the axiom of infinity to his system since it cannot be proved from his other axioms. The differences between von Neumann's approach to limitation of size and Fraenkel and Lévy's approach are:
 * Von Neumann's axiom puts limitation of size into an axiom system, making it possible to prove most set existence axioms. The limitation of size doctrine justifies axioms using informal arguments that are more open to disagreement than a proof.
 * Von Neumann assumed the power set axiom since it cannot be proved from his other axioms. Fraenkel and Lévy state that the limitation of size doctrine justifies the power set axiom.

There is disagreement on whether the limitation of size doctrine justifies the power set axiom. Michael Hallett has analyzed the arguments given by Fraenkel and Lévy. Some of their arguments measure size by criteria other than cardinal size—for example, Fraenkel introduces "comprehensiveness" and "extendability." Hallett points out what he considers to be flaws in their arguments.

Hallett then argues that results in set theory seem to imply that there is no link between the size of an infinite set and the size of its power set. This would imply that the limitation of size doctrine is incapable of justifying the power set axiom because it requires that the power set of x is not "too much bigger" than x. For the case where size is measured by cardinal size, Hallett mentions Paul Cohen's work. Starting with a model of ZFC and $$\aleph_\alpha$$, Cohen built a model in which the cardinality of the power set of ω is $$\aleph_\alpha$$ if the cofinality of $$\aleph_\alpha$$ is not ω; otherwise, its cardinality is $$\aleph_{\alpha+1}$$. Since the cardinality of the power set of ω has no bound, there is no link between the cardinal size of ω and the cardinal size of P(ω).

Hallett also discusses the case where size is measured by "comprehensiveness," which considers a collection "too big" if it is of "unbounded comprehension" or "unlimited extent." He points out that for an infinite set, we cannot be sure that we have all its subsets without going through the unlimited extent of the universe. He also quotes John L. Bell and Moshé Machover: "... the power set P(u) of a given [infinite] set u is proportional not only to the size of u but also to the 'richness' of the entire universe ..." After making these observations, Hallett states: "One is led to suspect that there is simply no link between the size (comprehensiveness) of an infinite a and the size of P(a)."

Hallett considers the limitation of size doctrine valuable for justifying most of the axioms of set theory. His arguments only indicate that it cannot justify the axioms of infinity and power set. He concludes that "von Neumann's explicit assumption [of the smallness of power-sets] seems preferable to Zermelo's, Fraenkel's, and Lévy's obscurely hidden implicit assumption of the smallness of power-sets."

History
Von Neumann developed the axiom of limitation of size as a new method of identifying sets. ZFC identifies sets via its set building axioms. However, as Abraham Fraenkel pointed out: "The rather arbitrary character of the processes which are chosen in the axioms of Z [ZFC] as the basis of the theory, is justified by the historical development of set-theory rather than by logical arguments."

The historical development of the ZFC axioms began in 1908 when Zermelo chose axioms to eliminate the paradoxes and to support his proof of the well-ordering theorem. In 1922, Abraham Fraenkel and Thoralf Skolem pointed out that Zermelo's axioms cannot prove the existence of the set {Z0, Z1, Z2, ...} where Z0 is the set of natural numbers, and Zn+1 is the power set of Zn. They also introduced the axiom of replacement, which guarantees the existence of this set. However, adding axioms as they are needed neither guarantees the existence of all reasonable sets nor clarifies the difference between sets that are safe to use and collections that lead to contradictions.

In a 1923 letter to Zermelo, von Neumann outlined an approach to set theory that identifies sets that are "too big" and might lead to contradictions. Von Neumann identified these sets using the criterion: "A set is 'too big' if and only if it is equivalent with the set of all things." He then restricted how these sets may be used: "... in order to avoid the paradoxes those [sets] which are 'too big' are declared to be impermissible as elements." By combining this restriction with his criterion, von Neumann obtained his first version of the axiom of limitation of size, which in the language of classes states: A class is a proper class if and only if it is equinumerous with V. By 1925, Von Neumann modified his axiom by changing "it is equinumerous with V" to "it can be mapped onto V", which produces the axiom of limitation of size. This modification allowed von Neumann to give a simple proof of the axiom of replacement. Von Neumann's axiom identifies sets as classes that cannot be mapped onto V. Von Neumann realized that, even with this axiom, his set theory does not fully characterize sets.

Gödel found von Neumann's axiom to be "of great interest":


 * "In particular I believe that his [von Neumann's] necessary and sufficient condition which a property must satisfy, in order to define a set, is of great interest, because it clarifies the relationship of axiomatic set theory to the paradoxes. That this condition really gets at the essence of things is seen from the fact that it implies the axiom of choice, which formerly stood quite apart from other existential principles.  The inferences, bordering on the paradoxes, which are made possible by this way of looking at things, seem to me, not only very elegant, but also very interesting from the logical point of view. Moreover I believe that only by going farther in this direction, i.e., in the direction opposite to constructivism, will the basic problems of abstract set theory be solved."