The axioms of Zermelo Fraenkel set theory, the role of choice, and the use of axioms as a foundation for mathematics.
Set theory becomes a foundation for mathematics only after its basic principles are stated as axioms. Informal set language is useful, but it can lead to contradictions if every definable collection is allowed to be a set. The Zermelo Fraenkel axioms provide a controlled formal framework in which sets may be built, compared, and used without allowing unrestricted comprehension.
Motivation
A naive view says that every collection described by a property should be a set. For example, one might try to form:
This is the collection of all sets that are not elements of themselves. If were a set, then one could ask whether:
By the definition of , we would have:
This is impossible. The contradiction shows that sets cannot be formed by arbitrary properties without restriction.
The axioms of ZF set theory avoid this problem by allowing only certain controlled ways of forming sets. Instead of saying that every property defines a set, ZF says that sets can be constructed from existing sets by specific operations.
Definition 7.98 (Language of Set Theory)
The language of set theory is a first order language with one nonlogical binary relation symbol:
The formula:
is read as “ is an element of ”.
Equality is part of first order logic, so formulas may also use:
All objects in the theory are sets. Thus set theory does not begin with separate primitive kinds such as numbers, functions, or relations. These objects are later represented as sets.
Axiom 7.99 (Extensionality)
Sets with the same elements are equal:
This axiom says that a set is completely determined by its elements. There is no additional hidden structure attached to a set beyond membership.
Example 7.100
By extensionality:
because both sets have the same elements.
Also:
because repetition does not create a new element.
Axiom 7.101 (Empty Set)
There exists a set with no elements:
Such a set is denoted:
The empty set is unique by extensionality.
Lemma 7.102 (Uniqueness of the Empty Set)
If and have no elements, then:
Proof
Assume:
and:
Then for every , both statements and are false. Therefore:
for every . By extensionality:
Axiom 7.103 (Pairing)
For any sets and , there exists a set whose elements are exactly and :
The set is denoted:
If , then:
This axiom allows the formation of unordered pairs and singleton sets.
Definition 7.104 (Singleton)
The singleton containing is:
It is the unique set whose only element is :
Axiom 7.105 (Union)
For every set , there exists a set whose elements are exactly the elements of elements of :
This set is denoted:
The union axiom allows many sets already collected inside to be merged into one set.
Example 7.106
If:
then:
because the elements of elements of are exactly , , and .
Definition 7.107 (Binary Union)
The union of two sets and is defined by:
Thus:
This definition uses Pairing to form and Union to merge its elements.
Axiom 7.108 (Power Set)
For every set , there exists a set whose elements are exactly the subsets of :
This set is denoted:
Thus:
The power set axiom is one of the main sources of large sets, because Cantor theorem shows:
for every set .
Axiom Schema 7.109 (Separation)
Let be any formula of the language of set theory, possibly with parameters. For every set , there exists a set containing exactly those elements of that satisfy :
This set is denoted:
Separation is an axiom schema because there is one axiom for each formula .
The restriction is essential. Separation does not say that every definable collection is a set. It only says that a definable subcollection of an existing set is a set.
Example 7.110
If:
then Separation allows the formation of:
which is:
The property “is even” is used only to select elements from a set already known to exist.
Lemma 7.111 (Russell Collection Is Not a Set)
There is no set:
Proof
Suppose such a set exists. Then by its defining property:
for every set .
Taking , we obtain:
This is impossible. Therefore no such set exists.
This argument explains why Separation is restricted to subsets of already existing sets.
Axiom Schema 7.112 (Replacement)
Let be a formula such that for each in a set there is a unique satisfying . Then the collection of all such is a set.
Formally:
Replacement says that the image of a set under a definable function is again a set.
This axiom is needed for many transfinite constructions, because it allows sets indexed by large ordinals to be collected as sets.
Example 7.113
Suppose is a set and the formula defines:
Then Replacement gives the set:
This is the image of under the operation .
Axiom 7.114 (Infinity)
There exists a set containing the empty set and closed under the successor operation:
The successor of a set is:
The axiom of infinity guarantees the existence of an infinite set. It is the axiom that allows the construction of the natural numbers as sets.
Definition 7.115 (Natural Numbers as Sets)
The natural numbers are represented by finite ordinals:
and in general:
The set of all natural numbers is denoted:
Axiom 7.116 (Foundation)
Every nonempty set has an element disjoint from it:
This axiom is also called the axiom of regularity.
It prevents infinitely descending membership chains such as:
It also rules out sets satisfying:
Lemma 7.117 (No Set Is an Element of Itself)
Assuming Foundation, no set satisfies:
Proof
Suppose, for contradiction, that:
Consider the singleton:
This set is nonempty. By Foundation, there exists an element such that:
Since , the only element of is , so:
Thus:
But and , so:
This contradicts:
Therefore:
for every set .
Axiom 7.118 (Choice)
The axiom of choice says that for every set of nonempty pairwise disjoint sets, there is a set containing exactly one element from each of them.
One common formulation is:
For every set whose elements are nonempty sets, there exists a function with domain such that:
for every .
Such a function is called a choice function.
Definition 7.119 (ZF and ZFC)
ZF is the theory consisting of the following axioms and axiom schemas:
- Extensionality
- Empty Set
- Pairing
- Union
- Power Set
- Separation
- Replacement
- Infinity
- Foundation
ZFC is ZF together with the axiom of choice.
Thus:
where denotes the axiom of choice.
Example 7.120
Let:
A choice function on may be given by:
The axiom of choice becomes important when the family of nonempty sets is infinite and no explicit rule is given for choosing one element from each set.
Theorem 7.121 (Equivalent Forms of Choice)
Over ZF, the following principles are equivalent:
- The axiom of choice.
- Every set can be well ordered.
- Every surjective function has a right inverse.
- Every vector space has a basis.
- Every product of nonempty sets is nonempty.
Proof
We do not prove all equivalences here, but we explain the main connections.
The axiom of choice gives a way to choose elements during the construction of a well ordering, which yields the well ordering theorem.
If every set can be well ordered, then given any family of nonempty sets, choose the least element of each member under a well ordering of the union. This gives a choice function.
If every surjection has a right inverse, then applying this to the map from pairs with onto the family of sets gives a choice function.
The equivalence with bases of vector spaces and products of nonempty sets shows that the axiom of choice appears throughout algebra, topology, and analysis, not only in set theory.
Cumulative Hierarchy
ZF is often understood through the cumulative hierarchy. Sets are built in stages.
Start with:
At a successor stage:
At a limit stage :
The universe of sets is then informally described as:
This is not a set union over all ordinals inside ZF, but a class sized construction. It expresses the idea that every set appears at some stage of the hierarchy.
Definition 7.122 (Rank)
The rank of a set is the least ordinal such that:
Equivalently, appears by stage:
Rank measures how deeply a set is built from the empty set.
Example 7.123
The empty set has rank:
The set:
has rank:
The set:
has rank:
Finite ordinals have finite rank, while sets involving may have infinite rank.
Classes and Sets
Not every definable collection is a set. A collection described by a formula may be too large to be a set. Such a collection is often called a proper class.
Examples include:
the collection of all sets, and:
the collection of all ordinals.
These collections can be discussed informally or in class theories, but they are not sets in ZF.
Lemma 7.124 (There Is No Set of All Sets)
There is no set such that:
Proof
Suppose such a set exists. By Separation, form:
Since contains every set, the set is an element of . Therefore the defining condition applies to :
This is impossible. Hence there is no set of all sets.
Lemma 7.125 (There Is No Set of All Ordinals)
There is no set whose elements are exactly all ordinals.
Proof
Suppose, for contradiction, that:
is a set.
Since every element of is an ordinal and ordinals are linearly ordered by membership, the union:
is an ordinal. But because contains all ordinals:
Now is an ordinal, so its successor:
is also an ordinal. Hence:
But every element of is contained in , so:
This is impossible, since:
but:
Therefore the ordinals cannot form a set.
Role of ZF and ZFC
ZF provides the standard background theory for most ordinary mathematics. It supports the construction of numbers, ordered pairs, relations, functions, algebraic structures, topological spaces, and many other objects used throughout mathematics.
ZFC is stronger because it includes the axiom of choice. Many standard theorems depend on choice, including the well ordering theorem, Tychonoff theorem for arbitrary products, and the existence of bases for arbitrary vector spaces.
At the same time, some statements cannot be decided from ZF or ZFC alone, assuming these theories are consistent. Later chapters study this phenomenon through constructibility, forcing, independence, and large cardinals.