Equivalent forms of the axiom of choice, including Zorn's lemma, the well ordering theorem, maximal principles, and right inverses of surjections.
The axiom of choice has many equivalent forms. Some of these forms speak about selecting elements from sets, some speak about ordering arbitrary sets, and some speak about finding maximal objects in partially ordered sets. Although these statements look different, they express the same underlying principle over the usual axioms of ZF set theory.
The purpose of this section is to explain the most important equivalent formulations in detail. The main examples are the product form of choice, the statement that every surjection has a right inverse, the well ordering theorem, Zorn’s lemma, and the Hausdorff maximal principle.
The Product Form of Choice
The axiom of choice says that every family of nonempty sets has a choice function. This can be restated in terms of products.
Let be an indexed family of sets. The product of this family is:
An element of this product is exactly a choice function. It chooses one element from each set .
Proposition 8.21 (Product Form)
The axiom of choice is equivalent to the statement that whenever is a family of nonempty sets, the product:
is nonempty.
Proof
Assume the axiom of choice. Let be a family of nonempty sets. By the axiom of choice, there exists a function such that:
for every . By the definition of the product, this function is an element of:
Therefore the product is nonempty.
Conversely, assume that every product of nonempty sets is nonempty. Let be any family of nonempty sets. By assumption:
so there exists:
By the definition of product, this means:
for every . Thus is a choice function for the family, and the axiom of choice follows.
This formulation is often the most compact way to state choice. It says that an arbitrary product of nonempty sets is nonempty.
Right Inverses of Surjections
Another useful form of choice is stated in terms of surjective functions. A surjective function sends at least one element of its domain to every element of its codomain. A right inverse chooses one preimage for each point in the codomain.
Definition 8.22 (Right Inverse)
Let:
be a function. A right inverse of is a function:
such that:
for every .
Equivalently:
The function chooses, for each , one element such that .
Proposition 8.23
The axiom of choice is equivalent to the statement that every surjective function has a right inverse.
Proof
Assume the axiom of choice. Let:
be a surjective function.
For each , define the fiber over :
Since is surjective, each fiber is nonempty:
for every .
The family:
is therefore a family of nonempty sets. By the axiom of choice, there exists a choice function:
such that:
for every .
Since , the definition of gives:
for every . Hence:
and is a right inverse of .
Conversely, assume every surjective function has a right inverse. Let:
be a family of nonempty sets. We must construct a choice function.
Define:
Define:
by:
We show that is surjective. Let . Since is nonempty, choose an element:
Then:
and:
Therefore every lies in the image of , so is surjective.
By assumption, has a right inverse:
such that:
for every .
Since , there exist and such that:
But:
and:
Thus:
Therefore, for each , the element has the form:
for some:
Define:
Then:
for every , so is a choice function. This proves the axiom of choice.
Well Ordered Sets
The well ordering theorem is another equivalent form of choice. It says that every set can be ordered so strongly that every nonempty subset has a first element.
Definition 8.24 (Total Order)
A total order on a set is a relation such that, for all :
.
If and , then .
If and , then .
Either or .
The first three conditions say that is a partial order. The fourth condition says that any two elements are comparable.
Definition 8.25 (Well Order)
A well order on a set is a total order such that every nonempty subset of has a least element.
That means that if:
and:
then there exists such that:
for every .
The usual order on is a well order. Every nonempty subset of has a least element.
The usual order on is not a well order, because itself has no least element.
The usual order on is not a well order, because the interval:
has no least element.
Theorem 8.26 (Well Ordering Theorem)
Every set can be well ordered.
This means that for every set , there exists some relation on such that is a well ordered set.
The theorem does not say that the well order is natural. It only says that some well order exists.
Proposition 8.27
The axiom of choice implies the well ordering theorem.
Proof
Assume the axiom of choice. Let be a set. If:
then the empty relation is a well order on , so suppose that is nonempty.
By the axiom of choice, there exists a choice function on the family of all nonempty subsets of . Thus there is a function:
such that:
for every nonempty subset .
We now build an enumeration of the elements of by transfinite recursion. At each stage, if there are elements not yet chosen, we choose one of them.
Suppose elements have already been chosen for all . Let:
If:
define:
Thus is chosen from the remaining elements of .
This construction cannot continue through all ordinals while producing distinct elements of , because is a set and the class of all ordinals is not a set. Hence there is some ordinal such that:
Then:
Define an order on by:
if and only if:
This order is total because the ordinals below are totally ordered.
It is a well order because every nonempty subset corresponds to a nonempty set of ordinals:
Every nonempty set of ordinals has a least element. Let:
Then is the least element of under the order just defined.
Therefore can be well ordered.
Proposition 8.28
The well ordering theorem implies the axiom of choice.
Proof
Assume every set can be well ordered. Let:
be a family of nonempty sets. We must construct a choice function.
Let:
By the well ordering theorem, there exists a well order on .
For each , the set is a nonempty subset of . Since is a well order, has a least element.
Define:
Then:
for every .
Thus is a choice function for the family, so the axiom of choice holds.
Corollary 8.29
The axiom of choice is equivalent to the well ordering theorem.
Proof
Proposition 8.27 proves that the axiom of choice implies the well ordering theorem. Proposition 8.28 proves that the well ordering theorem implies the axiom of choice. Therefore the two statements are equivalent.
Partially Ordered Sets
Zorn’s lemma is stated using partially ordered sets. Before stating it, we recall the relevant definitions.
Definition 8.30 (Partial Order)
A partial order on a set is a relation satisfying reflexivity, antisymmetry, and transitivity.
Thus, for all :
.
If and , then .
If and , then .
A set equipped with a partial order is called a partially ordered set, or poset.
Unlike a total order, a partial order does not require every two elements to be comparable.
Definition 8.31 (Chain)
Let be a partially ordered set. A chain in is a subset such that for all , either:
or:
Thus a chain is a subset on which the partial order becomes a total order.
Definition 8.32 (Upper Bound)
Let be a partially ordered set, and let . An upper bound for is an element such that:
for every .
The upper bound does not need to belong to . It only needs to lie above every element of .
Definition 8.33 (Maximal Element)
Let be a partially ordered set. An element is maximal if:
implies:
for every .
Equivalently, there is no element such that:
A maximal element is not necessarily above every element of . It simply cannot be extended upward inside the order.
Lemma 8.34 (Greatest Elements and Maximal Elements)
Every greatest element is maximal, but a maximal element need not be greatest.
Proof
Suppose is a greatest element of . Then:
for every .
If:
then since is greatest, we also have:
By antisymmetry:
Therefore is maximal.
For the converse, consider the set:
with the partial order given only by:
and:
Then both and are maximal, because neither has a strictly larger element above it. However, there is no greatest element, since and are incomparable.
Zorn’s Lemma
Zorn’s lemma is one of the most useful equivalent forms of the axiom of choice. It is used to prove the existence of maximal objects.
Theorem 8.35 (Zorn’s Lemma)
Let be a nonempty partially ordered set. Suppose that every chain in has an upper bound in . Then has a maximal element.
The hypothesis says that every totally ordered subcollection can be extended upward to some element of the poset. The conclusion says that there exists an object that cannot be enlarged further.
Proposition 8.36
The well ordering theorem implies Zorn’s lemma.
Proof
Assume the well ordering theorem. Let be a nonempty partially ordered set in which every chain has an upper bound.
By the well ordering theorem, choose a well order of the underlying set . This well order is not required to agree with the partial order . It is used only to make canonical choices.
We construct a chain by adding elements whenever possible.
Given a chain , say that an element is an extension of if:
is still a chain and is not already in .
If extensions exist, choose the least extension and add it. If no extensions exist, stop.
More formally, one performs this construction by transfinite recursion, using the well order to choose the least possible next element whenever one exists.
The construction must eventually stop, because it cannot add distinct elements of the set through all ordinals. Let be the chain obtained when no further extension is possible.
We claim that has an upper bound . This follows from the hypothesis, since every chain in has an upper bound.
We now show that is maximal. Suppose:
for some .
Since is an upper bound for , every satisfies:
Together with:
this gives:
for every .
Thus:
is a chain, and is an extension of unless .
But if , then because is an upper bound for , we have:
which contradicts:
Therefore is a genuine extension of , contradicting the fact that the construction stopped.
Hence no such exists, and is maximal.
Proposition 8.37
Zorn’s lemma implies the axiom of choice.
Proof
Assume Zorn’s lemma. Let:
be a family of nonempty sets. We must construct a choice function.
Let be the set of all partial choice functions. An element of is a function such that:
.
for every .
Order by extension:
if and only if:
and:
for every .
This means that extends .
The set is nonempty, since the empty function belongs to .
Let be a chain in . Define:
We show that is a partial choice function. Since is linearly ordered by extension, any two functions in agree on their common domain. Therefore the union is a function.
If , then for some . Since is a partial choice function:
Thus .
Also, every is extended by , so is an upper bound for in .
By Zorn’s lemma, has a maximal element .
We claim that:
Suppose not. Then there exists:
Since is nonempty, choose:
Define:
Then is a partial choice function extending , and it is strictly larger than . This contradicts the maximality of .
Therefore:
Hence is a choice function for the whole family. This proves the axiom of choice.
Corollary 8.38
The axiom of choice, the well ordering theorem, and Zorn’s lemma are equivalent.
Proof
The axiom of choice implies the well ordering theorem by Proposition 8.27. The well ordering theorem implies Zorn’s lemma by Proposition 8.36. Zorn’s lemma implies the axiom of choice by Proposition 8.37. Therefore all three statements are equivalent.
Hausdorff Maximal Principle
The Hausdorff maximal principle is another maximal principle equivalent to the axiom of choice. It says that every partially ordered set contains a maximal chain.
Theorem 8.39 (Hausdorff Maximal Principle)
Every partially ordered set has a maximal chain.
A maximal chain is a chain that cannot be enlarged by adding more elements while remaining a chain.
Proposition 8.40
Zorn’s lemma implies the Hausdorff maximal principle.
Proof
Let be a partially ordered set. Let be the set of all chains in , ordered by inclusion.
Thus:
if and only if:
We apply Zorn’s lemma to .
Let be a chain in . This means that is a collection of chains in , and any two members of are comparable by inclusion.
Define:
We show that is a chain in .
Let . Then there exist chains such that:
and:
Since is a chain under inclusion, either:
or:
If , then both and are in , and since is a chain, and are comparable.
If , the same argument applies using .
Thus is a chain in .
Also:
for every , so is an upper bound for in .
By Zorn’s lemma, has a maximal element. Such an element is exactly a maximal chain in .
Proposition 8.41
The Hausdorff maximal principle implies Zorn’s lemma.
Proof
Assume the Hausdorff maximal principle. Let be a nonempty partially ordered set in which every chain has an upper bound.
By the Hausdorff maximal principle, there exists a maximal chain:
By hypothesis, this chain has an upper bound:
Thus:
for every .
We claim that is maximal in .
Suppose not. Then there exists such that:
For every , we have:
Hence:
for every .
Therefore:
is a chain.
Moreover, . If , then since is an upper bound of , we would have:
which contradicts:
Thus is a strictly larger chain than , contradicting the maximality of .
Therefore no such exists, and is maximal. Hence Zorn’s lemma holds.
Corollary 8.42
Zorn’s lemma and the Hausdorff maximal principle are equivalent.
Proof
Proposition 8.40 proves that Zorn’s lemma implies the Hausdorff maximal principle. Proposition 8.41 proves that the Hausdorff maximal principle implies Zorn’s lemma. Therefore the two principles are equivalent.
Maximal Objects in Algebra
Zorn’s lemma is widely used because many mathematical objects are naturally ordered by inclusion.
For example, let be a vector space, and consider the set of all linearly independent subsets of , ordered by inclusion. A maximal element in this poset is a linearly independent set that cannot be enlarged while preserving linear independence. Such a set is a basis.
The use of Zorn’s lemma follows a common pattern:
Define a poset of partial objects.
Show that every chain has an upper bound.
Apply Zorn’s lemma to obtain a maximal object.
Prove that the maximal object has the desired stronger property.
This pattern appears throughout algebra, topology, and analysis.
Proposition 8.43 (Every Vector Space Has a Basis)
Assume Zorn’s lemma. Every vector space has a basis.
Proof
Let be a vector space over a field . Let be the set of all linearly independent subsets of , ordered by inclusion.
The set is nonempty because:
Let be a chain in . Define:
We show that is linearly independent.
Let:
and suppose:
For each , since , there exists such that:
Because is a chain under inclusion, among the finitely many sets:
there is one that contains all the others. Call it .
Then:
Since is linearly independent, we have:
Therefore is linearly independent.
Thus , and is an upper bound for the chain .
By Zorn’s lemma, has a maximal element . Then is a maximal linearly independent subset of .
We show that spans . Suppose not. Then there exists:
such that:
Then:
is linearly independent. Indeed, if:
is a finite linear relation with , then if , we could solve for as a linear combination of elements of , contradicting . Hence , and then all remaining coefficients are zero because is linearly independent.
Thus is a larger linearly independent set, contradicting the maximality of .
Therefore:
So is both linearly independent and spanning, hence is a basis of .
Summary of Equivalent Forms
Over ZF, the following principles are equivalent:
The axiom of choice.
Every product of nonempty sets is nonempty.
Every surjective function has a right inverse.
Every set can be well ordered.
Zorn’s lemma.
The Hausdorff maximal principle.
Each form is useful in a different setting. The product form is closest to the original statement of choice. The right inverse form is useful in category theoretic and function theoretic arguments. The well ordering theorem is useful for transfinite recursion and comparison of sizes. Zorn’s lemma and the Hausdorff maximal principle are useful for proving existence of maximal objects.