Skip to content

8.1 Axiom of Choice

The axiom of choice, choice functions, indexed families, and first consequences in axiomatic set theory.

The axiom of choice is an additional principle of set theory which says that, given any family of nonempty sets, it is possible to choose one element from each set in the family. The statement sounds simple, but it has deep consequences because it applies even when the family is infinite and no explicit rule for making the choices is given.

In ordinary finite mathematics, choosing one element from each of several nonempty sets causes no difficulty, because the choices can be made one after another. The axiom of choice becomes important when there may be infinitely many sets and no definable procedure is available for selecting the elements.

Indexed Families

Before stating the axiom, we fix the language used to describe a family of sets.

Definition 8.1 (Indexed Family)

An indexed family of sets is a function:

A:IV A : I \to V

where II is a set, and for each iIi \in I, the value A(i)A(i) is a set.

We usually write:

Ai A_i

instead of A(i)A(i), and we write the family as:

(Ai)iI (A_i)_{i \in I}

The set II is called the index set, and each AiA_i is called a member of the family.

The family is nonempty pointwise if:

Ai A_i \ne \varnothing

for every iIi \in I.

This means that every set in the family contains at least one element, although the elements may be different for different indices.

Definition 8.2 (Choice Function)

Let (Ai)iI(A_i)_{i \in I} be an indexed family of nonempty sets. A choice function for this family is a function:

f:IiIAi f : I \to \bigcup_{i \in I} A_i

such that:

f(i)Ai f(i) \in A_i

for every iIi \in I.

Thus a choice function selects one element from each set AiA_i.

The condition f(i)Aif(i) \in A_i is the essential part of the definition. It says that the chosen value at index ii must actually come from the set indexed by ii.

Axiom 8.3 (Axiom of Choice)

For every indexed family (Ai)iI(A_i)_{i \in I} of nonempty sets, there exists a choice function:

f:IiIAi f : I \to \bigcup_{i \in I} A_i

such that:

f(i)Ai f(i) \in A_i

for every iIi \in I.

Equivalently, if every AiA_i is nonempty, then one can choose an element:

aiAi a_i \in A_i

for every iIi \in I.

The axiom does not provide a formula for aia_i. It only asserts that a function making all these choices exists.

Example 8.4 (A Finite Family)

Let:

A1={2,4,6},A2={1,3,5},A3={0,10} A_1 = \{2,4,6\}, \quad A_2 = \{1,3,5\}, \quad A_3 = \{0,10\}

A choice function may be defined by:

f(1)=2,f(2)=3,f(3)=0 f(1)=2, \quad f(2)=3, \quad f(3)=0

This is a choice function because:

2A1,3A2,0A3 2 \in A_1, \quad 3 \in A_2, \quad 0 \in A_3

No special axiom is needed here, because there are only finitely many choices to make.

Example 8.5 (A Family with a Definable Choice Rule)

Let:

An={mN:m>n} A_n = \{m \in \mathbb{N} : m > n\}

for each nNn \in \mathbb{N}.

Each AnA_n is nonempty. A choice function is:

f(n)=n+1 f(n)=n+1

Indeed:

n+1An n+1 \in A_n

for every nNn \in \mathbb{N}.

This example also does not require the full strength of the axiom of choice, because a concrete rule for choosing elements is available.

Example 8.6 (Where Choice Becomes Nontrivial)

Suppose (Ai)iI(A_i)_{i \in I} is an arbitrary family of nonempty sets, and suppose no ordering, rule, or additional structure is given on the sets AiA_i.

For each ii, the statement:

Ai A_i \ne \varnothing

says that there exists some element of AiA_i.

However, the separate assertions:

iI, x, xAi \forall i \in I,\ \exists x,\ x \in A_i

do not by themselves give a single function ff satisfying:

iI, f(i)Ai \forall i \in I,\ f(i) \in A_i

The axiom of choice is precisely the principle that allows us to pass from these many separate existence statements to one simultaneous choice function.

The Product Form

The axiom of choice is often stated using Cartesian products.

Definition 8.7 (Product of a Family of Sets)

Let (Ai)iI(A_i)_{i \in I} be an indexed family of sets. The product of the family is:

iIAi={f:IiIAi:f(i)Ai for every iI} \prod_{i \in I} A_i = \{f : I \to \bigcup_{i \in I} A_i : f(i) \in A_i \text{ for every } i \in I\}

An element of:

iIAi \prod_{i \in I} A_i

is exactly a choice function for the family (Ai)iI(A_i)_{i \in I}.

Thus the product is the set of all possible simultaneous choices.

Proposition 8.8 (Product Form of the Axiom of Choice)

The axiom of choice is equivalent to the statement that for every indexed family (Ai)iI(A_i)_{i \in I}, if:

Ai A_i \ne \varnothing

for every iIi \in I, then:

iIAi \prod_{i \in I} A_i \ne \varnothing

Proof

Assume the axiom of choice. Let (Ai)iI(A_i)_{i \in I} be a family of nonempty sets. By the axiom of choice, there exists a function ff such that:

f(i)Ai f(i) \in A_i

for every iIi \in I. By the definition of the product, this function ff is an element of:

iIAi \prod_{i \in I} A_i

Therefore:

iIAi \prod_{i \in I} A_i \ne \varnothing

Conversely, assume that every product of a family of nonempty sets is nonempty. Let (Ai)iI(A_i)_{i \in I} be a family of nonempty sets. By the assumed product statement:

iIAi \prod_{i \in I} A_i \ne \varnothing

so there exists an element:

fiIAi f \in \prod_{i \in I} A_i

By the definition of the product, this means that ff is a function with domain II and:

f(i)Ai f(i) \in A_i

for every iIi \in I. Hence ff is a choice function for the family. This is exactly the axiom of choice.

Finite Choice

Finite choice can be proved without the axiom of choice. This is important because it separates ordinary finite selection from the stronger infinite principle.

Proposition 8.9 (Choice for Two Sets)

If AA and BB are nonempty sets, then there exists a function:

f:{0,1}AB f : \{0,1\} \to A \cup B

such that:

f(0)A f(0) \in A

and:

f(1)B f(1) \in B

Proof

Since AA is nonempty, there exists an element:

aA a \in A

Since BB is nonempty, there exists an element:

bB b \in B

Define:

f(0)=a f(0)=a

and:

f(1)=b f(1)=b

Then ff is a function with domain {0,1}\{0,1\}, and it satisfies:

f(0)Aandf(1)B f(0) \in A \quad \text{and} \quad f(1) \in B

as required.

Proposition 8.10 (Finite Choice)

Let nNn \in \mathbb{N}, and let:

A0,A1,,An1 A_0, A_1, \dots, A_{n-1}

be nonempty sets. Then there exists a function:

f:{0,1,,n1}k<nAk f : \{0,1,\dots,n-1\} \to \bigcup_{k<n} A_k

such that:

f(k)Ak f(k) \in A_k

for every k<nk<n.

Proof

We prove the statement by induction on nn.

For n=0n=0, the index set is empty. The empty function is a function from the empty set to the empty union, and it vacuously satisfies the condition that f(k)Akf(k) \in A_k for every k<0k<0, since there are no such indices.

Assume the statement holds for nn. Let:

A0,A1,,An A_0, A_1, \dots, A_n

be nonempty sets. By the induction hypothesis, there exists a function:

f:{0,1,,n1}k<nAk f : \{0,1,\dots,n-1\} \to \bigcup_{k<n} A_k

such that:

f(k)Ak f(k) \in A_k

for every k<nk<n.

Since AnA_n is nonempty, there exists:

anAn a_n \in A_n

Define a new function gg on {0,1,,n}\{0,1,\dots,n\} by:

g(k)=f(k) g(k)=f(k)

for k<nk<n, and:

g(n)=an g(n)=a_n

Then g(k)Akg(k) \in A_k for every k<nk<n by the induction hypothesis, and g(n)Ang(n) \in A_n by the choice of ana_n. Hence:

g(k)Ak g(k) \in A_k

for every knk \le n.

Therefore the statement holds for n+1n+1, and the result follows by induction.

Countable Choice

A weaker form of the axiom of choice is countable choice.

Definition 8.11 (Countable Choice)

The axiom of countable choice says that every countable family of nonempty sets has a choice function.

In symbols, if:

(An)nN (A_n)_{n \in \mathbb{N}}

is a family of nonempty sets, then there exists a function:

f:NnNAn f : \mathbb{N} \to \bigcup_{n \in \mathbb{N}} A_n

such that:

f(n)An f(n) \in A_n

for every nNn \in \mathbb{N}.

Countable choice is weaker than the full axiom of choice because it only applies to families indexed by the natural numbers.

Proposition 8.12

The axiom of choice implies countable choice.

Proof

Assume the axiom of choice. Let:

(An)nN (A_n)_{n \in \mathbb{N}}

be a countable family of nonempty sets.

This is an indexed family whose index set is:

N \mathbb{N}

Since each AnA_n is nonempty, the axiom of choice gives a choice function:

f:NnNAn f : \mathbb{N} \to \bigcup_{n \in \mathbb{N}} A_n

such that:

f(n)An f(n) \in A_n

for every nNn \in \mathbb{N}.

This is exactly the statement of countable choice.

Choice and Surjections

The axiom of choice is also closely related to right inverses of surjective functions.

Definition 8.13 (Right Inverse)

Let:

f:XY f : X \to Y

be a function. A right inverse of ff is a function:

g:YX g : Y \to X

such that:

f(g(y))=y f(g(y))=y

for every yYy \in Y.

Equivalently:

fg=idY f \circ g = \mathrm{id}_Y

A right inverse chooses, for each yYy \in Y, one element of the fiber over yy.

Proposition 8.14

Assume the axiom of choice. Every surjective function has a right inverse.

Proof

Let:

f:XY f : X \to Y

be a surjective function. For each yYy \in Y, define the fiber:

Xy={xX:f(x)=y} X_y = \{x \in X : f(x)=y\}

Since ff is surjective, for every yYy \in Y there exists xXx \in X such that:

f(x)=y f(x)=y

Therefore:

Xy X_y \ne \varnothing

for every yYy \in Y.

The family:

(Xy)yY (X_y)_{y \in Y}

is therefore a family of nonempty sets indexed by YY. By the axiom of choice, there exists a choice function:

g:YyYXy g : Y \to \bigcup_{y \in Y} X_y

such that:

g(y)Xy g(y) \in X_y

for every yYy \in Y.

Since g(y)Xyg(y) \in X_y, by the definition of XyX_y we have:

f(g(y))=y f(g(y))=y

for every yYy \in Y.

Thus:

fg=idY f \circ g = \mathrm{id}_Y

Hence gg is a right inverse of ff.

Proposition 8.15

If every surjective function has a right inverse, then the axiom of choice holds.

Proof

Assume that every surjective function has a right inverse.

Let:

(Ai)iI (A_i)_{i \in I}

be a family of nonempty sets. We must construct a choice function.

Define:

X={(i,a):iI and aAi} X = \{(i,a) : i \in I \text{ and } a \in A_i\}

Define:

π:XI \pi : X \to I

by:

π(i,a)=i \pi(i,a)=i

We claim that π\pi is surjective. Let iIi \in I. Since AiA_i is nonempty, there exists:

aAi a \in A_i

Then:

(i,a)X (i,a) \in X

and:

π(i,a)=i \pi(i,a)=i

Thus every iIi \in I is in the image of π\pi, so π\pi is surjective.

By assumption, π\pi has a right inverse:

s:IX s : I \to X

such that:

π(s(i))=i \pi(s(i))=i

for every iIi \in I.

Since s(i)Xs(i) \in X, there are elements jIj \in I and aAja \in A_j such that:

s(i)=(j,a) s(i)=(j,a)

But:

π(s(i))=i \pi(s(i))=i

and:

π(j,a)=j \pi(j,a)=j

Therefore:

j=i j=i

So for each iIi \in I, the value s(i)s(i) has the form:

s(i)=(i,ai) s(i)=(i,a_i)

for some:

aiAi a_i \in A_i

Define:

c(i)=ai c(i)=a_i

Then:

c(i)Ai c(i) \in A_i

for every iIi \in I, and therefore cc is a choice function for the family (Ai)iI(A_i)_{i \in I}.

Hence the axiom of choice holds.

Corollary 8.16

The axiom of choice is equivalent to the statement that every surjective function has a right inverse.

Proof

Proposition 8.14 shows that the axiom of choice implies that every surjective function has a right inverse. Proposition 8.15 shows that if every surjective function has a right inverse, then the axiom of choice holds. Therefore the two statements are equivalent.

Choice and Families of Pairwise Disjoint Sets

The axiom of choice is often easiest to picture when the sets in the family are pairwise disjoint.

Definition 8.17 (Selector)

Let A\mathcal{A} be a set whose members are nonempty sets. A selector for A\mathcal{A} is a set SS such that:

SA=1 |S \cap A| = 1

for every AAA \in \mathcal{A}.

Thus SS contains exactly one element from each member of A\mathcal{A}.

Proposition 8.18

Assume the axiom of choice. Every set of pairwise disjoint nonempty sets has a selector.

Proof

Let A\mathcal{A} be a set of pairwise disjoint nonempty sets.

By the axiom of choice, there exists a function:

f:AA f : \mathcal{A} \to \bigcup \mathcal{A}

such that:

f(A)A f(A) \in A

for every AAA \in \mathcal{A}.

Let:

S={f(A):AA} S = \{f(A) : A \in \mathcal{A}\}

We show that:

SA=1 |S \cap A| = 1

for every AAA \in \mathcal{A}.

First, since f(A)Af(A) \in A and f(A)Sf(A) \in S, we have:

f(A)SA f(A) \in S \cap A

Thus SAS \cap A is nonempty.

Now suppose:

xSA x \in S \cap A

Since xSx \in S, there exists BAB \in \mathcal{A} such that:

x=f(B) x=f(B)

Since f(B)Bf(B) \in B, we have:

xB x \in B

Also xAx \in A. Therefore:

xAB x \in A \cap B

Because the members of A\mathcal{A} are pairwise disjoint, this implies:

A=B A=B

Hence:

x=f(A) x=f(A)

So the only element of SAS \cap A is f(A)f(A), and therefore:

SA=1 |S \cap A| = 1

Why the Axiom Is Not Merely a Definition

The axiom of choice does not define a choice function. It asserts that such a function exists.

This distinction matters. If the family has a natural rule for choosing, then a choice function can often be explicitly defined. For example, every nonempty subset of N\mathbb{N} has a least element, so for a family of nonempty subsets of N\mathbb{N} one may define:

f(A)=minA f(A)=\min A

In that case, no use of the full axiom of choice is needed, because the ordering of N\mathbb{N} provides a concrete selection rule.

For arbitrary sets, there may be no given ordering, no canonical representative, and no definable way to select an element. The axiom of choice asserts that a global selection function nevertheless exists.

Example 8.19 (Choosing from Nonempty Subsets of Natural Numbers)

Let A\mathcal{A} be a set of nonempty subsets of N\mathbb{N}. Define:

f(A)=minA f(A)=\min A

for each AAA \in \mathcal{A}.

This is a choice function because every nonempty subset of N\mathbb{N} has a least element.

Proof

Let AAA \in \mathcal{A}. Since AA is a nonempty subset of N\mathbb{N}, the well ordering property of N\mathbb{N} implies that AA has a least element.

Therefore:

minAA \min A \in A

By definition:

f(A)=minA f(A)=\min A

Thus:

f(A)A f(A) \in A

Since this holds for every AAA \in \mathcal{A}, the function ff is a choice function.

Example 8.20 (Choosing from Nonempty Finite Sets of Natural Numbers)

Let A\mathcal{A} be a family of nonempty finite subsets of N\mathbb{N}. The same rule:

f(A)=minA f(A)=\min A

defines a choice function.

The finiteness of the sets is not needed here, because every nonempty subset of N\mathbb{N} has a least element. The important point is that N\mathbb{N} has a fixed well ordering.

Dependence on the Background Theory

In Zermelo Fraenkel set theory, usually abbreviated ZF, the axiom of choice is not included by default. When the axiom of choice is added to ZF, the resulting theory is called ZFC.

Thus:

ZFC=ZF+AC \mathrm{ZFC} = \mathrm{ZF} + \mathrm{AC}

Many ordinary theorems in algebra, topology, and analysis use some form of choice, sometimes explicitly and sometimes indirectly. For example, the statement that every vector space has a basis requires the axiom of choice in full generality.

The axiom of choice is therefore both a set theoretic principle and a practical tool for large parts of modern mathematics.