UKvardate \monthname[\THEMONTH] \THEYEAR
A Dedekind-style axiomatization and the corresponding universal property of an ordinal number system
Abstract
In this paper, we give an axiomatization of the ordinal number system, in the style of Dedekind’s axiomatization of the natural number system. The latter is based on a structure consisting of a set , a distinguished element and a function . The structure in our axiomatization is a triple , where is a class, is a class function defined on all -closed ‘subsets’ of , and is a class function . In fact, we develop the theory relative to a Grothendieck-style universe (minus the power set axiom), as a way of bringing the natural and the ordinal cases under one framework. We also establish a universal property for the ordinal number system, analogous to the well-known universal property for the natural number system.
Introduction
The introduction and study of ordinal numbers goes back to the pioneering works of Cantor in set theory [2, 3]. In modern language, Cantor’s ordinal numbers are isomorphic classes of well-ordered sets, see e.g. [5]. There is also a ‘concrete’ definition of an ordinal number as a transitive set which is well-ordered under the element relation – see e.g. [7]. Such sets are usually called von Neumann ordinals. Natural numbers can be seen concretely as the finite ordinal numbers. In Dedekind’s approach to the natural number system described in [4], the natural numbers are not defined as concrete objects, but rather as abstract entities organized in a certain structure; namely, a triple consisting of a set (a set of ‘abstract’ natural numbers), a distinguished element of (in [4], the distinguished element is 1), and a function , which names the ‘successor’ of each natural number. The axioms that such a system should satisfy were formulated by Dedekind, but are often referred to as Peano axioms today (see e.g. [13] for some historical background):
-
•
does not belong to the image of .
-
•
is injective.
-
•
for any subset of that is closed under and contains .
It is an observation due to Lawvere [10] that these axioms identify the natural number systems as initial objects in the category of all triples where is a set, and is a function . This ‘universal property’ of the natural number system, freed from its category-theoretic formulation, is actually the ‘definition by induction’ theorem already contained in [4]. A morphism of such triples is defined as a function such that:
-
•
,
-
•
for all .
The ‘definition by induction’ theorem states that there is exactly one morphism to any other triple from the triple satisfying the axioms stated above. This theorem is of course well known because of its practical use: it says that recursively defined functions exist and are uniquely determined by the recursion. Intuitively, the theorem can be understood as follows. A triple can be viewed as an abstraction of the concept of counting – is the set of figures used in counting, is where counting begins and the function names increments when counting. Without further restrictions on such ‘counting systems’, there are many non-isomorphic ones, some of which are quite different from the natural number system, but still useful; for instance, hours on a clock, where counting loops back to once we pass . A morphism of these triples can be viewed as a ‘translation’ of one counting system to another. The universal property of the natural number system presents it as a ‘universal’ counting system, in the sense that it has a unique translation to any other counting system. Incidentally, such intuition is not particular to the natural number system: many structures in mathematics can be defined by natural universal properties – see [11].
Ordinal numbers exhibit a similar structure to natural numbers – there is a ‘starting’ ordinal (the natural number ), and every ordinal number has a successor. The natural numbers are the first ordinal numbers. This set is closed under succession. The ordinal number system allows for another type of succession that can be applied to sets of ordinal numbers closed under succession, giving rise to the so-called ‘limit’ ordinal numbers. The infinite sequence of natural numbers is succeeded by a limit ordinal number, usually denoted by . Now, we can take the ‘usual’ successor of , call it , and keep taking its successors until we get another set that is closed under succession, after which we introduce another limit ordinal number – it will be . The next limit ordinal number will be . At some point, we reach , then , and so on until we reach , then , and so on…The process is supposed to continue until all ordinal numbers that we have named no longer form a set. Let us also recall that von Neumann ordinals are defined as sets of preceding ordinals. Thus the first ordinal number, the number , is defined as and the successor of an ordinal number is defined as . A limit ordinal number is one that is the union of all preceding ordinal numbers. We can, in particular, think of as a limit ordinal number given by the union of its predecessors, since the empty union equals the empty set. Equivalently, limit ordinal numbers are those whose sets of predecessors are closed under succession.
We may also think of the ordinal number system in terms of a triple – this time, is a class (since the collection of all ordinal numbers is no longer a set), is a (class) function that specifies limit ordinals and is defined for those subclasses of which form sets closed under the class function , which specifies the successor of each ordinal. In this paper we show that the following three axioms on such a triple are suitable as analogues of the three Dedekind-Peano axioms for the ordinal number system:
-
•
does not belong to the image of , and also , for any such that is defined.
-
•
is injective and if and only if and are defined, where and denote closures of and , respectively, under and predecessors.
-
•
for any subclass of that is closed under and that contains for each such that is defined.
In particular, we prove that:
-
•
The system of von Neumann ordinals constitutes a triple satisfying the three axioms above. There is nothing surprising here, as the result relies on the well-known properties of ordinal numbers.
-
•
Any triple satisfying these three axioms has an order which makes it order-isomorphic to the system of ordinal numbers. The order, in fact, is the specialization order of the topology given by the closure operator in the second axiom (without those axioms, this order is merely a preorder).
-
•
The triple satisfying the three axioms above is an initial object in the category of all triples such that implies , whenever those are defined (with the domain of being the class of all -closed subsets of ).
The idea for defining an ordinal number system abstractly goes back to Zermelo (see ‘Seven notes on ordinal numbers and large cardinals’ in [15]). His approach is to define it as a particular type of well-ordered class . The following axioms would suffice:
-
•
For every , the class is a set.
-
•
For each subset of , the class is non-empty.
There is, of course, an analogous presentation (although less known than the one given by Peano axioms) of the natural number system as a well-ordered set satisfying the following conditions:
-
•
For each , the set is finite.
-
•
For each , the set is non-empty.
This is an alternative approach to that of Dedekind, where there is greater emphasis on the order structure. The difference between our approach to ordinal numbers and the traditional approaches is similar, where in our approach we try to make minimal use of the order structure. Universal properties of the ordinal number system emerging from the more order-based approach have been established in [8]. Our universal property is different from those.
The main new results of the paper are given in the last two sections. Before that, we redevelop the basic theory of ordinal numbers relative to the set-theoretic context in which these results are proved, ensuring that the paper is self-contained.
1 The context
There are a number of alternatives for a context in which the theory that we lay down in this paper could be developed. Elaboration of those contexts and comparison of results across the contexts as a future development of our work would certainly be worthwhile. In this paper, we have decided to stick to what we believe to be technically the most simple and intuitive context, given by a ‘universe’ inside the standard Zermelo-Fraenkel axiomatic set theory, including the axioms of foundation and choice (see e.g. [7]). Developing mathematics relative to a universe is typical in those subjects where sets of different sizes are needed. For instance, this is the approach followed in the exposition of category theory in [11]. The universes we work with, however, are slightly more general than the more commonly used Grothendieck universes [1, 6, 14]. The main difference is that our universes do not require closure under power sets and can be empty. Our context is in fact a particular instance of the quite general category-theoretic context used in [8]. Generalization of our results to that context is left for future work.
We remark that the definitions, theorems and their proofs contained in this paper could be adapted, after a straightforward modification to their formulation to ‘absolute’ set theory, where our ‘sets’ could be replaced with ‘classes’ and elements of the fixed universe with ‘sets’. We would then get the form of the definitions and theorems given in the Introduction.
For a set , by we denote the power set of , and by we denote the union of all elements of . By we denote the set of natural numbers. While we do not rely on any prior knowledge of facts about ordinal numbers (proofs of all needed facts are included in the paper), we do rely on knowledge of basic set-theoretic properties of the natural number system. In particular, we will make use of mathematical induction, definition by recursion, as well as the fact that any infinite set has a subset bijective to .
Recall that a set is said to be transitive when , or equivalently, when . For a function and a set , we write to denote the image of under :
Definition 1.
A universe is a set satisfying the following:
-
(U1)
is a transitive set.
-
(U2)
If then .
-
(U3)
for any and any function .
These axioms imply that is closed under the following standard set-theoretic constructions:
-
•
Singletons. Trivially, since .
-
•
Union. Because .
-
•
Subsets. Let . If and , then because is a transitive set (thanks to the axiom of foundation). If then let . Consider the function defined by
Then and so .
-
•
Cartesian products (binary). Let and . Then
for each and . For each define a function by . Then for each . Now define a function by . Then, .
-
•
Disjoint union. Given , the disjoint union can be defined as
Then where is defined by .
-
•
Quotient sets. Let and let be an equivalence relation on . Then , where is the function defined by .
From this it follows of course that when is not empty, it contains all natural numbers, assuming that they are defined by the recursion
Furthermore, when contains at least one infinite set, it also contains the set of all natural numbers (as defined above).
The empty set is a universe. The sets whose transitive closure have cardinality less than a fixed infinite cardinal form a universe in the sense of the definition above (by Lemma 6.4 in [9]). In particular, hereditarily finite sets form a universe, as do hereditarily countable sets. The so-called Grothendieck universes are exactly those universes in our sense, which are closed under power sets, i.e. if , then .
For any two sets and , we write
when there is a bijection from to . The restricted power set of a set relative to a universe is the set
The following lemmas will be useful.
Lemma 2.
When is non-empty, for any set and its finite subset , we have .
Proof.
This follows from the fact that when is non-empty, it contains a set of each finite size. ∎
Lemma 3.
If and , then .
Proof.
If , then is bijective to a subset of . ∎
Lemma 4.
If then . Also, if , then for any function , we have: .
Proof.
Let and where . Then is bijective to a suitable quotient set of a disjoint union of and . Next, to prove the second part of the lemma, suppose and for each , suppose . Let denote a bijection . Define a function by . Then . The union will then be bijective to a suitable disjoint union of . ∎
Lemma 5.
Given a function ,
Proof.
If , then is bijective to a suitable quotient of . ∎
Lemma 6.
If and , then .
Proof.
This follows from the fact that is closed under cartesian products. ∎
Lemma 7.
.
Proof.
Let , and . Write for a bijection . Consider the function defined by . Then and so . ∎
2 Abstract ordinals
In this section we introduce an abstract notion of an ordinal number system relative to a universe and establish its basic properties. Consider a partially ordered set . The relation for the partial order, as a relation from to , induces a Galois connection from to itself given by the mappings
We call the lower complement of , and the upper complement of . Note that by ‘’ above we mean ‘’, as usual. Since the two mappings above form a Galois connection, we have
for any . We define the incremented join of a subset of (when it exists) as follows:
Note that since , the incremented join of is never an element of . It will be convenient to use the following abbreviations (where and ):
Note that for any such that exists for all ,
Furthermore, the following basic laws are self-evident:
-
(L1)
,
-
(L2)
there is no such that ,
-
(L3)
,
-
(L4)
(for a total order),
-
(L5)
(for a total order),
-
(L6)
(for a total order),
-
(L7)
(for a total order).
The following (easy) lemma will also be useful:
Lemma 8.
Let be a poset and let . Then:
-
(i)
If does not have a largest element, then exists if and only if exists, and when they exist they are equal, . Conversely, if then does not have a largest element.
-
(ii)
If has a largest element , then exists if and only if exists, and when they exist, . Conversely, when is a total order, if then .
-
(iii)
If exists then . Conversely, when is a total order, if exists, then .
Proof.
-
(i)
If has no largest element, then
and so
exists if and only if
exists, and when they exist, they are equal. Now suppose . Since is never an element of , we conclude that does not have a largest element.
-
(ii)
Let . Then , and so exists if and only if exists, and when they exist, they are equal. Now suppose, in the case of total order, that for some . Then can only have elements that are strictly smaller than , and thus each element of is less than or equal to (L5). Also, since , it does not hold that , and thus is not strictly larger than every element of . We can conclude that .
-
(iii)
Suppose exists. Since , we must have for each . This together with the fact that cannot be an element of forces to be an element of . Hence . Suppose now that is a total order and exists. Consider an element . Then is not an element of and so we cannot have . Therefore, . This proves . ∎
Definition 9.
An ordinal system relative to a universe is a partially ordered set satisfying the following axioms:
-
(O1)
For all , if , then .
-
(O2)
exists for each .
We refer to elements of as ordinals.
Note that if is non-empty, then Axiom (O1) forces the universe to be non-empty as well. So for any ordinal we have (Lemma 2). Note also that Axiom (O1) is equivalent to its weaker form (the equivalence does not require (O2) and relies on Lemma 3):
-
(O1′)
for all .
Axiom (O2) implies that the mapping is a function . We call this function the successor function of the ordinal system . For each , an element of that has the form is called a successor ordinal and the successor of . We call an ordinal that is not a successor ordinal a limit ordinal.
Recall that a poset is a well-ordered set when each of its nonempty subsets has smallest element.
Theorem 10.
A poset is an ordinal system relative to a universe if and only if it is a well-ordered set (and consecutively, a totally ordered set) satisfying (O1′) and such that for all .
Proof.
Consider an ordinal system relative to a universe . Let be a non-empty subset of . Then by (O1), and thus exists by (O2). By Lemma 8(iii), . This proves the ‘only if’ part of the theorem. Note that any well-ordered set is totally ordered: having a smallest element of a two-element subset forces and to be comparable. The ‘if’ part is quite obvious. ∎
We will use this theorem often without referring to it. One of its consequences is that each non-empty ordinal system has a smallest element. We denote this element by . Note that is a limit ordinal. Since by the same theorem an ordinal system is a total order, the properties (L4–7) above apply to an ordinal system. In particular, we get that the successor function is injective. We also get the following:
Lemma 11.
In an ordinal system, for an ordinal the following conditions are equivalent:
-
(i)
is a limit ordinal.
-
(ii)
is closed under successors.
-
(iii)
.
Proof.
We have for any ordinal (L4). If is a limit ordinal then for each we have (L3). So (i)(ii). If (ii) holds, by by (L1), we get that does not have a largest element. So (Lemma 8). This gives (ii)(iii). Suppose now . If were a successor ordinal , then by (L5), would be the join of . However, by (L1), and therefore, must be a limit ordinal. Thus, (iii)(i). ∎
The following theorem gives yet another way of thinking about an abstract ordinal number system.
Theorem 12.
A poset is an ordinal system relative to a universe if and only if (O1) holds along with the following axioms:
-
(Oa)
For all , the join exists.
-
(Ob)
exists for each .
Proof.
This can easily be proved using (i) and (ii) of Lemma 8. ∎
Transfinite induction and recursion are well known for well-ordered sets. We formulate them here (in one particular form, out of many possibilities) in the case of ordinal systems, since we will use them later on in the paper. We have included our own direct proofs, for the sake of completeness, but we do not expect these proofs to have any new arguments that do not already exist in the literature.
Theorem 13 (transfinite induction).
Let be an ordinal system and let satisfy the following conditions:
-
(I1)
;
-
(I2)
for every limit ordinal , if then .
Then .
Proof.
Since is well-ordered, if is non-empty then it has a smallest element . By (I1), cannot be a successor of any in . By (I2), it also cannot be a limit ordinal. This is a contradiction since a limit ordinal is defined as one that is not a successor ordinal. ∎
Here is one of the immediate consequences of transfinite induction (the proof will require also the total order of an ordinal system, as well as the properties (L3) and (L4)):
Corollary 14.
Let be an ordinal system relative to a universe and let satisfy the following conditions:
-
(S1)
is down-closed in , i.e., if then , for all ;
-
(S2)
is an ordinal system relative to under the restriction of the order of .
Then .
Theorem 15 (transfinite recursion).
Let be an ordinal system and let , where is a set, is a function and is a function . Then there exists a unique function that satisfies the following conditions:
-
(R1)
for any ,
-
(R2)
for any limit ordinal .
Proof.
For each ordinal , let be the set consisting of all functions
that satisfy the following conditions:
-
(i)
for any successor ordinal , and
-
(ii)
for any limit ordinal .
Note that any function satisfying (R1–2) must have a subfunction in each set . Also, for any , a function must have a subfunction in each set where . We prove by transfinite induction that for each , the set contains exactly one function .
- Successor case.
-
Suppose that for some ordinal and each there exists a unique function . Then satisfies (i,ii) and thus .
Now consider any function . Since and must both have the unique function as a subfunction, and are identical on the domain . But since and both satisfy condition (i), we get that and thus .
- Limit case.
-
Suppose that for some limit ordinal and each there exists a unique function . Then for any two ordinals , the function is a subfunction of , which is in turn a subfunction of every function in . The relation
is then a function over the domain and moreover, it is easy to see that .
Now consider any function . Since for each ordinal the unique function is a subfunction of both and , we see that they are identical on the domain , and since they both satisfy condition (ii), the following holds:
We can conclude that and thus is the unique function in .
We showed that for each ordinal , exactly one function exists. Construct a function as follows: for each ordinal ,
It satisfies (R1–2) because each satisfies (i,ii). Since any function satisfying (R1–2) must have a subfunction in each set , we can conclude that is the unique function satisfying (R1–2). ∎
3 Concrete ordinals
For a universe , define a -ordinal to be a transitive set that belongs to the universe and is a well-ordered set under the relation
Thus, a -ordinal is nothing but a usual von Neumann ordinal number [12] that happens to be an element of . In other words, it is a von Neumann ordinal number internal to the universe . Thus, at least one direction in the following theorem is well known. We include a full proof for completeness.
Theorem 16.
A set is the set of all -ordinals if and only if the following conditions hold:
-
(i)
provided (), and ,
-
(ii)
is a transitive set,
-
(iii)
is an ordinal system relative to under the relation .
When these conditions hold, the incremented join of is given by .
Proof.
Let be the set of all -ordinals. Then (i) follows easily from the definition of a -ordinal. To show (ii), let . We want to show that is a -ordinal. Since also by (U1). Since and is well-ordered (under ), so is . Next, we must prove that is transitive. Let . Then since is transitive. We want to show , so suppose . Then . By the fact that is an order on , we must have . We cannot have , since , and so . Next, we prove (iii). It is easy to see that the relation is a partial order on : reflexivity is obvious, while transitivity follows from each element of being a transitive set. To prove (O1′), consider . Then we have:
Since is a -ordinal, and so (Lemma 3). Note that by (ii) we actually have , although we did not need this to establish (O1′).
For what follows, we will first establish the following:
- Fact 1.
-
for any two -ordinals and such that .
Let . Then . Hence . By the well-ordering of , we then get that either or . By transitivity of and the fact that , the second option is excluded. So . This shows . Now suppose . Then , by transitivity of . If , then which would give , which is impossible. So . This proves . Hence .
From Fact 1 we easily get the following:
- Fact 2.
-
if and only if , for any two -ordinals and .
This in turn implies that is totally ordered under . Indeed, consider . If then either or is non-empty. Without loss of generality, suppose is non-empty. By Fact 1, . Then, by Fact 2, either or , and since we can conclude that and thus .
We now show that is well-ordered under the relation . Let be nonempty. Take any . If , then for all we have and thus by total ordering. If , then exists, since . For all , it holds that and thus (by total ordering and Fact 2), which in turn implies . We can conclude that for all , and thus .
We will now complete the proof of (iii) and prove the last statement of the theorem simultaneously. Consider . By (i) and Lemma 7, . Then . To show that is a -ordinal, we need to prove that it is a transitive set, well-ordered under the relation . Since each element of is transitive, we have , which implies transitivity of :
It follows from (ii) that each element of is a -ordinal. So the fact that is well-ordered under the relation follows from the fact that is well-ordered under the same relation, as we have already proved. Thus, is a -ordinal. Any -ordinal such that for every element will have and thus by Fact 2. To conclude that is the incremented join of , it remains to make a trivial remark that for each we have .
It remains to prove that if (i–iii) hold then is the set of all -ordinals. Let be any set satisfying (i–iii). First we prove that every element of is a -ordinal. Let . By (iii), the set is ordered under the relation . In this ordered set, the set consists of those elements of which are also elements of . By (ii), this is all elements of and so . By (i) and (iii), . Furthermore, for any the following holds:
This shows that is transitive. Since is well-ordered under (by (iii) and Theorem 10) and , is also well-ordered under the same relation. Thus, every element of is a -ordinal. Now we need to establish that every -ordinal is in . We already proved that the set of all -ordinals is an ordinal system, and since is a set of -ordinals, . The equality then follows from Corollary 14. ∎
4 A Dedekind-style axiomatization
A limit-successor system is a triple where is a set, is a partial function called the limit function and is a function called the successor function. A successor-closed subset of a limit-successor system is a subset of such that . For such subset, write to denote
and to denote
A subset is said to be closed when
It is not difficult to see that closed subsets form a topology on ; in fact, an Alexandrov topology. We denote the closure of a subset in this topology by . Recall that the corresponding ‘specialization preorder’ given by
is a preorder (as it is for any Alexandrov topology) and that if and only if for some .
We abbreviate the operator composed with itself times as , with the case giving the identity operator. We write for the operator defined by
and for the operator . It is easy to see that the closure of a subset can be computed as
This means that the specialization preorder ‘breaks up’ into two relations and , each determined by and alone, as explained in what follows. These relations are defined by:
We then have if and only if
for some , where can be any natural number . Note that is both reflexive and transitive, although the same cannot be claimed for .
Definition 17.
Given a universe , a -counting system is a limit-successor system satisfying the following conditions:
-
(C1)
The domain of is the set of all successor-closed subsets .
-
(C2)
If and belong to the domain of and , then .
The structure above is the one that will be used for formulating the universal property of an ordinal system in the next section. In this section we give a characterization of ordinal systems as counting systems having further internal properties.
Lemma 18.
Let be a universe and let be a -counting system. Then is transitive and furthermore,
for all .
Proof.
To prove transitivity, suppose and . Then , , and for some successor-closed . Then the union (Lemma 4) is also successor-closed, and hence it belongs to the domain of by (C1). Since , we get that . This implies that . By (C2), . Having and means that . This completes the proof of transitivity. To prove the second property, suppose . Then and with , for some and successor-closed . This proof follows a similar idea where we expand , this time adding to it all elements of the form , where . The resulting set
is clearly successor-closed and belongs to (Lemmas 2 and 4). Then, since , we get that by (C2). This implies . ∎
This lemma gives that in a -counting system , for any we have
and hence the closure of a subset is given by
Theorem 19.
The specialization preorder of a -counting system makes an ordinal system relative to , provided the following conditions hold:
-
(C3)
for all such that is defined.
-
(C4)
is injective and has the property that if then .
-
(C5)
for any successor-closed set having the property that every time is defined.
When these conditions hold, is the successor function of the ordinal system and whenever is defined; moreover, the limit ordinals are exactly the elements of of the form . Furthermore, the closure of is given by . Finally, any ordinal system relative to arises this way from a (unique) -counting system satisfying (C3–5).
Before proving the theorem, let us illustrate axioms (C1–5) in the case when is the universe of hereditarily finite sets (i.e., sets which are elements of a finite transitive set). Then is only defined on finite successor-closed sets. Injectivity of in (C4) forces every element of such set to have the property . At the same time, by (C3), such cannot lie in the image of . So
has the second property in (C5). Moreover, by injectivity of again, is also successor-closed. Then, by (C5), and so can only be defined on the empty set. With this provision, the triple becomes a triple where is the unique element in the image of , . The axioms (C1–2) then trivially hold, while (C3–5) take the form of the axioms of Dedekind for a natural number system:
-
•
The first equality in (C3) states that does not belong to the image of , while the second equality holds trivially.
-
•
(C4) just states that is injective.
-
•
(C5) becomes the usual principle of mathematical induction.
Proof of Theorem 19.
Suppose the conditions (C1–5) hold.
Step 1. As a first step, we prove that the specialization preorder is antisymmetric, i.e., that it is a partial order.
For this, we first show that is ‘antireflexive’: it is impossible to have . Indeed, suppose and . Since is successor-closed by (C1), . But then , which is impossible by (C3).
Next, we show antisymmetry of . Suppose and . Then we get that for . We will now show that this is not possible. In fact, we establish a slightly stronger property, which will be useful later on as well:
- Property 0.
-
for all and .
Actually, we have already established this property in the remark after the theorem. Here is a more detailed argument. Consider the set of all such that for all . We will use (C5) to show that . First, we show that is successor-closed. Let . Suppose for some . Then by injectivity of (which is required in (C4)), , which is impossible. So , showing that is successor-closed. Now let be successor-closed (by (C1), is such if and only if is defined). Then for all by the first equality in (C3). So and we can apply (C5) to get , as desired.
Antisymmetry of has thus been established.
We are now ready to prove the antisymmetry of . Suppose and . There are four cases to consider:
- Case 1.
-
. Then by antisymmetry of .
- Case 2.
-
for some . By transitivity of and Lemma 18, in this case we get , which we have shown not to be possible.
- Case 3.
-
for some . Similar to the previous case, in this case we get , which is impossible.
- Case 4.
-
for some . In this case too we get the impossible (this case relies in addition on transitivity of , established also in Lemma 18).
We have thus shown that the specialization preorder is antisymmetric. We will now establish the following two properties, which will be useful later on.
- Property 1.
-
If then , for all .
- Property 2.
-
If then , for all .
To prove the first property, suppose . There are two cases:
- Case 1.
-
; then clearly (as ).
- Case 2.
-
for some . Then and for a successor-closed , by (C1). So and thus . With this gives .
So in both cases we get , as required. To prove Property 2, suppose . We have again two cases:
- Case 1.
-
. This with gives by injectivity of from (C4).
- Case 2.
-
. Since by the first equality in (C3), by injectivity of from (C4) we must have . This will give .
We get the required conclusion in both cases.
Step 2. Next, we want to prove that the specialization order is a total order.
We will prove this by simultaneously establishing the following:
- Property 3.
-
Let . If for all such that is defined, then necessarily .
Let be the set of all such that for every either or . We will use (C5) to show that . For this, we first prove that is successor-closed. Let . Consider any . Since , if then . If then by Property 1, . This proves that is successor-closed. Consider now , where . To prove that , we proceed as follows. Let . If for at least one , then , since . Thus, it suffices to prove that the set of all such that if for all then , is the entire . This we prove using (C5). First, we show that is successor-closed. Suppose . If for all , then by Property 2, for all . If then, since is successor-closed by (C1), we will have , which will violate the assumption that for all . So we get that for all . Then , since . This implies , thus proving that is successor-closed. Now let be such that is defined. Suppose for all . From the first equality in (C3) we get that for each . So for each , there is such that and . This implies that for each (by (C4)), and so . Since , each element of is comparable with each element of . If for every we have such that , then . This would then give , and so by (C2), , showing that , as desired. In contrast, if there is such that for every , then (since ) . This together with will give . We have thus shown that has the required properties in order for us to apply (C5) to conclude that . This then shows that , and so has the required properties to conclude that . The proof of the specialization order being a total order is then complete. At the same time, since and for each such that is defined, , we have also established Property 3.
Step 3. We now show that whenever is defined and for all .
Properties 0 and 3 show that is the join of , for any such that is defined. Indeed, if for all , then for each , also . Since , as clearly and by Property 0, , we get: for all . Then by Property 3, thus showing that is a join of . Furthermore, the property together with Property 1 implies that , for each . Thus, once we prove that is an ordinal system under the specialization order, we have that is its successor function and is given by join. Furthermore, when is defined, is successor-closed and so it cannot have a largest element, by Property 0. Then the join of must also be the incremented join of (Lemma 8).
Step 4. We show that is an ordinal system under the specialization order where limit ordinals are exactly the elements of the form .
Consider the set of all such that . If , then by Property 2, (Lemmas 2 and 4) and so . Suppose is such that is defined. Since the specialization order is a total order and is the join of elements in , we have
and so . By (C5), .
To prove that is an ordinal system under the specialization order, it remains to prove that for any , the incremented join of exists in . If has a largest element, then the successor of that element is the incremented join of (Lemma 8). If does not contain an infinite set, then is finite and so it has a largest element. Now consider that has no largest element, with containing an infinite set. We define:
This is of course the closure of under . Then (Lemmas 5 and 6) and so is defined by (C1). Since , it holds that for all . Let . We prove by induction on that for each , we have for some . For , this follows from the fact that does not have a largest element. Suppose for some . Then . Since cannot be the largest element of , we must have for some . Then . What we have shown implies that the incremented join of is also the incremented join of .
We have thus proved that the specialization (pre)order of a -counting system satisfying (C3–5) makes it an ordinal system relative to , with as its successor function and given equivalently by join and by incremented join. This also establishes that if an ordinal system relative to arises this way from a -counting system satisfying (C3–5), then this -counting system is unique. We now prove the existence of such a -counting system. Actually, before doing that, note that by (C3), no element of of the form can be a successor ordinal, and so it must be a limit ordinal. Conversely, for a limit ordinal we have (Lemma 11). This shows that limit ordinals are precisely the ordinals of the form .
For an ordinal system relative to , consider the limit-successor system , where is the usual join restricted on a domain as required by (C1).
Step 5. We show that (C1–5) hold for the limit-successor system and that the corresponding specialization order matches with the order of . In this step we show as well that holds for each .
By Theorem 12, is indeed defined over the entire domain required in (C1). To prove (C2), first we establish that
for each . It is easy to see that is closed, so . To show , let . We have well-ordering and hence total order by Theorem 10. Then and so for some . Consider
We consider two cases:
- Case 1.
-
is a successor ordinal. Then for some ordinal . Since , we must have . Then and so (L3). This gives and so .
- Case 2.
-
is a limit ordinal. Then is successor-closed. Furthermore, we have
By closure of , we get . Since for all , we get . This gives and so .
We have thus established that the equality holds for each . From this it follows that the specialization preorder matches with the order of . We then get that (C2) holds by the fact that if down-closures of two subsets of a poset are equal, then so are their joins. Thus is a -counting system.
It remains to show that (C3–5) hold. Consider a successor-closed . has no maximum element thanks to (L1) and thus, by Lemma 8. By the same lemma, cannot be a successor if has no maximum. Thus we have , which is the first part of (C3). Since for each , we have that , which means the second part of (C3) also holds, i.e. .
We already know that is injective, so to see that (C4) holds, consider another successor-closed . If , then
Thus (C4) holds. Finally, consider a successor-closed subset of where for all successor-closed subsets of such that . Then if it satisfies (I1) and (I2) in our formulation of transfinite induction. We check both:
-
(I1)
follows from the fact that is successor-closed.
-
(I2)
Let be a limit ordinal such that . Then (Lemma 11).
Since both of these conditions hold, we can conclude that , and thus (C5) holds. This completes the proof. ∎
5 The corresponding universal property
Given two -counting systems and , a function that preserves the successor function () automatically preserves successor-closed subsets, so for any successor-closed , both sides of the equality
are defined (Lemma 5). When this equality holds for any such , we say that is a morphism of -counting systems and represent as an arrow
It is not difficult to see that -counting systems and morphisms between them form a category, under the usual composition of functions. Isomorphisms in this category are bijections between -counting systems which preserve both succession and limiting. Call a -counting system an ordinal -counting system when conditions (C3–5) hold. Clearly, the property of being an ordinal -counting system is stable under isomorphism of -counting systems. By Theorems 16 and 19, an ordinal -counting system exists and is given by the -ordinals. We will now see that ordinal -counting systems are precisely the initial objects in the category of -counting systems.
Theorem 20.
For any universe , a -counting system is an initial object in the category of -counting systems if and only if it is an ordinal -counting system.
Proof.
Since we know that an ordinal -counting system exists (Theorem 16) and that the property of being an ordinal -counting system is stable under isomorphism, it suffices to show that any ordinal -counting system is an initial object in the category of -counting systems. By Theorem 19, an ordinal -counting system has the form , where is an ordinal system relative to and is the join defined for exactly the successor-closed subsets in the -counting system.
For any -counting system , if a morphism exists, it must be the unique function defined by the transfinite recursion
-
(i)
for any ;
-
(ii)
for any limit ordinal (Lemma 11).
We now prove that the function defined by the recursion above is a morphism. It preserves succession by (i). Consider closed under successors. Then is a limit ordinal and , by Theorem 19. By definition of , we then have
We will now prove . We clearly have , so it suffices to show that . This is equivalent to showing , which would follow if we prove is closed. If , then . Therefore, and so . If , then (as is a limit ordinal by Theorem 19)
which implies . This gives . Note that we have the first of these two subset inclusions due to the fact that thanks to Theorem 19. This proves that is closed. So . We therefore get , showing that is indeed a morphism . ∎
Consider the case when every element in is a finite set (e.g., could be the universe of hereditarily finite sets). Then every triple , where is a set, is a function , and , can be seen as a -counting system for the same , with for each finite . A morphism between such -counting systems is a function such that and . The natural number system , with its usual successor function , is an initial object in the category of such -counting systems. The theorem above presents the natural number system as an initial object in the category of all -counting systems. It is not surprising that the natural number system is initial in this larger category too, since the empty set is the only finite successor-closed subset of .
References
- [1] M. Artin, A. Grothendieck, and J.-L. Verdier. Théorie des topos et cohomologie etale des schémas: Séminaire de géométrie algébrique du Bois Marie (SGA 4), Tome 1. 1963–1964.
- [2] G. Cantor. Beiträge zur Begründung der transfiniten Mengenlehre(erster Artikel). Math. Ann., 46(4):481–512, 1895.
- [3] G. Cantor. Beiträge zur Begründung der transfiniten Mengenlehre (zweiter Artikel). Math. Ann., 49(2):207–246, 1897.
- [4] R. Dedekind. Was sind und was sollen die Zahlen? Vieweg, Braunschweig, 1888.
- [5] A. H. Fraenkel. Abstract set theory. Bull. Amer. Math. Soc, 59:584–585, 1953.
- [6] P. Gabriel. Des catégories abéliennes. Bull. Soc. Math. France, 90:323–448, 1962.
- [7] T. Jech. Set theory. Springer, Berlin, Heidelberg, 2003.
- [8] A. Joyal and I. Moerdijk. Algebraic set theory, volume 220. Cambridge University Press, 1995.
- [9] K. Kunen. Set theory: an introduction to independence proofs. Elsevier Science Publishers, 1980.
- [10] F. W. Lawvere. An elementary theory of the category of sets. Proc. Nat. Acad. Sci. U.S.A., 52(6):1506–1511, 1964.
- [11] S. Mac Lane. Categories for the working mathematician, volume 5 of Graduate Texts in Mathematics. Springer, second edition, 1998.
- [12] J. von Neumann. Zur einführung der transfiniten Zahlen. Acta Litterarum ac Scientiarum Regiae Universitatis Hungaricae Francisco-Josephinae, sectio scientiarum mathematicarum, 1:199–208, 1923.
- [13] H. Wang. The axiomatization of arithmetic. J. Symb. Log., 22(2):145–158, 1957.
- [14] N. H. Williams. On Grothendieck universes. Compos. Math., 21(1):1–3, 1969.
- [15] E. Zermelo. Ernst Zermelo: Collected Works/Gesammelte Werke: Volume I/Band I – Set Theory, Miscellanea/Mengenlehre, Varia, volume 21 of Schriften der Mathematisch-naturwissenschaftlichen Klasse. Springer, 2010.