Tychonoff’s theorem proves that the product (even infinite) of compact spaces is also compact. The proof makes judicious use of Zorn’s lemma. In fact, it uses it so well, that I gained an appreciation for how fun the lemma can be.
In this post, we’ll be going over the notion of product space and compact space, building up to the final result.
I think that the final proof makes elegant use of a powerful tool: Zorn’s lemma. This use is so elegant, in fact, that as soon as I first read the proof, I felt compelled to write a blog post exposing the proof, and trying to describe its elegance to others.
All the proofs are inspired by, and sometimes even taken from, Munkres 1, which is the standard introductory text on Topology. I won’t be going over the basics of Topological spaces and continuous functions here, nor will I be exposing all of the details around product and compact spaces, so I recommend this book if you’d like to see these fundamentals, or learn more about Topology.
Compactness
The first notion we need to develop is that of a Compact space. In some sense, a Compact space is analogous to the closed interval . An interval like this has an uncountable number of points, but is still “small” in some sense: we can’t keep squeezing stuff into it.
Formally:
A space is Compact precisely when any collection of open sets that cover , (i.e. ), a finite sub-collection suffices to cover .
The intuition here goes back to the closed interval. We could try and cover in such a way that no finite sub-cover can exist. For example, by making smaller and smaller advances towards one side of the interval:
The problem is that we never reach the other side, which is included in our interval. So this fails to cover the entire interval. If we had the half open interval instead, then this attempt would work. In some sense, openness makes things not compact, because you can inch towards some point that you reach “in the limit”, but that you miss at every single step. With a closed interval, you need to actually reach every point to cover the interval, not just in a limiting way.
Finite Intersection Property
An opposite, but equivalent formulation of compactness can be given in terms of closed sets and intersections.
First, a definition:
A collection of subsets has the Finite Intersection Property (FIP, for short) precisely when any finite intersection of sets in this collection is non-empty.
Theorem: A space is Compact for any collection of closed sets satisfying the FIP, the intersection of this collection is non-empty.
Proof:
This is an exercise in rewriting definitions.
Mathematically, for a space to be compact, we need:
All of these are equivalent, and the last one says that for every collection of closed subsets in which no finite intersection is empty, the intersection of the collection is also not empty.
This is precisely the statement we wanted to show as equivalent to compactness.
Back to our example of a closed interval of the real line, instead of having a growing open cover that can’t keep growing forever without missing an endpoint, we now have receding closed sets, which must have a common region:
Products
The next notion we need to develop is that of a product space.
As a motivating example, take our closed interval , and consider the set , whose elements are pairs with :
If we have two open sets , then the product:
is also open:
This collection is only a basis for this topology.
More generally, set is open precisely when every point contains such a product set:
Finite Products
We can go ahead an provide a formal definition of this construction, for finite products.
Given Topological spaces and , the cartesian product can be endowed with a topology, whose basis consists of products , with open in , and open in .
The standard projections:
can be shown to be continuous. This is because:
In fact, these slices form a sub-basis: finite intersections of these slices form a basis for this topology.
With this in mind, one way of looking at the product topology is as the simplest way to to make the projections and continuous. For the projections to be continuous, all of the slices need to be open. Having declared these slices to be open, the product topology emerges freely using the axioms of a topological space.
Extending to the infinite case
We’ve just seen how the topology can be defined in terms of slices as a sub-basis. This characterization extends naturally to the case where we have an arbitrary collection of spaces, and consider the infinite product .
The elements of this set are dependent functions for some indexing set .
We have a set function . And we can define the slices
as our sub-basis.
This means that finite intersections of these slices form our basis. But a finite intersection of slices can be more precisely described as a product with open in , and , for all but finitely many .
Formally, the product topology is defined by taking these special products as our basis. In the case of a finite product, the ” almost everywhere” disappears, since it’s not possible to violate this condition, taking a finite product.
Another way of looking at the infinite product of spaces is by arranging the spaces one after the other as a big cake. The points in this space are then paths through each layer. We can choose a finite number of slices, restricting the paths in that layer, but the other layers remain unconstrained:
Tychonoff’s Theorem
We’ve seen the notion of a compactness, and product spaces. A natural question to ask is whether these properties are compatible:
Given a collection of compact spaces, is the product also a compact space?
Tychnoff’s theorem answers this with a definitive yes.
A rough idea, and a problem
It’s more convenient to use the finite intersection version of compactness here. Given a collection of closed subsets of our product space, with the finite intersection property, we need to find a point in the intersection .
One idea is to try finding this point component by component, using that fact that each part of our product space is already known to be compact.
Concretely, consider the product . Let and . Our collection consists of rectangles centered along the segment :
This has the finite intersection property. This is because for any two rectangles centered on this line, the smaller one will always be contained in the larger one. It then follows, by induction, that the intersection of any number of these rectangles is also in the collection, and thus non-empty.
In fact, we can see the final intersection of this collection. It is the segment itself:
If we project each rectangle down to one of the intervals, we get a nested set of closed intervals, eventually reaching :
Now, the rough idea we had earlier was to pick a point in the intersection of each projection, and then use that. Let’s say we pick and . The problem is that is not in the intersection , in the product :
It’s possible to make the “wrong” choice when considering each component in isolation. We need to use the way that our collection puts constraints on each component at the same time, when finding our point. We’ve gotten rid of this global information by limiting ourselves to considering just the component collections , and .
We can amend this by choosing a larger collection of sets that contains . Let’s now add in all the rectangles centered on the same segment , but where the “top” can vary as well:
Now this collection “hones in” precisely on the point , and our method of construction works, since the projections of this set have to end up at as well.
The reason our original collection didn’t work is because in general, and especially not in this case. When we project down the line segment, and then glue each projection back up, we end up with something way too big:
On the other hand, when we have a single point, things work out just fine:
Lessons Learned
In a sense, the collection was not strict enough to force each component to be coherent, and so it was possible to make the “wrong” choice. We have too much freedom in our choice for each component, so it’s possible to not respect the conditions that imposes between the different components.
Now, in this specific case, it’s easy to find a collection that that’s strong enough to force our components to be coherent. But in general, we can’t leave this up to chance or observation, we need a fool-proof method to make this work.
What we’re going to do is simply consider all such collections. If we consider the biggest set with the finite intersection property that contains , we’ll be able to use all of the insight we could ever gleam from picking a collection by hand.
Another way to think about this is that for any collection , we can find an intersection of by compactness. If we restrict ourselves to just doing this for the one collection we’ve been given, we’re not exercising the full power of compactness. The more sets are in a collection the more “difficult” it is to find a point in the intersection of these sets. By using the biggest possible collection containing , we exercise the full power of our assumption that is compact.
A general lesson about proving theorems is that you should try and use exactly the power you’ve assumed. If you can prove something without one of your assumptions, you should remove the assumption, and obtain a more general theorem. If you don’t exercise the full power you’ve acquired, you’re making the proof harder to obtain.
Zorn’s Lemma
Given some collection , with the finite intersection property, we’d like to show that there’s a “biggest” collection containing , and satisfying the finite intersection property.
The problem with trying to construct this object is that the “super-collection”:
is pretty unwieldy. In fact, it’s very uncountable. Super uncountable, if you will. Trying to build it directly, or using induction, is going to fail immediately.
On the other hand, there’s never any obstacle stopping us from buildind this mega-set. If at some point we’re working on our collection , and we’ve realized that there’s an even bigger that works, then we can just use that instead, and keep chugging on.
If we ever encounter some kind of obstacle, we can swiftly adjust our course around it, or incorporate whatever information it’s signalling to us.
This is the kind of situation in which Zorn’s lemma applies. We’re trying to build a big object, that requires considering an infinite number of steps, but where no obstacle to our construction ever pops up.
Theorem: (Zorn’s Lemma) Given a partially ordered set , if every chain (totally ordered subset) has an upper bound in , then has a maximal element.
Note:
Note that “maximal” simply means that nothing is bigger. i.e. maximal .
In particular, this does not imply .
Proving this is far beyond the scope of this post. Essentially, this theorem is taken as an axiom. At least, that’s how I think of it. Using set theory, and assuming the axiom of choice, you can prove this lemma. In fact, this lemma is equivalent to the axiom of choice. Morally speaking, the axiom of choice is inoffensive, and used constantly throughout Topology, but Zorn’s lemma seems much more powerful, and used, just not constantly.
But, this feeling is just an illusion, because once you’ve assumed the axiom of choice, you have no choice but to accept Zorn’s lemma as well.
Thankfully, I have no moral qualms about using Zorn’s lemma, and I actually find it kind of cool when I do get to use it, so let me show you how to apply it in our case.
Another way of thinking about this is that if you want to show me that there’s always a bigger object that exists, you’d need to come up with some kind of chain in this order that goes on forever:
The thing is, if I can show that any chain, by virtue of being a chain, without any assumption of finiteness, or any other condition on its size, is bounded, then I can prevent you from ever finding a witness to the non-existence of my mega-object.
This is chiefly non-constructive, since we’re saying that an object exists by virtue of our inability to disprove its existence, but I digress.
The biggest FIP
Concretely, we’ll be proving the following theorem:
Theorem: Given a set , and a collection of subsets of , with the FIP property, there exists a collection containing and with the FIP property, that is not strictly contained in any other such collection.
Proof:
As hinted at, we’ll be using Zorn’s lemma. The first step in applying it is to find a partially ordered set. Consider:
For the partial order , we’ll be using .
A maximal element of this set would be a collection satisfying all the properties we want. We can use Zorn’s lemma to find this collection, provided that every chain has an upper bound.
Let’s say we have a chain . This means that for any either or . We need to show that this super-collection has an upper bound in .
If this chain is empty, simply take as an upper bound.
If it is not empty, let’s just take the union of all of the collections:
Certainly, this is an upper bound, so we just need to check that it is contained in .
First, it obviously contains . Because the chain is non-empty, we have some . By assumption , since we’ve taken a chain in .
Secondly, it satisfies the FIP property. Given sets , there exists, by definition, collections , with . Since the indices are arbitrary, assume . Then, as well, alongside all of the others. Their intersection, is non-empty, since has the FIP.
Having shown that every chain in has an upper bound, we can now apply Zorn’s lemma, and obtain a maximal collection .
This collection is closed under finite intersection:
Lemma: Given , the intersection is also in .
Proof:
Given , define:
We now show that has the FIP, which implies , and thus , by maximality.
Let’s say we have . If these consist entirely of elements of , then we have no work to do, since their intersection is non-empty, by assumption.
So, we only need to consider the case where we have: .
We have:
This is a finite intersection of elements of , so non-empty.
We now conclude that .
This collection is so all-encompassing, that if a set touches every set in the collection, it must itself be in the collection.
Lemma: If is a subset of , intersecting every element of , then
Proof:
We show that has the FIP, which makes it equal to , meaning in the first place.
We use a similar strategy as before.
Given , we need to show that their intersection is non-empty.
If each is in , then this is true by assumption.
Therefore, assume we’re in the situation with . By the previous lemma the intersection:
is in . Then we have that is non-empty by assumption, since intersects every .
Proving our Theorem
We can now use this strategy to prove our theorem:
Tychnoff’s Theorem: Given a collection of Compact spaces, the product is also a Compact space.
Proof: Let be a collection of closed sets with the FIP. We show that the intersection:
is non-empty. If this were a collection of closed sets, this would be the intersection . Compactness thus follows from showing this.
Now, we can find a maximal collection containing , and satisfying the FIP, as we’ve developed earlier.
If is non-empty, then certainly would also be non-empty.
Consider the collection:
This set satisfies the FIP, because:
(this observation holds for any function )
By compactness of each , we can then choose a point such that:
Let by the product of these components. We need to show that .
If we have a sub-basis element , then we can show that it intersects every .
Since by definition, so we have intersecting in some point , with . It’s clear that .
Since intersects every , it belongs to , because of being maximal, as we proved earlier.
Since basis elements in the Product Topology consist of finite intersections of these subsets, basis elements containing are also in , since is closed under finite intersection, as we showed earlier.
Since has the FIP, any element in intersects every other element. Thus, any basis element containing intersects every . This means that any neighborhood of intersects every , that is to say:
This means that
which implies that is compact.
Conclusion
This post likely wasn’t all that enlightening unless you were already familiar with products and compactness. If you liked the illustrations nonetheless, and are curious to learn more, I’d recommend reading a bit of Munkres, which is linked in the next section.