Jump to content

Zorn's lemma

From Wikipedia, the free encyclopedia
(Redirected from Zorn lemma)

Zorn's lemma can be used to show that every connected graph has a spanning tree. The set of all sub-graphs that are trees is ordered by inclusion, and the union of a chain is an upper bound. Zorn's lemma says that a maximal tree must exist, which is a spanning tree since the graph is connected.[1] Zorn's lemma is not needed for finite graphs, such as the one pictured here.

Zorn's lemma, also known as the Kuratowski–Zorn lemma, is a proposition of set theory. It states that a partially ordered set containing upper bounds for every chain (that is, every totally ordered subset) necessarily contains at least one maximal element.

The lemma was proven (assuming the axiom of choice) by Kazimierz Kuratowski in 1922 and independently by Max Zorn in 1935.[2] It occurs in the proofs of several theorems of crucial importance, for instance the Hahn–Banach theorem in functional analysis, the theorem that every vector space has a basis,[3] Tychonoff's theorem in topology stating that every product of compact spaces is compact, and the theorems in abstract algebra that in a ring with identity every proper ideal is contained in a maximal ideal and that every field has an algebraic closure.[4]

Zorn's lemma is equivalent to the well-ordering theorem and also to the axiom of choice, in the sense that within ZF (Zermelo–Fraenkel set theory without the axiom of choice) any one of the three is sufficient to prove the other two.[5] An earlier formulation of Zorn's lemma is the Hausdorff maximal principle which states that every totally ordered subset of a given partially ordered set is contained in a maximal totally ordered subset of that partially ordered set.[6]

Motivation

[edit]

To prove the existence of a mathematical object that can be viewed as a maximal element in some partially ordered set in some way, one can try proving the existence of such an object by assuming there is no maximal element and using transfinite induction and the assumptions of the situation to get a contradiction. Zorn's lemma tidies up the conditions a situation needs to satisfy in order for such an argument to work and enables mathematicians to not have to repeat the transfinite induction argument by hand each time, but just check the conditions of Zorn's lemma.

If you are building a mathematical object in stages and find that (i) you have not finished even after infinitely many stages, and (ii) there seems to be nothing to stop you continuing to build, then Zorn’s lemma may well be able to help you.

— William Timothy Gowers, "How to use Zorn’s lemma"[7]

Statement of the lemma

[edit]

Preliminary notions:

Partially ordered set
A set P equipped with a binary relation ≤ that is reflexive (xx for every x), antisymmetric (if both xy and yx hold, then x = y), and transitive (if xy and yz then xz) is said to be (partially) ordered by ≤. Given two elements x and y of P with xy, y is said to be greater than or equal to x. The word "partial" is meant to indicate that not every pair of elements of a partially ordered set is required to be comparable under the order relation, that is, in a partially ordered set P with order relation ≤ there may be elements x and y with neither xy nor yx. An ordered set in which every pair of elements is comparable is called totally ordered.
Chain
Every subset S of a partially ordered set P can itself be seen as partially ordered by restricting the order relation inherited from P to S. A subset S of a partially ordered set P is called a chain (in P) if it is totally ordered in the inherited order.
Maximal element
An element m of a partially ordered set P with order relation ≤ is maximal (with respect to ≤) if there is no other element of P greater than m, that is, there is no s in P with sm and ms. Depending on the order relation, a partially ordered set may have any number of maximal elements. However, a totally ordered set can have at most one maximal element.
Upper bound
Given a subset S of a partially ordered set P, an element u of P is an upper bound of S if it is greater than or equal to every element of S. Here, S is not required to be a chain, and u is required to be comparable to every element of S but need not itself be an element of S.

Zorn's lemma can then be stated as:

Zorn's lemma[8][9] Let be a partially ordered set that satisfies the following two properties:

  1. is nonempty;
  2. Every chain in P has an upper bound in P.

Then has at least one maximal element.

In fact, property (1) is redundant, since property (2) says, in particular, that the empty chain has an upper bound in , implying is nonempty. However, in practice, one often checks (1) and then verifies (2) only for nonempty chains, since the case of the empty chain is taken care of by (1).

In the terminology of Bourbaki, a partially ordered set is called inductive if each chain has an upper bound in the set (in particular, the set is then nonempty).[10] Then the lemma can be stated as:

Zorn's lemma[11] Each inductive set has a maximal element.

For some applications, the following variant may be useful.

Corollary[12] Let be a partially ordered set in which every chain has an upper bound and an element in . Then there exists a maximal element in such that .

Indeed, let with the partial ordering from . Then, for a chain in , an upper bound in is in and so satisfies the hypothesis of Zorn's lemma and a maximal element in is a maximal element in as well.

Remark:[13] Zorn's lemma can fail for a partially ordered class, not a set. Indeed, let P be the class of all ordinals. Then it satisfies the hypothesis of the lemma (it can be shown that the union of a chain of ordinals is again an ordinal; roughly, initial segments glue). However, has no maximal element: if is a maximal ordinal, the successor of it is strictly larger. (The fact that the class of ordinals is not a set is known as the Burali-Forti paradox.)

Example applications

[edit]

Every vector space has a basis

[edit]

Zorn's lemma can be used to show that every vector space V has a basis.[14]

If V = {0}, then the empty set is a basis for V. Now, suppose that V ≠ {0}. Let P be the set consisting of all linearly independent subsets of V. Since V is not the zero vector space, there exists a nonzero element v of V, so P contains the linearly independent subset {v}. Furthermore, P is partially ordered by set inclusion (see inclusion order). Finding a maximal linearly independent subset of V is the same as finding a maximal element in P.

To apply Zorn's lemma, take a chain T in P (that is, T is a subset of P that is totally ordered). If T is the empty set, then {v} is an upper bound for T in P. Suppose then that T is non-empty. We need to show that T has an upper bound, that is, there exists a linearly independent subset B of V containing all the members of T.

Take B to be the union of all the sets in T. We wish to show that B is an upper bound for T in P. To do this, it suffices to show that B is a linearly independent subset of V.

Suppose otherwise, that B is not linearly independent. Then there exists vectors v1, v2, ..., vkB and scalars a1, a2, ..., ak, not all zero, such that

Since B is the union of all the sets in T, there are some sets S1, S2, ..., SkT such that viSi for every i = 1, 2, ..., k. As T is totally ordered, one of the sets S1, S2, ..., Sk must contain the others, so there is some set Si that contains all of v1, v2, ..., vk. This tells us there is a linearly dependent set of vectors in Si, contradicting that Si is linearly independent (because it is a member of P).

The hypothesis of Zorn's lemma has been checked, and thus there is a maximal element in P, in other words a maximal linearly independent subset B of V.

Finally, we show that B is indeed a basis of V. It suffices to show that B is a spanning set of V. Suppose for the sake of contradiction that B is not spanning. Then there exists some vV not covered by the span of B. This says that B ∪ {v} is a linearly independent subset of V that is larger than B, contradicting the maximality of B. Therefore, B is a spanning set of V, and thus, a basis of V.

Every nontrivial ring with unity contains a maximal ideal

[edit]

Zorn's lemma can be used to show that every nontrivial ring R with unity contains a maximal ideal.

Let P be the set consisting of all proper ideals in R (that is, all ideals in R except R itself). Since R is non-trivial, the set P contains the trivial ideal {0}. Furthermore, P is partially ordered by set inclusion. Finding a maximal ideal in R is the same as finding a maximal element in P.

To apply Zorn's lemma, take a chain T in P. If T is empty, then the trivial ideal {0} is an upper bound for T in P. Assume then that T is non-empty. It is necessary to show that T has an upper bound, that is, there exists an ideal IR containing all the members of T but still smaller than R (otherwise it would not be a proper ideal, so it is not in P).

Take I to be the union of all the ideals in T. We wish to show that I is an upper bound for T in P. We will first show that I is an ideal of R. For I to be an ideal, it must satisfy three conditions:

  1. I is a nonempty subset of R,
  2. For every x, yI, the sum x + y is in I,
  3. For every rR and every xI, the product rx is in I.

#1 - I is a nonempty subset of R.

Because T contains at least one element, and that element contains at least 0, the union I contains at least 0 and is not empty. Every element of T is a subset of R, so the union I only consists of elements in R.

#2 - For every x, yI, the sum x + y is in I.

Suppose x and y are elements of I. Then there exist two ideals J, KT such that x is an element of J and y is an element of K. Since T is totally ordered, we know that JK or KJ. Without loss of generality, assume the first case. Both x and y are members of the ideal K, therefore their sum x + y is a member of K, which shows that x + y is a member of I.

#3 - For every rR and every xI, the product rx is in I.

Suppose x is an element of I. Then there exists an ideal JT such that x is in J. If rR, then rx is an element of J and hence an element of I. Thus, I is an ideal in R.

Now, we show that I is a proper ideal. An ideal is equal to R if and only if it contains 1. (It is clear that if it is R then it contains 1; on the other hand, if it contains 1 and r is an arbitrary element of R, then r1 = r is an element of the ideal, and so the ideal is equal to R.) So, if I were equal to R, then it would contain 1, and that means one of the members of T would contain 1 and would thus be equal to R – but R is explicitly excluded from P.

The hypothesis of Zorn's lemma has been checked, and thus there is a maximal element in P, in other words a maximal ideal in R.

Proof

[edit]

Assuming the axiom of choice, Zorn's lemma can be proved in multiple ways.

Proof by transfinite recursion

[edit]

Suppose Zorn's lemma is false. Then there exists a partially ordered set, or poset, P such that every totally ordered subset has an upper bound, and that for every element in P there is another element bigger than it. For every totally ordered subset T we may then define a bigger element b(T), because T has an upper bound, and that upper bound has a bigger element. To actually define the function b, we need to employ the axiom of choice (explicitly: let , which is non-empty by the argument above. The axiom of choice furnishes ).

Using the function b, we are going to define elements a0 < a1 < a2 < a3 < ... < aω < aω+1 <…, in P. This uncountable sequence is really long: the indices are not just the natural numbers, but all ordinals. In fact, the sequence is too long for the set P; there are too many ordinals (a proper class), more than there are elements in any set (in other words, given any set of ordinals, there exists a larger ordinal), and the set P will be exhausted before long and then we will run into the desired contradiction.

The ai are defined by transfinite recursion: we pick a0 in P arbitrary (this is possible, since P contains an upper bound for the empty set and is thus not empty) and for any other ordinal w we set aw = b({av : v < w}). Because the av are totally ordered, this is a well-founded definition.

The above proof can be formulated without explicitly referring to ordinals by considering the initial segments {av : v < w} as subsets of P. Such sets can be easily characterized as well-ordered chains SP where each xS satisfies x = b({yS : y < x}). Contradiction is reached by noting that we can always find a "next" initial segment either by taking the union of all such S (corresponding to the limit ordinal case) or by appending b(S) to the "last" S (corresponding to the successor ordinal case).[15]

This proof shows that actually a slightly stronger version of Zorn's lemma is true:

LemmaIf P is a poset in which every well-ordered subset has an upper bound, and if x is any element of P, then P has a maximal element greater than or equal to x. That is, there is a maximal element which is comparable to x.

Proof from the Hausdorff maximal principle

[edit]

The Hausdorff maximal principle is an alternative formulation of Zorn's lemma asserting that every partially ordered set has a maximal chain with respect to set inclusion. (In fact, the usual form of the Hausdorff maximal principle is slightly stronger, stating that for every chain there exists a maximal chain such that .)

The usual form of Zorn's lemma follows from the Hausdorff maximal principle, since if satisfies the hypothesis of Zorn's lemma, then its maximal chain also has an upper bound in . This is a maximal element since if , then is a strictly larger chain than , contradicting the maximality of .[16]

Conversely, the Hausdorff maximal principle also follows from Zorn's lemma by regarding the set of admissible chains (chains that contain ) as a partially ordered set ordered by set inclusion. In fact, this specific case only needs the following weak form of Zorn's lemma:[17]

Lemma 1Let be a partially ordered set in which each chain has a least upper bound in . Then has a maximal element.

Or the following even weaker form:

Lemma 2Let be a set consisting of subsets of some fixed set such that satisfies the following properties:

  1. is nonempty.
  2. The union of each totally ordered subsets of is in , where the ordering is with respect to set inclusion.
  3. For each set in , each subset of is in .

Then has a maximal element with respect to set inclusion.

(Note that, strictly speaking, (1) is redundant since (2) implies the empty set is in .) This is a weaker form since that the union of each chain of is a least upper bound of that chain. This cycle of implications (Zorn's lemma ⇒ Lemma 1 ⇒ Lemma 2 ⇒ Hausdorff maximal principle ⇒ Zorn's lemma) shows that all these forms are in fact equivalent.

Lemma 2 can be directly proved from the axiom of choice, as shown in Hausdorff maximal principle § Proof 1.

The Bourbaki–Witt theorem can also be used to give a proof of the Hausdorff maximal principle; see Hausdorff maximal principle § Proof 2.

Zorn's lemma implies the axiom of choice

[edit]

A proof that Zorn's lemma implies the axiom of choice illustrates a typical application of Zorn's lemma.[18] (The structure of the proof is exactly the same as the one for the Hahn–Banach theorem.)

Given a set of nonempty sets and its union (which exists by the axiom of union), we want to show there is a function

such that for each . For that end, consider the set

.

It is partially ordered by extension; i.e., if and only if is the restriction of . If is a chain in , then we can define the function on the union by setting when . This is well-defined since if , then is the restriction of . The function is also an element of and is a common extension of all 's. Thus, we have shown that each chain in has an upper bound in . Hence, by Zorn's lemma, there is a maximal element in that is defined on some . We want to show . Suppose otherwise; then there is a set . As is nonempty, it contains an element . We can then extend to a function by setting and . (Note this step does not need the axiom of choice.) The function is in and , a contradiction to the maximality of .

Essentially the same proof also shows that Zorn's lemma implies the well-ordering theorem: take to be the set of all well-ordered subsets of a given set and then shows a maximal element of is .[19]

History

[edit]

The Hausdorff maximal principle is an early statement similar to Zorn's lemma.

Kazimierz Kuratowski proved in 1922[20] a version of the lemma close to its modern formulation (it applies to sets ordered by inclusion and closed under unions of well-ordered chains). Essentially the same formulation (weakened by using arbitrary chains, not just well-ordered) was independently given by Max Zorn in 1935,[21] who proposed it as a new axiom of set theory replacing the well-ordering theorem, exhibited some of its applications in algebra, and promised to show its equivalence with the axiom of choice in another paper, which never appeared.

The name "Zorn's lemma" appears to be due to John Tukey, who used it in his book Convergence and Uniformity in Topology in 1940. Bourbaki's Théorie des Ensembles of 1939 refers to a similar maximal principle as "le théorème de Zorn".[22] The name "Kuratowski–Zorn lemma" prevails in Poland and Russia.

Equivalent forms of Zorn's lemma

[edit]

Zorn's lemma is equivalent (in ZF) to three main results:

  1. Hausdorff maximal principle
  2. Axiom of choice
  3. Well-ordering theorem.

A well-known joke alluding to this equivalency (which may defy human intuition) is attributed to Jerry Bona: "The Axiom of Choice is obviously true, the well-ordering principle obviously false, and who can tell about Zorn's lemma?"[23]

Zorn's lemma is also equivalent to the strong completeness theorem of first-order logic.[24]

Moreover, Zorn's lemma (or one of its equivalent forms) implies some major results in other mathematical areas. For example,

  1. Banach's extension theorem which is used to prove one of the most fundamental results in functional analysis, the Hahn–Banach theorem
  2. Every vector space has a basis, a result from linear algebra (to which it is equivalent[25]). In particular, the real numbers, as a vector space over the rational numbers, possess a Hamel basis.
  3. Every commutative unital ring has a maximal ideal, a result from ring theory known as Krull's theorem, to which Zorn's lemma is equivalent[26]
  4. Tychonoff's theorem in topology (to which it is also equivalent[27])
  5. Every proper filter is contained in an ultrafilter, a result that yields the completeness theorem of first-order logic[28]

In this sense, Zorn's lemma is a powerful tool, applicable to many areas of mathematics.

Analogs under weakenings of the axiom of choice

[edit]

A weakened form of Zorn's lemma can be proven from ZF + DC (Zermelo–Fraenkel set theory with the axiom of choice replaced by the axiom of dependent choice). Zorn's lemma can be expressed straightforwardly by observing that the set having no maximal element would be equivalent to stating that the set's ordering relation would be entire, which would allow us to apply the axiom of dependent choice to construct a countable chain. As a result, any partially ordered set with exclusively finite chains must have a maximal element.[29]

More generally, strengthening the axiom of dependent choice to higher ordinals allows us to generalize the statement in the previous paragraph to higher cardinalities.[29] In the limit where we allow arbitrarily large ordinals, we recover the proof of the full Zorn's lemma using the axiom of choice in the preceding section.

Preorder version

[edit]

There is a version of Zorn's lemma for a preordered set. In that case, we need to be a bit careful about the definition of a maximal element. Precisely, it states

Zorn's lemma[30] Let P be a preordered set such that each chain in it has an upper bound. Then P contains an element x such that for each element y in P.

This version trivially follows from the usual Zorn's lemma. Indeed, consider the quotient

where means and . Then is a partially ordered set satisfying the hypothesis of Zorn's lemma and thus has a maximal element.

[edit]

The 1970 film Zorns Lemma is named after the lemma.

The lemma was referenced on The Simpsons in the episode "Bart's New Friend".[31]

See also

[edit]

Notes

[edit]
  1. ^ Serre, Jean-Pierre (2003), Trees, Springer Monographs in Mathematics, Springer, p. 23
  2. ^ Moore 2013, p. 168
  3. ^ Wilansky, Albert (1964). Functional Analysis. New York: Blaisdell. pp. 16–17.
  4. ^ Jech 2008, ch. 2, §2 Some applications of the Axiom of Choice in mathematics
  5. ^ Jech 2008, p. 9
  6. ^ Moore 2013, p. 168
  7. ^ William Timothy Gowers (12 August 2008). "How to use Zorn's lemma".
  8. ^ Halmos 1960, § 16.
  9. ^ Lang, Serge (2002). Algebra. Graduate Texts in Mathematics. Vol. 211 (Revised 3rd ed.). Springer-Verlag. p. 880. ISBN 978-0-387-95385-4., Dummit, David S.; Foote, Richard M. (1998). Abstract Algebra (2nd ed.). Prentice Hall. p. 875. ISBN 978-0-13-569302-5., and Bergman, George M (2015). An Invitation to General Algebra and Universal Constructions. Universitext (2nd ed.). Springer-Verlag. p. 162. ISBN 978-3-319-11477-4..
  10. ^ Bourbaki 1970, Ch. III., §2., no. 4., Definition 3.
  11. ^ Bourbaki 1970, Ch. III., §2., no. 4., Théorème 2.
  12. ^ Bourbaki 1970, Ch. III., §2., no. 4., Corollaire 1.
  13. ^ Remark 12 in https://terrytao.wordpress.com/2009/01/28/245b-notes-7-well-ordered-sets-ordinals-and-zorns-lemma-optional/
  14. ^ Smits, Tim. "A Proof that every Vector Space has a Basis" (PDF). Archived from the original (PDF) on 20 March 2023. Retrieved 14 August 2022.
  15. ^ Lewin, Jonathan W. (1991). "A simple proof of Zorn's lemma". The American Mathematical Monthly. 98 (4): 353–354. doi:10.1080/00029890.1991.12000768.
  16. ^ Halmos 1960, § 16. NB: in the reference, this deduction is by noting there is an order-preserving embedding
    and that the "passage" allows to deduce the existence of a maximal element of or equivalently, that of from the weak form of Zorn's lemma. The meaning of passage there was unclear and so here we gave an alternative reasoning.
  17. ^ Halmos 1960, § 16. Exercise.
  18. ^ Halmos 1960, § 16. Exercise.
  19. ^ Halmos 1960, § 17. Exercise.
  20. ^ Kuratowski, Casimir (1922). "Une méthode d'élimination des nombres transfinis des raisonnements mathématiques" [A method of disposing of transfinite numbers of mathematical reasoning] (PDF). Fundamenta Mathematicae (in French). 3: 76–108. doi:10.4064/fm-3-1-76-108. Retrieved 24 April 2013.
  21. ^ Zorn, Max (1935). "A remark on method in transfinite algebra". Bulletin of the American Mathematical Society. 41 (10): 667–670. doi:10.1090/S0002-9904-1935-06166-X.
  22. ^ Campbell 1978, p. 82.
  23. ^ Krantz, Steven G. (2002), "The Axiom of Choice", Handbook of Logic and Proof Techniques for Computer Science, Springer, pp. 121–126, doi:10.1007/978-1-4612-0115-1_9, ISBN 978-1-4612-6619-8.
  24. ^ J.L. Bell & A.B. Slomson (1969). Models and Ultraproducts. North Holland Publishing Company. Chapter 5, Theorem 4.3, page 103.
  25. ^ Blass, Andreas (1984). "Existence of bases implies the Axiom of Choice". Axiomatic Set Theory. Contemporary Mathematics. Vol. 31. pp. 31–33. doi:10.1090/conm/031/763890. ISBN 9780821850268.
  26. ^ Hodges, W. (1979). "Krull implies Zorn". Journal of the London Mathematical Society. s2-19 (2): 285–287. doi:10.1112/jlms/s2-19.2.285.
  27. ^ Kelley, John L. (1950). "The Tychonoff product theorem implies the axiom of choice". Fundamenta Mathematicae. 37: 75–76. doi:10.4064/fm-37-1-75-76.
  28. ^ J.L. Bell & A.B. Slomson (1969). Models and Ultraproducts. North Holland Publishing Company.
  29. ^ a b Wolk, Elliot S. (1983), "On the principle of dependent choices and some forms of Zorn's lemma", Canadian Mathematical Bulletin, 26 (3): 365–367, doi:10.4153/CMB-1983-062-5
  30. ^ https://math.iisc.ac.in/~gadgil/proofs-and-programs-2023/doc/Mathlib/Order/Zorn.html
  31. ^ "Zorn's Lemma | The Simpsons and their Mathematical Secrets".

References

[edit]

Further reading

[edit]
[edit]