Compactness theorem

Last updated January 20, 2024

In mathematical logic, the compactness theorem states that a set of first-order sentences has a model if and only if every finite subset of it has a model. This theorem is an important tool in model theory, as it provides a useful (but generally not effective) method for constructing models of any set of sentences that is finitely consistent.

The compactness theorem for the propositional calculus is a consequence of Tychonoff's theorem (which says that the product of compact spaces is compact) applied to compact Stone spaces,^[1] hence the theorem's name. Likewise, it is analogous to the finite intersection property characterization of compactness in topological spaces: a collection of closed sets in a compact space has a non-empty intersection if every finite subcollection has a non-empty intersection.

The compactness theorem is one of the two key properties, along with the downward Löwenheim–Skolem theorem, that is used in Lindström's theorem to characterize first-order logic. Although there are some generalizations of the compactness theorem to non-first-order logics, the compactness theorem itself does not hold in them, except for a very limited number of examples.^[2]

History

Kurt Gödel proved the countable compactness theorem in 1930. Anatoly Maltsev proved the uncountable case in 1936.^[3]^[4]

Applications

The compactness theorem has many applications in model theory; a few typical results are sketched here.

Robinson's principle

The compactness theorem implies the following result, stated by Abraham Robinson in his 1949 dissertation.

Robinson's principle:^[5]^[6] If a first-order sentence holds in every field of characteristic zero, then there exists a constant $p$ such that the sentence holds for every field of characteristic larger than $p.$ This can be seen as follows: suppose $\varphi$ is a sentence that holds in every field of characteristic zero. Then its negation $\lnot \varphi ,$ together with the field axioms and the infinite sequence of sentences

1+1\neq 0,\;\;1+1+1\neq 0,\;\ldots

is not satisfiable (because there is no field of characteristic 0 in which $\lnot \varphi$ holds, and the infinite sequence of sentences ensures any model would be a field of characteristic 0). Therefore, there is a finite subset $A$ of these sentences that is not satisfiable. $A$ must contain $\lnot \varphi$ because otherwise it would be satisfiable. Because adding more sentences to $A$ does not change unsatisfiability, we can assume that $A$ contains the field axioms and, for some $k,$ the first $k$ sentences of the form $1+1+\cdots +1\neq 0.$ Let $B$ contain all the sentences of $A$ except $\lnot \varphi .$ Then any field with a characteristic greater than $k$ is a model of $B,$ and $\lnot \varphi$ together with $B$ is not satisfiable. This means that $\varphi$ must hold in every model of $B,$ which means precisely that $\varphi$ holds in every field of characteristic greater than $k.$ This completes the proof.

The Lefschetz principle, one of the first examples of a transfer principle, extends this result. A first-order sentence $\varphi$ in the language of rings is true in some (or equivalently, in every) algebraically closed field of characteristic 0 (such as the complex numbers for instance) if and only if there exist infinitely many primes $p$ for which $\varphi$ is true in some algebraically closed field of characteristic $p,$ in which case $\varphi$ is true in all algebraically closed fields of sufficiently large non-0 characteristic $p.$ ^[5] One consequence is the following special case of the Ax–Grothendieck theorem: all injective complex polynomials $\mathbb {C} ^{n}\to \mathbb {C} ^{n}$ are surjective ^[5] (indeed, it can even be shown that its inverse will also be a polynomial).^[7] In fact, the surjectivity conclusion remains true for any injective polynomial $F^{n}\to F^{n}$ where $F$ is a finite field or the algebraic closure of such a field.^[7]

Upward Löwenheim–Skolem theorem

A second application of the compactness theorem shows that any theory that has arbitrarily large finite models, or a single infinite model, has models of arbitrary large cardinality (this is the Upward Löwenheim–Skolem theorem). So for instance, there are nonstandard models of Peano arithmetic with uncountably many 'natural numbers'. To achieve this, let $T$ be the initial theory and let $\kappa$ be any cardinal number. Add to the language of $T$ one constant symbol for every element of $\kappa .$ Then add to $T$ a collection of sentences that say that the objects denoted by any two distinct constant symbols from the new collection are distinct (this is a collection of $\kappa ^{2}$ sentences). Since every finite subset of this new theory is satisfiable by a sufficiently large finite model of $T,$ or by any infinite model, the entire extended theory is satisfiable. But any model of the extended theory has cardinality at least $\kappa$ .

Non-standard analysis

A third application of the compactness theorem is the construction of nonstandard models of the real numbers, that is, consistent extensions of the theory of the real numbers that contain "infinitesimal" numbers. To see this, let $\Sigma$ be a first-order axiomatization of the theory of the real numbers. Consider the theory obtained by adding a new constant symbol $\varepsilon$ to the language and adjoining to $\Sigma$ the axiom $\varepsilon >0$ and the axioms $\varepsilon <{\tfrac {1}{n}}$ for all positive integers $n.$ Clearly, the standard real numbers $\mathbb {R}$ are a model for every finite subset of these axioms, because the real numbers satisfy everything in $\Sigma$ and, by suitable choice of $\varepsilon ,$ can be made to satisfy any finite subset of the axioms about $\varepsilon .$ By the compactness theorem, there is a model ${}^{*}\mathbb {R}$ that satisfies $\Sigma$ and also contains an infinitesimal element $\varepsilon .$

A similar argument, this time adjoining the axioms $\omega >0,\;\omega >1,\ldots ,$ etc., shows that the existence of numbers with infinitely large magnitudes cannot be ruled out by any axiomatization $\Sigma$ of the reals.^[8]

It can be shown that the hyperreal numbers ${}^{*}\mathbb {R}$ satisfy the transfer principle:^[9] a first-order sentence is true of $\mathbb {R}$ if and only if it is true of ${}^{*}\mathbb {R} .$

Proofs

One can prove the compactness theorem using Gödel's completeness theorem, which establishes that a set of sentences is satisfiable if and only if no contradiction can be proven from it. Since proofs are always finite and therefore involve only finitely many of the given sentences, the compactness theorem follows. In fact, the compactness theorem is equivalent to Gödel's completeness theorem, and both are equivalent to the Boolean prime ideal theorem, a weak form of the axiom of choice.^[10]

Gödel originally proved the compactness theorem in just this way, but later some "purely semantic" proofs of the compactness theorem were found; that is, proofs that refer to truth but not to provability. One of those proofs relies on ultraproducts hinging on the axiom of choice as follows:

Proof: Fix a first-order language $L,$ and let $\Sigma$ be a collection of $L$ -sentences such that every finite subcollection of $L$ -sentences, $i\subseteq \Sigma$ of it has a model ${\mathcal {M}}_{i}.$ Also let ${\textstyle \prod _{i\subseteq \Sigma }{\mathcal {M}}_{i}}$ be the direct product of the structures and $I$ be the collection of finite subsets of $\Sigma .$ For each $i\in I,$ let $A_{i}=\{j\in I:j\supseteq i\}.$ The family of all of these sets $A_{i}$ generates a proper filter, so there is an ultrafilter $U$ containing all sets of the form $A_{i}.$

Now for any sentence $\varphi$ in ${\displaystyle \Sigma$

the set $A_{\{\varphi \}}$ is in $U$
whenever $j\in A_{\{\varphi \}},$ then $\varphi \in j,$ hence $\varphi$ holds in ${\mathcal {M}}_{j}$
the set of all $j$ with the property that $\varphi$ holds in ${\mathcal {M}}_{j}$ is a superset of $A_{\{\varphi \}},$ hence also in $U$

Łoś's theorem now implies that $\varphi$ holds in the ultraproduct ${\textstyle \prod _{i\subseteq \Sigma }{\mathcal {M}}_{i}/U.}$ So this ultraproduct satisfies all formulas in $\Sigma .$

Notes

↑ See Truss (1997).
↑ J. Barwise, S. Feferman, eds., Model-Theoretic Logics (New York: Springer-Verlag, 1985) , in particular, Makowsky, J. A. Chapter XVIII: Compactness, Embeddings and Definability. 645--716, see Theorems 4.5.9, 4.6.12 and Proposition 4.6.9. For compact logics for an extended notion of model see Ziegler, M. Chapter XV: Topological Model Theory. 557--577. For logics without the relativization property it is possible to have simultaneously compactness and interpolation, while the problem is still open for logics with relativization. See Xavier Caicedo, A Simple Solution to Friedman's Fourth Problem, J. Symbolic Logic, Volume 51, Issue 3 (1986), 778-784. doi : 10.2307/2274031 JSTOR 2274031
↑ Vaught, Robert L.: "Alfred Tarski's work in model theory". Journal of Symbolic Logic 51 (1986), no. 4, 869–882
↑ Robinson, A.: Non-standard analysis. North-Holland Publishing Co., Amsterdam 1966. page 48.
1 2 3 Marker 2002, pp. 40–43.
↑ Gowers, Barrow-Green & Leader 2008, pp. 639–643.
1 2 Terence, Tao (7 March 2009). "Infinite fields, finite fields, and the Ax-Grothendieck theorem".
↑ Goldblatt 1998, pp. 10–11.
↑ Goldblatt 1998, p. 11.
↑ See Hodges (1993).

Related Research Articles

An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word ἀξίωμα (axíōma), meaning 'that which is thought worthy or fit' or 'that which commends itself as evident'.

First-order logic—also known as predicate logic, quantificational logic, and first-order predicate calculus—is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over non-logical objects, and allows the use of sentences that contain variables, so that rather than propositions such as "Socrates is a man", one can have expressions in the form "there exists x such that x is Socrates and x is a man", where "there exists" is a quantifier, while x is a variable. This distinguishes it from propositional logic, which does not use quantifiers or relations; in this sense, propositional logic is the foundation of first-order logic.

Gödel's completeness theorem is a fundamental theorem in mathematical logic that establishes a correspondence between semantic truth and syntactic provability in first-order logic.

The proof of Gödel's completeness theorem given by Kurt Gödel in his doctoral dissertation of 1929 is not easy to read today; it uses concepts and formalisms that are no longer used and terminology that is often obscure. The version given below attempts to represent all the steps in the proof and all the important ideas faithfully, while restating the proof in the modern language of mathematical logic. This outline should not be considered a rigorous proof of the theorem.

In mathematical logic, model theory is the study of the relationship between formal theories, and their models. The aspects investigated include the number and size of models of a theory, the relationship of different models to each other, and their interaction with the formal language itself. In particular, model theorists also investigate the sets that can be defined in a model of a theory, and the relationship of such definable sets to each other. As a separate discipline, model theory goes back to Alfred Tarski, who first used the term "Theory of Models" in publication in 1954. Since the 1970s, the subject has been shaped decisively by Saharon Shelah's stability theory.

In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consistent if it has a model, i.e., there exists an interpretation under which all formulas in the theory are true. This is the sense used in traditional Aristotelian logic, although in contemporary mathematical logic the term satisfiable is used instead. The syntactic definition states a theory $is consistent if there is no formula such that both and its negation are elements of the set of consequences of . Let be a set of closed sentences and the set of closed sentences provable from under some formal deductive system. The set of axioms is consistent when for no formula .$

Noether's theorem states that every continuous symmetry of the action of a physical system with conservative forces has a corresponding conservation law. This is the first of two theorems proven by mathematician Emmy Noether in 1915 and published in 1918. The action of a physical system is the integral over time of a Lagrangian function, from which the system's behavior can be determined by the principle of least action. This theorem only applies to continuous and smooth symmetries of physical space.

In the mathematical discipline of set theory, forcing is a technique for proving consistency and independence results. Intuitively, forcing can be thought of as a technique to expand the set theoretical universe $to a larger universe by introducing a new "generic" object .$

In logic and mathematics, second-order logic is an extension of first-order logic, which itself is an extension of propositional logic. Second-order logic is in turn extended by higher-order logic and type theory.

In mathematical logic, the Löwenheim–Skolem theorem is a theorem on the existence and cardinality of models, named after Leopold Löwenheim and Thoralf Skolem.

Finite model theory is a subarea of model theory. Model theory is the branch of logic which deals with the relation between a formal language (syntax) and its interpretations (semantics). Finite model theory is a restriction of model theory to interpretations on finite structures, which have a finite universe.

Independence-friendly logic is an extension of classical first-order logic (FOL) by means of slashed quantifiers of the form $and, where is a finite set of variables. The intended reading of is "there is a which is functionally independent from the variables in ". IF logic allows one to express more general patterns of dependence between variables than those which are implicit in first-order logic. This greater level of generality leads to an actual increase in expressive power; the set of IF sentences can characterize the same classes of structures as existential second-order logic.$

In mathematical logic, a theory is a set of sentences in a formal language. In most scenarios a deductive system is first understood from context, after which an element $of a deductively closed theory is then called a theorem of the theory. In many deductive systems there is usually a subset that is called "the set of axioms" of the theory, in which case the deductive system is also called an "axiomatic system". By definition, every axiom is automatically a theorem. A first-order theory is a set of first-order sentences (theorems) recursively obtained by the inference rules of the system applied to the set of axioms.$

In universal algebra and in model theory, a structure consists of a set along with a collection of finitary operations and relations that are defined on it.

In mathematical logic, an ω-consistenttheory is a theory that is not only (syntactically) consistent, but also avoids proving certain infinite combinations of sentences that are intuitively contradictory. The name is due to Kurt Gödel, who introduced the concept in the course of proving the incompleteness theorem.

In the mathematical discipline of set theory, there are many ways of describing specific countable ordinals. The smallest ones can be usefully and non-circularly expressed in terms of their Cantor normal forms. Beyond that, many ordinals of relevance to proof theory still have computable ordinal notations. However, it is not possible to decide effectively whether a given putative ordinal notation is a notation or not ; various more-concrete ways of defining ordinals that definitely have notations are available.

In mathematical logic, true arithmetic is the set of all true first-order statements about the arithmetic of natural numbers. This is the theory associated with the standard model of the Peano axioms in the language of the first-order Peano axioms. True arithmetic is occasionally called Skolem arithmetic, though this term usually refers to the different theory of natural numbers with multiplication.

In mathematical logic, a formula is satisfiable if it is true under some assignment of values to its variables. For example, the formula $is satisfiable because it is true when and, while the formula is not satisfiable over the integers. The dual concept to satisfiability is validity; a formula is valid if every assignment of values to its variables makes the formula true. For example, is valid over the integers, but is not.$

In first-order arithmetic, the induction principles, bounding principles, and least number principles are three related families of first-order principles, which may or may not hold in nonstandard models of arithmetic. These principles are often used in reverse mathematics to calibrate the axiomatic strength of theorems.

References

Boolos, George; Jeffrey, Richard; Burgess, John (2004). Computability and Logic (fourth ed.). Cambridge University Press.
Chang, C.C.; Keisler, H. Jerome (1989). Model Theory (third ed.). Elsevier. ISBN 0-7204-0692-7.
Dawson, John W. junior (1993). "The compactness of first-order logic: From Gödel to Lindström". History and Philosophy of Logic. 14: 15–37. doi:10.1080/01445349308837208.
Hodges, Wilfrid (1993). Model theory . Cambridge University Press. ISBN 0-521-30442-3.
Goldblatt, Robert (1998). Lectures on the Hyperreals . New York: Springer Verlag. ISBN 0-387-98464-X.
Gowers, Timothy; Barrow-Green, June; Leader, Imre (2008). The Princeton Companion to Mathematics. Princeton: Princeton University Press. pp. 635–646. ISBN 978-1-4008-3039-8. OCLC 659590835.
Marker, David (2002). Model Theory: An Introduction. Graduate Texts in Mathematics. Vol. 217. Springer. ISBN 978-0-387-98760-6. OCLC 49326991.
Robinson, J. A. (1965). "A Machine-Oriented Logic Based on the Resolution Principle". Journal of the ACM. Association for Computing Machinery (ACM). 12 (1): 23–41. doi: 10.1145/321250.321253 . ISSN 0004-5411. S2CID 14389185.
Truss, John K. (1997). Foundations of Mathematical Analysis. Oxford University Press. ISBN 0-19-853375-6.

External links

Compactness Theorem, Internet Encyclopedia of Philosophy .

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] See Truss (1997).

[2] J. Barwise, S. Feferman, eds., Model-Theoretic Logics (New York: Springer-Verlag, 1985) , in particular, Makowsky, J. A. Chapter XVIII: Compactness, Embeddings and Definability. 645--716, see Theorems 4.5.9, 4.6.12 and Proposition 4.6.9. For compact logics for an extended notion of model see Ziegler, M. Chapter XV: Topological Model Theory. 557--577. For logics without the relativization property it is possible to have simultaneously compactness and interpolation, while the problem is still open for logics with relativization. See Xavier Caicedo, A Simple Solution to Friedman's Fourth Problem, J. Symbolic Logic, Volume 51, Issue 3 (1986), 778-784. doi : 10.2307/2274031 JSTOR 2274031

[3] Vaught, Robert L.: "Alfred Tarski's work in model theory". Journal of Symbolic Logic 51 (1986), no. 4, 869–882

[4] Robinson, A.: Non-standard analysis. North-Holland Publishing Co., Amsterdam 1966. page 48.

[FOOTNOTEMarker200240–43-5] 1 2 3 Marker 2002, pp. 40–43.

[FOOTNOTEGowersBarrow-GreenLeader2008639–643-6] Gowers, Barrow-Green & Leader 2008, pp. 639–643.

[Tao2009AxGrothendieck-7] 1 2 Terence, Tao (7 March 2009). "Infinite fields, finite fields, and the Ax-Grothendieck theorem".

[FOOTNOTEGoldblatt1998[httpsarchiveorgdetailslecturesonhyperr00gold_574pagen12_10]–11-8] Goldblatt 1998, pp. 10–11.

[FOOTNOTEGoldblatt199811-9] Goldblatt 1998, p. 11.

[10] See Hodges (1993).

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]