Loading [MathJax]/jax/element/mml/optable/MathOperators.js
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Mathematics LibreTexts

3.4: Substructures and the Löwenheim-Skolem Theorems

( \newcommand{\kernel}{\mathrm{null}\,}\)

In this section we will discuss a relation between structures. A given set of sentences may have many different models, and it will turn out that in some cases those models are related in surprising ways. We begin by defining the notion of a substructure.

Definition 3.4.1.

If A and B are two L-structures, we will say that A is a substructure of B, and write AB, if:

  1. AB.
  2. For every constant symbol c, cA=cB.
  3. For every n-ary relation symbol R, RA=RBAn.
  4. For every n-ary function symbol f, fA=fBAn. In other words, for every n-ary function symbol f and every aA, fA(a)=fB(a). (This is called the restriction of the function fB to the set An.)

Thus a substructure of B is completely determined by its universe, and this universe can be any nonempty subset of B that contains the constants and is closed under every function f.

Example 3.4.2:

Suppose that we try to build a substructure A of the structure N=(N,0,S,+,,E,<). Since A must be closed under the functions and contain the constants, the number 0 must be an element of the universe A. But now, since the substructure must be closed under the function S, it is clear that every natural number must be an element of A. Thus N has no proper substructures.

Example 3.4.3:

Now, suppose that we try to find some substructures of the structure B=(N,0,<), with the usual interpretations of 0 and <. Since there are no function symbols, any nonempty subset of N that includes the number 0 can serve as the universe of a substructure AB.

Suppose that we let A=({0},0,<). Then notice that even though AB, there are plenty of sentences that are true in one structure that are not true in the other structure. For example, (x)(y)x<y is false in A and true in B. It will not be hard for you to find an example of a sentence that is true in A and false in B.

As Example 3.4.3 shows, if we are given two structures such that AB, most of the time you would expect that A and B would be very different, and there would be lots of sentences that would be true in one of the structures that would not be true in the other.

Sometimes, however, truth in the smaller structure is more closely tied to truth in the larger structure.

Definition 3.4.4.

Suppose that A and B are L-structures and AB. We say that A is an elementary substructure of B (equivalently, B is an elementary extension of A), and write AB, if for every s:VarsA and for every L-formula ϕ,

Aϕ[s]if and only ifBϕ[s].

Chaff: Notice that if we want to prove AB, we need only prove Aϕ[s]Bϕ[s], since once we have done that, the other direction comes for free by using the contrapositive and negations.

Proposition 3.4.5.

Suppose that AB. Then a sentence σ is true in A if and only if it is true in B.

proof

Exercise 5.

Example 3.4.6:

We saw earlier that the structure B=(N,0,<) has lots of substructures. However, B has no proper elementary substructures. For suppose that AB. Certainly, 0A, as A is a substructure. Since the sentence (y)[0<y(x)(0<xyx)] is true in B, it must be true in A as well. So

A(y)[0<y(x)(0<xyx)].

Thus, for any assignment function s:VarsA there is some aA such that

A[0<y(x)(0<xyx)][s[y|a]].

Fix such an s and such an aA. Now we use elementarity again. Since AB and s[y|a]:VarsA, we know that

B[0<y(x)(0<xyx)][s[y|a]].

But in the structure B, there is a unique element that makes the formula [0<y(x)(0<xyx)] true, namely the number 1. So a must be the number 1, and so 1 must be an element of A. Similarly, you can show that 2A, 3A, and so on. Thus NA, and A will not be a proper elementary substructure of B.

This example shows that when building an elementary substructure of a given structure B, we need to make sure that witnesses for each existential sentence true in B must be included in the universe of the elementary substructure A. That idea will be the core of the proof of the Downward Löwenheim-Skolem Theorem, Theorem 3.4.8. In fact, the next lemma says that making sure that such witnesses are elements of A is all that is needed to ensure that A is an elementary substructure of B.

Lemma 3.4.7.

Suppose that AB and that for every formula α and every s:VarsA such that Bxα[s] there is an aA such that Bα[s[x|a]]. Then AB.

proof

We will show, given the assumptions of the lemma, that if ϕ is any formula and s is any variable assignment function into A, Aϕ[s] if and only if Bϕ[s], and thus AB.

This is an easy proof by induction on the complexity of ϕ, which we will make even easier by noting that we can replace the inductive step by an inductive step, as can be defined in terms of .

So for the base case, assume that ϕ is atomic. For example, if ϕ is R(x,y), then Aϕ[s] if and only if (s(x),s(y))RA. But RA=RBA2, so (s(x),s(y))RA if and only if (s(x),s(y))RB. But (s(x),s(y))RB if and only if Bϕ[s], as needed.

For the inductive clauses, assume that ϕ is ¬α. Then

\[\begin{array}{rll} \mathfrak{A} \models \phi \left[ s \right] & \text{if and only if} \: \mathfrak{A} \models \neg \alpha \left[ s \right] & \\ & \text{if and only if} \: \mathfrak{A} \not\models \alpha \left[ s \right] & \\ & \text{if and only if} \: \mathfrak{B} \not\models \alpha \left[ s \right] & \text{inductive hypothesis \\ & \text{if and only if} \: \mathfrak{B} \models \neg \alpha \left[ s \right] & \\ & \text{if and only if} \: \mathfrak{B} \models \phi \left[ s \right]. \end{array}\]

The second inductive clause, if ϕ is αβ, is similar.

For the last inductive clause, suppose that ϕ is xα. Suppose also that Aϕ[s]; in other words, Axα[s]. Then, for some aA, Aα[s[x|a]]. Since s[x|a] is a function mapping variables into A, by our inductive hypothesis, Bα[s[x|a]]. But then Bxα[s], as needed. For the other direction, assume that Bxα[s], where s:VarsA. We use the assumption of the lemma to find an aA such that Bα[s[x|a]]. As s[x|a] is a function with codomain A, by the inductive hypothesis Aα[s[x|a]], and thus Bxα[s], and the proof is complete.

Chaff: We are now going to look at the Löwenheim-Skolem Theorems, which were published in 1915. To understand these theorems, you need to have at least a basic understanding of cardinality, a topic that is outlined in the Appendix. However, if you are in a hurry, it will suffice if you merely remember that there are many different sizes of infinite sets. An infinite set A is countable if there is a bijection between A and the set of natural numbers N, otherwise, the set is uncountable. Examples of countable sets include the integers and the set of rational numbers. The set of real numbers is uncountable, in that there is no bijection between R and N. So there are more reals than natural numbers. There are infinitely many different sizes of infinite sets. The smallest infinite size is countable.

Theorem 3.4.8: Downward löwenheim-skolem theorem

Suppose that L is a countable language and B is an L-structure. Then B has a countable elementary substructure.

proof

If B is finite or countable infinite, then B is its own countable elementary substructure, so assume that B is uncountable. As the language L is countable, there are only countably many L-formulas, and thus only countably many formulas of the form xα.

Let A0 be any nonempty countable subset of B. We show how to build A1 such that A0A1, and A1 is countable. The idea is to add to A0 witnesses for the truth (in B) of existential statements.

Notice that as A0 is countable, there are only countably many functions s:VarsA0 that are eventually constant, by which we mean there is a natural number k such that if i,j>k, then s(vi)=s(vj). (This is a nice exercise for those of you have had a course in set theory or are reasonably comfortable with cardinality arguments.) Also, if we are given any ϕ and any s:VarsA0, we can find an eventually constant s:VarsA0 such that s and s agree on the free variables of ϕ, and thus Bϕ[s] if and only if Bϕ[s].

The construction of A1: For each formula of the form xα and each s:VarsA0 such that Bxα[s], find an eventually constant s:VarsA0 such that s and s agree on the free variables of xα. Pick an element aα,sB such that Bα[s[x|aα,s]], and let

A1=A0{aα,s}allα,s:VarsA0.

Notice that A1 is countable, as there are only countably many α's and countably many s.

Continue this construction, iteratively building An+1 from An. Let A=n=0An. As A is a countable union of countable sets, A is countable.

Now we have constructed a potential universe A for a substructure for B. We have to prove that A is closed under the functions of B (by the remarks following Definition 3.4.1 this shows that A is a substructure of B), and we have to show that A satisfies the criteria set out in Lemma 3.4.7, so we will know that A is an elementary substructure of B.

First, to show that A is closed under the functions of B, suppose that aA and f is a unary function symbol (the general case is identical) and that b=fB(a). We must show that bA. Fix an n so large that aAn, let ϕ be the formula (y)y=f(x), and let s be any assignment function into A such that s(x)=a. We know that B(y)y=f(x)[s], and we know that if B(y=f(x))[s[y|d]], then d=b. So, in our construction of An+1 we must have used ay=f(x),s=b, so bAn+1, and bA, as needed.

In order to use Lemma 3.4.7, we must show that if α is a formula and s:VarsA is such that Bα[s[x|a]]. So, fix such an α and such an s. Find an eventually constant s:VarsA such that s and s agree on all the free variables of α. Thus Bxα[s], and all of the values of s are elements of some fixed An, as s takes on only a finite number of values. But then by construction of An+1 such that Bα[s[x|a]], as needed.

So we have met the hypotheses of Lemma 3.4.7, and thus A is a countable elementary substructure of B, as needed.

Chaff: We would like to look at a bit of this proof a little more closely. In the construction of A1, what we did was to find an a,s for each formula xα and each s:VarsA, and the point was that a,s would be a witness to the truth in B of the existential statement xα. So we have constructed a function which, given an existential formula xα and an assignment function, finds a value for x that makes the formula α true. A function of this sort is called a Skolem function, and the construction of A in the proof of the Downward Löwenheim-Skolem Theorem can thus be summarized: Let A0 be a countable subset of B, and form the closure of A0 under the set of all Skolem functions. Then show that this closure is an elementary substructure of B.

Example 3.4.9:

We saw an indication in Exercise 4 in Section 2.8 that the axioms of Zermelo-Fraenkel set theory (known as ZF) can be formalized in first-order logic. Accepting that as true (which it is), we know that if the axioms are consistent they have a model, and then by the Downward Löwenheim-Skolem Theorem, there must be a countable model for set theory. But this is interesting, as the following are all theorems of ZF:

  • There is a countably infinite set.
  • If a set a exists, then the collection of subsets of a exists.
  • If a is countably infinite, then the collection of subsets of a is uncountable. (This is Cantor's Theorem).

Now, let us suppose that A is our countable model of ZF, and suppose that a is an element of A and is countably infinite. If b is the set of all of the subsets of a, we know that b is uncountable (by Cantor's Theorem) and yet b must be countable, as all of the elements of b are in the model A, and A is countable! So b must be both countable and uncountable! This is called (somewhat incorrectly) Skolem's paradox, and Exercise 8 asks you to figure out the solution to the paradox

Probably the way to think about the Downward Löwenheim-Skolem Theorem is that it guarantees that if there are any infinite models of a given set of formulas, then there is a small (countably infinite means small) model of that set of formulas. It seems reasonable to ask if there is a similar guarantee about big models, and there is.

Proposition 3.4.10.

Suppose that Σ is a set of L-formulas with an infinite model. If κ is an infinite cardinal, then there is a model of Σ of cardinality greater than or equal to κ.

proof

This is an easy application of the Compactness Theorem. Expand L to include κ new constant symbols ci, and let Γ=Σ{cicj|ij}. Then Γ is finitely satisfiable, as we can take our given infinite model of Σ and interpret the ci in that model in such a way that cicj for any finite set of constant symbols. By the Compactness Theorem, there is a structure A that is a model of Γ, and thus certainly the cardinality of A is greater than or equal to κ. If we restrict A to the original language, we get a model of Σ of the required cardinality.

corollary 3.4.11.

If Σ is a set of formulas from a countable language with an infinite model, and if κ is an infinite cardinal, then there is a model of Σ of cardinality κ.

proof

First, use Proposition 3.4.10 to get B, a model of Σ of cardinality greater than or equal to κ. Then, mimic the proof of the Downward Löwenheim-Skolem Theorem, starting with a set A0B of cardinality exactly κ. Then the A that is constructed in that proof also will have cardinality κ, and as AB, A will be a model of Σ of cardinality κ.

corollary 3.4.12

If A is an infinite L-structure, then there is no set of first-order formulas that characterize A up to isomorphism.

proof

More precisely, the corollary says that there is no set of formulas Σ such that BΣ if and only if AB. We know that there are models of Σ of all cardinalities, and we know that there are no bijections between sets of different cardinalities. So there must be many models of Σ that are not isomorphic to A.

Chaff: There are sets of axioms that do characterize infinite structures. For example, the second-order axioms of Peano Arithmetic include axioms to ensure that addition and multiplication behave normally, and they also include the principle of mathematical induction: If M is a set of numbers, if 0M, and if S(n)M for every n such that nM, then (n)(nM).

Any model of Peano Arithmetic is isomorphic to the natural numbers, but notice that we used two notions (sets of numbers and the elementhood relation) that are not part of our description of N. By introducing sets of numbers we have left the world of first-order logic and have entered second-order logic, and it is only by using second-order logic that we are able to characterize N. For a nice discussion of this topic, see [Bell and Machover 77, Chapter 7, Section 2].

The results from Proposition 3.4.10 to Corollary 3.4.12 give us models that are large, but they have a slightly different flavor from the Downward Löwenheim-Skolem Theorem, in that they do not guarantee that the small model is an elementary substructure of the large model. That is the content of the Upward Löwenheim-Skolem Theorem, a proof of which is outlined in the Exercises.

theorem 3.4.13: Upward löwenheim-Skolem theorem

If L is a countable language, A is an infinite L-structure, and κ is a cardinal, then A has an elementary extension B such that the cardinality of B is greater than or equal to κ.

Exercises

  1. Suppose that BA, that ϕ is of the form (x)ψ, where ψ is quantifier-free, and that Aϕ. Prove that Bϕ. The short version of this fact is, "Universal sentences are preserved downward." Formulate and prove the corresponding fact for existential sentences.
  2. Justify the Chaff following Definition 3.4.4.
  3. Show that if AB and CB and AC, then AC.
  4. Suppose that we have an elementary chain, a set of L-structures such that
    A1A2A3
    and let A=i=1Ai. So the universe A of A is the union of the universes Ai, RA=i=1RAi, etc. Show that AiA for each i. [Suggestion: To show that AiA is pretty easy by the definition. To get that A is an elementary extension, you have to use induction on the complexity of formulas. Notice by the comments following Definition 3.4.4 that you need only prove one direction. You may find it easier to use rather than in the quantifier part of the inductive step of the proof.]
  5. Prove Proposition 3.4.5.
  6. Show that if AB and if there is an element bB and a formula ϕ(x) such that Bϕ[s[x|b]] and for every other ˆbB, B, then b \in A. [Suggestion: This is very similar to Example 3.4.6.]
  7. Suppose that \mathfrak{B} = \{ \mathbb{N}, +, \cdot \}, and let A_0 = \{ 2, 3 \}. Let F be the set of Skolem functions \{ f_{\alpha, s} \} corresponding to \alpha's of the form \left( \exists x \right) x = yz. Find the closure of A_0 under F. [Suggestion: Do not forget that the assignment functions s that you need to consider are functions mapping into A_0 at first, then A_1, and so on. You probably want to explicitly write out A_1, then A_2, etc. We are using the notation here corresponding to the proof of Theorem 3.4.8.]
  8. To say that a set a is countable means that there is a function with domain the natural numbers and codomain a that is a bijection. Notice that this is an existential statement, saying that a certain kind of function exists. Now, think about Example 3.4.9 and see if you can figure out why it is not really a contradiction that the set b is both countable and uncountable. In particular, think about what it means for an existential statement to be true in a structure \mathfrak{A}, as opposed to true in the real world (whatever that means!).
  9. (Toward the Proof of the Upward Löwenheim-Skolem Theorem) If \mathfrak{A} is an (\mathcal{L}\)-structure, let \mathcal{L} \left( A \right) = \mathcal{L} \cup \{ \bar{a} | a \in A \}, where each \bar{a} is a new constant symbol. Then, let \bar{\mathfrak{A}} be the \mathcal{L} \left( A \right)-structure having the same universe as \mathfrak{A} and the same interpretation of the symbols of \mathcal{L} as \mathfrak{A}, and interpreting each \bar{a} as a. Then we define the complete diagram of \mathfrak{A} as
    Th \left( \bar{\mathfrak{A}} \right) = \{ \sigma | \sigma \: \text{is an} \: \mathcal{L} \left( A \right) \text{-formula such that} \: \bar{\mathfrak{A}} \models \sigma \}.
    Show that if \bar{\mathfrak{B}} is any model of Th \left( \bar{\mathfrak{A}} \right), and if \mathfrak{B} = \bar{\mathfrak{B}} \upharpoonright_\mathcal{L}, then \mathfrak{A} is isomorphic to an elementary substructure of \mathfrak{B}. [Suggestion: Let h : A \rightarrow B be given by h \left( a \right) = \bar{a}^\mathfrak{B}. Let C be the range of h. Show C is closed under f^\mathfrak{B} for every f in \mathcal{L}, and thus C is the universe of \mathfrak{C}, a substructure of \mathfrak{B}. Then show h is an isomorphism between \mathfrak{A} and \mathfrak{C}. Finally, show that \mathfrak{C} \prec \mathfrak{B}.]
  10. Use Exercise 9 to prove the Upward Löwenheim-Skolem Theorem by finding a model \bar{\mathfrak{B}} of the complete diagram of the given model \mathfrak{A} such that the cardinality of \bar{B} is greater than or equal to \kappa.
  11. We can now fill in some of the details of our discussion of nonstandard analysis from Example 3.3.5. As the language \mathcal{L}_\mathbb{R} of that example already includes constant symbols for each real number, the complete diagram of \mathfrak{R} is nothing more than Th \left( \mathfrak{R} \right). Explain how Exercise 9 shows that there is an isomorphic copy of the real line living inside the structure \mathfrak{A}.

This page titled 3.4: Substructures and the Löwenheim-Skolem Theorems is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Christopher Leary and Lars Kristiansen (OpenSUNY) via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?