5.3: Divisibility Statements and Other Proofs Using PMI

Last updated
Save as PDF

Page ID: 19390

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

There is a very famous result known as Fermat’s Little Theorem. This would probably be abbreviated FLT except for two things. In science fiction FLT means “faster than light travel” and there is another theorem due to Fermat that goes by the initials FLT: Fermat’s Last Theorem. Fermat’s last theorem states that equations of the form \(a^n + b^n = c^n\), where \(n\) is a positive natural number, only have integer solutions that are trivial (like \(0^3 + 1^3 = 1^3\) ) when \(n\) is greater than \(2\). When \(n\) is \(1\), there are lots of integer solutions. When \(n\) is \(2\), there are still plenty of integer solutions – these are the so-called Pythagorean triples, for example \(3\), \(4\) & \(5\) or \(5\), \(12\) & \(13\). It is somewhat unfair that this statement is known as Fermat’s last theorem since he didn’t prove it (or at least we can’t be sure that he proved it). Five years after his death, Fermat’s son published a translated¹ version of Diophantus’s Arithmetica containing his father’s notations. One of those notations – near the place where Diophantus was discussing the equation \(x^2 + y^2 = z^2\) and its solution in whole numbers – was the statement of what is now known as Fermat’s last theorem as well as the following claim:

Cuius rei demonstrationem mirabilem sane detexi hanc marginis exiguitas non caperet.

In English:

I have discovered a truly remarkable proof of this that the margin of this page is too small to contain.

Between \(1670\) and \(1994\) a lot of famous mathematicians worked on FLT but never found the “demonstrationem mirabilem.” Finally in \(1994\), Andrew Wiles of Princeton announced a proof of FLT, but in Wiles’s own words, his is “a twentieth century proof” it can’t be the proof Fermat had in mind.

These days most people believe that Fermat was mistaken. Probably he thought a proof technique that works for small values of \(n\) could be generalized. It remains a tantalizing question, can a proof of FLT using only methods available in the \(17^{\text{th}}\) century be accomplished?

Part of the reason that so many people spent so much effort on FLT over the centuries is that Fermat had an excellent record as regards being correct about his theorems and proofs. The result known as Fermat’s little theorem is an example of a theorem and proof that Fermat got right. It is probably known as his “little” theorem because its statement is very short, but it is actually a fairly deep result.

Theorem \(\PageIndex{1}\): Fermat's Little Theorem

For every prime number \(p\), and for all integers \(x\), the \(p\)-th power of \(x\) and \(x\) itself are congruent \(\text{mod } p\). Symbolically:

\[x^p ≡ x (\text{mod } p)\]

A slight restatement of Fermat’s little theorem is that p is always a divisor of \(x^p − x\) (assuming \(p\) is a prime and \(x\) is an integer). Math professors enjoy using their knowledge of Fermat’s little theorem to cook up divisibility results that can be proved using mathematical induction. For example, consider the following:

\(∀n ∈ \mathbb{N}, 3|(n^3 + 2n + 6).\)

This is really just the \(p = 3\) case of Fermat’s little theorem with a little camouflage added: \(n^3 + 2n + 6 = (n^3 − n) + 3(n + 2)\). But let’s have a look at proving this statement using PMI.

Theorem \(\PageIndex{2}\)

\[∀n ∈ \mathbb{N}, 3|(n^3 + 2n + 6)\]

Proof

(By mathematical induction)

Basis: Clearly \(3|6\).

Inductive step:

(We need to show that \(3|(k^3 + 2k + 6) \implies 3|((k + 1)^3 + 2(k + 1) + 6\).)

Consider the quantity \((k + 1)^3 + 2(k + 1) + 6\).

\(\begin{array} (k + 1)^3 + 2(k + 1) + 6 \\ &= (k^3 + 3k^2 + 3k + 1) + (2k + 2) + 6 \\ &= (k^3 + 2k + 6) + 3k^2 + 3k + 3 \\ &= (k^3 + 2k + 6) + 3(k^2 + k + 1). \end{array}\)

By the inductive hypothesis, \(3\) is a divisor of \(k^3 + 2k + 6\) so there is an integer \(m\) such that \(k^3 + 2k + 6 = 3m\). Thus,

\((k + 1)^3 + 2(k + 1) + 6 \\ = 3m + 3(k^2 + k + 1) \\ = 3(m + k^2 + k + 1)\).

Q.E.D.

Practice

Devise an inductive proof of the statement, \(∀n ∈ \mathbb{N}, 5|x^5+4x−10\).

There is one other subtle trick for devising statements to be proved by PMI that you should know about. An example should suffice to make it clear. Notice that \(7\) is equivalent to \(1 (\text{ mod } 6)\), it follows that any power of \(7\) is also \(1 (\text{ mod } 6\)). So, if we subtract \(1\) from some power of \(7\) we will have a number that is divisible by \(6\).

The proof (by PMI) of a statement like this requires another subtle little trick. Somewhere along the way in the proof you’ll need the identity \(7 = 6+1\).

Theorem \(\PageIndex{3}\)

\[∀n ∈ \mathbb{N}, 6|7^n − 1\]

Proof

(By PMI)

Basis: Note that \(7^0 − 1\) is \(0\) and also that \(6|0\).

Inductive step:

(We need to show that if \(6|7^k − 1\) then \(6|7^{k+1} − 1\).)

Consider the quantity \(7^{k+1} − 1\).

\(\begin{array} 7^{k+1} − 1 &= 7 · 7^k − 1 \\ &= (6 + 1) · 7^k − 1 \\ &= 6 · 7^k + 1 · 7^k − 1 \\ &= 6(7^k) + (7^k − 1) \end{array}\)

By the inductive hypothesis, \(6 | 7^k − 1\) so there is an integer \(m\) such that \(7^k − 1 = 6m\). It follows that

\(7^{k+1} − 1 = 6(7^k) + 6m\).

So, clearly, \(6\) is a divisor of \(7^{k+1} − 1\).

Q.E.D.

Mathematical induction can often be used to prove inequalities. There are quite a few examples of families of statements where there is an inequality for every natural number. Often such statements seem to be true and yet devising a proof can be illusive. If such is the case, try using PMI. One hint: it is fairly typical that the inductive step in a PMI proof of an inequality will involve reasoning that isn’t particularly sharp. Just remember that if you have an inequality and you make the big side even bigger, the resulting statement is certainly still true!

Consider the sequences \(2n\) and \(n!\).

\(n\)	\(0\)	\(1\)	\(2\)	\(3\)
\(2^n\)	\(1\)	\(2\)	\(4\)	\(8\)
\(n!\)	\(1\)	\(1\)	\(2\)	\(6\)

As the table illustrates, for small values of \(n\), \(2n > n!\). But from \(n = 4\) onward the inequality is reversed.

Theorem \(\PageIndex{4}\)

\[∀n ≥ 4 ∈ \mathbb{N}, 2^n < n!\]

Proof

(By mathematical induction)

Basis: When \(n = 4\) we have \(2^4 < 4!\), which is certainly true \((16 < 24)\).

Inductive step: Suppose that \(k\) is a natural number with \(k > 4\), and that \(2^k < k!\). Multiply the left hand side of this inequality by \(2\) and the right hand side by \(k + 1\)² to get

\(2 · 2^k < (k + 1) · k!\).

\(2^{k+1} < (k + 1)!.\)

Q.E.D.

The observant Calculus student will certainly be aware of the fact that, asymptotically, exponential functions grow faster than polynomial functions. That is, if you have a base \(b\) which is greater than \(1\), the function \(b^x\) is eventually larger than any polynomial \(p(x)\). This may seem a bit hard to believe if \(b = 1.001\) and \(p(x) = 500x^{10}\). The graph of \(y = 1.001^x\) is practically indistinguishable from the line \(y = 1\) (at first), whereas the graph of \(y = 500x^{10}\) has already reached the astronomical value of five trillion \((5,000,000,000,000)\) when \(x\) is just \(10\). Nevertheless, the exponential will eventually outstrip the polynomial. We can use the methods of this section to get started on proving the fact mentioned above. Consider the two sequences \(n^2\) and \(2n\).

\(n\)	\(0\)	\(1\)	\(2\)	\(3\)	\(4\)	\(5\)	\(6\)
\(n^2\)	\(0\)	\(1\)	\(4\)	\(9\)	\(16\)	\(25\)	\(36\)
\(2^n\)	\(1\)	\(2\)	\(4\)	\(8\)	\(16\)	\(32\)	\(64\)

If we think of a “race” between the sequences \(n^2\) and \(2^n\), notice that \(2^n\) starts out with the lead. The two sequences are tied when \(n = 2\). Briefly, \(n^2\) goes into the lead but they are tied again when \(n = 4\). After that, it would appear that \(2^n\) recaptures the lead for good. Of course, we’re making a rather broad presumption – is it really true that \(n^2\) never catches up with \(2^n\) again? Well, if we’re right then the following theorem should be provable:

Theorem \(\PageIndex{5}\)

For all natural numbers \(n\), if \(n ≥ 4\) then \(n^2 ≤ 2^n\).

Proof

Basis: When \(n = 4\) we have \(4^2 ≤ 2^4\), which is true since both numbers are \(16\).

Inductive step: (In the inductive step we assume that \(k^2 ≤ 2^k \) and then show that \((k + 1)^2 ≤ 2^{k+1}\).)

The inductive hypothesis tells us that

\(k^2 ≤ 2^k\).

If we add \(2^{k + 1}\) to the left-hand side of this inequality and \(2^k\) to the right-hand side we will produce the desired inequality. Thus our proof will follow provided that we know that \(2^{k + 1} ≤ 2^k\). Indeed, it is sufficient to show that \(2^{k + 1} ≤ k^2\) since we already know (by the inductive hypothesis) that \(k^2 ≤ 2^k\).

So the result remains in doubt unless you can complete the exercise that follows. . .

Q.E.D.???

Practice

Prove the lemma: For all \(n ∈ \mathbb{N}\), if \(n ≥ 4\) then \(2^{n + 1} ≤ n^2\).

Exercises:

Give inductive proofs of the following:

Exercise \(\PageIndex{1}\)

\(∀x ∈ \mathbb{N}, 3|x^3 − x\) 2. 3. 4. 5. 6. 7.

Exercise \(\PageIndex{2}\)

\(∀x ∈ \mathbb{N}, 3|x^3 + 5x\)

Exercise \(\PageIndex{3}\)

\(∀x ∈ \mathbb{N}, 11|x^{11} + 10x\)

Exercise \(\PageIndex{4}\)

\(∀n ∈ \mathbb{N}, 3|4^n − 1\)

Exercise \(\PageIndex{5}\)

\(∀n ∈ \mathbb{N}, 6|(3n^2 + 3n − 12)\)

Exercise \(\PageIndex{6}\)

\(∀n ∈ \mathbb{N}, 5|(n^5 − 5n^3 + 14n)\)

Exercise \(\PageIndex{7}\)

\(∀n ∈ \mathbb{N}, 4|(13^n + 4n − 1)\)

Exercise \(\PageIndex{8}\)

\(∀n ∈ \mathbb{N}, 7|8^n + 6\)

Exercise \(\PageIndex{9}\)

\(∀n ∈ \mathbb{N}, 6|2n^3 − 2n − 12\)

Exercise \(\PageIndex{10}\)

\(∀n ≥ 3 ∈ \mathbb{N}, 3n^2 + 3n + 1 < 2n^3\)

Exercise \(\PageIndex{11}\)

\(∀n > 3 ∈ \mathbb{N}, n^3 < 3^n\)

Exercise \(\PageIndex{12}\)

\(∀n ≥ 3 ∈ \mathbb{N}, n^3 + 3 > n^2 + 3n + 1\)

Exercise \(\PageIndex{13}\)

\(∀x ≥ 4 ∈ \mathbb{N}, x^{2} 2^x ≤ 4^x)\)

Notes

1. The translation from Greek into Latin was done by Claude Bachet.

2. It might be smoother to justify this step by first proving the lemma that \(∀a, b, c, d ∈ \mathbb{R}^+, a < b ∧ c < d \implies ac < bd\).

Search

Text Color

Text Size

Margin Size

Font Type

Exercise \(\PageIndex{1}\)

Exercise \(\PageIndex{2}\)

Exercise \(\PageIndex{3}\)

Exercise \(\PageIndex{4}\)

Exercise \(\PageIndex{5}\)

Exercise \(\PageIndex{6}\)

Exercise \(\PageIndex{7}\)

Exercise \(\PageIndex{8}\)

Exercise \(\PageIndex{9}\)

Exercise \(\PageIndex{10}\)

Exercise \(\PageIndex{11}\)

Exercise \(\PageIndex{12}\)

Exercise \(\PageIndex{13}\)