14.1 Proof by Induction

\( \newcommand{\NOT}{\neg} \newcommand{\AND}{\wedge} \newcommand{\OR}{\vee} \newcommand{\XOR}{\oplus} \newcommand{\IMP}{\Rightarrow} \newcommand{\IFF}{\Leftrightarrow} \newcommand{\TRUE}{\text{True}\xspace} \newcommand{\FALSE}{\text{False}\xspace} \newcommand{\IN}{\,{\in}\,} \newcommand{\NOTIN}{\,{\notin}\,} \newcommand{\TO}{\rightarrow} \newcommand{\DIV}{\mid} \newcommand{\NDIV}{\nmid} \newcommand{\MOD}[1]{\pmod{#1}} \newcommand{\MODS}[1]{\ (\text{mod}\ #1)} \newcommand{\N}{\mathbb N} \newcommand{\Z}{\mathbb Z} \newcommand{\Q}{\mathbb Q} \newcommand{\R}{\mathbb R} \newcommand{\C}{\mathbb C} \newcommand{\cA}{\mathcal A} \newcommand{\cB}{\mathcal B} \newcommand{\cC}{\mathcal C} \newcommand{\cD}{\mathcal D} \newcommand{\cE}{\mathcal E} \newcommand{\cF}{\mathcal F} \newcommand{\cG}{\mathcal G} \newcommand{\cH}{\mathcal H} \newcommand{\cI}{\mathcal I} \newcommand{\cJ}{\mathcal J} \newcommand{\cL}{\mathcal L} \newcommand{\cK}{\mathcal K} \newcommand{\cN}{\mathcal N} \newcommand{\cO}{\mathcal O} \newcommand{\cP}{\mathcal P} \newcommand{\cQ}{\mathcal Q} \newcommand{\cS}{\mathcal S} \newcommand{\cT}{\mathcal T} \newcommand{\cV}{\mathcal V} \newcommand{\cW}{\mathcal W} \newcommand{\cZ}{\mathcal Z} \newcommand{\emp}{\emptyset} \newcommand{\bs}{\backslash} \newcommand{\floor}[1]{\left \lfloor #1 \right \rfloor} \newcommand{\ceil}[1]{\left \lceil #1 \right \rceil} \newcommand{\abs}[1]{\left | #1 \right |} \newcommand{\xspace}{} \newcommand{\proofheader}[1]{\underline{\textbf{#1}}} \)

In this chapter, we’re going to learn about two intertwined concepts: the mathematical proof technique of induction, and the programming technique of recursion. Even though these techniques may seem unrelated, we’ll see over the course of this chapter that they are truly two sides of the same coin. Both are rooted in a fundamental way of representing data and problems, which can be summarized in this general strategy: identify how an entity or problem can be broken down into smaller instances of that entity or problem with the same structure.

A proof by induction

Let’s start with an example of a common use of induction in mathematics: proving the correctness of various summation/product formulas. For example, the formulas found in Appendix C.1 can all be proved using induction.

Let $f: \N \to \N$ be defined as $\displaystyle{f(n) = \sum_{i=0}^n i}$. Prove that for all $n \in \N$, $f(n) = \frac{n(n + 1)}{2}$.

It is possible to verify that this formula is true for any concrete number like $n = 100$ just by computing the value of $f(n)$ and the fraction $\frac{n(n + 1)}{2}$ and checking that these two values are equal. But of course a universally-quantified statement can’t be proven just with specific exapmles, so we need a different approach: induction.

The principle of induction

The first explicit formulation of the principle of induction was given by the mathematician Blaise Pascal in 1665. However, its uses have been traced as far back as Plato (370 BC), and a variation of Euclid’s proof of the existence of infinitely many primes (from around the same time period). We cannot stress enough the importance of the induction principle—it is the powerhorse behind nearly all proofs. The principle of induction applies to universal statements over the natural numbers—that is, statements of the form $\forall n \in \N,~ P(n)$. It cannot be used to prove statements of any other form! Note however that $P(n)$ can be quite complicated and can involve other possibly nested quantifiers.

In this course, we will only study rigorously the most basic form of induction, commonly called simple induction.In CSC236/CSC240, you’ll learn about different forms of induction. There are two steps to proving a statement $\forall n \in \N,~ P(n)$ by induction:

The base case is a proof that the statement is True for the first natural number $n=0$; that is, a proof that $P(0)$ holds.
The inductive step is a proof that for all $k \in \N$, if $P(k)$ is true, then $P(k+1)$ is also true. Our convention will be to use $k$ as the variable for the inductive step, but you may prefer using $n$ or some other variable name. That is: \[\forall k \in \N,~ P(k) \Rightarrow P(k+1).\]

Once the base case and inductive step are proven, by the principle of induction, one can conclude $\forall n \in \N,~ P(n)$.

Typical structure of a proof by induction.

Given statement to prove: $\forall n \in \N,~ P(n)$.

We prove this by induction on $n$.

Base case: Let $n = 0$.

[Proof that $P(0)$ is True.]

Inductive step: Let $k \in \N$, and assume that $P(k)$ is true. (The assumption that $P(k)$ is true is called the induction hypothesis.)

[Proof that $P(k+1)$ is True.]

Back to our example

Let’s see induction in action with our above example, proving that $\forall n \in \N,~ f(n) = \frac{n(n + 1)}{2}$.

We prove this statement by induction on $n$.

Base case: Let $n = 0$.

In this case, $f(0) = \displaystyle{\sum_{i=0}^{0} i} = 0$, and $\frac{0(0 + 1)}{2} = 0$. So the two sides of the equation are equal.

Inductive step: Let $k \in \N$ and assume that $f(k) = \frac{k(k + 1)}{2}$. This assumption is the induction hypothesis (I.H.) for our proof. We want to prove that $f(k + 1) = \frac{(k + 1)(k + 2)}{2}$.

Mini-discussion. As with any proof by induction, the key part of completing the induction step is to determine how to use the induction hypothesis. Remember what we said at the beginning of this section: induction is all about breaking down an entity into smaller instances of the same entity. In our case, we know that $\displaystyle{f(k + 1) = \sum_{i=0}^{k + 1} i}$, which can be broken down by “taking out” the last term: $\displaystyle{\left(\sum_{i=0}^k i \right) + (k + 1)}$.

Back to our proof. We’ll start with the left-hand side of the equation:

\[\begin{align*} f(k + 1) &= \sum_{i=0}^{k + 1} i \tag{definition of $f$} \\ &= \left(\sum_{i=0}^{k} i \right) + (k + 1) \tag{taking out last term} \\ &= f(k) + (k + 1) \tag{definition of $f$} \\ &= \frac{k(k + 1)}{2} + (k + 1) \tag{by the $\textbf{I.H.}$} \\ &= \frac{(k + 1)(k + 2)}{2} \end{align*}\]

How does induction “work”?

Proofs by induction often feel like cheating when you first see them. Rather than directly proving a predicate (e.g., $f(n) = \frac{n(n + 1)}{2}$) for an arbitrary $n$, we actually assume that the predicate is true for an arbitrary $k$ instead! The reason why induction works is that the inductive step allows us to form a chain of implications going from 0 to every other natural number. We often visualize induction as a line of dominos. Pushing over the first domino topples the second domino, then the third, etc.

Image of dominos

In a proof by induction, our base case tells us that $P(0)$ is True, unconditionally (no assumptions). And from our induction step, we know that $P(0) \Rightarrow P(1)$, and so $P(1)$ must also be True. And since $P(1)$ is True, using our induction step again, $P(1) \Rightarrow P(2)$, and so $P(2)$ is also True. This chain of implications extends infinitely far out, covering all natural numbers.

References

(MAT137 videos) 1.14 Proof by induction