2.3: Polynomial Fitting
( \newcommand{\kernel}{\mathrm{null}\,}\)
Investigate!
A standard 8×8 chessboard contains 64 squares. Actually, this is just the number of unit squares. How many squares of all sizes are there on a chessboard? Start with smaller boards: 1×1, 2×2, 3×3, etc. Find a formula for the total number of squares in an n×n board.
So far we have seen methods for finding the closed formulas for arithmetic and geometric sequences. Since we know how to compute the sum of the first n terms of arithmetic and geometric sequences, we can compute the closed formulas for sequences which have an arithmetic (or geometric) sequence of differences between terms. But what if we consider a sequence which is the sum of the first n terms of a sequence which is itself the sum of an arithmetic sequence?
Before we get too carried away, let's consider an example: How many squares (of all sizes) are there on a chessboard? A chessboard consists of 64 squares, but we also want to consider squares of longer side length. Even though we are only considering an 8×8 board, there is already a lot to count. So instead, let us build a sequence: the first term will be the number of squares on a 1×1 board, the second term will be the number of squares on a 2×2 board, and so on. After a little thought, we arrive at the sequence
1,5,14,30,55,…
This sequence is not arithmetic (or geometric for that matter), but perhaps it's sequence of differences is. For differences we get
4,9,16,25,…
Not a huge surprise: one way to count the number of squares in a 4×4 chessboard is to notice that there are 16 squares with side length 1, 9 with side length 2, 4 with side length 3 and 1 with side length 4. So the original sequence is just the sum of squares. Now this sequence of differences is not arithmetic since it's sequence of differences (the differences of the differences of the original sequence) is not constant. In fact, this sequence of second differences is
5,7,9,…
which is an arithmetic sequence (with constant difference 2). Notice that our original sequence had third differences (that is, differences of differences of differences of the original) constant. We will call such a sequence Δ3-constant. The sequence 1,4,9,16,… has second differences constant, so it will be a Δ2-constant sequence. In general, we will say a sequence is a Δk-constant sequence if the kth differences are constant.
Example 2.3.1
Which of the following sequences are Δk-constant for some value of k?
- 2,3,7,14,24,37,….
- 1,8,27,64,125,216,….
- 1,2,4,8,16,64,128,….
- Solution
-
- This is the sequence from Example 2.2.6, in which we found a closed formula by recognizing the sequence as the sequence of partial sums of an arithmetic sequence. Indeed, the sequence of first differences is 1,4,7,10,13,…, which itself has differences 3,3,3,3,…. Thus 2,3,7,14,24,37,… is a Δ2-constant sequence.
- These are the perfect cubes. The sequence of first differences is 7,19,37,61,91,…; the sequence of second differences is 12,18,24,30,…; the sequence of third differences is constant: 6,6,6,…. Thus the perfect cubes are a Δ3-constant sequence.
- If we take first differences we get 1,2,4,8,16,…. Wait, what? That's the sequence we started with. So taking second differences will give us the same sequence again. No matter how many times we repeat this we will always have the same sequence, which in particular means no finite number of differences will be constant. Thus this sequence is not Δk-constant for any k.
The Δ0-constant sequences are themselves constant, so a closed formula for them is easy to compute (it's just the constant). The Δ1-constant sequences are arithmetic and we have a method for finding closed formulas for them as well. Every Δ2-constant sequence is the sum of an arithmetic sequence so we can find formulas for these as well. But notice that the format of the closed formula for a Δ2-constant sequence is always quadratic. For example, the square numbers are Δ2-constant with closed formula an=n2. The triangular numbers (also Δ2-constant) have closed formula an=n(n+1)2, which when multiplied out gives you an n2 term as well. It appears that every time we increase the complexity of the sequence, that is, increase the number of differences before we get constants, we also increase the degree of the polynomial used for the closed formula. We go from constant to linear to quadratic. The sequence of differences between terms tells us something about the rate of growth of the sequence. If a sequence is growing at a constant rate, then the formula for the sequence will be linear. If the sequence is growing at a rate which itself is growing at a constant rate, then the formula is quadratic. You have seen this elsewhere: if a function has a constant second derivative (rate of change) then the function must be quadratic.
This works in general:
Finite Differences
The closed formula for a sequence will be a degree k polynomial if and only if the sequence is Δk-constant (i.e., the kth sequence of differences is constant).
This tells us that the sequence of numbers of squares on a chessboard, 1,5,14,30,55,…, which we saw to be Δ3-constant, will have a cubic (degree 3 polynomial) for its closed formula.
Now once we know what format the closed formula for a sequence will take, it is much easier to actually find the closed formula. In the case that the closed formula is a degree k polynomial, we just need k+1 data points to “fit” the polynomial to the data.
Example 2.3.2
Find a formula for the sequence 3,7,14,24,…. Assume a1=3.
- Solution
-
First, check to see if the formula has constant differences at some level. The sequence of first differences is 4,7,10,… which is arithmetic, so the sequence of second differences is constant. The sequence is Δ2-constant, so the formula for an will be a degree 2 polynomial. That is, we know that for some constants a, b, and c,
an=an2+bn+c.Now to find a, b, and c. First, it would be nice to know what a0 is, since plugging in n=0 simplifies the above formula greatly. In this case, a0=2 (work backwards from the sequence of constant differences). Thus
a0=2=a⋅02+b⋅0+c,so c=2. Now plug in n=1 and n=2. We get
a1=3=a+b+2 a2=7=a4+b2+2.At this point we have two (linear) equations and two unknowns, so we can solve the system for a and b (using substitution or elimination or even matrices). We find a=32 and b=−12, so an=32n2−12n+2.
Example 2.3.3
Find a closed formula for the number of squares on an n×n chessboard.
- Solution
-
We have seen that the sequence 1,5,14,30,55,… is Δ3-constant, so we are looking for a degree 3 polynomial. That is,
an=an3+bn2+cn+d.We can find d if we know what a0 is. Working backwards from the third differences, we find a0=0 (unsurprisingly, since there are no squares on a 0×0 chessboard). Thus d=0. Now plug in n=1, n=2, and n=3:
1=a+b+c5=8a+4b+2c14=27a+9b+3c.If we solve this system of equations we get a=13, b=12 and c=16. Therefore the number of squares on an n×n chessboard is an=13n3+12n2+16n.
Note: Since the squares-on-a-chessboard problem is really asking for the sum of squares, we now have a nice formula for n∑k=1k2.
Not all sequences will have polynomials as their closed formula. We can use the theory of finite differences to identify these.
Example 2.3.4
Determine whether the following sequences can be described by a polynomial, and if so, of what degree.
- 1,2,4,8,16,…
- 0,7,50,183,484,1055,…
- 1,1,2,3,5,8,13,…
- Solution
-
- As we saw in Example 2.3.1, this sequence is not Δk-constant for any k. Therefore the closed formula for the sequence is not a polynomial. In fact, we know the closed formula is an=2n, which grows faster than any polynomial (so is not a polynomial).
- The sequence of first differences is 7,43,133,301,571,…. The second differences are: 36,90,168,270,…. Third difference: 54,78,102,…. Fourth differences: 24,24,…. As far as we can tell, this sequence of differences is constant so the sequence is Δ4-constant and as such the closed formula is a degree 4 polynomial.
- This is the Fibonacci sequence. The sequence of first differences is 0,1,1,2,3,5,8,…, the second differences are 1,0,1,1,2,3,5…. We notice that after the first few terms, we get the original sequence back. So there will never be constant differences, so the closed formula for the Fibonacci sequence is not a polynomial.