3.1: Invertibility
( \newcommand{\kernel}{\mathrm{null}\,}\)
In previous sections, we have found solutions to linear systems using the Gaussian elimination algorithm. We will now investigate another way of finding solutions to a specific type of equation Ax=b when the matrix A has the same number of rows and columns. To get started, let's look at some familiar examples.
Preview Activity 3.1.1.
- Explain how you would solve the equation 3x=5 without using the concept of division.
- Find the 2×2 matrix A that rotates vectors counterclockwise by 90∘.
- Find the 2×2 matrix B that rotates vectors clockwise by 90∘.
- What do you expect the product BA to be? Explain the reasoning behind your expectation and then compute BA to verify it.
- Solve the equation Ax=(3−2) using Gaussian elimination.
- Explain why your solution may also be found by computing x=B(3−2).
Invertible matrices
The preview activity began with a familiar type of equation, 3x=5, and asked for a strategy to solve it. One possible response is to divide both sides by 3; instead, let's rephrase this as multiplying by 3−1=13, the multiplicative inverse of 3.
Now that we are interested in solving equations of the form Ax=b, we might try to find a similar approach. Is there a matrix A−1 that plays the role of the multiplicative inverse? Of course, we can't expect every matrix to have a multiplicative inverse; after all, the real number 0 doesn't have an inverse. We will see, however, that many matrices do.
An n×n matrix A is called invertible if there is a matrix B such that BA=In, where In is the n×n identity matrix. The matrix B is called the inverse of A and denoted A−1.
In the preview activity, we considered the matrices
since A rotates vectors in R2 by 90∘ and B rotates vectors by −90∘. It's easy to check that
This shows that B=A−1.
The preview also indicates the use of matrix inverses. Since we have A−1A=I, we can solve the equation Ax=b by multiplying both sides on the left by A−1:
Notice that this is similar to finding the solution to 3x=5 as x=135.
Activity 3.1.2.
Let's consider the matrices
- Define these matrices in Sage and verify that BA=I so that B=A−1.
- Find the solution to the equation Ax=(4−14) using A−1.
- Using your Sage cell above, multiply A and B in the opposite order; that is, what do you find when you evaluate AB?
- Suppose that A is an n×n invertible matrix with inverse A−1. This means that every equation of the form Ax=b has a solution, namely, x=A−1b. What can you conclude about the span of the columns of A?
- What can you conclude about the pivot positions of the matrix A?
- If A is an invertible 4×4 matrix, what is its reduced row echelon form?
This activity demonstrates a few important things. First, we said that A is invertible if there is a matrix B such that BA=I. In general, multiplying matrices requires care because the product depends on the order in which the matrices are multiplied. However, in this case, we can check that BA=I implies that AB=I as well. This means that B is also invertible and that A=B−1. This is the subject of Exercise 3.1.5.9.
Also, if the matrix A is invertible, then every equation Ax=b has a solution x=A−1b. This means that the span of the columns of A is Rn so that A has a pivot in every row. Since the matrix A has n rows and n columns, there must be a pivot in every row and every column. Therefore, the reduced row echelon form of A is
This provides us with a useful characterization of invertible matrices.
Constructing a matrix inverse
We have seen that an invertible matrix A has the property that its reduced row echelon form is the identity; that is, A∼I. Here, we will use this fact to construct the inverse of a matrix A.
Activity 3.1.3.
In this activity, we will begin with the matrix
and construct its inverse A−1. For the time being, let's denote the inverse by B so that B=A−1.
- We know that AB=I. If we write B=[b1b2], then we have
AB=[Ab1Ab2]=[e1e2]=I.
This means that we need to solve the equations
Ab1=e1Ab2=e2.Using the Sage cell below, solve these equations for the columns of B.
- What is the matrix B? Check that AB=I and BA=I.
- To find the columns of B, we solved two equations, Ab1=e1 and Ab2=e2. We could do this by augmenting A two separate times, forming matrices
[Ae1][Ae2]
and finding their reduced row echelon forms. But instead of solving these two equations separately, we could also solve them together by forming the augmented matrix [Ae1e2] and finding the row reduced echelon form. In other words, we augment A by the matrix I to form [AI].
Form this augmented matrix and find its reduced row echelon form to find A−1.
Assuming A is invertible, we have shown that
[AI]∼[IA−1]. - If you have defined a matrix A in Sage, you can find it's inverse as
A.inverse()
. Use Sage to find the inverse of the matrixA=[1−2−1−1565−46]. - What happens when we try to find the inverse of the matrix
[−42−21]?
- Suppose that n×n matrices C and D are both invertible. What do you find when you simplify the product (D−1C−1)(CD)? Explain why the product CD is invertible and (CD)−1=D−1C−1.
Finding the inverse of an n×n matrix A requires us to solve n equations. If we write the inverse as
then we need to solve
We can, of course, solve each equation separately, but it is more efficient to bundle the equations together by forming the augmented matrix [AI] and finding its row reduced echelon form. We then find
We saw earlier that, if A has an inverse, then A∼I. We have now seen that, if A∼I, then A has an inverse.
Finally, we see that the product of two invertible matrices A and B is also invertible. This is because
Therefore, we have (AB)−1=B−1A−1. Because the matrix product depends on the order in which we multiply matrices, use care when applying this relationship. The inverse of a product is the product of the inverses with the order of multiplication reversed.
Properties of invertible matrices.
- An n×n matrix A is invertible if and only if A∼I.
- If A is invertible, then the solution to the equation Ax=b is given by x=A−1b.
- We can find A−1 by finding the reduced row echelon form of [AI]; namely,
[AI]∼[IA−1].
- If A and B are two invertible n×n matrices, then their product AB is also invertible and (AB)−1=B−1A−1.
There is a simple formula for finding the inverse of a 2×2 matrix:
which can be easily checked. The condition that A be invertible is, in this case, reduced to the condition that ad−bc≠0. We will understand this condition better once we have explored determinants in Section 3.4. There is a similar formula for the inverse of a 3×3 matrix, but there is not a good reason to write it here.
Triangular matrices and Gaussian elimination
Generally speaking, solving an equation Ax=b by first finding A−1 and then evaluating x=A−1b is not the best strategy since row reducing the augmented matrix [Ab] involves considerably less work. This becomes clear once we remember that finding the inverse A−1 requires us to solve n equations of this form.
For the class of triangular matrices, however, finding inverses is relatively efficient and useful, as we will see in Section 5.1.
We say that a matrix A is lower triangular if all its entries above the diagaonal are zero. Similarly, A is upper triangular if all the entries below the diagonal are zero.
For example, the matrix L below is a lower triangular matrix while U is an upper triangular one.
We can develop a simple test to determine whether an n×n lower triangular matrix is invertible. Let's use Gaussian elimination to find the reduced row echelon form of the lower triangular matrix
Because the entries on the diagonal are nonzero, we find a pivot position in every row, which tells us that the matrix is invertible. If, however, there is a zero entry on the diagonal, the matrix cannot be invertible. Considering the matrix below, we see that having a zero on the diagonal leads to a row without a pivot position.
An n×n triangular matrix is invertible if and only if the entries on the diagonal are all nonzero.
Up to this point, our primary tool for studying linear systems, sets of vectors, and matrices has been Gaussian elimination. As the next activity demonstrates, we can express the row operations performed in Gaussian elimination in terms of matrix multiplication. In Section 5.1, we will use this observation to create an efficient way to solve equations of the form Ax=b.
Activity 3.1.4.
As an example, we will consider the matrix
When performing Gaussian elimination on A, we first apply a row replacement operation in which we multiply the first row by −2 and add to the second row. After this step, we have a new matrix A1.
- Show that multiplying A by the lower triangular matrix
L1=[100−210001]
has the same effect as this row operation; that is, show that L1A=A1.
- Explain why L1 is invertible and find its inverse L−11.
- You should see that there is a simple relationship between L1 and L−11. Describe this relationship and explain why it holds.
- To continue the Gaussian elimination algorithm, we need to apply two more row replacements to bring A into a triangular form U where
A=[12120−2−12−1]∼[1210−4−400−4]=U.
Find the matrices L2 and L3 that perform these row replacement operations so that L3L2L1A=U.
- Explain why the matrix product L3L2L1 is invertible and use this fact to write A=LU. What is the matrix L that you find? Why do you think we denote it by L?
- Row replacement operations may always be performed by multiplying by a lower triangular matrix. It turns out the other two row operations, scaling and interchange, may also be performed using matrix multiplication. For instance, consider the two matrices
S=[100030001],P=[001010100].
Show that multiplying A by S performs a scaling operation and that multiplying by P performs a row interchange.
- Explain why the matrices S and P are invertible and state their inverses.
We will demonstrate the ideas in this activity again using the matrix
After performing three row replacement operations, we find the row equivalent upper triangular matrix U:
The first row replacement operation multiplies the first row by 3 and adds the result to the second row. We can perfom this operation by multiplying A by the lower triangular matrix L1 where
The next two row replacement operations are performed by the matrices
so that L3L2L1A=U.
Notice that the inverse of L1 has the simple form:
This makes sense; if we want to undo the operation of multiplying the first row by 3 and adding to the second row, we should multiply the first row by −3 and add it to the second row. This is the effect of L−11.
The other row operations we use in implementing Gaussian elimination can also be performed by multiplying by an invertible matrix. In particular, if we scale a row by a nonzero number s, we can undo this operation by scaling by 1s. This leads to the invertible diagonal matrices, such as
Similarly, a row interchange leads to a matrix P, which is its own inverse. An example is
Summary
In this section, we found conditions guaranteeing that a matrix has an inverse. When these conditions hold, we also found an algorithm for finding the inverse.
- The n×n matrix A is invertible if and only if it is row equivalent to In, the n×n identity matrix.
- If a matrix A is invertible, then the solution to the equation Ax=b is x=A−1b.
- If a matrix A is invertible, we can use Gaussian elimination to find its inverse:
[AI]∼[IA−1].
- The row operations used in performing Gaussian elimination can be performed by multiplying by invertible matrices. More specifically, a row replacement operation may be performed by multiplying by an invertible lower triangular matrix.
Exercises 3.1.5Exercises
Consider the matrix
- Explain why A has an inverse.
- Find the inverse of A by augmenting by the identity I to form [AI].
- Use your inverse to solve the equation Ax=(32−3−1).
In this exercise, we will consider 2×2 matrices as defining linear transformations.
- Write the matrix A that performs a 45∘ rotation. What geometric operation undoes this rotation? Find the matrix that perform this operation and verify that it is A−1.
- Write the matrix A that performs a 180∘ rotation. Verify that A2=I so that A−1=A, and explain geometrically why this is the case.
- Find three more matrices A that satisfy A2=I.
Suppose that A is an n×n matrix.
- Suppose that A2=AA is invertible with inverse B. This means that BA2=BAA=I. Explain why A must be invertible with inverse BA.
- Suppose that A100 is invertible with inverse B. Explain why A is invertible. What is A−1 in terms of A and B?
Our definition of an invertible matrix requires that A be a square n×n matrix. Let's examine what happens when A is not square. For instance, suppose that
- Verify that BA=I2. In this case, we say that B is a left inverse of A.
- If A has a left inverse B, we can still use it to find solutions to linear equations. If we know there is a solution to the equation Ax=b, we can multiply both sides of the equation by B to find x=Bb.
Suppose you know there is a solution to the equation Ax=(−1−36). Use the left inverse B to find x and verify that it is a solution.
- Now consider the matrix
C=[1−10−210]
and verify that C is also a left inverse of A. This shows that the matrix A may have more than one left inverse.
- When A is a square matrix, we said that BA=I implies that AB=I. In this problem, we have a non-square matrix A with BA=I. What happens when we compute AB?
If a matrix A is invertible, there is a sequence of row operations that transform A into the identity matrix I. We have seen that every row operation can be performed by matrix multiplication. If the jth step in the Gaussian elimination process is performed by multiplying by Ej, then we have
which means that
For each of the following matrices, find a sequence of row operations that transforms the matrix to the identity I. Write the matrices Ej that perform the steps and use them to find A−1.
-
A=[020−300001].
-
A=[100021000−3100021].
-
A=[111011002].
Determine whether the following statements are true or false and explain your reasoning.
- If A is invertible, then the columns of A are linearly independent.
- If A is a square matrix whose diagonal entries are all nonzero, then A is invertible.
- If A is an invertible n×n matrix, then the columns of A span Rn.
- If A is invertible, then there is a nontrivial solution to the homogeneous equation Ax=0.
- If A is an n×n matrix and the equation Ax=b has a solution for every vector b, then A is invertible.
Provide a justification for your response to the following questions.
- Suppose that A is a square matrix with two identical columns. Can A be invertible?
- Suppose that A is a square matrix with two identical rows. Can A be invertible?
- Suppose that A is an invertible matrix and that AB=AC. Can you conclude that B=C?
- Suppose that A is an invertible n×n matrix. What can you say about the span of the columns of A−1?
- Suppose that A is an invertible matrix and that B is row equivalent to A. Can you guarantee that B is invertible?
Suppose that we start with the 3×3 matrix A and perform the following sequence of row operations:
- Multiply row 1 by -2 and add to row 2.
- Multiply row 1 by 4 and add to row 3.
- Scale row 2 by 1/2.
- Multiply row 2 by -1 and add to row 3.
Suppose we arrive at the upper triangular matrix
- Write the matrices E1, E2, E3, and E4 that perform the four row operations.
- Find the matrix E=E4E3E2E1.
- We then have E4E3E2E1A=EA=U. Now that we have the matrix E, find the original matrix A=E−1U.
We defined an n×n matrix to be invertible if there is a matrix B such that BA=In. In this exercise, we will explain why B is also invertible and that AB=I. This means that, if B=A−1, then A=B−1.
- Given the fact that BA=In, explain why the matrix B must also be a square n×n matrix.
- Suppose that b is a vector in Rn. Since we have BA=I, it follows that B(Ab)=b. Use this to explain why the columns of B span Rn. What does this say about the pivot positions of B?
- Explain why the equation Bx=0 has only the trivial solution.
- Beginning with the equation, BA=I, multiply both sides by B to obtain BAB=B. We will rearrange this equation:
BAB=BBAB−B=0B(AB−I)=0.
Since the homogeneous equation Bx=0 has only the trivial solution, explain why AB−I=0 and therefore, AB=I.