Loading [MathJax]/jax/output/HTML-CSS/jax.js
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Mathematics LibreTexts

3.1: Invertibility

( \newcommand{\kernel}{\mathrm{null}\,}\)

In previous sections, we have found solutions to linear systems using the Gaussian elimination algorithm. We will now investigate another way of finding solutions to a specific type of equation Ax=b when the matrix A has the same number of rows and columns. To get started, let's look at some familiar examples.

Preview Activity 3.1.1.

  1. Explain how you would solve the equation 3x=5 without using the concept of division.
  2. Find the 2×2 matrix A that rotates vectors counterclockwise by 90.
  3. Find the 2×2 matrix B that rotates vectors clockwise by 90.
  4. What do you expect the product BA to be? Explain the reasoning behind your expectation and then compute BA to verify it.
  5. Solve the equation Ax=(32) using Gaussian elimination.
  6. Explain why your solution may also be found by computing x=B(32).

Invertible matrices

The preview activity began with a familiar type of equation, 3x=5, and asked for a strategy to solve it. One possible response is to divide both sides by 3; instead, let's rephrase this as multiplying by 31=13, the multiplicative inverse of 3.

Now that we are interested in solving equations of the form Ax=b, we might try to find a similar approach. Is there a matrix A1 that plays the role of the multiplicative inverse? Of course, we can't expect every matrix to have a multiplicative inverse; after all, the real number 0 doesn't have an inverse. We will see, however, that many matrices do.

Definition 3.1.1

An n×n matrix A is called invertible if there is a matrix B such that BA=In, where In is the n×n identity matrix. The matrix B is called the inverse of A and denoted A1.

In the preview activity, we considered the matrices

A=[0110],B=[0110],

since A rotates vectors in R2 by 90 and B rotates vectors by 90. It's easy to check that

BA=[0110][0110]=[1001]=I.

This shows that B=A1.

The preview also indicates the use of matrix inverses. Since we have A1A=I, we can solve the equation Ax=b by multiplying both sides on the left by A1:

A1(Ax)=A1b(A1A)x=A1bIx=A1bx=A1b.

Notice that this is similar to finding the solution to 3x=5 as x=135.

Activity 3.1.2.

Let's consider the matrices

A=[102221111],B=[124113012].
  1. Define these matrices in Sage and verify that BA=I so that B=A1.
  2. Find the solution to the equation Ax=(414) using A1.
  3. Using your Sage cell above, multiply A and B in the opposite order; that is, what do you find when you evaluate AB?
  4. Suppose that A is an n×n invertible matrix with inverse A1. This means that every equation of the form Ax=b has a solution, namely, x=A1b. What can you conclude about the span of the columns of A?
  5. What can you conclude about the pivot positions of the matrix A?
  6. If A is an invertible 4×4 matrix, what is its reduced row echelon form?

This activity demonstrates a few important things. First, we said that A is invertible if there is a matrix B such that BA=I. In general, multiplying matrices requires care because the product depends on the order in which the matrices are multiplied. However, in this case, we can check that BA=I implies that AB=I as well. This means that B is also invertible and that A=B1. This is the subject of Exercise 3.1.5.9.

Also, if the matrix A is invertible, then every equation Ax=b has a solution x=A1b. This means that the span of the columns of A is Rn so that A has a pivot in every row. Since the matrix A has n rows and n columns, there must be a pivot in every row and every column. Therefore, the reduced row echelon form of A is

A[100010001]=I.

This provides us with a useful characterization of invertible matrices.

Constructing a matrix inverse

We have seen that an invertible matrix A has the property that its reduced row echelon form is the identity; that is, AI. Here, we will use this fact to construct the inverse of a matrix A.

Activity 3.1.3.

In this activity, we will begin with the matrix

A=[1213]

and construct its inverse A1. For the time being, let's denote the inverse by B so that B=A1.

  1. We know that AB=I. If we write B=[b1b2], then we have
    AB=[Ab1Ab2]=[e1e2]=I.

    This means that we need to solve the equations

    Ab1=e1Ab2=e2.

    Using the Sage cell below, solve these equations for the columns of B.

  2. What is the matrix B? Check that AB=I and BA=I.
  3. To find the columns of B, we solved two equations, Ab1=e1 and Ab2=e2. We could do this by augmenting A two separate times, forming matrices
    [Ae1][Ae2]

    and finding their reduced row echelon forms. But instead of solving these two equations separately, we could also solve them together by forming the augmented matrix [Ae1e2] and finding the row reduced echelon form. In other words, we augment A by the matrix I to form [AI].

    Form this augmented matrix and find its reduced row echelon form to find A1.

    Assuming A is invertible, we have shown that

    [AI][IA1].
  4. If you have defined a matrix A in Sage, you can find it's inverse as A.inverse(). Use Sage to find the inverse of the matrix
    A=[121156546].
  5. What happens when we try to find the inverse of the matrix
    [4221]?
  6. Suppose that n×n matrices C and D are both invertible. What do you find when you simplify the product (D1C1)(CD)? Explain why the product CD is invertible and (CD)1=D1C1.

Finding the inverse of an n×n matrix A requires us to solve n equations. If we write the inverse as

B=[b1b2bn],

then we need to solve

Ab1=e1Ab2=e2Abn=en.

We can, of course, solve each equation separately, but it is more efficient to bundle the equations together by forming the augmented matrix [AI] and finding its row reduced echelon form. We then find

[AI]=[Ae1e2en][Ib1b2bn]=[IA1].

We saw earlier that, if A has an inverse, then AI. We have now seen that, if AI, then A has an inverse.

Finally, we see that the product of two invertible matrices A and B is also invertible. This is because

(B1A1)(AB)=B1(A1A)B=B1IB=B1B=I.

Therefore, we have (AB)1=B1A1. Because the matrix product depends on the order in which we multiply matrices, use care when applying this relationship. The inverse of a product is the product of the inverses with the order of multiplication reversed.

Properties of invertible matrices.

  • An n×n matrix A is invertible if and only if AI.
  • If A is invertible, then the solution to the equation Ax=b is given by x=A1b.
  • We can find A1 by finding the reduced row echelon form of [AI]; namely,
    [AI][IA1].
  • If A and B are two invertible n×n matrices, then their product AB is also invertible and (AB)1=B1A1.

There is a simple formula for finding the inverse of a 2×2 matrix:

[abcd]1=1adbc[dbca],

which can be easily checked. The condition that A be invertible is, in this case, reduced to the condition that adbc0. We will understand this condition better once we have explored determinants in Section 3.4. There is a similar formula for the inverse of a 3×3 matrix, but there is not a good reason to write it here.

Triangular matrices and Gaussian elimination

Generally speaking, solving an equation Ax=b by first finding A1 and then evaluating x=A1b is not the best strategy since row reducing the augmented matrix [Ab] involves considerably less work. This becomes clear once we remember that finding the inverse A1 requires us to solve n equations of this form.

For the class of triangular matrices, however, finding inverses is relatively efficient and useful, as we will see in Section 5.1.

Definition 3.1.2

We say that a matrix A is lower triangular if all its entries above the diagaonal are zero. Similarly, A is upper triangular if all the entries below the diagonal are zero.

For example, the matrix L below is a lower triangular matrix while U is an upper triangular one.

L=[000000],U=[000000].

We can develop a simple test to determine whether an n×n lower triangular matrix is invertible. Let's use Gaussian elimination to find the reduced row echelon form of the lower triangular matrix

[100220344][100020044][100020004][100010001].

Because the entries on the diagonal are nonzero, we find a pivot position in every row, which tells us that the matrix is invertible. If, however, there is a zero entry on the diagonal, the matrix cannot be invertible. Considering the matrix below, we see that having a zero on the diagonal leads to a row without a pivot position.

[100200344][100000044][100011000].
Proposition 3.1.3.

An n×n triangular matrix is invertible if and only if the entries on the diagonal are all nonzero.

Up to this point, our primary tool for studying linear systems, sets of vectors, and matrices has been Gaussian elimination. As the next activity demonstrates, we can express the row operations performed in Gaussian elimination in terms of matrix multiplication. In Section 5.1, we will use this observation to create an efficient way to solve equations of the form Ax=b.

Activity 3.1.4.

As an example, we will consider the matrix

A=[121202121].

When performing Gaussian elimination on A, we first apply a row replacement operation in which we multiply the first row by 2 and add to the second row. After this step, we have a new matrix A1.

A=[121202121][121044121]=A1.
  1. Show that multiplying A by the lower triangular matrix
    L1=[100210001]

    has the same effect as this row operation; that is, show that L1A=A1.

  2. Explain why L1 is invertible and find its inverse L11.
  3. You should see that there is a simple relationship between L1 and L11. Describe this relationship and explain why it holds.
  4. To continue the Gaussian elimination algorithm, we need to apply two more row replacements to bring A into a triangular form U where
    A=[121202121][121044004]=U.

    Find the matrices L2 and L3 that perform these row replacement operations so that L3L2L1A=U.

  5. Explain why the matrix product L3L2L1 is invertible and use this fact to write A=LU. What is the matrix L that you find? Why do you think we denote it by L?
  6. Row replacement operations may always be performed by multiplying by a lower triangular matrix. It turns out the other two row operations, scaling and interchange, may also be performed using matrix multiplication. For instance, consider the two matrices
    S=[100030001],P=[001010100].

    Show that multiplying A by S performs a scaling operation and that multiplying by P performs a row interchange.

  7. Explain why the matrices S and P are invertible and state their inverses.

We will demonstrate the ideas in this activity again using the matrix

A=[132363202].

After performing three row replacement operations, we find the row equivalent upper triangular matrix U:

A=[132363201][132033201]=A1[132033063]=A2[132033003]=U.

The first row replacement operation multiplies the first row by 3 and adds the result to the second row. We can perfom this operation by multiplying A by the lower triangular matrix L1 where

L1A=[100310001][132363202]=[132033201]=A1.

The next two row replacement operations are performed by the matrices

L2=[100010201],L3=[100010021]

so that L3L2L1A=U.

Notice that the inverse of L1 has the simple form:

L1=[100310001],L11=[100310001].

This makes sense; if we want to undo the operation of multiplying the first row by 3 and adding to the second row, we should multiply the first row by 3 and add it to the second row. This is the effect of L11.

The other row operations we use in implementing Gaussian elimination can also be performed by multiplying by an invertible matrix. In particular, if we scale a row by a nonzero number s, we can undo this operation by scaling by 1s. This leads to the invertible diagonal matrices, such as

S=[s00010001],S1=[1s00010001].

Similarly, a row interchange leads to a matrix P, which is its own inverse. An example is

P=[010100001]=P1.

Summary

In this section, we found conditions guaranteeing that a matrix has an inverse. When these conditions hold, we also found an algorithm for finding the inverse.

  • The n×n matrix A is invertible if and only if it is row equivalent to In, the n×n identity matrix.
  • If a matrix A is invertible, then the solution to the equation Ax=b is x=A1b.
  • If a matrix A is invertible, we can use Gaussian elimination to find its inverse:
    [AI][IA1].
  • The row operations used in performing Gaussian elimination can be performed by multiplying by invertible matrices. More specifically, a row replacement operation may be performed by multiplying by an invertible lower triangular matrix.

Exercises 3.1.5Exercises

1

Consider the matrix

A=[3114023121023012].
  1. Explain why A has an inverse.
  2. Find the inverse of A by augmenting by the identity I to form [AI].
  3. Use your inverse to solve the equation Ax=(3231).
2

In this exercise, we will consider 2×2 matrices as defining linear transformations.

  1. Write the matrix A that performs a 45 rotation. What geometric operation undoes this rotation? Find the matrix that perform this operation and verify that it is A1.
  2. Write the matrix A that performs a 180 rotation. Verify that A2=I so that A1=A, and explain geometrically why this is the case.
  3. Find three more matrices A that satisfy A2=I.
3

Suppose that A is an n×n matrix.

  1. Suppose that A2=AA is invertible with inverse B. This means that BA2=BAA=I. Explain why A must be invertible with inverse BA.
  2. Suppose that A100 is invertible with inverse B. Explain why A is invertible. What is A1 in terms of A and B?
4

Our definition of an invertible matrix requires that A be a square n×n matrix. Let's examine what happens when A is not square. For instance, suppose that

A=[112130],B=[221121].
  1. Verify that BA=I2. In this case, we say that B is a left inverse of A.
  2. If A has a left inverse B, we can still use it to find solutions to linear equations. If we know there is a solution to the equation Ax=b, we can multiply both sides of the equation by B to find x=Bb.

    Suppose you know there is a solution to the equation Ax=(136). Use the left inverse B to find x and verify that it is a solution.

  3. Now consider the matrix
    C=[110210]

    and verify that C is also a left inverse of A. This shows that the matrix A may have more than one left inverse.

  4. When A is a square matrix, we said that BA=I implies that AB=I. In this problem, we have a non-square matrix A with BA=I. What happens when we compute AB?
5

If a matrix A is invertible, there is a sequence of row operations that transform A into the identity matrix I. We have seen that every row operation can be performed by matrix multiplication. If the jth step in the Gaussian elimination process is performed by multiplying by Ej, then we have

EpE2E1A=I,

which means that

A1=EpE2E1.

For each of the following matrices, find a sequence of row operations that transforms the matrix to the identity I. Write the matrices Ej that perform the steps and use them to find A1.

  1. A=[020300001].
  2. A=[1000210003100021].
  3. A=[111011002].
6

Determine whether the following statements are true or false and explain your reasoning.

  1. If A is invertible, then the columns of A are linearly independent.
  2. If A is a square matrix whose diagonal entries are all nonzero, then A is invertible.
  3. If A is an invertible n×n matrix, then the columns of A span Rn.
  4. If A is invertible, then there is a nontrivial solution to the homogeneous equation Ax=0.
  5. If A is an n×n matrix and the equation Ax=b has a solution for every vector b, then A is invertible.
7

Provide a justification for your response to the following questions.

  1. Suppose that A is a square matrix with two identical columns. Can A be invertible?
  2. Suppose that A is a square matrix with two identical rows. Can A be invertible?
  3. Suppose that A is an invertible matrix and that AB=AC. Can you conclude that B=C?
  4. Suppose that A is an invertible n×n matrix. What can you say about the span of the columns of A1?
  5. Suppose that A is an invertible matrix and that B is row equivalent to A. Can you guarantee that B is invertible?
8

Suppose that we start with the 3×3 matrix A and perform the following sequence of row operations:

  1. Multiply row 1 by -2 and add to row 2.
  2. Multiply row 1 by 4 and add to row 3.
  3. Scale row 2 by 1/2.
  4. Multiply row 2 by -1 and add to row 3.

Suppose we arrive at the upper triangular matrix

U=[321013004].
  1. Write the matrices E1, E2, E3, and E4 that perform the four row operations.
  2. Find the matrix E=E4E3E2E1.
  3. We then have E4E3E2E1A=EA=U. Now that we have the matrix E, find the original matrix A=E1U.
9

We defined an n×n matrix to be invertible if there is a matrix B such that BA=In. In this exercise, we will explain why B is also invertible and that AB=I. This means that, if B=A1, then A=B1.

  1. Given the fact that BA=In, explain why the matrix B must also be a square n×n matrix.
  2. Suppose that b is a vector in Rn. Since we have BA=I, it follows that B(Ab)=b. Use this to explain why the columns of B span Rn. What does this say about the pivot positions of B?
  3. Explain why the equation Bx=0 has only the trivial solution.
  4. Beginning with the equation, BA=I, multiply both sides by B to obtain BAB=B. We will rearrange this equation:
    BAB=BBABB=0B(ABI)=0.

    Since the homogeneous equation Bx=0 has only the trivial solution, explain why ABI=0 and therefore, AB=I.


This page titled 3.1: Invertibility is shared under a CC BY 4.0 license and was authored, remixed, and/or curated by David Austin via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?