Processing math: 100%
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Mathematics LibreTexts

2.8: Elementary Matrices

( \newcommand{\kernel}{\mathrm{null}\,}\)

We now turn our attention to a special type of matrix called an elementary matrix. An elementary matrix is always a square matrix. Recall the row operations given in Definition 1.3.2. Any elementary matrix, which we often denote by E, is obtained from applying one row operation to the identity matrix of the same size.

For example, the matrix E=[0110] is the elementary matrix obtained from switching the two rows. The matrix E=[100030001] is the elementary matrix obtained from multiplying the second row of the 3×3 identity matrix by 3. The matrix E=[1031] is the elementary matrix obtained from adding 3 times the first row to the third row.

You may construct an elementary matrix from any row operation, but remember that you can only apply one operation.

Consider the following definition.

Definition 2.8.1: Elementary Matrices and Row Operations

Let E be an n×n matrix. Then E is an elementary matrix if it is the result of applying one row operation to the n×n identity matrix In.

Those which involve switching rows of the identity matrix are called permutation matrices.

Therefore, E constructed above by switching the two rows of I2 is called a permutation matrix.

Elementary matrices can be used in place of row operations and therefore are very useful. It turns out that multiplying (on the left hand side) by an elementary matrix E will have the same effect as doing the row operation used to obtain E.

The following theorem is an important result which we will use throughout this text.

Theorem 2.8.1: Multiplication by an Elementary Matrix and Row Operations

To perform any of the three row operations on a matrix A it suffices to take the product EA, where E is the elementary matrix obtained by using the desired row operation on the identity matrix.

Therefore, instead of performing row operations on a matrix A, we can row reduce through matrix multiplication with the appropriate elementary matrix. We will examine this theorem in detail for each of the three row operations given in Definition 1.3.2.

First, consider the following lemma.

Lemma 2.8.1: Action of Permutation Matrix

Let Pij denote the elementary matrix which involves switching the ith and the jth rows. Then Pij is a permutation matrix and PijA=B where B is obtained from A by switching the ith and the jth rows.

We will explore this idea more in the following example.

Example 2.8.1: Switching Rows with an Elementary Matrix

Let P12=[010100001],A=[abgdef]

Find B where B=P12A.

Solution

You can see that the matrix P12 is obtained by switching the first and second rows of the 3×3 identity matrix I.

Using our usual procedure, compute the product P12A=B. The result is given by

B=[gdabef]

Notice that B is the matrix obtained by switching rows 1 and 2 of A. Therefore by multiplying A by P12, the row operation which was applied to I to obtain P12 is applied to A to obtain B.

Theorem 2.8.1 applies to all three row operations, and we now look at the row operation of multiplying a row by a scalar. Consider the following lemma.

Lemma 2.8.2: Multiplication by a Scalar and Elementary Matrices

Let E(k,i) denote the elementary matrix corresponding to the row operation in which the ith row is multiplied by the nonzero scalar, k. Then

E(k,i)A=B

where B is obtained from A by multiplying the ith row of A by k.

We will explore this lemma further in the following example.

Example 2.8.2: Multiplication of a Row by 5 Using Elementary Matrix

Let

E(5,2)=[100050001],A=[abcdef]

Find the matrix B where B=E(5,2)A

Solution

You can see that E(5,2) is obtained by multiplying the second row of the identity matrix by 5.

Using our usual procedure for multiplication of matrices, we can compute the product E(5,2)A. The resulting matrix is given by

B=[ab5c5def]

Notice that B is obtained by multiplying the second row of A by the scalar 5.

There is one last row operation to consider. The following lemma discusses the final operation of adding a multiple of a row to another row.

Lemma 2.8.3: Adding Multiples of Rows and Elementary Matrices

Let E(k×i+j) denote the elementary matrix obtained from I by adding k times the ith row to the jth. Then

E(k×i+j)A=B

where B is obtained from A by adding k times the ith row to the jth row of A.

Consider the following example.

Example 2.8.3: Adding Two Times the First Row to the Last

Let

E(2×1+3)=[100010201],A=[abcdef]

Find B where B=E(2×1+3)A.

Solution

You can see that the matrix E(2×1+3) was obtained by adding 2 times the first row of I to the third row of I.

Using our usual procedure, we can compute the product E(2×1+3)A. The resulting matrix B is given by B=[abcd2a+e2b+f]

You can see that B is the matrix obtained by adding 2 times the first row of A to the third row.

Suppose we have applied a row operation to a matrix A. Consider the row operation required to return A to its original form, to undo the row operation. It turns out that this action is how we find the inverse of an elementary matrix E.

Consider the following theorem.

Theorem 2.8.2: Elementary Matrices and Inverses

Every elementary matrix is invertible and its inverse is also an elementary matrix.

In fact, the inverse of an elementary matrix is constructed by doing the reverse row operation on I. E1 will be obtained by performing the row operation which would carry E back to I.

  • If E is obtained by switching rows i and j, then E1 is also obtained by switching rows i and j.
  • If E is obtained by multiplying row i by the scalar k, then E1 is obtained by multiplying row i by the scalar 1k.
  • If E is obtained by adding k times row i to row j, then E1 is obtained by subtracting k times row i from row j.

Consider the following example.

Example 2.8.4: Inverse of an Elementary Matrix

Let E=[1002]

Find E1.

Solution

Consider the elementary matrix E given by

E=[1002]

Here, E is obtained from the 2×2 identity matrix by multiplying the second row by 2. In order to carry E back to the identity, we need to multiply the second row of E by 12. Hence,

E1 is given by E1=[10012]

We can verify that EE1=I. Take the product EE1, given by

EE1=[1002][10012]=[1001]

This equals I so we know that we have compute E1 properly.

Suppose an m×n matrix A is row reduced to its reduced row-echelon form. By tracking each row operation completed, this row reduction can be completed through multiplication by elementary matrices.

Consider the following definition.

Definition 2.8.2: The Form B=UA

Let A be an m×n matrix and let B be the reduced row-echelon form of A. Then we can write B=UA where U is the product of all elementary matrices representing the row operations done to A to obtain B.

Consider the following example.

Example 2.8.5: The Form B=UA

Let A=[011020]. Find B, the reduced row-echelon form of A and write it in the form B=UA.

Solution

To find B, row reduce A. For each step, we will record the appropriate elementary matrix. First, switch rows 1 and 2.

[011020][100120]

The resulting matrix is equivalent to finding the product of P12=[010100001] and A.

Next, add (2) times row 1 to row 3.

[100120][100100]

This is equivalent to multiplying by the matrix E(2×1+3)=[100010201]. Notice that the resulting matrix is B, the required reduced row-echelon form of A.

We can then write

B=E(2×1+2)(P12A)=(E(2×1+2)P12)A=UA

It remains to find the matrix U.

U=E(2×1+2)P12=[100010201][010100001]=[010100021]

We can verify that B=UA holds for this matrix U: UA=[010100021][011020]=[100100]=B

While the process used in the above example is reliable and simple when only a few row operations are used, it becomes cumbersome in a case where many row operations are needed to carry A to B. The following theorem provides an alternate way to find the matrix U.

Theorem 2.8.3: Finding the Matrix U

Let A be an m×n matrix and let B be its reduced row-echelon form. Then B=UA where U is an invertible m×m matrix found by forming the matrix [A|Im] and row reducing to [B|U].

Let’s revisit the above example using the process outlined in Theorem 2.8.3.

Example 2.8.6: The Form B=UA, Revisited

Let A=[011020]. Using the process outlined in Theorem 2.8.3, find U such that B=UA.

Solution

First, set up the matrix [A|Im]. [011001001020001] Now, row reduce this matrix until the left side equals the reduced row-echelon form of A.

[011001001020001][100100110020001][100100110000021]

The left side of this matrix is B, and the right side is U. Comparing this to the matrix U found above in Example 2.8.5, you can see that the same matrix is obtained regardless of which process is used.

Recall from Algorithm 2.7.1 that an n×n matrix A is invertible if and only if A can be carried to the n×n identity matrix using the usual row operations. This leads to an important consequence related to the above discussion.

Suppose A is an n×n invertible matrix. Then, set up the matrix [A|In] as done above, and row reduce until it is of the form [B|U]. In this case, B=In because A is invertible.

B=UAIn=UAU1=A

Now suppose that U=E1E2Ek where each Ei is an elementary matrix representing a row operation used to carry A to I. Then,

U1=(E1E2Ek)1=E1kE12E11

Remember that if Ei is an elementary matrix, so too is E1i. It follows that

A=U1=E1kE12E11

and A can be written as a product of elementary matrices.

Theorem 2.8.4: Product of Elementary Matrices

Let A be an n×n matrix. Then A is invertible if and only if it can be written as a product of elementary matrices.

Consider the following example.

Example 2.8.7: Product of Elementary Matrices

Let A=[010110021]. Write A as a product of elementary matrices.

Solution

We will use the process outlined in Theorem 2.8.3 to write A as a product of elementary matrices. We will set up the matrix [A|I] and row reduce, recording each row operation as an elementary matrix.

First:

[010100110010021001][110010010100021001]

represented by the elementary matrix E1=[010100001].

Secondly:

[110010010100021001][100110010100021001]

represented by the elementary matrix E2=[110010001].

Finally:

[100110010100021001][100110010100001201]

represented by the elementary matrix E3=[100010021].

Notice that the reduced row-echelon form of A is I. Hence I=UA where U is the product of the above elementary matrices. It follows that A=U1. Since we want to write A as a product of elementary matrices, we wish to express U1 as a product of elementary matrices. 

U1=(E3E2E1)1=E11E12E13=[010100001][110010001][100010021]=A

This gives A written as a product of elementary matrices. By Theorem 2.8.4 it follows that A is invertible.


This page titled 2.8: Elementary Matrices is shared under a CC BY 4.0 license and was authored, remixed, and/or curated by Ken Kuttler (Lyryx) via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?