Diagonalization

Last updated
Save as PDF

Page ID: 218319

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\dsum}{\displaystyle\sum\limits} \)

\( \newcommand{\dint}{\displaystyle\int\limits} \)

\( \newcommand{\dlim}{\displaystyle\lim\limits} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\(\newcommand{\longvect}{\overrightarrow}\)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

Diagonalization

Similar Matrices

We have seen that the commutative property does not hold for matrices, so that if A is an n x n matrix, then

P^-¹AP

is not necessarily equal to A. For different nonsingular matrices P, the above expression will represent different matrices. However, all such matrices share some important properties as we shall soon see.

Definition

Let A and B be an n x n matrices, then A is similar to B if there is a nonsingular matrix P with

B = P^-¹AP

Example

Consider the matrices

\( A = \begin{pmatrix} 2 & -1 \\ 1 & 5 \end{pmatrix} \) \( P = \begin{pmatrix} 3 & 4 \\ 4 & 5 \end{pmatrix} \)

Then

\( B = P^{-1}AP = \begin{pmatrix} -5 & 4 \\ 4 & -3 \end{pmatrix} \begin{pmatrix} 2 & -1 \\ 1 & 5 \end{pmatrix} \begin{pmatrix} 3 & 4 \\ 4 & 5 \end{pmatrix} \)

is similar to A.

Notice the three following facts

A is similar to A.
If A is similar to B then B is similar to A.
If A is similar to B and B is similar to C then A is similar to C.

We call a relationship with these three properties an equivalence relationship. We will prove the third property.

If A is similar to B and B is similar to C then there are matrices P and Q with

B = P^-¹AP and C = Q^-¹BQ

We need to find a matrix R with

C = R^-¹AR

We have

C = Q^-¹BQ = Q^-1(P^-¹AP)Q =

(Q^-1P^-1)A(PQ) = (PQ)^-1A(PQ) = R^-¹AR

There is a wonderful fact that we state below.

Theorem

If A and B are similar matrices, then they have the same eigenvalues.

Proof

It is enough to show that they have the same characteristic polynomials. We have

det(\(\lambda\)I - B) = det(\(\lambda\)I - P^-¹AP) = det(P^-1\(\lambda\)IP - P^-¹AP)

= det(P^-1(\(\lambda\)I - A)P) = det(\(\lambda\)I - A)

Diagonalized Matrices

The easiest kind of matrices to deal with are diagonal matrices. Determinants are simple, the eigenvalues are just the diagonal entries and the eigenvectors are just elements of the standard basis. Even the inverse is a piece of cake (if the matrix is nonsingular). Although most matrices are not diagonal, many are diagonalizable, that is they are similar to a diagonal matrix.

Definition

A matrix A is diagonalizable if A is similar to a diagonal matrix D.

D = P^-¹AP

The following theorem tells us when a matrix is diagonalizable and if it is how to find its similar diagonal matrix D.

Theorem

Let A be an n x n matrix. Then A is diagonalizable if and only if A has n linearly independent eigenvectors. If so, then

D = P^-¹AP

If {v₁, ... , v_n} are the eigenvectors of A and {\(\lambda\)₁, ... , \(\lambda\)_n} are the corresponding eigenvalues, then

v_j the j^th column of P

and

[D]_jj = \(\lambda\)_j

Example

In the last discussion, we saw that the matrix

\( A = \begin{pmatrix} 1 & 3 \\ 2 & 2 \end{pmatrix} \)

has -1 and 4 as eigenvalues with associated eigenvectors

\( v_{-1} = \begin{pmatrix} 3 \\ -2 \end{pmatrix} \) \( v_{4} = \begin{pmatrix} 1 \\ 1 \end{pmatrix} \)

Hence

\(P = \begin{pmatrix} 3 & 1 \\ -2 & 1 \end{pmatrix} \) \( D = \begin{pmatrix} -1 & 0 \\ 4 & 0 \end{pmatrix} \)

You can verify that

D = P^-¹AP

Proof of the Theorem

D = P^-¹AP

for some diagonal matrix D and nonsingular matrix P, then

AP = PD

Let v_i be the j^th column of P and [D]_jj = \(\lambda\)j. Then the j^th column of AP is Av_i and the j^th column of PD is \(\lambda\)_iv_j. Hence

Av_j = l_iv_j

so that v_j is an eigenvector of A with corresponding eigenvalue \(\lambda\)_j. Since P has its columns as eigenvectors, and P is nonsingular, rank(P) = n, and the columns of P (the eigenvalues of A) are linearly independent.

Next suppose that the eigenvalues of A are linearly independent. Then form D and P as above. Then since

Av_j = \(\lambda\)_iv_j

The j^th column of AP equals the j^th column of PD, hence AP = PD. Since the columns of P are linearly independent, P is nonsingular so that

D = P^-¹AP

Theorem

Let A be an n x n matrix with n real and distinct eigenvalues. Then A is diagonalizable.

Proof

Let

{\(\lambda\)₁, ... , \(\lambda\)_k} and {v₁, ... , v_k}

with

rank(Span({v₁, ... , v_k})) = k - 1

be the eigenvalues and eigenvectors of A. We need to show that none of the vectors can be written as a linear combination of the rest. Without loss of generality, we need show that the first can not be written as a linear combination of the rest. If

v₁ = c₂v₂ + ... + c_nv_k (1)

We can multiply both sides of the equation by A to get

\(\lambda\)₁v₁ = Ac₂v₂ + ... + Ac_nv_k = c₂\(\lambda\)₂v₂ + ... + c_n\(\lambda\)_nv_k (2)

Multiply (1) by \(\lambda\)₁ and subtract it from (2) to get

c₂(\(\lambda\)₂ - \(\lambda\)₁)v₂ + ... + c_n(\(\lambda\)_n - \(\lambda\)₁)v_n = 0

Since the \(\lambda\)'s are distinct, the c_i's must all be zero, which is a contradiction (otherwise the rank would be less than k - 1). Hence

rankSpan({v₁, ... , v_k}) = k

for any k. In particular, let k = n, and the result follows.

Note that the converse certainly does not hold. For example, the identity matrix I has 1 as all of its eigenvalues, but it is diagonalizable (it is diagonal).

Steps to Diagonalize a Matrix

Find the eigenvalues by finding the roots of the characteristic polynomial.
Find the eigenvectors by finding the null space of A - \(\lambda\)_iI.
If the number of linearly independent vectors is n, then let P be the matrix whose columns are eigenvectors and let D be the diagonal matrix with [D]_jj = \(\lambda\)_j

Example

Diagonalize the matrix

\( A = \begin{pmatrix} 3 & 1 & -1 \\ 0 & 1 & 0 \\ 2 & 1 & 0 \end{pmatrix} \)

Solution

We find the characteristic polynomial

\( det(\lambda I - A) = det(\begin{pmatrix} \lambda - 3 & 1 & -1 \\ 0 & \lambda - 1 & 0 \\ 2 & 1 & \lambda \end{pmatrix} \)

\( = (\lambda - 3)( \lambda - 1)\lambda + 2(\lambda - 1) = (\lambda - 1)(\lambda^2 - 3\lambda + 2) = (\lambda - 1)^2(\lambda - 2) \)

The roots are 1 (with multiplicity 2) and 2 (with multiplicity 1).

Now we find the eigenspaces associated with the eigenvalues. We have

\( rref(1I - A) = \begin{pmatrix} -2 & -1 & 1 \\ 0 & 0 & 0 \\ -2 & -1 & 1 \end{pmatrix} = \begin{pmatrix} 1 & \frac{1}{2} & -\frac{1}{2} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}\)

A basis for the null space is

\( V_1 = \{\begin{pmatrix} -1 \\ 2\\ 0 \end{pmatrix}, \begin{pmatrix} 1 \\ 0\\ 2 \end{pmatrix}\} \)

Next we find a basis for the eigenspace associated with the eigenvalue 2. We have

\( rref(2I - A) = \begin{pmatrix} -1 & -1 & 1 \\ 0 & 1 & 0 \\ -2 & -1 & 2 \end{pmatrix} = \begin{pmatrix} 1 & 0 & -1 \\ 0 & 1 & 0 \\ 0 & 0 & 0 \end{pmatrix}\)

A basis for the null space is

\( V_2 = \{\begin{pmatrix} 1 \\ 0\\ 1 \end{pmatrix} \} \)

Now put this all together to get

\(P = \begin{pmatrix} -1 & 1 & 1 \\ 2 & 0 & 0 \\ 0 & 2 & 1 \end{pmatrix} \) \( D = \begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end{pmatrix} \)

Back to the Matrices and Vectors Page

Search

Text Color

Text Size

Margin Size

Font Type