Search

Text Color

Margin Size

Font Type

Enable Dyslexic Font

12.1: Invariant Directions

Last updated

Jul 27, 2023
Save as PDF
- 12: Eigenvalues and Eigenvectors
- 12.2: The Eigenvalue-Eigenvector Equation

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\id}{\mathrm{id}}$ $\newcommand{\Span}{\mathrm{span}}$

( \newcommand{\kernel}{\mathrm{null}\,}\) $\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$ $\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$ $\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\id}{\mathrm{id}}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\kernel}{\mathrm{null}\,}$

$\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$

$\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$

$\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$ $\newcommand{\AA}{\unicode[.8,0]{x212B}}$

$\newcommand{\vectorA}[1]{\vec{#1}} % arrow$

$\newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$

$\newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vectorC}[1]{\textbf{#1}}$

$\newcommand{\vectorD}[1]{\overrightarrow{#1}}$

$\newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}$

$\newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}}$

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\avec}{\mathbf a}$

$\newcommand{\bvec}{\mathbf b}$

$\newcommand{\cvec}{\mathbf c}$

$\newcommand{\dvec}{\mathbf d}$

$\newcommand{\dtil}{\widetilde{\mathbf d}}$

$\newcommand{\evec}{\mathbf e}$

$\newcommand{\fvec}{\mathbf f}$

$\newcommand{\nvec}{\mathbf n}$

$\newcommand{\pvec}{\mathbf p}$

$\newcommand{\qvec}{\mathbf q}$

$\newcommand{\svec}{\mathbf s}$

$\newcommand{\tvec}{\mathbf t}$

$\newcommand{\uvec}{\mathbf u}$

$\newcommand{\vvec}{\mathbf v}$

$\newcommand{\wvec}{\mathbf w}$

$\newcommand{\xvec}{\mathbf x}$

$\newcommand{\yvec}{\mathbf y}$

$\newcommand{\zvec}{\mathbf z}$

$\newcommand{\rvec}{\mathbf r}$

$\newcommand{\mvec}{\mathbf m}$

$\newcommand{\zerovec}{\mathbf 0}$

$\newcommand{\onevec}{\mathbf 1}$

$\newcommand{\real}{\mathbb R}$

$\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$

$\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$

$\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$

$\newcommand{\laspan}[1]{\text{Span}\{#1\}}$

$\newcommand{\bcal}{\cal B}$

$\newcommand{\ccal}{\cal C}$

$\newcommand{\scal}{\cal S}$

$\newcommand{\wcal}{\cal W}$

$\newcommand{\ecal}{\cal E}$

$\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$

$\newcommand{\gray}[1]{\color{gray}{#1}}$

$\newcommand{\lgray}[1]{\color{lightgray}{#1}}$

$\newcommand{\rank}{\operatorname{rank}}$

$\newcommand{\row}{\text{Row}}$

$\newcommand{\col}{\text{Col}}$

$\renewcommand{\row}{\text{Row}}$

$\newcommand{\nul}{\text{Nul}}$

$\newcommand{\var}{\text{Var}}$

$\newcommand{\corr}{\text{corr}}$

$\newcommand{\len}[1]{\left|#1\right|}$

$\newcommand{\bbar}{\overline{\bvec}}$

$\newcommand{\bhat}{\widehat{\bvec}}$

$\newcommand{\bperp}{\bvec^\perp}$

$\newcommand{\xhat}{\widehat{\xvec}}$

$\newcommand{\vhat}{\widehat{\vvec}}$

$\newcommand{\uhat}{\widehat{\uvec}}$

$\newcommand{\what}{\widehat{\wvec}}$

$\newcommand{\Sighat}{\widehat{\Sigma}}$

$\newcommand{\lt}{<}$

$\newcommand{\gt}{>}$

$\newcommand{\amp}{&}$

$\definecolor{fillinmathshade}{gray}{0.9}$

Have a look at the linear transformation $L$ depicted below:

$invariant.jpg$

It was picked at random by choosing a pair of vectors $L(e_{1})$ and $L(e_{2})$ as the outputs of $L$ acting on the canonical basis vectors. Notice how the unit square with a corner at the origin is mapped to a parallelogram. The second line of the picture shows these superimposed on one another. Now look at the second picture on that line. There, two vectors $f_{1}$ and $f_{2}$ have been carefully chosen such that if the inputs into $L$ are in the parallelogram spanned by $f_{1}$ and $f_{2}$ , the outputs also form a parallelogram with edges lying along the same two directions. Clearly this is a very special situation that should correspond to interesting properties of $L$ .

Now lets try an explicit example to see if we can achieve the last picture:

Example $\PageIndex{1}$ :

Consider the linear transformation $L$ such that $L\begin{pmatrix}1\\0\end{pmatrix}=\begin{pmatrix}-4\\-10\end{pmatrix}\, \mbox{ and }\, L\begin{pmatrix}0\\1\end{pmatrix}=\begin{pmatrix}3\\7\end{pmatrix}\, ,$ so that the matrix of $L$ is

$\begin{pmatrix} -4 & 3 \\ -10 & 7 \\ \end{pmatrix}\, .$

Recall that a vector is a direction and a magnitude; $L$ applied to $\begin{pmatrix}1\\0\end{pmatrix}$ or $\begin{pmatrix}0\\1\end{pmatrix}$ changes both the direction and the magnitude of the vectors given to it.

Notice that $L\begin{pmatrix}3\\5\end{pmatrix}=\begin{pmatrix}-4\cdot 3+3\cdot 5 \\ -10\cdot 3+7\cdot 5\end{pmatrix}=\begin{pmatrix}3\\5\end{pmatrix}\, .$ Then $L$ fixes the direction (and actually also the magnitude) of the vector $v_{1}=\begin{pmatrix}3\\5\end{pmatrix}$ .

Now, notice that any vector with the same direction as $v_{1}$ can be written as $cv_{1}$ for some constant $c$ . Then $L(cv_{1})=cL(v_{1})=cv_{1}$ , so $L$ fixes every vector pointing in the same direction as $v_{1}$ .

Also notice that

$L\begin{pmatrix}1\\2\end{pmatrix}=\begin{pmatrix}-4\cdot 1+3\cdot 2 \\ -10\cdot 1+7\cdot 2\end{pmatrix}=\begin{pmatrix}2\\4\end{pmatrix}=2\begin{pmatrix}1\\2\end{pmatrix}\, ,$

so $L$ fixes the direction of the vector $v_{2}=\begin{pmatrix}1\\2\end{pmatrix}$ but stretches $v_{2}$ by a factor of $2$ . Now notice that for any constant $c$ , $L(cv_{2})=cL(v_{2})=2cv_{2}$ . Then $L$ stretches every vector pointing in the same direction as $v_{2}$ by a factor of $2$ .

In short, given a linear transformation $L$ it is sometimes possible to find a vector $v\neq 0$ and constant $\lambda\neq 0$ such that $Lv=\lambda v.$

$eigeneqn.jpg$

We call the direction of the vector $v$ an $\textit{invariant direction}$ . In fact, any vector pointing in the same direction also satisfies this equation because $L(cv)=cL(v)=\lambda cv$ . More generally, any $\textit{non-zero}$ vector $v$ that solves

$Lv=\lambda v$

is called an $\textit{eigenvector}$ of $L$ , and $\lambda$ (which now need not be zero) is an $\textit{eigenvalue}$ . Since the direction is all we really care about here, then any other vector $cv$ (so long as $c\neq 0$ ) is an equally good choice of eigenvector. Notice that the relation " $u$ and $v$ point in the same direction'' is an equivalence relation.

In our example of the linear transformation $L$ with matrix

$\begin{pmatrix} -4 & 3 \\ -10 & 7 \\ \end{pmatrix}\, ,$

we have seen that $L$ enjoys the property of having two invariant directions, represented by eigenvectors $v_{1}$ and $v_{2}$ with eigenvalues $1$ and $2$ , respectively.

It would be very convenient if we could write any vector $w$ as a linear combination of $v_{1}$ and $v_{2}$ . Suppose $w=rv_{1}+sv_{2}$ for some constants $r$ and $s$ . Then:

$L(w)=L(rv_{1}+sv_{2})=rL(v_{1})+sL(v_{2})=rv_{1}+2sv_{2}.$

Now $L$ just multiplies the number $r$ by $1$ and the number $s$ by $2$ . If we could write this as a matrix, it would look like:

$\begin{pmatrix} 1 & 0 \\ 0 & 2 \end{pmatrix}\begin{pmatrix}s\\t\end{pmatrix}$

which is much slicker than the usual scenario

$L\!\begin{pmatrix} x\\ y \end{pmatrix}\!=\!\begin{pmatrix} \!a&b\! \\ \!c&d\! \end{pmatrix} \! \!\begin{pmatrix} x \\ y \end{pmatrix}\!=\! \begin{pmatrix} \!ax+by\! \\ \!cx+dy\! \end{pmatrix}\, .$

Here, $s$ and $t$ give the coordinates of $w$ in terms of the vectors $v_{1}$ and $v_{2}$ . In the previous example, we multiplied the vector by the matrix $L$ and came up with a complicated expression. In these coordinates, we see that $L$ has a very simple diagonal matrix, whose diagonal entries are exactly the eigenvalues of $L$ .

This process is called diagonalization. It makes complicated linear systems much easier to analyze.

Now that we've seen what eigenvalues and eigenvectors are, there are a number of questions that need to be answered.

How do we find eigenvectors and their eigenvalues?
How many eigenvalues and (independent) eigenvectors does a given linear transformation have?
When can a linear transformation be diagonalized?

We'll start by trying to find the eigenvectors for a linear transformation.

Example $\PageIndex{2}$ :

Let $L \colon \Re^{2}\rightarrow \Re^{2}$ such that $L(x,y)=(2x+2y, 16x+6y)$ . First, we find the matrix of $L$ :

$\begin{pmatrix}x\\y\end{pmatrix}\stackrel{L}{\longmapsto} \begin{pmatrix} 2 & 2 \\ 16 & 6 \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}.$

We want to find an invariant direction $v=\begin{pmatrix}x\\y\end{pmatrix}$ such that

$Lv=\lambda v$

or, in matrix notation,
$\begin{equation*} \begin{array}{lrcl} &\begin{pmatrix} 2 & 2 \\ 16 & 6 \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}&=&\lambda \begin{pmatrix}x\\y\end{pmatrix} \\ \Leftrightarrow &\begin{pmatrix} 2 & 2 \\ 16 & 6 \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}&=&\begin{pmatrix} \lambda & 0 \\ 0 & \lambda \end{pmatrix} \begin{pmatrix}x\\y\end{pmatrix} \\ \Leftrightarrow& \begin{pmatrix} 2-\lambda & 2 \\ 16 & 6-\lambda \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}&=& \begin{pmatrix}0\\0\end{pmatrix}\, . \end{array} \end{equation*}$
This is a homogeneous system, so it only has solutions when the matrix $\begin{pmatrix} 2-\lambda & 2 \\ 16 & 6-\lambda \end{pmatrix}$ is singular. In other words,
$\begin{equation*} \begin{array}{lrcl} &\det \begin{pmatrix} 2-\lambda & 2 \\ 16 & 6-\lambda \end{pmatrix}&=&0 \\ \Leftrightarrow& (2-\lambda)(6-\lambda)-32&=&0 \\ \Leftrightarrow &\lambda^{2}-8\lambda-20&=&0\\ \Leftrightarrow &(\lambda-10)(\lambda+2)&=&0 \end{array} \end{equation*}$

For any square $n\times n$ matrix $M$ , the polynomial in $\lambda$ given by $P_{M}(\lambda)=\det (\lambda I-M)=(-1)^{n} \det (M-\lambda I)$ is called the $\textit{characteristic polynomial}$ of $M$ , and its roots are the eigenvalues of $M$ .

In this case, we see that $L$ has two eigenvalues, $\lambda_{1}=10$ and $\lambda_{2}=-2$ . To find the eigenvectors, we need to deal with these two cases separately. To do so, we solve the linear system $\begin{pmatrix} 2-\lambda & 2 \\ 16 & 6-\lambda \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}= \begin{pmatrix}0\\0\end{pmatrix}$ with the particular eigenvalue $\lambda$ plugged in to the matrix.

1. [ $\underline{\lambda=10}$ :] We solve the linear system

$\begin{pmatrix} -8 & 2 \\ 16 & -4 \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}= \begin{pmatrix}0\\0\end{pmatrix}.$
Both equations say that $y=4x$ , so any vector $\begin{pmatrix}x\\4x\end{pmatrix}$ will do. Since we only need the direction of the eigenvector, we can pick a value for $x$ . Setting $x=1$ is convenient, and gives the eigenvector $v_{1}=\begin{pmatrix}1\\4\end{pmatrix}$ .

2. [ $\underline{\lambda=-2}$ :] We solve the linear system

$\begin{pmatrix} 4 & 2 \\ 16 & 8 \end{pmatrix}\begin{pmatrix}x\\y\end{pmatrix}= \begin{pmatrix}0\\0\end{pmatrix}.$
Here again both equations agree, because we chose $\lambda$ to make the system singular. We see that $y=-2x$ works, so we can choose $v_{2}=\begin{pmatrix}1\\-2\end{pmatrix}$ .

Our process was the following:

Find the characteristic polynomial of the matrix $M$ for $L$ , given by $\det (\lambda I-M)$ .
Find the roots of the characteristic polynomial; these are the eigenvalues of $L$ .
For each eigenvalue $\lambda_{i}$ , solve the linear system $(M-\lambda_{i} I)v=0$ to obtain an eigenvector $v$ associated to $\lambda_{i}$ .

Contributor

David Cherney, Tom Denton, and Andrew Waldron (UC Davis)

Contributor

Support Center

How can we help?