3.2: Bases and coordinate systems

Last updated

Jun 19, 2024
Save as PDF
- 3.1: Invertibility
- 3.3: Image Compression

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\id}{\mathrm{id}}$ $\newcommand{\Span}{\mathrm{span}}$

( \newcommand{\kernel}{\mathrm{null}\,}\) $\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$ $\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$ $\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\id}{\mathrm{id}}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\kernel}{\mathrm{null}\,}$

$\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$

$\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$

$\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$ $\newcommand{\AA}{\unicode[.8,0]{x212B}}$

$\newcommand{\vectorA}[1]{\vec{#1}} % arrow$

$\newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$

$\newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vectorC}[1]{\textbf{#1}}$

$\newcommand{\vectorD}[1]{\overrightarrow{#1}}$

$\newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}$

$\newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}}$

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\avec}{\mathbf a}$

$\newcommand{\bvec}{\mathbf b}$

$\newcommand{\cvec}{\mathbf c}$

$\newcommand{\dvec}{\mathbf d}$

$\newcommand{\dtil}{\widetilde{\mathbf d}}$

$\newcommand{\evec}{\mathbf e}$

$\newcommand{\fvec}{\mathbf f}$

$\newcommand{\nvec}{\mathbf n}$

$\newcommand{\pvec}{\mathbf p}$

$\newcommand{\qvec}{\mathbf q}$

$\newcommand{\svec}{\mathbf s}$

$\newcommand{\tvec}{\mathbf t}$

$\newcommand{\uvec}{\mathbf u}$

$\newcommand{\vvec}{\mathbf v}$

$\newcommand{\wvec}{\mathbf w}$

$\newcommand{\xvec}{\mathbf x}$

$\newcommand{\yvec}{\mathbf y}$

$\newcommand{\zvec}{\mathbf z}$

$\newcommand{\rvec}{\mathbf r}$

$\newcommand{\mvec}{\mathbf m}$

$\newcommand{\zerovec}{\mathbf 0}$

$\newcommand{\onevec}{\mathbf 1}$

$\newcommand{\real}{\mathbb R}$

$\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$

$\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$

$\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$

$\newcommand{\laspan}[1]{\text{Span}\{#1\}}$

$\newcommand{\bcal}{\cal B}$

$\newcommand{\ccal}{\cal C}$

$\newcommand{\scal}{\cal S}$

$\newcommand{\wcal}{\cal W}$

$\newcommand{\ecal}{\cal E}$

$\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$

$\newcommand{\gray}[1]{\color{gray}{#1}}$

$\newcommand{\lgray}[1]{\color{lightgray}{#1}}$

$\newcommand{\rank}{\operatorname{rank}}$

$\newcommand{\row}{\text{Row}}$

$\newcommand{\col}{\text{Col}}$

$\renewcommand{\row}{\text{Row}}$

$\newcommand{\nul}{\text{Nul}}$

$\newcommand{\var}{\text{Var}}$

$\newcommand{\corr}{\text{corr}}$

$\newcommand{\len}[1]{\left|#1\right|}$

$\newcommand{\bbar}{\overline{\bvec}}$

$\newcommand{\bhat}{\widehat{\bvec}}$

$\newcommand{\bperp}{\bvec^\perp}$

$\newcommand{\xhat}{\widehat{\xvec}}$

$\newcommand{\vhat}{\widehat{\vvec}}$

$\newcommand{\uhat}{\widehat{\uvec}}$

$\newcommand{\what}{\widehat{\wvec}}$

$\newcommand{\Sighat}{\widehat{\Sigma}}$

$\newcommand{\lt}{<}$

$\newcommand{\gt}{>}$

$\newcommand{\amp}{&}$

$\definecolor{fillinmathshade}{gray}{0.9}$

$\newcommand{\zerovec}{\mathbf 0}$ $\newcommand{\twovec}[2]{\begin{pmatrix} #1 \\ #2 \end{pmatrix} }$ $\newcommand{\threevec}[3]{\begin{pmatrix} #1 \\ #2 \\ #3 \end{pmatrix} }$ $\newcommand{\fourvec}[4]{\begin{pmatrix} #1 \\ #2 \\ #3 \\ #4 \end{pmatrix} }$ $\newcommand{\fivevec}[5]{\begin{pmatrix} #1 \\ #2 \\ #3 \\ #4 \\ #5 \end{pmatrix} }$ When working in the plane, we are used to thinking about standard Cartesian coordinates. If we mention the point $(4,3)\text{,}$ we know that we arrive at this point from the origin by moving four units to the right and three units up.

Sometimes, however, it is more natural to work in a different coordinate system. Suppose, for instance, that you live in the city whose map is shown in Figure 3.2.1 and that you would like to give a guest directions for getting from your house to the store. You would probably say something like, "Go four blocks up Maple. Then turn left on Main for three blocks." The grid of streets in the city gives a more natural coordinate system than standard north-south, east-west coordinates.

Figure 3.2.1. A city map.

In this section, we will develop the concept of a basis through which we will create new coordinate systems in $\mathbb R^m\text{.}$ We will see that the right choice of a coordinate system provides a more natural way to approach some problems.

Preview Activity 3.2.1.

Consider the vectors

$\begin{equation*} \mathbf v_1 = \twovec{2}{1}, \mathbf v_2 = \twovec{1}{2} \end{equation*}$

in $\mathbb R^2\text{.}$

Indicate the linear combination $\mathbf v_1 - 2\mathbf v_2$ on Figure 3.2.2.

Figure 3.2.2. Linear combinations of

$\mathbf v_1$ and

$\mathbf v_2\text{.}$

Express the vector $\twovec{-3}{0}$ as a linear combination of $\mathbf v_1$ and $\mathbf v_2\text{.}$
Find the linear combination $10\mathbf v_1 - 13\mathbf v_2\text{.}$
Express the vector $\twovec{16}{-4}$ as a linear combination of $\mathbf v_1$ and $\mathbf v_2\text{.}$
Explain why every vector in $\mathbb R^2$ can be written as a linear combination of $\mathbf v_1$ and $\mathbf v_2$ in exactly one way.

In the preview activity, we worked with a set of two vectors in $\mathbb R^2$ and found that we could express any vector in $\mathbb R^2$ in two different ways: in the usual way where the components of the vector describe horizontal and vertical changes, and in a new way as a linear combination of $\mathbf v_1$ and $\mathbf v_2\text{.}$ We could also translate between these two different descriptions. This example illustrates the central idea of this section.

Bases

In the preview activity, we created a new coordinate system for $\mathbb R^2$ using linear combinations of a set of vectors. As we work to do this more generally, the following definition will guide us.

Definition 3.2.3

A set of vectors $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n$ in $\mathbb R^m$ is called a basis for $\mathbb R^m$ if the set of vectors spans $\mathbb R^m$ and is linearly independent.

We will look at some examples of bases in the following activity.

Activity 3.2.2.

In the preview activity, we considered a set of vectors in $\mathbb R^2\text{:}$
$\begin{equation*} \mathbf v_1 = \twovec{2}{1}, \mathbf v_2 = \twovec{1}{2}\text{.} \end{equation*}$

Explain why these vectors form a basis for $\mathbb R^2\text{.}$
Consider the set of vectors in $\mathbb R^3$
$\begin{equation*} \mathbf v_1 = \threevec{1}{1}{1}, \mathbf v_2 = \threevec{0}{1}{-1}, \mathbf v_3 = \threevec{1}{0}{-1}\text{.} \end{equation*}$

and determine whether they form a basis for $\mathbb R^3\text{.}$
Do the vectors
$\begin{equation*} \mathbf v_1 = \threevec{-2}{1}{3}, \mathbf v_2 = \threevec{3}{0}{-1}, \mathbf v_3 = \threevec{1}{1}{0}, \mathbf v_4 = \threevec{0}{3}{-2} \end{equation*}$

form a basis for $\mathbb R^3\text{?}$
Explain why the vectors $\mathbf e_1,\mathbf e_2,\mathbf e_3$ form a basis for $\mathbb R^3\text{.}$
If a set of vectors $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n$ forms a basis for $\mathbb R^m\text{,}$ what can you guarantee about the pivot positions of the matrix
$\begin{equation*} \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_n \end{array}\right]\text{?} \end{equation*}$
If the set of vectors $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n$ is a basis for $\mathbb R^{10}\text{,}$ how many vectors must be in the set?

We can develop a test to determine if a set of vectors $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n$ forms a basis for $\mathbb R^m$ by considering the matrix

$\begin{equation*} A = \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_n \end{array}\right]\text{.} \end{equation*}$

To be a basis, this set of vectors must span $\mathbb R^m$ and be linearly independent.

We know that the set of vectors spans $\mathbb R^m$ if and only if $A$ has a pivot position in every row. We also know that the set of vectors is linearly independent if and only if $A$ has a pivot position in every column. This means that a set of vectors forms a basis if and only if $A$ has a pivot in every row and every column. Therefore, $A$ must be row equivalent to the identify matrix $I\text{:}$

$\begin{equation*} A \sim \left[\begin{array}{cccc} 1 & 0 & \ldots & 0 \\ 0 & 0 & \ldots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \ldots & 1 \end{array}\right] = I\text{.} \end{equation*}$

In addition to helping identify bases, this fact tells us something important about the number of vectors in a basis. Since the matrix $A$ has a pivot position in every row and every column, it must have the same number of rows as columns. Therefore, the number of vectors in a basis for $\mathbb R^m$ must be $m\text{.}$ For example, a basis for $\mathbb R^{10}$ must have exactly 10 vectors.

Example 3.2.4

It is worth pointing out that we first encountered a basis long ago when we considered the vectors in $\mathbb R^3\text{:}$

$\begin{equation*} \mathbf e_1 = \threevec{1}{0}{0}, \mathbf e_2 = \threevec{0}{1}{0}, \mathbf e_3 = \threevec{0}{0}{1}\text{.} \end{equation*}$

We see that these vectors are, in fact, the columns of the $3\times3$ identity matrix, which confirms that this set forms a basis.

More generally, the set of vectors $\mathbf e_1,\mathbf e_2,\ldots,\mathbf e_m$ forms a basis for $\mathbb R^m\text{,}$ which we call the standard basis for $\mathbb R^m\text{.}$

Coordinate systems

If we have a basis for $\mathbb R^m\text{,}$ we can use it to form a coordinate system as we will now describe. Rather than continuing to write a list of vectors, we will find it convenient to denote a basis using a single symbol, such as

$\begin{equation*} \mathcal{B} = \{\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_m\} \end{equation*}$

Example 3.2.5

In this section's preview activity, we considered the vectors

$\begin{equation*} \mathbf v_1 = \twovec{2}{1}, \mathbf v_2 = \twovec{1}{2}\text{,} \end{equation*}$

which form a basis $\mathcal{B}=\{\mathbf v_1,\mathbf v_2\}$ for $\mathbb R^2\text{.}$

In the standard coordinate system, the point $(2,-3)$ is found by moving 2 units to the right and 3 units down. We would like to define a new coordinate system where we interpret $(2,-3)$ to mean we move two times along $\mathbf v_1$ and 3 times along $-\mathbf v_2\text{.}$ As we see in the figure, doing so leaves us at the point $(1,-4)\text{,}$ expressed in the usual coordinate system.

We have seen that

$\begin{equation*} \mathbf x = \twovec{1}{-4} = 2\mathbf v_1 - 3\mathbf v_2 \text{.} \end{equation*}$

The coordinates of the vector $\mathbf x$ in the new coordinate system are the weights that we use to create $\mathbf x$ as a linear combination of $\mathbf v_1$ and $\mathbf v_2\text{.}$

Since we now have two descriptions of the vector $\mathbf x\text{,}$ we need some notation to keep track of which coordinate system we are using. Because $\twovec{1}{-4} = 2\mathbf v_1 - 3\mathbf v_2\text{,}$ we will write

$\begin{equation*} \{{\twovec{1}{-4}}\}_{\mathcal{B}} = \twovec{2}{-3} \text{.} \end{equation*}$

More generally, $\{{\mathbf x}\}_{\mathcal{B}}$ will denote the coordinates of $\mathbf x$ in the basis $\mathcal{B}\text{;}$ that is, $\{{\mathbf x}\}_{\mathcal{B}}$ is the vector $\twovec{c_1}{c_2}$ of weights such that

$\begin{equation*} \mathbf x = c_1\mathbf v_1 + c_2 \mathbf v_2\text{.} \end{equation*}$

To illustrate, if the coordinates of $\mathbf x$ in the basis $\mathcal{B}$ are

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = \twovec{5}{-2}\text{,} \end{equation*}$

then

$\begin{equation*} \mathbf x = 5\mathbf v_1 - 2\mathbf v_2 = 5\twovec{2}{1}-2\twovec{1}{2} = \twovec{8}{3}\text{.} \end{equation*}$

We conclude that

$\begin{equation*} \{{\twovec{8}{3}}\}_{\mathcal{B}} = \twovec{5}{-2}\text{.} \end{equation*}$

This demonstrates how we can translate coordinates in the basis $\mathcal{B}$ into standard coordinates.

Suppose we know the expression of a vector $\mathbf x$ in standard coordinates. How can we find its coordinates in the basis $\mathcal{B}\text{?}$ For instance, suppose $\mathbf x=\twovec{-8}{2}$ and that we would like to find $\{{\mathbf x}\}_{\mathcal{B}}\text{.}$ We have

$\begin{equation*} \{{\twovec{-8}{2}}\}_{\mathcal{B}}=\twovec{c_1}{c_2} \end{equation*}$

where

$\begin{equation*} \twovec{-8}{2} = c_1\mathbf v_1 + c_2\mathbf v_2 \end{equation*}$

$\begin{equation*} c_1 \twovec{2}{1} + c_2 \twovec{1}{2} = \twovec{-8}{2}. \end{equation*}$

This linear system for the weights defines an augmented matrix

$\begin{equation*} \left[\begin{array}{rr|r} 2 & 1 & -8 \\ 1 & 2 & 2 \\ \end{array}\right] \sim \left[\begin{array}{rr|r} 1 & 0 & -6 \\ 0 & 1 & 4 \\ \end{array}\right]\text{.} \end{equation*}$

Therefore,

$\begin{equation*} \{{\twovec{-8}{2}}\}_{\mathcal{B}} = \twovec{-6}{4}\text{.} \end{equation*}$

This example illustrates how a basis in $\mathbb R^2$ provides a new coordinate system for $\mathbb R^2$ and shows how we may translate between this coordinate system and the standard one.

More generally, suppose that $\mathcal{B}=\{\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_m\}$ is a basis for $\mathbb R^m\text{.}$ We know that the vectors span $\mathbb R^m\text{,}$ which implies that any vector $\mathbf x$ in $\mathbb R^m$ can be written as a linear combination of the vectors. In addition, we know that the vectors are linearly independent, which means that we can write $\mathbf x$ as a linear combination of the vectors in exactly one way. Therefore, we have

$\begin{equation*} \mathbf x = c_1\mathbf v_1 + c_2\mathbf v_2 + \ldots + c_m\mathbf v_m \end{equation*}$

where the weights $c_1, c_2,\ldots, c_m$ are unique. In this case, we write the coordinate description of $\mathbf x$ in the basis $\mathcal{B}$ as

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = \fourvec{c_1}{c_2}{\vdots}{c_m}\text{.} \end{equation*}$

Activity 3.2.3.

Let's begin with the basis $\mathcal{B} = \{\mathbf v_1,\mathbf v_2\}$ of $\mathbb R^2$ where

$\begin{equation*} \mathbf v_1 = \twovec{3}{-2}, \mathbf v_2 = \twovec{2}{1}\text{.} \end{equation*}$

If the coordinates of $\mathbf x$ in the basis $\mathcal{B}$ are $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{-2}{4}\text{,}$ what is the vector $\mathbf x\text{?}$
If $\mathbf x = \twovec{3}{5}\text{,}$ find the coordinates of $\mathbf x$ in the basis $\mathcal{B}\text{;}$ that is, find $\{{\mathbf x}\}_{\mathcal{B}}\text{.}$
Find a matrix $A$ such that, for any vector $\mathbf x\text{,}$ we have $\mathbf x = A\{{\mathbf x}\}_{\mathcal{B}}\text{.}$ Explain why this matrix is invertible.
Using what you found in the previous part, find a matrix $B$ such that, for any vector $\mathbf x\text{,}$ we have $\{{\mathbf x}\}_{\mathcal{B}} = B\mathbf x\text{.}$ What is the relationship between the two matrices you have found in this and the previous part? Explain why this relationship holds.
Suppose we also consider the basis
$\begin{equation*} \mathcal{C} = \left\{\twovec{1}{2}, \twovec{-2}{1}\right\}\text{.} \end{equation*}$

Find a matrix $C$ that converts coordinates in the basis $\mathcal{C}$ into coordinates in the basis $\mathcal{B}\text{;}$ that is,

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = C \{{\mathbf x}{\mathcal{C}}\text{.} \end{equation*}$

You may wish to think about converting coordinates from the basis $\mathcal{C}$ into the standard coordinate system and then into the basis $\mathcal{B}\text{.}$
Suppose we consider the standard basis
$\begin{equation*} \mathcal{E} = \{\mathbf e_1,\mathbf e_2\}\text{.} \end{equation*}$

What is the relationship between $\mathbf x$ and $\{{\mathbf x}\}_{\mathcal{E}}\text{?}$

This activity demonstrates how we can efficiently convert between coordinate systems defined by different bases. Let's consider a basis $\mathcal{B} = \{\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_m\}$ and a vector $\mathbf x\text{.}$ We know that

$\begin{equation*} \begin{aligned} \mathbf x & {}={} c_1\mathbf v_1 +c_2\mathbf v_2+\ldots+c_m\mathbf v_m \\ \\ & {}={} \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_m \end{array}\right] \fourvec{c_1}{c_2}{\vdots}{c_m} \\ \\ & {}={} \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_m \end{array}\right] \{{\mathbf x}\}_{\mathcal{B}}\text{.} \end{aligned} \end{equation*}$

If we use $C_{\mathcal{B}}$ to denote the matrix whose columns are the basis vectors, then we find that

$\begin{equation*} \mathbf x = C_{\mathcal{B}}\{{\mathbf x}\}_{\mathcal{B}} \end{equation*}$

where $C_{\mathcal{B}} = \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_m \end{array}\right]\text{.}$ This means that the matrix $C_{\mathcal{B}}$ converts coordinates in the basis $\mathcal{B}$ into standard coordinates.

Since the columns of $C_{\mathcal{B}}$ are the basis vectors $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_m\text{,}$ we know that $C_{\mathcal{B}} \sim I_m$ because this set of vectors is linearly independent and spans $\mathbb R^m\text{.}$ Therefore, $C_{\mathcal{B}}$ is invertible. Since we have

$\begin{equation*} \mathbf x = C_{\mathcal{B}}\{{\mathbf x}\}_{\mathcal{B}}\text{,} \end{equation*}$

we must also have

$\begin{equation*} C_{\mathcal{B}}^{-1}\mathbf x = \{{\mathbf x}\}_{\mathcal{B}}\text{.} \end{equation*}$

To summarize, we see that $C_{\mathcal{B}}$ converts coordinates in the basis $\mathcal{B}$ into standard coordinates, and $C_{\mathcal{B}}^{-1}$ converts standard coordinates into coordinates in the basis $\mathcal{B}\text{.}$

If we have another basis $\mathcal{C}\text{,}$ we find, in the same way, that $\mathbf x = C_{\mathcal{C}}\{{\mathbf x}\}_{\mathcal{C}}$ for the conversion between coordinates in the basis $\mathcal{C}$ into standard coordinates. We then have

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = C^{-1}_{\mathcal{B}} \mathbf x = C_{\mathcal{B}}^{-1}(C_{\mathcal{C}} \{{\mathbf x}\}_{\mathcal{C}}) = (C_{\mathcal{B}}^{-1}C_{\mathcal{C}}) \{{\mathbf x}\}_{\mathcal{C}}\text{.} \end{equation*}$

Therefore, $C_{\mathcal{B}}^{-1}C_{\mathcal{C}}$ is the matrix that converts $\mathcal{C}$ -coordinates into $\mathcal{B}$ -coordinates.

In spite of the fact that much of what we are doing here seems new, we have been using the standard basis all along. For example, if $\mathbf x$ is a vector, then

$\begin{equation*} \mathbf x = \fourvec{c_1}{c_2}{\vdots}{c_m} = c_1\mathbf e_1 + c_2\mathbf e_2 + \ldots + c_m\mathbf e_m = C_{\mathcal{E}}\{{\mathbf x}\}_{\mathcal{E}}\text{.} \end{equation*}$

The matrix $C_{\mathcal{E}}$ is, of course, the identity.

Examples of bases

We will now look at some examples of bases and begin to see the usefulness of looking at a problem in a different coordinate system.

Example 3.2.6

Let's consider the basis of $\mathbb R^3\text{:}$

$\begin{equation*} \mathcal{B} = \left\{ \threevec{1}{0}{-2}, \threevec{-2}{1}{0}, \threevec{1}{1}{2} \right\}\text{.} \end{equation*}$

It is relatively straightforward to convert a vector's representation in this basis to the standard basis, using the matrix whose columns are the basis vectors:

$\begin{equation*} C_{\mathcal{B}} = \left[\begin{array}{rrr} 1 & -2 & 1 \\ 0 & 1 & 1 \\ -2 & 0 & 2 \\ \end{array}\right]\text{.} \end{equation*}$

For example, suppose that the vector $\mathbf x$ is described in the coordinate system defined by the basis as $\{{\mathbf x}\}_{\mathcal{B}} = \threevec{2}{-2}{1}\text{.}$ We then have

$\begin{equation*} \mathbf x = C_{\mathcal{B}}\{{\mathbf x}\}_{\mathcal{B}} = \left[\begin{array}{rrr} 1 & -2 & 1 \\ 0 & 1 & 1 \\ -2 & 0 & 2 \\ \end{array}\right] \threevec{2}{-2}{1} = \threevec{7}{-1}{2}\text{.} \end{equation*}$

Consider now the vector $\mathbf x=\threevec{3}{1}{-2}\text{.}$ If we would like to express $\mathbf x$ in the coordinate system defined by $\mathcal{B}\text{,}$ then we compute

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = C^{-1}_{\mathcal{B}}\mathbf x = \left[\begin{array}{rrr} \frac14 & \frac 12 & -\frac38 \\ -\frac14 & \frac12 & -\frac18 \\ \frac14 & \frac12 & \frac18 \\ \end{array}\right] \threevec{3}{1}{-2} = \threevec{2}{0}{1}\text{.} \end{equation*}$

Example 3.2.7

Suppose we work for a company that records its quarterly revenue, in millions of dollars, as:

Table 3.2.8. Quarterly revenue

Quarter	Revenue
1	10.3
2	13.1
3	7.5
4	8.2

Rather than using a table to record the data, we could display it in a graph or write it as a vector in $\mathbb R^4\text{:}$

$\begin{equation*} \mathbf x=\fourvec{10.3}{13.1}{7.5}{8.2}\text{.} \end{equation*}$

Let's now consider a new basis $\mathcal{B}$ for $\mathbb R^4$ using vectors

$\begin{equation*} \mathbf v_1=\fourvec{1}{1}{1}{1}, \mathbf v_2=\fourvec{1}{1}{-1}{-1}, \mathbf v_3=\fourvec{1}{-1}{0}{0}, \mathbf v_4=\fourvec{0}{0}{1}{-1}\text{.} \end{equation*}$

We may view these basis elements graphically, as in Figure 3.2.9

Figure 3.2.9. A representation of the basis elements of

$\mathcal{B}\text{.}$

As we wish to convert our revenue vectors into the coordinates given by $\mathcal{B}\text{,}$ we form the matrices:

$\begin{equation*} C_{\mathcal{B}} = \left[\begin{array}{rrrr} 1 & 1 & 1 & 0 \\ 1 & 1 & -1 & 0 \\ 1 & -1 & 0 & 1 \\ 1 & -1 & 0 & -1 \\ \end{array}\right], C_{\mathcal{B}}^{-1} = \left[\begin{array}{rrrr} \frac14 & \frac14 & \frac14 & \frac14 \\ \frac14 & \frac14 & -\frac14 & -\frac14 \\ \frac12 & -\frac12 & 0 & 0 \\ 0 & 0 & \frac12 & -\frac12 \\ \end{array}\right] \end{equation*}$

and compute

$\begin{equation*} \{{\mathbf x}\}_{\mathcal{B}} = C_{\mathcal{B}}^{-1} \mathbf x = C_{\mathcal{B}}^{-1} \fourvec{10.3}{13.1}{7.5}{8.2} = \fourvec{9.775}{1.925}{-1.400}{-0.350}\text{.} \end{equation*}$

This means that our revenue vector is

$\begin{equation*} \mathbf x = 9.775 \mathbf v_1 + 1.925 \mathbf v_2 - 1.400\mathbf v_3 - 0.350 \mathbf v_4\text{.} \end{equation*}$

We will think about what these coordinates mean by adding the basis vectors together one at a time.

The first coordinate gives us the average revenue over the year: $9.775\mathbf v_1\text{.}$

Adding in the second component shows how the averages in the first and second halves of year differ from the annual average: $9.775\mathbf v_1 + 1.925\mathbf v_2\text{.}$

The third and fourth components break down the behavior in the first and second halves of the year into quarters:

$\begin{equation*} \begin{aligned} \mathbf x = & 9.775 \mathbf v_1 + 1.925 \mathbf v_2 \\ & - 1.400\mathbf v_3 - 0.350 \mathbf v_4\text{.} \end{aligned} \end{equation*}$

If we write $\{{\mathbf x}\}_{\mathcal{B}} = \fourvec{c_1}{c_2}{c_3}{c_4}\text{,}$ we see that the coefficient $c_1$ measures the average revenue over the year, $c_2$ measures the deviation from the annual average in the first and second halves of the year, and $c_3$ measures how the revenue in the first and second quarter differs from the average in the first half of the year. In this way, the coefficients provide a view of the revenue over different time scales, from an annual summary to a finer view of quarterly behavior.

This basis is sometimes called a Haar wavelet basis, and the change of basis is known as a Haar wavelet transform. In the next section, we will see how this basis provides a useful way to store digital images.

Activity 3.2.4. Edge detection.

An important problem in the field of computer vision is to detect edges in a digital photograph, as is shown in Figure 3.2.10. Edge detection algorithms are useful when, say, we want a robot to locate an object in its field of view. Graphic designers also use these algorithms to create artist effects.

Figure 3.2.10. A canyon wall in Capitol Reef National Park and the result of an edge detection algorithm.

We will consider a very simple version of an edge detection algorithm to give a sense of how this works. Rather than considering a two-dimensional photograph, we will think about a one-dimensional row of pixels in a photograph. The grayscale values of a pixel measure the brightness of a pixel; a grayscale value of 0 corresponds to black, and a value of 255 corresponds to white.

Suppose, for simplicity, that the grayscale values for a row of six pixels are represented by a vector $\mathbf x$ in $\mathbb R^6\text{:}$

$\begin{equation*} \mathbf x = \left[\begin{array}{r} 25 \\ 34 \\ 30 \\ 45 \\ 190 \\ 200 \end{array}\right]\text{.} \end{equation*}$

We can easily see that there is a jump in brightness between pixels 4 and 5, but how can we detect it computationally? We will introduce a new basis $\mathcal{B}$ for $\mathbb R^6$ with vectors:

$\begin{equation*} \mathbf v_1=\left[\begin{array}{r} 1 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \end{array}\right], \mathbf v_2=\left[\begin{array}{r} 1 \\ 1 \\ 0 \\ 0 \\ 0 \\ 0 \end{array}\right], \mathbf v_3=\left[\begin{array}{r} 1 \\ 1 \\ 1 \\ 0 \\ 0 \\ 0 \end{array}\right], \mathbf v_4=\left[\begin{array}{r} 1 \\ 1 \\ 1 \\ 1 \\ 0 \\ 0 \end{array}\right], \mathbf v_5=\left[\begin{array}{r} 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 0 \end{array}\right], \mathbf v_6=\left[\begin{array}{r} 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 1 \end{array}\right]\text{.} \end{equation*}$

Construct the matrix $C_\mathcal{B}$ that relates the standard coordinate system with the coordinates in the basis $\mathcal{B}\text{.}$
Determine the matrix $C_\mathcal{B}^{-1}$ that converts the representation of $\mathbf x$ in standard coordinates into the coordinate system defined by $\mathcal{B}\text{.}$
Suppose the vectors are expressed in general terms as
$\begin{equation*} \mathbf x = \left[\begin{array}{r} x_1 \\ x_2 \\ x_3 \\ x_4 \\ x_5 \\ x_6 \end{array}\right], \{{\mathbf x}\}_{\mathcal{B}} = \left[\begin{array}{r} c_1 \\ c_2 \\ c_3 \\ c_4 \\ c_5 \\ c_6 \end{array}\right]\text{.} \end{equation*}$

Using the relationship $\{{\mathbf x}\}_{\mathcal{B}} = C_{\mathcal{B}}^{-1}\mathbf x\text{,}$ determine an expression for the coefficient $c_2$ in terms of $x_1,x_2,\ldots,x_6\text{.}$ What does $c_2$ measure in terms of the grayscale values of the pixels? What does $c_4$ measure in terms of the grayscale values of the pixels?
Now for the specific vector
$\begin{equation*} \mathbf x = \left[\begin{array}{r} 25 \\ 34 \\ 30 \\ 45 \\ 190 \\ 200 \end{array}\right]\text{,} \end{equation*}$

determine the representation of $\mathbf x$ in the $\mathcal{B}$ -coordinate system.
Explain how the coefficients in $\{{\mathbf x}\}_{\mathcal{B}}$ determine the location of the jump in brightness in the grayscale values represented by the vector $\mathbf x\text{.}$

Readers who are familiar with calculus may recognize that this change of basis converts a vector $\mathbf x$ into $\{{\mathbf x}\}_{\mathcal{B}}\text{,}$ the set of changes in $\mathbf x\text{.}$ This process is similar to differentiation in calculus. Similarly, the process of converting $\{{\mathbf x}\}_{\mathcal{B}}$ into the vector $\mathbf x$ adds together the changes in a process similar to integration. This change of basis, therefore, represents a linear algebraic version of the Fundamental Theorem of Calculus.

Summary

We defined a basis to be a set of vectors $\mathcal{B} = \{\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n\}$ that spans $\mathbb R^m$ and is linearly independent.

A set of vectors forms a basis for $\mathbb R^m$ if and only if the matrix
$\begin{equation*} A = \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_n \end{array}\right] \sim I\text{.} \end{equation*}$

This means there must be $m$ vectors in a basis for $\mathbb R^m\text{.}$
If $\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_m$ forms a basis for $\mathbb R^m\text{,}$ then any vector in $\mathbb R^m$ can be written as a linear combination of the vectors in exactly one way.
We used the basis $\mathcal{B}$ to define a coordinate system in which $\{{\mathbf x}\}_{\mathcal{B}} = \fourvec{c_1}{c_2}{\vdots}{c_n} \text{,}$ the coordinates of $\mathbf x$ in the basis $\mathcal{B}\text{,}$ are defined by
$\begin{equation*} \mathbf x = c_1\mathbf v_1+c_2\mathbf v_2 + \ldots + c_n\mathbf v_m\text{.} \end{equation*}$
Forming the matrix $C_{\mathcal{B}}$ whose columns are the basis vectors, we can convert between coordinate systems:
$\begin{equation*} \begin{aligned} x & {}={} C_{\mathcal{B}}\{{\mathbf x}\}_{\mathcal{B}} \\ C_{\mathcal{B}}^{-1} x & {}={} \{{\mathbf x}\}_{\mathcal{B}} \\ \end{aligned}\text{.} \end{equation*}$

Exercises 3.2.5Exercises

1

Shown in Figure 3.2.11 are two vectors $\mathbf v_1$ and $\mathbf v_2$ in the plane $\mathbb R^2\text{.}$

Figure 3.2.11. Vectors

$\mathbf v_1$ and

$\mathbf v_2$ in

$\mathbb R^2\text{.}$

Explain why $\mathcal{B}=\{\mathbf v_1,\mathbf v_2\}$ is a basis for $\mathbb R^2\text{.}$
Using Figure 3.2.11, indicate the vectors x such that
1. $\displaystyle \{{\mathbf x}\}_{\mathcal{B}} = \twovec{2}{-1}$
2. $\displaystyle \{{\mathbf x}\}_{\mathcal{B}} = \twovec{-1}{-2}$
3. $\displaystyle \{{\mathbf x}\}_{\mathcal{B}} = \twovec{0}{3}$
Using Figure 3.2.11, find the representation {x}B if
1. $\mathbf x = \twovec{-2}{-1}\text{.}$
2. $\mathbf x = \twovec{2}{4}\text{.}$
3. $\mathbf x = \twovec{2}{-5}\text{.}$
Find $\{{\mathbf x}\}_{\mathcal{B}}$ if $\mathbf x=\twovec{60}{90}\text{.}$

2

Consider vectors

$\begin{equation*} \begin{aligned} \mathbf v_1=\twovec{1}{2}, & \mathbf v_2=\twovec{1}{-3} \\ \mathbf w_1=\twovec{2}{3}, & \mathbf w_2=\twovec{-1}{-2} \text{.} \\ \end{aligned} \end{equation*}$

and let $\mathcal{B} = \{\mathbf v_1,\mathbf v_2\}$ and $\mathcal{C} = \{\mathbf w_1,\mathbf w_2\}\text{.}$

Explain why $\mathcal{B}$ and $\mathcal{C}$ are both bases of $\mathbb R^2\text{.}$
If $\mathbf x = \twovec{5}{-3}\text{,}$ find $\{{\mathbf x}\}_{\mathcal{B}}$ and $\{{\mathbf x}\}_{\mathcal{C}}\text{.}$
If $\{{\mathbf x}\}_{\mathcal{B}}=\twovec{2}{-4}\text{,}$ find $\mathbf x$ and $\{{\mathbf x}\}_{\mathcal{C}}\text{.}$
If $\{{\mathbf x}\}_{\mathcal{C}}=\twovec{-3}{2}\text{,}$ find $\mathbf x$ and $\{{\mathbf x}\}_{\mathcal{B}}\text{.}$
Find a matrix $D$ such that $\{{\mathbf x}\}_{\mathcal{B}} = D\{{\mathbf x}\}_{\mathcal{C}}\text{.}$

3

Consider the following vectors in $\mathbb R^4\text{:}$

$\begin{equation*} \mathbf v_1 = \fourvec{1}{1}{1}{1}, \mathbf v_2 = \fourvec{0}{1}{1}{1}, \mathbf v_3 = \fourvec{0}{0}{1}{1}, \mathbf v_4 = \fourvec{0}{0}{0}{1}\text{.} \end{equation*}$

Explain why $\mathcal{B}=\{\mathbf v_1,\mathbf v_2,\mathbf v_3,\mathbf v_4\}$ forms a basis for $\mathbb R^4\text{.}$
Explain how to convert $\{{\mathbf x}\}_{\mathcal{B}}\text{,}$ the representation of a vector $\mathbf x$ in the coordinates defined by $\mathcal{B}\text{,}$ into $\mathbf x\text{,}$ its representation in the standard coordinate system.
Explain how to convert the vector $\mathbf x$ into, $\{{\mathbf x}\}_{\mathcal{B}}\text{,}$ its representation in the coordinate system defined by $\mathcal{B}\text{.}$
If $\mathbf x=\fourvec{23}{12}{10}{19}\text{,}$ find $\{{\mathbf x}\}_{\mathcal{B}}\text{.}$
If $\{{\mathbf x}\}_{\mathcal{B}}=\fourvec{3}{1}{-3}{-4}\text{,}$ find $\mathbf x\text{.}$

4

Consider the following vectors in $\mathbb R^3\text{:}$

$\begin{equation*} \mathbf v_1=\threevec{1}{3}{2}, \mathbf v_2=\threevec{0}{1}{4}, \mathbf v_3=\threevec{-2}{-5}{0}, \mathbf v_4=\threevec{-2}{-1}{-1}, \mathbf v_5=\threevec{1}{-2}{-1}\text{.} \end{equation*}$

Do these vectors form a basis for $\mathbb R^3\text{?}$ Explain your thinking.
Find a subset of these vectors that forms a basis of $\mathbb R^3\text{.}$
Suppose you have a set of vectors $\mathbf v_1, \mathbf v_2,\ldots,\mathbf v_6$ in $\mathbb R^4$ such
$\begin{equation*} \left[\begin{array}{rrrr} \mathbf v_1 & \mathbf v_2 & \ldots & \mathbf v_6 \end{array}\right] \sim \left[\begin{array}{rrrrrr} 1 & 0 & -2 & 0 & 1 & 0 \\ 0 & 1 & 3 & 0 & -4 & 0 \\ 0 & 0 & 0 & 1 & 2 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ \end{array}\right]\text{.} \end{equation*}$

Find a subset of the vectors that span $\mathbb R^4\text{.}$

5

This exercise involves a simple Fourier transform, which will play an important role in the next section.

Suppose that we have the vectors

$\begin{equation*} \mathbf v_1=\threevec{1}{1}{1}, \mathbf v_2=\threevec{\cos\left(\frac\pi6\right)} {\cos\left(\frac{3\pi}6\right)} {\cos\left(\frac{5\pi}6\right)} \mathbf v_3=\threevec{\cos\left(\frac{2\pi}6\right)} {\cos\left(\frac{6\pi}6\right)} {\cos\left(\frac{10\pi}6\right)}\text{.} \end{equation*}$

Explain why $\mathcal{B}=\{\mathbf v_1,\mathbf v_2,\mathbf v_3\}$ is a basis for $\mathbb R^3\text{.}$
If $\mathbf x=\threevec{15}{15}{15}\text{,}$ find $\{{\mathbf x}\}_{\mathcal{B}}\text{.}$
Find the matrices $C_{\mathcal{B}}$ and $C_{\mathcal{B}}^{-1}\text{.}$ If $\mathbf x=\threevec{x_1}{x_2}{x_3}$ and $\{{\mathbf x}\}_{\mathcal{B}} = \threevec{c_1}{c_2}{c_3}\text{,}$ explain why $c_1$ is the average of $x_1\text{,}$ $x_2\text{,}$ and $x_3\text{.}$

6

Determine whether the following statements are true or false and provide a justification for your response.

If the columns of a matrix $A$ form a basis for $\mathbb R^m\text{,}$ then $A$ is invertible.
There must be 125 vectors in a basis for $\mathbb R^{125}\text{.}$
If $\mathcal{B}=\{\mathbf v_1,\mathbf v_2,\ldots,\mathbf v_n\}$ is a basis of $\mathbb R^m\text{,}$ then every vector in $\mathbb R^m$ can be expressed as a linear combination of basis vectors.
The coordinates $\{{\mathbf x}\}_{\mathcal{B}}$ are the weights that form $\mathbf x$ as a linear combination of basis vectors.
If the basis vectors form the columns of the matrix $C_{\mathcal{B}}\text{,}$ then $\{{\mathbf x}\}_{\mathcal{B}} = C_{\mathcal{B}}\mathbf x\text{.}$

7

Provide a justification for your response to each of the following questions.

Suppose you have $m$ linearly independent vectors in $\mathbb R^m\text{.}$ Can you guarantee that they form a basis of $\mathbb R^m\text{?}$
If $A$ is an invertible $m\times m$ matrix, do the columns necessarily form a basis of $\mathbb R^m\text{?}$
Suppose we have an invertible $m\times m$ matrix $A\text{,}$ and we perform a sequence of row operations on $A$ to form a matrix $B\text{.}$ Can you guarantee that the columns of $B$ form a basis for $\mathbb R^m\text{?}$

8

Crystallographers find it convenient to use coordinate systems that are adapted to the specific geometry of a crystal. As a two-dimensional example, consider a layer of graphite in which carbon atoms are arranged in regular hexagons to form the crystalline structure shown in Figure 3.2.12.

Figure 3.2.12. A layer of carbon atoms in a graphite crystal.

The origin of the coordinate system is at the carbon atom labeled by “0”. It is convenient to choose the basis $\mathcal{B}$ defined by the vectors $\mathbf v_1$ and $\mathbf v_2$ and the coordinate system it defines.

Locate the points x for which
1. $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{1}{0}\text{,}$
2. $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{0}{1}\text{,}$
3. $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{2}{1}\text{.}$
Find the coordinates $\{{\mathbf x}\}_{\mathcal{B}}$ for all the carbon atoms in the hexagon whose lower left vertex is labeled “0”.
What are the coordinates $\{{\mathbf x}\}_{\mathcal{B}}$ of the center of that hexagon, which is labeled “C”?
How do the coordinates of the atoms in the hexagon whose lower left corner is labeled “1” compare to the coordinates in the hexagon whose lower left corner is labeled "0"?
Does the point $\mathbf x$ whose coordinates are $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{16}{4}$ correspond to a carbon atom or the center of a hexagon?

9

Suppose that $A=\left[\begin{array}{rr} 2 & 1 \\ 1& 2 \\ \end{array}\right]$ and

$\begin{equation*} \mathbf v_1=\twovec{1}{1}, \mathbf v_2=\twovec{1}{-1}\text{.} \end{equation*}$

Explain why $\mathcal{B}=\{\mathbf v_1,\mathbf v_2\}$ is a basis for $\mathbb R^2\text{.}$
Find $A\mathbf v_1$ and $A\mathbf v_2\text{.}$
Use what you found in the previous part of this problem to find $\{{A\mathbf v_1}{\mathcal{B}}$ and $\{{A\mathbf v_2}{\mathcal{B}}\text{.}$
If $\{{\mathbf x}\}_{\mathcal{B}} = \twovec{1}{-5}\text{,}$ find $\{{A\mathbf x}{\mathcal{B}} \text{.}$
Find a matrix $D$ such that $\{{A\mathbf x}{\mathcal{B}} = D\{{\mathbf x}\}_{\mathcal{B}}\text{.}$

You should find that the matrix $D$ is a very simple matrix, which means that this basis $\mathcal{B}$ is well suited to study the effect of multiplication by $A\text{.}$ This observation is the central idea of the next chapter.