Search

Text Color

Margin Size

Font Type

Enable Dyslexic Font

9.5: The Gram-Schmidt Orthogonalization procedure

Last updated

Mar 5, 2021
Save as PDF
- 9.4: Orthonormal bases
- 9.6: Orthogonal projections and minimization problems

( \newcommand{\kernel}{\mathrm{null}\,}\)

We now come to a fundamentally important algorithm, which is called the Gram-Schmidt orthogonalization procedure. This algorithm makes it possible to construct, for each list of linearly independent vectors (resp. basis), a corresponding orthonormal list (resp. orthonormal basis).

Theorem 9.5.1

If $(v_{1}, \dots, v_{m})$ is a list of linearly independent vectors in $V$ , then there exists an orthonormal list $(e_{1}, \dots, e_{m})$ such that

$\begin{matrix} (9.5.1) & span (v_{1}, \dots, v_{k}) = span (e_{1}, \dots, e_{k}), for all k = 1, \dots, m . \end{matrix}$

Proof

The proof is constructive, that is, we will actually construct vectors $e_{1}, \dots, e_{m}$ having the desired properties. Since $(v_{1}, \dots, v_{m})$ is linearly independent, $v_{k} \neq 0$ for each $k = 1, 2, \dots, m$ . Set $e_{1} = \frac{v_{1}}{‖ v_{1} ‖}$ . Then $e_{1}$ is a vector of norm 1 and satisfies Equation (9.5.1) for $k = 1$ . Next, set

$e_{2} = \frac{v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1}}{‖ v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1} ‖} .$

This is, in fact, the normalized version of the orthogonal decomposition Equation(9.3.1)~ $(???)$ . I.e.,

$w = v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1},$

where $w ⊥ e_{1}$ . Note that $‖ e_{2} ‖ = 1$ and $span (e_{1}, e_{2}) = span (v_{1}, v_{2})$ .

Now, suppose that $e_{1}, \dots, e_{k - 1}$ have been constructed such that $(e_{1}, \dots, e_{k - 1})$ is an orthonormal list and $span (v_{1}, \dots, v_{k - 1}) = span (e_{1}, \dots, e_{k - 1})$ . Then define
$e_{k} = \frac{v_{k} - ⟨ v_{k}, e_{1} ⟩ e_{1} - ⟨ v_{k}, e_{2} ⟩ e_{2} - \dots - ⟨ v_{k}, e_{k - 1} ⟩ e_{k - 1}}{‖ v_{k} - ⟨ v_{k}, e_{1} ⟩ e_{1} - ⟨ v_{k}, e_{2} ⟩ e_{2} - \dots - ⟨ v_{k}, e_{k - 1} ⟩ e_{k - 1} ‖} .$

Since $(v_{1}, \dots, v_{k})$ is linearly independent, we know that $v_{k} \notin span (v_{1}, \dots, v_{k - 1})$ . Hence, we also know that $v_{k} \notin span (e_{1}, \dots, e_{k - 1})$ . It follows that the norm in the definition of $e_{k}$ is not zero, and so $e_{k}$ is well-defined (i.e., we are not dividing by zero). Note that a vector divided by its norm has norm 1 so that $‖ e_{k} ‖ = 1$ . Furthermore,

$\begin{aligned} ⟨ e_{k}, e_{i} ⟩ & = ⟨ \frac{v_{k} - ⟨ v_{k}, e_{1} ⟩ e_{1} - ⟨ v_{k}, e_{2} ⟩ e_{2} - \dots - ⟨ v_{k}, e_{k - 1} ⟩ e_{k - 1}}{‖ v_{k} - ⟨ v_{k}, e_{1} ⟩ e_{1} - ⟨ v_{k}, e_{2} ⟩ e_{2} - \dots - ⟨ v_{k}, e_{k - 1} ⟩ e_{k - 1} ‖}, e_{i} ⟩ \\ = \frac{⟨ v_{k}, e_{i} ⟩ - ⟨ v_{k}, e_{i} ⟩}{‖ v_{k} - ⟨ v_{k}, e_{1} ⟩ e_{1} - ⟨ v_{k}, e_{2} ⟩ e_{2} - \dots - ⟨ v_{k}, e_{k - 1} ⟩ e_{k - 1} ‖} = 0, \end{aligned}$

for each $1 \leq i < k$ . Hence, $(e_{1}, \dots, e_{k})$ is orthonormal.

$◻$

From the definition of $e_{k}$ , we see that $v_{k} \in span (e_{1}, \dots, e_{k})$ so that $span (v_{1}, \dots, v_{k}) \subset span (e_{1}, \dots, e_{k})$ . Since both lists $(e_{1}, \dots, e_{k})$ and $(v_{1}, \dots, v_{k})$ are linearly independent, they must span subspaces of the same dimension and therefore are the same subspace. Hence Equation (9.5.1) holds.

Example $9.5.2$

Take $v_{1} = (1, 1, 0)$ and $v_{2} = (2, 1, 1)$ in $R^{3}$ . The list $(v_{1}, v_{2})$ is linearly independent (as you should verify!). To illustrate the Gram-Schmidt procedure, we begin by setting
$e_{1} = \frac{v_{1}}{‖ v_{1} ‖} = \frac{1}{\sqrt{2}} (1, 1, 0) .$
Next, set
$e_{2} = \frac{v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1}}{‖ v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1} ‖} .$
The inner product $⟨ v_{2}, e_{1} ⟩ = \frac{1}{\sqrt{2}} ⟨ (1, 1, 0), (2, 1, 1) ⟩ = \frac{3}{\sqrt{2}}$ ,
so
$u_{2} = v_{2} - ⟨ v_{2}, e_{1} ⟩ e_{1} = (2, 1, 1) - \frac{3}{2} (1, 1, 0) = \frac{1}{2} (1, - 1, 2) .$
Calculating the norm of $u_{2}$ , we obtain $‖ u_{2} ‖ = \sqrt{\frac{1}{4} (1 + 1 + 4)} = \frac{\sqrt{6}}{2}$ .
Hence, normalizing this vector, we obtain
$e_{2} = \frac{u_{2}}{‖ u_{2} ‖} = \frac{1}{\sqrt{6}} (1, - 1, 2) .$
The list $(e_{1}, e_{2})$ is therefore orthonormal and has the same span as $(v_{1}, v_{2})$ .

Corollary 9.5.3.

Every finite-dimensional inner product space has an orthonormal basis.

Proof

Let $(v_{1}, \dots, v_{n})$ be any basis for $V$ . This list is linearly independent and spans $V$ . Apply the Gram-Schmidt procedure to this list to obtain an orthonormal list $(e_{1}, \dots, e_{n})$ , which still spans $V$ by construction. By Proposition9.4.2~ $???$ , this list is linearly independent and hence a basis of $V$ .

Corollary 9.5.4.

Every orthonormal list of vectors in $V$ can be extended to an orthonormal basis of $V$ .

Proof

Let $(e_{1}, \dots, e_{m})$ be an orthonormal list of vectors in $V$ . By Proposition9.4.2~ $???$ , this list is linearly independent and hence can be extended to a basis $(e_{1}, \dots, e_{m}, v_{1}, \dots, v_{k})$ of $V$ by the Basis Extension Theorem. Now apply the Gram-Schmidt procedure to obtain a new orthonormal basis $(e_{1}, \dots, e_{m}, f_{1}, \dots, f_{k})$ . The first $m$ vectors do not change since they already are orthonormal. The list still spans $V$ and is linearly independent by Proposition9.4.2~ $???$ and therefore forms a basis.

Recall Theorem7.5.3~ $???$ : given an operator $T \in L (V, V)$ on a complex vector space $V$ , there exists a basis $B$ for $V$ such that the matrix $M (T)$ of $T$ with respect to $B$ is upper triangular. We would like to extend this result to require the additional property of orthonormality.

Corollary 9.5.5

Let $V$ be an inner product space over $F$ and $T \in L (V, V)$ . If $T$ is upper-triangular with respect to some basis, then $T$ is upper-triangular with respect to some orthonormal basis.

Proof

Let $(v_{1}, \dots, v_{n})$ be a basis of $V$ with respect to which $T$ is upper-triangular. Apply the Gram-Schmidt procedure to obtain an orthonormal basis $(e_{1}, \dots, e_{n})$ , and note that

$\begin{matrix} (9.5.2) & span (e_{1}, \dots, e_{k}) = span (v_{1}, \dots, v_{k}), for all 1 \leq k \leq n . \end{matrix}$

We proved before that $T$ is upper-triangular with respect to a basis $(v_{1}, \dots, v_{n})$ if and only if $span (v_{1}, \dots, v_{k})$ is invariant under $T$ for each $1 \leq k \leq n$ . Since these spans are unchanged by the Gram-Schmidt procedure, $T$ is still upper triangular for the corresponding orthonormal basis.

Contributors

Both hardbound and softbound versions of this textbook are available online at WorldScientific.com.

Contributors

Support Center

How can we help?