Search

Text Color

Margin Size

Font Type

Enable Dyslexic Font

38.3: Pseudoinverse

Last updated

Sep 17, 2022
Save as PDF
- 38.2: Finding the best solution in an overdetermined system
- 39: 20 In-Class Assignment - Least Squares Fit (LSF)

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\id}{\mathrm{id}}$ $\newcommand{\Span}{\mathrm{span}}$

( \newcommand{\kernel}{\mathrm{null}\,}\) $\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$ $\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$ $\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\id}{\mathrm{id}}$

$\newcommand{\Span}{\mathrm{span}}$

$\newcommand{\kernel}{\mathrm{null}\,}$

$\newcommand{\range}{\mathrm{range}\,}$

$\newcommand{\RealPart}{\mathrm{Re}}$

$\newcommand{\ImaginaryPart}{\mathrm{Im}}$

$\newcommand{\Argument}{\mathrm{Arg}}$

$\newcommand{\norm}[1]{\| #1 \|}$

$\newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$\newcommand{\Span}{\mathrm{span}}$ $\newcommand{\AA}{\unicode[.8,0]{x212B}}$

$\newcommand{\vectorA}[1]{\vec{#1}} % arrow$

$\newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$

$\newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vectorC}[1]{\textbf{#1}}$

$\newcommand{\vectorD}[1]{\overrightarrow{#1}}$

$\newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}$

$\newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}}$

$\newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }$

$\newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}}$

$\newcommand{\avec}{\mathbf a}$

$\newcommand{\bvec}{\mathbf b}$

$\newcommand{\cvec}{\mathbf c}$

$\newcommand{\dvec}{\mathbf d}$

$\newcommand{\dtil}{\widetilde{\mathbf d}}$

$\newcommand{\evec}{\mathbf e}$

$\newcommand{\fvec}{\mathbf f}$

$\newcommand{\nvec}{\mathbf n}$

$\newcommand{\pvec}{\mathbf p}$

$\newcommand{\qvec}{\mathbf q}$

$\newcommand{\svec}{\mathbf s}$

$\newcommand{\tvec}{\mathbf t}$

$\newcommand{\uvec}{\mathbf u}$

$\newcommand{\vvec}{\mathbf v}$

$\newcommand{\wvec}{\mathbf w}$

$\newcommand{\xvec}{\mathbf x}$

$\newcommand{\yvec}{\mathbf y}$

$\newcommand{\zvec}{\mathbf z}$

$\newcommand{\rvec}{\mathbf r}$

$\newcommand{\mvec}{\mathbf m}$

$\newcommand{\zerovec}{\mathbf 0}$

$\newcommand{\onevec}{\mathbf 1}$

$\newcommand{\real}{\mathbb R}$

$\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$

$\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$

$\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$

$\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$

$\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$

$\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$

$\newcommand{\laspan}[1]{\text{Span}\{#1\}}$

$\newcommand{\bcal}{\cal B}$

$\newcommand{\ccal}{\cal C}$

$\newcommand{\scal}{\cal S}$

$\newcommand{\wcal}{\cal W}$

$\newcommand{\ecal}{\cal E}$

$\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$

$\newcommand{\gray}[1]{\color{gray}{#1}}$

$\newcommand{\lgray}[1]{\color{lightgray}{#1}}$

$\newcommand{\rank}{\operatorname{rank}}$

$\newcommand{\row}{\text{Row}}$

$\newcommand{\col}{\text{Col}}$

$\renewcommand{\row}{\text{Row}}$

$\newcommand{\nul}{\text{Nul}}$

$\newcommand{\var}{\text{Var}}$

$\newcommand{\corr}{\text{corr}}$

$\newcommand{\len}[1]{\left|#1\right|}$

$\newcommand{\bbar}{\overline{\bvec}}$

$\newcommand{\bhat}{\widehat{\bvec}}$

$\newcommand{\bperp}{\bvec^\perp}$

$\newcommand{\xhat}{\widehat{\xvec}}$

$\newcommand{\vhat}{\widehat{\vvec}}$

$\newcommand{\uhat}{\widehat{\uvec}}$

$\newcommand{\what}{\widehat{\wvec}}$

$\newcommand{\Sighat}{\widehat{\Sigma}}$

$\newcommand{\lt}{<}$

$\newcommand{\gt}{>}$

$\newcommand{\amp}{&}$

$\definecolor{fillinmathshade}{gray}{0.9}$

If you have already signed in, please refresh the page.

%matplotlib inline
import matplotlib.pylab as plt
import numpy as np
import sympy as sym
import time
sym.init_printing(use_unicode=True)

%matplotlib inline
import matplotlib.pylab as plt
import numpy as np
import sympy as sym
import time
sym.init_printing(use_unicode=True)

In this class we often talk about solving problems of the form:

$Ax = b \nonumber$

Currently we have determined that this problem becomes very nice when the $n \times n$ matrix $A$ has an inverse. We can easily multiply each side by the inverse:

$A^{-1}Ax = A^{-1}b \nonumber$

Since $A^{-1}A = I$ the solution for $x$ is simply:

$x = A^{-1}b \nonumber$

Now, let us consider a a more general problem where the $m \times n$ matrix $A$ is not square, i.e. $m \neq n$ and its rank $r$ maybe less than $m$ and/or $n$ . In this case we want to find a Pseudoinverse (which we denote as $A^+$ ) which acts like an inverse for a non-square matrix. In other words we want to find an $A^+$ for $A$ such that:

$A^+A \approx I \nonumber$

Assuming we can find the $n \times m$ matrix $A^+$ , we should then be able to solve for $x$ as follows:

$Ax = b \nonumber$

$A^+Ax = A^+b \nonumber$

$x \approx A^+b \nonumber$

How do we know there is a Pseudoinverse

Assuming the general case of a $m \times n$ matrix $A$ where its rank $r$ maybe less than $m$ and/or $n$ ( $r\leq m$ and $r\leq n$ ). We can conclude the following about the fundamental spaces of $A$ :

The rowspace of $A$ is in $R^n$ with dimension $r$
The columnspace of $A$ is in $R^m$ also with dimension $r$ .
The nullspace of $A$ is in $R^n$ with dimension $n−r$
The nullspace of $A^{\top}$ is in $R^m$ with dimension $m−r$ .

Because the rowspace of $A$ and the column space $A$ have the same dimension then $A$ is a the one-to-one mapping from the row space to the columnspace. In other words:

For any $x$ in the rowspace, we have that $Ax$ is one point in the columnspace. If $x'$ is another point in the row space different from $x$ , we have $Ax\neq Ax'$ (The mapping is one-to-one).
For any $y$ in the columnspace, we can find $x$ in the rowspace such that $Ax=y$ (The mapping is onto).

The above is not really a proof but hopefully there is sufficient information to convince yourself that this is true.

How to compute pseudoinverse

We want to find the $n \times m$ matrix that maps from columnspace to the rowspace of $A$ , and $x=A^+Ax$ , if $x$ is in the rowspace.

Let's apply SVD on A: A=UΣV⊤, where U is a m×m matrix, V⊤ is a n×n matrix, and Σ is a diagonal m×n matrix. We can decompose the matrices as A=[⋮⋮U1U2⋮⋮][Σ1000][⋯V⊤1⋯⋯V⊤2⋯]. Here U1 is of m×r, U2 is of m×(m−r), Σ1 is of r×r, V⊤1 is of r×n, and V⊤2 is of (n−r)×n.
- The columnspace of $U_1$ is the columnspace of $A$ , and columnspace of $U_2$ is the nullspace of $A^{\top}$ .
- The rowspace of $V_1$ is the rowspace of $A$ , and rowspace of $V_2$ is the nullspace of $A$ .
If x is in the rowspace of A, we have that V⊤2x=0. We have Ax=U1Σ1V⊤1x.
- If we define a matrix $B=V_1\Sigma_1^{-1}U_1^\top$ , we have that $BAx=V_1\Sigma_1^{-1}U_1^\top U_1\Sigma_1 V_1^\top x=V_1V_1^\top x$ . That is $BAx=x$ is $x$ is in the rowspace of $A$ .
The matrix $B$ is the pseudoinverse of matrix $A$ . $A^+ = V_1\Sigma_1^{-1}U_1^\top$ $A^+ = \begin{bmatrix}\vdots & \vdots \\ V_1 & V_2 \\ \vdots &\vdots\end{bmatrix} \begin{bmatrix}\Sigma_1^{-1} & 0 \\ 0 & 0\end{bmatrix} \begin{bmatrix}\cdots & U_1^\top & \cdots \\ \cdots & U_2^\top &\cdots \end{bmatrix}$ .

Example

Let $A=[1,2]$ , we know that $r=m=1$ and $n=2$ .

If you have already signed in, please refresh the page.

A = np.matrix([[1,2]])

A = np.matrix([[1,2]])

Todo

Calculate the pseudoinverse $A^+$ of $A$ using the numpy.linalg function pinv:

If you have already signed in, please refresh the page.

#put your code here

#put your code here

Do This

Compute $AA^{+}$ and $A^{+}A$

If you have already signed in, please refresh the page.

#put your code here

#put your code here

Question

If $x$ is in the nullspace of $A$ what is the effect of $A^{+}Ax$ ?

Question

If $x$ is in the rowspace of $A$ what is the effect of $A^{+}Ax$ ?

Left inverse is pseudoinverse

We can compute the left inverse of $A$ if $r = n \leq m$ . In this case, we may have more rows than columns, and the matrix $A$ has full column rank.

In this case, the SVD of $A$ is $A = U \Sigma V^{\top}$ . Here $U$ is of $m \times n$ , $\Sigma$ is of $n \times n$ and nonsingular, $V^{\top}$ is of $n \times n$ . The pseudoinverse of $A$ is $A^+ = V\Sigma^{-1}U^\top$ .

The left inverse of $A$ is $(A^\top A)^{-1}A^\top= (V\Sigma U^\top U\Sigma V^\top )^{-1} V\Sigma U^\top = V(\Sigma \Sigma )^{-1} V^\top V\Sigma U^\top = V\Sigma ^{-1} U^\top =A^+$ .

Example

Let $A=\begin{bmatrix}1\\2\end{bmatrix}$ , we know that $r=n=1$ and $m=2$ . Then we have the left inverse.

If you have already signed in, please refresh the page.

A = np.matrix([[1],[2]])
A

A = np.matrix([[1],[2]])
A

Do This

Calculate the pseudoinverse $A^{\top}$ of $A$ .

Do This

Calculate the left inverse of $A$ , and verify that it is the same as $A^{\top}$ .

How do we know there is a Pseudoinverse

How to compute pseudoinverse

Example

Todo

Do This

Question

Question

Left inverse is pseudoinverse

Example

Do This

Do This

Support Center

How can we help?