14.3: Fundamental Theorem of Affine Geometry
- Page ID
- 23670
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Further we assume knowledge of vector algebra, namely multiplication by a real number, addition and the parallelogram rule.
Show that affine transformations map parallelograms to parallelograms. Conclude that if \(P\mapsto P'\) is an affine transformation, then
\(\overrightarrow{XY}=\overrightarrow{AB}\), \(\text{if and only if}\) \(\overrightarrow{X'Y'} =\overrightarrow{A'B'}.\)
- Hint
-
Apply twice Exercise 14.1.1 and the parallelogram rule. (Note that the case if \(A, B, X\), and \(Y\) are collinear requires extra work.)
Let \(P \mapsto P'\) be an affine transformation of the Euclidean plane. Then, for any triple of points \(O\), \(X\), \(P\), we have
\[\overrightarrow{OP}=t \cdot \overrightarrow{OX} \ \ \ \ \text{if and only if}\ \ \ \ \overrightarrow{O'P'} =t \cdot \overrightarrow{O'X'}.\]
- Proof
-
Observe that the affine transformations described in Exercise 14.1.2 as well as all motions satisfy the condition 14.3.1. Therefore a given affine transformation \(P \mapsto P'\) satisfies 14.3.1 if and only if its composition with motions and scalings satisfies 14.3.1.
Applying this observation, we can reduce the problem to its partial case. Namely, we may assume that \(O'=O\), \(X'=X\), the point \(O\) is the origin of a coordinate system, and \(X\) has coordinates \((1,0)\).
In this case, \(\overrightarrow{OP}=t\cdot \overrightarrow{OX}\) if and only if \(P=(t,0)\). Since \(O\) and \(X\) are fixed, the transformation maps the \(x\)-axis to itself. That is, \(P'=(f(t),0)\) for some function \(t\mapsto f(t)\), or, equivalently, \(\overrightarrow{O'P'}=f(t)\cdot \overrightarrow{O'X'}\). It remains to show that
\[f(t)=t \]
for any \(t\).
Since \(O'=O\) and \(X'=X\), we get that \(f(0)=0\) and \(f(1)=1\). Further, according to Exercise 14.2.2, we have that \(f(x\cdot y)=f(x)\cdot f(y)\) and \(f(x+y)=f(x)+f(y)\) for any \(x,y \in \mathbb{R}\). By the algebraic lemma (proved below, see Lemma 14.4.1), these conditions imply 14.3.2.
Suppose an affine transformation maps a nondegenerate triangle \(OXY\) to a triangle \(O'X'Y'\). Then \(\triangle O'X'Y'\) is nondegenerate, and
\(\overrightarrow{OP}=x \cdot \overrightarrow{OX} + y \cdot \overrightarrow{OY} \ \ \ \text{if and only if}\ \ \ \ \overrightarrow{O'P'} =x\cdot\overrightarrow{O'X'}+y\cdot\overrightarrow{O'Y'}.\)
- Proof
-
Since an affine transformation maps lines to lines, the triangle \(O'X'Y'\) is nondegenerate.
If \(x=0\) or \(y=0\), then the second statement follows directly from the proposition. Otherwise consider points \(V\) and \(W\) defined by
\(\overrightarrow{OV} = x \cdot \overrightarrow{OX}, \ \ \ \overrightarrow{OW} = y \cdot \overrightarrow{OY}.\)
By the proposition,
\(\overrightarrow{O'V'}=x\cdot\overrightarrow{O'X'}, \ \ \ \overrightarrow{O'W'} =y \cdot \overrightarrow{O'Y'}.\)
Note that
\(\overrightarrow{OP}=\overrightarrow{OV}+\overrightarrow{OW},\)
or, equivalently, \(\square OVPW\) is a parallelogram. According to Exercise 14.1.1, \(\square O'V'P'W'\) is a parallelogram as well. Therefore
\(\begin{array} {rcl} {\overrightarrow{O'P'}} & = & {\overrightarrow{O'V'} + \overrightarrow{O'W'} =} \\ {} & = & {x \cdot \overrightarrow{O'X'} + y \cdot \overrightarrow{O'Y'}} \end{array}\)
Show that any affine transformation is continuous.
- Hint
-
Fix a coordinate system and apply the fundamental theorem of affine geometry (Theorem \(\PageIndex{1}\)) for the points \(O = (0, 0), X = (1, 0)\) and \(Y = (0, 1)\).
The following exercise provides the converse to Exercise Exercise 14.1.2(d).
Show that any affine transformation can be written in coordinates as
\((x,y)\mapsto(a\cdot x+b\cdot y+r,c\cdot x+d\cdot y+s)\)
for constants \(a,b,c,d,r,s\) such that the matrix \((\begin{smallmatrix}a&b\\c&d\end{smallmatrix})\) is invertible.
- Hint
-
Fix a coordinate system and apply the fundamental theorem of affine geometry (Theorem \(\PageIndex{1}\)) for the points \(O = (0, 0), X = (1, 0)\) and \(Y = (0, 1)\).