12.5: Special operations on matrices
- Page ID
- 298
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)In this section, we define three important operations on matrices called the transpose, conjugate transpose, and the trace. These will then be seen to interact with matrix multiplication and invertibility in order to form special classes of matrices that are extremely important to applications of Linear Algebra.
A.5.1 Transpose and conjugate transpose
Given positive integers \(m, n \in \mathbb{Z}_+\) and any matrix \(A = (a_{ij} ) \in \mathbb{F}^{m \times n} ,\) we define the transpose \(A^T = ((a^T )_{ij} ) \in \mathbb{F}^{n \times m}\) and the conjugate transpose \(A^{\ast} = ((a^{\ast} )_{ij} ) \in \mathbb{F}^{n \times m}\) by
\[(a^T )_{ij} = a_{ji} \rm{~and~} (a^{\ast} )_{ij} = \overline{a_{ji}} , \]
where \(\overline{a_{ji}}\) denotes the complex conjugate of the scalar \(a_{ji} \in \mathbb{F}.\) In particular, if \(A \in \mathbb{R}^{m \times n} \),then note that \(A^T = A^{\ast} .\)
Example \(\PageIndex{1}\):
With notation as in Example A.1.3,
\[ A^T= \left[ \begin{array}{ccc} 3 & -1 & 1 \end{array} \right],B^T=\left[ \begin{array}{cc} 4 & 0 \\ -1 & 2 \end{array} \right], C^T = \left[ \begin{array}{c} 1 \\ 4 \\ 2 \end{array} \right],\\ D^T= \left[ \begin{array}{ccc} 1 & -1 & 3 \\ 5 & 0 & 2 \\ 2 & 1 & 4 \end{array} \right], E^T = \left[ \begin{array}{ccc} 6 & -1 & 4 \\ 1 & 1 & 1 \\3 & 2 & 3 \end{array} \right]. \]
One of the motivations for defining the operations of transpose and conjugate transpose is that they interact with the usual arithmetic operations on matrices in a natural manner. We summarize the most fundamental of these interactions in the following theorem.
Theorem A.5.2. Given positive integers \(m, n \in \mathbb{Z}_+\) and any matrices \(A, B \in \mathbb{F}^{m \times n} ,\)
- \((A^T )^T = A {\it{~and~}} (A^{\ast} )^{\ast} = A.\)
- \((A + B)^T = A^T + B^T {\it{~and~}} (A + B)^{\ast} = A^{\ast} + B^{\ast} .\)
- \((\alpha A)^T = \alpha A^T {\it{~and~}} (\alpha A)^{\ast} = \alpha A^{\ast} ,\) where \(\alpha \in \mathbb{F}\) is any scalar.
- \((AB)^T = B^T A^T .\)
- if \(m = n\) and \(A \in GL(n, \mathbb{F})\), then \(A^T , A^* \in GL(n, \mathbb{F})\) with respective inverses given by
\[(A^T )^{-1} = (A^{-1} )^T {\it{~and~}} (A^* )^{-1} = (A^{-1} )^* .\]
Another motivation for defining the transpose and conjugate transpose operations is that they allow us to define several very special classes of matrices.
Definition A.5.3. Given a positive integer \(n \in \mathbb{Z}_+\) , we say that the square matrix \(A \in \mathbb{F}^{n \times n}\)
- is symmetric if \(A = A^T .\)
- is Hermitian if \(A = A^* .\)
- is orthogonal if \(A \in GL(n, \mathbb{R})\) and \(A^{-1} = A^T .\) Moreover, we define the (real) orthogonal group to be the set \(O(n) = \{A \in GL(n, \mathbb{R})~ |~ A^{-1} = A^T \}.\)
- is unitary if \(A \in GL(n, \mathbb{C})\) and \(A^{-1} = A^* \). Moreover, we define the (complex)
unitary group to be the set \(U(n) = \{A \in GL(n, \mathbb{C}) ~|~ A^{-1 }= A^* \}.\)
A lot can be said about these classes of matrices. Both \(O(n)\) and \(U(n)\), for example, form a group under matrix multiplication. Additionally, real symmetric and complex Hermitian matrices always have real eigenvalues. Moreover, given any matrix \(A \in \mathbb{R}^{m \times n} , AA^T\) is a symmetric matrix with real, non-negative eigenvalues. Similarly, for \(A \in \mathbb{C}^{m \times n} , AA^*\) is Hermitian with real, non-negative eigenvalues.
A.5.2 The trace of a square matrix
Given a positive integer \(n \in \mathbb{Z}_+\) and any square matrix \(A = (a_{ij} ) \in \mathbb{F}^{n \times n} ,\) we define the trace of \(A\) to be the scalar
\[trace(A) = \sum_{k=1}^n a_{kk} \in \mathbb{F}.\]
Example \(\PageIndex{2}\)
With notation as in Example A.1.3 above,
\[trace(B) = 4 + 2 = 6, trace(D) = 1 + 0 + 4 = 5, {\it{~and~}} trace(E) = 6 + 1 + 3 = 10.\]
Note, in particular, that the traces of \(A\) and \(C\) are not defined since these are not square matrices.
We summarize some of the most basic properties of the trace operation in the following theorem, including its connection to the transpose operations defined in the previous section.
Theorem A.5.5. Given positive integers \(m, n \in \mathbb{Z}_+\) and square matrices \(A, B \in \mathbb{F}^{n \times n} ,\)
- \(trace(\alpha A) = \alpha trace(A),\) for any scalar \(\alpha \in \mathbb{F}.\)
- \(trace(A + B) = trace(A) + trace(B).\)
- \(trace(A^T ) = trace(A) {\it{~and~}} trace(A^* ) = \overline{trace(A)}.\)
- \(trace(AA^* ) = \sum_{k=1}^{n} \sum_{l=1}^{n} |a_{kl} |^2 \). In particular, \(trace(AA^* ) = 0\) if and only if \(A = 0^{n \times n}\) .
- \(trace(AB) = trace(BA).\) More generally, given matrices \(A_1 , \ldots , A_m \in \mathbb{F}^{n \times n} ,\) the trace operation has the so-called cyclic property, meaning that \[trace(A_1 \cdots A_m ) = trace(A_2 \cdots A_m A_1 ) = \cdots = trace(A_m A_1 \cdots A_{m-1} ).\]
Moreover, if we define a linear map \(T : \mathbb{F}^n \rightarrow \mathbb{F}^n\) by setting \(T (v) = Av\) for each \(v \in \mathbb{F}^n\) and if \(T\) has distinct eigenvalues \(\lambda_1 , \ldots , \lambda_n \), then \(trace(A) = \sum_{k=1}^n \lambda_{k} .\)
Contributors
- Isaiah Lankham, Mathematics Department at UC Davis
- Bruno Nachtergaele, Mathematics Department at UC Davis
- Anne Schilling, Mathematics Department at UC Davis
Both hardbound and softbound versions of this textbook are available online at WorldScientific.com.