We’ll now begin our study of the homogeneous system
\[\label{eq:10.4.1} {\bf y}'=A{\bf y}, \]
where \(A\) is an \(n\times n\) constant matrix. Since \(A\) is continuous on \((-\infty,\infty)\), Theorem 10.2.1 implies that all solutions of Equation \ref{eq:10.4.1} are defined on \((-\infty,\infty)\). Therefore, when we speak of solutions of \({\bf y}'=A{\bf y}\), we’ll mean solutions on \((-\infty,\infty)\).
In this section we assume that all the eigenvalues of \(A\) are real and that \(A\) has a set of \(n\) linearly independent eigenvectors. In the next two sections we consider the cases where some of the eigenvalues of \(A\) are complex, or where \(A\) does not have \(n\) linearly independent eigenvectors.
In Example 10.3.2 we showed that the vector functions
but we did not show how we obtained \({\bf y}_1\) and \({\bf y}_2\) in the first place. To see how these solutions can be obtained we write Equation \ref{eq:10.4.2} as
The trivial solution \(x_1=x_2=0\) of this system isn’t useful, since it corresponds to the trivial solution \(y_1\equiv y_2\equiv0\) of Equation \ref{eq:10.4.3}, which can’t be part of a fundamental set of solutions of Equation \ref{eq:10.4.2}. Therefore we consider only those values of \(\lambda\) for which Equation \ref{eq:10.4.5} has nontrivial solutions. These are the values of \(\lambda\) for which the determinant of Equation \ref{eq:10.4.5} is zero; that is,
which implies that \(x_1=-x_2/2\), where \(x_2\) can be chosen arbitrarily. Choosing \(x_2=2\) yields the solution \(y_1=-e^{2t}\), \(y_2=2e^{2t}\) of Equation \ref{eq:10.4.3}. We can write this solution in vector form as
so \(x_1=-x_2\). Taking \(x_2=1\) here yields the solution \(y_1=-e^{-t}\), \(y_2=e^{-t}\) of Equation \ref{eq:10.4.3}. We can write this solution in vector form as
In Equation \ref{eq:10.4.6} and Equation \ref{eq:10.4.7} the constant coefficients in the arguments of the exponential functions are the eigenvalues of the coefficient matrix in Equation \ref{eq:10.4.2}, and the vector coefficients of the exponential functions are associated eigenvectors. This illustrates the next theorem.
Theorem 10.4.1
Suppose the \(n\times n\) constant matrix \(A\) has \(n\) real eigenvalues \(\lambda_1,\lambda_2,\ldots,\lambda_n\) (which need not be distinct) with associated linearly independent eigenvectors \({\bf x}_1,{\bf x}_2,\ldots,{\bf x}_n\). Then the functions
Since the columns of the determinant on the right are \({\bf x}_1\), \({\bf x}_2\), …, \({\bf x}_n\), which are assumed to be linearly independent, the determinant is nonzero. Therefore Theorem 10.3.3 implies that \(\{{\bf y}_1,{\bf y}_2,\ldots,{\bf y}_n\}\) is a fundamental set of solutions of \({\bf y}'=A{\bf y}\).
Example 10.4.1
Find the general solution of \[\label{eq:10.4.8} {\bf y}'=\left[\begin{array}{cc}{2}&{4}\\[4pt]{4}&{2}\end{array} \right] {\bf y}. \]
Solve the initial value problem \[\label{eq:10.4.9} {\bf y}'=\left[\begin{array}{cc}{2}&{4}\\[4pt]{4}&{2}\end{array} \right] {\bf y},\quad{\bf y}(0)=\left[\begin{array}{r}5 \\[4pt]-1 \end{array}\right]. \]
Solution a
The characteristic polynomial of the coefficient matrix \(A\) in Equation \ref{eq:10.4.8} is
Find the general solution of \[\label{eq:10.4.12} {\bf y}'=\left[\begin{array}{rrr}3&-1&-1\\[4pt]-2& 3& 2\\[4pt]4&-1&-2\end{array}\right]{\bf y}. \]
Solve the initial value problem \[\label{eq:10.4.13} {\bf y}'=\left[\begin{array}{rrr}3&-1&-1\\[4pt]-2&3& 2\\[4pt]4&-1&-2\end{array} \right]{\bf y},\quad{\bf y}(0)=\left[\begin{array}{r}2\\[4pt] -1\\[4pt]8\end{array}\right]. \]
Solution a
The characteristic polynomial of the coefficient matrix \(A\) in Equation \ref{eq:10.4.12} is
Hence, \(\lambda_1=1\) is an eigenvalue of multiplicity \(1\), while \(\lambda_2=-5\) is an eigenvalue of multiplicity \(2\). Eigenvectors associated with \(\lambda_1=1\) are solutions of the system with augmented matrix
Hence, the components of these eigenvectors need only satisfy the single condition
\[x_1+x_2+x_3=0.\nonumber \]
Since there’s only one equation here, we can choose \(x_2\) and \(x_3\) arbitrarily. We obtain one eigenvector by choosing \(x_2=0\) and \(x_3=1\), and another by choosing \(x_2=1\) and \(x_3=0\). In both cases \(x_1=-1\). Therefore
\[\left[\begin{array}{r}-1\\[4pt]0\\[4pt]1\end{array}\right]\quad \mbox{ and }\quad\left[\begin{array}{r}-1\\[4pt]1\\[4pt]0 \end{array}\right]\nonumber \]
are linearly independent eigenvectors associated with \(\lambda_2= -5\), and the corresponding solutions of Equation \ref{eq:10.4.16} are
\[{\bf y}_2=\left[\begin{array}{r}-1\\[4pt]0\\[4pt]1\end{array} \right]e^{-5t}\quad \mbox{ and }\quad{\bf y}_3=\left[\begin{array}{r}-1\\[4pt]1\\[4pt] 0\end{array}\right]e^{-5t}.\nonumber \]
Because of this and Equation \ref{eq:10.4.17}, Theorem 10.4.1
implies that the general solution of Equation \ref{eq:10.4.16} is
It is convenient to think of a “\(y_1\)-\(y_2\) plane," where a point is identified by rectangular coordinates \((y_1,y_2)\). If \({\bf y}=\left[\begin{array}{c}{y_{1}}\\[4pt]{y_{2}}\end{array} \right]\) is a non-constant solution of Equation \ref{eq:10.4.18}, then the point \((y_1(t),y_2(t))\) moves along a curve \(C\) in the \(y_1\)-\(y_2\) plane as \(t\) varies from \(-\infty\) to \(\infty\). We call \(C\) the trajectory of \({\bf y}\). (We also say that \(C\) is a trajectory of the system Equation \ref{eq:10.4.18}.) I’s important to note that \(C\) is the trajectory of infinitely many solutions of Equation \ref{eq:10.4.18}, since if \(\tau\) is any real number, then \({\bf y}(t-\tau)\) is a solution of Equation \ref{eq:10.4.18}, and \((y_1(t-\tau),y_2(t-\tau))\) also moves along \(C\) as \(t\) varies from \(-\infty\) to \(\infty\). Moreover, Exercise 10.4.28c implies that distinct trajectories of Equation \ref{eq:10.4.18} can’t intersect, and that two solutions \({\bf y}_1\) and \({\bf y}_2\) of Equation \ref{eq:10.4.18} have the same trajectory if and only if \({\bf y}_2(t)={\bf y}_1(t-\tau)\) for some \(\tau\).
From Exercise 10.4.28a, a trajectory of a nontrivial solution of Equation \ref{eq:10.4.18} can’t contain \((0,0)\), which we define to be the trajectory of the trivial solution \({\bf y}\equiv0\). More generally, if \({\bf y}=\left[\begin{array}{c}{k_{1}}\\[4pt]{k_{2}}\end{array} \right] \ne{\bf 0}\) is a constant solution of Equation \ref{eq:10.4.18} (which could occur if zero is an eigenvalue of the matrix of Equation \ref{eq:10.4.18}), we define the trajectory of \({\bf y}\) to be the single point \((k_1,k_2)\).
To be specific, this is the question: What do the trajectories look like, and how are they traversed? In this section we’ll answer this question, assuming that the matrix
of Equation \ref{eq:10.4.18} has real eigenvalues \(\lambda_1\) and \(\lambda_2\) with associated linearly independent eigenvectors \({\bf x}_1\) and \({\bf x}_2\). Then the general solution of Equation \ref{eq:10.4.18} is
We’ll consider other situations in the next two sections.
We leave it to you to classify the trajectories of Equation \ref{eq:10.4.18} if zero is an eigenvalue of \(A\). We’ll confine our attention here to the case where both eigenvalues are nonzero. In this case the simplest situation is where \(\lambda_1=\lambda_2\ne0\), so Equation \ref{eq:10.4.19} becomes
Since \({\bf x}_1\) and \({\bf x}_2\) are linearly independent, an arbitrary vector \({\bf x}\) can be written as \({\bf x}=c_1{\bf x}_1+c_2{\bf x}_2\). Therefore the general solution of Equation \ref{eq:10.4.18} can be written as \({\bf y}={\bf x}e^{\lambda_1 t}\) where \({\bf x}\) is an arbitrary \(2\)-vector, and the trajectories of nontrivial solutions of Equation \ref{eq:10.4.18} are half-lines through (but not including) the origin. The direction of motion is away from the origin if \(\lambda_1>0\) (Figure 10.4.1
), toward it if \(\lambda_1<0\) (Figure 10.4.2
). (In these and the next figures an arrow through a point indicates the direction of motion along the trajectory through the point.)
Now suppose \(\lambda_2>\lambda_1\), and let \(L_1\) and \(L_2\) denote lines through the origin parallel to \({\bf x}_1\) and \({\bf x}_2\), respectively. By a half-line of \(L_1\) (or \(L_2\)), we mean either of the rays obtained by removing the origin from \(L_1\) (or \(L_2\)).
Letting \(c_2=0\) in Equation \ref{eq:10.4.19} yields \({\bf y}=c_1{\bf x}_1e^{\lambda_1 t}\). If \(c_1\ne0\), the trajectory defined by this solution is a half-line of \(L_1\). The direction of motion is away from the origin if \(\lambda_1>0\), toward the origin if \(\lambda_1<0\). Similarly, the trajectory of \({\bf y}=c_2{\bf x}_2e^{\lambda_2 t}\) with \(c_2\ne0\) is a half-line of \(L_2\).
Henceforth, we assume that \(c_1\) and \(c_2\) in Equation \ref{eq:10.4.19} are both nonzero. In this case, the trajectory of Equation \ref{eq:10.4.19} can’t intersect \(L_1\) or \(L_2\), since every point on these lines is on the trajectory of a solution for which either \(c_1=0\) or \(c_2=0\). (Remember: distinct trajectories can’t intersect!). Therefore the trajectory of Equation \ref{eq:10.4.19} must lie entirely in one of the four open sectors bounded by \(L_1\) and \(L_2\), but do not any point on \(L_1\) or \(L_2\). Since the initial point \((y_1(0),y_2(0))\) defined by
Since the right side of Equation \ref{eq:10.4.20} approaches \(c_2{\bf x}_2\) as \(t\to\infty\), the trajectory is asymptotically parallel to \(L_2\) as \(t\to\infty\). Since the right side of Equation \ref{eq:10.4.21} approaches \(c_1{\bf x}_1\) as \(t\to-\infty\), the trajectory is asymptotically parallel to \(L_1\) as \(t\to-\infty\).
The shape and direction of traversal of the trajectory of Equation \ref{eq:10.4.19} depend upon whether \(\lambda_1\) and \(\lambda_2\) are both positive, both negative, or of opposite signs. We’ll now analyze these three cases.
Henceforth \(\|{\bf u}\|\) denote the length of the vector \({\bf u}\).
Case 1: λ₂ > λ₁
Figure 10.4.4
shows some typical trajectories. In this case, \(\lim_{t\to-\infty}\|{\bf y}(t)\|=0\), so the trajectory is not only asymptotically parallel to \(L_1\) as \(t\to-\infty\), but is actually asymptotically tangent to \(L_1\) at the origin. On the other hand, \(\lim_{t\to\infty}\|{\bf y}(t)\|=\infty\) and
so, although the trajectory is asymptotically parallel to \(L_2\) as \(t\to\infty\), it is not asymptotically tangent to \(L_2\). The direction of motion along each trajectory is away from the origin.
Figure 10.4.4
: Two positive eigenvalues; motion away from origin.
Case 2: 0 > λ₂ > λ₁
Figure 10.4.5
shows some typical trajectories. In this case, \(\lim_{t\to\infty}\|{\bf y}(t)\|=0\), so the trajectory is asymptotically tangent to \(L_2\) at the origin as \(t\to\infty\). On the other hand, \(\lim_{t\to-\infty}\|{\bf y}(t)\|=\infty\) and
so, although the trajectory is asymptotically parallel to \(L_1\) as \(t\to-\infty\), it is not asymptotically tangent to it. The direction of motion along each trajectory is toward the origin.
Figure 10.4.5
: Two negative eigenvalues; motion toward the origin.
Case 3: λ₂ > 0 > λ₁
Figure 10.4.6
shows some typical trajectories. In this case,
so the trajectory is asymptotically tangent to \(L_1\) as \(t\to-\infty\). The direction of motion is toward the origin on \(L_1\) and away from the origin on \(L_2\). The direction of motion along any other trajectory is away from \(L_1\), toward \(L_2\).