# 1.9: First Order Linear PDE

- Page ID
- 71365

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)We only considered ODE so far, so let us solve a linear first order PDE. Consider the equation \[a(x,t) \, u_x + b(x,t) \, u_t + c(x,t) \, u = g(x,t), \qquad u(x,0) = f(x) , \qquad -\infty < x < \infty, \quad t > 0 , \nonumber \] where \(u(x,t)\) is a function of \(x\) and \(t\). The *initial condition* \(u(x,0) = f(x)\) is now a function of \(x\) rather than just a number. In these problems, it is useful to think of \(x\) as position and \(t\) as time. The equation describes the evolution of a function of \(x\) as time goes on. Below, the coefficients \(a\), \(b\), \(c\), and the function \(g\) are mostly going to be constant or zero. The method we describe works with nonconstant coefficients, although the computations may get difficult quickly.

This method we use is the . The idea is that we find lines along which the equation is an ODE that we solve. We will see this technique again for second order PDE when we encounter the wave equation in Section 4.8.

Consider the equation \[u_t + \alpha u_x = 0, \qquad u(x,0) = f(x) . \nonumber \] This particular equation, \(u_t + \alpha u_x = 0\), is called the *transport equation*.

The data will propagate along curves called characteristics. The idea is to change to the so-called *characteristic coordinates*. If we change to these coordinates, the equation simplifies. The change of variables for this equation is

\[\xi = x - \alpha t , \qquad s = t . \nonumber \]

Let’s see what the equation becomes. Remember the chain rule in several variables.

\[\begin{align}\begin{aligned} & u_t = u_\xi \xi_t + u_s s_t = - \alpha u_\xi + u_s , \\ & u_x = u_\xi \xi_x + u_s s_x = u_\xi .\end{aligned}\end{align} \nonumber \]

The equation in the coordinates \(\xi\) and \(s\) becomes

\[\underbrace{(- \alpha u_\xi + u_s)}_{u_t} + \alpha \underbrace{(u_\xi)}_{u_x} = 0 , \nonumber \]

or in other words \[u_s = 0 . \nonumber \]

That is trivial to solve. Treating \(\xi\) as simply a parameter, we have obtained the ODE \(\frac{d u}{d s} = 0\).

The solution is a function that does not depend on \(s\) (but it does depend on \(\xi\)). That is, there is some function \(A\) such that \[u = A(\xi) = A(x - \alpha t) . \nonumber \] The initial condition says that: \[f(x) = u(x,0) = A(x - \alpha 0) = A(x) , \nonumber \] so \(A=f\). In other words, \[u(x,t) = f(x-\alpha t) . \nonumber \] Everything is simply moving right at speed \(\alpha\) as \(t\) increases. The curve given by the equation \[\xi = \text{constant} \nonumber \]

is called the characteristic. See Figure \(\PageIndex{1}\). In this case, the solution does not change along the characteristic.

In the \((x,t)\) coordinates, the characteristic curves satisfy \(t = \frac{1}{\alpha} ( x- \xi)\), and are in fact lines. The slope of characteristic lines is \(\frac{1}{\alpha}\), and for each different \(\xi\) we get a different characteristic line.

We see why \(u_t + \alpha u_x = 0\) is called the transport equation: everything travels at some constant speed. Sometimes this is called . An example application is material being moved by a river where the material does not diffuse and is simply carried along. In this setup, \(x\) is the position along the river, \(t\) is the time, and \(u(x,t)\) the concentration the material at position \(x\) and time \(t\). See Figure \(\PageIndex{2}\) for an example.

We use similar idea in the more general case: \[a u_x + b u_t + c u = g, \qquad u(x,0) = f(x) . \nonumber \] We change coordinates to the characteristic coordinates. Let us call these coordinates \((\xi,s)\). These are coordinates where \(a u_x + b u_t\) becomes differentiation in the \(s\) variable.

Along the characteristic curves (where \(\xi\) is constant), we get a new ODE in the \(s\) variable. In the transport equation, we got the simple \(\frac{du}{ds} = 0\). In general, we get the linear equation \[\label{eq:fopde:charode} \frac{du}{ds} + c u = g. \]

We think of everything as a function of \(\xi\) and \(s\), although we are thinking of \(\xi\) as a parameter rather than an independent variable. So the equation is an ODE. It is a linear ODE that we can solve using the integrating factor.

To find the characteristics, think of a curve given parametrically \(\bigl(x(s),t(s)\bigr)\). We try to have the curve satisfy \[\frac{dx}{ds} = a, \qquad \frac{dt}{ds} = b . \nonumber \] Why? Because when we think of \(x\) and \(t\) as functions of \(s\) we find, using the chain rule, \[\frac{du}{ds} + c u = \underbrace{\left( u_x \frac{dx}{ds} + u_t \frac{dt}{ds}\right)}_{\frac{du}{ds}} + c u = a u_x + b u_t + c u = g . \nonumber \]

So we get the ODE \(\eqref{eq:fopde:charode}\), which then describes the value of the solution \(u\) of the PDE along this characteristic curve. It is also convenient to make sure that \(s=0\) corresponds to \(t=0\), that is \(t(0) = 0\). It will be convenient also for \(x(0) = \xi\). See Figure \(\PageIndex{3}\).

Consider \[u_x + u_t + u = x, \qquad u(x,0) = e^{-x^2} . \nonumber \] We find the characteristics, that is, the curves given by \[\frac{dx}{ds} = 1, \qquad \frac{dt}{ds} = 1 . \nonumber \] So \[x = s + c_1, \qquad t = s+ c_2 , \nonumber \] for some \(c_1\) and \(c_2\). At \(s=0\) we want \(t=0\), and \(x\) should be \(\xi\). So we let \(c_1 = \xi\) and \(c_2 = 0\): \[x = s + \xi, \qquad t = s . \nonumber \]

The ODE is \(\frac{du}{ds} + u = x\), and \(x = s+\xi\). So, the ODE to solve along the characteristic is \[\frac{du}{ds} + u = s+ \xi . \nonumber \] The general solution of this equation, treating \(\xi\) as a parameter, is \(u = C e^{-s}+s+\xi-1\), for some constant \(C\). At \(s=0\), our initial condition is that \(u\) is \(e^{-\xi^2}\), since at \(s=0\) we have \(x=\xi\). Given this initial condition, we find \(C=e^{-\xi^2} - \xi +1\). So,

\[\begin{align}\begin{aligned} u & = \bigl(e^{-\xi^2} - \xi +1\bigr) e^{-s}+s+\xi-1 \\ & = e^{-\xi^2-s} + (1 - \xi) e^{-s} +s+\xi-1 . \end{aligned}\end{align} \nonumber \]

Substitute \(\xi = x-t\) and \(s=t\) to find \(u\) in terms of \(x\) and \(t\): \[\begin{align}\begin{aligned} u & = e^{-\xi^2-s} + (1 - \xi) e^{-s} +s+\xi-1 \\ & = e^{-{(x-t)}^2-t} + (1 - x + t) e^{-t} +x-1 . \end{aligned}\end{align} \nonumber \] See Figure \(\PageIndex{4}\) for a plot of \(u(x,t)\) as a function of two variables.

When the coefficients are not constants, the characteristic curves are not going to be straight lines anymore.

Consider the following variable coefficient equation: \[x u_x + u_t + 2 u = 0, \qquad u(x,0) = \cos(x) . % , \qquad -\infty < x < \infty, %\quad t > 0 . \nonumber \]

We find the characteristics, that is, the curves given by \[\frac{dx}{ds} = x, \qquad \frac{dt}{ds} = 1 . \nonumber \] So \[x = c_1 e^{s} , \qquad t = s+ c_2 . \nonumber \]

At \(s=0\), we wish to get the line \(t=0\), and \(x\) should be \(\xi\). So \[x = \xi e^s, \qquad t = s . \nonumber \]

OK, the ODE we need to solve is \[\frac{du}{ds} + 2 u = 0 . \nonumber \]

This is for a fixed \(\xi\). At \(s=0\), we should get that \(u\) is \(\cos(\xi)\), so that is our initial condition. Consequently, \[u = e^{-2s} \cos(\xi)= e^{-2t} \cos(xe^{-t}) . \nonumber \]

We make a few closing remarks. One thing to keep in mind is that we would get into trouble if the coefficient in front of \(u_t\), that is the \(b\), is ever zero. Let us consider a quick example of what can go wrong: \[u_x + u = 0, \qquad u(x,0) = \sin(x). \nonumber \] This problem has no solution. If we had a solution, it would imply that \(u_x(x,0) = \cos(x)\), but \(u_x(x,0) + u(x,0) = \cos(x) + \sin(x) \not= 0\). The problem is that the characteristic curve is now the line \(t=0\), and the solution is already provided on that line!

As long as \(b\) is nonzero, it is convenient to ensure that \(b\) is positive by multiplying by \(-1\) if necessary, so that positive \(s\) means positive \(t\).

Another remark is that if \(a\) or \(b\) in the equation are variable, the computations can quickly get out of hand, as the expressions for the characteristic coordinates become messy and then solving the ODE becomes even messier. In the examples above, \(b\) was always \(1\), meaning we got \(s=t\) in the characteristic coordinates. If \(b\) is not constant, your expression for \(s\) will be more complicated.

Finding the characteristic coordinates is really a system of ODE in general if \(a\) depends on \(t\) or if \(b\) depends on \(x\). In that case, we would need techniques of systems of ODE to solve, see Chapter 3 or Chapter 8. In general, if \(a\) and \(b\) are not linear functions or constants, finding closed form expressions for the characteristic coordinates may be impossible.

Finally, the method of characteristics applies to nonlinear first order PDE as well. In the nonlinear case, the characteristics depend not only on the differential equation, but also on the initial data. This leads to not only more difficult computations, but also the formation of singularities where the solution breaks down at a certain point in time. An example application where first order nonlinear PDE come up is traffic flow theory, and you have probably experienced the formation of singularities: traffic jams. But we digress.