Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Mathematics LibreTexts

13.2: Sturm-Liouville Problems

( \newcommand{\kernel}{\mathrm{null}\,}\)

1 newcommand dy , mathrmdy newcommand dx , mathrmdx newcommand dyx , frac mathrmdy mathrmdx newcommand ds , mathrmds newcommand dt , mathrmdt newcommand\dst , frac mathrmds mathrmdt

In this section we consider eigenvalue problems of the form

(13.2.1)P0(x)y+P1(x)y+P2(x)y+λR(x)y=0,B1(y)=0,B2(y)=0,

where

B1(y)=αy(a)+βy(a)andB2(y)=ρy(b)+δy(b).

As in Section 13.1, α, β, ρ, and δ are real numbers, with

α2+β2>0andρ2+δ2>0,

P0, P1, P2, and R are continuous, and P0 and R are positive on [a,b].

We say that λ is an eigenvalue of Equation 13.2.1 if Equation 13.2.1 has a nontrivial solution y. In this case, y is an eigenfunction associated with λ, or a λ-eigenfunction. Solving the eigenvalue problem means finding all eigenvalues and associated eigenfunctions of Equation 13.2.1.

Example 13.2.1

Solve the eigenvalue problem

(13.2.2)y+3y+2y+λy=0,y(0)=0,y(1)=0.

Solution

The characteristic equation of Equation 13.2.2 is

r2+3r+2+λ=0,

with zeros

r1=3+14λ2andr2=314λ2.

If λ<1/4 then r1 and r2 are real and distinct, so the general solution of the differential equation in Equation 13.2.2 is

y=c1er1t+c2er2t.

The boundary conditions require that

c1er1+c2er2=0c1er1+c2er2=0.

Since the determinant of this system is er2er10, the system has only the trivial solution. Therefore λ isn’t an eigenvalue of Equation 13.2.2.

If λ=1/4 then r1=r2=3/2, so the general solution of Equation 13.2.2 is

y=e3x/2(c1+c2x).

The boundary condition y(0)=0 requires that c1=0, so y=c2xe3x/2 and the boundary condition y(0) requires that c2=0. Therefore λ=1/4 isn’t an eigenvalue of Equation 13.2.2.

If λ>1/4 then

r1=32+iωandr2=32iω,

with

(13.2.3)ω=4λ12or equivalentlyλ=1+4ω24.

In this case the general solution of the differential equation in Equation 13.2.2 is

y=e3x/2(c1cosωx+c2sinωx).

The boundary condition y(0)=0 requires that c1=0, so y=c2e3x/2sinωx, which holds with c20 if and only if ω=nπ, where n is an integer. We may assume that n is a positive integer. (Why?). From Equation 13.2.3, the eigenvalues are λn=(1+4n2π2)/4, with associated eigenfunctions

yn=e3x/2sinnπx,n=1,2,3,.

Example 13.2.2

Solve the eigenvalue problem

(13.2.4)x2y+xy+λy=0,y(1)=0,y(2)=0.

Solution

If λ=0, the differential equation in Equation 13.2.4 reduces to x(xy)=0, so xy=c1,

y=c1x,andy=c1lnx+c2.

The boundary condition y(1)=0 requires that c2=0, so y=c1lnx. The boundary condition y(2)=0 requires that c1ln2=0, so c1=0. Therefore zero isn’t an eigenvalue of Equation 13.2.4.

If λ<0, we write λ=k2 with k>0, so Equation 13.2.4 becomes

x2y+xyk2y=0,

an Euler equation (Section 7.4) with indicial equation

r2k2=(rk)(r+k)=0.

Therefore

y=c1xk+c2xk.

The boundary conditions require that

2kc1+2kc2=02kc1+2kc2=0.

Since the determinant of this system is 2k2k0, c1=c2=0. Therefore Equation 13.2.4 has no negative eigenvalues.

If λ>0 we write λ=k2 with k>0. Then Equation 13.2.4 becomes

x2y+xy+k2y=0,

an Euler equation with indicial equation

r2+k2=(rik)(r+ik)=0,

so

y=c1cos(klnx)+c2sin(klnx).

The boundary condition y(1)=0 requires that c1=0. Therefore y=c2sin(klnx). This holds with c20 if and only if k=nπ/ln2, where n is a positive integer. Hence, the eigenvalues of Equation 13.2.4 are λn=(nπ/ln2)2, with associated eigenfunctions

yn=sin(nπln2lnx),n=1,2,3,.

For theoretical purposes, it is useful to rewrite the differential equation in Equation 13.2.1 in a different form, provided by the next theorem.

Theorem 13.2.1

If P0, P1, P2, and R are continuous and P0 and R are positive on a closed interval [a,b], then the equation

(13.2.5)P0(x)y+P1(x)y+P2(x)y+λR(x)y=0

can be rewritten as

(13.2.6)(p(x)y)+q(x)y+λr(x)y=0,

where p, p, q and r are continuous and p and r are positive on [a,b].

Proof

We begin by rewriting Equation 13.2.5 as

(13.2.7)y+u(x)y+v(x)y+λR1(x)y=0,

with u=P1/P0, v=P2/P0, and R1=R/P0. (Note that R1 is positive on [a,b].) Now let p(x)=eU(x), where U is any antiderivative of u. Then p is positive on [a,b] and, since U=u,

(13.2.8)p(x)=p(x)u(x)

is continuous on [a,b]. Multiplying Equation 13.2.7 by p(x) yields

(13.2.9)p(x)y+p(x)u(x)y+p(x)v(x)y+λp(x)R1(x)y=0.

Since p is positive on [a,b], this equation has the same solutions as Equation 13.2.5. From Equation 13.2.8,

(p(x)y)=p(x)y+p(x)y=p(x)y+p(x)u(x)y,

so Equation 13.2.9 can be rewritten as in Equation 13.2.6, with q(x)=p(x)v(x) and r(x)=p(x)R1(x). This completes the proof.

It is to be understood throughout the rest of this section that p, q, and r have the properties stated in Theorem 13.2.1 . Moreover, whenever we write Ly in a general statement, we mean

Ly=(p(x)y)+q(x)y.

The differential equation Equation 13.2.6 is called a Sturm-Liouville equation, and the eigenvalue problem

(13.2.10)(p(x)y)+q(x)y+λr(x)y=0,B1(y)=0,B2(y)=0,

which is equivalent to Equation 13.2.1, is called a Sturm-Liouville problem.

Example 13.2.3

Rewrite the eigenvalue problem

(13.2.11)y+3y+(2+λ)y=0,y(0)=0,y(1)=0

of Theorem 13.2.1 as a Sturm-Liouville problem.

Solution

Comparing Equation 13.2.11 to Equation 13.2.7 shows that u(x)=3, so we take U(x)=3x and p(x)=e3x. Multiplying the differential equation in Equation 13.2.11 by e3x yields

e3x(y+3y)+2e3xy+λe3xy=0.

Since

e3x(y+3y)=(e3xy),

Equation 13.2.11 is equivalent to the Sturm–Liouville problem

(13.2.12)(e3xy)+2e3xy+λe3xy=0,y(0)=0,y(1)=0.

Example 13.2.4

Rewrite the eigenvalue problem

(13.2.13)x2y+xy+λy=0,y(1)=0,y(2)=0

of Theorem 13.2.2 as a Sturm-Liouville problem.

Solution

Dividing the differential equation in Equation 13.2.13 by x2 yields

y+1xy+λx2y=0.

Comparing this to Equation 13.2.7 shows that u(x)=1/x, so we take U(x)=lnx and p(x)=elnx=x. Multiplying the differntial equation by x yields

xy+y+λxy=0.

Since

xy+y=(xy),

Equation 13.2.13 is equivalent to the Sturm–Liouville problem

(13.2.14)(xy)+λxy=0,y(1)=0,y(2)=0.

Problems 1–4 of Section 11.1 are Sturm–Liouville problems. (Problem 5 isn’t, although some authors use a definition of Sturm-Liouville problem that does include it.) We were able to find the eigenvalues of Problems 1-4 explicitly because in each problem the coefficients in the boundary conditions satisfy αβ=0 and ρδ=0; that is, each boundary condition involves either y or y, but not both. If this isn’t true then the eigenvalues can’t in general be expressed exactly by simple formulas; rather, approximate values must be obtained by numerical solution of equations derived by requiring the determinants of certain 2×2 systems of homogeneous equations to be zero. To apply the numerical methods effectively, graphical methods must be used to determine approximate locations of the zeros of these determinants. Then the zeros can be computed accurately by numerical methods.

Example 13.2.5

Solve the Sturm–Liouville problem

(13.2.15)y+λy=0,y(0)+y(0)=0,y(1)+3y(1)=0.

Solution

If λ=0, the differential equation in Equation 13.2.15 reduces to y=0, with general solution y=c1+c2x. The boundary conditions require that

c1+4c2=0c1+4c2=0,

so c1=c2=0. Therefore zero isn’t an eigenvalue of Equation 13.2.15.

If λ<0, we write λ=k2 where k>0, and the differential equation in Equation 13.2.15 becomes yk2y=0, with general solution

(13.2.16)y=c1coshkx+c2sinhkx,

so

y=k(c1sinhkx+c2coshkx).

The boundary conditions require that

(13.2.17)c1+kc2=0(coshk+3ksinhk)c1+(sinhk+3kcoshk)c2=0

The determinant of this system is

DN(k)=|1kcoshk+3ksinhksinhk+3kcoshk|=(13k2)sinhk+2kcoshk.

Therefore the system Equation 13.2.17 has a nontrivial solution if and only if DN(k)=0 or, equivalently,

(13.2.18)tanhk=2k13k2.

The graph of the right side (Figure 13.2.1 ) has a vertical asymptote at k=1/3. Since the two sides have different signs if k<1/3, this equation has no solution in (0,1/3). Figure 13.2.1 shows the graphs of the two sides of Equation 13.2.18 on an interval to the right of the vertical asymptote, which is indicated by the dashed line. You can see that the two curves intersect near k0=1.2, Given this estmate, you can use Newton’s to compute k0 more accurately. We computed k01.1219395. Therefore k021.2587483 is an eigenvalue of Equation 13.2.15. From Equation 13.2.16 and the first equation in Equation 13.2.17,

y0=k0coshk0xsinhk0x.

fig130201.svg
Figure 13.2.1 : u=tanhk and u=2k/(13k2)

If λ>0 we write λ=k2 where k>0, and differential equation in Equation 13.2.15 becomes y+k2y=0, with general solution

(13.2.19)y=coskx+c2sinkx,

so

y=k(c1sinkx+c2coskx).

The boundary conditions require that

(13.2.20)c1+kc2=0(cosk3ksink)c1+(sink+3kcosk)c2=0.

The determinant of this system is

DP(k)=|1kcosk3ksinksink+3kcosk|=(1+3k2)sink+2kcosk.

The system Equation 13.2.20 has a nontrivial solution if and only if DP(k)=0 or, equivalently,

tank=2k1+3k2.

Figure 13.2.2 shows the graphs of the two sides of this equation. You can see from the figure that the graphs intersect at infinitely many points knnπ ( n=1, 2, 3,…), where the error in this approximation approaches zero as n. Given this estimate, you can use Newton’s method to compute kn more accurately. We computed

k112.9256856,k216.1765914,k319.3538959,k412.5132570.

The estimates of the corresponding eigenvalues λn=kn2 are

λ1158.5596361,λ2538.1502809,λ3587.4953676,λ4156.5815998.

From Equation 13.2.19 and the first equation in Equation 13.2.20,

yn=kncosknxsinknx

is an eigenfunction associated with λn

fig130202.svg
Figure 13.2.2 : u=tank and u=2k/(1+k)

Since the differential equations in Equation 13.2.12 and Equation 13.2.14 are more complicated than those in Equation 13.2.11 and Equation 13.2.13 respectively, what is the point of Theorem 13.2.1 ? The point is this: to solve a specific problem, it may be better to deal with it directly, as we did in Examples 13.2.1 and 13.2.2 ; however, we’ll see that transforming the general eigenvalue problem Equation 13.2.1 to the Sturm–Liouville problem Equation 13.2.10 leads to results applicable to all eigenvalue problems of the form Equation 13.2.1.

Theorem 13.2.2

If

Ly=(p(x)y)+q(x)y

and u and v are twice continuously functions on [a,b] that satisfy the boundary conditions B1(y)=0 and B2(y)=0, then

(13.2.21)ab[u(x)Lv(x)v(x)Lu(x)]dx=0.

Proof

Integration by parts yields

ab[u(x)Lv(x)v(x)Lu(x)]dx=ab[u(x)(p(x)v(x))v(x)(p(x)u(x))]dx=p(x)[u(x)v(x)u(x)v(x)]|ababp(x)[u(x)v(x)u(x)v(x)]dx.

Since the last integral equals zero,

(13.2.22)ab[u(x)Lv(x)v(x)Lu(x)]dx=p(x)[u(x)v(x)u(x)v(x)]|ab.

By assumption, B1(u)=B1(v)=0 and B2(u)=B2(v)=0. Therefore

αu(a)+βu(a)=0αv(a)+βv(a)=0andρu(b)+δu(b)=0.ρv(b)+δv(b)=0.

Since α2+β2>0 and ρ2+δ2>0, the determinants of these two systems must both be zero; that is,

u(a)v(a)u(a)v(a)=u(b)v(b)u(b)v(b)=0.

This and Equation 13.2.22 imply Equation 13.2.21, which completes the proof.

The next theorem shows that a Sturm–Liouville problem has no complex eigenvalues.

Theorem 13.2.3

If λ=p+qi with q0 then the boundary value problem

Ly+λr(x)y=0,B1(y)=0,B2(y)=0

has only the trivial solution.

Proof

For this theorem to make sense, we must consider complex-valued solutions of

(13.2.23)Ly+(p+iq)r(x,y)y=0.

If y=u+iv where u and v are real-valued and twice differentiable, we define y=u+iv and y=u+iv. We say that y is a solution of Equation 13.2.23 if the real and imaginary parts of the left side of Equation 13.2.23 are both zero. Since Ly=(p(x)y)+q(x)y and p, q, and r are real-valued,

Ly+λr(x)y=L(u+iv)+(p+iq)r(x)(u+iv)=Lu+r(x)(puqv)+i[Lv+r(x)(pu+qv)],

so Ly+λr(x)y=0 if and only if

Lu+r(x)(puqv)=0Lv+r(x)(qu+pv)=0.

Multiplying the first equation by v and the second by u yields

vLu+r(x)(puvqv2)=0uLv+r(x)(qu2+puv)=0.

Subtracting the first equation from the second yields

uLvvLu+qr(x)(u2+v2)=0,

so

(13.2.24)ab[u(x)Lv(x)v(x)Lu(x)]dx+abr(x)[u2(x)+v2(x)]dx=0.

Since

B1(y)=B1(u+iv)=B1(u)+iB1(v)

and

B2(y)=B2(u+iv)=B2(u)+iB2(v),

B1(y)=0 and B2(y)=0 implies that

B1(u)=B2(u)=B1(v)=B2(v)=0.

Therefore Theorem 13.2.2 implies that first integral in Equation 13.2.24 equals zero, so Equation 13.2.24 reduces to

qabr(x)[u2(x)+v2(x)]dx=0.

Since r is positive on [a,b] and q0 by assumption, this implies that u0 and v0 on [a,b]. Therefore y0 on [a,b], which completes the proof.

Theorem 13.2.4

If λ1 and λ2 are distinct eigenvalues of the Sturm–Liouville problem

(13.2.25)Ly+λr(x)y=0,B1(y)=0,B2(y)=0

with associated eigenfunctions u and v respectively, then

(13.2.26)abr(x)u(x)v(x)dx=0.

Proof

Since u and v satisfy the boundary conditions in Equation 13.2.25, Theorem 13.2.2 implies that

ab[u(x)Lv(x)v(x)Lu(x)]dx=0.

Since Lu=λ1ru and Lv=λ2rv, this implies that

(λ1λ2)abr(x)u(x)v(x)dx=0.

Since λ1λ2, this implies Equation 13.2.26, which completes the proof.

If u and v are any integrable functions on [a,b] and

abr(x)u(x)v(x)dx=0,

we say that u and v orthogonal on [a,b] with respect to r=r(x).

Theorem 13.1.1 implies the next theorem.

Theorem 13.2.5

If u0 and v both satisfy

Ly+λr(x)y=0,B1(y)=0,B2(y)=0,

then v=cu for some constant c.

We’ve now proved parts of the next theorem. A complete proof is beyond the scope of this book.

Theorem 13.2.6

The set of all eigenvalues of the Sturm–Liouville problem

Ly+λr(x)y=0,B1(y)=0,B2(y)=0

can be ordered as

λ1<λ2<<λn<,

and

limnλn=.

For each n, if yn is an arbitrary λn-eigenfunction, then every λn-eigenfunction is a constant multiple of yn. If mn, ym and yn are orthogonal [a,b] with respect to r=r(x); that is,

(13.2.27)abr(x)ym(x)yn(x)dx=0.

You may want to verify Equation 13.2.27 for the eigenfunctions obtained in Examples 13.2.1 and 13.2.2 .

In conclusion, we mention the next theorem. The proof is beyond the scope of this book.

Theorem 13.2.7

Let λ1<λ2<<λn< be the eigenvalues of the Sturm–Liouville problem

Ly+λr(x)y=0,B1(y)=0,B2(y)=0,

with associated eigenvectors y1, y2, …, yn,. Suppose f is piecewise smooth (Definition 11.2.3) on [a,b]. For each n, let

cn=abr(x)f(x)yn(x)dxabr(x)yn2(x)dx.

Then

f(x)+f(x+)2=n=1cnyn(x)

for all x in the open interval (a,b).


This page titled 13.2: Sturm-Liouville Problems is shared under a CC BY-NC-SA 3.0 license and was authored, remixed, and/or curated by William F. Trench via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?