B.2: The Complex Exponential
( \newcommand{\kernel}{\mathrm{null}\,}\)
Definition and Basic Properties
There are two equivalent standard definitions of the exponential, ez, of the complex number z=x+iy. For the more intuitive definition, one simply replaces the real number x in the Taylor series expansion 1 ex=∑∞n=0xnn! with the complex number z, giving
ez=∞∑n=0znn!
We instead highlight the more computationally useful definition.
For any complex number z=x+iy, with x and y real, the exponential ez, is defined by
ex+iy = excosy+iexsiny
In particular 2, eiy = cosy+isiny.
We will not fully prove that the intuitive definition (EZ) and the computational Definition B.2.1 are equivalent. But we will do so in the special case that z=iy, with y real. Under (EZ),
eiy=1+iy+(iy)22!+(iy)33!+(iy)44!+(iy)55!+(iy)66!+⋯
The even terms in this expansion are
1+(iy)22!+(iy)44!+(iy)66!+⋯=1−y22!+y44!−y66!+⋯=cosy
and the odd terms in this expansion are
iy+(iy)33!+(iy)55!+⋯=i(y−y33!+y55!+⋯)=isiny
Adding the even and odd terms together gives us that, under (EZ), eiy is indeed equal to cosy+isiny. 3 It is obvious that, in the special case that z=x with x real, the definitions (EZ) and B.2.1 are equivalent. So to complete the proof of equivalence in the general case z=x+iy, it suffices to prove that ex+iy=exeiy under both (EZ) and Definition B.2.1. For Definition B.2.1, this follows from Lemma B.2.2, below. As a consequence, we have
eiπ=−1
which gives an amazing linking between calculus (e), geometry (π), algebra (i) and the basic number −1.
In the next lemma we verify that the complex exponential obeys a couple of familiar computational properties.
- For any complex numbers z1 and z2,
ez1+z2=ez1ez2
- For any complex number c,
ddtect=cect
- Proof
-
- For any two complex numbers z1=x1+iy1 and z2=x2+iy2, with x1, y1, x2, y2 real,
ez1ez2=ex1(cosy1+isiny1)ex2(cosy2+isiny2)=ex1+x2(cosy1+isiny1)(cosy2+isiny2)=ex1+x2{(cosy1cosy2−siny1siny2)+i(cosy1siny2+cosy2siny1)}=ex1+x2{cos(y1+y2)+isin(y1+y2)}
by the trig identities of Appendix A.8. This says exactly thatez1ez2=e(x1+x2)+i(y1+y2)=ez1+z2
and that the familiar multiplication formula also applies to complex exponentials. - For any real number t and any complex number c=α+iβ, with α, β real,
ect=eαt+iβt=eαt[cos(βt)+isin(βt)]
so that the derivative with respect to tddtect=αeαt[cos(βt)+isin(βt)]+eαt[−βsin(βt)+iβcos(βt)]=(α+iβ)eαt[cos(βt)+isin(βt)]=cect
is also the familiar one.
- For any two complex numbers z1=x1+iy1 and z2=x2+iy2, with x1, y1, x2, y2 real,
Relationship with sin and cos
When θ is a real number
eiθ=cosθ+isinθe−iθ=cosθ−isinθ=¯eiθ
are complex numbers of modulus one.
Solving for cosθ and sinθ (by adding and subtracting the two equations) gives
These formulae make it easy derive trig identities. For example,
cosθcosϕ=14(eiθ+e−iθ)(eiϕ+e−iϕ)=14(ei(θ+ϕ)+ei(θ−ϕ)+ei(−θ+ϕ)+e−i(θ+ϕ))=14(ei(θ+ϕ)+e−i(θ+ϕ)+ei(θ−ϕ)+ei(−θ+ϕ))=12(cos(θ+ϕ)+cos(θ−ϕ))
and, using (a+b)3=a3+3a2b+3ab2+b3,
sin3θ=−18i(eiθ−e−iθ)3=−18i(ei3θ−3eiθ+3e−iθ−e−i3θ)=3412i(eiθ−e−iθ)−1412i(ei3θ−e−i3θ)=34sinθ−14sin(3θ)
and
cos(2θ)=ℜ(e2θi)=ℜ(eiθ)2=ℜ(cosθ+isinθ)2=ℜ(cos2θ+2isinθcosθ−sin2θ)=cos2θ−sin2θ
Polar Coordinates
Let z=x+iy be any complex number. Writing x and y in polar coordinates in the usual way, i.e. x=rcos(θ), y=rsin(θ), gives
x+iy=rcosθ+irsinθ=reiθ
See the figure on the left below. In particular
1= ei0=e2πi=e2kπifor k=0,±1,±2,⋯−1= eiπ=e3πi=e(1+2k)πifor k=0,±1,±2,⋯i= eiπ/2=e52πi=e(12+2k)πifor k=0,±1,±2,⋯−i=e−iπ/2=e32πi=e(−12+2k)πifor k=0,±1,±2,⋯
See the figure on the right below.
The polar coordinate θ=arctanyx associated with the complex number z=x+iy, i.e. the point (x,y) in the xy-plane, is also called the argument of z.
The polar coordinate representation makes it easy to find square roots, third roots and so on. Fix any positive integer n. The nth roots of unity are, by definition, all solutions z of
zn = 1
Writing z=reiθ
rnenθi = 1e0i
The polar coordinates (r,θ) and (r′,θ′) represent the same point in the xy-plane if and only if r=r′ and θ=θ′+2kπ for some integer k. So zn=1 if and only if rn=1, i.e. r=1, and nθ=2kπ for some integer k. The nth roots of unity are all the complex numbers e2πikn with k integer. There are precisely n distinct nth roots of unity because e2πikn=e2πik′n if and only if 2πkn−2πk′n=2πk−k′n is an integer multiple of 2π. That is, if and only if k−k′ is an integer multiple of n. The n distinct nth roots of unity are
1 , e2πi1n , e2πi2n , e2πi3n , ⋯ , e2πin−1n
For example, the 6th roots of unity are depicted below.
Exploiting Complex Exponentials in Calculus Computations
You have learned how to evaluate integrals involving trigonometric functions by using integration by parts, various trigonometric identities and various substitutions. It is often much easier to just use (B.2.3) and (B.2.4). Part of the utility of complex numbers comes from how well they interact with calculus through the exponential function. Here are two examples.
∫excosx dx=12∫ex[eix+e−ix] dx=12∫[e(1+i)x+e(1−i)x] dx=12[11+ie(1+i)x+11−ie(1−i)x]+C
This form of the indefinite integral looks a little weird because of the i's. While it looks complex because of the i's, it is actually purely real (and correct), because 11−ie(1−i)x is the complex conjugate of 11+ie(1+i)x. We can convert the indefinite integral into a more familiar form just by subbing back in e±ix=cosx±isinx, 11+i=1−i(1+i)(1−i)=1−i2 and 11−i=¯11+i=1+i2.
∫excosx dx=12ex[11+ieix+11−ie−ix]+C=12ex[1−i2(cosx+isinx)+1+i2(cosx−isinx)]+C=12excosx+12exsinx+C
You can quickly verify this by differentiating (or by comparing with Example 1.7.11).
Evaluating the integral ∫cosnx dx using the methods of Section 1.8 can be a real pain. It is much easier if we convert to complex exponentials. Using (a+b)4=a4+4a3b+6a2b2+4ab3+b4,
∫cos4x dx=124∫[eix+e−ix]4 dx=124∫[e4ix+4e2ix+6+4e−2ix+e−4ix] dx=124[14ie4ix+42ie2ix+6x+4−2ie−2ix+1−4ie−4ix]+C=124[1212i(e4ix−e−4ix)+42i(e2ix−e−2ix)+6x]+C=124[12sin4x+4sin2x+6x]+C=132sin4x+14sin2x+38x+C
Exploiting Complex Exponentials in Differential Equation Computations
Complex exponentials are also widely used to simplify the process of guessing solutions to ordinary differential equations. We'll start with (possibly a review of) some basic definitions and facts about differential equations.
- A differential equation is an equation for an unknown function that contains the derivatives of that unknown function. For example y″(t)+y(t)=0 is a differential equation for the unknown function y(t).
- In the differential calculus text CLP-1, we treated only derivatives of functions of one variable. Such derivatives are called ordinary derivatives. A differential equation is called an ordinary differential equation (often shortened to “ODE”) if only ordinary derivatives appear. That is, if the unknown function has only a single independent variable.
In CLP-3 we will treat derivatives of functions of more than one variable. For example, let f(x,y) be a function of two variables. If you treat y as a constant and take the derivative of the resulting function of the single variable x, the result is called the partial derivative of f with respect to x. A differential equation is called a partial differential equation (often shortened to “PDE”) if partial derivatives appear. That is, if the unknown function has more than one independent variable. For example y″(t)+y(t)=0 is an ODE while ∂2u∂t2(x,t)=c2∂2u∂x2(x,t) is a PDE.
- The order of a differential equation is the order of the highest derivative that appears. For example y″(t)+y(t)=0 is a second order ODE.
- An ordinary differential equation that is of the form
a0(t)y(n)(t)+a1(t)y(n−1)(t)+⋯+an(t)y(t)=F(t)
with given coefficient functions a0(t), ⋯, an(t) and F(t) is said to be linear. Otherwise, the ODE is said to be nonlinear. For example, y′(t)2+y(t)=0, y′(t)y″(t)+y(t)=0 and y′(t)=ey(t) are all nonlinear.
- The ODE (B.2.1) is said to have constant coefficients if the coefficients a0(t), a1(t), ⋯, an(t) are all constants. Otherwise, it is said to have variable coefficients. For example, the ODE y″(t)+7y(t)=sint is constant coefficient, while y″(t)+ty(t)=sint is variable coefficient.
- The ODE (B.2.1) is said to be homogeneous if F(t) is identically zero. Otherwise, it is said to be inhomogeneous or nonhomogeneous. For example, the ODE y″(t)+7y(t)=0 is homogeneous, while y″(t)+7y(t)=sint is inhomogeneous. A homogeneous ODE always has the trivial solution y(t)=0.
- An initial value problem is a problem in which one is to find an unknown function y(t) that satisfies both a given ODE and given initial conditions, like y(t0)=1, y′(t0)=0. Note that all of the conditions involve the function y(t) (or its derivatives) evaluated at a single time t=t0.
- A boundary value problem is a problem in which one is to find an unknown function y(t) that satisfies both a given ODE and given boundary conditions, like y(t0)=0, y(t1)=0. Note that the conditions involve the function y(t) (or its derivatives) evaluated at two different times.
The following theorem gives the form of solutions to the linear 4 ODE (B.2.1).
Assume that the coefficients a0(t), a1(t), ⋯, an−1(t), an(t) and F(t) are continuous functions and that a0(t) is not zero.
- The general solution to the linear ODE (B.2.1) is of the form
y(t)=yp(t)+C1y1(t)+C2y2(t)+⋯+Cnyn(t)
where
- n is the order of (B.2.1)
- yp(t) is any solution to (B.2.1)
- C1, C2, ⋯, Cn are arbitrary constants
- y1, y2, ⋯, yn are n independent solutions to the homogenous equation
a0(t)y(n)(t)+a1(t)y(n−1)(t)+⋯+an−1(t)y′(t)+an(t)y(t)=0
associated to (B.2.1). “Independent” just means that no yi can be written as a linear combination of the other yj's. For example, y1(t) cannot be expressed in the form b2y2(t)+⋯+bnyn(t).
In (B.2.2), yp is called the “particular solution” and C1y1(t)+C2y2(t)+⋯+Cnyn(t) is called the “complementary solution”.
- Given any constants b0, ⋯, bn−1 there is exactly one function y(t) that obeys the ODE (B.2.1) and the initial conditions
y(0)=b0y′(0)=b1⋯y(n−1)(0)=bn−1
In the following example we'll derive one widely used linear constant coefficient ODE.
As an example of how ODE's arise, we consider the RLC circuit, which is the electrical circuit consisting of a resistor of resistance R, a coil (or solenoid) of inductance L, a capacitor of capacitance C and a voltage source arranged in series, as shown below. Here R, L and C are all nonnegative constants.
We're going to think of the voltage x(t) as an input signal, and the voltage y(t) as an output signal. The goal is to determine the output signal produced by a given input signal. If i(t) is the current flowing at time t in the loop as shown and q(t) is the charge on the capacitor, then the voltages across R, L and C, respectively, at time t are Ri(t), Ldidt(t) and y(t)=q(t)C. By the Kirchhoff's law 5 that says that the voltage between any two points has to be independent of the path used to travel between the two points, these three voltages must add up to x(t) so that
Ri(t)+Ldidt(t)+q(t)C=x(t)
Assuming that R, L, C and x(t) are known, this is still one differential equation in two unknowns, the current i(t) and the charge q(t). Fortunately, there is a relationship between the two. Because the current entering the capacitor is the rate of change of the charge on the capacitor
i(t)=dqdt(t)=Cy′(t)
This just says that the capacitor cannot create or destroy charge on its own; all charging of the capacitor must come from the current. Substituting (B.2.4) into (B.2.3) gives
LCy″(t)+RCy′(t)+y(t)=x(t)
which is a second order linear constant coefficient ODE. As a concrete example, we'll take an ac voltage source and choose the origin of time so that x(0)=0, x(t)=E0sin(ωt). Then the differential equation becomes
LCy″(t)+RCy′(t)+y(t)=E0sin(ωt)
Finally, here are two examples in which we use complex exponentials to solve an ODE.
By Theorem B.2.9(a), the general solution to the ordinary differential equation
y″(t)+4y′(t)+5y(t)=0
is of the form C1u1(t)+C2u2(t) with u1(t) and u2(t) being two (independent) solutions to (ODE) and with C1 and C2 being arbitrary constants. The easiest way to find u1(t) and u2(t) is to guess them. And the easiest way to guess them is to try 6 The reason that y(t)=ert is a good guess is that, with this guess, all of y(t), y′(t) and y″(t) are constants times ert. So the left hand side of the differential equation is also a constant, that depends on r, times ert. So we just have to choose r so that the constant is zero. y(t)=ert, with r being a constant to be determined. Substituting y(t)=ert into (ODE) gives
r2ert+4rert+5ert=0⟺(r2+4r+5)ert=0⟺r2+4r+5=0
This quadratic equation for r can be solved either by using the high school formula or by completing the square.
r2+4r+5=0⟺(r+2)2+1=0⟺(r+2)2=−1⟺r+2=±i⟺r=−2±i
So the general solution to (ODE) is
y(t)=C1e(−2+i)t+C2e(−2−i)t
This is one way to write the general solution, but there are many others. In particular there are quite a few people in the world who are (foolishly) afraid 7 Embracing the complexity leads to simplicity. of complex exponentials. We can hide them by using (B.2.3) and (B.2.4).
y(t)=C1e(−2+i)t+C2e(−2−i)t=C1e−2teit+C2e−2te−it=C1e−2t(cost+isint)+C2e−2t(cost−isint)=(C1+C2)e−2tcost+(iC1−iC2)e−2tsint=D1e−2tcost+D2e−2tsint
with D1=C1+C2 and D2=iC1−iC2 being two other arbitrary constants. Don't make the mistake of thinking that D2 must be complex because i appears in the formula D2=iC1−iC2 relating D2 and C1,C2. No one said that C1 and C2 are real numbers. In fact, in typical applications, the arbitrary constants are determined by initial conditions and often D1 and D2 turn out to be real and C1 and C2 turn out to be complex. For example, the initial conditions y(0)=0, y′(0)=2 force
0=y(0)=C1+C22=y′(0)=(−2+i)C1+(−2−i)C2
The first equation gives C2=−C1 and then the second equation gives
(−2+i)C1−(−2−i)C1=2⟺2iC1=2⟺iC1=1⟺C1=−i, C2=i
and
D1=C1+C2=0D2=iC1−iC2=2
We shall now guess one solution (i.e. a particular solution) to the differential equation
y″(t)+2y′(t)+3y(t)=cost
Equations like this arise, for example, in the study of the RLC circuit. We shall simplify the computation by exploiting that cost=ℜeit. First, we shall guess a function Y(t) obeying
Y″+2Y′+3Y=eit
Then, taking complex conjugates gives
ˉY″+2ˉY′+3ˉY=e−it
which we shall call (¯ODE2). Then, adding 12(ODE2) and 12(¯ODE2) together will give
(ℜY)″+2(ℜY)′+3(ℜY)=ℜeit=cost
which shows that ℜY(t) is a solution to (ODE1). Let's try Y(t)=Aeit, with A a constant to be determined. This is a solution of (ODE2) if and only if
d2dt2(Aeit)+2ddt(Aeit)+3Aeit=eit⟺(i2+2i+3)Aeit=eit⟺A=12+2i
So eit2+2i is a solution to (ODE2) and ℜeit2+2i is a solution to (ODE1). To simplify this, write 2+2i in polar coordinates. From the sketch below we have 2+2i=2√2eiπ4.
So
eit2+2i=eit2√2eiπ4=12√2ei(t−π4)⟹ℜeit2+2i=12√2ℜei(t−π4)=12√2cos(t−π4)