5.3: Integration by Substitution

Last updated
Save as PDF

Page ID: 4322

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

Learning Objectives

In this section, we strive to understand the ideas generated by the following important questions:

How can we begin to find algebraic formulas for antiderivatives of more complicated algebraic functions?
What is an indefinite integral and how is its notation used in discussing antiderivatives?
How does the technique of u-substitution work to help us evaluate certain indefinite integrals, and how does this process rely on identifying function-derivative pairs?

In Section 4.4, we learned the key role that antiderivatives play in the process of evaluating definite integrals exactly. In particular, the Fundamental Theorem of Calculus tells us that if \(F\) is any antiderivative of \(f\), then

\[\int^b_a f (x) dx = F(b) − F(a).\]

Furthermore, we realized that each elementary derivative rule developed in Chapter 2 leads to a corresponding elementary antiderivative, as summarized in Table 4.1. Thus, if we wish to evaluate an integral such as

\[\int_0^1 x 3 − √ x + 5 x dx \label{eq5.3}\]

it is straightforward to do so, since we can easily antidifferentiate f (x) = x 3 − √ x + 5 x . In particular, since a function \(F\) whose derivative is \(f\) is given by

\[F(x) = 1 4 x 4− 2 3 x 3/2+ 1 \ln (5) 5 x \]

the Fundamental Theorem of Calculus tells us that

\[\int_0^1 x 3 − √ x + 5 x dx = 1 4 x 4 − 2 3 x 3/2 + 1 \ln (5) 5 x 1 0 = 1 4 (1) 4 − 2 3 (1) 3/2 + 1 \ln (5) 5 1 ! − 0 − 0 + 1 \ln (5) 5 0 ! = − 5 12 + 4 \ln (5) .\]

Because an algebraic formula for an antiderivative of f enables us to evaluate the definite integral

\[\int ^b_a f (x) dx\]

exactly, we see that we have a natural interest in being able to find such algebraic antiderivatives. Note that we emphasize algebraic antiderivatives, as opposed to any antiderivative, since we know by the Second Fundamental Theorem of Calculus that

\[G(x) = \int^x_a f (t) dt\]

is indeed an antiderivative of the given function \(f\), but one that still involves a definite integral. One of our main goals in this section and the one following is to develop understanding, in select circumstances, of how to “undo” the process of differentiation in order to find an algebraic antiderivative for a given function.

Preview Activity \(\PageIndex{1}\)

In Section 2.5, we learned the Chain Rule and how it can be applied to find the derivative of a composite function. In particular, if \(u\) is a differentiable function of \(x\), and \(f\) is a differentiable function of \(u(x)\), then

\[\dfrac{d}{dx} [ f (u(x))] = f' (u(x)) · u 0 (x).\]

In words, we say that the derivative of a composite function c(x) = f (u(x)), where f is considered the “outer” function and u the “inner” function, is “the derivative of the outer function, evaluated at the inner function, times the derivative of the inner function.”

(a) For each of the following functions, use the Chain Rule to find the function’s derivative. Be sure to label each derivative by name (e.g., the derivative of g(x) should be labeled g' (x)).

g(x) = e 3x
h(x) = sin(5x + 1)
p(x) = arctan(2x)
q(x) = (2 − 7x) 4
r(x) = 3 4−11x

(b) For each of the following functions, use your work in (a) to help you determine the general antiderivative3 of the function. Label each antiderivative by name (e.g., the antiderivative of m should be called M). In addition, check your work by computing the derivative of each proposed antiderivative.

m(x) = e 3x
n(x) = cos(5x + 1)
s(x) = 1 1+4x 2 3Recall that the general antiderivative of a function includes “+C” to reflect the entire family of functions that share the same derivative.
v(x) = (2 − 7x) 3 v. w(x) = 3 4−11x

(c) Based on your experience in parts (a) and (b), conjecture an antiderivative for each of the following functions. Test your conjectures by computing the derivative of each proposed antiderivative.

a(x) = cos(πx)
b(x) = (4x + 7) 11
c(x) = xex 2 ./

Reversing the Chain Rule: First Steps

In Preview Activity \(\PageIndex{1}\), we saw that it is usually straightforward to antidifferentiate a function of the form h(x) = f (u(x)), whenever f is a familiar function whose antiderivative is known and u(x) is a linear function. For example, if we consider h(x) = (5x − 3) 6 , in this context the outer function f is f (u) = u 6 , while the inner function is u(x) = 5x − 3. Since the antiderivative of f is

\[F(u) = 1 7 u 7 + C,\]

we see that the antiderivative of h is

\[H(x) = 1 7 (5x − 3) 7 · 1 5 + C = 1 35 (5x − 3) 7 + C.\]

The inclusion of the constant 1 5 is essential precisely because the derivative of the inner function is u 0 (x) = 5. Indeed, if we now compute \(H' (x)\), we find by the Chain Rule (and Constant Multiple Rule) that

\[H ' (x) = 1 35 · 7(5x − 3) 6 · 5 = (5x − 3) 6 = h(x), \]

and thus H is indeed the general antiderivative of \(h\). Hence, in the special case where the outer function is familiar and the inner function is linear, we can antidifferentiate composite functions according to the following rule. If \(h(x) = f (ax + b)\) and \(F\) is a known algebraic antiderivative of f , then the general antiderivative of h is given by

\[H(x) = 1 a F(ax + b) + C. \]

When discussing antiderivatives, it is often useful to have shorthand notation that indicates the instruction to find an antiderivative. Thus, in a similar way to how the notation d dx [ f (x)] represents the derivative of f (x) with respect to x, we use the notation of the indefinite integral, Z f (x) dx to represent the general antiderivative of \(f\) with respect to x. For instance, returning to the earlier example with h(x) = (5x − 3) 6 above, we can rephrase the relationship between h and its antiderivative H through the notation

\[\int (5x − 3) 6 dx = 1 35 (5x − 6) 7 + C.\]

When we find an antiderivative, we will often say that we evaluate an indefinite integral; said differently, the instruction to evaluate an indefinite integral means to find the general antiderivative. Just as the notation d dx [] means “find the derivative with respect to x of ,” the notation R dx means “find a function of x whose derivative is .”

Activity \(\PageIndex{2}\)

Evaluate each of the following indefinite integrals. Check each antiderivative that you find by differentiating.

\(\displaystyle \int sin(8 − 3x) dx\)
\(\displaystyle \int sec2 (4x) dx\)
\(\displaystyle \int 1 11x−9 dx\)
\(\displaystyle \int csc(2x + 1) cot(2x + 1) dx\)
\(\displaystyle \int 1 √ 1−16x 2 dx\)
\(\displaystyle \int 5 −x dx\)

Reversing the Chain Rule

u-substitution Of course, a natural question arises from our recent work: what happens when the inner function is not a linear function? For example, can we find antiderivatives of such functions as \(g(x) = xex^2\) and \(h(x) = e x^2\)? It is important to explicitly remember that differentiation and antidifferentiation are essentially inverse processes; that they are not quite inverse processes is due to the +C that arises when antidifferentiating. This close relationship enables us to take any known derivative rule and translate it to a corresponding rule for an indefinite integral. For example, since d dx x 5 = 5x 4 , we can equivalently write Z 5x 4 dx = x 5 + C. Recall that the Chain Rule states that

\[\dfrac{d}{dx} [ f (g(x))] = f' (g(x)) · g' (x).\]

Restating this relationship in terms of an indefinite integral,

\[\int f' (g(x))g' (x) dx = f (g(x)) + C. \label{5.5} \]

Hence, Equation \ref{5.5} tells us that if we can take a given function and view its algebraic structure as f' (g(x))g' (x) for some appropriate choices of f and g, then we can antidifferentiate the function by reversing the Chain Rule. It is especially notable that both g(x) and g' (x) appear in the form of f' (g(x))g' (x); we will sometimes say that we seek to identify a function-derivative pair when trying to apply the rule in Equation \ref{5.5}. In the situation where we can identify a function-derivative pair, we will introduce a new variable u to represent the function g(x). Observing that with u = g(x), it follows in Leibniz notation that du dx = g' (x), so that in terms of differentials4 , du = g' (x) dx. Now converting the indefinite integral of interest to a new one in terms of u, we have

\[\int f' (g(x))g' (x) dx = \int f' (u) du.\]

Provided that f' is an elementary function whose antiderivative is known, we can now 4 If we recall from the definition of the derivative that du dx ≈ 4u 4x and use the fact that du dx = g' (x), then we see that g' (x) ≈ 4u 4x . Solving for 4u, 4u ≈ g' (x)4x. It is this last relationship that, when expressed in “differential” notation enables us to write

\[du = g' (x) dx\]

in the change of variable formula.

easily evaluate the indefinite integral in u, and then go on to determine the desired overall antiderivative of f' (g(x))g' (x). We call this process u-substitution. To see u-substitution at work, we consider the following example.

Example \(\PageIndex{1}\):

Evaluate the indefinite integral

\[\int x^3 · sin(7x 4 + 3) dx\]

and check the result by differentiating.

Solution

We can make two key algebraic observations regarding the integrand, x 3 · sin(7x 4 + 3). First, sin(7x 4 + 3) is a composite function; as such, we know we’ll need a more sophisticated approach to antidifferentiating.

Second, x 3 is almost the derivative of (7x 4 + 3); the only issue is a missing constant. Thus, x 3 and (7x 4 + 3) are nearly a function-derivative pair. Furthermore, we know the antiderivative of f (u) = sin(u). The combination of these observations suggests that we can evaluate the given indefinite integral by reversing the chain rule through u-substitution. Letting u represent the inner function of the composite function sin(7x 4 + 3), we have u = 7x 4 + 3, and thus du dx = 28x 3 . In differential notation, it follows that

\[du = 28x 3 dx,\]

and thus

\[x 3 dx = 1 28 du.\]

We make this last observation because the original indefinite integral may now be written Z sin(7x 4 + 3) · x 3 dx, and so by substituting the expressions in u for x (specifically u for 7x 4 + 3 and 1 28 du for x 3 dx), it follows that

\[\int sin(7x 4 + 3) · x 3 dx = Z sin(u) · 1 28 du.\]

Now we may evaluate the original integral by first evaluating the easier integral in u, followed by replacing u by the expression 7x 4 + 3. Doing so, we find

\[\int sin(7x 4 + 3) · x 3 dx = Z sin(u) · 1 28 du = 1 28 Z sin(u) du = 1 28 (− cos(u)) + C = − 1 28 cos(7x 4 + 3) + C.\]

To check our work, we observe by the Chain Rule that

\[\dfrac{d}{dx} − 1 28 cos(7x 4 + 3) + C = − 1 28 · (−1)sin(7x 4 + 3) · 28x 3 = sin(7x 4 + 3) · x 3\]

which is indeed the original integrand. An essential observation about our work in Example 5.2 is that the u-substitution only worked because the function multiplying sin(7x 4 + 3) was x 3 . If instead that function was x 2 or x 4 , the substitution process may not (and likely would not) have worked. This is one of the primary challenges of antidifferentiation: slight changes in the integrand make tremendous differences. For instance, we can use u-substitution with u = x 2 and du = 2xdx to find that

\[\int xex 2 dx = Z e u · 1 2 du = 1 2 Z e u du = 1 2 e u + C = 1 2 e x 2 + C.\]

If, however, we consider the similar indefinite integral

\[\int e x 2 dx,\]

the missing x to multiply e x 2 makes the u-substitution u = x 2 no longer possible. Hence, part of the lesson of u-substitution is just how specialized the process is: it only applies to situations where, up to a missing constant, the integrand that is present is the result of applying the Chain Rule to a different, related function.

Activity \(\PageIndex{3}\)

Evaluate each of the following indefinite integrals by using these steps:

Find two functions within the integrand that form (up to a possible missing constant) a function-derivative pair;
Make a substitution and convert the integral to one involving u and du;
Evaluate the new integral in u;
Convert the resulting function of u back to a function of x by using your earlier substitution;
Check your work by differentiating the function of x. You should come up with the integrand originally given.

\(displaystyle \int x 2 5x 3 + 1 dx\)
\(displaystyle \int e x sin(e x ) dx\)
\(displaystyle \int cos( √ x) √ x dx C\)

Evaluating Definite Integrals via u-substitution

We have just introduced u-substitution as a means to evaluate indefinite integrals of functions that can be written, up to a constant multiple, in the form f (g(x))g' (x). This same technique can be used to evaluate definite integrals involving such functions, though we need to be careful with the corresponding limits of integration. Consider, for instance, the definite integral

\[\int^5_2 xex 2 dx.\]

Whenever we write a definite integral, it is implicit that the limits of integration correspond to the variable of integration. To be more explicit, observe that

\[\int^5_2 xex 2 dx = Z x=5 x=2 xex 2 dx.\]

When we execute a u-substitution, we change the variable of integration; it is essential to note that this also changes the limits of integration. For instance, with the substitution u = x 2 and du = 2x dx, it also follows that when x = 2, u = 2 2 = 4, and when x = 5, u = 5 2 = 25. Thus, under the change of variables of u-substitution, we now have

\[\int^{x=5}_{x=2} xex 2 dx = Z u=25 u=4 e u · 1 2 du = 1 2 e u u=25 u=4 = 1 2 e 25 − 1 2 e 4 \]

Alternatively, we could consider the related indefinite integral R xex 2 dx, find the antiderivative 1 2 e x 2 through u-substitution, and then evaluate the original definite integral.

From that perspective, we’d have

\[\int^5_2 xex 2 dx = 1 2 e x 2 5 2 = 1 2 e 25 − 1 2 e 4 \]

which is, of course, the same result.

Activity \(\PageIndex{1}\)

Evaluate each of the following definite integrals exactly through an appropriate usubstitution.

Z 2 1 x 1 + 4x 2 dx
\int_0^1 e −x (2e −x + 3) 9 dx
Z 4/π 2/π cos 1 x x 2 dx C

Summary

In this section, we encountered the following important ideas:

To begin to find algebraic formulas for antiderivatives of more complicated algebraic functions, we need to think carefully about how we can reverse known differentiation rules. To that end, it is essential that we understand and recall known derivatives of basic functions, as well as the standard derivative rules.
The indefinite integral provides notation for antiderivatives. When we write “R f (x) dx,” we mean “the general antiderivative of f .” In particular, if we have functions f and F such that f' = f , the following two statements say the exact thing: d dx [F(x)] = f (x) and Z f (x) dx = F(x) + C. That is, f is the derivative of F, and F is an antiderivative of f.
The technique of R u-substitution helps us evaluate indefinite integrals of the form f (g(x))g' (x) dx through the substitutions u = g(x) and du = g' (x) dx, so that Z f (g(x))g' (x) dx = Z f (u) du. A key part of choosing the expression in x to be represented by u is the identification of a function-derivative pair. To do so, we often look for an “inner” function g(x) that is part of a composite function, while investigating whether g' (x) (or a constant multiple of g' (x)) is present as a multiplying factor of the integrand.

Search

Text Color

Text Size

Margin Size

Font Type