# 2.5: The Chain Rule

- Page ID
- 4303

Learning Objectives

In this section, we strive to understand the ideas generated by the following important questions:

- What is a composite function and how do we recognize its structure algebraically?
- Given a composite function \(C(x) = f (g(x))\) that is built from differentiable functions \(f\) and \(g\), how do we compute \(C'(x)\) in terms of \(f\), \(g\), \(f'\), and \(g'\)?
- What is the statement of the Chain Rule?

In addition to learning how to differentiate a variety of basic functions, we have also been developing our ability to understand how to use rules to differentiate certain algebraic combinations of them. For example, we not only know how to take the derivative of \(f (x) = sin(x)\) and \(g(x) = x^2 \), but now we can quickly find the derivative of each of the following combinations of \(f\) and \(g\):

\[s(x) = 3x^2 - 5 \sin(x)\]

\[p(x) = x^2 \sin(x),\]

and

\[q(x) = \sin(x) x^2.\]

Finding \(s'\) uses the sum and constant multiple rules, determining \(p'\) requires the product rule, and \(q'\) can be attained with the quotient rule. Again, we note the importance of recognizing the algebraic structure of a given function in order to find its derivative:

\[s(x) = 3g(x) - 5f (x)\]

\[p(x) = g(x) \cdot f(x),\]

and

\[q(x) = \dfrac{ f(x)}{g(x)}.\]

There is one more natural way to algebraically combine basic functions, and that is by composing them. For instance, let’s consider the function

\[C(x) = \sin(x^2)\]

and observe that any input \(x\) passes through a chain of functions. In particular, in the process that defines the function \(C(x)\), \(x\) is first squared, and then the sine of the result is taken. Using an arrow diagram,

\[ x → x^2 → \sin(x^2 ).\]

In terms of the elementary functions \(f\) and \(g\), we observe that \(x\) is first input in the function \(g\), and then the result is used as the input in \(f\). Said differently, we can write

\[C(x) = f(g(x)) = \sin(x^2 )\]

and say that \(C\) is the *composition *of \(f\) and \(g\). We will refer to \(g\), the function that is first applied to \(x\), as the *inner function*, while \(f\), the function that is applied to the result, is the *outer function*.

The main question that we answer in the present section is: given a composite function \(C(x) = f(g(x))\) that is built from differentiable functions \(f\) and g, how do we compute C'(x) in terms of \(f\), \(g\), \(f'\), and \(g'\)? In the same way that the rate of change of a product of two functions,

\[p(x) = f(x) · g(x),\]

depends on the behavior of both \(f\) and \(g\), it makes sense intuitively that the rate of change of a composite function \(C(x) = f (g(x))\) will also depend on some combination of \(f\) and \(g\) and their derivatives. The rule that describes how to compute \(C'\) in terms of \(f\) and \(g\) and their derivatives will be called the **chain rule**. But before we can learn what the chain rule says and why it works, we first need to be comfortable decomposing composite functions so that we can correctly identify the inner and outer functions, as we did in the example above with

\[C(x) = \sin(x^2).\]

Preview Activity \(\PageIndex{1}\):

For each function given below, identify its fundamental algebraic structure. In particular, is the given function a sum, product, quotient, or composition of basic functions? If the function is a composition of basic functions, state a formula for the inner function \(g\) and the outer function \(f\) so that the overall composite function can be written in the form \(f (g(x))\). If the function is a sum, product, or quotient of basic functions, use the appropriate rule to determine its derivative.

- \(h(x) = tan(2^x )\)
- \(p(x) = 2^x \tan(x)\)
- \(r(x) = (\tan(x))^2\)
- \(m(x) = e^{\tan(x)}\)
- \(w(x) = \sqrt{x} + \tan(x) \)
- \(z(x) = \sqrt{ \tan(x)}\)

## The Chain Rule

One of the challenges of differentiating a composite function is that it often cannot be written in an alternate algebraic form. For instance, the function

\[C(x) = \sin(x^2)\]

cannot be expanded or otherwise rewritten, so it presents no alternate approaches to taking the derivative. But other composite functions can be expanded or simplified, and these present a way to begin to explore how the chain rule might have to work. To that end, we consider two examples of composite functions that present alternate means of finding the derivative.

Example \(\PageIndex{1}\): Derivative of a Composite Function of Polynomials

Let \(f (x) = -4x + 7\) and \(g(x) = 3x - 5\). Determine a formula for \(C(x) = f(g(x))\) and compute \(C'(x)\). How is \(C'\) related to \(f\) and \(g\) and their derivatives?

**Solution**

By the rules given for \(f\) and \(g\),

\[ \begin{align} C(x) &= f(g(x)) \nonumber \\ &= f(3x - 5) \nonumber\\ &= -4(3x - 5) + 7 \nonumber\\ &= -12x + 20 + 7 \nonumber\\ &= -12x + 27. \nonumber\end{align}\]

Thus, \(C'(x) = -12\). Noting that \(f'(x) = -4\) and \(g' (x) = 3\), we observe that \(C'\) appears to be the product of \(f'\) and \(g'\).

From one perspective, Example \(\PageIndex{1}\) may be too elementary. Linear functions are the simplest of all functions, and perhaps composing linear functions (which yields another linear function) does not exemplify the true complexity that is involved with differentiating a composition of more complicated functions. At the same time, we should remember the perspective that any differentiable function is locally linear, so any function with a derivative behaves like a line when viewed up close. From this point of view, the fact that the derivatives of \(f\) and \(g\) are multiplied to find the derivative of their composition turns out to be a key insight. We now consider a second example involving a nonlinear function to gain further understanding of how differentiating a composite function involves the basic functions that combine to form it.

Example \(\PageIndex{2}\): Derivative of a Composite Function involving trigonometric functiosn

Let \(C(x) = \sin(2x)\). Use the double angle identity to rewrite \(C\) as a product of basic functions, and use the product rule to find \(C'\). Rewrite \(C'\) in the simplest form possible.

**Solution**

By the double angle identity for the sine function

\[C(x) = \sin(2x) = 2 \sin(x) \cos(x).\]

Applying the product rule and simplifying,

\[C'(x) = 2 \sin(x)(- \sin(x)) + \cos(x)(2 \cos(x)) = 2(\cos^2 (x) - \sin^2 (x)). \]

Next, we recall that one of the double angle identities for the cosine function tells us that

\[\cos(2x) = \cos^2 (x) - \sin^2 (x).\]

Substituting this result in our expression for \(C'(x)\), we now have that \(C'(x) = 2 \cos(2x)\).

So from Example \(\PageIndex{2}\), we see that if \(C(x) = \sin(2x)\), then \(C'(x) = 2 \cos(2x)\). Letting \(g(x) = 2x\) and \(f(x) = \sin(x)\), we observe that \(C(x) = f(g(x))\). Moreover, with \(g'(x) = 2\) and \(f'(x) = \cos(x)\), it follows that we can view the structure of \(C'(x)\) as

\[C'(x) = 2 \cos(2x) = g'(x) f'(g(x)). \]

In this example, we see that for the composite function \(C(x) = f(g(x))\), the derivative \(C'\) is (as in the example involving linear functions) constituted by multiplying the derivatives of \(f\) and \(g\), but with the special condition that \(f'\) is evaluated at \(g(x)\), rather than at \(x\).

It makes sense intuitively that these two quantities are involved in understanding the rate of change of a composite function: if we are considering \(C(x) = f(g(x))\) and asking how fast \(C\) is changing at a given \(x\) value as \(x\) changes, it clearly matters how fast \(g\) is changing at \(x\), as well as how fast \(f\) is changing at the value of \(g(x)\). It turns out that this structure holds not only for the functions in Examples \(\PageIndex{1}\) and \(\PageIndex{2}\), but indeed for all differentiable functions as is stated in the Chain Rule (Like other differentiation rules, the Chain Rule can be proved formally using the limit definition of the derivative).

Chain Rule

If \(g\) is differentiable at \(x\) and \(f\) is differentiable at \(g(x)\), then the composite function \(C\) defined by \(C(x) = f(g(x))\) is differentiable at \(x\) and

\[C'(x) = f' (g(x))g'(x). \label{chain}\]

As with the product and quotient rules, it is often helpful to think verbally about what the chain rule says:

If \(C\) is a composite function defined by an outer function \(f\) and an inner function \(g\), then \(C'\) is given by the derivative of the outer function, evaluated at the inner function, times the derivative of the inner function.

At least initially in working particular examples requiring the chain rule, it can also be helpful to clearly identify the inner function \(g\) and outer function \(f\), compute their derivatives individually, and then put all of the pieces together to generate the derivative of the overall composite function. To see what we mean by this, consider the function

\[r(x) = (\tan(x))^2 .\]

The function \(r\) is composite, with inner function \(g(x) = \tan(x)\) and outer function \(f (x) = x^2\). Organizing the key information involving \(f\), \(g\), and their derivatives, we have

- \(f (x) = x ^2\)
- \(g(x) = \tan(x)\)
- \(f'(x) = 2x \)
- \(g'(x) = \text{sec}^2 (x) \)
- \(f'(g(x)) = 2 \tan(x)\)

Applying the chain rule (Equation \ref{chain}), which tells us that

\[r' (x) = f'(g(x))g'(x),\]

we find that for \(r(x) = (\tan(x))^2\), its derivative is

\[r'(x) = 2 \tan(x)\text{sec}^2 (x).\]

As a side note, we remark that another way to write \(r(x)\) is \(r(x) = \tan^2 (x).\) Observe that in this format, the composite nature of the function is more implicit, but this is common notation for powers of trigonometric functions: \(cos^4 (x)\), \(sin^5 (x)\), and \(\text{sec}^2 (x)\) are all composite functions, with the outer function a power function and the inner function a trigonometric one.

The chain rule now substantially expands the library of functions we can differentiate, as the following activity demonstrates.

Activity \(\PageIndex{1}\): Inner vs. Outer Functions

For each function given below, identify an inner function \(g\) and outer function \(f\) to write the function in the form \(f (g(x))\). Then, determine \(f'(x)\), \(g'(x)\), and \(f'(g(x))\), and finally apply the chain rule (Equation \ref{chain}) to determine the derivative of the given function.

- \(h(x) = \cos(x^4 )\)
- \(p(x) = \sqrt{\tan(x)}\)
- \(s(x) = 2^{\sin(x)}\)
- \(z(x) = \cot^5 (x)\)
- \(m(x) = (\text{sec}(x) + e^x )^9\)

Using multiple rules simultaneously The chain rule now joins the sum, constant multiple, product, and quotient rules in our collection of the different techniques for finding the derivative of a function through understanding its algebraic structure and the basic functions that constitute it. It takes substantial practice to get comfortable with navigating multiple rules in a single problem; using proper notation and taking a few extra steps can be particularly helpful as well. We demonstrate with an example and then provide further opportunity for practice in the following activity.

Example \(\PageIndex{3}\): A more Complex Composite Function

Find a formula for the derivative of \[h(t) = 3^{t^2+2t} \text{sec}^4 (t).\]

**Solution **

We first observe that the most basic structure of \(h\) is that it is the product of two functions:

\[h(t) = a(t) \cdot b(t)\]

where \(a(t) = 3^{t^2+2t}\) and \(b(t) = \text{sec}^4 (t)\). Therefore, we see that we will need to use the product rule to differentiate \(h\). When it comes time to differentiate \(a\) and \(b\) in their roles in the product rule, we observe that since each is a composite function, the chain rule will be needed. We therefore begin by working separately to compute \(a'(t)\) and \(b'(t)\).

Writing

\[a(t) = f(g(t)) = 3^{t^2+2t}\]

and finding the derivatives of \(f\) and \(g\), we have

- \(f (t) = 3^t
- \(g(t) = t^2 + 2t\)
- \(f'(t) = 3^t \ln (3)\)
- \(g'(t) = 2t + 2\)
- \(f'(g(t)) = 3^{t^2+2 t} \ln (3) \)

Thus, by the chain rule, it follows that

\[a' (t) = f'(g(t))g'(t) = 3^{t^2+2t} \ln (3)(2t + 2). \]

Turning next to \(b\), we write

\[b(t) = r(s(t)) = \text{sec}^4 (t)\]

and find the derivatives of \(r\) and \(g\). Doing so,

- \(r(t) = t^4\)
- \(s(t) = \text{sec}(t)\)
- \(r' (t) = 4t^3\)
- \(s' (t) = \text{sec}(t)\tan(t)\)
- \(r' (s(t)) = 4 \text{sec}^3 (t)\)

By the chain rule (Equation \ref{chain}), we now know that

\[b' (t) = r' (s(t))s' (t) = 4 \text{sec}^3 (t)\text{sec}(t)\tan(t) = 4 \text{sec}^4 (t)\tan(t).\]

Now we are finally ready to compute the derivative of the overall function \(h\). Recalling that

\[h(t) = 3^{t^2+2t} \text{sec}^4 (t),\]

by the product rule we have

\[h ' (t) = 3 t 2+2t \dfrac{d}{dt} [\text{sec}^4 (t)] + \text{sec}^4 (t) \dfrac{d }{dt} \left[3^{ t^2+2t} \right].\]

From our work above with \(a\) and \(b\), we know the derivatives of \(3^{t^2+2t}\) and \(\text{sec}^4 (t)\), and therefore

\[h' (t) = 3^{ t^2+2t} 4 \text{sec}^4 (t)\tan(t) + \text{sec}^4 (t)3^{ t^2+2t} \ln (3)(2t + 2). \]

Activity \(\PageIndex{2}\)

For each of the following functions, find the function’s derivative. State the rule(s) you use, label relevant derivatives appropriately, and be sure to clearly identify your overall answer.

- \(p(r) = 4 \sqrt{ r^6 + 2e^r}\)
- \(m(v) = \sin(v^2) \cos(v^3 )\)
- \(h(y) = \dfrac{\cos(10y)}{e^{4y} + 1}\)
- \(s(z) = 2^{z^2 \text{sec}(z)}\)
- \(c(x) = \sin(e^{x^2})\)

The chain rule now adds substantially to our ability to do different familiar problems that involve derivatives. Whether finding the equation of the tangent line to a curve, the instantaneous velocity of a moving particle, or the instantaneous rate of change of a certain quantity, if the function under consideration involves a composition of other functions, the chain rule is indispensable.

Activity \(\PageIndex{3}\)

Use known derivative rules, including the chain rule, as needed to answer each of the following questions.

- Find an equation for the tangent line to the curve \(y = \sqrt{ e^x + 3}\) at the point where \(x = 0\).
- If \[s(t) = \dfrac{1}{(t^2 + 1)^3}\] represents the position function of a particle moving horizontally along an axis at time \(t\) (where \(s\) is measured in inches and t in seconds), find the particle’s instantaneous velocity at \(t = 1\). Is the particle moving to the left or right at that instant?
- At sea level, air pressure is 30 inches of mercury. At an altitude of \(h\) feet above sea level, the air pressure, \(P\), in inches of mercury, is given by the function \[P = 30e^{-0.0000323h}.\] Compute \(dP/dh\) and explain what this derivative function tells you about air pressure, including a discussion of the units on \(dP/dh\). In addition, determine how fast the air pressure is changing for a pilot of a small plane passing through an altitude of 1000 feet.
- Suppose that \(f (x)\) and \(g(x)\) are differentiable functions and that the following information about them is known:

x |
f (x) |
f'(x) |
g(x) |
g'(x) |
---|---|---|---|---|

-1 | 2 | -5 | -3 | 4 |

2 | -3 | 4 | -1 | 2 |

If \(C(x)\) is a function given by the formula \(f (g(x))\), determine \(C'(2)\). In addition, if \(D(x)\) is the function \(f ( f(x))\), find \(D' (-1)\).

## The Composite Version of Basic Function Rules

As we gain more experience with differentiating complicated functions, we will become more comfortable in the process of simply writing down the derivative without taking multiple steps. We demonstrate part of this perspective here by showing how we can find a composite rule that corresponds to two of our basic functions. For instance, we know that

\[\dfrac{d}{dx} [\sin(x)] = \cos(x).\]

If we instead want to know

\[\dfrac{d}{dx} [\sin(u(x))],\]

where \(u\) is a differentiable function of \(x\), then this requires the chain rule with the sine function as the outer function. Applying the chain rule (Equation \ref{chain}),

\[\dfrac{d}{dx} [\sin(u(x))] = \cos(u(x)) \cdot u' (x). \]

Similarly, since

\[\dfrac{d}{dx} [a^x ] = a^x \ln (a),\]

it follows by the chain rule that

\[\dfrac{d}{dx} \left[a^{u(x)} \right] = a^{u(x)} \ln (a) \cdot u' (x). \]

In the process of getting comfortable with derivative rules, an excellent exercise is to write down a list of all basic functions whose derivatives are known, list those derivatives, and then write the corresponding chain rule for the composite version with the inner function being an unknown function \(u(x)\) and the outer function being the known basic function. These versions of the chain rule are particularly simple when the inner function is linear, since the derivative of a linear function is a constant. For instance,

\[ \dfrac{d}{dx} (5x + 7)^{10} = 10(5x + 7)^9 \cdot 5\]

\[ \dfrac{d}{dx} \left[\tan(17x) \right] = 17 \text{sec}^2 (17x),\]

and

\[\dfrac{d}{dx} \left[ e^{-3x} \right] = -3e^{-3x} .\]

## Summary

In this section, we encountered the following important ideas:

- A composite function is one where the input variable x first passes through one function, and then the resulting output passes through another. For example, the function \[h(x) = 2^{\sin(x)}\] is composite since \[x → \sin(x) → 2^{\sin(x)}.\]
- Given a composite function \(C(x) = f(g(x))\) that is built from differentiable functions \(f\) and \(g\), the chain rule tells us that we compute \(C'(x)\) in terms of \(f\), \(g\), \(f'\), and \(g'\) according to the formula \[C'(x) = f'(g(x))g'(x).\]