Skip to main content
Mathematics LibreTexts

4.7: Fitting Exponential Models to Data

  • Page ID
    13856
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    In the previous section, we saw number lines using logarithmic scales. It is also common to see two dimensional graphs with one or both axes using a logarithmic scale.

    One common use of a logarithmic scale on the vertical axis is to graph quantities that are changing exponentially, since it helps reveal relative differences. This is commonly used in stock charts, since values historically have grown exponentially over time. Both stock charts below show the Dow Jones Industrial Average, from 1928 to 2010.

    A chart showing the Dow Jones Industrial Average from 1928 to 2010. The vertical axis goes from 0 to 14 thousand, equally spaced by 2 thousand.  The graph appears nearly flat from 1928 to 1980, then looks to curve up until 2000 where it drops down before rising up again until about 2008, when it drops again.

    A chart showing the Dow Jones Industrial Average from 1928 to 2010. The vertical axis goes from 0 to 14 thousand, with tick marks logarithmically spaced  The graph appears roughly to be increasing linear, but with a large drop in 1930, and smaller looking drops in 2000 and 2008.

    Both charts have a linear horizontal scale, but the first graph has a linear vertical scale, while the second has a logarithmic vertical scale. The first scale is the one we are more familiar with, and shows what appears to be a strong exponential trend, at least up until the year 2000.

    Example \(\PageIndex{1}\)

    There were stock market drops in 1929 and 2008. Which was larger?

    Solution

    In the first graph, the stock market drop around 2008 looks very large, and in terms of dollar values, it was indeed a large drop. However, the second graph shows relative changes, and the drop in 2009 seems less major on this graph, and in fact the drop starting in 1929 was, percentage-wise, much more significant.

    Specifically, in 2008, the Dow value dropped from about 14,000 to 8,000, a drop of 6,000. This is obviously a large value drop, and amounts to about a 43% drop. In 1929, the Dow value dropped from a high of around 380 to a low of 42 by July of 1932. While value-wise this drop of 338 is much smaller than the 2008 drop, it corresponds to a 89% drop, a much larger relative drop than in 2008. The logarithmic scale shows these relative changes.

    The second graph above, in which one axis uses a linear scale and the other axis uses a logarithmic scale, is an example of a semi-log graph.

    Definition: Semi-log and log-log GRAPHS

    A semi-log graph is a graph with one axis using a linear scale and one axis using a logarithmic scale.

    A log-log graph is a graph with both axes using logarithmic scales.

    Example \(\PageIndex{2}\)

    Plot 5 points on the graph of\(f(x)=3(2)^{x}\) on a semi-log graph with a logarithmic scale on the vertical axis.

    Solution

    To do this, we need to find 5 points on the graph, then calculate the logarithm of the output value. Arbitrarily choosing 5 input values,

    \(x\) \(f(x)\) log(\(f(x)\))
    -3 \(3(2)^{-1} = \dfrac{3}{8}\) -0.426
    -1 \(3(2)^{-1} = \dfrac{3}{2}\) 0.176
    0 \(3(2)^{0} = 3\) 0.477
    2 \(3(2)^{2} = 12\) 1.079
    5 \(3(2)^{5} = 96\) 1.982

    Plotting these values on a semi-log graph,

    A graph with the horizontal axis labeled x and the vertical axis labeled log of f of x.  Dots are plotted at approximately negative 3 comma negative 0.4, negative 1 comma 0.2, 0 comma 0.5, 2 comma 1.1, and 5 comma 2

    Notice that on this semi-log scale, values from the exponential function appear linear. We can show this behavior is expected by utilizing logarithmic properties. For the function \(f(x)=ab^{x}\), finding log(\(f(x)\)) gives

    \[\log \left(f(x)\right)=\log \left(ab^{x} \right)\nonumber\] Utilizing the sum property of logs,
    \[\log \left(f(x)\right)=\log \left(a\right)+\log \left(b^{x} \right)\nonumber\] Now utilizing the exponent property,
    \[\log \left(f(x)\right)=\log \left(a\right)+x\log \left(b\right)\nonumber\]

    This relationship is linear, with log(a) as the vertical intercept, and log(b) as the slope. This relationship can also be utilized in reverse.

    Example \(\PageIndex{3}\)

    An exponential graph is plotted on semi-log axes. Find a formula for the exponential function \(g(x)\) that generated this graph.

    A graph with the horizontal axis labeled x and the vertical axis labeled log of f of x.  A line is drawn passing through 0 comma 1 and 4 comma 4

    Solution

    The graph is linear, with vertical intercept at (0, 1). Looking at the change between the points (0, 1) and (4, 4), we can determine the slope of the line is \(\dfrac{3}{4}\). Since the output is log(\(g(x)\)), this leads to the equation \(\log \left(g(x)\right)=1+\dfrac{3}{4} x\).

    We can solve this formula for \(g(x)\) by rewriting in exponential form and simplifying:

    \[\log \left(g(x)\right)=1+\dfrac{3}{4} x\nonumber\] Rewriting as an exponential,
    \[g(x)=10^{1+\dfrac{3}{4} x}\nonumber\] Breaking this apart using exponent rules,
    \[g(x)=10^{1} \cdot 10^{\dfrac{3}{4} x}\nonumber\] Using exponent rules to group the second factor,
    \[g(x)=10^{1} \cdot \left(10^{\dfrac{3}{4} } \right)^{x}\nonumber\] Evaluating the powers of 10,
    \[g(x)=10\left(5.623\right)^{x}\nonumber\]

    Exercise \(\PageIndex{1}\)

    An exponential graph is plotted on a semi-log graph below. Find a formula for the exponential function \(g(x)\) that generated this graph.

    A graph with the horizontal axis labeled x and the vertical axis labeled log of f of x.  A line is drawn passing through 0 comma 2 and 2 comma 1

    Answer

    \[g(x) = 10^{2 - 0.5x} = 10^2 (10^{-0.5})^{x},\quad f(x) = 100 (0.3162)^x\nonumber\]

    Fitting Exponential Functions to Data

    Some technology options provide dedicated functions for finding exponential functions that fit data, but many only provide functions for fitting linear functions to data. The semi-log scale provides us with a method to fit an exponential function to data by building upon the techniques we have for fitting linear functions to data.

    to fit an exponential function to a set of data using linearization

    1. Find the log of the data output values
    2. Find the linear equation that fits the (input, log(output)) pairs. This equation will be of the form log(\(f(x)\)) = \(b + mx\).
    3. Solve this equation for the exponential function \(f(x)\)

    Example \(\PageIndex{4}\)

    The table below shows the cost in dollars per megabyte of storage space on computer hard drives from 1980 to 2004, and the data is shown on a standard graph to the right, with the input changed to years after 1980.

    \(\begin{array}{|l|l|}
    \hline \text { Year } & \text { Cost per MB } \\
    \hline 1980 & 192.31 \\
    \hline 1984 & 87.86 \\
    \hline 1988 & 15.98 \\
    \hline 1992 & 4 \\
    \hline 1996 & 0.173 \\
    \hline 2000 & 0.006849 \\
    \hline 2004 & 0.001149 \\
    \hline
    \end{array}\)

    A graph with horizontal axis labeled t and vertical axis labeled Cost. Dots are plotted at approximately 0 comma 192, 4 comma 88, 8 comma 16, 12 comma 4, 16 comma 0.2, 20 comma 0.007, 24 comma 0.001

    This data appears to be decreasing exponentially. To find a function that models this decay, we would start by finding the log of the costs.

    \(\begin{array}{|l|l|l|l|}
    \hline \text { Year } & t & \text { Cost per MB } & \log (\text { Cost }) \\
    \hline 1980 & 0 & 192.31 & 2.284002 \\
    \hline 1984 & 4 & 87.86 & 1.943791 \\
    \hline 1988 & 8 & 15.98 & 1.203577 \\
    \hline 1992 & 12 & 4 & 0.60206 \\
    \hline 1996 & 16 & 0.173 & -0.76195 \\
    \hline 2000 & 20 & 0.006849 & -2.16437 \\
    \hline 2004 & 24 & 0.001149 & -2.93952 \\
    \hline
    \end{array}\)

    A graph with horizontal axis labeled t and vertical axis labeled log of Cost. Dots are plotted at approximately 0 comma 2.3, 4 comma 1.9, 8 comma 1.2, 12 comma 0.6, 16 comma negative 0.8, 20 comma negative 2.2, 24 comma negative 2.9

    Solution

    As hoped, the graph of the log of costs appears fairly linear, suggesting an exponential function will fit the original data will fit reasonably well. Using technology, we can find a linear equation to fit the log(Cost) values. Using \(t\) as years after 1980, linear regression gives the equation:

    \[\log (C(t))=2.794-0.231t\nonumber\]

    Solving for \(C(t)\),

    \[C(t)=10^{2.794-0.231t}\nonumber\]
    \[C(t)=10^{2.794} \cdot 10^{-0.231t}\nonumber\]
    \[C(t)=10^{2.794} \cdot \left(10^{-0.231} \right)^{t}\nonumber\]
    \[C(t)=622\cdot \left(0.5877\right)^{t}\nonumber\]

    This equation suggests that the cost per megabyte for storage on computer hard drives is decreasing by about 41% each year.

    Using this function, we could predict the cost of storage in the future. Predicting the cost in the year 2020 (\(t = 40\)):

    \(C(40) =622\left(0.5877\right)^{40} \approx 0.000000364\) dollars per megabyte, a really small number. That is equivalent to $0.36 per terabyte of hard drive storage.

    Comparing the values predicted by this model to the actual data, we see the model matches the original data in order of magnitude, but the specific values appear quite different. This is, unfortunately, the best exponential model that can fit the data. It is possible that a non-exponential model would fit the data better, or there could just be wide enough variability in the data that no relatively simple model would fit the data any better.

    Year Actual Cost per MB Cost predicted by model
    1980 192.31 622.3
    1984 87.86 74.3
    1988 15.98 8.9
    1992 4 1.1
    1996 0.173 0.13
    2000 0.006849 0.015
    2004 0.001149 0.0018

    Exercise \(\PageIndex{2}\)

    The table below shows the value \(V\), in billions of dollars, of US imports from China \(t\) years after 2000.

    year 2000 2001 2002 2003 2004 2005
    \(t\) 0 1 2 3 4 5
    \(V\) 100 102.3 125.2 152.4 196  

    This data appears to be growing exponentially. Linearize this data and build a model to predict how many billions of dollars of imports were expected in 2011.

    Answer

    \(V(t) = 90.545 (1.2078)^t\). Predicting in 2011, \(V(11) = 722.45\) billion dollars.

    Important Topics of this Section

    • Semi-log graph
    • Log-log graph
    • Linearizing exponential functions
    • Fitting an exponential equation to data

    This page titled 4.7: Fitting Exponential Models to Data is shared under a CC BY-SA 4.0 license and was authored, remixed, and/or curated by David Lippman & Melonie Rasmussen (The OpenTextBookStore) via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.