Skip to main content
Mathematics LibreTexts

6.2.3: Describing Trends in Scatter Plots

  • Page ID
    36717
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    Lesson

    Let's look for associations between variables.

    Exercise \(\PageIndex{1}\): Which One Doesn't Belong - Scatter Plots

    Which one doesn't belong?

    clipboard_e31310349f745598336347e379ac8f854.png
    Figure \(\PageIndex{1}\)

    Exercise \(\PageIndex{2}\): Fitting Lines

    Experiment with finding lines to fit the data. Drag the points to move the line. You can close the expressions list by clicking on the double arrow.

    1. Here is a scatter plot. Experiment with different lines to fit the data. Pick the line that you think best fits the data. Compare it with a partner’s.
    2. Here is a different scatter plot. Experiment with drawing lines to fit the data. Pick the line that you think best fits the data. Compare it with a partner’s.
    3. In your own words, describe what makes a line fit a data set well.

    Exercise \(\PageIndex{3}\): Good Fit Bad Fit

    The scatter plots both show the year and price for the same 17 used cars. However, each scatter plot shows a different model for the relationship between year and price.

    clipboard_e6b06cf2d2b92f7fb890a64098ce92e41.png
    Figure \(\PageIndex{2}\)
    1. Look at Diagram A.
      1. For how many cars does the model in Diagram A make a good prediction of its price?
      2. For how many cars does the model underestimate the price?
      3. For how many cars does it overestimate the price?
    2. Look at Diagram B.
      1. For how many cars does the model in Diagram B make a good prediction of its price?
      2. For how many cars does the model underestimate the price?
      3. For how many cars does it overestimate the price?
    3. For how many cars does the prediction made by the model in Diagram A differ by more than $3,000? What about the model in Diagram B?
    4. Which model does a better job of predicting the price of a used car from its year?

    Exercise \(\PageIndex{4}\): Practice Fitting Lines

    1. Is this line a good fit for the data? Explain your reasoning.

    clipboard_e1caefb5f49a8c57465ebea879c21e666.png
    Figure \(\PageIndex{3}\)

    2. Draw a line that fits the data better.

    clipboard_eb9af66f11e94a3ff619b490130affb2f.png
    Figure \(\PageIndex{4}\)

    3. Is this line a good fit for the data? Explain your reasoning.

    clipboard_e81e2ac7c764d32a6f248c77b9105b0de.png
    Figure \(\PageIndex{5}\)

    4. Draw a line that fits the data better.

    clipboard_e9fb104cc0eec3fa826b4fed60d157c2d.png
    Figure \(\PageIndex{6}\)

    Are you ready for more?

    clipboard_e7235146a745fd316b24ba286b9e5b422.png
    Figure \(\PageIndex{7}\)
    clipboard_e603119b10bf2d28f7781347d716ca95f.png
    Figure \(\PageIndex{8}\)
    clipboard_e0f30f6242b0cdcdd081ac36651660795.png
    Figure \(\PageIndex{9}\)

    These scatter plots were created by multiplying the \(x\)-coordinate by 3 then adding a random number between two values to get the \(y\)-coordinate. The first scatter plot added a random number between -0.5 and 0.5 to the \(y\)-coordinate. The second scatter plot added a random number between -2 and 2 to the -coordinate. The third scatter plot added a random number between -10 and 10 to the \(y\)-coordinate.

    1. For each scatter plot, draw a line that fits the data.
    2. Explain why some were easier to do than others.

    Summary

    When a linear function fits data well, we say there is a linear association between the variables. For example, the relationship between height and weight for 25 dogs with the linear function whose graph is shown in the scatter plot.

    clipboard_e569c9d15b105db4aa58213a8d2395313.png
    Figure \(\PageIndex{10}\)

    Because the model fits the data well and because the slope of the line is positive, we say that there is a positive association between dog height and dog weight.

    What do you think the association between the weight of a car and its fuel efficiency is?

    clipboard_ebf333e9f0f74261234e71436035d02a7.png
    Figure \(\PageIndex{11}\)

    Because the slope of a line that fits the data well is negative, we say that there is a negative association between the fuel efficiency and weight of a car.

    Glossary Entries

    Definition: Negative Association

    A negative association is a relationship between two quantities where one tends to decrease as the other increases. In a scatter plot, the data points tend to cluster around a line with negative slope.

    Different stores across the country sell a book for different prices.

    The scatter plot shows that there is a negative association between the the price of the book in dollars and the number of books sold at that price.

    clipboard_eab65766890cf1492613cd690560b3992.png
    Figure \(\PageIndex{12}\)

    Definition: Outlier

    An outlier is a data value that is far from the other values in the data set.

    Here is a scatter plot that shows lengths and widths of 20 different left feet. The foot whose length is 24.5 cm and width is 7.8 cm is an outlier.

    clipboard_e8c41b607f7acd2fad49b581c0621199d.png
    Figure \(\PageIndex{13}\)

    Definition: Positive Association

    A positive association is a relationship between two quantities where one tends to increase as the other increases. In a scatter plot, the data points tend to cluster around a line with positive slope.

    The relationship between height and weight for 25 dogs is shown in the scatter plot. There is a positive association between dog height and dog weight.

    clipboard_ee59595d412dd9cac9e902713a4d8844c.png
    Figure \(\PageIndex{14}\)

    Practice

    Exercise \(\PageIndex{5}\)

    1. Draw a line that you think is a good fit for this data. For this data, the inputs are the horizontal values, and the outputs are the vertical values.

    clipboard_e68126312d1f1010826c63ce1870b202e.png
    Figure \(\PageIndex{15}\)

    2. Use your line of fit to estimate what you would expect the output value to be when the input is 10.

    Exercise \(\PageIndex{6}\)

    Here is a scatter plot that shows the most popular videos in a 10-year span.

    clipboard_e033f70d2ca21a6d5240b9a1395bb6226.png
    Figure \(\PageIndex{16}\)
    1. Use the scatter plot to estimate the number of views for the most popular video in this 10-year span.
    2. Estimate when the 4th most popular video was released.

    (From Unit 6.2.1)

    Exercise \(\PageIndex{7}\)

    A recipe for bread calls for 1 teaspoon of yeast for every 2 cups of flour.

    1. Name two quantities in this situation that are in a functional relationship.
    2. Write an equation that represents the function.
    3. Draw the graph of the function. Label at least two points with input-output pairs.

    (From Unit 5.3.1)


    This page titled 6.2.3: Describing Trends in Scatter Plots is shared under a CC BY license and was authored, remixed, and/or curated by Illustrative Mathematics.

    • Was this article helpful?