Skip to main content
Mathematics LibreTexts

9.2: Measures of Central Tendency

  • Page ID
    139299
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    The "Center" is an important aspect of a distribution. When we refer to the "Center" we are trying to get an idea of the location of the middle of the data set. There are three common measures of central tendency. They are the mean, median, and mode.

    Let's begin by trying to find the most "typical" value of a data set.

    Note that we just used the word "typical" although in many cases you might think of using the word "average." We need to be careful with the word "average" as it means different things to different people in different contexts. One of the most common uses of the word "average" is what mathematicians and statisticians call the arithmetic mean, or just plain old mean for short. "Arithmetic mean" sounds rather fancy, but you have likely calculated a mean many times without realizing it; the mean is what most people think of when they use the word "average".

    Mean

    The mean of a set of data is the sum of the data values divided by the number of values.

    Example \(\PageIndex{1}\)

    Marci’s exam scores for her last math class were: 79, 86, 82, 94. The mean of these
    values would be:

    Solution

    \(\dfrac{79+86+82+94}{4}=85.25\). Typically we round means to one more decimal place than the original data had. In this case, we would round 85.25 to 85.3 .

    Example \(\PageIndex{2}\)

    The number of touchdown (TD) passes thrown by each of the 31 teams in the National Football League in the 2000 season are shown below.

    37

    33

    33

    32

    29

    28

    28

    23

    22

    22

    22

    21

    21

    21

    20

     

    20

    19

    19

    18

    18

    18

    18

    16

    15

    14

    14

    14

    12

    12

    9

    6

    Solution

    Adding these values, we get 634 total TDs. Dividing by 31, the number of data values, we get 634/31 = 20.4516. It would be appropriate to round this to 20.5.

    It would be most correct for us to report that “The mean number of touchdown passes thrown in the NFL in the 2000 season was 20.5 passes,” but it is not uncommon to see the more casual word “average” used in place of “mean.”

    You Try It \(\PageIndex{1}\)

    The price of a jar of peanut butter at 5 stores was: $3.29, $3.59, $3.79, $3.75, and $3.99. Find the mean price.

    Answer

    The range of the data is $0.70.

    Example \(\PageIndex{3}\)

    The one hundred families in a particular neighborhood are asked their annual household income, to the nearest $5 thousand dollars. The results are summarized in a frequency table below.

    Calculating the mean by hand could get tricky if we try to type in all 100 values:
    \[
    \dfrac{\overbrace{15+\cdots+15}^{6 \text { temms }}+\overbrace{20+\cdots+20}^{8 \text { tems }}+\overbrace{25+\cdots+25}^{11 \text { tems }}+\cdots}{100}
    \]
    We could calculate this more easily by noticing that adding 15 to itself six times is the same as \(15 \cdot 6=90\). Using this simplification, we get
    \[
    \dfrac{15 \cdot 6+20 \cdot 8+25 \cdot 11+30 \cdot 17+35 \cdot 19+40 \cdot 20+45 \cdot 12+50 \cdot 7}{100}=\dfrac{3390}{100}=33.9
    \]
    The mean household income of our sample is 33.9 thousand dollars \((\$ 33,900)\).

    Example \(\PageIndex{4}\)

    Extending off the last example, suppose a new family moves into the neighborhood example that has a household income of \(\$ 5\) million ( \(\$ 5000\) thousand).

    Income (thousands of dollars)

    Frequency

    15

    6

    20

    8

    25

    11

    30

    17

    35

    19

    40

    20

    45

    12

    50

    7

    Solution

    Adding this to our sample, our mean is now:
    \[
    \dfrac{15 \cdot 6+20 \cdot 8+25 \cdot 11+30 \cdot 17+35 \cdot 19+40 \cdot 20+45 \cdot 12+50 \cdot 7+5000 \cdot 1}{101}=\dfrac{8390}{101}=83.069
    \]
    While 83.1 thousand dollars \((\$ 83,069)\) is the correct mean household income, it no longer represents a "typical" value.

    Imagine the data values on a see-saw or balance scale. The mean is the value that keeps the data in balance, like in the picture below.

    clipboard_e435bab3605acb8659f427aff0d93fa37.png

    If we graph our household data, the $5 million data value is so far out to the right that the mean has to adjust up to keep things in balance

    clipboard_e2988dfd888848b3e4a2e9197b8c7e605.png

    For this reason, when working with data that have outliers – as described in the previous section - it is common to use a different measure of center, the median.

    Median

    The median of a set of data is the value in the middle when the data is in order

    To find the median, begin by listing the data in order from smallest to largest.

    If the number of data values, n, is odd, then the median is the middle data value. This position in the ordered list of this value can be found by rounding n/2 up to the next whole number.

    If the number of data values is even, there is no one middle value, so we find the mean of the two middle values. The two middle numbers will be in the n/2 and the (n/2) + 1 positions in the ordered list.

    Example \(\PageIndex{5}\)

    Returning to the football touchdown data, we would start by listing the data in order from largest to smallest (we could have listed them from smallest to largest, it makes no difference to the answer)

    Solution

    37

    33

    33

    32

    29

    28

    28

    23

    22

    22

    22

    21

    21

    21

    20

     

    20

    19

    19

    18

    18

    18

    18

    16

    15

    14

    14

    14

    12

    12

    9

    6

    Solution

    Since there are 31 data values, an odd number, the median will be the middle number, the 16th data value (31/2 = 15.5, round up to 16, leaving 15 values below and 15 above). The 16th data value is 20, so the median number of touchdown passes in the 2000 season was 20 passes. Notice that for this data, the median is fairly close to the mean we calculated earlier, 20.5.

    Example \(\PageIndex{6}\)

    Find the median of these quiz scores: 15 20 18 16 14 18 12 15 17 17.

    Solution

    We start by listing the data in order: 12 14 15 15 16 17 17 18 18 20

    Since there are 10 data values, an even number, there is no one middle number. The median will be the mean of the numbers in positions n/2 and (n/2) + 1.

    Since n = 10, n/2 = 5 and (n/2) + 1 = 6, so we need the mean of the 5th and 6th numbers in the ordered list.

    The 5th number in the ordered list is 16, and the 6th number in the ordered list is 17. So we find the mean of the two middle numbers, 16 and 17, and get (16+17)/2 = 16.5.

    The median quiz score was 16.5.

    You Try It \(\PageIndex{2}\)

    The price of a jar of peanut butter at 5 stores was: $3.29, $3.59, $3.79, $3.75, and $3.99. Find the median price.

    Answer

    First we put the data in order: $3.29, $3.59, $3.75, $3.79, $3.99. Since there are an odd number of data, the median will be the middle value, $3.75.

    Example \(\PageIndex{7}\)

    Let us return now to our original household income data

    Income (thousands of dollars)

    Frequency

    15

    6

    20

    8

    25

    11

    30

    17

    35

    19

    40

    20

    45

    12

    50

    7

    Solution
    There are 6 data values of $15, so → Values 1 to 6 are $15 thousand
    The next 8 data values are $20, so → Values 7 to (6+8)=14 are $20 thousand
    The next 11 data values are $25, so → Values 15 to (14+11)=25 are $25 thousand
    The next 17 data values are $30, so → Values 26 to (25+17)=42 are $30 thousand
    The next 19 data values are $35, so → Values 43 to (42+19)=61 are $35 thousand

    From this we can tell that values 50 and 51 will be $35 thousand, and the mean of these two values is $35 thousand. The median income in this neighborhood is $35 thousand.

    Example \(\PageIndex{8}\)

    If we add in the new neighbor with a $5 million household income, then there will be 101 data values, and the 51st value will be the median. As we discovered in the last example, the 51st value is $35 thousand. Notice that the new neighbor did not affect the median in this case. The median is not swayed as much by outliers as the mean is.

    In addition to the mean and the median, there is one other common measurement of the "typical" value of a data set: the mode.

    Mode

    The mode is the element of the data set that occurs most frequently.

    The mode is fairly useless with data like weights or heights where there are a large number of possible values. The mode is most commonly used for categorical data, for which median and mean cannot be computed.

    Example \(\PageIndex{9}\)

    In our vehicle color survey, we collected the data

    Color

    Frequency

    Blue

    3

    Green

    5

    Red

    4

    White

    3

    Black

    2

    Grey

    3

    Solution

    For this data, Green is the mode, since it is the data value that occurred the most frequently.

    It is possible for a data set to have more than one mode if several categories have the same frequency, or no modes if each every category occurs only once.

    You Try It \(\PageIndex{3}\)

    Reviewers were asked to rate a product on a scale of 1 to 5. Find

    Rating

    Frequency

    1

    4

    2

    8

    3

    7

    4

    3

    5

    1

    a. The mean rating
    b. The median rating
    c. The mode rating

    Answer

    There are 23 ratings.

    1. The mean is \(\frac{1 \cdot 4+2 \cdot 8+3 \cdot 7+4 \cdot 3+5 \cdot 1}{23} \approx 2.5\)
    2. There are 23 data values, so the median will be the \(12^{\text {th }}\) data value. Ratings of 1 are the first 4 values, while a rating of 2 are the next 8 values, so the \(12^{\text {th }}\) value will be a rating of 2. The median is 2 .
    3. The mode is the most frequent rating. The mode rating is 2 .

    9.2: Measures of Central Tendency is shared under a CC BY-NC-SA license and was authored, remixed, and/or curated by LibreTexts.

    • Was this article helpful?