28 Mean, Median, Mode
You may use a calculator throughout this module.
We often describe data using a measure of central tendency. This is a number that we use to describe the typical data value. In this module, we will look at three measures of central tendency: the mean, the median, and the mode. Each of these has pros and cons, depending on the particular data set.
Mean
The mean of a set of data is what we commonly call the average: add up all of the numbers and then divide by how many numbers there were.
Exercises
- The table below shows the amount of time, rounded to the nearest half minute, it took Marty to complete the Friday crossword puzzle in the New York Times. Calculate the mean completion time for these thirteen puzzles.
Oct 6, 2023 Oct 13, 2023 Oct 20, 2023 Oct 27, 2023 Nov 3, 2023 Nov 10, 2023 Nov 17, 2023 Nov 24, 2023 Dec 1, 2023 Dec 8, 2023 Dec 15, 2023 Dec 22, 2023 Dec 29, 2023 - The table below shows the average price of a gallon of regular unleaded gasoline in the Seattle metro area for ten weeks in late 2023.[1] Compute the mean price over this time period.
Oct 23, 2023 Oct 30, 2023 Nov 6, 2023 Nov 13, 2023 Nov 20, 2023 Nov 27, 2023 Dec 4, 2023 Dec 11, 2023 Dec 18, 2023 Dec 25, 2023
Median
The median is the middle number in a set of data; it has an equal number of data values below it as above it. The numbers must be arranged in order, usually smallest to largest but largest to smallest would also work. Then we can count in from both ends of the list and find the median in the middle.
If there are an odd number of data values, there will be one number in the middle, which is the median.
If there are an even number of data values, there will be two numbers in the middle. The mean of these two numbers is the median.
Exercises
- Here are Marty’s Friday crossword puzzle completion times again, in minutes, listed in order from fastest to slowest. What is the median completion time?
- Here are the Seattle gas prices again, listed in order from lowest to highest. What is the median price? , , , , , , , , ,
The five houses on a block have these property values: ; ; ; ; .
- Find the mean property value.
- Find the median property value.
A new house is built on the block, making the property values ; ; ; ; ; .
- Find the mean property value.
- Find the median property value.
- Which of these measures appears to give a more accurate representation of the typical house on the block?
The mean is better to work with when we do more complicated statistical analysis, but it is sensitive to extreme values; in other words, one very large or very small number can have a significant effect on the mean. The median is not sensitive to extreme values, which can make it a better measure to use when describing data that has one or two numbers very different from the remainder of the data.
Mode
The mode is the value that appears most frequently in the data set. On the game show Family Feud, the goal is to guess the mode: the most popular answer.
If no numbers are repeated, then the data set has no mode. If there are two values that are tied for most frequently occurring, then they are both considered a mode and the data set is called bimodal. If there are more than two values tied for the lead, we usually say that there is no mode.[2] (It’s like in sports: there is usually one MVP, but occasionally there are two co-MVPs. Having three or more MVPs would start to get ridiculous.)
Exercises
- Here are Marty’s Friday crossword puzzle completion times one last time, in minutes. What is the mode of the completion times?
- Here are the Seattle gas prices one last time. What is the mode of the prices? , , , , , , , , ,
- One hundred cell phone owners are asked which carrier they use. What is the mode of the data?
AT&T Mobility Verizon Wireless T-Mobile US Dish Wireless U.S. Cellular - Fifty people are asked what their favorite type of Girl Scout cookie is. What is the mode?
S’Mores Samoas Tagalongs Trefoils Thin Mints
Let’s put it all together and find the mean, median, and mode of some data sets. Sportsball!
Exercises
, , , , , , , , , , , , , , , , , , .
- Find the mean number of games won from 2001 to 2019.
- Find the median number of games won from 2001 to 2019.
- Find the mode of the number of games won from 2001 to 2019.
- Do any of these measures appear to be misleading, or do they all represent the data fairly well?
From 2001-2019, these are the numbers of games won by the Buffalo Bills each NFL season.[4]
, , , , , , , , , , , , , , , , , , .
- Find the mean number of games won from 2001 to 2019.
- Find the median number of games won from 2001 to 2019.
- Find the mode of the number of games won from 2001 to 2019.
- Do any of these measures appear to be misleading, or do they all represent the data fairly well?
Some sets of data may not be easy to describe with one measure of central tendency.
Exercises
Thirteen clementines are weighed. Their masses, in grams, are
, , , , , , , , , , , , .
- Determine the mean. Does the mean appear to represent the mass of a typical clementine?
- Determine the median. Does the median appear to represent the mass of a typical clementine?
- Determine the mode. Does the mode appear to represent the mass of a typical clementine?
Suppose that the -gram clementine is a tiny bit heavier and the masses are actually
, , , , , , , , , , , , .
- Determine the new mean. Is the new mean different from the original mean?
- Determine the new median. Is the new median different from the original median?
- Determine the new mode. Is the new mode different from the original mode? Does it represent the mass of a typical clementine?
- Source: https://www.eia.gov/petroleum/gasdiesel/ ↵
- The concepts of trimodal and multimodal data exist, but we aren't going to consider anything beyond bimodal in this textbook. ↵
- Source: https://www.pro-football-reference.com/teams/nwe/index.htm ↵
- Source: https://www.pro-football-reference.com/teams/buf/index.htm ↵