GMAT Statistics: Mean, Median, Mode, and Standard Deviation
Quick Takeaways
- Outliers: Pull the Mean (avg) but don't touch the Median (middle).
- Standard Deviation: Don't calculate! Just know it means 'spread'.
- Weighted Avg: Use 'Tug-of-War' visual (closer to heavy weight).
- Range: Max - Min (Simplest measure of spread).
- Relation: Mean = Median? Distribution is symmetrical.
Statistics: More Than Just Numbers
The GMAT tests statistics not because business school requires you to be a mathematician, but because it requires you to be a smart interpreter of data. You can expect to see 2-3 statistics-based questions on the Quant section, covering concepts from basic averages to the more complex idea of standard deviation. The key to success is understanding what these measures represent conceptually, not just how to calculate them.
Measures of Center: Mean, Median, and Mode
These three measures describe the 'center' of a data set in different ways.
Mean (or Average)
The mean is simply the sum of all the values divided by the number of values. The key formula to know is Sum = Mean × Number of values. The GMAT often tests your ability to manipulate this formula to find a missing value or the sum of a set.
Median
The median is the middle value in a data set when it is ordered from least to greatest. If there's an even number of values, the median is the average of the two middle values. The most important property of the median is that it is not sensitive to outliers (extremely high or low values).
Mode
The mode is the value that appears most frequently in a data set. A set can have one mode, more than one mode, or no mode at all. This is the least commonly tested of the three measures.
Measures of Spread: Range and Standard Deviation
These measures tell you how spread out or dispersed the data points are.
Range
The range is the simplest measure of spread: Range = Highest Value - Lowest Value.
Standard Deviation (SD)
Standard deviation is a more sophisticated measure of how spread out the data is from the mean. A low SD means the data points are clustered tightly around the mean, while a high SD means they are spread far apart. You will never have to calculate the actual standard deviation on the GMAT. Instead, you need to understand it conceptually:
- If all the numbers in a set are the same, the SD is 0.
- Adding or subtracting the same number to every value in a set does not change the SD.
- Multiplying every value in a set by the same number does change the SD by that same factor.
The GMAT's Favorite Trick: Weighted Averages
Weighted average problems are a GMAT staple. They occur when you need to find the average of two or more subgroups that have different sizes or 'weights'. For example, if you have the average score for men and the average score for women in a class, the overall average will be 'pulled' closer to the average of the larger group. A number line or a 'tug-of-war' approach can be a powerful visual tool for solving these problems. On OpenPrep, weighted average questions are tagged separately from simple mean questions — useful because the error patterns differ significantly. Students who confuse simple and weighted averages almost always make the same type of error (ignoring group sizes), which a targeted drill can fix in one or two focused sessions.
Common Statistics Traps to Avoid
- Confusing Mean and Median: The GMAT loves to create scenarios where an outlier pulls the mean in one direction while the median stays the same. Always ask yourself if extreme values are in play.
- Forgetting to Order the Set for Median: You cannot find the median until you arrange the data set in ascending or descending order. This is a simple but common mistake.
- Assuming a Simple Average: When you see the word 'average,' especially in a problem with different groups, your first thought should be: 'Is this a weighted average?'
Worked examples: statistics in GMAT-style questions
These examples cover the four statistics question types most commonly tested on the GMAT Focus Edition, with the specific traps each one is designed to trigger.
Example 1: Mean manipulation — using Sum = Mean × Count
Question: The average (mean) score of 5 students on a test is 82. When a 6th student takes the test, the average rises to 84. What was the 6th student's score?
Solution: Sum = Mean × Count. Sum of original 5 students = 82 × 5 = 410. Sum of all 6 students = 84 × 6 = 504. 6th student's score = 504 − 410 = 94. Memorise: Sum = Mean × Count. Every GMAT mean question is solved by working with sums, not averages.
Example 2: Median with an even set — order matters
Question: Set S = {3, 7, 2, 9, 5, 1}. What is the median of S?
Solution: Step 1 — Order the set: {1, 2, 3, 5, 7, 9}. Step 2 — With 6 values (even count), the median is the average of the 3rd and 4th values: (3 + 5) / 2 = 4. Common error: forgetting to order the set first, then averaging the 3rd and 4th values of the unordered set (which would give (2 + 9) / 2 = 5.5 — wrong).
Example 3: Weighted average — where the result "lives"
Question: Class A has 20 students with an average score of 70. Class B has 30 students with an average score of 80. What is the combined average score?
Wrong approach: (70 + 80) / 2 = 75. This ignores the different class sizes. Correct approach: Total score = (20 × 70) + (30 × 80) = 1,400 + 2,400 = 3,800. Total students = 50. Combined average = 3,800 / 50 = 76. Notice that 76 is closer to 80 than to 70 — the larger class (B, 30 students) pulls the average toward its mean. The "tug-of-war" visual: the combined average always sits between the two group averages, pulled toward the heavier group.
Example 4: Standard deviation — conceptual, no calculation
Question: Set P = {2, 4, 6, 8, 10}. If every element is multiplied by 3, which of the following changes? (A) Mean (B) Median (C) Standard deviation (D) Range (E) All of the above
Solution: When every element is multiplied by the same constant (3): Mean changes (6, 12, 18, 24, 30 → new mean = 18 vs old mean = 6). Median changes (6 → 18). Range changes (was 10−2=8, now 30−6=24). Standard deviation changes (the spread scales by 3 — SD triples). Answer: (E). The key rule: adding the same constant to every element does NOT change SD or Range; multiplying every element by a constant changes SD, Range, Mean, and Median proportionally.