πŸ““ SELF-STUDY WORKBOOK

Statistics & Data
Analysis

Mean Β· Median Β· Mode Β· Range Β· IQR Β· Variability
Box Plots Β· Stem-and-Leaf Β· Dot Plots

πŸ“š 20 Questions ⭐ Tricky Traps Included βœ… Instant Feedback
Progress
0 / 20 Score: 0
Β§1
Measures of Center

MEAN = add 'em all Γ· count  |  MEDIAN = middle value (sort first!)  |  MODE = most frequent
⚠️ Trap: outlier pulls the mean but NOT the median!

πŸ“ Quick Example

Data: 4, 7, 7, 9, 13  β†’  Mean = (4+7+7+9+13)Γ·5 = 8  |  Median = 7 (middle)  |  Mode = 7

⭐

Q1. A student scored 72, 85, 85, 90, 68 on five tests. What is the mean score?

πŸ’‘ Explanation

Add all scores: 72 + 85 + 85 + 90 + 68 = 400
Divide by 5: 400 Γ· 5 = 80 βœ“
Watch out! 85 is the MODE (appears twice), not the mean.


⭐⭐

Q2. The ages of 5 kids are 8, 10, 10, 11, 6. Which value is the median?

πŸ’‘ Explanation

First SORT: 6, 8, 10, 10, 11
With 5 values, the middle is the 3rd value β†’ 10 βœ“
⚠️ Always sort BEFORE finding the median!


⭐⭐⭐

Q3. A class has test scores: 70, 72, 74, 75, 76, 100. A new student joins with score 100. Which measure of center changes the most?

πŸ’‘ Explanation

100 is an outlier (far from the rest).
The mean is pulled strongly by outliers.
The median barely moves β€” it's resistant to outliers! βœ“

Β§2
Range & Interquartile Range (IQR)

RANGE = Max βˆ’ Min (measures total spread)
IQR = Q3 βˆ’ Q1 (middle 50% spread β€” ignores outliers!)
Q1 = median of lower half  |  Q3 = median of upper half
⚠️ Trap: do NOT include the median when finding Q1 & Q3 (odd data set)

πŸ“ Quick Example

Data (sorted): 2, 5, 7, 9, 12, 15, 18
Median = 9  β†’  Lower half: 2, 5, 7 β†’ Q1 = 5
Upper half: 12, 15, 18 β†’ Q3 = 15
IQR = 15 βˆ’ 5 = 10  |  Range = 18 βˆ’ 2 = 16

⭐

Q4. The heights (cm) of 7 students are: 142, 155, 158, 163, 167, 171, 180. What is the range?

πŸ’‘ Explanation

Range = Max βˆ’ Min = 180 βˆ’ 142 = 38 cm βœ“


⭐⭐⭐

Q5. Using the same data: 142, 155, 158, 163, 167, 171, 180, what is the IQR?

πŸ’‘ Explanation

Median = 163 (4th value)
Lower half: 142, 155, 158 β†’ Q1 = 155
Upper half: 167, 171, 180 β†’ Q3 = 171
IQR = 171 βˆ’ 155 = 16 βœ“


⭐⭐

Q6. A teacher says "I want to describe the spread of the middle half of students' scores, ignoring the very high and very low scores." Which measure should she use?

πŸ’‘ Explanation

The IQR describes the middle 50% of data.
It is resistant to outliers β€” perfect for ignoring extremes! βœ“
Range includes all values (affected by outliers).

Β§3
Box Plots (Box-and-Whisker)

A box plot shows: Min β†’ Q1 β†’ Median β†’ Q3 β†’ Max
Box = Q1 to Q3 (IQR)  |  Whiskers = to min & max
⚠️ Trap: the box does NOT have to be centered β€” skewed data makes it lopsided!

πŸ“ Box Plot Visual
Min
10
Q1
25
Med
35
Q3
50
Max
70
⭐

Q7. From the box plot above (Min=10, Q1=25, Median=35, Q3=50, Max=70), what is the IQR?

πŸ’‘ Explanation

IQR = Q3 βˆ’ Q1 = 50 βˆ’ 25 = 25 βœ“
The box itself visually represents the IQR!


⭐⭐⭐

Q8. A box plot has: Min=5, Q1=10, Median=12, Q3=20, Max=45. Which statement is BEST supported?

πŸ’‘ Explanation

Max=45 is FAR from Q3=20 β€” this is an outlier!
The long right whisker shows the data is skewed right.
The median (12) is closer to Q1 than Q3, confirming right skew. βœ“


⭐⭐

Q9. Two box plots show quiz scores for Class A and Class B. Both have the same median (75). Class A's IQR = 8, Class B's IQR = 22. What can you conclude?

πŸ’‘ Explanation

Same median β†’ same center. But IQR measures spread.
Class A: IQR = 8 β†’ scores are clustered closely together.
Class B: IQR = 22 β†’ scores are more spread out. βœ“
Larger IQR = more variability!

Β§4
Stem-and-Leaf & Dot Plots

STEM = tens digit  |  LEAF = ones digit
To find median from stem-and-leaf: count from both ends inward
DOT PLOT = each dot = one data value above a number line

⭐⭐

Q10. Use this stem-and-leaf plot:

4
2 5 8 5
0 3 3 7 6
1 4

Key: 4|2 = 42

What is the median?

πŸ’‘ Explanation

List all values: 42, 45, 48, 50, 53, 53, 57, 61, 64 β†’ 9 values
Median = 5th value = 53 βœ“


⭐

Q11. The dot plot below shows the number of books read by 8 students:

1
2
3
4
5

What is the mode?

πŸ’‘ Explanation

Mode = most frequent value = tallest stack of dots.
The value 3 has 3 dots β€” the most! βœ“

Β§5
Measures of Variability

VARIABILITY = how spread out data is
RANGE & IQR both measure variability (spread)
MEAN & MEDIAN measure center β€” NOT variability!
⚠️ Trap: "How close together" = variability (small = close, large = spread)

⭐⭐

Q12. Which question could be answered using a measure of variability?

πŸ’‘ Explanation

Variability answers "how spread out?" questions.
"Difference between tallest and shortest" β†’ Range β†’ variability βœ“
"Average" β†’ mean (center). "Most common" β†’ mode (center).


⭐⭐⭐

Q13. A coach wants to know how close together all runners' 100m times were. Which measure should the coach use?

πŸ’‘ Explanation

"How close together" = how small the spread is β†’ use a measure of variability.
Both Range and IQR work, but IQR is better as it ignores outliers.
βœ“ IQR tells us how bunched the middle half of runners are.


⭐⭐

Q14. Dataset A: 10, 10, 10, 10, 10  |  Dataset B: 2, 6, 10, 14, 18. Compare their ranges.

πŸ’‘ Explanation

Dataset A: Range = 10 βˆ’ 10 = 0 (no spread β€” all same!)
Dataset B: Range = 18 βˆ’ 2 = 16
A has ZERO variability; B has much more spread. βœ“

Β§6
Choosing the Right Measure

Use MEAN when: data is symmetric, no outliers
Use MEDIAN when: data has outliers or is skewed
Use MODE when: data is categorical (colors, names, etc.)
⚠️ Trap: real estate prices β†’ use MEDIAN (billionaires skew mean up!)

⭐⭐⭐

Q15. 5 people's salaries: $30K, $32K, $34K, $35K, $200K. A journalist wants to report the "typical" salary. Which is BEST?

πŸ’‘ Explanation

Mean = (30+32+34+35+200)Γ·5 = $66.2K β€” pulled up by the outlier!
Median = middle value = $34K β€” much more "typical" βœ“
With outliers, MEDIAN represents "typical" better.


⭐⭐

Q16. Survey: "What is your favorite sport?" Results: SoccerΓ—8, BasketballΓ—5, TennisΓ—3, SwimmingΓ—2. Which measure of center is appropriate?

πŸ’‘ Explanation

This is categorical data (sport names β€” not numbers!).
You can't find a mean or median of categories.
Use MODE β†’ most frequent = Soccer βœ“


⭐⭐⭐

Q17. Adding a very large outlier to a dataset will:

πŸ’‘ Explanation

A large outlier increases the mean significantly.
The median barely changes (shifts by 1 position at most).
The range increases (new max is the outlier).
Key: Mean is sensitive; Median is resistant. βœ“

Β§7
Mixed Practice β€” Tricky Questions!

πŸ“Š 5-Number Summary: Min, Q1, Median, Q3, Max
Each "section" of a box plot contains 25% of data.
⚠️ Even if a section LOOKS wider, it still holds 25% of the data!

⭐⭐⭐

Q18. A box plot has Min=10, Q1=15, Median=20, Q3=30, Max=60. What percent of data falls between Q1 and Q3?

πŸ’‘ Explanation

Q1 to Q3 = the BOX = the middle 50% of all data βœ“
Q1 to Median = 25%  |  Median to Q3 = 25%
Total: 25% + 25% = 50%


⭐⭐⭐

Q19. Dataset: 3, 5, 7, 9, 11, 13 (6 values). What is the median?

πŸ’‘ Explanation

6 values (even number) β†’ median = average of 3rd and 4th values.
3rd value = 7, 4th value = 9
Median = (7 + 9) Γ· 2 = 8 βœ“
⚠️ Even number of values: AVERAGE the two middle values!


⭐⭐⭐

Q20. The mean of 4 numbers is 15. Three of the numbers are 10, 14, and 18. What is the 4th number?

πŸ’‘ Explanation

Mean = Sum Γ· Count β†’ Sum = Mean Γ— Count = 15 Γ— 4 = 60
Known sum: 10 + 14 + 18 = 42
4th number = 60 βˆ’ 42 = 18 βœ“