US Grade 11 · Statistics

Master Stats.

Mean · Median · Box Plot · Standard Deviation · Normal Distribution · z-Scores
Questions0 / 20
Correct0
0 / 20
Section 1 · Mean & Median
Q·01
Beginner
MEAN = sum ÷ count · MEDIAN = middle value (sorted)
Core Concept
Mean (average): Add all values, divide by the number of values.
Median: Sort the data. If odd number of values → middle one. If even → average of two middle values.
Key difference: The mean is affected by outliers; the median is resistant to them.
A student recorded the number of hours spent studying each day for one week:
2, 5, 3, 8, 4, 3, 6

What is the mean number of study hours per day?
Explanation
Add all values: 2 + 5 + 3 + 8 + 4 + 3 + 6 = 31
Divide by the number of values (7):
Mean = 31 ÷ 7 ≈ 4.43 hours Why not C (4)? — 4 is the median, not the mean. Sorted: 2, 3, 3, 4, 5, 6, 8 → middle value = 4.
Always distinguish: mean = arithmetic average, median = positional middle.
🔄 Try This
What is the median of the same data set? What happens to the mean if the value "8" is replaced by "80"? Does the median change?
Q·02
Beginner
OUTLIER shifts mean, NOT median → median is "resistant"
A small company has 5 employees with annual salaries (in thousands of dollars):
42, 45, 43, 47, 210

Which statement best describes the relationship between the mean and median salary?
Explanation
Median: Sorted: 42, 43, 45, 47, 210 → middle = $45K
Mean: (42+43+45+47+210) ÷ 5 = 387 ÷ 5 = $77.4K

The outlier ($210K) pulls the mean toward it but does NOT move the median. This is why:
→ Skewed-right data: mean > median
→ The median is called a resistant measure — it doesn't care about extreme values.
🔄 Try This
If a 6th employee earning $44K joins, what is the new median? (Remember: even number of values → average of two middle values.)
Q·03
Medium
SKEW: right-skew → mean > median · left-skew → mean < median
A teacher reports that the median test score in her class is 78, but the mean is 69.

What does this tell us about the shape of the distribution of test scores?
Explanation
The key rule: mean < median → left skew (negatively skewed).

Here mean (69) < median (78), so the tail pulls to the left. This means a few students scored very low, dragging the mean down below the median.

Memory trick: The mean "chases the tail."
→ Tail on right → mean is pulled right → mean > median → right skew
→ Tail on left → mean is pulled left → mean < median → left skew
🔄 Try This
If a data set has mean = 50 and median = 50, what shape would you expect? If 5 more high-scoring outliers were added, which direction would the mean move? Would the median move much?
Section 2 · Box Plots & Five-Number Summary
Q·04
Beginner
5-NUMBER: Min · Q1 · Median · Q3 · Max · IQR = Q3 − Q1
Box Plot Guide
Five-number summary: Minimum, Q1 (25th percentile), Median (Q2), Q3 (75th percentile), Maximum
IQR = Q3 − Q1  (the width of the box)
Box = middle 50% of data  |  Whiskers = extend to min & max (or 1.5×IQR fence)
Outlier rule: Any point below Q1 − 1.5·IQR or above Q3 + 1.5·IQR is an outlier.
A data set has the following five-number summary:
Min = 12  |  Q1 = 20  |  Median = 28  |  Q3 = 35  |  Max = 50

What is the IQR (Interquartile Range)?
Explanation
IQR = Q3 − Q1 = 35 − 20 = 15 Common mistakes:
→ A: That's the range (Max − Min), not IQR.
→ D: That's Median − Q1, which has no standard name.

The IQR tells us the spread of the middle 50% of the data. A larger IQR means the data is more spread out in the middle.
🔄 Try This
Using the outlier rule (1.5 × IQR fence), is the value 50 an outlier in this data set? Calculate: Upper fence = Q3 + 1.5 × IQR = ?
Q·05
Medium
BOX WIDTH = IQR · WHISKER LENGTH shows range · OUTLIER = dot beyond whisker
Study the box plot below, which shows the scores of 40 students on a math quiz:
0 10 20 30 40 50 60 70 80 90 100 Min=30 Q1=50 Med=65 Q3=75 Max=95
Based on this box plot, which statement is correct?
Explanation
A — Actually true! The box (Q1 to Q3) always contains the middle 50%. ✓ But wait — option C is the best and most insightful statement.

B — Wrong! Range = Max − Min = 95 − 30 = 65, not 25. (25 is the IQR = 75 − 50.)

C ✓ — The median (65) is closer to Q3 (75) than to Q1 (50). This means the upper half of the box is compressed → left skew (the longer whisker is on the left).

D — The median is the 50th percentile. 50% of students scored above 65, not 75%.
🔄 Try This
Calculate the outlier fences. Is a score of 5 an outlier? Lower fence = Q1 − 1.5 × IQR = 50 − 1.5 × 25 = ?
Q·06
Medium
PERCENTILE: Q1=25th · Q2=50th · Q3=75th · above Q3 = top 25%
A college's admissions office reports that the SAT scores of admitted students have:
Q1 = 1200  |  Median = 1350  |  Q3 = 1450

A student scored 1450 on the SAT. What is the most accurate interpretation?
Explanation
1450 = Q3, which is the 75th percentile. This means the student scored at or above approximately 75% of all admitted students.

A — Wrong. Only above Q3 doesn't mean above all — 25% of students scored higher.
B/C — Wrong. 50% describes the median (Q2 = 1350), not Q3.
D ✓ — Correct. Q3 = 75th percentile means 75% of the data falls at or below this value.
🔄 Try This
If you scored at Q1 (1200), what percentile is that? What percentage of admitted students scored higher than you?
Section 3 · Standard Deviation · Sample vs Population
Q·07
Beginner
σ = population (÷N) · s = sample (÷(n−1)) · bigger SD = more spread
Population vs Sample
Population (σ): You have data on EVERY member of the group. Divide by N.
Sample (s): You have data on only SOME members (a subset). Divide by (n − 1).
Why n−1? → Called Bessel's correction. Sample variance tends to underestimate population variance, so we use n−1 to correct this bias.
Rule of thumb: In most real-life studies, we use sample statistics (s).
A researcher studies the heights of all 8 students in a small class (not a sample — this IS the entire group of interest).

Which formula should be used to calculate the standard deviation?
Explanation
The key question is: Are these all the individuals you care about, or just a subset?

Here, the 8 students ARE the entire group of interest → use population SD (σ), divide by N.

B/D — Wrong. Use n−1 only when you are drawing conclusions about a larger population from a subset.
C — Wrong. They give different (close but not equal) results.

Memory trick: Population = P = σ (sigma) | Sample = S = s
🔄 Try This
A doctor surveys 50 patients from a hospital of 10,000 to study blood pressure. Should she use σ or s? Why?
Q·08
Medium
SD steps: find mean → subtract each → square → average → square root
A data set has values: 4, 7, 7, 10

The mean is 7. Calculate the population standard deviation (σ).
Value (x)Deviation (x − μ)(x − μ)²
44 − 7 = −39
77 − 7 = 00
77 − 7 = 00
1010 − 7 = 39
Sum18
Using the table above, which is the correct σ?
Explanation
Population variance: σ² = (sum of squared deviations) ÷ N = 18 ÷ 4 = 4.5
σ = √4.5 ≈ 2.12 A — Wrong. σ² is the variance; σ requires a square root.
C — Wrong. Dividing by n−1 = 3 gives the sample standard deviation (s ≈ 2.45), not σ.
D — Wrong. 4.5 is the variance (σ²), not the standard deviation (σ).

Always remember: Standard Deviation = √(Variance)
🔄 Try This
What would the sample standard deviation (s) be for the same data? Use n−1 = 3 in the denominator. Compare s vs σ — which is larger, and why?
Q·09
Medium
SD interpretation: 68% within 1σ · 95% within 2σ · 99.7% within 3σ
Word Problem: Two statistics classes each took the same exam. Both classes had a mean score of 75, but:
• Class A had a standard deviation of 3
• Class B had a standard deviation of 15

Which statement is the most accurate conclusion?
Explanation
Standard deviation measures spread/variability, not performance level.

Class A (SD = 3): most scores likely between 72 and 78. Consistent!
Class B (SD = 15): scores likely range from ~45 to ~105. Very spread out.

B — Wrong. Higher SD doesn't mean higher scores; the means are equal.
C — Misleading. Same mean ≠ same performance. The spread tells a very different story.
D — Wrong. SD has no relationship to the number of students (sample size = n).
🔄 Try This
Using the 68-95-99.7 rule for a normal distribution: In Class B (mean=75, SD=15), approximately what percentage of students scored between 45 and 105?
Section 4 · Normal Distribution
Q·10
Beginner
NORMAL: bell curve · symmetric · mean=median=mode · 68-95-99.7 rule
The Empirical Rule (68-95-99.7 Rule)
In a normal distribution with mean μ and standard deviation σ:
• About 68% of data falls within 1σ of the mean (μ ± 1σ)
• About 95% of data falls within 2σ of the mean (μ ± 2σ)
• About 99.7% of data falls within 3σ of the mean (μ ± 3σ)
The curve is perfectly symmetric around the mean.
The heights of adult men in a city are normally distributed with a mean of 70 inches and a standard deviation of 3 inches.

Approximately what percentage of men are between 64 and 76 inches tall?
Explanation
First, count standard deviations from the mean:
70 − 64 = 6 inches = below the mean
76 − 70 = 6 inches = above the mean
So the range 64 to 76 = μ ± 2σ → 95% of data Quick check: 70 ± 1×3 = [67, 73] → 68% | 70 ± 2×3 = [64, 76] → 95% ✓ | 70 ± 3×3 = [61, 79] → 99.7%
🔄 Try This
Using the same distribution (μ=70, σ=3), approximately what percentage of men are taller than 73 inches? Hint: 73 = μ + 1σ. What percent is in the right tail beyond 1σ?
Q·11
Medium
SYMMETRY: total=100% · left half=50% · 1σ tail = (100−68)/2 = 16%
Word Problem: IQ scores are normally distributed with a mean of 100 and a standard deviation of 15.

What percentage of the population has an IQ score above 130?
Explanation
130 − 100 = 30 = (since σ = 15)
Within 2σ (between 70 and 130): 95% of the population.
That leaves 5% outside the range — split equally in both tails (symmetry!):
Above 130 = 5% ÷ 2 = 2.5% A — 5% is in both tails combined, not just the upper one.
C — Wrong. 16% applies to 1σ above mean (score = 115), not 2σ.
D — Wrong. 3σ from mean = 100 ± 45 = [55, 145], not 130.
🔄 Try This
What percentage of the population has an IQ between 85 and 115? What about between 70 and 130? (Hint: 85 = μ − 1σ, 115 = μ + 1σ)
Section 5 · Standard Normal Distribution & z-Scores
Q·12
Beginner
z = (x − μ) / σ · z=0 is mean · z=+1 is 1σ above · z=−2 is 2σ below
The z-Score Formula
A z-score tells you how many standard deviations a value is from the mean:
   z = (x − μ) / σ
• z = 0 → the value equals the mean
• z = +2 → the value is 2 standard deviations above the mean
• z = −1.5 → the value is 1.5 standard deviations below the mean
The standard normal distribution has μ = 0 and σ = 1.
A student scored 85 on a test where the class mean is 75 and the standard deviation is 5.

What is the student's z-score?
Explanation
z = (x − μ) / σ = (85 − 75) / 5 = 10 / 5 = 2.0 This means the student scored 2 standard deviations above the mean.

A — Wrong. z=1.0 would mean scoring 75 + 1×5 = 80, not 85.
D — Wrong. Always subtract the mean first before dividing by σ.
🔄 Try This
Another student scored 60 on the same test. What is their z-score? Is it positive or negative? What does that tell you about their performance relative to the class?
Q·13
Medium
COMPARE across distributions: always use z-scores, NOT raw scores
Word Problem: Sarah took two different standardized tests:
• Math test: she scored 680 (class mean = 650, SD = 30)
• English test: she scored 75 (class mean = 65, SD = 8)

On which test did Sarah perform relatively better compared to her class?
Explanation
Calculate both z-scores:
Math z = (680 − 650) / 30 = 30/30 = 1.0 English z = (75 − 65) / 8 = 10/8 = 1.25 Sarah's English z-score (1.25) > Math z-score (1.0).
She performed relatively better in English compared to her peers.

A — NEVER compare raw scores across different tests. They're on different scales!
Key lesson: z-scores allow fair comparison across different distributions.
🔄 Try This
If Sarah wants to improve her weakest subject, which should she focus on? If she studies Math and raises her score to 710, what would her new Math z-score be?
Q·14
Hard
z-TABLE: area to LEFT of z · P(X<x) = Φ(z) · P(X>x) = 1 − Φ(z)
The weight of apples from a farm is normally distributed with a mean of 150 grams and a standard deviation of 20 grams.

A store only accepts apples weighing more than 170 grams. Using the z-table value that P(Z < 1.0) = 0.8413, approximately what percentage of apples are accepted?
Explanation
First compute the z-score: z = (170 − 150) / 20 = 1.0

The z-table gives P(Z < 1.0) = 0.8413 — this is the area to the LEFT (below 170g).
We want apples above 170g:
P(X > 170) = 1 − P(Z < 1.0) = 1 − 0.8413 = 0.1587 = 15.87% A — Wrong. 84.13% is the proportion below 170g — those are rejected, not accepted.
Critical tip: The z-table always gives area to the LEFT. For "greater than," always subtract from 1.
🔄 Try This
What percentage of apples weigh between 130g and 170g? (Hint: find P(−1 < Z < 1). Use symmetry — P(Z < −1.0) = 1 − 0.8413 = 0.1587)
Section 6 · Harder Word Problems & Mixed Concepts
Q·15
Hard
FIND x FROM z: x = μ + z·σ · reverse the z-score formula
Word Problem: A teacher grades on a curve. She wants the top 16% of students to receive an "A." Exam scores are normally distributed with a mean of 72 and a standard deviation of 10.

What is the minimum score needed to receive an "A"? (Recall: 16% above 1σ from mean)
Explanation
From the Empirical Rule: 68% of data is within ±1σ. That means 32% is outside, and by symmetry, 16% is above 1σ from the mean.

So "top 16%" corresponds to z = +1.0.
x = μ + z·σ = 72 + (1.0)(10) = 82 B — Top 2.5% would need z = +2, giving x = 72 + 2(10) = 92. That's too strict.
Remember: To find a cutoff value, use x = μ + z·σ (the reverse z-score formula).
🔄 Try This
The teacher also wants to give a "D" to the bottom 2.5% of students. What is the maximum score for a "D"? (Hint: bottom 2.5% → z = −2.0)
Q·16
Hard
PERCENTILE rank: use z-table → area to LEFT = percentile rank
Word Problem: The time it takes students to complete a standardized test is normally distributed with a mean of 45 minutes and a standard deviation of 8 minutes.

Given P(Z < −1.25) ≈ 0.1056, what is the probability that a randomly selected student finishes in less than 35 minutes?
Explanation
z = (35 − 45) / 8 = −10/8 = −1.25 The question asks P(X < 35) = P(Z < −1.25).
The z-table directly gives us the area to the left: 0.1056 = 10.56%

B — 89.44% is P(X > 35), not P(X < 35).
D — Wrong. Do not double the value unless finding a two-tailed probability.
🔄 Try This
What percentage of students take between 35 and 53 minutes? P(35 < X < 53) = P(−1.25 < Z < ?) — first find the z-score for 53 minutes.
Q·17
Hard
BETWEEN two values: P(a<X<b) = P(Z<z₂) − P(Z<z₁)
Word Problem: The daily temperature in a city in June is normally distributed with μ = 78°F and σ = 6°F.

Given: P(Z < 0.33) ≈ 0.63 and P(Z < −0.67) ≈ 0.25

What is the probability that a randomly chosen June day has a temperature between 74°F and 80°F?
Explanation
Step 1 — Find z-scores:
z for 80°F: (80 − 78) / 6 = 2/6 ≈ 0.33 z for 74°F: (74 − 78) / 6 = −4/6 ≈ −0.67 Step 2 — Use subtraction:
P(74 < X < 80) = P(Z < 0.33) − P(Z < −0.67) = 0.63 − 0.25 = 0.38 B — NEVER add z-table values. Always subtract (bigger minus smaller).
D — The range is NOT symmetric: 80−78 = 2, but 78−74 = 4 (different distances from mean).
🔄 Try This
What is the probability that a June day is above 84°F? Find the z-score for 84°F, then use 1 − (z-table value).
Q·18
Exam Level
OUTLIER in boxplot: beyond 1.5×IQR fence · in normal dist: beyond 3σ is unusual
Word Problem: A quality control inspector measures the diameter (in mm) of bolts produced by a machine. The data is:
10.1, 10.3, 10.2, 10.4, 10.1, 10.3, 10.2, 10.5, 10.1, 14.8

The inspector creates a box plot and marks outliers. Which statement about 14.8 mm is correct?
Explanation
Sorted data (excluding 14.8): 10.1, 10.1, 10.1, 10.2, 10.2, 10.3, 10.3, 10.4, 10.5
Q1 ≈ 10.1, Q3 ≈ 10.4, IQR = 0.3
Upper fence = Q3 + 1.5 × IQR = 10.4 + 0.45 = 10.85 mm
14.8 mm >> 10.85 mm → definitely an outlier

B — Wrong. Any single data point can be an outlier.
C — The opposite is true! Outliers make the median a better (more resistant) measure.
D — Wrong. Outliers strongly inflate the standard deviation (because squaring the large deviation makes it enormous).
🔄 Try This
Compare the mean and median of this data set with and without 14.8 mm. How much does removing the outlier change each? Which measure changes more?
Q·19
Exam Level
z-SCORE to PERCENTILE: z=+1→84th · z=+2→97.5th · z=−1→16th
Hard Word Problem: A student's score on a national exam places her at the 97.5th percentile. The exam scores are normally distributed with a mean of 500 and a standard deviation of 100.

What was her approximate score?
Explanation
97.5th percentile means 97.5% of students scored below her.
From the Empirical Rule: 95% of data is within 2σ. The upper 2.5% tail begins at z = +2.
x = μ + z·σ = 500 + (2.0)(100) = 700 A — z = +1 corresponds to the 84th percentile, not 97.5th.
D — Never convert percentile directly to raw score without using μ and σ. That logic is completely wrong.

Key percentile-z links to memorize:
84th ↔ z=+1 | 97.5th ↔ z=+2 | 99.85th ↔ z=+3 | 16th ↔ z=−1
🔄 Try This
A student scored 400 on the same exam. What percentile is that? Calculate the z-score, then use the Empirical Rule to identify the percentile.
Q·20
Exam Level
SYNTHESIS: read context carefully · choose correct measure · explain tradeoffs
Final Challenge — Word Problem:

A real estate analyst collects the selling prices of homes in two neighborhoods (in $1,000s):

Neighborhood A: 220, 235, 240, 245, 250, 255, 260, 265, 270, 280
Neighborhood B: 150, 180, 195, 210, 245, 250, 260, 280, 420, 890

A buyer's agent says: "Both neighborhoods have a similar median (~$247K), so they are equally affordable."

Which statement best critiques this claim?
Explanation
This is the classic trap: same median ≠ same distribution.

Neighborhood A: prices clustered tightly between $220K–$280K. Low SD ≈ $18K. Predictable.
Neighborhood B: includes extreme values ($420K, $890K). Much higher SD ≈ $215K. Very unpredictable.

The medians may be similar, but the risk and variability are completely different.

A — Wrong. Two distributions with identical medians can look totally different (see above).
B — Wrong. With outliers, the mean of B is inflated (~$308K), making median better, not worse.
C — Partially true but incomplete — the key issue is overall variability, not just the max.

Lesson: Always report both a measure of center AND a measure of spread. Never judge a distribution by one number alone.
🔄 Try This
Calculate the actual mean for Neighborhood B. How different is it from the median? Which neighborhood would you choose if you had a strict budget of $260K? Why might the IQR be more useful than the SD here?
Your Final Score

Review each explanation carefully — statistics is about understanding context, not just calculation.