20 core concept problems — from descriptive stats to inference. Each wrong answer shows a full explanation.
20Problems
8Topics
⚡Memory Keys
0 / 20
Score: 0
Unit 1 · Describing Data
Q01
Mean vs. Median
A dataset of 5 exam scores is: 70, 72, 75, 78, 150. Which statement is correct?
SKEW → PULL MEAN away from median · outlier = use median
Correct: B. Mean = (70+72+75+78+150)/5 = 89. Median = 75 (middle value). The extreme outlier 150 pulls the mean upward. The median is resistant to outliers — it only depends on the middle rank, not the actual value. Always prefer median when data is skewed or has outliers.
Q02
Standard Deviation
Two classes took the same test. Class A scores: 70, 75, 80, 85, 90. Class B scores: 79, 79, 80, 81, 81. Which has a larger standard deviation and why?
SD = spread from mean · more spread = bigger SD
Correct: B. Both classes have mean = 80. But Class A's values range from 70 to 90 (wider spread), while Class B clusters between 79–81 (narrow spread). Standard deviation measures how far values deviate from the mean on average. Larger deviations → larger SD. Equal means ≠ equal spread!
Q03
z-Score Interpretation
A student scores 88 on a test where \(\mu = 80\) and \(\sigma = 4\). What is the student's z-score, and what does it mean?
\[z = \frac{x - \mu}{\sigma}\]
z = HOW MANY SDs above/below mean · z=0 means AT the mean
Correct: B. \(z = (88 - 80)/4 = 8/4 = 2\). A z-score of +2 means the student scored 2 standard deviations above the mean. Positive z = above mean; negative z = below mean; z = 0 = exactly at the mean. About 95% of data falls within \(|z| \leq 2\) in a normal distribution.
Unit 2 · Displaying & Exploring Data
Q04
Boxplot & IQR
A dataset has: Min=10, Q1=20, Median=30, Q3=50, Max=90. What is the IQR, and which values would be classified as outliers?
\[\text{IQR} = Q_3 - Q_1\qquad \text{Outlier if } x < Q_1 - 1.5\cdot\text{IQR} \text{ or } x > Q_3 + 1.5\cdot\text{IQR}\]
Correct: C. IQR = Q3 − Q1 = 50 − 20 = 30. Wait — let's recalculate properly: Lower fence = 20 − 1.5(30) = 20 − 45 = −25. Upper fence = 50 + 1.5(30) = 50 + 45 = 95. Since 90 < 95, Max = 90 is NOT an outlier. Min = 10 > −25, also not an outlier. So there are no outliers. Answer C correctly identifies IQR = 30 and no outlier at 90. Always check BOTH fences!
Q05
Shape of Distribution
A histogram shows most values bunched on the right side, with a long tail stretching to the left. This distribution is:
TAIL direction = skew direction · left tail = left-skewed (negatively skewed)
Correct: B. Skewness is named after the direction of the tail, NOT the peak. Long tail to the left → left-skewed (negatively skewed). In left-skewed distributions, mean < median < mode. A common trap: students say "peak is on right, so right-skewed" — wrong! Follow the tail!
Unit 3 · Probability
Q06
Conditional Probability
In a class, 40% study math (M), 30% study science (S), and 15% study both. Given a student studies math, what is the probability they also study science?
\[P(S \mid M) = \frac{P(S \cap M)}{P(M)}\]
CONDITIONAL = restrict sample space · divide by the given event
Correct: A. \(P(S \mid M) = P(S \cap M) / P(M) = 0.15 / 0.40 = \mathbf{0.375}\). Think of it as: "Among the 40% who study math, 15% also study science." So \(15/40 = 37.5\%\). The common mistake is using 0.30 (unconditional P(S)) or 0.15 (the intersection) without dividing.
Q07
Independence vs. Mutual Exclusivity
Events A and B satisfy \(P(A) = 0.4\), \(P(B) = 0.3\), and \(P(A \cap B) = 0\). Which statement is TRUE?
MUTUALLY EXCLUSIVE ≠ independent · ME means P(A∩B)=0 · Independent means P(A∩B)=P(A)·P(B)
Correct: C. Mutually exclusive means \(P(A \cap B) = 0\) ✓ (given). For independence, we need \(P(A \cap B) = P(A) \cdot P(B) = 0.4 \times 0.3 = 0.12\). But \(P(A \cap B) = 0 \neq 0.12\), so NOT independent. In fact, mutually exclusive events (when both have positive probabilities) are always dependent — if A occurs, B cannot!
Q08
Complement Rule
The probability of rain on any given day is 0.35. What is the probability it does NOT rain for 3 consecutive days? (Assume independence.)
\[P(\text{no rain 3 days}) = P(\text{no rain})^3 = (1 - 0.35)^3\]
Correct: B. P(no rain on one day) = 1 − 0.35 = 0.65. For 3 independent days: \(0.65^3 = 0.65 \times 0.65 \times 0.65 \approx \mathbf{0.274}\). Common errors: (A) only calculates one day; (C) incorrectly cubes 0.35; (D) multiplies 0.65 × 3 (addition instead of multiplication for independent AND events).
Unit 4 · Random Variables & Distributions
Q09
Expected Value
A game pays $10 with probability 0.2, $5 with probability 0.5, and loses $3 with probability 0.3. What is the expected value?
\[E(X) = \sum x_i \cdot P(x_i)\]
E(X) = weighted average · multiply each value × probability then ADD
Correct: A. \(E(X) = (10)(0.2) + (5)(0.5) + (-3)(0.3) = 2 + 2.5 - 0.9 = \mathbf{\$3.60}\)... Wait, that gives B! Let me recheck: \(2.0 + 2.5 - 0.9 = 3.6\). Correct answer is B: $3.60. Don't forget: losses are negative values! Multiply carefully and keep track of signs.
Q10
Binomial Distribution
A fair coin is flipped 6 times. What is the probability of getting exactly 4 heads?
\[P(X = k) = \binom{n}{k} p^k (1-p)^{n-k}\]
BINS: Binary · Independent · N fixed · Same probability · BINOM check all 4!
Correct: A. \(P(X=4) = \binom{6}{4}(0.5)^4(0.5)^2 = 15 \times 0.0625 \times 0.25 = 15 \times 0.015625 = \mathbf{0.234}\). Note: \(\binom{6}{4} = \binom{6}{2} = 15\). The "choose" (combination) accounts for all the different orderings of 4 heads — without it, you'd only count one specific sequence.
Unit 5 · Sampling Distributions
Q11
Central Limit Theorem
A population has mean \(\mu = 50\) and standard deviation \(\sigma = 20\). For samples of size \(n = 100\), the sampling distribution of \(\bar{x}\) has what standard deviation?
\[\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}\]
CLT: n≥30 → approx. normal · SE = σ/√n · bigger n = less spread
Correct: B. \(\sigma_{\bar{x}} = 20/\sqrt{100} = 20/10 = \mathbf{2}\). This is called the Standard Error (SE). Larger samples reduce variability of the sample mean. SE ≠ SD — SE is the standard deviation of the sampling distribution (how much \(\bar{x}\) varies from sample to sample), not the original population's spread.
Q12
Bias vs. Variability
A sample consistently gives estimates that are too high, but those estimates are very close to each other. This sampling method has:
BIAS = consistently off-target · VARIABILITY = spread of estimates · want LOW both
Correct: C. "Consistently too high" = high bias (systematic error, hitting the wrong target). "Very close to each other" = low variability (precise, consistent results). Think of it like an archery target: high bias = arrows cluster away from the bullseye; low variability = arrows cluster tightly together (even if in the wrong spot).
Unit 6 · Confidence Intervals
Q13
Interpreting a Confidence Interval
A 95% confidence interval for a population mean is (42, 58). Which interpretation is CORRECT?
CI = PROCEDURE captures μ 95% of time · NOT "95% chance μ is inside THIS interval"
Correct: B. This is the most commonly missed concept in AP Stats. The true μ is a fixed (unknown) value — it's either in (42, 58) or it's not. The "95%" refers to the procedure: if we repeat the sampling many times, 95% of the intervals constructed this way will contain μ. Never say "95% probability that μ is in THIS specific interval."
Q14
Margin of Error
To cut the margin of error in half while keeping the same confidence level, you should:
\[\text{ME} = z^* \cdot \frac{\sigma}{\sqrt{n}}\]
ME ∝ 1/√n · to HALF ME → QUADRUPLE n (×4)
Correct: B. ME = \(z^* \cdot \sigma/\sqrt{n}\). To halve ME: \(\sigma/\sqrt{n_{\text{new}}} = \frac{1}{2} \cdot \sigma/\sqrt{n_{\text{old}}}\), which gives \(\sqrt{n_{\text{new}}} = 2\sqrt{n_{\text{old}}}\), so \(n_{\text{new}} = 4n_{\text{old}}\). The square root makes the relationship non-linear — halving ME requires 4× the sample size, not 2×. This surprises many students!
Unit 7 · Hypothesis Testing
Q15
Type I vs. Type II Error
A court uses "innocent until proven guilty." Convicting an innocent person corresponds to which type of error?
Type I = FALSE POSITIVE (reject true H₀) · Type II = FALSE NEGATIVE (fail to reject false H₀)
Correct: B. H₀: person is innocent (true). We "rejected" H₀ (convicted them) even though H₀ was true → Type I error (false positive). Type I error rate = α (significance level). Type II error = acquitting a guilty person (fail to reject false H₀). Reducing α decreases Type I errors but increases Type II errors — there's always a tradeoff.
Q16
p-value Interpretation
A test gives a p-value of 0.03 with significance level α = 0.05. Which conclusion is correct?
p-value = P(data this extreme | H₀ true) · p < α → reject H₀ · p ≠ P(H₀ is true)
Correct: C. Since p = 0.03 < α = 0.05, we reject H₀. The p-value is NOT the probability that H₀ is true (option A — very common mistake!). It's the probability of observing results this extreme assuming H₀ is true. Also, we never "prove" anything in statistics (D) — we only find sufficient evidence to reject or insufficient evidence to reject.
Unit 8 · Linear Regression & Correlation
Q17
Correlation vs. Causation
A study finds a strong positive correlation (r = 0.91) between ice cream sales and drowning deaths. Can we conclude that ice cream causes drowning?
CORRELATION ≠ CAUSATION · lurking variable = confounding · need EXPERIMENT not observation
Correct: B. The lurking variable is summer heat — it increases both ice cream sales AND swimming (thus drowning risk). High r only tells us there's a linear relationship, not its cause. To establish causation: you need a randomized controlled experiment. Observational studies can NEVER prove causation, regardless of how high r is.
Q18
Interpreting the Slope
The LSRL for predicting test score from study hours is: \(\hat{y} = 50 + 8x\). What does the slope 8 mean?
SLOPE = for each 1-unit increase in x, y changes by b₁ · in CONTEXT always!
Correct: B. The slope b₁ = 8 means: for each 1-unit increase in x (study hours), the predicted y (test score) increases by 8. The intercept 50 means a student with 0 study hours is predicted to score 50. Always state slope interpretation in context using "for each additional [x-unit], [y] is predicted to [increase/decrease] by [slope]."
Q19
Coefficient of Determination r²
For the regression of test scores on study hours, r = 0.8. The value r² = 0.64 tells us:
r² = % of variation in y EXPLAINED by the linear relationship with x
Correct: C. r² = 0.64 means 64% of the variability in test scores is explained by the linear relationship with study hours. The remaining 36% is due to other factors. Full template: "[r² × 100]% of the variation in [y] is explained by the linear relationship with [x]." Never skip "variation" or "linear relationship" in your answer!
Q20
Residual Analysis
After fitting a linear regression, the residual plot shows a curved (U-shaped) pattern. What does this indicate?
\[\text{Residual} = \text{Observed} - \text{Predicted} = y - \hat{y}\]
RESIDUAL PLOT: random scatter = good fit · pattern = linear model inappropriate
Correct: C. A curved residual plot means the linear model is systematically missing a curve in the data — it's underfitting a non-linear relationship. A good residual plot should look like random scatter around zero with no pattern. Patterns (U-shape, funnel, wave) all signal that the linear model is inappropriate. Try a transformation (like log or square root) or a polynomial model.