20 essential questions linking every major unit — from data exploration to inference.
0 of 20 answered
Score: 0 / 20
Unit 1Exploring Data
Question 01
A dataset of exam scores has a mean of 72 and a median of 68.
Which statement is most likely true about the distribution's shape?
Key Word Mean > Median → right-skewed (pulled by high outliers)
5
Why not correct?
When mean > median, high-value outliers pull the mean rightward. Left-skew would give mean < median. Skew alone cannot confirm bimodality.
Question 02
The IQR (interquartile range) of a dataset is 12. Using the 1.5 × IQR rule,
any value below Q1 = 40 or above Q3 = 52 is classified as:
Formula Fence = Q1 − 1.5·IQR / Q3 + 1.5·IQR
5
Why not correct?
Lower fence = 40 − 1.5(12) = 40 − 18 = 22. Upper fence = 52 + 1.5(12) = 52 + 18 = 70. Any point outside [22, 70] is an outlier.
Unit 2Exploring Two-Variable Data
Question 03
A scatterplot shows hours studied (x) vs. test score (y) with
r = 0.87. The LSRL (least-squares regression line) is:
ŷ = 45 + 5.2x
A student who studies 0 hours is predicted to score:
Key Word y-intercept = predicted y when x = 0
5
Why not correct?
Plug x = 0: ŷ = 45 + 5.2(0) = 45. The y-intercept (45) is the predicted value when x = 0. D sounds intuitive but is not what the model says.
Question 04
The coefficient of determination r² = 0.76. This means:
Key Word r² = % variation in y explained by x
5
Why not correct?
r² ≠ r (A). r² describes explained variation, not slope (C). Correlation never implies causation and is never a "probability" of causation (D).
Unit 3Collecting Data
Question 05
A researcher assigns 30 patients to one of three treatments by randomly drawing names from a hat.
This is an example of:
Key Word Random Assignment → Experiment. Random Selection → Generalize
5
Why not correct?
Random assignment to treatments defines an experiment (C). Sampling methods (A, B) describe how subjects are selected from a population, not how they are assigned. Observational studies never impose a treatment (D).
Question 06
Which sampling method divides the population into non-overlapping groups (e.g., grade levels)
and then randomly selects from each group?
Key Word STRATA → pick from every group
5
Why not correct?
Cluster (A): randomly select entire groups. SRS (C): every individual equally likely, no grouping. Convenience (D): whoever is easiest — biased. Stratified (B): sample from each stratum.
Unit 4Probability
Question 07
Event A and Event B are independent.
If P(A) = 0.4 and P(B) = 0.5, find
P(A and B).
Formula Independent: P(A∩B) = P(A)·P(B)
5
Why not correct?
Independence → multiply: 0.4 × 0.5 = 0.20. A (0.9) uses addition = P(A∪B). 0.1 = 0.5 − 0.4 (subtraction, no rule). 0.45 = average of probabilities — meaningless here.
Question 08
Using the general addition rule:
P(A or B) = P(A) + P(B) − P(A and B)
If P(A)=0.6, P(B)=0.4, and P(A and B)=0.2, find P(A or B).
Trick Subtract overlap once to avoid double-counting
5
Why not correct?
0.6 + 0.4 − 0.2 = 0.80. A (1.0) forgets to subtract overlap. C (0.24) multiplies instead. D (0.76) subtracts 0.24 from 1 — circular error.
Unit 5Sampling Distributions
Question 09
The Central Limit Theorem states that as sample size n increases,
the sampling distribution of x̄ becomes approximately normal.
Which condition must hold?
Magic Number n ≥ 30 → CLT kicks in (regardless of population shape)
5
Why not correct?
The CLT's power is that no matter what shape the population has, large enough n produces a normal sampling distribution. A is the very thing CLT frees us from needing.
Question 10
The standard error (SE) of the sample mean is:
SE = σ / √n
If σ = 20 and n = 100, the SE equals:
Key Word SE shrinks as n grows — more data = more precision
5
Why not correct?
SE = 20 / √100 = 20/10 = 2. A forgets to divide. B divides by 100 instead of √100. D multiplies instead of divides.
Unit 6Inference for Proportions
Question 11
Before constructing a one-proportion z-interval, which condition checks that the
normal approximation is valid?
Check np̂ ≥ 10 AND n(1−p̂) ≥ 10 → Large Counts
5
Why not correct?
For proportions, normality is checked via the large counts condition (C), not n ≥ 30 (A, which applies to means). B applies to t-procedures, not proportions. D is never achievable with a sample.
Question 12
A 95% confidence interval for a proportion is (0.42, 0.58).
Which interpretation is correct?
Trap "95% confident" = about the METHOD, NOT about this specific interval
5
Why not correct?
A is the classic error — the true proportion is fixed, not random; the interval either captures it or doesn't. "95% confidence" describes the long-run performance of the method, not this one interval. B and D misinterpret what varies.
Unit 7Inference for Means
Question 13
A one-sample t-test is used (instead of a z-test) for means when:
Rule Unknown σ → use t-distribution with df = n − 1
5
Why not correct?
The t-distribution exists specifically because σ is unknown (C) and we substitute the sample s. With very large n the t and z converge, but we still formally use t. Proportions use z-procedures (D).
Question 14
The p-value of a hypothesis test is 0.03 and the significance level is α = 0.05.
What should the researcher conclude?
Rule p-value < α → Reject H₀ (evidence for Hₐ)
5
Why not correct?
0.03 < 0.05 → reject H₀ (B). We never "accept" or "prove" H₀ (C) — we only fail to reject it. D is backwards. A would apply only if p ≥ α.
Unit 8Chi-Square Tests
Question 15
A chi-square test for goodness of fit tests whether:
Key Word GOF = ONE variable vs. a claimed distribution
5
Why not correct?
A describes χ² test for independence/association (two-way table). B is a t-test for means. D is a one-proportion z-test. Goodness of fit (C) compares one variable's observed counts to expected counts from a stated model.
Question 16
For a chi-square test, expected counts must satisfy:
E = (row total × column total) / grand total
What is the minimum expected count required in each cell?
Check All expected counts ≥ 5 (large counts for χ²)
5
Why not correct?
The chi-square large counts condition requires all expected (not observed) counts ≥ 5. "At least 10" (A) is the large counts condition for proportions (np̂ ≥ 10). 30 (D) is for CLT of means.
Unit 9Inference for Slopes
Question 17
In a significance test for the slope β of a regression line,
the null hypothesis is always:
Key Word H₀: β = 0 → "No linear relationship exists"
5
Why not correct?
β = 0 means the line is flat — x has no predictive power over y. Although β = 0 ↔ r = 0, we state the null in terms of β (not r) for the slope test. D is actually the alternative hypothesis.
Question 18
In a regression output, the residual for a data point is defined as:
Formula residual = actual − predicted = y − ŷ
5
Why not correct?
Residual = y − ŷ (B). A reverses the sign. C is deviation from the mean (used in other formulas). D is a ratio — not a residual. A positive residual means the point lies above the line.
BonusError Types & Power
Question 19
A drug is actually effective, but the hypothesis test fails to reject H₀.
This is a:
Cheat Sheet Type I = reject true H₀ (false alarm) · Type II = keep false H₀ (miss)
5
Why not correct?
The drug works (H₀ false), yet we fail to reject H₀ — that's a Type II error (B). Type I (A) would be rejecting H₀ when it's actually true. Low power makes Type II errors more likely, but the error itself is Type II, not "a power failure" (D).
Question 20
Statistical power is the probability of:
Power = 1 − β
Which action increases power?
Key Word More n → less SE → bigger t/z → easier to detect effect
5
Why not correct?
Larger n reduces SE, making it easier to detect a real effect → higher power (C). Smaller n (A) reduces power. Stricter α (B) makes it harder to reject → reduces power. Higher σ (D) adds noise → reduces power.