01
Normal Distributions & Z-Scores
Bell curve ยท Standardization ยท Empirical Rule ยท Percentiles
Quick-Recall Keys
- Z-score = (x โ ฮผ) / ฯ
- 68-95-99.7 Rule 1ฯ / 2ฯ / 3ฯ
- ฮผ = mean center of bell
- ฯ = std dev width of bell
- Z=0 = exactly the mean
- Positive Z โ above mean
- Negative Z โ below mean
A normal distribution is best described as a distribution that is...
๐ก Explanation
A normal distribution is always symmetric and bell-shaped, centered at its mean (ฮผ). Option C describes the standard normal distribution specifically โ not all normal distributions. Option A describes a right-skewed distribution.The formula for a Z-score is:
Z = (x โ ฮผ) / ฯ
If a student scores 85 on a test where the mean is 75 and the standard deviation is 10, what is the Z-score?
๐ก Explanation
Z = (85 โ 75) / 10 = 10 / 10 = 1.0. A Z-score of +1.0 means the student scored exactly one standard deviation above the mean. Always plug in: (your value โ mean) รท std dev.According to the Empirical Rule (68โ95โ99.7 Rule), approximately what percentage of data falls within 2 standard deviations of the mean in a normal distribution?
๐ก Explanation
Memorize: 68 โ 95 โ 99.7 for 1ฯ โ 2ฯ โ 3ฯ. Within ยฑ2ฯ: about 95% of all data. This is one of the most tested facts in statistics. Think: "2 standard deviations catches almost everyone."Heights of adult women in a city are normally distributed with a mean of 64 inches and a standard deviation of 3 inches. What is the Z-score for a woman who is 70 inches tall?
๐ก Explanation
Z = (70 โ 64) / 3 = 6 / 3 = 2.0. This woman is exactly 2 standard deviations above the mean. Using the Empirical Rule, only about 2.5% of women are taller than her (since 95% fall within ยฑ2ฯ, and 5% fall outside โ split evenly, 2.5% on each side).๐ Word Problem: The weight of bags of chips produced by a factory is normally distributed with ฮผ = 150g and ฯ = 5g. Quality control rejects bags with a Z-score below โ2 or above +2.
Approximately what percentage of bags will be rejected?
๐ก Explanation
95% of bags fall within Z = ยฑ2, meaning they are accepted. Therefore, 100% โ 95% = 5% are rejected. Common mistake: choosing 0.3% (that's the area outside ยฑ3ฯ) or confusing accepted with rejected.๐ Word Problem: Scores on a standardized exam are normally distributed with ฮผ = 500 and ฯ = 100. A university admits students who score in the top 16%.
What is the minimum score needed for admission? (Use the Empirical Rule.)
๐ก Explanation
68% falls within ยฑ1ฯ โ 32% falls outside ยฑ1ฯ โ 16% falls above Z = +1 (by symmetry). So the top 16% starts at Z = +1, which is: x = ฮผ + 1ฯ = 500 + 100 = 600. Key insight: "top 16%" means you need to find the score at Z = +1.๐ Tricky Concept: Two students take different tests. Student A scores 78 on a test with ฮผ = 70, ฯ = 8. Student B scores 82 on a test with ฮผ = 75, ฯ = 4.
Who performed better relative to their class?
๐ก Explanation
Student A: Z = (78โ70)/8 = 1.0. Student B: Z = (82โ75)/4 = 7/4 = 1.75. Higher Z-score means better performance relative to the class. This is exactly WHY Z-scores exist โ to compare across different tests fairly. Raw score alone is misleading.02
Hypothesis Testing
Hโ & Hโ ยท p-value ยท Significance level ยท Type I & II Errors
Quick-Recall Keys
- Hโ = null (no effect, status quo)
- Hโ = alternative (our claim)
- p-value small โ reject Hโ
- ฮฑ = 0.05 = significance level
- Type I Error = false alarm (reject true Hโ)
- Type II Error = missed signal (keep false Hโ)
- p < ฮฑ โ statistically significant
In hypothesis testing, the null hypothesis (Hโ) always states that...
๐ก Explanation
Hโ (null hypothesis) always represents the "boring, default" position: nothing is happening, no effect exists. Think of Hโ as the prosecution saying "innocent until proven guilty." You only reject it if evidence is strong enough.A researcher tests whether a new drug lowers blood pressure. The significance level is ฮฑ = 0.05. The study produces a p-value of 0.03. What should the researcher conclude?
๐ก Explanation
p-value (0.03) < ฮฑ (0.05) โ Reject Hโ. A small p-value means the data is very unlikely if Hโ were true. Important: we never "accept Hโ" โ we only fail to reject it. Option C is always wrong wording in statistics.A teacher claims the average score in her class is above 80. Which pair of hypotheses correctly sets up this test?
๐ก Explanation
Hโ always uses "=" (equality). The teacher's claim goes into Hโ โ and since she claims scores are "above 80," Hโ: ฮผ > 80. This is a one-tailed (right-tailed) test. Option B would be a two-tailed test (โ means "different in either direction").๐ Word Problem: A factory claims its machines produce bolts with a mean diameter of exactly 10mm. An engineer suspects the bolts are not exactly 10mm (could be too big or too small). She samples 50 bolts and runs a hypothesis test with ฮฑ = 0.05. The p-value is 0.08.
What is the correct conclusion?
๐ก Explanation
p-value (0.08) > ฮฑ (0.05) โ Fail to reject Hโ. The evidence is not strong enough. Since the engineer suspects a difference in either direction, this is a two-tailed test โ do NOT halve the p-value. Option D is wrong here.A Type I Error occurs when you...
๐ก Explanation
Type I = False Positive = "false alarm" โ you cry wolf when there is no wolf. Its probability equals ฮฑ. Type II = False Negative = "missed it" โ the wolf was there but you didn't notice. Its probability is ฮฒ. Memory trick: Type I โ "I cried wolf" (rejected truth).๐ Word Problem: A pharmaceutical company tests a new vaccine. Hโ: The vaccine has no effect. Hโ: The vaccine reduces infections.
A Type II Error here would mean:
A Type II Error here would mean:
Which real-world consequence best describes a Type II Error in this context?
๐ก Explanation
Type II Error = Fail to reject a false Hโ. Here Hโ says "no effect," but in reality the vaccine works. Failing to reject Hโ means we conclude "no effect" โ and a working vaccine gets shelved. Option A describes a Type I Error (approving something ineffective = false positive).๐ Tricky Question: A study finds p = 0.049 with ฮฑ = 0.05.
Which statement is the most accurate interpretation?
๐ก Explanation
The p-value is a conditional probability: given Hโ is true, how likely is this data? It is NOT the probability that Hโ is true (Option A), and it does not prove anything (Option D). Option C confuses the p-value with power. Option B is the precise, correct definition.03
Margin of Error
Confidence Intervals ยท Critical Values ยท Sample Size ยท Interpretation
Quick-Recall Keys
- MoE = z* ร (ฯ / โn)
- CI = xฬ ยฑ MoE
- 90% CI โ z* = 1.645
- 95% CI โ z* = 1.96
- 99% CI โ z* = 2.576
- nโ โ MoEโ (more precise)
- CI wider = more confident
A poll reports: "47% of voters support Candidate A, with a margin of error of ยฑ3%." What does this mean?
๐ก Explanation
The confidence interval is: 47% ยฑ 3% = [44%, 50%]. The margin of error is not about survey mistakes or uncertainty in responses โ it quantifies the sampling variability. We are (typically 95%) confident the true population value lies in this range.Which of the following will decrease the margin of error?
๐ก Explanation
MoE = z* ร (ฯ / โn). A larger n makes โn bigger, making ฯ/โn smaller โ MoE decreases. Options A and C both increase MoE. Option B decreases n โ increases MoE. Always remember: more data = more precision = smaller MoE.๐ Word Problem: A researcher surveys 100 students and finds the average study time is 3.5 hours per day, with a population standard deviation of ฯ = 1.0 hours. The critical value for a 95% confidence interval is z* = 1.96.
What is the margin of error?
๐ก Explanation
MoE = z* ร (ฯ/โn) = 1.96 ร (1.0/โ100) = 1.96 ร (1.0/10) = 1.96 ร 0.1 = 0.196 hours. So the 95% CI is: 3.5 ยฑ 0.196 โ [3.304, 3.696]. Common mistake: forgetting to take โn (just dividing by n = 100 instead of 10).A 95% confidence interval for a population mean is reported as (42, 58). Which of the following interpretations is correct?
๐ก Explanation
This is the most commonly misunderstood concept! Once an interval is computed, the true mean either IS or ISN'T in it (probability is 0 or 1). The "95%" refers to the process: if we repeated this study many times, 95% of those intervals would capture the true mean. Option A sounds right but is technically wrong โ the true mean is a fixed value, not random.๐ Word Problem: A market researcher wants to estimate the proportion of customers who prefer Brand X, with a margin of error of no more than 4% at a 95% confidence level (z* = 1.96). She assumes maximum variability (p = 0.5).
What is the minimum sample size needed?
๐ก Explanation
For proportions: MoE = z* ร โ(p(1โp)/n). Solving for n: n = (z*/MoE)ยฒ ร p(1โp) = (1.96/0.04)ยฒ ร 0.5 ร 0.5 = (49)ยฒ ร 0.25 = 2401 ร 0.25 = 600.25 โ round up to 601. Note: n = 385 is the answer for MoE = 5%, a very common mix-up on exams.๐ Connecting All Three Units: A health agency conducts a study and finds that the average daily sodium intake of adults is xฬ = 2,400 mg, with a 95% CI of (2,250 mg, 2,550 mg). The recommended daily limit is 2,300 mg.
Hโ: ฮผ = 2,300 mg Hโ: ฮผ โ 2,300 mg
Hโ: ฮผ = 2,300 mg Hโ: ฮผ โ 2,300 mg
Based on the confidence interval, what should the agency conclude about the hypothesis test at ฮฑ = 0.05?
๐ก Explanation
Key insight: A 95% CI is directly equivalent to a two-tailed hypothesis test at ฮฑ = 0.05. The CI is [2,250; 2,550]. The null value 2,300 falls inside this interval... wait โ 2,300 IS inside [2,250, 2,550]. So we fail to reject? Actually: 2,250 < 2,300 < 2,550 โ โ 2,300 is inside the interval, so Fail to reject Hโ. Option A is correct! This is an intentionally tricky "trick question" โ always check whether the null value is inside or outside the CI carefully.๐
Quiz Complete!
0/20