Math 216: Statistical Thinking
Key Question: How can we accurately estimate population proportions from sample data? Sampling distributions provide the theoretical framework for statistical inference with categorical data!
Real-World Applications:
Statistical Framework:
Key Insight: Even though each sample gives a different answer, they follow a predictable normal pattern!
Core Statistical Properties
Mean of \(\hat{p}\):
Standard Error of \(\hat{p}\):
Key Insight: These properties enable statistical inference from samples to populations!
Core Statistical Principle
Central Limit Theorem for Proportions: For sufficiently large samples, the sampling distribution of \(\hat{p}\) is approximately normal, regardless of the population distribution shape.
Mathematical Formulation: \[\hat{p} \sim N\left(p, \sqrt{\frac{p(1-p)}{n}}\right)\]
Sample Size Requirements:
Statistical Significance: This universal principle enables confidence intervals and hypothesis testing for proportions!
Central Limit Theorem Conditions
Success-Failure Condition: For normal approximation to be valid, we need: \[np \geq 15 \quad \text{and} \quad n(1-p) \geq 15\]
Verification Examples:
Example 1: Political Polling
Example 2: Market Research
Probability Calculation Exercises
Real-World Application
Population proportion: \(p = 0.45\), sample size: \(n = 400\)
Standard error: \(SE = \sqrt{\frac{0.45 \times 0.55}{400}} = 0.0249\)
Find \(P(\hat{p} < 0.40)\):
pnorm((0.40-0.45)/0.0249) = 0.0222
Find \(P(\hat{p} > 0.50)\):
1 - pnorm((0.50-0.45)/0.0249) = 0.0222
Find \(P(0.42 < \hat{p} < 0.48)\):
pnorm((0.48-0.45)/0.0249) - pnorm((0.42-0.45)/0.0249) = 0.7699