← back to statistics

Foundations for Inference

OpenIntro Statistics · Ch. 5 · openintro.org/book/os

Statistical inference draws conclusions about a population from a sample. Two main tools: confidence intervals estimate a parameter with a margin of error, and hypothesis tests evaluate whether evidence supports a claim. Both rest on the sampling distribution.

Point estimates and the sampling distribution

A point estimate is a single number computed from data (like the sample mean). Different samples give different estimates. The distribution of all possible estimates is the sampling distribution. By the 🎰 Central Limit Theorem, sample means are approximately normal for large n.

reject reject fail to reject -z* μ₀ +z* α/2 α/2 Two-tailed hypothesis test at significance level α
Scheme

Confidence intervals

A 95% confidence interval is: point estimate ± z* × SE, where z* = 1.96 for 95% confidence. Interpretation: if we repeated the sampling many times, 95% of intervals would contain the true parameter.

Scheme

Hypothesis testing and p-values

Start with a null hypothesis H₀ (no effect). Compute a test statistic and its p-value: the probability of seeing data this extreme if H₀ is true. If p < α (typically 0.05), reject H₀.

Scheme

Significance level and errors

The significance level α is the threshold for rejecting H₀. Type I error: rejecting H₀ when it is true (probability = α). Type II error: failing to reject H₀ when it is false (probability = β). Power = 1 - β: the probability of correctly detecting a real effect.

Scheme
Neighbors

Foundations (Wikipedia)

Translation notes

OpenIntro builds inference from the sampling distribution of the sample proportion, then generalizes. We use the sample mean for clarity. The original covers decision errors with medical testing analogies and uses simulation to build intuition for sampling distributions. For large samples, the Z-test is equivalent to the t-test. Small-sample inference uses the t-distribution (covered in Ch. 7).

Want the full treatment? Read OpenIntro Statistics, Ch. 5.