← back to statistics

Inference for Means

OpenIntro Statistics · CC BY-SA 3.0 · Chapter 7

When data are numerical, inference targets the mean. The t-distribution replaces the normal when the population standard deviation is unknown. ANOVA generalizes the two-sample test to any number of groups.

x̄1 x̄2 Group 1 Group 2

One-sample t-test

Test whether a sample mean differs from a hypothesized value. The t-statistic replaces Z when sigma is unknown: t = (x-bar - mu0) / (s / sqrt(n)). It follows a wpt-distribution with n - 1 degrees of freedom. As n grows, t approaches Z.

Scheme

Two-sample t-test

Compare means from two independent groups. The standard error combines variability from both samples: SE = sqrt(s1^2/n1 + s2^2/n2). Degrees of freedom come from Welch's approximation when variances are unequal.

Scheme

Paired t-test

When observations come in pairs (before/after, matched subjects), compute the differences and run a one-sample t-test on those differences. This controls for subject-level variation that would inflate the two-sample SE.

Scheme

ANOVA and the F-distribution

Analysis of variance tests whether any group mean differs among k groups. It decomposes total variation into between-group (MSG) and within-group (MSE) components. The F-statistic = MSG / MSE follows an F-distribution with (k-1, n-k) degrees of freedom. A large F means the group means spread more than within-group noise would predict.

Scheme

Notation reference

Symbol Formula Meaning
t(x̄ - μ0) / (s/√n)t-statistic
dfn - 1Degrees of freedom (one-sample)
MSGSSG / (k-1)Mean square between groups
MSESSE / (n-k)Mean square within groups
FMSG / MSEF-statistic for ANOVA
Neighbors

Cross-references

Foundations (Wikipedia)