Inference for Means

OpenIntro Statistics · CC BY-SA 3.0 · Chapter 7

When data are numerical, inference targets the mean. The t-distribution replaces the normal when the population standard deviation is unknown. ANOVA generalizes the two-sample test to any number of groups.

One-sample t-test

Test whether a sample mean differs from a hypothesized value. The t-statistic replaces Z when sigma is unknown: t = (x-bar - mu0) / (s / sqrt(n)). It follows a t-distribution with n - 1 degrees of freedom. As n grows, t approaches Z.

Scheme

; One-sample t-test
; H0: mu = 100, sample mean = 105, s = 15, n = 25

(define x-bar 105)
(define mu0 100)
(define s 15)
(define n 25)

(define se (/ s (sqrt n)))
(define t-stat (/ (- x-bar mu0) se))

(display "x-bar = ") (display x-bar) (newline)
(display "SE    = ") (display se) (newline)
(display "t     = ") (display t-stat) (newline)
(display "df    = ") (display (- n 1)) (newline)
; Critical t at alpha=0.05, df=24, two-tailed: ~2.064
(display "Reject H0? ")
(display (if (> (abs t-stat) 2.064) "Yes" "No"))

Two-sample t-test

Compare means from two independent groups. The standard error combines variability from both samples: SE = sqrt(s1^2/n1 + s2^2/n2). Degrees of freedom come from Welch's approximation when variances are unequal.

Scheme

; Two-sample t-test (Welch's)
; Group 1: mean=82, s=10, n=30
; Group 2: mean=78, s=12, n=35

(define x1 82) (define s1 10) (define n1 30)
(define x2 78) (define s2 12) (define n2 35)

(define se (sqrt (+ (/ (* s1 s1) n1) (/ (* s2 s2) n2))))
(define t-stat (/ (- x1 x2) se))

(display "Difference = ") (display (- x1 x2)) (newline)
(display "SE         = ") (display se) (newline)
(display "t          = ") (display t-stat) (newline)
; Welch df approximation is complex; use min(n1-1, n2-1) = 29 as conservative
(display "df (conservative) = 29") (newline)
; Critical t at alpha=0.05, df=29: ~2.045
(display "Reject H0? ")
(display (if (> (abs t-stat) 2.045) "Yes" "No"))

Paired t-test

When observations come in pairs (before/after, matched subjects), compute the differences and run a one-sample t-test on those differences. This controls for subject-level variation that would inflate the two-sample SE.

Scheme

; Paired t-test
; Before: (85 90 78 92 88), After: (90 94 80 96 91)
; Test H0: mean difference = 0

(define before (list 85 90 78 92 88))
(define after  (list 90 94 80 96 91))

(define diffs (map - after before))
(display "Differences: ") (display diffs) (newline)

(define n (length diffs))
(define d-bar (/ (apply + diffs) n))

; Sample SD of differences
(define ss (apply + (map (lambda (d) (* (- d d-bar) (- d d-bar))) diffs)))
(define s-d (sqrt (/ ss (- n 1))))

(define se (/ s-d (sqrt n)))
(define t-stat (/ (exact->inexact d-bar) se))

(display "d-bar = ") (display (exact->inexact d-bar)) (newline)
(display "s_d   = ") (display s-d) (newline)
(display "t     = ") (display t-stat) (newline)
(display "df    = ") (display (- n 1)) (newline)
; Critical t at alpha=0.05, df=4: ~2.776
(display "Reject H0? ")
(display (if (> (abs t-stat) 2.776) "Yes" "No"))

ANOVA and the F-distribution

Analysis of variance tests whether any group mean differs among k groups. It decomposes total variation into between-group (MSG) and within-group (MSE) components. The F-statistic = MSG / MSE follows an F-distribution with (k-1, n-k) degrees of freedom. A large F means the group means spread more than within-group noise would predict.

Scheme

; One-way ANOVA
; Three groups:
; A: (4 5 6 5), B: (8 9 7 8), C: (6 7 5 6)

(define a (list 4 5 6 5))
(define b (list 8 9 7 8))
(define c (list 6 7 5 6))

(define (mean lst) (/ (apply + lst) (length lst)))
(define (ss-within lst mu)
  (apply + (map (lambda (x) (* (- x mu) (- x mu))) lst)))

(define grand-mean (/ (+ (apply + a) (apply + b) (apply + c)) 12))
(define ma (mean a)) (define mb (mean b)) (define mc (mean c))

; Between-group SS
(define ssg (+ (* 4 (* (- ma grand-mean) (- ma grand-mean)))
               (* 4 (* (- mb grand-mean) (- mb grand-mean)))
               (* 4 (* (- mc grand-mean) (- mc grand-mean)))))
; Within-group SS
(define sse (+ (ss-within a ma) (ss-within b mb) (ss-within c mc)))

(define msg (/ ssg 2))    ; df = k-1 = 2
(define mse (/ sse 9))    ; df = n-k = 9

(define f-stat (/ (exact->inexact msg) (exact->inexact mse)))

(display "Group means: ") (display (list (exact->inexact ma) (exact->inexact mb) (exact->inexact mc))) (newline)
(display "Grand mean:  ") (display (exact->inexact grand-mean)) (newline)
(display "MSG = ") (display (exact->inexact msg)) (newline)
(display "MSE = ") (display (exact->inexact mse)) (newline)
(display "F   = ") (display f-stat) (newline)
; Critical F at alpha=0.05, df=(2,9): ~4.26
(display "Reject H0? ")
(display (if (> f-stat 4.26) "Yes" "No"))

; One-way ANOVA
; Three groups:
; A: (4 5 6 5), B: (8 9 7 8), C: (6 7 5 6)

(define a (list 4 5 6 5))
(define b (list 8 9 7 8))
(define c (list 6 7 5 6))

(define (mean lst) (/ (apply + lst) (length lst)))
(define (ss-within lst mu)
  (apply + (map (lambda (x) (* (- x mu) (- x mu))) lst)))

(define grand-mean (/ (+ (apply + a) (apply + b) (apply + c)) 12))
(define ma (mean a)) (define mb (mean b)) (define mc (mean c))

; Between-group SS
(define ssg (+ (* 4 (* (- ma grand-mean) (- ma grand-mean)))
               (* 4 (* (- mb grand-mean) (- mb grand-mean)))
               (* 4 (* (- mc grand-mean) (- mc grand-mean)))))
; Within-group SS
(define sse (+ (ss-within a ma) (ss-within b mb) (ss-within c mc)))

(define msg (/ ssg 2))    ; df = k-1 = 2
(define mse (/ sse 9))    ; df = n-k = 9

(define f-stat (/ (exact->inexact msg) (exact->inexact mse)))

(display "Group means: ") (display (list (exact->inexact ma) (exact->inexact mb) (exact->inexact mc))) (newline)
(display "Grand mean:  ") (display (exact->inexact grand-mean)) (newline)
(display "MSG = ") (display (exact->inexact msg)) (newline)
(display "MSE = ") (display (exact->inexact mse)) (newline)
(display "F   = ") (display f-stat) (newline)
; Critical F at alpha=0.05, df=(2,9): ~4.26
(display "Reject H0? ")
(display (if (> f-stat 4.26) "Yes" "No"))

Notation reference

Symbol	Formula	Meaning
t	(x̄ - μ0) / (s/√n)	t-statistic
df	n - 1	Degrees of freedom (one-sample)
MSG	SSG / (k-1)	Mean square between groups
MSE	SSE / (n-k)	Mean square within groups
F	MSG / MSE	F-statistic for ANOVA

Neighbors

Cross-references

Grinstead Ch.6 — expected value foundations for what "mean" means

Foundations (Wikipedia)

← Inference for Proportions by june.kim Simple Linear Regression →