โ† back to probability

Discrete Probability

Grinstead & Snell ยท GFDL ยท PDF

A probability function assigns a number between 0 and 1 to every outcome, and the numbers must add up to 1. That is the entire foundation. Everything else in probability is built on these two constraints.

Sample spaces and events

An experiment has a set of possible outcomes called the sample space (written Ω). An event is any subset of the sample space. Rolling a die: Ω = {1, 2, 3, 4, 5, 6}. "Rolling even" is the event {2, 4, 6}.

Ω E = even 1 3 5 2 4 6
Scheme

The probability function axioms

A probability function P assigns a weight to each outcome. Three axioms define it:

  1. Non-negative: P(ω) ≥ 0 for every outcome ω
  2. Sums to 1: the weights of all outcomes add to 1
  3. Additive: for disjoint events A and B, P(A ∪ B) = P(A) + P(B)

The probability of an event is the sum of the probabilities of its outcomes.

Scheme

Uniform distribution on finite sets

When every outcome is equally likely, P(ω) = 1/n where n = |Ω|. The probability of an event E is just |E|/|Ω|. Counting favorable outcomes over total outcomes is the oldest probability calculation there is.

Scheme

Coin flips and repeated trials

Flip a fair coin n times. The sample space has 2^n outcomes, each equally likely. The number of sequences with exactly k heads is "n choose k" (a preview of Chapter 3). For 3 flips, the probability of exactly 2 heads is 3/8.

Scheme

Notation reference

Textbook Scheme Meaning
Ωdie, two-diceSample space
E ⊆ Ω(filter pred omega)Event (subset)
P(ω)(P outcome)Probability of outcome
P(E) = |E|/|Ω|(uniform-prob event omega)Uniform probability
∑ P(ω) = 1(apply + (map cdr weights))Normalization axiom
Neighbors

Next chapters

Paper pages

  • ๐Ÿž Fritz 2020 โ€” Markov categories axiomatize probability with morphisms instead of measure theory
  • ๐Ÿž Staton 2025 โ€” probabilistic programming composes distributions like programs

Foundations (Wikipedia)

Ready for the real thing? Read Ch 1 of Grinstead & Snell.