← back to scientific method

A System of Logic

John Stuart Mill · 1843 · Project Gutenberg
Book III, Chapters 8-10: "Of the Four Methods of Experimental Inquiry" (later expanded to five).

Five methods for extracting causes from observations. You cannot prove causation from data alone, but you can isolate it by systematically varying conditions and watching what changes.

The method of difference

Mill's most influential method. Two cases identical in every respect except one factor. If the effect differs, that factor is the cause. This is the logic underneath every controlled experiment.

Case 1 A B C effect e Case 2 A B -- no e same conditions, one factor removed, effect disappears: C causes e

The five methods

Method How it works
AgreementMultiple cases share one common factor and the same effect. That factor is probably the cause.
DifferenceTwo cases identical except one factor. The effect appears only when that factor is present.
Joint methodAgreement and difference combined. Positive cases share the factor; negative cases lack it.
ResiduesSubtract known cause-effect pairs from a complex phenomenon. Whatever remains is caused by whatever factor remains.
Concomitant variationWhen a factor varies in degree, the effect varies in degree. The precursor to correlation analysis.

Mill's practical answer to Hume

Hume showed that no amount of observation proves the next instance. Induction cannot be justified by logic. Mill accepted this and sidestepped it. His methods do not prove causation in Hume's strict sense. They isolate it. If you hold everything constant and vary one factor, and the effect tracks that factor, you have extracted the cause from the noise.

The move is subtraction. Agreement finds what all positive cases share. Difference removes everything except the candidate. Residues subtract what you already know. Each method narrows the field of possible causes by elimination rather than by accumulating confirmations. Mill understood that Hume's critique applied to confirmation, not to elimination.

Fisher formalized exactly this insight ninety years later. Randomization is the method of difference made rigorous: assign subjects to treatment and control at random, so every factor except the treatment is (in expectation) identical between the two groups. Fisher never cited Mill directly, but the logic is the same. Vary one thing, hold the rest constant. The randomized controlled trial is Mill's second canon with a probability model bolted on.

Boole, Mill's contemporary, took the complementary path. Where Mill asked how to extract causes from observations, Boole asked how to make reasoning itself mechanical. Between them, Mill and Boole built the two pillars that twentieth-century science would stand on: controlled experiment and formal inference.

Limitations Mill saw

Mill was candid about what his methods could not do. The method of agreement can only identify necessary conditions, not sufficient ones. The method of difference requires that you can actually hold everything else constant, which in practice is hard. Concomitant variation detects correlation but cannot distinguish direct from indirect causation. The method of residues assumes you already know most of the causal structure, which means it works only in mature sciences.

These are not flaws in Mill's thinking. They are honest boundary conditions. Fisher's randomization solved the "hold everything constant" problem probabilistically. Modern causal inference (wpPearl, Rubin) solved the "direct vs. indirect" problem structurally. But every solution still operates within the framework Mill laid out: isolate causes by systematic variation and elimination.

Neighbors