← back to The Commons

As We May Think

Vannevar Bush · The Atlantic · July 1945 · Read the original

Bush proposes the memex—a desk that stores and links all of a researcher's documents by associative trails rather than hierarchical index. The real argument: science's bottleneck is not producing knowledge but selecting what matters from the flood.

The moment

Vannevar Bush ran the Office of Scientific Research and Development during World War II, coordinating roughly 6,000 American scientists. By the summer of 1945, the war was ending. The institutional question was practical: what happens when those scientists go back to peacetime work?

Bush's answer was not administrative. He looked at the knowledge those scientists had already produced and asked a harder question: can anyone actually find it? The publication rate had outrun every retrieval system in existence. Libraries filed by subject. Journals filed by date. Neither matched the way researchers actually thought.

The investigator is staggered by the findings and conclusions of thousands of other workers—conclusions which he cannot find time to grasp, much less to remember, as they appear.

The memex

Bush imagined a device the size of a desk. Inside: all of a person's books, records, correspondence, stored on microfilm. Two viewing screens. A keyboard for annotation. The storage technology was speculative but grounded in existing microfilm capabilities pushed to their limits.

The interesting part was not the hardware. It was the linking. A user reading one document could tie it to another with a named trail. That trail was permanent. The next time the user opened either document, the link was there, ready to follow. The memex stored not just information but the paths a mind had taken through it.

Wholly new forms of encyclopedias will appear, ready-made with a mesh of associative trails running through them, ready to be dropped into the memex and there amplified.

This was not alphabetical order. Not the Dewey Decimal System. Not any hierarchy at all. Bush was proposing what Ted Nelson would later call hypertext, twenty years before Nelson coined the term. The web wouldn't arrive for another half century.

Trails as thought

Bush grounded the memex in a claim about cognition. The human mind does not work by index. It works by association. One thought triggers the next through connections that are contextual, idiosyncratic, often surprising. A filing cabinet fights this tendency. The memex would mirror it.

The trail was the key concept. A researcher studying, say, the elastic properties of materials might build a trail linking a Turkish bow to a short English bow to the decline of mounted archery. That trail would persist. Another researcher could pick it up, extend it, branch from it. The trails themselves became documents worth sharing.

The human mind . . . operates by association. With one item in its grasp, it snaps instantly to the next that is suggested by the association of thoughts, in accordance with some intricate web of trails carried by the cells of the brain.

Bush understood that the trail a researcher leaves through a body of knowledge is itself a form of knowledge. The path encodes judgment: what mattered, what connected, what was worth following up. Destroy the path and you lose more than convenience. You lose the researcher's interpretation of the material.

Selection, not production

This is the hinge of the essay:

The difficulty seems to be, not so much that we publish unduly in view of the extent and variety of present-day interests, but rather that publication has been extended far beyond our present ability to make real use of the record.

The tools for creating knowledge had raced ahead. Laboratories, measurement instruments, printing presses — all scaled. The tools for finding knowledge had not. Card catalogs. Manual cross-referencing. Memory. No incremental improvement would close the gap.

Eighty years later, the gap has only widened. arXiv adds 16,000 papers a month. Search engines retrieve by keyword. Recommendation algorithms surface what is popular. Neither captures the associative, trail-building cognition Bush wanted to augment. The record exists. Making real use of it remains the unsolved problem.

Wrong and right

The microfilm was wrong. The desk was wrong. The mechanical selectors were wrong. Bush was extrapolating analog technologies that digital ones would overtake within a decade. He also imagined the memex as personal — each researcher's trails private by default, sharing by physical copy. He did not foresee a network connecting all the memexes into a shared space.

The diagnosis was exact. Knowledge retrieval is the bottleneck. Associative linking beats hierarchical filing. Trails encode thought. Selection is harder than production. These claims survive because they are claims about cognition, not about microfilm.

Neighbors
  • 📡 IT Ch.3 — channel capacity: Shannon's compression is the mathematical answer to Bush's selection problem — keep what matters, discard what doesn't
  • ⚖ Statute of Anne 1710 — next in the Commons collection: if Bush asked how we find knowledge, the Statute asked who gets to own it
  • 🧠 Cognitive Science Ch.1 — memory and cognitive architecture: the memex is a cognitive prosthetic — Bush's diagnosis of memory failure anticipates working/long-term memory distinctions in cognitive architecture
  • ⚖ Berners-Lee 1989 — the web as memex: the world wide web implements Bush's associative links — hyperlinks are trails, URLs are the index, and browsers are the memex

Bush imagined Ramanujan and Hardy finding each other through trails—two minds describing the same problem in different vocabularies, finally linked by association rather than index. jkPageLeft Manifesto imagines the same thing through embeddings: the trails are vectors now, but the problem Bush diagnosed has not changed.