Integrity
The method already exists. What's missing is anyone practicing it.
Long content
Gwern Branwen doesn't write blog posts. Blog posts are "the triumph of the hare over the tortoise, meant to be read by a few people on a weekday in 2004 and never again." He writes long content: essays that improve over decades, designed for a 60-year lifespan. The intended audience is his future self, "who is intelligent and interested, but has forgotten."
Every page carries a confidence tag: certain, likely, possible, unlikely. Every page carries a completion status: notes, draft, in progress, finished. Predictions are registered publicly and scored against reality. When evidence contradicts a claim, the claim gets a correction and a link to why. He cites Darwin's method: "whenever a published fact, a new observation or thought came across me, which was opposed to my general results, to make a memorandum of it without fail and at once."
He runs randomized self-experiments on nootropics and publishes the data, including the nulls. He writes 50,000-word literature reviews and updates them when new evidence arrives. He follows reporting standards designed for clinical trials (STROBE, CONSORT, PRISMA) even though nobody requires it. Every cited URL is archived against link rot. The full version history is public: append .md to any page URL and read the source.
Everything is released CC-0. Public domain. His reasoning: "Obscurity is a far greater threat to authors than piracy." The license reduces friction, encourages copying, and costs him nothing.
No PhD. No lab. No grant. No peer review. No gatekeeper. A website and the discipline to be honest on it, for twenty years.
What he's doing that they described
Bacon said: report what you observe, not what you expect. Gwern tags every page with its confidence level and updates when the evidence changes. Bacon's tables of instances, maintained in public.
Chamberlin said: hold multiple hypotheses or your favorite will blind you. Gwern includes contradictory evidence systematically. "The only solution is to be diligent to include criticism, so even if you do not escape brainwashing, at least your readers have a chance."
Popper said: specify in advance what would disprove your theory. Gwern registers predictions on PredictionBook.com. Embedded predictions in essays provide built-in feedback: as time passes, predictions are validated or falsified. Accountability that most scientists avoid and most pundits refuse.
Feynman said: report the results that didn't work. Gwern publishes null results alongside positive ones. His treadmill desk experiment found it "distracting, chronically unpleasant." Published anyway. That's Feynman's first principle, practiced rather than preached.
Mayo said: your test doesn't count unless it could have caught you being wrong. Gwern follows clinical trial reporting standards voluntarily. His quality control checklist reruns statistical code for reproducibility. Severity as personal discipline.
He also publishes the embarrassing stuff: anime predictions, treadmill desk reviews, experiments that went nowhere. A curated trail can still be Goodharted. A complete one can't.
That this is remarkable says more about the institution than about him.
The novelty trap
Every thinker on the timeline assumed scientists want truth. The modern incentive structure wants something else: novelty. Journals reject replications. Tenure committees count publications, not corrections.
P-hacking: run twenty tests, report the one that hits p < 0.05. HARKing: write the introduction after you see the data, pretending you predicted what you found. Francesca Gino built a career at Harvard Business School on fabricated data. The fraud went undetected for years because her results were novel and her publications were prestigious. When Data Colada exposed the fabrication in 2023, the institutional response was slow, protective, conflicted.
Gino is the output of an optimization function, not the exception to it. Reward novelty, punish replication, and you get cargo cult science at industrial scale. Feynman described the disease. Ioannidis quantified it. The replication crisis is the system working as designed.
Gwern opted out. No journal. No tenure track. No incentive to p-hack. His incentive is whether his future self, re-reading the page in ten years, will find it honest. That turns out to be a better optimization function than impact factor.
Honest work sits behind paywalls or on personal websites that Google may or may not surface. PageLeft is a bet on a different structure: index the web by meaning, filter by license (copyleft only), rank by citation. A replication becomes a citation. A negative result becomes a link that says "this didn't hold." Replication builds rank.
—
june.kim
Scientia
Science was never supposed to be a career strategy. Bacon didn't tabulate instances to get tenure. Ramanujan didn't write to Hardy for a publication credit. Gwern doesn't tag his pages with confidence levels to impress an interviewer. The word itself comes from scientia: knowledge, not credential. Pursuing truth together. Not keeping it to yourself. Not using it as social leverage.
The method works when knowledge is shared freely and tested severely. It breaks the moment someone has more to gain from a novel claim than from an honest null. Chamberlin's multiple hypotheses, Popper's falsification, Mayo's severity: the method already exists. What's missing is a structure that rewards it.
The next frontier of the scientific method won't come from philosophy. It will come from architecture.
june.kim