โ† back to game theory

Repeated Prisoner's Dilemma

Nordstrom, Introduction to Game Theory ยท Section 4.4 ยท CC BY-SA 4.0

Play once: defect. Play forever: cooperate. Finite repetition unravels by backward induction: the last round has no future, so defect; but then the second-to-last round is effectively last, so defect; and so on. Infinite repetition breaks the chain. The shadow of the future makes tit-for-tat viable.

cumulative payoff over rounds 1 2 3 4 5 6 ... always defect (10) tit-for-tat (18) payoffs diverge cumulative payoff

Finite repetition: backward induction unravels

If both players know the game ends after round N, work backward. In round N there is no future punishment, so both defect. But then round N-1 is effectively the last real round, so both defect there too. The logic cascades all the way to round 1. Finite repetition gives the same result as playing once.

Scheme

Infinite repetition: tit-for-tat cooperates

When the game has no known endpoint, the future matters. Tit-for-tat: cooperate on round 1, then copy your opponent's last move. It is responsive (punishes defection), nice (starts cooperative), and forgiving (returns to cooperation). Axelrod's tournaments showed it outperforms pure strategies. Tit-for-tat is a closed feedback loop: perceive the opponent's last move, remember it, act on it. This is the same structure that makes jkgeneral intelligence work -- consolidation over repeated interactions.

Scheme

The shadow of the future

Cooperation holds when the expected value of future cooperation exceeds the one-time gain from defecting. With a discount factor d (probability the game continues), cooperation via tit-for-tat is sustainable when the present value of cooperation outweighs the temptation to defect. As d approaches 1, the future looms large and cooperation stabilizes.

Scheme

Notation reference

Term Scheme Meaning
Tit-for-tat(tit-for-tat r hist)Cooperate first, then mirror opponent
Discount factor d(coop-value d)Probability game continues next round
Backward induction(finite-game rounds)Solve from the last round backward
Shadow of the future(cooperation-holds? d)Future payoff makes cooperation rational
Neighbors

Game theory foundations

Paper connections

  • ๐Ÿž Hedges 2018 โ€” compositional game theory makes repeated games composable via sequential composition of open games

Foundations (Wikipedia)

Translation notes

The payoff matrix here uses Nordstrom's convention (CC=3, CD=0, DC=5, DD=1), which is the standard form from Axelrod's tournaments. The discount factor analysis simplifies the folk theorem to its core insight: cooperation in repeated games requires d above a threshold determined by the payoff ratios. Axelrod's three properties of successful strategies (responsive, nice, unexploitable) are demonstrated by the tit-for-tat implementation.

← Volunteer's Dilemma fin ยท 14 of 14