Repeated Prisoner's Dilemma
Nordstrom, Introduction to Game Theory ยท Section 4.4 ยท CC BY-SA 4.0
Play once: defect. Play forever: cooperate. Finite repetition unravels by backward induction: the last round has no future, so defect; but then the second-to-last round is effectively last, so defect; and so on. Infinite repetition breaks the chain. The shadow of the future makes tit-for-tat viable.
Finite repetition: backward induction unravels
If both players know the game ends after round N, work backward. In round N there is no future punishment, so both defect. But then round N-1 is effectively the last real round, so both defect there too. The logic cascades all the way to round 1. Finite repetition gives the same result as playing once.
Infinite repetition: tit-for-tat cooperates
When the game has no known endpoint, the future matters. Tit-for-tat: cooperate on round 1, then copy your opponent's last move. It is responsive (punishes defection), nice (starts cooperative), and forgiving (returns to cooperation). Axelrod's tournaments showed it outperforms pure strategies. Tit-for-tat is a closed feedback loop: perceive the opponent's last move, remember it, act on it. This is the same structure that makes
general intelligence work -- consolidation over repeated interactions.
The shadow of the future
Cooperation holds when the expected value of future cooperation exceeds the one-time gain from defecting. With a discount factor d (probability the game continues), cooperation via tit-for-tat is sustainable when the present value of cooperation outweighs the temptation to defect. As d approaches 1, the future looms large and cooperation stabilizes.
Notation reference
| Term | Scheme | Meaning |
|---|---|---|
| Tit-for-tat | (tit-for-tat r hist) | Cooperate first, then mirror opponent |
| Discount factor d | (coop-value d) | Probability game continues next round |
| Backward induction | (finite-game rounds) | Solve from the last round backward |
| Shadow of the future | (cooperation-holds? d) | Future payoff makes cooperation rational |
Neighbors
Game theory foundations
- ๐ฒ Nordstrom 12 โ Prisoner's Dilemma and Chicken (the one-shot case)
- ๐ฒ Nordstrom 13 โ Volunteer's Dilemma
Paper connections
- ๐ Hedges 2018 โ compositional game theory makes repeated games composable via sequential composition of open games
Foundations (Wikipedia)
Translation notes
The payoff matrix here uses Nordstrom's convention (CC=3, CD=0, DC=5, DD=1), which is the standard form from Axelrod's tournaments. The discount factor analysis simplifies the folk theorem to its core insight: cooperation in repeated games requires d above a threshold determined by the payoff ratios. Axelrod's three properties of successful strategies (responsive, nice, unexploitable) are demonstrated by the tit-for-tat implementation.