Subgame perfect equilibrium

Subgame Perfect Equilibrium
Subgame Perfect Equilibrium
Solution concept in game theory
Relationship
Subset of	Nash equilibrium
Intersects with	Evolutionarily stable strategy
Significance
Proposed by	Reinhard Selten (1965)
Used for	Extensive form games
Example	Ultimatum game

Last updated February 20, 2025

In game theory, a subgame perfect equilibrium (SPE), or subgame perfect Nash equilibrium(SPNE), is a refinement of the Nash equilibrium concept, specifically designed for dynamic games where players make sequential decisions. A strategy profile is an SPE if it represents a Nash equilibrium in every possible subgame of the original game. Informally, this means that at any point in the game, the players' behavior from that point onward should represent a Nash equilibrium of the continuation game (i.e. of the subgame), no matter what happened before. This ensures that strategies are credible and rational throughout the entire game, eliminating non-credible threats.

Every finite extensive game with perfect recall (each player remembers all their previous actions and knowledge throughout the game) has a subgame perfect equilibrium.^[1] A common method for finding SPE in finite games is backward induction, where one starts by analyzing the last actions the final mover should take to maximize his/her utility and works backward. While backward induction is a common method for finding SPE in finite games, it is not always applicable to games with infinite horizons, or those with imperfect or incomplete information. In infinite horizon games, other techniques, like the one-shot deviation principle, are often used to verify SPE.

Subgame perfect equilibrium necessarily satisfies the one-shot deviation principle and is always a subset of the Nash equilibria for a given game. The ultimatum game is a classic example of a game with fewer subgame perfect equilibria than Nash equilibria.

Example

Determining the subgame perfect equilibrium by using backward induction is shown below in Figure 1. Strategies for Player 1 are given by {Up, Uq, Dp, Dq}, whereas Player 2 has the strategies among {TL, TR, BL, BR}. There are 4 subgames in this example, with 3 proper subgames.

Using the backward induction, the players will take the following actions for each subgame:

Subgame for actions p and q: Player 1 will take action p with payoff (3, 3) to maximize Player 1's payoff, so the payoff for action L becomes (3,3).
Subgame for actions L and R: Player 2 will take action L for 3 > 2, so the payoff for action D becomes (3, 3).
Subgame for actions T and B: Player 2 will take action T to maximize Player 2's payoff, so the payoff for action U becomes (1, 4).
Subgame for actions U and D: Player 1 will take action D to maximize Player 1's payoff.

Thus, the subgame perfect equilibrium is {Dp, TL} with the payoff (3, 3).

An extensive-form game with incomplete information is presented below in Figure 2. Note that the node for Player 1 with actions A and B, and all succeeding actions is a subgame. Player 2's nodes are not a subgame as they are part of the same information set.

The first normal-form game is the normal form representation of the whole extensive-form game. Based on the provided information, (UA, X), (DA, Y), and (DB, Y) are all Nash equilibria for the entire game.

The second normal-form game is the normal form representation of the subgame starting from Player 1's second node with actions A and B. For the second normal-form game, the Nash equilibrium of the subgame is (A, X).

For the entire game Nash equilibria (DA, Y) and (DB, Y) are not subgame perfect equilibria because the move of Player 2 does not constitute a Nash equilibrium. The Nash equilibrium (UA, X) is subgame perfect because it incorporates the subgame Nash equilibrium (A, X) as part of its strategy.^[2]

To solve this game, first find the Nash Equilibria by mutual best response of Subgame 1. Then use backwards induction and plug in (A,X) → (3,4) so that (3,4) become the payoffs for Subgame 2.^[2]

The dashed line indicates that player 2 does not know whether player 1 will play A or B in a simultaneous game.

Player 1 chooses U rather than D because 3 > 2 for Player 1's payoff. The resulting equilibrium is (A, X) → (3,4).

Solution of Subgame Perfect Equilibrium Subgame-Perfect-Solution.svg — Solution of Subgame Perfect Equilibrium

Thus, the subgame perfect equilibrium through backwards induction is (UA, X) with the payoff (3, 4).

Repeated games

For finitely repeated games, if a stage game has only one unique Nash equilibrium, the subgame perfect equilibrium is to play without considering past actions, treating the current subgame as a one-shot game. An example of this is a finitely repeated Prisoner's dilemma game. The Prisoner's dilemma gets its name from a situation that contains two guilty culprits. When they are interrogated, they have the option to stay quiet or defect. If both culprits stay quiet, they both serve a short sentence. If both defect, they both serve a moderate sentence. If they choose opposite options, then the culprit that defects is free and the culprit who stays quiet serves a long sentence. Ultimately, using backward induction, the last subgame in a finitely repeated Prisoner's dilemma requires players to play the unique Nash equilibrium (both players defecting). Because of this, all games prior to the last subgame will also play the Nash equilibrium to maximize their single-period payoffs.^[3] If a stage-game in a finitely repeated game has multiple Nash equilibria, subgame perfect equilibria can be constructed to play non-stage-game Nash equilibrium actions, through a "carrot and stick" structure. One player can use the one stage-game Nash equilibrium to incentivize playing the non-Nash equilibrium action, while using a stage-game Nash equilibrium with lower payoff to the other player if they choose to defect.^[4]

Finding subgame-perfect equilibria

Reinhard Selten proved that any game which can be broken into "sub-games" containing a sub-set of all the available choices in the main game will have a subgame perfect Nash Equilibrium strategy (possibly as a mixed strategy giving non-deterministic sub-game decisions). Subgame perfection is only used with games of complete information. Subgame perfection can be used with extensive form games of complete but imperfect information.

The subgame-perfect Nash equilibrium is normally deduced by "backward induction" from the various ultimate outcomes of the game, eliminating branches which would involve any player making a move that is not credible (because it is not optimal) from that node. One game in which the backward induction solution is well known is tic-tac-toe, but in theory even Go has such an optimum strategy for all players. The problem of the relationship between subgame perfection and backward induction was settled by Kaminski (2019), who proved that a generalized procedure of backward induction produces all subgame perfect equilibria in games that may have infinite length, infinite actions as each information set, and imperfect information if a condition of final support is satisfied.

The interesting aspect of the word "credible" in the preceding paragraph is that taken as a whole (disregarding the irreversibility of reaching sub-games) strategies exist which are superior to subgame perfect strategies, but which are not credible in the sense that a threat to carry them out will harm the player making the threat and prevent that combination of strategies. For instance in the game of "chicken" if one player has the option of ripping the steering wheel from their car they should always take it because it leads to a "sub game" in which their rational opponent is precluded from doing the same thing (and killing them both). The wheel-ripper will always win the game (making his opponent swerve away), and the opponent's threat to suicidally follow suit is not credible.

References

↑ Osborne, M. J. (2004). An Introduction to Game Theory. Oxford University Press.
1 2 Joel., Watson (2013-05-09). Strategy : an introduction to game theory (Third ed.). New York. ISBN 9780393918380. OCLC 842323069.{{cite book}}: CS1 maint: location missing publisher (link)
↑ Yildiz, Muhamet (2012). "12 Repeated Games". 14.12 Economic Applications of Game Theory. Massachusetts Institute of Technology: MIT OpenCourseWare. Retrieved April 27, 2021.
↑ Takako, Fujiwara-Greve (27 June 2015). Non-cooperative game theory. Tokyo. ISBN 9784431556442. OCLC 911616270.{{cite book}}: CS1 maint: location missing publisher (link)

External links

Selten, R. (1965). Spieltheoretische behandlung eines oligopolmodells mit nachfrageträgheit. Zeitschrift für die gesamte Staatswissenschaft/Journal of Institutional and Theoretical Economics, (H. 2), 301-324, 667-689. [in German - part 1, part 2]
Example of Extensive Form Games with imperfect information
Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game from gametheory.net.
Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game from gametheory.net.
Kaminski, M.M. Generalized Backward Induction: Justification for a Folk Algorithm. Games 2019, 10, 34.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Osborne2004-1] Osborne, M. J. (2004). An Introduction to Game Theory. Oxford University Press.

[:0-2] 1 2 Joel., Watson (2013-05-09). Strategy : an introduction to game theory (Third ed.). New York. ISBN 9780393918380. OCLC 842323069.{{cite book}}: CS1 maint: location missing publisher (link)

[3] Yildiz, Muhamet (2012). "12 Repeated Games". 14.12 Economic Applications of Game Theory. Massachusetts Institute of Technology: MIT OpenCourseWare. Retrieved April 27, 2021.

[4] Takako, Fujiwara-Greve (27 June 2015). Non-cooperative game theory. Tokyo. ISBN 9784431556442. OCLC 911616270.{{cite book}}: CS1 maint: location missing publisher (link)

[1]

[2]

[3]

[4]

v t e Topics of game theory
Definitions	Congestion game Cooperative game Determinacy Escalation of commitment Extensive-form game First-player and second-player win Game complexity Graphical game Hierarchy of beliefs Information set Normal-form game Preference Sequential game Simultaneous game Simultaneous action selection Solved game Succinct game Mechanism design
Equilibrium concepts	Bayes correlated equilibrium Bayesian Nash equilibrium Berge equilibrium Core Correlated equilibrium Coalition-proof Nash equilibrium Epsilon-equilibrium Evolutionarily stable strategy Gibbs equilibrium Mertens-stable equilibrium Markov perfect equilibrium Nash equilibrium Pareto efficiency Perfect Bayesian equilibrium Proper equilibrium Quantal response equilibrium Quasi-perfect equilibrium Risk dominance Satisfaction equilibrium Self-confirming equilibrium Sequential equilibrium Shapley value Strong Nash equilibrium Subgame perfection Trembling hand equilibrium
Strategies	Appeasement Backward induction Bid shading Collusion Cheap talk De-escalation Deterrence Escalation Forward induction Grim trigger Markov strategy Pairing strategy Dominant strategies Pure strategy Mixed strategy Strategy-stealing argument Tit for tat
Classes of games	Auction Bargaining problem Global game Intransitive game Mean-field game n-player game Perfect information Large Poisson game Potential game Repeated game Screening game Signaling game Strictly determined game Stochastic game Symmetric game Zero-sum game
Games	Go Chess Infinite chess Checkers All-pay auction Prisoner's dilemma Gift-exchange game Optional prisoner's dilemma Traveler's dilemma Coordination game Chicken Centipede game Lewis signaling game Volunteer's dilemma Dollar auction Battle of the sexes Stag hunt Matching pennies Ultimatum game Electronic mail game Rock paper scissors Pirate game Dictator game Public goods game Blotto game War of attrition El Farol Bar problem Fair division Fair cake-cutting Bertrand competition Cournot competition Stackelberg competition Deadlock Diner's dilemma Guess 2/3 of the average Kuhn poker Nash bargaining game Induction puzzles Trust game Princess and monster game Rendezvous problem Pursuit game
Theorems	Aumann's agreement theorem Folk theorem Minimax theorem Nash's theorem Negamax theorem Purification theorem Revelation principle Sprague–Grundy theorem Zermelo's theorem
Key figures	Albert W. Tucker Amos Tversky Antoine Augustin Cournot Ariel Rubinstein Claude Shannon Daniel Kahneman David K. Levine David M. Kreps Donald B. Gillies Drew Fudenberg Eric Maskin Harold W. Kuhn Herbert Simon Hervé Moulin John Conway Jean Tirole Jean-François Mertens Jennifer Tour Chayes John Harsanyi John Maynard Smith John Nash John von Neumann Kenneth Arrow Kenneth Binmore Leonid Hurwicz Lloyd Shapley Melvin Dresher Merrill M. Flood Olga Bondareva Oskar Morgenstern Paul Milgrom Peyton Young Reinhard Selten Robert Axelrod Robert Aumann Robert B. Wilson Roger Myerson Samuel Bowles Suzanne Scotchmer Thomas Schelling William Vickrey
Search optimizations	Alpha–beta pruning Aspiration window Principal variation search max^n algorithm Paranoid algorithm Lazy SMP
Miscellaneous	Bounded rationality Combinatorial game theory Confrontation analysis Coopetition Evolutionary game theory Glossary of game theory List of game theorists List of games in game theory No-win situation Topological game Tragedy of the commons