Divine equilibrium

Last updated

The Divinity Criterion or Divine Equilibrium or Universal Divinity is a refinement of Perfect Bayesian equilibrium in a signaling game proposed by Banks and Sobel (1987). [1] One of the most widely applied refinement is the D1-Criterion.

It is a restriction of receiver's beliefs to the type of senders for whom deviating towards an off-the-equilibrium message could improve their outcome compared to the equilibrium payoff. [2] In addition to the restriction suggested by the Intuitive Criterion, the Divinity Criterion considers only those types which are most likely to send the off-the-equilibrium message.

If more than one sender could benefit from the deviation, the Intuition Criterion assigns equal probabilities for all the senders, whereas the D1-Criterion considers different probabilities.

Example

The following example is adapted from the original Banks and Sobel (1987).

Consider a case of sequential settlement. The defendant (sender of messages) has two types: t1 and t2. Type t1 is not negligent and type t2 is negligent. The defendant can offer a high settlement or low settlement (messages) to the plaintiff. The plaintiff (receiver of messages) can either accept or reject the settlement offer. The payoff is illustrated below.

Low settlement
plaintiff

defendant
Accept offerReject offer
t1-3,3-6,0
t2-3,3-11,5
High settlement
plaintiff

defendant
Accept offerReject offer
t1-5,5-6,0
t2-5,5-11,5

Here the payoff shows that if the plaintiff accepts the high settlement, the money transfer is larger (5 instead of 3 in the low settlement case). If the plaintiff rejects the offer, then the negligent type t2 receives a higher penalty from the court (-11 comparing with -6 for type t1). However, since the plaintiff does not know the type, he is better off to accept the settlement when the defendant is not negligent.

Consider the prior probably to be half and half, meaning that before seeing the settlement offer the plaintiff believes that with 50 percent of the probability that the defendant is negligent. In this case, the game has two pooling equilibria. The first equilibrium (E1) is that both types of defendant choose a low settlement, and the plaintiff accepts the offer. The second equilibrium (E2) is that both types of defendant choose a high settlement, and the plaintiff accepts the offer. However, in order to support these equilibria, one also has to specify what are the beliefs if the plaintiff sees a different type of offer (commonly referred as off-equilibrium messages).

From the payoff matrix we can see that under E1, the defendant is already getting his highest possible payoff (-3). Thus, regardless of what the plaintiff thinks, the defendant does not have incentive to deviate. This reasoning is not true if the equilibrium is E2. The idea of equilibrium refinement is indeed to provide a reasonable argument to select the most "intuitive" outcome.

To see how the refinement works, we first need to check what kind of beliefs can support E2. In this case, in order for the defendant not to have incentive to deviate to low settlement, the plaintiff has to respond with "Reject" if he sees a low settlement, which translates into that the plaintiff has to have a belief that the defendant is negligent (type t2) with probability larger than 60 percent (if this is true, then "accept" gives the plaintiff a payoff of 3, and "reject" gives the plaintiff a payoff of slightly more than 3=5*60%). Implicitly, the belief says that type t2 is more likely to deviate than t1. The D1 criterion (or the more advanced version of divinity) is built on the idea of deciding which type is actually more likely to deviate.

If we look again at the payoff matrix under the low settlement case, then regardless of how the plaintiff assigns his probability of choosing "accept" or "reject", the payoff for type t1 is higher than type t2 (for example, if the plaintiff plays a strategy of choosing equally between "accept" and "reject", then type t1 gets -4.5, whereas type t2 gets -7). Thus, comparing with the equilibrium payoff of -5 under E2, one can argue that whenever t2 wants to deviate, type t1 also wants to deviate. Thus, a reasonable belief should assign a higher probability to type t1. The D1 criterion pushes this type of reasoning to the extreme and requires that the plaintiff believes that the deviation (if observed) should come from t1. As a result, E2 is not plausible because it contradicts with the refinement.

Related Research Articles

In game theory, the Nash equilibrium, named after the mathematician John Forbes Nash Jr., is the most common way to define the solution of a non-cooperative game involving two or more players. In a Nash equilibrium, each player is assumed to know the equilibrium strategies of the other players, and no player has anything to gain by changing only their own strategy. The principle of Nash equilibrium dates back to the time of Cournot, who in 1838 applied it to competing firms choosing outputs.

A coordination game is a type of simultaneous game found in game theory. It describes the situation where a player will earn a higher payoff when he selects the same course of action as another player. The game is not one of pure conflict, which results in multiple pure strategy Nash equilibria in which players choose matching strategies. Figure 1 shows a 2-player example.

Matching pennies is the name for a simple game used in game theory. It is played between two players, Even and Odd. Each player has a penny and must secretly turn the penny to heads or tails. The players then reveal their choices simultaneously. If the pennies match, then Even keeps both pennies, so wins one from Odd. If the pennies do not match Odd keeps both pennies, so receives one from Even.

In game theory, cheap talk is communication between players that does not directly affect the payoffs of the game. Providing and receiving information is free. This is in contrast to signaling in which sending certain messages may be costly for the sender depending on the state of the world.

Signaling game

In game theory, a signaling game is a simple type of a dynamic Bayesian game.

In game theory, a player's strategy is any of the options which they choose in a setting where the outcome depends not only on their own actions but on the actions of others. The discipline mainly concerns the action of a player in a game affecting the behavior or actions of other players. Some examples of "games" include chess, bridge, poker, monopoly, diplomacy or battleship. A player's strategy will determine the action which the player will take at any stage of the game. In studying game theory, economists enlist a more rational lens in analyzing decisions rather than the psychological or sociological perspectives taken when analyzing relationships between decisions of two or more parties in different disciplines.

Solution concept

In game theory, a solution concept is a formal rule for predicting how a game will be played. These predictions are called "solutions", and describe which strategies will be adopted by players and, therefore, the result of the game. The most commonly used solution concepts are equilibrium concepts, most famously Nash equilibrium.

An extensive-form game is a specification of a game in game theory, allowing for the explicit representation of a number of key aspects, like the sequencing of players' possible moves, their choices at every decision point, the information each player has about the other player's moves when they make a decision, and their payoffs for all possible game outcomes. Extensive-form games also allow for the representation of incomplete information in the form of chance events modeled as "moves by nature".

In game theory, a Perfect Bayesian Equilibrium (PBE) is an equilibrium concept relevant for dynamic games with incomplete information. It is a refinement of Bayesian Nash equilibrium (BNE). A perfect Bayesian equilibrium has two components -- strategies and beliefs:

In game theory, a Bayesian game is a game that models the outcome of player interactions using aspects of Bayesian probability. Bayesian games are notable because they allowed, for the first time in game theory, for the specification of the solutions to games with incomplete information.

Backward induction is the process of reasoning backwards in time, from the end of a problem or situation, to determine a sequence of optimal actions. It proceeds by examining the last point at which a decision is to be made and then identifying what action would be most optimal at that moment. Using this information, one can then determine what to do at the second-to-last time of decision. This process continues backwards until one has determined the best action for every possible situation at every point in time. Backward induction was first used in 1875 by Arthur Cayley, who uncovered the method while trying to solve the infamous Secretary problem.

In game theory, trembling hand perfect equilibrium is a refinement of Nash equilibrium due to Reinhard Selten. A trembling hand perfect equilibrium is an equilibrium that takes the possibility of off-the-equilibrium play into account by assuming that the players, through a "slip of the hand" or tremble, may choose unintended strategies, albeit with negligible probability.

In game theory, folk theorems are a class of theorems describing an abundance of Nash equilibrium payoff profiles in repeated games. The original Folk Theorem concerned the payoffs of all the Nash equilibria of an infinitely repeated game. This result was called the Folk Theorem because it was widely known among game theorists in the 1950s, even though no one had published it. Friedman's (1971) Theorem concerns the payoffs of certain subgame-perfect Nash equilibria (SPE) of an infinitely repeated game, and so strengthens the original Folk Theorem by using a stronger equilibrium concept: subgame-perfect Nash equilibria rather than Nash equilibria.

In game theory, a correlated equilibrium is a solution concept that is more general than the well known Nash equilibrium. It was first discussed by mathematician Robert Aumann in 1974. The idea is that each player chooses their action according to their private observation of the value of the same public signal. A strategy assigns an action to every possible observation a player can make. If no player would want to deviate from their strategy, the distribution from which the signals are drawn is called a correlated equilibrium.

Quantal response equilibrium (QRE) is a solution concept in game theory. First introduced by Richard McKelvey and Thomas Palfrey, it provides an equilibrium notion with bounded rationality. QRE is not an equilibrium refinement, and it can give significantly different results from Nash equilibrium. QRE is only defined for games with discrete strategies, although there are continuous-strategy analogues.

Risk dominance and payoff dominance are two related refinements of the Nash equilibrium (NE) solution concept in game theory, defined by John Harsanyi and Reinhard Selten. A Nash equilibrium is considered payoff dominant if it is Pareto superior to all other Nash equilibria in the game. When faced with a choice among equilibria, all players would agree on the payoff dominant equilibrium since it offers to each player at least as much payoff as the other Nash equilibria. Conversely, a Nash equilibrium is considered risk dominant if it has the largest basin of attraction. This implies that the more uncertainty players have about the actions of the other player(s), the more likely they will choose the strategy corresponding to it.

Proper equilibrium is a refinement of Nash Equilibrium due to Roger B. Myerson. Proper equilibrium further refines Reinhard Selten's notion of a trembling hand perfect equilibrium by assuming that more costly trembles are made with significantly smaller probability than less costly ones.

In game theory, a stochastic game, introduced by Lloyd Shapley in the early 1950s, is a repeated game with probabilistic transitions played by one or more players. The game is played in a sequence of stages. At the beginning of each stage the game is in some state. The players select actions and each player receives a payoff that depends on the current state and the chosen actions. The game then moves to a new random state whose distribution depends on the previous state and the actions chosen by the players. The procedure is repeated at the new state and play continues for a finite or infinite number of stages. The total payoff to a player is often taken to be the discounted sum of the stage payoffs or the limit inferior of the averages of the stage payoffs.

The intuitive criterion is a technique for equilibrium refinement in signaling games. It aims to reduce possible outcome scenarios by restricting the possible sender types to types who could obtain higher utility levels by deviating to off-the-equilibrium messages, and to types for which the off-the-equilibrium message is not equilibrium dominated.

Behavioral game theory analyzes interactive strategic decisions and behavior using the methods of game theory, experimental economics, and experimental psychology. Experiments include testing deviations from typical simplifications of economic theory such as the independence axiom and neglect of altruism, fairness, and framing effects. As a research program, the subject is a development of the last three decades.

References

  1. Banks, Jeffrey, and Joel Sobel, 1987, "Equilibrium selection in signaling games", Econometrica, 55(3), 647-661.
  2. The Intuitive and Divinity Criterion: Interpretation and Step-by-Step Examples Felix Munoz-Garcia, Ana Espinola-Arredondo, Journal of Industrial Organization Education. Volume 5, Issue 1, Pages 1–20, ISSN (Online) 1935-5041, DOI: 10.2202/1935-5041.1024, March 2011