Cognitive hierarchy theory

Last updated

Cognitive hierarchy theory (CHT) is a behavioral model originating in behavioral economics and game theory that attempts to describe human thought processes in strategic games. CHT aims to improve upon the accuracy of predictions made by standard analytic methods (including backwards induction and iterated elimination of dominated strategies), which can deviate considerably from actual experimental outcomes.

Contents

The Level-k Framework

Level-k theory is a competing theory to Cognitive Hierarchy Theory [1] but is similar to Cognitive Hierarchy Theory in the sense that player types are drawn from a hierarchy of levels of iterated rationalizability.

The hierarchy begins with some very naive type. This completely non-strategic "level-zero" player will choose actions without regard to the actions of other players. Such a player is said to have zero-order beliefs.

A one level higher sophisticated type believe the population consists of all naive types. This slightly more sophisticated (the level one) player believes that the other players will act non-strategically; his or her action will be the best response consistent with those first-order beliefs.

The next level believes the population consists of the first level. This more sophisticated (level two) player acts on the belief that the other players are level one. This pattern continues for higher-level players, but each player has only a finite depth of reasoning, meaning that individual players have a limit to the depth to which they can reason strategically.

Econometrically, [2] a Mixture Model is typically used to identify subpopulations. Within each subpopulation, deviation from the prescribed action for the type can be captured either as computation errors or as within-type heterogeneity in beliefs. [3]

Level-k theory assumes that players in strategic games base their decisions on their predictions about the likely actions of other players. According to level-k, players in strategic games can be categorized by the "depth" of their strategic thought. [4] It is thus heavily focused on bounded rationality.

In its basic form, level-k theory implies that each player believes that they are the most sophisticated person in the game. Players at some level k will neglect the fact that other players could also be level-k, or even higher. This has been attributed to many factors, such as "maintenance costs" or simply overconfidence. [5]

The Cognitive Hierarchy Framework

Some theorists [6] [5] have noted that players do not necessarily fall under the archetypes above. Instead, a player can act under the assumption that some percentage of the population fits each archetype, and act accordingly to find the best response. For example, in the Keynesian Beauty Contest described below, a player might believe that half the players are level-zero, and half are level-one. This player would select a number about halfway between the guesses of the archetypal level-one and level-two players. It is also argued that if the players are able to believe that there are others that can do the same level of reasoning, leading to an inclusive cognitive hierarchy, the framework could be helpful in capturing behavior games (e.g., expansive games) that are not dominance-solvable. [7]

Example: The Keynesian beauty contest

In the Keynesian beauty contest, participants are asked to choose a number that will be as close as possible to some fraction of the average of all participants' guesses. Suppose there are many players, each attempting to guess ½ of the average from the range 1-100.

A level zero player will select a number non-strategically. That number might be selected at random, or may have special significance to the player (in which case it is indistinguishable from a random number by other players).

A level one player will choose the number consistent with the belief that all other players are level zero. If all other players in the game are level zero, the average of those guesses would be about 50. Therefore, a level one player will choose 25.

A level two player will choose the number consistent with the belief that all other players are level one. Since a level one player will choose 25, a level two player should choose 13. This process repeats for higher-level players.

Example: The centipede game

In the centipede game, two players take turns choosing either to expand a slowly increasing pot, or to end the game and keep a larger fraction of the pot. In this example, the players are Alice and Bob. Alice chooses first, and also has the highest reward if Bob chooses to expand the pot on the final round.

If Alice is non-strategic (level zero), she will compare the payoffs at each possible endpoint of the game and note that her highest reward results from Bob expanding the pot on the final round. Alice will thus choose to expand the pot at every turn.

If Alice is level one, she will correctly identify her optimal outcome. However, she will also note that this outcome is not feasible because Bob's optimal outcome results from him ending the game on his last turn, rather than expanding the pot. As a result, she will choose to end the game on her last round rather than expanding the pot.

If Alice is level two, she will predict that Bob expects her to end the game on her last round, and will try to end the game just before she does. As a result, Alice will choose to end the game on her second to last round.

Comparison to standard theory and experimental evidence

Theories of behavior often assume that players think strategically, meaning that players will base their actions on the probable decisions of other players in a way that will serve their objectives. However, many games, both real and contrived, do not result in the equilibrium predicted by standard analytic methods.

The standard solution to the Keynesian Beauty Contest is determined by iterated elimination of dominated strategies. Using the example above, a fully rational player will observe that the most the number could be is 50. This player will also predict that the other players know that as well and will behave accordingly, so the maximum feasible number becomes 25. But, again, other players should know that, too. This process repeats indefinitely, and concludes with all players selecting 0, the Nash equilibrium for this game.

This solution is inconsistent with experimental evidence, which finds that most players choose numbers around either 25 or 13. These guesses are consistent with first- and second-order depth of reasoning, supporting CHT. A small proportion of players exhibit depths of reasoning greater than second order. [6] [4]

The standard solution to the centipede game is determined by backward induction. According to this method, if Bob reaches his final decision, he will prefer to keep a larger share of a smaller pot to the smaller share of a larger pot, so he will end the game instead of expanding the pot. Alice knows that Bob will end the game on his last move, so she decides to end the game one step before then. However, Bob knows that Alice will end early, so he decides to end just before she does. This process repeats until Alice is confronted with her decision on the first round; knowing that Bob will end the game at the first opportunity, Alice ends the game on the first round, and they walk home with the smallest possible total payoff.

Thus, standard analytic methods predict that all players will defect as soon as they have the opportunity, despite the higher payoffs that would accrue to more cooperative play. In actual experimental settings, however, cooperative behavior is observed, but only for a limited number of rounds. While the benefits to cooperation persist (and in fact grow), most games end prematurely, with the defection of a player who had previously been cooperative.

Comparison to alternative models

Many alternative models have been proposed to explain the discrepancies between standard theory and experimental results. For example, the temporary cooperation in the centipede game has been ascribed to altruism and either error or the anticipation of errors by players. In the case of altruism, a player opposed by an altruist will cooperate temporarily to increase the size of the payoff, with the intention of defecting later. In the case of error, a player does not appreciate the vulnerabilities created by cooperative play. If a player anticipates that the opponent is prone to making such errors, it will be in that player's interest to cooperate until just before the opponent recognizes the error.

While these alternative explanations are descriptive and plausible, they are also non-predictive and non-falsifiable, which limits their usefulness as behavioral models. They are also speculative: given an observation that deviates from a prediction, economists are unable to distinguish between errors, social preferences, intentional strategies, or other causes.

Cognitive Hierarchy Theory explains the observed pattern of opportunistic cooperation found in many games, without being susceptible to speculation about players' traits, such as intelligence or motivations. In the centipede game, the eventual defection of most players signifies that most players are strategic and non-altruistic. This suggests that players cooperate on a temporary basis because they are seeking their own self-interest, and only cooperate as long as they expect it to serve them, suggesting that CHT describes human behavior better than these alternatives. Furthermore, because researchers are able to preserve the common assumption that players are self-interested, CHT can be incorporated into existing models rather than replacing them outright.

CHT can offer reasonably accurate predictions about human behavior while acknowledging stronger forms of bounded rationality and opportunism than standard theory. Unlike methods such as backwards induction, it does not assume that players possess an unrealistically developed ability to process information, especially under conditions of uncertainty, dependence on other players, and time constraints. Furthermore, by incorporating stronger assumptions of opportunism, it is able to explain why a player will cooperate and then defect, instead of consistent cooperation or defection.

Related Research Articles

Game theory is the study of mathematical models of strategic interactions among rational agents. It has applications in many fields of social science, used extensively in economics as well as in logic, systems science and computer science. Initially game theory addressed two-person zero-sum games, in which a participant's gains or losses are exactly balanced by the losses and gains of the other participant. In the 1950’s it was extended to the study of non zero-sum games and was eventually game applied to a wide range of behavioral relations, and is now an umbrella term for the science of rational decision making in humans, animals, as well as computers.

The prisoner's dilemma is a game theory thought experiment that involves two rational agents, each of whom can cooperate for mutual benefit or betray their partner ("defect") for individual reward. This dilemma was originally framed by Merrill Flood and Melvin Dresher in 1950 while they worked at the RAND Corporation. Albert W. Tucker later formalized the game by structuring the rewards in terms of prison sentences and named it the "prisoner's dilemma".

In game theory, the Nash equilibrium is the most commonly-used solution concept for non-cooperative games. A Nash equilibrium is a situation where no player could gain by changing their own strategy. The idea of Nash equilibrium dates back to the time of Cournot, who in 1838 applied it to his model of competition in an oligopoly.

In game theory, the centipede game, first introduced by Robert Rosenthal in 1981, is an extensive form game in which two players take turns choosing either to take a slightly larger share of an increasing pot, or to pass the pot to the other player. The payoffs are arranged so that if one passes the pot to one's opponent and the opponent takes the pot on the next round, one receives slightly less than if one had taken the pot on this round, but after an additional switch the potential payoff will be higher. Therefore, although at each round a player has an incentive to take the pot, it would be better for them to wait. Although the traditional centipede game had a limit of 100 rounds, any game with this structure but a different number of rounds is called a centipede game.

Matching pennies is a non-cooperative game studied in game theory. It is played between two players, Even and Odd. Each player has a penny and must secretly turn the penny to heads or tails. The players then reveal their choices simultaneously. If the pennies match, then Even wins and keeps both pennies. If the pennies do not match, then Odd wins and keeps both pennies.

A non-cooperative game is a form of game under the topic of game theory. Non-cooperative games are used in situations where there are competition between the players of the game. In this model, there are no external rules that enforces the cooperation of the players therefore it is typically used to model a competitive environment. This is stated in various accounts most prominent being John Nash's paper.

The dictator game is a popular experimental instrument in social psychology and economics, a derivative of the ultimatum game. The term "game" is a misnomer because it captures a decision by a single player: to send money to another or not. Thus, the dictator has the most power and holds the preferred position in this “game.” Although the “dictator” has the most power and presents a take it or leave it offer, the game has mixed results based on different behavioral attributes. The results – where most "dictators" choose to send money – evidence the role of fairness and norms in economic behavior, and undermine the assumption of narrow self-interest when given the opportunity to maximise one's own profits.

<span class="mw-page-title-main">Public goods game</span> Experimental economics game

The public goods game is a standard of experimental economics. In the basic game, subjects secretly choose how many of their private tokens to put into a public pot. The tokens in this pot are multiplied by a factor and this "public good" payoff is evenly divided among players. Each subject also keeps the tokens they do not contribute.

Backward induction is the process of determining a sequence of optimal choices by reasoning from the end point of a problem or situation back to its beginning via individual events or actions. Backward induction involves examining the final point in a series of decisions and identifying the most optimal process or action required to arrive at that point. This process continues backward until the best action for every possible point along the sequence is determined. Backward induction was first utilized in 1875 by Arthur Cayley, who discovered the method while attempting to solve the secretary problem.

In game theory, a focal point is a solution that people tend to choose by default in the absence of communication in order to avoid coordination failure. The concept was introduced by the American economist Thomas Schelling in his book The Strategy of Conflict (1960). Schelling states that "[p]eople can often concert their intentions or expectations with others if each knows that the other is trying to do the same" in a cooperative situation, so their action would converge on a focal point which has some kind of prominence compared with the environment. However, the conspicuousness of the focal point depends on time, place and people themselves. It may not be a definite solution.

The chain store paradox is an apparent game theory paradox describing the decisions a chain store might make, where a "deterrence strategy" appears optimal instead of the backward induction strategy of standard game theory reasoning.

In game theory, "guess 2/3 of the average" is a game that explores how a player’s strategic reasoning process takes into account the mental process of others in the game.

Quantal response equilibrium (QRE) is a solution concept in game theory. First introduced by Richard McKelvey and Thomas Palfrey, it provides an equilibrium notion with bounded rationality. QRE is not an equilibrium refinement, and it can give significantly different results from Nash equilibrium. QRE is only defined for games with discrete strategies, although there are continuous-strategy analogues.

In game theory, the traveler's dilemma is a non-zero-sum game in which each player proposes a payoff. The lower of the two proposals wins; the lowball player receives the lowball payoff plus a small bonus, and the highball player receives the same lowball payoff, minus a small penalty. Surprisingly, the Nash equilibrium is for both players to aggressively lowball. The traveler's dilemma is notable in that naive play appears to outperform the Nash equilibrium; this apparent paradox also appears in the centipede game and the finitely-iterated prisoner's dilemma.

Strong reciprocity is an area of research in behavioral economics, evolutionary psychology, and evolutionary anthropology on the predisposition to cooperate even when there is no apparent benefit in doing so. This topic is particularly interesting to those studying the evolution of cooperation, as these behaviors seem to be in contradiction with predictions made by many models of cooperation. In response, current work on strong reciprocity is focused on developing evolutionary models which can account for this behavior. Critics of strong reciprocity argue that it is an artifact of lab experiments and does not reflect cooperative behavior in the real world.

<span class="mw-page-title-main">Simultaneous game</span>

In game theory, a simultaneous game or static game is a game where each player chooses their action without knowledge of the actions chosen by other players. Simultaneous games contrast with sequential games, which are played by the players taking turns. In other words, both players normally act at the same time in a simultaneous game. Even if the players do not act at the same time, both players are uninformed of each other's move while making their decisions. Normal form representations are usually used for simultaneous games. Given a continuous game, players will have different information sets if the game is simultaneous than if it is sequential because they have less information to act on at each step in the game. For example, in a two player continuous game that is sequential, the second player can act in response to the action taken by the first player. However, this is not possible in a simultaneous game where both players act at the same time.

Social preferences describe the human tendency to not only care about one's own material payoff, but also the reference group's payoff or/and the intention that leads to the payoff. Social preferences are studied extensively in behavioral and experimental economics and social psychology. Types of social preferences include altruism, fairness, reciprocity, and inequity aversion. The field of economics originally assumed that humans were rational economic actors, and as it became apparent that this was not the case, the field began to change. The research of social preferences in economics started with lab experiments in 1980, where experimental economists found subjects' behavior deviated systematically from self-interest behavior in economic games such as ultimatum game and dictator game. These experimental findings then inspired various new economic models to characterize agent's altruism, fairness and reciprocity concern between 1990 and 2010. More recently, there are growing amounts of field experiments that study the shaping of social preference and its applications throughout society.

Behavioral game theory seeks to examine how people's strategic decision-making behavior is shaped by social preferences, social utility and other psychological factors. Behavioral game theory analyzes interactive strategic decisions and behavior using the methods of game theory, experimental economics, and experimental psychology. Experiments include testing deviations from typical simplifications of economic theory such as the independence axiom and neglect of altruism, fairness, and framing effects. As a research program, the subject is a development of the last three decades.

Subjective expected relative similarity (SERS) is a normative and descriptive theory that predicts and explains cooperation levels in a family of games termed Similarity Sensitive Games (SSG), among them the well-known Prisoner's Dilemma game (PD). SERS was originally developed in order to (i) provide a new rational solution to the PD game and (ii) to predict human behavior in single-step PD games. It was further developed to account for: (i) repeated PD games, (ii) evolutionary perspectives and, as mentioned above, (iii) the SSG subgroup of 2×2 games. SERS predicts that individuals cooperate whenever their subjectively perceived similarity with their opponent exceeds a situational index derived from the game's payoffs, termed the similarity threshold of the game. SERS proposes a solution to the rational paradox associated with the single step PD and provides accurate behavioral predictions. The theory was developed by Prof. Ilan Fischer at the University of Haifa.

References

  1. Stahl, D. O. (1993). Evolution of Smartn Players. Games and Economic Behavior, 5(4), 604-617.
  2. Stahl II, D. O., & Wilson, P. W. (1994). Experimental evidence on players' models of other players. Journal of Economic Behavior & Organization, 25(3), 309-327.
  3. Ernan Haruvy, Dale O. Stahl, & Paul W. Wilson (2001). Modeling and testing for heterogeneity in observed strategic behavior. Review of Economics and Statistics, 83(1), 146-157
  4. 1 2 Nagel, Rosemarie. "Unraveling in Guessing Games: An Experimental Study". The American Economic Review , Vol. 85, Issue 5. December 1995
  5. 1 2 Stahl, Dale and Wilson, Paul. "On Players' Models of Other Players: Theory and Experimental Evidence". Games and Economic Behavior . 10, 1995
  6. 1 2 Camerer, Colin F., Teck-Hua Ho and Juin-Kuan Chong. "A Cognitive Hierarchy Model of Games". The Quarterly Journal of Economics , August 2004
  7. Koriyama, Yukio; Ozkes, Ali I. (June 2021). "Inclusive Cognitive Hierarchy". Journal of Economic Behavior and Organization . 186 (1): 458. doi: 10.1016/j.jebo.2021.04.016 .

See also