Stackelberg competition

Last updated

The Stackelberg leadership model is a strategic game in economics in which the leader firm moves first and then the follower firms move sequentially (hence, it is sometimes described as the "leader-follower game"). It is named after the German economist Heinrich Freiherr von Stackelberg who published Marktform und Gleichgewicht [Market Structure and Equilibrium] in 1934, which described the model. In game theory terms, the players of this game are a leader and a follower and they compete on quantity. The Stackelberg leader is sometimes referred to as the Market Leader.

Contents

There are some further constraints upon the sustaining of a Stackelberg equilibrium. The leader must know ex ante that the follower observes its action. The follower must have no means of committing to a future non-Stackelberg leader's action and the leader must know this. Indeed, if the 'follower' could commit to a Stackelberg leader action and the 'leader' knew this, the leader's best response would be to play a Stackelberg follower action.

Firms may engage in Stackelberg competition if one has some sort of advantage enabling it to move first. More generally, the leader must have commitment power. Moving observably first is the most obvious means of commitment: once the leader has made its move, it cannot undo it—it is committed to that action. Moving first may be possible if the leader was the incumbent monopoly of the industry and the follower is a new entrant. Holding excess capacity is another means of commitment.

Subgame perfect Nash equilibrium

The Stackelberg model can be solved to find the subgame perfect Nash equilibrium or equilibria (SPNE), i.e. the strategy profile that serves best each player, given the strategies of the other player and that entails every player playing in a Nash equilibrium in every subgame.

In very general terms, let the price function for the (duopoly) industry be ; price is simply a function of total (industry) output, so is where the subscript represents the leader and represents the follower. Suppose firm has the cost structure . The model is solved by backward induction. The leader considers what the best response of the follower is, i.e. how it will respond once it has observed the quantity of the leader. The leader then picks a quantity that maximises its payoff, anticipating the predicted response of the follower. The follower actually observes this and in equilibrium picks the expected quantity as a response.

To calculate the SPNE, the best response functions of the follower must first be calculated (calculation moves 'backwards' because of backward induction).

The profit of firm (the follower) is revenue minus cost. Revenue is the product of price and quantity and cost is given by the firm's cost structure, so profit is: . The best response is to find the value of that maximises given , i.e. given the output of the leader (firm ), the output that maximises the follower's profit is found. Hence, the maximum of with respect to is to be found. First differentiate with respect to :

Setting this to zero for maximisation:

The values of that satisfy this equation are the best responses. Now the best response function of the leader is considered. This function is calculated by considering the follower's output as a function of the leader's output, as just computed.

The profit of firm (the leader) is , where is the follower's quantity as a function of the leader's quantity, namely the function calculated above. The best response is to find the value of that maximises given , i.e. given the best response function of the follower (firm ), the output that maximises the leader's profit is found. Hence, the maximum of with respect to is to be found. First, differentiate with respect to :

Setting this to zero for maximisation:

Examples

The following example is very general. It assumes a generalised linear demand structure

and imposes some restrictions on cost structures for simplicity's sake so the problem can be resolved.

and

for ease of computation.

The follower's profit is:

The maximisation problem resolves to (from the general case):

Consider the leader's problem:

Substituting for from the follower's problem:

The maximisation problem resolves to (from the general case):

Now solving for yields , the leader's optimal action:

This is the leader's best response to the reaction of the follower in equilibrium. The follower's actual can now be found by feeding this into its reaction function calculated earlier:

The Nash equilibria are all . It is clear (if marginal costs are assumed to be zero – i.e. cost is essentially ignored) that the leader has a significant advantage. Intuitively, if the leader was no better off than the follower, it would simply adopt a Cournot competition strategy.

Plugging the follower's quantity , back into the leader's best response function will not yield . This is because once leader has committed to an output and observed the followers it always wants to reduce its output ex-post. However its inability to do so is what allows it to receive higher profits than under Cournot.

Economic analysis

An extensive-form representation is often used to analyze the Stackelberg leader-follower model. Also referred to as a “decision tree”, the model shows the combination of outputs and payoffs both firms have in the Stackelberg game.

A Stackelberg game represented in extensive form Extensive form game 4.JPG
A Stackelberg game represented in extensive form

The image on the left depicts in extensive form a Stackelberg game. The payoffs are shown on the right. This example is fairly simple. There is a basic cost structure involving only marginal cost (there is no fixed cost). The demand function is linear and price elasticity of demand is 1. However, it illustrates the leader's advantage.

The follower wants to choose to maximise its payoff . Taking the first order derivative and equating it to zero (for maximisation) yields as the maximum value of .

The leader wants to choose to maximise its payoff . However, in equilibrium, it knows the follower will choose as above. So in fact the leader wants to maximise its payoff (by substituting for the follower's best response function). By differentiation, the maximum payoff is given by . Feeding this into the follower's best response function yields . Suppose marginal costs were equal for the firms (so the leader has no market advantage other than first move) and in particular . The leader would produce 2000 and the follower would produce 1000. This would give the leader a profit (payoff) of two million and the follower a profit of one million. Simply by moving first, the leader has accrued twice the profit of the follower. However, Cournot profits here are 1.78 million apiece (strictly, apiece), so the leader has not gained much, but the follower has lost. However, this is example-specific. There may be cases where a Stackelberg leader has huge gains beyond Cournot profit that approach monopoly profits (for example, if the leader also had a large cost structure advantage, perhaps due to a better production function). There may also be cases where the follower actually enjoys higher profits than the leader, but only because it, say, has much lower costs. This behaviour consistently work on duopoly markets even if the firms are asymmetrical.

Credible and non-credible threats by the follower

If, after the leader had selected its equilibrium quantity, the follower deviated from the equilibrium and chose some non-optimal quantity it would not only hurt itself, but it could also hurt the leader. If the follower chose a much larger quantity than its best response, the market price would lower and the leader's profits would be stung, perhaps below Cournot level profits. In this case, the follower could announce to the leader before the game starts that unless the leader chooses a Cournot equilibrium quantity, the follower will choose a deviant quantity that will hit the leader's profits. After all, the quantity chosen by the leader in equilibrium is only optimal if the follower also plays in equilibrium. The leader is, however, in no danger. Once the leader has chosen its equilibrium quantity, it would be irrational for the follower to deviate because it too would be hurt. Once the leader has chosen, the follower is better off by playing on the equilibrium path. Hence, such a threat by the follower would not be credible.

However, in an (indefinitely) repeated Stackelberg game, the follower might adopt a punishment strategy where it threatens to punish the leader in the next period unless it chooses a non-optimal strategy in the current period. This threat may be credible because it could be rational for the follower to punish in the next period so that the leader chooses Cournot quantities thereafter.

Stackelberg compared with Cournot

The Stackelberg and Cournot models are similar because in both competition is on quantity. However, as seen, the first move gives the leader in Stackelberg a crucial advantage. There is also the important assumption of perfect information in the Stackelberg game: the follower must observe the quantity chosen by the leader, otherwise the game reduces to Cournot. With imperfect information, the threats described above can be credible. If the follower cannot observe the leader's move, it is no longer irrational for the follower to choose, say, a Cournot level of quantity (in fact, that is the equilibrium action). However, it must be that there is imperfect information and the follower is unable to observe the leader's move because it is irrational for the follower not to observe if it can once the leader has moved. If it can observe, it will so that it can make the optimal decision. Any threat by the follower claiming that it will not observe even if it can is as uncredible as those above. This is an example of too much information hurting a player. In Cournot competition, it is the simultaneity of the game (the imperfection of knowledge) that results in neither player ( ceteris paribus ) being at a disadvantage.

Game-theoretic considerations

As mentioned, imperfect information in a leadership game reduces to Cournot competition. However, some Cournot strategy profiles are sustained as Nash equilibria but can be eliminated as incredible threats (as described above) by applying the solution concept of subgame perfection. Indeed, it is the very thing that makes a Cournot strategy profile a Nash equilibrium in a Stackelberg game that prevents it from being subgame perfect.

Consider a Stackelberg game (i.e. one which fulfills the requirements described above for sustaining a Stackelberg equilibrium) in which, for some reason, the leader believes that whatever action it takes, the follower will choose a Cournot quantity (perhaps the leader believes that the follower is irrational). If the leader played a Stackelberg action, (it believes) that the follower will play Cournot. Hence it is non-optimal for the leader to play Stackelberg. In fact, its best response (by the definition of Cournot equilibrium) is to play Cournot quantity. Once it has done this, the best response of the follower is to play Cournot.

Consider the following strategy profiles: the leader plays Cournot; the follower plays Cournot if the leader plays Cournot and the follower plays Stackelberg if the leader plays Stackelberg and if the leader plays something else, the follower plays an arbitrary strategy (hence this actually describes several profiles). This profile is a Nash equilibrium. As argued above, on the equilibrium path play is a best response to a best response. However, playing Cournot would not have been the best response of the leader were it that the follower would play Stackelberg if it (the leader) played Stackelberg. In this case, the best response of the leader would be to play Stackelberg. Hence, what makes this profile (or rather, these profiles) a Nash equilibrium (or rather, Nash equilibria) is the fact that the follower would play non-Stackelberg if the leader were to play Stackelberg.

However, this very fact (that the follower would play non-Stackelberg if the leader were to play Stackelberg) means that this profile is not a Nash equilibrium of the subgame starting when the leader has already played Stackelberg (a subgame off the equilibrium path). If the leader has already played Stackelberg, the best response of the follower is to play Stackelberg (and therefore it is the only action that yields a Nash equilibrium in this subgame). Hence the strategy profile – which is Cournot – is not subgame perfect.

Comparison with other oligopoly models

In comparison with other oligopoly models,

Applications

The Stackelberg concept has been extended to dynamic Stackelberg games. [1] [2] With the addition of time as a dimension, phenomena not found in static games were discovered, such as violation of the principle of optimality by the leader. [2]

In recent years, Stackelberg games have been applied in the security domain. [3] In this context, the defender (leader) designs a strategy to protect a resource, such that the resource remains safe irrespective of the strategy adopted by the attacker (follower). Stackelberg differential games are also used to model supply chains and marketing channels. [4] Other applications of Stackelberg games include heterogeneous networks, [5] genetic privacy, [6] [7] robotics, [8] [9] autonomous driving, [10] [11] electrical grids, [12] [13] and integrated energy systems. [14]

See also

Related Research Articles

A duopoly is a type of oligopoly where two firms have dominant or exclusive control over a market, and most of the competition within that market occurs directly between them.

An oligopoly is a market in which pricing control lies in the hands of a few sellers.

In game theory, the Nash equilibrium is the most commonly-used solution concept for non-cooperative games. A Nash equilibrium is a situation where no player could gain by changing their own strategy. The idea of Nash equilibrium dates back to the time of Cournot, who in 1838 applied it to his model of competition in an oligopoly.

<span class="mw-page-title-main">Fourier series</span> Decomposition of periodic functions into sums of simpler sinusoidal forms

A Fourier series is an expansion of a periodic function into a sum of trigonometric functions. The Fourier series is an example of a trigonometric series, but not all trigonometric series are Fourier series. By expressing a function as a sum of sines and cosines, many problems involving the function become easier to analyze because trigonometric functions are well understood. For example, Fourier series were first used by Joseph Fourier to find solutions to the heat equation. This application is possible because the derivatives of trigonometric functions fall into simple patterns. Fourier series cannot be used to approximate arbitrary functions, because most functions have infinitely many terms in their Fourier series, and the series do not always converge. Well-behaved functions, for example smooth functions, have Fourier series that converge to the original function. The coefficients of the Fourier series are determined by integrals of the function multiplied by trigonometric functions, described in Common forms of the Fourier series below.

Thermal conduction is the diffusion of thermal energy (heat) within one material or between materials in contact. The higher temperature object has molecules with more kinetic energy; collisions between molecules distributes this kinetic energy until an object has the same kinetic energy throughout. Thermal conductivity, frequently represented by k, is a property that relates the rate of heat loss per unit area of a material to its rate of change of temperature. Essentially, it is a value that accounts for any property of the material that could change the way it conducts heat. Heat spontaneously flows along a temperature gradient. For example, heat is conducted from the hotplate of an electric stove to the bottom of a saucepan in contact with it. In the absence of an opposing external driving energy source, within a body or between bodies, temperature differences decay over time, and thermal equilibrium is approached, temperature becoming more uniform.

<span class="mw-page-title-main">Profit maximization</span> Process to determine the highest profits for a firm

In economics, profit maximization is the short run or long run process by which a firm may determine the price, input and output levels that will lead to the highest possible total profit. In neoclassical economics, which is currently the mainstream approach to microeconomics, the firm is assumed to be a "rational agent" which wants to maximize its total profit, which is the difference between its total revenue and its total cost.

<span class="mw-page-title-main">Economic equilibrium</span> Situation where economic forces are balanced

In economics, economic equilibrium is a situation in which economic forces such as supply and demand are balanced and in the absence of external influences the values of economic variables will not change. For example, in the standard text perfect competition, equilibrium occurs at the point at which quantity demanded and quantity supplied are equal.

The Basel problem is a problem in mathematical analysis with relevance to number theory, concerning an infinite sum of inverse squares. It was first posed by Pietro Mengoli in 1650 and solved by Leonhard Euler in 1734, and read on 5 December 1735 in The Saint Petersburg Academy of Sciences. Since the problem had withstood the attacks of the leading mathematicians of the day, Euler's solution brought him immediate fame when he was twenty-eight. Euler generalised the problem considerably, and his ideas were taken up more than a century later by Bernhard Riemann in his seminal 1859 paper "On the Number of Primes Less Than a Given Magnitude", in which he defined his zeta function and proved its basic properties. The problem is named after Basel, hometown of Euler as well as of the Bernoulli family who unsuccessfully attacked the problem.

Economic order quantity (EOQ), also known as financial purchase quantity or economic buying quantity, is the order quantity that minimizes the total holding costs and ordering costs in inventory management. It is one of the oldest classical production scheduling models. The model was developed by Ford W. Harris in 1786, but the consultantR. H. Wilson applied it extensively, and he and K. Andler are given credit for their in-depth analysis.

<span class="mw-page-title-main">Magnetic vector potential</span> Integral of the magnetic field

In classical electromagnetism, magnetic vector potential is the vector quantity defined so that its curl is equal to the magnetic field: . Together with the electric potential φ, the magnetic vector potential can be used to specify the electric field E as well. Therefore, many equations of electromagnetism can be written either in terms of the fields E and B, or equivalently in terms of the potentials φ and A. In more advanced theories such as quantum mechanics, most equations use potentials rather than fields.

Bertrand competition is a model of competition used in economics, named after Joseph Louis François Bertrand (1822–1900). It describes interactions among firms (sellers) that set prices and their customers (buyers) that choose quantities at the prices set. The model was formulated in 1883 by Bertrand in a review of Antoine Augustin Cournot's book Recherches sur les Principes Mathématiques de la Théorie des Richesses (1838) in which Cournot had put forward the Cournot model. Cournot's model argued that each firm should maximise its profit by selecting a quantity level and then adjusting price level to sell that quantity. The outcome of the model equilibrium involved firms pricing above marginal cost; hence, the competitive price. In his review, Bertrand argued that each firm should instead maximise its profits by selecting a price level that undercuts its competitors' prices, when their prices exceed marginal cost. The model was not formalized by Bertrand; however, the idea was developed into a mathematical model by Francis Ysidro Edgeworth in 1889.

Cournot competition is an economic model used to describe an industry structure in which companies compete on the amount of output they will produce, which they decide on independently of each other and at the same time. It is named after Antoine Augustin Cournot (1801–1877) who was inspired by observing competition in a spring water duopoly. It has the following features:

In game theory, an extensive-form game is a specification of a game allowing for the explicit representation of a number of key aspects, like the sequencing of players' possible moves, their choices at every decision point, the information each player has about the other player's moves when they make a decision, and their payoffs for all possible game outcomes. Extensive-form games also allow for the representation of incomplete information in the form of chance events modeled as "moves by nature". Extensive-form representations differ from normal-form in that they provide a more complete description of the game in question, whereas normal-form simply boils down the game into a payoff matrix.

<span class="mw-page-title-main">Charge density</span> Electric charge per unit length, area or volume

In electromagnetism, charge density is the amount of electric charge per unit length, surface area, or volume. Volume charge density is the quantity of charge per unit volume, measured in the SI system in coulombs per cubic meter (C⋅m−3), at any point in a volume. Surface charge density (σ) is the quantity of charge per unit area, measured in coulombs per square meter (C⋅m−2), at any point on a surface charge distribution on a two dimensional surface. Linear charge density (λ) is the quantity of charge per unit length, measured in coulombs per meter (C⋅m−1), at any point on a line charge distribution. Charge density can be either positive or negative, since electric charge can be either positive or negative.

Hotelling's lemma is a result in microeconomics that relates the supply of a good to the maximum profit of the producer. It was first shown by Harold Hotelling, and is widely used in the theory of the firm.

The Chudnovsky algorithm is a fast method for calculating the digits of π, based on Ramanujan's π formulae. Published by the Chudnovsky brothers in 1988, it was used to calculate π to a billion decimal places.

An LC circuit can be quantized using the same methods as for the quantum harmonic oscillator. An LC circuit is a variety of resonant circuit, and consists of an inductor, represented by the letter L, and a capacitor, represented by the letter C. When connected together, an electric current can alternate between them at the circuit's resonant frequency:

The Brander–Spencer model is an economic model in international trade originally developed by James Brander and Barbara Spencer in the early 1980s. The model illustrates a situation where, under certain assumptions, a government can subsidize domestic firms to help them in their competition against foreign producers and in doing so enhances national welfare. This conclusion stands in contrast to results from most international trade models, in which government non-interference is socially optimal.

A markup rule is the pricing practice of a producer with market power, where a firm charges a fixed mark-up over its marginal cost.

In oligopoly theory, conjectural variation is the belief that one firm has an idea about the way its competitors may react if it varies its output or price. The firm forms a conjecture about the variation in the other firm's output that will accompany any change in its own output. For example, in the classic Cournot model of oligopoly, it is assumed that each firm treats the output of the other firms as given when it chooses its output. This is sometimes called the "Nash conjecture," as it underlies the standard Nash equilibrium concept. However, alternative assumptions can be made. Suppose you have two firms producing the same good, so that the industry price is determined by the combined output of the two firms. Now suppose that each firm has what is called the "Bertrand Conjecture" of −1. This means that if firm A increases its output, it conjectures that firm B will reduce its output to exactly offset firm A's increase, so that total output and hence price remains unchanged. With the Bertrand Conjecture, the firms act as if they believe that the market price is unaffected by their own output, because each firm believes that the other firm will adjust its output so that total output will be constant. At the other extreme is the Joint-Profit maximizing conjecture of +1. In this case, each firm believes that the other will imitate exactly any change in output it makes, which leads to the firms behaving like a single monopoly supplier.

References

  1. Simaan, M.; Cruz, J. B. (May 1973). "On the Stackelberg strategy in nonzero-sum games". Journal of Optimization Theory and Applications. 11 (5): 533–555. doi:10.1007/BF00935665. ISSN   0022-3239. S2CID   121400147.
  2. 1 2 Simaan, M.; Cruz, J. B. (June 1973). "Additional aspects of the Stackelberg strategy in nonzero-sum games". Journal of Optimization Theory and Applications. 11 (6): 613–626. doi:10.1007/BF00935561. ISSN   0022-3239.
  3. Brown, Gerald (2006). "Defending critical infrastructure". Interfaces. 36 (6): 530–544. doi:10.1287/inte.1060.0252. hdl: 10945/36732 . S2CID   16223037.
  4. He, Xiuli; Prasad, Ashutosh; Sethi, Suresh P.; Gutierrez, Genaro J. (December 2007). "A survey of Stackelberg differential game models in supply and marketing channels". Journal of Systems Science and Systems Engineering. 16 (4): 385–413. CiteSeerX   10.1.1.727.2952 . doi:10.1007/s11518-007-5058-2. ISSN   1004-3756. S2CID   11443159.
  5. Ghosh, Subha; De, Debashis (2021-04-28). "E²M³: energy-efficient massive MIMO–MISO 5G HetNet using Stackelberg game". The Journal of Supercomputing. 77 (11): 13549–13583. doi:10.1007/s11227-021-03809-1. ISSN   0920-8542. S2CID   235569547.
  6. Wan, Zhiyu; Vorobeychik, Yevgeniy; Xia, Weiyi; Clayton, Ellen Wright; Kantarcioglu, Murat; Malin, Bradley (2017-02-02). "Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach". The American Journal of Human Genetics. 100 (2): 316–322. doi:10.1016/j.ajhg.2016.12.002. ISSN   0002-9297. PMC   5294764 . PMID   28065469.
  7. Wan, Zhiyu; Vorobeychik, Yevgeniy; Xia, Weiyi; Liu, Yongtai; Wooders, Myrna; Guo, Jia; Yin, Zhijun; Clayton, Ellen Wright; Kantarcioglu, Murat; Malin, Bradley A. (2021). "Using game theory to thwart multistage privacy intrusions when sharing data". Science Advances. 7 (50): eabe9986. Bibcode:2021SciA....7.9986W. doi:10.1126/sciadv.abe9986. PMC   8664254 . PMID   34890225.
  8. Koh, Joewie J.; Ding, Guohui; Heckman, Christoffer; Chen, Lijun; Roncone, Alessandro (2020-10-24). "Cooperative Control of Mobile Robots with Stackelberg Learning". 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Las Vegas, NV, USA: IEEE. pp. 7985–7992. arXiv: 2008.00679 . doi:10.1109/IROS45743.2020.9341376. ISBN   978-1-7281-6212-6. S2CID   220935562.
  9. Ranjbar-Sahraei, Bijan; Stankova, Katerina; Tuyls, Karl; Weiss, Gerhard (2013-09-02). "Stackelberg-based Coverage Approach in Nonconvex Environments". Advances in Artificial Life, ECAL 2013. MIT Press: 462–469. CiteSeerX   10.1.1.650.4481 . doi:10.7551/978-0-262-31709-2-ch066. ISBN   978-0-262-31709-2. S2CID   11668402.
  10. Yoo, Jehong; Langari, Reza (2020). "A Stackelberg Game Theoretic Model of Lane-Merging". arXiv: 2003.09786 .{{cite journal}}: Cite journal requires |journal= (help)
  11. Cooper, Matt; Lee, Jun Ki; Beck, Jacob; Fishman, Joshua D.; Gillett, Michael; Papakipos, Zoë; Zhang, Aaron; Ramos, Jerome; Shah, Aansh (2019), Salichs, Miguel A.; Ge, Shuzhi Sam; Barakova, Emilia Ivanova; Cabibihan, John-John (eds.), "Stackelberg Punishment and Bully-Proofing Autonomous Vehicles", Social Robotics, vol. 11876, Cham: Springer International Publishing, pp. 368–377, arXiv: 1908.08641 , doi:10.1007/978-3-030-35888-4_34, ISBN   978-3-030-35887-7, S2CID   201645147 , retrieved 2021-05-03
  12. Qiu, Haifeng; Gu, Wei; Wang, Lu; Pan, Guangsheng; Xu, Yinliang; Wu, Zhi (June 2021). "Trilayer Stackelberg Game Approach for Robustly Power Management in Community Grids". IEEE Transactions on Industrial Informatics. 17 (6): 4073–4083. doi:10.1109/TII.2020.3015733. ISSN   1551-3203. S2CID   226558914.
  13. An, Lu; Chakrabortty, Aranya; Duel-Hallen, Alexandra (2020-12-14). "A Stackelberg Security Investment Game for Voltage Stability of Power Systems". 2020 59th IEEE Conference on Decision and Control (CDC). Jeju, Korea (South): IEEE. pp. 3359–3364. arXiv: 2006.11665 . doi:10.1109/CDC42340.2020.9304301. ISBN   978-1-7281-7447-1. S2CID   219965779.
  14. Zheng, Weiye; Hill, David J. (2021-03-01). "Incentive-based coordination mechanism for distributed operation of integrated electricity and heat systems". Applied Energy. 285: 116373. doi:10.1016/j.apenergy.2020.116373. ISSN   0306-2619. S2CID   233833095.