Price of anarchy

Last updated

The Price of Anarchy (PoA) [1] is a concept in economics and game theory that measures how the efficiency of a system degrades due to selfish behavior of its agents. It is a general notion that can be extended to diverse systems and notions of efficiency. For example, consider the system of transportation of a city and many agents trying to go from some initial location to a destination. Here, efficiency means the average time for an agent to reach the destination. In the 'centralized' solution, a central authority can tell each agent which path to take in order to minimize the average travel time. In the 'decentralized' version, each agent chooses its own path. The Price of Anarchy measures the ratio between average travel time in the two cases.

Contents

Usually the system is modeled as a game and the efficiency is some function of the outcomes (e.g. maximum delay in a network, congestion in a transportation system, social welfare in an auction, etc.). Different concepts of equilibrium can be used to model the selfish behavior of the agents, among which the most common is the Nash equilibrium. Different flavors of Nash equilibrium lead to variations of the notion of Price of Anarchy as Pure Price of Anarchy (for deterministic equilibria), Mixed Price of Anarchy (for randomized equilibria), and Bayes–Nash Price of Anarchy (for games with incomplete information). Solution concepts other than Nash equilibrium lead to variations such as the Price of Sinking. [2]

The term Price of Anarchy was first used by Elias Koutsoupias and Christos Papadimitriou, [1] [3] but the idea of measuring inefficiency of equilibrium is older. [4] The concept in its current form was designed to be the analogue of the 'approximation ratio' in an approximation algorithm or the 'competitive ratio' in an online algorithm. This is in the context of the current trend of analyzing games using algorithmic lenses (algorithmic game theory).

Mathematical definition

Consider a game , defined by a set of players , strategy sets for each player and utilities (where also called set of outcomes). We can define a measure of efficiency of each outcome which we call welfare function . Natural candidates include the sum of players utilities (utilitarian objective) minimum utility (fairness or egalitarian objective) ..., or any function that is meaningful for the particular game being analyzed and is desirable to be maximized.

We can define a subset to be the set of strategies in equilibrium (for example, the set of Nash equilibria). The Price of Anarchy is then defined as the ratio between the optimal 'centralized' solution and 'worst equilibrium':

If, instead of a 'welfare' which we want to 'maximize', the function measure efficiency is a 'cost function' which we want to 'minimize' (e.g. delay in a network) we use (following the convention in approximation algorithms):

A related notion is that of the Price of Stability (PoS) which measures the ratio between the optimal 'centralized' solution and the 'best equilibrium':

or in the case of cost functions:

We know that by the definition. It is expected that the loss in efficiency due to game-theoretical constraints is somewhere between 'PoS' and 'PoA'.

Examples

Prisoner's dilemma

Consider the 2x2 game called prisoner's dilemma, given by the following cost matrix:

CooperateDefect
Cooperate1, 17, 0
Defect0, 75, 5

and let the cost function be Now, the worst (and only) Nash Equilibrium would be when both players defect and the resulting cost is . However, the highest social welfare occurs when both cooperate, in which case the cost is . Thus the PoA of this game will be .

Since the game has a unique Nash equilibrium, the PoS is equal to the PoA and it is 5 too.

Job scheduling

A more natural example is the one of job scheduling. There are players and each of them has a job to run. They can choose one of machines to run the job. The Price of Anarchy compares the situation where the selection of machines is guided/directed centrally to the situation where each player chooses the machine that will make its job run fastest.

Each machine has a speed Each job has a weight A player picks a machine to run his or her job on. So, the strategies of each player are Define the load on machine to be:

The cost for player is i.e., the load of the machine they chose. We consider the egalitarian cost function , here called the makespan.

We consider two concepts of equilibrium: pure Nash and mixed Nash. It should be clear that mixed PoA ≥ pure PoA, because any pure Nash equilibrium is also a mixed Nash equilibrium (this inequality can be strict: e.g. when , , , and , the mixed strategies achieve an average makespan of 1.5, while any pure-strategy PoA in this setting is ). First we need to argue that there exist pure Nash equilibria.

Claim. For each job scheduling game, there exists at least one pure-strategy Nash equilibrium.

Proof. We would like to take a socially optimal action profile . This would mean simply an action profile whose makespan is minimum. However, this will not be enough. There may be several such action profiles leading to a variety of different loads distributions (all having the same maximum load). Among these, we further restrict ourselves to one that has a minimum second-largest load. Again, this results in a set of possible load distributions, and we repeat until the th-largest (i.e., smallest) load, where there can only be one distribution of loads (unique up to permutation). This would also be called the lexicographic smallest sorted load vector.

We claim that this is a pure-strategy Nash equilibrium. Reasoning by contradiction, suppose that some player could strictly improve by moving from machine to machine . This means that the increased load of machine after the move is still smaller than the load of machine before the move. As the load of machine must decrease as a result of the move and no other machine is affected, this means that the new configuration is guaranteed to have reduced the th-largest (or higher ranked) load in the distribution. This, however, violates the assumed lexicographic minimality of .Q.E.D.

Claim. For each job scheduling game, the pure PoA is at most .

Proof. It is easy to upper-bound the welfare obtained at any mixed-strategy Nash equilibrium by

Consider, for clarity of exposition, any pure-strategy action profile : clearly

Since the above holds for the social optimum as well, comparing the ratios and proves the claim. Q.E.D

Selfish Routing

Braess's paradox

Braess paradox road example.svg

Consider a road network as shown in the adjacent diagram on which 4000 drivers wish to travel from point Start to End. The travel time in minutes on the Start–A road is the number of travelers (T) divided by 100, and on Start–B is a constant 45 minutes (likewise with the roads across from them). If the dashed road does not exist (so the traffic network has 4 roads in total), the time needed to drive Start–A–End route with drivers would be . The time needed to drive the Start–B–End route with drivers would be . As there are 4000 drivers, the fact that can be used to derive the fact that when the system is at equilibrium. Therefore, each route takes minutes. If either route took less time, it would not be a Nash equilibrium: a rational driver would switch from the longer route to the shorter route.

Now suppose the dashed line A–B is a road with an extremely short travel time of approximately 0 minutes. Suppose that the road is opened and one driver tries Start–A–B–End. To his surprise he finds that his time is minutes, a saving of almost 25 minutes. Soon, more of the 4000 drivers are trying this new route. The time taken rises from 40.01 and keeps climbing. When the number of drivers trying the new route reaches 2500, with 1500 still in the Start–B–End route, their time will be minutes, which is no improvement over the original route. Meanwhile, those 1500 drivers have been slowed to minutes, a 20-minute increase. They are obliged to switch to the new route via A too, so it now takes minutes. Nobody has any incentive to travel A-End or Start-B because any driver trying them will take 85 minutes. Thus, the opening of the cross route triggers an irreversible change to it by everyone, costing everyone 80 minutes instead of the original 65. If every driver were to agree not to use the A–B path, or if that route were closed, every driver would benefit by a 15-minute reduction in travel time.

Generalized routing problem

The routing problem introduced in the Braess's paradox can be generalized to many different flows traversing the same graph at the same time.

Definition (Generalized flow). Let , and be as defined above, and suppose that we want to route the quantities through each distinct pair of nodes in . A flow is defined as an assignment of a real, nonnegative number to each path going from to , with the constraint that

The flow traversing a specific edge of is defined as

For succinctness, we write when are clear from context.

Definition (Nash-equilibrium flow). A flow is a Nash-equilibrium flow iff and from to

This definition is closely related to what we said about the support of mixed-strategy Nash equilibria in normal-form games.

Definition (Conditional welfare of a flow). Let and be two flows in associated with the same sets and . In what follows, we will drop the subscript to make the notation clearer. Assume to fix the latencies induced by on the graph: the conditional welfare of with respect to is defined as

Fact 1. Given a Nash-equilibrium flow and any other flow , .

Proof (By contradiction). Assume that . By definition,

.

Since and are associated with the same sets , we know that

Therefore, there must be a pair and two paths from to such that , , and

In other words, the flow can achieve a lower welfare than only if there are two paths from to having different costs, and if reroutes some flow of from the higher-cost path to the lower-cost path. This situation is clearly incompatible with the assumption that is a Nash-equilibrium flow.Q.E.D.

Note that Fact 1 does not assume any particular structure on the set .

Fact 2. Given any two real numbers and , .

Proof. This is another way to express the true inequality . Q.E.D.

Theorem. The pure PoA of any generalized routing problem with linear latencies is .

Proof. Note that this theorem is equivalent to saying that for each Nash-equilibrium flow , , where is any other flow. By definition,

By using Fact 2, we have that

since

We can conclude that , and prove the thesis using Fact 1. Q.E.D.

Note that in the proof we have made extensive use of the assumption that the functions in are linear. Actually, a more general fact holds.

Theorem. Given a generalized routing problem with graph and polynomial latency functions of degree with nonnegative coefficients, the pure PoA is .

Note that the PoA can grow with . Consider the example shown in the following figure, where we assume unit flow: the Nash-equilibrium flows have social welfare 1; however, the best welfare is achieved when , in which case

This quantity tends to zero when tends to infinity.

Further results

PoA upper bounds can be easily obtained if the game is shown to satisfy a so-called smoothness inequality. More precisely, a cost-minimimization game is (λ,μ)-smooth (with λ ≥ 0 and μ < 1) if the inequality [5]

holds for any outcome a and a*. In this case, the PoA is upper bounded by λ/(1 − μ). [5]

Moreover, for these (finite) games it was proven that every equilibrium which achieves the PoA bound is fragile, in the sense that the agents demonstrate a state of indifference between their equilibrium action and the action they would pursue in a system-optimal outcome. [6]

See also

Related Research Articles

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

<span class="mw-page-title-main">Fokker–Planck equation</span> Partial differential equation

In statistical mechanics and information theory, the Fokker–Planck equation is a partial differential equation that describes the time evolution of the probability density function of the velocity of a particle under the influence of drag forces and random forces, as in Brownian motion. The equation can be generalized to other observables as well. The Fokker-Planck equation has multiple applications in information theory, graph theory, data science, finance, economics etc.

<span class="mw-page-title-main">Jensen's inequality</span> Theorem of convex functions

In mathematics, Jensen's inequality, named after the Danish mathematician Johan Jensen, relates the value of a convex function of an integral to the integral of the convex function. It was proved by Jensen in 1906, building on an earlier proof of the same inequality for doubly-differentiable functions by Otto Hölder in 1889. Given its generality, the inequality appears in many forms depending on the context, some of which are presented below. In its simplest form the inequality states that the convex transformation of a mean is less than or equal to the mean applied after convex transformation; it is a simple corollary that the opposite is true of concave transformations.

In probability theory, a Chernoff bound is an exponentially decreasing upper bound on the tail of a random variable based on its moment generating function. The minimum of all such exponential bounds forms the Chernoff or Chernoff-Cramér bound, which may decay faster than exponential. It is especially useful for sums of independent random variables, such as sums of Bernoulli random variables.

<span class="mw-page-title-main">Dirichlet distribution</span> Probability distribution

In probability and statistics, the Dirichlet distribution (after Peter Gustav Lejeune Dirichlet), often denoted , is a family of continuous multivariate probability distributions parameterized by a vector of positive reals. It is a multivariate generalization of the beta distribution, hence its alternative name of multivariate beta distribution (MBD). Dirichlet distributions are commonly used as prior distributions in Bayesian statistics, and in fact, the Dirichlet distribution is the conjugate prior of the categorical distribution and multinomial distribution.

In mechanics, virtual work arises in the application of the principle of least action to the study of forces and movement of a mechanical system. The work of a force acting on a particle as it moves along a displacement is different for different displacements. Among all the possible displacements that a particle may follow, called virtual displacements, one will minimize the action. This displacement is therefore the displacement followed by the particle according to the principle of least action.

The work of a force on a particle along a virtual displacement is known as the virtual work.

<span class="mw-page-title-main">Lambert series</span> Mathematical term

In mathematics, a Lambert series, named for Johann Heinrich Lambert, is a series taking the form

In mathematics, we can define norms for the elements of a vector space. When the vector space in question consists of matrices, these are called matrix norms.

In mathematics, the Schur orthogonality relations, which were proven by Issai Schur through Schur's lemma, express a central fact about representations of finite groups. They admit a generalization to the case of compact groups in general, and in particular compact Lie groups, such as the rotation group SO(3).

Expected shortfall (ES) is a risk measure—a concept used in the field of financial risk measurement to evaluate the market risk or credit risk of a portfolio. The "expected shortfall at q% level" is the expected return on the portfolio in the worst of cases. ES is an alternative to value at risk that is more sensitive to the shape of the tail of the loss distribution.

In polynomial interpolation of two variables, the Padua points are the first known example of a unisolvent point set with minimal growth of their Lebesgue constant, proven to be . Their name is due to the University of Padua, where they were originally discovered.

In computational learning theory, Rademacher complexity, named after Hans Rademacher, measures richness of a class of sets with respect to a probability distribution. The concept can also be extended to real valued functions.

In mathematics and physics, the Magnus expansion, named after Wilhelm Magnus (1907–1990), provides an exponential representation of the solution of a first-order homogeneous linear differential equation for a linear operator. In particular, it furnishes the fundamental matrix of a system of linear ordinary differential equations of order n with varying coefficients. The exponent is aggregated as an infinite series, whose terms involve multiple integrals and nested commutators.

In game theory, the price of stability (PoS) of a game is the ratio between the best objective function value of one of its equilibria and that of an optimal outcome. The PoS is relevant for games in which there is some objective authority that can influence the players a bit, and maybe help them converge to a good Nash equilibrium. When measuring how efficient a Nash equilibrium is in a specific game we often also talk about the price of anarchy (PoA), which is the ratio between the worst objective function value of one of its equilibria and that of an optimal outcome.

In mathematics, low-rank approximation is a minimization problem, in which the cost function measures the fit between a given matrix and an approximating matrix, subject to a constraint that the approximating matrix has reduced rank. The problem is used for mathematical modeling and data compression. The rank constraint is related to a constraint on the complexity of a model that fits the data. In applications, often there are other constraints on the approximating matrix apart from the rank constraint, e.g., non-negativity and Hankel structure.

In mathematics, a transformation of a sequence's generating function provides a method of converting the generating function for one sequence into a generating function enumerating another. These transformations typically involve integral formulas applied to a sequence generating function or weighted sums over the higher-order derivatives of these functions.

<span class="mw-page-title-main">Price of anarchy in auctions</span>

The Price of Anarchy (PoA) is a concept in game theory and mechanism design that measures how the social welfare of a system degrades due to selfish behavior of its agents. It has been studied extensively in various contexts, particularly in auctions.

In number theory, the prime omega functions and count the number of prime factors of a natural number Thereby counts each distinct prime factor, whereas the related function counts the total number of prime factors of honoring their multiplicity. That is, if we have a prime factorization of of the form for distinct primes , then the respective prime omega functions are given by and . These prime factor counting functions have many important number theoretic relations.

The purpose of this page is to catalog new, interesting, and useful identities related to number-theoretic divisor sums, i.e., sums of an arithmetic function over the divisors of a natural number , or equivalently the Dirichlet convolution of an arithmetic function with one:

In finance, an option on realized variance is a type of variance derivatives which is the derivative securities on which the payoff depends on the annualized realized variance of the return of a specified underlying asset, such as stock index, bond, exchange rate, etc. Another liquidated security of the same type is variance swap, which is, in other words, the futures contract on realized variance.

References

  1. 1 2 Koutsoupias, Elias; Papadimitriou, Christos (May 2009). "Worst-case Equilibria". Computer Science Review. 3 (2): 65–69. doi:10.1016/j.cosrev.2009.04.003. Archived from the original on 2016-03-13. Retrieved 2010-09-12.
  2. M. Goemans, V. Mirrokni, A. Vetta, Sink equilibria and convergence , FOCS 05
  3. Chung, Christine; Ligett, Katrina; Pruhs, Kirk; Roth, Aaron (2008), Monien, Burkhard; Schroeder, Ulf-Peter (eds.), "The Price of Stochastic Anarchy", Algorithmic Game Theory, Berlin, Heidelberg: Springer Berlin Heidelberg, vol. 4997, pp. 303–314, doi:10.1007/978-3-540-79309-0_27, ISBN   978-3-540-79308-3 , retrieved 2023-12-29
  4. P. Dubey. Inefficiency of Nash equilibria. Math. Operat. Res., 11(1):1–8, 1986
  5. 1 2 Seaton, Joshua H.; Brown, Philip N. (2023). "On the Intrinsic Fragility of the Price of Anarchy". IEEE Control Systems Letters. 7: 3573–3578. doi:10.1109/LCSYS.2023.3335315. ISSN   2475-1456.
  6. Roughgarden, Tim (2015-11-02). "Intrinsic Robustness of the Price of Anarchy". Journal of the ACM. 62 (5): 1–42. doi:10.1145/2806883. ISSN   0004-5411.

Further reading