Mean-field game theory

Last updated

Mean-field game theory is the study of strategic decision making by small interacting agents in very large populations. It lies at the intersection of game theory with stochastic analysis and control theory. The use of the term "mean field" is inspired by mean-field theory in physics, which considers the behavior of systems of large numbers of particles where individual particles have negligible impacts upon the system. In other words, each agent acts according to his minimization or maximization problem taking into account other agents’ decisions and because their population is large we can assume the number of agents goes to infinity and a representative agent exists. [1]

Contents

In traditional game theory, the subject of study is usually a game with two players and discrete time space, and extends the results to more complex situations by induction. However, for games in continuous time with continuous states (differential games or stochastic differential games) this strategy cannot be used because of the complexity that the dynamic interactions generate. On the other hand with MFGs we can handle large numbers of players through the mean representative agent and at the same time describe complex state dynamics.

This class of problems was considered in the economics literature by Boyan Jovanovic and Robert W. Rosenthal, [2] in the engineering literature by Minyi Huang, Roland Malhame, and Peter E. Caines [3] [4] [5] and independently and around the same time by mathematicians Jean-Michel Lasry  [ fr ] and Pierre-Louis Lions. [6] [7]

In continuous time a mean-field game is typically composed of a Hamilton–Jacobi–Bellman equation that describes the optimal control problem of an individual and a Fokker–Planck equation that describes the dynamics of the aggregate distribution of agents. Under fairly general assumptions it can be proved that a class of mean-field games is the limit as of an N-player Nash equilibrium. [8]

A related concept to that of mean-field games is "mean-field-type control". In this case, a social planner controls the distribution of states and chooses a control strategy. The solution to a mean-field-type control problem can typically be expressed as a dual adjoint Hamilton–Jacobi–Bellman equation coupled with Kolmogorov equation. Mean-field-type game theory is the multi-agent generalization of the single-agent mean-field-type control. [9]

General Form of a Mean-field Game

The following system of equations [10] can be used to model a typical Mean-field game:

The basic dynamics of this set of Equations can be explained by an average agent's optimal control problem. In a mean-field game, an average agent can control their movement to influence the population's overall location by:

where is a parameter and is a standard Brownian motion. By controlling their movement, the agent aims to minimize their overall expected cost throughout the time period :

where is the running cost at time and is the terminal cost at time . By this definition, at time and position , the value function can be determined as:

Given the definition of the value function , it can be tracked by the Hamilton-Jacobi equation (1). The optimal action of the average players can be determined as . As all agents are relatively small and cannot single-handedly change the dynamics of the population, they will individually adapt the optimal control and the population would move in that way. This is similar to a Nash Equilibrium, in which all agents act in response to a specific set of others' strategies. The optimal control solution then leads to the Kolmogorov-Fokker-Planck equation (2).

Finite State Games

A prominent category of mean field is games with a finite number of states and a finite number of actions per player. For those games, the analog of the Hamilton-Jacobi-Bellman equation is the Bellman equation, and the discrete version of the Fokker-Planck equation is the Kolmogorov equation. Specifically, for discrete-time models, the players' strategy is the Kolmogorov equation's probability matrix. In continuous time models, players have the ability to control the transition rate matrix.

A discrete mean field game can be defined by a tuple , where is the state space, the action set, the transition rate matrices, the initial state, the cost functions and a discount factor. Furthermore, a mixed strategy is a measurable function , that associates to each state and each time a probability measure on the set of possible actions. Thus is the probability that, at time a player in state takes action , under strategy . Additionally, rate matrices define the evolution over the time of population distribution, where is the population distribution at time . [11]

Linear-quadratic Gaussian game problem

From Caines (2009), a relatively simple model of large-scale games is the linear-quadratic Gaussian model. The individual agent's dynamics are modeled as a stochastic differential equation

where is the state of the -th agent, is the control of the -th agent, and are independent Wiener processes for all . The individual agent's cost is

The coupling between agents occurs in the cost function.

General and Applied Use

The paradigm of Mean Field Games has become a major connection between distributed decision-making and stochastic modeling. Starting out in the stochastic control literature, it is gaining rapid adoption across a range of applications, including:

a. Financial market Carmona reviews applications in financial engineering and economics that can be cast and tackled within the framework of the MFG paradigm. [12] Carmona argues that models in macroeconomics, contract theory, finance, …, greatly benefit from the switch to continuous time from the more traditional discrete-time models. He considers only continuous time models in his review chapter, including systemic risk, price impact, optimal execution, models for bank runs, high-frequency trading, and cryptocurrencies.

b. Crowd motions MFG assumes that individuals are smart players which try to optimize their strategy and path with respect to certain costs (equilibrium with rational expectations approach). MFG models are useful to describe the anticipation phenomenon: the forward part describes the crowd evolution while the backward gives the process of how the anticipations are built. Additionally, compared to multi-agent microscopic model computations, MFG only requires lower computational costs for the macroscopic simulations. Some researchers have turned to MFG in order to model the interaction between populations and study the decision-making process of intelligent agents, including aversion and congestion behavior between two groups of pedestrians, [13] departure time choice of morning commuters, [14] and decision-making processes for autonomous vehicle. [15]

c. Control and mitigation of Epidemics Since the epidemic has affected society and individuals significantly, MFG and mean-field controls (MFCs) provide a perspective to study and understand the underlying population dynamics, especially in the context of the Covid-19 pandemic response. MFG has been used to extend the SIR-type dynamics with spatial effects or allowing for individuals to choose their behaviors and control their contributions to the spread of the disease. MFC is applied to design the optimal strategy to control the virus spreading within a spatial domain, [16] control individuals’ decisions to limit their social interactions, [17] and support the government’s nonpharmaceutical interventions. [18]

See also

Related Research Articles

<span class="mw-page-title-main">Kaluza–Klein theory</span> Unified field theory

In physics, Kaluza–Klein theory is a classical unified field theory of gravitation and electromagnetism built around the idea of a fifth dimension beyond the common 4D of space and time and considered an important precursor to string theory. In their setup, the vacuum has the usual 3 dimensions of space and one dimension of time but with another microscopic extra spatial dimension in the shape of a tiny circle. Gunnar Nordström had an earlier, similar idea. But in that case, a fifth component was added to the electromagnetic vector potential, representing the Newtonian gravitational potential, and writing the Maxwell equations in five dimensions.

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-12 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity, and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.

<span class="mw-page-title-main">Stress–energy tensor</span> Tensor describing energy momentum density in spacetime

The stress–energy tensor, sometimes called the stress–energy–momentum tensor or the energy–momentum tensor, is a tensor physical quantity that describes the density and flux of energy and momentum in spacetime, generalizing the stress tensor of Newtonian physics. It is an attribute of matter, radiation, and non-gravitational force fields. This density and flux of energy and momentum are the sources of the gravitational field in the Einstein field equations of general relativity, just as mass density is the source of such a field in Newtonian gravity.

<span class="mw-page-title-main">Noether's theorem</span> Statement relating differentiable symmetries to conserved quantities

Noether's theorem or Noether's first theorem states that every differentiable symmetry of the action of a physical system with conservative forces has a corresponding conservation law. The theorem was proven by mathematician Emmy Noether in 1915 and published in 1918. The action of a physical system is the integral over time of a Lagrangian function, from which the system's behavior can be determined by the principle of least action. This theorem only applies to continuous and smooth symmetries over physical space.

In probability theory and related fields, Malliavin calculus is a set of mathematical techniques and ideas that extend the mathematical field of calculus of variations from deterministic functions to stochastic processes. In particular, it allows the computation of derivatives of random variables. Malliavin calculus is also called the stochastic calculus of variations. P. Malliavin first initiated the calculus on infinite dimensional space. Then, the significant contributors such as S. Kusuoka, D. Stroock, J-M. Bismut, S. Watanabe, I. Shigekawa, and so on finally completed the foundations.

In physics, particularly in quantum field theory, configurations of a physical system that satisfy classical equations of motion are called "on the mass shell" or simply more often on shell; while those that do not are called "off the mass shell", or off shell.

In mathematics, a Markov decision process (MDP) is a discrete-time stochastic control process. It provides a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for studying optimization problems solved via dynamic programming. MDPs were known at least as early as the 1950s; a core body of research on Markov decision processes resulted from Ronald Howard's 1960 book, Dynamic Programming and Markov Processes. They are used in many disciplines, including robotics, automatic control, economics and manufacturing. The name of MDPs comes from the Russian mathematician Andrey Markov as they are an extension of Markov chains.

A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics and are used to model various behaviours of stochastic models such as stock prices, random growth models or physical systems that are subjected to thermal fluctuations.

Tensor–vector–scalar gravity (TeVeS), developed by Jacob Bekenstein in 2004, is a relativistic generalization of Mordehai Milgrom's Modified Newtonian dynamics (MOND) paradigm.

In theoretical physics, massive gravity is a theory of gravity that modifies general relativity by endowing the graviton with a nonzero mass. In the classical theory, this means that gravitational waves obey a massive wave equation and hence travel at speeds below the speed of light.

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the Earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

<span class="mw-page-title-main">Covariant formulation of classical electromagnetism</span> Ways of writing certain laws of physics

The covariant formulation of classical electromagnetism refers to ways of writing the laws of classical electromagnetism in a form that is manifestly invariant under Lorentz transformations, in the formalism of special relativity using rectilinear inertial coordinate systems. These expressions both make it simple to prove that the laws of classical electromagnetism take the same form in any inertial coordinate system, and also provide a way to translate the fields and forces from one frame to another. However, this is not as general as Maxwell's equations in curved spacetime or non-rectilinear coordinate systems.

In physics, Liouville field theory is a two-dimensional conformal field theory whose classical equation of motion is a generalization of Liouville's equation.

<span class="mw-page-title-main">Gluon field strength tensor</span> Second rank tensor in quantum chromodynamics

In theoretical particle physics, the gluon field strength tensor is a second order tensor field characterizing the gluon interaction between quarks.

In probability theory, a McKean–Vlasov process is a stochastic process described by a stochastic differential equation where the coefficients of the diffusion depend on the distribution of the solution itself. The equations are a model for Vlasov equation and were first studied by Henry McKean in 1966. It is an example of propagation of chaos, in that it can be obtained as a limit of a mean-field system of interacting particles: as the number of particles tends to infinity, the interactions between any single particle and the rest of the pool will only depend on the particle itself.

Lagrangian field theory is a formalism in classical field theory. It is the field-theoretic analogue of Lagrangian mechanics. Lagrangian mechanics is used to analyze the motion of a system of discrete particles each with a finite number of degrees of freedom. Lagrangian field theory applies to continua and fields, which have an infinite number of degrees of freedom.

Supersymmetric theory of stochastic dynamics or stochastics (STS) is an exact theory of stochastic (partial) differential equations (SDEs), the class of mathematical models with the widest applicability covering, in particular, all continuous time dynamical systems, with and without noise. The main utility of the theory from the physical point of view is a rigorous theoretical explanation of the ubiquitous spontaneous long-range dynamical behavior that manifests itself across disciplines via such phenomena as 1/f, flicker, and crackling noises and the power-law statistics, or Zipf's law, of instantonic processes like earthquakes and neuroavalanches. From the mathematical point of view, STS is interesting because it bridges the two major parts of mathematical physics – the dynamical systems theory and topological field theories. Besides these and related disciplines such as algebraic topology and supersymmetric field theories, STS is also connected with the traditional theory of stochastic differential equations and the theory of pseudo-Hermitian operators.

<span class="mw-page-title-main">Dual graviton</span> Hypothetical particle found in supergravity

In theoretical physics, the dual graviton is a hypothetical elementary particle that is a dual of the graviton under electric-magnetic duality, as an S-duality, predicted by some formulations of supergravity in eleven dimensions.

In mathematics, and especially differential geometry and mathematical physics, gauge theory is the general study of connections on vector bundles, principal bundles, and fibre bundles. Gauge theory in mathematics should not be confused with the closely related concept of a gauge theory in physics, which is a field theory which admits gauge symmetry. In mathematics theory means a mathematical theory, encapsulating the general study of a collection of concepts or phenomena, whereas in the physical sense a gauge theory is a mathematical model of some natural phenomenon.

Probabilistic numerics is an active field of study at the intersection of applied mathematics, statistics, and machine learning centering on the concept of uncertainty in computation. In probabilistic numerics, tasks in numerical analysis such as finding numerical solutions for integration, linear algebra, optimization and simulation and differential equations are seen as problems of statistical, probabilistic, or Bayesian inference.

References

  1. Vasiliadis, Athanasios (2019). "An Introduction to Mean Field Games using probabilistic methods". arXiv: 1907.01411 [math.OC].
  2. Jovanovic, Boyan; Rosenthal, Robert W. (1988). "Anonymous Sequential Games". Journal of Mathematical Economics . 17 (1): 77–87. doi:10.1016/0304-4068(88)90029-8.
  3. Huang, M. Y.; Malhame, R. P.; Caines, P. E. (2006). "Large Population Stochastic Dynamic Games: Closed-Loop McKean–Vlasov Systems and the Nash Certainty Equivalence Principle". Communications in Information and Systems. 6 (3): 221–252. doi: 10.4310/CIS.2006.v6.n3.a5 . Zbl   1136.91349.
  4. Nourian, M.; Caines, P. E. (2013). "ε–Nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents". SIAM Journal on Control and Optimization. 51 (4): 3302–3331. arXiv: 1209.5684 . doi:10.1137/120889496. S2CID   36197045.
  5. Djehiche, Boualem; Tcheukam, Alain; Tembine, Hamidou (2017). "Mean-Field-Type Games in Engineering". AIMS Electronics and Electrical Engineering. 1 (1): 18–73. arXiv: 1605.03281 . doi:10.3934/ElectrEng.2017.1.18. S2CID   16055840.
  6. Lions, Pierre-Louis; Lasry, Jean-Michel (March 2007). "Large investor trading impacts on volatility". Annales de l'Institut Henri Poincaré C. 24 (2): 311–323. Bibcode:2007AIHPC..24..311L. doi: 10.1016/j.anihpc.2005.12.006 .
  7. Lasry, Jean-Michel; Lions, Pierre-Louis (28 March 2007). "Mean field games". Japanese Journal of Mathematics. 2 (1): 229–260. doi:10.1007/s11537-007-0657-8. S2CID   1963678.
  8. Cardaliaguet, Pierre (September 27, 2013). "Notes on Mean Field Games" (PDF).
  9. Bensoussan, Alain; Frehse, Jens; Yam, Phillip (2013). Mean Field Games and Mean Field Type Control Theory. Springer Briefs in Mathematics. New York: Springer-Verlag. ISBN   9781461485070.[ page needed ]
  10. Achdou, Yves (2020). Mean field games : Cetraro, Italy 2019. Pierre Cardaliaguet, F. Delarue, Alessio Porretta, Filippo Santambrogio. Cham. ISBN   978-3-030-59837-2. OCLC   1238206187.{{cite book}}: CS1 maint: location missing publisher (link)
  11. Doncel, Josu; Gast, Nicolas; Gaujal, Bruno (2019). "Discrete mean field games: Existence of equilibria and convergence". Journal of Dynamics & Games: 1–19. arXiv: 1909.01209 . doi:10.3934/jdg.2019016. S2CID   197507580.
  12. Carmona, Rene (2020). "Applications of mean field games in financial engineering and economic theory". arXiv: 2012.05237 [q-fin.GN].
  13. Lachapelle, Aimé; Wolfram, Marie-Therese (2011). "On a mean field game approach modeling congestion and aversion in pedestrian crowds". Transportation Research Part B: Methodological. 45 (10): 1572–1589. doi:10.1016/j.trb.2011.07.011. S2CID   55991774.
  14. Feinstein, Zachary; Sojmark, Andreas (2019). "A dynamic default contagion model: From Eisenberg-Noe to the mean field". arXiv: 1912.08695 [q-fin.MF].
  15. Huang, Kuang; Chen, Xu; Di, Xuan; Du, Qiang (2021). "Dynamic driving and routing games for autonomous vehicles on networks: A mean field game approach". Transportation Research Part C: Emerging Technologies. 128: 103189. arXiv: 2012.08388 . doi:10.1016/j.trc.2021.103189. S2CID   235436377.
  16. Lee, Wonjun; Liu, Siting; Tembine, Hamidou; Li, Wuchen; Osher, Stanley (2021). "Controlling propagation of epidemics via mean-field control". SIAM Journal on Applied Mathematics. 81 (1): 190–207. arXiv: 2006.01249 . doi:10.1137/20M1342690. S2CID   226299517.
  17. Aurell, Alexander; Carmona, Rene; Dayanikli, Gokce; Lauriere, Mathieu (2022). "Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach". SIAM Journal on Control and Optimization. 60 (2): S294–S322. arXiv: 2011.03105 . doi:10.1137/20M1377862. S2CID   226278147.
  18. Elie, Romuald; Hubert, Emma; Turinici, Gabriel (2020). "Contact rate epidemic control of COVID-19: an equilibrium view". Mathematical Modelling of Natural Phenomena. 15: 35. doi: 10.1051/mmnp/2020022 . S2CID   215814201.