Information geometry

Last updated
The set of all normal distributions forms a statistical manifold with hyperbolic geometry. Normal Distribution PDF.svg
The set of all normal distributions forms a statistical manifold with hyperbolic geometry.

Information geometry is an interdisciplinary field that applies the techniques of differential geometry to study probability theory and statistics. [1] It studies statistical manifolds, which are Riemannian manifolds whose points correspond to probability distributions.

Contents

Introduction

Historically, information geometry can be traced back to the work of C. R. Rao, who was the first to treat the Fisher matrix as a Riemannian metric. [2] [3] The modern theory is largely due to Shun'ichi Amari, whose work has been greatly influential on the development of the field. [4]

Classically, information geometry considered a parametrized statistical model as a Riemannian manifold. For such models, there is a natural choice of Riemannian metric, known as the Fisher information metric. In the special case that the statistical model is an exponential family, it is possible to induce the statistical manifold with a Hessian metric (i.e a Riemannian metric given by the potential of a convex function). In this case, the manifold naturally inherits two flat affine connections, as well as a canonical Bregman divergence. Historically, much of the work was devoted to studying the associated geometry of these examples. In the modern setting, information geometry applies to a much wider context, including non-exponential families, nonparametric statistics, and even abstract statistical manifolds not induced from a known statistical model. The results combine techniques from information theory, affine differential geometry, convex analysis and many other fields.

The standard references in the field are Shun’ichi Amari and Hiroshi Nagaoka's book, Methods of Information Geometry, [5] and the more recent book by Nihat Ay and others. [6] A gentle introduction is given in the survey by Frank Nielsen. [7] In 2018, the journal Information Geometry was released, which is devoted to the field.

Contributors

The history of information geometry is associated with the discoveries of at least the following people, and many others.

Applications

As an interdisciplinary field, information geometry has been used in various applications.

Here an incomplete list:

See also

Related Research Articles

<span class="mw-page-title-main">Differential topology</span> Branch of mathematics

In mathematics, differential topology is the field dealing with the topological properties and smooth properties of smooth manifolds. In this sense differential topology is distinct from the closely related field of differential geometry, which concerns the geometric properties of smooth manifolds, including notions of size, distance, and rigid shape. By comparison differential topology is concerned with coarser properties, such as the number of holes in a manifold, its homotopy type, or the structure of its diffeomorphism group. Because many of these coarser properties may be captured algebraically, differential topology has strong links to algebraic topology.

<span class="mw-page-title-main">Differential geometry</span> Branch of mathematics dealing with functions and geometric structures on differentiable manifolds

Differential geometry is a mathematical discipline that studies the geometry of smooth shapes and smooth spaces, otherwise known as smooth manifolds. It uses the techniques of differential calculus, integral calculus, linear algebra and multilinear algebra. The field has its origins in the study of spherical geometry as far back as antiquity. It also relates to astronomy, the geodesy of the Earth, and later the study of hyperbolic geometry by Lobachevsky. The simplest examples of smooth spaces are the plane and space curves and surfaces in the three-dimensional Euclidean space, and the study of these shapes formed the basis for development of modern differential geometry during the 18th and 19th centuries.

<span class="mw-page-title-main">Shing-Tung Yau</span> Chinese mathematician

Shing-Tung Yau is a Chinese-American mathematician. He is the director of the Yau Mathematical Sciences Center at Tsinghua University and Professor Emeritus at Harvard University. Until 2022, Yau was the William Caspar Graustein Professor of Mathematics at Harvard, at which point he moved to Tsinghua.

In information geometry, the Fisher information metric is a particular Riemannian metric which can be defined on a smooth statistical manifold, i.e., a smooth manifold whose points are probability measures defined on a common probability space. It can be used to calculate the informational difference between measurements.

In mathematical statistics, the Fisher information is a way of measuring the amount of information that an observable random variable X carries about an unknown parameter θ of a distribution that models X. Formally, it is the variance of the score, or the expected value of the observed information.

In mathematical statistics, the Kullback–Leibler (KL) divergence, denoted , is a type of statistical distance: a measure of how one probability distribution P is different from a second, reference probability distribution Q. Mathematically, it is defined as

A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics and are used to model various behaviours of stochastic models such as stock prices, random growth models or physical systems that are subjected to thermal fluctuations.

<span class="mw-page-title-main">Ole Barndorff-Nielsen</span> Danish statistician (1935–2022)

Ole Eiler Barndorff-Nielsen was a Danish statistician who has contributed to many areas of statistical science.

Ruppeiner geometry is thermodynamic geometry using the language of Riemannian geometry to study thermodynamics. George Ruppeiner proposed it in 1979. He claimed that thermodynamic systems can be represented by Riemannian geometry, and that statistical properties can be derived from the model.

<span class="mw-page-title-main">Jean-Michel Bismut</span> French mathematician (born 1948)

Jean-Michel Bismut is a French mathematician who has been a professor at the Université Paris-Sud since 1981. His mathematical career covers two apparently different branches of mathematics: probability theory and differential geometry. Ideas from probability play an important role in his works on geometry.

Damiano Brigo is a mathematician known for research in mathematical finance, filtering theory, stochastic analysis with differential geometry, probability theory and statistics, authoring more than 130 research publications and three monographs. From 2012 he serves as full professor with a chair in mathematical finance at the Department of Mathematics of Imperial College London, where he headed the Mathematical Finance group in 2012–2019. He is also a well known quantitative finance researcher, manager and advisor in the industry. His research has been cited and published also in mainstream industry publications, including Risk Magazine, where he has been the most cited author in the twenty years 1998–2017. He is often requested as a plenary or invited speaker both at academic and industry international events. Brigo's research has also been used in court as support for legal proceedings.

This page lists articles related to probability theory. In particular, it lists many articles corresponding to specific probability distributions. Such articles are marked here by a code of the form (X:Y), which refers to number of random variables involved and the type of the distribution. For example (2:DC) indicates a distribution with two random variables, discrete or continuous. Other codes are just abbreviations for topics. The list of codes can be found in the table of contents.

In information geometry, a divergence is a kind of statistical distance: a binary function which establishes the separation from one probability distribution to another on a statistical manifold.

Mathematics is a broad subject that is commonly divided in many areas that may be defined by their objects of study, by the used methods, or by both. For example, analytic number theory is a subarea of number theory devoted to the use of methods of analysis for the study of natural numbers.

In mathematics, a statistical manifold is a Riemannian manifold, each of whose points is a probability distribution. Statistical manifolds provide a setting for the field of information geometry. The Fisher information metric provides a metric on these manifolds. Following this definition, the log-likelihood function is a differentiable map and the score is an inclusion.

<i>Journal of Geometry and Physics</i> Academic journal

The Journal of Geometry and Physics is a scientific journal in mathematical physics. Its scope is to stimulate the interaction between geometry and physics by publishing primary research and review articles which are of common interest to practitioners in both fields. The journal is published by Elsevier since 1984.

<span class="mw-page-title-main">Stanislav Molchanov</span> Soviet American mathematician

Stanislav Alexeyevich Molchanov is a Soviet and American mathematician.

In information geometry, Chentsov's theorem states that the Fisher information metric is, up to rescaling, the unique Riemannian metric on a statistical manifold that is invariant under sufficient statistics.

Projection filters are a set of algorithms based on stochastic analysis and information geometry, or the differential geometric approach to statistics, used to find approximate solutions for filtering problems for nonlinear state-space systems. The filtering problem consists of estimating the unobserved signal of a random dynamical system from partial noisy observations of the signal. The objective is computing the probability distribution of the signal conditional on the history of the noise-perturbed observations. This distribution allows for calculations of all statistics of the signal given the history of observations. If this distribution has a density, the density satisfies specific stochastic partial differential equations (SPDEs) called Kushner-Stratonovich equation, or Zakai equation. It is known that the nonlinear filter density evolves in an infinite dimensional function space.

References

  1. Nielsen, Frank (2022). "The Many Faces of Information Geometry" (PDF). Notices of the AMS. 69 (1). American Mathematical Society: 36-45.
  2. Rao, C. R. (1945). "Information and Accuracy Attainable in the Estimation of Statistical Parameters". Bulletin of the Calcutta Mathematical Society. 37: 81–91. Reprinted in Breakthroughs in Statistics. Springer. 1992. pp. 235–247. doi:10.1007/978-1-4612-0919-5_16. S2CID   117034671.
  3. Nielsen, F. (2013). "Cramér-Rao Lower Bound and Information Geometry". In Bhatia, R.; Rajan, C. S. (eds.). Connected at Infinity II: On the Work of Indian Mathematicians. Texts and Readings in Mathematics. Vol. Special Volume of Texts and Readings in Mathematics (TRIM). Hindustan Book Agency. pp. 18–37. arXiv: 1301.3578 . doi:10.1007/978-93-86279-56-9_2. ISBN   978-93-80250-51-9. S2CID   16759683.
  4. Amari, Shun'ichi (1983). "A foundation of information geometry". Electronics and Communications in Japan. 66 (6): 1–10. doi:10.1002/ecja.4400660602.
  5. Amari, Shun'ichi; Nagaoka, Hiroshi (2000). Methods of Information Geometry. Translations of Mathematical Monographs. Vol. 191. American Mathematical Society. ISBN   0-8218-0531-2.
  6. Ay, Nihat; Jost, Jürgen; Lê, Hông Vân; Schwachhöfer, Lorenz (2017). Information Geometry. Ergebnisse der Mathematik und ihrer Grenzgebiete. Vol. 64. Springer. ISBN   978-3-319-56477-7.
  7. Nielsen, Frank (2018). "An Elementary Introduction to Information Geometry". Entropy. 22 (10).
  8. Kass, R. E.; Vos, P. W. (1997). Geometrical Foundations of Asymptotic Inference. Series in Probability and Statistics. Wiley. ISBN   0-471-82668-5.
  9. Brigo, Damiano; Hanzon, Bernard; LeGland, Francois (1998). "A differential geometric approach to nonlinear filtering: the projection filter" (PDF). IEEE Transactions on Automatic Control. 43 (2): 247–252. doi:10.1109/9.661075.
  10. van Handel, Ramon; Mabuchi, Hideo (2005). "Quantum projection filter for a highly nonlinear model in cavity QED". Journal of Optics B: Quantum and Semiclassical Optics. 7 (10): S226–S236. arXiv: quant-ph/0503222 . Bibcode:2005JOptB...7S.226V. doi:10.1088/1464-4266/7/10/005. S2CID   15292186.
  11. Zlochin, Mark; Baram, Yoram (2001). "Manifold Stochastic Dynamics for Bayesian Learning". Neural Computation. 13 (11): 2549–2572. doi:10.1162/089976601753196021.
  12. Amari, Shun'ichi (1985). Differential-Geometrical Methods in Statistics. Lecture Notes in Statistics. Berlin: Springer-Verlag. ISBN   0-387-96056-2.
  13. Murray, M.; Rice, J. (1993). Differential Geometry and Statistics. Monographs on Statistics and Applied Probability. Vol. 48. Chapman and Hall. ISBN   0-412-39860-5.
  14. Marriott, Paul; Salmon, Mark, eds. (2000). Applications of Differential Geometry to Econometrics. Cambridge University Press. ISBN   0-521-65116-6.