This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
Bayes space is a function space defined as an equivalence class of measures with the same null-sets. Two measures are defined to be equivalent if they are proportional. The basic ideas of Bayes spaces have their roots in Compositional Data Analysis and the Aitchison geometry. [1] Theoretical applications are mainly in statistics, specifically functional data analysis of density functions, aka density data analysis. [2] [3] [4] [5] Practical applications are in geochemistry [6] , COVID-19 modelling [7] , sediment analysis [8] and developmental research [9] . Alternative approaches to density analysis are based on the Wasserstein metric, often termed Wasserstein regression, have also been applied to medicine [10] .
The basic structure of the Bayes space is that of a vector space, with addition and multiplication being defined by perturbation and powering. [11] The space is formed over a -finite reference/base measure, denoted or depending on whether it is infinite or finite. Densities are considered as Radon-Nikodym derivatives of the measures with same null-sets as the base measure, and are equivalent if they are proportional. In case of finite base measures, Hilbert space structure can be achieved by defining a centered log-ratio transformation on the measures, mapping them to a subset of consisting of functions integrating to 0. [12]
Consider a finite base measure (not necessarily a probability measure) on a domain . This may be a uniform distribution on a bounded interval, or it can be a Radon-Nikodym derivative of the Lebesgue measure (the Gaussian distribution, for example). If we take two densities with respect to , they are said to be B-equivalent if there exists a s.t , denoted (the convention is used in cases where a measure is infinite). It can be shown that is an equivalence relation. The Bayes space is defined as the quotient space of all measures with the same null-sets in as under the equivalence relation .
The first challenge to analysing density functions is that is not linear space under ordinary addition and multiplication since the ordinary difference between two densities would not be non-negative everywhere. Like in the Aitchison geometry for finite dimensional data, perturbation and powering is defined for densities:
Perturbation
Powering
where are densities in and is some real number. It can be shown using the properties of multiplication and powering of real numbers that forms a vector space over the real numbers.
The definition of Bayes space does not strictly require a finite reference measure . If Bayes space is defined over an infinite reference measure , it must be -finite (like the Lebesgue measure). The finite reference measure is, however, necessary for adding Hilbert space structure to a subset of . Consider the subspace
. For , this is a linear subspace and isometrically isomorphic to the Hilbert space via the centered log-ratio (clr) transformation . The subspace of log-square integrable functions is termed the Bayes Hilbert space. It can be shown that the clr transformation is a linear isomorphism between the two spaces. Defining an inner product on as the inner product of the clr transformations will provide the Hilbert space structure for , obtaining the centered log-ratio transformation as a linear isometry.
The measure does not have to be univariate (one-dimensional), but can also be defined as a product measure on Cartesian products, characterising bivariate (two-dimensional) or multivariate densities. The geometric structure of Hilbert spaces can be used to decompose multivariate densities orthogonally into independent and interaction parts using the concept of "geometric marginals". [13] [14] [9] This decomposition has relations to copula theory. [14] The geometry in defines norms on densities that can be used to quantify "relative simplicial deviance", which is measure of how much of a bivariate distribution can be explained by the interaction part; [13] in the multivariate case the relative simplicial deviance can be generalised to the "information composition". [14]