Multivariate Pareto distribution

Last updated May 26, 2023

In statistics, a multivariate Pareto distribution is a multivariate extension of a univariate Pareto distribution.^[1]

Bivariate Pareto distributions
Bivariate Pareto distribution of the first kind
Bivariate Pareto distribution of the second kind
Multivariate Pareto distributions
Multivariate Pareto distribution of the first kind
Multivariate Pareto distribution of the second kind
Multivariate Pareto distribution of the fourth kind
Multivariate Feller–Pareto distribution
References

There are several different types of univariate Pareto distributions including Pareto Types I−IV and Feller−Pareto.^[2] Multivariate Pareto distributions have been defined for many of these types.

Bivariate Pareto distributions

Bivariate Pareto distribution of the first kind

Mardia (1962)^[3] defined a bivariate distribution with cumulative distribution function (CDF) given by

F(x_{1},x_{2})=1-\sum _{i=1}^{2}\left({\frac {x_{i}}{\theta _{i}}}\right)^{-a}+\left(\sum _{i=1}^{2}{\frac {x_{i}}{\theta _{i}}}-1\right)^{-a},\qquad x_{i}>\theta _{i}>0,i=1,2;a>0,

and joint density function

f(x_{1},x_{2})=(a+1)a(\theta _{1}\theta _{2})^{a+1}(\theta _{2}x_{1}+\theta _{1}x_{2}-\theta _{1}\theta _{2})^{-(a+2)},\qquad x_{i}\geq \theta _{i}>0,i=1,2;a>0.

The marginal distributions are Pareto Type 1 with density functions

f(x_{i})=a\theta _{i}^{a}x_{i}^{-(a+1)},\qquad x_{i}\geq \theta _{i}>0,i=1,2.

The means and variances of the marginal distributions are

E[X_{i}]={\frac {a\theta _{i}}{a-1}},a>1;\quad Var(X_{i})={\frac {a\theta _{i}^{2}}{(a-1)^{2}(a-2)}},a>2;\quad i=1,2,

and for a > 2, X₁ and X₂ are positively correlated with

\operatorname {cov} (X_{1},X_{2})={\frac {\theta _{1}\theta _{2}}{(a-1)^{2}(a-2)}},{\text{ and }}\operatorname {cor} (X_{1},X_{2})={\frac {1}{a}}.

Bivariate Pareto distribution of the second kind

Arnold^[4] suggests representing the bivariate Pareto Type I complementary CDF by

{\overline {F}}(x_{1},x_{2})=\left(1+\sum _{i=1}^{2}{\frac {x_{i}-\theta _{i}}{\theta _{i}}}\right)^{-a},\qquad x_{i}>\theta _{i},i=1,2.

If the location and scale parameter are allowed to differ, the complementary CDF is

{\overline {F}}(x_{1},x_{2})=\left(1+\sum _{i=1}^{2}{\frac {x_{i}-\mu _{i}}{\sigma _{i}}}\right)^{-a},\qquad x_{i}>\mu _{i},i=1,2,

which has Pareto Type II univariate marginal distributions. This distribution is called a multivariate Pareto distribution of type II by Arnold.^[4] (This definition is not equivalent to Mardia's bivariate Pareto distribution of the second kind.)^[3]

For a > 1, the marginal means are

E[X_{i}]=\mu _{i}+{\frac {\sigma _{i}}{a-1}},\qquad i=1,2,

while for a > 2, the variances, covariance, and correlation are the same as for multivariate Pareto of the first kind.

Multivariate Pareto distributions

Multivariate Pareto distribution of the first kind

Mardia's^[3]Multivariate Pareto distribution of the First Kind has the joint probability density function given by

f(x_{1},\dots ,x_{k})=a(a+1)\cdots (a+k-1)\left(\prod _{i=1}^{k}\theta _{i}\right)^{-1}\left(\sum _{i=1}^{k}{\frac {x_{i}}{\theta _{i}}}-k+1\right)^{-(a+k)},\qquad x_{i}>\theta _{i}>0,a>0,\qquad (1)

The marginal distributions have the same form as (1), and the one-dimensional marginal distributions have a Pareto Type I distribution. The complementary CDF is

{\overline {F}}(x_{1},\dots ,x_{k})=\left(\sum _{i=1}^{k}{\frac {x_{i}}{\theta _{i}}}-k+1\right)^{-a},\qquad x_{i}>\theta _{i}>0,i=1,\dots ,k;a>0.\quad (2)

The marginal means and variances are given by

E[X_{i}]={\frac {a\theta _{i}}{a-1}},{\text{ for }}a>1,{\text{ and }}Var(X_{i})={\frac {a\theta _{i}^{2}}{(a-1)^{2}(a-2)}},{\text{ for }}a>2.

If a > 2 the covariances and correlations are positive with

\operatorname {cov} (X_{i},X_{j})={\frac {\theta _{i}\theta _{j}}{(a-1)^{2}(a-2)}},\qquad \operatorname {cor} (X_{i},X_{j})={\frac {1}{a}},\qquad i\neq j.

Multivariate Pareto distribution of the second kind

Arnold^[4] suggests representing the multivariate Pareto Type I complementary CDF by

{\overline {F}}(x_{1},\dots ,x_{k})=\left(1+\sum _{i=1}^{k}{\frac {x_{i}-\theta _{i}}{\theta _{i}}}\right)^{-a},\qquad x_{i}>\theta _{i}>0,\quad i=1,\dots ,k.

If the location and scale parameter are allowed to differ, the complementary CDF is

{\overline {F}}(x_{1},\dots ,x_{k})=\left(1+\sum _{i=1}^{k}{\frac {x_{i}-\mu _{i}}{\sigma _{i}}}\right)^{-a},\qquad x_{i}>\mu _{i},\quad i=1,\dots ,k,\qquad (3)

which has marginal distributions of the same type (3) and Pareto Type II univariate marginal distributions. This distribution is called a multivariate Pareto distribution of type II by Arnold.^[4]

For a > 1, the marginal means are

E[X_{i}]=\mu _{i}+{\frac {\sigma _{i}}{a-1}},\qquad i=1,\dots ,k,

while for a > 2, the variances, covariances, and correlations are the same as for multivariate Pareto of the first kind.

Multivariate Pareto distribution of the fourth kind

A random vector X has a k-dimensional multivariate Pareto distribution of the Fourth Kind^[4] if its joint survival function is

{\overline {F}}(x_{1},\dots ,x_{k})=\left(1+\sum _{i=1}^{k}\left({\frac {x_{i}-\mu _{i}}{\sigma _{i}}}\right)^{1/\gamma _{i}}\right)^{-a},\qquad x_{i}>\mu _{i},\sigma _{i}>0,i=1,\dots ,k;a>0.\qquad (4)

The k₁-dimensional marginal distributions (k₁<k) are of the same type as (4), and the one-dimensional marginal distributions are Pareto Type IV.

Multivariate Feller–Pareto distribution

A random vector X has a k-dimensional Feller–Pareto distribution if

X_{i}=\mu _{i}+(W_{i}/Z)^{\gamma _{i}},\qquad i=1,\dots ,k,\qquad (5)

where

W_{i}\sim \Gamma (\beta _{i},1),\quad i=1,\dots ,k,\qquad Z\sim \Gamma (\alpha ,1),

are independent gamma variables.^[4] The marginal distributions and conditional distributions are of the same type (5); that is, they are multivariate Feller–Pareto distributions. The one–dimensional marginal distributions are of Feller−Pareto type.

Related Research Articles

In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is

<span class="mw-page-title-main">Multivariate normal distribution</span> Generalization of the one-dimensional normal distribution to higher dimensions

In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One definition is that a random vector is said to be k-variate normally distributed if every linear combination of its k components has a univariate normal distribution. Its importance derives mainly from the multivariate central limit theorem. The multivariate normal distribution is often used to describe, at least approximately, any set of (possibly) correlated real-valued random variables each of which clusters around a mean value.

In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. The point in the parameter space that maximizes the likelihood function is called the maximum likelihood estimate. The logic of maximum likelihood is both intuitive and flexible, and as such the method has become a dominant means of statistical inference.

<span class="mw-page-title-main">Law of large numbers</span> Averages of repeated trials converge to the expected value

In probability theory, the law of large numbers (LLN) is a theorem that describes the result of performing the same experiment a large number of times. According to the law, the average of the results obtained from a large number of trials should be close to the expected value and tends to become closer to the expected value as more trials are performed.

In mathematical analysis, Hölder's inequality, named after Otto Hölder, is a fundamental inequality between integrals and an indispensable tool for the study of $L p$ spaces.

Hotellings <i>T</i>-squared distribution

In statistics, particularly in hypothesis testing, the Hotelling's T-squared distribution (T²), proposed by Harold Hotelling, is a multivariate probability distribution that is tightly related to the F-distribution and is most notable for arising as the distribution of a set of sample statistics that are natural generalizations of the statistics underlying the Student's t-distribution. The Hotelling's t-squared statistic (t²) is a generalization of Student's t-statistic that is used in multivariate hypothesis testing.

Directional statistics is the subdiscipline of statistics that deals with directions, axes or rotations in Rⁿ. More generally, directional statistics deals with observations on compact Riemannian manifolds including the Stiefel manifold.

In probability and statistics, a circular distribution or polar distribution is a probability distribution of a random variable whose values are angles, usually taken to be in the range [0, 2π). A circular distribution is often a continuous probability distribution, and hence has a probability density, but such distributions can also be discrete, in which case they are called circular lattice distributions. Circular distributions can be used even when the variables concerned are not explicitly angles: the main consideration is that there is not usually any real distinction between events occurring at the lower or upper end of the range, and the division of the range could notionally be made at any point.

In Bayesian statistics, a maximum a posteriori probability (MAP) estimate is an estimate of an unknown quantity, that equals the mode of the posterior distribution. The MAP can be used to obtain a point estimate of an unobserved quantity on the basis of empirical data. It is closely related to the method of maximum likelihood (ML) estimation, but employs an augmented optimization objective which incorporates a prior distribution over the quantity one wants to estimate. MAP estimation can therefore be seen as a regularization of maximum likelihood estimation.

In probability theory and directional statistics, the von Mises distribution is a continuous probability distribution on the circle. It is a close approximation to the wrapped normal distribution, which is the circular analogue of the normal distribution. A freely diffusing angle $on a circle is a wrapped normally distributed random variable with an unwrapped variance that grows linearly in time. On the other hand, the von Mises distribution is the stationary distribution of a drift and diffusion process on the circle in a harmonic potential, i.e. with a preferred orientation. The von Mises distribution is the maximum entropy distribution for circular data when the real and imaginary parts of the first circular moment are specified. The von Mises distribution is a special case of the von Mises-Fisher distribution on the N -dimensional sphere.$

In estimation theory and decision theory, a Bayes estimator or a Bayes action is an estimator or decision rule that minimizes the posterior expected value of a loss function. Equivalently, it maximizes the posterior expectation of a utility function. An alternative way of formulating an estimator within Bayesian statistics is maximum a posteriori estimation.

Covariance matrix adaptation evolution strategy (CMA-ES) is a particular kind of strategy for numerical optimization. Evolution strategies (ES) are stochastic, derivative-free methods for numerical optimization of non-linear or non-convex continuous optimization problems. They belong to the class of evolutionary algorithms and evolutionary computation. An evolutionary algorithm is broadly based on the principle of biological evolution, namely the repeated interplay of variation and selection: in each generation (iteration) new individuals are generated by variation, usually in a stochastic way, of the current parental individuals. Then, some individuals are selected to become the parents in the next generation based on their fitness or objective function value $. Like this, over the generation sequence, individuals with better and better -values are generated.$

In statistics, the bias of an estimator is the difference between this estimator's expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called unbiased. In statistics, "bias" is an objective property of an estimator. Bias is a distinct concept from consistency: consistent estimators converge in probability to the true value of the parameter, but may be biased or unbiased; see bias versus consistency for more.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

In probability and statistics, the class of exponential dispersion models (EDM) is a set of probability distributions that represents a generalisation of the natural exponential family. Exponential dispersion models play an important role in statistical theory, in particular in generalized linear models because they have a special structure which enables deductions to be made about appropriate statistical inference.

<span class="mw-page-title-main">Shifted log-logistic distribution</span>

The shifted log-logistic distribution is a probability distribution also known as the generalized log-logistic or the three-parameter log-logistic distribution. It has also been called the generalized logistic distribution, but this conflicts with other uses of the term: see generalized logistic distribution.

<span class="mw-page-title-main">Normal-inverse-gamma distribution</span>

In probability theory and statistics, the normal-inverse-gamma distribution is a four-parameter family of multivariate continuous probability distributions. It is the conjugate prior of a normal distribution with unknown mean and variance.

<span class="mw-page-title-main">Wrapped normal distribution</span>

In probability theory and directional statistics, a wrapped normal distribution is a wrapped probability distribution that results from the "wrapping" of the normal distribution around the unit circle. It finds application in the theory of Brownian motion and is a solution to the heat equation for periodic boundary conditions. It is closely approximated by the von Mises distribution, which, due to its mathematical simplicity and tractability, is the most commonly used distribution in directional statistics.

<span class="mw-page-title-main">Wrapped Cauchy distribution</span>

In probability theory and directional statistics, a wrapped Cauchy distribution is a wrapped probability distribution that results from the "wrapping" of the Cauchy distribution around the unit circle. The Cauchy distribution is sometimes known as a Lorentzian distribution, and the wrapped Cauchy distribution may sometimes be referred to as a wrapped Lorentzian distribution.

In probability and statistics, the generalized beta distribution is a continuous probability distribution with four shape parameters, including more than thirty named distributions as limiting or special cases. It has been used in the modeling of income distribution, stock returns, as well as in regression analysis. The exponential generalized beta (EGB) distribution follows directly from the GB and generalizes other common distributions.

References

↑ S. Kotz; N. Balakrishnan; N. L. Johnson (2000). "52". Continuous Multivariate Distributions. Vol. 1 (second ed.). ISBN 0-471-18387-3.
↑ Barry C. Arnold (1983). Pareto Distributions. International Co-operative Publishing House. ISBN 0-89974-012-X. Chapter 3.
1 2 3 Mardia, K. V. (1962). "Multivariate Pareto distributions". Annals of Mathematical Statistics. 33 (3): 1008–1015. doi: 10.1214/aoms/1177704468 .
1 2 3 4 5 6 Barry C. Arnold (1983). Pareto Distributions. International Co-operative Publishing House. ISBN 0-89974-012-X. Chapter 6.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[CMD1-1] S. Kotz; N. Balakrishnan; N. L. Johnson (2000). "52". Continuous Multivariate Distributions. Vol. 1 (second ed.). ISBN 0-471-18387-3.

[arnold3-2] Barry C. Arnold (1983). Pareto Distributions. International Co-operative Publishing House. ISBN 0-89974-012-X. Chapter 3.

[Mardia62-3] 1 2 3 Mardia, K. V. (1962). "Multivariate Pareto distributions". Annals of Mathematical Statistics. 33 (3): 1008–1015. doi: 10.1214/aoms/1177704468 .

[arnold6-4] 1 2 3 4 5 6 Barry C. Arnold (1983). Pareto Distributions. International Co-operative Publishing House. ISBN 0-89974-012-X. Chapter 6.

[1]

[2]

[3]

[4]