Kumaraswamy distribution

Last updated
Kumaraswamy
Probability density function
KumaraswamyT pdf.svg
Cumulative distribution function
Kumaraswamy cdf.svg
Parameters (real)
(real)
Support
PDF
CDF
Quantile
Mean
Median
Mode for
Variance (complicated-see text)
Skewness (complicated-see text)
Excess kurtosis (complicated-see text)
Entropy

In probability and statistics, the Kumaraswamy's double bounded distribution is a family of continuous probability distributions defined on the interval (0,1). It is similar to the beta distribution, but much simpler to use especially in simulation studies since its probability density function, cumulative distribution function and quantile functions can be expressed in closed form. This distribution was originally proposed by Poondi Kumaraswamy [1] for variables that are lower and upper bounded with a zero-inflation. In this first article of the distribution, the natural lower bound of zero for rainfall was modelled using a discrete probability, as rainfall in many places, especially in tropics, has significant nonzero probability. This discrete probability is now called zero-inflation. This was extended to inflations at both extremes [0,1] in the work of Fletcher and Ponnambalam. [2] . A good example for inflations at extremes are the probabilities of full and empty reservoirs and are important for reservoir design.

Contents

Characterization

Probability density function

The probability density function of the Kumaraswamy distribution without considering any inflation is

and where a and b are non-negative shape parameters.

Cumulative distribution function

The cumulative distribution function is

Quantile function

The inverse cumulative distribution function (quantile function) is

Generalizing to arbitrary interval support

In its simplest form, the distribution has a support of (0,1). In a more general form, the normalized variable x is replaced with the unshifted and unscaled variable z where:

Properties

The raw moments of the Kumaraswamy distribution are given by: [3] [4]

where B is the Beta function and Γ(.) denotes the Gamma function. The variance, skewness, and excess kurtosis can be calculated from these raw moments. For example, the variance is:

The Shannon entropy (in nats) of the distribution is: [5]

where is the harmonic number function.

Relation to the Beta distribution

The Kumaraswamy distribution is closely related to Beta distribution. [6] Assume that Xa,b is a Kumaraswamy distributed random variable with parameters a and b. Then Xa,b is the a-th root of a suitably defined Beta distributed random variable. More formally, Let Y1,b denote a Beta distributed random variable with parameters and . One has the following relation between Xa,b and Y1,b.

with equality in distribution.

One may introduce generalised Kumaraswamy distributions by considering random variables of the form , with and where denotes a Beta distributed random variable with parameters and . The raw moments of this generalized Kumaraswamy distribution are given by:

Note that we can re-obtain the original moments setting , and . However, in general, the cumulative distribution function does not have a closed form solution.

Example

An example of the use of the Kumaraswamy distribution is the storage volume of a reservoir of capacity z whose upper bound is zmax and lower bound is 0, which is also a natural example for having two inflations as many reservoirs have nonzero probabilities for both empty and full reservoir states. [2]

Related Research Articles

<span class="mw-page-title-main">Cauchy distribution</span> Probability distribution

The Cauchy distribution, named after Augustin-Louis Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution, Cauchy–Lorentz distribution, Lorentz(ian) function, or Breit–Wigner distribution. The Cauchy distribution is the distribution of the x-intercept of a ray issuing from with a uniformly distributed angle. It is also the distribution of the ratio of two independent normally distributed random variables with mean zero.

<span class="mw-page-title-main">Exponential distribution</span> Probability distribution

In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate; the distance parameter could be any meaningful mono-dimensional measure of the process, such as time between production errors, or length along a roll of fabric in the weaving manufacturing process. It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. In addition to being used for the analysis of Poisson point processes it is found in various other contexts.

<span class="mw-page-title-main">Chi-squared distribution</span> Probability distribution and special case of gamma distribution

In probability theory and statistics, the chi-squared distribution with degrees of freedom is the distribution of a sum of the squares of independent standard normal random variables.

<span class="mw-page-title-main">Beta distribution</span> Probability distribution

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] or in terms of two positive parameters, denoted by alpha (α) and beta (β), that appear as exponents of the variable and its complement to 1, respectively, and control the shape of the distribution.

<span class="mw-page-title-main">Gumbel distribution</span> Particular case of the generalized extreme value distribution

In probability theory and statistics, the Gumbel distribution is used to model the distribution of the maximum of a number of samples of various distributions.

<i>F</i>-distribution Continuous probability distribution

In probability theory and statistics, the F-distribution or F-ratio, also known as Snedecor's F distribution or the Fisher–Snedecor distribution, is a continuous probability distribution that arises frequently as the null distribution of a test statistic, most notably in the analysis of variance (ANOVA) and other F-tests.

<span class="mw-page-title-main">Laplace distribution</span> Probability distribution

In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponential distributions spliced together along the abscissa, although the term is also sometimes used to refer to the Gumbel distribution. The difference between two independent identically distributed exponential random variables is governed by a Laplace distribution, as is a Brownian motion evaluated at an exponentially distributed random time. Increments of Laplace motion or a variance gamma process evaluated over the time scale also have a Laplace distribution.

<span class="mw-page-title-main">Dirichlet distribution</span> Probability distribution

In probability and statistics, the Dirichlet distribution, often denoted , is a family of continuous multivariate probability distributions parameterized by a vector of positive reals. It is a multivariate generalization of the beta distribution, hence its alternative name of multivariate beta distribution (MBD). Dirichlet distributions are commonly used as prior distributions in Bayesian statistics, and in fact, the Dirichlet distribution is the conjugate prior of the categorical distribution and multinomial distribution.

<span class="mw-page-title-main">Stable distribution</span> Distribution of variables which satisfies a stability property under linear combinations

In probability theory, a distribution is said to be stable if a linear combination of two independent random variables with this distribution has the same distribution, up to location and scale parameters. A random variable is said to be stable if its distribution is stable. The stable distribution family is also sometimes referred to as the Lévy alpha-stable distribution, after Paul Lévy, the first mathematician to have studied it.

In probability theory and statistics, the generalized extreme value (GEV) distribution is a family of continuous probability distributions developed within extreme value theory to combine the Gumbel, Fréchet and Weibull families also known as type I, II and III extreme value distributions. By the extreme value theorem the GEV distribution is the only possible limit distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. that a limit distribution needs to exist, which requires regularity conditions on the tail of the distribution. Despite this, the GEV distribution is often used as an approximation to model the maxima of long (finite) sequences of random variables.

<span class="mw-page-title-main">Inverse-gamma distribution</span> Two-parameter family of continuous probability distributions

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the reciprocal of a variable distributed according to the gamma distribution.

<span class="mw-page-title-main">Beta prime distribution</span> Probability distribution

In probability theory and statistics, the beta prime distribution is an absolutely continuous probability distribution. If has a beta distribution, then the odds has a beta prime distribution.

<span class="mw-page-title-main">Beta-binomial distribution</span> Discrete probability distribution

In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of Bernoulli trials is either unknown or random. The beta-binomial distribution is the binomial distribution in which the probability of success at each of n trials is not fixed but randomly drawn from a beta distribution. It is frequently used in Bayesian statistics, empirical Bayes methods and classical statistics to capture overdispersion in binomial type distributed data.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

<span class="mw-page-title-main">Fréchet distribution</span> Continuous probability distribution

The Fréchet distribution, also known as inverse Weibull distribution, is a special case of the generalized extreme value distribution. It has the cumulative distribution function

The term generalized logistic distribution is used as the name for several different families of probability distributions. For example, Johnson et al. list four forms, which are listed below.

A product distribution is a probability distribution constructed as the distribution of the product of random variables having two other known distributions. Given two statistically independent random variables X and Y, the distribution of the random variable Z that is formed as the product is a product distribution.

In probability theory, a beta negative binomial distribution is the probability distribution of a discrete random variable  equal to the number of failures needed to get successes in a sequence of independent Bernoulli trials. The probability of success on each trial stays constant within any given experiment but varies across different experiments following a beta distribution. Thus the distribution is a compound probability distribution.

<span class="mw-page-title-main">Modified Kumaraswamy distribution</span> Concept in probability theory

In probability theory, the Modified Kumaraswamy (MK) distribution is a two-parameter continuous probability distribution defined on the interval (0,1). It serves as an alternative to the beta and Kumaraswamy distributions for modeling double-bounded random variables. The MK distribution was originally proposed by Sagrillo, Guerra, and Bayer through a transformation of the Kumaraswamy distribution. Its density exhibits an increasing-decreasing-increasing shape, which is not characteristic of the beta or Kumaraswamy distributions. The motivation for this proposal stemmed from applications in hydro-environmental problems.

References

  1. Kumaraswamy, P. (1980). "A generalized probability density function for double-bounded random processes". Journal of Hydrology. 46 (1–2): 79–88. Bibcode:1980JHyd...46...79K. doi:10.1016/0022-1694(80)90036-0. ISSN   0022-1694.
  2. 1 2 Fletcher, S.G.; Ponnambalam, K. (1996). "Estimation of reservoir yield and storage distribution using moments analysis". Journal of Hydrology. 182 (1–4): 259–275. Bibcode:1996JHyd..182..259F. doi:10.1016/0022-1694(95)02946-x. ISSN   0022-1694.
  3. Lemonte, Artur J. (2011). "Improved point estimation for the Kumaraswamy distribution". Journal of Statistical Computation and Simulation. 81 (12): 1971–1982. doi:10.1080/00949655.2010.511621. ISSN   0094-9655.
  4. CRIBARI-NETO, FRANCISCO; SANTOS, JÉSSICA (2019). "Inflated Kumaraswamy distributions" (PDF). Anais da Academia Brasileira de Ciências. 91 (2): e20180955. doi: 10.1590/0001-3765201920180955 . ISSN   1678-2690. PMID   31141016. S2CID   169034252.
  5. Michalowicz, Joseph Victor; Nichols, Jonathan M.; Bucholtz, Frank (2013). Handbook of Differential Entropy. Chapman and Hall/CRC. p. 100. ISBN   9781466583177.
  6. 1 2 Jones, M.C. (2009). "Kumaraswamy's distribution: A beta-type distribution with some tractability advantages". Statistical Methodology. 6 (1): 70–81. doi:10.1016/j.stamet.2008.04.001. ISSN   1572-3127.