Probability density function | |||
Cumulative distribution function | |||
Parameters | shape shape scale | ||
---|---|---|---|
Support | |||
CDF | |||
Quantile | |||
Mean | Contents[1] | ||
Median | |||
Mode | |||
Variance |
The Dagum distribution (or Mielke Beta-Kappa distribution) is a continuous probability distribution defined over positive real numbers. It is named after Camilo Dagum, who proposed it in a series of papers in the 1970s. [2] [3] The Dagum distribution arose from several variants of a new model on the size distribution of personal income and is mostly associated with the study of income distribution. There is both a three-parameter specification (Type I) and a four-parameter specification (Type II) of the Dagum distribution; a summary of the genesis of this distribution can be found in "A Guide to the Dagum Distributions". [4] A general source on statistical size distributions often cited in work using the Dagum distribution is Statistical Size Distributions in Economics and Actuarial Sciences. [5]
The cumulative distribution function of the Dagum distribution (Type I) is given by
The corresponding probability density function is given by
The quantile function is given by
The Dagum distribution can be derived as a special case of the generalized Beta II (GB2) distribution (a generalization of the Beta prime distribution):
There is also an intimate relationship between the Dagum and Singh–Maddala / Burr distribution.
The cumulative distribution function of the Dagum (Type II) distribution adds a point mass at the origin and then follows a Dagum (Type I) distribution over the rest of the support (i.e. over the positive halfline)
The quantile function of the Dagum (Type II) distribution is
This article needs additional citations for verification .(June 2011) |
The Dagum distribution is often used to model income and wealth distribution. The relation between the Dagum Type I and the Gini coefficient is summarized in the formula below: [6]
where is the gamma function. Note that this value is independent from the scale-parameter, .
Derivation of the Gini coefficient for the Dagum distribution |
---|
The Gini coefficient for a continuous probability distribution takes the form: where is the CDF of the distribution and is the expected value. For the Dagum distribution, assuming , the formula for the Gini coefficient becomes: Defining the substitution leads to the simpler equation: And making the substitution further changes the Gini coefficient formula to: Each of the integral components is equivalent to the standard beta function, which may be written in terms of the gamma function: Then using this definition, and the fact that , the Gini coefficient may be written as: with the last term being the Gini coefficient for the Dagum distribution. |
Although the Dagum distribution is not the only three-parameter distribution used to model income distribution, one study found it to usually be a better fit than other three-parameter models. [7]
The Dagum distribution has been extended to model net wealth distribution, accounting for the observed frequencies of negative and null net wealth. This generalized model, known as the Dagum General Model of Net Wealth Distribution, [8] is a mixture model consisting of an atomic distribution at zero (representing economic units with no wealth) with two continuous distributions for negative and positive net wealth.
The Cauchy distribution, named after Augustin-Louis Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution, Cauchy–Lorentz distribution, Lorentz(ian) function, or Breit–Wigner distribution. The Cauchy distribution is the distribution of the x-intercept of a ray issuing from with a uniformly distributed angle. It is also the distribution of the ratio of two independent normally distributed random variables with mean zero.
In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate; the distance parameter could be any meaningful mono-dimensional measure of the process, such as time between production errors, or length along a roll of fabric in the weaving manufacturing process. It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. In addition to being used for the analysis of Poisson point processes it is found in various other contexts.
The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actuarial, and many other types of observable phenomena; the principle originally applied to describing the distribution of wealth in a society, fitting the trend that a large portion of wealth is held by a small fraction of the population. The Pareto principle or "80-20 rule" stating that 80% of outcomes are due to 20% of causes was named in honour of Pareto, but the concepts are distinct, and only Pareto distributions with shape value of log45 ≈ 1.16 precisely reflect it. Empirical observation has shown that this 80-20 distribution fits a wide range of cases, including natural phenomena and human activities.
In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use:
In vector calculus, Green's theorem relates a line integral around a simple closed curve C to a double integral over the plane region D bounded by C. It is the two-dimensional special case of Stokes' theorem. In one dimension, it is equivalent to the fundamental theorem of calculus. In three dimensions, it is equivalent to the divergence theorem.
In probability theory, a distribution is said to be stable if a linear combination of two independent random variables with this distribution has the same distribution, up to location and scale parameters. A random variable is said to be stable if its distribution is stable. The stable distribution family is also sometimes referred to as the Lévy alpha-stable distribution, after Paul Lévy, the first mathematician to have studied it.
The Duffing equation, named after Georg Duffing (1861–1944), is a non-linear second-order differential equation used to model certain damped and driven oscillators. The equation is given by where the (unknown) function is the displacement at time t, is the first derivative of with respect to time, i.e. velocity, and is the second time-derivative of i.e. acceleration. The numbers and are given constants.
In probability theory and statistics, the beta prime distribution is an absolutely continuous probability distribution. If has a beta distribution, then the odds has a beta prime distribution.
In statistics, the multivariate t-distribution is a multivariate probability distribution. It is a generalization to random vectors of the Student's t-distribution, which is a distribution applicable to univariate random variables. While the case of a random matrix could be treated within this structure, the matrix t-distribution is distinct and makes particular use of the matrix structure.
A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.
In probability and statistics, the log-logistic distribution is a continuous probability distribution for a non-negative random variable. It is used in survival analysis as a parametric model for events whose rate increases initially and decreases later, as, for example, mortality rate from cancer following diagnosis or treatment. It has also been used in hydrology to model stream flow and precipitation, in economics as a simple model of the distribution of wealth or income, and in networking to model the transmission times of data considering both the network and the software.
The Herschel–Bulkley fluid is a generalized model of a non-Newtonian fluid, in which the strain experienced by the fluid is related to the stress in a complicated, non-linear way. Three parameters characterize this relationship: the consistency k, the flow index n, and the yield shear stress . The consistency is a simple constant of proportionality, while the flow index measures the degree to which the fluid is shear-thinning or shear-thickening. Ordinary paint is one example of a shear-thinning fluid, while oobleck provides one realization of a shear-thickening fluid. Finally, the yield stress quantifies the amount of stress that the fluid may experience before it yields and begins to flow.
In mathematics, Montgomery's pair correlation conjecture is a conjecture made by Hugh Montgomery that the pair correlation between pairs of zeros of the Riemann zeta function is
The multivariate stable distribution is a multivariate probability distribution that is a multivariate generalisation of the univariate stable distribution. The multivariate stable distribution defines linear relations between stable distribution marginals. In the same way as for the univariate case, the distribution is defined in terms of its characteristic function.
In statistics, the Davis distributions are a family of continuous probability distributions. It is named after Harold T. Davis (1892–1974), who in 1941 proposed this distribution to model income sizes.. It is a generalization of the Planck's law of radiation from statistical physics.
In probability and statistics, the generalized beta distribution is a continuous probability distribution with four shape parameters, including more than thirty named distributions as limiting or special cases. A fifth parameter for scaling is sometimes included, while a sixth parameter for location is customarily left implicit and excluded from the characterization. The distribution has been used in the modeling of income distribution, stock returns, as well as in regression analysis. The exponential generalized beta (EGB) distribution follows directly from the GB and generalizes other common distributions.
The Johnson's SU-distribution is a four-parameter family of probability distributions first investigated by N. L. Johnson in 1949. Johnson proposed it as a transformation of the normal distribution:
In two-dimensional conformal field theory, Virasoro conformal blocks are special functions that serve as building blocks of correlation functions. On a given punctured Riemann surface, Virasoro conformal blocks form a particular basis of the space of solutions of the conformal Ward identities. Zero-point blocks on the torus are characters of representations of the Virasoro algebra; four-point blocks on the sphere reduce to hypergeometric functions in special cases, but are in general much more complicated. In two dimensions as in other dimensions, conformal blocks play an essential role in the conformal bootstrap approach to conformal field theory.
An additive process, in probability theory, is a cadlag, continuous in probability stochastic process with independent increments. An additive process is the generalization of a Lévy process. An example of an additive process that is not a Lévy process is a Brownian motion with a time-dependent drift. The additive process was introduced by Paul Lévy in 1937.
In statistics, the matrix F distribution is a matrix variate generalization of the F distribution which is defined on real-valued positive-definite matrices. In Bayesian statistics it can be used as the semi conjugate prior for the covariance matrix or precision matrix of multivariate normal distributions, and related distributions.