Metalog distribution

Last updated September 05, 2023

The metalog distribution is a flexible continuous probability distribution designed for ease of use in practice. Together with its transforms, the metalog family of continuous distributions is unique because it embodies all of following properties: virtually unlimited shape flexibility; a choice among unbounded, semi-bounded, and bounded distributions; ease of fitting to data with linear least squares; simple, closed-form quantile function (inverse CDF) equations that facilitate simulation; a simple, closed-form PDF; and Bayesian updating in closed form in light of new data. Moreover, like a Taylor series, metalog distributions may have any number of terms, depending on the degree of shape flexibility desired and other application needs.

Applications where metalog distributions can be useful typically involve fitting empirical data, simulated data, or expert-elicited quantiles to smooth, continuous probability distributions. Fields of application are wide-ranging, and include economics, science, engineering, and numerous other fields. The metalog distributions, also known as the Keelin distributions, were first published in 2016^[1] by Tom Keelin.^[2]

History

The history of probability distributions can be viewed, in part, as a progression of developments towards greater flexibility in shape and bounds when fitting to data. The normal distribution was first published in 1756,^[3] and Bayes’ theorem in 1763.^[4] The normal distribution laid the foundation for much of the development of classical statistics. In contrast, Bayes' theorem laid the foundation for the state-of-information, belief-based probability representations. Because belief-based probabilities can take on any shape and may have natural bounds, probability distributions flexible enough to accommodate both were needed. Moreover, many empirical and experimental data sets exhibited shapes that could not be well matched by the normal or other continuous distributions. So began the search for continuous probability distributions with flexible shapes and bounds.

Early in the 20th century, the Pearson ^[5] family of distributions, which includes the normal, beta, uniform, gamma, student-t, chi-square, F, and five others,^[6] emerged as a major advance in shape flexibility. These were followed by the Johnson ^[7]^[8] distributions. Both families can represent the first four moments of data (mean, variance, skewness, and kurtosis) with smooth continuous curves. However, they have no ability to match fifth or higher-order moments. Moreover, for a given skewness and kurtosis, there is no choice of bounds. For example, matching the first four moments of a data set may yield a distribution with a negative lower bound, even though it might be known that the quantity in question cannot be negative. Finally, their equations include intractable integrals and complex statistical functions, so that fitting to data typically requires iterative methods.

Early in the 21st century, decision analysts began working to develop continuous probability distributions that would exactly fit any specified three points on the cumulative distribution function for an uncertain quantity (e.g., expert-elicited $0.10,0.50$ , and $0.90$ quantiles). The Pearson and the Johnson family distributions were generally inadequate for this purpose. In addition, decision analysts also sought probability distributions that would be easy to parameterize with data (e.g., by using linear least squares, or equivalently, multiple linear regression). Introduced in 2011, the class of quantile-parameterized distributions (QPDs) accomplished both goals. While being a significant advance for this reason, the QPD originally used to illustrate this class of distributions, the Simple Q-Normal distribution,^[9] had less shape flexibility than the Pearson and Johnson families, and lacked the ability to represent semi-bounded and bounded distributions. Shortly thereafter, Keelin^[1] developed the family of metalog distributions, another instance of the QPD class, which is more shape-flexible than the Pearson and Johnson families, offers a choice of boundedness, has closed-form equations that can be fit to data with linear least squares, and has closed-form quantile functions, which facilitate Monte Carlo simulation.

Definition and quantile function

The metalog distribution is a generalization of the logistic distribution, where the term "metalog" is short for "metalogistic". Starting with the logistic quantile function, $x=Q(y)=\mu +s{\mbox{ }}\ln {\Bigl (}{y \over {1-y}}{\Bigr )}$ , Keelin substituted power series expansions in cumulative probability $y=F(x)$ for the $\mu$ and the $s$ parameters, which control location and scale, respectively.^[10]

\mu =a_{1}+a_{4}(y-0.5)+a_{5}(y-0.5)^{2}+a_{7}(y-0.5)^{3}+a_{9}(y-0.5)^{4}+\dots

s=a_{2}+a_{3}(y-0.5)+a_{6}(y-0.5)^{2}+a_{8}(y-0.5)^{3}+a_{10}(y-0.5)^{4}+\dots

Keelin's rationale for this substitution was fivefold.^[10] First, the resulting quantile function would have significant shape flexibility, governed by the coefficients $a_{i}$ . Second, it would have a simple closed form that is linear in these coefficients, implying that they could easily be determined from CDF data by linear least squares. Third, the resulting quantile function would be smooth, differentiable, and analytic, ensuring that a smooth, closed-form PDF would be available. Fourth, simulation would be facilitated by the resulting closed-form inverse CDF. Fifth, like a Taylor series, any number of terms $k$ could be used, depending on the degree of shape flexibility desired and other application needs.

Note that the subscripts of the $a$ -coefficients are such that $a_{1}$ and $a_{4}$ are in the $\mu$ expansion, $a_{2}$ and $a_{3}$ are in the $s$ expansion, and subscripts alternate thereafter. This ordering was chosen so that the first two terms in the resulting metalog quantile function correspond to the logistic distribution exactly; adding a third term with $a_{3}\neq 0$ adjusts skewness; adding a fourth term with $a_{4}\neq 0$ adjusts kurtosis primarily; and adding subsequent non-zero terms yields more nuanced shape refinements.^[10]^: p.252

Rewriting the logistic quantile function to incorporate the above substitutions for $\mu$ and $s$ yields the metalog quantile function, for cumulative probability $0<y<1$ .

M_{k}(y)=\left\{{\begin{array}{ll}a_{1}+a_{2}\ln {\Bigl (}{y \over {1-y}}{\Bigr )}&{\mbox{for }}k=2\\a_{1}+a_{2}\ln {\Bigl (}{y \over {1-y}}{\Bigr )}+a_{3}(y-0.5)\ln {\Bigl (}{y \over {1-y}}{\Bigr )}&{\mbox{for }}k=3\\a_{1}+a_{2}\ln {\Bigl (}{y \over {1-y}}{\Bigr )}+a_{3}(y-0.5)\ln {\Bigl (}{y \over {1-y}}{\Bigr )}+a_{4}(y-0.5)&{\mbox{for }}k=4\\M_{k-1}(y)+a_{k}(y-0.5)^{k-1 \over 2}&{\mbox{for odd }}k\geq 5\\M_{k-1}(y)+a_{k}(y-0.5)^{{k \over 2}-1}\ln {\Bigl (}{y \over {1-y}}{\Bigr )}&{\mbox{for even }}k\geq 6\\\end{array}}\right.

Equivalently, the metalog quantile function can be expressed in terms of basis functions: $M_{k}(y)=\sum _{i=1}^{k}a_{i}g_{i}(y)$ , where the metalog basis functions are $g_{1}(y)=1,{\mbox{ }}g_{2}(y)=\ln {\Bigl (}{y \over {1-y}}{\Bigr )},$ and each subsequent $g_{i}(y)$ is defined as the expression that is multiplied by $a_{i}$ in the equation for $M_{k}(y)$ above. Note that coefficient $a_{1}$ is the median, since all other terms equal zero when $y=0.5$ . Special cases of the metalog quantile function are the logistic distribution ( $k=2$ ) and the uniform distribution ( $k\geq 4,a_{1}=0.5,a_{4}=1,a_{i}=0$ otherwise).

Probability density function

Differentiating $x=M_{k}(y)$ with respect to $y$ yields the quantile density function^[11] $q(y)=dx/dy$ . The reciprocal of this quantity, $(q(y))^{-1}=dy/dx=f(Q(y))$ , is the probability density function expressed as a p-PDF,^[12]

m_{k}(y)=\left\{{\begin{array}{ll}{y(1-y) \over {a_{2}}}&{\mbox{for }}k=2\\{\Bigl (}{a_{2} \over {y(1-y)}}+a_{3}{\Bigl (}{y-0.5 \over {y(1-y)}}+\ln {y \over {1-y}}{\Bigr )}{\Bigr )}^{-1}&{\mbox{for }}k=3\\{\Bigl (}{a_{2} \over {y(1-y)}}+a_{3}{\Bigl (}{y-0.5 \over {y(1-y)}}+\ln {y \over {1-y}}{\Bigr )}+a_{4}{\Bigr )}^{-1}&{\mbox{for }}k=4\\{\Bigl (}{1 \over {m_{k-1}(y)}}+a_{k}{k-1 \over 2}(y-0.5)^{(k-3)/2}{\Bigr )}^{-1}&{\mbox{for odd }}k\geq 5\\{\Bigl (}{1 \over {m_{k-1}(y)}}+a_{k}{\Bigl (}{(y-0.5)^{k/2-1} \over {y(1-y)}}+({k \over 2}-1)(y-0.5)^{(k/2-2)}\ln {y \over {1-y}}{\Bigr )}{\Bigr )}^{-1}&{\mbox{for even }}k\geq 6,\\\end{array}}\right.

which may be equivalently expressed in terms of basis functions as

m_{k}(y)=\left(\sum _{i=1}^{k}a_{i}{{dg_{i}(y)} \over {dy}}\right)^{-1}

where

0<y<1

.

Note that this PDF is expressed as a function of cumulative probability, $y$ , rather than variable of interest, $x$ . To plot the PDF (e.g., as shown in the figures on this page), one can vary $y\in (0,1)$ parametrically, and then plot $x=M_{k}(y)$ on the horizontal axis and $m_{k}(y)$ on the vertical axis.

Based on the above equations and the following transformations that enable a choice of bounds, the family of metalog distributions is composed of unbounded, semibounded, and bounded metalogs, along with their symmetric-percentile triplet (SPT) special cases.

Unbounded, semi-bounded, and bounded metalog distributions

As defined above, the metalog distribution is unbounded, except in the unusual special case where $a_{i}=0$ for all terms that contain $\ln {\Bigl (}{y \over {1-y}}{\Bigr )}$ . However, many applications require flexible probability distributions that have a lower bound $b_{l}$ , an upper bound $b_{u}$ , or both. To meet this need, Keelin used transformations to derive semi-bounded and bounded metalog distributions.^[1] Such transformations are governed by a general property of quantile functions: for any quantile function $x=Q(y)$ and increasing function $z(x),x=z^{-1}(Q(y))$ is also a quantile function.^[13] For example, the quantile function of the normal distribution is $x=\mu +\sigma \Phi ^{-1}(y)$ ; since the natural logarithm, $z(x)=\ln(x-b_{l})$ , is an increasing function, $x=b_{l}+e^{\mu +\sigma \Phi ^{-1}(y)}$ is the quantile function of the lognormal distribution. Analogously, applying this property to the metalog quantile function $M_{k}(y)$ using the transformations below yields the semi-bounded and bounded members of the metalog family. By considering $z(x)$ to be metalog-distributed, all members of the metalog family meet Keelin and Powley's^[9] definition of a quantile-parameterized distribution and thus possess the properties thereof.

{\begin{array}{l|c|c|c|c|c}&&&&&{\mbox{shape}}\\&{\mbox{transformation}}&{\mbox{quantile function}}&{\mbox{PDF}}&{\mbox{where}}&{\mbox{parameters}}\\\hline {\mbox{Metalog (unbounded)}}&z(x)=x&M_{k}(y)&m_{k}(y)&0<y<1&k-2\\\hline {\mbox{Log metalog}}&z(x)=\ln(x-b_{l})&M_{k}^{\log(}y)=b_{l}+e^{M_{k}(y)}&m_{k}^{\log(}y)=m_{k}(y)e^{-M_{k}(y)}&0<y<1&k-1\\{\mbox{(bounded below)}}&&=b_{l}&=0&y=0\\\hline {\mbox{Negative-log metalog}}&z(x)=-\ln(b_{u}-x)&M_{k}^{\operatorname {nlog} }(y)=b_{u}-e^{-M_{k}(y)}&m_{k}^{\operatorname {nlog} }(y)=m_{k}(y)e^{M_{k}(y)}&0<y<1&k-1\\{\mbox{(bounded above)}}&&=b_{u}&=0&y=1\\\hline {\mbox{Logit metalog}}&z(x)=\ln {\Bigl (}{x-b_{l} \over {b_{u}-x}}{\Bigr )}&M_{k}^{\operatorname {logit} }(y)={b_{l}+b_{u}e^{M_{k}(y)} \over {1+e^{M_{k}(y)}}}&m_{k}^{\operatorname {logit} }(y)=m_{k}(y){{\bigl (}1+e^{M_{k}(y)}{\bigr )}^{2} \over {(b_{u}-b_{l})e^{M_{k}(y)}}}&0<y<1&k\\{\mbox{(bounded)}}&&=b_{l}&=0&y=0\\&&=b_{u}&=0&y=1\end{array}}

Note that the number of shape parameters in the metalog family increases linearly with the number of terms $k$ . Therefore, any of the above metalogs may have any number of shape parameters. By contrast, the Pearson and Johnson families of distributions are limited to two shape parameters.

SPT metalog distributions

The symmetric-percentile triplet (SPT) metalog distributions are a three-term $(k=3)$ special case of the unbounded, semi-bounded, and bounded metalog distributions.^[14] These are parameterized by the three $(x,y)$ points off the CDF curve, of the form $(x_{1},\alpha )$ , $(x_{2},0.5)$ , and $(x_{3},1-\alpha )$ , where $0<\alpha <0.5$ . SPT metalogs are useful when, for example, quantiles $(x_{1},x_{2},x_{3})$ corresponding to the CDF probabilities (e.g. $0.1,0.5,0.9$ ) are elicited from an expert and used to parameterize the three-term metalog distributions. As noted below, certain mathematical properties are simplified by the SPT parameterization.

Properties

The metalog family of probability distributions has the following properties.

Feasibility

A function of the form of $M_{k}(y)$ or any of its above transforms is a feasible probability distribution if and only if its PDF is greater than zero for all $y\in (0,1).$ ^[9] This implies a feasibility constraint on the set of coefficients ${\boldsymbol {a}}=(a_{1},...,a_{k})\in \mathbb {R} ^{k}$ ,

m_{k}(y)>0

for all

y\in (0,1)

.

In practical applications, feasibility must generally be checked rather than assumed. For $k=2$ , $a_{2}>0$ ensures feasibility. For $k=3$ (including SPT metalogs), the feasibility condition is $a_{2}>0$ and ${|a_{3}|/a_{2}}<1.66711$ .^[14] For $k=4$ , a similar closed form has been derived.^[15] For $k\geq 5$ , feasibility is typically checked graphically or numerically.

The unbounded metalog and its above transforms share the same set of feasible coefficients.^[16] Therefore, for a given set of coefficients, confirming that $m_{k}(y)>0$ for all $y\in (0,1)$ is sufficient regardless of the transform in use.

Convexity

The set of feasible metalog coefficients $S_{\boldsymbol {a}}=\{{\boldsymbol {a}}\in \mathbb {R} ^{k}|m_{k}(y)>0$ for all $y\in (0,1)\}$ is convex. Because convex optimization problems require convex feasible sets, this property can simplify optimization problems involving metalogs. Moreover, this property guarantees that any convex combination of the ${\boldsymbol {a}}$ vectors of feasible metalogs is feasible, which is useful, for example, when combining the opinion of multiple experts^[17] or interpolating among feasible metalogs.^[18] By implication, any probabilistic mixture of metalog distributions is itself a metalog.

Fitting to data

The coefficients ${\boldsymbol {a}}$ can be determined from data by linear least squares. Given $n$ data points $(x_{i},y_{i})$ that are intended to characterize a metalog CDF, and an $n\times k$ matrix ${\boldsymbol {Y}}$ whose elements consist of the basis functions $g_{j}(y_{i})$ , then as long as ${\boldsymbol {Y}}^{T}{\boldsymbol {Y}}$ is invertible, the column vector ${\boldsymbol {a}}$ of the coefficients $a_{i}$ is given by ${\boldsymbol {a}}=({\boldsymbol {Y}}^{T}{\boldsymbol {Y}})^{-1}{\boldsymbol {Y}}^{T}{\boldsymbol {z}}$ , where $n\geq k$ and column vector ${\boldsymbol {z}}=(z(x_{1}),\ldots ,z(x_{n}))$ . If $n=k$ , this equation reduces to ${\boldsymbol {a}}={\boldsymbol {Y}}^{-1}{\boldsymbol {z}}$ , where the resulting metalog CDF runs through all data points exactly. For SPT metalogs, it further reduces to expressions in terms of the three $(x_{i},y_{i})$ points directly.^[14]

An alternate fitting method, implemented as a linear program, determines the coefficients by minimizing the sum of absolute distances between the CDF and the data, subject to feasibility constraints.^[19]

How metalogs converge to standard normal distribution as
k
{\displaystyle k}
increases from 2 to 10 StdNormalPDF11.gif — How metalogs converge to standard normal distribution as $k$ increases from 2 to 10

Shape flexibility

According to the metalog flexibility theorem,^[17] any probability distribution with a continuous quantile function can be approximated arbitrarily closely by a metalog. Moreover, in the original paper, Keelin showed that ten-term metalog distributions parameterized by 105 CDF points from 30 traditional source distributions (including the normal, student-t, lognormal, gamma, beta, and extreme-value distributions) approximate each such source distribution within a K-S distance of 0.001 or less.^[20] Thus, metalog shape flexibility is virtually unlimited.

The animated figure on the right illustrates this for the standard normal distribution, where metalogs with various numbers of terms are parameterized by the same set of 105 points from the standard normal CDF. The metalog PDF converges to the standard normal PDF as the number of terms increases. With two terms, the metalog approximates the normal with a logistic distribution. With each increment in number of terms, the fit gets closer. With 10 terms, the metalog PDF and standard normal PDF are visually indistinguishable.

Similarly, nine-term semi-bounded metalog PDFs with $b_{l}=0$ are visually indistinguishable from a range of Weibull distributions. The six cases shown to the right correspond to Weibull shape parameters 0.5, 0.8, 1.0, 1.5, 2, and 4. In each case, the metalog is parameterized by the nine $x$ points from the Weibull CDF that correspond to the cumulative probabilities $y=(0.001,0.02,0.10,0.25,0.5,0.75,0.9,0.98,0.999)$ .

Such convergence is not unique to the normal and Weibull distributions. Keelin originally showed analogous results for a wide range of distributions^[20] and has since provided further illustrations.^[17]^[21]

Median

The median of any distribution in the metalog family has a simple closed form. Note that $y=0.5$ defines the median, and $M_{k}(0.5)=a_{1}$ (since all subsequent terms are zero for $y=0.5$ ). It follows that the medians of the unbounded metalog, log metalog, negative-log metalog, and logit metalog distributions are $a_{1}$ , $b_{l}+e^{a_{1}}$ , $b_{u}-e^{-a_{1}}$ , and ${b_{l}+b_{u}e^{a_{1}} \over {1+e^{a_{1}}}}$ , respectively.

Moments

The $m^{th}$ moment of the unbounded metalog distribution, $E[x^{m}]=\int _{y=0}^{1}{M_{k}(y)}^{m}\,dy$ , is a special case of the more general formula for QPDs.^[9] For the unbounded metalog, such integrals evaluate to closed-form moments that are $m^{th}$ order polynomials in the coefficients $a_{i}$ . The first four central moments of the four-term unbounded metalog are:

{\begin{aligned}{\text{mean}}={}&a_{1}+{a_{3} \over 2}\\[6pt]{\text{variance}}={}&\pi ^{2}{{a_{2}}^{2} \over 3}+{{a_{3}}^{2} \over {12}}+\pi ^{2}{{a_{3}}^{2} \over {36}}+a_{2}a_{4}+{{a_{4}}^{2} \over {12}}\\[6pt]{\text{skewness}}={}&\pi ^{2}{a_{2}}^{2}{a_{3}}+\pi ^{2}{{a_{3}}^{3} \over {24}}+{{{a_{2}}{a_{3}}{a_{4}}} \over {2}}+\pi ^{2}{{{a_{2}}{a_{3}}{a_{4}}} \over {6}}+{{{a_{3}}{a_{4}}^{2}} \over {8}}\\[6pt]{\text{kurtosis}}={}&7\pi ^{4}{{a_{2}}^{4} \over {15}}+3\pi ^{2}{{{a_{2}}^{2}{a_{3}}^{2}} \over {24}}+7\pi ^{4}{{{a_{2}}^{2}{a_{3}}^{2}} \over {30}}+{{{a_{3}}^{4}} \over {80}}+\pi ^{2}{{a_{3}}^{4} \over {24}}+7\pi ^{4}{{a_{3}}^{4} \over {1200}}+2\pi ^{2}{a_{2}}^{3}{a_{4}}\\[6pt]&{}+{{{a_{2}}{a_{3}}^{2}{a_{4}}} \over {2}}+2\pi ^{2}{{{a_{2}}{a_{3}}^{2}{a_{4}}} \over {3}}+2{{a_{2}}^{2}{a_{4}}^{2}}+\pi ^{2}{{{a_{2}}^{2}{a_{4}}^{2}} \over {6}}+{{{a_{3}}^{2}{a_{4}}^{2}} \over {8}}+\pi ^{2}{{{a_{3}}^{2}{a_{4}}^{2}} \over {40}}+{{{a_{2}}{a_{4}}^{3}} \over {3}}+{{a_{4}}^{4} \over {80}}\end{aligned}}

Moments for fewer terms are subsumed in these equations. For example, moments of the three-term metalog can be obtained by setting $a_{4}$ to zero. Moments for metalogs with more terms, and higher-order moments ( $m>4$ ), are also available.^[22] Moments for semi-bounded and bounded metalogs are not available in closed form.

Parameterization with moments

Three-term unbounded metalogs can be parameterized in closed form with their first three central moments. Let $m,v,$ and $s$ be the mean, variance, and skewness, and let $s_{s}$ be the standardized skewness, $s_{s}=s/v^{3/2}$ . Equivalent expressions of the moments in terms of coefficients, and coefficients in terms of moments, are as follows:

{\begin{array}{ll}m=a_{1}+{a_{3} \over 2}&&a_{1}=m-{a_{3} \over 2}\\[6pt]v=\pi ^{2}{{a_{2}}^{2} \over 3}+{{a_{3}}^{2} \over {12}}+\pi ^{2}{{a_{3}}^{2} \over {36}}&&a_{2}={1 \over {\pi }}{\Bigl [}3{\Bigl (}v-{\Bigl (}{1 \over {12}}+{{\pi }^{2} \over {36}}{\Bigr )}{a_{3}}^{2}{\Bigr )}{\Bigr ]}^{1 \over {2}}\\[6pt]s=\pi ^{2}{a_{2}}^{2}{a_{3}}+\pi ^{2}{{a_{3}}^{3} \over {24}}&&a_{3}=4{\Bigl (}{6v \over {6+\pi ^{2}}}{\Bigr )}^{1 \over {2}}\cos {\Bigl [}{1 \over {3}}{\Bigl (}\cos ^{-1}{\Bigl (}-{s_{s} \over {4}}{\Bigl (}1+{{\pi }^{2} \over {6}}{\Bigr )}^{1 \over {2}}{\Bigr )}+4\pi {\Bigr )}{\Bigr ]}\\[6pt]\end{array}}

The equivalence of these two sets of expressions can be derived by noting that the moments equations on the left determine a cubic polynomial in terms of the coefficients $a_{1},a_{2},$ and $a_{3}$ , which can be solved in closed form as functions of $m,v,$ and $s$ . Moreover, this solution is unique.^[23] In terms of moments, the feasibility condition is $|s_{s}|\leq 2.07093$ , which can be shown to be equivalent to the following feasibility condition in terms of the coefficients: $a_{2}>0$ ; and ${|a_{3}|/a_{2}}<1.66711$ .^[23]

This property can be used, for example, to represent the sum of independent, non-identically distributed random variables. Based on cumulants, it is known that for any set of independent random variables, the mean, variance, and skewness of the sum are the sums of the respective means, variances, and skewnesses. Parameterizing a three-term metalog with these central moments yields a continuous distribution that exactly preserves these three moments, and accordingly provides a reasonable approximation to the shape of the distribution of the sum of independent random variables.

Simulation

Since their quantile functions are expressed in closed form, metalogs facilitate Monte Carlo simulation. Substituting uniformly distributed random samples of $y$ into the Metalog quantile function (inverse CDF) produces random samples of $x$ in closed form, thereby eliminating the need to invert a CDF. See below for simulation applications.

Eliciting and Combining Expert Opinion

Due to their shape flexibility, metalog distributions can be an attractive choice for eliciting and representing expert opinion.^[24] Moreover, if the opinions of multiple experts are expressed as $k$ -term metalogs, the consensus opinion may be calculated as a $k$ -term metalog in closed form, where the ${\boldsymbol {a}}$ -coefficients of the consensus metalog are simply a weighted average of those of the individual experts.^[17] This result follows from Vincentization, where the consensus quantile function is a weighted average of individual quantile functions.

Bayesian Updating in Closed Form

In a classic paper, Howard (1970)^[25] shows how the beta-binomial distribution can be used to update, according to Bayes rule in closed form, uncertainty over the long-run frequency $\phi$ of a coin toss coming up "heads" in light of new coin-toss data. In contrast, if the uncertainty of interest to be updated is defined not by a scalar probability over a discrete event (like the result of a coin toss) but by a probability density function over a continuous variable, metalog Bayesian updating may be used. Under certain conditions, metalog quantile parameters and ${\boldsymbol {a}}$ -coefficients may be updated in closed form in light of new data according to Bayes rule.^[17]

Applications

For 3,474 steelhead trout caught and released on the Babine River in British Columbia during 2006-2010, empirical weight data (histogram) and 10-term log metalog PDF (blue curve) fit to this data by least squares. FishWeightMetalog.png — For 3,474 steelhead trout caught and released on the Babine River in British Columbia during 2006-2010, empirical weight data (histogram) and 10-term log metalog PDF (blue curve) fit to this data by least squares.

Due to their shape and bounds flexibility, metalogs can be used to represent empirical or other data in virtually any field of human endeavor.

Astronomy. Metalogs were applied to assess the risks of asteroid impact.^[26]
Cybersecurity. Metalogs were used in cyber security risk assessment.^[19]^[27]
Eliciting and combining expert opinion. Statistics Canada elicited expert opinions on future Canadian fertility rates from 18 experts, which included the use of spreadsheet-based real-time PDF feedback based on five-term metalogs. The individual expert opinions were then weighted and combined into an overall metalog-based forecast.^[24]
Empirical data exploration and visualization. In fish biology, a 10-term log metalog distribution (bounded below at 0) was fit to the weights of 3,474 steelhead trout caught and released on the Babine River in British Columbia during 2006–2010. The bimodality of the resulting distribution has been attributed to the presence of both first-time and second-time spawners in the river, the latter of which tend to weigh more.^[28]
Hydrology. A 10-term semi-bounded metalog was used to model the probability distribution of annual river gauge heights.^[29]
Oil field production. Semi-bounded SPT metalogs were used to analyze biases in projections of oil-field production when compared to observed production after the fact.^[30]
Portfolio management. SPT metalogs have been used to model commercial value of new products and product portfolios.^[31]
Simulation input distributions. To support a bidding decision, uncertainty about the future value of each of 259 financial assets was represented as an SPT metalog. A simulation of total portfolio value was shown to yield more realistic results than a corresponding simulation based on discrete low, median, and high values for each asset.^[32]
Simulation output distributions. Metalogs have also been used to fit output data from simulations in order to represent those outputs as closed-form continuous distributions (both CDFs and PDFs). Used in this way, they are typically more stable and smoother than histograms.^[32]
Sums of lognormals. Metalogs enable a closed-form representation of known distributions whose CDFs have no closed-form expression. Keelin et al. (2019)^[18] apply this to the sum of independent identically distributed lognormal distributions, where quantiles of the sum can be determined by a large number of simulations. Nine such quantiles are used to parameterize a semi-bounded metalog distribution that runs through each of these nine quantiles exactly. Quantile parameters are stored in a table, which can then be interpolated to yield in-between values; these values are guaranteed to be feasible by the convexity property above.

Metalog panel for steelhead weight data MetalogPanel.png — Metalog panel for steelhead weight data

Choosing number of terms

For a given application and data set, choosing the number of metalog terms $k$ depends on context and may require judgment. For expert elicitation, three to five terms is usually sufficient. For data exploration and matching other probability distributions such as the sum of lognormals, eight to 12 terms is usually sufficient. A metalog panel, which displays the metalog PDFs corresponding to differing numbers of terms $k$ for a given data set, may aid this judgment. For example, in the steelhead weight metalog panel,^[1] using less than seven terms arguably underfits the data by obscuring the data's inherent bimodality. Using more than 11 terms is unnecessary and could, in principle, overfit the data. The case with 16 terms is infeasible for this data set, as indicated by the blank cell in the metalog panel. Other tools (such as regularization, Akaike information criterion, and Bayesian information criterion) may also be useful. For example, when applied to the steelhead weight data, the AIC ranking of metalog distributions from 2-16 terms along with a wide range of classical distributions identifies the 11-term log metalog as the best fit to this data. A similar BIC ranking identifies the 10-term log metalog as the best fit. Keelin (2016)^[1] offers further perspectives on the distribution selection within the metalog family.^[33]

Related distributions

The metalog distributions belong to the group of distributions defined in terms of the quantile function, which include the quantile-parameterized distributions, the Tukey lambda distribution, its generalization, GLD,^[34] the Govindarajulu distribution^[35] and others.^[13] The following distributions are subsumed within the metalog family:

The logistic distribution is a special case of the unbounded metalog where $a_{i}=0$ for all $i>2$ .
The uniform distribution is a special case of: 1) the unbounded metalog where $k\geq 4$ , $a_{1}=0.5$ , $a_{4}=1$ and $a_{i}=0$ otherwise; and 2) the bounded metalog where $k\geq 2$ , $b_{l}=0$ , $b_{u}=1$ , $a_{2}=1$ , and $a_{i}=0$ otherwise.
The log-logistic distribution, also known as the Fisk distribution in economics, is a special case of the log metalog where $b_{l}=0$ , and $a_{i}=0$ for all $i>2$ .
The log-uniform distribution is a special case of the log metalog where $k\geq 4$ , $a_{1}=0.5$ , $a_{4}=1$ , and $a_{i}=0$ otherwise.
The logit-logistic distribution^[36] is a special case of the logit metalog where $a_{i}=0$ for all $i>2$ .

Software

Freely available software tools can be used to work with metalog distributions:

Excel workbooks. By pasting or typing in CDF data, metalogs (with choice of bounds) are instantly displayed.
- SPT metalogs workbook^[37] calculates 2–3 term metalogs determined by three $(x_{i},y_{i})$ CDF data.
- Metalogs workbook^[38] calculates 2–16 term metalogs (including metalog panel) determined by 2-10,000 $(x_{i},y_{i})$ CDF data.
- ELD (equally likely data) Metalog workbooks^[39] calculate 2–16 term metalogs determined by 2–10,000 $(x_{i})$ CDF data, where $y_{i}$ 's and metalog panel are automatically calculated.
R. rmetalog ^[40] (on Complehensive R Archive Network, CRAN).
Python. Pymetalog^[41] closely mirrors the R package. Metalogistic^[42] takes advantage of the SciPy platform.
MakeDistribution.com ^[43] facilitates experimentation with metalogs parameterized by several CDF data points. The SPT metalog calculator,^[44] metalog calculator^[45] and ELD metalog calculator^[46] are online versions of the Excel Workbooks.
SIPmath Modeler Tools^[47] support metalog distributions in an Excel add-In for simulation.
Lumina's Analytica Free 101 software^[48] for modeling and aiding difficult decisions.
BayesFusion's Metalog Builder ^[49] allows for interactive building of metalog distributions. BayesFusion's GeNIe ^[50] (academic version of the software is free for academic research and teaching) implements the metalog distributions.

Commercially available packages also support the use of metalog distributions:

FrontLine Solvers: Analytic Solver, RASON, and Solver SDK,^[51] software for optimization. Automatically fits user data to the full range of (bounded and unbounded, multi-term) metalog distributions and provides option to compare metalog distributions with classical distributions based on user-selected goodness of fit criteria.
Lone Star Analysis: TruNavigator and AnalyticsOS software ^[52] for predictive and prescriptive analytics.

Related Research Articles

<span class="mw-page-title-main">Multivariate normal distribution</span> Generalization of the one-dimensional normal distribution to higher dimensions

In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One definition is that a random vector is said to be k-variate normally distributed if every linear combination of its k components has a univariate normal distribution. Its importance derives mainly from the multivariate central limit theorem. The multivariate normal distribution is often used to describe, at least approximately, any set of (possibly) correlated real-valued random variables each of which clusters around a mean value.

<span class="mw-page-title-main">Log-normal distribution</span> Probability distribution

In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable $X$ is log-normally distributed, then $Y = ln(X)$ has a normal distribution. Equivalently, if $Y$ has a normal distribution, then the exponential function of $Y$ , $X = exp(Y)$ , has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. It is a convenient and useful model for measurements in exact and engineering sciences, as well as medicine, economics and other topics (e.g., energies, concentrations, lengths, prices of financial instruments, and other metrics).

In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. The point in the parameter space that maximizes the likelihood function is called the maximum likelihood estimate. The logic of maximum likelihood is both intuitive and flexible, and as such the method has become a dominant means of statistical inference.

<span class="mw-page-title-main">Gumbel distribution</span> Particular case of the generalized extreme value distribution

In probability theory and statistics, the Gumbel distribution is used to model the distribution of the maximum of a number of samples of various distributions.

In mathematics, the Gudermannian function relates a hyperbolic angle measure $to a circular angle measure called the gudermannian of and denoted . The Gudermannian function reveals a close relationship between the circular functions and hyperbolic functions. It was introduced in the 1760s by Johann Heinrich Lambert, and later named for Christoph Gudermann who also described the relationship between circular and hyperbolic functions in 1830. The gudermannian is sometimes called the hyperbolic amplitude as a limiting case of the Jacobi elliptic amplitude when parameter$

In probability and statistics, an exponential family is a parametric set of probability distributions of a certain form, specified below. This special form is chosen for mathematical convenience, including the enabling of the user to calculate expectations, covariances using differentiation based on some useful algebraic properties, as well as for generality, as exponential families are in a sense very natural sets of distributions to consider. The term exponential class is sometimes used in place of "exponential family", or the older term Koopman–Darmois family. The terms "distribution" and "family" are often used loosely: specifically, an exponential family is a set of distributions, where the specific distribution varies with the parameter; however, a parametric family of distributions is often referred to as "a distribution", and the set of all exponential families is sometimes loosely referred to as "the" exponential family. They are distinct because they possess a variety of desirable properties, most importantly the existence of a sufficient statistic.

In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a link function and by allowing the magnitude of the variance of each measurement to be a function of its predicted value.

<span class="mw-page-title-main">Dirichlet distribution</span> Probability distribution

In probability and statistics, the Dirichlet distribution (after Peter Gustav Lejeune Dirichlet), often denoted $, is a family of continuous multivariate probability distributions parameterized by a vector of positive reals. It is a multivariate generalization of the beta distribution, hence its alternative name of multivariate beta distribution (MBD) . Dirichlet distributions are commonly used as prior distributions in Bayesian statistics, and in fact, the Dirichlet distribution is the conjugate prior of the categorical distribution and multinomial distribution.$

In mathematics, specifically the theory of elliptic functions, the nome is a special function that belongs to the non-elementary functions. This function is of great importance in the description of the elliptic functions, especially in the description of the modular identity of the Jacobi theta function, the Hermite elliptic transcendents and the Weber modular functions, that are used for solving equations of higher degrees.

In probability theory and statistics, the generalized extreme value (GEV) distribution is a family of continuous probability distributions developed within extreme value theory to combine the Gumbel, Fréchet and Weibull families also known as type I, II and III extreme value distributions. By the extreme value theorem the GEV distribution is the only possible limit distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. Note that a limit distribution needs to exist, which requires regularity conditions on the tail of the distribution. Despite this, the GEV distribution is often used as an approximation to model the maxima of long (finite) sequences of random variables.

<span class="mw-page-title-main">Empirical distribution function</span> Distribution function associated with the empirical measure of a sample

In statistics, an empirical distribution function is the distribution function associated with the empirical measure of a sample. This cumulative distribution function is a step function that jumps up by $1/ n$ at each of the $n$ data points. Its value at any specified value of the measured variable is the fraction of observations of the measured variable that are less than or equal to the specified value.

In mathematics, the lemniscate elliptic functions are elliptic functions related to the arc length of the lemniscate of Bernoulli. They were first studied by Giulio Fagnano in 1718 and later by Leonhard Euler and Carl Friedrich Gauss, among others.

In statistics, binomial regression is a regression analysis technique in which the response has a binomial distribution: it is the number of successes in a series of $independent Bernoulli trials, where each trial has probability of success . In binomial regression, the probability of a success is related to explanatory variables: the corresponding concept in ordinary regression is to relate the mean value of the unobserved response to explanatory variables.$

In directional statistics, the von Mises–Fisher distribution, is a probability distribution on the $-sphere in . If the distribution reduces to the von Mises distribution on the circle.$

In the theory of probability and statistics, the Dvoretzky–Kiefer–Wolfowitz–Massart inequality bounds how close an empirically determined distribution function will be to the distribution function from which the empirical samples are drawn. It is named after Aryeh Dvoretzky, Jack Kiefer, and Jacob Wolfowitz, who in 1956 proved the inequality

Expected shortfall (ES) is a risk measure—a concept used in the field of financial risk measurement to evaluate the market risk or credit risk of a portfolio. The "expected shortfall at q% level" is the expected return on the portfolio in the worst $of cases. ES is an alternative to value at risk that is more sensitive to the shape of the tail of the loss distribution.$

In probability and statistics, the quantile function outputs the value of a random variable such that its probability is less than or equal to an input probability value. Intuitively, the quantile function associates with a range at and below a probability input the likelihood that a random variable is realized in that range for some probability distribution. It is also called the percentile function, percent-point function or inverse cumulative distribution function.

In fluid dynamics and electrostatics, slender-body theory is a methodology that can be used to take advantage of the slenderness of a body to obtain an approximation to a field surrounding it and/or the net effect of the field on the body. Principal applications are to Stokes flow — at very low Reynolds numbers — and in electrostatics.

In financial mathematics, tail value at risk (TVaR), also known as tail conditional expectation (TCE) or conditional tail expectation (CTE), is a risk measure associated with the more general value at risk. It quantifies the expected value of the loss given that an event outside a given probability level has occurred.

A quantile-parameterized distribution (QPD) is a probability distributions that is directly parameterized by data. They were created to meet the need for easy-to-use continuous probability distributions flexible enough to represent a wide range of uncertainties, such as those commonly encountered in business, economics, engineering, and science. Because QPDs are directly parameterized by data, they have the practical advantage of avoiding the intermediate step of parameter estimation, a time-consuming process that typically requires non-linear iterative methods to estimate probability-distribution parameters from data. Some QPDs have virtually unlimited shape flexibility and closed-form moments as well.

References

1 2 3 4 5 Keelin, Thomas W. (2016). "The Metalog Distributions" (PDF). Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338. ISSN 1545-8490. Archived from the original on 2016-11-28.
↑ "About the Author". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ De Moivre, A. (1756). The doctrine of chances: or, A method of calculating the probabilities of events in play (Vol. 1). Chelsea Publishing Company.
↑ Bayes, T. (1763). LII. An essay towards solving a problem in the doctrine of chances. By the late Rev. Mr. Bayes, FRS communicated by Mr. Price, in a letter to John Canton, AMFR S. Philosophical transactions of the Royal Society of London, (53), pp. 370–418.
↑ Johnson NL, Kotz S, Balakrishnan N. Continuous univariate distributions, Vol 1, Second Edition, John Wiley & Sons, Ltd, 1994, pp. 15–25.
↑ Ord, J.K., 1972. Families of frequency distributions. Charles Griffin & Co, Ltd, London. Table 1.1, p 6.
↑ Johnson, N. L. (1949). "Systems of Frequency Curves Generated by Methods of Translation". Biometrika. 36 (1/2): 149–176. doi:10.2307/2332539. JSTOR 2332539. PMID 18132090.
↑ Tadikamalla, Pandu R.; Johnson, Norman L. (1982). "Systems of Frequency Curves Generated by Transformations of Logistic Variables". Biometrika. 69 (2): 461–465. doi:10.1093/biomet/69.2.461. JSTOR 2335422.
1 2 3 4 Keelin, Thomas W.; Powley, Bradford W. (2011-08-04). "Quantile-Parameterized Distributions" (PDF). Decision Analysis. 8 (3): 206–219. doi:10.1287/deca.1110.0213. ISSN 1545-8490. Archived from the original on 2011-09-01.
1 2 3 Keelin TW (2016). "The Metalog Distributions." Decision Analysis. 13 (4): 243–277.
↑ Parzen, E., 1979, Nonparametric statistical data modeling, Journal of the American Statistical Association,7, 105–131
↑ p-PDF: A probability density function expressed as a function of cumulative probability $y$ rather than variable of interest $x$ ; equivalently, a "density quantile function" as defined by Parzen, E., 1979, Nonparametric statistical data modeling, Journal of the American Statistical Association,7, 105–131
1 2 Gilchrist, Warren (2000-05-15). Statistical Modelling with Quantile Functions. Chapman and Hall/CRC. doi:10.1201/9781420035919. ISBN 978-0-429-11920-0.
1 2 3 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.
↑ "The Metalog Distributions Feasibility". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ Powley, B.W. (2013). “Quantile Function Methods For Decision Analysis”. Corollary 12, p 30. PhD Dissertation, Stanford University
1 2 3 4 5 Keelin, Thomas W., and Ronald A. Howard. (2021). “The Metalog Distributions: Virtually Unlimited Shape Flexibility, Combining Expert Opinion in Closed Form, and Bayesian Updating in Closed Form.” OSF Preprints. doi:10.31219/osf.io/xdg5e.
1 2 Keelin, T.W., Chrisman, L. and Savage, S.L. (2019). “The metalog distributions and extremely accurate sums of lognormals in closed form.” WSC '19: Proceedings of the Winter Simulation Conference. 3074–3085.
1 2 Faber, I.J. (2019). Cyber Risk Management: AI-generated Warnings of Threats (Doctoral dissertation, Stanford University).
1 2 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.
↑ "Metalog Distributions Shape Flexibility". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ "The Metalog Distribution Moments". www.metalogdistributions.com. Retrieved 2021-02-13.
1 2 "Equivalence of coefficients' and moments' parameterizations of three-term metalog". www.metalogdistributions.com. Retrieved 2021-03-28.
1 2 Dion, P., Galbraith, N., Sirag, E. (2020). “Using expert elicitation to build long-term projection assumptions.” In Developments in Demographic Forecasting, Chapter 3, pp. 43–62. Springer
↑ Howard, Ronald A. (1970). "Decision analysis: Perspectives on inference, decision, and experimentation". Proceedings of the IEEE. 58 (5): 632–643. doi:10.1109/PROC.1970.7719.
↑ Reinhardt, Jason C.; Chen, Xi; Liu, Wenhao; Manchev, Petar; Paté-Cornell, M. Elisabeth (2016). "Asteroid Risk Assessment: A Probabilistic Approach". Risk Analysis. 36 (2): 244–261. Bibcode:2016RiskA..36..244R. doi:10.1111/risa.12453. PMID 26215051. S2CID 23308354.
↑ Wang, Jiali; Neil, Martin; Fenton, Norman (2020). "A Bayesian network approach for cybersecurity risk assessment implementing and extending the FAIR model". Computers & Security. 89: 101659. doi:10.1016/j.cose.2019.101659. S2CID 209099797.
↑ Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.
↑ Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.
↑ Bratvold, Reidar B.; Mohus, Erlend; Petutschnig, David; Bickel, Eric (2020). "Production Forecasting: Optimistic and Overconfident—Over and over Again". Spe Reservoir Evaluation & Engineering. 23 (3): 0799–0810. doi:10.2118/195914-PA. S2CID 219661316.
↑ "SmartOrg Portfolio Manager". www.smartorg.com. Retrieved 2021-02-13.
1 2 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.
↑ Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4). Section 6.3, pp. 274–275. doi:10.1287/deca.2016.0338.
↑ Ramberg, John S.; Schmeiser, Bruce W. (1974-02-01). "An approximate method for generating asymmetric random variables". Communications of the ACM. 17 (2): 78–82. doi: 10.1145/360827.360840 . ISSN 0001-0782. S2CID 2640548.
↑ Nair, N. Unnikrishnan; Sankaran, P. G.; Vineshkumar, B. (2012-12-15). "The Govindarajulu Distribution: Some Properties and Applications". Communications in Statistics - Theory and Methods. 41 (24): 4391–4406. doi:10.1080/03610926.2011.573168. ISSN 0361-0926. S2CID 121096603.
↑ Wang, Mingliang; Rennolls, Keith (2005). "Tree diameter distribution modelling: Introducing the logit–logistic distribution". Canadian Journal of Forest Research. 35 (6): 1305–1313. doi:10.1139/x05-057.
↑ "SPT metalogs Excel workbook". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ "Metalogs workbook". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ "ELD metalogs workbook". www.metalogdistributions.com. Retrieved 2021-02-13.
↑ rmetalog R package
↑ Pymetalog Python package
↑ Metalogistic Python package
↑ MakeDistribution.com website supporting metalog experimentation.
↑ Online SPT metalog calculator.
↑ Online metalog calculator.
↑ Online ELD metalog calculator.
↑ SIPmath Modeler Tools Excel add-in
↑ Analytica Free 101 software aids modeling of difficult decisions.
↑ BayesFusion's Metalog Builder for interactive building of metalog distributions
↑ BayesFusion's GeNIe
↑ FrontLine Solvers: Analytic Solver, RASON, and Solver SDK software for optimization.
↑ Lone Star Analysis: TruNavigator and AnalyticsOS for predictive and prescriptive analytics.

External links

The Metalog Distributions website, www.metalogs.org
The Metalog Distributions YouTube channel, educational videos

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 3 4 5 Keelin, Thomas W. (2016). "The Metalog Distributions" (PDF). Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338. ISSN 1545-8490. Archived from the original on 2016-11-28.

[2] "About the Author". www.metalogdistributions.com. Retrieved 2021-02-13.

[3] De Moivre, A. (1756). The doctrine of chances: or, A method of calculating the probabilities of events in play (Vol. 1). Chelsea Publishing Company.

[4] Bayes, T. (1763). LII. An essay towards solving a problem in the doctrine of chances. By the late Rev. Mr. Bayes, FRS communicated by Mr. Price, in a letter to John Canton, AMFR S. Philosophical transactions of the Royal Society of London, (53), pp. 370–418.

[5] Johnson NL, Kotz S, Balakrishnan N. Continuous univariate distributions, Vol 1, Second Edition, John Wiley & Sons, Ltd, 1994, pp. 15–25.

[6] Ord, J.K., 1972. Families of frequency distributions. Charles Griffin & Co, Ltd, London. Table 1.1, p 6.

[7] Johnson, N. L. (1949). "Systems of Frequency Curves Generated by Methods of Translation". Biometrika. 36 (1/2): 149–176. doi:10.2307/2332539. JSTOR 2332539. PMID 18132090.

[8] Tadikamalla, Pandu R.; Johnson, Norman L. (1982). "Systems of Frequency Curves Generated by Transformations of Logistic Variables". Biometrika. 69 (2): 461–465. doi:10.1093/biomet/69.2.461. JSTOR 2335422.

[KeelinPowley-9] 1 2 3 4 Keelin, Thomas W.; Powley, Bradford W. (2011-08-04). "Quantile-Parameterized Distributions" (PDF). Decision Analysis. 8 (3): 206–219. doi:10.1287/deca.1110.0213. ISSN 1545-8490. Archived from the original on 2011-09-01.

[Kee-10] 1 2 3 Keelin TW (2016). "The Metalog Distributions." Decision Analysis. 13 (4): 243–277.

[11] Parzen, E., 1979, Nonparametric statistical data modeling, Journal of the American Statistical Association,7, 105–131

[:1-12] -PDF: A probability density function expressed as a function of cumulative probability $y$ rather than variable of interest $x$ ; equivalently, a "density quantile function" as defined by Parzen, E., 1979, Nonparametric statistical data modeling, Journal of the American Statistical Association,7, 105–131

[:2-13] 1 2 Gilchrist, Warren (2000-05-15). Statistical Modelling with Quantile Functions. Chapman and Hall/CRC. doi:10.1201/9781420035919. ISBN 978-0-429-11920-0.

[SPT-14] 1 2 3 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.

[15] "The Metalog Distributions Feasibility". www.metalogdistributions.com. Retrieved 2021-02-13.

[16] Powley, B.W. (2013). “Quantile Function Methods For Decision Analysis”. Corollary 12, p 30. PhD Dissertation, Stanford University

[KeelinHoward-17] 1 2 3 4 5 Keelin, Thomas W., and Ronald A. Howard. (2021). “The Metalog Distributions: Virtually Unlimited Shape Flexibility, Combining Expert Opinion in Closed Form, and Bayesian Updating in Closed Form.” OSF Preprints. doi:10.31219/osf.io/xdg5e.

[sumLog-18] 1 2 Keelin, T.W., Chrisman, L. and Savage, S.L. (2019). “The metalog distributions and extremely accurate sums of lognormals in closed form.” WSC '19: Proceedings of the Winter Simulation Conference. 3074–3085.

[Faber-19] 1 2 Faber, I.J. (2019). Cyber Risk Management: AI-generated Warnings of Threats (Doctoral dissertation, Stanford University).

[conv-20] 1 2 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.

[21] "Metalog Distributions Shape Flexibility". www.metalogdistributions.com. Retrieved 2021-02-13.

[22] "The Metalog Distribution Moments". www.metalogdistributions.com. Retrieved 2021-02-13.

[equiv-23] 1 2 "Equivalence of coefficients' and moments' parameterizations of three-term metalog". www.metalogdistributions.com. Retrieved 2021-03-28.

[statCan-24] 1 2 Dion, P., Galbraith, N., Sirag, E. (2020). “Using expert elicitation to build long-term projection assumptions.” In Developments in Demographic Forecasting, Chapter 3, pp. 43–62. Springer

[25] Howard, Ronald A. (1970). "Decision analysis: Perspectives on inference, decision, and experimentation". Proceedings of the IEEE. 58 (5): 632–643. doi:10.1109/PROC.1970.7719.

[26] Reinhardt, Jason C.; Chen, Xi; Liu, Wenhao; Manchev, Petar; Paté-Cornell, M. Elisabeth (2016). "Asteroid Risk Assessment: A Probabilistic Approach". Risk Analysis. 36 (2): 244–261. Bibcode:2016RiskA..36..244R. doi:10.1111/risa.12453. PMID 26215051. S2CID 23308354.

[27] Wang, Jiali; Neil, Martin; Fenton, Norman (2020). "A Bayesian network approach for cybersecurity risk assessment implementing and extending the FAIR model". Computers & Security. 89: 101659. doi:10.1016/j.cose.2019.101659. S2CID 209099797.

[28] Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.

[29] Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.

[30] Bratvold, Reidar B.; Mohus, Erlend; Petutschnig, David; Bickel, Eric (2020). "Production Forecasting: Optimistic and Overconfident—Over and over Again". Spe Reservoir Evaluation & Engineering. 23 (3): 0799–0810. doi:10.2118/195914-PA. S2CID 219661316.

[31] "SmartOrg Portfolio Manager". www.smartorg.com. Retrieved 2021-02-13.

[sim-32] 1 2 Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4): 243–277. doi:10.1287/deca.2016.0338.

[33] Keelin, Thomas W. (2016). "The Metalog Distributions". Decision Analysis. 13 (4). Section 6.3, pp. 274–275. doi:10.1287/deca.2016.0338.

[34] Ramberg, John S.; Schmeiser, Bruce W. (1974-02-01). "An approximate method for generating asymmetric random variables". Communications of the ACM. 17 (2): 78–82. doi: 10.1145/360827.360840 . ISSN 0001-0782. S2CID 2640548.

[35] Nair, N. Unnikrishnan; Sankaran, P. G.; Vineshkumar, B. (2012-12-15). "The Govindarajulu Distribution: Some Properties and Applications". Communications in Statistics - Theory and Methods. 41 (24): 4391–4406. doi:10.1080/03610926.2011.573168. ISSN 0361-0926. S2CID 121096603.

[36] Wang, Mingliang; Rennolls, Keith (2005). "Tree diameter distribution modelling: Introducing the logit–logistic distribution". Canadian Journal of Forest Research. 35 (6): 1305–1313. doi:10.1139/x05-057.

[37] "SPT metalogs Excel workbook". www.metalogdistributions.com. Retrieved 2021-02-13.

[38] "Metalogs workbook". www.metalogdistributions.com. Retrieved 2021-02-13.

[39] "ELD metalogs workbook". www.metalogdistributions.com. Retrieved 2021-02-13.

[40] rmetalog R package

[41] Pymetalog Python package

[42] Metalogistic Python package

[43] MakeDistribution.com website supporting metalog experimentation.

[44] Online SPT metalog calculator.

[45] Online metalog calculator.

[46] Online ELD metalog calculator.

[47] SIPmath Modeler Tools Excel add-in

[48] Analytica Free 101 software aids modeling of difficult decisions.

[49] BayesFusion's Metalog Builder for interactive building of metalog distributions

[50] BayesFusion's GeNIe

[51] FrontLine Solvers: Analytic Solver, RASON, and Solver SDK software for optimization.

[52] Lone Star Analysis: TruNavigator and AnalyticsOS for predictive and prescriptive analytics.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]