Diversity index

Last updated February 04, 2025

A diversity index is a method of measuring how many different types (e.g. species) there are in a dataset (e.g. a community). Diversity indices are statistical representations of different aspects of biodiversity (e.g. richness, evenness, and dominance), which are useful simplifications for comparing different communities or sites.

When diversity indices are used in ecology, the types of interest are usually species, but they can also be other categories, such as genera, families, functional types, or haplotypes. The entities of interest are usually individual organisms (e.g. plants or animals), and the measure of abundance can be, for example, number of individuals, biomass or coverage. In demography, the entities of interest can be people, and the types of interest various demographic groups. In information science, the entities can be characters and the types of the different letters of the alphabet. The most commonly used diversity indices are simple transformations of the effective number of types (also known as 'true diversity'), but each diversity index can also be interpreted in its own right as a measure corresponding to some real phenomenon (but a different one for each diversity index).^[1]^[2]^[3]^[4]

Many indices only account for categorical diversity between subjects or entities. Such indices, however do not account for the total variation (diversity) that can be held between subjects or entities which occurs only when both categorical and qualitative diversity are calculated.

Diversity indices described in this article include:

Richness, simply a count of the number of types in a dataset.
Shannon index, which also takes into account the proportional abundance of each class under a weighted geometric mean.
- The Rényi entropy, which adds the ability to freely vary the kind of weighted mean used.
Simpson index, which too takes into account the proportional abundance of each class under a weighted arithmetic mean
Berger–Parker index, which gives the proportional abundance of the most abundant type.
Effective number of species (true diversity), which allows for freely varying the kind of weighted mean used, and has a intuitive meaning.^[4]

Some more sophisticated indices also account for the phylogenetic relatedness among the types. These are called phylo-divergence indices, and are not yet described in this article.^[5]

Effective number of species or Hill numbers

True diversity, or the effective number of types, refers to the number of equally abundant types needed for the average proportional abundance of the types to equal that observed in the dataset of interest (where all types may not be equally abundant). The true diversity in a dataset is calculated by first taking the weighted generalized mean $M q -1$ of the proportional abundances of the types in the dataset, and then taking the reciprocal of this. The equation is:^[3]^[4]

{}^{q}\!D={1 \over M_{q-1}}={1 \over {\sqrt[{q-1}]{\sum _{i=1}^{R}p_{i}p_{i}^{q-1}}}}=\left({\sum _{i=1}^{R}p_{i}^{q}}\right)^{1/(1-q)}

The denominator $M q -1$ equals the average proportional abundance of the types in the dataset as calculated with the weighted generalized mean with exponent $q - 1$ . In the equation, $R$ is richness (the total number of types in the dataset), and the proportional abundance of the $i$ th type is $p i$ . The proportional abundances themselves are used as the nominal weights. The numbers $^{q}D$ are called Hill numbers of orderq or effective number of species.^[6]

When $q = 1$ , the above equation is undefined. However, the mathematical limit as $q$ approaches 1 is well defined and the corresponding diversity is calculated with the following equation:

{}^{1}\!D={1 \over {\prod _{i=1}^{R}p_{i}^{p_{i}}}}=\exp \left(-\sum _{i=1}^{R}p_{i}\ln(p_{i})\right)

which is the exponential of the Shannon entropy calculated with natural logarithms (see above). In other domains, this statistic is also known as the perplexity .

The general equation of diversity is often written in the form^[1]^[2]

{}^{q}\!D=\left({\sum _{i=1}^{R}p_{i}^{q}}\right)^{1/(1-q)}

and the term inside the parentheses is called the basic sum. Some popular diversity indices correspond to the basic sum as calculated with different values of $q$ .^[2]

Sensitivity of the diversity value to rare vs. abundant species

The value of $q$ is often referred to as the order of the diversity. It defines the sensitivity of the true diversity to rare vs. abundant species by modifying how the weighted mean of the species' proportional abundances is calculated. With some values of the parameter $q$ , the value of the generalized mean $M q -1$ assumes familiar kinds of weighted means as special cases. In particular,

$q = 0$ corresponds to the weighted harmonic mean,
$q = 1$ to the weighted geometric mean, and
$q = 2$ to the weighted arithmetic mean.
As $q$ approaches infinity, the weighted generalized mean with exponent $q - 1$ approaches the maximum $p i$ value, which is the proportional abundance of the most abundant species in the dataset.

Generally, increasing the value of $q$ increases the effective weight given to the most abundant species. This leads to obtaining a larger $M q -1$ value and a smaller true diversity ( $q D$ ) value with increasing $q$ .

When $q = 1$ , the weighted geometric mean of the $p i$ values is used, and each species is exactly weighted by its proportional abundance (in the weighted geometric mean, the weights are the exponents). When $q > 1$ , the weight given to abundant species is exaggerated, and when $q < 1$ , the weight given to rare species is. At $q = 0$ , the species weights exactly cancel out the species proportional abundances, such that the weighted mean of the $p i$ values equals $1 / R$ even when all species are not equally abundant. At $q = 0$ , the effective number of species, $0 D$ , hence equals the actual number of species $R$ . In the context of diversity, $q$ is generally limited to non-negative values. This is because negative values of $q$ would give rare species so much more weight than abundant ones that $q D$ would exceed $R$ .^[3]^[4]

Richness

Richness $R$ simply quantifies how many different types the dataset of interest contains. For example, species richness (usually noted $S$ ) is simply the number of species, e.g. at a particular site. Richness is a simple measure, so it has been a popular diversity index in ecology, where abundance data are often not available.^[7] If true diversity is calculated with $q = 0$ , the effective number of types ( $0 D$ ) equals the actual number of types, which is identical to Richness ( $R$ ).^[2]^[4]

Shannon index

The Shannon index has been a popular diversity index in the ecological literature, where it is also known as Shannon's diversity index, Shannon–Wiener index, and (erroneously) Shannon–Weaver index.^[8] The measure was originally proposed by Claude Shannon in 1948 to quantify the entropy (hence Shannon entropy, related to Shannon information content) in strings of text.^[9] The idea is that the more letters there are, and the closer their proportional abundances in the string of interest, the more difficult it is to correctly predict which letter will be the next one in the string. The Shannon entropy quantifies the uncertainty (entropy or degree of surprise) associated with this prediction. It is most often calculated as follows:

H'=-\sum _{i=1}^{R}p_{i}\ln(p_{i})

where $p i$ is the proportion of characters belonging to the $i$ th type of letter in the string of interest. In ecology, $p i$ is often the proportion of individuals belonging to the $i$ th species in the dataset of interest. Then the Shannon entropy quantifies the uncertainty in predicting the species identity of an individual that is taken at random from the dataset.

Although the equation is here written with natural logarithms, the base of the logarithm used when calculating the Shannon entropy can be chosen freely. Shannon himself discussed logarithm bases 2, 10 and $e$ , and these have since become the most popular bases in applications that use the Shannon entropy. Each log base corresponds to a different measurement unit, which has been called binary digits (bits), decimal digits (decits), and natural digits (nats) for the bases 2, 10 and $e$ , respectively. Comparing Shannon entropy values that were originally calculated with different log bases requires converting them to the same log base: change from the base $a$ to base $b$ is obtained with multiplication by $log b (a)$ .^[9]

The Shannon index ( $H'$ ) is related to the weighted geometric mean of the proportional abundances of the types. Specifically, it equals the logarithm of true diversity as calculated with $q = 1$ :^[3]

H'=-\sum _{i=1}^{R}p_{i}\ln(p_{i})=-\sum _{i=1}^{R}\ln \left(p_{i}^{p_{i}}\right)

This can also be written

H'=-\left(\ln \left(p_{1}^{p_{1}}\right)+\ln \left(p_{2}^{p_{2}}\right)+\ln \left(p_{3}^{p_{3}}\right)+\cdots +\ln \left(p_{R}^{p_{R}}\right)\right)

which equals

H'=-\ln \left(p_{1}^{p_{1}}p_{2}^{p_{2}}p_{3}^{p_{3}}\cdots p_{R}^{p_{R}}\right)=\ln \left({1 \over p_{1}^{p_{1}}p_{2}^{p_{2}}p_{3}^{p_{3}}\cdots p_{R}^{p_{R}}}\right)=\ln \left({1 \over {\prod _{i=1}^{R}p_{i}^{p_{i}}}}\right)

Since the sum of the $p i$ values equals 1 by definition, the denominator equals the weighted geometric mean of the $p i$ values, with the $p i$ values themselves being used as the weights (exponents in the equation). The term within the parentheses hence equals true diversity $1 D$ , and $H'$ equals $ln(1 D)$ .^[1]^[3]^[4]

When all types in the dataset of interest are equally common, all $p i$ values equal $1 / R$ , and the Shannon index hence takes the value $ln(R)$ . The more unequal the abundances of the types, the larger the weighted geometric mean of the $p i$ values, and the smaller the corresponding Shannon entropy. If practically all abundance is concentrated to one type, and the other types are very rare (even if there are many of them), Shannon entropy approaches zero. When there is only one type in the dataset, Shannon entropy exactly equals zero (there is no uncertainty in predicting the type of the next randomly chosen entity).

In machine learning the Shannon index is also called as Information gain.

Rényi entropy

The Rényi entropy is a generalization of the Shannon entropy to other values of $q$ than 1. It can be expressed:

{}^{q}H={\frac {1}{1-q}}\;\ln \left(\sum _{i=1}^{R}p_{i}^{q}\right)

which equals

{}^{q}H=\ln \left({1 \over {\sqrt[{q-1}]{\sum _{i=1}^{R}p_{i}p_{i}^{q-1}}}}\right)=\ln({}^{q}\!D)

This means that taking the logarithm of true diversity based on any value of $q$ gives the Rényi entropy corresponding to the same value of $q$ .

Simpson index

The Simpson index was introduced in 1949 by Edward H. Simpson to measure the degree of concentration when individuals are classified into types.^[10] The same index was rediscovered by Orris C. Herfindahl in 1950.^[11] The square root of the index had already been introduced in 1945 by the economist Albert O. Hirschman.^[12] As a result, the same measure is usually known as the Simpson index in ecology, and as the Herfindahl index or the Herfindahl–Hirschman index (HHI) in economics.

The measure equals the probability that two entities taken at random from the dataset of interest represent the same type.^[10] It equals:

\lambda =\sum _{i=1}^{R}p_{i}^{2},

where $R$ is richness (the total number of types in the dataset). This equation is also equal to the weighted arithmetic mean of the proportional abundances $p i$ of the types of interest, with the proportional abundances themselves being used as the weights.^[1] Proportional abundances are by definition constrained to values between zero and one, but it is a weighted arithmetic mean, hence $λ \geq 1/ R$ , which is reached when all types are equally abundant.

By comparing the equation used to calculate λ with the equations used to calculate true diversity, it can be seen that $1/λ$ equals $2 D$ , i.e., true diversity as calculated with $q = 2$ . The original Simpson's index hence equals the corresponding basic sum.^[2]

The interpretation of λ as the probability that two entities taken at random from the dataset of interest represent the same type assumes that the first entity is replaced to the dataset before taking the second entity. If the dataset is very large, sampling without replacement gives approximately the same result, but in small datasets, the difference can be substantial. If the dataset is small, and sampling without replacement is assumed, the probability of obtaining the same type with both random draws is:

\ell ={\frac {\sum _{i=1}^{R}n_{i}(n_{i}-1)}{N(N-1)}}

where $n i$ is the number of entities belonging to the $i$ th type and $N$ is the total number of entities in the dataset.^[10] This form of the Simpson index is also known as the Hunter–Gaston index in microbiology.^[13]

Since the mean proportional abundance of the types increases with decreasing number of types and increasing abundance of the most abundant type, λ obtains small values in datasets of high diversity and large values in datasets of low diversity. This is counterintuitive behavior for a diversity index, so often, such transformations of λ that increase with increasing diversity have been used instead. The most popular of such indices have been the inverse Simpson index (1/λ) and the Gini–Simpson index (1 − λ).^[1]^[2] Both of these have also been called the Simpson index in the ecological literature, so care is needed to avoid accidentally comparing the different indices as if they were the same.

Inverse Simpson index

The inverse Simpson index equals:

{\frac {1}{\lambda }}={1 \over \sum _{i=1}^{R}p_{i}^{2}}={}^{2}D

This simply equals true diversity of order 2, i.e. the effective number of types that is obtained when the weighted arithmetic mean is used to quantify average proportional abundance of types in the dataset of interest.

The index is also used as a measure of the effective number of parties.

Gini–Simpson index

The Gini-Simpson Index is also called Gini impurity, or Gini's diversity index^[14] in the field of Machine Learning. The original Simpson index λ equals the probability that two entities taken at random from the dataset of interest (with replacement) represent the same type. Its transformation 1 − λ, therefore, equals the probability that the two entities represent different types. This measure is also known in ecology as the probability of interspecific encounter (PIE)^[15] and the Gini–Simpson index.^[2] It can be expressed as a transformation of the true diversity of order 2:

1-\lambda =1-\sum _{i=1}^{R}p_{i}^{2}=1-{\frac {1}{{}^{2}D}}

The Gibbs–Martin index of sociology, psychology, and management studies,^[16] which is also known as the Blau index, is the same measure as the Gini–Simpson index.

The quantity is also known as the expected heterozygosity in population genetics.

Berger–Parker index

The Berger–Parker index, named after Wolfgang H. Berger and Frances Lawrence Parker,^[17] equals the maximum $p i$ value in the dataset, i.e., the proportional abundance of the most abundant type. This corresponds to the weighted generalized mean of the $p i$ values when $q$ approaches infinity, and hence equals the inverse of the true diversity of order infinity ( $1/ \infty D$ ).

Related Research Articles

In information theory, the entropy of a random variable quantifies the average level of uncertainty or information associated with the variable's potential states or possible outcomes. This measures the expected amount of information needed to describe the state of the variable, considering the distribution of probabilities across all potential states. Given a discrete random variable $, which takes values in the set and is distributed according to, the entropy is where denotes the sum over the variable's possible values. The choice of base for, the logarithm, varies for different applications. Base 2 gives the unit of bits, while base e gives "natural units" nat, and base 10 gives units of "dits", "bans", or "hartleys". An equivalent definition of entropy is the expected value of the self-information of a variable.$

<span class="mw-page-title-main">Boltzmann constant</span> Physical constant relating particle kinetic energy with temperature

The Boltzmann constant is the proportionality factor that relates the average relative thermal energy of particles in a gas with the thermodynamic temperature of the gas. It occurs in the definitions of the kelvin (K) and the gas constant, in Planck's law of black-body radiation and Boltzmann's entropy formula, and is used in calculating thermal noise in resistors. The Boltzmann constant has dimensions of energy divided by temperature, the same as entropy and heat capacity. It is named after the Austrian scientist Ludwig Boltzmann.

In probability theory and statistics, the Weibull distribution is a continuous probability distribution. It models a broad range of random variables, largely in the nature of a time to failure or time between events. Examples are maximum one-day rainfalls and the time a user spends on a web page.

<span class="mw-page-title-main">Partition function (statistical mechanics)</span> Function in thermodynamics and statistical physics

In physics, a partition function describes the statistical properties of a system in thermodynamic equilibrium. Partition functions are functions of the thermodynamic state variables, such as the temperature and volume. Most of the aggregate thermodynamic variables of the system, such as the total energy, free energy, entropy, and pressure, can be expressed in terms of the partition function or its derivatives. The partition function is dimensionless.

Species diversity is the number of different species that are represented in a given community. The effective number of species refers to the number of equally abundant species needed to obtain the same mean proportional species abundance as that observed in the dataset of interest. Meanings of species diversity may include species richness, taxonomic or phylogenetic diversity, and/or species evenness. Species richness is a simple count of species. Taxonomic or phylogenetic diversity is the genetic relationship between different groups of species. Species evenness quantifies how equal the abundances of the species are.

In mathematical statistics, the Kullback–Leibler (KL) divergence, denoted $, is a type of statistical distance: a measure of how much a model probability distribution Q is different from a true probability distribution P . Mathematically, it is defined as$

Variational Bayesian methods are a family of techniques for approximating intractable integrals arising in Bayesian inference and machine learning. They are typically used in complex statistical models consisting of observed variables as well as unknown parameters and latent variables, with various sorts of relationships among the three types of random variables, as might be described by a graphical model. As typical in Bayesian inference, the parameters and latent variables are grouped together as "unobserved variables". Variational Bayesian methods are primarily used for two purposes:

To provide an analytical approximation to the posterior probability of the unobserved variables, in order to do statistical inference over these variables.
To derive a lower bound for the marginal likelihood of the observed data. This is typically used for performing model selection, the general idea being that a higher marginal likelihood for a given model indicates a better fit of the data by that model and hence a greater probability that the model in question was the one that generated the data.

In information theory, the Rényi entropy is a quantity that generalizes various notions of entropy, including Hartley entropy, Shannon entropy, collision entropy, and min-entropy. The Rényi entropy is named after Alfréd Rényi, who looked for the most general way to quantify information while preserving additivity for independent events. In the context of fractal dimension estimation, the Rényi entropy forms the basis of the concept of generalized dimensions.

In information theory, the cross-entropy between two probability distributions $and, over the same underlying set of events, measures the average number of bits needed to identify an event drawn from the set when the coding scheme used for the set is optimized for an estimated probability distribution, rather than the true distribution .$

In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, if nothing is known about a distribution except that it belongs to a certain class, then the distribution with the largest entropy should be chosen as the least-informative default. The motivation is twofold: first, maximizing entropy minimizes the amount of prior information built into the distribution; second, many physical systems tend to move towards maximal entropy configurations over time.

In physics, the von Neumann entropy, named after John von Neumann, is a measure of the statistical uncertainty within a description of a quantum system. It extends the concept of Gibbs entropy from classical statistical mechanics to quantum statistical mechanics, and it is the quantum counterpart of the Shannon entropy from classical information theory. For a quantum-mechanical system described by a density matrix $ρ$ , the von Neumann entropy is $where denotes the trace and denotes the matrix version of the natural logarithm. If the density matrix ρ is written in a basis of its eigenvectors as then the von Neumann entropy is merely In this form, S can be seen as the Shannon entropy of the eigenvalues, reinterpreted as probabilities.$

The Theil index is a statistic primarily used to measure economic inequality and other economic phenomena, though it has also been used to measure racial segregation. The Theil index T_T is the same as redundancy in information theory which is the maximum possible entropy of the data minus the observed entropy. It is a special case of the generalized entropy index. It can be viewed as a measure of redundancy, lack of diversity, isolation, segregation, inequality, non-randomness, and compressibility. It was proposed by a Dutch econometrician Henri Theil (1924–2000) at the Erasmus University Rotterdam.

Differential entropy is a concept in information theory that began as an attempt by Claude Shannon to extend the idea of (Shannon) entropy of a random variable, to continuous probability distributions. Unfortunately, Shannon did not derive this formula, and rather just assumed it was the correct continuous analogue of discrete entropy, but it is not. The actual continuous version of discrete entropy is the limiting density of discrete points (LDDP). Differential entropy is commonly encountered in the literature, but it is a limiting case of the LDDP, and one that loses its fundamental association with discrete entropy.

In ecology, alpha diversity (α-diversity) is the mean species diversity in a site at a local scale. The term was introduced by R. H. Whittaker together with the terms beta diversity (β-diversity) and gamma diversity (γ-diversity). Whittaker's idea was that the total species diversity in a landscape is determined by two different things, the mean species diversity in sites at a more local scale and the differentiation among those sites.

The concept entropy was first developed by German physicist Rudolf Clausius in the mid-nineteenth century as a thermodynamic property that predicts that certain spontaneous processes are irreversible or impossible. In statistical mechanics, entropy is formulated as a statistical property using probability theory. The statistical entropy perspective was introduced in 1870 by Austrian physicist Ludwig Boltzmann, who established a new field of physics that provided the descriptive linkage between the macroscopic observation of nature and the microscopic view based on the rigorous treatment of large ensembles of microscopic states that constitute thermodynamic systems.

In ecology, gamma diversity (γ-diversity) is the total species diversity in a landscape. The term was introduced by R. H. Whittaker together with the terms alpha diversity (α-diversity) and beta diversity (β-diversity). Whittaker's idea was that the total species diversity in a landscape (γ) is determined by two different things, the mean species diversity in sites at a more local scale (α) and the differentiation among those sites (β). According to this reasoning, alpha diversity and beta diversity constitute independent components of gamma diversity:

An index of qualitative variation (IQV) is a measure of statistical dispersion in nominal distributions. Examples include the variation ratio or the information entropy.

In mathematics, a function defined on a region of the complex plane is said to be of bounded type if it is equal to the ratio of two analytic functions bounded in that region. But more generally, a function is of bounded type in a region $if and only if is analytic on and has a harmonic majorant on where . Being the ratio of two bounded analytic functions is a sufficient condition for a function to be of bounded type, and if is simply connected the condition is also necessary.$

In network science, the network entropy is a disorder measure derived from information theory to describe the level of randomness and the amount of information encoded in a graph. It is a relevant metric to quantitatively characterize real complex networks and can also be used to quantify network complexity

The Gibbs rotational ensemble represents the possible states of a mechanical system in thermal and rotational equilibrium at temperature $and angular velocity . The Jaynes procedure can be used to obtain this ensemble. An ensemble is the set of microstates corresponding to a given macrostate.$

References

1 2 3 4 5 Hill, M. O. (1973). "Diversity and evenness: a unifying notation and its consequences". Ecology . 54 (2): 427–432. Bibcode:1973Ecol...54..427H. doi:10.2307/1934352. JSTOR 1934352.
1 2 3 4 5 6 7 Jost, L (2006). "Entropy and diversity". Oikos. 113 (2): 363–375. Bibcode:2006Oikos.113..363J. doi:10.1111/j.2006.0030-1299.14714.x.
1 2 3 4 5 Tuomisto, H (2010). "A diversity of beta diversities: straightening up a concept gone awry. Part 1. Defining beta diversity as a function of alpha and gamma diversity". Ecography . 33 (1): 2–22. Bibcode:2010Ecogr..33....2T. doi:10.1111/j.1600-0587.2009.05880.x.
1 2 3 4 5 6 Tuomisto, H (2010). "A consistent terminology for quantifying species diversity? Yes, it does exist". Oecologia . 164 (4): 853–860. Bibcode:2010Oecol.164..853T. doi:10.1007/s00442-010-1812-0. PMID 20978798. S2CID 19902787.
↑ Tucker, Caroline M.; Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S. (May 2017). "A guide to phylogenetic metrics for conservation, community ecology and macroecology: A guide to phylogenetic metrics for ecology". Biological Reviews. 92 (2): 698–715. doi:10.1111/brv.12252. PMC 5096690 . PMID 26785932.
↑ Chao, Anne; Chiu, Chun-Huo; Jost, Lou (2016), "Phylogenetic Diversity Measures and Their Decomposition: A Framework Based on Hill Numbers", Biodiversity Conservation and Phylogenetic Systematics, Topics in Biodiversity and Conservation, vol. 14, Springer International Publishing, pp. 141–172, doi: 10.1007/978-3-319-22461-9_8 , ISBN 9783319224602
↑ Morris, E. Kathryn; Caruso, Tancredi; Buscot, François; Fischer, Markus; Hancock, Christine; Maier, Tanja S.; Meiners, Torsten; Müller, Caroline; Obermaier, Elisabeth; Prati, Daniel; Socher, Stephanie A.; Sonnemann, Ilja; Wäschke, Nicole; Wubet, Tesfaye; Wurst, Susanne (September 2014). "Choosing and using diversity indices: insights for ecological applications from the German Biodiversity Exploratories". Ecology and Evolution. 4 (18): 3514–3524. Bibcode:2014EcoEv...4.3514M. doi:10.1002/ece3.1155. ISSN 2045-7758. PMC 4224527 . PMID 25478144.
↑ Spellerberg, Ian F., and Peter J. Fedor. (2003) A tribute to Claude Shannon (1916–2001) and a plea for more rigorous use of species richness, species diversity and the ‘Shannon–Wiener’Index. Global Ecology and Biogeography 12.3, 177-179.
1 2 Shannon, C. E. (1948) A mathematical theory of communication. The Bell System Technical Journal, 27, 379–423 and 623–656.
1 2 3 Simpson, E. H. (1949). "Measurement of diversity". Nature. 163 (4148): 688. Bibcode:1949Natur.163..688S. doi: 10.1038/163688a0 .
↑ Herfindahl, O. C. (1950) Concentration in the U.S. Steel Industry. Unpublished doctoral dissertation, Columbia University.
↑ Hirschman, A. O. (1945) National power and the structure of foreign trade. Berkeley.
↑ Hunter, PR; Gaston, MA (1988). "Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity". J Clin Microbiol. 26 (11): 2465–2466. doi:10.1128/JCM.26.11.2465-2466.1988. PMC 266921 . PMID 3069867.
↑ "Growing Decision Trees". MathWorks.
↑ Hurlbert, S.H. (1971). "The nonconcept of species diversity: A critique and alternative parameters". Ecology . 52 (4): 577–586. Bibcode:1971Ecol...52..577H. doi:10.2307/1934145. JSTOR 1934145. PMID 28973811. S2CID 25837001.
↑ Gibbs, Jack P.; William T. Martin (1962). "Urbanization, technology and the division of labor". American Sociological Review . 27 (5): 667–677. doi:10.2307/2089624. JSTOR 2089624.
↑ Berger, Wolfgang H.; Parker, Frances L. (June 1970). "Diversity of Planktonic Foraminifera in Deep-Sea Sediments". Science . 168 (3937): 1345–1347. Bibcode:1970Sci...168.1345B. doi:10.1126/science.168.3937.1345. PMID 17731043. S2CID 29553922.

External links

Simpson's Diversity index
Diversity indices Archived 2005-12-19 at the Wayback Machine gives some examples of estimates of Simpson's index for real ecosystems.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Hill1973-1] 1 2 3 4 5 Hill, M. O. (1973). "Diversity and evenness: a unifying notation and its consequences". Ecology . 54 (2): 427–432. Bibcode:1973Ecol...54..427H. doi:10.2307/1934352. JSTOR 1934352.

[Jost2006-2] 1 2 3 4 5 6 7 Jost, L (2006). "Entropy and diversity". Oikos. 113 (2): 363–375. Bibcode:2006Oikos.113..363J. doi:10.1111/j.2006.0030-1299.14714.x.

[Tuomisto2010a-3] 1 2 3 4 5 Tuomisto, H (2010). "A diversity of beta diversities: straightening up a concept gone awry. Part 1. Defining beta diversity as a function of alpha and gamma diversity". Ecography . 33 (1): 2–22. Bibcode:2010Ecogr..33....2T. doi:10.1111/j.1600-0587.2009.05880.x.

[Tuomisto2010c-4] 1 2 3 4 5 6 Tuomisto, H (2010). "A consistent terminology for quantifying species diversity? Yes, it does exist". Oecologia . 164 (4): 853–860. Bibcode:2010Oecol.164..853T. doi:10.1007/s00442-010-1812-0. PMID 20978798. S2CID 19902787.

[5] Tucker, Caroline M.; Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S. (May 2017). "A guide to phylogenetic metrics for conservation, community ecology and macroecology: A guide to phylogenetic metrics for ecology". Biological Reviews. 92 (2): 698–715. doi:10.1111/brv.12252. PMC 5096690 . PMID 26785932.

[6] Chao, Anne; Chiu, Chun-Huo; Jost, Lou (2016), "Phylogenetic Diversity Measures and Their Decomposition: A Framework Based on Hill Numbers", Biodiversity Conservation and Phylogenetic Systematics, Topics in Biodiversity and Conservation, vol. 14, Springer International Publishing, pp. 141–172, doi: 10.1007/978-3-319-22461-9_8 , ISBN 9783319224602

[7] Morris, E. Kathryn; Caruso, Tancredi; Buscot, François; Fischer, Markus; Hancock, Christine; Maier, Tanja S.; Meiners, Torsten; Müller, Caroline; Obermaier, Elisabeth; Prati, Daniel; Socher, Stephanie A.; Sonnemann, Ilja; Wäschke, Nicole; Wubet, Tesfaye; Wurst, Susanne (September 2014). "Choosing and using diversity indices: insights for ecological applications from the German Biodiversity Exploratories". Ecology and Evolution. 4 (18): 3514–3524. Bibcode:2014EcoEv...4.3514M. doi:10.1002/ece3.1155. ISSN 2045-7758. PMC 4224527 . PMID 25478144.

[Spellerberg2003-8] Spellerberg, Ian F., and Peter J. Fedor. (2003) A tribute to Claude Shannon (1916–2001) and a plea for more rigorous use of species richness, species diversity and the ‘Shannon–Wiener’Index. Global Ecology and Biogeography 12.3, 177-179.

[Shannon1948-9] 1 2 Shannon, C. E. (1948) A mathematical theory of communication. The Bell System Technical Journal, 27, 379–423 and 623–656.

[Simpson1949-10] 1 2 3 Simpson, E. H. (1949). "Measurement of diversity". Nature. 163 (4148): 688. Bibcode:1949Natur.163..688S. doi: 10.1038/163688a0 .

[11] Herfindahl, O. C. (1950) Concentration in the U.S. Steel Industry. Unpublished doctoral dissertation, Columbia University.

[12] Hirschman, A. O. (1945) National power and the structure of foreign trade. Berkeley.

[Hunter1988-13] Hunter, PR; Gaston, MA (1988). "Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity". J Clin Microbiol. 26 (11): 2465–2466. doi:10.1128/JCM.26.11.2465-2466.1988. PMC 266921 . PMID 3069867.

[14] "Growing Decision Trees". MathWorks.

[15] Hurlbert, S.H. (1971). "The nonconcept of species diversity: A critique and alternative parameters". Ecology . 52 (4): 577–586. Bibcode:1971Ecol...52..577H. doi:10.2307/1934145. JSTOR 1934145. PMID 28973811. S2CID 25837001.

[16] Gibbs, Jack P.; William T. Martin (1962). "Urbanization, technology and the division of labor". American Sociological Review . 27 (5): 667–677. doi:10.2307/2089624. JSTOR 2089624.

[17] Berger, Wolfgang H.; Parker, Frances L. (June 1970). "Diversity of Planktonic Foraminifera in Deep-Sea Sediments". Science . 168 (3937): 1345–1347. Bibcode:1970Sci...168.1345B. doi:10.1126/science.168.3937.1345. PMID 17731043. S2CID 29553922.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]