Species discovery curve

Last updated

In ecology, the species discovery curve (also known as a species accumulation curve or collector's curve [1] ) is a graph recording the cumulative number of species of living things recorded in a particular environment as a function of the cumulative effort expended searching for them (usually measured in person-hours). It is related to, but not identical with, the species-area curve.

The species discovery curve will necessarily be increasing, and will normally be negatively accelerated (that is, its rate of increase will slow down). Plotting the curve gives a way of estimating the number of additional species that will be discovered with further effort. This is usually done by fitting some kind of functional form to the curve, either by eye or by using non-linear regression techniques. Commonly used functional forms include the logarithmic function and the negative exponential function. The advantage of the negative exponential function is that it tends to an asymptote which equals the number of species that would be discovered if infinite effort is expended. However, some theoretical approaches imply that the logarithmic curve may be more appropriate,[ citation needed ] implying that though species discovery will slow down with increasing effort, it will never entirely cease, so there is no asymptote, and if infinite effort was expended, an infinite number of species would be discovered.[ citation needed ] An example in which one would not expect the function to asymptote is in the study of genetic sequences where new mutations and sequencing errors may lead to infinite variants.

The first theoretical investigation of the species-discovery process was in a classic paper by Fisher, Corbet and Williams (1943), which was based on a large collection of butterflies made in Malaya. Theoretical statistical work on the problem continues, see for example the recent paper by Chao and Shen (2004). The theory is linked to that of Zipf's law.

The same approach is used in many other fields. For example, in ethology, it can be applied to the number of distinct fixed action patterns that will be discovered as a function of cumulative effort studying the behaviour of a species of animal; in molecular genetics it is now being applied to the number of distinct genes that are discovered; and in literary studies, it can be used to estimate the total vocabulary of a writer from the given sample of his or her recorded works (see Efron & Thisted, 1976).

Related Research Articles

<span class="mw-page-title-main">Power law</span> Functional relationship between two quantities

In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a relative change in the other quantity proportional to a power of the change, independent of the initial size of those quantities: one quantity varies as a power of another. For instance, considering the area of a square in terms of the length of its side, if the length is doubled, the area is multiplied by a factor of four.

<span class="mw-page-title-main">Logistic function</span> S-shaped curve

A logistic function or logistic curve is a common S-shaped curve with equation

A learning curve is a graphical representation of the relationship between how proficient people are at a task and the amount of experience they have. Proficiency usually increases with increased experience, that is to say, the more someone, groups, companies or industries perform a task, the better their performance at the task.

In analytical geometry, a transcendental curve is a curve that is not an algebraic curve. Here for a curve, C, what matters is the point set underlying C, not a given parametrisation. For example, the unit circle is an algebraic curve ; the usual parametrisation by trigonometric functions may involve those transcendental functions, but certainly the unit circle is defined by a polynomial equation.

Species diversity is the number of different species that are represented in a given community. The effective number of species refers to the number of equally abundant species needed to obtain the same mean proportional species abundance as that observed in the dataset of interest. Meanings of species diversity may include species richness, taxonomic or phylogenetic diversity, and/or species evenness. Species richness is a simple count of species. Taxonomic or phylogenetic diversity is the genetic relationship between different groups of species. Species evenness quantifies how equal the abundances of the species are.

<span class="mw-page-title-main">Logarithmic distribution</span>

In probability and statistics, the logarithmic distribution is a discrete probability distribution derived from the Maclaurin series expansion

In convex analysis, a non-negative function f : RnR+ is logarithmically concave if its domain is a convex set, and if it satisfies the inequality

<span class="mw-page-title-main">Species richness</span> Variety of species in an ecological community, landscape or region

Species richness is the number of different species represented in an ecological community, landscape or region. Species richness is simply a count of species, and it does not take into account the abundances of the species or their relative abundance distributions. Species richness is sometimes considered synonymous with species diversity, but the formal metric species diversity takes into account both species richness and species evenness.

<span class="mw-page-title-main">Species–area relationship</span>

The species–area relationship or species–area curve describes the relationship between the area of a habitat, or of part of a habitat, and the number of species found within that area. Larger areas tend to contain larger numbers of species, and empirically, the relative numbers seem to follow systematic mathematical relationships. The species–area relationship is usually constructed for a single type of organism, such as all vascular plants or all species of a specific trophic level within a particular site. It is rarely if ever, constructed for all types of organisms if simply because of the prodigious data requirements. It is related but not identical to the species discovery curve.

In statistical modeling, polynomial functions and rational functions are sometimes used as an empirical technique for curve fitting.

<span class="mw-page-title-main">Rarefaction (ecology)</span> Technique to assess species richness

In ecology, rarefaction is a technique to assess species richness from the results of sampling. Rarefaction allows the calculation of species richness for a given number of individual samples, based on the construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the number of samples. Rarefaction curves generally grow rapidly at first, as the most common species are found, but the curves plateau as only the rarest species remain to be sampled.

Relative species abundance is a component of biodiversity and is a measure of how common or rare a species is relative to other species in a defined location or community. Relative abundance is the percent composition of an organism of a particular kind relative to the total number of organisms in the area. Relative species abundances tend to conform to specific patterns that are among the best-known and most-studied patterns in macroecology. Different populations in a community exist in relative proportions; this idea is known as relative abundance.

The survival function is a function that gives the probability that a patient, device, or other object of interest will survive past a certain time. The survival function is also known as the survivor function or reliability function. The term reliability function is common in engineering while the term survival function is used in a broader range of applications, including human mortality. The survival function is the complementary cumulative distribution function of the lifetime. Sometimes complementary cumulative distribution functions are called survival functions in general.

<span class="mw-page-title-main">CumFreq</span> Software tool for data analysis and statistics

In statistics and data analysis the application software CumFreq is a tool for cumulative frequency analysis of a single variable and for probability distribution fitting.

In exponentiation, the base is the number b in an expression of the form bn.

Taylor's power law is an empirical law in ecology that relates the variance of the number of individuals of a species per unit area of habitat to the corresponding mean by a power law relationship. It is named after the ecologist who first proposed it in 1961, Lionel Roy Taylor (1924–2007). Taylor's original name for this relationship was the law of the mean. The name Taylor's law was coined by Southwood in 1966.

Probability distribution fitting or simply distribution fitting is the fitting of a probability distribution to a series of data concerning the repeated measurement of a variable phenomenon. The aim of distribution fitting is to predict the probability or to forecast the frequency of occurrence of the magnitude of the phenomenon in a certain interval.

The unseen species problem is commonly referred to in ecology and deals with the estimation of the number of species represented in an ecosystem that were not observed by samples. It more specifically relates to how many new species would be discovered if more samples were taken in an ecosystem. The study of the unseen species problem was started in the early 1940s by Alexander Steven Corbet. He spent 2 years in British Malaya trapping butterflies and was curious how many new species he would discover if he spent another 2 years trapping. Many different estimation methods have been developed to determine how many new species would be discovered given more samples. The unseen species problem also applies more broadly, as the estimators can be used to estimate any new elements of a set not previously found in samples. An example of this is determining how many words William Shakespeare knew based on all of his written works.

References

  1. Colwell, R. K.; Coddington, J. A. (1994-07-29). "Estimating terrestrial biodiversity through extrapolation". Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences. 345 (1311): 101–118. doi:10.1098/rstb.1994.0091. ISSN   0962-8436. PMID   7972351.