Relative species abundance

Last updated

Relative species abundance is a component of biodiversity and is a measure of how common or rare a species is relative to other species in a defined location or community. [1] Relative abundance is the percent composition of an organism of a particular kind relative to the total number of organisms in the area.[ citation needed ] Relative species abundances tend to conform to specific patterns that are among the best-known and most-studied patterns in macroecology. Different populations in a community exist in relative proportions; this idea is known as relative abundance.

Contents

Introduction

Relative species abundance

Figure 1. Relative species abundance of beetles sampled from the river Thames showing the universal "hollow curve". (derived from data presented in Magurran (2004) and collected by C.B. Williams (1964) ) Hollow curve beetles.png
Figure 1. Relative species abundance of beetles sampled from the river Thames showing the universal "hollow curve". (derived from data presented in Magurran (2004) and collected by C.B. Williams (1964) )

Relative species abundance and species richness describe key elements of biodiversity. [1] Relative species abundance refers to how common or rare a species is relative to other species in a given location or community. [1] [4]

Usually relative species abundances are described for a single trophic level. Because such species occupy the same trophic level they will potentially or actually compete for similar resources. [1] For example, relative species abundances might describe all terrestrial birds in a forest community or all planktonic copepods in a particular marine environment.

Relative species abundances follow very similar patterns over a wide range of ecological communities. When plotted as a histogram of the number of species represented by 1, 2, 3, ..., n individuals usually fit a hollow curve, such that most species are rare, (represented by a single individual in a community sample) and relatively few species are abundant (represented by a large number of individuals in a community sample)(Figure 1). [4] This pattern has been long-recognized and can be broadly summarized with the statement that "most species are rare". [5] For example, Charles Darwin noted in 1859 in The Origin of Species that "... rarity is the attribute of vast numbers of species in all classes...." [6]

Species abundance patterns can be best visualized in the form of relative abundance distribution plots. The consistency of relative species abundance patterns suggests that some common macroecological "rule" or process determines the distribution of individuals among species within a trophic level.

Distribution plots

Figure 2. Preston plot of beetles sampled from the river Thames showing a strong right-skew. PrestonPlot beetles.png
Figure 2. Preston plot of beetles sampled from the river Thames showing a strong right-skew.
Figure 3. Whittaker plot of beetles sampled from the river Thames showing a slight "s"-shape. Rank abundance beetles.png
Figure 3. Whittaker plot of beetles sampled from the river Thames showing a slight "s"-shape.

Relative species abundance distributions are usually graphed as frequency histograms ("Preston plots"; Figure 2) [7] or rank-abundance diagrams ("Whittaker Plots"; Figure 3). [8]

Frequency histogram (Preston plot):

x-axis: logarithm of abundance bins (historically log2 as a rough approximation to the natural logarithm)
y-axis: number of species at given abundance

Rank-abundance diagram (Whittaker plot):

x-axis: species list, ranked in order of descending abundance (i.e. from common to rare)
y-axis: logarithm of % relative abundance

When plotted in these ways, relative species abundances from wildly different data sets show similar patterns: frequency histograms tend to be right-skewed (e.g. Figure 2) and rank-abundance diagrams tend to conform to the curves illustrated in Figure 4.

Understanding relative species abundance patterns

Researchers attempting to understand relative species abundance patterns usually approach them in a descriptive or mechanistic way. Using a descriptive approach biologists attempt to fit a mathematical model to real data sets and infer the underlying biological principles at work from the model parameters. By contrast, mechanistic approaches create a mathematical model based on biological principles and then test how well these models fit real data sets. [9]

Descriptive approaches

Geometric series (Motomura 1932)

Figure 4. Generic Rank-abundance diagram of three common mathematical models used to fit species abundance distributions: Motomura's geometric series, Fisher's logseries, and Preston's log-normal series (modified from Magurran 1988) Common descriptiveWhittaker.jpg
Figure 4. Generic Rank-abundance diagram of three common mathematical models used to fit species abundance distributions: Motomura's geometric series, Fisher's logseries, and Preston's log-normal series (modified from Magurran 1988)
Figure 5. Plant succession in abandoned fields within Brookhaven National Laboratory, NY. Species abundances conform to the geometric series during early succession but approach lognormal as the community ages. (modified from Whittaker 1972 ) Succession.jpg
Figure 5. Plant succession in abandoned fields within Brookhaven National Laboratory, NY. Species abundances conform to the geometric series during early succession but approach lognormal as the community ages. (modified from Whittaker 1972 )

I. Motomura developed the geometric series model based on benthic community data in a lake. [12] Within the geometric series each species' level of abundance is a sequential, constant proportion (k) of the total number of individuals in the community. Thus if k is 0.5, the most common species would represent half of individuals in the community (50%), the second most common species would represent half of the remaining half (25%), the third, half of the remaining quarter (12.5%) and so forth.

Although Motomura originally developed the model as a statistical (descriptive) means to plot observed abundances, the "discovery" of his paper by Western researchers in 1965 led to the model being used as a niche apportionment model the "niche-preemption model". [8] In a mechanistic model k represents the proportion of the resource base acquired by a given species.

The geometric series rank-abundance diagram is linear with a slope of –k, and reflects a rapid decrease in species abundances by rank (Figure 4). [12] The geometric series does not explicitly assume that species colonize an area sequentially, however, the model fits the concept of niche preemption, where species sequentially colonize a region and the first species to arrive receives the majority of resources. [13] The geometric series model fits observed species abundances in highly uneven communities with low diversity. [13] This is expected to occur in terrestrial plant communities (as these assemblages often show strong dominance) as well as communities at early successional stages and those in harsh or isolated environments (Figure 5). [8]

Logseries (Fisher et al 1943)

where:

S = the number of species in the sampled community
N = the number of individuals sampled
= a constant derived from the sample data set

The logseries was developed by Ronald Fisher to fit two different abundance data sets: British moth species (collected by Carrington Williams) and Malaya butterflies (collected by Alexander Steven Corbet). [14] The logic behind the derivation of the logseries is varied [15] however Fisher proposed that sampled species abundances would follow a negative binomial from which the zero abundance class (species too rare to be sampled) was eliminated. [1] He also assumed that the total number of species in a community was infinite. Together, this produced the logseries distribution (Figure 4). The logseries predicts the number of species at different levels of abundance (n individuals) with the formula:

where:

S = the number of species with an abundance of n
x = a positive constant (0 < x < 1) which is derived from the sample data set and generally approaches 1 in value

The number of species with 1, 2, 3, ..., n individuals are therefore:

Fisher’s constants

The constants α and x can be estimated through iteration from a given species data set using the values S and N. [2] Fisher's dimensionless α is often used as a measure of biodiversity, and indeed has recently been found to represent the fundamental biodiversity parameter θ from neutral theory (see below).

Log normal (Preston 1948)

Figure 6. An example of Preston's veil. Fish species abundances sampled using repeated trawling over a one-month (blue bars), two month (gold bars) and one-year period (yellow). One year of sampling indicates that the fish community is log-normally distributed. (derived from Magurran 2004 ) Prestonsveil copy.jpg
Figure 6. An example of Preston's veil. Fish species abundances sampled using repeated trawling over a one-month (blue bars), two month (gold bars) and one-year period (yellow). One year of sampling indicates that the fish community is log-normally distributed. (derived from Magurran 2004 )

Using several data sets (including breeding bird surveys from New York and Pennsylvania and moth collections from Maine, Alberta and Saskatchewan) Frank W. Preston (1948) argued that species abundances (when binned logarithmically in a Preston plot) follow a normal (Gaussian) distribution, partly as a result of the central limit theorem (Figure 4). [7] This means that the abundance distribution is lognormal. According to his argument, the right-skew observed in species abundance frequency histograms (including those described by Fisher et al. (1943) [14] ) was, in fact, a sampling artifact. Given that species toward the left side of the x-axis are increasingly rare, they may be missed in a random species sample. As the sample size increases however, the likelihood of collecting rare species in a way that accurately represents their abundance also increases, and more of the normal distribution becomes visible. [7] The point at which rare species cease to be sampled has been termed Preston's veil line. As the sample size increases Preston's veil is pushed farther to the left and more of the normal curve becomes visible [2] [10] (Figure 6). Williams' moth data, originally used by Fisher to develop the logseries distribution, became increasingly lognormal as more years of sampling were completed. [1] [3]

Calculating theoretical species richness

Preston's theory has an application: if a community is truly lognormal yet under-sampled, the lognormal distribution can be used to estimate the true species richness of a community. Assuming the shape of the total distribution can be confidently predicted from the collected data, the normal curve can be fit via statistical software or by completing the Gaussian formula: [7]

where:

n0 is the number of species in the modal bin (the peak of the curve)
n is the number of species in bins R distant from the modal bin
a is a constant derived from the data

It is then possible to predict how many species are in the community by calculating the total area under the curve (N):

The number of species missing from the data set (the missing area to the left of the veil line) is simply N minus the number of species sampled. [2] Preston did this for two lepidopteran data sets, predicting that, even after 22 years of collection, only 72% and 88% of the species present had been sampled. [7]

Yule model (Nee 2003)

The Yule model is based on a much earlier, GaltonWatson model which was used to describe the distribution of species among genera. [16] The Yule model assumes random branching of species trees, with each species (branch tip) having the equivalent probability of giving rise to new species or becoming extinct. As the number of species within a genus, within a clade, has a similar distribution to the number of individuals within a species, within a community (i.e. the "hollow curve"), Sean Nee (2003) used the model to describe relative species abundances. [4] [17] In many ways this model is similar to niche apportionment models, however, Nee intentionally did not propose a biological mechanism for the model behavior, arguing that any distribution can be produced by a variety of mechanisms. [17]

Mechanistic approaches: niche apportionment

Note: This section provides a general summary of niche apportionment theory, more information can be found under niche apportionment models.

Most mechanistic approaches to species abundance distributions use niche-space, i.e. available resources, as the mechanism driving abundances. If species in the same trophic level consume the same resources (such as nutrients or sunlight in plant communities, prey in carnivore communities, nesting locations or food in bird communities) and these resources are limited, how the resource "pie" is divided among species determines how many individuals of each species can exist in the community. Species with access to abundant resources will have higher carrying capacities than those with little access. Mutsunori Tokeshi [18] later elaborated niche apportionment theory to include niche filling in unexploited resource space. [9] Thus, a species may survive in the community by carving out a portion of another species' niche (slicing up the pie into smaller pieces) or by moving into a vacant niche (essentially making the pie larger, for example, by being the first to arrive in a newly available location or through the development of a novel trait that allows access previously unavailable resources). Numerous niche apportionment models have been developed. Each make different assumptions about how species carve up niche-space.

Unified neutral theory (Hubbell 1979/2001)

The Unified Neutral Theory of Biodiversity and Biogeography (UNTB) is a special form of mechanistic model that takes an entirely different approach to community composition than the niche apportionment models. [1] Instead of species populations reaching equilibrium within a community, the UNTB model is dynamic, allowing for continuing changes in relative species abundances through drift.

A community in the UNTB model can be best visualized as a grid with a certain number of spaces, each occupied with individuals of different species. The model is zero-sum as there are a limited number of spaces that can be occupied: an increase in the number of individuals of one species in the grid must result in corresponding decrease in the number of individuals of other species in the grid. The model then uses birth, death, immigration, extinction and speciation to modify community composition over time.

Hubbell's theta

The UNTB model produces a dimensionless "fundamental biodiversity" number, θ, which is derived using the formula:

θ = 2Jmv

where:

Jm is the size of the metacommunity (the outside source of immigrants to the local community)
v is the speciation rate in the model

Relative species abundances in the UNTB model follow a zero-sum multinomial distribution. [19] The shape of this distribution is a function of the immigration rate, the size of the sampled community (grid), and θ. [19] When the value of θ is small, the relative species abundance distribution is similar to the geometric series (high dominance). As θ gets larger, the distribution becomes increasingly s-shaped (log-normal) and, as it approaches infinity, the curve becomes flat (the community has infinite diversity and species abundances of one). Finally, when θ = 0 the community described consists of only one species (extreme dominance). [1]

Fisher's alpha and Hubbell's theta – an interesting convergence

An unexpected result of the UNTB is that at very large sample sizes, predicted relative species abundance curves describe the metacommunity and become identical to Fisher's logseries. At this point θ also becomes identical to Fisher's for the equivalent distribution and Fisher's constant x is equal to the ratio of birthrate : deathrate. Thus, the UNTB unintentionally offers a mechanistic explanation of the logseries 50 years after Fisher first developed his descriptive model. [1]

Related Research Articles

A histogram is a visual representation of the distribution of quantitative data. To construct a histogram, the first step is to "bin" the range of values— divide the entire range of values into a series of intervals—and then count how many values fall into each interval. The bins are usually specified as consecutive, non-overlapping intervals of a variable. The bins (intervals) are adjacent and are typically of equal size.

A likelihood function measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the joint probability distribution of the random variable that (presumably) generated the observations. When evaluated on the actual data points, it becomes a function solely of the model parameters.

In statistics, sufficiency is a property of a statistic computed on a sample dataset in relation to a parametric model of the dataset. A sufficient statistic contains all of the information that the dataset provides about the model parameters. It is closely related to the concepts of an ancillary statistic which contains no information about the model parameters, and of a complete statistic which only contains information about the parameters and no ancillary information.

<span class="mw-page-title-main">Beta distribution</span> Probability distribution

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] or in terms of two positive parameters, denoted by alpha (α) and beta (β), that appear as exponents of the variable and its complement to 1, respectively, and control the shape of the distribution.

<span class="mw-page-title-main">Gamma distribution</span> Probability distribution

In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use:

  1. With a shape parameter α and a scale parameter θ
  2. With a shape parameter and a rate parameter
<span class="mw-page-title-main">Unified neutral theory of biodiversity</span> Theory of evolutionary biology

The unified neutral theory of biodiversity and biogeography is a theory and the title of a monograph by ecologist Stephen P. Hubbell. It aims to explain the diversity and relative abundance of species in ecological communities. Like other neutral theories of ecology, Hubbell assumes that the differences between members of an ecological community of trophically similar species are "neutral", or irrelevant to their success. This implies that niche differences do not influence abundance and the abundance of each species follows a random walk. The theory has sparked controversy, and some authors consider it a more complex version of other null models that fit the data better.

Empirical Bayes methods are procedures for statistical inference in which the prior probability distribution is estimated from the data. This approach stands in contrast to standard Bayesian methods, for which the prior distribution is fixed before any data are observed. Despite this difference in perspective, empirical Bayes may be viewed as an approximation to a fully Bayesian treatment of a hierarchical model wherein the parameters at the highest level of the hierarchy are set to their most likely values, instead of being integrated out. Empirical Bayes, also known as maximum marginal likelihood, represents a convenient approach for setting hyperparameters, but has been mostly supplanted by fully Bayesian hierarchical analyses since the 2000s with the increasing availability of well-performing computation techniques. It is still commonly used, however, for variational methods in Deep Learning, such as variational autoencoders, where latent variable spaces are high-dimensional.

In mathematical statistics, the Kullback–Leibler (KL) divergence, denoted , is a type of statistical distance: a measure of how much a model probability distribution Q is different from a true probability distribution P. Mathematically, it is defined as

<span class="mw-page-title-main">Species richness</span> Variety of species in an ecological community, landscape or region

Species richness is the number of different species represented in an ecological community, landscape or region. Species richness is simply a count of species, and it does not take into account the abundances of the species or their relative abundance distributions. Species richness is sometimes considered synonymous with species diversity, but the formal metric species diversity takes into account both species richness and species evenness.

Spatial ecology studies the ultimate distributional or spatial unit occupied by a species. In a particular habitat shared by several species, each of the species is usually confined to its own microhabitat or spatial niche because two species in the same general territory cannot usually occupy the same ecological niche for any significant length of time.

<span class="mw-page-title-main">Species–area relationship</span> Relationship between the size of an area or habitat and the number of species it can support

The species–area relationship or species–area curve describes the relationship between the area of a habitat, or of part of a habitat, and the number of species found within that area. Larger areas tend to contain larger numbers of species, and empirically, the relative numbers seem to follow systematic mathematical relationships. The species–area relationship is usually constructed for a single type of organism, such as all vascular plants or all species of a specific trophic level within a particular site. It is rarely if ever, constructed for all types of organisms if simply because of the prodigious data requirements. It is related but not identical to the species discovery curve.

In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant. Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. Customer 1 sits at the first table. The next customer either sits at the same table as customer 1, or the next table. This continues, with each customer choosing to either sit at an occupied table with a probability proportional to the number of customers already there, or an unoccupied table. At time n, the n customers have been partitioned among m ≤ n tables. The results of this process are exchangeable, meaning the order in which the customers sit does not affect the probability of the final distribution. This property greatly simplifies a number of problems in population genetics, linguistic analysis, and image recognition.

<span class="mw-page-title-main">Multifractal system</span> System with multiple fractal dimensions

A multifractal system is a generalization of a fractal system in which a single exponent is not enough to describe its dynamics; instead, a continuous spectrum of exponents is needed.

<span class="mw-page-title-main">Interspecific competition</span> Form of competition

Interspecific competition, in ecology, is a form of competition in which individuals of different species compete for the same resources in an ecosystem. This can be contrasted with mutualism, a type of symbiosis. Competition between members of the same species is called intraspecific competition.

<span class="mw-page-title-main">Abundance (ecology)</span> Relative representation of a species in anr ecosystem

In ecology, local abundance is the relative representation of a species in a particular ecosystem. It is usually measured as the number of individuals found per sample. The ratio of abundance of one species to one or multiple other species living in an ecosystem is referred to as relative species abundances. Both indicators are relevant for computing biodiversity.

In ecology, the occupancy–abundance (O–A) relationship is the relationship between the abundance of species and the size of their ranges within a region. This relationship is perhaps one of the most well-documented relationships in macroecology, and applies both intra- and interspecifically. In most cases, the O–A relationship is a positive relationship. Although an O–A relationship would be expected, given that a species colonizing a region must pass through the origin and could reach some theoretical maximum abundance and distribution, the relationship described here is somewhat more substantial, in that observed changes in range are associated with greater-than-proportional changes in abundance. Although this relationship appears to be pervasive, and has important implications for the conservation of endangered species, the mechanism(s) underlying it remain poorly understood.

In probability and statistics, the Tweedie distributions are a family of probability distributions which include the purely continuous normal, gamma and inverse Gaussian distributions, the purely discrete scaled Poisson distribution, and the class of compound Poisson–gamma distributions which have positive mass at zero, but are otherwise continuous. Tweedie distributions are a special case of exponential dispersion models and are often used as distributions for generalized linear models.

In ecology the relative abundance distribution (RAD) or species abundance distribution species abundance distribution (SAD) describes the relationship between the number of species observed in a field study as a function of their observed abundance. The SAD is one of ecology's oldest and most universal laws – every community shows a hollow curve or hyperbolic shape on a histogram with many rare species and just a few common species. When plotted as a histogram of number of species on the y-axis vs. abundance on an arithmetic x-axis, the classic hyperbolic J-curve or hollow curve is produced, indicating a few very abundant species and many rare species. The SAD is central prediction of the Unified neutral theory of biodiversity.

Mechanistic models for niche apportionment are biological models used to explain relative species abundance distributions. These niche apportionment models describe how species break up resource pool in multi-dimensional space, determining the distribution of abundances of individuals among species. The relative abundances of species are usually expressed as a Whittaker plot, or rank abundance plot, where species are ranked by number of individuals on the x-axis, plotted against the log relative abundance of each species on the y-axis. The relative abundance can be measured as the relative number of individuals within species or the relative biomass of individuals within species.

Taylor's power law is an empirical law in ecology that relates the variance of the number of individuals of a species per unit area of habitat to the corresponding mean by a power law relationship. It is named after the ecologist who first proposed it in 1961, Lionel Roy Taylor (1924–2007). Taylor's original name for this relationship was the law of the mean. The name Taylor's law was coined by Southwood in 1966.

References

  1. 1 2 3 4 5 6 7 8 9 Hubbell, S. P. 2001. The unified neutral theory of biodiversity and biogeography. Princeton University Press, Princeton, N.J.
  2. 1 2 3 4 5 6 7 Magurran, A. E. 2004. Measuring biological diversity. Blackwell Scientific, Oxford.
  3. 1 2 3 4 Williams, C.B. 1964. Patterns in the balance of nature and related problems in quantitative ecology. Academic Press, London.
  4. 1 2 3 McGill, B. J., Etienne R. S., Gray J. S., Alonso D., Anderson M. J., Benecha H. K., Dornelas M., Enquist B. J., Green J. L., He F., Hurlbert A. H., Magurran A. E., Marquet P. A., Maurer B. A., Ostling A., Soykan C. U., Ugland K. I., White E. P. 2007. "Species abundance distributions: moving beyond single prediction theories to integration within an ecological framework". Ecology Letters 10: 995–1015
  5. Andrewartha, H. G.; Birch L. C. 1954. The Distribution and Abundance of Animals. The University of Chicago Press, Chicago, Illinois.
  6. Darwin, C. 2004 (1859). The Origin of Species by means of natural selection or the preservation of favoured races in the struggle for life. Castle Books, New Jersey.
  7. 1 2 3 4 5 Preston, F. W. (1948). "The Commonness, and Rarity, of Species" (PDF). Ecology. 29 (3): 254–283. Bibcode:1948Ecol...29..254P. doi:10.2307/1930989. JSTOR   1930989.
  8. 1 2 3 Whittaker, R. H. 1965. "Dominance and diversity in land plant communities", Science 147: 250–260
  9. 1 2 Tokeshi, M. 1999. Species coexistence: ecological and evolutionary perspectives. Blackwell Scientific, Oxford
  10. 1 2 Magurran, A. E. 1988. Ecological Diversity and Its Measurement. Princeton Univ. Press
  11. Whittaker, R. H. 1972. "Evolution and measurement of species diversity". Taxon 21:213251
  12. 1 2 Motomura, I. 1932. "A statistical treatment of associations", Japanese Journal of Zoology. 44: 379–383 (in Japanese)
  13. 1 2 He, F., Tang D. 2008. "Estimating the niche preemption parameter of the geometric series" Acta Oecologica 33 (1):105107
  14. 1 2 Fisher, R. A; Corbet, A. S.; Williams, C. B. 1943. "The relation between the number of species and the number of individuals in a random sample of an animal population". Journal of Animal Ecology 12: 4258.
  15. Johnson, J. L.; Adrienne, W. K.; Kotz, S. (2005). Univariate and Discrete Distributions, 3rd edn. John Wiley and Sons, New York
  16. Yule, G. U. 1924. "A mathematical theory of evolution based on the conclusions of Dr. J. C. Willis, FRS". Philosophical Transactions of the Royal Society B 213: 21–87
  17. 1 2 Nee, S. 2003. "The unified phenomenological theory of biodiversity". In T. Blackburn and K. Gaston, editors, Macroecology: concepts and consequences. Blackwell Scientific, Oxford.
  18. Tokeshi, M. 1990. "Niche apportionment or random assortment: species abundance patterns revisited". Journal of Animal Ecology 59: 11291146
  19. 1 2 Hubbell, S. P.; Lake J. 2003. "The neutral theory of biogeography and biodiversity: and beyond". In T. Blackburn and K. Gaston, editors, Macroecology: concepts and consequences. Blackwell Scientific, Oxford.