Species distribution modelling

Last updated
Example of simple correlative species distribution modelling using rainfall, altitude, and current species observations to create a model of possible existence for a certain species. Predicting habitats.png
Example of simple correlative species distribution modelling using rainfall, altitude, and current species observations to create a model of possible existence for a certain species.

Species distribution modelling (SDM), also known as environmental(or ecological) niche modelling (ENM), habitat modelling, predictive habitat distribution modelling, and range mapping [1] uses computer algorithms to predict the distribution of a species across geographic space and time using environmental data. The environmental data are most often climate data (e.g. temperature, precipitation), but can include other variables such as soil type, water depth, and land cover. SDMs are used in several research areas in conservation biology, ecology and evolution. These models can be used to understand how environmental conditions influence the occurrence or abundance of a species, and for predictive purposes (ecological forecasting). Predictions from an SDM may be of a species’ future distribution under climate change, a species’ past distribution in order to assess evolutionary relationships, or the potential future distribution of an invasive species. Predictions of current and/or future habitat suitability can be useful for management applications (e.g. reintroduction or translocation of vulnerable species, reserve placement in anticipation of climate change).

Contents

There are two main types of SDMs. Correlative SDMs, also known as climate envelope models, bioclimatic models, or resource selection function models, model the observed distribution of a species as a function of environmental conditions. [1] Mechanistic SDMs, also known as process-based models or biophysical models, use independently derived information about a species' physiology to develop a model of the environmental conditions under which the species can exist. [2]

The extent to which such modelled data reflect real-world species distributions will depend on a number of factors, including the nature, complexity, and accuracy of the models used and the quality of the available environmental data layers; the availability of sufficient and reliable species distribution data as model input; and the influence of various factors such as barriers to dispersal, geologic history, or biotic interactions, that increase the difference between the realized niche and the fundamental niche. Environmental niche modelling may be considered a part of the discipline of biodiversity informatics.

History

A. F. W. Schimper used geographical and environmental factors to explain plant distributions in his 1898 Pflanzengeographie auf physiologischer Grundlage (Plant Geography Upon a Physiological Basis) and his 1908 work of the same name. [3] Andrew Murray used the environment to explain the distribution of mammals in his 1866 The Geographical Distribution of Mammals. [4] Robert Whittaker's work with plants and Robert MacArthur's work with birds strongly established the role the environment plays in species distributions. [1] Elgene O. Box constructed environmental envelope models to predict the range of tree species. [5] His computer simulations were among the earliest uses of species distribution modelling. [1]

The adoption of more sophisticated generalised linear models (GLMs) made it possible to create more sophisticated and realistic species distribution models. The expansion of remote sensing and the development of GIS-based environmental modelling increase the amount of environmental information available for model-building and made it easier to use. [1]

Correlative vs mechanistic models

Correlative SDMs

SDMs originated as correlative models. Correlative SDMs model the observed distribution of a species as a function of geographically referenced climatic predictor variables using multiple regression approaches. Given a set of geographically referred observed presences of a species and a set of climate maps, an algorithm finds the most likely environmental ranges within which a species lives. Correlative SDMs assume that species are at equilibrium with their environment and that the relevant environmental variables have been adequately sampled. The models allow for interpolation between a limited number of species occurrences.

For these algorithms to be effective, it is required to gather observations not only of species presences, but also of absences, that is, where the species does not live. Records of species absences are typically not as common as records of presences, thus often "random background" or "pseudo-absence" data are used to fit these models. If there are incomplete records of species occurrences, pseudo-absences can introduce bias. Since correlative SDMs are models of a species’ observed distribution, they are models of the realized niche (the environments where a species is found), as opposed to the fundamental niche (the environments where a species can be found, or where the abiotic environment is appropriate for the survival). For a given species, the realized and fundamental niches might be the same, but if a species is geographically confined due to dispersal limitation or species interactions, the realized niche will be smaller than the fundamental niche.

Correlative SDMs are easier and faster to implement than mechanistic SDMs, and can make ready use of available data. Since they are correlative however, they do not provide much information about causal mechanisms and are not good for extrapolation. They will also be inaccurate if the observed species range is not at equilibrium (e.g. if a species has been recently introduced and is actively expanding its range).

Mechanistic SDMs

Mechanistic SDMs are more recently developed. In contrast to correlative models, mechanistic SDMs use physiological information about a species (taken from controlled field or laboratory studies) to determine the range of environmental conditions within which the species can persist. [2] These models aim to directly characterize the fundamental niche, and to project it onto the landscape. A simple model may simply identify threshold values outside of which a species can't survive. A more complex model may consist of several sub-models, e.g. micro-climate conditions given macro-climate conditions, body temperature given micro-climate conditions, fitness or other biological rates (e.g. survival, fecundity) given body temperature (thermal performance curves), resource or energy requirements, and population dynamics. Geographically referenced environmental data are used as model inputs. Because the species distribution predictions are independent of the species’ known range, these models are especially useful for species whose range is actively shifting and not at equilibrium, such as invasive species.

Mechanistic SDMs incorporate causal mechanisms and are better for extrapolation and non-equilibrium situations. However, they are more labor-intensive to create than correlational models and require the collection and validation of a lot of physiological data, which may not be readily available. The models require many assumptions and parameter estimates, and they can become very complicated.

Dispersal, biotic interactions, and evolutionary processes present challenges, as they aren’t usually incorporated into either correlative or mechanistic models.

Correlational and mechanistic models can be used in combination to gain additional insights. For example, a mechanistic model could be used to identify areas that are clearly outside the species’ fundamental niche, and these areas can be marked as absences or excluded from analysis. See [6] for a comparison between mechanistic and correlative models.

Niche modelling algorithms (correlative)

There are a variety of mathematical methods that can be used for fitting, selecting, and evaluating correlative SDMs. Algorithms include "profile" methods, which are simple statistical techniques that use e.g. environmental distance to known sites of occurrence such as BIOCLIM [7] [8] and DOMAIN; "regression" methods (e.g. forms of generalized linear models); and "machine learning" methods such as maximum entropy (MAXENT). Ten machine learning algorithms used in SDM can be seen in. [9] An incomplete list of algorithms that have been used for niche modelling includes:

Profile techniques

Regression-based techniques

Machine learning techniques

Furthermore, ensemble models can be created from several model outputs to create a model that captures components of each. Often the mean or median value across several models is used as an ensemble. Similarly, consensus models are models that fall closest to some measure of central tendency of all models—consensus models can be individual model runs or ensembles of several models.

Niche modelling software (correlative)

SPACES is an online Environmental niche modeling platform that allows users to design and run dozens of the most prominent algorithms in a high performance, multi-platform, browser-based environment.

MaxEnt is the most widely used method/software uses presence only data and performs well when there are few presence records available.

ModEco implements various algorithms.

DIVA-GIS has an easy to use (and good for educational use) implementation of BIOCLIM

The Biodiversity and Climate Change Virtual Laboratory (BCCVL) is a "one stop modelling shop" that simplifies the process of biodiversity and climate impact modelling. It connects the research community to Australia's national computational infrastructure by integrating a suite of tools in a coherent online environment. Users can access global climate and environmental datasets or upload their own data, perform data analysis across six different experiment types with a suite of 17 different algorithms, and easily visualise, interpret and evaluate the results of the models. Experiments types include: Species Distribution Model, Multispecies Distribution Model, Species Trait Model (currently under development), Climate Change Projection, Biodiverse Analysis and Ensemble Analysis. Example of BCCVL SDM outputs can be found here

Another example is Ecocrop, which is used to determine the suitability of a crop to a specific environment. [11] This database system can also project crop yields and evaluate the impact of environmental factors such as climate change on plant growth and suitability. [12]

Most niche modelling algorithms are available in the R packages 'dismo', 'biomod2' and 'mopa'..

Software developers may want to build on the openModeller project.

The Collaboratory for Adaptation to Climate Change adapt.nd.edu Archived 2012-08-06 at the Wayback Machine has implemented an online version of openModeller that allows users to design and run openModeller in a high-performance, browser-based environment to allow for multiple parallel experiments without the limitations of local processor power.

See also

Related Research Articles

<span class="mw-page-title-main">Ecology</span> Study of organisms and their environment

Ecology is the study of the relationships among living organisms, including humans, and their physical environment. Ecology considers organisms at the individual, population, community, ecosystem, and biosphere level. Ecology overlaps with the closely related sciences of biogeography, evolutionary biology, genetics, ethology, and natural history. Ecology is a branch of biology, and it is not synonymous with environmentalism.

<span class="mw-page-title-main">Theoretical ecology</span>

Theoretical ecology is the scientific discipline devoted to the study of ecological systems using theoretical methods such as simple conceptual models, mathematical models, computational simulations, and advanced data analysis. Effective models improve understanding of the natural world by revealing how the dynamics of species populations are often based on fundamental biological conditions and processes. Further, the field aims to unify a diverse range of empirical observations by assuming that common, mechanistic processes generate observable phenomena across species and ecological environments. Based on biologically realistic assumptions, theoretical ecologists are able to uncover novel, non-intuitive insights about natural processes. Theoretical results are often verified by empirical and observational studies, revealing the power of theoretical methods in both predicting and understanding the noisy, diverse biological world.

<span class="mw-page-title-main">Ecological niche</span> Fit of a species living under specific environmental conditions

In ecology, a niche is the match of a species to a specific environmental condition. It describes how an organism or population responds to the distribution of resources and competitors and how it in turn alters those same factors. "The type and number of variables comprising the dimensions of an environmental niche vary from one species to another [and] the relative importance of particular environmental variables for a species may vary according to the geographic and biotic contexts".

<span class="mw-page-title-main">Biogeography</span> Study of the distribution of species and ecosystems in geographic space and through geological time

Biogeography is the study of the distribution of species and ecosystems in geographic space and through geological time. Organisms and biological communities often vary in a regular fashion along geographic gradients of latitude, elevation, isolation and habitat area. Phytogeography is the branch of biogeography that studies the distribution of plants. Zoogeography is the branch that studies distribution of animals. Mycogeography is the branch that studies distribution of fungi, such as mushrooms.

<span class="mw-page-title-main">Paleoecology</span> Study of interactions between organisms and their environments across geologic timescales

Paleoecology is the study of interactions between organisms and/or interactions between organisms and their environments across geologic timescales. As a discipline, paleoecology interacts with, depends on and informs a variety of fields including paleontology, ecology, climatology and biology.

Realized niche width is a phrase relating to ecology, is defined by the actual space that an organism inhabits and the resources it can access as a result of limiting pressures from other species. An organism's ecological niche is determined by the biotic and abiotic factors that make up that specific ecosystem that allow that specific organism to survive there. The width of an organism's niche is set by the range of conditions a species is able to survive in that specific environment.

Spatial ecology studies the ultimate distributional or spatial unit occupied by a species. In a particular habitat shared by several species, each of the species is usually confined to its own microhabitat or spatial niche because two species in the same general territory cannot usually occupy the same ecological niche for any significant length of time.

<span class="mw-page-title-main">Spatial analysis</span> Formal techniques which study entities using their topological, geometric, or geographic properties

Spatial analysis is any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques using different analytic approaches, especially spatial statistics. It may be applied in fields as diverse as astronomy, with its studies of the placement of galaxies in the cosmos, or to chip fabrication engineering, with its use of "place and route" algorithms to build complex wiring structures. In a more restricted sense, spatial analysis is geospatial analysis, the technique applied to structures at the human scale, most notably in the analysis of geographic data. It may also be applied to genomics, as in transcriptomics data.

<span class="mw-page-title-main">Functional ecology</span>

Functional ecology is a branch of ecology that focuses on the roles, or functions, that species play in the community or ecosystem in which they occur. In this approach, physiological, anatomical, and life history characteristics of the species are emphasized. The term "function" is used to emphasize certain physiological processes rather than discrete properties, describe an organism's role in a trophic system, or illustrate the effects of natural selective processes on an organism. This sub-discipline of ecology represents the crossroads between ecological patterns and the processes and mechanisms that underlie them. It focuses on traits represented in large number of species and can be measured in two ways – the first being screening, which involves measuring a trait across a number of species, and the second being empiricism, which provides quantitative relationships for the traits measured in screening. Functional ecology often emphasizes an integrative approach, using organism traits and activities to understand community dynamics and ecosystem processes, particularly in response to the rapid global changes occurring in earth's environment.

<span class="mw-page-title-main">Species distribution</span> Geographical area in which a species can be found

Species distribution, or speciesdispersion, is the manner in which a biological taxon is spatially arranged. The geographic limits of a particular taxon's distribution is its range, often represented as shaded areas on a map. Patterns of distribution change depending on the scale at which they are viewed, from the arrangement of individuals within a small family unit, to patterns within a population, or the distribution of the entire species as a whole (range). Species distribution is not to be confused with dispersal, which is the movement of individuals away from their region of origin or from a population center of high density.

The following outline is provided as an overview of and topical guide to ecology:

In ecology, the occupancy–abundance (O–A) relationship is the relationship between the abundance of species and the size of their ranges within a region. This relationship is perhaps one of the most well-documented relationships in macroecology, and applies both intra- and interspecifically. In most cases, the O–A relationship is a positive relationship. Although an O–A relationship would be expected, given that a species colonizing a region must pass through the origin and could reach some theoretical maximum abundance and distribution, the relationship described here is somewhat more substantial, in that observed changes in range are associated with greater-than-proportional changes in abundance. Although this relationship appears to be pervasive, and has important implications for the conservation of endangered species, the mechanism(s) underlying it remain poorly understood

Ecological forecasting uses knowledge of physics, ecology and physiology to predict how ecological populations, communities, or ecosystems will change in the future in response to environmental factors such as climate change. The goal of the approach is to provide natural resource managers with information to anticipate and respond to short and long-term climate conditions.

<span class="mw-page-title-main">Effects of climate change on plant biodiversity</span>

The history of life on Earth is closely associated with environmental change on multiple spatial and temporal scales. Climate change is a long-term change in the average weather patterns that have come to define Earth’s local, regional and global climates. These changes have a broad range of observed effects that are synonymous with the term. Climate change is any significant long term change in the expected pattern, whether due to natural variability or as a result of human activity. Predicting the effects that climate change will have on plant biodiversity can be achieved using various models, however bioclimatic models are most commonly used.

Relative species abundance is a component of biodiversity and is a measure of how common or rare a species is relative to other species in a defined location or community. Relative abundance is the percent composition of an organism of a particular kind relative to the total number of organisms in the area. Relative species abundances tend to conform to specific patterns that are among the best-known and most-studied patterns in macroecology. Different populations in a community exist in relative proportions; this idea is known as relative abundance.

<span class="mw-page-title-main">AquaMaps</span>

AquaMaps is a collaborative project with the aim of producing computer-generated predicted global distribution maps for marine species on a 0.5 x 0.5 degree grid of the oceans based on data available through online species databases such as FishBase and SeaLifeBase and species occurrence records from OBIS or GBIF and using an environmental envelope model in conjunction with expert input. The underlying model represents a modified version of the relative environmental suitability (RES) model developed by Kristin Kaschner to generate global predictions of marine mammal occurrences.

In paleoecology and ecological forecasting, a no-analog community or climate is one that is compositionally different from a baseline for measurement. Alternative naming conventions to describe no-analog communities and climates may include novel, emerging, mosaic, disharmonious and intermingled.

Professor Jane Elith is an ecologist in the School of Botany at the University of Melbourne. She graduated from the School of Agriculture and Forestry at the University of Melbourne in 1977. She specialises in ecological models that focus on spatial analysis and prediction of the habitat of plant and animal species. Following graduation, she was a research assistant and tutor for three years, and then spent the following 12 years raising her children. She returned to the University of Melbourne in 1992 and later commenced a part-time PhD in the School of Botany. She was awarded her PhD in 2002 on 'Predicting the distribution of plants'. Since then, she has been a research fellow in the School of Botany. She is currently an ARC Future Fellow and sits within the Centre of Excellence for Biosecurity Risk Analysis at the University of Melbourne.

<span class="mw-page-title-main">Climate change and invasive species</span> Increase of invasive organisms caused by climate change

Climate change and invasive species refers to the process of the environmental destabilization caused by climate change. This environmental change facilitates the spread of invasive species — species that are not historically found in a certain region, and often bring about a negative impact to that region's native species. This complex relationship is notable because climate change and invasive species are also considered by the USDA to be two of the top four causes of global biodiversity loss.

<span class="mw-page-title-main">Lauren B. Buckley</span> American scientist

Lauren B. Buckley is an evolutionary ecologist and professor of biology at the University of Washington. She researches the relationship between organismal physiological and life history features and response to global climate change.

References

  1. 1 2 3 4 5 Elith, Jane; Leathwick, John R. (2009-02-06). "Species Distribution Models: Ecological Explanation and Prediction Across Space and Time". Annual Review of Ecology, Evolution, and Systematics. 40 (1): 677–697. doi:10.1146/annurev.ecolsys.110308.120159. ISSN   1543-592X. S2CID   86460963.
  2. 1 2 Kearney, Michael; Porter, Warren (2009). "Mechanistic niche modelling: combining physiological and spatial data to predict species' ranges". Ecology Letters. 12 (4): 334–350. doi:10.1111/j.1461-0248.2008.01277.x. ISSN   1461-0248. PMID   19292794.
  3. Andreas Franz Wilhelm Schimper (1908), Pflanzen-geographie auf physiologischer Grundlage (in German) (2nd ed.), Jena, Jena: Gustav Fischer, doi:10.5962/BHL.TITLE.46243, OCLC   12120623, OL   24353101M, Wikidata   Q117084350
  4. Murray, Andrew, 1812-1878 (1866), The geographical distribution of mammals, London: Day and Son, limited, doi:10.5962/BHL.TITLE.15762, LCCN   04035567, OCLC   8680065, OL   16272962M, Wikidata   Q51421963 {{citation}}: CS1 maint: multiple names: authors list (link)
  5. Box, Elgene O. (1981-05-01). "Predicting physiognomic vegetation types with climate variables". Vegetatio. 45 (2): 127–139. doi:10.1007/BF00119222. ISSN   1573-5052. S2CID   25941018.
  6. Morin, X.; Thuiller (2009). "Comparing niche- and process-based models to reduce prediction uncertainty in species range shifts under climate change". Ecology. 90 (5): 1301–13. doi:10.1890/08-0134.1. PMID   19537550.
  7. Nix HA (1986). "BIOCLIM — a Bioclimatic Analysis and Prediction System". Research Report, CSIRO Division of Water and Land Resources. 1983–1985: 59–60.
  8. Nix HA (1986). "A biogeographic analysis of Australian elapid snakes". In Longmore (ed.). Atlas of Elapid Snakes of Australia. Australian Flora and Fauna Series 7. Bureau of Flora and Fauna, Canberra. pp. 4–15.
  9. Effrosynidis, Dimitrios; Tsikliras, Athanassios; Arampatzis, Avi; Sylaios, Georgios (2020-12-13). "Species Distribution Modelling via Feature Engineering and Machine Learning for Pelagic Fishes in the Mediterranean Sea". Applied Sciences. 10 (24): 8900. doi: 10.3390/app10248900 .
  10. Real, Raimundo; Barbosa, A. Márcia; Vargas, J. Mario (2006). "Obtaining Environmental Favourability Functions from Logistic Regression". Environmental and Ecological Statistics. 13 (2): 237–245. doi:10.1007/s10651-005-0003-3. hdl: 10174/20244 . S2CID   34887643.
  11. "FAO Ecocrop". ECHOcommunity. Retrieved 2019-08-19.
  12. Rosenstock, Todd S.; Nowak, Andreea; Girvetz, Evan (2018). The Climate-Smart Agriculture Papers: Investigating the Business of a Productive, Resilient and Low Emission Future. Cham, Switzerland: Springer. p. 41. ISBN   9783319927978.

Further reading