Push of the past

Last updated • 5 min readFrom Wikipedia, The Free Encyclopedia

The push of the past [1] [2] is a type of survivorship bias associated with evolutionary diversification when extinction is possible. Groups that survive a long time are likely to have “got off to a flying start”, [1] and this statistical bias creates an illusion of a true slow-down of diversification rate through time.

Contents

Birth–Death modelling in evolutionary studies

The evolutionary processes of speciation and extinction can be modelled with a stochasticbirth–death model” (BDM), which is an important component in the study of macroevolution. A BDM assigns each species a certain probability of splitting () or going extinct () per interval of time. [3] This gives rise to an exponential distribution, with the number of species in a particular clade N at any time t given by

,

although this expression only gives the expected value when and are large (see below).

In the special case of there being no extinction, this simplifies to the so-called "Yule process".

Lineage-through-time plots

A different type of plot of diversity through time, called a “lineage through time” (LTT) plot, retrospectively reconstructs the number of lineages that led to the living species of a group. This is equivalent to constructing a dated phylogeny and then counting how many branches are present at each time interval. As we know retrospectively that all such lineages survived until the present, it follows that no extinction is possible along them. It can be shown that the rate of production of new lineages through time is given by . [2]

Survivorship bias in diversification

Rather than considering the distribution of all possible stochastic outcomes for given values of and it is also possible to consider what happens when certain conditions of survivorship are imposed on the possible outcomes.

Push of the past

If a BDM is forward-modelled, i.e. if the fate of an original single species is modelled through time, then a wide range of possible outcomes can occur, as the process is stochastic. With significant extinction rates, any particular clade is likely to be short-lived. However, we know that relatively long-lived clades such as the plants or animals by definition did not go extinct. As a result, their patterns of diversification will be a sub-set of all the possible outcomes for diversifications with their particular values of and - all patterns with early extinction will be excluded. Imposing the condition of survival on a clade implies that rates of early diversification will be higher than expected. It can be shown that for a long-lived clade, the expected initial short-term rate of diversification is approximately , [2] [4] as opposed to the long-term rate of . However, the wide confidence intervals on this value mean that values of initial diversification of up to fall within the 95% range. Long-lived clades should thus show a characteristic early burst of diversification that quickly declines to the long-term rate, an effect called the "push of the past".

Pull of the present

For a normal-sized clade, the push of the past is only observed in the raw count of species through time (e.g. that reconstructed from the fossil record), but the rate of lineage increase is affected as the present is approached. This is because recently created sub-clades within a particular group have an expected lifetime, and as the present is approached, these sub-clades will not have had time to go extinct. Thus, the rate of creation of reconstructed lineages should increase in the near past from to in the present - living species by definition have an observed zero extinction rate. This theoretical apparent increase in the rate of lineage production has been termed the "pull of the present".

Log plots of species numbers for both raw species (blue lines) and lineages through time (red line) when diversification is conditioned on survival to the past. In A, the rapid early diversification rate in the blue line is the push of past, and the late increase along the red line is the pull of the present. Shading gives 95% confidence intervals. In B and C, the observed slowdown in rates of diversification through time and relative to diversity are plotted. From Budd and Mann (2018). Push of the past.jpg
Log plots of species numbers for both raw species (blue lines) and lineages through time (red line) when diversification is conditioned on survival to the past. In A, the rapid early diversification rate in the blue line is the push of past, and the late increase along the red line is the pull of the present. Shading gives 95% confidence intervals. In B and C, the observed slowdown in rates of diversification through time and relative to diversity are plotted. From Budd and Mann (2018).

In reality, the “pull of the present” has proven difficult to demonstrate: rates of lineage production in reconstructed phylogenies often show a slow-down or even decrease as the present is approached. This conundrum has been much discussed, and two major solutions have been proposed: first, that diversification is diversity dependent, [5] so that as the carrying capacity of the environment is reached the rate of lineage production slows; secondly, that our modern species concept does not properly capture the “lineages” of BDM, and that speciation as we recognize it is only the end point of a drawn-out process of splitting of subpopulations through time, each of which could be considered to be a lineage in itself. [6]

Turnover and survivorship bias

For a given diversification rate of , it is possible to consider high turnover (λ and μ high) and low turnover (λ and μ low) scenarios. [2] As the push of the past and pull of the present depend on the stochastic absence of extinction, it follows that both these effects are greatest when m is high, i.e. in high turnover situations. For example, if λ is 0.6 and μ 0.55 (both measured in rates per species per million years), the initial rate of species production would be 1.2 (2λ); but if they were 0.15 and 0.1 respectively, the initial rate would only be 0.3, even though the overall diversification rate () is the same in both cases, 0.05. it can be seen that the initial rate of diversification in the push of the past can be much greater than the background rate; in the first case here, 24 times higher. Such high rates have often been observed at the origin of major groups such as the animals and angiosperms. It is possible that such striking diversifications are thus simply an effect of survivorship bias, and that if overall rates could be measured at their time of origin (including those of groups that quickly went extinct) no unusual rates would be observed. Consideration of the null hypothesis of survivorship bias is thus important when assigning causes to apparent cases of early rapid diversification,

Crown group origins

The effect of the push of the past appears to be the reason that crown groups tend to emerge early within the history of a group as a whole: groups that diversify readily tend to create early new lineages.

Mass extinctions and the push of the past

The push of the past is an expected effect whenever a small group is diversifying and its future survival is known to have occurred. It should thus also be seen in groups that were heavily affected by mass extinctions and went on to rediversify. [2]

Related Research Articles

Queueing theory Mathematical study of waiting lines, or queues

Queueing theory is the mathematical study of waiting lines, or queues. A queueing model is constructed so that queue lengths and waiting time can be predicted. Queueing theory is generally considered a branch of operations research because the results are often used when making business decisions about the resources needed to provide a service.

The laser diode rate equations model the electrical and optical performance of a laser diode. This system of ordinary differential equations relates the number or density of photons and charge carriers (electrons) in the device to the injection current and to device and material parameters such as carrier lifetime, photon lifetime, and the optical gain.

The birth–death process is a special case of continuous-time Markov process where the state transitions are of only two types: "births", which increase the state variable by one and "deaths", which decrease the state by one. The model's name comes from a common application, the use of such models to represent the current size of a population where the transitions are literal births and deaths. Birth–death processes have many applications in demography, queueing theory, performance engineering, epidemiology, biology and other areas. They may be used, for example, to study the evolution of bacteria, the number of people with a disease within a population, or the number of customers in line at the supermarket.

In mathematics and probability theory, a gamma process, also known as (Moran-)Gamma subordinator, is a random process with independent gamma distributed increments. Often written as , it is a pure-jump increasing Lévy process with intensity measure for positive . Thus jumps whose size lies in the interval occur as a Poisson process with intensity The parameter controls the rate of jump arrivals and the scaling parameter inversely controls the jump size. It is assumed that the process starts from a value 0 at t = 0.

Inverse Gaussian distribution Family of continuous probability distributions

In probability theory, the inverse Gaussian distribution is a two-parameter family of continuous probability distributions with support on (0,∞).

In probability theory relating to stochastic processes, a Feller process is a particular kind of Markov process.

The maximal ergodic theorem is a theorem in ergodic theory, a discipline within mathematics.

The linear attenuation coefficient, attenuation coefficient, or narrow-beam attenuation coefficient characterizes how easily a volume of material can be penetrated by a beam of light, sound, particles, or other energy or matter. A coefficient value that is large represents a beam becoming 'attenuated' as it passes through a given medium, while a small value represents that the medium had little effect on loss. The SI unit of attenuation coefficient is the reciprocal metre (m−1). Extinction coefficient is another term for this quantity, often used in meteorology and climatology. Most commonly, the quantity measures the exponential decay of intensity, that is, the value of downward e-folding distance of the original intensity as the energy of the intensity passes through a unit thickness of material, so that an attenuation coefficient of 1 m−1 means that after passing through 1 metre, the radiation will be reduced by a factor of e, and for material with a coefficient of 2 m−1, it will be reduced twice by e, or e2. Other measures may use a different factor than e, such as the decadic attenuation coefficient below. The broad-beam attenuation coefficient counts forward-scattered radiation as transmitted rather than attenuated, and is more applicable to radiation shielding.

In queueing theory, a discipline within the mathematical theory of probability, the Pollaczek–Khinchine formula states a relationship between the queue length and service time distribution Laplace transforms for an M/G/1 queue. The term is also used to refer to the relationships between the mean queue length and mean waiting/service time in such a model.

M/M/1 queue Queue with Markov (Poisson) arrival process, exponential service time distribution and one server

In queueing theory, a discipline within the mathematical theory of probability, an M/M/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times have an exponential distribution. The model name is written in Kendall's notation. The model is the most elementary of queueing models and an attractive object of study as closed-form expressions can be obtained for many metrics of interest in this model. An extension of this model with more than one server is the M/M/c queue.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/c queue is a multi-server queueing model. In Kendall's notation it describes a system where arrivals form a single queue and are governed by a Poisson process, there are c servers, and job service times are exponentially distributed. It is a generalisation of the M/M/1 queue which considers only a single server. The model with infinitely many servers is the M/M/∞ queue.

In probability theory, a Laplace functional refers to one of two possible mathematical functions of functions or, more precisely, functionals that serve as mathematical tools for studying either point processes or concentration of measure properties of metric spaces. One type of Laplace functional, also known as a characteristic functional is defined in relation to a point process, which can be interpreted as random counting measures, and has applications in characterizing and deriving results on point processes. Its definition is analogous to a characteristic function for a random variable.

In queueing theory, a discipline within the mathematical theory of probability, the backpressure routing algorithm is a method for directing traffic around a queueing network that achieves maximum network throughput, which is established using concepts of Lyapunov drift. Backpressure routing considers the situation where each job can visit multiple service nodes in the network. It is an extension of max-weight scheduling where each job visits only a single service node.

In queueing theory, a discipline within the mathematical theory of probability, a fluid queue is a mathematical model used to describe the fluid level in a reservoir subject to randomly determined periods of filling and emptying. The term dam theory was used in earlier literature for these models. The model has been used to approximate discrete models, model the spread of wildfires, in ruin theory and to model high speed data networks. The model applies the leaky bucket algorithm to a stochastic source.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/∞ queue is a multi-server queueing model where every arrival experiences immediate service and does not wait. In Kendall's notation it describes a system where arrivals are governed by a Poisson process, there are infinitely many servers, so jobs do not need to wait for a server. Each job has an exponentially distributed service time. It is a limit of the M/M/c queue model where the number of servers c becomes very large.

In queueing theory, a discipline within the mathematical theory of probability, an M/D/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times are fixed (deterministic). The model name is written in Kendall's notation. Agner Krarup Erlang first published on this model in 1909, starting the subject of queueing theory. An extension of this model with more than one server is the M/D/c queue.

In probability theory, a transition rate matrix is an array of numbers describing the instantaneous rate at which a continuous time Markov chain transitions between states.

Biogeography-based optimization (BBO) is an evolutionary algorithm (EA) that optimizes a function by stochastically and iteratively improving candidate solutions with regard to a given measure of quality, or fitness function. BBO belongs to the class of metaheuristics since it includes many variations, and since it does not make any assumptions about the problem and can therefore be applied to a wide class of problems.

Diversification rates are the rates at which new species form and living species go extinct. Diversification rates can be estimated from fossils, data on the species diversity of clades and their ages, or phylogenetic trees. Diversification rates are typically reported on a per-lineage basis, and refer to the diversification dynamics expected under a birth–death process.

A mixed Poisson distribution is a univariate discrete probability distribution in stochastics. It results from assuming that a random variable is Poisson distributed, where the rate parameter itself is considered as a random variable. Hence it is a special case of a compound probability distribution. Mixed Poisson distributions can be found in actuarial mathematics as a general approach for the distribution of the number of claims and is also examined as an epidemiological model. It should not be confused with compound Poisson distribution or compound Poisson process.

References

  1. 1 2 Nee, S; May, R. M.; Harvey, P. H. (1994). "The reconstructed evolutionary process". Phil. Trans. R. Soc. B. 344 (1309): 305–311. doi:10.1098/rstb.1994.0068. PMID   7938201.
  2. 1 2 3 4 5 6 Budd, G. E.; Mann, R. P. (2018). "History is written by the victors: the effect of the push of the past on the fossil record". Evolution. 72 (11): 2276–2291. doi:10.1111/evo.13593. PMC   6282550 . PMID   30257040.
  3. Nee, Sean (2006). "Birth-Death models in Macroevolution" (PDF). Annu. Rev. Ecol. Evol. Syst. 37: 1–17. doi:10.1146/annurev.ecolsys.37.091305.110035 . Retrieved 27 October 2018.
  4. Stadler, T; Steel, M (2012). "Distribution of branch lengths and phylogenetic diversity under homogeneous speciation models". J. Theor. Biol. 297: 33–40. arXiv: 1107.5926 . Bibcode:2012JThBi.297...33S. doi:10.1016/j.jtbi.2011.11.019. PMID   22142624. S2CID   17461488.
  5. Rabosky, D. L. (2013). "Diversity-dependence, ecological speciation, and the role of competition in macroevolution". Annu. Rev. Ecol. Evol. Syst. 44: 481–502. doi:10.1146/annurev-ecolsys-110512-135800.
  6. Etienne, R. S.; Rosindell, J. (2012). "Prolonging the past counteracts the pull of the present: protracted speciation can explain observed slowdowns in diversification". Syst. Biol. 61 (2): 204–213. doi: 10.1093/sysbio/syr091 . PMC   3280041 . PMID   21873376.