Moran process

Last updated March 26, 2024

A Moran process or Moran model is a simple stochastic process used in biology to describe finite populations. The process is named after Patrick Moran, who first proposed the model in 1958.^[1] It can be used to model variety-increasing processes such as mutation as well as variety-reducing effects such as genetic drift and natural selection. The process can describe the probabilistic dynamics in a finite population of constant size N in which two alleles A and B are competing for dominance. The two alleles are considered to be true replicators (i.e. entities that make copies of themselves).

In each time step a random individual (which is of either type A or B) is chosen for reproduction and a random individual is chosen for death; thus ensuring that the population size remains constant. To model selection, one type has to have a higher fitness and is thus more likely to be chosen for reproduction. The same individual can be chosen for death and for reproduction in the same step.

Neutral drift

Neutral drift is the idea that a neutral mutation can spread throughout a population, so that eventually the original allele is lost. A neutral mutation does not bring any fitness advantage or disadvantage to its bearer. The simple case of the Moran process can describe this phenomenon.

The Moran process is defined on the state space $i = 0, ..., N$ which count the number of A individuals. Since the number of A individuals can change at most by one at each time step, a transition exists only between state i and state $i - 1, i$ and $i + 1$ . Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are

{\begin{aligned}P_{i,i-1}&={\frac {N-i}{N}}{\frac {i}{N}}\\P_{i,i}&=1-P_{i,i-1}-P_{i,i+1}\\P_{i,i+1}&={\frac {i}{N}}{\frac {N-i}{N}}\\\end{aligned}}

The entry $P_{i,j}$ denotes the probability to go from state i to state j. To understand the formulas for the transition probabilities one has to look at the definition of the process which states that always one individual will be chosen for reproduction and one is chosen for death. Once the A individuals have died out, they will never be reintroduced into the population since the process does not model mutations (A cannot be reintroduced into the population once it has died out and vice versa) and thus $P_{0,0}=1$ . For the same reason the population of A individuals will always stay N once they have reached that number and taken over the population and thus $P_{N,N}=1$ . The states 0 and N are called absorbing while the states $1, ..., N - 1$ are called transient. The intermediate transition probabilities can be explained by considering the first term to be the probability to choose the individual whose abundance will increase by one and the second term the probability to choose the other type for death. Obviously, if the same type is chosen for reproduction and for death, then the abundance of one type does not change.

Eventually the population will reach one of the absorbing states and then stay there forever. In the transient states, random fluctuations will occur but eventually the population of A will either go extinct or reach fixation. This is one of the most important differences to deterministic processes which cannot model random events. The expected value and the variance of the number of A individuals $X (t)$ at timepoint t can be computed when an initial state $X (0) = i$ is given:

{\begin{aligned}\operatorname {E} [X(t)\mid X(0)=i]&=i\\\operatorname {Var} (X(t)\mid X(0)=i)&={\tfrac {2i}{N}}\left(1-{\tfrac {i}{N}}\right){\frac {1-\left(1-{\frac {2}{N^{2}}}\right)^{t}}{\frac {2}{N^{2}}}}\end{aligned}}

For a mathematical derivation of the equation above, click on "show" to reveal

For the expected value the calculation runs as follows. Writing $p = .mw-parser-output .sfrac{white-space:nowrap}.mw-parser-output .sfrac.tion,.mw-parser-output .sfrac .tion{display:inline-block;vertical-align:-0.5em;font-size:85%;text-align:center}.mw-parser-output .sfrac .num{display:block;line-height:1em;margin:0.0em 0.1em;border-bottom:1px solid}.mw-parser-output .sfrac .den{display:block;line-height:1em;margin:0.1em 0.1em}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);clip-path:polygon(0px 0px,0px 0px,0px 0px);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}i/N,$

{\begin{aligned}\operatorname {E} [X(t)\mid X(t-1)=i]&=(i-1)P_{i,i-1}+iP_{i,i}+(i+1)P_{i,i+1}\\&=2ip(1-p)+i(p^{2}+(1-p)^{2})\\&=i.\end{aligned}}

Writing $Y=X(t)$ and $Z=X(t-1)$ , and applying the law of total expectation, $\operatorname {E} [Y]=\operatorname {E} [\operatorname {E} [Y\mid Z]]=\operatorname {E} [Z].$ Applying the argument repeatedly gives $\operatorname {E} [X(t)]=\operatorname {E} [X(0)],$ or $\operatorname {E} [X(t)\mid X(0)=i]=i.$

For the variance the calculation runs as follows. Writing $V_{t}=\operatorname {Var} (X(t)\mid X(0)=i),$ we have

{\begin{aligned}V_{1}&=E\left[X(1)^{2}\mid X(0)=i\right]-\operatorname {E} [X(1)\mid X(0)=i]^{2}\\&=(i-1)^{2}p(1-p)+i^{2}\left(p^{2}+(1-p)^{2}\right)+(i+1)^{2}p(1-p)-i^{2}\\&=2p(1-p)\end{aligned}}

For all $t$ , $(X(t)\mid X(t-1)=i)$ and $(X(1)\mid X(0)=i)$ are identically distributed, so their variances are equal. Writing as before $Y=X(t)$ and $Z=X(t-1)$ , and applying the law of total variance,

{\begin{aligned}\operatorname {Var} (Y)&=\operatorname {E} [\operatorname {Var} (Y\mid Z)]+\operatorname {Var} (\operatorname {E} [Y\mid Z])\\&=E\left[\left({\frac {2Z}{N}}\right)\left(1-{\frac {Z}{N}}\right)\right]+\operatorname {Var} (Z)\\&=\left({\frac {2\operatorname {E} [Z]}{N}}\right)\left(1-{\frac {\operatorname {E} [Z]}{N}}\right)+\left(1-{\frac {2}{N^{2}}}\right)\operatorname {Var} (Z).\end{aligned}}

If $X(0)=i$ , we obtain

V_{t}=V_{1}+\left(1-{\frac {2}{N^{2}}}\right)V_{t-1}.

Rewriting this equation as

V_{t}-{\frac {V_{1}}{\frac {2}{N^{2}}}}=\left(1-{\frac {2}{N^{2}}}\right)\left(V_{t-1}-{\frac {V_{1}}{\frac {2}{N^{2}}}}\right)=\left(1-{\frac {2}{N^{2}}}\right)^{t-1}\left(V_{1}-{\frac {V_{1}}{\frac {2}{N^{2}}}}\right)

yields

V_{t}=V_{1}{\frac {1-\left(1-{\frac {2}{N^{2}}}\right)^{t}}{\frac {2}{N^{2}}}}

as desired.

The probability that A reaches fixation is called fixation probability. For the simple Moran process this probability is $x i = i / N .$

Since all individuals have the same fitness, they also have the same chance of becoming the ancestor of the whole population; this probability is $1 / N$ and thus the sum of all i probabilities (for all A individuals) is just $i / N .$ The mean time to absorption starting in state i is given by

k_{i}=N\left[\sum _{j=1}^{i}{\frac {N-i}{N-j}}+\sum _{j=i+1}^{N-1}{\frac {i}{j}}\right]

For a mathematical derivation of the equation above, click on "show" to reveal

The mean time spent in state j when starting in state i which is given by

k_{i}^{j}=\delta _{ij}+P_{i,i-1}k_{i-1}^{j}+P_{i,i}k_{i}^{j}+P_{i,i+1}k_{i+1}^{j}

Here $δ ij$ denotes the Kroenecker delta. This recursive equation can be solved using a new variable $q i$ so that $P_{i,i-1}=P_{i,i+1}=q_{i}$ and thus $P_{i,i}=1-2q_{i}$ and rewritten

k_{i+1}^{j}=2k_{i}^{j}-k_{i-1}^{j}-{\frac {\delta _{ij}}{q_{i}}}

The variable $y_{i}^{j}=k_{i}^{j}-k_{i-1}^{j}$ is used and the equation becomes

{\begin{aligned}y_{i+1}^{j}&=y_{i}^{j}-{\frac {\delta _{ij}}{q_{i}}}\\\\\sum _{i=1}^{m}y_{i}^{j}&=(k_{1}^{j}-k_{0}^{j})+(k_{2}^{j}-k_{1}^{j})+\cdots +(k_{m-1}^{j}-k_{m-2}^{j})+(k_{m}^{j}-k_{m-1}^{j})\\&=k_{m}^{j}-k_{0}^{j}\\\sum _{i=1}^{m}y_{i}^{j}&=k_{m}^{j}\\\\y_{1}^{j}&=(k_{1}^{j}-k_{0}^{j})=k_{1}^{j}\\y_{2}^{j}&=y_{1}^{j}-{\frac {\delta _{1j}}{q_{1}}}=k_{1}^{j}-{\frac {\delta _{1j}}{q_{1}}}\\y_{3}^{j}&=k_{1}^{j}-{\frac {\delta _{1j}}{q_{1}}}-{\frac {\delta _{2j}}{q_{2}}}\\&\vdots \\y_{i}^{j}&=k_{1}^{j}-\sum _{r=1}^{i-1}{\frac {\delta _{rj}}{q_{r}}}={\begin{cases}k_{1}^{j}&j\geq i\\k_{1}^{j}-{\frac {1}{q_{j}}}&j\leq i\end{cases}}\\\\k_{i}^{j}&=\sum _{m=1}^{i}y_{m}^{j}={\begin{cases}i\cdot k_{1}^{j}&j\geq i\\i\cdot k_{1}^{j}-{\frac {i-j}{q_{j}}}&j\leq i\end{cases}}\end{aligned}}

Knowing that $k_{N}^{j}=0$ and

q_{j}=P_{j,j+1}={\frac {j}{N}}{\frac {N-j}{N}}

we can calculate $k_{1}^{j}$ :

{\begin{aligned}k_{N}^{j}=\sum _{i=1}^{m}y_{i}^{j}=N\cdot k_{1}^{j}&-{\frac {N-j}{q_{j}}}=0\\k_{1}^{j}&={\frac {N}{j}}\end{aligned}}

Therefore

k_{i}^{j}={\begin{cases}{\frac {i}{j}}\cdot k_{j}^{j}&j\geq i\\{\frac {N-i}{N-j}}\cdot k_{j}^{j}&j\leq i\end{cases}}

with $k_{j}^{j}=N$ . Now $k i$ , the total time until fixation starting from state i, can be calculated

{\begin{aligned}k_{i}=\sum _{j=1}^{N-1}k_{i}^{j}&=\sum _{j=1}^{i}k_{i}^{j}+\sum _{j=i+1}^{N-1}k_{i}^{j}\\&=\sum _{j=1}^{i}N{\frac {N-i}{N-j}}+\sum _{j=i+1}^{N-1}N{\frac {i}{j}}\end{aligned}}

For large N the approximation

\lim _{N\to \infty }k_{i}\approx -N^{2}\left[(1-x_{i})\ln(1-x_{i})+x_{i}\ln(x_{i})\right]

holds.

Selection

If one allele has a fitness advantage over the other allele, it will be more likely to be chosen for reproduction. This can be incorporated into the model if individuals with allele A have fitness $f_{i}>0$ and individuals with allele B have fitness $g_{i}>0$ where $i$ is the number of individuals of type A; thus describing a general birth-death process. The transition matrix of the stochastic process is tri-diagonal in shape. Let $r_{i}:=f_{i}/g_{i}$ , then the transition probabilities are

{\begin{aligned}P_{i,i-1}&={\frac {g_{i}\cdot (N-i)}{f_{i}\cdot i+g_{i}\cdot (N-i)}}\cdot {\frac {i}{N}}={\frac {1}{r_{i}\cdot {\frac {i}{N}}+{\frac {N-i}{N}}}}\cdot {\frac {N-i}{N}}\cdot {\frac {i}{N}}\\P_{i,i}&=1-P_{i,i-1}-P_{i,i+1}\\P_{i,i+1}&={\frac {f_{i}\cdot i}{f_{i}\cdot i+g_{i}\cdot (N-i)}}\cdot {\frac {N-i}{N}}={\frac {r_{i}}{r_{i}\cdot {\frac {i}{N}}+{\frac {N-i}{N}}}}\cdot {\frac {i}{N}}\cdot {\frac {N-i}{N}}\\\end{aligned}}

The entry $P_{i,j}$ denotes the probability to go from state i to state j. The difference to neutral selection above is now that reproduction of an individual with allele B is accepted with probability

{\frac {f_{i}/g_{i}}{{\frac {f_{i}}{g_{i}}}\cdot {\frac {i}{N}}+{\frac {N-i}{i}}}},

and reproduction of an individual with allele A is accepted with probability

{\frac {1}{{\frac {f_{i}}{g_{i}}}\cdot {\frac {i}{N}}+{\frac {N-i}{i}}}},

when the number of individuals with allele B is exactly i.

Also in this case, fixation probabilities when starting in state i is defined by recurrence

x_{i}={\begin{cases}0&i=0\\\beta _{i}x_{i-1}+(1-\alpha _{i}-\beta _{i})x_{i}+\alpha _{i}x_{i+1}&1\leq i\leq N-1\\1&i=N\end{cases}}

And the closed form is given by

x_{i}={\frac {\displaystyle 1+\sum _{j=1}^{i-1}\prod _{k=1}^{j}\gamma _{k}}{\displaystyle 1+\sum _{j=1}^{N-1}\prod _{k=1}^{j}\gamma _{k}}}\qquad {\text{(1)}}

where $\gamma _{i}=P_{i,i-1}/P_{i,i+1}$ per definition and will just be $g_{i}/f_{i}$ for the general case.

For a mathematical derivation of the equation above, click on "show" to reveal

Also in this case, fixation probabilities can be computed, but the transition probabilities are not symmetric. The notation $P_{i,i+1}=\alpha _{i},P_{i,i-1}=\beta _{i},P_{i,i}=1-\alpha _{i}-\beta _{i}$ and $\gamma _{i}=\beta _{i}/\alpha _{i}$ is used. The fixation probability can be defined recursively and a new variable $y_{i}=x_{i}-x_{i-1}$ is introduced.

{\begin{aligned}x_{i}&=\beta _{i}x_{i-1}+(1-\alpha _{i}-\beta _{i})x_{i}+\alpha _{i}x_{i+1}\\\beta _{i}(x_{i}-x_{i-1})&=\alpha _{i}(x_{i+1}-x_{i})\\\gamma _{i}\cdot y_{i}&=y_{i+1}\end{aligned}}

Now two properties from the definition of the variable $y i$ can be used to find a closed form solution for the fixation probabilities:

{\begin{aligned}\sum _{i=1}^{m}y_{i}&=x_{m}&&1\\y_{k}&=x_{1}\cdot \prod _{l=1}^{k-1}\gamma _{l}&&2\\\Rightarrow \sum _{m=1}^{i}y_{m}&=x_{1}+x_{1}\sum _{j=1}^{i-1}\prod _{k=1}^{j}\gamma _{k}=x_{i}&&3\end{aligned}}

Combining (3) and $x N = 1$ :

x_{1}\left(1+\sum _{j=1}^{N-1}\prod _{k=1}^{j}\gamma _{k}\right)=x_{N}=1.

which implies:

x_{1}={\frac {1}{1+\sum _{j=1}^{N-1}\prod _{k=1}^{j}\gamma _{k}}}

This in turn gives us:

x_{i}={\frac {\displaystyle 1+\sum _{j=1}^{i-1}\prod _{k=1}^{j}\gamma _{k}}{\displaystyle 1+\sum _{j=1}^{N-1}\prod _{k=1}^{j}\gamma _{k}}}

This general case where the fitness of A and B depends on the abundance of each type is studied in evolutionary game theory.

Less complex results are obtained if a constant fitness ratio $r=1/\gamma _{i}$ , for all i, is assumed. Individuals of type A reproduce with a constant rate r and individuals with allele B reproduce with rate 1. Thus if A has a fitness advantage over B, r will be larger than one, otherwise it will be smaller than one. Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are

{\begin{aligned}P_{0,0}&=1\\P_{i,i-1}&={\frac {N-i}{r\cdot i+N-i}}\cdot {\frac {i}{N}}={\frac {1}{r\cdot {\frac {i}{N}}+{\frac {N-i}{N}}}}\cdot {\frac {N-i}{N}}\cdot {\frac {i}{N}}\\P_{i,i}&=1-P_{i,i-1}-P_{i,i+1}\\P_{i,i+1}&={\frac {r\cdot i}{r\cdot i+N-i}}\cdot {\frac {N-i}{N}}={\frac {r}{r\cdot {\frac {i}{N}}+{\frac {N-i}{N}}}}\cdot {\frac {i}{N}}\cdot {\frac {N-i}{N}}\\P_{N,N}&=1.\end{aligned}}

In this case $\gamma _{i}=1/r$ is a constant factor for each composition of the population and thus the fixation probability from equation (1) simplifies to

x_{i}={\frac {1-r^{-i}}{1-r^{-N}}}\quad \Rightarrow \quad x_{1}=\rho ={\frac {1-r^{-1}}{1-r^{-N}}}\qquad {\text{(2)}}

where the fixation probability of a single mutant A in a population of otherwise all B is often of interest and is denoted by $ρ$ .

Also in the case of selection, the expected value and the variance of the number of A individuals may be computed

{\begin{aligned}\operatorname {E} [X(t)\mid X(t-1)=i]&=ps{\dfrac {1-p}{ps+1}}+i\\\operatorname {Var} (X(t+1)\mid X(t)=i)&=p(1-p){\dfrac {(s+1)+(ps+1)^{2}}{(ps+1)^{2}}}\end{aligned}}

where $p = i / N,$ and $r = 1 + s$ .

For a mathematical derivation of the equation above, click on "show" to reveal

For the expected value the calculation runs as follows

{\begin{aligned}\operatorname {E} [\Delta (1)\mid X(0)=i]&=(i-1-i)\cdot P_{i,i-1}+(i-i)\cdot P_{i,i}+(i+1-i)\cdot P_{i,i+1}\\&=-{\frac {N-i}{ri+N-i}}{\frac {i}{N}}+{\frac {ri}{ri+N-i}}{\frac {N-i}{N}}\\&=-{\frac {(N-i)i}{(ri+N-i)N}}+{\frac {i(N-i)}{(ri+N-i)N}}+{\frac {si(N-i)}{(ri+N-i)N}}\\&=ps{\dfrac {1-p}{ps+1}}\\\operatorname {E} [X(t)\mid X(t-1)=i]&=ps{\dfrac {1-p}{ps+1}}+i\end{aligned}}

For the variance the calculation runs as follows, using the variance of a single step

{\begin{aligned}\operatorname {Var} (X(t+1)\mid X(t)=i)&=\operatorname {Var} (X(t))+\operatorname {Var} (\Delta (t+1)\mid X(t)=i)\\&=0+E\left[\Delta (t+1)^{2}\mid X(t)=i\right]-\operatorname {E} [\Delta (t+1)\mid X(t)=i]^{2}\\&=(i-1-i)^{2}\cdot P_{i,i-1}+(i-i)^{2}\cdot P_{i,i}+(i+1-i)^{2}\cdot P_{i,i+1}-\operatorname {E} [\Delta (t+1)\mid X(t)=i]^{2}\\&=P_{i,i-1}+P_{i,i+1}-\operatorname {E} [\Delta (t+1)\mid X(t)=i]^{2}\\&={\frac {(N-i)i}{(ri+N-i)N}}+{\frac {(N-i)i(1+s)}{(ri+N-i)N}}-\operatorname {E} [\Delta (t+1)\mid X(t)=i]^{2}\\&=i(N-i){\frac {2+s}{(ri+N-i)N}}-\operatorname {E} [\Delta (t+1)\mid X(t)=i]^{2}\\&=i(N-i){\frac {2+s}{(ri+N-i)N}}-\left(ps{\dfrac {1-p}{ps+1}}\right)^{2}\\&=p(1-p){\frac {2+s(ps+1)}{(ps+1)^{2}}}-p(1-p){\frac {ps^{2}(1-p)}{(ps+1)^{2}}}\\&=p(1-p){\dfrac {2+2ps+s+p^{2}s^{2}}{(ps+1)^{2}}}\end{aligned}}

Rate of evolution

In a population of all B individuals, a single mutant A will take over the whole population with the probability

\rho ={\frac {1-r^{-1}}{1-r^{-N}}}.\qquad {\text{(2)}}

If the mutation rate (to go from the B to the A allele) in the population is u then the rate with which one member of the population will mutate to A is given by $N \times u$ and the rate with which the whole population goes from all B to all A is the rate that a single mutant A arises times the probability that it will take over the population (fixation probability):

R=N\cdot u\cdot \rho =u\quad {\text{if}}\quad \rho ={\frac {1}{N}}.

Thus if the mutation is neutral (i.e. the fixation probability is just 1/N) then the rate with which an allele arises and takes over a population is independent of the population size and is equal to the mutation rate. This important result is the basis of the neutral theory of evolution and suggests that the number of observed point mutations in the genomes of two different species would simply be given by the mutation rate multiplied by two times the time since divergence. Thus the neutral theory of evolution provides a molecular clock, given that the assumptions are fulfilled which may not be the case in reality.

Related Research Articles

In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbers is spread out from their average value. It is the second central moment of a distribution, and the covariance of the random variable with itself, and it is often represented by $,,,, or .$

The weighted arithmetic mean is similar to an ordinary arithmetic mean, except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The notion of weighted mean plays a role in descriptive statistics and also occurs in a more general form in several other areas of mathematics.

In probability theory and statistics, the geometric distribution is either one of two discrete probability distributions:

<span class="mw-page-title-main">Log-normal distribution</span> Probability distribution

In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable $X$ is log-normally distributed, then $Y = ln(X)$ has a normal distribution. Equivalently, if $Y$ has a normal distribution, then the exponential function of $Y$ , $X = exp(Y)$ , has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. It is a convenient and useful model for measurements in exact and engineering sciences, as well as medicine, economics and other topics (e.g., energies, concentrations, lengths, prices of financial instruments, and other metrics).

Covariance in probability theory and statistics is a measure of the joint variability of two random variables.

In mathematics, a generating function is a representation of an infinite sequence of numbers as the coefficients of a formal power series. Unlike an ordinary series, the formal power series is not required to converge: in fact, the generating function is not actually regarded as a function, and the "variable" remains an indeterminate. Generating functions were first introduced by Abraham de Moivre in 1730, in order to solve the general linear recurrence problem. One can generalize to formal power series in more than one indeterminate, to encode information about infinite multi-dimensional arrays of numbers.

In mathematics, the error function, often denoted by $erf$ , is a function defined as:

In statistics, the logistic model is a statistical model that models the log-odds of an event as a linear combination of one or more independent variables. In regression analysis, logistic regression is estimating the parameters of a logistic model. Formally, in binary logistic regression there is a single binary dependent variable, coded by an indicator variable, where the two values are labeled "0" and "1", while the independent variables can each be a binary variable or a continuous variable. The corresponding probability of the value labeled "1" can vary between 0 and 1, hence the labeling; the function that converts log-odds to probability is the logistic function, hence the name. The unit of measurement for the log-odds scale is called a logit, from logistic unit, hence the alternative names. See § Background and § Definition for formal mathematics, and § Example for a worked example.

In probability theory, the probability generating function of a discrete random variable is a power series representation (the generating function) of the probability mass function of the random variable. Probability generating functions are often employed for their succinct description of the sequence of probabilities Pr(X = i) in the probability mass function for a random variable X, and to make available the well-developed theory of power series with non-negative coefficients.

In mathematics, a Gaussian function, often simply referred to as a Gaussian, is a function of the base form

In probability theory and statistics, the cumulants $κ n$ of a probability distribution are a set of quantities that provide an alternative to the moments of the distribution. Any two probability distributions whose moments are identical will have identical cumulants as well, and vice versa.

The Basel problem is a problem in mathematical analysis with relevance to number theory, concerning an infinite sum of inverse squares. It was first posed by Pietro Mengoli in 1650 and solved by Leonhard Euler in 1734, and read on 5 December 1735 in The Saint Petersburg Academy of Sciences. Since the problem had withstood the attacks of the leading mathematicians of the day, Euler's solution brought him immediate fame when he was twenty-eight. Euler generalised the problem considerably, and his ideas were taken up more than a century later by Bernhard Riemann in his seminal 1859 paper "On the Number of Primes Less Than a Given Magnitude", in which he defined his zeta function and proved its basic properties. The problem is named after Basel, hometown of Euler as well as of the Bernoulli family who unsuccessfully attacked the problem.

In probability theory, Buffon's needle problem is a question first posed in the 18th century by Georges-Louis Leclerc, Comte de Buffon:

In probability theory, a compound Poisson distribution is the probability distribution of the sum of a number of independent identically-distributed random variables, where the number of terms to be added is itself a Poisson-distributed variable. The result can be either a continuous or a discrete distribution.

In probability theory, the multinomial distribution is a generalization of the binomial distribution. For example, it models the probability of counts for each side of a k-sided dice rolled n times. For n independent trials each of which leads to a success for exactly one of k categories, with each category having a given fixed success probability, the multinomial distribution gives the probability of any particular combination of numbers of successes for the various categories.

In queueing theory, a discipline within the mathematical theory of probability, a Jackson network is a class of queueing network where the equilibrium distribution is particularly simple to compute as the network has a product-form solution. It was the first significant development in the theory of networks of queues, and generalising and applying the ideas of the theorem to search for similar product-form solutions in other networks has been the subject of much research, including ideas used in the development of the Internet. The networks were first identified by James R. Jackson and his paper was re-printed in the journal Management Science’s ‘Ten Most Influential Titles of Management Sciences First Fifty Years.’

In natural language processing, latent Dirichlet allocation (LDA) is a Bayesian network for modeling automatically extracted topics in textual corpora. The LDA is an example of a Bayesian topic model. In this, observations are collected into documents, and each word's presence is attributable to one of the document's topics. Each document will contain a small number of topics.

In probability theory, the coupon collector's problem refers to mathematical analysis of "collect all coupons and win" contests. It asks the following question: If each box of a brand of cereals contains a coupon, and there are n different types of coupons, what is the probability that more than t boxes need to be bought to collect all n coupons? An alternative statement is: Given n coupons, how many coupons do you expect you need to draw with replacement before having drawn each coupon at least once? The mathematical analysis of the problem reveals that the expected number of trials needed grows as $. For example, when n = 50 it takes about 225 trials on average to collect all 50 coupons.$

The purpose of this page is to provide supplementary materials for the ordinary least squares article, reducing the load of the main article with mathematics and improving its accessibility, while at the same time retaining the completeness of exposition.

References

↑ Moran, P. A. P. (1958). "Random processes in genetics". Mathematical Proceedings of the Cambridge Philosophical Society . 54 (1): 60–71. doi:10.1017/S0305004100033193.

External links

"Evolutionary Dynamics on Graphs".

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Moran, P. A. P. (1958). "Random processes in genetics". Mathematical Proceedings of the Cambridge Philosophical Society . 54 (1): 60–71. doi:10.1017/S0305004100033193.

[1]

v t e Stochastic processes
Discrete time	Bernoulli process Branching process Chinese restaurant process Galton–Watson process Independent and identically distributed random variables Markov chain Moran process Random walk Loop-erased Self-avoiding Biased Maximal entropy
Continuous time	Additive process Bessel process Birth–death process pure birth Brownian motion Bridge Excursion Fractional Geometric Meander Cauchy process Contact process Continuous-time random walk Cox process Diffusion process Dyson Brownian motion Empirical process Feller process Fleming–Viot process Gamma process Geometric process Hawkes process Hunt process Interacting particle systems Itô diffusion Itô process Jump diffusion Jump process Lévy process Local time Markov additive process McKean–Vlasov process Ornstein–Uhlenbeck process Poisson process Compound Non-homogeneous Schramm–Loewner evolution Semimartingale Sigma-martingale Stable process Superprocess Telegraph process Variance gamma process Wiener process Wiener sausage
Both	Branching process Galves–Löcherbach model Gaussian process Hidden Markov model (HMM) Markov process Martingale Differences Local Sub- Super- Random dynamical system Regenerative process Renewal process Stochastic chains with memory of variable length White noise
Fields and other	Dirichlet process Gaussian random field Gibbs measure Hopfield model Ising model Potts model Boolean network Markov random field Percolation Pitman–Yor process Point process Cox Poisson Random field Random graph
Time series models	Autoregressive conditional heteroskedasticity (ARCH) model Autoregressive integrated moving average (ARIMA) model Autoregressive (AR) model Autoregressive–moving-average (ARMA) model Generalized autoregressive conditional heteroskedasticity (GARCH) model Moving-average (MA) model
Financial models	Binomial options pricing model Black–Derman–Toy Black–Karasinski Black–Scholes Chan–Karolyi–Longstaff–Sanders (CKLS) Chen Constant elasticity of variance (CEV) Cox–Ingersoll–Ross (CIR) Garman–Kohlhagen Heath–Jarrow–Morton (HJM) Heston Ho–Lee Hull–White Korn-Kreer-Lenssen LIBOR market Rendleman–Bartter SABR volatility Vašíček Wilkie
Actuarial models	Bühlmann Cramér–Lundberg Risk process Sparre–Anderson
Queueing models	Bulk Fluid Generalized queueing network M/G/1 M/M/1 M/M/c
Properties	Càdlàg paths Continuous Continuous paths Ergodic Exchangeable Feller-continuous Gauss–Markov Markov Mixing Piecewise-deterministic Predictable Progressively measurable Self-similar Stationary Time-reversible
Limit theorems	Central limit theorem Donsker's theorem Doob's martingale convergence theorems Ergodic theorem Fisher–Tippett–Gnedenko theorem Large deviation principle Law of large numbers (weak/strong) Law of the iterated logarithm Maximal ergodic theorem Sanov's theorem Zero–one laws (Blumenthal, Borel–Cantelli, Engelbert–Schmidt, Hewitt–Savage, Kolmogorov, Lévy)
Inequalities	Burkholder–Davis–Gundy Doob's martingale Doob's upcrossing Kunita–Watanabe Marcinkiewicz–Zygmund
Tools	Cameron–Martin formula Convergence of random variables Doléans-Dade exponential Doob decomposition theorem Doob–Meyer decomposition theorem Doob's optional stopping theorem Dynkin's formula Feynman–Kac formula Filtration Girsanov theorem Infinitesimal generator Itô integral Itô's lemma Karhunen–Loève theorem Kolmogorov continuity theorem Kolmogorov extension theorem Lévy–Prokhorov metric Malliavin calculus Martingale representation theorem Optional stopping theorem Prokhorov's theorem Quadratic variation Reflection principle Skorokhod integral Skorokhod's representation theorem Skorokhod space Snell envelope Stochastic differential equation Tanaka Stopping time Stratonovich integral Uniform integrability Usual hypotheses Wiener space Classical Abstract
Disciplines	Actuarial mathematics Control theory Econometrics Ergodic theory Extreme value theory (EVT) Large deviations theory Mathematical finance Mathematical statistics Probability theory Queueing theory Renewal theory Ruin theory Signal processing Statistics Stochastic analysis Time series analysis Machine learning
List of topics Category

Moran process

Contents

Neutral drift

Selection

Rate of evolution

See also

Related Research Articles

References

Further reading

External links