Models of DNA evolution

Last updated

A number of different Markov models of DNA sequence evolution have been proposed. [1] These substitution models differ in terms of the parameters used to describe the rates at which one nucleotide replaces another during evolution. These models are frequently used in molecular phylogenetic analyses. In particular, they are used during the calculation of likelihood of a tree (in Bayesian and maximum likelihood approaches to tree estimation) and they are used to estimate the evolutionary distance between sequences from the observed differences between the sequences.

Contents

Introduction

These models are phenomenological descriptions of the evolution of DNA as a string of four discrete states. These Markov models do not explicitly depict the mechanism of mutation nor the action of natural selection. Rather they describe the relative rates of different changes. For example, mutational biases and purifying selection favoring conservative changes are probably both responsible for the relatively high rate of transitions compared to transversions in evolving sequences. However, the Kimura (K80) model described below only attempts to capture the effect of both forces in a parameter that reflects the relative rate of transitions to transversions.

Evolutionary analyses of sequences are conducted on a wide variety of time scales. Thus, it is convenient to express these models in terms of the instantaneous rates of change between different states (the Q matrices below). If we are given a starting (ancestral) state at one position, the model's Q matrix and a branch length expressing the expected number of changes to have occurred since the ancestor, then we can derive the probability of the descendant sequence having each of the four states. The mathematical details of this transformation from rate-matrix to probability matrix are described in the mathematics of substitution models section of the substitution model page. By expressing models in terms of the instantaneous rates of change we can avoid estimating a large numbers of parameters for each branch on a phylogenetic tree (or each comparison if the analysis involves many pairwise sequence comparisons).

The models described on this page describe the evolution of a single site within a set of sequences. They are often used for analyzing the evolution of an entire locus by making the simplifying assumption that different sites evolve independently and are identically distributed. This assumption may be justifiable if the sites can be assumed to be evolving neutrally. If the primary effect of natural selection on the evolution of the sequences is to constrain some sites, then models of among-site rate-heterogeneity can be used. This approach allows one to estimate only one matrix of relative rates of substitution, and another set of parameters describing the variance in the total rate of substitution across sites.

DNA evolution as a continuous-time Markov chain

Continuous-time Markov chains

Continuous-time Markov chains have the usual transition matrices which are, in addition, parameterized by time, . Specifically, if are the states, then the transition matrix

where each individual entry, refers to the probability that state will change to state in time .

Example: We would like to model the substitution process in DNA sequences (i.e. Jukes–Cantor, Kimura, etc.) in a continuous-time fashion. The corresponding transition matrices will look like:

where the top-left and bottom-right 2 × 2 blocks correspond to transition probabilities and the top-right and bottom-left 2 × 2 blocks corresponds to transversion probabilities.

Assumption: If at some time , the Markov chain is in state , then the probability that at time , it will be in state depends only upon , and . This then allows us to write that probability as .

Theorem: Continuous-time transition matrices satisfy:

Note: There is here a possible confusion between two meanings of the word transition. (i) In the context of Markov chains, transition is the general term for the change between two states. (ii) In the context of nucleotide changes in DNA sequences, transition is a specific term for the exchange between either the two purines (A ↔ G) or the two pyrimidines (C ↔ T) (for additional details, see the article about transitions in genetics). By contrast, an exchange between one purine and one pyrimidine is called a transversion.

Deriving the dynamics of substitution

Consider a DNA sequence of fixed length m evolving in time by base replacement. Assume that the processes followed by the m sites are Markovian independent, identically distributed and that the process is constant over time. For a particular site, let

be the set of possible states for the site, and

their respective probabilities at time . For two distinct , let be the transition rate from state to state . Similarly, for any , let the total rate of change from be

The changes in the probability distribution for small increments of time are given by

In other words, (in frequentist language), the frequency of 's at time is equal to the frequency at time minus the frequency of the lost's plus the frequency of the newly created's.

Similarly for the probabilities , and . These equations can be written compactly as

where

is known as the rate matrix. Note that, by definition, the sum of the entries in each row of is equal to zero. It follows that

For a stationary process, where does not depend on time t, this differential equation can be solved. First,

where denotes the exponential of the matrix . As a result,

Ergodicity

If the Markov chain is irreducible, i.e. if it is always possible to go from a state to a state (possibly in several steps), then it is also ergodic. As a result, it has a unique stationary distribution, where corresponds to the proportion of time spent in state after the Markov chain has run for an infinite amount of time. In DNA evolution, under the assumption of a common process for each site, the stationary frequencies correspond to equilibrium base compositions. Indeed, note that since the stationary distribution satisfies , we see that when the current distribution is the stationary distribution we have

In other words, the frequencies of do not change.

Time reversibility

Definition: A stationary Markov process is time reversible if (in the steady state) the amount of change from state to is equal to the amount of change from to , (although the two states may occur with different frequencies). This means that:

Not all stationary processes are reversible, however, most commonly used DNA evolution models assume time reversibility, which is considered to be a reasonable assumption.

Under the time reversibility assumption, let , then it is easy to see that:

Definition The symmetric term is called the exchangeability between states and . In other words, is the fraction of the frequency of state that is the result of transitions from state to state .

Corollary The 12 off-diagonal entries of the rate matrix, (note the off-diagonal entries determine the diagonal entries, since the rows of sum to zero) can be completely determined by 9 numbers; these are: 6 exchangeability terms and 3 stationary frequencies , (since the stationary frequencies sum to 1).

Scaling of branch lengths

By comparing extant sequences, one can determine the amount of sequence divergence. This raw measurement of divergence provides information about the number of changes that have occurred along the path separating the sequences. The simple count of differences (the Hamming distance) between sequences will often underestimate the number of substitution because of multiple hits (see homoplasy). Trying to estimate the exact number of changes that have occurred is difficult, and usually not necessary. Instead, branch lengths (and path lengths) in phylogenetic analyses are usually expressed in the expected number of changes per site. The path length is the product of the duration of the path in time and the mean rate of substitutions. While their product can be estimated, the rate and time are not identifiable from sequence divergence.

The descriptions of rate matrices on this page accurately reflect the relative magnitude of different substitutions, but these rate matrices are not scaled such that a branch length of 1 yields one expected change. This scaling can be accomplished by multiplying every element of the matrix by the same factor, or simply by scaling the branch lengths. If we use the β to denote the scaling factor, and ν to denote the branch length measured in the expected number of substitutions per site then βν is used in the transition probability formulae below in place of μt. Note that ν is a parameter to be estimated from data, and is referred to as the branch length, while β is simply a number that can be calculated from the rate matrix (it is not a separate free parameter).

The value of β can be found by forcing the expected rate of flux of states to 1. The diagonal entries of the rate-matrix (the Q matrix) represent -1 times the rate of leaving each state. For time-reversible models, we know the equilibrium state frequencies (these are simply the πi parameter value for state i). Thus we can find the expected rate of change by calculating the sum of flux out of each state weighted by the proportion of sites that are expected to be in that class. Setting β to be the reciprocal of this sum will guarantee that scaled process has an expected flux of 1:

For example, in the Jukes–Cantor, the scaling factor would be 4/(3μ) because the rate of leaving each state is 3μ/4.

Most common models of DNA evolution

JC69 model (Jukes and Cantor 1969)

JC69, the Jukes and Cantor 1969 model, [2] is the simplest substitution model. There are several assumptions. It assumes equal base frequencies and equal mutation rates. The only parameter of this model is therefore , the overall substitution rate. As previously mentioned, this variable becomes a constant when we normalize the mean-rate to 1.

Probability
P
i
j
{\displaystyle P_{ij}}
of changing from initial state
i
{\displaystyle i}
to final state
j
{\displaystyle j}
as a function of the branch length (
n
{\displaystyle \nu }
) for JC69. Red curve: nucleotide states
i
{\displaystyle i}
and
j
{\displaystyle j}
are different. Blue curve: initial and final states are the same. After a long time, probabilities tend to the nucleotide equilibrium frequencies (0.25: dashed line). Jukes and Cantor (1969) JC69 Pij Pii.jpg
Probability of changing from initial state to final state as a function of the branch length () for JC69. Red curve: nucleotide states and are different. Blue curve: initial and final states are the same. After a long time, probabilities tend to the nucleotide equilibrium frequencies (0.25: dashed line).

When branch length, , is measured in the expected number of changes per site then:

It is worth noticing that what stands for sum of any column (or row) of matrix multiplied by time and thus means expected number of substitutions in time (branch duration) for each particular site (per site) when the rate of substitution equals .

Given the proportion of sites that differ between the two sequences the Jukes–Cantor estimate of the evolutionary distance (in terms of the expected number of changes) between two sequences is given by

The in this formula is frequently referred to as the -distance. It is a sufficient statistic for calculating the Jukes–Cantor distance correction, but is not sufficient for the calculation of the evolutionary distance under the more complex models that follow (also note that used in subsequent formulae is not identical to the "-distance").

K80 model (Kimura 1980)

K80, the Kimura 1980 model, [3] often referred to as Kimura's two parameter model (or the K2P model), distinguishes between transitions (, i.e. from purine to purine, or , i.e. from pyrimidine to pyrimidine) and transversions (from purine to pyrimidine or vice versa). In Kimura's original description of the model the α and β were used to denote the rates of these types of substitutions, but it is now more common to set the rate of transversions to 1 and use κ to denote the transition/transversion rate ratio (as is done below). The K80 model assumes that all of the bases are equally frequent ().

Rate matrix with columns corresponding to , , , and , respectively.

The Kimura two-parameter distance is given by:

where p is the proportion of sites that show transitional differences and q is the proportion of sites that show transversional differences.

K81 model (Kimura 1981)

K81, the Kimura 1981 model, [4] often called Kimura's three parameter model (K3P model) or the Kimura three substitution type (K3ST) model, has distinct rates for transitions and two distinct types of transversions. The two transversion types are those that conserve the weak/strong properties of the nucleotides (i.e., and , denoted by symbol [4] ) and those that conserve the amino/keto properties of the nucleotides (i.e., and , denoted by symbol [4] ). The K81 model assumes that all equilibrium base frequencies are equal (i.e., ).

Rate matrix with columns corresponding to , , , and , respectively.

The K81 model is used much less often than the K80 (K2P) model for distance estimation and it is seldom the best-fitting model in maximum likelihood phylogenetics. Despite these facts, the K81 model has continued to be studied in the context of mathematical phylogenetics. [5] [6] [7] One important property is the ability to perform a Hadamard transform assuming the site patterns were generated on a tree with nucleotides evolving under the K81 model. [8] [9] [10]

When used in the context of phylogenetics the Hadamard transform provides an elegant and fully invertible means to calculate expected site pattern frequencies given a set of branch lengths (or vice versa). Unlike many maximum likelihood calculations, the relative values for , , and can vary across branches and the Hadamard transform can even provide evidence that the data do not fit a tree. The Hadamard transform can also be combined with a wide variety of methods to accommodate among-sites rate heterogeneity, [11] using continuous distributions rather than the discrete approximations typically used in maximum likelihood phylogenetics [12] (although one must sacrifice the invertibility of the Hadamard transform to use certain among-sites rate heterogeneity distributions [11] ).

F81 model (Felsenstein 1981)

F81, the Felsenstein's 1981 model, [13] is an extension of the JC69 model in which base frequencies are allowed to vary from 0.25 ()

Rate matrix:

When branch length, ν, is measured in the expected number of changes per site then:

HKY85 model (Hasegawa, Kishino and Yano 1985)

HKY85, the Hasegawa, Kishino and Yano 1985 model, [14] can be thought of as combining the extensions made in the Kimura80 and Felsenstein81 models. Namely, it distinguishes between the rate of transitions and transversions (using the κ parameter), and it allows unequal base frequencies (). [ Felsenstein described a similar (but not equivalent) model in 1984 using a different parameterization; [15] that latter model is referred to as the F84 model. [16] ]

Rate matrix

If we express the branch length, ν in terms of the expected number of changes per site then:

and formula for the other combinations of states can be obtained by substituting in the appropriate base frequencies.

T92 model (Tamura 1992)

T92, the Tamura 1992 model, [17] is a mathematical method developed to estimate the number of nucleotide substitutions per site between two DNA sequences, by extending Kimura's (1980) two-parameter method to the case where a G+C content bias exists. This method will be useful when there are strong transition-transversion and G+C-content biases, as in the case of Drosophila mitochondrial DNA. [17]

T92 involves a single, compound base frequency parameter (also noted )

As T92 echoes the Chargaff's second parity rule — pairing nucleotides do have the same frequency on a single DNA strand, G and C on the one hand, and A and T on the other hand — it follows that the four base frequences can be expressed as a function of

and

Rate matrix

The evolutionary distance between two DNA sequences according to this model is given by

where and is the G+C content ().

TN93 model (Tamura and Nei 1993)

TN93, the Tamura and Nei 1993 model, [18] distinguishes between the two different types of transition; i.e. () is allowed to have a different rate to (). Transversions are all assumed to occur at the same rate, but that rate is allowed to be different from both of the rates for transitions.

TN93 also allows unequal base frequencies ().

Rate matrix

GTR model (Tavaré 1986)

GTR, the Generalised time-reversible model of Tavaré 1986, [19] is the most general neutral, independent, finite-sites, time-reversible model possible. It was first described in a general form by Simon Tavaré in 1986. [19]

GTR parameters consist of an equilibrium base frequency vector, , giving the frequency at which each base occurs at each site, and the rate matrix

Where

are the transition rate parameters.

Therefore, GTR (for four characters, as is often the case in phylogenetics) requires 6 substitution rate parameters, as well as 4 equilibrium base frequency parameters. However, this is usually eliminated down to 9 parameters plus , the overall number of substitutions per unit time. When measuring time in substitutions (=1) only 8 free parameters remain.

In general, to compute the number of parameters, one must count the number of entries above the diagonal in the matrix, i.e. for n trait values per site , and then add n for the equilibrium base frequencies, and subtract 1 because is fixed. One gets

For example, for an amino acid sequence (there are 20 "standard" amino acids that make up proteins), one would find there are 209 parameters. However, when studying coding regions of the genome, it is more common to work with a codon substitution model (a codon is three bases and codes for one amino acid in a protein). There are codons, but the rates for transitions between codons which differ by more than one base is assumed to be zero. Hence, there are parameters.

See also

Related Research Articles

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-12 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity, and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.

In mathematics, particularly in linear algebra, tensor analysis, and differential geometry, the Levi-Civita symbol or Levi-Civita epsilon represents a collection of numbers; defined from the sign of a permutation of the natural numbers 1, 2, ..., n, for some positive integer n. It is named after the Italian mathematician and physicist Tullio Levi-Civita. Other names include the permutation symbol, antisymmetric symbol, or alternating symbol, which refer to its antisymmetric property and definition in terms of permutations.

In probability theory and statistics, the cumulantsκn of a probability distribution are a set of quantities that provide an alternative to the moments of the distribution. Any two probability distributions whose moments are identical will have identical cumulants as well, and vice versa.

In the general theory of relativity, the Einstein field equations relate the geometry of spacetime to the distribution of matter within it.

The Einstein–Hilbert action in general relativity is the action that yields the Einstein field equations through the stationary-action principle. With the (− + + +) metric signature, the gravitational part of the action is given as

<span class="mw-page-title-main">Chiral model</span> Model of mesons in the massless quark limit

In nuclear physics, the chiral model, introduced by Feza Gürsey in 1960, is a phenomenological model describing effective interactions of mesons in the chiral limit (where the masses of the quarks go to zero), but without necessarily mentioning quarks at all. It is a nonlinear sigma model with the principal homogeneous space of a Lie group as its target manifold. When the model was originally introduced, this Lie group was the SU(N), where N is the number of quark flavors. The Riemannian metric of the target manifold is given by a positive constant multiplied by the Killing form acting upon the Maurer–Cartan form of SU(N).

String cosmology is a relatively new field that tries to apply equations of string theory to solve the questions of early cosmology. A related area of study is brane cosmology.

In theoretical physics, a source field is a background field coupled to the original field as

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the Earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

<span class="mw-page-title-main">Covariant formulation of classical electromagnetism</span> Ways of writing certain laws of physics

The covariant formulation of classical electromagnetism refers to ways of writing the laws of classical electromagnetism in a form that is manifestly invariant under Lorentz transformations, in the formalism of special relativity using rectilinear inertial coordinate systems. These expressions both make it simple to prove that the laws of classical electromagnetism take the same form in any inertial coordinate system, and also provide a way to translate the fields and forces from one frame to another. However, this is not as general as Maxwell's equations in curved spacetime or non-rectilinear coordinate systems.

In directional statistics, the von Mises–Fisher distribution, is a probability distribution on the -sphere in . If the distribution reduces to the von Mises distribution on the circle.

The Newman–Penrose (NP) formalism is a set of notation developed by Ezra T. Newman and Roger Penrose for general relativity (GR). Their notation is an effort to treat general relativity in terms of spinor notation, which introduces complex forms of the usual variables used in GR. The NP formalism is itself a special case of the tetrad formalism, where the tensors of the theory are projected onto a complete vector basis at each point in spacetime. Usually this vector basis is chosen to reflect some symmetry of the spacetime, leading to simplified expressions for physical observables. In the case of the NP formalism, the vector basis chosen is a null tetrad: a set of four null vectors—two real, and a complex-conjugate pair. The two real members often asymptotically point radially inward and radially outward, and the formalism is well adapted to treatment of the propagation of radiation in curved spacetime. The Weyl scalars, derived from the Weyl tensor, are often used. In particular, it can be shown that one of these scalars— in the appropriate frame—encodes the outgoing gravitational radiation of an asymptotically flat system.

In mathematics, the Jack function is a generalization of the Jack polynomial, introduced by Henry Jack. The Jack polynomial is a homogeneous, symmetric polynomial which generalizes the Schur and zonal polynomials, and is in turn generalized by the Heckman–Opdam polynomials and Macdonald polynomials.

In quantum mechanics, the Pauli equation or Schrödinger–Pauli equation is the formulation of the Schrödinger equation for spin-½ particles, which takes into account the interaction of the particle's spin with an external electromagnetic field. It is the non-relativistic limit of the Dirac equation and can be used where particles are moving at speeds much less than the speed of light, so that relativistic effects can be neglected. It was formulated by Wolfgang Pauli in 1927.

<span class="mw-page-title-main">Spinodal decomposition</span> Mechanism of spontaneous phase separation

Spinodal decomposition is a mechanism by which a single thermodynamic phase spontaneously separates into two phases. Decomposition occurs when there is no thermodynamic barrier to phase separation. As a result, phase separation via decomposition does not require the nucleation events resulting from thermodynamic fluctuations, which normally trigger phase separation.

<span class="mw-page-title-main">Flamant solution</span>

The Flamant solution provides expressions for the stresses and displacements in a linear elastic wedge loaded by point forces at its sharp end. This solution was developed by A. Flamant in 1892 by modifying the three-dimensional solution of Boussinesq.

f(R) is a type of modified gravity theory which generalizes Einstein's general relativity. f(R) gravity is actually a family of theories, each one defined by a different function, f, of the Ricci scalar, R. The simplest case is just the function being equal to the scalar; this is general relativity. As a consequence of introducing an arbitrary function, there may be freedom to explain the accelerated expansion and structure formation of the Universe without adding unknown forms of dark energy or dark matter. Some functional forms may be inspired by corrections arising from a quantum theory of gravity. f(R) gravity was first proposed in 1970 by Hans Adolph Buchdahl. It has become an active field of research following work by Starobinsky on cosmic inflation. A wide range of phenomena can be produced from this theory by adopting different functions; however, many functional forms can now be ruled out on observational grounds, or because of pathological theoretical problems.

The Grey atmosphere is a useful set of approximations made for radiative transfer applications in studies of stellar atmospheres based on the simplified notion that the absorption coefficient of matter within a star's atmosphere is constant—that is, unchanging—for all frequencies of the star's incident radiation.

The table of chords, created by the Greek astronomer, geometer, and geographer Ptolemy in Egypt during the 2nd century AD, is a trigonometric table in Book I, chapter 11 of Ptolemy's Almagest, a treatise on mathematical astronomy. It is essentially equivalent to a table of values of the sine function. It was the earliest trigonometric table extensive enough for many practical purposes, including those of astronomy. Since the 8th and 9th centuries, the sine and other trigonometric functions have been used in Islamic mathematics and astronomy, reforming the production of sine tables. Khwarizmi and Habash al-Hasib later produced a set of trigonometric tables.

Stochastic portfolio theory (SPT) is a mathematical theory for analyzing stock market structure and portfolio behavior introduced by E. Robert Fernholz in 2002. It is descriptive as opposed to normative, and is consistent with the observed behavior of actual markets. Normative assumptions, which serve as a basis for earlier theories like modern portfolio theory (MPT) and the capital asset pricing model (CAPM), are absent from SPT.

References

  1. Arenas, Miguel (2015). "Trends in substitution models of molecular evolution". Frontiers in Genetics. 6: 319. doi: 10.3389/fgene.2015.00319 . ISSN   1664-8021. PMC   4620419 . PMID   26579193.
  2. Jukes TH, Cantor CR (1969). Evolution of Protein Molecules. New York: Academic Press. pp. 21–132.
  3. Kimura M (December 1980). "A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences". Journal of Molecular Evolution. 16 (2): 111–20. Bibcode:1980JMolE..16..111K. doi:10.1007/BF01731581. PMID   7463489. S2CID   19528200.
  4. 1 2 3 Kimura M (January 1981). "Estimation of evolutionary distances between homologous nucleotide sequences". Proceedings of the National Academy of Sciences of the United States of America. 78 (1): 454–8. Bibcode:1981PNAS...78..454K. doi: 10.1073/pnas.78.1.454 . PMC   319072 . PMID   6165991.
  5. Bashford JD, Jarvis PD, Sumner JG, Steel MA (2004-02-25). "U (1) × U (1) × U (1) symmetry of the Kimura 3ST model and phylogenetic branching processes". Journal of Physics A: Mathematical and General. 37 (8): L81–L89. arXiv: q-bio/0310037 . doi:10.1088/0305-4470/37/8/L01. S2CID   7845860.
  6. Sumner JG, Charleston MA, Jermiin LS, Jarvis PD (August 2008). "Markov invariants, plethysms, and phylogenetics". Journal of Theoretical Biology. 253 (3): 601–15. arXiv: 0711.3503 . Bibcode:2008JThBi.253..601S. doi:10.1016/j.jtbi.2008.04.001. PMID   18513747. S2CID   6851591.
  7. Sumner JG, Jarvis PD, Holland BR (December 2014). "A tensorial approach to the inversion of group-based phylogenetic models". BMC Evolutionary Biology. 14 (1): 236. doi: 10.1186/s12862-014-0236-6 . PMC   4268818 . PMID   25472897.
  8. Hendy MD, Penny D, Steel MA (April 1994). "A discrete Fourier analysis for evolutionary trees". Proceedings of the National Academy of Sciences of the United States of America. 91 (8): 3339–43. Bibcode:1994PNAS...91.3339H. doi: 10.1073/pnas.91.8.3339 . PMC   43572 . PMID   8159749.
  9. Hendy MD (2005). "Hadamard conjugation: an analytic tool for phylogenetics". In Gascuel O (ed.). Mathematics of Evolution and Phylogeny. Oxford University Press. pp. 143–177. ISBN   978-0198566106.
  10. Hendy MD, Snir S (July 2008). "Hadamard conjugation for the Kimura 3ST model: combinatorial proof using path sets". IEEE/ACM Transactions on Computational Biology and Bioinformatics. 5 (3): 461–71. arXiv: q-bio/0505055 . doi:10.1109/TCBB.2007.70227. PMID   18670048. S2CID   20633916.
  11. 1 2 Waddell PJ, Penny D, Moore T (August 1997). "Hadamard conjugations and modeling sequence evolution with unequal rates across sites". Molecular Phylogenetics and Evolution. 8 (1): 33–50. doi:10.1006/mpev.1997.0405. PMID   9242594.
  12. Yang Z (September 1994). "Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods". Journal of Molecular Evolution. 39 (3): 306–14. Bibcode:1994JMolE..39..306Y. CiteSeerX   10.1.1.305.951 . doi:10.1007/BF00160154. PMID   7932792. S2CID   17911050.
  13. Felsenstein J (1981). "Evolutionary trees from DNA sequences: a maximum likelihood approach". Journal of Molecular Evolution. 17 (6): 368–76. Bibcode:1981JMolE..17..368F. doi:10.1007/BF01734359. PMID   7288891. S2CID   8024924.
  14. Hasegawa M, Kishino H, Yano T (1985). "Dating of the human-ape splitting by a molecular clock of mitochondrial DNA". Journal of Molecular Evolution. 22 (2): 160–74. Bibcode:1985JMolE..22..160H. doi:10.1007/BF02101694. PMID   3934395. S2CID   25554168.
  15. Kishino H, Hasegawa M (August 1989). "Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea". Journal of Molecular Evolution. 29 (2): 170–9. Bibcode:1989JMolE..29..170K. doi:10.1007/BF02100115. PMID   2509717. S2CID   8045061.
  16. Felsenstein J, Churchill GA (January 1996). "A Hidden Markov Model approach to variation among sites in rate of evolution". Molecular Biology and Evolution. 13 (1): 93–104. doi: 10.1093/oxfordjournals.molbev.a025575 . hdl: 1813/31897 . PMID   8583911.
  17. 1 2 Tamura K (July 1992). "Estimation of the number of nucleotide substitutions when there are strong transition-transversion and G+C-content biases". Molecular Biology and Evolution. 9 (4): 678–87. doi: 10.1093/oxfordjournals.molbev.a040752 . PMID   1630306.
  18. Tamura K, Nei M (May 1993). "Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees". Molecular Biology and Evolution. 10 (3): 512–26. doi: 10.1093/oxfordjournals.molbev.a040023 . PMID   8336541.
  19. 1 2 Tavaré S (1986). "Some Probabilistic and Statistical Problems in the Analysis of DNA Sequences" (PDF). Lectures on Mathematics in the Life Sciences. 17: 57–86.

Further reading