QST (genetics)

Last updated

In quantitative genetics, QST is a statistic intended to measure the degree of genetic differentiation among populations with regard to a quantitative trait. It was developed by Ken Spitze in 1993. [1] Its name reflects that QST was intended to be analogous to the fixation index for a single genetic locus (FST). [2] [3] QST is often compared with FST of neutral loci to test if variation in a quantitative trait is a result of divergent selection or genetic drift, an analysis known as QST–FST comparisons.

Contents

Calculation of QST

Equations

QST represents the proportion of variance among subpopulations, and is it’s calculation is synonymous to FST developed by Sewall Wright. [4] However, instead of using genetic differentiation, QST is calculated by finding the variance of a quantitative trait within and among subpopulations, and for the total population. [1] Variance of a quantitative trait among populations (σ2GB) is described as:

And the variance of a quantitative trait within populations (σ2GW) is described as:

Where σ2T is the total genetic variance in all populations. Therefore, QST can be calculated with the following equation:

Assumptions

Calculation of QST is subject to several assumptions: populations must be in Hardy-Weinberg Equilibrium, observed variation is assumed to be due to additive genetic effects only, selection and linkage disequilibrium are not present, [5] and the subpopulations exist within an island model. [6]

QST-FST comparisons

QST–FST analyses often involve culturing organisms in consistent environmental conditions, known as common garden experiments, [7] and comparing the phenotypic variance to genetic variance. If QST is found to exceed FST, this is interpreted as evidence of divergent selection, because it indicates more differentiation in the trait than could be produced solely by genetic drift. If QST is less than FST, balancing selection is expected to be present. If the values of QST and FSTare equivalent, the observed trait differentiation could be due to genetic drift. [6]

Suitable comparison of QST and FST is subject to multiple ecological and evolutionary assumptions, [8] [9] [10] and since the development of QST, multiple studies have examined the limitations and constrictions of QST-FST analyses. Leinonen et al. notes FST must be calculated with neutral loci, however over filtering of non-neutral loci can artificially reduce FSTvalues. [7] Cubry et al. found QST is reduced in the presence of dominance, resulting in conservative estimates of divergent selection when QST is high, and inconclusive results of balancing selection when QST is low. [5] Additionally, population structure can significantly impact QST-FST ratios. Stepping stone models, which can generate more evolutionary noise than island models, are more likely to experience type 1 errors. [6] If a subset of populations act as sources, such as during invasion, weighting the genetic contributions of each population can increase detection of adaptation. [11] In order to improve precision of QST analyses, more populations (>20) should be included in analyses. [12]

QST applications in literature

Multiple studies have incorporated QST to separate effects of natural selection and genetic drift, and QST is often observed to exceed FST, indicating local adaptation. [13] In an ecological restoration study, Bower and Aitken used QST to evaluate suitable populations for seed transfer of whitebark pine. They found high QST values in many populations, suggesting local adaptation for cold-adapted characteristics. [14] During an assessment of the invasive species, Brachypodium sylvaticum , Marchini et al. found divergence between native and invasive populations during initial establishment in the invaded range, but minimal divergence during range expansion. [11] In an examination of the common snapdragon ( Antirrhinum majus ) along an elevation gradient, QST-FST analyses revealed different adaptation trends between two subspecies (A. m. pseudomajus and A. m. striatum). While both subspecies occur at all elevations, A. m. striatum had high QST values for traits associated with altitude adaptation: plant height, number of branches, and internode length. A. m. pseudomajus had lower QST than FST values for germination time. [15]

See also

Related Research Articles

Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and population structure.

<span class="mw-page-title-main">Quantitative genetics</span> Study of the inheritance of continuously variable traits

Quantitative genetics deals with quantitative traits, which are phenotypes that vary continuously —as opposed to discretely identifiable phenotypes and gene-products.

<span class="mw-page-title-main">Genetic variation</span> Difference in DNA among individuals or populations

Genetic variation is the difference in DNA among individuals or the differences between populations among the same species. The multiple sources of genetic variation include mutation and genetic recombination. Mutations are the ultimate sources of genetic variation, but other mechanisms, such as genetic drift, contribute to it, as well.

<span class="mw-page-title-main">Genetic diversity</span> Total number of genetic characteristics in a species

Genetic diversity is the total number of genetic characteristics in the genetic makeup of a species, it ranges widely from the number of species to differences within species and can be attributed to the span of survival for a species. It is distinguished from genetic variability, which describes the tendency of genetic characteristics to vary.

Genetic architecture is the underlying genetic basis of a phenotypic trait and its variational properties. Phenotypic variation for quantitative traits is, at the most basic level, the result of the segregation of alleles at quantitative trait loci (QTL). Environmental factors and other external influences can also play a role in phenotypic variation. Genetic architecture is a broad term that can be described for any given individual based on information regarding gene and allele number, the distribution of allelic and mutational effects, and patterns of pleiotropy, dominance, and epistasis.

<span class="mw-page-title-main">Molecular ecology</span> Field of evolutionary biology

Molecular ecology is a field of evolutionary biology that is concerned with applying molecular population genetics, molecular phylogenetics, and more recently genomics to traditional ecological questions. It is virtually synonymous with the field of "Ecological Genetics" as pioneered by Theodosius Dobzhansky, E. B. Ford, Godfrey M. Hewitt, and others. These fields are united in their attempt to study genetic-based questions "out in the field" as opposed to the laboratory. Molecular ecology is related to the field of conservation genetics.

<span class="mw-page-title-main">Genetic distance</span> Measure of divergence between populations

Genetic distance is a measure of the genetic divergence between species or between populations within a species, whether the distance measures time from common ancestor or degree of differentiation. Populations with many similar alleles have small genetic distances. This indicates that they are closely related and have a recent common ancestor.

An evolutionarily significant unit (ESU) is a population of organisms that is considered distinct for purposes of conservation. Delineating ESUs is important when considering conservation action. This term can apply to any species, subspecies, geographic race, or population. Often the term "species" is used rather than ESU, even when an ESU is more technically considered a subspecies or variety rather than a biological species proper. In marine animals the term "stock" is often used as well.

Genetic equilibrium is the condition of an allele or genotype in a gene pool where the frequency does not change from generation to generation. Genetic equilibrium describes a theoretical state that is the basis for determining whether and in what ways populations may deviate from it. Hardy–Weinberg equilibrium is one theoretical framework for studying genetic equilibrium. It is commonly studied using models that take as their assumptions those of Hardy-Weinberg, meaning:

<span class="mw-page-title-main">Fixation index</span> Measure of population differentiation

The fixation index (FST) is a measure of population differentiation due to genetic structure. It is frequently estimated from genetic polymorphism data, such as single-nucleotide polymorphisms (SNP) or microsatellites. Developed as a special case of Wright's F-statistics, it is one of the most commonly used statistics in population genetics. Its values range from 0 to 1, with 0.15 being substantially differentiated and 1 being complete differentiation.

<span class="mw-page-title-main">Evolutionary physiology</span> Study of changes in physiological characteristics

Evolutionary physiology is the study of the biological evolution of physiological structures and processes; that is, the manner in which the functional characteristics of individuals in a population of organisms have responded to natural selection across multiple generations during the history of the population. It is a sub-discipline of both physiology and evolutionary biology. Practitioners in the field come from a variety of backgrounds, including physiology, evolutionary biology, ecology, and genetics.

Population genomics is the large-scale comparison of DNA sequences of populations. Population genomics is a neologism that is associated with population genetics. Population genomics studies genome-wide effects to improve our understanding of microevolution so that we may learn the phylogenetic history and demography of a population.

In multivariate quantitative genetics, a genetic correlation is the proportion of variance that two traits share due to genetic causes, the correlation between the genetic influences on a trait and the genetic influences on a different trait estimating the degree of pleiotropy or causal overlap. A genetic correlation of 0 implies that the genetic effects on one trait are independent of the other, while a correlation of 1 implies that all of the genetic influences on the two traits are identical. The bivariate genetic correlation can be generalized to inferring genetic latent variable factors across > 2 traits using factor analysis. Genetic correlation models were introduced into behavioral genetics in the 1970s–1980s.

<span class="mw-page-title-main">Isolation by distance</span>

Isolation by distance (IBD) is a term used to refer to the accrual of local genetic variation under geographically limited dispersal. The IBD model is useful for determining the distribution of gene frequencies over a geographic region. Both dispersal variance and migration probabilities are variables in this model and both contribute to local genetic differentiation. Isolation by distance is usually the simplest model for the cause of genetic isolation between populations. Evolutionary biologists and population geneticists have been exploring varying theories and models for explaining population structure. Yoichi Ishida compares two important theories of isolation by distance and clarifies the relationship between the two. According to Ishida, Sewall Wright's isolation by distance theory is termed ecological isolation by distance while Gustave Malécot's theory is called genetic isolation by distance. Isolation by distance is distantly related to speciation. Multiple types of isolating barriers, namely prezygotic isolating barriers, including isolation by distance, are considered the key factor in keeping populations apart, limiting gene flow.

<span class="mw-page-title-main">Genetic variance</span> Biological concept

Genetic variance is a concept outlined by the English biologist and statistician Ronald Fisher in his fundamental theorem of natural selection. In his 1930 book The Genetical Theory of Natural Selection, Fisher postulates that the rate of change of biological fitness can be calculated by the genetic variance of the fitness itself. Fisher tried to give a statistical formula about how the change of fitness in a population can be attributed to changes in the allele frequency. Fisher made no restrictive assumptions in his formula concerning fitness parameters, mate choices or the number of alleles and loci involved.

Polygenic adaptation describes a process in which a population adapts through small changes in allele frequencies at hundreds or thousands of loci.

This glossary of evolutionary biology is a list of definitions of terms and concepts used in the study of evolutionary biology, population biology, speciation, and phylogenetics, as well as sub-disciplines and related fields. For additional terms from related glossaries, see Glossary of genetics, Glossary of ecology, and Glossary of biology.

In evolutionary biology, developmental bias refers to the production against or towards certain ontogenetic trajectories which ultimately influence the direction and outcome of evolutionary change by affecting the rates, magnitudes, directions and limits of trait evolution. Historically, the term was synonymous with developmental constraint, however, the latter has been more recently interpreted as referring solely to the negative role of development in evolution.

The infinitesimal model, also known as the polygenic model, is a widely used statistical model in quantitative genetics and in genome-wide association studies. Originally developed in 1918 by Ronald Fisher, it is based on the idea that variation in a quantitative trait is influenced by an infinitely large number of genes, each of which makes an infinitely small (infinitesimal) contribution to the phenotype, as well as by environmental factors. In "The Correlation between Relatives on the Supposition of Mendelian Inheritance", the original 1918 paper introducing the model, Fisher showed that if a trait is polygenic, "then the random sampling of alleles at each gene produces a continuous, normally distributed phenotype in the population". However, the model does not necessarily imply that the trait must be normally distributed, only that its genetic component will be so around the average of that of the individual's parents. The model served to reconcile Mendelian genetics with the continuous distribution of quantitative traits documented by Francis Galton.

In biology, parallel speciation is a type of speciation where there is repeated evolution of reproductively isolating traits via the same mechanisms occurring between separate yet closely related species inhabiting different environments. This leads to a circumstance where independently evolved lineages have developed reproductive isolation from their ancestral lineage, but not from other independent lineages that inhabit similar environments. In order for parallel speciation to be confirmed, there is a set of three requirements that has been established that must be met: there must be phylogenetic independence between the separate populations inhabiting similar environments to ensure that the traits responsible for reproductive isolation evolved separately, there must be reproductive isolation not only between the ancestral population and the descendent population, but also between descendent populations that inhabit dissimilar environments, and descendent populations that inhabit similar environments must not be reproductively isolated from one another. To determine if natural selection specifically is the cause of parallel speciation, a fourth requirement has been established that includes identifying and testing an adaptive mechanism, which eliminates the possibility of a genetic factor such as polyploidy being the responsible agent.

References

  1. 1 2 Spitze K (October 1993). "Population structure in Daphnia obtusa: quantitative genetic and allozymic variation". Genetics. 135 (2): 367–374. doi:10.1093/genetics/135.2.367. PMC   1205642 . PMID   8244001.
  2. Whitlock MC (April 2008). "Evolutionary inference from QST". Molecular Ecology. 17 (8): 1885–1896. doi: 10.1111/j.1365-294X.2008.03712.x . PMID   18363667.
  3. McKay JK, Latta RG (June 2002). "Adaptive population divergence: markers, QTL and traits". Trends in Ecology & Evolution . 17 (6): 285–291. doi:10.1016/S0169-5347(02)02478-3.
  4. Wright S (1949). "The Genetic Structure of Populations". Annals of Eugenics. 15 (4): 323–354. doi:10.1111/j.1469-1809.1949.tb02451.x. PMID   24540312.
  5. 1 2 Cubry P, Scotti I, Oddou-Muratorio S, Lefèvre F (November 2017). "Generalization of the QST framework in hierarchically structured populations: Impacts of inbreeding and dominance" (PDF). Molecular Ecology Resources. 17 (6): e76–e83. doi:10.1111/1755-0998.12693. PMID   28681534. S2CID   206947951.
  6. 1 2 3 de Villemereuil P, Gaggiotti OE, Goudet J (2022). "Common garden experiments to study local adaptation need to account for population structure". Journal of Ecology. 110 (5): 1005–1009. doi:10.1111/1365-2745.13528. ISSN   0022-0477. S2CID   225136876.
  7. 1 2 Leinonen T, McCairns RJ, O'Hara RB, Merilä J (March 2013). "Q(ST)-F(ST) comparisons: evolutionary and ecological insights from genomic heterogeneity". Nature Reviews. Genetics. 14 (3): 179–190. doi:10.1038/nrg3395. PMID   23381120. S2CID   6312222.
  8. Pujol B, Wilson AJ, Ross RI, Pannell JR (November 2008). "Are Q(ST)-F(ST) comparisons for natural populations meaningful?". Molecular Ecology. 17 (22): 4782–4785. doi:10.1111/j.1365-294X.2008.03958.x. PMID   19140971. S2CID   11707577.
  9. Leinonen T, O'Hara RB, Cano JM, Merilä J (January 2008). "Comparative studies of quantitative trait and neutral marker divergence: a meta-analysis". Journal of Evolutionary Biology. 21 (1): 1–17. doi: 10.1111/j.1420-9101.2007.01445.x . PMID   18028355. S2CID   1037769.
  10. Miller JR, Wood BP, Hamilton MB (October 2008). "F(ST) and Q(ST) under neutrality". Genetics. 180 (2): 1023–1037. doi:10.1534/genetics.108.092031. PMC   2567353 . PMID   18780742.
  11. 1 2 Marchini GL, Arredondo TM, Cruzan MB (November 2018). "Selective differentiation during the colonization and establishment of a newly invasive species". Journal of Evolutionary Biology. 31 (11): 1689–1703. doi:10.1111/jeb.13369. PMID   30120791. S2CID   52031406.
  12. O'Hara RB, Merilä J (November 2005). "Bias and precision in QST estimates: problems and some solutions". Genetics. 171 (3): 1331–1339. doi:10.1534/genetics.105.044545. PMC   1456852 . PMID   16085700.
  13. Merilä, J.; Crnokrak, P. (2001). "Comparison of genetic differentiation at marker loci and quantitative traits: Natural selection and genetic differentiation". Journal of Evolutionary Biology. 14 (6): 892–903. doi: 10.1046/j.1420-9101.2001.00348.x . S2CID   83979407.
  14. Bower, Andrew D.; Aitken, Sally N. (2008). "Ecological genetics and seed transfer guidelines for Pinus albicaulis (Pinaceae)". American Journal of Botany. 95 (1): 66–76. doi: 10.3732/ajb.95.1.66 . PMID   21632316.
  15. Marin, Sara; Gibert, Anaïs; Archambeau, Juliette; Bonhomme, Vincent; Lascoste, Mylène; Pujol, Benoit (2020). "Potential adaptive divergence between subspecies and populations of snapdragon plants inferred from Q ST – F ST comparisons". Molecular Ecology. 29 (16): 3010–3021. doi:10.1111/mec.15546. ISSN   0962-1083. PMC   7540467 . PMID   32652730.