Human evolutionary genetics studies how one human genome differs from another human genome, the evolutionary past that gave rise to the human genome, and its current effects. Differences between genomes have anthropological, medical, historical and forensic implications and applications. Genetic data can provide important insights into human evolution.
−10 — – −9 — – −8 — – −7 — – −6 — – −5 — – −4 — – −3 — – −2 — – −1 — – 0 — |
| |||||||||||||||||||||||||||||
Biologists classify humans, along with only a few other species, as great apes (species in the family Hominidae). The living Hominidae include two distinct species of chimpanzee (the bonobo, Pan paniscus , and the chimpanzee, Pan troglodytes ), two species of gorilla (the western gorilla, Gorilla gorilla , and the eastern gorilla, Gorilla graueri), and two species of orangutan (the Bornean orangutan, Pongo pygmaeus , and the Sumatran orangutan, Pongo abelii). The great apes with the family Hylobatidae of gibbons form the superfamily Hominoidea of apes.
Apes, in turn, belong to the primate order (>400 species), along with the Old World monkeys, the New World monkeys, and others. Data from both mitochondrial DNA (mtDNA) and nuclear DNA (nDNA) indicate that primates belong to the group of Euarchontoglires, together with Rodentia, Lagomorpha, Dermoptera, and Scandentia. [1] This is further supported by Alu-like short interspersed nuclear elements (SINEs) which have been found only in members of the Euarchontoglires. [2]
A phylogenetic tree is usually derived from DNA or protein sequences from populations. Often, mitochondrial DNA or Y chromosome sequences are used to study ancient human demographics. These single-locus sources of DNA do not recombine and are almost always inherited from a single parent, with only one known exception in mtDNA. [3] Individuals from closer geographic regions generally tend to be more similar than individuals from regions farther away. Distance on a phylogenetic tree can be used approximately to indicate:
The separation of humans from their closest relatives, the non-human African apes (chimpanzees and gorillas), has been studied extensively for more than a century. Five major questions have been addressed:
As discussed before, different parts of the genome show different sequence divergence between different hominoids. It has also been shown that the sequence divergence between DNA from humans and chimpanzees varies greatly. For example, the sequence divergence varies between 0% to 2.66% between non-coding, non-repetitive genomic regions of humans and chimpanzees. [8] The percentage of nucleotides in the human genome (hg38) that had one-to-one exact matches in the chimpanzee genome (pantro6) was 84.38%. Additionally gene trees, generated by comparative analysis of DNA segments, do not always fit the species tree. Summing up:
The divergence time of humans from other apes is of great interest. One of the first molecular studies, published in 1967 measured immunological distances (IDs) between different primates. [10] Basically the study measured the strength of immunological response that an antigen from one species (human albumin) induces in the immune system of another species (human, chimpanzee, gorilla and Old World monkeys). Closely related species should have similar antigens and therefore weaker immunological response to each other's antigens. The immunological response of a species to its own antigens (e.g. human to human) was set to be 1.
The ID between humans and gorillas was determined to be 1.09, that between humans and chimpanzees was determined as 1.14. However the distance to six different Old World monkeys was on average 2.46, indicating that the African apes are more closely related to humans than to monkeys. The authors consider the divergence time between Old World monkeys and hominoids to be 30 million years ago (MYA), based on fossil data, and the immunological distance was considered to grow at a constant rate. They concluded that divergence time of humans and the African apes to be roughly ~5 MYA. That was a surprising result. Most scientists at that time thought that humans and great apes diverged much earlier (>15 MYA).
The gorilla was, in ID terms, closer to human than to chimpanzees; however, the difference was so slight that the trichotomy could not be resolved with certainty. Later studies based on molecular genetics were able to resolve the trichotomy: chimpanzees are phylogenetically closer to humans than to gorillas. However, some divergence times estimated later (using much more sophisticated methods in molecular genetics) do not substantially differ from the very first estimate in 1967, but a recent paper [11] puts it at 11–14 MYA.
Current methods to determine divergence times use DNA sequence alignments and molecular clocks. Usually the molecular clock is calibrated assuming that the orangutan split from the African apes (including humans) 12-16 MYA. Some studies also include some old world monkeys and set the divergence time of them from hominoids to 25-30 MYA. Both calibration points are based on very little fossil data and have been criticized. [12]
If these dates are revised, the divergence times estimated from molecular data will change as well. However, the relative divergence times are unlikely to change. Even if we can't tell absolute divergence times exactly, we can be pretty sure that the divergence time between chimpanzees and humans is about sixfold shorter than between chimpanzees (or humans) and monkeys.
One study (Takahata et al., 1995) used 15 DNA sequences from different regions of the genome from human and chimpanzee and 7 DNA sequences from human, chimpanzee and gorilla. [13] They determined that chimpanzees are more closely related to humans than gorillas. Using various statistical methods, they estimated the divergence time human-chimp to be 4.7 MYA and the divergence time between gorillas and humans (and chimps) to be 7.2 MYA.
Additionally they estimated the effective population size of the common ancestor of humans and chimpanzees to be ~100,000. This was somewhat surprising since the present day effective population size of humans is estimated to be only ~10,000. If true that means that the human lineage would have experienced an immense decrease of its effective population size (and thus genetic diversity) in its evolution. (see Toba catastrophe theory)
Another study (Chen & Li, 2001) sequenced 53 non-repetitive, intergenic DNA segments from human, chimpanzee, gorilla and orangutan. [8] When the DNA sequences were concatenated to a single long sequence, the generated neighbor-joining tree supported the Homo-Pan clade with 100% bootstrap (that is that humans and chimpanzees are the closest related species of the four). When three species are fairly closely related to each other (like human, chimpanzee and gorilla), the trees obtained from DNA sequence data may not be congruent with the tree that represents the speciation (species tree).
The shorter internodal time span (TIN) the more common are incongruent gene trees. The effective population size (Ne) of the internodal population determines how long genetic lineages are preserved in the population. A higher effective population size causes more incongruent gene trees. Therefore, if the internodal time span is known, the ancestral effective population size of the common ancestor of humans and chimpanzees can be calculated.
When each segment was analyzed individually, 31 supported the Homo-Pan clade, 10 supported the Homo-Gorilla clade, and 12 supported the Pan-Gorilla clade. Using the molecular clock the authors estimated that gorillas split up first 6.2-8.4 MYA and chimpanzees and humans split up 1.6-2.2 million years later (internodal time span) 4.6-6.2 MYA. The internodal time span is useful to estimate the ancestral effective population size of the common ancestor of humans and chimpanzees.
A parsimonious analysis revealed that 24 loci supported the Homo-Pan clade, 7 supported the Homo-Gorilla clade, 2 supported the Pan-Gorilla clade and 20 gave no resolution. Additionally they took 35 protein coding loci from databases. Of these 12 supported the Homo-Pan clade, 3 the Homo-Gorilla clade, 4 the Pan-Gorilla clade and 16 gave no resolution. Therefore, only ~70% of the 52 loci that gave a resolution (33 intergenic, 19 protein coding) support the 'correct' species tree. From the fraction of loci which did not support the species tree and the internodal time span they estimated previously, the effective population of the common ancestor of humans and chimpanzees was estimated to be ~52 000 to 96 000. This value is not as high as that from the first study (Takahata), but still much higher than present day effective population size of humans.
A third study (Yang, 2002) used the same dataset that Chen and Li used but estimated the ancestral effective population of 'only' ~12,000 to 21,000, using a different statistical method. [14]
Humans and chimpanzees are 99.1% identical at the codingl level, with 99.4% similarity at the nonsynonymous level and 98.4% at the synonymous level. [15] The alignable sequences within genomes of humans and chimpanzees differ by about 35 million single-nucleotide substitutions. Additionally about 3% of the complete genomes differ by deletions, insertions and duplications. [16]
Since mutation rate is relatively constant, roughly one half of these changes occurred in the human lineage. Only a very tiny fraction of those fixed differences gave rise to the different phenotypes of humans and chimpanzees and finding those is a great challenge. The vast majority of the differences are neutral and do not affect the phenotype.[ citation needed ]
Molecular evolution may act in different ways, through protein evolution, gene loss, differential gene regulation and RNA evolution. All are thought to have played some part in human evolution.
Many different mutations can inactivate a gene, but few will change its function in a specific way. Inactivation mutations will therefore be readily available for selection to act on. Gene loss could thus be a common mechanism of evolutionary adaptation (the "less-is-more" hypothesis). [17]
80 genes were lost in the human lineage after separation from the last common ancestor with the chimpanzee. 36 of those were for olfactory receptors. Genes involved in chemoreception and immune response are overrepresented. [18] Another study estimated that 86 genes had been lost. [19]
A gene for type I hair keratin was lost in the human lineage. Keratins are a major component of hairs. Humans still have nine functional type I hair keratin genes, but the loss of that particular gene may have caused the thinning of human body hair. Based on the assumption of a constant molecular clock, the study predicts the gene loss occurred relatively recently in human evolution—less than 240 000 years ago, but both the Vindija Neandertal and the high-coverage Denisovan sequence contain the same premature stop codons as modern humans and hence dating should be greater than 750 000 years ago. [20]
Stedman et al. (2004) stated that the loss of the sarcomeric myosin gene MYH16 in the human lineage led to smaller masticatory muscles. They estimated that the mutation that led to the inactivation (a two base pair deletion) occurred 2.4 million years ago, predating the appearance of Homo ergaster/erectus in Africa. The period that followed was marked by a strong increase in cranial capacity, promoting speculation that the loss of the gene may have removed an evolutionary constraint on brain size in the genus Homo. [21]
Another estimate for the loss of the MYH16 gene is 5.3 million years ago, long before Homo appeared. [22]
Segmental duplications (SDs or LCRs) have had roles in creating new primate genes and shaping human genetic variation.
When the human genome was compared to the genomes of five comparison primate species, including the chimpanzee, gorilla, orangutan, gibbon, and macaque, it was found that there are approximately 20,000 human-specific insertions believed to be regulatory. While most insertions appear to be fitness neutral, a small amount have been identified in positively selected genes showing associations to neural phenotypes and some relating to dental and sensory perception-related phenotypes. These findings hint at the seemingly important role of human-specific insertions in the recent evolution of humans. [23]
Human accelerated regions are areas of the genome that differ between humans and chimpanzees to a greater extent than can be explained by genetic drift over the time since the two species shared a common ancestor. These regions show signs of being subject to natural selection, leading to the evolution of distinctly human traits. Two examples are HAR1F, which is believed to be related to brain development and HAR2 (a.k.a. HACNS1) that may have played a role in the development of the opposable thumb.
It has also been hypothesized that much of the difference between humans and chimpanzees is attributable to the regulation of gene expression rather than differences in the genes themselves. Analyses of conserved non-coding sequences, which often contain functional and thus positively selected regulatory regions, address this possibility. [24]
When the draft sequence of the common chimpanzee (Pan troglodytes) genome was published in the summer 2005, 2400 million bases (of ~3160 million bases) were sequenced and assembled well enough to be compared to the human genome. [16] 1.23% of this sequenced differed by single-base substitutions. Of this, 1.06% or less was thought to represent fixed differences between the species, with the rest being variant sites in humans or chimpanzees. Another type of difference, called indels (insertions/deletions) accounted for many fewer differences (15% as many), but contributed ~1.5% of unique sequence to each genome, since each insertion or deletion can involve anywhere from one base to millions of bases. [16]
A companion paper examined segmental duplications in the two genomes, [25] whose insertion and deletion into the genome account for much of the indel sequence. They found that a total of 2.7% of euchromatic sequence had been differentially duplicated in one or the other lineage.
Locus | Human-Chimp | Human-Gorilla | Human-Orangutan |
---|---|---|---|
Alu elements | 2 | - | - |
Non-coding (Chr. Y) | 1.68 ± 0.19 | 2.33 ± 0.2 | 5.63 ± 0.35 |
Pseudogenes (autosomal) | 1.64 ± 0.10 | 1.87 ± 0.11 | - |
Pseudogenes (Chr. X) | 1.47 ± 0.17 | - | - |
Noncoding (autosomal) | 1.24 ± 0.07 | 1.62 ± 0.08 | 3.08 ± 0.11 |
Genes (Ks) | 1.11 | 1.48 | 2.98 |
Introns | 0.93 ± 0.08 | 1.23 ± 0.09 | - |
Xq13.3 | 0.92 ± 0.10 | 1.42 ± 0.12 | 3.00 ± 0.18 |
Subtotal for X chromosome | 1.16 ± 0.07 | 1.47 ± 0.08 | - |
Genes (Ka) | 0.8 | 0.93 | 1.96 |
The sequence divergence has generally the following pattern: Human-Chimp < Human-Gorilla << Human-Orangutan, highlighting the close kinship between humans and the African apes. Alu elements diverge quickly due to their high frequency of CpG dinucleotides which mutate roughly 10 times more often than the average nucleotide in the genome. The mutation rate is higher in the male germ line, therefore the divergence in the Y chromosome—which is inherited solely from the father—is higher than in autosomes. The X chromosome is inherited twice as often through the female germ line as through the male germ line and therefore shows slightly lower sequence divergence. The sequence divergence of the Xq13.3 region is surprisingly low between humans and chimpanzees. [26]
Mutations altering the amino acid sequence of proteins (Ka) are the least common. In fact ~29% of all orthologous proteins are identical between human and chimpanzee. The typical protein differs by only two amino acids. [16] The measures of sequence divergence shown in the table only take the substitutional differences, for example from an A (adenine) to a G (guanine), into account. DNA sequences may however also differ by insertions and deletions (indels) of bases. These are usually stripped from the alignments before the calculation of sequence divergence is performed.
An international group of scientists completed a draft sequence of the Neanderthal genome in May 2010. The results indicate some breeding between modern humans (Homo sapiens) and Neanderthals (Homo neanderthalensis), as the genomes of non-African humans have 1–4% more in common with Neanderthals than do the genomes of subsaharan Africans. Neanderthals and most modern humans share a lactose-intolerant variant of the lactase gene that encodes an enzyme that is unable to break down lactose in milk after weaning. Modern humans and Neanderthals also share the FOXP2 gene variant associated with brain development and with speech in modern humans, indicating that Neanderthals may have been able to speak. Chimps have two amino acid differences in FOXP2 compared with human and Neanderthal FOXP2. [27] [28] [29]
Homo sapiens is thought to have emerged about 300,000 years ago. It dispersed throughout Africa, and after 70,000 years ago throughout Eurasia and Oceania. A 2009 study identified 14 "ancestral population clusters", the most remote being the San people of Southern Africa. [30] [31]
With their rapid expansion throughout different climate zones, and especially with the availability of new food sources with the domestication of cattle and the development of agriculture, human populations have been exposed to significant selective pressures since their dispersal. For example, the ancestors of East Asians are thought to have undergone processess of selection for a number of alleles, including variants of the EDAR, ADH1B, ABCC1, and ALDH2 genes.
The East Asian types of ADH1B in particular are associated with rice domestication and would thus have arisen after the development of rice cultivation roughly 10,000 years ago. [32] Several phenotypical traits of characteristic of East Asians are due to a single mutation of the EDAR gene, dated to c. 35,000 years ago. [33]
As of 2017 [update] , the Single Nucleotide Polymorphism Database (dbSNP), which lists SNP and other variants, listed a total of 324 million variants found in sequenced human genomes. [34] Nucleotide diversity, the average proportion of nucleotides that differ between two individuals, is estimated at between 0.1% and 0.4% for contemporary humans (compared to 2% between humans and chimpanzees). [35] [36] This corresponds to genome differences at a few million sites; the 1000 Genomes Project similarly found that "a typical [individual] genome differs from the reference human genome at 4.1 million to 5.0 million sites … affecting 20 million bases of sequence." [37]
In February 2019, scientists discovered evidence, based on genetics studies using artificial intelligence (AI), that suggest the existence of an unknown human ancestor species, not Neanderthal, Denisovan or human hybrid (like Denny (hybrid hominin)), in the genome of modern humans. [38] [39]
In March 2019, Chinese scientists reported inserting the human brain-related MCPH1 gene into laboratory rhesus monkeys, resulting in the transgenic monkeys performing better and answering faster on "short-term memory tests involving matching colors and shapes", compared to control non-transgenic monkeys, according to the researchers. [40] [41]
In May 2023, scientists reported, based on genetic studies, a more complicated pathway of human evolution than previously understood. According to the studies, humans evolved from different places and times in Africa, instead of from a single location and period of time. [42] [43]
On 31 August 2023, researchers reported, based on genetic studies, that a human ancestor population bottleneck occurred "around 930,000 and 813,000 years ago ... lasted for about 117,000 years and brought human ancestors close to extinction." [44] [45]
Human evolution is the evolutionary process within the history of primates that led to the emergence of Homo sapiens as a distinct species of the hominid family that includes all the great apes. This process involved the gradual development of traits such as human bipedalism, dexterity, and complex language, as well as interbreeding with other hominins, indicating that human evolution was not linear but weblike. The study of the origins of humans involves several scientific disciplines, including physical and evolutionary anthropology, paleontology, and genetics; the field is also known by the terms anthropogeny, anthropogenesis, and anthropogony.
Homininae, also called "African hominids" or "African apes", is a subfamily of Hominidae. It includes two tribes, with their extant as well as extinct species: 1) the tribe Hominini ―and 2) the tribe Gorillini (gorillas). Alternatively, the genus Pan is sometimes considered to belong to its own third tribe, Panini. Homininae comprises all hominids that arose after orangutans split from the line of great apes. The Homininae cladogram has three main branches, which lead to gorillas and to humans and chimpanzees. There are two living species of Panina and two living species of gorillas, but only one extant human species. Traces of extinct Homo species, including Homo floresiensis have been found with dates as recent as 40,000 years ago. Organisms in this subfamily are described as hominine or hominines.
Primates is an order of mammals, which is further divided into the strepsirrhines, which include lemurs, galagos, and lorisids; and the haplorhines, which include tarsiers; and the simians, which include monkeys and apes. Primates arose 85–55 million years ago first from small terrestrial mammals, which adapted for life in tropical forests: many primate characteristics represent adaptations to the challenging environment among tree tops, including large brain sizes, binocular vision, color vision, vocalizations, shoulder girdles allowing a large degree of movement in the upper limbs, and opposable thumbs that enable better grasping and dexterity. Primates range in size from Madame Berthe's mouse lemur, which weighs 30 g (1 oz), to the eastern gorilla, weighing over 200 kg (440 lb). There are 376–524 species of living primates, depending on which classification is used. New primate species continue to be discovered: over 25 species were described in the 2000s, 36 in the 2010s, and six in the 2020s.
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.
In human genetics, the Mitochondrial Eve is the matrilineal most recent common ancestor (MRCA) of all living humans. In other words, she is defined as the most recent woman from whom all living humans descend in an unbroken line purely through their mothers and through the mothers of those mothers, back until all lines converge on one woman.
Gibbons are apes in the family Hylobatidae. The family historically contained one genus, but now is split into four extant genera and 20 species. Gibbons live in subtropical and tropical forests from eastern Bangladesh to Northeast India to southern China and Indonesia.
Apes are a clade of Old World simians native to sub-Saharan Africa and Southeast Asia, which together with its sister group Cercopithecidae form the catarrhine clade, cladistically making them monkeys. Apes do not have tails due to a mutation of the TBXT gene. In traditional and non-scientific use, the term ape can include tailless primates taxonomically considered Cercopithecidae, and is thus not equivalent to the scientific taxon Hominoidea. There are two extant branches of the superfamily Hominoidea: the gibbons, or lesser apes; and the hominids, or great apes.
In human genetics, the Y-chromosomal most recent common ancestor is the patrilineal most recent common ancestor (MRCA) from whom all currently living humans are descended. He is the most recent male from whom all living humans are descended through an unbroken line of their male ancestors. The term Y-MRCA reflects the fact that the Y chromosomes of all currently living human males are directly derived from the Y chromosome of this remote ancestor. The analogous concept of the matrilineal most recent common ancestor is known as "Mitochondrial Eve", the most recent woman from whom all living humans are descended matrilineally. As with "Mitochondrial Eve", the title of "Y-chromosomal Adam" is not permanently fixed to a single individual, but can advance over the course of human history as paternal lineages become extinct.
The humanzee is a hypothetical hybrid of chimpanzee and human, thus a form of human–animal hybrid. Serious attempts to create such a hybrid were made by Soviet biologist Ilya Ivanovich Ivanov in the 1920s, and possibly by researchers in China in the 1960s, though neither succeeded.
The timeline of human evolution outlines the major events in the evolutionary lineage of the modern human species, Homo sapiens, throughout the history of life, beginning some 4 billion years ago down to recent evolution within H. sapiens during and since the Last Glacial Period.
The Hominini (hominins) form a taxonomic tribe of the subfamily Homininae (hominines). They comprise two extant genera: Homo (humans) and Pan, but in standard usage exclude the genus Gorilla (gorillas), which is grouped separately within subfamily Homininae.
The Chimpanzee Genome Project was an effort to determine the DNA sequence of the chimpanzee genome. Sequencing began in 2005 and by 2013 twenty-four individual chimpanzees had been sequenced. This project was folded into the Great Ape Genome Project.
Molecular anthropology, also known as genetic anthropology, is the study of how molecular biology has contributed to the understanding of human evolution. This field of anthropology examines evolutionary links between ancient and modern human populations, as well as between contemporary species. Generally, comparisons are made between sequences, either DNA or protein sequences; however, early studies used comparative serology.
The Hominidae, whose members are known as the great apes or hominids, are a taxonomic family of primates that includes eight extant species in four genera: Pongo ; Gorilla ; Pan ; and Homo, of which only modern humans remain.
The chimpanzee–human last common ancestor (CHLCA) is the last common ancestor shared by the extant Homo (human) and Pan genera of Hominini. Estimates of the divergence date vary widely from thirteen to five million years ago.
The human mitochondrial molecular clock is the rate at which mutations have been accumulating in the mitochondrial genome of hominids during the course of human evolution. The archeological record of human activity from early periods in human prehistory is relatively limited and its interpretation has been controversial. Because of the uncertainties from the archeological record, scientists have turned to molecular dating techniques in order to refine the timeline of human evolution. A major goal of scientists in the field is to develop an accurate hominid mitochondrial molecular clock which could then be used to confidently date events that occurred during the course of human evolution.
Incomplete lineage sorting, also termed hemiplasy, deep coalescence, retention of ancestral polymorphism, or trans-species polymorphism, describes a phenomenon in population genetics when ancestral gene copies fail to coalesce into a common ancestral copy until deeper than previous speciation events. It is caused by lineage sorting of genetic polymorphisms that were retained across successive nodes in the species tree. In other words, the tree produced by a single gene differs from the population or species level tree, producing a discordant tree. Whatever the mechanism, the result is that a generated species level tree may differ depending on the selected genes used for assessment. This is in contrast to complete lineage sorting, where the tree produced by the gene is the same as the population or species level tree. Both are common results in phylogenetic analysis, although it depends on the gene, organism, and sampling technique.
The myth of the one percent refers to the 1975 study done by Wilson and King that asserted that human-chimpanzee divergence is about 1%. Humans share a common ancestor with chimpanzees, and the rapid evolution of chimpanzees and humans, along with gorillas and bonobos, has led to difficulties in creating an accurate lineage or tree topology. Chimpanzees and humans were found to be a monophyletic clade, leading to the question of how closely related the two are.
The phylogenetic split of Hominidae into the subfamilies Homininae and Ponginae is dated to the middle Miocene, roughly 18 to 14 million years ago. This split is also referenced as the "orangutan–human last common ancestor" by Jeffrey H. Schwartz, professor of anthropology at the University of Pittsburgh School of Arts and Sciences, and John Grehan, director of science at the Buffalo Museum.
{{cite journal}}
: CS1 maint: numeric names: authors list (link)