An evolutionary lineage is a temporal series of populations, organisms, cells, or genes connected by a continuous line of descent from ancestor to descendant. [1] [2] Lineages are subsets of the evolutionary tree of life. Lineages are often determined by the techniques of molecular systematics.
Lineages are typically visualized as subsets of a phylogenetic tree. A lineage is a single line of descent or linear chain within the tree, while a clade is a (usually branched) monophyletic group, containing a single ancestor and all its descendants. Phylogenetic trees are typically created from DNA, RNA or protein sequence data. Apart from this, morphological differences and similarities have been, and still are used to create phylogenetic trees. Sequences from different individuals are collected and their similarity is quantified. Mathematical procedures are used to cluster individuals by similarity. [3]
Members of a species are considered to evolve as a single unit (or lineage) when they repeatedly share the same genes. The nodes would represent a split in lineage due to a breaking of genetic connections: when a single lineage is divided into two subsets, with the individuals not exchanging genes, they will accumulate differences in genes. If they do not fuse back again, it will create a new distinct descendant clade. [3]
Just as a map is a scaled approximation of true geography, a phylogenetic tree is an approximation of the true complete evolutionary relationships. For example, in a full tree of life, the entire clade of animals can be collapsed to a single branch of the tree. However, this is merely a limitation of rendering space. In theory, a true and complete tree for all living organisms or for any DNA sequence could be generated. [3]
Nevertheless, phylogenies can sometimes appear in a non-treelike form. Branches on the tree of life may grow together, a phenomenon called reticulation, which occurs due to different biological processes. Another process, introgression, occurs when hybrids between distinct lineages transfer novel genetic material through subsequent crossing. In other cases, hybrid speciation takes place when lineages hybridize to form a new, distinct lineage. Horizontal gene transfer, involving the introgression of very few genes, usually appears as a treelike population history with some genes having a discordant history. Thus, the tree-like representation would be proper as long as introgression and hybrid speciation are rare or limited to closely related tips (of lineages). In some cases, evolutionary relantionships should be depicted better in the form of a network. [3]
Common descent is a concept in evolutionary biology applicable when one species is the ancestor of two or more species later in time. According to modern evolutionary biology, all living beings could be descendants of a unique ancestor commonly referred to as the last universal common ancestor (LUCA) of all life on Earth.
Cladistics is an approach to biological classification in which organisms are categorized in groups ("clades") based on hypotheses of most recent common ancestry. The evidence for hypothesized relationships is typically shared derived characteristics (synapomorphies) that are not present in more distant groups and ancestors. However, from an empirical perspective, common ancestors are inferences based on a cladistic hypothesis of relationships of taxa whose character states can be observed. Theoretically, a last common ancestor and all its descendants constitute a (minimal) clade. Importantly, all descendants stay in their overarching ancestral clade. For example, if the terms worms or fishes were used within a strict cladistic framework, these terms would include humans. Many of these terms are normally used paraphyletically, outside of cladistics, e.g. as a 'grade', which are fruitless to precisely delineate, especially when including extinct species. Radiation results in the generation of new subclades by bifurcation, but in practice sexual hybridization may blur very closely related groupings.
In biological phylogenetics, a clade, also known as a monophyletic group or natural group, is a grouping of organisms that are monophyletic – that is, composed of a common ancestor and all its lineal descendants – on a phylogenetic tree. In the taxonomical literature, sometimes the Latin form cladus is used rather than the English form. Clades are the fundamental unit of cladistics, a modern approach to taxonomy adopted by most biological fields.
A cladogram is a diagram used in cladistics to show relations among organisms. A cladogram is not, however, an evolutionary tree because it does not show how ancestors are related to descendants, nor does it show how much they have changed, so many differing evolutionary trees can be consistent with the same cladogram. A cladogram uses lines that branch off in different directions ending at a clade, a group of organisms with a last common ancestor. There are many shapes of cladograms but they all have lines that branch off from other lines. The lines can be traced back to where they branch off. These branching off points represent a hypothetical ancestor which can be inferred to exhibit the traits shared among the terminal taxa above it. This hypothetical ancestor might then provide clues about the order of evolution of various features, adaptation, and other evolutionary narratives about ancestors. Although traditionally such cladograms were generated largely on the basis of morphological characters, DNA and RNA sequencing data and computational phylogenetics are now very commonly used in the generation of cladograms, either on their own or in combination with morphology.
A phylogenetic tree, phylogeny or evolutionary tree is a graphical representation which shows the evolutionary history between a set of species or taxa during a specific time. In other words, it is a branching diagram or a tree showing the evolutionary relationships among various biological species or other entities based upon similarities and differences in their physical or genetic characteristics. In evolutionary biology, all life on Earth is theoretically part of a single phylogenetic tree, indicating common ancestry. Phylogenetics is the study of phylogenetic trees. The main challenge is to find a phylogenetic tree representing optimal evolutionary ancestry between a set of species or taxa. Computational phylogenetics focuses on the algorithms involved in finding optimal phylogenetic tree in the phylogenetic landscape.
Molecular phylogenetics is the branch of phylogeny that analyzes genetic, hereditary molecular differences, predominantly in DNA sequences, to gain information on an organism's evolutionary relationships. From these analyses, it is possible to determine the processes by which diversity among species has been achieved. The result of a molecular phylogenetic analysis is expressed in a phylogenetic tree. Molecular phylogenetics is one aspect of molecular systematics, a broader term that also includes the use of molecular data in taxonomy and biogeography.
A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.
Anagenesis is the gradual evolution of a species that continues to exist as an interbreeding population. This contrasts with cladogenesis, which occurs when there is branching or splitting, leading to two or more lineages and resulting in separate species. Anagenesis does not always lead to the formation of a new species from an ancestral species. When speciation does occur as different lineages branch off and cease to interbreed, a core group may continue to be defined as the original species. The evolution of this group, without extinction or species selection, is anagenesis.
In biology and genetic genealogy, the most recent common ancestor (MRCA), also known as the last common ancestor (LCA), of a set of organisms is the most recent individual from which all the organisms of the set are descended. The term is also used in reference to the ancestry of groups of genes (haplotypes) rather than organisms.
The last universal common ancestor (LUCA) is the hypothesized common ancestral cell from which the three domains of life, the Bacteria, the Archaea, and the Eukarya originated. It is suggested to have been a "cellular organism that had a lipid bilayer and used DNA, RNA, and protein". The LUCA has also been defined as "a hypothetical organism ancestral to all three domains". The LUCA is the point or stage at which the three domains of life diverged from preexisting forms of life. The nature of this point or stage of divergence remains a topic of research.
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a speciation event (orthologs), or a duplication event (paralogs), or else a horizontal gene transfer event (xenologs).
Evidence of common descent of living organisms has been discovered by scientists researching in a variety of disciplines over many decades, demonstrating that all life on Earth comes from a single ancestor. This forms an important part of the evidence on which evolutionary theory rests, demonstrates that evolution does occur, and illustrates the processes that created Earth's biodiversity. It supports the modern evolutionary synthesis—the current scientific theory that explains how and why life changes over time. Evolutionary biologists document evidence of common descent, all the way back to the last universal common ancestor, by developing testable predictions, testing hypotheses, and constructing theories that illustrate and describe its causes.
An internal node of a phylogenetic tree is described as a polytomy or multifurcation if (i) it is in a rooted tree and is linked to three or more child subtrees or (ii) it is in an unrooted tree and is attached to four or more branches. A tree that contains any multifurcations can be described as a multifurcating tree.
Computational phylogenetics, phylogeny inference, or phylogenetic inference focuses on computational and optimization algorithms, heuristics, and approaches involved in phylogenetic analyses. The goal is to find a phylogenetic tree representing optimal evolutionary ancestry between a set of genes, species, or taxa. Maximum likelihood, parsimony, Bayesian, and minimum evolution are typical optimality criteria used to assess how well a phylogenetic tree topology describes the sequence data. Nearest Neighbour Interchange (NNI), Subtree Prune and Regraft (SPR), and Tree Bisection and Reconnection (TBR), known as tree rearrangements, are deterministic algorithms to search for optimal or the best phylogenetic tree. The space and the landscape of searching for the optimal phylogenetic tree is known as phylogeny search space.
Human evolutionary genetics studies how one human genome differs from another human genome, the evolutionary past that gave rise to the human genome, and its current effects. Differences between genomes have anthropological, medical, historical and forensic implications and applications. Genetic data can provide important insights into human evolution.
Incomplete lineage sorting, also termed hemiplasy, deep coalescence, retention of ancestral polymorphism, or trans-species polymorphism, describes a phenomenon in population genetics when ancestral gene copies fail to coalesce into a common ancestral copy until deeper than previous speciation events. It is caused by lineage sorting of genetic polymorphisms that were retained across successive nodes in the species tree. In other words, the tree produced by a single gene differs from the population or species level tree, producing a discordant tree. Whatever the mechanism, the result is that a generated species level tree may differ depending on the selected genes used for assessment. This is in contrast to complete lineage sorting, where the tree produced by the gene is the same as the population or species level tree. Both are common results in phylogenetic analysis, although it depends on the gene, organism, and sampling technique.
The following outline is provided as an overview of and topical guide to evolution:
Reticulate evolution, or network evolution is the origination of a lineage through the partial merging of two ancestor lineages, leading to relationships better described by a phylogenetic network than a bifurcating tree. Reticulate patterns can be found in the phylogenetic reconstructions of biodiversity lineages obtained by comparing the characteristics of organisms. Reticulation processes can potentially be convergent and divergent at the same time. Reticulate evolution indicates the lack of independence between two evolutionary lineages. Reticulation affects survival, fitness and speciation rates of species.
Character evolution is the process by which a character or trait evolves along the branches of an evolutionary tree. Character evolution usually refers to single changes within a lineage that make this lineage unique from others. These changes are called character state changes and they are often used in the study of evolution to provide a record of common ancestry. Character state changes can be phenotypic changes, nucleotide substitutions, or amino acid substitutions. These small changes in a species can be identifying features of when exactly a new lineage diverged from an old one.
This glossary of genetics and evolutionary biology is a list of definitions of terms and concepts used in the study of genetics and evolutionary biology, as well as sub-disciplines and related fields, with an emphasis on classical genetics, quantitative genetics, population biology, phylogenetics, speciation, and systematics. Overlapping and related terms can be found in Glossary of cellular and molecular biology, Glossary of ecology, and Glossary of biology.