An allele [1] (or allelomorph) is a variant of the sequence of nucleotides at a particular location, or locus, on a DNA molecule. [2]
Alleles can differ at a single position through single nucleotide polymorphisms (SNP), [3] but they can also have insertions and deletions of up to several thousand base pairs. [4]
Most alleles observed result in little or no change in the function of the gene product it codes for. However, sometimes different alleles can result in different observable phenotypic traits, such as different pigmentation. A notable example of this is Gregor Mendel's discovery that the white and purple flower colors in pea plants were the result of a single gene with two alleles.
Nearly all multicellular organisms have two sets of chromosomes at some point in their biological life cycle; that is, they are diploid. For a given locus, if the two chromosomes contain the same allele, they, and the organism, are homozygous with respect to that allele. If the alleles are different, they, and the organism, are heterozygous with respect to those alleles.
Popular definitions of 'allele' typically refer only to different alleles within genes. For example, the ABO blood grouping is controlled by the ABO gene, which has six common alleles (variants). In population genetics, nearly every living human's phenotype for the ABO gene is some combination of just these six alleles. [5] [6]
The word "allele" is a short form of "allelomorph" ("other form", a word coined by British geneticists William Bateson and Edith Rebecca Saunders) in the 1900s, [7] [8] which was used in the early days of genetics to describe variant forms of a gene detected in different phenotypes and identified to cause the differences between them. It derives from the Greek prefix ἀλληλο-, allelo-, meaning "mutual", "reciprocal", or "each other", which itself is related to the Greek adjective ἄλλος, allos (cognate with Latin alius), meaning "other".
In many cases, genotypic interactions between the two alleles at a locus can be described as dominant or recessive, according to which of the two homozygous phenotypes the heterozygote most resembles. Where the heterozygote is indistinguishable from one of the homozygotes, the allele expressed is the one that leads to the "dominant" phenotype, [9] [10] and the other allele is said to be "recessive". The degree and pattern of dominance varies among loci. This type of interaction was first formally-described by Gregor Mendel. However, many traits defy this simple categorization and the phenotypes are modelled by co-dominance and polygenic inheritance. [11]
The term "wild type" allele is sometimes used to describe an allele that is thought to contribute to the typical phenotypic character as seen in "wild" populations of organisms, such as fruit flies ( Drosophila melanogaster ). Such a "wild type" allele was historically regarded as leading to a dominant (overpowering – always expressed), common, and normal phenotype, in contrast to "mutant" alleles that lead to recessive, rare, and frequently deleterious phenotypes. It was formerly thought that most individuals were homozygous for the "wild type" allele at most gene loci, and that any alternative "mutant" allele was found in homozygous form in a small minority of "affected" individuals, often as genetic diseases, and more frequently in heterozygous form in "carriers" for the mutant allele. It is now appreciated that most or all gene loci are highly polymorphic, with multiple alleles, whose frequencies vary from population to population, and that a great deal of genetic variation is hidden in the form of alleles that do not produce obvious phenotypic differences. Wild type alleles are often denoted by a superscript plus sign (i.e., p+ for an allele p). [12]
A population or species of organisms typically includes multiple alleles at each locus among various individuals. Allelic variation at a locus is measurable as the number of alleles (polymorphism) present, or the proportion of heterozygotes in the population. A null allele is a gene variant that lacks the gene's normal function because it either is not expressed, or the expressed protein is inactive.
For example, at the gene locus for the ABO blood type carbohydrate antigens in humans, [13] classical genetics recognizes three alleles, IA, IB, and i, which determine compatibility of blood transfusions. Any individual has one of six possible genotypes (IAIA, IAi, IBIB, IBi, IAIB, and ii) which produce one of four possible phenotypes: "Type A" (produced by IAIA homozygous and IAi heterozygous genotypes), "Type B" (produced by IBIB homozygous and IBi heterozygous genotypes), "Type AB" produced by IAIB heterozygous genotype, and "Type O" produced by ii homozygous genotype. (It is now known that each of the A, B, and O alleles is actually a class of multiple alleles with different DNA sequences that produce proteins with identical properties: more than 70 alleles are known at the ABO locus. [14] Hence an individual with "Type A" blood may be an AO heterozygote, an AA homozygote, or an AA heterozygote with two different "A" alleles.)
The frequency of alleles in a diploid population can be used to predict the frequencies of the corresponding genotypes (see Hardy–Weinberg principle). For a simple model, with two alleles;
where p is the frequency of one allele and q is the frequency of the alternative allele, which necessarily sum to unity. Then, p2 is the fraction of the population homozygous for the first allele, 2pq is the fraction of heterozygotes, and q2 is the fraction homozygous for the alternative allele. If the first allele is dominant to the second then the fraction of the population that will show the dominant phenotype is p2 + 2pq, and the fraction with the recessive phenotype is q2.
With three alleles:
In the case of multiple alleles at a diploid locus, the number of possible genotypes (G) with a number of alleles (a) is given by the expression:
A number of genetic disorders are caused when an individual inherits two recessive alleles for a single-gene trait. Recessive genetic disorders include albinism, cystic fibrosis, galactosemia, phenylketonuria (PKU), and Tay–Sachs disease. Other disorders are also due to recessive alleles, but because the gene locus is located on the X chromosome, so that males have only one copy (that is, they are hemizygous), they are more frequent in males than in females. Examples include red–green color blindness and fragile X syndrome.
Other disorders, such as Huntington's disease, occur when an individual inherits only one dominant allele.
While heritable traits are typically studied in terms of genetic alleles, epigenetic marks such as DNA methylation can be inherited at specific genomic regions in certain species, a process termed transgenerational epigenetic inheritance. The term epiallele is used to distinguish these heritable marks from traditional alleles, which are defined by nucleotide sequence. [15] A specific class of epiallele, the metastable epialleles, has been discovered in mice and in humans which is characterized by stochastic (probabilistic) establishment of epigenetic state that can be mitotically inherited. [16] [17]
The term "idiomorph", from Greek 'morphos' (form) and 'idio' (singular, unique), was introduced in 1990 in place of "allele" to denote sequences at the same locus in different strains that have no sequence similarity and probably do not share a common phylogenetic relationship. It is used mainly in the genetic research of mycology. [18] [19]
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a specific gene depends on the number of copies of each chromosome found in that species, also referred to as ploidy. In diploid species like humans, two full sets of chromosomes are present, meaning each individual has two alleles for any given gene. If both alleles are the same, the genotype is referred to as homozygous. If the alleles are different, the genotype is referred to as heterozygous.
Mendelian inheritance is a type of biological inheritance following the principles originally proposed by Gregor Mendel in 1865 and 1866, re-discovered in 1900 by Hugo de Vries and Carl Correns, and later popularized by William Bateson. These principles were initially controversial. When Mendel's theories were integrated with the Boveri–Sutton chromosome theory of inheritance by Thomas Hunt Morgan in 1915, they became the core of classical genetics. Ronald Fisher combined these ideas with the theory of natural selection in his 1930 book The Genetical Theory of Natural Selection, putting evolution onto a mathematical footing and forming the basis for population genetics within the modern evolutionary synthesis.
In genetics, dominance is the phenomenon of one variant (allele) of a gene on a chromosome masking or overriding the effect of a different variant of the same gene on the other copy of the chromosome. The first variant is termed dominant and the second is called recessive. This state of having two different variants of the same gene on each chromosome is originally caused by a mutation in one of the genes, either new or inherited. The terms autosomal dominant or autosomal recessive are used to describe gene variants on non-sex chromosomes (autosomes) and their associated traits, while those on sex chromosomes (allosomes) are termed X-linked dominant, X-linked recessive or Y-linked; these have an inheritance and presentation pattern that depends on the sex of both the parent and the child. Since there is only one copy of the Y chromosome, Y-linked traits cannot be dominant or recessive. Additionally, there are other forms of dominance, such as incomplete dominance, in which a gene variant has a partial effect compared to when it is present on both chromosomes, and co-dominance, in which different variants on each chromosome both show their associated traits.
In population genetics, the Hardy–Weinberg principle, also known as the Hardy–Weinberg equilibrium, model, theorem, or law, states that allele and genotype frequencies in a population will remain constant from generation to generation in the absence of other evolutionary influences. These influences include genetic drift, mate choice, assortative mating, natural selection, sexual selection, mutation, gene flow, meiotic drive, genetic hitchhiking, population bottleneck, founder effect,inbreeding and outbreeding depression.
Allele frequency, or gene frequency, is the relative frequency of an allele at a particular locus in a population, expressed as a fraction or percentage. Specifically, it is the fraction of all chromosomes in the population that carry that allele over the total population or sample size. Microevolution is the change in allele frequencies that occurs over time within a population.
In biology, polymorphism is the occurrence of two or more clearly different morphs or forms, also referred to as alternative phenotypes, in the population of a species. To be classified as such, morphs must occupy the same habitat at the same time and belong to a panmictic population.
The Punnett square is a square diagram that is used to predict the genotypes of a particular cross or breeding experiment. It is named after Reginald C. Punnett, who devised the approach in 1905. The diagram is used by biologists to determine the probability of an offspring having a particular genotype. The Punnett square is a tabular summary of possible combinations of maternal alleles with paternal alleles. These tables can be used to examine the genotypical outcome probabilities of the offspring of a single trait (allele), or when crossing multiple traits from the parents.
Disassortative mating is a mating pattern in which individuals with dissimilar phenotypes mate with one another more frequently than would be expected under random mating. Disassortative mating reduces the mean genetic similarities within the population and produces a greater number of heterozygotes. The pattern is character specific, but does not affect allele frequencies. This nonrandom mating pattern will result in deviation from the Hardy-Weinberg principle.
Balancing selection refers to a number of selective processes by which multiple alleles are actively maintained in the gene pool of a population at frequencies larger than expected from genetic drift alone. Balancing selection is rare compared to purifying selection. It can occur by various mechanisms, in particular, when the heterozygotes for the alleles under consideration have a higher fitness than the homozygote. In this way genetic polymorphism is conserved.
A heterozygote advantage describes the case in which the heterozygous genotype has a higher relative fitness than either the homozygous dominant or homozygous recessive genotype. Loci exhibiting heterozygote advantage are a small minority of loci. The specific case of heterozygote advantage due to a single locus is known as overdominance. Overdominance is a rare condition in genetics where the phenotype of the heterozygote lies outside of the phenotypical range of both homozygote parents, and heterozygous individuals have a higher fitness than homozygous individuals.
Overdominance is a phenomenon in genetics where the phenotype of the heterozygote lies outside the phenotypical range of both homozygous parents. Overdominance can also be described as heterozygote advantage regulated by a single genomic locus, wherein heterozygous individuals have a higher fitness than homozygous individuals. However, not all cases of the heterozygote advantage are considered overdominance, as they may be regulated by multiple genomic regions. Overdominance has been hypothesized as an underlying cause for heterosis.
Equine coat color genetics determine a horse's coat color. Many colors are possible, but all variations are produced by changes in only a few genes. Bay is the most common color of horse, followed by black and chestnut. A change at the agouti locus is capable of turning bay to black, while a mutation at the extension locus can turn bay or black to chestnut.
Genetic variation in populations can be analyzed and quantified by the frequency of alleles. Two fundamental calculations are central to population genetics: allele frequencies and genotype frequencies. Genotype frequency in a population is the number of individuals with a given genotype divided by the total number of individuals in the population. In population genetics, the genotype frequency is the frequency or proportion of genotypes in a population.
A null allele is a nonfunctional allele caused by a genetic mutation. Such mutations can cause a complete lack of production of the associated gene product or a product that does not function properly; in either case, the allele may be considered nonfunctional. A null allele cannot be distinguished from deletion of the entire locus solely from phenotypic observation.
Under the law of dominance in genetics, an individual expressing a dominant phenotype could contain either two copies of the dominant allele or one copy of each dominant and recessive allele. By performing a test cross, one can determine whether the individual is heterozygous or homozygous dominant.
Polar overdominance is a unique form of inheritance originally described in livestock, with relevant examples in humans and mice being discovered shortly after. The term polar is used to describe this type of overdominance because the phenotype of the heterozygote is more prevalent than the other genotypes. This polarity is shown as differential phenotype is only present in one of the heterozygote configurations when the recessive allele is inherited in a parent of origin type fashion. Polar overdominance differs from regular overdominance where both heterozygote genotypes display a phenotype that has increased fitness regardless of the parent of origin. Studying this type of inheritance could have practical applications in preventative medicine for humans as well as a variety of other agricultural applications.
In medical genetics, compound heterozygosity is the condition of having two or more heterogeneous recessive alleles at a particular locus that can cause genetic disease in a heterozygous state; that is, an organism is a compound heterozygote when it has two recessive alleles for the same gene, but with those two alleles being different from each other. Compound heterozygosity reflects the diversity of the mutation base for many autosomal recessive genetic disorders; mutations in most disease-causing genes have arisen many times. This means that many cases of disease arise in individuals who have two unrelated alleles, who technically are heterozygotes, but both the alleles are defective.
Lethal alleles are alleles that cause the death of the organism that carries them. They are usually a result of mutations in genes that are essential for growth or development. Lethal alleles can be recessive, dominant, conditional, perinatal, or postnatal after an extended period of apparently normal development depending on the gene or genes involved.
Marker assisted selection or marker aided selection (MAS) is an indirect selection process where a trait of interest is selected based on a marker linked to a trait of interest, rather than on the trait itself. This process has been extensively researched and proposed for plant- and animal- breeding.
Zygosity is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism.