Coefficient of relationship

Last updated

The coefficient of relationship is a measure of the degree of consanguinity (or biological relationship) between two individuals. The term coefficient of relationship was defined by Sewall Wright in 1922, and was derived from his definition of the coefficient of inbreeding of 1921. The measure is most commonly used in genetics and genealogy. A coefficient of inbreeding can be calculated for an individual, and is typically one-half the coefficient of relationship between the parents.

Contents

In general, the higher the level of inbreeding the closer the coefficient of relationship between the parents approaches a value of 1, expressed as a percentage, [lower-alpha 1] and approaches a value of 0 for individuals with arbitrarily remote common ancestors.

Coefficient of relationship

The coefficient of relationship () between two individuals B and C is obtained by a summation of coefficients calculated for every line by which they are connected to their common ancestors. Each such line connects the two individuals via a common ancestor, passing through no individual which is not a common ancestor more than once. A path coefficient between an ancestor A and an offspring O separated by generations is given as:

where and are the coefficients of inbreeding for A and O, respectively.

The coefficient of relationship is now obtained by summing over all path coefficients:

By assuming that the pedigree can be traced back to a sufficiently remote population of perfectly random-bred stock (fA = 0 for all A in the sum) the definition of r may be simplified to

where p enumerates all paths connecting B and C with unique common ancestors (i.e. all paths terminate at a common ancestor and may not pass through a common ancestor to a common ancestor's ancestor), and L(p) is the length of the path p.

To give an (artificial) example: Assuming that two individuals share the same 32 ancestors of n = 5 generations ago, but do not have any common ancestors at four or fewer generations ago, their coefficient of relationship would be

, which for n = 5, is, , or approximately 0.0313 or 3%.

Individuals for which the same situation applies for their 1024 ancestors of ten generations ago would have a coefficient of r = 2−10 = 0.1%. If follows that the value of r can be given to an accuracy of a few percent if the family tree of both individuals is known for a depth of five generations, and to an accuracy of a tenth of a percent if the known depth is at least ten generations. The contribution to r from common ancestors of 20 generations ago (corresponding to roughly 500 years in human genealogy, or the contribution from common descent from a medieval population) falls below one part-per-million.

Human relationships

Diagram of common family relationships, where the area of each colored circle is scaled according to the coefficient of relatedness. All relatives of the same relatedness are included together in one of the gray ellipses. Legal degrees of relationship can be found by counting the number of solid-line connections between the self and a relative. Coefficient of relatedness.png
Diagram of common family relationships, where the area of each colored circle is scaled according to the coefficient of relatedness. All relatives of the same relatedness are included together in one of the gray ellipses. Legal degrees of relationship can be found by counting the number of solid-line connections between the self and a relative.

The coefficient of relationship is sometimes used to express degrees of kinship in numeric terms in human genealogy.

In human relationships, the value of the coefficient of relationship is usually calculated based on the knowledge of a full family tree extending to a comparatively small number of generations, perhaps of the order of three or four. As explained above, the value for the coefficient of relationship so calculated is thus a lower bound, with an actual value that may be up to a few percent higher. The value is accurate to within 1% if the full family tree of both individuals is known to a depth of seven generations. [lower-alpha 3]

Most incest laws concern the relationships where r = 25% or higher, although many ignore the rare case of double first cousins. Some jurisdictions also prohibit sexual relations or marriage between cousins of various degree, or individuals related only through adoption or affinity. Whether there is any likelihood of conception is generally considered irrelevant.

Kinship coefficient

The kinship coefficient is a simple measure of relatedness, defined as the probability that a pair of randomly sampled homologous alleles are identical by descent. [1] More simply, it is the probability that an allele selected randomly from an individual, i, and an allele selected at the same autosomal locus from another individual, j, are identical and from the same ancestor.

RelationshipKinship
coefficient
Individual-self1/2
full sister / full brother1/4
mother / father / daughter / son1/4
grandmother / grandfather / granddaughter / grandson1/8
aunt / uncle / niece / nephew1/8
first cousin1/16
half-sister / half-brother1/8
Several of the most common family relationships and their corresponding kinship coefficient.

The coefficient of relatedness is equal to twice the kinship coefficient. [2]

Calculation

The kinship coefficient between two individuals, i and j, is represented as Φij. The kinship coefficient between a non-inbred individual and itself, Φii, is equal to 1/2. This is due to the fact that humans are diploid, meaning the only way for the randomly chosen alleles to be identical by descent is if the same allele is chosen twice (probability 1/2). Similarly, the relationship between a parent and a child is found by the chance that the randomly picked allele in the child is from the parent (probability 1/2) and the probability of the allele that is picked from the parent being the same one passed to the child (probability 1/2). Since these two events are independent of each other, they are multiplied Φij = 1/2 X 1/2 = 1/4. [3] [4]

See also

Notes

  1. strictly speaking, r=1 for clones and identical twins, but since the definition of r is usually intended to estimate the suitability of two individuals for breeding, they are typically taken to be of opposite sex.
  2. For instance, one's sibling connects to one's parent, which connects to one's self (2 lines) while one's aunt/uncle connects to one's grandparent, which connects to one's parent, which connects to one's self (3 lines).
  3. A full family tree of seven generations (128 paths to ancestors of the 7th degree) is unreasonable even for members of high nobility. For example, the family tree of Queen Elizabeth II is fully known for a depth of six generations, but becomes difficult to trace in the seventh generation.

Related Research Articles

<span class="mw-page-title-main">Inbreeding</span> Reproduction by closely related organisms

Inbreeding is the production of offspring from the mating or breeding of individuals or organisms that are closely related genetically. By analogy, the term is used in human reproduction, but more commonly refers to the genetic disorders and other consequences that may arise from expression of deleterious or recessive traits resulting from incestuous sexual relationships and consanguinity. Animals avoid incest only rarely.

<span class="mw-page-title-main">Genetic drift</span> Concept in genetics

Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance.

<span class="mw-page-title-main">Heritability</span> Estimation of effect of genetic variation on phenotypic variation of a trait

Heritability is a statistic used in the fields of breeding and genetics that estimates the degree of variation in a phenotypic trait in a population that is due to genetic variation between individuals in that population. The concept of heritability can be expressed in the form of the following question: "What is the proportion of the variation in a given trait within a population that is not explained by the environment or random chance?"

<span class="mw-page-title-main">Fitness (biology)</span> Expected reproductive success

Fitness is the quantitative representation of individual reproductive success. It is also equal to the average contribution to the gene pool of the next generation, made by the same individuals of the specified genotype or phenotype. Fitness can be defined either with respect to a genotype or to a phenotype in a given environment or time. The fitness of a genotype is manifested through its phenotype, which is also affected by the developmental environment. The fitness of a given phenotype can also be different in different selective environments.

<span class="mw-page-title-main">Population genetics</span> Subfield of genetics

Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and population structure.

<span class="mw-page-title-main">Hardy–Weinberg principle</span> Principle in genetics

In population genetics, the Hardy–Weinberg principle, also known as the Hardy–Weinberg equilibrium, model, theorem, or law, states that allele and genotype frequencies in a population will remain constant from generation to generation in the absence of other evolutionary influences. These influences include genetic drift, mate choice, assortative mating, natural selection, sexual selection, mutation, gene flow, meiotic drive, genetic hitchhiking, population bottleneck, founder effect,inbreeding and outbreeding depression.

<span class="mw-page-title-main">Quantitative genetics</span> Study of the inheritance of continuously variable traits

Quantitative genetics deals with quantitative traits, which are phenotypes that vary continuously —as opposed to discretely identifiable phenotypes and gene-products.

<span class="mw-page-title-main">Founder effect</span> Effect in population genetics

In population genetics, the founder effect is the loss of genetic variation that occurs when a new population is established by a very small number of individuals from a larger population. It was first fully outlined by Ernst Mayr in 1942, using existing theoretical work by those such as Sewall Wright. As a result of the loss of genetic variation, the new population may be distinctively different, both genotypically and phenotypically, from the parent population from which it is derived. In extreme cases, the founder effect is thought to lead to the speciation and subsequent evolution of new species.

<span class="mw-page-title-main">Consanguinity</span> Property of being from the same kinship as another person

Consanguinity is the characteristic of having a kinship with a relative who is descended from a common ancestor.

In population genetics, linkage disequilibrium (LD) is the non-random association of alleles at different loci in a given population. Loci are said to be in linkage disequilibrium when the frequency of association of their different alleles is higher or lower than expected if the loci were independent and associated randomly.

In population genetics, F-statistics describe the statistically expected level of heterozygosity in a population; more specifically the expected degree of (usually) a reduction in heterozygosity when compared to Hardy–Weinberg expectation.

<span class="mw-page-title-main">Cousin</span> Descendant of an ancestors sibling

Most generally, in the lineal kinship system used in the English-speaking world, a cousin is a type of familial relationship in which two relatives are two or more familial generations away from their most recent common ancestor. Commonly, "cousin" refers to a first cousin – a relative of the same generation whose most recent common ancestor with the subject is a grandparent.

The effective population size (Ne) is a number that, in some simplified scenarios, corresponds to the number of breeding individuals in the population. More generally, Ne is the number of individuals that an idealised population would need to have in order for some specified quantity of interest (typically change of genetic diversity or inbreeding rates) to be the same as in the real population. Idealised populations are based on unrealistic but convenient simplifications such as random mating, simultaneous birth of each new generation, constant population size, and equal numbers of children per parent. For most quantities of interest and most real populations, the effective population size Ne is usually smaller than the census population size N of a real population. The same population may have multiple effective population sizes, for different properties of interest, including for different genetic loci.

Malecot's coancestry coefficient, , refers to an indirect measure of genetic similarity of two individuals which was initially devised by the French mathematician Gustave Malécot.

Coalescent theory is a model of how alleles sampled from a population may have originated from a common ancestor. In the simplest case, coalescent theory assumes no recombination, no natural selection, and no gene flow or population structure, meaning that each variant is equally likely to have been passed from one generation to the next. The model looks backward in time, merging alleles into a single ancestral copy according to a random process in coalescence events. Under this model, the expected time between successive coalescence events increases almost exponentially back in time. Variance in the model comes from both the random passing of alleles from one generation to the next, and the random occurrence of mutations in these alleles.

The coefficient of inbreeding of an individual is the probability that two alleles at any locus in an individual are identical by descent from the common ancestor(s) of the two parents.

In population genetics, fixation is the change in a gene pool from a situation where there exists at least two variants of a particular gene (allele) in a given population to a situation where only one of the alleles remains. That is, the allele becomes fixed. In the absence of mutation or heterozygote advantage, any allele must eventually be lost completely from the population or fixed. Whether a gene will ultimately be lost or fixed is dependent on selection coefficients and chance fluctuations in allelic proportions. Fixation can refer to a gene in general or particular nucleotide position in the DNA chain (locus).

Population structure is the presence of a systematic difference in allele frequencies between subpopulations. In a randomly mating population, allele frequencies are expected to be roughly similar between groups. However, mating tends to be non-random to some degree, causing structure to arise. For example, a barrier like a river can separate two groups of the same species and make it difficult for potential mates to cross; if a mutation occurs, over many generations it can spread and become common in one subpopulation while being completely absent in the other.

<span class="mw-page-title-main">Isolation by distance</span>

Isolation by distance (IBD) is a term used to refer to the accrual of local genetic variation under geographically limited dispersal. The IBD model is useful for determining the distribution of gene frequencies over a geographic region. Both dispersal variance and migration probabilities are variables in this model and both contribute to local genetic differentiation. Isolation by distance is usually the simplest model for the cause of genetic isolation between populations. Evolutionary biologists and population geneticists have been exploring varying theories and models for explaining population structure. Yoichi Ishida compares two important theories of isolation by distance and clarifies the relationship between the two. According to Ishida, Sewall Wright's isolation by distance theory is termed ecological isolation by distance while Gustave Malécot's theory is called genetic isolation by distance. Isolation by distance is distantly related to speciation. Multiple types of isolating barriers, namely prezygotic isolating barriers, including isolation by distance, are considered the key factor in keeping populations apart, limiting gene flow.

Genetic purging is the reduction of the frequency of a deleterious allele, caused by an increased efficiency of natural selection prompted by inbreeding.

References

  1. Lange, Kenneth (2003). Mathematical and statistical methods for genetic analysis. Springer. p. 81. ISBN   978-0-387-21750-5.
  2. Wright, Sewall (1921). "Systems of Mating" (PDF). Genetics. 6 (2): 111–178. doi:10.1093/genetics/6.2.111. PMC   1200510 . PMID   17245958.
  3. Lange, Kenneth (2003). Mathematical and statistical methods for genetic analysis. Springer. pp. 81–83.
  4. Jacquard, Albert (1974). The genetic structure of populations. Springer-Verlag. ISBN   978-3-642-88415-3.

Bibliography