Identity by descent

Last updated

A DNA segment is identical by state (IBS) in two or more individuals if they have identical nucleotide sequences in this segment. An IBS segment is identical by descent (IBD) in two or more individuals if they have inherited it from a common ancestor without recombination, that is, the segment has the same ancestral origin in these individuals. DNA segments that are IBD are IBS per definition, but segments that are not IBD can still be IBS due to the same mutations in different individuals or recombinations that do not alter the segment.[ citation needed ]

Contents

The origin of IBD segments is depicted via a pedigree. Pedigree, recombination and resulting IBD segments, schematic representation.png
The origin of IBD segments is depicted via a pedigree.
A colorblind-friendly version of this image. Pedigree, recombination and resulting IBD segments, schematic representation modified.png
A colorblind-friendly version of this image.

Theory

All individuals in a finite population are related if traced back long enough and will, therefore, share segments of their genomes IBD. During meiosis segments of IBD are broken up by recombination. Therefore, the expected length of an IBD segment depends on the number of generations since the most recent common ancestor at the locus of the segment. The length of IBD segments that result from a common ancestor n generations in the past (therefore involving 2n meiosis) is exponentially distributed with mean 1/(2n) Morgans (M). [1] The expected number of IBD segments decreases with the number of generations since the common ancestor at this locus. For a specific DNA segment, the probability of being IBD decreases as 2−2n since in each meiosis the probability of transmitting this segment is 1/2. [2]

Applications

Identified IBD segments can be used for a wide range of purposes. As noted above the amount (length and number) of IBD sharing depends on the familial relationships between the tested individuals. Therefore, one application of IBD segment detection is to quantify relatedness. [3] [4] [5] [6] Measurement of relatedness can be used in forensic genetics, [7] but can also increase information in genetic linkage mapping [3] [8] and help to decrease bias by undocumented relationships in standard association studies. [6] [9] Another application of IBD is genotype imputation and haplotype phase inference. [10] [11] [12] Long shared segments of IBD, which are broken up by short regions may be indicative for phasing errors. [5] [13] :SI

IBD mapping

IBD mapping [3] is similar to linkage analysis, but can be performed without a known pedigree on a cohort of unrelated individuals. IBD mapping can be seen as a new form of association analysis that increases the power to map genes or genomic regions containing multiple rare disease susceptibility variants. [6] [14]

Using simulated data, Browning and Thompson showed that IBD mapping has higher power than association testing when multiple rare variants within a gene contribute to disease susceptibility. [14] Via IBD mapping, genome-wide significant regions in isolated populations as well as outbred populations were found while standard association tests failed. [11] [15] Houwen et al. used IBD sharing to identify the chromosomal location of a gene responsible for benign recurrent intrahepatic cholestasis in an isolated fishing population. [16] Kenny et al. also used an isolated population to fine-map a signal found by a genome-wide association study (GWAS) of plasma plant sterol (PPS) levels, a surrogate measure of cholesterol absorption from the intestine. [17] Francks et al. was able to identify a potential susceptibility locus for schizophrenia and bipolar disorder with genotype data of case-control samples. [18] Lin et al. found a genome-wide significant linkage signal in a dataset of multiple sclerosis patients. [19] Letouzé et al. used IBD mapping to look for founder mutations in cancer samples. [20]

An IBD segment identified by HapFABIA in Asian genomes. Rare single nucleotide variants (SNVs) that tag the IBD segment are coloured purple. Below the turquoise bar, the IBD segment in ancient genomes is displayed. IBD segment detected by HapFABIA in 1000Genomes.png
An IBD segment identified by HapFABIA in Asian genomes. Rare single nucleotide variants (SNVs) that tag the IBD segment are coloured purple. Below the turquoise bar, the IBD segment in ancient genomes is displayed.

IBD in population genetics

Detection of natural selection in the human genome is also possible via detected IBD segments. Selection will usually tend to increase the number of IBD segments among individuals in a population. By scanning for regions with excess IBD sharing, regions in the human genome that have been under strong, very recent selection can be identified. [21] [22]

In addition to that, IBD segments can be useful for measuring and identifying other influences on population structure. [6] [23] [24] [25] [26] Gusev et al. showed that IBD segments can be used with additional modeling to estimate demographic history including bottlenecks and admixture. [24] Using similar models Palamara et al. and Carmi et al. reconstructed the demographic history of Ashkenazi Jewish and Kenyan Maasai individuals. [25] [26] [27] Botigué et al. investigated differences in African ancestry among European populations. [28] Ralph and Coop used IBD detection to quantify the common ancestry of different European populations [29] and Gravel et al. similarly tried to draw conclusions of the genetic history of populations in the Americas. [30] Ringbauer et al. utilized geographic structure of IBD segments to estimate dispersal within Eastern Europe during the last centuries. [31] Using the 1000 Genomes data Hochreiter found differences in IBD sharing between African, Asian and European populations as well as IBD segments that are shared with ancient genomes like the Neanderthal or Denisova. [13]

Methods and software

Programs for the detection of IBD segments in unrelated individuals:

See also

Related Research Articles

A microsatellite is a tract of repetitive DNA in which certain DNA motifs are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. They have a higher mutation rate than other areas of DNA leading to high genetic diversity. Microsatellites are often referred to as short tandem repeats (STRs) by forensic geneticists and in genetic genealogy, or as simple sequence repeats (SSRs) by plant geneticists.

Genetic linkage is the tendency of DNA sequences that are close together on a chromosome to be inherited together during the meiosis phase of sexual reproduction. Two genetic markers that are physically near to each other are unlikely to be separated onto different chromatids during chromosomal crossover, and are therefore said to be more linked than markers that are far apart. In other words, the nearer two genes are on a chromosome, the lower the chance of recombination between them, and the more likely they are to be inherited together. Markers on different chromosomes are perfectly unlinked, although the penetrance of potentially deleterious alleles may be influenced by the presence of other alleles, and these other alleles may be located on other chromosomes than that on which a particular potentially deleterious allele is located.

<span class="mw-page-title-main">Single-nucleotide polymorphism</span> Single nucleotide in genomic DNA at which different sequence alternatives exist

In genetics and bioinformatics, a single-nucleotide polymorphism is a germline substitution of a single nucleotide at a specific position in the genome that is present in a sufficiently large fraction of considered population.

In population genetics, linkage disequilibrium (LD) is the non-random association of alleles at different loci in a given population. Loci are said to be in linkage disequilibrium when the frequency of association of their different alleles is higher or lower than expected if the loci were independent and associated randomly.

<span class="mw-page-title-main">Gene mapping</span> Process of locating specific genes

Gene mapping or genome mapping describes the methods used to identify the location of a gene on a chromosome and the distances between genes. Gene mapping can also describe the distances between different sites within a gene.

Haploview is a commonly used bioinformatics software which is designed to analyze and visualize patterns of linkage disequilibrium (LD) in genetic data. Haploview can also perform association studies, choosing tagSNPs and estimating haplotype frequencies. Haploview is developed and maintained by Dr. Mark Daly's lab at the MIT/Harvard Broad Institute.

<span class="mw-page-title-main">Genome-wide association study</span> Study of genetic variants in different individuals

In genomics, a genome-wide association study, is an observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWA studies typically focus on associations between single-nucleotide polymorphisms (SNPs) and traits like major human diseases, but can equally be applied to any other genetic variants and any other organisms.

Expression quantitative trait loci (eQTLs) are genomic loci that explain variation in expression levels of mRNAs.

In genetics, association mapping, also known as "linkage disequilibrium mapping", is a method of mapping quantitative trait loci (QTLs) that takes advantage of historic linkage disequilibrium to link phenotypes to genotypes, uncovering genetic associations.

Genetic studies of Jews are part of the population genetics discipline and are used to analyze the chronology of Jewish migration accompanied by research in other fields, such as history, linguistics, archaeology, and paleontology. These studies investigate the origins of various Jewish ethnic divisions. In particular, they examine whether there is a common genetic heritage among them. The medical genetics of Jews are studied for population-specific diseases.

Disease gene identification is a process by which scientists identify the mutant genotypes responsible for an inherited genetic disorder. Mutations in these genes can include single nucleotide substitutions, single nucleotide additions/deletions, deletion of the entire gene, and other genetic abnormalities.

Quantitative trait loci mapping or QTL mapping is the process of identifying genomic regions that potentially contain genes responsible for important economic, health or environmental characters. Mapping QTLs is an important activity that plant breeders and geneticists routinely use to associate potential causal genes with phenotypes of interest. Family-based QTL mapping is a variant of QTL mapping where multiple-families are used.

A sequence related amplified polymorphism (SRAP) is a molecular technique, developed by G. Li and C. F. Quiros in 2001, for detecting genetic variation in the open reading frames (ORFs) of genomes of plants and related organisms.

In genetics, haplotype estimation refers to the process of statistical estimation of haplotypes from genotype data. The most common situation arises when genotypes are collected at a set of polymorphic sites from a group of individuals. For example in human genetics, genome-wide association studies collect genotypes in thousands of individuals at between 200,000-5,000,000 SNPs using microarrays. Haplotype estimation methods are used in the analysis of these datasets and allow genotype imputation of alleles from reference databases such as the HapMap Project and the 1000 Genomes Project.

Imputation in genetics refers to the statistical inference of unobserved genotypes. It is achieved by using known haplotypes in a population, for instance from the HapMap or the 1000 Genomes Project in humans, thereby allowing to test for association between a trait of interest and experimentally untyped genetic variants, but whose genotypes have been statistically inferred ("imputed"). Genotype imputation is usually performed on SNPs, the most common kind of genetic variation.

SNV calling from NGS data is any of a range of methods for identifying the existence of single nucleotide variants (SNVs) from the results of next generation sequencing (NGS) experiments. These are computational techniques, and are in contrast to special experimental methods based on known population-wide single nucleotide polymorphisms. Due to the increasing abundance of NGS data, these techniques are becoming increasingly popular for performing SNP genotyping, with a wide variety of algorithms designed for specific experimental designs and applications. In addition to the usual application domain of SNP genotyping, these techniques have been successfully adapted to identify rare SNPs within a population, as well as detecting somatic SNVs within an individual using multiple tissue samples.

Mega2 allows the applied statistical geneticist to convert one's data from several input formats to a large number output formats suitable for analysis by commonly used software packages. In a typical human genetics study, the analyst often needs to use a variety of different software programs to analyze the data, and these programs usually require that the data be formatted to their precise input specifications. Conversion of one's data into these multiple different formats can be tedious, time-consuming, and error-prone. Mega2, by providing validated conversion pipelines, can accelerate the analyses while reducing errors.

PLINK is a free, commonly used, open-source whole-genome association analysis toolset designed by Shaun Purcell. The software is designed flexibly to perform a wide range of basic, large-scale genetic analyses.

In genetics, a haplotype block is a region of an organism's genome in which there is little evidence of a history of genetic recombination, and which contain only a small number of distinct haplotypes. According to the haplotype-block model, such blocks should show high levels of linkage disequilibrium and be separated from one another by numerous recombination events. The boundaries of haplotype blocks cannot be directly observed; they must instead be inferred indirectly through the use of algorithms. However, some evidence suggests that different algorithms for identifying haplotype blocks give very different results when used on the same data, though another study suggests that their results are generally consistent. The National Institutes of Health funded the HapMap project to catalog haplotype blocks throughout the human genome.

Sharon Ruth Browning is a statistical geneticist at the University of Washington, and a research professor with its Department of Biostatistics. Her research has various implications for the field of biogenetics.

References

  1. Browning, S. R. (2008). "Estimation of Pairwise Identity by Descent from Dense Genetic Marker Data in a Population Sample of Haplotypes". Genetics. 178 (4): 2123–2132. doi:10.1534/genetics.107.084624. PMC   2323802 . PMID   18430938.
  2. Thompson, E. A. (2008). "The IBD process along four chromosomes". Theoretical Population Biology. 73 (3): 369–373. doi:10.1016/j.tpb.2007.11.011. PMC   2518088 . PMID   18282591.
  3. 1 2 3 4 Albrechtsen, A.; Sand Korneliussen, T.; Moltke, I.; Van Overseem Hansen, T.; Nielsen, F. C.; Nielsen, R. (2009). "Relatedness mapping and tracts of relatedness for genome-wide data in the presence of linkage disequilibrium". Genetic Epidemiology. 33 (3): 266–274. doi:10.1002/gepi.20378. PMID   19025785. S2CID   12029712.
  4. Browning, S. R.; Browning, B. L. (2010). "High-Resolution Detection of Identity by Descent in Unrelated Individuals". The American Journal of Human Genetics. 86 (4): 526–539. doi:10.1016/j.ajhg.2010.02.021. PMC   2850444 . PMID   20303063.
  5. 1 2 3 Gusev, A.; Lowe, J. K.; Stoffel, M.; Daly, M. J.; Altshuler, D.; Breslow, J. L.; Friedman, J. M.; Pe'Er, I. (2008). "Whole population, genome-wide mapping of hidden relatedness". Genome Research. 19 (2): 318–326. doi:10.1101/gr.081398.108. PMC   2652213 . PMID   18971310.
  6. 1 2 3 4 5 Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M. A. R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P. I. W.; Daly, M. J.; Sham, P. C. (2007). "PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses". The American Journal of Human Genetics. 81 (3): 559–575. doi:10.1086/519795. PMC   1950838 . PMID   17701901.
  7. Ian W. Evett; Bruce S. Weir (January 1998). Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. Sinauer Associates, Incorporated. ISBN   978-0-87893-155-2.
  8. Leutenegger, A.; Prum, B.; Genin, E.; Verny, C.; Lemainque, A.; Clergetdarpoux, F.; Thompson, E. (2003). "Estimation of the Inbreeding Coefficient through Use of Genomic Data". The American Journal of Human Genetics. 73 (3): 516–523. doi:10.1086/378207. PMC   1180677 . PMID   12900793.
  9. Voight, B. F.; Pritchard, J. K. (2005). "Confounding from Cryptic Relatedness in Case-Control Association Studies". PLOS Genetics. 1 (3): e32. doi: 10.1371/journal.pgen.0010032 . PMC   1200427 . PMID   16151517.
  10. Kong, A.; Masson, G.; Frigge, M. L.; Gylfason, A.; Zusmanovich, P.; Thorleifsson, G.; Olason, P. I.; Ingason, A.; Steinberg, S.; Rafnar, T.; Sulem, P.; Mouy, M.; Jonsson, F.; Thorsteinsdottir, U.; Gudbjartsson, D. F.; Stefansson, H.; Stefansson, K. (2008). "Detection of sharing by descent, long-range phasing and haplotype imputation". Nature Genetics. 40 (9): 1068–1075. doi:10.1038/ng.216. PMC   4540081 . PMID   19165921.
  11. 1 2 Gusev, A.; Shah, M. J.; Kenny, E. E.; Ramachandran, A.; Lowe, J. K.; Salit, J.; Lee, C. C.; Levandowsky, E. C.; Weaver, T. N.; Doan, Q. C.; Peckham, H. E.; McLaughlin, S. F.; Lyons, M. R.; Sheth, V. N.; Stoffel, M.; De La Vega, F. M.; Friedman, J. M.; Breslow, J. L.; Pe'Er, I. (2011). "Low-Pass Genome-Wide Sequencing and Variant Inference Using Identity-by-Descent in an Isolated Human Population". Genetics. 190 (2): 679–689. doi:10.1534/genetics.111.134874. PMC   3276614 . PMID   22135348.
  12. Browning, B. L.; Browning, S. R. (2009). "A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals". The American Journal of Human Genetics. 84 (2): 210–223. doi:10.1016/j.ajhg.2009.01.005. PMC   2668004 . PMID   19200528.
  13. 1 2 3 Hochreiter, S. (2013). "HapFABIA: Identification of very short segments of identity by descent characterized by rare variants in large sequencing data". Nucleic Acids Research. 41 (22): e202. doi:10.1093/nar/gkt1013. PMC   3905877 . PMID   24174545.
  14. 1 2 Browning, S. R.; Thompson, E. A. (2012). "Detecting Rare Variant Associations by Identity-by-Descent Mapping in Case-Control Studies". Genetics. 190 (4): 1521–1531. doi:10.1534/genetics.111.136937. PMC   3316661 . PMID   22267498.
  15. 1 2 Gusev, A.; Kenny, E. E.; Lowe, J. K.; Salit, J.; Saxena, R.; Kathiresan, S.; Altshuler, D. M.; Friedman, J. M.; Breslow, J. L.; Pe'Er, I. (2011). "DASH: A Method for Identical-by-Descent Haplotype Mapping Uncovers Association with Recent Variation". The American Journal of Human Genetics. 88 (6): 706–717. doi:10.1016/j.ajhg.2011.04.023. PMC   3113343 . PMID   21620352.
  16. Houwen, R. H. J.; Baharloo, S.; Blankenship, K.; Raeymaekers, P.; Juyn, J.; Sandkuijl, L. A.; Freimer, N. B. (1994). "Genome screening by searching for shared segments: Mapping a gene for benign recurrent intrahepatic cholestasis". Nature Genetics. 8 (4): 380–386. doi:10.1038/ng1294-380. hdl: 1765/55192 . PMID   7894490. S2CID   8131209.
  17. Kenny, E. E.; Gusev, A.; Riegel, K.; Lutjohann, D.; Lowe, J. K.; Salit, J.; Maller, J. B.; Stoffel, M.; Daly, M. J.; Altshuler, D. M.; Friedman, J. M.; Breslow, J. L.; Pe'Er, I.; Sehayek, E. (2009). "Systematic haplotype analysis resolves a complex plasma plant sterol locus on the Micronesian Island of Kosrae". Proceedings of the National Academy of Sciences. 106 (33): 13886–13891. Bibcode:2009PNAS..10613886K. doi: 10.1073/pnas.0907336106 . PMC   2728990 . PMID   19667188.
  18. Francks, C.; Tozzi, F.; Farmer, A.; Vincent, J. B.; Rujescu, D.; St Clair, D.; Muglia, P. (2008). "Population-based linkage analysis of schizophrenia and bipolar case–control cohorts identifies a potential susceptibility locus on 19q13". Molecular Psychiatry. 15 (3): 319–325. doi: 10.1038/mp.2008.100 . hdl: 11858/00-001M-0000-0012-C935-9 . PMID   18794890.
  19. Lin, R.; Charlesworth, J.; Stankovich, J.; Perreau, V. M.; Brown, M. A.; Anzgene, B. V.; Taylor, B. V. (2013). Toland, Amanda Ewart (ed.). "Identity-by-Descent Mapping to Detect Rare Variants Conferring Susceptibility to Multiple Sclerosis". PLOS ONE. 8 (3): e56379. Bibcode:2013PLoSO...856379L. doi: 10.1371/journal.pone.0056379 . PMC   3589405 . PMID   23472070.
  20. Letouzé, E.; Sow, A.; Petel, F.; Rosati, R.; Figueiredo, B. C.; Burnichon, N.; Gimenez-Roqueplo, A. P.; Lalli, E.; De Reyniès, A. L. (2012). Mailund, Thomas (ed.). "Identity by Descent Mapping of Founder Mutations in Cancer Using High-Resolution Tumor SNP Data". PLOS ONE. 7 (5): e35897. Bibcode:2012PLoSO...735897L. doi: 10.1371/journal.pone.0035897 . PMC   3342326 . PMID   22567117.
  21. Albrechtsen, A.; Moltke, I.; Nielsen, R. (2010). "Natural Selection and the Distribution of Identity-by-Descent in the Human Genome". Genetics. 186 (1): 295–308. doi:10.1534/genetics.110.113977. PMC   2940294 . PMID   20592267.
  22. Han, L.; Abney, M. (2011). "Identity by descent estimation with dense genome-wide genotype data". Genetic Epidemiology. 35 (6): 557–567. doi:10.1002/gepi.20606. PMC   3587128 . PMID   21769932.
  23. Cockerham, C. C.; Weir, B. S. (1983). "Variance of actual inbreeding". Theoretical Population Biology. 23 (1): 85–109. doi:10.1016/0040-5809(83)90006-0. PMID   6857551.
  24. 1 2 Gusev, A.; Palamara, P. F.; Aponte, G.; Zhuang, Z.; Darvasi, A.; Gregersen, P.; Pe'Er, I. (2011). "The Architecture of Long-Range Haplotypes Shared within and across Populations". Molecular Biology and Evolution. 29 (2): 473–486. doi:10.1093/molbev/msr133. PMC   3350316 . PMID   21984068.
  25. 1 2 Palamara, P. F.; Lencz, T.; Darvasi, A.; Pe’Er, I. (2012). "Length Distributions of Identity by Descent Reveal Fine-Scale Demographic History". The American Journal of Human Genetics. 91 (5): 809–822. doi:10.1016/j.ajhg.2012.08.030. PMC   3487132 . PMID   23103233.
  26. 1 2 Palamara, P. F.; Pe'Er, I. (2013). "Inference of historical migration rates via haplotype sharing". Bioinformatics. 29 (13): i180–i188. doi:10.1093/bioinformatics/btt239. PMC   3694674 . PMID   23812983.
  27. Carmi, S.; Palamara, P. F.; Vacic, V.; Lencz, T.; Darvasi, A.; Pe'Er, I. (2013). "The Variance of Identity-by-Descent Sharing in the Wright-Fisher Model". Genetics. 193 (3): 911–928. arXiv: 1206.4745 . doi:10.1534/genetics.112.147215. PMC   3584006 . PMID   23267057.
  28. Botigue, L. R.; Henn, B. M.; Gravel, S.; Maples, B. K.; Gignoux, C. R.; Corona, E.; Atzmon, G.; Burns, E.; Ostrer, H.; Flores, C.; Bertranpetit, J.; Comas, D.; Bustamante, C. D. (2013). "Gene flow from North Africa contributes to differential human genetic diversity in southern Europe". Proceedings of the National Academy of Sciences. 110 (29): 11791–11796. Bibcode:2013PNAS..11011791B. doi: 10.1073/pnas.1306223110 . PMC   3718088 . PMID   23733930.
  29. Ralph, P.; Coop, G. (2013). Tyler-Smith, Chris (ed.). "The Geography of Recent Genetic Ancestry across Europe". PLOS Biology. 11 (5): e1001555. doi: 10.1371/journal.pbio.1001555 . PMC   3646727 . PMID   23667324.
  30. Gravel, S.; Zakharia, F.; Moreno-Estrada, A.; Byrnes, J. K.; Muzzio, M.; Rodriguez-Flores, J. L.; Kenny, E. E.; Gignoux, C. R.; Maples, B. K.; Guiblet, W.; Dutil, J.; Via, M.; Sandoval, K.; Bedoya, G.; 1000 Genomes, T. K.; Oleksyk, A.; Ruiz-Linares, E. G.; Burchard, J. C.; Martinez-Cruzado, C. D.; Bustamante, C. D. (2013). Williams, Scott M (ed.). "Reconstructing Native American Migrations from Whole-Genome and Whole-Exome Data". PLOS Genetics. 9 (12): e1004023. arXiv: 1306.4021 . doi: 10.1371/journal.pgen.1004023 . PMC   3873240 . PMID   24385924.{{cite journal}}: CS1 maint: numeric names: authors list (link)
  31. Ringbauer, Harald; Coop, Graham; Barton, Nicholas H. (2017-03-01). "Inferring Recent Demography from Isolation by Distance of Long Shared Sequence Blocks". Genetics. 205 (3): 1335–1351. doi:10.1534/genetics.116.196220. ISSN   0016-6731. PMC   5340342 . PMID   28108588.
  32. Naseri A, Liu X, Zhang S, Zhi D. Ultra-fast Identity by Descent Detection in Biobank-Scale Cohorts using Positional Burrows–Wheeler Transform BioRxiv 2017.
  33. Rodriguez JM, Batzoglou S, Bercovici S. An accurate method for inferring relatedness in large datasets of unphased genotypes via an embedded likelihood-ratio test. RECOMB 2013, LNBI 7821:212-229.
  34. Browning, B. L.; Browning, S. R. (2011). "A Fast, Powerful Method for Detecting Identity by Descent". The American Journal of Human Genetics. 88 (2): 173–182. doi:10.1016/j.ajhg.2011.01.010. PMC   3035716 . PMID   21310274.
  35. Browning, B. L.; Browning, S. R. (2013). "Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data". Genetics. 194 (2): 459–471. doi:10.1534/genetics.113.150029. PMC   3664855 . PMID   23535385.
  36. Browning, B. L.; Browning, S. R. (2013). "Detecting Identity by Descent and Estimating Genotype Error Rates in Sequence Data". The American Journal of Human Genetics. 93 (5): 840–851. doi:10.1016/j.ajhg.2013.09.014. PMC   3824133 . PMID   24207118.
  37. Moltke, I.; Albrechtsen, A.; Hansen, T. V. O.; Nielsen, F. C.; Nielsen, R. (2011). "A method for detecting IBD regions simultaneously in multiple individuals--with applications to disease genetics". Genome Research. 21 (7): 1168–1180. doi:10.1101/gr.115360.110. PMC   3129259 . PMID   21493780.
  38. He, D. (2013). "IBD-Groupon: An efficient method for detecting group-wise identity-by-descent regions simultaneously in multiple individuals based on pairwise IBD relationships". Bioinformatics. 29 (13): i162–i170. doi:10.1093/bioinformatics/btt237. PMC   3694672 . PMID   23812980.