Sorenson Molecular Genealogy Foundation

Last updated

The Sorenson Molecular Genealogy Foundation (SMGF) was an independent DNA and genealogical research institution with the goal of demonstrating how the peoples of the world are related. SMGF collected DNA samples and genealogical information from individuals across the globe to establish these connections.

Contents

History

SMGF originated in 1999 and was inspired by several conversations between inventor and philanthropist James LeVoy Sorenson and Brigham Young University professor Dr. Scott Woodward. [1] Mr. Sorenson envisioned the development of a genetic-genealogical blueprint of all humankind. This blueprint, as developed by Dr. Woodward, would show how closely humans are related to one another, demonstrating the familial relationships between unique individuals. The collection of DNA samples and associated family pedigrees began in 2000, beginning first with Brigham Young University students, and quickly branching outside of Utah, then to the rest of the world.

In 2003, SMGF moved its operations from the Brigham Young University campus to Salt Lake City. SMGF also outsourced all of its laboratory work to Sorenson Genomics, [2] freeing researchers to create a publicly available online repository. This repository was known as the Sorenson Database.

SMGF collected more than 100,000 DNA samples and familial pedigrees from donors around the world, all of which were available in the Sorenson Database. This database was acquired by Ancestry.com prior to its launch of AncestryDNA when they purchased GeneTree in 2012. [3] Genetree was SMGF's commercial arm. [4]

Sorenson Database

The Sorenson Database contained more than 100,000 DNA samples and familial pedigrees, encompassing 2.8 million genealogical records and 2.4 million genotypes.

The Sorenson Database matched genetic information with familial surnames. A person searching through the database was able to find both genetic and genealogical matches, but could search using either DNA haplotypes or familial surname. The average family pedigree contained six generations of pedigree information (more than 150 years) with over 6 million genetically linked ancestors.

There were only two areas for individuals to search within the Sorenson Database, a Y chromosome database and a mitochondrial DNA database. The Y-DNA database allowed males to search their direct paternal line; the mtDNA database allowed both males and females search their direct maternal line. According to ISOGG.org, Sorenson "also had large X chromosome and autosomal STR databases, but these databases were never made publicly available." [5]

Y-chromosome Database:

The Y chromosome is passed on from father to son with little changes to Y-DNA. Surnames also tend to generally be passed on from father to son. The Sorenson Database used Y-chromosome genetic markers and surnames to determine common ancestors. Users could share DNA results and pedigree charts obtained from other sources. They could search for ancestors using their surname, genetic marker values obtained through their DNA analysis, or a combination of the two. The database also had genetic marker default values if users did not have their own genetic information.

A Y-DNA test analyzes similarities in Y-chromosome DNA markers, corresponding to specific locations on the Y chromosome. Markers have a Short tandem repeat (STR) that involve repeats of 2–5 DNA base pairs. At Sorenson labs, the number of STRs in a 36-marker haplotype was compared between the sample given and the database to determine relatedness. The Single-nucleotide polymorphism (SNP) was also examined. Y-chromosome SNPs are rare and can be used to separate populations of men or haplogroups and can be used in population studies. [6]

Mitochondrial Database:

Both males and females inherit their mitochondrial DNA (mtDNA) from their mother, which allows mtDNA to determine a maternal line. The Sorenson Database compared mitochondrial DNA from over 76, 000 people in their mitochondrial database to find cousins connected across generations. Users could share their mtDNA results from other commercial labs.

The mtDNA test examines three regions of the D-loop of mtDNA: HVR1, HVR2 and HVR3 which is called the Hypervariable region. Mutations in these areas of mtDNA are extremely rare and have likely not changed between maternal-line cousins within the last 500 years. Therefore, looking at mutations in these areas can determine maternal lineages. Samples of mtDNA submitted to the Sorenson Molecular Genealogy Foundation were compared to the Cambridge Reference Sequence (CRS) of mtDNA then to each other for relatedness.

Participation

One of the perhaps overly ambitious missions of SMGF was to connect any two individuals in the world, showing to both how they are related. In order to accomplish this goal, SMGF collected DNA samples and correlated familial pedigrees from participants around the world. [7] After reaching their 100,000 sample goal, SMGF modified their goals to focus less on collecting new samples and more on analysis of previous samples.

New participants were still accepted through SMGF's subsidiary, GeneTree for many years. Also, previous SMGF participants could buy their DNA report through GeneTree.com. [8]

Another goal of SMGF was to advance research in the scientific field of molecular genealogy. SMGF conducted research to identify additional DNA markers which link an individual to their genealogical family, however direct or distant. SMGF researchers published a number of research papers, [9] many of which involved collaborations with other researchers.

DNA collection for the project initially involved drawing blood samples; since August 2002, the SMGF used a mouthwash-based collection method (GenetiRinse), which has the advantage that it can be mailed to participants living in remote areas.

Controversy and Shutdown

In May 2015, Ancestry.com took down the SMGF website, citing that "the site has been used for purposes other than what was intended, forcing [Ancestry.com] to cease operations on the site." [10] The action by Ancestry.com was taken in response to news revelations claiming that the SMGF database had been used by Ancestry.com and local law enforcement, without any court oversight [e.g., a warrant] and in violation of SMGF's privacy policy, to try to tie criminal suspects to crime scenes. [11] Although the actions in the case by all parties, particularly Ancestry.com and the local law enforcement, were eventually recognized to have followed proper constitutional procedures of privacy protection and court oversight in the case, the scandal significantly impacted Ancestry.com's reputation and led to the decision to deactivate the website and mothball the SMGF database.

Related Research Articles

<span class="mw-page-title-main">Haplotype</span> Group of genes from one parent

A haplotype is a group of alleles in an organism that are inherited together from a single parent.

In biology and genetic genealogy, the most recent common ancestor (MRCA), also known as the last common ancestor (LCA) or concestor, of a set of organisms is the most recent individual from which all the organisms of the set are descended. The term is also used in reference to the ancestry of groups of genes (haplotypes) rather than organisms.

Genetic genealogy is the use of genealogical DNA tests, i.e., DNA profiling and DNA testing, in combination with traditional genealogical methods, to infer genetic relationships between individuals. This application of genetics came to be used by family historians in the 21st century, as DNA tests became affordable. The tests have been promoted by amateur groups, such as surname study groups or regional genealogical groups, as well as research projects such as the Genographic Project.

The Genographic Project, launched on 13 April 2005 by the National Geographic Society and IBM, was a genetic anthropological study that aimed to map historical human migrations patterns by collecting and analyzing DNA samples. The final phase of the project was Geno 2.0 Next Generation. Upon retirement of the site, 1,006,542 participants in over 140 countries had joined the project.

A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation that can be observed. A genetic marker may be a short DNA sequence, such as a sequence surrounding a single base-pair change, or a long one, like minisatellites.

A genealogical DNA test is a DNA-based test used in genetic genealogy that looks at specific locations of a person's genome in order to find or verify ancestral genealogical relationships, or to estimate the ethnic mixture of an individual. Since different testing companies use different ethnic reference groups and different matching algorithms, ethnicity estimates for an individual vary between tests, sometimes dramatically.

A surname DNA project is a genetic genealogy project which uses genealogical DNA tests to trace male lineage.

In genetic genealogy, a unique-event polymorphism (UEP) is a genetic marker that corresponds to a mutation that is likely to occur so infrequently that it is believed overwhelmingly probable that all the individuals who share the marker, worldwide, will have inherited it from the same common ancestor, and the same single mutation event.

<span class="mw-page-title-main">Family Tree DNA</span> Commercial genetic testing company

FamilyTreeDNA is a division of Gene by Gene, a commercial genetic testing company based in Houston, Texas. FamilyTreeDNA offers analysis of autosomal DNA, Y-DNA, and mitochondrial DNA to individuals for genealogical purpose. With a database of more than two million records, it is the most popular company worldwide for Y-DNA and mitochondrial DNA, and the fourth most popular for autosomal DNA. In Europe, it is the most common also for autosomal DNA. FamilyTreeDNA as a division of Gene by Gene were acquired by MYDNA, Inc., an Australian company in January 2021.

The genetic history of the British Isles is the subject of research within the larger field of human population genetics. It has developed in parallel with DNA testing technologies capable of identifying genetic similarities and differences between both modern and ancient populations. The conclusions of population genetics regarding the British Isles in turn draw upon and contribute to the larger field of understanding the history of the human occupation of the area, complementing work in linguistics, archaeology, history and genealogy.

Haplogroup G-FGC7535, also known as Haplogroup G2a1, is a Y-chromosome haplogroup. It is an immediate descendant of G2a (G-P15), which is a primary branch of haplogroup G2 (P287).

In human genetics, Haplogroup G-P303 is a Y-chromosome haplogroup. It is a branch of haplogroup G (Y-DNA) (M201). In descending order, G-P303 is additionally a branch of G2 (P287), G2a (P15), G2a2, G2a2b, G2a2b2, and finally G2a2b2a. This haplogroup represents the majority of haplogroup G men in most areas of Europe west of Russia and the Black Sea. To the east, G-P303 is found among G persons across the Middle East, Iran, the southern Caucasus area, China, and India. G-P303 exhibits its highest diversity in the Levant.

GeneTree was a family history website focused on using DNA testing to trace ancestry. A website account was free, and within their account users could order DNA tests, enter results from other testing companies, search the DNA database, create an online family tree, and correspond with family members – including sharing pictures.

<span class="mw-page-title-main">International Society of Genetic Genealogy</span>

The International Society of Genetic Genealogy (ISOGG) is an independent non-commercial nonprofit organization of genetic genealogists run by volunteers. It was founded by a group of surname DNA project administrators in 2005 to promote DNA testing for genealogy. It advocates the use of genetics in genealogical research, provides educational resources for genealogists interested in DNA testing, and facilitates networking among genetic genealogists. As of June 2013, it comprises over 8,000 members in 70 countries. As of July 2013, regional meetings are coordinated by 20 volunteer regional coordinators located in the United States, Australia, Brazil, Canada, England, Egypt, Ireland and Russia.

Population genetics is a scientific discipline which contributes to the examination of the human evolutionary and historical migrations. Particularly useful information is provided by the research of two uniparental markers within our genome, the Y-chromosome (Y-DNA) and mitochondrial DNA (mtDNA), as well as autosomal DNA. The data from Y-DNA and autosomal DNA suggests that the Croats mostly are descendants of the Slavs from medieval migration period, according to mtDNA have genetic diversity which fits within a broader European maternal genetic landscape, and overall have a uniformity with other South Slavs from the territory of former Yugoslavia.

As with all modern European nations, a large degree of 'biological continuity' exists between Bosnians and Bosniaks and their ancient predecessors with Y chromosomal lineages testifying to predominantly Paleolithic European ancestry. Studies based on bi-allelic markers of the NRY have shown the three main ethnic groups of Bosnia and Herzegovina to share, in spite of some quantitative differences, a large fraction of the same ancient gene pool distinct for the region. Analysis of autosomal STRs have moreover revealed no significant difference between the population of Bosnia and Herzegovina and neighbouring populations.

The rape and murder of Angie Dodge was a cold case in Idaho Falls, Idaho, that remained open from the crime's occurrence on June 13, 1996, until May 2019. For 20 years, a falsely accused Chris Tapp served time in prison for Dodge's rape and murder while authorities searched for suspects that matched DNA left at the crime scene. In 2017, Tapp's rape conviction was repealed and he was released from prison. Over the course of Tapp's sentence, authorities continued searching for remaining suspects, partially at the insistence of the murdered girl's mother, and in 2014 accused Michael Usry Jr. of rape and murder. Authorities identified Usry as a suspect through Y-chromosome familial searching, a process by which partial DNA matches to relatives are used to identify an individual. After conducting a DNA test, authorities discovered that Usry did not match the DNA found at the crime scene. In 2019, authorities used autosomal familial searching to find and convict Brian Leigh Dripps Sr., who was found to have a full genetic match and confessed to the crime after interrogation.

References

  1. Hart, John L. (April 24, 2004). "Unlocking DNA". Church News . Retrieved July 26, 2011.
  2. "About Us". Sorenson Genomics.
  3. Moore, CeCe. "Ancestry.com Buys GeneTree (Another Competitor) and Launches Their New Autosomal DNA Product to Subscribers". Your Genetic Genealogist.
  4. "Ancestry.com Launches AncestryDNA". Ancestry Insider.
  5. "Sorenson Molecular Genealogy Foundation". ISOGG Wiki. International Society of Genetic Genealogy.
  6. "Simple Fool's Guide to Population Genomics". Stanford University – Palumbi Lab. Archived from the original on February 24, 2020.
  7. Boyle, Alan (July 3, 2011). "Africans visit their American cousins". MSNBC . Retrieved May 19, 2017.
  8. "Sorenson Molecular Genealogy Foundation's Genetic Genealogy Database Crosses Historic Milestone with 100,000 DNA Samples, Aided by Multi-Million Dollar Gift from Founder" (Press release). Sorenson Molecular Genealogy Foundation. April 28, 2010. Retrieved May 19, 2017.
  9. "New World Had at Least 15 'Founding Mothers' Says Latest Study from Sorenson Molecular Genealogy Foundation and University of Pavia, Italy". Business Wire (Press release). Retrieved May 19, 2017.
  10. Estes, Roberta (May 15, 2015). "RIP Sorenson – A crushing loss". DNAeXplained. Retrieved May 4, 2016.
  11. Lynch, Jennifer (May 1, 2015). "How Private DNA Data Led Idaho Cops on a Wild Goose Chase and Linked an Innocent Man to a 20-year-old Murder Case". Electronic Frontier Foundation . Retrieved May 4, 2016.

Official sites

News articles