The Personal Genome Project (PGP) is a long term, large cohort study which aims to sequence and publicize the complete genomes and medical records of 100,000 volunteers, in order to enable research into personal genomics and personalized medicine. It was initiated by Harvard University's George M. Church in 2005. [1] [2] [3] As of November 2017, more than 10,000 volunteers had joined the project. Volunteers were accepted initially if they were permanent residents of the US and were able to submit tissue and/or genetic samples. Later the project was expanded to other countries.[ citation needed ]
The Project was initially launched in the US in 2005 [1] and later extended to Canada (2012), [4] United Kingdom (2013), [5] Austria (2014), [6] Korea (2015) [7] and China (2017). [8]
The project allowed participants to publish the genotype (the full DNA sequence of all 46 chromosomes) of the volunteers, along with extensive information about their phenotype: medical records, various measurements, MRI images, etc. All data were placed within the public domain and made available over the Internet so that researchers could test various hypotheses about the relationships among genotype, environment and phenotype. Participants could decide what data they are comfortable to publish publicly and could choose to upload additional data or remove existing data at their own convenience.[ citation needed ]
An important part of the project was the exploration of the resulting risks to the participants, such as possible discrimination by insurers and employers if the genome shows a predisposition for certain diseases.[ citation needed ]
The PGP is establishing an international network of sites, including the United States (Harvard PGP), Canada (University of Toronto / Hospital for Sick Kids), and other countries that adhere to certain "conforming implementation" criteria such as no promise of anonymity and data return. [9] The Harvard Medical School Institutional Review Board requested that the first set of volunteers include the principal investigator George Church and other diverse stakeholders in the scientific, medical, and social implications of personal genomes, because they were well positioned to give highly informed consent. As sequencing technology becomes cheaper, and the societal issues mentioned above are worked out, it was hoped that a large number of volunteers from all walks of life would participate. The long-term goal was that every person have access to his or her genotype to be used for personalized medical decisions.[ citation needed ]
The first ten volunteers were referred to as the "PGP-10". These volunteers were:
In order to enroll, each participant must pass a series of short online tests to ensure that they are providing informed consent. [11] By 2012, 2000 participants had enrolled [12] and by November 2017 10,000 had joined the project. [8]
In July 2014, at the 'Genetics, Genomics and Global Health—Inequalities, Identities and Insecurities' conference, Stephan Beck, the head of the UK arm of this project indicated that they had over 1000 volunteers, and had temporarily paused collection data due to lack of funding. As of November 2016, the pause was still in effect. [13]
Since 2016, participants of the PGP could choose to obtain their whole-genome sequenced performed for $999. [14] In the same year Complete Genomics contributed over 184 phased human genomes to the project. [15]
In February 2018, the results were published of the first 56 Canadian participants who had their whole genome analyzed. [16] Several DNA mutations that would have been expected by expert consensus to affect health of the participants had not done so, indicating that getting health data from the human genome was difficult. [17]
On March 9, 2017, producers of the popular online brain-training program Lumosity announced they would collaborate with Harvard researchers to investigate the relationship between genetics and memory, attention, and reaction speed. [18] [19]
Scientists at the Wyss Institute for Biologically Inspired Engineering and the Harvard Medical School Personal Genome Project (PGP) planned to recruit 10,000 members from the PGP, to perform a set of cognitive tests from Lumos Labs’ NeuroCognitive Performance Test, a brief, repeatable, online assessment to evaluate participants’ memory functions, including object recall, object pattern memorization, and response times. The researchers would then correlate extremely high performance scores with naturally occurring variations in the participants’ genomes. To validate their findings, the team would sequence, edit, and visualize DNA, model neuronal development in 3-D brain organoids ex vivo, and finally test emerging hypotheses in experimental models of neurodegeneration.[ citation needed ]
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.
Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dimensional structural configuration. In contrast to genetics, which refers to the study of individual genes and their roles in inheritance, genomics aims at the collective characterization and quantification of all of an organism's genes, their interrelations and influence on the organism. Genes may direct the production of proteins with the assistance of enzymes and messenger molecules. In turn, proteins make up body structures such as organs and tissues as well as control chemical reactions and carry signals between cells. Genomics also involves the sequencing and analysis of genomes through uses of high throughput DNA sequencing and bioinformatics to assemble and analyze the function and structure of entire genomes. Advances in genomics have triggered a revolution in discovery-based research and systems biology to facilitate understanding of even the most complex biological systems such as the brain.
Genetic testing, also known as DNA testing, is used to identify changes in DNA sequence or chromosome structure. Genetic testing can also include measuring the results of genetic changes, such as RNA analysis as an output of gene expression, or through biochemical analysis to measure specific protein output. In a medical setting, genetic testing can be used to diagnose or rule out suspected genetic disorders, predict risks for specific conditions, or gain information that can be used to customize medical treatments based on an individual's genetic makeup. Genetic testing can also be used to determine biological relatives, such as a child's biological parentage through DNA paternity testing, or be used to broadly predict an individual's ancestry. Genetic testing of plants and animals can be used for similar reasons as in humans, to gain information used for selective breeding, or for efforts to boost genetic diversity in endangered populations.
In genetics and bioinformatics, a single-nucleotide polymorphism is a germline substitution of a single nucleotide at a specific position in the genome that is present in a sufficiently large fraction of considered population.
George McDonald Church is an American geneticist, molecular engineer, chemist, serial entrepreneur, and pioneer in personal genomics and synthetic biology. He is the Robert Winthrop Professor of Genetics at Harvard Medical School, Professor of Health Sciences and Technology at Harvard University and Massachusetts Institute of Technology, and a founding member of the Wyss Institute for Biologically Inspired Engineering at Harvard. Through his Harvard lab Church has co-founded around 50 biotech companies pushing the boundaries of innovation in the world of life sciences and making his lab as a hotbed of biotech startup activity in Boston. In 2018, the Church lab at Harvard made a record by spinning off 16 biotech companies in one year. The Church lab works on research projects that are distributed in diverse areas of modern biology like developmental biology, neurobiology, info processing, medical genetics, genomics, gene therapy, diagnostics, chemistry & bioengineering, space biology & space genetics, and ecosystem. Research and technology developments at the Church lab have impacted or made direct contributions to nearly all "next-generation sequencing (NGS)" methods and companies. In 2017, Time magazine listed him in Time 100, the list of 100 most influential people in the world. In 2022, he was featured among the most influential people in biopharma by Fierce Pharma, and was listed among the top 8 famous geneticists of all time in human history. As of January 2023, Church serves as a member of the Bulletin of the Atomic Scientists' Board of Sponsors, established by Albert Einstein.
Personal genomics or consumer genetics is the branch of genomics concerned with the sequencing, analysis and interpretation of the genome of an individual. The genotyping stage employs different techniques, including single-nucleotide polymorphism (SNP) analysis chips, or partial or full genome sequencing. Once the genotypes are known, the individual's variations can be compared with the published literature to determine likelihood of trait expression, ancestry inference and disease risk.
The 1000 Genomes Project, launched in January 2008, was an international research effort to establish by far the most detailed catalogue of human genetic variation. Scientists planned to sequence the genomes of at least one thousand anonymous participants from a number of different ethnic groups within the following three years, using newly developed technologies which were faster and less expensive. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature. In 2012, the sequencing of 1092 genomes was announced in a Nature publication. In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research.
Population genomics is the large-scale comparison of DNA sequences of populations. Population genomics is a neologism that is associated with population genetics. Population genomics studies genome-wide effects to improve our understanding of microevolution so that we may learn the phylogenetic history and demography of a population.
Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.
Complete Genomics is a life sciences company that has developed and commercialized a DNA sequencing platform for human genome sequencing and analysis. This solution combines the company's proprietary human genome sequencing technology with its informatics and data management software to provide finished variant reports and assemblies at Complete Genomics’ commercial genome center in Mountain View, California.
Exome sequencing, also known as whole exome sequencing (WES), is a genomic technique for sequencing all of the protein-coding regions of genes in a genome. It consists of two steps: the first step is to select only the subset of DNA that encodes proteins. These regions are known as exons—humans have about 180,000 exons, constituting about 1% of the human genome, or approximately 30 million base pairs. The second step is to sequence the exonic DNA using any high-throughput DNA sequencing technology.
The $1,000 genome refers to an era of predictive and personalized medicine during which the cost of fully sequencing an individual's genome (WGS) is roughly one thousand USD. It is also the title of a book by British science writer and founding editor of Nature Genetics, Kevin Davies. By late 2015, the cost to generate a high-quality "draft" whole human genome sequence was just below $1,500.
The genomic epidemiological database for global identification of microorganisms or global microbial identifier is a platform for storing whole genome sequencing data of microorganisms, for the identification of relevant genes and for the comparison of genomes to detect and track-and-trace infectious disease outbreaks and emerging pathogens. The database holds two types of information: 1) genomic information of microorganisms, linked to, 2) metadata of those microorganism such as epidemiological details. The database includes all genera of microorganisms: bacteria, viruses, parasites and fungi.
Personalized medicine involves medical treatments based on the characteristics of individual patients, including their medical history, family history, and genetics. Although personal genetic information is becoming increasingly important in healthcare, there is a lack of sufficient education in medical genetics among physicians and the general public. For example, pharmacogenomics is practiced worldwide by only a limited number of pharmacists, although most pharmacy colleges in the United States now include it in their curriculum. It is also increasingly common for genetic testing to be offered directly to consumers, who subsequently seek out educational materials and bring their results to their doctors. Issues involving genetic testing also invariably lead to ethical and legal concerns, such as the potential for inadvertent effects on family members, increased insurance rates, or increased psychological stress.
openSNP is an open source website where users can share their genetic information. Users upload their genes, including gender, age, eye color, medical history, Fitbit data. With a focus on user patient-led research (PLR), there is potential to redefine the way health research is conducted.
"It promises to be a vital supplement to standard research: it can focus on conditions that are neglected by standard research, such as rare diseases or side effects, and can draw on a broader range of data and deliver outcomes more rapidly. It can also be a way of realising valuable forms of social interaction and support in cases where members of a community conduct PLR together, for example, patients suffering from the same illness."
Genetic privacy involves the concept of personal privacy concerning the storing, repurposing, provision to third parties, and displaying of information pertaining to one's genetic information. This concept also encompasses privacy regarding the ability to identify specific individuals by their genetic sequence, and the potential to gain information on specific characteristics about that person via portions of their genetic information, such as their propensity for specific diseases or their immediate or distant ancestry.
Elective genetic and genomic testing are DNA tests performed for an individual who does not have an indication for testing. An elective genetic test analyzes selected sites in the human genome while an elective genomic test analyzes the entire human genome. Some elective genetic and genomic tests require a physician to order the test to ensure that individuals understand the risks and benefits of testing as well as the results. Other DNA-based tests, such as a genealogical DNA test do not require a physician's order. Elective testing is generally not paid for by health insurance companies. With the advent of personalized medicine, also called precision medicine, an increasing number of individuals are undertaking elective genetic and genomic testing.
Nebula Genomics is a personal genomics company based in San Francisco, California. It offers a whole-genome sequencing service.
The Tohoku Medical Megabank Project is a national project in Japan. The mission of the Tohoku Medical Megabank (TMM) project is to carry out a long-term health survey in the Miyagi and Iwate prefectures, which were affected by the Great East Japan Earthquake, and provide the research infrastructure for the development of personalized medicine by establishing a biobank and conducting cohort studies. It started in 2012.
Personalized genomics is the human genetics-derived study of analyzing and interpreting individualized genetic information by genome sequencing to identify genetic variations compared to the library of known sequences. International genetics communities have spared no effort from the past and have gradually cooperated to prosecute research projects to determine DNA sequences of the human genome using DNA sequencing techniques. The methods that are the most commonly used are whole exome sequencing and whole genome sequencing. Both approaches are used to identify genetic variations. Genome sequencing became more cost-effective over time, and made it applicable in the medical field, allowing scientists to understand which genes are attributed to specific diseases.