Complete Genomics

Last updated
Complete Genomics
Company type Private
Industry Biotechnology
Founded2006
Headquarters,
Area served
Worldwide
Parent MGI
Website www.completegenomics.com OOjs UI icon edit-ltr-progressive.svg

Complete Genomics is a life sciences company that has developed and commercialized a DNA sequencing platform for human genome sequencing and analysis. The company is a wholly-owned subsidiary of MGI.

Contents

History

Complete Genomics was founded in June 2005 by Clifford Reid, Radoje (Rade) Drmanac, and John Curson. Clifford Reid was the chairman, president and chief executive officer of Complete Genomics before leaving in 2015 to set up Genos, a spinoff of Complete Genomics' consumer division. [1] [2]

In February 2009, Complete Genomics announced that it had sequenced its first human genome and submitted the resulting variant data to the National Center for Biotechnology Information database. In November 2009, Complete Genomics published sequence data for three human genomes in the journal Science . [3]

The resulting data has supported research in diverse areas such as screening of embryos, [4] detection of genetic relationships, [5] [6] neurology, [7] aging, [8] a novel Mendelian disease with neuromuscular and cardiac involvement, [9] eating disorders, [10] Prader-Willi syndrome and autism, [11] ophthalmology, [12] and oncology. [13] [14] [15] [16] [17] In 2014, a collaboration among Radboud University (The Netherlands), Maastricht University Medical Centre (The Netherlands), Central South University (China) and Complete Genomics identified major causes of intellectual disability using whole genome sequencing. [18]

In 2016, Complete Genomics contributed over 184 phased human genomes to George Church's Personal Genome Project. [19] In 2019, they published their new single-tube long fragment read (stLFR) technology, enabling the construction of long DNA molecules from short reads using a combinatorial process of DNA barcoding. It enables phasing, SV detection, scaffolding, and cost-effective diploid de novo genome assembly from second-generation sequencing technology. [20]

In March 2013, Complete Genomics was acquired by BGI Group, a genomics services company in Shenzhen, Guangdong, China. [21] After the acquisition, Complete Genomics moved to San Jose and in June 2018 became part of MGI. [22] MGI was a subsidiary of BGI Group before it was spun out and listed on the Shanghai Stock Exchange in 2022. [23]

Related Research Articles

<span class="mw-page-title-main">Bioinformatics</span> Computational analysis of large, complex sets of biological data

Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The subsequent process of analyzing and interpreting data is referred to as computational biology.

In genetics, shotgun sequencing is a method used for sequencing random DNA strands. It is named by analogy with the rapidly expanding, quasi-random shot grouping of a shotgun.

<span class="mw-page-title-main">Human genome</span> Complete set of nucleic acid sequences for humans

The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.

<span class="mw-page-title-main">Genomics</span> Discipline in genetics

Genomics is an interdisciplinary field of molecular biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dimensional structural configuration. In contrast to genetics, which refers to the study of individual genes and their roles in inheritance, genomics aims at the collective characterization and quantification of all of an organism's genes, their interrelations and influence on the organism. Genes may direct the production of proteins with the assistance of enzymes and messenger molecules. In turn, proteins make up body structures such as organs and tissues as well as control chemical reactions and carry signals between cells. Genomics also involves the sequencing and analysis of genomes through uses of high throughput DNA sequencing and bioinformatics to assemble and analyze the function and structure of entire genomes. Advances in genomics have triggered a revolution in discovery-based research and systems biology to facilitate understanding of even the most complex biological systems such as the brain.

<span class="mw-page-title-main">DNA sequencer</span> A scientific instrument used to automate the DNA sequencing process

A DNA sequencer is a scientific instrument used to automate the DNA sequencing process. Given a sample of DNA, a DNA sequencer is used to determine the order of the four bases: G (guanine), C (cytosine), A (adenine) and T (thymine). This is then reported as a text string, called a read. Some DNA sequencers can be also considered optical instruments as they analyze light signals originating from fluorochromes attached to nucleotides.

<span class="mw-page-title-main">BGI Group</span> Chinese genome sequencing company

BGI Group, formerly Beijing Genomics Institute, is a Chinese genomics company with headquarters in Yantian, Shenzhen. The company was originally formed in 1999 as a genetics research center to participate in the Human Genome Project. It also sequences the genomes of other animals, plants and microorganisms.

<span class="mw-page-title-main">Functional genomics</span> Field of molecular biology

Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "candidate-gene" approach.

<span class="mw-page-title-main">DNA sequencing</span> Process of determining the nucleic acid sequence

DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.

<span class="mw-page-title-main">Human Genome Project</span> International scientific research project

The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with only 0.3% of the bases covered by potential issues. The final gapless assembly was finished in January 2022.

Sequencing by hybridization is a class of methods for determining the order in which nucleotides occur on a strand of DNA. Typically used for looking for small changes relative to a known DNA sequence. The binding of one strand of DNA to its complementary strand in the DNA double-helix is sensitive to even single-base mismatches when the hybrid region is short or if specialized mismatch detection proteins are present. This is exploited in a variety of ways, most notably via DNA chips or microarrays with thousands to billions of synthetic oligonucleotides found in a genome of interest plus many known variations or even all possible single-base variations.

<span class="mw-page-title-main">Oncogenomics</span> Sub-field of genomics

Oncogenomics is a sub-field of genomics that characterizes cancer-associated genes. It focuses on genomic, epigenomic and transcript alterations in cancer.

<span class="mw-page-title-main">1000 Genomes Project</span> International research effort on genetic variation

The 1000 Genomes Project (1KGP), taken place from January 2008 to 2015, was an international research effort to establish the most detailed catalogue of human genetic variation at the time. Scientists planned to sequence the genomes of at least one thousand anonymous healthy participants from a number of different ethnic groups within the following three years, using advancements in newly developed technologies. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature. In 2012, the sequencing of 1092 genomes was announced in a Nature publication. In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research.

<span class="mw-page-title-main">Whole genome sequencing</span> Determining nearly the entirety of the DNA sequence of an organisms genome at a single time

Whole genome sequencing (WGS) is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.

<span class="mw-page-title-main">RNA-Seq</span> Lab technique in cellular biology

RNA-Seq is a technique that uses next-generation sequencing to reveal the presence and quantity of RNA molecules in a biological sample, providing a snapshot of gene expression in the sample, also known as transcriptome.

The exome is composed of all of the exons within the genome, the sequences which, when transcribed, remain within the mature RNA after introns are removed by RNA splicing. This includes untranslated regions of messenger RNA (mRNA), and coding regions. Exome sequencing has proven to be an efficient method of determining the genetic basis of more than two dozen Mendelian or single gene disorders.

Cancer genome sequencing is the whole genome sequencing of a single, homogeneous or heterogeneous group of cancer cells. It is a biochemical laboratory method for the characterization and identification of the DNA or RNA sequences of cancer cell(s).

<span class="mw-page-title-main">Exome sequencing</span> Sequencing of all the exons of a genome

Exome sequencing, also known as whole exome sequencing (WES), is a genomic technique for sequencing all of the protein-coding regions of genes in a genome. It consists of two steps: the first step is to select only the subset of DNA that encodes proteins. These regions are known as exons—humans have about 180,000 exons, constituting about 1% of the human genome, or approximately 30 million base pairs. The second step is to sequence the exonic DNA using any high-throughput DNA sequencing technology.

<span class="mw-page-title-main">DNA nanoball sequencing</span> DNA sequencing technology

DNA nanoball sequencing is a high throughput sequencing technology that is used to determine the entire genomic sequence of an organism. The method uses rolling circle replication to amplify small fragments of genomic DNA into DNA nanoballs. Fluorescent nucleotides bind to complementary nucleotides and are then polymerized to anchor sequences bound to known sequences on the DNA template. The base order is determined via the fluorescence of the bound nucleotides This DNA sequencing method allows large numbers of DNA nanoballs to be sequenced per run at lower reagent costs compared to other next generation sequencing platforms. However, a limitation of this method is that it generates only short sequences of DNA, which presents challenges to mapping its reads to a reference genome. After purchasing Complete Genomics, the Beijing Genomics Institute (BGI) refined DNA nanoball sequencing to sequence nucleotide samples on their own platform.

Personalized genomics is the human genetics-derived study of analyzing and interpreting individualized genetic information by genome sequencing to identify genetic variations compared to the library of known sequences. International genetics communities have spared no effort from the past and have gradually cooperated to prosecute research projects to determine DNA sequences of the human genome using DNA sequencing techniques. The methods that are the most commonly used are whole exome sequencing and whole genome sequencing. Both approaches are used to identify genetic variations. Genome sequencing became more cost-effective over time, and made it applicable in the medical field, allowing scientists to understand which genes are attributed to specific diseases.

<span class="mw-page-title-main">MGI (company)</span> Chinese biotechnology company

MGI or MGI Tech is a Chinese biotechnology company, which provides a line of products and technologies that serves the genetic sequencing, genotyping and gene expression, and proteomics markets. Its headquarters are located in Shenzhen, Guangdong, China.

References

  1. "Consumer Genomics Startup Genos Research Plans to Let Customers Explore, Share Their Data". GenomeWeb. 13 June 2016. Retrieved 2019-06-15.
  2. "Complete Genomics Announces Updated Mission and New Partnerships on 18th Anniversary". News-Medical. 15 June 2023. Retrieved 2023-06-15.
  3. Drmanac R; Sparks, A. B.; Callow, M. J.; Halpern, A. L.; Burns, N. L.; Kermani, B. G.; Carnevali, P.; Nazarenko, I.; Nilsen, G. B.; Yeung, G.; Dahl, F.; Fernandez, A.; Staker, B.; Pant, K. P.; Baccash, J.; Borcherding, A. P.; Brownley, A.; Cedeno, R.; Chen, L.; Chernikoff, D.; Cheung, A.; Chirita, R.; Curson, B.; Ebert, J. C.; Hacker, C. R.; Hartlage, R.; Hauser, B.; Huang, S.; Jiang, Y.; et al. (November 2009). "Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays". Science. 327 (5961): 78–81. Bibcode:2010Sci...327...78D. doi: 10.1126/science.1181498 . PMID   19892942. S2CID   17309571.
  4. Winard R; et al. (2014). "In vitro screening of embryos by whole-genome sequencing: now, in the future or never?". Hum Reprod. 29 (4): 842–851. doi: 10.1093/humrep/deu005 . PMID   24491297.
  5. Li H; et al. (2014). "Relationship estimation from whole-genome sequence data". PLOS Genet. 10 (1): e1004144. doi: 10.1371/journal.pgen.1004144 . PMC   3907355 . PMID   24497848.
  6. Su S-Y; et al. (2012). "Detection of identity by descent using next-generation whole genome sequencing data". BMC Bioinformatics. 13: 121. doi: 10.1186/1471-2105-13-121 . PMC   3403908 . PMID   22672699.
  7. Bundo M (2014). "Increased L1 retrotransposition in the neuronal genome in schizophrenia". Neuron. 81 (2): 306–313. doi: 10.1016/j.neuron.2013.10.053 . PMID   24389010.
  8. Kai Y; et al. (2013). "Aging as accelerated accumulation of somatic variants: whole-genome sequencing of centenarian and middle-aged monozygotic twin pairs". Twin Research and Human Genetics. 16 (6): 1026–1032. doi: 10.1017/thg.2013.73 . PMID   24182360.
  9. Wang K; et al. (2013). "Whole-genome DNA/RNA sequencing identifies truncating mutations in RBCK1 in a novel Mendelian disease with neuromuscular and cardiac involvement". Genome Medicine. 5 (7): 67. doi: 10.1186/gm471 . PMC   3971341 . PMID   23889995.
  10. Cui H; et al. (2013). "Eating disorder predisposition is associated with ESRRA and HDAC4 mutations". J Clin Invest. 123 (11): 4706–4713. doi:10.1172/jci71400. PMC   3809805 . PMID   24216484.
  11. Schaaf CP; et al. (2013). "Truncating mutations of MAGEL2 cause Prader-Willi phenotypes and autism". Nature Genetics. 45 (11): 1405–1408. doi:10.1038/ng.2776. PMC   3819162 . PMID   24076603.
  12. Nishiguchi KM; et al. (2012). "Genes associated with retinitis pigmentosa and allied diseases are frequently mutated in the general population". PLOS ONE. 7 (7): e41902. Bibcode:2012PLoSO...741902N. doi: 10.1371/journal.pone.0041902 . PMC   3407128 . PMID   22848652.
  13. Ma Y; et al. (2012). "Developmental timing of mutations revealed by whole-genome sequencing of twins with acute lymphoblastic leukemia". Proc Natl Acad Sci USA. 110 (18): 7429–7433. Bibcode:2013PNAS..110.7429M. doi: 10.1073/pnas.1221099110 . PMC   3645544 . PMID   23569245.
  14. Kiel MJ; et al. (2012). "Whole-genome sequencing identifies recurrent somatic NOTCH2 mutations in splenic marginal zone lymphoma". The Journal of Experimental Medicine. 209 (9): 1553–1565. doi:10.1084/jem.20120910. PMC   3428949 . PMID   22891276.
  15. Molenaar JJ; et al. (2012). "Sequencing of neuroblastoma identifies chromothripsis and defects in neuritogenesis genes". Nature. 483 (7391): 589–593. Bibcode:2012Natur.483..589M. doi: 10.1038/nature10910 . hdl: 1765/57581 . PMID   22367537.
  16. Turajlic S; et al. (2011). "Whole genome sequencing of matched primary and metastatic acral melanomas". Genome Res. 22 (2): 196–207. doi:10.1101/gr.125591.111. PMC   3266028 . PMID   22183965.
  17. Yokoyama S; et al. (2011). "GA novel recurrent mutation in MITF predisposes to familial and sporadic melanoma". Nature. 480 (7375): 99–103. Bibcode:2011Natur.480...99Y. doi:10.1038/nature10630. PMC   3266855 . PMID   22080950.
  18. Gilissen C; et al. (2014). "Genome sequencing identifies major causes of severe intellectual disability". Nature. 511 (7509): 344–347. Bibcode:2014Natur.511..344G. doi:10.1038/nature13394. hdl: 2066/138095 . PMID   24896178. S2CID   205238886.
  19. Peters, Brock A.; Drmanac, Radoje; Church, George M.; Estep, Preston W.; Zaranek, Alexander Wait; Vandewege, Ward; Connelly, Abram; Clegg, Tom; Agarwal, Misha R. (2016-12-01). "The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes". GigaScience. 5 (1): 42. doi: 10.1186/s13742-016-0148-z . PMC   5057367 . PMID   27724973.
  20. Peters, Brock A.; Drmanac, Radoje; Xu, Xun; Kristiansen, Karsten; Wang, Jian; Yang, Huanming; Alexeev, Andrei; Zhang, Wenwei; Dong, Yuliang (2019-05-01). "Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly". Genome Research. 29 (5): 798–808. doi:10.1101/gr.245126.118. ISSN   1088-9051. PMC   6499310 . PMID   30940689.
  21. Specter, Michael (6 January 2014) The Gene Factory The New Yorker, Retrieved 28 October 2014
  22. brandonvd. "About Us". Complete Genomics. Retrieved 2019-06-15.
  23. Smyth, Jamie; Sevastopulo, Demetri (18 April 2023). "Chinese genetics company targets US despite political tensions". FinancialTimes . Retrieved 2023-04-18.