Private, Public | |
Industry | Genome sequencing Biotechnology |
Founded | September 9, 1999 (Beijing) |
Founder | Wang Jian Yu Jun Yang Huanming Liu Siqi |
Headquarters | Shenzhen, Guangdong, China |
Number of locations | |
Area served | Worldwide |
Key people | Wang Jian (President, Chairman) |
Products | BGISEQ, MGISEQ |
Revenue | |
Owner | Wang Jian |
Number of employees | ~ 5,000 (worldwide) |
Divisions | BGI China (Mainland) BGI Asia Pacific BGI Americas BGI Europe (Europe and Africa) |
Subsidiaries | List of subsidiaries:
|
Website | www.bgi.com/global/ www.genomics.cn |
BGI (Chinese :华大基因; pinyin :Huádà Jīyīn), known as the Beijing Genomics Institute prior to 2008 and as the BGI Group, is a genome sequencing center, headquartered in Shenzhen, Guangdong, China. [2] It was formed in 1999 to participate in the Human Genome Project, and is the world's largest genetics research center. [3] It is considered to be world's leader in gene-sequencing services, sequencing genome also of other animals, plants and microorganisms. [4] In 2013 it bought Complete Genomics in Mountain View, California for US$118 million, and develops, manufactures and markets genome sequencing technology. [4]
Chinese is a group of related, but in many cases not mutually intelligible, language varieties, forming the Sinitic branch of the Sino-Tibetan language family. Chinese is spoken by the Han majority and many minority ethnic groups in China. About 1.2 billion people speak some form of Chinese as their first language.
Hanyu Pinyin, often abbreviated to pinyin, is the official romanization system for Standard Chinese in mainland China and to some extent in Taiwan. It is often used to teach Standard Mandarin Chinese, which is normally written using Chinese characters. The system includes four diacritics denoting tones. Pinyin without tone marks is used to spell Chinese names and words in languages written with the Latin alphabet, and also in certain computer input methods to enter Chinese characters.
Whole genome sequencing is ostensibly the process of determining the complete DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast. In practice, genome sequences that are nearly complete are also called whole genome sequences.
Wang Jian, Yu Jun, Yang Huanming and Liu Siqi created BGI in November 1999, [4] in Beijing, China as a non-governmental independent research institute in order to participate in the Human Genome Project as China's representative. [5] [6] After the project was completed, funding dried up. So BGI moved to Hangzhou in exchange for funding from the Hangzhou Municipal Government. In 2002, BGI sequenced the rice genome which was a cover story in the journal Science. In 2003 BGI decoded the SARS virus genome and created a kit for detection of the virus. In 2003, BGI Hangzhou and the Zhejiang University founded a new research institute, the James D. Watson Institute of Genome Sciences, Zhejiang University. The Watson Institute was intended to become a major center for research and education in East Asia modeled after the Cold Spring Harbor Laboratory in the US.
Yang Huanming, also known as Henry Yang, is one of China's leading genetics researchers. He is Chairman and co-founder of the Beijing Genomics Institute, formerly of the Chinese Academy of Sciences. Yang received his B. Sc from Hangzhou University in 1978, followed by a master's degree in biology in 1982 from the Nanjing Railroad Medical Institute. Yang earned his Ph.D in 1988 in the Institute of Medical Genetics University of Copenhagen, Denmark. He then completed his post-doctoral training in Europe and the United States. He was elected as member of the Chinese Academy of Sciences in 2007, a foreign academician of Indian National Science Academy in 2009, a member of the German National Academy of Sciences in 2012, and foreign associate of the US National Academy of Science in 2014.
The Human Genome Project (HGP) was an international scientific research project with the goal of determining the sequence of nucleotide base pairs that make up human DNA, and of identifying and mapping all of the genes of the human genome from both a physical and a functional standpoint. It remains the world's largest collaborative biological project. After the idea was picked up in 1984 by the US government when the planning started, the project formally launched in 1990 and was declared complete on April 14, 2003. Funding came from the US government through the National Institutes of Health (NIH) as well as numerous other groups from around the world. A parallel project was conducted outside government by the Celera Corporation, or Celera Genomics, which was formally launched in 1998. Most of the government-sponsored sequencing was performed in twenty universities and research centers in the United States, the United Kingdom, Japan, France, Germany and China.
Hangzhou formerly romanized as Hangchow, is the capital and most populous city of Zhejiang Province in East China. It sits at the head of Hangzhou Bay, which separates Shanghai and Ningbo. Hangzhou grew to prominence as the southern terminus of the Grand Canal and has been one of the most renowned and prosperous cities in China for much of the last millennium. The city's West Lake, a UNESCO World Heritage site immediately west of the city, is among its best-known attractions. A study conducted by PwC and China Development Research Foundation saw Hangzhou ranked first among "Chinese Cities of Opportunity". Hangzhou is also considered a World City with a "Beta+" classification according to GaWC.
In 2007 BGI's headquarters relocated to Shenzhen as "the first citizen-managed, non-profit research institution in China". Yu Jun left BGI at this time purportedly selling his stake to the other 3 founders for a nominal sum. [4] In 2008, BGI-Shenzhen was officially recognized as a non-profit organization by Shenzhen government. [7] In 2008, BGI published the first human genome of an Asian individual. [5] [8]
Shenzhen is a major city in Guangdong Province, China; it forms part of the Pearl River Delta megalopolis, bordering Hong Kong to the south, Huizhou to the northeast, and Dongguan to the northwest. It holds sub-provincial administrative status, with powers slightly less than those of a province.
In 2010 BGI Shenzhen was certified as meeting the requirements of ISO9001:2008 standard for the design and provision of high-throughput sequencing services, [9] The same year BGI bought 128 Illumina's HiSeq 2000 gene-sequencing machines, [3] [5] which was backed by US$1.5 billion in "collaborative funds" over the next 10 years from the China Development Bank. [4] [10] [11] By the end of the year, they reportedly had a budget of $30 million. [12] In 2010, BGI Americas was established with its main office in Cambridge, Massachusetts, USA, and BGI Europe was established in Copenhagen, Denmark. [13] [14] By 2018, they were expanded with offices and laboratories in Seattle and San Jose in USA, [3] and London in the UK, as well were founded BGI Asia Pacific with offices in Hong Kong, Kobe (Japan), Bangkok (Thailand), Laos, Singapore, Brisbane (Australia) and many others. [15] [16]
The China Development Bank (CDB) is a financial institution in the People's Republic of China (PRC), led by a cabinet minister at the Governor level, under the direct jurisdiction of the State Council. As one of three policy banks in China, it is responsible for raising funds for large-scale infrastructure projects, including the Three Gorges Dam and the Shanghai Pudong International Airport. Established by the Policy Banks Law of 1994, the bank is described as the engine that powers the national government’s economic development policies.
Cambridge is a city in Middlesex County, Massachusetts, and part of the Boston metropolitan area.
In 2011 BGI reported it employed 4,000 scientists and technicians, [2] and had a $192 million in revenue. [4] BGI did the genome sequencing for the deadly 2011 Germany E. coli O104:H4 outbreak in three days under an open license. [17] Since 2012 it started to commercialize its services, having investments from China Life Insurance Company, Citic's Goldstone Investment, Jack Ma's Yunfeng Capital, and SoftBank China Capital. [18] In 2013 BGI reported it had relationships with 17 out of the top 20 global pharmaceutical companies, [4] [19] and advertised that it provided commercial science, health, agricultural, and informatics services to global pharmaceutical companies. [20] That year it bought Complete Genomics of Mountain View, California, a major supplier of DNA sequencing technology, for US$118 million. [4] [17] In the same year, the BGI was roughly valued at $820 million. [4]
A novel strain of Escherichia coli O104:H4 bacteria caused a serious outbreak of foodborne illness focused in northern Germany in May through June 2011. The illness was characterized by bloody diarrhea, with a high frequency of serious complications, including hemolytic-uremic syndrome (HUS), a condition that requires urgent treatment. The outbreak was originally thought to have been caused by an enterohemorrhagic (EHEC) strain of E. coli, but it was later shown to have been caused by an enteroaggregative E. coli (EAEC) strain that had acquired the genes to produce Shiga toxins, present in organic fenugreek sprouts.
China Life Insurance Company Limited is a Beijing-based China-incorporated company that provides life insurance and annuity products. China Life is ranked No. 94 on Fortune 2015 Global 500 Company list. China Life, which is 70% state-owned, is the biggest life insurer in China, but is coming off a few rocky years. China's insurance market attracted dozens of new competitors after the Chinese government liberalized it, and China Life's market share has fallen by almost half since 2007, from 50% to around 26%, according to Morningstar. The company is completing a major restructuring, and the government assigned it a new CEO in 2014. A new sales push early this year led to a big bump in net income in the first quarter of 2015. China Life is also ranked on Fortune China: 2015 Top 500 Chinese Enterprises at No.13.
Jack Ma is a Chinese business magnate, investor, and philanthropist. He is the co-founder and executive chairman of the Alibaba Group, a multinational technology conglomerate. as of January 2019, he is one of China's richest men with a net worth of US$36.2 billion, as well as one of the wealthiest people in the world. Ma's primary ideology was for an open and market-driven economy.
In 2015, they signed a collaboration with the Zhongshan Hospital' Center for Clinical Precision Medicine in Shanghai, opened in May 2015 with a budget of ¥100 million. They are reportedly being involved as a sequencing institution in China's US$9.2-billion research project for medical care which will last for 15 years. [3] [21] In May 2017, was announced formation of West Coast Innovation Center, co-located in Seattle and San Jose, on the first location planned to work on precision medicine and feature collaborations with University of Washington, the Allen Institute for Brain Science, the Bill & Melinda Gates Foundation, and Washington State University, while on the second's already existing laboratory with 100 employees to develop the next-generation sequencing technologies. [3] In May 2018, was reached an agreement with Mount Sinai Hospital (Toronto), Canada, for first installation of BGISEQ platforms in North America. [22]
Zhongshan Hospital, formerly Chung Shan Memorial Hospital, is a major teaching hospital in Shanghai, China, affiliated with the Shanghai Medical College of Fudan University.
The University of Washington is a public research university in Seattle, Washington.
The Allen Institute for Brain Science is a Seattle-based independent, nonprofit medical research organization. Founded in 2003, it is dedicated to accelerating the understanding of how the human brain works. With the intent of catalyzing brain research in different areas, the Allen Institute provides free data and tools to scientists.
The BGI Group subsidiary, BGI Genomics, had made initial public offering in July 2017 at Shenzhen Stock Exchange, raising ¥547 million ($80.7 million), with company's first-day valuation set at over $1.15 billion. [1] [23] In 2018, the BGI was reportedly 85.3% owned by Wang Jian, and the group owns 42.4% of its main unit BGI Genomics. The reported market value for BGI Genomics in July 2018 was around $5 billion, as is of another subsidiary, MGI Tech, specialized in developing and manufacturing technology, which IPO of a stake of about 20% for $1 billion is scheduled for 2019 in Hong Kong. [18]
An international project launched in 1990 and declared complete in 2003. They joined in 1999 and provided 1% of the workload. [42]
An international project launched in 2002 and declared complete in 2009. They provided 10% of the workload. [42]
Started in 2007 and named after two Emperors believed to have founded China’s dominant ethnic group, [45] BGI planned in this project, to sequence at least 100 Chinese individuals to produce a high-resolution map of Chinese genetic polymorphisms. [46] [47] The first genome data was published in October 2007. [48] An anonymous Chinese billionaire donated $10 million RMB (about US$1.4 million) to the project and his genome was sequenced at the beginning of the project. [46] [47]
An international project launched in 2008 and declared complete in 2015. [42]
An international project launched in 2008. [42]
An international projected jointly initiated with Children's Hospital of Philadelphia in 2011. [42] With it were discovered genes and mutations associated with rare diseases, which was reported in more than 20 scientific publications. They also "co-developed a clinical whole exome diagnostic test offered through CHOP pathology since 2012". They again collaborated in 2017 when CHOP's Children’s Brain Tumor Tissue Consortium was joined by BGI's China National GeneBank. [49]
A Chinese project focused on the research of the genetic basis of intelligence launched in August 2012. American physicist Steve Hsu joined as a scientific adviser and one of the project's leaders. [50] It was done on 2,200 samples mostly from the United States, [42] out of which 1,600 were of individuals who participated in the Study of Mathematically Precocious Youth and reportedly have IQs over 160, collected by American psychologist and geneticist Robert Plomin. [50] [51]
An international project launched in 2008 and declared complete in 2015. In 2010, BGI has announced it will contribute US$100 million to large-scale sequencing projects of plants and animals. [11] [52]
In 2009 BGI-Shenzhen announced the launch of three genome projects that focus on animals living in extreme environments. The three selected genomes are those of two polar animals: the polar bear and emperor penguin, and one altiplano animal: the Tibetan antelope. [53]
In 2010, BGI, Beijing University, Heilongjiang Manchurian tiger forestry zoo, Kunming Institute of Zoology, San Diego Zoo Institute for Conservation Research in California, and others announced they would sequence the Amur tiger, South China tiger, Bengal tiger, Asiatic lion, African lion, clouded leopard, snow leopard, and other felines. BGI would also sequence the genomes and epigenoms of a liger and tigon. Since the two reciprocal hybrids have different phenotypes, despite being genetically identical, it was expected that the epigenome might reveal the basis of such differences. [54] The project aim was to significantly advance conservation research and was auspiciously announced for the Chinese year of the Tiger. [55] Results were reported in 2013 for the genomes of the Anur tiger, the white Bengal tiger, African lion, white African lion and snow leopard. [56]
A jointly funded project announced on 19 March 2010, BGI will collaborate with Sidney K. Pierce of University of South Florida and Charles Delwiche of the University of Maryland at College Park to sequence the genomes of the sea slug, Elysia chlorotica , and its algal food Vaucheria litorea . The sea slug uses genes from the algae to synthesize chlorophyll, the first interspecies of gene transfer discovered. Sequencing their genomes could elucidate the mechanism of that transfer. [57]
An international project initiated with the Smithsonian Institution and other partners in 2018, to sequence DNA of the 1.5 million known eukaryotic species on the planet. [49]
The project was started on 1 August 2009, with the mission to sequence 10,000 microbes within 3 years. It includes sequencing industrial, agricultural, medical microorganism and many others. It is done in collaboration with many institutes, universities and enterprises, including Biotechnology Research Institute of the Chinese Academy of Agricultural Sciences and the Tianjin Institute of Industrial Biotechnology of Chinese Academy of Sciences. [58]
In 2010, the institute 500-node supercomputer processed 10 terabytes of raw sequencing data every 24 hours from its current 30 or so Genome Analyzers from Illumina. The annual budget for the computer center was US$9 million. [12] In the same year, BGI's computational biologists developed the first successful algorithm, based on graph theory, for aligning billions of 25 to 75-base pair strings produced by next-generation sequencers, specifically Illumina’s Genome Analyzer, during de novo sequencing. The algorithm, called SOAPdenovo, can assemble a genome in two days and has been used to sequence an array of plant and animal genomes. [29]
SOAPdenovo is part of "Short Oligonucleotide Analysis Package" (SOAP), a suite of tools developed by BGI for de novo assembly of human-sized genomes, alignment, SNP detection, resequencing, indel finding, and structural variation analysis. Built for the Illumina sequencers' short reads, SOAPdenovo has been used to assemble multiple human genomes [25] [26] [27] (identifying an eight kilobase insertion not detected by mapping to the human reference genome [59] ) and animals, like the giant panda. [24]
Until 2015, BGI had released BGISEQ-100, based on Thermo Fisher Scientific's Ion Torrent device, and BGISEQ-1000, based on similar technology by Complete Genomics, for both of which received an approval from the CFDA for a NIFTY noninvasive prenatal test. [60] [61] In October 2015, BGI launched BGISEQ-500, [62] a larger desktop sequencing system, which received an approved registration as a medical device a year later by the CFDA. It reportedly received more than 500 orders for the system and run over 112,000 tests until late 2016. [61] The China National GeneBank, opened by BGI and Chinese Government in September 2016, [63] has 150 instruments of the system. [61] The BGISEQ-500 was developed as a sequencing platform capable of competing with Illumina's platforms with its quality and reduced price. [60] In November 2016, BGI launched BGISEQ-50, a miniature version of desktop sequencer. [64] In 2017, BGI began offering WGS for $600. [3] In October 2017, MGI Tech, a subsidiary of BGI, launched two new sequencers MGISEQ-2000 and MGISEQ-200, [65] while a year later MGISEQ-T7. [66]
In the fields of molecular biology and genetics, a genome is the genetic material of an organism. It consists of DNA. The genome includes both the genes and the noncoding DNA, as well as mitochondrial DNA and chloroplast DNA. The study of the genome is called genomics.
Genome projects are scientific endeavours that ultimately aim to determine the complete genome sequence of an organism and to annotate protein-coding genes and other important genome-encoded features. The genome sequence of an organism includes the collective DNA sequences of each chromosome in the organism. For a bacterium containing a single chromosome, a genome project will aim to map the sequence of that chromosome. For the human species, whose genome includes 22 pairs of autosomes and 2 sex chromosomes, a complete genome sequence will involve 46 separate chromosome sequences.
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.
Metagenomics is the study of genetic material recovered directly from environmental samples. The broad field may also be referred to as environmental genomics, ecogenomics or community genomics. While traditional microbiology and microbial genome sequencing and genomics rely upon cultivated clonal cultures, early environmental gene sequencing cloned specific genes to produce a profile of diversity in a natural sample. Such work revealed that the vast majority of microbial biodiversity had been missed by cultivation-based methods. Recent studies use either "shotgun" or PCR directed sequencing to get largely unbiased samples of all genes from all the members of the sampled communities. Because of its ability to reveal the previously hidden diversity of microscopic life, metagenomics offers a powerful lens for viewing the microbial world that has the potential to revolutionize understanding of the entire living world. As the price of DNA sequencing continues to fall, metagenomics now allows microbial ecology to be investigated at a much greater scale and detail than before.
George McDonald Church is an American geneticist, molecular engineer, and chemist. As of 2015, he is Robert Winthrop Professor of Genetics at Harvard Medical School and Professor of Health Sciences and Technology at Harvard and MIT, and was a founding member of the Wyss Institute for Biologically Inspired Engineering at Harvard. As of March 2017, Church serves as a member of the Bulletin of the Atomic Scientists' Board of Sponsors.
Illumina, Inc. is an American company incorporated in April 1998 that develops, manufactures and markets integrated systems for the analysis of genetic variation and biological function. The company provides a line of products and services that serve the sequencing, genotyping and gene expression and proteomics markets. Its headquarters are located in San Diego, California.
The 1000 Genomes Project, launched in January 2008, was an international research effort to establish by far the most detailed catalogue of human genetic variation. Scientists planned to sequence the genomes of at least one thousand anonymous participants from a number of different ethnic groups within the following three years, using newly developed technologies which were faster and less expensive. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature. In 2012, the sequencing of 1092 genomes was announced in a Nature publication. In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research. Many rare variations, restricted to closely related groups, were identified, and eight structural-variation classes were analyzed.
Complete Genomics is a life sciences company that has developed and commercialized a DNA sequencing platform for human genome sequencing and analysis. This solution combines the company’s proprietary human genome sequencing technology with its informatics and data management software to provide finished variant reports and assemblies at Complete Genomics’ own commercial genome center in Mountain View, California. In March 2013 Complete Genomics was acquired by BGI-Shenzhen, a genomics services company in Shenzhen, Guangdong, China.
Announced in 2008, shortly after the human 1000 Genomes Project, the 1000 Plant Genomes Project (1KP) is a similar large-scale genomics using the high speed and efficiency of next-generation DNA sequencing. Headed by Dr. Gane Ka-Shu Wong and Dr. Michael Deyholos of the University of Alberta, the project successfully sequenced the transcriptomes of 1000 different plant species by 2014.
Exome sequencing, also known as whole exome sequencing (WES), is a genomic technique for sequencing all of the protein-coding region of genes in a genome. It consists of two steps: the first step is to select only the subset of DNA that encodes proteins. These regions are known as exons – humans have about 180,000 exons, constituting about 1% of the human genome, or approximately 30 million base pairs. The second step is to sequence the exonic DNA using any high-throughput DNA sequencing technology.
DNA nanoball sequencing is a high throughput sequencing technology that is used to determine the entire genomic sequence of an organism. The method uses rolling circle replication to amplify small fragments of genomic DNA into DNA nanoballs. Fluorescent nucleotides bind to complementary nucleotides and are then polymerized to anchor sequences bound to known sequences on the DNA template. The base order is determined via the fluorescence of the bound nucleotides This DNA sequencing method allows large numbers of DNA nanoballs to be sequenced per run at lower reagent costs compared to other next generation sequencing platforms. However, a limitation of this method is that it generates only short sequences of DNA, which presents challenges to mapping its reads to a reference genome. After purchasing Complete Genomics, the Beijing Genomics Institute (BGI) refined DNA nanoball sequencing to sequence nucleotide samples on their own platform.
The $1,000 genome refers to an era of predictive and personalized medicine during which the cost of fully sequencing an individual's genome (WGS) is roughly USD $1,000. It is also the title of a book by British science writer and founding editor of Nature Genetics, Kevin Davies. By late 2015, the cost to generate a high-quality 'draft' whole human genome sequence was just below $1,500.
Illumina dye sequencing is a technique used to determine the series of base pairs in DNA, also known as DNA sequencing. The reversible terminated chemistry concept was invented by Bruno Canard and Simon Sarfati at the Pasteur Institute in Paris. It was developed by Shankar Balasubramanian and David Klenerman of Cambridge University, who subsequently founded Solexa, a company later acquired by Illumina. This sequencing method is based on reversible dye-terminators that enable the identification of single bases as they are introduced into DNA strands. It can also be used for whole-genome and region sequencing, transcriptome analysis, metagenomics, small RNA discovery, methylation profiling, and genome-wide protein-nucleic acid interaction analysis.
Wang Jun is a Chinese scientist, founder and CEO of iCarbonX, and former CEO of the Beijing Genomics Institute.
Transcriptomics technologies are the techniques used to study an organism’s transcriptome, the sum of all of its RNA transcripts. The information content of an organism is recorded in the DNA of its genome and expressed through transcription. Here, mRNA serves as a transient intermediary molecule in the information network, whilst non-coding RNAs perform additional diverse functions. A transcriptome captures a snapshot in time of the total transcripts present in a cell. Transcriptomics technologies provide a broad account of which cellular processes are active and which are dormant. A major challenge in molecular biology lies in understanding how the same genome can give rise to different cell types and how gene expression is regulated.
The Earth BioGenome Project (EBP) is an initiative that aims to sequence and catalog the genomes of all of Earth's currently described eukaryotic species over a period of ten years. The initiative would produce an open DNA database of biological information that provides a platform for scientific research and supports environmental and conservation initiatives. A scientific paper presenting the vision for the project was published in PNAS in April 2018, and the project officially launched November 1, 2018.
A plant genome assembly represents the complete genomic sequence of a plant species, which is assembled into chromosomes and other organelles by using DNA fragments that are obtained from different types of sequencing technology.