Pyrosequencing

Last updated

Pyrosequencing is a method of DNA sequencing (determining the order of nucleotides in DNA) based on the "sequencing by synthesis" principle, in which the sequencing is performed by detecting the nucleotide incorporated by a DNA polymerase. Pyrosequencing relies on light detection based on a chain reaction when pyrophosphate is released. Hence, the name pyrosequencing.

Contents

The principle of pyrosequencing was first described in 1993 [1] by, Bertil Pettersson, Mathias Uhlen and Pål Nyren by combining the solid phase sequencing method [2] using streptavidin coated magnetic beads with recombinant DNA polymerase lacking 3´to 5´exonuclease activity (proof-reading) and luminescence detection using the firefly luciferase enzyme. [3] A mixture of three enzymes (DNA polymerase, ATP sulfurylase and firefly luciferase) and a nucleotide (dNTP) are added to single stranded DNA to be sequenced and the incorporation of nucleotide is followed by measuring the light emitted. The intensity of the light determines if 0, 1 or more nucleotides have been incorporated, thus showing how many complementary nucleotides are present on the template strand. The nucleotide mixture is removed before the next nucleotide mixture is added. This process is repeated with each of the four nucleotides until the DNA sequence of the single stranded template is determined.

A second solution-based method for pyrosequencing was described in 1998 [4] by Mostafa Ronaghi, Mathias Uhlen and Pål Nyren. In this alternative method, an additional enzyme apyrase is introduced to remove nucleotides that are not incorporated by the DNA polymerase. This enabled the enzyme mixture including the DNA polymerase, the luciferase and the apyrase to be added at the start and kept throughout the procedure, thus providing a simple set-up suitable for automation. An automated instrument based on this principle was introduced to the market the following year by the company Pyrosequencing.

A third microfluidic variant of the pyrosequencing method was described in 2005 [5] by Jonathan Rothberg and co-workers at the company 454 Life Sciences. This alternative approach for pyrosequencing was based on the original principle of attaching the DNA to be sequenced to a solid support and they showed that sequencing could be performed in a highly parallel manner using a microfabricated microarray. This allowed for high-throughput DNA sequencing and an automated instrument was introduced to the market. This became the first next generation sequencing instrument starting a new era in genomics research, with rapidly falling prices for DNA sequencing allowing whole genome sequencing at affordable prices.

Procedure

The chart shows how pyrosequencing works. How Pyrosequencing Works.svg
The chart shows how pyrosequencing works.

"Sequencing by synthesis" involves taking a single strand of the DNA to be sequenced and then synthesizing its complementary strand enzymatically. The pyrosequencing method is based on detecting the activity of DNA polymerase (a DNA synthesizing enzyme) with another chemoluminescent enzyme. Essentially, the method allows sequencing a single strand of DNA by synthesizing the complementary strand along it, one base pair at a time, and detecting which base was actually added at each step. The template DNA is immobile, and solutions of A, C, G, and T nucleotides are sequentially added and removed from the reaction. Light is produced only when the nucleotide solution complements the first unpaired base of the template. The sequence of solutions which produce chemiluminescent signals allows the determination of the sequence of the template. [6]

For the solution-based version of pyrosequencing, the single-strand DNA (ssDNA) template is hybridized to a sequencing primer and incubated with the enzymes DNA polymerase, ATP sulfurylase, luciferase and apyrase, and with the substrates adenosine 5´ phosphosulfate (APS) and luciferin.

  1. The addition of one of the four deoxynucleotide triphosphates (dNTPs) (dATPαS, which is not a substrate for a luciferase, is added instead of dATP to avoid noise) initiates the second step. DNA polymerase incorporates the correct, complementary dNTPs onto the template. This incorporation releases pyrophosphate (PPi).
  2. ATP sulfurylase converts PPi to ATP in the presence of adenosine 5´ phosphosulfate. This ATP acts as a substrate for the luciferase-mediated conversion of luciferin to oxyluciferin that generates visible light in amounts that are proportional to the amount. The light produced in the luciferase-catalyzed reaction is detected by a camera and analyzed in a program.
  3. Unincorporated nucleotides and ATP are degraded by the apyrase, and the reaction can restart with another nucleotide.

The process can be represented by the following equations:

where:

Limitations

Currently, a limitation of the method is that the lengths of individual reads of DNA sequence are in the neighborhood of 300-500 nucleotides, shorter than the 800-1000 obtainable with chain termination methods (e.g. Sanger sequencing). This can make the process of genome assembly more difficult, particularly for sequences containing a large amount of repetitive DNA. Lack of proof-reading activity limits accuracy of this method.

Commercialization

The company Pyrosequencing AB in Uppsala, Sweden was founded with venture capital provided by HealthCap in order to commercialize machinery and reagents for sequencing short stretches of DNA using the pyrosequencing technique. Pyrosequencing AB was listed on the Stockholm Stock Exchange in 1999. It was renamed to Biotage in 2003. [7] The pyrosequencing business line was acquired by Qiagen in 2008. Pyrosequencing technology was further licensed to 454 Life Sciences. 454 developed an array-based pyrosequencing technology which emerged as a platform for large-scale DNA sequencing, including genome sequencing and metagenomics.

Roche announced the discontinuation of the 454 sequencing platform in 2013. [8]

Related Research Articles

<span class="mw-page-title-main">DNA replication</span> Biological process

In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part of biological inheritance. This is essential for cell division during growth and repair of damaged tissues, while it also ensures that each of the new cells receives its own copy of the DNA. The cell possesses the distinctive property of division, which makes replication of DNA essential.

In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule.

<span class="mw-page-title-main">Pyrophosphate</span> Class of chemical compounds

In chemistry, pyrophosphates are phosphorus oxyanions that contain two phosphorus atoms in a P–O–P linkage. A number of pyrophosphate salts exist, such as disodium pyrophosphate (Na2H2P2O7) and tetrasodium pyrophosphate (Na4P2O7), among others. Often pyrophosphates are called diphosphates. The parent pyrophosphates are derived from partial or complete neutralization of pyrophosphoric acid. The pyrophosphate bond is also sometimes referred to as a phosphoanhydride bond, a naming convention which emphasizes the loss of water that occurs when two phosphates form a new P–O–P bond, and which mirrors the nomenclature for anhydrides of carboxylic acids. Pyrophosphates are found in ATP and other nucleotide triphosphates, which are important in biochemistry. The term pyrophosphate is also the name of esters formed by the condensation of a phosphorylated biological compound with inorganic phosphate, as for dimethylallyl pyrophosphate. This bond is also referred to as a high-energy phosphate bond.

DnaG is a bacterial DNA primase and is encoded by the dnaG gene. The enzyme DnaG, and any other DNA primase, synthesizes short strands of RNA known as oligonucleotides during DNA replication. These oligonucleotides are known as primers because they act as a starting point for DNA synthesis. DnaG catalyzes the synthesis of oligonucleotides that are 10 to 60 nucleotides long, however most of the oligonucleotides synthesized are 11 nucleotides. These RNA oligonucleotides serve as primers, or starting points, for DNA synthesis by bacterial DNA polymerase III. DnaG is important in bacterial DNA replication because DNA polymerase cannot initiate the synthesis of a DNA strand, but can only add nucleotides to a preexisting strand. DnaG synthesizes a single RNA primer at the origin of replication. This primer serves to prime leading strand DNA synthesis. For the other parental strand, the lagging strand, DnaG synthesizes an RNA primer every few kilobases (kb). These primers serve as substrates for the synthesis of Okazaki fragments.

A nucleoside triphosphate is a nucleoside containing a nitrogenous base bound to a 5-carbon sugar, with three phosphate groups bound to the sugar. They are the molecular precursors of both DNA and RNA, which are chains of nucleotides made through the processes of DNA replication and transcription. Nucleoside triphosphates also serve as a source of energy for cellular reactions and are involved in signalling pathways.

<span class="mw-page-title-main">DNA sequencing</span> Process of determining the nucleic acid sequence

DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.

<span class="mw-page-title-main">Sanger sequencing</span> Method of DNA sequencing developed in 1977

Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Frederick Sanger and colleagues in 1977, it became the most widely used sequencing method for approximately 40 years. It was first commercialized by Applied Biosystems in 1986. More recently, higher volume Sanger sequencing has been replaced by next generation sequencing methods, especially for large-scale, automated genome analyses. However, the Sanger method remains in wide use for smaller-scale projects and for validation of deep sequencing results. It still has the advantage over short-read sequencing technologies in that it can produce DNA sequence reads of > 500 nucleotides and maintains a very low error rate with accuracies around 99.99%. Sanger sequencing is still actively being used in efforts for public health initiatives such as sequencing the spike protein from SARS-CoV-2 as well as for the surveillance of norovirus outbreaks through the Center for Disease Control and Prevention's (CDC) CaliciNet surveillance network.

<span class="mw-page-title-main">Deoxycytidine triphosphate</span> Chemical compound

Deoxycytidine triphosphate (dCTP) is a nucleoside triphosphate that contains the pyrimidine base cytosine. The triphosphate group contains high-energy phosphoanhydride bonds, which liberate energy when hydrolized.

454 Life Sciences was a biotechnology company based in Branford, Connecticut that specialized in high-throughput DNA sequencing. It was acquired by Roche in 2007 and shut down by Roche in 2013 when its technology became noncompetitive, although production continued until mid-2016.

A ribonucleoside tri-phosphate (rNTP) is composed of a ribose sugar, 3 phosphate groups attached via diester bonds to the 5' oxygen on the ribose and a nitrogenous base attached to the 1' carbon on the ribose. rNTP's are also referred to as NTPs while the deoxyribose version is referred to as dNTPs. The nitrogenous base can either be a purine such as a Adenine or Guanine or a pyrimidine such as a Uracil or Cytosine. rNTPs have significant biological uses, they can serve as building blocks of RNA synthesis, primers in DNA replication, stores of chemical energy, chiefly Adenosine triphosphate (ATP) and more.

<span class="mw-page-title-main">Bisulfite sequencing</span> Lab procedure detecting 5-methylcytosines in DNA

Bisulfitesequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the most studied. In animals it predominantly involves the addition of a methyl group to the carbon-5 position of cytosine residues of the dinucleotide CpG, and is implicated in repression of transcriptional activity.

<span class="mw-page-title-main">Mostafa Ronaghi</span> Iranian molecular biologist

Mostafa Ronaghi is an Iranian molecular biologist, specializing in DNA sequencing methodology. He earned his Ph.D. from the Royal Institute of Technology in Sweden in 1998.

<span class="mw-page-title-main">Sulfate adenylyltransferase</span>

In enzymology, a sulfate adenylyltransferase is an enzyme that catalyzes the chemical reaction

<span class="mw-page-title-main">T7 DNA polymerase</span>

T7 DNA polymerase is an enzyme used during the DNA replication of the T7 bacteriophage. During this process, the DNA polymerase “reads” existing DNA strands and creates two new strands that match the existing ones. The T7 DNA polymerase requires a host factor, E. coli thioredoxin, in order to carry out its function. This helps stabilize the binding of the necessary protein to the primer-template to improve processivity by more than 100-fold, which is a feature unique to this enzyme. It is a member of the Family A DNA polymerases, which include E. coli DNA polymerase I and Taq DNA polymerase.

Optical mapping is a technique for constructing ordered, genome-wide, high-resolution restriction maps from single, stained molecules of DNA, called "optical maps". By mapping the location of restriction enzyme sites along the unknown DNA of an organism, the spectrum of resulting DNA fragments collectively serves as a unique "fingerprint" or "barcode" for that sequence. Originally developed by Dr. David C. Schwartz and his lab at NYU in the 1990s this method has since been integral to the assembly process of many large-scale sequencing projects for both microbial and eukaryotic genomes. Later technologies use DNA melting, DNA competitive binding or enzymatic labelling in order to create the optical mappings.

<span class="mw-page-title-main">Ion semiconductor sequencing</span>

Ion semiconductor sequencing is a method of DNA sequencing based on the detection of hydrogen ions that are released during the polymerization of DNA. This is a method of "sequencing by synthesis", during which a complementary strand is built based on the sequence of a template strand.

Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation sequencing. Some of these technologies emerged between 1993 and 1998 and have been commercially available since 2005. These technologies use miniaturized and parallelized platforms for sequencing of 1 million to 43 billion short reads per instrument run.

<span class="mw-page-title-main">Illumina dye sequencing</span>

Illumina dye sequencing is a technique used to determine the series of base pairs in DNA, also known as DNA sequencing. The reversible terminated chemistry concept was invented by Bruno Canard and Simon Sarfati at the Pasteur Institute in Paris. It was developed by Shankar Balasubramanian and David Klenerman of Cambridge University, who subsequently founded Solexa, a company later acquired by Illumina. This sequencing method is based on reversible dye-terminators that enable the identification of single nucleotides as they are washed over DNA strands. It can also be used for whole-genome and region sequencing, transcriptome analysis, metagenomics, small RNA discovery, methylation profiling, and genome-wide protein-nucleic acid interaction analysis.

<span class="mw-page-title-main">Reduced representation bisulfite sequencing</span> Methylation process

Reduced representation bisulfite sequencing (RRBS) is an efficient and high-throughput technique for analyzing the genome-wide methylation profiles on a single nucleotide level. It combines restriction enzymes and bisulfite sequencing to enrich for areas of the genome with a high CpG content. Due to the high cost and depth of sequencing to analyze methylation status in the entire genome, Meissner et al. developed this technique in 2005 to reduce the amount of nucleotides required to sequence to 1% of the genome. The fragments that comprise the reduced genome still include the majority of promoters, as well as regions such as repeated sequences that are difficult to profile using conventional bisulfite sequencing approaches.

Mathias Uhlén is a Swedish scientist and Professor of Microbiology at Royal Institute of Technology (KTH), Stockholm. After a post-doc period at the EMBL in Heidelberg, Germany, he became professor in microbiology at KTH in 1988. His research is focused on protein science, antibody engineering and precision medicine and range from basic research in human and microbial biology to more applied research, including clinical applications. He is member of several academies and societies, including Royal Swedish Academy of Science (KVA), National Academy of Engineering (NAE) and the Swedish Academy of Engineering Science (IVA). Dr Uhlen was the Founding Director of the national infrastructure Science for Life Laboratory (SciLifeLab) from 2010-2015

References

  1. Nyren, Pettersson and Uhlen (1993) “Solid Phase DNA Minisequencing by an Enzymatic Luminometric Inorganic Pyrophosphate Detection Assay” Analytical Biochemistry 208 (1), 171-175, https://doi.org/10.1006/abio.1993.1024
  2. Uhlen (1989) ”Magnetic separation of DNA” Nature 340: 733-4, https://doi.org/10.1038/340733a0
  3. Nyren and Lundin (1985) “Enzymatic method for continuous monitoring of inorganic pyrophosphate synthesis” Analytiocal Biochemistry 151 (2): 504-509. https://doi.org/10.1016/0003-2697(85)90211-8
  4. Ronaghi, Mostafa; Uhlén, Mathias; Nyrén, Pål (1998-07-17). "A Sequencing Method Based on Real-Time Pyrophosphate". Science. 281 (5375): 363–365. doi:10.1126/science.281.5375.363. PMID   9705713. S2CID   26331871.
  5. Marguiles et al (2005) “Genome sequencing in microfabricated high-density picolitre reactors” Nature 437, 376-380. https://doi.org/doi:10.1038/nature03959;
  6. QIAGEN. "Pyrosequencing Technology and Platform Overview" . Retrieved 4 August 2017.
  7. Biotage. "Biotage History". www.biotage.com. Retrieved 2022-09-19.
  8. Hollmer, Mark (October 17, 2013). "Roche to close 454 Life Sciences as it reduces gene sequencing focus". Fierce Biotech.

Further reading