Multiple displacement amplification (MDA) is a DNA amplification technique. This method can rapidly amplify minute amounts of DNA samples to a reasonable quantity for genomic analysis. The reaction starts by annealing random hexamer primers to the template: DNA synthesis is carried out by a high fidelity enzyme, preferentially Φ29 DNA polymerase. Compared with conventional PCR amplification techniques, MDA does not employ sequence-specific primers but amplifies all DNA, generates larger-sized products with a lower error frequency, and works at a constant temperature. MDA has been actively used in whole genome amplification (WGA) and is a promising method for application to single cell genome sequencing and sequencing-based genetic studies.
Many biological and forensic cases involving genetic analysis require sequencing of DNA from minute amounts of sample, such as DNA from uncultured single cells or trace amounts of tissue collected from crime scenes. Conventional Polymerase Chain Reaction (PCR)-based DNA amplification methods require sequence-specific oligonucleotide primers and heat-stable (usually Taq) polymerase, and can be used to generate significant amounts of DNA from minute amounts of DNA. However, this is not sufficient for modern techniques which use sequencing-based DNA analysis. Therefore, a more efficient non-sequence-specific method to amplify minute amounts of DNA is necessary, especially in single-cell genomic studies.
Bacteriophage Φ29 DNA polymerase is a high-processivity enzyme that can produce DNA amplicons greater than 70 kilobase pairs. [1] Its high fidelity and 3’–5' proofreading activity reduces the amplification error rate to 1 in 106−107 bases compared to conventional Taq polymerase with a reported error rate of 1 in 9,000. [2] The reaction can be carried out at a moderate isothermal condition of 30 °C and therefore does not require a thermocycler. It has been actively used in cell-free cloning, which is the enzymatic method of amplifying DNA in vitro without cell culturing and DNA extraction. The large fragment of Bst DNA polymerase is also used in MDA, but Ф29 is generally preferred due to its sufficient product yield and proofreading activity. [3]
Hexamer primers are sequences composed of six random nucleotides. For MDA applications, these primers are usually thiophosphate-modified at their 3’ end to convey resistance to the 3’–5’ exonuclease activity of Ф29 DNA polymerase. MDA reactions start with the annealing of such primers to the DNA template followed by polymerase-mediated chain elongation. Increasing numbers of primer annealing events happen along the amplification reaction.
The amplification reaction initiates when multiple primer hexamers anneal to the template. When DNA synthesis proceeds to the next starting site, the polymerase displaces the newly produced DNA strand and continues its strand elongation. The strand displacement generates a newly synthesized single-stranded DNA template for more primers to anneal. Further primer annealing and strand displacement on the newly synthesized template results in a hyper-branched DNA network. The sequence debranching during amplification results in a high yield of the products. To separate the DNA branching network, S1 nucleases are used to cleave the fragments at displacement sites. The nicks on the resulting DNA fragments are repaired by DNA polymerase I.
MDA can generate 1–2 μg of DNA from single cell with genome coverage of up to 99%. [4] Products also have lower error rate and larger sizes compared to PCR based Taq amplification. [4] [5]
General work flow of MDA: [6]
MDA generates sufficient yield of DNA products. It is a powerful tool of amplifying DNA molecules from samples, such as uncultured microorganism or single cells to the amount that would be sufficient for sequencing studies. The large size of MDA-amplified DNA products also provides desirable sample quality for identifying the size of polymorphic repeat alleles. Its high fidelity also makes it reliable to be used in the single-nucleotide polymorphism (SNP) allele detection. Due to its strand displacement during amplification, the amplified DNA has sufficient coverage of the source DNA molecules, which provides a high-quality product for genomic analysis. The products of displaced strands can be subsequently cloned into vectors to construct library for subsequent sequencing reactions.
ADO is defined as the random non-amplification of one of the alleles present in a heterozygous sample. Some studies have reported the ADO rate of the MDA products to be 0–60%. [7] This drawback decreases the accuracy of genotyping of single sample and misdiagnosis in other MDA involved applications. ADO appears to be independent of the fragment sizes and has been reported to have a similar rate in other single-cell techniques. Possible solutions are the use of different lysis conditions or to carry out multiple rounds of amplifications from the diluted MDA products since PCR mediated amplification from cultured cells has been reported to give lower ADO rates.
'Preferential amplification' is over-amplification of one of the alleles in comparison to the other. Most studies on MDA have reported this issue. The amplification bias is currently observed to be random. It might affect the analysis of small stretches of genomic DNA in identifying Short Tandem Repeats (STR) alleles.
Endogenous template-independent primer-primer interaction is due to the random design of hexamer primers. One possible solution is to design constrained-randomized hexanucleotide primers that do not cross-hybridize.
Single cells of uncultured bacteria, archaea and protists, as well as individual viral particles and single fungal spores have been sequenced with the help of MDA. [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18]
The ability to sequence individual cells is also useful in combating human disease. Genomes from single human embryonic cells have been successfully amplified for sequencing using MDA, allowing preimplantation genetic diagnosis (PGD): screening for genetic health issues in an early-stage embryo before implantation. [19] Diseases with heterogeneous properties, such as cancer, also benefit from MDA-based genome sequencing's ability to study mutations in individual cells.
The MDA products from a single cell have also been successfully used in array-comparative genomic hybridization experiments, which usually require a relatively large amount of amplified DNA.
Chromatin Immunoprecipitation results in production of complex mixtures of relatively short DNA fragments, which is challenging to amplify with MDA without causing a bias in the fragment representation. A method to circumvent this problem was proposed, which is based on conversion of these mixtures to circular concatemers using ligation, followed by Φ29 DNA polymerase-mediated MDA. [20]
The trace amount of samples collected from crime scenes can be amplified by MDA to the quantity that is enough for forensic DNA analysis, which is commonly used in identifying victims and suspects.
The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA sufficiently to enable detailed study. PCR was invented in 1983 by American biochemist Kary Mullis at Cetus Corporation. Mullis and biochemist Michael Smith, who had developed other essential ways of manipulating DNA, were jointly awarded the Nobel Prize in Chemistry in 1993.
A primer is a short, single-stranded nucleic acid used by all living organisms in the initiation of DNA synthesis. A synthetic primer may also be referred to as an oligo, short for oligonucleotide. DNA polymerase enzymes are only capable of adding nucleotides to the 3’-end of an existing nucleic acid, requiring a primer be bound to the template before DNA polymerase can begin a complementary strand. DNA polymerase adds nucleotides after binding to the RNA primer and synthesizes the whole strand. Later, the RNA strands must be removed accurately and replace them with DNA nucleotides forming a gap region known as a nick that is filled in using an enzyme called ligase. The removal process of the RNA primer requires several enzymes, such as Fen1, Lig1, and others that work in coordination with DNA polymerase, to ensure the removal of the RNA nucleotides and the addition of DNA nucleotides. Living organisms use solely RNA primers, while laboratory techniques in biochemistry and molecular biology that require in vitro DNA synthesis usually use DNA primers, since they are more temperature stable. Primers can be designed in laboratory for specific reactions such as polymerase chain reaction (PCR). When designing PCR primers, there are specific measures that must be taken into consideration, like the melting temperature of the primers and the annealing temperature of the reaction itself. Moreover, the DNA binding sequence of the primer in vitro has to be specifically chosen, which is done using a method called basic local alignment search tool (BLAST) that scans the DNA and finds specific and unique regions for the primer to bind.
In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule.
In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification or replication events. It can be formed artificially, using various methods including polymerase chain reactions (PCR) or ligase chain reactions (LCR), or naturally through gene duplication. In this context, amplification refers to the production of one or more copies of a genetic fragment or target sequence, specifically the amplicon. As it refers to the product of an amplification reaction, amplicon is used interchangeably with common laboratory terms, such as "PCR product."
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.
Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Frederick Sanger and colleagues in 1977, it became the most widely used sequencing method for approximately 40 years. An automated instrument using slab gel electrophoresis and fluorescent labels was first commercialized by Applied Biosystems in March 1987. Later, automated slab gels were replaced with automated capillary array electrophoresis. More recently, higher volume Sanger sequencing has been replaced by next generation sequencing methods, especially for large-scale, automated genome analyses. However, the Sanger method remains in wide use for smaller-scale projects and for validation of deep sequencing results. It still has the advantage over short-read sequencing technologies in that it can produce DNA sequence reads of > 500 nucleotides and maintains a very low error rate with accuracies around 99.99%. Sanger sequencing is still actively being used in efforts for public health initiatives such as sequencing the spike protein from SARS-CoV-2 as well as for the surveillance of norovirus outbreaks through the Center for Disease Control and Prevention's (CDC) CaliciNet surveillance network.
Taq polymerase is a thermostable DNA polymerase I named after the thermophilic eubacterial microorganism Thermus aquaticus, from which it was originally isolated by Chien et al. in 1976. Its name is often abbreviated to Taq or Taq pol. It is frequently used in the polymerase chain reaction (PCR), a method for greatly amplifying the quantity of short segments of DNA.
Rolling circle replication (RCR) is a process of unidirectional nucleic acid replication that can rapidly synthesize multiple copies of circular molecules of DNA or RNA, such as plasmids, the genomes of bacteriophages, and the circular RNA genome of viroids. Some eukaryotic viruses also replicate their DNA or RNA via the rolling circle mechanism.
SNP genotyping is the measurement of genetic variations of single nucleotide polymorphisms (SNPs) between members of a species. It is a form of genotyping, which is the measurement of more general genetic variation. SNPs are one of the most common types of genetic variation. An SNP is a single base pair mutation at a specific locus, usually consisting of two alleles. SNPs are found to be involved in the etiology of many human diseases and are becoming of particular interest in pharmacogenetics. Because SNPs are conserved during evolution, they have been proposed as markers for use in quantitative trait loci (QTL) analysis and in association studies in place of microsatellites. The use of SNPs is being extended in the HapMap project, which aims to provide the minimal set of SNPs needed to genotype the human genome. SNPs can also provide a genetic fingerprint for use in identity testing. The increase of interest in SNPs has been reflected by the furious development of a diverse range of SNP genotyping methods.
Bisulfitesequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the most studied. In animals it predominantly involves the addition of a methyl group to the carbon-5 position of cytosine residues of the dinucleotide CpG, and is implicated in repression of transcriptional activity.
The polymerase chain reaction (PCR) is a commonly used molecular biology tool for amplifying DNA, and various techniques for PCR optimization which have been developed by molecular biologists to improve PCR performance and minimize failure.
A nucleic acid test (NAT) is a technique used to detect a particular nucleic acid sequence and thus usually to detect and identify a particular species or subspecies of organism, often a virus or bacterium that acts as a pathogen in blood, tissue, urine, etc. NATs differ from other tests in that they detect genetic materials rather than antigens or antibodies. Detection of genetic materials allows an early diagnosis of a disease because the detection of antigens and/or antibodies requires time for them to start appearing in the bloodstream. Since the amount of a certain genetic material is usually very small, many NATs include a step that amplifies the genetic material—that is, makes many copies of it. Such NATs are called nucleic acid amplification tests (NAATs). There are several ways of amplification, including polymerase chain reaction (PCR), strand displacement assay (SDA), transcription mediated assay (TMA), and loop-mediated isothermal amplification (LAMP).
The versatility of polymerase chain reaction (PCR) has led to modifications of the basic protocol being used in a large number of variant techniques designed for various purposes. This article summarizes many of the most common variations currently or formerly used in molecular biology laboratories; familiarity with the fundamental premise by which PCR works and corresponding terms and concepts is necessary for understanding these variant techniques.
Φ29 DNA polymerase is an enzyme from the bacteriophage Φ29. It is being increasingly used in molecular biology for multiple displacement DNA amplification procedures, and has a number of features that make it particularly suitable for this application. It was discovered and characterized by Spanish scientists Luis Blanco and Margarita Salas.
Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation sequencing. Some of these technologies emerged between 1993 and 1998 and have been commercially available since 2005. These technologies use miniaturized and parallelized platforms for sequencing of 1 million to 43 billion short reads per instrument run.
Illumina dye sequencing is a technique used to determine the series of base pairs in DNA, also known as DNA sequencing. The reversible terminated chemistry concept was invented by Bruno Canard and Simon Sarfati at the Pasteur Institute in Paris. It was developed by Shankar Balasubramanian and David Klenerman of Cambridge University, who subsequently founded Solexa, a company later acquired by Illumina. This sequencing method is based on reversible dye-terminators that enable the identification of single nucleotides as they are washed over DNA strands. It can also be used for whole-genome and region sequencing, transcriptome analysis, metagenomics, small RNA discovery, methylation profiling, and genome-wide protein-nucleic acid interaction analysis.
Multiple Annealing and Looping Based Amplification Cycles (MALBAC) is a quasilinear whole genome amplification method. Unlike conventional DNA amplification methods that are non-linear or exponential, MALBAC utilizes special primers that allow amplicons to have complementary ends and therefore to loop, preventing DNA from being copied exponentially. This results in amplification of only the original genomic DNA and therefore reduces amplification bias. MALBAC is “used to create overlapped shotgun amplicons covering most of the genome”. For next generation sequencing, MALBAC is followed by regular PCR which is used to further amplify amplicons.
Single-cell sequencing examines the nucleic acid sequence information from individual cells with optimized next-generation sequencing technologies, providing a higher resolution of cellular differences and a better understanding of the function of an individual cell in the context of its microenvironment. For example, in cancer, sequencing the DNA of individual cells can give information about mutations carried by small populations of cells. In development, sequencing the RNAs expressed by individual cells can give insight into the existence and behavior of different cell types. In microbial systems, a population of the same species can appear genetically clonal. Still, single-cell sequencing of RNA or epigenetic modifications can reveal cell-to-cell variability that may help populations rapidly adapt to survive in changing environments.
Surveyor nuclease assay is an enzyme mismatch cleavage assay used to detect single base mismatches or small insertions or deletions (indels).
G&T-seq is a novel form of single cell sequencing technique allowing one to simultaneously obtain both transcriptomic and genomic data from single cells, allowing for direct comparison of gene expression data to its corresponding genomic data in the same cell...
{{cite journal}}
: CS1 maint: multiple names: authors list (link)