In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification or replication events. It can be formed artificially, using various methods including polymerase chain reactions (PCR) or ligase chain reactions (LCR), or naturally through gene duplication. In this context, amplification refers to the production of one or more copies of a genetic fragment or target sequence, specifically the amplicon. As it refers to the product of an amplification reaction, amplicon is used interchangeably with common laboratory terms, such as "PCR product."
Artificial amplification is used in research, [1] forensics, [2] and medicine [1] for purposes that include detection and quantification of infectious agents, [3] identification of human remains, [4] and extracting genotypes from human hair. [2]
Natural gene duplication plays a major role in evolution. It is also implicated in several forms of human cancer including primary mediastinal B cell lymphoma and Hodgkin's lymphoma. [5] In this context the term amplicon can refer both to a section of chromosomal DNA that has been excised, amplified, and reinserted elsewhere in the genome, and to a fragment of extrachromosomal DNA known as a double minute, each of which can be composed of one or more genes. Amplification of the genes encoded by these amplicons generally increases transcription of those genes and ultimately the volume of associated proteins. [6]
Amplicons in general are direct repeat (head-to-tail) or inverted repeat (head-to-head or tail-to-tail) genetic sequences, and can be either linear or circular in structure. [7] Circular amplicons consist of imperfect inverted duplications annealed into a circle [8] and are thought to arise from precursor linear amplicons. [9]
During artificial amplification, amplicon length is dictated by the experimental goals. [10]
Analysis of amplicons has been made possible by the development of amplification methods such as PCR, and increasingly by cheaper and more high-throughput technologies for DNA sequencing or next-generation sequencing, such as ion semiconductor sequencing, popularly referred to as the brand of the developer, Ion Torrent. [11]
DNA sequencing technologies such as next-generation sequencing have made it possible to study amplicons in genome biology and genetics, including cancer genetics research, [12] phylogenetic research, and human genetics. [13] For example, using the 16S rRNA gene, which is part of every bacterial and archaeal genome and is highly conserved, bacteria can be taxonomically classified by comparison of the amplicon sequence to known sequences. This works similarly in the fungal domain with the 18S rRNA gene as well as the ITS1 non-coding region. [14]
Irrespective of the approach used to amplify the amplicons, some technique must be used to quantitate the amplified product. [15] Generally, these techniques incorporate a capture step and a detection step, although how these steps are incorporated depends on the individual assay.
Examples include the Amplicor HIV-1 Monitor Assay (RT-PCR), which has the capacity to recognize HIV in plasma; the HIV-1 QT (NASBA), which is used to measure plasma viral load by amplifying a segment of the HIV RNA; and transcription mediated amplification, which employs a hybridization protection assay to distinguish Chlamydia trachomatis infections. [15] Various detection and capture steps are involved in each approach to assess the amplification product, or amplicon. With amplicon sequencing the high number of different amplicons resulting from amplification of a usual sample are concatenated and sequenced. After quality control classification is done by different methods, the counts of identical taxa representing their relative abundance in the sample.
PCR can be used to determine sex from a human DNA sample. [16] The loci of Alu element insertion is selected, amplified and evaluated in terms of size of the fragment. The sex assay utilizes AluSTXa for the X chromosome, AluSTYa for the Y chromosome, or both AluSTXa and AluSTYa, to reduce the possibility of error to a negligible quantity. The inserted chromosome yields a large fragment when the homologous region is amplified. The males are distinguished as having two DNA amplicons present, while females have only a single amplicon. The kit adapted for carrying out the method includes a pair of primers to amplify the locus and optionally polymerase chain reaction reagents. [17]
LCR can be used to diagnose tuberculosis. [18] The sequence containing protein antigen B is targeted by four oligonucleotide primers—two for the sense strand, and two for the antisense strand. The primers bind adjacent to one another, forming a segment of double stranded DNA that once separated, can serve as a target for future rounds of replication. In this instance, the product can be detected via the microparticle enzyme immunoassay (MEIA).
In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a specific protein in a cell that does not normally express that protein, or to sequence or quantify mRNA molecules using DNA based methods. cDNA that codes for a specific protein can be transferred to a recipient cell for expression, often bacterial or yeast expression systems. cDNA is also generated to analyze transcriptomic profiles in bulk tissue, single cells, or single nuclei in assays such as microarrays, qPCR, and RNA-seq.
The polymerase chain reaction (PCR) is a method widely used to rapidly make millions to billions of copies of a specific DNA sample, allowing scientists to take a very small sample of DNA and amplify it to a large enough amount to study in detail. PCR was invented in 1983 by the American biochemist Kary Mullis at Cetus Corporation; Mullis and biochemist Michael Smith, who had developed other essential ways of manipulating DNA, were jointly awarded the Nobel Prize in Chemistry in 1993.
A microsatellite is a tract of repetitive DNA in which certain DNA motifs are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. They have a higher mutation rate than other areas of DNA leading to high genetic diversity. Microsatellites are often referred to as short tandem repeats (STRs) by forensic geneticists and in genetic genealogy, or as simple sequence repeats (SSRs) by plant geneticists.
Reverse transcription polymerase chain reaction (RT-PCR) is a laboratory technique combining reverse transcription of RNA into DNA and amplification of specific DNA targets using polymerase chain reaction (PCR). It is primarily used to measure the amount of a specific RNA. This is achieved by monitoring the amplification reaction using fluorescence, a technique called real-time PCR or quantitative PCR (qPCR). Combined RT-PCR and qPCR are routinely used for analysis of gene expression and quantification of viral RNA in research and clinical settings.
This is a list of topics in molecular biology. See also index of biochemistry articles.
Gene amplification refers to a number of natural and artificial processes by which the number of copies of a gene is increased "without a proportional increase in other genes".
A real-time polymerase chain reaction is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule during the PCR, not at its end, as in conventional PCR. Real-time PCR can be used quantitatively and semi-quantitatively.
Rolling circle replication (RCR) is a process of unidirectional nucleic acid replication that can rapidly synthesize multiple copies of circular molecules of DNA or RNA, such as plasmids, the genomes of bacteriophages, and the circular RNA genome of viroids. Some eukaryotic viruses also replicate their DNA or RNA via the rolling circle mechanism.
SNP genotyping is the measurement of genetic variations of single nucleotide polymorphisms (SNPs) between members of a species. It is a form of genotyping, which is the measurement of more general genetic variation. SNPs are one of the most common types of genetic variation. An SNP is a single base pair mutation at a specific locus, usually consisting of two alleles. SNPs are found to be involved in the etiology of many human diseases and are becoming of particular interest in pharmacogenetics. Because SNPs are conserved during evolution, they have been proposed as markers for use in quantitative trait loci (QTL) analysis and in association studies in place of microsatellites. The use of SNPs is being extended in the HapMap project, which aims to provide the minimal set of SNPs needed to genotype the human genome. SNPs can also provide a genetic fingerprint for use in identity testing. The increase of interest in SNPs has been reflected by the furious development of a diverse range of SNP genotyping methods.
Multiplex ligation-dependent probe amplification (MLPA) is a variation of the multiplex polymerase chain reaction that permits amplification of multiple targets with only a single primer pair. It detects copy number changes at the molecular level, and software programs are used for analysis. Identification of deletions or duplications can indicate pathogenic mutations, thus MLPA is an important diagnostic tool used in clinical pathology laboratories worldwide.
Bisulfitesequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the most studied. In animals it predominantly involves the addition of a methyl group to the carbon-5 position of cytosine residues of the dinucleotide CpG, and is implicated in repression of transcriptional activity.
In the fields of bioinformatics and computational biology, Genome survey sequences (GSS) are nucleotide sequences similar to expressed sequence tags (ESTs) that the only difference is that most of them are genomic in origin, rather than mRNA.
Digital polymerase chain reaction is a biotechnological refinement of conventional polymerase chain reaction methods that can be used to directly quantify and clonally amplify nucleic acids strands including DNA, cDNA, or RNA. The key difference between dPCR and traditional PCR lies in the method of measuring nucleic acids amounts, with the former being a more precise method than PCR, though also more prone to error in the hands of inexperienced users. A "digital" measurement quantitatively and discretely measures a certain variable, whereas an “analog” measurement extrapolates certain measurements based on measured patterns. PCR carries out one reaction per single sample. dPCR also carries out a single reaction within a sample, however the sample is separated into a large number of partitions and the reaction is carried out in each partition individually. This separation allows a more reliable collection and sensitive measurement of nucleic acid amounts. The method has been demonstrated as useful for studying variations in gene sequences — such as copy number variants and point mutations — and it is routinely used for clonal amplification of samples for next-generation sequencing.
The versatility of polymerase chain reaction (PCR) has led to a large number of variants of PCR.
OLIGO Primer Analysis Software was the first publicly available software for DNA primer design. The first papers describing this software were published in 1989 and 1990, and consecutive upgrades in the 1990s and 2000s, all have been cited together over 600 times in scientific journals and over 500 times in patents. The program is a comprehensive real time PCR primer and probe search and analysis tool, and also does other tasks such as siRNA and molecular beacon searches, open reading frame, restriction enzyme analysis, etc. It has been created and maintained by Wojciech Rychlik and Piotr Rychlik. The OLIGO has been reviewed several times in scientific journals, for the first time in 1991 in a review in Critical Reviews in Biochemistry and Molecular Biology, and for its next upgrades.
In molecular biology, and more importantly high-throughput DNA sequencing, a chimera is a single DNA sequence originating when multiple transcripts or DNA sequences get joined. It can occur in various contexts. Chimeric reads are generally considered artifacts in sequencing applications and are filtered out from the data during processing to prevent spurious inferences of biological variation. In a different context, the deliberate creation of artificial chimeras can also be a useful tool in the molecular biology. For example, in protein engineering, "chimeragenesis " is one of the "two major techniques used to manipulate cDNA sequences". For gene fusions that occur through natural processes, see chimeric genes and fusion genes.
Diversity Arrays Technology (DArT) is a high-throughput genetic marker technique that can detect allelic variations to provides comprehensive genome coverage without any DNA sequence information for genotyping and other genetic analysis. The general steps involve reducing the complexity of the genomic DNA with specific restriction enzymes, choosing diverse fragments to serve as representations for the parent genomes, amplify via polymerase chain reaction (PCR), insert fragments into a vector to be placed as probes within a microarray, then fluorescent targets from a reference sequence will be allowed to hybridize with probes and put through an imaging system. The objective is to identify and quantify various forms of DNA polymorphism within genomic DNA of sampled species.
Multiplex polymerase chain reaction refers to the use of polymerase chain reaction to amplify several different DNA sequences simultaneously. This process amplifies DNA in samples using multiple primers and a temperature-mediated DNA polymerase in a thermal cycler. The primer design for all primers pairs has to be optimized so that all primer pairs can work at the same annealing temperature during PCR.
Multiple Annealing and Looping Based Amplification Cycles (MALBAC) is a quasilinear whole genome amplification method. Unlike conventional DNA amplification methods that are non-linear or exponential, MALBAC utilizes special primers that allow amplicons to have complementary ends and therefore to loop, preventing DNA from being copied exponentially. This results in amplification of only the original genomic DNA and therefore reduces amplification bias. MALBAC is “used to create overlapped shotgun amplicons covering most of the genome”. For next generation sequencing, MALBAC is followed by regular PCR which is used to further amplify amplicons.
Molecular diagnostics is a collection of techniques used to analyze biological markers in the genome and proteome, and how their cells express their genes as proteins, applying molecular biology to medical testing. In medicine the technique is used to diagnose and monitor disease, detect risk, and decide which therapies will work best for individual patients, and in agricultural biosecurity similarly to monitor crop- and livestock disease, estimate risk, and decide what quarantine measures must be taken.
"What is an amplicon? See examples of the different applications". YouTube video.