In silico PCR

Last updated

In silico PCR [1] refers to computational tools used to calculate theoretical polymerase chain reaction (PCR) results using a given set of primers (probes) to amplify DNA sequences from a sequenced genome or transcriptome. [2] [3] [4] [5]

These tools are used to optimize the design of primers for target DNA or cDNA sequences. Primer optimization has two goals: efficiency and selectivity. Efficiency involves taking into account such factors as GC-content, efficiency of binding, complementarity, secondary structure, and annealing and melting point (Tm). Primer selectivity requires that the primer pairs not fortuitously bind to random sites other than the target of interest, nor should the primer pairs bind to conserved regions of a gene family. If the selectivity is poor, a set of primers will amplify multiple products besides the target of interest. [6]

In silico PCR example result with jPCR software. In silico PCR example result (jPCR result).png
In silico PCR example result with jPCR software.

The design of appropriate short or long primer pairs is only one goal of PCR product prediction. Other information provided by in silico PCR tools may include determining primer location, orientation, length of each amplicon, simulation of electrophoretic mobility, identification of open reading frames, and links to other web resources. [7] [8] [9]

Many software packages are available offering differing balances of feature set, ease of use, efficiency, and cost. [10] [11] [12] [13] [14] Primer-BLAST is widely used, and freely accessible from the National Center for Biotechnology Information (NCBI) website. On the other hand, FastPCR, [10] a commercial application, allows simultaneous testing of a single primer or a set of primers designed for multiplex target sequences. It performs a fast, gapless alignment to test the complementarity of the primers to the target sequences. Probable PCR products can be found for linear and circular templates using standard or inverse PCR as well as for multiplex PCR. Dicey [15] is free software that outputs in-silico PCR products from primer sets provided in a Fasta file. It is fast (through use of a genome's FM-index) and can account for primer melting temperature and tolerated edit distances between primers and hit locations on the genome. VPCR [3] runs a dynamic simulation of multiplex PCR, allowing for an estimate of quantitative competition effects between multiple amplicons in one reaction. The UCSC Genome Browser offers isPCR, which provides graphical as well text-file output to view PCR products on more than 100 sequenced genomes.

A primer may bind to many predicted sequences, but only sequences with no or few mismatches (1 or 2, depending on location and nucleotide) at the 3' end of the primer can be used for polymerase extension. The last 10-12 bases at the 3' end of a primer are sensitive to initiation of polymerase extension and general primer stability on the template binding site. The effect of a single mismatch at these last 10 bases at the 3' end of the primer depends on its position and local structure, reducing the primer binding, selectivity, and PCR efficiency. [7] [9]

Related Research Articles

<span class="mw-page-title-main">Polymerase chain reaction</span> Laboratory technique to multiply a DNA sample for study

The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA sufficiently to enable detailed study. PCR was invented in 1983 by American biochemist Kary Mullis at Cetus Corporation. Mullis and biochemist Michael Smith, who had developed other essential ways of manipulating DNA, were jointly awarded the Nobel Prize in Chemistry in 1993.

<span class="mw-page-title-main">Reverse transcription polymerase chain reaction</span> Laboratory technique to multiply an RNA sample for study

Reverse transcription polymerase chain reaction (RT-PCR) is a laboratory technique combining reverse transcription of RNA into DNA and amplification of specific DNA targets using polymerase chain reaction (PCR). It is primarily used to measure the amount of a specific RNA. This is achieved by monitoring the amplification reaction using fluorescence, a technique called real-time PCR or quantitative PCR (qPCR). Confusion can arise because some authors use the acronym RT-PCR to denote real-time PCR. In this article, RT-PCR will denote Reverse Transcription PCR. Combined RT-PCR and qPCR are routinely used for analysis of gene expression and quantification of viral RNA in research and clinical settings.

Site-directed mutagenesis is a molecular biology method that is used to make specific and intentional mutating changes to the DNA sequence of a gene and any gene products. Also called site-specific mutagenesis or oligonucleotide-directed mutagenesis, it is used for investigating the structure and biological activity of DNA, RNA, and protein molecules, and for protein engineering.

In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification or replication events. It can be formed artificially, using various methods including polymerase chain reactions (PCR) or ligase chain reactions (LCR), or naturally through gene duplication. In this context, amplification refers to the production of one or more copies of a genetic fragment or target sequence, specifically the amplicon. As it refers to the product of an amplification reaction, amplicon is used interchangeably with common laboratory terms, such as "PCR product."

<span class="mw-page-title-main">Real-time polymerase chain reaction</span> Laboratory technique of molecular biology

A real-time polymerase chain reaction is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule during the PCR, not at its end, as in conventional PCR. Real-time PCR can be used quantitatively and semi-quantitatively.

SNP genotyping is the measurement of genetic variations of single nucleotide polymorphisms (SNPs) between members of a species. It is a form of genotyping, which is the measurement of more general genetic variation. SNPs are one of the most common types of genetic variation. An SNP is a single base pair mutation at a specific locus, usually consisting of two alleles. SNPs are found to be involved in the etiology of many human diseases and are becoming of particular interest in pharmacogenetics. Because SNPs are conserved during evolution, they have been proposed as markers for use in quantitative trait loci (QTL) analysis and in association studies in place of microsatellites. The use of SNPs is being extended in the HapMap project, which aims to provide the minimal set of SNPs needed to genotype the human genome. SNPs can also provide a genetic fingerprint for use in identity testing. The increase of interest in SNPs has been reflected by the furious development of a diverse range of SNP genotyping methods.

Multiplex ligation-dependent probe amplification (MLPA) is a variation of the multiplex polymerase chain reaction that permits amplification of multiple targets with only a single primer pair. It detects copy number changes at the molecular level, and software programs are used for analysis. Identification of deletions or duplications can indicate pathogenic mutations, thus MLPA is an important diagnostic tool used in clinical pathology laboratories worldwide.

<span class="mw-page-title-main">Bisulfite sequencing</span> Lab procedure detecting 5-methylcytosines in DNA

Bisulfitesequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the most studied. In animals it predominantly involves the addition of a methyl group to the carbon-5 position of cytosine residues of the dinucleotide CpG, and is implicated in repression of transcriptional activity.

Loop-mediated isothermal amplification (LAMP) is a single-tube technique for the amplification of DNA and a low-cost alternative to detect certain diseases that was invented in 2000 at the University of Tokyo. Reverse transcription loop-mediated isothermal amplification (RT-LAMP) combines LAMP with a reverse transcription step to allow the detection of RNA.

Webtag is an on-line bioinformatics tool providing oligonucleotide sequences that are absent from a specified genome. These tags can be appended to gene specific primers for reverse transcriptase polymerase chain reaction (RT-PCR) experiments, circumventing genomic DNA contamination.

<span class="mw-page-title-main">Oligomer restriction</span>

Oligomer Restriction is a procedure to detect an altered DNA sequence in a genome. A labeled oligonucleotide probe is hybridized to a target DNA, and then treated with a restriction enzyme. If the probe exactly matches the target, the restriction enzyme will cleave the probe, changing its size. If, however, the target DNA does not exactly match the probe, the restriction enzyme will have no effect on the length of the probe. The OR technique, now rarely performed, was closely associated with the development of the popular polymerase chain reaction (PCR) method.

The versatility of polymerase chain reaction (PCR) has led to modifications of the basic protocol being used in a large number of variant techniques designed for various purposes. This article summarizes many of the most common variations currently or formerly used in molecular biology laboratories; familiarity with the fundamental premise by which PCR works and corresponding terms and concepts is necessary for understanding these variant techniques.

In molecular biology, and more importantly high-throughput DNA sequencing, a chimera is a single DNA sequence originating when multiple transcripts or DNA sequences get joined. Chimeras can be considered artifacts and be filtered out from the data during processing to prevent spurious inferences of biological variation. However, chimeras should not be confused with chimeric reads, who are generally used by structural variant callers to detect structural variation events and are not always an indication of the presence of a chimeric transcript or gene.

Multiplex polymerase chain reaction refers to the use of polymerase chain reaction to amplify several different DNA sequences simultaneously. This process amplifies DNA in samples using multiple primers and a temperature-mediated DNA polymerase in a thermal cycler. The primer design for all primers pairs has to be optimized so that all primer pairs can work at the same annealing temperature during PCR.

Polony sequencing is an inexpensive but highly accurate multiplex sequencing technique that can be used to “read” millions of immobilized DNA sequences in parallel. This technique was first developed by Dr. George Church's group at Harvard Medical School. Unlike other sequencing techniques, Polony sequencing technology is an open platform with freely downloadable, open source software and protocols. Also, the hardware of this technique can be easily set up with a commonly available epifluorescence microscopy and a computer-controlled flowcell/fluidics system. Polony sequencing is generally performed on paired-end tags library that each molecule of DNA template is of 135 bp in length with two 17–18 bp paired genomic tags separated and flanked by common sequences. The current read length of this technique is 26 bases per amplicon and 13 bases per tag, leaving a gap of 4–5 bases in each tag.

Hot start PCR is a modified form of conventional polymerase chain reaction (PCR) that reduces the presence of undesired products and primer dimers due to non-specific DNA amplification at room temperatures. Many variations and modifications of the PCR procedure have been developed in order to achieve higher yields; hot start PCR is one of them. Hot start PCR follows the same principles as the conventional PCR - in that it uses DNA polymerase to synthesise DNA from a single stranded template. However, it utilizes additional heating and separation methods, such as inactivating or inhibiting the binding of Taq polymerase and late addition of Taq polymerase, to increase product yield as well as provide a higher specificity and sensitivity. Non-specific binding and priming or formation of primer dimers are minimized by completing the reaction mix after denaturation. Some ways to complete reaction mixes at high temperatures involve modifications that block DNA polymerase activity in low temperatures, use of modified deoxyribonucleotide triphosphates (dNTPs), and the physical addition of one of the essential reagents after denaturation.

<span class="mw-page-title-main">Viral metagenomics</span>

Viral metagenomics uses metagenomic technologies to detect viral genomic material from diverse environmental and clinical samples. Viruses are the most abundant biological entity and are extremely diverse; however, only a small fraction of viruses have been sequenced and only an even smaller fraction have been isolated and cultured. Sequencing viruses can be challenging because viruses lack a universally conserved marker gene so gene-based approaches are limited. Metagenomics can be used to study and analyze unculturable viruses and has been an important tool in understanding viral diversity and abundance and in the discovery of novel viruses. For example, metagenomics methods have been used to describe viruses associated with cancerous tumors and in terrestrial ecosystems.

DECIPHER is a software toolset that can be used to decipher and manage biological sequences efficiently using the programming language R. Some functions of the program are accessible online through web tools.

No-SCAR genome editing is an editing method that is able to manipulate the Escherichia coli genome. The system relies on recombineering whereby DNA sequences are combined and manipulated through homologous recombination. No-SCAR is able to manipulate the E. coli genome without the use of the chromosomal markers detailed in previous recombineering methods. Instead, the λ-Red recombination system facilitates donor DNA integration while Cas9 cleaves double-stranded DNA to counter-select against wild-type cells. Although λ-Red and Cas9 genome editing are widely used technologies, the no-SCAR method is novel in combining the two functions; this technique is able to establish point mutations, gene deletions, and short sequence insertions in several genomic loci with increased efficiency and time sensitivity.

Reverse complement polymerase chain reaction (RC-PCR) is a modification of the polymerase chain reaction (PCR). It is primarily used to generate amplicon libraries for DNA sequencing by next generation sequencing (NGS). The technique permits both the amplification and the ability to append sequences or functional domains of choice independently to either end of the generated amplicons in a single closed tube reaction. RC-PCR was invented in 2013 by Daniel Ward and Christopher Mattocks at Salisbury NHS Foundation Trust, UK.

References

  1. Synonyms: digital PCR, virtual PCR, electronic PCR, e-PCR
  2. Schuler, G. D. (1997). "Sequence mapping by electronic PCR". Genome Research. 7 (5): 541–550. doi:10.1101/gr.7.5.541. PMC   310656 . PMID   9149949.
  3. 1 2 Lexa, M.; Horak, J.; Brzobohaty, B. (2001). "Virtual PCR". Bioinformatics. 17 (2): 192–193. doi:10.1093/bioinformatics/17.2.192. PMID   11238077.
  4. Rotmistrovsky, K.; Jang, W.; Schuler, G. D. (2004). "A web server for performing electronic PCR". Nucleic Acids Research. 32 (Web Server issue): W108–W112. doi:10.1093/nar/gkh450. PMC   441588 . PMID   15215361.
  5. Bikandi, J.; Millan, R. S.; Rementeria, A.; Garaizar, J. (2004). "In silico analysis of complete bacterial genomes: PCR, AFLP-PCR and endonuclease restriction". Bioinformatics. 20 (5): 798–799. doi: 10.1093/bioinformatics/btg491 . PMID   14752001.
  6. Boutros, P. C.; Okey, A. B. (2004). "PUNS: Transcriptomic- and genomic-in silico PCR for enhanced primer design". Bioinformatics. 20 (15): 2399–2400. doi: 10.1093/bioinformatics/bth257 . PMID   15073008.
  7. 1 2 3 Kalendar, R.; Lee, D.; Schulman, A. H. (2011). "Java web tools for PCR, in silico PCR, and oligonucleotide assembly and analysis". Genomics. 98 (2): 137–144. doi:10.1016/j.ygeno.2011.04.009. PMID   21569836.
  8. 1 2 Kalendar, R; Lee, D; Schulman, A. H. (2014). "FastPCR Software for PCR, in Silico PCR, and Oligonucleotide Assembly and Analysis". DNA Cloning and Assembly Methods. Methods in Molecular Biology. Vol. 1116. pp. 271–302. CiteSeerX   10.1.1.700.4632 . doi:10.1007/978-1-62703-764-8_18. ISBN   978-1-62703-763-1. PMID   24395370.
  9. 1 2 Yu, B.; Zhang, C. (2011). "In Silico PCR Analysis". In Silico Tools for Gene Discovery. Methods in Molecular Biology. Vol. 760. pp. 91–107. doi:10.1007/978-1-61779-176-5_6. ISBN   978-1-61779-175-8. PMID   21779992.
  10. 1 2 "FastPCR". PrimerDigital Ltd.
  11. "Oligomer Web Tools". Oligomer Oy, Finland. Archived from the original on 2014-03-27. Retrieved 2014-03-27.
  12. "Electronic PCR". NCBI - National Center for Biotechnology Information.
  13. "UCSC Genome Bioinformatics". UCSC Genome Bioinformatics Group.
  14. Gulvik, C. A.; Effler, T. C.; Wilhelm, S. W.; Buchan, A. (2012). "De-MetaST-BLAST: A Tool for the Validation of Degenerate Primer Sets and Data Mining of Publicly Available Metagenomes". PLOS ONE. 7 (1): e50362. Bibcode:2012PLoSO...750362G. doi: 10.1371/journal.pone.0050362 . PMC   3506598 . PMID   23189198.
  15. Rausch, Tobias. "Dicey". Github. Retrieved 27 February 2024.