The Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines are a set of protocols for conducting and reporting quantitative real-time PCR experiments and data, as devised by Bustin et al. in 2009. [1] They were devised after a paper was published in 2002 that claimed to detect measles virus in children with autism through the use of RT-qPCR, but the results proved to be completely unreproducible by other scientists. [2] The authors themselves also did not try to reproduce them and the raw data was found to have a large amount of errors and basic mistakes in analysis. This incident prompted Stephen Bustin to create the MIQE guidelines to provide a baseline level of quality for qPCR data published in scientific literature. [2]
The MIQE guidelines were created due to the low quality of qPCR data submitted to academic journals at the time, which was only becoming more common as Next Generation Sequencing machinery allowed for such experiments to be run for a cheaper cost. Because the technique is utilized across all of science in multiple fields, the instruments, methods, and designs of how qPCR is used differ greatly. To help improve overall quality, the MIQE guidelines were made as generalized suggestions on basic experimental procedures and forms of data that should be collected as a minimum level of reported information for other researchers to understand and use when reading the published material. Setting up a recognized and largely agreed upon set of guidelines such as these were deemed important by the scientific community especially due to the ever increasing amount of scientific work coming from developing countries with many different languages and protocols. [3]
In 2009, Stephen Bustin led an international group of scientists including Mikael Kubista to put together a set of guidelines on how to perform qPCR and what forms of data should be collected and published in the process. [1] This also allowed editors and reviewers of scientific journals to employ the guidelines when looking over a submitted paper that included qPCR data. Thus, the guidelines were set up as a sort of checklist for each step of the procedure with certain items being marked as essential (E) when submitting data for publication and others marked as just desirable (D). [4]
An additional version of the guidelines was published in September 2010 for use with fluorescence-based quantitative real-time PCR. It also acted as a précis for the broader form of the guidelines. [5] Other researchers have been creating further versions for specific forms of qPCR that may require a supplementary or different set of items to check, including single-cell qPCR [6] and digital PCR (dPCR). [7] Appropriate adherence to the existing MIQE guidelines has also been overviewed in other scientific areas, including photobiomodulation [8] and clinical biomarkers. [9]
It was noted by Bustin in 2014 (and again by him in 2017) that there was some amount of uptake and usage of the MIQE guidelines within the scientific community, but there were still far too many published papers with qPCR experiments that lacked even the most basic of data presentation and proper confirmation of effectiveness for said data. These studies retained major reproducibility issues, where the conclusions of their evidence could not be replicated by other researchers, throwing the initial results into doubt. All of this was despite many papers directly citing Bustin's original MIQE publication, but not following through on the guideline checklist of material in their own experiments. [2] [10] However, some researchers have pointed out at least some success, with a number of papers being rejected by academic journals for publication due to failing to pass MIQE checklists. Other studies have been retracted after the fact once their lack of proper data to pass the MIQE guidelines was noted and publicly pointed out to the journal editors. [11]
When setting up their new comparative qPCR systems titled "Dots in Boxes" in 2017, New England Biolabs stated that they had designed the data collection portion around the MIQE guidelines so that the data fit all the minimum parameter checklists in the protocols. [12] Other scientific instrument companies have assisted in guideline compliance by purposefully tailoring their devices for them, including Bio-Rad creating a mobile app that allows for active marking of the MIQE checklist as each step is completed. [13]
An overview of the 10th anniversary since the publication of the MIQE guidelines was conducted in June 2020 and discussed the scientific studies that had produced better and more organized results when following the guidelines. [14] In August 2020, an updated version of the guidelines for the digital PCR method was published to account for improvement in machinery, technologies, and techniques since the original 2013 release. Additional guideline steps were added for data analysis, while also providing a more simplified checklist table for researchers to use. [15] An RT-qPCR targeting assay was developed alongside Stephen Bustin using the MIQE guidelines for clinical biomarkers in December 2020 in order to identify the clinical presence of COVID-19 viral particles during the COVID-19 pandemic. [16]
The MIQE guidelines are split up into 9 different sections that make up the checklist. These include not only considerations for doing the qPCR itself, but also how the resulting data is collected, analyzed, and presented. An important part of the latter is including information relating to the analysis software used and also submitting the raw data to the relevant databases. [1]
Large portions of the guidelines include basic actions that would normally be included in experiments and publications regardless, such as an item for describing the experimental and control group differences. Other such information includes how many individual units are used in each group in the experiment. These two pieces are defined as essential for any study. This section also includes two desirable points, which are pointing out whether the author's laboratory itself or a core laboratory of the university or organization conducted the qPCR assay and an acknowledgement of any other individuals that contributed to the work. [1]
The essential requirements that samples and sample material must meet includes a description of the sample, what form of dissection was used, what processing method was done, whether the samples were frozen or fixed and how long did it take, and what sample conditions were used. It is also desirable to know the volume or mass of the sample that was processed for the qPCR. [1]
For the process of extracting the DNA/RNA, there are a number of essential guidelines. This includes a description of the extraction process done, a statement on what DNA extraction kit was used and any changes made to the directions, details on whether any DNase or RNase treatment was used, a statement on whether any contamination was assessed, a quantification of the amount of genetic material extracted, a description of the instruments used for the extraction, the methods used to retain RNA integrity, a statement on the RNA integrity number and quality indicator and the quantification cycle (Cq) reached, and lastly what testing was done to determine the presence or absence of inhibitors. Four desired pieces of information are where the reagents used were obtained from, what level of genetic purity was obtained, what yield was obtained, and an electrophoresis gel image for confirmation. [1]
The primary essential parts for this phase include detailing the reaction conditions in full, giving both the amount of RNA used and the total volume of the reaction, give information on the oligonucleotide used as a primer and its concentration, the concentration and type of reverse transcriptase used, and lastly the temperature and amount of time done for the reaction. It is also desirable to have the catalog numbers of reagents used and their manufacturers, the standard deviation for the Cq with and without the transcriptase being involved, and how the cDNA was stored. [1]
All of the basic information regarding the target is necessary here, including the gene symbol, the accession database number for the sequence in question, the length of the sequence being amplified, information about the specificity screen used such as BLAST, what splicing variants exist for the sequence, and where the exon or intron for each primer is. There are several desired, but not required information pieces for this section, such as the location of the amplicon, whether any pseudogenes or homologs exist, whether a sequence alignment was done and the data obtained from it, and any data on the secondary structure of the amplified sequence. [1]
Creation of the oligonucleotides requires only two pieces of essential information: the primer sequences used and the location and details of any modifications made to the sequence. But there are several desirable pieces of data, including the identification number from the RTPrimerDB database, the sequences from the probes, the manufacturer used to make the oligos, and how they were purified. [1]
As one of the primary segments of the guidelines, there are several essential parts on the checklist for the qPCR process itself. This includes the full set of conditions used for the reaction, the volume of both the reaction and the cDNA, the concentrations for the probes, magnesium ions, and dNTPs, what kind of polymerase was used and its concentration, what kit was used and its manufacturer, what additives to the reaction were used, who manufactured the qPCR machine, and what parameters were set for the thermocycling process. The only additional desired pieces of information are the chemical composition of the buffer used, who manufactured the plates and tubes used and what their catalog number is, and whether the reaction was set up manually or by a machine. [1] [17]
In order to confirm the effectiveness and quality of the qPCR process that was performed, there are several actions and subsequent data that must be presented. This includes explaining the specific method of checking that the process functioned, such as using a gel, direct sequencing of the genetic material, showing a melt profile, or from digestion by restriction enzyme. If SYBR Green I was used, then the Cq of the control group with no template DNA must be given. Further essential data includes the calibration of the machine curves with the slope and y intercept noted, the efficiency of the PCR process as determined from the aforementioned slope, the correlation coefficients (r squared) for the calibration curves, the dynamic range of the linear curves, the Cq found at the lowest concentration where 95% of the results were still positive (LOD) along with the evidence for the LOD itself, and lastly if a multiplex is used, then the efficiency and LOD must be given for each assay done. [1] [17]
The extra desired information includes evidence given that qPCR optimization occurred by the use of gradients, the confidence intervals to show efficiency of the qPCR, and the confidence intervals for the entire range tested. [1]
The final section of the guidelines involves information on how the analysis of the qPCR data was done. The essential parts of that include the program and program version used for the analysis, the method for how the Cq was determined, figuring out the outlier points in the data and how they are used or excluded and why, what results were found for the controls with no template genetic material, an explanation for why the reference genes used were chosen and why the number of them was chosen, the method used to normalize the data, how many technical replicates were included, how repeatable was the data within the assays, what methods were used to determine significance of the results, and what software was used for this part of the qualitative analysis. [1]
It is also desired to include information on the number of biological replicates and whether they matched the results from the technical replicates, the reproducibility data for the concentration variants, data on the power analysis, and lastly for the researchers to submit the raw data in the RDML file format. [1] [18]
Biostatistics is a branch of statistics that applies statistical methods to a wide range of topics in biology. It encompasses the design of biological experiments, the collection and analysis of data from those experiments and the interpretation of the results.
Molecular biology is a branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions.
The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA sufficiently to enable detailed study. PCR was invented in 1983 by American biochemist Kary Mullis at Cetus Corporation. Mullis and biochemist Michael Smith, who had developed other essential ways of manipulating DNA, were jointly awarded the Nobel Prize in Chemistry in 1993.
Reverse transcription polymerase chain reaction (RT-PCR) is a laboratory technique combining reverse transcription of RNA into DNA and amplification of specific DNA targets using polymerase chain reaction (PCR). It is primarily used to measure the amount of a specific RNA. This is achieved by monitoring the amplification reaction using fluorescence, a technique called real-time PCR or quantitative PCR (qPCR). Confusion can arise because some authors use the acronym RT-PCR to denote real-time PCR. In this article, RT-PCR will denote Reverse Transcription PCR. Combined RT-PCR and qPCR are routinely used for analysis of gene expression and quantification of viral RNA in research and clinical settings.
In a blind or blinded experiment, information which may influence the participants of the experiment is withheld until after the experiment is complete. Good blinding can reduce or eliminate experimental biases that arise from a participants' expectations, observer's effect on the participants, observer bias, confirmation bias, and other sources. A blind can be imposed on any participant of an experiment, including subjects, researchers, technicians, data analysts, and evaluators. In some cases, while blinding would be useful, it is impossible or unethical. For example, it is not possible to blind a patient to their treatment in a physical therapy intervention. A good clinical protocol ensures that blinding is as effective as possible within ethical and practical constraints.
Antibiotic sensitivity testing or antibiotic susceptibility testing is the measurement of the susceptibility of bacteria to antibiotics. It is used because bacteria may have resistance to some antibiotics. Sensitivity testing results can allow a clinician to change the choice of antibiotics from empiric therapy, which is when an antibiotic is selected based on clinical suspicion about the site of an infection and common causative bacteria, to directed therapy, in which the choice of antibiotic is based on knowledge of the organism and its sensitivities.
A real-time polymerase chain reaction is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule during the PCR, not at its end, as in conventional PCR. Real-time PCR can be used quantitatively and semi-quantitatively.
An RNA spike-in is an RNA transcript of known sequence and quantity used to calibrate measurements in RNA hybridization assays, such as DNA microarray experiments, RT-qPCR, and RNA-Seq.
TaqMan probes are hydrolysis probes that are designed to increase the specificity of quantitative PCR. The method was first reported in 1991 by researcher Kary Mullis at Cetus Corporation, and the technology was subsequently developed by Hoffmann-La Roche for diagnostic assays and by Applied Biosystems for research applications.
Digital polymerase chain reaction is a biotechnological refinement of conventional polymerase chain reaction methods that can be used to directly quantify and clonally amplify nucleic acids strands including DNA, cDNA, or RNA. The key difference between dPCR and qPCR lies in the method of measuring nucleic acids amounts, with the former being a more precise method than PCR, though also more prone to error in the hands of inexperienced users. PCR carries out one reaction per single sample. dPCR also carries out a single reaction within a sample, however the sample is separated into a large number of partitions and the reaction is carried out in each partition individually. This separation allows a more reliable collection and sensitive measurement of nucleic acid amounts. The method has been demonstrated as useful for studying variations in gene sequences — such as copy number variants and point mutations.
The RNA integrity number (RIN) is an algorithm for assigning integrity values to RNA measurements.
A nucleic acid test (NAT) is a technique used to detect a particular nucleic acid sequence and thus usually to detect and identify a particular species or subspecies of organism, often a virus or bacterium that acts as a pathogen in blood, tissue, urine, etc. NATs differ from other tests in that they detect genetic materials rather than antigens or antibodies. Detection of genetic materials allows an early diagnosis of a disease because the detection of antigens and/or antibodies requires time for them to start appearing in the bloodstream. Since the amount of a certain genetic material is usually very small, many NATs include a step that amplifies the genetic material—that is, makes many copies of it. Such NATs are called nucleic acid amplification tests (NAATs). There are several ways of amplification, including polymerase chain reaction (PCR), strand displacement assay (SDA), or transcription mediated assay (TMA).
Multiplex polymerase chain reaction refers to the use of polymerase chain reaction to amplify several different DNA sequences simultaneously. This process amplifies DNA in samples using multiple primers and a temperature-mediated DNA polymerase in a thermal cycler. The primer design for all primers pairs has to be optimized so that all primer pairs can work at the same annealing temperature during PCR.
Virus quantification is counting or calculating the number of virus particles (virions) in a sample to determine the virus concentration. It is used in both research and development (R&D) in academic and commercial laboratories as well as in production situations where the quantity of virus at various steps is an important variable that must be monitored. For example, the production of virus-based vaccines, recombinant proteins using viral vectors, and viral antigens all require virus quantification to continually monitor and/or modify the process in order to optimize product quality and production yields and to respond to ever changing demands and applications. Other examples of specific instances where viruses need to be quantified include clone screening, multiplicity of infection (MOI) optimization, and adaptation of methods to cell culture.
COLD-PCR is a modified polymerase chain reaction (PCR) protocol that enriches variant alleles from a mixture of wildtype and mutation-containing DNA. The ability to preferentially amplify and identify minority alleles and low-level somatic DNA mutations in the presence of excess wildtype alleles is useful for the detection of mutations. Detection of mutations is important in the case of early cancer detection from tissue biopsies and body fluids such as blood plasma or serum, assessment of residual disease after surgery or chemotherapy, disease staging and molecular profiling for prognosis or tailoring therapy to individual patients, and monitoring of therapy outcome and cancer remission or relapse. Common PCR will amplify both the major (wildtype) and minor (mutant) alleles with the same efficiency, occluding the ability to easily detect the presence of low-level mutations. The capacity to detect a mutation in a mixture of variant/wildtype DNA is valuable because this mixture of variant DNAs can occur when provided with a heterogeneous sample – as is often the case with cancer biopsies. Currently, traditional PCR is used in tandem with a number of different downstream assays for genotyping or the detection of somatic mutations. These can include the use of amplified DNA for RFLP analysis, MALDI-TOF genotyping, or direct sequencing for detection of mutations by Sanger sequencing or pyrosequencing. Replacing traditional PCR with COLD-PCR for these downstream assays will increase the reliability in detecting mutations from mixed samples, including tumors and body fluids.
A quantitative PCR instrument is a machine that amplifies and detects DNA. It combines the functions of a thermal cycler and a fluorimeter, enabling the process of quantitative PCR.
Stephen Andrew Bustin is a British scientist, former professor of molecular sciences at Queen Mary University of London from 2004 to 2012, as well as visiting professor at Middlesex University, beginning in 2006. In 2012 he was appointed Professor of Allied Health and Medicine at Anglia Ruskin University. He is known for his research into polymerase chain reaction, and has written a book on the topic, entitled A-Z of Quantitative PCR. This book has been called "the bible of qPCR."
Reverse transcription loop-mediated isothermal amplification (RT-LAMP) is a one step nucleic acid amplification method to multiply specific sequences of RNA. It is used to diagnose infectious disease caused by RNA viruses.
Minimum information standards are sets of guidelines and formats for reporting data derived by specific high-throughput methods. Their purpose is to ensure the data generated by these methods can be easily verified, analysed and interpreted by the wider scientific community. Ultimately, they facilitate the transfer of data from journal articles into databases in a form that enables data to be mined across multiple data sets. Minimal information standards are available for a vast variety of experiment types including microarray (MIAME), RNAseq (MINSEQE), metabolomics (MSI) and proteomics (MIAPE).
COVID-19 testing involves analyzing samples to assess the current or past presence of SARS-CoV-2, the virus that cases COVID-19 and is responsible for the COVID-19 pandemic. The two main types of tests detect either the presence of the virus or antibodies produced in response to infection. Molecular tests for viral presence through its molecular components are used to diagnose individual cases and to allow public health authorities to trace and contain outbreaks. Antibody tests instead show whether someone once had the disease. They are less useful for diagnosing current infections because antibodies may not develop for weeks after infection. It is used to assess disease prevalence, which aids the estimation of the infection fatality rate.