Primer walking

Last updated

Primer walking is a technique used to clone a gene (e.g., disease gene) from its known closest markers (e.g., known gene). As a result, it is employed in cloning and sequencing efforts in plants, fungi, and mammals with minor alterations. This technique, also known as "directed sequencing," employs a series of Sanger sequencing reactions to either confirm the reference sequence of a known plasmid or PCR product based on the reference sequence (sequence confirmation service) or to discover the unknown sequence of a full plasmid or PCR product by designing primers to sequence overlapping sections (sequence discovery service). [1]

Contents

Primer walking: a DNA sequencing method

Primer walking is a method to determine the sequence of DNA up to the 1.3–7.0 kb range whereas chromosome walking is used to produce the clones of already known sequences of the gene. [2] Too long fragments cannot be sequenced in a single sequence read using the chain termination method. This method works by dividing the long sequence into several consecutive short ones. The DNA of interest may be a plasmid insert, a PCR product or a fragment representing a gap when sequencing a genome. The term "primer walking" is used where the main aim is to sequence the genome. The term "chromosome walking" is used instead when the sequence is known but there is no clone of a gene. For example, the gene for a disease may be located near a specific marker such as an RFLP on the sequence. [3] Chromosome walking is a technique used to clone a gene (e.g., disease gene) from its known closest markers (e.g., known gene) and hence is used in moderate modifications in cloning and sequencing projects in plants, fungi, and animals. To put it another way, it's utilized to find, isolate, and clone a specific sequence existing near the gene to be mapped. Libraries of large fragments, mainly bacterial artificial chromosome libraries, are mostly used in genomic projects. To identify the desired colony and to select a particular clone the library is screened first with a desired probe. After screening, the clone is overlapped with the probe and overlapping fragments are mapped. These fragments are then used as a new probe (short DNA fragments obtained from the 3′ or 5′ ends of clones) to identify other clones. A library approximately consists of 96 clones and each clone contains a different insert. Probe one identifies λ1 and λ2 as it overlaps them . Probe two derived from λ2 clones is used to identify λ3, and so on. Orientation of the clones is determined by restriction mapping of the clones. Thus, new chromosomal regions present in the vicinity of a gene could be identified. Chromosome walking is time-consuming, and chromosome landing is the method of choice for gene identification. This method necessitates the discovery of a marker that is firmly related to the mutant locus. [4]

The fragment is first sequenced as if it were a shorter fragment. Sequencing is performed from each end using either universal primers or specifically designed ones. This should identify the first 1000 or so bases. In order to completely sequence the region of interest, design and synthesis of new primers (complementary to the final 20 bases of the known sequence) is necessary to obtain contiguous sequence information. [5]

Primer walking versus shotgun sequencing

Primer walking is an example of directed sequencing because the primer is designed from a known region of DNA to guide the sequencing in a specific direction. In contrast to directed sequencing, shotgun sequencing of DNA is a more rapid sequencing strategy. [6]

There is a technique from the "old time" of genome sequencing. The underlying method for sequencing is the Sanger chain termination method which can have read lengths between 100 and 1000 basepairs (depending on the instruments used). This means you have to break down longer DNA molecules, clone and subsequently sequence them. There are two methods possible. [7]

The first is called chromosome (or primer) walking and starts with sequencing the first piece. The next (contiguous) piece of the sequence is then sequenced using a primer which is complementary to the end of the first sequence read and so on. This technique doesn't require much assembling, but you need a lot of primers and it is relatively slow. [8]

To overcome this problem the shotgun sequencing method was developed. Here the DNA is broken into different pieces (not all broken at the same place), cloned and sequenced with primers specific for the vector used for cloning. This leads to overlapping sequences which then have to be assembled into one sequence on the computer. This method allows for the parallelization of the sequencing (you can prepare a lot of sequencing reactions at the same time and run them) which makes the process much faster and also avoids the need for sequence specific primers. The challenge is to organize sequences into their order, as overlaps are not as clear here. To resolve this problem, a first draft is made and then critical regions are resequenced using other techniques such as primer walking. [9]

Process

The overall process is as follows: A primer that matches the beginning of the DNA to sequence is used to synthesize a short DNA strand adjacent to the unknown sequence, starting with the primer (see PCR). The new short DNA strand is sequenced using the chain termination method. The end of the sequenced strand is used as a primer for the next part of the long DNA sequence, hence the term "walking".

The method can be used to sequence entire chromosomes (hence "chromosome walking"). [10] Primer walking was also the basis for the development of shotgun sequencing, which uses random primers instead of specifically chosen ones.

See also

Related Research Articles

<span class="mw-page-title-main">Polymerase chain reaction</span> Laboratory technique to multiply a DNA sample for study

The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists amplify a very small sample of DNA sufficiently to enable detailed study. PCR was invented in 1983 by American biochemist Kary Mullis at Cetus Corporation; Mullis and biochemist Michael Smith, who had developed other essential ways of manipulating DNA, were jointly awarded the Nobel Prize in Chemistry in 1993.

In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule.

In genetics, shotgun sequencing is a method used for sequencing random DNA strands. It is named by analogy with the rapidly expanding, quasi-random shot grouping of a shotgun.

A contig is a set of overlapping DNA segments that together represent a consensus region of DNA. In bottom-up sequencing projects, a contig refers to overlapping sequence data (reads); in top-down sequencing projects, contig refers to the overlapping clones that form a physical map of the genome that is used to guide sequencing and assembly. Contigs can thus refer both to overlapping DNA sequences and to overlapping physical segments (fragments) contained in clones depending on the context.

<span class="mw-page-title-main">Cloning vector</span> Small piece of maintainable DNA

A cloning vector is a small piece of DNA that can be stably maintained in an organism, and into which a foreign DNA fragment can be inserted for cloning purposes. The cloning vector may be DNA taken from a virus, the cell of a higher organism, or it may be the plasmid of a bacterium. The vector contains features that allow for the convenient insertion of a DNA fragment into the vector or its removal from the vector, for example through the presence of restriction sites. The vector and the foreign DNA may be treated with a restriction enzyme that cuts the DNA, and DNA fragments thus generated contain either blunt ends or overhangs known as sticky ends, and vector DNA and foreign DNA with compatible ends can then be joined by molecular ligation. After a DNA fragment has been cloned into a cloning vector, it may be further subcloned into another vector designed for more specific use.

Site-directed mutagenesis is a molecular biology method that is used to make specific and intentional mutating changes to the DNA sequence of a gene and any gene products. Also called site-specific mutagenesis or oligonucleotide-directed mutagenesis, it is used for investigating the structure and biological activity of DNA, RNA, and protein molecules, and for protein engineering.

Chromosome jumping is a tool of molecular biology that is used in the physical mapping of genomes. It is related to several other tools used for the same purpose, including chromosome walking.

This is a list of topics in molecular biology. See also index of biochemistry articles.

A DNA construct is an artificially-designed segment of DNA borne on a vector that can be used to incorporate genetic material into a target tissue or cell. A DNA construct contains a DNA insert, called a transgene, delivered via a transformation vector which allows the insert sequence to be replicated and/or expressed in the target cell. This gene can be cloned from a naturally occurring gene, or synthetically constructed. The vector can be delivered using physical, chemical or viral methods. Typically, the vectors used in DNA constructs contain an origin of replication, a multiple cloning site, and a selectable marker. Certain vectors can carry additional regulatory elements based on the expression system involved.

<span class="mw-page-title-main">Gene mapping</span> Process of locating specific genes

Gene mapping or genome mapping describes the methods used to identify the location of a gene on a chromosome and the distances between genes. Gene mapping can also describe the distances between different sites within a gene.

A genomic library is a collection of overlapping DNA fragments that together make up the total genomic DNA of a single organism. The DNA is stored in a population of identical vectors, each containing a different insert of DNA. In order to construct a genomic library, the organism's DNA is extracted from cells and then digested with a restriction enzyme to cut the DNA into fragments of a specific size. The fragments are then inserted into the vector using DNA ligase. Next, the vector DNA can be taken up by a host organism - commonly a population of Escherichia coli or yeast - with each cell containing only one vector molecule. Using a host cell to carry the vector allows for easy amplification and retrieval of specific clones from the library for analysis.

In the fields of bioinformatics and computational biology, Genome survey sequences (GSS) are nucleotide sequences similar to expressed sequence tags (ESTs) that the only difference is that most of them are genomic in origin, rather than mRNA.

<span class="mw-page-title-main">Artificial gene synthesis</span> Group of methods in synthetic biology

Artificial gene synthesis, or simply gene synthesis, refers to a group of methods that are used in synthetic biology to construct and assemble genes from nucleotides de novo. Unlike DNA synthesis in living cells, artificial gene synthesis does not require template DNA, allowing virtually any DNA sequence to be synthesized in the laboratory. It comprises two main steps, the first of which is solid-phase DNA synthesis, sometimes known as DNA printing. This produces oligonucleotide fragments that are generally under 200 base pairs. The second step then involves connecting these oligonucleotide fragments using various DNA assembly methods. Because artificial gene synthesis does not require template DNA, it is theoretically possible to make a completely synthetic DNA molecule with no limits on the nucleotide sequence or size.

<span class="mw-page-title-main">Functional cloning</span>

Functional cloning is a molecular cloning technique that relies on prior knowledge of the encoded protein’s sequence or function for gene identification. In this assay, a genomic or cDNA library is screened to identify the genetic sequence of a protein of interest. Expression cDNA libraries may be screened with antibodies specific for the protein of interest or may rely on selection via the protein function. Historically, the amino acid sequence of a protein was used to prepare degenerate oligonucleotides which were then probed against the library to identify the gene encoding the protein of interest. Once candidate clones carrying the gene of interest are identified, they are sequenced and their identity is confirmed. This method of cloning allows researchers to screen entire genomes without prior knowledge of the location of the gene or the genetic sequence.

The versatility of polymerase chain reaction (PCR) has led to modifications of the basic protocol being used in a large number of variant techniques designed for various purposes. This article summarizes many of the most common variations currently or formerly used in molecular biology laboratories; familiarity with the fundamental premise by which PCR works and corresponding terms and concepts is necessary for understanding these variant techniques.

Diversity Arrays Technology (DArT) is a high-throughput genetic marker technique that can detect allelic variations to provides comprehensive genome coverage without any DNA sequence information for genotyping and other genetic analysis. The general steps involve reducing the complexity of the genomic DNA with specific restriction enzymes, choosing diverse fragments to serve as representations for the parent genomes, amplify via polymerase chain reaction (PCR), insert fragments into a vector to be placed as probes within a microarray, then fluorescent targets from a reference sequence will be allowed to hybridize with probes and put through an imaging system. The objective is to identify and quantify various forms of DNA polymorphism within genomic DNA of sampled species.

<span class="mw-page-title-main">Jumping library</span>

Jumping libraries or junction-fragment libraries are collections of genomic DNA fragments generated by chromosome jumping. These libraries allow the analysis of large areas of the genome and overcome distance limitations in common cloning techniques. A jumping library clone is composed of two stretches of DNA that are usually located many kilobases away from each other. The stretch of DNA located between these two "ends" is deleted by a series of biochemical manipulations carried out at the start of this cloning technique.

No-SCAR genome editing is an editing method that is able to manipulate the Escherichia coli genome. The system relies on recombineering whereby DNA sequences are combined and manipulated through homologous recombination. No-SCAR is able to manipulate the E. coli genome without the use of the chromosomal markers detailed in previous recombineering methods. Instead, the λ-Red recombination system facilitates donor DNA integration while Cas9 cleaves double-stranded DNA to counter-select against wild-type cells. Although λ-Red and Cas9 genome editing are widely used technologies, the no-SCAR method is novel in combining the two functions; this technique is able to establish point mutations, gene deletions, and short sequence insertions in several genomic loci with increased efficiency and time sensitivity.

Physical map is a technique used in molecular biology to find the order and physical distance between DNA base pairs by DNA markers. It is one of the gene mapping techniques which can determine the sequence of DNA base pairs with high accuracy. Genetic mapping, another approach of gene mapping, can provide markers needed for the physical mapping. However, as the former deduces the relative gene position by recombination frequencies, it is less accurate than the latter.

<span class="mw-page-title-main">Vectorette PCR</span>

Vectorette PCR is a variation of polymerase chain reaction (PCR) designed in 1988. The original PCR was created and also patented during the 1980s. Vectorette PCR was first noted and described in an article in 1990 by John H. Riley and his team. Since then, multiple variants of PCR have been created. Vectorette PCR focuses on amplifying a specific sequence obtained from an internal sequence that is originally known until the fragment end. Multiple researches have taken this method as an opportunity to conduct experiments in order to uncover the potential uses that can be derived from Vectorette PCR.

References

  1. Company, Azenta Life Sciences. "Primer Walking FAQ". web.genewiz.com. Retrieved 2021-11-20.
  2. DNA viruses : a practical approach. Alan Cann. Oxford: New York. 2000. ISBN   0-585-48411-2. OCLC   53956473.{{cite book}}: CS1 maint: others (link)
  3. Cann, Alan (1999). DNA Viruses: A Practical Approach. Oxford University Press. ISBN   978-0-19-963718-8.
  4. Modern Applications of Plant Biotechnology in Pharmaceutical Sciences. 2015. doi:10.1016/c2014-0-02123-5. ISBN   9780128022214.
  5. Sterky, Fredrik; Lundeberg, Joakim (2000). "Sequence analysis of genes and genomes" (PDF). Journal of Biotechnology. 76 (1): 1–31. doi:10.1016/s0168-1656(99)00176-5. PMID   10784293.
  6. "ScienceDirect.com | Science, health and medical journals, full text articles and books". www.sciencedirect.com. Retrieved 2021-11-20.
  7. "genetics - Why do we use DNA sequencing methods such as shotgun?". Biology Stack Exchange. Retrieved 2021-11-20.
  8. "genetics - Why do we use DNA sequencing methods such as shotgun?". Biology Stack Exchange. Retrieved 2021-11-20.
  9. "genetics - Why do we use DNA sequencing methods such as shotgun?". Biology Stack Exchange. Retrieved 2021-11-20.
  10. Chinault, A. Craig; John Carbon (Feb 1979). "Overlap hybridization screening: Isolation and characterization of overlapping DNA fragments surrounding the leu2 gene on yeast chromosome III". Gene. 5 (2): 111–126. doi:10.1016/0378-1119(79)90097-0. PMID   376402.