CDNA library

Last updated

A cDNA library is a combination of cloned cDNA (complementary DNA) fragments inserted into a collection of host cells, which constitute some portion of the transcriptome of the organism and are stored as a "library". cDNA is produced from fully transcribed mRNA found in the nucleus and therefore contains only the expressed genes of an organism. Similarly, tissue-specific cDNA libraries can be produced. In eukaryotic cells the mature mRNA is already spliced, hence the cDNA produced lacks introns and can be readily expressed in a bacterial cell. While information in cDNA libraries is a powerful and useful tool since gene products are easily identified, the libraries lack information about enhancers, introns, and other regulatory elements found in a genomic DNA library. [1]

Contents

cDNA Library Construction

Formation of a cDNA library. Formation of a cDNA Library.jpg
Formation of a cDNA library.

cDNA is created from a mature mRNA from a eukaryotic cell with the use of reverse transcriptase. In eukaryotes, a poly-(A) tail (consisting of a long sequence of adenine nucleotides) distinguishes mRNA from tRNA and rRNA and can therefore be used as a primer site for reverse transcription. This has the problem that not all transcripts, such as those for the histone, encode a poly-A tail. [2]

mRNA extraction

Firstly, mRNA template needs to be isolated for the creation of cDNA libraries. Since mRNA only contains exons, the integrity of the isolated mRNA should be considered so that the protein encoded can still be produced. Isolated mRNA should range from 500 bp to 8 kb. [3] Several methods exist for purifying RNA such as trizol extraction and column purification. Column purification can be done using oligomeric dT nucleotide coated resins, and features of mRNA such as having a poly-A tail can be exploited where only mRNA sequences containing said feature will bind. The desired mRNA bound to the column is then eluted.

cDNA construction

Once mRNA is purified, an oligo-dT primer (a short sequence of deoxy-thymidine nucleotides) is bound to the poly-A tail of the RNA. The primer is required to initiate DNA synthesis by the enzyme reverse transcriptase. This results in the creation of RNA-DNA hybrids where a single strand of complementary DNA is bound to a strand of mRNA. [4] To remove the mRNA, the RNAse H enzyme is used to cleave the backbone of the mRNA and generate free 3'-OH groups, which is important for the replacement of mRNA with DNA. [3] DNA polymerase I is then added, the cleaved RNA acts as a primer the DNA polymerase I can identify and initiate replacement of RNA nucleotides with those of DNA. [3] This is provided by the sscDNA itself by coiling on itself at the 3' end, generating a hairpin loop . The polymerase extends the 3'-OH end, and later the loop at 3' end is opened by the scissoring action of S1 nuclease. Restriction endonucleases and DNA ligase are then used to clone the sequences into bacterial plasmids.

The cloned bacteria are then selected, commonly through the use of antibiotic selection. Once selected, stocks of the bacteria are created which can later be grown and sequenced to compile the cDNA library.

cDNA Library uses

cDNA libraries are commonly used when reproducing eukaryotic genomes, as the amount of information is reduced to remove the large numbers of non-coding regions from the library. cDNA libraries are used to express eukaryotic genes in prokaryotes. Prokaryotes do not have introns in their DNA and therefore do not possess any enzymes that can cut it out during transcription process. cDNA does not have introns and therefore can be expressed in prokaryotic cells. cDNA libraries are most useful in reverse genetics where the additional genomic information is of less use. Additionally, cDNA libraries are frequently used in functional cloning to identify genes based on the encoded protein's function. When studying eukaryotic DNA, expression libraries are constructed using complementary DNA (cDNA) to help ensure the insert is truly a gene. [4]

cDNA Library vs. Genomic DNA Library

cDNA library lacks the non-coding and regulatory elements found in genomic DNA. Genomic DNA libraries provide more detailed information about the organism, but are more resource-intensive to generate and keep.

Cloning of cDNA

cDNA molecules can be cloned by using restriction site linkers. Linkers are short, double stranded pieces of DNA (oligodeoxyribonucleotide) about 8 to 12 nucleotide pairs long that include a restriction endonuclease cleavage site e.g. BamHI. Both the cDNA and the linker have blunt ends which can be ligated together using a high concentration of T4 DNA ligase. Then sticky ends are produced in the cDNA molecule by cleaving the cDNA ends (which now have linkers with an incorporated site) with the appropriate endonuclease. A cloning vector (plasmid) is then also cleaved with the appropriate endonuclease. Following "sticky end" ligation of the insert into the vector the resulting recombinant DNA molecule is transferred into E. coli host cell for cloning.

See also

Related Research Articles

<span class="mw-page-title-main">Complementary DNA</span> DNA reverse transcribed from RNA

In genetics, complementary DNA (cDNA) is DNA that was reverse transcribed from an RNA. cDNA exists in both single-stranded and double-stranded forms and in both natural and engineered forms.

<span class="mw-page-title-main">Messenger RNA</span> RNA that is read by the ribosome to produce a protein

In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.

<span class="mw-page-title-main">Primer (molecular biology)</span> Short strand of RNA or DNA that serves as a starting point for DNA synthesis

A primer is a short single-stranded nucleic acid used by all living organisms in the initiation of DNA synthesis. A synthetic primer may also be referred to as an oligo, short for oligonucleotide. DNA polymerase enzymes are only capable of adding nucleotides to the 3’-end of an existing nucleic acid, requiring a primer be bound to the template before DNA polymerase can begin a complementary strand. DNA polymerase adds nucleotides after binding to the RNA primer and synthesizes the whole strand. Later, the RNA strands must be removed accurately and replace them with DNA nucleotides forming a gap region known as a nick that is filled in using an enzyme called ligase. The removal process of the RNA primer requires several enzymes, such as Fen1, Lig1, and others that work in coordination with DNA polymerase, to ensure the removal of the RNA nucleotides and the addition of DNA nucleotides. Living organisms use solely RNA primers, while laboratory techniques in biochemistry and molecular biology that require in vitro DNA synthesis usually use DNA primers, since they are more temperature stable. Primers can be designed in laboratory for specific reactions such as polymerase chain reaction (PCR). When designing PCR primers, there are specific measures that must be taken into consideration, like the melting temperature of the primers and the annealing temperature of the reaction itself. Moreover, the DNA binding sequence of the primer in vitro has to be specifically chosen, which is done using a method called basic local alignment search tool (BLAST) that scans the DNA and finds specific and unique regions for the primer to bind.

<span class="mw-page-title-main">Protein biosynthesis</span> Assembly of proteins inside biological cells

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

<span class="mw-page-title-main">Reverse transcriptase</span> Enzyme which generates DNA

A reverse transcriptase (RT) is an enzyme used to generate complementary DNA (cDNA) from an RNA template, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobile genetic elements to proliferate within the host genome, and by eukaryotic cells to extend the telomeres at the ends of their linear chromosomes. Contrary to a widely held belief, the process does not violate the flows of genetic information as described by the classical central dogma, as transfers of information from RNA to DNA are explicitly held possible.

<span class="mw-page-title-main">DNA synthesis</span>

DNA synthesis is the natural or artificial creation of deoxyribonucleic acid (DNA) molecules. DNA is a macromolecule made up of nucleotide units, which are linked by covalent bonds and hydrogen bonds, in a repeating structure. DNA synthesis occurs when these nucleotide units are joined to form DNA; this can occur artificially or naturally. Nucleotide units are made up of a nitrogenous base, pentose sugar (deoxyribose) and phosphate group. Each unit is joined when a covalent bond forms between its phosphate group and the pentose sugar of the next nucleotide, forming a sugar-phosphate backbone. DNA is a complementary, double stranded structure as specific base pairing occurs naturally when hydrogen bonds form between the nucleotide bases.

<span class="mw-page-title-main">Retrotransposon</span> Type of genetic component

Retrotransposons are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through the reverse transcription process using an RNA transposition intermediate.

<span class="mw-page-title-main">Primary transcript</span> RNA produced by transcription

A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs are modified in preparation for translation. For example, a precursor mRNA (pre-mRNA) is a type of primary transcript that becomes a messenger RNA (mRNA) after processing.

Rapid amplification of cDNA ends (RACE) is a technique used in molecular biology to obtain the full length sequence of an RNA transcript found within a cell. RACE results in the production of a cDNA copy of the RNA sequence of interest, produced through reverse transcription, followed by PCR amplification of the cDNA copies. The amplified cDNA copies are then sequenced and, if long enough, should map to a unique genomic region. RACE is commonly followed up by cloning before sequencing of what was originally individual RNA molecules. A more high-throughput alternative which is useful for identification of novel transcript structures, is to sequence the RACE-products by next generation sequencing technologies.

This is a list of topics in molecular biology. See also index of biochemistry articles.

<span class="mw-page-title-main">Serial analysis of gene expression</span> Molecular biology technique

Serial Analysis of Gene Expression (SAGE) is a transcriptomic technique used by molecular biologists to produce a snapshot of the messenger RNA population in a sample of interest in the form of small tags that correspond to fragments of those transcripts. Several variants have been developed since, most notably a more robust version, LongSAGE, RL-SAGE and the most recent SuperSAGE. Many of these have improved the technique with the capture of longer tags, enabling more confident identification of a source gene.

Exon shuffling is a molecular mechanism for the formation of new genes. It is a process through which two or more exons from different genes can be brought together ectopically, or the same exon can be duplicated, to create a new exon-intron structure. There are different mechanisms through which exon shuffling occurs: transposon mediated exon shuffling, crossover during sexual recombination of parental genomes and illegitimate recombination.

The versatility of polymerase chain reaction (PCR) has led to modifications of the basic protocol being used in a large number of variant techniques designed for various purposes. This article summarizes many of the most common variations currently or formerly used in molecular biology laboratories; familiarity with the fundamental premise by which PCR works and corresponding terms and concepts is necessary for understanding these variant techniques.

Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.

<span class="mw-page-title-main">Illumina dye sequencing</span> DNA sequencing method

Illumina dye sequencing is a technique used to determine the series of base pairs in DNA, also known as DNA sequencing. The reversible terminated chemistry concept was invented by Bruno Canard and Simon Sarfati at the Pasteur Institute in Paris. It was developed by Shankar Balasubramanian and David Klenerman of Cambridge University, who subsequently founded Solexa, a company later acquired by Illumina. This sequencing method is based on reversible dye-terminators that enable the identification of single nucleotides as they are washed over DNA strands. It can also be used for whole-genome and region sequencing, transcriptome analysis, metagenomics, small RNA discovery, methylation profiling, and genome-wide protein-nucleic acid interaction analysis.

<span class="mw-page-title-main">In vitro recombination</span> Process of isolation and amplification of DNA segments

Recombinant DNA (rDNA), or molecular cloning, is the process by which a single gene, or segment of DNA, is isolated and amplified. Recombinant DNA is also known as in vitro recombination. A cloning vector is a DNA molecule that carries foreign DNA into a host cell, where it replicates, producing many copies of itself along with the foreign DNA. There are many types of cloning vectors such as plasmids and phages. In order to carry out recombination between vector and the foreign DNA, it is necessary the vector and DNA to be cloned by digestion, ligase the foreign DNA into the vector with the enzyme DNA ligase. And DNA is inserted by introducing the DNA into bacteria cells by transformation.

The extension Poly(A) Test (ePAT) describes a method to determine the poly(A) tail lengths of mRNA molecules. It was developed and described by A. Jänicke et al. in 2012.

<span class="mw-page-title-main">Retroviral ribonuclease H</span>

The retroviral ribonuclease H is a catalytic domain of the retroviral reverse transcriptase (RT) enzyme. The RT enzyme is used to generate complementary DNA (cDNA) from the retroviral RNA genome. This process is called reverse transcription. To complete this complex process, the retroviral RT enzymes need to adopt a multifunctional nature. They therefore possess 3 of the following biochemical activities: RNA-dependent DNA polymerase, ribonuclease H, and DNA-dependent DNA polymerase activities. Like all RNase H enzymes, the retroviral RNase H domain cleaves DNA/RNA duplexes and will not degrade DNA or unhybridized RNA.

G&T-seq is a novel form of single cell sequencing technique allowing one to simultaneously obtain both transcriptomic and genomic data from single cells, allowing for direct comparison of gene expression data to its corresponding genomic data in the same cell...

This glossary of cellular and molecular biology is a list of definitions of terms and concepts commonly used in the study of cell biology, molecular biology, and related disciplines, including genetics, biochemistry, and microbiology. It is split across two articles:

References

  1. "cDNA Libraries - an overview | ScienceDirect Topics". www.sciencedirect.com. Retrieved 2024-02-19.
  2. cDNA library || How cDNA library is constructed? || What are DNA libraries used for? , retrieved 2024-02-19
  3. 1 2 3 Ying, Shao-Yao (2004). "Complementary DNA Libraries: An Overview". Molecular Biotechnology. 27 (3): 245–252. doi:10.1385/MB:27:3:245. ISSN   1073-6085. PMID   15247497. S2CID   25600775.
  4. 1 2 P., Clark, David (2009). Biotechnology : applying the genetic revolution. Pazdernik, Nanette Jean. Amsterdam: Academic Press/Elsevier. ISBN   9780121755522. OCLC   226038060.{{cite book}}: CS1 maint: multiple names: authors list (link)