Saturation mutagenesis

Last updated
Saturation mutagenesis of a single position in a theoretical 10-residue protein. The wild type version of the protein is shown at the top, with M representing the first amino acid methionine, and * representing the termination of translation. All 19 mutants at position 5 are shown below. Site saturation mutagenesis.svg
Saturation mutagenesis of a single position in a theoretical 10-residue protein. The wild type version of the protein is shown at the top, with M representing the first amino acid methionine, and * representing the termination of translation. All 19 mutants at position 5 are shown below.

Site saturation mutagenesis (SSM), or simply site saturation, is a random mutagenesis technique used in protein engineering, in which a single codon or set of codons is substituted with all possible amino acids at the position. [1] There are many variants of the site saturation technique, from paired site saturation (saturating two positions in every mutant in the library) to scanning site saturation (performing a site saturation at every site in the protein, resulting in a library of size [20 x (number of residues in the protein)] that contains every possible point mutant of the protein).

Contents

Method

Depiction of one common way to clone a site-directed mutagenesis library (i.e., using degenerate oligos). The gene of interest is PCRed with oligos that contain a region that is perfectly complementary to the template (blue), and one that differs from the template by one or more nucleotides (red). Many such primers containing degeneracy in the non-complementary region are pooled into the same PCR, resulting in many different PCR products with different mutations in that region (individual mutants shown with different colors below). Site-directed mutagenesis library cloning steps.pdf
Depiction of one common way to clone a site-directed mutagenesis library (i.e., using degenerate oligos). The gene of interest is PCRed with oligos that contain a region that is perfectly complementary to the template (blue), and one that differs from the template by one or more nucleotides (red). Many such primers containing degeneracy in the non-complementary region are pooled into the same PCR, resulting in many different PCR products with different mutations in that region (individual mutants shown with different colors below).

Saturation mutagenesis is commonly achieved by site-directed mutagenesis PCR with a randomised codon in the primers (e.g. SeSaM) [2] or by artificial gene synthesis, with a mixture of synthesis nucleotides used at the codons to be randomised. [3]

Different degenerate codons can be used to encode sets of amino acids. [1] Because some amino acids are encoded by more codons than others, the exact ratio of amino acids cannot be equal. Additionally, it is usual to use degenerate codons that minimise stop codons (which are generally not desired). Consequently, the fully randomised 'NNN' is not ideal, and alternative, more restricted degenerate codons are used. 'NNK' and 'NNS' have the benefit of encoding all 20 amino acids, but still encode a stop codon 3% of the time. Alternative codons such as 'NDT', 'DBK' avoid stop codons entirely, and encode a minimal set of amino acids that still encompass all the main biophysical types (anionic, cationic, aliphatic hydrophobic, aromatic hydrophobic, hydrophilic, small). [1] In the case there is no restriction to use a single degenerate codon only, it is possible to reduce the bias considerably. [4] [5] Several computational tools were developed to allow high level of control over the degenerate codons and their corresponding amino acids. [6] [7] [8]

Degenerate codon No. of codonsNo. of amino acidsNo. of stopsAmino acids encoded
NNN64203All 20
NNK / NNS32201All 20
NDT12120RNDCGHILFSYV
DBK18120ARCGILMFSTWV
NRT880RNDCGHSY

Applications

Saturation mutagenesis is commonly used to generate variants for directed evolution. [9] [10]

See also

Related Research Articles

Genetic code Rules by which information encoded within genetic material is translated into proteins.

The genetic code is the set of rules used by living cells to translate information encoded within genetic material into proteins. Translation is accomplished by the ribosome, which links proteinogenic amino acids in an order specified by messenger RNA (mRNA), using transfer RNA (tRNA) molecules to carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all organisms and can be expressed in a simple table with 64 entries.

Selenocysteine Chemical compound

Selenocysteine is the 21st proteinogenic amino acid. Selenoproteins contain selenocysteine residues. Selenocysteine is an analogue of the more common cysteine with selenium in place of the sulfur.

Pyrrolysine Chemical compound

Pyrrolysine is an α-amino acid that is used in the biosynthesis of proteins in some methanogenic archaea and bacteria; it is not present in humans. It contains an α-amino group, a carboxylic acid group. Its pyrroline side-chain is similar to that of lysine in being basic and positively charged at neutral pH.

Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles. It is also a product and services market, with an estimated value of $168 billion by 2017.

Codon usage bias A genetic bias towards the preferential use of one of the redundant codons that encode the same amino acid over the others

Codon usage bias refers to differences in the frequency of occurrence of synonymous codons in coding DNA. A codon is a series of three nucleotides that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation.

Site-directed mutagenesis is a molecular biology method that is used to make specific and intentional changes to the DNA sequence of a gene and any gene products. Also called site-specific mutagenesis or oligonucleotide-directed mutagenesis, it is used for investigating the structure and biological activity of DNA, RNA, and protein molecules, and for protein engineering.

Nirenberg and Matthaei experiment

The Nirenberg and Matthaei experiment was a scientific experiment performed in May 1961 by Marshall W. Nirenberg and his post-doctoral fellow, J. Heinrich Matthaei, at the National Institutes of Health (NIH). The experiment deciphered the first of the 64 triplet codons in the genetic code by using nucleic acid homopolymers to translate specific amino acids.

Xenobiology (XB) is a subfield of synthetic biology, the study of synthesizing and manipulating biological devices and systems. The name "xenobiology" derives from the Greek word xenos, which means "stranger, alien". Xenobiology is a form of biology that is not (yet) familiar to science and is not found in nature. In practice, it describes novel biological systems and biochemistries that differ from the canonical DNA–RNA-20 amino acid system. For example, instead of DNA or RNA, XB explores nucleic acid analogues, termed xeno nucleic acid (XNA) as information carriers. It also focuses on an expanded genetic code and the incorporation of non-proteinogenic amino acids into proteins.

Chemical biology is a scientific discipline spanning the fields of chemistry and biology. The discipline involves the application of chemical techniques, analysis, and often small molecules produced through synthetic chemistry, to the study and manipulation of biological systems. In contrast to biochemistry, which involves the study of the chemistry of biomolecules and regulation of biochemical pathways within and between cells, chemical biology deals with chemistry applied to biology.

A synonymous substitution is the evolutionary substitution of one base for another in an exon of a gene coding for a protein, such that the produced amino acid sequence is not modified. This is possible because the genetic code is "degenerate", meaning that some amino acids are coded for by more than one three-base-pair codon; since some of the codons for a given amino acid differ by just one base pair from others coding for the same amino acid, a mutation that replaces the "normal" base by one of the alternatives will result in incorporation of the same amino acid into the growing polypeptide chain when the gene is translated. Synonymous substitutions and mutations affecting noncoding DNA are often considered silent mutations; however, it is not always the case that the mutation is silent.

Start codon First codon of a messenger RNA transcript translated by a ribosome

The start codon is the first codon of a messenger RNA (mRNA) transcript translated by a ribosome. The start codon always codes for methionine in eukaryotes and Archaea and a N-formylmethionine (fMet) in bacteria, mitochondria and plastids. The most common start codon is AUG.

Directed evolution Method used in protein engineering that mimics the process of natural selection to steer proteins or nucleic acids toward a user-defined goal

Directed evolution (DE) is a method used in protein engineering that mimics the process of natural selection to steer proteins or nucleic acids toward a user-defined goal. It consists of subjecting a gene to iterative rounds of mutagenesis, selection and amplification. It can be performed in vivo, or in vitro. Directed evolution is used both for protein engineering as an alternative to rationally designing modified proteins, as well as for experimental evolution studies of fundamental evolutionary principles in a controlled, laboratory environment.

A nonsense suppressor is a factor which can inhibit the effect of the nonsense mutation. Nonsense suppressors can be generally divided into two classes: a) a mutated tRNA which can bind with a termination codon on mRNA; b) a mutation on ribosomes decreasing the effect of a termination codon. It's believed that nonsense suppressors keep a low concentration in the cell and do not disrupt normal translation most of the time. In addition, many genes do not have only one termination codon, and cells commonly use ochre codons as the termination signal, whose nonsense suppressors are usually inefficient.

Expanded genetic code Modified genetic code

An expanded genetic code is an artificially modified genetic code in which one or more specific codons have been re-allocated to encode an amino acid that is not among the 22 common naturally-encoded proteinogenic amino acids.

Mutagenesis (molecular biology technique)

In molecular biology, mutagenesis is an important laboratory technique whereby DNA mutations are deliberately engineered to produce libraries of mutant genes, proteins, strains of bacteria, or other genetically modified organisms. The various constituents of a gene, as well as its regulatory elements and its gene products, may be mutated so that the functioning of a genetic locus, process, or product can be examined in detail. The mutation may produce mutant proteins with interesting properties or enhanced or novel functions that may be of commercial use. Mutant strains may also be produced that have practical application or allow the molecular basis of a particular cell function to be investigated.

Degeneracy or redundancy of codons is the redundancy of the genetic code, exhibited as the multiplicity of three-base pair codon combinations that specify an amino acid. The degeneracy of the genetic code is what accounts for the existence of synonymous mutations.

Cassette mutagenesis is a type of site-directed mutagenesis that uses a short, double-stranded oligonucleotide sequence to replace a fragment of target DNA. It uses complementary restriction enzyme digest ends on the target DNA and gene cassette to achieve specificity. It is different from methods that use single oligonucleotide in that a single gene cassette can contain multiple mutations. Unlike many site directed mutagenesis methods, cassette mutagenesis also does not involve primer extension by DNA polymerase.

Sequence saturation mutagenesis (SeSaM) is a chemo-enzymatic random mutagenesis method applied for the directed evolution of proteins and enzymes. It is one of the most common saturation mutagenesis techniques. In four PCR-based reaction steps, phosphorothioate nucleotides are inserted in the gene sequence, cleaved and the resulting fragments elongated by universal or degenerate nucleotides. These nucleotides are then replaced by standard nucleotides, allowing for a broad distribution of nucleic acid mutations spread over the gene sequence with a preference to transversions and with a unique focus on consecutive point mutations, both difficult to generate by other mutagenesis techniques. The technique was developed by Professor Ulrich Schwaneberg at Jacobs University Bremen and RWTH Aachen University.

SeSaM-Biotech GmbH

SeSaM-Biotech GmbH is a biotechnology service company founded in 2008 in Bremen and localized in Aachen today.

Genetic saturation is the result of multiple substitutions at the same site in a sequence, or identical substitutions in different sequences, such that the apparent sequence divergence rate is lower than the actual divergence that has occurred. When comparing two or more genetic sequences consisting of single nucleotides, differences in sequence observed are only differences in the final state of the nucleotide sequence. Single nucleotides that undergoing genetic saturation change multiple times, sometimes back to their original nucleotide or to a nucleotide common to the compared genetic sequence. Without genetic information from intermediate taxa, it is difficult to know how much, or if any saturation has occurred on an observed sequence. Genetic saturation occurs most rapidly on fast-evolving sequences, such as the hypervariable region of mitochondrial DNA, or in short tandem repeats such as on the Y-chromosome.

References

  1. 1 2 3 Reetz, M. T.; Carballeira J. D. (2007). "Iterative saturation mutagenesis (ISM) for rapid directed evolution of functional enzymes". Nature Protocols. 2 (4): 891–903. doi:10.1038/nprot.2007.72. PMID   17446890. S2CID   37361631.
  2. Zheng, Lei; Baumann, Ulrich; Reymond, Jean-Louis (2004-07-15). "An efficient one-step site-directed and site-saturation mutagenesis protocol". Nucleic Acids Research. 32 (14): e115. doi:10.1093/nar/gnh110. ISSN   0305-1048. PMC   514394 . PMID   15304544.
  3. Reetz, Manfred T.; Prasad, Shreenath; Carballeira, José D.; Gumulya, Yosephine; Bocola, Marco (2010-07-07). "Iterative Saturation Mutagenesis Accelerates Laboratory Evolution of Enzyme Stereoselectivity: Rigorous Comparison with Traditional Methods". Journal of the American Chemical Society. 132 (26): 9144–9152. doi:10.1021/ja1030479. ISSN   0002-7863. PMID   20536132.
  4. Kille, Sabrina; Acevedo-Rocha, Carlos G.; Parra, Loreto P.; Zhang, Zhi-Gang; Opperman, Diederik J.; Reetz, Manfred T.; Acevedo, Juan Pablo (2013-02-15). "Reducing Codon Redundancy and Screening Effort of Combinatorial Protein Libraries Created by Saturation Mutagenesis". ACS Synthetic Biology. 2 (2): 83–92. doi:10.1021/sb300037w. PMID   23656371.
  5. Tang, Lixia; Wang, Xiong; Ru, Beibei; Sun, Hengfei; Huang, Jian; Gao, Hui (June 2014). "MDC-Analyzer: a novel degenerate primer design tool for the construction of intelligent mutagenesis libraries with contiguous sites". BioTechniques. 56 (6): 301–302, 304, 306–308, passim. doi: 10.2144/000114177 . ISSN   1940-9818. PMID   24924390.
  6. Halweg-Edwards, Andrea L.; Pines, Gur; Winkler, James D.; Pines, Assaf; Gill, Ryan T. (September 16, 2016). "A Web Interface for Codon Compression". ACS Synthetic Biology. 5 (9): 1021–1023. doi:10.1021/acssynbio.6b00026. ISSN   2161-5063. PMID   27169595.
  7. Engqvist, Martin K. M.; Nielsen, Jens (2015-04-30). "ANT: Software for Generating and Evaluating Degenerate Codons for Natural and Expanded Genetic Codes". ACS Synthetic Biology. 4 (8): 935–938. doi:10.1021/acssynbio.5b00018. PMID   25901796.
  8. Kell, Douglas B.; Day, Philip J.; Breitling, Rainer; Green, Lucy; Currin, Andrew; Swainston, Neil (2017-07-10). "CodonGenie: optimised ambiguous codon design tools". PeerJ Computer Science. 3: e120. doi: 10.7717/peerj-cs.120 . ISSN   2376-5992.
  9. Chica, Robert A.; et al. (2005). "Semi-rational approaches to engineering enzyme activity: combining the benefits of directed evolution and rational design". Current Opinion in Biotechnology. 16 (4): 378–384. doi:10.1016/j.copbio.2005.06.004. PMID   15994074.
  10. Shivange, Amol V; Marienhagen, Jan; Mundhada, Hemanshu; Schenk, Alexander; Schwaneberg, Ulrich (2009-02-01). "Advances in generating functional diversity for directed protein evolution". Current Opinion in Chemical Biology. Biocatalysis and Biotransformation/Bioinorganic Chemistry. 13 (1): 19–25. doi:10.1016/j.cbpa.2009.01.019. PMID   19261539.