Native chemical ligation

Last updated

Native Chemical Ligation (NCL) is an important extension of the chemical ligation concept for constructing a larger polypeptide chain by the covalent condensation of two or more unprotected peptides segments. [1] Native chemical ligation is the most effective method for synthesizing native or modified proteins of typical size (i.e., proteins< ~300 AA). [2]

Contents

Reaction

In native chemical ligation, the ionized thiol group of an N-terminal cysteine residue of an unprotected peptide attacks the C-terminal thioester of a second unprotected peptide, in an aqueous buffer at pH 7.0 and room temperature. This transthioesterification step is reversible in the presence of an aryl thiol catalyst, rendering the reaction both chemoselective and regioselective, and leads to formation of a thioester-linked intermediate. The intermediate rapidly and spontaneously rearranges by an intramolecular S,N-acyl shift that results in the formation of a native amide ('peptide') bond at the ligation site (scheme 1).

Scheme 1: The two-step mechanism of native chemical ligation. NCL mechanism.pdf
Scheme 1: The two-step mechanism of native chemical ligation.

Remarks :

The initial transthioesterification step of the native chemical ligation reaction is catalyzed by thiol additives. The most effective and commonly used thiol catalyst is 4-mercaptophenylacetic acid (MPAA), (ref).

The key feature of native chemical ligation of unprotected peptides is the reversibility of the first step, the thiol(ate)–thioester exchange reaction. Native chemical ligation is exquisitely regioselective because that thiol(ate)–thioester exchange step is freely reversible in the presence of an added arylthiol catalyst. The high yields of final ligation product obtained, even in the presence of internal Cys residues in either/both segments, is the result of the irreversibility of the second (S-to-N acyl shift) amide-forming step under the reaction conditions used.

No side-products are formed from reaction with the other functional groups present in either peptide segment (e.g. Asp, Glu side chain carboxylic acids; Lys epsilon amino group; Tyr phenolic hydroxyl; Ser, Thr hydroxyls, etc.).

Historical context

In 1992, Stephen Kent and Martina Schnölzer at The Scripps Research Institute developed the "Chemical Ligation" concept, the first practical method to covalently condense unprotected peptide segments; the key feature of chemical ligation is formation of an unnatural bond at the ligation site. Just two years later in 1994, Philip Dawson, Tom Muir and Stephen Kent reported "Native Chemical Ligation", an extension of the chemical ligation concept to the formation of a native amide ('peptide') bond after initial nucleophilic condensation formed a thioester-linked condensation product designed to spontaneously rearrange to the native amide bond at the ligation site.

Theodor Wieland and coworkers had reported the S-to-N acyl shift as early as 1953, when the reaction of valine-thioester and cysteine amino acid in aqueous buffer was shown to yield the dipeptide valine-cysteine. [3] The reaction proceeded through the intermediacy of a thioester containing the sulfur of the cysteine residue. However, Wieland's work did NOT lead to the development of the native chemical ligation reaction. Rather, the study of amino acid thioester reactions led Wieland and others to develop the 'active ester' method for the synthesis of protected peptide segments by conventional chemical methods carried out in organic solvents.

Features

Native chemical ligation forms the basis of modern chemical protein synthesis, and has been used to prepare numerous proteins and enzymes by total chemical synthesis. The payoff in the native chemical ligation method is that coupling long peptides by this technique is typically near quantitative and provides synthetic access to large peptides and proteins otherwise impossible to make, due to their large size, decoration by post-translational modification, and containing non-coded amino acid or other chemical building blocks.

Native chemical ligation is inherently 'Green' in its atom economy and its use of benign solvents. It involves the reaction of an unprotected peptide thioester with a second, unprotected peptide that has an N-terminal cysteine residue. It is carried out in aqueous solution at neutral pH, usually in 6 M guanidine.hydrochloride, in the presence of an arylthiol catalyst and typically gives near-quantitative yields of the desired ligation product.

Peptide-thioesters can be directly prepared by Boc chemistry SPPS; however, thioester-containing peptides are not stable to treatment with a nucleophilic base, thus preventing direct synthesis of peptide thioesters by Fmoc chemistry SPPS. Fmoc chemistry solid phase peptide synthesis techniques for generating peptide-thioesters are based on the synthesis of peptide hydrazides that are converted to peptide thioesters post-synthetically.

Polypeptide C-terminal thioesters can also be produced in situ, using so-called N,S-acyl shift systems. Bis(2-sulfanylethyl)amido group, also called SEA group, belongs to this family. Polypeptide C-terminal bis(2-sulfanylethyl)amides (SEA peptide segments) react with Cys peptide to give a native peptide bond as in NCL. This reaction, which is called SEA Native Peptide Ligation, is a useful variant of native chemical ligation. [4]

In making peptide segments that contain an N-terminal cysteine residue, exposure to ketones should be avoided since these may cap the N-terminal cysteine. Do not use protecting groups that release aldehydes or ketones. For the same reason, the use of acetone should be avoided, particularly in washing glassware used for lyophilization.

A feature of the native chemical ligation technique is that the product polypeptide chain contains cysteine at the site of ligation. The cysteine at the ligation site can be desulfurized to alanine, thus extending the range of possible ligation sites to include alanine residues. Other beta-thiol containing amino acids can be used for native chemical ligation, followed by desulfurization. Alternatively, thiol-containing ligation auxiliaries can be used that mimic an N-terminal cysteine for the ligation reaction, but which can be removed after synthesis. The use of thiol-containing auxiliaries may not be as effective as ligation at a Cys residue. Native chemical ligation can also be performed with an N-terminal selenocysteine residue. [5]

Polypeptide C-terminal thioesters produced by recombinant DNA techniques can be reacted with an N-terminal Cys containing polypeptide by the same native ligation chemistry to provide very large semi-synthetic proteins. Native chemical ligation of this kind using a recombinant polypeptide segment is known as Expressed Protein Ligation. Similarly, a recombinant protein containing an N-terminal Cys can be reacted with a synthetic polypeptide thioester. Thus, native chemical ligation can be used to introduce chemically synthesized segments into recombinant proteins, regardless of size.

See also

Related Research Articles

<span class="mw-page-title-main">Protein primary structure</span> Linear sequence of amino acids in a peptide or protein

Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.

<span class="mw-page-title-main">Cysteine</span> Proteinogenic amino acid

Cysteine is a semiessential proteinogenic amino acid with the formula HOOC−CH(−NH2)−CH2−SH. The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. Cysteine is chiral, with only L-cysteine being found in nature.

<span class="mw-page-title-main">Post-translational modification</span> Biological processes

Post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes translating mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.

Nonribosomal peptides (NRP) are a class of peptide secondary metabolites, usually produced by microorganisms like bacteria and fungi. Nonribosomal peptides are also found in higher organisms, such as nudibranchs, but are thought to be made by bacteria inside these organisms. While there exist a wide range of peptides that are not synthesized by ribosomes, the term nonribosomal peptide typically refers to a very specific set of these as discussed in this article.

<span class="mw-page-title-main">Peptide synthesis</span> Production of peptides

In organic chemistry, peptide synthesis is the production of peptides, compounds where multiple amino acids are linked via amide bonds, also known as peptide bonds. Peptides are chemically synthesized by the condensation reaction of the carboxyl group of one amino acid to the amino group of another. Protecting group strategies are usually necessary to prevent undesirable side reactions with the various amino acid side chains. Chemical peptide synthesis most commonly starts at the carboxyl end of the peptide (C-terminus), and proceeds toward the amino-terminus (N-terminus). Protein biosynthesis in living organisms occurs in the opposite direction.

<span class="mw-page-title-main">Carboxypeptidase</span>

A carboxypeptidase is a protease enzyme that hydrolyzes (cleaves) a peptide bond at the carboxy-terminal (C-terminal) end of a protein or peptide. This is in contrast to an aminopeptidases, which cleave peptide bonds at the N-terminus of proteins. Humans, animals, bacteria and plants contain several types of carboxypeptidases that have diverse functions ranging from catabolism to protein maturation. At least two mechanisms have been discussed.

<span class="mw-page-title-main">Cyclic peptide</span> Peptide chains which contain a circular sequence of bonds

Cyclic peptides are polypeptide chains which contain a circular sequence of bonds. This can be through a connection between the amino and carboxyl ends of the peptide, for example in cyclosporin; a connection between the amino end and a side chain, for example in bacitracin; the carboxyl end and a side chain, for example in colistin; or two side chains or more complicated arrangements, for example in alpha-amanitin. Many cyclic peptides have been discovered in nature and many others have been synthesized in the laboratory. Their length ranges from just two amino acid residues to hundreds. In nature they are frequently antimicrobial or toxic; in medicine they have various applications, for example as antibiotics and immunosuppressive agents. Thin-Layer Chromatography (TLC) is a convenient method to detect cyclic peptides in crude extract from bio-mass.

Chemical ligation is the chemoselective condensation of unprotected peptide segments enabled by the formation of a non-native bond at the ligation site.

A peptide library is a tool for studying proteins. Peptide libraries typically contain a large number of peptides that have a systematic combination of amino acids. Usually, the peptide library is synthesized on a solid phase, mostly on resin, which can be made as a flat surface or beads. The peptide library is a popular tool for drug design, protein–protein interactions, and other biochemical and pharmaceutical applications.

<span class="mw-page-title-main">Isopeptide bond</span>

An isopeptide bond is a type of amide bond formed between a carboxyl group of one amino acid and an amino group of another. An isopeptide bond is the linkage between the side chain amino or carboxyl group of one amino acid to the α-carboxyl, α-amino group, or the side chain of another amino acid. In a typical peptide bond, also known as eupeptide bond, the amide bond always forms between the α-carboxyl group of one amino acid and the α-amino group of the second amino acid. Isopeptide bonds are rarer than regular peptide bonds. Isopeptide bonds lead to branching in the primary sequence of a protein. Proteins formed from normal peptide bonds typically have a linear primary sequence.

<span class="mw-page-title-main">Pseudoproline</span>

Pseudoproline derivatives are artificially created dipeptides to minimize aggregation during Fmoc solid-phase synthesis of peptides.

Bioconjugation is a chemical strategy to form a stable covalent link between two molecules, at least one of which is a biomolecule.

The Bergmann degradation is a series of chemical reactions designed to remove a single amino acid from the carboxylic acid (C-terminal) end of a peptide. First demonstrated by Max Bergmann in 1934, it is a rarely used method for sequencing peptides. The later developed Edman degradation is an improvement upon the Bergmann degradation, instead cleaving the N-terminal amino acid of peptides to produce a hydantoin containing the desired amino acid.

<span class="mw-page-title-main">Racemic crystallography</span>

Racemic crystallography is a technique used in structural biology where crystals of a protein molecule are developed from an equimolar mixture of an L-protein molecule of natural chirality and its D-protein mirror image. L-protein molecules consist of 'left-handed' L-amino acids and the achiral amino acid glycine, whereas the mirror image D-protein molecules consist of 'right-handed' D-amino acids and glycine. Typically, both the L-protein and the D-protein are prepared by total chemical synthesis.

Glycopeptides are peptides that contain carbohydrate moieties (glycans) covalently attached to the side chains of the amino acid residues that constitute the peptide.

Protein chemical synthesis by native peptide ligation of unprotected peptide segments is an interesting complement and potential alternative to the use of living systems for producing proteins. The synthesis of proteins requires efficient native peptide ligation methods, which enable the chemoselective formation of a native peptide bond in aqueous solution between unprotected peptide segments. The most frequently used technique for synthesizing proteins is Native chemical ligation (NCL). However, alternatives are emerging, one of which is SEA Native Peptide Ligation.

Ribosomally synthesized and post-translationally modified peptides (RiPPs), also known as ribosomal natural products, are a diverse class of natural products of ribosomal origin. Consisting of more than 20 sub-classes, RiPPs are produced by a variety of organisms, including prokaryotes, eukaryotes, and archaea, and they possess a wide range of biological functions.

<span class="mw-page-title-main">Nosiheptide</span> Chemical compound

Nosiheptide is a thiopeptide antibiotic produced by the bacterium Streptomyces actuosus.

The α-Ketoacid-Hydroxylamine (KAHA) Amide-Forming Ligation is a chemical reaction that is used to join two unprotected fragments in peptide synthesis. It is an alternative to the Native Chemical Ligation (NCL). KAHA Ligation was developed by Jeffrey W. Bode group at ETH Zürich.

Bradley Lether Pentelute is currently a professor of chemistry at the Massachusetts Institute of Technology (MIT). His research program lies at the intersection of chemistry and biology and develops bioconjugation strategies, cytosolic delivery platforms, and rapid flow synthesis technologies to optimize the production, achieve site-specific modification, enhance stability, and modulate function of a variety of bioactive agents. His laboratory successfully modified proteins via cysteine-containing “pi-clamps” made up of a short sequence of amino acids, and delivered large biomolecules, such as various proteins and drugs, into cells via the anthrax delivery vehicle. Pentelute has also made several key contributions to automated synthesis technologies in flow. These advances includes the invention of the world's fastest polypeptide synthesizer. This system is able to form amide bonds at a more efficient rate than standard commercial equipment and has helped in the process of understanding protein folding and its mechanisms. This automated flow technology was recently used to achieve total chemical synthesis of protein chains up to 164 amino acids in length that retained the structure and function of native variants obtained by recombinant expression. The primary goal of his endeavor is to use these processes to create designer biologics that can be used to treat diseases and solve the manufacturing problem for on-demand personalized therapies, such as cancer vaccines.

References

  1. Dawson, P. E.; Muir, T. W.; Clark-Lewis, I.; Kent, S. B. (1994). "Synthesis of proteins by native chemical ligation". Science. 266 (5186): 776–778. Bibcode:1994Sci...266..776D. doi:10.1126/science.7973629. PMID   7973629.
  2. Agouridas V, El Mahdi O, Diemer V, Cargoët M, Monbaliu JM, Melnyk O (June 2019). "Native Chemical Ligation and Extended Methods: Mechanisms, Catalysis, Scope, and Limitations". Chemical Reviews. 119 (12): 7328–7443. doi:10.1021/acs.chemrev.8b00712. PMID   31050890. S2CID   145023266.
  3. Wieland, T.; Bokelmann, E.; Bauer, L.; Lang, H. U.; Lau, H (1953). "Über Peptidsynthesen. 8. Mitteilung Bildung von S-haltigen Peptiden durch intramolekulare Wanderung von Aminoacylresten". Liebigs Ann. Chem. 583: 129–149. doi:10.1002/jlac.19535830110.
  4. Ollivier, N. Dheur, J. Mhidia, R. Blanpain, A. Melnyk, O. (2010). "Bis(2-sulfanylethyl)amino Native Peptide Ligation". Organic Letters. 12 (22): 5238–41. doi:10.1021/ol102273u. PMID   20964289.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  5. McGrath, N. A.; Raines, R. T. (2011). "Chemoselectivity in chemical biology: Acyl transfer reactions with sulfur and selenium". Acc. Chem. Res. 44 (9): 752–761. doi:10.1021/ar200081s. PMC   3242736 . PMID   21639109.

Further reading