Glycopeptide

Last updated

Glycopeptides are peptides that contain carbohydrate moieties (glycans) covalently attached to the side chains of the amino acid residues that constitute the peptide.

Contents

Over the past few decades it has been recognised that glycans on cell surface (attached to membrane proteins or lipids) and those bound to proteins (glycoproteins) play a critical role in biology. For example, these constructs have been shown to play important roles in fertilization, [1] the immune system, [2] brain development, [3] the endocrine system, [3] and inflammation. [3] [4] [5]

The synthesis of glycopeptides provides biological probes for researchers to elucidate glycan function in nature and products that have useful therapeutic and biotechnological applications.[ clarification needed ][ citation needed ]

Glycopeptide linkage variety

N-Linked glycans

N-Linked glycans derive their name from the fact that the glycan is attached to an asparagine (Asn, N) residue, and are amongst the most common linkages found in nature. Although the majority of N-linked glycans take the form GlcNAc-β-Asn [6] other less common structural linkages such as GlcNac-α-Asn [7] and Glc-Asn [8] have been observed. In addition to their function in protein folding and cellular attachment, the N-liked glycans of a protein can modulate the protein's function, in some cases acting as an on-off switch. [5]

GlcNAc-b-Asn GlcNac1.gif
GlcNAc-β-Asn

O-Linked glycans

O-Linked glycans are formed by a linkage between an amino acid hydroxyl side chain (usually from serine or threonine) with the glycan. The majority of O-linked glycans take the form GlcNac-β-Ser/Thr or GalNac-α-Ser/Thr. [6]

GlcNac-b-Ser GalNacSer.gif
GlcNac-β-Ser

C-Linked glycans

Of the three linkages the least common and least understood are C-linked glycans. The C-linkage refers to the covalent attachment of mannose to a tryptophan residue. An example of a C-linked glycan is α-mannosyl tryptophan. [9] [10]

Glycopeptide synthesis

Several methods have been reported in the literature for the synthesis of glycopeptides. Of these methods the most common strategies are listed below.

Solid phase peptide synthesis

Within solid phase peptide synthesis (SPPS) there exist two strategies for the synthesis of glycopeptides, linear and convergent assembly. Linear assembly relies on the synthesis of building blocks and then the use of SPPS to attach the building block together. An outline of this approach is illustrated below.

Scheme 1. Overview of the Linear Assembly Strategy GlycanSPPS.gif
Scheme 1. Overview of the Linear Assembly Strategy

Several methods exist for the synthesis of monosaccharide amino acid building block as illustrated below.

Scheme 2. a) Preparation of amino acid monosaccharide building block on resin b) Preparation of free amino acid monosaccharide building block Glycanbb.gif
Scheme 2. a) Preparation of amino acid monosaccharide building block on resin b) Preparation of free amino acid monosaccharide building block

Provided the monosaccharide amino acid building block is stable to peptide coupling conditions, amine deprotection conditions and resin cleavage. Linear assembly remains a popular strategy for the synthesis of glycopeptides with many examples in the literature. [13] [14] [15]

In the convergent assembly strategy a peptide chain and glycan residue are first synthesis separately. Then the glycan is glycosylated onto a specific residue of the peptide chain. This approach is not as popular as the linear strategy due to the poor reaction yields in the glycosylation step. [16]

Another strategy to produce glycopeptide libraries is using Glyco-SPOT synthesis technique. [17] The technique extends the existing method of SPOT synthesis. [18] In this method, libraries of glycopeptides are produced on a cellulose surface (e.g. filter paper) which acts as the solid phase. The glycopeptides are produced by spotting FMOC protected amino acids allowing the synthesis to be performed at microgram (nanomole) scale using very small amounts of glycoamino acids. The scale of this technique can be an advantage for creating libraries for screening by using less amounts of glycoamino acids per peptide. However to produce larger quantities of glycopeptides traditional resin-based solid phase techniques would be better.

Native chemical ligation

Native chemical ligation (NCL) is a convergent synthetic strategy based on the linear coupling of glycopeptide fragments. This technique makes use of the chemoselective reaction between a N-terminal cysteine residue on one peptide fragment with a thio-ester on the C-terminus of the other peptide fragment [19] as illustrated below.

Scheme 3 Mechanism of native chemical ligation Sulfur1.gif
Scheme 3 Mechanism of native chemical ligation

Unlike standard SPPS (which is limited to 50 amino acid residue) NCL allows the construction of large glycopeptides. However the strategy is limited by the fact that it requires a cysteine residue at N-terminus, an amino acid residue that is rare in nature. [19] However this problem has partly been address by the selective desulfurization of the cysteine residue to an alanine. [20]

See also

Related Research Articles

Peptidoglycan or murein is a polysaccharide consisting of amino acids that forms a mesh-like peptidoglycan layer outside the plasma membrane of most bacteria, forming the cell wall. The sugar component consists of alternating residues of β-(1,4) linked N-acetylglucosamine (NAG) and N-acetylmuramic acid (NAM). Attached to the N-acetylmuramic acid is a peptide chain of three to five amino acids. The peptide chain can be cross-linked to the peptide chain of another strand forming the 3D mesh-like layer. Peptidoglycan serves a structural role in the bacterial cell wall, giving structural strength, as well as counteracting the osmotic pressure of the cytoplasm. Peptidoglycan is also involved in binary fission during bacterial cell reproduction. L-form bacteria and mycoplasmas, both lacking peptidoglycan cell walls, do not proliferate by binary fission, but by a budding mechanism.

<span class="mw-page-title-main">Post-translational modification</span> Biological processes

Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosomes translating mRNA into polypeptide chains, which may then undergo PTM to form the mature protein product. PTMs are important components in cell signaling, as for example when prohormones are converted to hormones.

<span class="mw-page-title-main">Glycoprotein</span> Protein with oligosaccharide modifications

Glycoproteins are proteins which contain oligosaccharide chains (glycans) covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.

Glycosylation is the reaction in which a carbohydrate, i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule in order to form a glycoconjugate. In biology, glycosylation usually refers to an enzyme-catalysed reaction, whereas glycation may refer to a non-enzymatic reaction. Glycosylation is a form of co-translational and post-translational modification. Glycans serve a variety of structural and functional roles in membrane and secreted proteins. The majority of proteins synthesized in the rough endoplasmic reticulum undergo glycosylation. Glycosylation is also present in the cytoplasm and nucleus as the O-GlcNAc modification. Aglycosylation is a feature of engineered antibodies to bypass glycosylation. Five classes of glycans are produced:

Native chemical ligation is an important extension of the chemical ligation concept for constructing a larger polypeptide chain by the covalent condensation of two or more unprotected peptides segments. Native chemical ligation is the most effective method for synthesizing native or modified proteins of typical size.

<span class="mw-page-title-main">Teicoplanin</span>

Teicoplanin is an antibiotic used in the prophylaxis and treatment of serious infections caused by Gram-positive bacteria, including methicillin-resistant Staphylococcus aureus and Enterococcus faecalis. It is a semisynthetic glycopeptide antibiotic with a spectrum of activity similar to vancomycin. Its mechanism of action is to inhibit bacterial cell wall synthesis.

<span class="mw-page-title-main">Peptide synthesis</span> Production of peptides

In organic chemistry, peptide synthesis is the production of peptides, compounds where multiple amino acids are linked via amide bonds, also known as peptide bonds. Peptides are chemically synthesized by the condensation reaction of the carboxyl group of one amino acid to the amino group of another. Protecting group strategies are usually necessary to prevent undesirable side reactions with the various amino acid side chains. Chemical peptide synthesis most commonly starts at the carboxyl end of the peptide (C-terminus), and proceeds toward the amino-terminus (N-terminus). Protein biosynthesis in living organisms occurs in the opposite direction.

The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.

Bioconjugation is a chemical strategy to form a stable covalent link between two molecules, at least one of which is a biomolecule.

<span class="mw-page-title-main">Tunicamycin</span> Chemical compound

Tunicamycin is a mixture of homologous nucleoside antibiotics that inhibits the UDP-HexNAc: polyprenol-P HexNAc-1-P family of enzymes. In eukaryotes, this includes the enzyme GlcNAc phosphotransferase (GPT), which catalyzes the transfer of N-acetylglucosamine-1-phosphate from UDP-N-acetylglucosamine to dolichol phosphate in the first step of glycoprotein synthesis. Tunicamycin blocks N-linked glycosylation (N-glycans) and treatment of cultured human cells with tunicamycin causes cell cycle arrest in G1 phase. It is used as an experimental tool in biology, e.g. to induce unfolded protein response. Tunicamycin is produced by several bacteria, including Streptomyces clavuligerus and Streptomyces lysosuperificus.

<span class="mw-page-title-main">UDP-glucose 4-epimerase</span> Class of enzymes

The enzyme UDP-glucose 4-epimerase, also known as UDP-galactose 4-epimerase or GALE, is a homodimeric epimerase found in bacterial, fungal, plant, and mammalian cells. This enzyme performs the final step in the Leloir pathway of galactose metabolism, catalyzing the reversible conversion of UDP-galactose to UDP-glucose. GALE tightly binds nicotinamide adenine dinucleotide (NAD+), a co-factor required for catalytic activity.

<span class="mw-page-title-main">DPAGT1</span>

UDP-N-acetylglucosamine—dolichyl-phosphate N-acetylglucosaminephosphotransferase is an enzyme that in humans is encoded by the DPAGT1 gene.

<i>N</i>-linked glycosylation

N-linked glycosylation, is the attachment of an oligosaccharide, a carbohydrate consisting of several sugar molecules, sometimes also referred to as glycan, to a nitrogen atom, in a process called N-glycosylation, studied in biochemistry. This type of linkage is important for both the structure and function of many eukaryotic proteins. The N-linked glycosylation process occurs in eukaryotes and widely in archaea, but very rarely in bacteria. The nature of N-linked glycans attached to a glycoprotein is determined by the protein and the cell in which it is expressed. It also varies across species. Different species synthesize different types of N-linked glycan.

O-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. O-glycosylation is a post-translational modification that occurs after the protein has been synthesised. In eukaryotes, it occurs in the endoplasmic reticulum, Golgi apparatus and occasionally in the cytoplasm; in prokaryotes, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell metabolism and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including cancer, diabetes and Alzheimer's. O-glycosylation occurs in all domains of life, including eukaryotes, archaea and a number of pathogenic bacteria including Burkholderia cenocepacia, Neisseria gonorrhoeae and Acinetobacter baumannii.

Protein <i>O</i>-GlcNAc transferase Protein-coding gene in the species Homo sapiens

Protein O-GlcNAc transferase also known as OGT or O-linked N-acetylglucosaminyltransferase is an enzyme that in humans is encoded by the OGT gene. OGT catalyzes the addition of the O-GlcNAc post-translational modification to proteins.

Mannosyl-oligosaccharide glucosidase (MOGS) (EC 3.2.1.106, processing alpha-glucosidase I,Glc3Man9NAc2 oligosaccharide glucosidase, trimming glucosidase I, GCS1) is an enzyme with systematic name mannosyl-oligosaccharide glucohydrolase. MOGS is a transmembrane protein found in the membrane of the endoplasmic reticulum of eukaryotic cells. Biologically, it functions within the N-glycosylation pathway.

<i>O</i>-GlcNAc

O-GlcNAc is a reversible enzymatic post-translational modification that is found on serine and threonine residues of nucleocytoplasmic proteins. The modification is characterized by a β-glycosidic bond between the hydroxyl group of serine or threonine side chains and N-acetylglucosamine (GlcNAc). O-GlcNAc differs from other forms of protein glycosylation: (i) O-GlcNAc is not elongated or modified to form more complex glycan structures, (ii) O-GlcNAc is almost exclusively found on nuclear and cytoplasmic proteins rather than membrane proteins and secretory proteins, and (iii) O-GlcNAc is a highly dynamic modification that turns over more rapidly than the proteins which it modifies. O-GlcNAc is conserved across metazoans.

The aldehyde tag is a short peptide tag that can be further modified to add fluorophores, glycans, PEG chains or reactive groups for further synthesis. A short, genetically-encoded peptide, with a consensus sequence LCxPxR, is introduced into fusion proteins, and by subsequent treatment with the formylglycine-generating enzyme (FGE), the cysteine of the tag is converted to a reactive aldehyde group. This electrophilic group can be targeted by an array of aldehyde-specific reagents, such as aminooxy- or hydrazide-functionalized compounds.

<span class="mw-page-title-main">PNGase F</span>

Peptide:N-glycosidase F, commonly referred to as PNGase F, is an amidase of the peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase class. PNGase F works by cleaving between the innermost GlcNAc and asparagine residues of high mannose, hybrid, and complex oligosaccharides from N-linked glycoproteins and glycopeptides. This results in a deaminated protein or peptide and a free glycan.

<span class="mw-page-title-main">Chloroeremomycin</span> Chemical compound

Chloroeremomycin is a member of the glycopeptide family of antibiotics, such as vancomycin. The molecule is a non-ribosomal polypeptide that has been glycosylated. It is composed of seven amino acids and three saccharide units. Although chloroeremomycin has never been in clinical phases, oritavancin, a semi-synthetic derivative of chloroeremomycin, has been investigated.

References

  1. Talbot P.; Shur B. D.; Myles D. G. (2003). "Cell adhesion and fertilization: Steps in oocyte transport, sperm-zona pellucida interactions, and sperm-egg fusion". Biology of Reproduction. 68 (1): 1–9. doi:10.1095/biolreprod.102.007856. PMID   12493688. S2CID   10166894.
  2. Rudd P. M.; Elliott T.; Cresswell P.; Wilson I. A.; Dwek R. A. (2001). "Glycosylation and the immune system". Science. 291 (5512): 2370–2376. Bibcode:2001Sci...291.2370R. doi:10.1126/science.291.5512.2370. PMID   11269318.
  3. 1 2 3 Varki A (1993). "Biological Roles of Oligosaccharides - All of the Theories Are Correct". Glycobiology. 3 (2): 97–130. doi:10.1093/glycob/3.2.97. PMC   7108619 . PMID   8490246.
  4. Bertozzi C. R.; Kiessling L. L. (2001). "Chemical glycobiology". Science. 291 (5512): 2357–2364. Bibcode:2001Sci...291.2357B. doi:10.1126/science.1059820. PMID   11269316. S2CID   9585674.
  5. 1 2 Maverakis E, Kim K, Shimoda M, Gershwin M, Patel F, Wilken R, Raychaudhuri S, Ruhaak LR, Lebrilla CB (2015). "Glycans in the immune system and The Altered Glycan Theory of Autoimmunity". J Autoimmun. 57 (6): 1–13. doi:10.1016/j.jaut.2014.12.002. PMC   4340844 . PMID   25578468.
  6. 1 2 Vliegenthart J. F. G.; Casset F. (1998). "Novel forms of protein glycosylation". Current Opinion in Structural Biology. 8 (5): 565–571. doi:10.1016/s0959-440x(98)80145-0. hdl: 1874/5477 . PMID   9818259. S2CID   9360182.
  7. Shibata S.; Takeda T.; Natori Y. (1988). "The Structure of Nephritogenoside - a Nephritogenic Glycopeptide with Alpha-N-Glycosidic Linkage". Journal of Biological Chemistry. 263 (25): 12483–12485. doi: 10.1016/S0021-9258(18)37780-9 . PMID   3410849.
  8. Wieland F.; Heitzer R.; Schaefer W. (1983). "Asparaginylglucose - Novel Type of Carbohydrate Linkage". Proceedings of the National Academy of Sciences of the United States of America. 80 (18): 5470–5474. Bibcode:1983PNAS...80.5470W. doi: 10.1073/pnas.80.18.5470 . PMC   384279 . PMID   16593364.
  9. Debeer T.; Vliegenthart J. F. G.; Loffler A.; Hofsteenge J. (1995). "The Hexopyranosyl Residue That Is C-Glycosidically Linked to the Side-Chain of Tryptophan-7 in Human Rnase U-S Is Alpha-Marmopyranose". Biochemistry. 34 (37): 11785–11789. doi:10.1021/bi00037a016. hdl: 1874/5760 . PMID   7547911. S2CID   22324479.
  10. Ihara, Yoshito; Inai, Yoko; Ikezaki, Midori; Matsui, In-Sook L.; Manabe, Shino; Ito, Yukishige (2014). "C-Mannosylation: A Modification on Tryptophan in Cellular Proteins". Glycoscience: Biology and Medicine: 1–8. doi:10.1007/978-4-431-54836-2_67-1. ISBN   978-4-431-54836-2. S2CID   82050024.
  11. Jansson A. M.; Meldal M.; Bock K. (1990). "The Active Ester N-Fmoc-3-O-[Ac4-Alpha-D-Manp-(1-]2)-Ac3-Alpha-D-Manp-1-]-Threonine-O-Pfp as a Building Block in Solid-Phase Synthesis of an O-Linked Dimannosyl Glycopeptide". Tetrahedron Letters. 31 (48): 6991–6994. doi:10.1016/s0040-4039(00)97224-1.
  12. Elofsson M.; Walse B.; Kihlberg J. (1991). "Building-Blocks for Glycopeptide Synthesis – Glycosylation of 3-Mercaptopropionic Acid and Fmoc Amino-Acids with Unprotected Carboxyl Groups". Tetrahedron Letters. 32 (51): 7613–7616. doi:10.1016/0040-4039(91)80548-k.
  13. Li H. G.; Li B.; Song H. J.; Breydo L.; Baskakov I. V.; Wang L. X. (2005). "Chemoenzymatic synthesis of HIV-1V3 glycopeptides carrying two N-glycans and effects of glycosylation on the peptide domain". Journal of Organic Chemistry. 70 (24): 9990–9996. doi:10.1021/jo051729z. PMID   16292832.
  14. Yamamoto N.; Takayanagi Y.; Yoshino A.; Sakakibara T.; Kajihara Y. (2007). "An approach for a synthesis of asparagine-linked sialylglycopeptides having intact and homogeneous complex-type undecadisialyloligosaccharides". Chemistry: A European Journal. 13 (2): 613–625. doi:10.1002/chem.200600179. PMID   16977655.
  15. Shao N.; Xue J.; Guo Z. W. (2003). "Chemical synthesis of CD52 glycopeptides containing the acid-labile fucosyl linkage". Journal of Organic Chemistry. 68 (23): 9003–9011. doi:10.1021/jo034773s. PMID   14604374.
  16. Gamblin D. P.; Scanlan E. M.; Davis B. G. (2009). "Glycoprotein Synthesis: An Update". Chemical Reviews. 109 (1): 131–163. doi:10.1021/cr078291i. PMID   19093879.
  17. Mehta, AY; Veeraiah, RKH; Dutta, S; Goth, CK; Hanes, MS; Gao, C; Stavenhagen, K; Kardish, R; Matsumoto, Y; Heimburg-Molinaro, J; Boyce, M; Pohl, NLB; Cummings, RD (29 June 2020). "Parallel Glyco-SPOT Synthesis of Glycopeptide Libraries". Cell Chemical Biology. 27 (9): 1207–1219.e9. doi:10.1016/j.chembiol.2020.06.007. PMC   7556346 . PMID   32610041.
  18. Hilpert, K; Winkler, DF; Hancock, RE (2007). "Peptide arrays on cellulose support: SPOT synthesis, a time and cost efficient method for synthesis of large numbers of peptides in a parallel and addressable fashion". Nature Protocols. 2 (6): 1333–49. doi:10.1038/nprot.2007.160. PMID   17545971. S2CID   32143600.
  19. 1 2 Nilsson B. L.; Soellner M. B.; Raines R. T. (2005). "Chemical synthesis of proteins". Annual Review of Biophysics and Biomolecular Structure. 34: 91–118. doi:10.1146/annurev.biophys.34.040204.144700. PMC   2845543 . PMID   15869385.
  20. Wan Q.; Danishefsky S. J. (2007). "Free Radical Based, Specific Desulfurization of Cysteine: A Powerful Advance in the Synthesis of Polypeptides and Glycopolypeptides". Angew. Chem. 119 (48): 9408–9412. Bibcode:2007AngCh.119.9408W. doi:10.1002/ange.200704195. PMID   18046687.

Further reading