Glycopeptides are peptides that contain carbohydrate moieties (glycans) covalently attached to the side chains of the amino acid residues that constitute the peptide.
Over the past few decades it has been recognised that glycans on cell surface (attached to membrane proteins or lipids) and those bound to proteins (glycoproteins) play a critical role in biology. For example, these constructs have been shown to play important roles in fertilization, [1] the immune system, [2] brain development, [3] the endocrine system, [3] and inflammation. [3] [4] [5]
The synthesis of glycopeptides provides biological probes for researchers to elucidate glycan function in nature and products that have useful therapeutic and biotechnological applications.[ clarification needed ][ citation needed ]
N-Linked glycans derive their name from the fact that the glycan is attached to an asparagine (Asn, N) residue, and are amongst the most common linkages found in nature. Although the majority of N-linked glycans take the form GlcNAc-β-Asn [6] other less common structural linkages such as GlcNac-α-Asn [7] and Glc-Asn [8] have been observed. In addition to their function in protein folding and cellular attachment, the N-liked glycans of a protein can modulate the protein's function, in some cases acting as an on-off switch. [5]
O-Linked glycans are formed by a linkage between an amino acid hydroxyl side chain (usually from serine or threonine) with the glycan. The majority of O-linked glycans take the form GlcNac-β-Ser/Thr or GalNac-α-Ser/Thr. [6]
Of the three linkages the least common and least understood are C-linked glycans. The C-linkage refers to the covalent attachment of mannose to a tryptophan residue. An example of a C-linked glycan is α-mannosyl tryptophan. [9] [10]
Several methods have been reported in the literature for the synthesis of glycopeptides. Of these methods the most common strategies are listed below.
Within solid phase peptide synthesis (SPPS) there exist two strategies for the synthesis of glycopeptides, linear and convergent assembly. Linear assembly relies on the synthesis of building blocks and then the use of SPPS to attach the building block together. An outline of this approach is illustrated below.
Several methods exist for the synthesis of monosaccharide amino acid building block as illustrated below.
Provided the monosaccharide amino acid building block is stable to peptide coupling conditions, amine deprotection conditions and resin cleavage. Linear assembly remains a popular strategy for the synthesis of glycopeptides with many examples in the literature. [13] [14] [15]
In the convergent assembly strategy a peptide chain and glycan residue are first synthesis separately. Then the glycan is glycosylated onto a specific residue of the peptide chain. This approach is not as popular as the linear strategy due to the poor reaction yields in the glycosylation step. [16]
Another strategy to produce glycopeptide libraries is using Glyco-SPOT synthesis technique. [17] The technique extends the existing method of SPOT synthesis. [18] In this method, libraries of glycopeptides are produced on a cellulose surface (e.g. filter paper) which acts as the solid phase. The glycopeptides are produced by spotting FMOC protected amino acids allowing the synthesis to be performed at microgram (nanomole) scale using very small amounts of glycoamino acids. The scale of this technique can be an advantage for creating libraries for screening by using less amounts of glycoamino acids per peptide. However to produce larger quantities of glycopeptides traditional resin-based solid phase techniques would be better.
Native chemical ligation (NCL) is a convergent synthetic strategy based on the linear coupling of glycopeptide fragments. This technique makes use of the chemoselective reaction between a N-terminal cysteine residue on one peptide fragment with a thio-ester on the C-terminus of the other peptide fragment [19] as illustrated below.
Unlike standard SPPS (which is limited to 50 amino acid residue) NCL allows the construction of large glycopeptides. However the strategy is limited by the fact that it requires a cysteine residue at N-terminus, an amino acid residue that is rare in nature. [19] However this problem has partly been address by the selective desulfurization of the cysteine residue to an alanine. [20]
Peptidoglycan or murein is a polysaccharide consisting of amino acids that forms a mesh-like peptidoglycan layer outside the plasma membrane of most bacteria, forming the cell wall. The sugar component consists of alternating residues of β-(1,4) linked N-acetylglucosamine (NAG) and N-acetylmuramic acid (NAM). Attached to the N-acetylmuramic acid is a peptide chain of three to five amino acids. The peptide chain can be cross-linked to the peptide chain of another strand forming the 3D mesh-like layer. Peptidoglycan serves a structural role in the bacterial cell wall, giving structural strength, as well as counteracting the osmotic pressure of the cytoplasm. Peptidoglycan is also involved in binary fission during bacterial cell reproduction. L-form bacteria and mycoplasmas, both lacking peptidoglycan cell walls, do not proliferate by binary fission, but by a budding mechanism.
Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosomes translating mRNA into polypeptide chains, which may then undergo PTM to form the mature protein product. PTMs are important components in cell signaling, as for example when prohormones are converted to hormones.
Glycoproteins are proteins which contain oligosaccharide chains (glycans) covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.
Glycosylation is the reaction in which a carbohydrate, i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule in order to form a glycoconjugate. In biology, glycosylation usually refers to an enzyme-catalysed reaction, whereas glycation may refer to a non-enzymatic reaction. Glycosylation is a form of co-translational and post-translational modification. Glycans serve a variety of structural and functional roles in membrane and secreted proteins. The majority of proteins synthesized in the rough endoplasmic reticulum undergo glycosylation. Glycosylation is also present in the cytoplasm and nucleus as the O-GlcNAc modification. Aglycosylation is a feature of engineered antibodies to bypass glycosylation. Five classes of glycans are produced:
Native chemical ligation is an important extension of the chemical ligation concept for constructing a larger polypeptide chain by the covalent condensation of two or more unprotected peptides segments. Native chemical ligation is the most effective method for synthesizing native or modified proteins of typical size.
Teicoplanin is an antibiotic used in the prophylaxis and treatment of serious infections caused by Gram-positive bacteria, including methicillin-resistant Staphylococcus aureus and Enterococcus faecalis. It is a semisynthetic glycopeptide antibiotic with a spectrum of activity similar to vancomycin. Its mechanism of action is to inhibit bacterial cell wall synthesis.
In organic chemistry, peptide synthesis is the production of peptides, compounds where multiple amino acids are linked via amide bonds, also known as peptide bonds. Peptides are chemically synthesized by the condensation reaction of the carboxyl group of one amino acid to the amino group of another. Protecting group strategies are usually necessary to prevent undesirable side reactions with the various amino acid side chains. Chemical peptide synthesis most commonly starts at the carboxyl end of the peptide (C-terminus), and proceeds toward the amino-terminus (N-terminus). Protein biosynthesis in living organisms occurs in the opposite direction.
The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.
Bioconjugation is a chemical strategy to form a stable covalent link between two molecules, at least one of which is a biomolecule.
Tunicamycin is a mixture of homologous nucleoside antibiotics that inhibits the UDP-HexNAc: polyprenol-P HexNAc-1-P family of enzymes. In eukaryotes, this includes the enzyme GlcNAc phosphotransferase (GPT), which catalyzes the transfer of N-acetylglucosamine-1-phosphate from UDP-N-acetylglucosamine to dolichol phosphate in the first step of glycoprotein synthesis. Tunicamycin blocks N-linked glycosylation (N-glycans) and treatment of cultured human cells with tunicamycin causes cell cycle arrest in G1 phase. It is used as an experimental tool in biology, e.g. to induce unfolded protein response. Tunicamycin is produced by several bacteria, including Streptomyces clavuligerus and Streptomyces lysosuperificus.
The enzyme UDP-glucose 4-epimerase, also known as UDP-galactose 4-epimerase or GALE, is a homodimeric epimerase found in bacterial, fungal, plant, and mammalian cells. This enzyme performs the final step in the Leloir pathway of galactose metabolism, catalyzing the reversible conversion of UDP-galactose to UDP-glucose. GALE tightly binds nicotinamide adenine dinucleotide (NAD+), a co-factor required for catalytic activity.
UDP-N-acetylglucosamine—dolichyl-phosphate N-acetylglucosaminephosphotransferase is an enzyme that in humans is encoded by the DPAGT1 gene.
N-linked glycosylation, is the attachment of an oligosaccharide, a carbohydrate consisting of several sugar molecules, sometimes also referred to as glycan, to a nitrogen atom, in a process called N-glycosylation, studied in biochemistry. This type of linkage is important for both the structure and function of many eukaryotic proteins. The N-linked glycosylation process occurs in eukaryotes and widely in archaea, but very rarely in bacteria. The nature of N-linked glycans attached to a glycoprotein is determined by the protein and the cell in which it is expressed. It also varies across species. Different species synthesize different types of N-linked glycan.
O-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. O-glycosylation is a post-translational modification that occurs after the protein has been synthesised. In eukaryotes, it occurs in the endoplasmic reticulum, Golgi apparatus and occasionally in the cytoplasm; in prokaryotes, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell metabolism and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including cancer, diabetes and Alzheimer's. O-glycosylation occurs in all domains of life, including eukaryotes, archaea and a number of pathogenic bacteria including Burkholderia cenocepacia, Neisseria gonorrhoeae and Acinetobacter baumannii.
Protein O-GlcNAc transferase also known as OGT or O-linked N-acetylglucosaminyltransferase is an enzyme that in humans is encoded by the OGT gene. OGT catalyzes the addition of the O-GlcNAc post-translational modification to proteins.
Mannosyl-oligosaccharide glucosidase (MOGS) (EC 3.2.1.106, processing alpha-glucosidase I,Glc3Man9NAc2 oligosaccharide glucosidase, trimming glucosidase I, GCS1) is an enzyme with systematic name mannosyl-oligosaccharide glucohydrolase. MOGS is a transmembrane protein found in the membrane of the endoplasmic reticulum of eukaryotic cells. Biologically, it functions within the N-glycosylation pathway.
O-GlcNAc is a reversible enzymatic post-translational modification that is found on serine and threonine residues of nucleocytoplasmic proteins. The modification is characterized by a β-glycosidic bond between the hydroxyl group of serine or threonine side chains and N-acetylglucosamine (GlcNAc). O-GlcNAc differs from other forms of protein glycosylation: (i) O-GlcNAc is not elongated or modified to form more complex glycan structures, (ii) O-GlcNAc is almost exclusively found on nuclear and cytoplasmic proteins rather than membrane proteins and secretory proteins, and (iii) O-GlcNAc is a highly dynamic modification that turns over more rapidly than the proteins which it modifies. O-GlcNAc is conserved across metazoans.
The aldehyde tag is a short peptide tag that can be further modified to add fluorophores, glycans, PEG chains or reactive groups for further synthesis. A short, genetically-encoded peptide, with a consensus sequence LCxPxR, is introduced into fusion proteins, and by subsequent treatment with the formylglycine-generating enzyme (FGE), the cysteine of the tag is converted to a reactive aldehyde group. This electrophilic group can be targeted by an array of aldehyde-specific reagents, such as aminooxy- or hydrazide-functionalized compounds.
Peptide:N-glycosidase F, commonly referred to as PNGase F, is an amidase of the peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase class. PNGase F works by cleaving between the innermost GlcNAc and asparagine residues of high mannose, hybrid, and complex oligosaccharides from N-linked glycoproteins and glycopeptides. This results in a deaminated protein or peptide and a free glycan.
Chloroeremomycin is a member of the glycopeptide family of antibiotics, such as vancomycin. The molecule is a non-ribosomal polypeptide that has been glycosylated. It is composed of seven amino acids and three saccharide units. Although chloroeremomycin has never been in clinical phases, oritavancin, a semi-synthetic derivative of chloroeremomycin, has been investigated.