N-linked glycosylation is the attachment of an oligosaccharide, a carbohydrate consisting of several sugar molecules, sometimes also referred to as glycan, to a nitrogen atom (the amide nitrogen of an asparagine (Asn) residue of a protein), in a process called N-glycosylation, studied in biochemistry. [1] The resulting protein is called an N-linked glycan, or simply an N-glycan.
This type of linkage is important for both the structure [2] and function [3] of many eukaryotic proteins. The N-linked glycosylation process occurs in eukaryotes and widely in archaea, but very rarely in bacteria. The nature of N-linked glycans attached to a glycoprotein is determined by the protein and the cell in which it is expressed. [4] It also varies across species. Different species synthesize different types of N-linked glycan.
There are two types of bonds involved in a glycoprotein: bonds between the saccharides residues in the glycan and the linkage between the glycan chain and the protein molecule.
The sugar moieties are linked to one another in the glycan chain via glycosidic bonds. These bonds are typically formed between carbons 1 and 4 of the sugar molecules. The formation of glycosidic bond is energetically unfavourable, therefore the reaction is coupled to the hydrolysis of two ATP molecules. [4]
On the other hand, the attachment of a glycan residue to a protein requires the recognition of a consensus sequence. N-linked glycans are almost always attached to the nitrogen atom of an asparagine (Asn) side chain that is present as a part of Asn–X–Ser/Thr consensus sequence, where X is any amino acid except proline (Pro). [4]
In animal cells, the glycan attached to the asparagine is almost inevitably N-acetylglucosamine (GlcNAc) in the β-configuration. [4] This β-linkage is similar to glycosidic bond between the sugar moieties in the glycan structure as described above. Instead of being attached to a sugar hydroxyl group, the anomeric carbon atom is attached to an amide nitrogen. The energy required for this linkage comes from the hydrolysis of a pyrophosphate molecule. [4]
The biosynthesis of N-linked glycans occurs via three major steps: [4]
Synthesis, en bloc transfer and initial trimming of precursor oligosaccharide occurs in the endoplasmic reticulum (ER). Subsequent processing and modification of the oligosaccharide chain are carried out in the Golgi apparatus.[ citation needed ]
The synthesis of glycoproteins is thus spatially separated in different cellular compartments. Therefore, the type of N-glycan synthesized, depends on its accessibility to the different enzymes present within these cellular compartments.[ citation needed ]
However, in spite of the diversity, all N-glycans are synthesized through a common pathway with a common core glycan structure. [4] The core glycan structure is essentially made up of two N-acetyl glucosamine and three mannose residues. This core glycan is then elaborated and modified further, resulting in a diverse range of N-glycan structures. [4]
The process of N-linked glycosylation starts with the formation of dolichol-linked GlcNAc sugar. Dolichol is a lipid molecule composed of repeating isoprene units. This molecule is found attached to the membrane of the ER. Sugar molecules are attached to the dolichol through a pyrophosphate linkage [4] (one phosphate was originally linked to dolichol, and the second phosphate came from the nucleotide sugar). The oligosaccharide chain is then extended through the addition of various sugar molecules in a stepwise manner to form a precursor oligosaccharide.[ citation needed ]
The assembly of this precursor oligosaccharide occurs in two phases: Phase I and II. [4] Phase I takes place on the cytoplasmic side of the ER and Phase II takes place on the luminal side of the ER.[ citation needed ]
The precursor molecule, ready to be transferred to a protein, consists of two GlcNAc, nine mannose, and three glucose molecules.
Phase I | |
---|---|
Steps | Location |
| Cytoplasmic side of ER |
At this point, the lipid-linked glycan is translocated across the membrane making it accessible to enzymes in the endoplasmic reticulum lumen. This translocation process is still poorly understood, but it is suggested to be performed by an enzyme known as flippase. | |
Phase II | |
| Luminal side of ER |
Once the precursor oligosaccharide is formed, the completed glycan is then transferred to the nascent polypeptide in the lumen of the ER membrane. This reaction is driven by the energy released from the cleavage of the pyrophosphate bond between the dolichol-glycan molecule. There are three conditions to fulfill before a glycan is transferred to a nascent polypeptide: [4]
Oligosaccharyltransferase is the enzyme responsible for the recognition of the consensus sequence and the transfer of the precursor glycan to a polypeptide acceptor which is being translated in the endoplasmic reticulum lumen. N-linked glycosylation is, therefore, a co-translational event.[ citation needed ]
N-glycan processing is carried out in endoplasmic reticulum and the Golgi body. Initial trimming of the precursor molecule occurs in the ER and the subsequent processing occurs in the Golgi.
Upon transferring the completed glycan onto the nascent polypeptide, two glucose residues are removed from the structure. Enzymes known as glycosidases remove some sugar residues. These enzymes can break glycosidic linkages by using a water molecule. These enzymes are exoglycosidases as they only work on monosaccharide residues located at the non-reducing end of the glycan. [4] This initial trimming step is thought to act as a quality control step in the ER to monitor protein folding.
Once the protein is folded correctly, two glucose residues are removed by glucosidase I and II. The removal of the final third glucose residue signals that the glycoprotein is ready for transit from the ER to the cis-Golgi. [4] ER mannosidase catalyses the removal of this final glucose. However, if the protein is not folded properly, the glucose residues are not removed and thus the glycoprotein can't leave the endoplasmic reticulum. A chaperone protein (calnexin/calreticulin) binds to the unfolded or partially folded protein to assist protein folding.
The next step involves further addition and removal of sugar residues in the cis-Golgi. These modifications are catalyzed by glycosyltransferases and glycosidases respectively. In the cis-Golgi, a series of mannosidases remove some or all of the four mannose residues in α-1,2 linkages. [4] Whereas in the medial portion of the Golgi, glycosyltransferases add sugar residues to the core glycan structure, giving rise to the three main types of glycans: high mannose, hybrid and complex glycans.
The order of addition of sugars to the growing glycan chains is determined by the substrate specificities of the enzymes and their access to the substrate as they move through secretory pathway. Thus, the organization of this machinery within a cell plays an important role in determining which glycans are made.[ citation needed ]
Golgi enzymes play a key role in determining the synthesis of the various types of glycans. The order of action of the enzymes is reflected in their position in the Golgi stack:
Enzymes | Location within Golgi |
---|---|
Mannosidase I | cis-Golgi |
GlcNAc transferases | medial Golgi |
Galactosyltransferase and Sialyltransferase | trans-Golgi |
Similar N-glycan biosynthesis pathway have been found in prokaryotes and Archaea. [6] However, compared to eukaryotes, the final glycan structure in eubacteria and archaea does not seem to differ much from the initial precursor made in the endoplasmic reticulum. In eukaryotes, the original precursor oligosaccharide is extensively modified en route to the cell surface. [4]
N-linked glycans have intrinsic and extrinsic functions. [4] [7]
Within the immune system, the N-linked glycans on an immune cell's surface will help dictate that migration pattern of the cell, e.g. immune cells that migrate to the skin have specific glycosylations that favor homing to that site. [8] The glycosylation patterns on the various immunoglobulins including IgE, IgM, IgD, IgA, and IgG bestow them with unique effector functions by altering their affinities for Fc and other immune receptors. [8] Glycans may also be involved in "self" and "non self" discrimination, which may be relevant to the pathophysiology of various autoimmune diseases. [8]
Intrinsic | |
Extrinsic |
|
In some cases, interaction between the N-glycan and the protein stabilizes the protein through complex electronic effects. [11]
Changes in N-linked glycosylation has been associated with different diseases including rheumatoid arthritis, [12] type 1 diabetes, [13] Crohn's disease, [14] and cancers. [15] [16]
Mutations in eighteen genes involved in N-linked glycosylation result in a variety of diseases, most of which involve the nervous system. [3] [16]
Many therapeutic proteins in the market are antibodies, which are N-linked glycoproteins. For example, Etanercept, Infliximab and Rituximab are N-glycosylated therapeutic proteins.
The importance of N-linked glycosylation is becoming increasingly evident in the field of pharmaceuticals. [17] Although bacterial or yeast protein production systems have significant potential advantages such as high yield and low cost, problems arise when the protein of interest is a glycoprotein. Most prokaryotic expression systems such as E. coli cannot carry out post-translational modifications. On the other hand, eukaryotic expression hosts such as yeast and animal cells, have different glycosylation patterns. The proteins produced in these expression hosts are often not identical to human protein and thus, cause immunogenic reactions in patients. For example, S.cerevisiae (yeast) often produce high-mannose glycans which are immunogenic.
Non-human mammalian expression systems such as CHO or NS0 cells have the machinery required to add complex, human-type glycans. However, glycans produced in these systems can differ from glycans produced in humans, as they can be capped with both N-glycolylneuraminic acid (Neu5Gc) and N-acetylneuraminic acid (Neu5Ac), whereas human cells only produce glycoproteins containing N-acetylneuraminic acid. Furthermore, animal cells can also produce glycoproteins containing the galactose-alpha-1,3-galactose epitope, which can induce serious allergenic reactions, including anaphylactic shock, in people who have Alpha-gal allergy.
These drawbacks have been addressed by several approaches such as eliminating the pathways that produce these glycan structures through genetic knockouts. Furthermore, other expression systems have been genetically engineered to produce therapeutic glycoproteins with human-like N-linked glycans. These include yeasts such as Pichia pastoris , [18] insect cell lines, green plants, [19] and even bacteria.
Glycoproteins are proteins which contain oligosaccharide (sugar) chains covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.
A congenital disorder of glycosylation is one of several rare inborn errors of metabolism in which glycosylation of a variety of tissue proteins and/or lipids is deficient or defective. Congenital disorders of glycosylation are sometimes known as CDG syndromes. They often cause serious, sometimes fatal, malfunction of several different organ systems in affected infants. The most common sub-type is PMM2-CDG where the genetic defect leads to the loss of phosphomannomutase 2 (PMM2), the enzyme responsible for the conversion of mannose-6-phosphate into mannose-1-phosphate.
A cisterna is a flattened membrane vesicle found in the endoplasmic reticulum and Golgi apparatus. Cisternae are an integral part of the packaging and modification processes of proteins occurring in the Golgi.
Mannose is a sugar with the formula HOCH2(CHOH)4CHO. It is one of the monomers of the aldohexose series of carbohydrates. It is a C-2 epimer of glucose. Mannose is important in human metabolism, especially in the glycosylation of certain proteins. Several congenital disorders of glycosylation are associated with mutations in enzymes involved in mannose metabolism.
Glycosylation is the reaction in which a carbohydrate, i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule in order to form a glycoconjugate. In biology, glycosylation usually refers to an enzyme-catalysed reaction, whereas glycation may refer to a non-enzymatic reaction.
An oligosaccharide is a saccharide polymer containing a small number of monosaccharides. Oligosaccharides can have many functions including cell recognition and cell adhesion.
Dolichol refers to any of a group of long-chain mostly unsaturated organic compounds that are made up of varying numbers of isoprene units terminating in an α-saturated isoprenoid group, containing an alcohol functional group.
The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.
Oligosaccharyltransferase or OST (EC 2.4.1.119) is a membrane protein complex that transfers a 14-sugar oligosaccharide from dolichol to nascent protein. It is a type of glycosyltransferase. The sugar Glc3Man9GlcNAc2 (where Glc=Glucose, Man=Mannose, and GlcNAc=N-acetylglucosamine) is attached to an asparagine (Asn) residue in the sequence Asn-X-Ser or Asn-X-Thr where X is any amino acid except proline. This sequence is called a glycosylation sequon. The reaction catalyzed by OST is the central step in the N-linked glycosylation pathway.
The enzyme mannosyl-glycoprotein endo-β-N-acetylglucosaminidase (endoglycosidase H) (EC 3.2.1.96) has systematic name glycopeptide-D-mannosyl-N4-(N-acetyl-D-glucosaminyl)2-asparagine 1,4-N-acetyl-β-glucosaminohydrolase. It is a highly specific endoglycosidase which cleaves asparagine-linked mannose rich oligosaccharides, but not highly processed complex oligosaccharides from glycoproteins. It is used for research purposes to deglycosylate glycoproteins and to monitor intracellular protein trafficking through the secretory pathway.
Ribophorins are dome shaped transmembrane glycoproteins which are located in the membrane of the rough endoplasmic reticulum, but are absent in the membrane of the smooth endoplasmic reticulum. There are two types of ribophorines: ribophorin I and II. These act in the protein complex oligosaccharyltransferase (OST) as two different subunits of the named complex. Ribophorin I and II are only present in eukaryote cells.
UDP-N-acetylglucosamine—dolichyl-phosphate N-acetylglucosaminephosphotransferase is an enzyme that in humans is encoded by the DPAGT1 gene.
Endoplasmic reticulum mannosyl-oligosaccharide 1,2-alpha-mannosidase is an enzyme that in humans is encoded by the MAN1B1 gene.
Mannosyl-oligosaccharide 1,2-alpha-mannosidase IA is an enzyme that in humans is encoded by the MAN1A1 gene.
O-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. O-glycosylation is a post-translational modification that occurs after the protein has been synthesised. In eukaryotes, it occurs in the endoplasmic reticulum, Golgi apparatus and occasionally in the cytoplasm; in prokaryotes, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell metabolism and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including cancer, diabetes and Alzheimer's. O-glycosylation occurs in all domains of life, including eukaryotes, archaea and a number of pathogenic bacteria including Burkholderia cenocepacia, Neisseria gonorrhoeae and Acinetobacter baumannii.
Mannosyl-oligosaccharide glucosidase (MOGS) (EC 3.2.1.106, processing α-glucosidase I,Glc3Man9NAc2 oligosaccharide glucosidase, trimming glucosidase I, GCS1) is an enzyme with systematic name mannosyl-oligosaccharide glucohydrolase. MOGS is a transmembrane protein found in the membrane of the endoplasmic reticulum of eukaryotic cells. Biologically, it functions within the N-glycosylation pathway.
Mannosyl-oligosaccharide 1,3-1,6-α-mannosidase, also known as Golgi α-mannosidase II, is an enzyme with systematic name (1→3)-(1→6)-mannosyl-oligosaccharide α-D-mannohydrolase. It catalyses the hydrolysis of the terminal (1→3)- and (1→6)-linked α-D-mannose residues in the mannosyl-oligosaccharide Man5(GlcNAc)3.
Peptide:N-glycosidase F, commonly referred to as PNGase F, is an amidase of the peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase class. PNGase F works by cleaving between the innermost GlcNAc and asparagine residues of high mannose, hybrid, and complex oligosaccharides from N-linked glycoproteins and glycopeptides. This results in a deaminated protein or peptide and a free glycan.
William Joseph Lennarz was a biochemist at Stony Brook University. He focused the majority of his research on biochemical processes in cells.
GlycoRNAs are small non-coding RNAs with sialylated glycans.