Proteoglycans are proteins [1] that are heavily glycosylated. The basic proteoglycan unit consists of a "core protein" with one or more covalently attached glycosaminoglycan (GAG) chain(s). [2] The point of attachment is a serine (Ser) residue to which the glycosaminoglycan is joined through a tetrasaccharide bridge (e.g. chondroitin sulfate-GlcA-Gal-Gal-Xyl-PROTEIN). The Ser residue is generally in the sequence -Ser-Gly-X-Gly- (where X can be any amino acid residue but proline), although not every protein with this sequence has an attached glycosaminoglycan. The chains are long, linear carbohydrate polymers that are negatively charged under physiological conditions due to the occurrence of sulfate and uronic acid groups. Proteoglycans occur in connective tissue.
Proteoglycans are categorized by their relative size (large and small) and the nature of their glycosaminoglycan chains. [3] Types include:
Type | Glycosaminoglycans (GAGs) | Small proteoglycans | Large proteoglycans |
---|---|---|---|
chondroitin sulfate/dermatan sulfate | decorin, 36 kDa biglycan, 38 kDa | aggrecan, 220 kDa, the major proteoglycan in cartilage | |
Heparan sulfate proteoglycan (HSPGs) | heparan sulfate/chondroitin sulfate | testican, 44 kDa | perlecan, 400–470 kDa betaglycan, >300 kDa agrin, >500 kDa |
Chondroitin sulfate proteoglycan (CSPGs) | chondroitin sulfate | bikunin, 25 kDa | neurocan, 136 kDa |
Keratan sulfate proteoglycan | keratan sulfate | fibromodulin, 42 kDa lumican, 38 kDa |
Certain members are considered members of the "small leucine-rich proteoglycan family" (SLRP). [4] These include decorin, biglycan, fibromodulin and lumican.
Proteoglycans are a major component of the animal extracellular matrix, the "filler" substance existing between cells in an organism. Here they form large complexes, both to other proteoglycans, to hyaluronan, and to fibrous matrix proteins, such as collagen. The combination of proteoglycans and collagen form cartilage, a sturdy tissue that is usually heavily hydrated (mostly due to the negatively charged sulfates in the glycosaminoglycan chains of the proteoglycans). [5] They are also involved in binding cations (such as sodium, potassium and calcium) and water, and also regulating the movement of molecules through the matrix. Evidence also shows they can affect the activity and stability of proteins and signalling molecules within the matrix. [6] [7] Individual functions of proteoglycans can be attributed to either the protein core or the attached GAG chain. They can also serve as lubricants, by creating a hydrating gel that helps withstand high pressure.
The protein component of proteoglycans is synthesized by ribosomes and translocated into the lumen of the rough endoplasmic reticulum. Glycosylation of the proteoglycan occurs in the Golgi apparatus in multiple enzymatic steps. First, a special link tetrasaccharide is attached to a serine side chain on the core protein to serve as a primer for polysaccharide growth. Then sugars are added one at a time by glycosyl transferase. The completed proteoglycan is then exported in secretory vesicles to the extracellular matrix of the tissue.
An inability to break down the proteoglycans is characteristic of a group of genetic disorders, called mucopolysaccharidoses. The inactivity of specific lysosomal enzymes that normally degrade glycosaminoglycans leads to the accumulation of proteoglycans within cells. This leads to a variety of disease symptoms, depending upon the type of proteoglycan that is not degraded. Mutations in the gene encoding the galactosyltransferase B4GALT7 result in a reduced substitution of the proteoglycans decorin and biglycan with glycosaminoglycan chains, and cause a spondylodysplastic form of Ehlers–Danlos syndrome. [8]
Quoting from recommendations for IUPAC: [9]
A glycoprotein is a compound containing carbohydrate (or glycan) covalently linked to protein. The carbohydrate may be in the form of a monosaccharide, disaccharide(s), oligosaccharide(s), polysaccharide(s), or their derivatives (e.g. sulfo- or phospho-substituted). One, a few, or many carbohydrate units may be present. Proteoglycans are a subclass of glycoproteins in which the carbohydrate units are polysaccharides that contain amino sugars. Such polysaccharides are also known as glycosaminoglycans.
Glycoproteins are proteins which contain oligosaccharide chains covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.
In biology, the extracellular matrix (ECM), is a network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide structural and biochemical support to surrounding cells. Because multicellularity evolved independently in different multicellular lineages, the composition of ECM varies between multicellular structures; however, cell adhesion, cell-to-cell communication and differentiation are common functions of the ECM.
Glycosylation is the reaction in which a carbohydrate, i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule in order to form a glycoconjugate. In biology, glycosylation usually refers to an enzyme-catalysed reaction, whereas glycation may refer to a non-enzymatic reaction.
An oligosaccharide is a saccharide polymer containing a small number of monosaccharides. Oligosaccharides can have many functions including cell recognition and cell adhesion.
In biology, matrix is the material in between an eukaryotic organism's cells.
Glycosaminoglycans (GAGs) or mucopolysaccharides are long, linear polysaccharides consisting of repeating disaccharide units. The repeating two-sugar unit consists of a uronic sugar and an amino sugar, except in the case of the sulfated glycosaminoglycan keratan, where, in place of the uronic sugar there is a galactose unit. GAGs are found in vertebrates, invertebrates and bacteria. Because GAGs are highly polar molecules and attract water; the body uses them as lubricants or shock absorbers.
The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.
Chondroblasts, or perichondrial cells, is the name given to mesenchymal progenitor cells in situ which, from endochondral ossification, will form chondrocytes in the growing cartilage matrix. Another name for them is subchondral cortico-spongious progenitors. They have euchromatic nuclei and stain by basic dyes.
Ground substance is an amorphous gel-like substance in the extracellular space of animals that contains all components of the extracellular matrix (ECM) except for fibrous materials such as collagen and elastin. Ground substance is active in the development, movement, and proliferation of tissues, as well as their metabolism. Additionally, cells use it for support, water storage, binding, and a medium for intercellular exchange. Ground substance provides lubrication for collagen fibers.
Versican is a large extracellular matrix proteoglycan that is present in a variety of human tissues. It is encoded by the VCAN gene.
Heparan sulfate (HS) is a linear polysaccharide found in all animal tissues. It occurs as a proteoglycan in which two or three HS chains are attached in close proximity to cell surface or extracellular matrix proteins. In this form, HS binds to a variety of protein ligands, including Wnt, and regulates a wide range of biological activities, including developmental processes, angiogenesis, blood coagulation, abolishing detachment activity by GrB, and tumour metastasis. HS has also been shown to serve as cellular receptor for a number of viruses, including the respiratory syncytial virus. One study suggests that cellular heparan sulfate has a role in SARS-CoV-2 Infection, particularly when the virus attaches with ACE2.
Decorin is a protein that in humans is encoded by the DCN gene.
Biglycan is a small leucine-rich repeat proteoglycan (SLRP) which is found in a variety of extracellular matrix tissues, including bone, cartilage and tendon. In humans, biglycan is encoded by the BGN gene which is located on the X chromosome.
Syndecan 1 is a protein which in humans is encoded by the SDC1 gene. The protein is a transmembrane heparan sulfate proteoglycan and is a member of the syndecan proteoglycan family. The syndecan-1 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Syndecan-1 is a sponge for growth factors and chemokines, with binding largely via heparan sulfate chains. The syndecans mediate cell binding, cell signaling, and cytoskeletal organization and syndecan receptors are required for internalization of the HIV-1 tat protein.
Syndecans are single transmembrane domain proteins that are thought to act as coreceptors, especially for G protein-coupled receptors. More specifically, these core proteins carry three to five heparan sulfate and chondroitin sulfate chains, i.e. they are proteoglycans, which allow for interaction with a large variety of ligands including fibroblast growth factors, vascular endothelial growth factor, transforming growth factor-beta, fibronectin and antithrombin-1. Interactions between fibronectin and some syndecans can be modulated by the extracellular matrix protein tenascin C.
Lumican, also known as LUM, is an extracellular matrix protein that, in humans, is encoded by the LUM gene on chromosome 12.
Beta-1,4-galactosyltransferase 7 also known as galactosyltransferase I is an enzyme that in humans is encoded by the B4GALT7 gene. Galactosyltransferase I catalyzes the synthesis of the glycosaminoglycan-protein linkage in proteoglycans. Proteoglycans in turn are structural components of the extracellular matrix that is found between cells in connective tissues.
In biochemistry, carbohydrate sulfotransferases are enzymes within the class of sulfotransferases which catalyze the transfer of the sulfate functional group to carbohydrate groups in glycoproteins and glycolipids. Carbohydrates are used by cells for a wide range of functions from structural purposes to extracellular communication. Carbohydrates are suitable for such a wide variety of functions due to the diversity in structure generated from monosaccharide composition, glycosidic linkage positions, chain branching, and covalent modification. Possible covalent modifications include acetylation, methylation, phosphorylation, and sulfation. Sulfation, performed by carbohydrate sulfotransferases, generates carbohydrate sulfate esters. These sulfate esters are only located extracellularly, whether through excretion into the extracellular matrix (ECM) or by presentation on the cell surface. As extracellular compounds, sulfated carbohydrates are mediators of intercellular communication, cellular adhesion, and ECM maintenance.
O-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. O-glycosylation is a post-translational modification that occurs after the protein has been synthesised. In eukaryotes, it occurs in the endoplasmic reticulum, Golgi apparatus and occasionally in the cytoplasm; in prokaryotes, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell metabolism and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including cancer, diabetes and Alzheimer's. O-glycosylation occurs in all domains of life, including eukaryotes, archaea and a number of pathogenic bacteria including Burkholderia cenocepacia, Neisseria gonorrhoeae and Acinetobacter baumannii.
Lecticans, also known as hyalectans, are a family of proteoglycans that are components of the extracellular matrix. There are four members of the lectican family: aggrecan, brevican, neurocan, and versican. Lecticans interact with hyaluronic acid and tenascin-R to form a ternary complex.