Proteoglycan

Last updated
Aggrecan, the major proteoglycan in cartilage, has 2316 amino acids PBB Protein ACAN image.jpg
Aggrecan, the major proteoglycan in cartilage, has 2316 amino acids

Proteoglycans are proteins [1] that are heavily glycosylated. The basic proteoglycan unit consists of a "core protein" with one or more covalently attached glycosaminoglycan (GAG) chain(s). [2] The point of attachment is a serine (Ser) residue to which the glycosaminoglycan is joined through a tetrasaccharide bridge (e.g. chondroitin sulfate-GlcA-Gal-Gal-Xyl-PROTEIN). The Ser residue is generally in the sequence -Ser-Gly-X-Gly- (where X can be any amino acid residue but proline), although not every protein with this sequence has an attached glycosaminoglycan. The chains are long, linear carbohydrate polymers that are negatively charged under physiological conditions due to the occurrence of sulfate and uronic acid groups. Proteoglycans occur in connective tissue.

Contents

Types

Proteoglycans are categorized by their relative size (large and small) and the nature of their glycosaminoglycan chains. [3] Types include:

TypeGlycosaminoglycans (GAGs)Small proteoglycansLarge proteoglycans
chondroitin sulfate/dermatan sulfate decorin, 36 kDa
biglycan, 38 kDa
aggrecan, 220 kDa, the major proteoglycan in cartilage
Heparan sulfate proteoglycan
(HSPGs)
heparan sulfate/chondroitin sulfate testican, 44 kDa perlecan, 400–470 kDa
betaglycan, >300 kDa
agrin, >500 kDa
Chondroitin sulfate proteoglycan
(CSPGs)
chondroitin sulfate bikunin, 25 kDa

neurocan, 136 kDa
versican, 260–370 kDa, present in many adult tissues including blood vessels and skin
brevican, 145kDa

Keratan sulfate proteoglycan keratan sulfate fibromodulin, 42 kDa
lumican, 38 kDa

Certain members are considered members of the "small leucine-rich proteoglycan family" (SLRP). [4] These include decorin, biglycan, fibromodulin and lumican.

Function

Proteoglycans are a major component of the animal extracellular matrix, the "filler" substance existing between cells in an organism. Here they form large complexes, both to other proteoglycans, to hyaluronan, and to fibrous matrix proteins, such as collagen. The combination of proteoglycans and collagen form cartilage, a sturdy tissue that is usually heavily hydrated (mostly due to the negatively charged sulfates in the glycosaminoglycan chains of the proteoglycans). [5] They are also involved in binding cations (such as sodium, potassium and calcium) and water, and also regulating the movement of molecules through the matrix. Evidence also shows they can affect the activity and stability of proteins and signalling molecules within the matrix. [6] [7] Individual functions of proteoglycans can be attributed to either the protein core or the attached GAG chain. They can also serve as lubricants, by creating a hydrating gel that helps withstand high pressure.

Synthesis

The protein component of proteoglycans is synthesized by ribosomes and translocated into the lumen of the rough endoplasmic reticulum. Glycosylation of the proteoglycan occurs in the Golgi apparatus in multiple enzymatic steps. First, a special link tetrasaccharide is attached to a serine side chain on the core protein to serve as a primer for polysaccharide growth. Then sugars are added one at a time by glycosyl transferase. The completed proteoglycan is then exported in secretory vesicles to the extracellular matrix of the tissue.

Clinical significance

An inability to break down the proteoglycans is characteristic of a group of genetic disorders, called mucopolysaccharidoses. The inactivity of specific lysosomal enzymes that normally degrade glycosaminoglycans leads to the accumulation of proteoglycans within cells. This leads to a variety of disease symptoms, depending upon the type of proteoglycan that is not degraded. Mutations in the gene encoding the galactosyltransferase B4GALT7 result in a reduced substitution of the proteoglycans decorin and biglycan with glycosaminoglycan chains, and cause a spondylodysplastic form of Ehlers–Danlos syndrome. [8]

Distinction between proteoglycans and glycoproteins

Quoting from recommendations for IUPAC: [9]

A glycoprotein is a compound containing carbohydrate (or glycan) covalently linked to protein. The carbohydrate may be in the form of a monosaccharide, disaccharide(s), oligosaccharide(s), polysaccharide(s), or their derivatives (e.g. sulfo- or phospho-substituted). One, a few, or many carbohydrate units may be present. Proteoglycans are a subclass of glycoproteins in which the carbohydrate units are polysaccharides that contain amino sugars. Such polysaccharides are also known as glycosaminoglycans.

Related Research Articles

<span class="mw-page-title-main">Glycoprotein</span> Protein with oligosaccharide modifications

Glycoproteins are proteins which contain oligosaccharide chains covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.

<span class="mw-page-title-main">Extracellular matrix</span> Network of proteins and molecules outside cells that provides structural support for cells

In biology, the extracellular matrix (ECM), is a network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide structural and biochemical support to surrounding cells. Because multicellularity evolved independently in different multicellular lineages, the composition of ECM varies between multicellular structures; however, cell adhesion, cell-to-cell communication and differentiation are common functions of the ECM.

Glycosylation is the reaction in which a carbohydrate, i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule in order to form a glycoconjugate. In biology, glycosylation usually refers to an enzyme-catalysed reaction, whereas glycation may refer to a non-enzymatic reaction.

An oligosaccharide is a saccharide polymer containing a small number of monosaccharides. Oligosaccharides can have many functions including cell recognition and cell adhesion.

In biology, matrix is the material in between an eukaryotic organism's cells.

<span class="mw-page-title-main">Glycosaminoglycan</span> Polysaccharides found in animal tissue

Glycosaminoglycans (GAGs) or mucopolysaccharides are long, linear polysaccharides consisting of repeating disaccharide units. The repeating two-sugar unit consists of a uronic sugar and an amino sugar, except in the case of the sulfated glycosaminoglycan keratan, where, in place of the uronic sugar there is a galactose unit. GAGs are found in vertebrates, invertebrates and bacteria. Because GAGs are highly polar molecules and attract water; the body uses them as lubricants or shock absorbers.

The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.

<span class="mw-page-title-main">Chondroblast</span> Mesenchymal progenitor cell that forms a chondrocyte

Chondroblasts, or perichondrial cells, is the name given to mesenchymal progenitor cells in situ which, from endochondral ossification, will form chondrocytes in the growing cartilage matrix. Another name for them is subchondral cortico-spongious progenitors. They have euchromatic nuclei and stain by basic dyes.

Ground substance is an amorphous gel-like substance in the extracellular space of animals that contains all components of the extracellular matrix (ECM) except for fibrous materials such as collagen and elastin. Ground substance is active in the development, movement, and proliferation of tissues, as well as their metabolism. Additionally, cells use it for support, water storage, binding, and a medium for intercellular exchange. Ground substance provides lubrication for collagen fibers.

<span class="mw-page-title-main">Versican</span> Protein-coding gene in the species Homo sapiens

Versican is a large extracellular matrix proteoglycan that is present in a variety of human tissues. It is encoded by the VCAN gene.

<span class="mw-page-title-main">Heparan sulfate</span> Macromolecule

Heparan sulfate (HS) is a linear polysaccharide found in all animal tissues. It occurs as a proteoglycan in which two or three HS chains are attached in close proximity to cell surface or extracellular matrix proteins. In this form, HS binds to a variety of protein ligands, including Wnt, and regulates a wide range of biological activities, including developmental processes, angiogenesis, blood coagulation, abolishing detachment activity by GrB, and tumour metastasis. HS has also been shown to serve as cellular receptor for a number of viruses, including the respiratory syncytial virus. One study suggests that cellular heparan sulfate has a role in SARS-CoV-2 Infection, particularly when the virus attaches with ACE2.

<span class="mw-page-title-main">Decorin</span> Protein-coding gene in humans

Decorin is a protein that in humans is encoded by the DCN gene.

<span class="mw-page-title-main">Biglycan</span> Protein-coding gene in humans

Biglycan is a small leucine-rich repeat proteoglycan (SLRP) which is found in a variety of extracellular matrix tissues, including bone, cartilage and tendon. In humans, biglycan is encoded by the BGN gene which is located on the X chromosome.

<span class="mw-page-title-main">Syndecan 1</span> Protein which in humans is encoded by the SDC1 gene

Syndecan 1 is a protein which in humans is encoded by the SDC1 gene. The protein is a transmembrane heparan sulfate proteoglycan and is a member of the syndecan proteoglycan family. The syndecan-1 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Syndecan-1 is a sponge for growth factors and chemokines, with binding largely via heparan sulfate chains. The syndecans mediate cell binding, cell signaling, and cytoskeletal organization and syndecan receptors are required for internalization of the HIV-1 tat protein.

<span class="mw-page-title-main">Syndecan</span>

Syndecans are single transmembrane domain proteins that are thought to act as coreceptors, especially for G protein-coupled receptors. More specifically, these core proteins carry three to five heparan sulfate and chondroitin sulfate chains, i.e. they are proteoglycans, which allow for interaction with a large variety of ligands including fibroblast growth factors, vascular endothelial growth factor, transforming growth factor-beta, fibronectin and antithrombin-1. Interactions between fibronectin and some syndecans can be modulated by the extracellular matrix protein tenascin C.

<span class="mw-page-title-main">Lumican</span>

Lumican, also known as LUM, is an extracellular matrix protein that, in humans, is encoded by the LUM gene on chromosome 12.

<span class="mw-page-title-main">B4GALT7</span> Protein-coding gene in the species Homo sapiens

Beta-1,4-galactosyltransferase 7 also known as galactosyltransferase I is an enzyme that in humans is encoded by the B4GALT7 gene. Galactosyltransferase I catalyzes the synthesis of the glycosaminoglycan-protein linkage in proteoglycans. Proteoglycans in turn are structural components of the extracellular matrix that is found between cells in connective tissues.

<span class="mw-page-title-main">Carbohydrate sulfotransferase</span> Class of enzymes which transfer an –SO3 group to glycoproteins and lipids

In biochemistry, carbohydrate sulfotransferases are enzymes within the class of sulfotransferases which catalyze the transfer of the sulfate functional group to carbohydrate groups in glycoproteins and glycolipids. Carbohydrates are used by cells for a wide range of functions from structural purposes to extracellular communication. Carbohydrates are suitable for such a wide variety of functions due to the diversity in structure generated from monosaccharide composition, glycosidic linkage positions, chain branching, and covalent modification. Possible covalent modifications include acetylation, methylation, phosphorylation, and sulfation. Sulfation, performed by carbohydrate sulfotransferases, generates carbohydrate sulfate esters. These sulfate esters are only located extracellularly, whether through excretion into the extracellular matrix (ECM) or by presentation on the cell surface. As extracellular compounds, sulfated carbohydrates are mediators of intercellular communication, cellular adhesion, and ECM maintenance.

O-linked glycosylation is the attachment of a sugar molecule to the oxygen atom of serine (Ser) or threonine (Thr) residues in a protein. O-glycosylation is a post-translational modification that occurs after the protein has been synthesised. In eukaryotes, it occurs in the endoplasmic reticulum, Golgi apparatus and occasionally in the cytoplasm; in prokaryotes, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell metabolism and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including cancer, diabetes and Alzheimer's. O-glycosylation occurs in all domains of life, including eukaryotes, archaea and a number of pathogenic bacteria including Burkholderia cenocepacia, Neisseria gonorrhoeae and Acinetobacter baumannii.

Lecticans, also known as hyalectans, are a family of proteoglycans that are components of the extracellular matrix. There are four members of the lectican family: aggrecan, brevican, neurocan, and versican. Lecticans interact with hyaluronic acid and tenascin-R to form a ternary complex.

References

  1. Proteoglycans at the U.S. National Library of Medicine Medical Subject Headings (MeSH)
  2. Gerhard Meisenberg; William H. Simmons (2006). Principles of medical biochemistry. Elsevier Health Sciences. pp. 243–. ISBN   978-0-323-02942-1 . Retrieved 6 February 2011.
  3. Iozzo, RV; Schaefer, L (March 2015). "Proteoglycan form and function: A comprehensive nomenclature of proteoglycans". Matrix Biology. 42: 11–55. doi:10.1016/j.matbio.2015.02.003. PMC   4859157 . PMID   25701227.
  4. Hans-Joachim Gabius; Sigrun Gabius (February 2002). Glycosciences: Status and Perspectives. John Wiley and Sons. pp. 209–. ISBN   978-3-527-30888-0 . Retrieved 6 February 2011.
  5. Voet, Donald; Voet, Judith; Pratt, Charlotte (2016). Fundamentals of Biochemistry: Life at the Molecular Level. Hoboken, New Jersey: John Wiley & Sons. p. 235. ISBN   978-1-118-91840-1.
  6. Ibrahim, Sherif (2017). "Syndecan-1 is a novel molecular marker for triple negative inflammatory breast cancer and modulates the cancer stem cell phenotype via the IL-6/STAT3, Notch and EGFR signaling pathways". Molecular Cancer. 16 (1): 57. doi: 10.1186/s12943-017-0621-z . PMC   5341174 . PMID   28270211.
  7. Ibrahim, Sherif (2013). "Syndecan-1 (CD138) modulates triple-negative breast cancer stem cell properties via regulation of LRP-6 and IL-6-mediated STAT3 signaling". PLOS ONE. 8 (12): e85737. Bibcode:2013PLoSO...885737I. doi: 10.1371/journal.pone.0085737 . PMC   3877388 . PMID   24392029.
  8. Seidler, Daniela (2006). "Defective glycosylation of decorin and biglycan, altered collagen structure, and abnormal phenotype of the skin fibroblasts of an Ehlers-Danlos syndrome patient carrying the novel Arg270Cys substitution in galactosyltransferase I (beta4GalT-7)". Journal of Molecular Medicine. 84 (7): 583–94. doi:10.1007/s00109-006-0046-4. PMID   16583246. S2CID   10165577.
  9. "Nomenclature of glycoproteins, glycopeptides and peptidoglycans, Recommendations 1985". www.qmul.ac.uk. Retrieved 16 March 2021.