Versican is a large extracellular matrix proteoglycan that is present in a variety of human tissues. It is encoded by the VCAN gene. [5] [6]
Versican is a large chondroitin sulfate proteoglycan with an apparent molecular mass of more than 1000kDa. In 1989, Zimmermann and Ruoslahti cloned and sequenced the core protein of fibroblast chondroitin sulfate proteoglycan. [7] They designated it versican in recognition of its versatile modular structure.
Versican belongs to the lectican protein family, with aggrecan (abundant in cartilage), brevican and neurocan (nervous system proteoglycans) as other members. Versican is also known as chondroitin sulfate proteoglycan core protein 2 or chondroitin sulfate proteoglycan 2 (CSPG2), and PG-M.
These proteoglycans share a homologous globular N-terminal, C-terminal, and glycosaminoglycan (GAG) binding regions.
The N-terminal (G1) globular domain consists of Ig-like loop and two link modules, and has Hyaluronan (HA) binding properties.
Versican occurs in 5 isoforms : V0, V1, V2, V3, V4. [8] The central domain of versican V0 contains both the GAG-α and GAG-β domains. V1 isoforms has the GAG-β domain, V2 has the GAG-α domain, V3 is void of any GAG attachment domains and V4 has a portion of the GAG-β domain. The GAGs, being composed of repeating disaccharide units, contribute to the negative charge and many other properties of proteoglycans.
The C-terminal (G3) globular domain consists of one or two Epidermal growth factor (EGF) repeats, a C-type lectin domain and complement regulatory protein (CRP)-like domain. The C-terminal domain binds a variety of ligands in ECM which contribute significantly to the functions of lecticans.
The role of versican in cell adhesion, migration, and proliferation has been extensively studied. Versican is often considered an anti-adhesion molecule. Considering the large size (>1000 kDa) and hydration capability of versican, it is possible that the interaction of integrins (large family of cell adhesion molecules) with their cell surface receptors is sterically hindered.
Expression of versican is observed in various adult tissues such as blood vessels, skin, and developing heart. Smooth muscle cells of blood vessels, epithelial cells of skin, and the cells of central and peripheral nervous system are a few examples of cell types that express versican physiologically. Versican is involved in development, guiding embryonic cell migration important in the formation of the heart and outlining the path for neural crest cell migration.
The N-terminal of versican has an important role in maintaining the integrity of the ECM by interacting with hyaluronan. Its interactions with link protein has also been studied.
The central domain of Versican is decorated with glycosaminoglycans. The structural and functional diversity of Versican is increased by variations in GAG sulfation patterns and the type of GAG chains bound to the core protein. There is a single versican gene, however alternative splicing of its mRNA produces 4 distinct versican isoforms that differ in their potential number of GAG chains. All isoforms have homologous N-terminal (HA binding) and C-terminal (lectin-like) domains. The central domain of versican V0 contains both the GAG-α and GAG-β domains. V1 isoforms has the GAG-β domain, V2 has the GAG-α domain, and V3 is void of any GAG attachment domains, and only consists of the N-terminal and C-terminal globular domains. It is known that the isoforms are differentially expressed in different tissue types. The biological significance of alternative splicing is yet to be determined.
Because of their negatively charged sulfates or carboxyl groups, chondroitin sulfate chains are attracted to various positively charged molecules such as certain growth factors, cytokines, and chemokines. This interaction in the extracellular matrix or on the cell surface is important in the formation of immobilized gradients of these factors, their protection from proteolytic cleavage, and their presentation to specific cell-surface receptors. The binding of versican with leukocyte adhesion molecules L-selectin, P-selectin, and CD44 is also mediated by the interaction of CS chains of versican with the carbohydrate-binding domain of these molecules. Both CD44 and L-selectin have been implicated in leukocyte trafficking. The ability of versican to bind a large panel of chemokines and the biological consequences of such binding has also been examined. Versican can bind specific chemokines through its CS chains and this interaction down-regulates the chemokines function. Recently, in light of results that V1 and V2 isoforms of versican have opposite effects on cell proliferation, glycosaminoglycan domain GAG-β has been implicated in versican-enhanced cell proliferation and versican-induced reduction of cell apoptosis.
The C-terminal of Versican interacts with a variety of molecules in the matrix. One important family of ligands is the tenascin family. [9] For example, The C-lectin domain of versican interacts with tenascin R through its fibronectin type III (FnIII) repeat 3-5 domain in a calcium dependent manner, in vivo. Different tenascin domains interact with a wide range of cellular receptors, including integrins, cell adhesion molecules and members of the syndecan and glypican proteoglycan families. Versican’s C-terminal domain interacts with fibulin-2, a protein whose expression is associated with that of versican in the developing heart. The EGF domain of the C-terminal of versican also binds the EGF-receptor molecule in vivo.
Versican is a key factor in inflammation through interactions with adhesion molecules on the surfaces of inflammatory leukocytes and interactions with chemokines that are involved in recruiting inflammatory cells.
In the adult central nervous system, versican is found in perineuronal nets, where it may stabilize synaptic connections. Versican can also inhibit nervous system regeneration and axonal growth following an injury to the central nervous system.
Increased versican expression is often observed in tumor growth in tissues such as breast, brain, [10] ovary, gastrointestinal tract, prostate, and melanoma, sarcoma, and peritoneal mesothelioma. A fifth isoform of versican, V4, that is similar to V1 but with a shortened beta-GAG region, is present and upregulated in human breast cancer. [8]
Versican is required for Lewis lung carcinoma in mice to metastasize to lung, liver and adrenal glands, acting via TLR2 to activate myeloid cells and produce TNF-alpha. [11]
Versican is increased in the changing tissue extracellular matrix in inflammatory lung disorders such as chronic obstructive pulmonary disease (COPD), asthma and bronchiolitis obliterans syndrome (BOS). [12] Cells (myofibroblasts, macrophages and other inflammatory cells) can migrate more easily through extracellular matrix that has a higher versican content.
Deposits of versican are not present in normal skin but are found in the reticular dermis during keloid scarring, a condition where scar formation becomes uncontrolled and overgrowth of skin tissue occurs at the site of the wound. [13]
Versican has been shown to interact with hyaluronan and a link protein (hyaluronan and proteoglycan link protein 1; HAPLN1). [14]
Proteoglycans are proteins that are heavily glycosylated. The basic proteoglycan unit consists of a "core protein" with one or more covalently attached glycosaminoglycan (GAG) chain(s). The point of attachment is a serine (Ser) residue to which the glycosaminoglycan is joined through a tetrasaccharide bridge. The Ser residue is generally in the sequence -Ser-Gly-X-Gly-, although not every protein with this sequence has an attached glycosaminoglycan. The chains are long, linear carbohydrate polymers that are negatively charged under physiological conditions due to the occurrence of sulfate and uronic acid groups. Proteoglycans occur in connective tissue.
Glycosaminoglycans (GAGs) or mucopolysaccharides are long, linear polysaccharides consisting of repeating disaccharide units. The repeating two-sugar unit consists of a uronic sugar and an amino sugar, except in the case of the sulfated glycosaminoglycan keratan, where, in place of the uronic sugar there is a galactose unit. GAGs are found in vertebrates, invertebrates and bacteria. Because GAGs are highly polar molecules and attract water; the body uses them as lubricants or shock absorbers.
Ground substance is an amorphous gel-like substance in the extracellular space of animals that contains all components of the extracellular matrix (ECM) except for fibrous materials such as collagen and elastin. Ground substance is active in the development, movement, and proliferation of tissues, as well as their metabolism. Additionally, cells use it for support, water storage, binding, and a medium for intercellular exchange. Ground substance provides lubrication for collagen fibers.
Keratan sulfate (KS), also called keratosulfate, is any of several sulfated glycosaminoglycans that have been found especially in the cornea, cartilage, and bone. It is also synthesized in the central nervous system where it participates both in development and in the glial scar formation following an injury. Keratan sulfates are large, highly hydrated molecules which in joints can act as a cushion to absorb mechanical shock.
Perlecan (PLC) also known as basement membrane-specific heparan sulfate proteoglycan core protein (HSPG) or heparan sulfate proteoglycan 2 (HSPG2), is a protein that in humans is encoded by the HSPG2 gene. The HSPG2 gene codes for a 4,391 amino acid protein with a molecular weight of 468,829. It is one of the largest known proteins. The name perlecan comes from its appearance as a "string of pearls" in rotary shadowed images.
Heparan sulfate (HS) is a linear polysaccharide found in all animal tissues. It occurs as a proteoglycan in which two or three HS chains are attached in close proximity to cell surface or extracellular matrix proteins. In this form, HS binds to a variety of protein ligands, including Wnt, and regulates a wide range of biological activities, including developmental processes, angiogenesis, blood coagulation, abolishing detachment activity by GrB, and tumour metastasis. HS has also been shown to serve as cellular receptor for a number of viruses, including the respiratory syncytial virus. One study suggests that cellular heparan sulfate has a role in SARS-CoV-2 Infection, particularly when the virus attaches with ACE2.
Biglycan is a small leucine-rich repeat proteoglycan (SLRP) which is found in a variety of extracellular matrix tissues, including bone, cartilage and tendon. In humans, biglycan is encoded by the BGN gene which is located on the X chromosome.
Aggrecan (ACAN), also known as cartilage-specific proteoglycan core protein (CSPCP) or chondroitin sulfate proteoglycan 1, is a protein that in humans is encoded by the ACAN gene. This gene is a member of the lectican (chondroitin sulfate proteoglycan) family. The encoded protein is an integral part of the extracellular matrix in cartilagenous tissue and it withstands compression in cartilage.
Syndecan 1 is a protein which in humans is encoded by the SDC1 gene. The protein is a transmembrane heparan sulfate proteoglycan and is a member of the syndecan proteoglycan family. The syndecan-1 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Syndecan-1 is a sponge for growth factors and chemokines, with binding largely via heparan sulfate chains. The syndecans mediate cell binding, cell signaling, and cytoskeletal organization and syndecan receptors are required for internalization of the HIV-1 tat protein.
Syndecans are single transmembrane domain proteins that are thought to act as coreceptors, especially for G protein-coupled receptors. More specifically, these core proteins carry three to five heparan sulfate and chondroitin sulfate chains, i.e. they are proteoglycans, which allow for interaction with a large variety of ligands including fibroblast growth factors, vascular endothelial growth factor, transforming growth factor-beta, fibronectin and antithrombin-1. Interactions between fibronectin and some syndecans can be modulated by the extracellular matrix protein tenascin C.
A disintegrin and metalloproteinase with thrombospondin motifs 4 is an enzyme that in humans is encoded by the ADAMTS4 gene.
Perineuronal nets (PNNs) are specialized extracellular matrix structures responsible for synaptic stabilization in the adult brain. PNNs are found around certain neuron cell bodies and proximal neurites in the central nervous system. PNNs play a critical role in the closure of the childhood critical period, and their digestion can cause restored critical period-like synaptic plasticity in the adult brain. They are largely negatively charged and composed of chondroitin sulfate proteoglycans, molecules that play a key role in development and plasticity during postnatal development and in the adult.
Laminin subunit alpha-1 is a protein that in humans is encoded by the LAMA1 gene.
Xylosyltransferase 1 is an enzyme that in humans is encoded by the XYLT1 gene.
Neurocan core protein is a protein that in humans is encoded by the NCAN gene.
Chondroitin sulfate proteoglycans (CSPGs) are proteoglycans consisting of a protein core and a chondroitin sulfate side chain. They are known to be structural components of a variety of human tissues, including cartilage, and also play key roles in neural development and glial scar formation. They are known to be involved in certain cell processes, such as cell adhesion, cell growth, receptor binding, cell migration, and interaction with other extracellular matrix constituents. They are also known to interact with laminin, fibronectin, tenascin, and collagen. CSPGs are generally secreted from cells.
In biochemistry, carbohydrate sulfotransferases are enzymes within the class of sulfotransferases which catalyze the transfer of the sulfate functional group to carbohydrate groups in glycoproteins and glycolipids. Carbohydrates are used by cells for a wide range of functions from structural purposes to extracellular communication. Carbohydrates are suitable for such a wide variety of functions due to the diversity in structure generated from monosaccharide composition, glycosidic linkage positions, chain branching, and covalent modification. Possible covalent modifications include acetylation, methylation, phosphorylation, and sulfation. Sulfation, performed by carbohydrate sulfotransferases, generates carbohydrate sulfate esters. These sulfate esters are only located extracellularly, whether through excretion into the extracellular matrix (ECM) or by presentation on the cell surface. As extracellular compounds, sulfated carbohydrates are mediators of intercellular communication, cellular adhesion, and ECM maintenance.
Neural/glial antigen 2, or NG2, is a rat integral membrane proteoglycan found in the plasma membrane of many diverse cell types. Homologous proteins in other species include human CSPG4, also known as melanoma-associated chondroitin sulfate proteoglycan (MCSP), Mouse AN2, and Sea urchin ECM3. This single-pass transmembrane molecule may be plasma membrane-bound or secreted and associated with the extracellular matrix. It is believed to play a role in functions such as cell adhesion, cell-cell and cell-ECM communication, migration and metastasis, proliferation, and axonal growth, guidance and regeneration. NG2-positive cells include oligodendrocyte progenitor cells (OPCs) and other progenitor cell populations, such as chondroblasts, myoblasts, and pericytes, as well as several different tumors including glioblastoma multiforme and melanoma.
A Link domain or Link module, also known as Xlink domain, is a protein domain that binds to hyaluronic acid. It is important in blood cell migration and apoptosis. The link domain is found in some extracellular proteins in vertebrates such as the hyalectans. It appears to be involved in extracellular matrix assembly and stability, cell adhesion, and migration.
Lecticans, also known as hyalectans, are a family of proteoglycans that are components of the extracellular matrix. There are four members of the lectican family: aggrecan, brevican, neurocan, and versican. Lecticans interact with hyaluronic acid and tenascin-R to form a ternary complex.
Versican identified as a direct miR-126a-5p target that induces telomere shortening, senescence, and nuclear factor κB (NF-κB) pathway activator