Sialic acid binding Ig-like lectin family | |
---|---|
Identifiers | |
Symbol | SIGLEC |
Membranome | 210 |
Siglecs(Sialic acid-binding immunoglobulin-type lectins) are cell surface proteins that bind sialic acid. They are found primarily on the surface of immune cells and are a subset of the I-type lectins. There are 14 different mammalian Siglecs, providing an array of different functions based on cell surface receptor-ligand interactions. [1]
The first described candidate Siglec was Sialoadhesin (Siglec-1/CD169) a lectin-like adhesion protein on macrophages. [2] Parallel studies by Ajit Varki and colleagues on the previously cloned CD22 (a B cell surface protein involved in adhesion and activation) showed direct evidence for sialic acid recognition. The subsequent cloning of Sialoadhesin by Crocker revealed homology to CD22 (Siglec-2), CD33 (Siglec-3) and myelin-associated glycoprotein (MAG/Siglec-4), leading to the proposal for a family of "Sialoadhesins". Varki then suggested the term Siglec as a better alternative and as a subset of I-type (Ig-type) lectins. This nomenclature was agreed upon and has been adopted by almost all investigators working on these molecules (by convention, Siglecs are always capitalised.) Several additional Siglecs (Siglecs 5–12) have been identified in humans that are highly similar in structure to CD33 and so are collectively referred to as "CD33-related Siglecs". [3] Further Siglecs have been identified including Siglec-14 and Siglec-15. Siglecs have been characterized into two distinct groups: the first and highly conserved-across-mammals group composed of Sialoadhesins, CD22, MAG, and Siglec-15, and a second group comprising Siglecs closely related to CD33. [4] [5] Others such as Siglec-8 and Siglec-9 have homologues in mice and rats (Siglec-F and Siglec-E respectively in both). Humans have a higher number of Siglecs than mice and so the numbering system was based on the human proteins. [6]
Siglecs are Type I transmembrane proteins where the NH3+-terminus is in the extracellular space and the COO−-terminus is cytosolic. [7] Each Siglec contains an N-terminal V-type immunoglobulin domain (Ig domain) which acts as the binding receptor for sialic acid. These lectins are placed into the group of I-type lectins because the lectin domain is an immunoglobulin fold. All Siglecs are extended from the cell surface by C2-type Ig domains which have no binding activity. Siglecs differ in the number of these C2-type domains. [6] As these proteins contain Ig domains, they are members of the Immunoglobulin superfamily (IgSF).
Most Siglecs, such as CD22 and the CD33-related family, contain ITIMs (Immunoreceptor tyrosine-based inhibitory motifs) in their cytosolic region. [7] These act to down-regulate signaling pathways involving phosphorylation, such as those induced by ITAMs (Immunoreceptor tyrosine-based activation motifs). [8] Some, however, like Siglec-14, contain positive amino acid residues that help dock ITAM-containing adaptor proteins such as DAP12. [1]
Due to the acidic nature of sialic acid, Siglec active sites contain a conserved arginine residue which is positively charged at physiological pH. This amino acid forms salt bridges with the carboxyl group of the sugar residue. [6] This is best seen in Sialoadhesin, where arginine at position 97 forms salt bridges with the COO− group of the sialic acid, producing a stable interaction. [9] Each lectin domain is specific for the linkage that connects sialic acid to the glycan. Sialic acid contains numerous hydroxyl groups which can be involved in the formation of glycosidic linkages, which are observed at carbons number 2, 3, 6, and 8 of the sugar backbone. The binding specificity of each Siglec is due to different chemical interactions between the sugar ligand and the Siglec amino acids. The position in space of the individual groups on the sugar and the protein amino acids affects the sialic acid linkage to which each Siglec binds. For example, Sialoadhesin preferentially binds α2,3 linkages over α2,6 linkages. [9]
The primary function of Siglecs is to bind glycans containing sialic acids. These receptor-glycan interactions can be used in cell adhesion, cell signalling and others. The function of Siglecs is limited to their cellular distribution. For example, MAG is found only on oligodendrocytes and schwann cells whereas Sialoadhesin is localised to macrophages.
Most Siglecs are short and do not extend far from the cell surface. This prevents most Siglecs from binding to other cells as mammalian cells are covered in sialic acid-containing glycans. This means that the majority of Siglecs only bind ligands on the surface of the same cell, so called cis -ligands, as they are "swamped" by glycans on the same cell. One exception is Sialoadhesin which contains 16 C2-Ig domains, producing a long, extended protein allowing it to bind trans-ligands, i.e. ligands found on other cells. Others, such as MAG, have also been shown to bind trans-ligands.
The members of the siglec family are paired receptors with opposing intracellular signaling functions. [10] [11] Due to their ITIM-containing cytoplasmic regions, most Siglecs interfere with cellular signalling, inhibiting immune cell activation. Once bound to their ligands, Siglecs recruit inhibitory proteins such as SHP phosphatases via their ITIM domains. [12] The tyrosine contained within the ITIM is phosphorylated after ligand binding and acts as a docking site for SH2 domain-containing proteins like SHP phosphatases. This leads to de-phosphorylation of cellular proteins, down-regulating activating signalling pathways.
Examples of negative signalling:
Siglec-14 contains an arginine residue in its transmembrane region. [15] This binds to the ITAM-containing DAP10 and DAP12 proteins. When bound to its ligand, Siglec-14 leads to activation of cellular signalling pathways via the DAP10 and DAP12 proteins. [7] These proteins up-regulate phosphorylation cascades involving numerous cellular proteins, leading to cellular activation. Siglec-14 appears to co-localise with Siglec-5, and as this protein inhibits cellular signalling pathways, co-ordinate opposing functions within immune cells. [15]
Siglecs that can bind trans-ligands, such as Sialoadhesin, allow cell-cell interactions to take place. These glycan-Siglec interactions allow cells to bind one another, allowing signalling in some cases, or in the case of Sialoadhesin, pathogen uptake. Sialoadhesin's function was originally thought to be important in binding to red blood cells. Sialoadhesin lacks a cytosolic ITIM or a positive residue to bind ITAM-containing adaptors and so is thought not to influence signalling. Studies show that this protein is involved in phagocytosis of bacteria that contain highly sialylated glycan structures such as the lipopolysaccharide of Neisseria meningitidis . [16] Binding to these structures allows the macrophage to phagocytose these bacteria, clearing the system of pathogens.
Siglec-7 is also used in binding to pathogens such as Campylobacter jejuni . This occurs in a sialic acid-dependent manner and brings NK cells and monocytes, on which Siglec-7 is expressed, into contact with these bacteria. [17] The NK cell is then able to kill these foreign pathogens.
Knock-out studies are often used to uncover the function proteins have within a cell. Mice are often used as they express orthologous proteins of ours, or extremely similar homologues.
Some examples of knock-out Siglecs include:
Name | Cellular distribution [7] | Sialic acid linkage specificity [6] | No. of C2-Ig domains [7] | ITIM or positive residue [7] |
---|---|---|---|---|
Siglec-1 (Sialoadhesin) | Macrophages | α2,3>α2,6 | 16 | None |
Siglec-2 (CD22) | B cells | α2,6 | 6 | ITIM |
Siglec-3 (CD33) | Myeloid progenitors, Monocytes | α2,6>α2,3 [20] | 1 | ITIM |
Siglec-4 (MAG) | Myelin | α2,3>α2,6 | 4 | None |
Siglec-5 (CD170) | Neutrophils, Monocytes | α2,3 | 3 | ITIM |
Siglec-6 (CD327) | Trophoblasts, Mast cells | α2,6 | 2 | ITIM |
Siglec-7 (CD328) | NK cells | α2,8>α2,6>α2,3 | 2 | ITIM |
Siglec-8 | Eosinophils, Mast cells | α2,3>α2,6 | 2 | ITIM |
Siglec-9 (CD329) | Monocytes, Neutrophils, Dendritic cells | α2,3=α2,6 (prefers sulfated residues) | 2 | ITIM |
Siglec-10 | B cells | α2,3=α2,6 | 4 | ITIM |
Siglec-11 | B cells | α2,8 | 4 | ITIM |
Siglec-12 [21] | Macrophages | No binding [6] | 2 | ITIM |
Siglec-13 [22] | Chimpanzee monocytes | |||
Siglec-14 | Unknown | α2,6 [15] | 2 | Arginine [15] |
Siglec-15 [4] | Osteoclasts, Macrophages, DCs Siglec-15 [4] [23] [24] | α2,6 [25] | 1 | Lysine [25] |
Siglec-16 [26] | Tissue macrophages | |||
Siglec-17 [22] | NK cells | |||
This table briefly summarises the cellular distribution of each human/primate Siglec; the linkage specificity each has for sialic acid binding; the number of C2-Ig domains it contains; and whether it contains an ITIM or a positive residue to bind ITAM-containing adaptor proteins. References in the column headings correspond to all information displayed in that column, unless other references are shown. Siglec-12 information is referenced by [21] only, excluding the linkage specificity.
Many pathologies have been linked to the spontaneous interactions between sialic acid and the immunosuppressive sialic acid-binding immunoglobulin-like lectin (Siglec) receptors on immune cells such as cancer, [27] HIV-1 [28] and Group B Strep Infection. [29] [30] The sialic acid family branches from glycans, sugar chains comprising various monosaccharides that cover the membrane of every living cell and display a staggering structural diversity. Sialic acids function in protein folding, neural development, cellular interactions, among many other physiological processes. As sialic acids are abundantly expressed in vertebrates and not in microorganisms, they are considered self-antigens or self-structures that play major role in inhibiting harmful immune system activity by regulating neutrophils and B cell tolerance. [31]
Within the immune system, Siglecs, especially those related to CD33, sialic acid and Siglec-binding pathogens are subjected to the runaway Red Queen co-evolution phenomenon by a selection pressure that maintains innate immune system's capacity for self-recognition and ensures prevention of autoimmunity diseases. [32] [33] This evolutionary chain and incessant mutations have made Siglecs one of the most rapidly evolving gene, evidenced by both intra- and inter-species differences. [33] The polymorphism of human-unique Siglec-12, -14 and -16 suggests that the selection pressure is ongoing. [32]
As Siglecs feature distinct binding preferences for the sialic acid and its modifications, several attempts have been made to chemically modify natural sialic acid ligands and eventually led to the creation of sialic acid mimetics (SAMs) with enhanced binding capacity and selectivity towards Siglecs. [34]
SAMs can be used to target Siglecs and modulate Siglec-expressing cells by modifying the sialic acid backbone at various positions, from C-2 to C-9. [34] [35] [36] The carboxylic acid, however, must be left intact. [34] The first attempts were made to develop high-affinity sialic acid mimetics for Siglec-2, which led to the discovery that increased binding affinity came hydrogen bonding and lipophilic interactions between SAMs and Siglec-2. [34] Several separate modifications have been made at the C-2, C-5 and C-9 positions, leading Mesch et al. to hypothesizing that the simultaneous modification at all three positions could lead to optimization of binding. [37]
Success in drastically enhanced binding of SAMs to Siglec 2 suggests that a similar approach can work on other members of the family. Some modifications have included an additional simultaneous modification at the C-4 position on the sialic acid backbone. [34] The development of (copper) I-catalyzed azide alkyne cycloaddition (CuAAC) click chemistry has expedited the identification of new SAMs and allowed for the creation of novel SAMs with high binding to Siglec-3, -5, -6, -7 and -10. [38] As of 2017, SAMs for most Siglecs have been reported, except for Siglec -6, -8, -11, -14, -15 and -16. [34]
Clustering of receptors and high-avidity binding, collectively known as multivalent binding, can enhance the effectiveness of SAMs in human body. Currently, advancements in glycoengineering have made use of SAM-decorated nanoparticles, SAM-decorated polymers and on-cell synthesis of SAMs to present SAMs to Siglecs. [34] Liposomes crosslinked with SAMs also have been shown to aid in presenting antigens to antigen-presenting cells via the Siglec-1 or -7 pathways. [39] Moreover, human cells, engineered with sialic acids carrying Ac5NeuNPoc incorporated into its sialoglycans and 3-bromo-benzyl azide, showed hyperactivity towards Siglec-2. [34] [40]
Pattern recognition receptors (PRRs) play a crucial role in the proper function of the innate immune system. PRRs are germline-encoded host sensors, which detect molecules typical for the pathogens. They are proteins expressed mainly by cells of the innate immune system, such as dendritic cells, macrophages, monocytes, neutrophils, as well as by epithelial cells, to identify two classes of molecules: pathogen-associated molecular patterns (PAMPs), which are associated with microbial pathogens, and damage-associated molecular patterns (DAMPs), which are associated with components of host's cells that are released during cell damage or death. They are also called primitive pattern recognition receptors because they evolved before other parts of the immune system, particularly before adaptive immunity. PRRs also mediate the initiation of antigen-specific adaptive immune response and release of inflammatory cytokines.
CD22, or cluster of differentiation-22, is a molecule belonging to the SIGLEC family of lectins. It is found on the surface of mature B cells and to a lesser extent on some immature B cells. Generally speaking, CD22 is a regulatory molecule that prevents the overactivation of the immune system and the development of autoimmune diseases.
Galectins are a class of proteins that bind specifically to β-galactoside sugars, such as N-acetyllactosamine, which can be bound to proteins by either N-linked or O-linked glycosylation. They are also termed S-type lectins due to their dependency on disulphide bonds for stability and carbohydrate binding. There have been about 15 galectins discovered in mammals, encoded by the LGALS genes, which are numbered in a consecutive manner. Only galectin-1, -2, -3, -4, -7, -7B, -8, -9, -9B, 9C, -10, -12, -13, -14, and -16 have been identified in humans. Galectin-5 and -6 are found in rodents, whereas galectin-11 and -15 are uniquely found in sheep and goats. Members of the galectin family have also been discovered in other mammals, birds, amphibians, fish, nematodes, sponges, and some fungi. Unlike the majority of lectins they are not membrane bound, but soluble proteins with both intra- and extracellular functions. They have distinct but overlapping distributions but found primarily in the cytosol, nucleus, extracellular matrix or in circulation. Although many galectins must be secreted, they do not have a typical signal peptide required for classical secretion. The mechanism and reason for this non-classical secretion pathway is unknown.
Sialoadhesin (SIGLEC-1) is a cell adhesion molecule found on the surface of macrophages. It is found in especially high amounts on macrophages of the spleen, liver, lymph node, bone marrow, colon, and lungs.
CD33 or Siglec-3 is a transmembrane receptor expressed on cells of myeloid lineage. It is usually considered myeloid-specific, but it can also be found on some lymphoid cells.
An immunoreceptor tyrosine-based inhibitory motif (ITIM), is a conserved sequence of amino acids that is found intracellularly in the cytoplasmic domains of many inhibitory receptors of the non-catalytic tyrosine-phosphorylated receptor family found on immune cells. These immune cells include T cells, B cells, NK cells, dendritic cells, macrophages and mast cells. ITIMs have similar structures of S/I/V/LxYxxI/V/L, where x is any amino acid, Y is a tyrosine residue that can be phosphorylated, S is the amino acid serine, I is the amino acid isoleucine, and V is the amino acid valine. ITIMs recruit SH2 domain-containing phosphatases, which inhibit cellular activation. ITIM-containing receptors often serve to target immunoreceptor tyrosine-based activation motif (ITAM)-containing receptors, resulting in an innate inhibition mechanism within cells. ITIM bearing receptors have important role in regulation of immune system allowing negative regulation at different levels of the immune response.
The mannose receptor is a C-type lectin primarily present on the surface of macrophages, immature dendritic cells and liver sinusoidal endothelial cells, but is also expressed on the surface of skin cells such as human dermal fibroblasts and keratinocytes. It is the first member of a family of endocytic receptors that includes Endo180 (CD280), M-type PLA2R, and DEC-205 (CD205).
Sialic acid-binding Ig-like lectin 7 is a protein that in humans is encoded by the SIGLEC7 gene. SIGLEC7 has also been designated as CD328.
Sialic acid-binding Ig-like lectin 12, or Siglec-XII, is a protein that in humans, is encoded by the SIGLEC12 gene.
Sialic acid-binding Ig-like lectin 5 is a protein that in humans is encoded by the SIGLEC5 gene. SIGLEC5 has also been designated CD170.
Sialic acid-binding Ig-like lectin 9 is a protein that in humans is encoded by the SIGLEC9 gene.
Sialic acid-binding Ig-like lectin 8 is a protein that in humans is encoded by the SIGLEC8 gene. This gene is located on chromosome 19q13.4, about 330 kb downstream of the SIGLEC9 gene. Within the siglec family of transmembrane proteins, Siglec-8 belongs to the CD33-related siglec subfamily, a subfamily that has undergone rapid evolution.
Sialic acid-binding Ig-like lectin 10 is a protein that in humans is encoded by the SIGLEC10 gene. Siglec-G is often referred to as the murine paralog of human Siglec-10
Ajit Varki is a physician-scientist who is distinguished professor of medicine and cellular and molecular medicine, founding co-director of the Glycobiology Research and Training Center at the University of California, San Diego (UCSD), and founding co-director of the UCSD/Salk Center for Academic Research and Training in Anthropogeny (CARTA). He is also executive editor of the textbook Essentials of Glycobiology and distinguished visiting professor at the Indian Institute of Technology in Madras and the National Center for Biological Sciences in Bangalore. He is a specialist advisor to the Human Gene Nomenclature Committee.
The word, sialome, is a junction of the Greek word for saliva (sialos) and the suffix used in molecular biology to reference a totality of some sort, -ome. The name relating to its role in biochemistry.
C-type lectin domain family 10 member A (CLEC10A) also designated as CD301 is a protein that in humans is encoded by the CLEC10A gene. CLEC10A is part of the C-type lectin superfamily and binds to N-Acetylgalactosamine (GalNAc). It is mainly expressed on myeloid cells and also on oocytes and very early stages of embryogenesis. CLEC10A is used as a marker of the CD1c+ dendritic cell subgroup, also called cDC2. The actions of CLEC10A are diverse, depending on the ligand and environment.
Killer Activation Receptors (KARs) are receptors expressed on the plasma membrane of Natural Killer cells. KARs work together with Killer Inhibitory Receptors, which inactivate KARs in order to regulate the NK cells functions on hosted or transformed cells.These receptors have a broad binding specificity and are able to broadcast opposite signals. It is the balance between these competing signals that determines if the cytotoxic activity of the NK cell and apoptosis of distressed cell occurs.
Paired receptors are pairs or clusters of receptor proteins that bind to extracellular ligands but have opposing activating and inhibitory signaling effects. Traditionally, paired receptors are defined as homologous pairs with similar extracellular domains and different cytoplasmic regions, whose genes are located together in the genome as part of the same gene cluster and which evolved through gene duplication. Homologous paired receptors often, but not always, have a shared ligand in common. More broadly, pairs of receptors have been identified that exhibit paired functional behavior - responding to a shared ligand with opposing intracellular signals - but are not closely homologous or co-located in the genome. Paired receptors are highly expressed in the cells of the immune system, especially natural killer (NK) and myeloid cells, and are involved in immune regulation.
Sialic acid-binding Ig-like lectin 6 is a protein that in humans is encoded by the SIGLEC6 gene. The gene was originally named CD33L (CD33-like) due to similarities between these genes but later became known as OB-BP1 due to its ability to bind to this factor and, finally, SIGLEC6 as the sixth member of the SIGLEC family of receptors to be identified. The protein has also been given the CD designation CD327.
GlycoRNAs are small non-coding RNAs with sialylated glycans.