The Symbol Nomenclature For Glycans (SNFG) [1] is a community-curated standard for the depiction of simple monosaccharides and complex carbohydrates (glycans) using various colored-coded, geometric shapes, along with defined text additions. [2] [3] It is hosted by the National Center for Biotechnology Information at the NCBI-Glycans Page. [4] It is curated by an international groups of researchers in the field that are collectively called the SNFG Discussion Group. The overall goal of the SNFG is to:
The SNFG consists of a table that provides color coded symbols for various monosaccharides that are commonly found in nature. It also includes a set of footnotes that describe rules for rendering glycans, including guidelines on how to modify the base set of symbols depicted in the table. These footnotes are organized into 10 themes that provide streamlined recommendations for: i. general usage of the SNFG; ii. CMYK / RGB color codes; iii. symbol colors and shapes; iv. ring configurations; v. bond linkage presentation; vi. sialic acids; vii. glycan modifications; viii. amino substitutions; ix. handling ambiguous or partially defined glycans; and x. depicting non-glycan entities using SNFG renderings. More details are available at the main SNFG webpage, [1] which is periodically updated with additional directions.
The monosaccharides can be linked together to describe complex carbohydrate structures or glycans. More exhaustive cases for mammalian species, other eukaryotes, plants and microbes are considered at the main SNFG page. [1]
Several software tools have been developed to support SNFG implementation by the community including:
The SNFG nomenclature has also been adopted as a standard by major databases and journals in the Biomedical Sciences.
In 1978, Stuart Kornfeld and colleagues at the Washington University School of Medicine presented a system for symbolic representation of vertebrate glycans. [10] This system gained popularity when it was implemented as a core method for glycan representation in the NCBI text book Essentials of Glycobiology edited by Ajit Varki (University of California, San Diego) and colleagues. [11] While the first edition of this text published in 1999 used black-and-white symbols similar to the Kornfeld system, color was introduced in the second edition of the text (2009). The advantage of color is that different monosaccharide stereoisomers could now be depicted using the same shape, only with different colors. The system of carbohydrate representation was adopted and widely disseminated by many including the NIGMS-funded Consortium for Functional Glycomics, and thus was often referred to as "CFG Nomenclature". This color representation was vastly expanded in the third edition of the text to include 49 new monosaccharides that appear mostly in non-vertebrates, microbes and plants. Inputs and recommendations from a number of scientists beyond the editors of the Essentials textbook was included in this implementation, and the release of the expanded glycan symbol system was coordinated with the IUPAC Carbohydrate Nomenclature committee. For long-term development of this symbol nomenclature and standardization of glycan representation in the Glycosciences, in 2015, the Essentials editors suggested that the representation be formally called SNFG ('Symbol Nomenclature For Glycans'), and future development be entrusted to a global community of scientists. To aid this development, each of the SNFG monosaccharide symbols was linked to PubChem entries at NCBI/NLM and a dedicated website at NCBI was established for future SNFG updates. [1] Thus, the development of the SNFG is currently undertaken by an international community of scientist that are called the SNFG Discussion Group.
Glycomics is the comprehensive study of glycomes, including genetic, physiologic, pathologic, and other aspects. Glycomics "is the systematic study of all glycan structures of a given cell type or organism" and is a subset of glycobiology. The term glycomics is derived from the chemical prefix for sweetness or a sugar, "glyco-", and was formed to follow the omics naming convention established by genomics and proteomics.
The glycome is the entire complement of sugars, whether free or present in more complex molecules, of an organism. An alternative definition is the entirety of carbohydrates in a cell. The glycome may in fact be one of the most complex entities in nature. "Glycomics, analogous to genomics and proteomics, is the systematic study of all glycan structures of a given cell type or organism" and is a subset of glycobiology.
The Consortium for Functional Glycomics (CFG) is a large research initiative funded in 2001 by a glue grant from the National Institute of General Medical Sciences (NIGMS) to “define paradigms by which protein-carbohydrate interactions mediate cell communication”. To achieve this goal, the CFG studies the functions of:
Defined in the narrowest sense, glycobiology is the study of the structure, biosynthesis, and biology of saccharides that are widely distributed in nature. Sugars or saccharides are essential components of all living things and aspects of the various roles they play in biology are researched in various medical, biochemical and biotechnological fields.
The terms glycan and polysaccharide are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate portion of a glycoconjugate, such as a glycoprotein, glycolipid, or a proteoglycan, even if the carbohydrate is only an oligosaccharide. Glycans usually consist solely of O-glycosidic linkages of monosaccharides. For example, cellulose is a glycan composed of β-1,4-linked D-glucose, and chitin is a glycan composed of β-1,4-linked N-acetyl-D-glucosamine. Glycans can be homo- or heteropolymers of monosaccharide residues, and can be linear or branched.
KEGG is a collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics, metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug development.
Siglecs(Sialic acid-binding immunoglobulin-type lectins) are cell surface proteins that bind sialic acid. They are found primarily on the surface of immune cells and are a subset of the I-type lectins. There are 14 different mammalian Siglecs, providing an array of different functions based on cell surface receptor-ligand interactions.
Glycoinformatics is a field of bioinformatics that pertains to the study of carbohydrates involved in protein post-translational modification. It broadly includes database, software, and algorithm development for the study of carbohydrate structures, glycoconjugates, enzymatic carbohydrate synthesis and degradation, as well as carbohydrate interactions. Conventional usage of the term does not currently include the treatment of carbohydrates from the better-known nutritive aspect.
Ajit Varki is a physician-scientist who is distinguished professor of medicine and cellular and molecular medicine, co-director of the Glycobiology Research and Training Center at the University of California, San Diego (UCSD), and co-director of the UCSD/Salk Center for Academic Research and Training in Anthropogeny (CARTA). He is also executive editor of the textbook Essentials of Glycobiology and Distinguished Visiting Professor at the Indian Institute of Technology in Madras and the National Center for Biological Sciences in Bangalore. He is a specialist advisor to the Human Gene Nomenclature Committee.
Anne Dell is an Australian biochemist specialising in the study of glycomics and the carbohydrate structures that modify proteins. Anne's work could be used to figure out how pathogens such as HIV are able to evade termination by the immune system which could be applied toward understanding how this occurs in fetuses. Her research has also led to the development of higher sensitivity mass spectroscopy techniques which have allowed for the better studying of the structure of carbohydrates. Anne also established GlycoTRIC at Imperial College London, a research center that allows for glycobiology to be better understood in biomedical applications. She is currently Professor of Carbohydrate Biochemistry and Head of the Department of Life Sciences at Imperial College London. Dell's other contributions to the study of Glycobiology are the additions she has made to the textbook "Essentials of Glycobiology" Dell was appointed Commander of the Order of the British Empire (CBE) in the 2009 Birthday Honours.
Glycopeptides are peptides that contain carbohydrate moieties (glycans) covalently attached to the side chains of the amino acid residues that constitute the peptide.
Macromolecular structure validation is the process of evaluating reliability for 3-dimensional atomic models of large biological molecules such as proteins and nucleic acids. These models, which provide 3D coordinates for each atom in the molecule, come from structural biology experiments such as x-ray crystallography or nuclear magnetic resonance (NMR). The validation has three aspects: 1) checking on the validity of the thousands to millions of measurements in the experiment; 2) checking how consistent the atomic model is with those experimental data; and 3) checking consistency of the model with known physical and chemical properties.
Translational glycobiology or applied glycobiology is the branch of glycobiology and glycochemistry that focuses on developing new pharmaceuticals through glycomics and glycoengineering. Research in the field aims to use therapeutic glycoconjugates for preventing transplant rejection, treating various bone diseases, and developing therapeutic cancer vaccines and other targeted therapies.
Carbohydrate Structure Database (CSDB) is a free curated database and service platform in glycoinformatics, launched in 2005 by a group of Russian scientists from N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences. CSDB stores published structural, taxonomical, bibliographic and NMR-spectroscopic data on natural carbohydrates and carbohydrate-related molecules.
The National Center for Functional Glycomics is an organization that is focused on the development of technology development in glycosciences. They are specifically focused on glycan analysis and molecular mechanisms of glycan recognition by proteins important in human biology and disease. The center was established at Emory University in 2013 with $5.5 million funding by National Institutes of Health under the leadership of Richard D. Cummings. The center moved to Harvard University in September 2015 and is currently located at Beth Israel Deaconess Medical Center in Boston Massachusetts. The center is affiliated with the Consortium for Functional Glycomics.
The Minimum Information Required About a Glycomics Experiment (MIRAGE) initiative is part of the Minimum Information Standards and specifically applies to guidelines for reporting on a glycomics experiment. The initiative is supported by the Beilstein Institute for the Advancement of Chemical Sciences. The MIRAGE project focuses on the development of publication guidelines for interaction and structural glycomics data as well as the development of data exchange formats. The project was launched in 2011 in Seattle and set off with the description of the aims of the MIRAGE project.
Glycan arrays, like that offered by the Consortium for Functional Glycomics (CFG), National Center for Functional Glycomics (NCFG) and Z Biotech, LLC, contain carbohydrate compounds that can be screened with lectins, antibodies or cell receptors to define carbohydrate specificity and identify ligands. Glycan array screening works in much the same way as other microarray that is used for instance to study gene expression DNA microarrays or protein interaction Protein microarrays.
Harry Schachter FRSC is a Canadian biochemist and glycobiologist, and professor emeritus at the University of Toronto and the Hospital For Sick Children in Toronto, Canada.
Ten Feizi is a British molecular biologist who is Professor and Director of the Glycosciences Laboratory at Imperial College London. Her research considers the structure and function of glycans. She was awarded the Society for Glycobiology Rosalind Kornfeld award in 2014.
Glycan-Protein interactions represent a class of biomolecular interactions that occur between free or protein-bound glycans and their cognate binding partners. Intramolecular glycan-protein (protein-glycan) interactions occur between glycans and proteins that they are covalently attached to. Together with protein-protein interactions, they form a mechanistic basis for many essential cell processes, especially for cell-cell interactions and host-cell interactions. For instance, SARS-CoV-2, the causative agent of COVID-19, employs its extensively glycosylated spike (S) protein to bind to the ACE2 receptor, allowing it to enter host cells. The spike protein is a trimeric structure, with each subunit containing 22 N-glycosylation sites, making it an attractive target for vaccine search.