The Unique Ingredient Identifier (UNII) is an alphanumeric identifier linked to a substance's molecular structure or descriptive information and is generated by the Global Substance Registration System (GSRS) of the Food and Drug Administration (FDA). It classifies substances as chemical, protein, nucleic acid, polymer, structurally diverse, or mixture [1] [2] according to the standards outlined by the International Organization for Standardization in ISO 11238 [3] and ISO DTS 19844. [4] UNIIs are non-proprietary, unique, unambiguous, and free to generate and use. [2] A UNII can be generated for substances at any level of complexity, being broad enough to include "any substance, from an atom to an organism." [1]
The GSRS is used to generate permanent, unique identifiers for substances in regulated products, such as ingredients in drug and biological products. The GSRS uses molecular structure, protein and nucleic sequences and descriptive information to generate the UNII. The preferred means for defining a chemical substance is by its two-dimensional molecular structure since it is pertinent to a substance's identity and information regarding a substance's stereochemistry is readily available. [5] Nucleic acids are defined by their sequences and by any modifications that may be present. In the case of proteins only end-group modifications will be uniquely identified, along with any other modifications that are essential for activity. This is because of the inherently heterogenous nature of proteins. Therefore, two different protein substances can share the same UNII and yet have no biosimilarity or therapeutic equivalence. [5] Polymers are defined by their structural repeating units and physical properties such as molecular weight or properties related to molecular weight (e.g. viscosity). Structurally diverse materials are inherently heterogenous preparations from natural materials such as plant extract and vaccines. [2]
The GSRS is a freely distributable software system provided through a collaboration between the FDA, the National Center for Advancing Translational Sciences (NCATS) and the European Medicines Agency (EMA). [1] The GSRS was developed to implement the ISO 11238 standard which is one of the core ISO Identification of Medicinal Product (IDMP) standards. The GSRS Board which governs the GSRS includes experts from FDA, European Regulatory Agencies, and the United States Pharmacopoeia (USP). [1]
Preferred Term | UNII |
---|---|
Methadone | UC6VBE7V1Z |
Methadone hydrochloride | 229809935B |
Oxygen | S88TT14065 |
Hydrogen | 7YNJ3PO35Z |
Water | 059QF0KO0R |
Biopolymers are natural polymers produced by the cells of living organisms. Like other polymers, biopolymers consist of monomeric units that are covalently bonded in chains to form larger molecules. There are three main classes of biopolymers, classified according to the monomers used and the structure of the biopolymer formed: polynucleotides, polypeptides, and polysaccharides. The Polynucleotides, RNA and DNA, are long polymers of nucleotides. Polypeptides include proteins and shorter polymers of amino acids; some major examples include collagen, actin, and fibrin. Polysaccharides are linear or branched chains of sugar carbohydrates; examples include starch, cellulose, and alginate. Other examples of biopolymers include natural rubbers, suberin and lignin, cutin and cutan, melanin, and polyhydroxyalkanoates (PHAs).
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is deoxyribose, a variant of ribose, the polymer is DNA.
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.
A macromolecule is a very large molecule important to biological processes, such as a protein or nucleic acid. It is composed of thousands of covalently bonded atoms. Many macromolecules are polymers of smaller molecules called monomers. The most common macromolecules in biochemistry are biopolymers and large non-polymeric molecules such as lipids, nanogels and macrocycles. Synthetic fibers and experimental materials such as carbon nanotubes are also examples of macromolecules.
Polyethylene glycol (PEG; ) is a polyether compound derived from petroleum with many applications, from industrial manufacturing to medicine. PEG is also known as polyethylene oxide (PEO) or polyoxyethylene (POE), depending on its molecular weight. The structure of PEG is commonly expressed as H−(O−CH2−CH2)n−OH.
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers – specifically polypeptides – formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid monomer may also be called a residue, which indicates a repeating unit of a polymer. Proteins form by amino acids undergoing condensation reactions, in which the amino acids lose one water molecule per reaction in order to attach to one another with a peptide bond. By convention, a chain under 30 amino acids is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations driven by a number of non-covalent interactions, such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, cryo-electron microscopy (cryo-EM) and dual polarisation interferometry, to determine the structure of proteins.
A peptidomimetic is a small protein-like chain designed to mimic a peptide. They typically arise either from modification of an existing peptide, or by designing similar systems that mimic peptides, such as peptoids and β-peptides. Irrespective of the approach, the altered chemical structure is designed to advantageously adjust the molecular properties such as stability or biological activity. This can have a role in the development of drug-like compounds from existing peptides. Peptidomimetics can be prepared by cyclization of linear peptides or coupling of stable unnatural amino acids. These modifications involve changes to the peptide that will not occur naturally. Unnatural amino acids can be generated from their native analogs via modifications such as amine alkylation, side chain substitution, structural bond extension cyclization, and isosteric replacements within the amino acid backbone. Based on their similarity with the precursor peptide, peptidomimetics can be grouped into four classes where A features the most and D the least similarities. Classes A and B involve peptide-like scaffolds, while classes C and D include small molecules.
Aptamers are oligomers of artificial ssDNA, RNA, XNA, or peptide that bind a specific target molecule, or family of target molecules. They exhibit a range of affinities, with variable levels of off-target binding and are sometimes classified as chemical antibodies. Aptamers and antibodies can be used in many of the same applications, but the nucleic acid-based structure of aptamers, which are mostly oligonucleotides, is very different from the amino acid-based structure of antibodies, which are proteins. This difference can make aptamers a better choice than antibodies for some purposes.
KEGG is a collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics, metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug development.
The history of molecular biology begins in the 1930s with the convergence of various, previously distinct biological and physical disciplines: biochemistry, genetics, microbiology, virology and physics. With the hope of understanding life at its most fundamental level, numerous physicists and chemists also took an interest in what would become molecular biology.
Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule of protein, DNA, or RNA, and that is important to its function. The structure of these molecules may be considered at any of several length scales ranging from the level of individual atoms to the relationships among entire protein subunits. This useful distinction among scales is often expressed as a decomposition of molecular structure into four levels: primary, secondary, tertiary, and quaternary. The scaffold for this multiscale organization of the molecule arises at the secondary level, where the fundamental structural elements are the molecule's various hydrogen bonds. This leads to several recognizable domains of protein structure and nucleic acid structure, including such secondary-structure features as alpha helixes and beta sheets for proteins, and hairpin loops, bulges, and internal loops for nucleic acids. The terms primary, secondary, tertiary, and quaternary structure were introduced by Kaj Ulrik Linderstrøm-Lang in his 1951 Lane Medical Lectures at Stanford University.
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context, sense may have slightly different meanings. For example, the negative-sense strand of DNA is equivalent to the template strand, whereas the positive-sense strand is the non-template strand whose nucleotide sequence is equivalent to the sequence of the mRNA transcript.
Analyte-specific reagents (ASRs) are a class of biological molecules which can be used to identify and measure the amount of an individual chemical substance in biological specimens.
In chemistry, residue is whatever remains or acts as a contaminant after a given class of events. Residue may be the material remaining after a process of preparation, separation, or purification, such as distillation, evaporation, or filtration. It may also denote the undesired by-products of a chemical reaction.
Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution, and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.
This glossary of biology terms is a list of definitions of fundamental terms and concepts used in biology, the study of life and of living organisms. It is intended as introductory material for novices; for more specific and technical definitions from sub-disciplines and related fields, see Glossary of cell biology, Glossary of genetics, Glossary of evolutionary biology, Glossary of ecology, Glossary of environmental science and Glossary of scientific naming, or any of the organism-specific glossaries in Category:Glossaries of biology.
In host–guest chemistry, macromolecular cages are a type of macromolecule structurally consisting of a three-dimensional chamber surrounded by a molecular framework. Macromolecular cage architectures come in various sizes ranging from 1-50 nm and have varying topologies as well as functions. They can be synthesized through covalent bonding or self-assembly through non-covalent interactions. Most macromolecular cages that are formed through self-assembly are sensitive to pH, temperature, and solvent polarity.
A majority of the human genome is made up of non-protein coding DNA. It infers that such sequences are not commonly employed to encode for a protein. However, even though these regions do not code for protein, they have other functions and carry necessary regulatory information.They can be classified based on the size of the ncRNA. Small noncoding RNA is usually categorized as being under 200 bp in length, whereas long noncoding RNA is greater than 200bp. In addition, they can be categorized by their function within the cell; Infrastructural and Regulatory ncRNAs. Infrastructural ncRNAs seem to have a housekeeping role in translation and splicing and include species such as rRNA, tRNA, snRNA.Regulatory ncRNAs are involved in the modification of other RNAs.
This glossary of cellular and molecular biology is a list of definitions of terms and concepts commonly used in the study of cell biology, molecular biology, and related disciplines, including molecular genetics, biochemistry, and microbiology. It is split across two articles:
This glossary of cellular and molecular biology is a list of definitions of terms and concepts commonly used in the study of cell biology, molecular biology, and related disciplines, including genetics, biochemistry, and microbiology. It is split across two articles: