CAS Registry Number

Last updated

Screenshot of the CAS Common Chemistry database with information about caffeine (58-08-2). Cascommonchemistry2022.png
Screenshot of the CAS Common Chemistry database with information about caffeine (58-08-2).

A CAS Registry Number [1] (also referred to as CAS RN [2] or informally CAS Number) is a unique identification number, assigned by the Chemical Abstracts Service (CAS) in the US to every chemical substance described in the open scientific literature, in order to index the substance in the CAS Registry. This registry includes all substances described since 1957, plus some substances from as far back as the early 1800s; [3] it is a chemical database that includes organic and inorganic compounds, minerals, isotopes, alloys, mixtures, and nonstructurable materials (UVCBs, substances of unknown or variable composition, complex reaction products, or biological origin). [4] CAS RNs are generally serial numbers (with a check digit), so they do not contain any information about the structures themselves the way SMILES and InChI strings do.

Contents

The CAS Registry is an authoritative collection of disclosed chemical substance information. It identifies more than 204 million unique organic and inorganic substances and 69 million protein and DNA sequences, [3] plus additional information about each substance. It is updated with around 15,000 additional new substances daily. [5] A collection of almost 500 thousand CAS registry numbers are made available under a CC BY-NC license at ACS Commons Chemistry. [6]

Use

Historically, chemicals have been identified by a wide variety of synonyms. Frequently these are arcane and constructed according to regional naming conventions relating to chemical formulae, structures or origins. Well-known chemicals may additionally be known via multiple generic, historical, commercial, and/or (black)-market names.

CAS Registry Numbers (CAS RN) are simple and regular, convenient for database searches. They offer a reliable, common and international link to every specific substance across the various nomenclatures and disciplines used by branches of science, industry, and regulatory bodies. Almost all molecule databases today allow searching by CAS Registry Number and is used as a global standard.

Format

A CAS Registry Number has no inherent meaning, but is assigned in sequential, increasing order when the substance is identified by CAS scientists for inclusion in the CAS Registry database.

A CAS RN is separated by hyphens into three parts, the first consisting from two up to seven digits, [7] the second consisting of two digits, and the third consisting of a single digit serving as a check digit. This format gives CAS a maximum capacity of 1,000,000,000 unique numbers.

The check digit is found by taking the last digit times 1, the preceding digit times 2, the preceding digit times 3 etc., adding all these up and computing the sum modulo 10. For example, the CAS number of water is 7732-18-5: the checksum 5 is calculated as (8×1 + 1×2 + 2×3 + 3×4 + 7×5 + 7×6) = 105; 105 mod 10 = 5.

Granularity

Search engines

See also

Related Research Articles

<i>Merck Index</i> Index of chemicals

The Merck Index is an encyclopedia of chemicals, drugs and biologicals with over 10,000 monographs on single substances or groups of related compounds published online by the Royal Society of Chemistry.

A chemical database is a database specifically designed to store chemical information. This information is about chemical and crystal structures, spectra, reactions and syntheses, and thermophysical data.

<span class="mw-page-title-main">Chemical Abstracts Service</span> Division of the American Chemical Society

Chemical Abstracts Service (CAS) is a division of the American Chemical Society. It is a source of chemical information and is located in Columbus, Ohio, United States.

<span class="mw-page-title-main">Tautomer</span> Structural isomers of chemical compounds that readily interconvert

Tautomers are structural isomers of chemical compounds that readily interconvert. The chemical reaction interconverting the two is called tautomerization. This conversion commonly results from the relocation of a hydrogen atom within the compound. The phenomenon of tautomerization is called tautomerism, also called desmotropism. Tautomerism is for example relevant to the behavior of amino acids and nucleic acids, two of the fundamental building blocks of life.

<span class="mw-page-title-main">Mercury(II) oxide</span> Chemical compound

Mercury(II) oxide, also called mercuric oxide or simply mercury oxide, is the inorganic compound with the formula HgO. It has a red or orange color. Mercury(II) oxide is a solid at room temperature and pressure. The mineral form montroydite is very rarely found.

A chemical file format is a type of data file which is used specifically for depicting molecular data. One of the most widely used is the chemical table file format, which is similar to Structure Data Format (SDF) files. They are text files that represent multiple chemical structure records and associated data fields. The XYZ file format is a simple format that usually gives the number of atoms in the first line, a comment on the second, followed by a number of lines with atomic symbols and cartesian coordinates. The Protein Data Bank Format is commonly used for proteins but is also used for other types of molecules. There are many other types which are detailed below. Various software systems are available to convert from one format to another.

Registry of Toxic Effects of Chemical Substances (RTECS) is a database of toxicity information compiled from the open scientific literature without reference to the validity or usefulness of the studies reported. Until 2001 it was maintained by US National Institute for Occupational Safety and Health (NIOSH) as a freely available publication. It is now maintained by the private company BIOVIA or from several value-added resellers and is available only for a fee or by subscription.

The International Chemical Identifier is a textual identifier for chemical substances, designed to provide a standard way to encode molecular information and to facilitate the search for such information in databases and on the web. Initially developed by the International Union of Pure and Applied Chemistry (IUPAC) and National Institute of Standards and Technology (NIST) from 2000 to 2005, the format and algorithms are non-proprietary. Since May 2009, it has been developed by the InChI Trust, a nonprofit charity from the United Kingdom which works to implement and promote the use of InChI.

The Beilstein database is the largest database in the field of organic chemistry, in which compounds are uniquely identified by their Beilstein Registry Number. The database covers the scientific literature from 1771 to the present and contains experimentally validated information on millions of chemical reactions and substances from original scientific publications. The electronic database was created from Handbuch der Organischen Chemie, founded by Friedrich Konrad Beilstein in 1881, but has appeared online under a number of different names, including Crossfire Beilstein. Since 2009, the content has been maintained and distributed by Elsevier Information Systems in Frankfurt under the product name "Reaxys".

PubChem is a database of chemical molecules and their activities against biological assays. The system is maintained by the National Center for Biotechnology Information (NCBI), a component of the National Library of Medicine, which is part of the United States National Institutes of Health (NIH). PubChem can be accessed for free through a web user interface. Millions of compound structures and descriptive datasets can be freely downloaded via FTP. PubChem contains multiple substance descriptions and small molecules with fewer than 100 atoms and 1,000 bonds. More than 80 database vendors contribute to the growing PubChem database.

<span class="mw-page-title-main">European Community number</span> Identifier for substance regulated within EU

The European Community number is a unique seven-digit identifier that was assigned to substances for regulatory purposes within the European Union by the European Commission. The EC Inventory comprises three individual inventories, EINECS, ELINCS and the NLP list.

ChemSpider is a freely accessible online database of chemicals owned by the Royal Society of Chemistry. It contains information on more than 100 million molecules from over 270 data sources, each of them receiving a unique identifier called ChemSpider Identifier.

<span class="mw-page-title-main">Dichlorodiphenyldichloroethane</span> Chemical compound

Dichlorodiphenyldichloroethane (DDD) is an organochlorine insecticide that is slightly irritating to the skin. DDD is a metabolite of DDT. DDD is colorless and crystalline; it is closely related chemically and is similar in properties to DDT, but it is considered to be less toxic to animals than DDT. The molecular formula for DDD is (ClC6H4)2CHCHCl2 or C14H10Cl4, whereas the formula for DDT is (ClC6H4)2CHCCl3 or C14H9Cl5.

The Hazardous Substances Data Bank (HSDB) was a toxicology database on the U.S. National Library of Medicine's (NLM) Toxicology Data Network (TOXNET). It focused on the toxicology of potentially hazardous chemicals, and included information on human exposure, industrial hygiene, emergency handling procedures, environmental fate, regulatory requirements, and related areas. All data were referenced and derived from a core set of books, government documents, technical reports, and selected primary journal literature. Prior to 2020, all entries were peer-reviewed by a Scientific Review Panel (SRP), members of which represented a spectrum of professions and interests. Last Chairs of the SRP are Dr. Marcel J. Cassavant, MD, Toxicology Group, and Dr. Roland Everett Langford, PhD, Environmental Fate Group. The SRP was terminated due to budget cuts and realignment of the NLM.

Reaxys is a web-based tool for the retrieval of information about chemical compounds and data from published literature, including journals and patents. The information includes chemical compounds, chemical reactions, chemical properties, related bibliographic data, substance data with synthesis planning information, as well as experimental procedures from selected journals and patents. It is licensed by Elsevier.

<span class="mw-page-title-main">Segesterone</span> Chemical compound

Segesterone, also known as 17α-hydroxy-16-methylene-19-norprogesterone or as 17α-deacetylnestorone, is a steroidal progestin of the 19-norprogesterone group that was never marketed. An acetate ester, segesterone acetate, better known as nestorone or elcometrine, is marketed for clinical use. Segesterone acetate produces segesterone as a metabolite.

<span class="mw-page-title-main">CompTox Chemicals Dashboard</span> Chemical database

The CompTox Chemicals Dashboard is a freely accessible online database created and maintained by the U.S. Environmental Protection Agency (EPA). The database provides access to multiple types of data including physicochemical properties, environmental fate and transport, exposure, usage, in vivo toxicity, and in vitro bioassay. EPA and other scientists use the data and models contained within the dashboard to help identify chemicals that require further testing and reduce the use of animals in chemical testing. The Dashboard is also used to provide public access to information from EPA Action Plans, e.g. around perfluorinated alkylated substances.

Poly(ethyl methacrylate) (PEMA) is a hydrophobic synthetic acrylate polymer. It has properties similar to the more common PMMA, however it produces less heat during polymerization, has a lower modulus of elasticity and has an overall softer texture. It may be vulcanized using lead oxide as a catalyst and it can be softened using ethanol.

Erbium phosphide is a binary inorganic compound of erbium and phosphorus with the chemical formula ErP.

References

  1. CAS registry description Archived 2008-07-25 at the Wayback Machine , by Chemical Abstracts Service
  2. "CAS Registry Number Verified Partner Program". CAS. Retrieved 4 August 2021.
  3. 1 2 "CAS Content: Substances". www.cas.org. Retrieved 10 August 2020.
  4. American Chemical Society. "CAS Registry and CASRNs". Archived from the original on 25 July 2008. Retrieved 25 July 2009.
  5. Chemical Substances – CAS REGISTRY
  6. "CAS Common Chemistry expands collection of publicly available chemical information". CAS. Retrieved 17 March 2021.
  7. 2014-06-18, https://www.cas.org/content/chemical-substances/faqs
  8. Canadian Centre for Occupational Health and Safety. "CHEMINDEX Search" . Retrieved 13 July 2009.
  9. United States National Library of Medicine. "ChemIDplus Advanced" . Retrieved 28 June 2021.
  10. American Chemical Society. "Substance Search" . Retrieved 8 July 2009.
  11. Jacobs, Andrea; Williams, Dustin; Hickey, Katherine; Patrick, Nathan; Williams, Antony J.; Chalk, Stuart; McEwen, Leah; Willighagen, Egon; Walker, Martin; Bolton, Evan; Sinclair, Gabriel; Sanford, Adam (13 May 2022). "CAS Common Chemistry in 2021: Expanding Access to Trusted Chemical Information for the Scientific Community". Journal of Chemical Information and Modeling. 62 (11): 2737–2743. doi:10.1021/acs.jcim.2c00268. PMC   9199008 . PMID   35559614. S2CID   248778162.
  12. National Industrial Chemicals Notification and Assessment Scheme. "AICS Detailed Help / Guidance Notes". Archived from the original on 9 July 2009. Retrieved 8 July 2009.
  13. European Commission Joint research Centre. "ESIS : European chemical Substances Information System". Archived from the original on 1 January 2014. Retrieved 11 July 2009.
  14. Library & Information Centre. "Finding a CAS Registry Number" . Retrieved 11 July 2009.
  15. Environmental Risk Management Authority. "HSNO Chemical Classification Information Database". Archived from the original on 11 July 2009. Retrieved 14 July 2009.
  16. National Induscctrial Chemicals Notification and Assessment Scheme. "AICS Search Tool". Archived from the original on 14 May 2009. Retrieved 11 July 2009.
  17. "CompTox Chemicals Dashboard | Home". comptox.epa.gov. Retrieved 14 October 2021.

To find the CAS number of a compound given its name, formula or structure, the following free resources can be used: