CAS Registry Number

Last updated

A CAS Registry Number, [1] also referred to as CASRN or CAS Number, is a unique numerical identifier assigned by the Chemical Abstracts Service (CAS) to every chemical substance described in the open scientific literature (currently including all substances described from 1957 through the present, plus some substances from the early or mid 1900s), including organic and inorganic compounds, minerals, isotopes, alloys and nonstructurable materials (UVCBs, substances of unknown or variable composition, complex reaction products, or biological origin). [2] CASRNs are generally serial numbers (with a check digit), so they do not contain any information about the structures themselves the way SMILES and InChI strings do.

Contents

The registry maintained by CAS is an authoritative collection of disclosed chemical substance information. It currently identifies more than 164 million unique organic and inorganic substances and 68 million protein and DNA sequences, [3] plus additional information about each substance. It is updated with around 15,000 additional new substances daily. [4]

Use

Historically, chemicals have been identified by a wide variety of synonyms. Frequently these are arcane and constructed according to regional naming conventions relating to chemical formulae, structures or origins. Well-known chemicals may additionally be known via multiple generic, historical, commercial, and/or (black)-market names.

CAS Registry Numbers (CASRN) are simple and regular, convenient for database searches. They offer a reliable, common and international link to every specific substance across the various nomenclatures and disciplines used by branches of science, industry, and regulatory bodies. Almost all molecule databases today allow searching by CAS Registry Number.

Format

A CAS Registry Number has no inherent meaning but is assigned in sequential, increasing order when the substance is identified by CAS scientists for inclusion in the CAS REGISTRY database.

A CASRN is separated by hyphens into three parts, the first consisting from two up to seven digits, [5] the second consisting of two digits, and the third consisting of a single digit serving as a check digit. This current format gives CAS a maximum capacity of 1,000,000,000 unique identifiers.

The check digit is found by taking the last digit times 1, the preceding digit times 2, the preceding digit times 3 etc., adding all these up and computing the sum modulo 10. For example, the CAS number of water is 7732-18-5: the checksum 5 is calculated as (8×1 + 1×2 + 2×3 + 3×4 + 7×5 + 7×6) = 105; 105 mod 10 = 5.

Granularity

Search engines

See also

Notes

  1. CAS registry description Archived 25 July 2008 at the Wayback Machine , by Chemical Abstracts Service
  2. American Chemical Society. "CAS Registry and CASRNs". Archived from the original on 25 July 2008. Retrieved 25 July 2009.
  3. "CAS Content: Substances". www.cas.org. Retrieved 10 August 2020.
  4. Chemical Substances - CAS REGISTRY
  5. 2014-06-18, https://www.cas.org/content/chemical-substances/faqs
  6. Canadian Centre for Occupational Health and Safety. "CHEMINDEX Search" . Retrieved 13 July 2009.
  7. United States National Library of Medicine. "Advanced" . Retrieved 1 December 2009.
  8. American Chemical Society. "Substance Search" . Retrieved 8 July 2009.
  9. National Industrial Chemicals Notification and Assessment Scheme. "AICS Detailed Help / Guidance Notes". Archived from the original on 9 July 2009. Retrieved 8 July 2009.
  10. European Commission Joint research Centre. "ESIS : European chemical Substances Information System". Archived from the original on 1 January 2014. Retrieved 11 July 2009.
  11. Library & Information Centre. "Finding a CAS Registry Number" . Retrieved 11 July 2009.
  12. Environmental Risk Management Authority. "HSNO Chemical Classification Information Database". Archived from the original on 11 July 2009. Retrieved 14 July 2009.
  13. National Industrial Chemicals Notification and Assessment Scheme. "AICS Search Tool". Archived from the original on 14 May 2009. Retrieved 11 July 2009.

To find the CAS number of a compound given its name, formula or structure, the following free resources can be used:

Related Research Articles

UN numbers are four-digit numbers that identify hazardous materials, and articles in the framework of international transport. Some hazardous substances have their own UN numbers, while sometimes groups of chemicals or products with similar properties receive a common UN number. A chemical in its solid state may receive a different UN number than the liquid phase if their hazardous properties differ significantly; substances with different levels of purity may also receive different UN numbers.

Entrez cross-database search engine, or web portal

The Entrez Global Query Cross-Database Search System is a federated search engine, or web portal that allows users to search many discrete health sciences databases at the National Center for Biotechnology Information (NCBI) website. The NCBI is a part of the National Library of Medicine (NLM), which is itself a department of the National Institutes of Health (NIH), which in turn is a part of the United States Department of Health and Human Services. The name "Entrez" was chosen to reflect the spirit of welcoming the public to search the content available from the NLM.

Chemical Abstracts Service Division of the American Chemical Society

Chemical Abstracts Service (CAS) is a division of the American Chemical Society. It is a source of chemical information. CAS is located in Columbus, Ohio, United States.

Mercury(II) oxide chemical compound

Mercury(II) oxide, also called mercuric oxide or simply mercury oxide, has a formula of HgO. It has a red or orange color. Mercury(II) oxide is a solid at room temperature and pressure. The mineral form montroydite is very rarely found.

This article discusses some common molecular file formats, including usage and converting between them.

The IUPAC International Chemical Identifier is a textual identifier for chemical substances, designed to provide a standard way to encode molecular information and to facilitate the search for such information in databases and on the web. Initially developed by IUPAC and NIST from 2000 to 2005, the format and algorithms are non-proprietary.

The Beilstein database is the largest database in the field of organic chemistry, in which compounds are uniquely identified by their Beilstein Registry Number. The database covers the scientific literature from 1771 to the present and contains experimentally validated information on millions of chemical reactions and substances from original scientific publications. The electronic database was created from Beilstein's Handbook of Organic Chemistry, founded by Friedrich Konrad Beilstein in 1881, but has appeared online under a number of different names, including Crossfire Beilstein. Since 2009, the content has been maintained and distributed by Elsevier Information Systems in Frankfurt under the product name "Reaxys".

PubChem is a database of chemical molecules and their activities against biological assays. The system is maintained by the National Center for Biotechnology Information (NCBI), a component of the National Library of Medicine, which is part of the United States National Institutes of Health (NIH). PubChem can be accessed for free through a web user interface. Millions of compound structures and descriptive datasets can be freely downloaded via FTP. PubChem contains substance descriptions and small molecules with fewer than 1000 atoms and 1000 bonds. More than 80 database vendors contribute to the growing PubChem database.

European Community number unique seven-digit identifier

The European Community number is a unique seven-digit identifier that was assigned to substances for regulatory purposes within the European Union by the European Commission. The EC Inventory comprises three individual inventories, EINECS, ELINCS and the NLP list.

Chemical substance Matter of constant composition best characterized by the entities (molecules, formula units, atoms) it is composed of

A chemical substance is a form of matter having constant chemical composition and characteristic properties. Some references add that chemical substance cannot be separated into its constituent elements by physical separation methods, i.e., without breaking chemical bonds. Chemical substances can be simple substances, chemical compounds, or alloys. Chemical elements may or may not be included in the definition, depending on expert viewpoint.

ChemSpider database of chemicals owned by the Royal Society of Chemistry; see P661

ChemSpider is a database of chemicals. ChemSpider is owned by the Royal Society of Chemistry.

Dichlorodiphenyldichloroethane chemical compound

Dichlorodiphenyldichloroethane (DDD) is an organochlorine insecticide that is slightly irritating to the skin. DDD is a metabolite of DDT. DDD is colorless and crystalline; it is closely related chemically and is similar in properties to DDT, but it is considered to be less toxic to animals than DDT. The molecular formula for DDD is (ClC6H4)2CHCHCl2 or C14H10Cl4, whereas the formula for DDT is (ClC6H4)2CHCCl3 or C14H9Cl5.

Chemical compound Substance composed of multiple elements

A chemical compound is a chemical substance composed of many identical molecules composed of atoms from more than one element held together by chemical bonds. A molecule consisting of atoms of only one element is therefore not a compound.

2-Fluoroethanol chemical compound

2-Fluoroethanol is the chemical compound with the formula CH2FCH2OH and the simplest fluorohydrin. This colorless liquid is one of the simplest stable fluorinated alcohols. It was developed for use as a rodenticide, insecticide, and acaricide. Owing to its easy oxidation to fluoroacetic acid, fluoroethanol is highly toxic (LD50 = 10 mg/kg). The related difluoro- and trifluoroethanols are far less dangerous.

The Hazardous Substances Data Bank (HSDB) is a toxicology database on the U.S. National Library of Medicine's (NLM) Toxicology Data Network (TOXNET). It focuses on the toxicology of potentially hazardous chemicals, and includes information on human exposure, industrial hygiene, emergency handling procedures, environmental fate, regulatory requirements, and related areas. All data are referenced and derived from a core set of books, government documents, technical reports, and selected primary journal literature. All entries are peer-reviewed by a Scientific Review Panel (SRP), members of which represent a spectrum of professions and interests. Current Chairs of the SRP are Dr. Marcel J. Cassavant, MD, Toxicology Group, and Dr. Roland Everett Langford, PhD, Environmental Fate Group.

Chemisches Zentralblatt is the first and oldest abstracts journal published in the field of chemistry. It covers the chemical literature from 1830 to 1969 and describes therefore the "birth" of chemistry as science, in contrast to alchemy. The information contained in this German journal is comparable with the content of the leading source of chemical information Chemical Abstracts Service (CAS), which started publishing abstracts in English in 1907.

CompTox Chemicals Dashboard chemical database

The CompTox Chemicals Dashboard is a freely accessible online database created and maintained by the U.S. Environmental Protection Agency (EPA). The database provides access to multiple types of data including physicochemical properties, environmental fate and transport, exposure, usage, in vivo toxicity, and in vitro bioassay. EPA and other scientists use the data and models contained within the dashboard to help identify chemicals that require further testing and reduce the use of animals in chemical testing. The Dashboard is also used to provide public access to information from EPA Action Plans, e.g. around perfluorinated alkylated substances.,

Poly(ethyl methacrylate) (PEMA) is a hydrophobic synthetic acrylate polymer. It has properties similar to the more common PMMA, however it produces less heat during polymerization, has a lower modulus of elasticity and an overall softer texture. It may be vulcanized using lead oxide as a catalyst and it can be softened using ethanol.