Substituent

Last updated

In organic chemistry, a substituent is one or a group of atoms that replaces (one or more) atoms, thereby becoming a moiety in the resultant (new) molecule. [1] (In organic chemistry and biochemistry, the terms substituent and functional group , as well as side chain and pendant group , are used almost interchangeably to describe those branches from the parent structure, [2] though certain distinctions are made in polymer chemistry. [3] In polymers, side chains extend from the backbone structure. In proteins, side chains are attached to the alpha carbon atoms of the amino acid backbone.)

Contents

The suffix -yl is used when naming organic compounds that contain a single bond replacing one hydrogen; -ylidene and -ylidyne are used with double bonds and triple bonds, respectively. In addition, when naming hydrocarbons that contain a substituent, positional numbers are used to indicate which carbon atom the substituent attaches to when such information is needed to distinguish between isomers. Substituents can be a combination of the inductive effect and the mesomeric effect. Such effects are also described as electron-rich and electron withdrawing. Additional steric effects result from the volume occupied by a substituent.

The phrases most-substituted and least-substituted are frequently used to describe or compare molecules that are products of a chemical reaction. In this terminology, methane is used as a reference of comparison. Using methane as a reference, for each hydrogen atom that is replaced or "substituted" by something else, the molecule can be said to be more highly substituted. For example:

Nomenclature

The suffix -yl is used in organic chemistry to form names of radicals, either separate species (called free radicals) or chemically bonded parts of molecules (called moieties ). It can be traced back to the old name of methanol, "methylene" (from Ancient Greek : μέθυméthu, 'wine' and ὕληhúlē, [4] 'wood', 'forest'), which became shortened to "methyl" in compound names, from which -yl was extracted. Several reforms of chemical nomenclature eventually generalized the use of the suffix to other organic substituents.

The use of the suffix is determined by the number of hydrogen atoms that the substituent replaces on a parent compound (and also, usually, on the substituent). According to the 1993 IUPAC recommendations: [5]

The suffix -ylidine is encountered sporadically, and appears to be a variant spelling of "-ylidene"; [6] it is not mentioned in the IUPAC guidelines.

For multiple bonds of the same type, which link the substituent to the parent group, the infixes -di-, -tri-, -tetra-, etc., are used: -diyl (two single bonds), -triyl (three single bonds), -tetrayl (four single bonds), -diylidene (two double bonds).

For multiple bonds of different types, multiple suffixes are concatenated: -ylylidene (one single and one double), -ylylidyne (one single and one triple), -diylylidene (two single and one double).

The parent compound name can be altered in two ways:

Note that some popular terms such as "vinyl" (when used to mean "polyvinyl") represent only a portion of the full chemical name.

Methane substituents

According to the above rules, a carbon atom in a molecule, considered as a substituent, has the following names depending on the number of hydrogens bound to it, and the type of bonds formed with the remainder of the molecule:

CH
4
methane no bonds
CH
3
methyl group or methanylone single bond to a non-hydrogen atom
=CH
2
methylene group or methanylidene or methylideneone double bond
CH
2
methylene bridge or methanediyl or methdiyltwo single bonds
≡CH methanylidyne group or methylidyneone triple bond
=CH− methine group or methanylylidene or methylylideneone single bond and one double bond
>CH− methanetriyl group or methtriylthree single bonds
≡C− methanylylidyne group or methylylidyneone triple bond and one single bond
=C= methanediylidene group or methdiylidenetwo double bonds
>C= methanediylylidene group or methdiylylidenetwo single bonds and one double bond
>C< methanetetrayl group or methtetraylfour single bonds

Notation

In a chemical structural formula, an organic substituent such as methyl, ethyl, or aryl can be written as R (or R1, R2, etc.) It is a generic placeholder, the R derived from radical or rest , which may replace any portion of the formula as the author finds convenient. The first to use this symbol was Charles Frédéric Gerhardt in 1844. [8]

The symbol X is often used to denote electronegative substituents such as the halides. [9] [10]

Statistical distribution

One cheminformatics study identified 849,574 unique substituents up to 12 non-hydrogen atoms large and containing only carbon, hydrogen, nitrogen, oxygen, sulfur, phosphorus, selenium, and the halogens in a set of 3,043,941 molecules. Fifty substituents can be considered common as they are found in more than 1% of this set, and 438 are found in more than 0.1%. 64% of the substituents are found in only one molecule. The top 5 most common are the methyl, phenyl, chlorine, methoxy, and hydroxyl substituents. The total number of organic substituents in organic chemistry is estimated at 3.1 million, creating a total of 6.7×1023 molecules. [11] An infinite number of substituents can be obtained simply by increasing carbon chain length. For instance, the substituents methyl (-CH3) and pentyl (-C5H11).

See also

Related Research Articles

<span class="mw-page-title-main">Alkane</span> Type of saturated hydrocarbon compound

In organic chemistry, an alkane, or paraffin, is an acyclic saturated hydrocarbon. In other words, an alkane consists of hydrogen and carbon atoms arranged in a tree structure in which all the carbon–carbon bonds are single. Alkanes have the general chemical formula CnH2n+2. The alkanes range in complexity from the simplest case of methane, where n = 1, to arbitrarily large and complex molecules, like pentacontane or 6-ethyl-2-methyl-5-(1-methylethyl) octane, an isomer of tetradecane.

<span class="mw-page-title-main">Aromatic compound</span> Compound containing rings with delocalized pi electrons

Aromatic compounds, also known as "mono- and polycyclic aromatic hydrocarbons", are organic compounds containing one or more aromatic rings. The word "aromatic" originates from the past grouping of molecules based on odor, before their general chemical properties were understood. The current definition of aromatic compounds does not have any relation with their odor.

<span class="mw-page-title-main">Alkene</span> Hydrocarbon compound containing one or more C=C bonds

In organic chemistry, an alkene is a hydrocarbon containing a carbon–carbon double bond. The double bond may be internal or in the terminal position. Terminal alkenes are also known as α-olefins.

<span class="mw-page-title-main">Alkyne</span> Hydrocarbon compound containing one or more C≡C bonds

In organic chemistry, an alkyne is an unsaturated hydrocarbon containing at least one carbon—carbon triple bond. The simplest acyclic alkynes with only one triple bond and no other functional groups form a homologous series with the general chemical formula CnH2n−2. Alkynes are traditionally known as acetylenes, although the name acetylene also refers specifically to C2H2, known formally as ethyne using IUPAC nomenclature. Like other hydrocarbons, alkynes are generally hydrophobic.

<span class="mw-page-title-main">Carboxylic acid</span> Organic compound containing a –C(=O)OH group

In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group attached to an R-group. The general formula of a carboxylic acid is R−COOH or R−CO2H, with R referring to the alkyl, alkenyl, aryl, or other group. Carboxylic acids occur widely. Important examples include the amino acids and fatty acids. Deprotonation of a carboxylic acid gives a carboxylate anion.

<span class="mw-page-title-main">Ether</span> Organic compounds made of alkyl/aryl groups bound to oxygen (R–O–R)

In organic chemistry, ethers are a class of compounds that contain an ether group—an oxygen atom connected to two alkyl or aryl groups. They have the general formula R−O−R′, where R and R′ represent the alkyl or aryl groups. Ethers can again be classified into two varieties: if the alkyl or aryl groups are the same on both sides of the oxygen atom, then it is a simple or symmetrical ether, whereas if they are different, the ethers are called mixed or unsymmetrical ethers. A typical example of the first group is the solvent and anaesthetic diethyl ether, commonly referred to simply as "ether". Ethers are common in organic chemistry and even more prevalent in biochemistry, as they are common linkages in carbohydrates and lignin.

<span class="mw-page-title-main">Functional group</span> Set of atoms in a molecule which augment its chemical and/or physical properties

In organic chemistry, a functional group is a substituent or moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions regardless of the rest of the molecule's composition. This enables systematic prediction of chemical reactions and behavior of chemical compounds and the design of chemical synthesis. The reactivity of a functional group can be modified by other functional groups nearby. Functional group interconversion can be used in retrosynthetic analysis to plan organic synthesis.

<span class="mw-page-title-main">Ketone</span> Organic compounds of the form >C=O

In organic chemistry, a ketone is a functional group with the structure R−C(=O)−R', where R and R' can be a variety of carbon-containing substituents. Ketones contain a carbonyl group −C(=O)−. The simplest ketone is acetone, with the formula (CH3)2CO. Many ketones are of great importance in biology and in industry. Examples include many sugars (ketoses), many steroids, and the solvent acetone.

<span class="mw-page-title-main">Organic chemistry</span> Subdiscipline of chemistry, focusing on carbon compounds

Organic chemistry is a subdiscipline within chemistry involving the scientific study of the structure, properties, and reactions of organic compounds and organic materials, i.e., matter in its various forms that contain carbon atoms. Study of structure determines their structural formula. Study of properties includes physical and chemical properties, and evaluation of chemical reactivity to understand their behavior. The study of organic reactions includes the chemical synthesis of natural products, drugs, and polymers, and study of individual organic molecules in the laboratory and via theoretical study.

In chemistry, a structural isomer of a compound is another compound whose molecule has the same number of atoms of each element, but with logically distinct bonds between them. The term metamer was formerly used for the same concept.

<span class="mw-page-title-main">Haloalkane</span> Group of chemical compounds derived from alkanes containing one or more halogens

The haloalkanes are alkanes containing one or more halogen substituents. They are a subset of the general class of halocarbons, although the distinction is not often made. Haloalkanes are widely used commercially. They are used as flame retardants, fire extinguishants, refrigerants, propellants, solvents, and pharmaceuticals. Subsequent to the widespread use in commerce, many halocarbons have also been shown to be serious pollutants and toxins. For example, the chlorofluorocarbons have been shown to lead to ozone depletion. Methyl bromide is a controversial fumigant. Only haloalkanes that contain chlorine, bromine, and iodine are a threat to the ozone layer, but fluorinated volatile haloalkanes in theory may have activity as greenhouse gases. Methyl iodide, a naturally occurring substance, however, does not have ozone-depleting properties and the United States Environmental Protection Agency has designated the compound a non-ozone layer depleter. For more information, see Halomethane. Haloalkane or alkyl halides are the compounds which have the general formula "RX" where R is an alkyl or substituted alkyl group and X is a halogen.

<span class="mw-page-title-main">Cycloalkane</span> Saturated alicyclic hydrocarbon

In organic chemistry, the cycloalkanes are the monocyclic saturated hydrocarbons. In other words, a cycloalkane consists only of hydrogen and carbon atoms arranged in a structure containing a single ring, and all of the carbon-carbon bonds are single. The larger cycloalkanes, with more than 20 carbon atoms are typically called cycloparaffins. All cycloalkanes are isomers of alkenes.

In organic chemistry, an alkyl group is an alkane missing one hydrogen. The term alkyl is intentionally unspecific to include many possible substitutions. An acyclic alkyl has the general formula of −CnH2n+1. A cycloalkyl group is derived from a cycloalkane by removal of a hydrogen atom from a ring and has the general formula −CnH2n−1. Typically an alkyl is a part of a larger molecule. In structural formulae, the symbol R is used to designate a generic (unspecified) alkyl group. The smallest alkyl group is methyl, with the formula −CH3.

<span class="mw-page-title-main">Aryl group</span> Molecular groups or substituents derived from an aromatic ring

In organic chemistry, an aryl is any functional group or substituent derived from an aromatic ring, usually an aromatic hydrocarbon, such as phenyl and naphthyl. "Aryl" is used for the sake of abbreviation or generalization, and "Ar" is used as a placeholder for the aryl group in chemical structure diagrams, analogous to “R” used for any organic substituent. “Ar” is not to be confused with the elemental symbol for argon.

In chemical nomenclature, the IUPAC nomenclature of organic chemistry is a method of naming organic chemical compounds as recommended by the International Union of Pure and Applied Chemistry (IUPAC). It is published in the Nomenclature of Organic Chemistry. Ideally, every possible organic compound should have a name from which an unambiguous structural formula can be created. There is also an IUPAC nomenclature of inorganic chemistry.

<span class="mw-page-title-main">Catenation</span> Bonding of atoms of the same element into chains or rings

In chemistry, catenation is the bonding of atoms of the same element into a series, called a chain. A chain or a ring shape may be open if its ends are not bonded to each other, or closed if they are bonded in a ring. The words to catenate and catenation reflect the Latin root catena, "chain".

<span class="mw-page-title-main">Skeletal formula</span> Representation method in chemistry

The skeletal formula, line-angle formula, or shorthand formula of an organic compound is a type of molecular structural formula that serves as a shorthand representation of a molecule's bonding and some details of its molecular geometry. A skeletal formula shows the skeletal structure or skeleton of a molecule, which is composed of the skeletal atoms that make up the molecule. It is represented in two dimensions, as on a piece of paper. It employs certain conventions to represent carbon and hydrogen atoms, which are the most common in organic chemistry.

In organic chemistry, butyl is a four-carbon alkyl radical or substituent group with general chemical formula −C4H9, derived from either of the two isomers (n-butane and isobutane) of butane.

<span class="mw-page-title-main">Neopentane</span> Chemical compound

Neopentane, also called 2,2-dimethylpropane, is a double-branched-chain alkane with five carbon atoms. Neopentane is a flammable gas at room temperature and pressure which can condense into a highly volatile liquid on a cold day, in an ice bath, or when compressed to a higher pressure.

<span class="mw-page-title-main">Cyclic compound</span> Molecule with a ring of bonded atoms

A cyclic compound is a term for a compound in the field of chemistry in which one or more series of atoms in the compound is connected to form a ring. Rings may vary in size from three to many atoms, and include examples where all the atoms are carbon, none of the atoms are carbon, or where both carbon and non-carbon atoms are present. Depending on the ring size, the bond order of the individual links between ring atoms, and their arrangements within the rings, carbocyclic and heterocyclic compounds may be aromatic or non-aromatic; in the latter case, they may vary from being fully saturated to having varying numbers of multiple bonds between the ring atoms. Because of the tremendous diversity allowed, in combination, by the valences of common atoms and their ability to form rings, the number of possible cyclic structures, even of small size numbers in the many billions.

References

  1. "Definition of SUBSTITUENT". www.merriam-webster.com. Retrieved 4 June 2022.
  2. D.R. Bloch (2006). Organic Chemistry Demystified. ISBN   978-0-07-145920-4.
  3. Jenkins, A. D.; Kratochvíl, P.; Stepto, R. F. T.; Suter, U. W. (1996). "PAC, 1996, 68, 2287. Glossary of basic terms in polymer science (IUPAC Recommendations 1996)". Pure and Applied Chemistry. 68 (12): 2287–2311. doi: 10.1351/pac199668122287 . This distinguishes a pendant group as neither oligomeric nor polymeric, whereas a pendant chain must be oligomeric or polymeric.
  4. This name came through a Greek language error: ὕλη (hȳlē) means "wood" ("forest"), ξυλο- (xylo-) means "wood" (the substance)
  5. IUPAC (1997) [1993]. "R-2.5 Substituent Prefix Names Derived from Parent Hydrides". A Guide to IUPAC Nomenclature of Organic Compounds (Recommendations 1993). Blackwell Scientific Publications; Advanced Chemistry Development, Inc.
  6. The PubChem database lists 740,110 results for -ylidene, of which 14 have synonyms where the suffix is replaced by -ylidine. Another 4 results contain -ylidine without listing -ylidene as a synonym.
  7. Nomenclature of Organic Chemistry. IUPAC Recommendations and Preferred Names 2013. Favre, Henri A.,, Powell, Warren H., 1934–, International Union of Pure and Applied Chemistry. Cambridge, UK: Royal Society of Chemistry. 2013. ISBN   9781849733069. OCLC   865143943.{{cite book}}: CS1 maint: others (link)
  8. See:
    • Charles Gerhardt, Précis de chimie organique (Summary of organic chemistry), vol. 1 (Paris, France: Fortin et Masson, 1844), page 29. From page 29: "En désignant, par conséquent, les éléments combustibles par R, sans tenir comptes des proportions atomiques de carbone et d'hydrogène, on peut exprimer d'une manière générale: Par R. — Les hydrogènes carbonés." (Consequently, by designating combustible components by R, without considering the atomic proportions of carbon and hydrogen, one can express in a general way: By R — hydrocarbons.)
    • William B. Jensen (2010) "Ask the Historian: Why is R Used for Hydrocarbon Substituents?," Journal of Chemical Education, 87: 360–361. Available at: University of Cincinnati.
  9. Jensen, W. B. (2010). "Why Is "R" Used To Symbolize Hydrocarbon Substituents?". Journal of Chemical Education. 87 (4): 360–361. Bibcode:2010JChEd..87..360J. doi:10.1021/ed800139p.
  10. The first use of the letter X to denote univalent electronegative groups appeared in:
  11. Ertl, P. (2003). "Cheminformatics Analysis of Organic Substituents: Identification of the Most Common Substituents, Calculation of Substituent Properties, and Automatic Identification of Drug-like Bioisosteric Groups". Journal of Chemical Information and Modeling. 43 (2): 374–380. doi:10.1021/ci0255782. PMID   12653499.