Names | |||
---|---|---|---|
IUPAC name Proline | |||
Systematic IUPAC name Pyrrolidine-2-carboxylic acid [1] | |||
Identifiers | |||
3D model (JSmol) | |||
80812 | |||
ChEBI |
| ||
ChEMBL |
| ||
ChemSpider | |||
DrugBank |
| ||
ECHA InfoCard | 100.009.264 | ||
EC Number |
| ||
26927 | |||
KEGG |
| ||
MeSH | Proline | ||
PubChem CID | |||
RTECS number |
| ||
UNII |
| ||
CompTox Dashboard (EPA) | |||
| |||
| |||
Properties | |||
C5H9NO2 | |||
Molar mass | 115.132 g·mol−1 | ||
Appearance | Transparent crystals | ||
Melting point | 205 to 228 °C (401 to 442 °F; 478 to 501 K) (decomposes) | ||
Solubility | 1.5g/100g ethanol 19 degC [2] | ||
log P | -0.06 | ||
Acidity (pKa) | 1.99 (carboxyl), 10.96 (amino) [3] | ||
Supplementary data page | |||
Proline (data page) | |||
Except where otherwise noted, data are given for materials in their standard state (at 25 °C [77 °F], 100 kPa). |
Proline (symbol Pro or P) [4] is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group -NH
2 but is rather a secondary amine. The secondary amine nitrogen is in the protonated form (NH2+) under biological conditions, while the carboxyl group is in the deprotonated −COO− form. The "side chain" from the α carbon connects to the nitrogen forming a pyrrolidine loop, classifying it as a aliphatic amino acid. It is non-essential in humans, meaning the body can synthesize it from the non-essential amino acid L-glutamate. It is encoded by all the codons starting with CC (CCU, CCC, CCA, and CCG).
Proline is the only proteinogenic amino acid which is a secondary amine, as the nitrogen atom is attached both to the α-carbon and to a chain of three carbons that together form a five-membered ring.
Proline was first isolated in 1900 by Richard Willstätter who obtained the amino acid while studying N-methylproline, and synthesized proline by the reaction of sodium salt of diethyl malonate with 1,3-dibromopropane. The next year, Emil Fischer isolated proline from casein and the decomposition products of γ-phthalimido-propylmalonic ester, [5] and published the synthesis of proline from phthalimide propylmalonic ester. [6]
The name proline comes from pyrrolidine, one of its constituents. [7]
Proline is biosynthetically derived from the amino acid L-glutamate. Glutamate-5-semialdehyde is first formed by glutamate 5-kinase (ATP-dependent) and glutamate-5-semialdehyde dehydrogenase (which requires NADH or NADPH). This can then either spontaneously cyclize to form 1-pyrroline-5-carboxylic acid, which is reduced to proline by pyrroline-5-carboxylate reductase (using NADH or NADPH), or turned into ornithine by ornithine aminotransferase, followed by cyclisation by ornithine cyclodeaminase to form proline. [8]
L-Proline has been found to act as a weak agonist of the glycine receptor and of both NMDA and non-NMDA (AMPA/kainate) ionotropic glutamate receptors. [9] [10] [11] It has been proposed to be a potential endogenous excitotoxin. [9] [10] [11] In plants, proline accumulation is a common physiological response to various stresses but is also part of the developmental program in generative tissues (e.g. pollen). [12] [13] [14] [15]
A diet rich in proline was linked to an increased risk of depression in humans in a study from 2022 that was tested on a limited pre-clinical trial on humans and primarily in other organisms. Results were significant in the other organisms. [16]
The distinctive cyclic structure of proline's side chain gives proline an exceptional conformational rigidity compared to other amino acids. It also affects the rate of peptide bond formation between proline and other amino acids. When proline is bound as an amide in a peptide bond, its nitrogen is not bound to any hydrogen, meaning it cannot act as a hydrogen bond donor, but can be a hydrogen bond acceptor.
Peptide bond formation with incoming Pro-tRNAPro in the ribosome is considerably slower than with any other tRNAs, which is a general feature of N-alkylamino acids. [17] Peptide bond formation is also slow between an incoming tRNA and a chain ending in proline; with the creation of proline-proline bonds slowest of all. [18]
The exceptional conformational rigidity of proline affects the secondary structure of proteins near a proline residue and may account for proline's higher prevalence in the proteins of thermophilic organisms. Protein secondary structure can be described in terms of the dihedral angles φ, ψ and ω of the protein backbone. The cyclic structure of proline's side chain locks the angle φ at approximately −65°. [19]
Proline acts as a structural disruptor in the middle of regular secondary structure elements such as alpha helices and beta sheets; however, proline is commonly found as the first residue of an alpha helix and also in the edge strands of beta sheets. Proline is also commonly found in turns (another kind of secondary structure), and aids in the formation of beta turns. This may account for the curious fact that proline is usually solvent-exposed, despite having a completely aliphatic side chain.
Multiple prolines and/or hydroxyprolines in a row can create a polyproline helix, the predominant secondary structure in collagen. The hydroxylation of proline by prolyl hydroxylase (or other additions of electron-withdrawing substituents such as fluorine) increases the conformational stability of collagen significantly. [20] Hence, the hydroxylation of proline is a critical biochemical process for maintaining the connective tissue of higher organisms. Severe diseases such as scurvy can result from defects in this hydroxylation, e.g., mutations in the enzyme prolyl hydroxylase or lack of the necessary ascorbate (vitamin C) cofactor.
Peptide bonds to proline, and to other N-substituted amino acids (such as sarcosine), are able to populate both the cis and trans isomers. Most peptide bonds overwhelmingly adopt the trans isomer (typically 99.9% under unstrained conditions), chiefly because the amide hydrogen (trans isomer) offers less steric repulsion to the preceding Cα atom than does the following Cα atom (cis isomer). By contrast, the cis and trans isomers of the X-Pro peptide bond (where X represents any amino acid) both experience steric clashes with the neighboring substitution and have a much lower energy difference. Hence, the fraction of X-Pro peptide bonds in the cis isomer under unstrained conditions is significantly elevated, with cis fractions typically in the range of 3-10%. [21] However, these values depend on the preceding amino acid, with Gly [22] and aromatic [23] residues yielding increased fractions of the cis isomer. Cis fractions up to 40% have been identified for aromatic–proline peptide bonds. [24]
From a kinetic standpoint, cis–trans proline isomerization is a very slow process that can impede the progress of protein folding by trapping one or more proline residues crucial for folding in the non-native isomer, especially when the native protein requires the cis isomer. This is because proline residues are exclusively synthesized in the ribosome as the trans isomer form. All organisms possess prolyl isomerase enzymes to catalyze this isomerization, and some bacteria have specialized prolyl isomerases associated with the ribosome. However, not all prolines are essential for folding, and protein folding may proceed at a normal rate despite having non-native conformers of many X–Pro peptide bonds.
Proline and its derivatives are often used as asymmetric catalysts in proline organocatalysis reactions. The CBS reduction and proline catalysed aldol condensation are prominent examples.
In brewing, proteins rich in proline combine with polyphenols to produce haze (turbidity). [25]
L-Proline is an osmoprotectant and therefore is used in many pharmaceutical and biotechnological applications.
The growth medium used in plant tissue culture may be supplemented with proline. This can increase growth, perhaps because it helps the plant tolerate the stresses of tissue culture. [26] [ better source needed ] For proline's role in the stress response of plants, see § Biological activity.
Proline is one of the two amino acids that do not follow along with the typical Ramachandran plot, along with glycine. Due to the ring formation connected to the beta carbon, the ψ and φ angles about the peptide bond have fewer allowable degrees of rotation. As a result, it is often found in "turns" of proteins as its free entropy (ΔS) is not as comparatively large to other amino acids and thus in a folded form vs. unfolded form, the change in entropy is smaller. Furthermore, proline is rarely found in α and β structures as it would reduce the stability of such structures, because its side chain α-nitrogen can only form one nitrogen bond.
Additionally, proline is the only amino acid that does not form a red-purple colour when developed by spraying with ninhydrin for uses in chromatography. Proline, instead, produces an orange-yellow colour.
Racemic proline can be synthesized from diethyl malonate and acrylonitrile: [27]
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 appear in the genetic code of life.
An alpha helix is a sequence of amino acids in a protein that are twisted into a coil.
The beta sheet is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet. A β-strand is a stretch of polypeptide chain typically 3 to 10 amino acids long with backbone in an extended conformation. The supramolecular association of β-sheets has been implicated in the formation of the fibrils and protein aggregates observed in amyloidosis, Alzheimer's disease and other proteinopathies.
In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 of one alpha-amino acid and N2 of another, along a peptide or protein chain.
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common secondary structural elements are alpha helices and beta sheets, though beta turns and omega loops occur as well. Secondary structure elements typically spontaneously form as an intermediate before the protein folds into its three dimensional tertiary structure.
(2S,4R)-4-Hydroxyproline, or L-hydroxyproline (C5H9O3N), is an amino acid, abbreviated as Hyp or O, e.g., in Protein Data Bank.
Enoyl-CoA-(∆) isomerase (EC 5.3.3.8, also known as dodecenoyl-CoA- isomerase, 3,2-trans-enoyl-CoA isomerase, ∆3 ,∆2 -enoyl-CoA isomerase, or acetylene-allene isomerase, is an enzyme that catalyzes the conversion of cis- or trans-double bonds of coenzyme A bound fatty acids at gamma-carbon to trans double bonds at beta-carbon as below:
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers – specifically polypeptides – formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid monomer may also be called a residue, which indicates a repeating unit of a polymer. Proteins form by amino acids undergoing condensation reactions, in which the amino acids lose one water molecule per reaction in order to attach to one another with a peptide bond. By convention, a chain under 30 amino acids is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations driven by a number of non-covalent interactions, such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, cryo-electron microscopy (cryo-EM) and dual polarisation interferometry, to determine the structure of proteins.
A catalytic triad is a set of three coordinated amino acid residues that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes. An acid-base-nucleophile triad is a common motif for generating a nucleophilic residue for covalent catalysis. The residues form a charge-relay network to polarise and activate the nucleophile, which attacks the substrate, forming a covalent intermediate which is then hydrolysed to release the product and regenerate free enzyme. The nucleophile is most commonly a serine or cysteine, but occasionally threonine or even selenocysteine. The 3D structure of the enzyme brings together the triad residues in a precise orientation, even though they may be far apart in the sequence.
A turn is an element of secondary structure in proteins where the polypeptide chain reverses its overall direction.
Bovine pancreatic ribonuclease, also often referred to as bovine pancreatic ribonuclease A or simply RNase A, is a pancreatic ribonuclease enzyme that cleaves single-stranded RNA. Bovine pancreatic ribonuclease is one of the classic model systems of protein science. Two Nobel Prizes in Chemistry have been awarded in recognition of work on bovine pancreatic ribonuclease: in 1972, the Prize was awarded to Christian Anfinsen for his work on protein folding and to Stanford Moore and William Stein for their work on the relationship between the protein's structure and its chemical mechanism; in 1984, the Prize was awarded to Robert Bruce Merrifield for development of chemical synthesis of proteins.
A polyproline helix is a type of protein secondary structure which occurs in proteins comprising repeating proline residues. A left-handed polyproline II helix is formed when sequential residues all adopt (φ,ψ) backbone dihedral angles of roughly and have trans isomers of their peptide bonds. This PPII conformation is also common in proteins and polypeptides with other amino acids apart from proline. Similarly, a more compact right-handed polyproline I helix is formed when sequential residues all adopt (φ,ψ) backbone dihedral angles of roughly and have cis isomers of their peptide bonds. Of the twenty common naturally occurring amino acids, only proline is likely to adopt the cis isomer of the peptide bond, specifically the X-Pro peptide bond; steric and electronic factors heavily favor the trans isomer in most other peptide bonds. However, peptide bonds that replace proline with another N-substituted amino acid are also likely to adopt the cis isomer.
A carboxypeptidase is a protease enzyme that hydrolyzes (cleaves) a peptide bond at the carboxy-terminal (C-terminal) end of a protein or peptide. This is in contrast to an aminopeptidases, which cleave peptide bonds at the N-terminus of proteins. Humans, animals, bacteria and plants contain several types of carboxypeptidases that have diverse functions ranging from catabolism to protein maturation. At least two mechanisms have been discussed.
In molecular biology, immunophilins are endogenous cytosolic peptidyl-prolyl isomerases (PPI) that catalyze the interconversion between the cis and trans isomers of peptide bonds containing the amino acid proline (Pro). They are chaperone molecules that generally assist in the proper folding of diverse "client" proteins. Immunophilins are traditionally classified into two families that differ in sequence and biochemical characteristics. These two families are: "cyclosporin-binding cyclophilins (CyPs)" and "FK506-binding proteins (FKBPs)". In 2005, a group of dual-family immunophilins (DFI) has been discovered, mostly in unicellular organisms; these DFIs are natural chimera of CyP and FKBPs, fused in either order.
The beta hairpin is a simple protein structural motif involving two beta strands that look like a hairpin. The motif consists of two strands that are adjacent in primary structure, oriented in an antiparallel direction, and linked by a short loop of two to five amino acids. Beta hairpins can occur in isolation or as part of a series of hydrogen bonded strands that collectively comprise a beta sheet.
Prolyl isomerase is an enzyme found in both prokaryotes and eukaryotes that interconverts the cis and trans isomers of peptide bonds with the amino acid proline. Proline has an unusually conformationally restrained peptide bond due to its cyclic structure with its side chain bonded to its secondary amine nitrogen. Most amino acids have a strong energetic preference for the trans peptide bond conformation due to steric hindrance, but proline's unusual structure stabilizes the cis form so that both isomers are populated under biologically relevant conditions. Proteins with prolyl isomerase activity include cyclophilin, FKBPs, and parvulin, although larger proteins can also contain prolyl isomerase domains.
Ribonuclease T1 (EC 4.6.1.24, guanyloribonuclease, Aspergillus oryzae ribonuclease, RNase N1, RNase N2, ribonuclease N3, ribonuclease U1, ribonuclease F1, ribonuclease Ch, ribonuclease PP1, ribonuclease SA, RNase F1, ribonuclease C2, binase, RNase Sa, guanyl-specific RNase, RNase G, RNase T1, ribonuclease guaninenucleotido-2'-transferase (cyclizing), ribonuclease N3, ribonuclease N1) is a fungal endonuclease that cleaves single-stranded RNA after guanine residues, i.e., on their 3' end; the most commonly studied form of this enzyme is the version found in the mold Aspergillus oryzae. Owing to its specificity for guanine, RNase T1 is often used to digest denatured RNA prior to sequencing. Similar to other ribonucleases such as barnase and RNase A, ribonuclease T1 has been popular for folding studies.
Antamanide is a cyclic decapeptide isolated from a fungus, the death cap: Amanita phalloides. It was being studied in 1995 as a potential anti-toxin against the effects of phalloidin and for its potential for treating edema. It contains 1 valine residue, 4 proline residues, 1 alanine residue, and 4 phenylalanine residues with a structure of c(Val-Pro-Pro-Ala-Phe-Phe-Pro-Pro-Phe-Phe). It was isolated by determining the source of the anti-phalloidin activity from a lipophillic extraction from the organism. It has been shown that antamanide can react to form alkali metal ion complexes. These include complexes with sodium and calcium ions. When these complexes are formed, the cyclopeptide structure undergoes a conformational change.
In epigenetics, proline isomerization is the effect that cis-trans isomerization of the amino acid proline has on the regulation of gene expression. Similar to aspartic acid, the amino acid proline has the rare property of being able to occupy both cis and trans isomers of its prolyl peptide bonds with ease. Peptidyl-prolyl isomerase, or PPIase, is an enzyme very commonly associated with proline isomerization due to their ability to catalyze the isomerization of prolines. PPIases are present in three types: cyclophilins, FK507-binding proteins, and the parvulins. PPIase enzymes catalyze the transition of proline between cis and trans isomers and are essential to the numerous biological functions controlled and affected by prolyl isomerization Without PPIases, prolyl peptide bonds will slowly switch between cis and trans isomers, a process that can lock proteins in a nonnative structure that can affect render the protein temporarily ineffective. Although this switch can occur on its own, PPIases are responsible for most isomerization of prolyl peptide bonds. The specific amino acid that precedes the prolyl peptide bond also can have an effect on which conformation the bond assumes. For instance, when an aromatic amino acid is bonded to a proline the bond is more favorable to the cis conformation. Cyclophilin A uses an "electrostatic handle" to pull proline into cis and trans formations. Most of these biological functions are affected by the isomerization of proline when one isomer interacts differently than the other, commonly causing an activation/deactivation relationship. As an amino acid, proline is present in many proteins. This aids in the multitude of effects that isomerization of proline can have in different biological mechanisms and functions.