Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called proteases, but may also occur by intra-molecular digestion.
Proteolysis in organisms serves many purposes; for example, digestive enzymes break down proteins in food to provide amino acids for the organism, while proteolytic processing of a polypeptide chain after its synthesis may be necessary for the production of an active protein. It is also important in the regulation of some physiological and cellular processes including apoptosis, as well as preventing the accumulation of unwanted or misfolded proteins in cells. Consequently, abnormality in the regulation of proteolysis can cause disease.
Proteolysis can also be used as an analytical tool for studying proteins in the laboratory, and it may also be used in industry, for example in food processing and stain removal.
Limited proteolysis of a polypeptide during or after translation in protein synthesis often occurs for many proteins. This may involve removal of the N-terminal methionine, signal peptide, and/or the conversion of an inactive or non-functional protein to an active one. The precursor to the final functional form of protein is termed proprotein, and these proproteins may be first synthesized as preproprotein. For example, albumin is first synthesized as preproalbumin and contains an uncleaved signal peptide. This forms the proalbumin after the signal peptide is cleaved, and a further processing to remove the N-terminal 6-residue propeptide yields the mature form of the protein. [1]
The initiating methionine (and, in bacteria, fMet) may be removed during translation of the nascent protein. For E. coli , fMet is efficiently removed if the second residue is small and uncharged, but not if the second residue is bulky and charged. [2] In both prokaryotes and eukaryotes, the exposed N-terminal residue may determine the half-life of the protein according to the N-end rule.
Proteins that are to be targeted to a particular organelle or for secretion have an N-terminal signal peptide that directs the protein to its final destination. This signal peptide is removed by proteolysis after their transport through a membrane.
Some proteins and most eukaryotic polypeptide hormones are synthesized as a large precursor polypeptide known as a polyprotein that requires proteolytic cleavage into individual smaller polypeptide chains. The polyprotein pro-opiomelanocortin (POMC) contains many polypeptide hormones. The cleavage pattern of POMC, however, may vary between different tissues, yielding different sets of polypeptide hormones from the same polyprotein.
Many viruses also produce their proteins initially as a single polypeptide chain that were translated from a polycistronic mRNA. This polypeptide is subsequently cleaved into individual polypeptide chains. [1] Common names for the polyprotein include gag (group-specific antigen) in retroviruses and ORF1ab in Nidovirales. The latter name refers to the fact that a slippery sequence in the mRNA that codes for the polypeptide causes ribosomal frameshifting, leading to two different lengths of peptidic chains (a and ab) at an approximately fixed ratio.
Many proteins and hormones are synthesized in the form of their precursors - zymogens, proenzymes, and prehormones. These proteins are cleaved to form their final active structures. Insulin, for example, is synthesized as preproinsulin, which yields proinsulin after the signal peptide has been cleaved. The proinsulin is then cleaved at two positions to yield two polypeptide chains linked by two disulfide bonds. Removal of two C-terminal residues from the B-chain then yields the mature insulin. Protein folding occurs in the single-chain proinsulin form which facilitates formation of the ultimate inter-peptide disulfide bonds, and the ultimate intra-peptide disulfide bond, found in the native structure of insulin.
Proteases in particular are synthesized in the inactive form so that they may be safely stored in cells, and ready for release in sufficient quantity when required. This is to ensure that the protease is activated only in the correct location or context, as inappropriate activation of these proteases can be very destructive for an organism. Proteolysis of the zymogen yields an active protein; for example, when trypsinogen is cleaved to form trypsin, a slight rearrangement of the protein structure that completes the active site of the protease occurs, thereby activating the protein.
Proteolysis can, therefore, be a method of regulating biological processes by turning inactive proteins into active ones. A good example is the blood clotting cascade whereby an initial event triggers a cascade of sequential proteolytic activation of many specific proteases, resulting in blood coagulation. The complement system of the immune response also involves a complex sequential proteolytic activation and interaction that result in an attack on invading pathogens.
Protein degradation may take place intracellularly or extracellularly. In digestion of food, digestive enzymes may be released into the environment for extracellular digestion whereby proteolytic cleavage breaks proteins into smaller peptides and amino acids so that they may be absorbed and used. In animals the food may be processed extracellularly in specialized organs or guts, but in many bacteria the food may be internalized via phagocytosis. Microbial degradation of protein in the environment can be regulated by nutrient availability. For example, limitation for major elements in proteins (carbon, nitrogen, and sulfur) induces proteolytic activity in the fungus Neurospora crassa [3] as well as in of soil organism communities. [4]
Proteins in cells are broken into amino acids. This intracellular degradation of protein serves multiple functions: It removes damaged and abnormal proteins and prevents their accumulation. It also serves to regulate cellular processes by removing enzymes and regulatory proteins that are no longer needed. The amino acids may then be reused for protein synthesis.
The intracellular degradation of protein may be achieved in two ways—proteolysis in lysosome, or a ubiquitin-dependent process that targets unwanted proteins to proteasome. The autophagy-lysosomal pathway is normally a non-selective process, but it may become selective upon starvation whereby proteins with peptide sequence KFERQ or similar are selectively broken down. The lysosome contains a large number of proteases such as cathepsins.
The ubiquitin-mediated process is selective. Proteins marked for degradation are covalently linked to ubiquitin. Many molecules of ubiquitin may be linked in tandem to a protein destined for degradation. The polyubiquinated protein is targeted to an ATP-dependent protease complex, the proteasome. The ubiquitin is released and reused, while the targeted protein is degraded.
Different proteins are degraded at different rates. Abnormal proteins are quickly degraded, whereas the rate of degradation of normal proteins may vary widely depending on their functions. Enzymes at important metabolic control points may be degraded much faster than those enzymes whose activity is largely constant under all physiological conditions. One of the most rapidly degraded proteins is ornithine decarboxylase, which has a half-life of 11 minutes. In contrast, other proteins like actin and myosin have a half-life of a month or more, while, in essence, haemoglobin lasts for the entire life-time of an erythrocyte. [5]
The N-end rule may partially determine the half-life of a protein, and proteins with segments rich in proline, glutamic acid, serine, and threonine (the so-called PEST proteins) have short half-life. [6] Other factors suspected to affect degradation rate include the rate deamination of glutamine and asparagine and oxidation of cystein, histidine, and methionine, the absence of stabilizing ligands, the presence of attached carbohydrate or phosphate groups, the presence of free α-amino group, the negative charge of protein, and the flexibility and stability of the protein. [5] Proteins with larger degrees of intrinsic disorder also tend to have short cellular half-life, [7] with disordered segments having been proposed to facilitate efficient initiation of degradation by the proteasome. [8] [9]
The rate of proteolysis may also depend on the physiological state of the organism, such as its hormonal state as well as nutritional status. In time of starvation, the rate of protein degradation increases.
In human digestion, proteins in food are broken down into smaller peptide chains by digestive enzymes such as pepsin, trypsin, chymotrypsin, and elastase, and into amino acids by various enzymes such as carboxypeptidase, aminopeptidase, and dipeptidase. It is necessary to break down proteins into small peptides (tripeptides and dipeptides) and amino acids so they can be absorbed by the intestines, and the absorbed tripeptides and dipeptides are also further broken into amino acids intracellularly before they enter the bloodstream. [10] Different enzymes have different specificity for their substrate; trypsin, for example, cleaves the peptide bond after a positively charged residue (arginine and lysine); chymotrypsin cleaves the bond after an aromatic residue (phenylalanine, tyrosine, and tryptophan); elastase cleaves the bond after a small non-polar residue such as alanine or glycine.
In order to prevent inappropriate or premature activation of the digestive enzymes (they may, for example, trigger pancreatic self-digestion causing pancreatitis), these enzymes are secreted as inactive zymogen. The precursor of pepsin, pepsinogen, is secreted by the stomach, and is activated only in the acidic environment found in stomach. The pancreas secretes the precursors of a number of proteases such as trypsin and chymotrypsin. The zymogen of trypsin is trypsinogen, which is activated by a very specific protease, enterokinase, secreted by the mucosa of the duodenum. The trypsin, once activated, can also cleave other trypsinogens as well as the precursors of other proteases such as chymotrypsin and carboxypeptidase to activate them.
In bacteria, a similar strategy of employing an inactive zymogen or prezymogen is used. Subtilisin, which is produced by Bacillus subtilis , is produced as preprosubtilisin, and is released only if the signal peptide is cleaved and autocatalytic proteolytic activation has occurred.
Proteolysis is also involved in the regulation of many cellular processes by activating or deactivating enzymes, transcription factors, and receptors, for example in the biosynthesis of cholesterol, [11] or the mediation of thrombin signalling through protease-activated receptors. [12]
Some enzymes at important metabolic control points such as ornithine decarboxylase is regulated entirely by its rate of synthesis and its rate of degradation. Other rapidly degraded proteins include the protein products of proto-oncogenes, which play central roles in the regulation of cell growth.
Cyclins are a group of proteins that activate kinases involved in cell division. The degradation of cyclins is the key step that governs the exit from mitosis and progress into the next cell cycle. [13] Cyclins accumulate in the course the cell cycle, then abruptly disappear just before the anaphase of mitosis. The cyclins are removed via a ubiquitin-mediated proteolytic pathway.
Caspases are an important group of proteases involved in apoptosis or programmed cell death. The precursors of caspase, procaspase, may be activated by proteolysis through its association with a protein complex that forms apoptosome, or by granzyme B, or via the death receptor pathways.
Autoproteolysis takes place in some proteins, whereby the peptide bond is cleaved in a self-catalyzed intramolecular reaction. Unlike zymogens, these autoproteolytic proteins participate in a "single turnover" reaction and do not catalyze further reactions post-cleavage. Examples include cleavage of the Asp-Pro bond in a subset of von Willebrand factor type D (VWD) domains [14] [15] and Neisseria meningitidis FrpC self-processing domain, [16] cleavage of the Asn-Pro bond in Salmonella FlhB protein, [17] Yersinia YscU protein, [18] as well as cleavage of the Gly-Ser bond in a subset of sea urchin sperm protein, enterokinase, and agrin (SEA) domains. [19] In some cases, the autoproteolytic cleavage is promoted by conformational strain of the peptide bond. [19]
Abnormal proteolytic activity is associated with many diseases. [20] In pancreatitis, leakage of proteases and their premature activation in the pancreas results in the self-digestion of the pancreas. People with diabetes mellitus may have increased lysosomal activity and the degradation of some proteins can increase significantly. Chronic inflammatory diseases such as rheumatoid arthritis may involve the release of lysosomal enzymes into extracellular space that break down surrounding tissues. Abnormal proteolysis may result in many age-related neurological diseases such as Alzheimer's due to the generation and ineffective removal of peptides that aggregate in cells. [21]
Proteases may be regulated by antiproteases or protease inhibitors, and imbalance between proteases and antiproteases can result in diseases, for example, in the destruction of lung tissues in emphysema brought on by smoking tobacco. Smoking is thought to increase the neutrophils and macrophages in the lung which release excessive amount of proteolytic enzymes such as elastase, such that they can no longer be inhibited by serpins such as α1-antitrypsin, thereby resulting in the breaking down of connective tissues in the lung. Other proteases and their inhibitors may also be involved in this disease, for example matrix metalloproteinases (MMPs) and tissue inhibitors of metalloproteinases (TIMPs). [22]
Other diseases linked to aberrant proteolysis include muscular dystrophy, degenerative skin disorders, respiratory and gastrointestinal diseases, and malignancy.
Protein backbones are very stable in water at neutral pH and room temperature, although the rate of hydrolysis of different peptide bonds can vary. The half life of a peptide bond under normal conditions can range from 7 years to 350 years, even higher for peptides protected by modified terminus or within the protein interior. [23] [24] [25] The rate of hydrolysis however can be significantly increased by extremes of pH and heat. Spontaneous cleavage of proteins may also involve catalysis by zinc on serine and threonine. [26]
Strong mineral acids can readily hydrolyse the peptide bonds in a protein (acid hydrolysis). The standard way to hydrolyze a protein or peptide into its constituent amino acids for analysis is to heat it to 105 °C for around 24 hours in 6M hydrochloric acid. [27] However, some proteins are resistant to acid hydrolysis. One well-known example is ribonuclease A, which can be purified by treating crude extracts with hot sulfuric acid so that other proteins become degraded while ribonuclease A is left intact. [28]
Certain chemicals cause proteolysis only after specific residues, and these can be used to selectively break down a protein into smaller polypeptides for laboratory analysis. [29] For example, cyanogen bromide cleaves the peptide bond after a methionine. Similar methods may be used to specifically cleave tryptophanyl, aspartyl, cysteinyl, and asparaginyl peptide bonds. Acids such as trifluoroacetic acid and formic acid may be used for cleavage.
Like other biomolecules, proteins can also be broken down by high heat alone. At 250 °C, the peptide bond may be easily hydrolyzed, with its half-life dropping to about a minute. [27] [30] Protein may also be broken down without hydrolysis through pyrolysis; small heterocyclic compounds may start to form upon degradation. Above 500 °C, polycyclic aromatic hydrocarbons may also form, [31] [32] which is of interest in the study of generation of carcinogens in tobacco smoke and cooking at high heat. [33] [34]
Proteolysis is also used in research and diagnostic applications:
Proteases may be classified according to the catalytic group involved in its active site. [39]
Certain types of venom, such as those produced by venomous snakes, can also cause proteolysis. These venoms are, in fact, complex digestive fluids that begin their work outside of the body. Proteolytic venoms cause a wide range of toxic effects, [40] including effects that are:
Chymotrypsin (EC 3.4.21.1, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenum, where it performs proteolysis, the breakdown of proteins and polypeptides. Chymotrypsin preferentially cleaves peptide amide bonds where the side chain of the amino acid N-terminal to the scissile amide bond (the P1 position) is a large hydrophobic amino acid (tyrosine, tryptophan, and phenylalanine). These amino acids contain an aromatic ring in their side chain that fits into a hydrophobic pocket (the S1 position) of the enzyme. It is activated in the presence of trypsin. The hydrophobic and shape complementarity between the peptide substrate P1 side chain and the enzyme S1 binding cavity accounts for the substrate specificity of this enzyme. Chymotrypsin also hydrolyzes other amide bonds in peptides at slower rates, particularly those containing leucine at the P1 position.
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the digestive system of many vertebrates, where it hydrolyzes proteins. Trypsin is formed in the small intestine when its proenzyme form, the trypsinogen produced by the pancreas, is activated. Trypsin cuts peptide chains mainly at the carboxyl side of the amino acids lysine or arginine. It is used for numerous biotechnological processes. The process is commonly referred to as trypsinogen proteolysis or trypsinization, and proteins that have been digested/treated with trypsin are said to have been trypsinized.
A protease is an enzyme that catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products. They do this by cleaving the peptide bonds within proteins by hydrolysis, a reaction where water breaks bonds. Proteases are involved in numerous biological pathways, including digestion of ingested proteins, protein catabolism, and cell signaling.
In biochemistry, a zymogen, also called a proenzyme, is an inactive precursor of an enzyme. A zymogen requires a biochemical change for it to become an active enzyme. The biochemical change usually occurs in Golgi bodies, where a specific part of the precursor enzyme is cleaved in order to activate it. The inactivating piece which is cleaved off can be a peptide unit, or can be independently-folding domains comprising more than 100 residues. Although they limit the enzyme's ability, these N-terminal extensions of the enzyme or a "prosegment" often aid in the stabilization and folding of the enzyme they inhibit.
Serine proteases are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serine proteases fall into two broad categories based on their structure: chymotrypsin-like (trypsin-like) or subtilisin-like.
Protein sequencing is the practical process of determining the amino acid sequence of all or part of a protein or peptide. This may serve to identify the protein or characterize its post-translational modifications. Typically, partial sequencing of a protein provides sufficient information to identify it with reference to databases of protein sequences derived from the conceptual translation of genes.
Trypsinogen is the precursor form of trypsin, a digestive enzyme. It is produced by the pancreas and found in pancreatic juice, along with amylase, lipase, and chymotrypsinogen. It is cleaved to its active form, trypsin, by enteropeptidase, which is found in the intestinal mucosa. Once activated, the trypsin can cleave more trypsinogen into trypsin, a process called autoactivation. Trypsin cleaves the peptide bond on the carboxyl side of basic amino acids such as arginine and lysine.
Enteropeptidase is an enzyme produced by cells of the duodenum and is involved in digestion in humans and other animals. Enteropeptidase converts trypsinogen into its active form trypsin, resulting in the subsequent activation of pancreatic digestive enzymes. Absence of enteropeptidase results in intestinal digestion impairment.
Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad.
Gelatinases are enzymes capable of degrading gelatin through hydrolysis, playing a major role in degradation of extracellular matrix and tissue remodeling. Gelatinases are a type of matrix metalloproteinase (MMP), a family of enzymes that depend on zinc as a cofactor and can break down parts of the extracellular matrix. MMPs have multiple subgroups, including gelatinase A and gelatinase B. Gelatinases are assigned a variety of Enzyme Commission numbers: gelatinase A uses 3.4.24.24, and gelatinase B uses 3.4.24.35, in which the first three numbers are same. The first digit, 3, is the class. Class 3 enzymes are hydrolases, enzymes that catalyze hydrolysis reactions, that is, they cleave bonds in presence of water. The next digit represents sub-class 4, or proteases, which are enzymes who hydrolyze peptide bonds in proteins. The next number is the sub-subclass of 24, which consists of metalloendopeptidases which contain metal ions in their active sites, in this case zinc, which help in cleaving peptide bonds. The last part of the EC number is the serial number, identifying specific enzymes within a sub-subclass. 24 represents gelatinase A, which is a metalloproteinase that breaks down gelatin and collagen, while 35 represents gelatinase B, which hydrolyzes peptide bonds.
A carboxypeptidase is a protease enzyme that hydrolyzes (cleaves) a peptide bond at the carboxy-terminal (C-terminal) end of a protein or peptide. This is in contrast to an aminopeptidases, which cleave peptide bonds at the N-terminus of proteins. Humans, animals, bacteria and plants contain several types of carboxypeptidases that have diverse functions ranging from catabolism to protein maturation. At least two mechanisms have been discussed.
Aminopeptidases are enzymes that catalyze the cleavage of amino acids from the N-terminus (beginning), of proteins or peptides. They are found in many organisms; in the cell, they are found in many organelles, in the cytosol, and as membrane proteins. Aminopeptidases are used in essential cellular functions, and are often zinc metalloenzymes, containing a zinc cofactor.
In molecular biology, the Signal Peptide Peptidase (SPP) is a type of protein that specifically cleaves parts of other proteins. It is an intramembrane aspartyl protease with the conserved active site motifs 'YD' and 'GxGD' in adjacent transmembrane domains (TMDs). Its sequences is highly conserved in different vertebrate species. SPP cleaves remnant signal peptides left behind in membrane by the action of signal peptidase and also plays key roles in immune surveillance and the maturation of certain viral proteins.
Chemical modification refers to a number of various processes involving the alteration of the chemical constitution or structure of molecules.
Protein metabolism denotes the various biochemical processes responsible for the synthesis of proteins and amino acids (anabolism), and the breakdown of proteins by catabolism.
An Oligopeptidase is an enzyme that cleaves peptides but not proteins. This property is due to its structure: the active site of this enzyme is located at the end of a narrow cavity which can only be reached by peptides.
Zingibain, zingipain, or ginger protease is a cysteine protease enzyme found in ginger rhizomes. It catalyses the preferential cleavage of peptides with a proline residue at the P2 position. It has two distinct forms, ginger protease I (GP-I) and ginger protease II (GP-II).
Aureolysin is an extracellular metalloprotease expressed by Staphylococcus aureus. This protease is a major contributor to the bacterium's virulence, or ability to cause disease, by cleaving host factors of the innate immune system as well as regulating S. aureus secreted toxins and cell wall proteins. To catalyze its enzymatic activities, aureolysin requires zinc and calcium which it obtains from the extracellular environment within the host.
Asparagine peptide lyase are one of the seven groups in which proteases, also termed proteolytic enzymes, peptidases, or proteinases, are classified according to their catalytic residue. The catalytic mechanism of the asparagine peptide lyases involves an asparagine residue acting as nucleophile to perform a nucleophilic elimination reaction, rather than hydrolysis, to catalyse the breaking of a peptide bond.
{{cite journal}}
: CS1 maint: numeric names: authors list (link)