Signal peptide

Last updated
Identifiers
SymbolN/A
OPM superfamily 256
OPM protein 1skh

A signal peptide (sometimes referred to as signal sequence, targeting signal, localization signal, localization sequence, transit peptide, leader sequence or leader peptide) is a short peptide (usually 16-30 amino acids long) [1] present at the N-terminus (or occasionally nonclassically at the C-terminus [2] or internally) of most newly synthesized proteins that are destined toward the secretory pathway. [3] These proteins include those that reside either inside certain organelles (the endoplasmic reticulum, Golgi or endosomes), secreted from the cell, or inserted into most cellular membranes. Although most type I membrane-bound proteins have signal peptides, most type II and multi-spanning membrane-bound proteins are targeted to the secretory pathway by their first transmembrane domain, which biochemically resembles a signal sequence except that it is not cleaved. They are a kind of target peptide.

Contents

Function (translocation)

Signal peptides function to prompt a cell to translocate the protein, usually to the cellular membrane. In prokaryotes, signal peptides direct the newly synthesized protein to the SecYEG protein-conducting channel, which is present in the plasma membrane. A homologous system exists in eukaryotes, where the signal peptide directs the newly synthesized protein to the Sec61 channel, which shares structural and sequence homology with SecYEG, but is present in the endoplasmic reticulum. [4] Both the SecYEG and Sec61 channels are commonly referred to as the translocon, and transit through this channel is known as translocation. While secreted proteins are threaded through the channel, transmembrane domains may diffuse across a lateral gate in the translocon to partition into the surrounding membrane.

Structure

The core of the signal peptide contains a long stretch of hydrophobic amino acids (about 5–16 residues long) [5] that has a tendency to form a single alpha-helix and is also referred to as the "h-region". In addition, many signal peptides begin with a short positively charged stretch of amino acids, which may help to enforce proper topology of the polypeptide during translocation by what is known as the positive-inside rule. [6] Because of its close location to the N-terminus it is called the "n-region". At the end of the signal peptide there is typically a stretch of amino acids that is recognized and cleaved by signal peptidase and therefore named cleavage site. This cleavage site is absent from transmembrane-domains that serve as signal peptides, which are sometimes referred to as signal anchor sequences. Signal peptidase may cleave either during or after completion of translocation to generate a free signal peptide and a mature protein. The free signal peptides are then digested by specific proteases. Moreover, different target locations are aimed by different types of signal peptides. For example, the structure of a target peptide aiming for the mitochondrial environment differs in terms of length and shows an alternating pattern of small positively charged and hydrophobic stretches. Nucleus aiming signal peptides can be found at both the N-terminus and the C-terminus of a protein and are in most cases retained in the mature protein.

It is possible to determine the amino acid sequence of the N-terminal signal peptide by Edman degradation, a cyclic procedure that cleaves off the amino acids one at a time. [7] [8]

Co-translational versus post-translational translocation

In both prokaryotes and eukaryotes signal sequences may act co-translationally or post-translationally.

The co-translational pathway is initiated when the signal peptide emerges from the ribosome and is recognized by the signal-recognition particle (SRP). [9] SRP then halts further translation (translational arrest only occurs in Eukaryotes) and directs the signal sequence-ribosome-mRNA complex to the SRP receptor, which is present on the surface of either the plasma membrane (in prokaryotes) or the ER (in eukaryotes). [10] Once membrane-targeting is completed, the signal sequence is inserted into the translocon. Ribosomes are then physically docked onto the cytoplasmic face of the translocon and protein synthesis resumes. [11]

The post-translational pathway is initiated after protein synthesis is completed. In prokaryotes, the signal sequence of post-translational substrates is recognized by the SecB chaperone protein that transfers the protein to the SecA ATPase, which in turn pumps the protein through the translocon. Although post-translational translocation is known to occur in eukaryotes, it is poorly understood. It is known that in yeast post-translational translocation requires the translocon and two additional membrane-bound proteins, Sec62 and Sec63. [12]

Secretion efficiency determination

Signal peptides are extremely heterogeneous, many prokaryotic and eukaryotic ones are functionally interchangeable within or between species and all determine protein secretion efficiency. [13] [14] [15]

Nucleotide level features

In vertebrates, the region of the mRNA that codes for the signal peptide (i.e. the signal sequence coding region, or SSCR) can function as an RNA element with specific activities. SSCRs promote nuclear mRNA export and the proper localization to the surface of the endoplasmic reticulum. In addition SSCRs have specific sequence features: they have low adenine-content, are enriched in certain motifs, and tend to be present in the first exon at a frequency that is higher than expected. [16] [17]

Alternate secretion mechanisms

Proteins without signal peptides can also be secreted by unconventional mechanisms. E.g. Interleukin, Galectin. [18] The process by which such secretory proteins gain access to the cell exterior is termed unconventional protein secretion (UPS). In plants, even 50% of secreted proteins can be UPS dependent. [19]

Nonclassical sequences

Signal peptides are usually located at the N-terminus of proteins. Some have C-terminal or internal signal peptides (examples: peroxisomal targeting signal and nuclear localisation signal). The structure of these nonclassical signal peptides differs vastly from the N-terminal signal peptides. [2]

Nomenclature

Signal peptides are not to be confused with the leader peptides sometimes encoded by leader mRNA, although both are sometimes ambiguously referred to as "leader peptides." These other leader peptides are short polypeptides that do not function in protein localization, but instead may regulate transcription or translation of the main protein, and are not part of the final protein sequence. This type of leader peptide primarily refers to a form of gene regulation found in bacteria, although a similar mechanism is used to regulate eukaryotic genes, which is referred to as uORFs (upstream open reading frames).

See also

Related Research Articles

<span class="mw-page-title-main">Endoplasmic reticulum</span> Cell organelle that synthesizes, folds and processes proteins

The endoplasmic reticulum (ER) is, in essence, the transportation system of the eukaryotic cell, and has many other important functions such as protein folding. It is a type of organelle made up of two subunits – rough endoplasmic reticulum (RER), and smooth endoplasmic reticulum (SER). The endoplasmic reticulum is found in most eukaryotic cells and forms an interconnected network of flattened, membrane-enclosed sacs known as cisternae, and tubular structures in the SER. The membranes of the ER are continuous with the outer nuclear membrane. The endoplasmic reticulum is not found in red blood cells, or spermatozoa.

<span class="mw-page-title-main">Protein biosynthesis</span> Assembly of proteins inside biological cells

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

Protein targeting or protein sorting is the biological mechanism by which proteins are transported to their appropriate destinations within or outside the cell. Proteins can be targeted to the inner space of an organelle, different intracellular membranes, the plasma membrane, or to the exterior of the cell via secretion. Information contained in the protein itself directs this delivery process. Correct sorting is crucial for the cell; errors or dysfunction in sorting have been linked to multiple diseases.

A transmembrane domain (TMD) is a membrane-spanning protein domain. TMDs may consist of one or several alpha-helices or a transmembrane beta barrel. Because the interior of the lipid bilayer is hydrophobic, the amino acid residues in TMDs are often hydrophobic, although proteins such as membrane pumps and ion channels can contain polar residues. TMDs vary greatly in size and hydrophobicity; they may adopt organelle-specific properties.

<span class="mw-page-title-main">Translation (biology)</span> Cellular process of protein synthesis

In biology, translation is the process in living cells in which proteins are produced using RNA molecules as templates. The generated protein is a sequence of amino acids. This sequence is determined by the sequence of nucleotides in the RNA. The nucleotides are considered three at a time. Each such triple results in addition of one specific amino acid to the protein being generated. The matching from nucleotide triple to amino acid is called the genetic code. The translation is performed by a large complex of functional RNA and proteins called ribosomes. The entire process is called gene expression.

The signal recognition particle (SRP) is an abundant, cytosolic, universally conserved ribonucleoprotein that recognizes and targets specific proteins to the endoplasmic reticulum in eukaryotes and the plasma membrane in prokaryotes.

The translocon is a complex of proteins associated with the translocation of polypeptides across membranes. In eukaryotes the term translocon most commonly refers to the complex that transports nascent polypeptides with a targeting signal sequence into the interior space of the endoplasmic reticulum (ER) from the cytosol. This translocation process requires the protein to cross a hydrophobic lipid bilayer. The same complex is also used to integrate nascent proteins into the membrane itself. In prokaryotes, a similar protein complex transports polypeptides across the (inner) plasma membrane or integrates membrane proteins. In either case, the protein complex are formed from Sec proteins, with the heterotrimeric Sec61 being the channel. In prokaryotes, the homologous channel complex is known as SecYEG.

<span class="mw-page-title-main">Secretion</span> Controlled release of substances by cells or tissues

Secretion is the movement of material from one point to another, such as a secreted chemical substance from a cell or gland. In contrast, excretion is the removal of certain substances or waste products from a cell or organism. The classical mechanism of cell secretion is via secretory portals at the plasma membrane called porosomes. Porosomes are permanent cup-shaped lipoprotein structures embedded in the cell membrane, where secretory vesicles transiently dock and fuse to release intra-vesicular contents from the cell.

The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amine group is bonded to the carboxylic group of another amino acid, making it a chain. That leaves a free carboxylic group at one end of the peptide, called the C-terminus, and a free amine group on the other end called the N-terminus. By convention, peptide sequences are written N-terminus to C-terminus, left to right (in LTR writing systems). This correlates the translation direction to the text direction, because when a protein is translated from messenger RNA, it is created from the N-terminus to the C-terminus, as amino acids are added to the carboxyl end of the protein.

In cell biology, microsomes are heterogeneous vesicle-like artifacts re-formed from pieces of the endoplasmic reticulum (ER) when eukaryotic cells are broken-up in the laboratory; microsomes are not present in healthy, living cells.

<span class="mw-page-title-main">Endoplasm</span> Also known as entoplasm

Endoplasm generally refers to the inner, dense part of a cell's cytoplasm. This is opposed to the ectoplasm which is the outer (non-granulated) layer of the cytoplasm, which is typically watery and immediately adjacent to the plasma membrane. The nucleus is separated from the endoplasm by the nuclear envelope. The different makeups/viscosities of the endoplasm and ectoplasm contribute to the amoeba's locomotion through the formation of a pseudopod. However, other types of cells have cytoplasm divided into endo- and ectoplasm. The endoplasm, along with its granules, contains water, nucleic acids, amino acids, carbohydrates, inorganic ions, lipids, enzymes, and other molecular compounds. It is the site of most cellular processes as it houses the organelles that make up the endomembrane system, as well as those that stand alone. The endoplasm is necessary for most metabolic activities, including cell division.

Sec61, termed SecYEG in prokaryotes, is a membrane protein complex found in all domains of life. As the core component of the translocon, it transports proteins to the endoplasmic reticulum in eukaryotes and out of the cell in prokaryotes. It is a doughnut-shaped pore through the membrane with 3 different subunits (heterotrimeric), SecY (α), SecE (γ), and SecG (β). It has a region called the plug that blocks transport into or out of the ER. This plug is displaced when the hydrophobic region of a nascent polypeptide interacts with another region of Sec61 called the seam, allowing translocation of the polypeptide into the ER lumen.

A secretory protein is any protein, whether it be endocrine or exocrine, which is secreted by a cell. Secretory proteins include many hormones, enzymes, toxins, and antimicrobial peptides. Secretory proteins are synthesized in the endoplasmic reticulum.

<span class="mw-page-title-main">Signal recognition particle receptor</span>

Signal recognition particle (SRP) receptor, also called the docking protein, is a dimer composed of 2 different subunits that are associated exclusively with the rough ER in mammalian cells. Its main function is to identify the SRP units. SRP is a molecule that helps the ribosome-mRNA-polypeptide complexes to settle down on the membrane of the endoplasmic reticulum.

<span class="mw-page-title-main">Signal recognition particle RNA</span>

The signal recognition particle RNA, is part of the signal recognition particle (SRP) ribonucleoprotein complex. SRP recognizes the signal peptide and binds to the ribosome, halting protein synthesis. SRP-receptor is a protein that is embedded in a membrane, and which contains a transmembrane pore. When the SRP-ribosome complex binds to SRP-receptor, SRP releases the ribosome and drifts away. The ribosome resumes protein synthesis, but now the protein is moving through the SRP-receptor transmembrane pore.

<span class="mw-page-title-main">Ribophorin</span>

Ribophorins are dome shaped transmembrane glycoproteins which are located in the membrane of the rough endoplasmic reticulum, but are absent in the membrane of the smooth endoplasmic reticulum. There are two types of ribophorines: ribophorin I and II. These act in the protein complex oligosaccharyltransferase (OST) as two different subunits of the named complex. Ribophorin I and II are only present in eukaryote cells.

A target peptide is a short peptide chain that directs the transport of a protein to a specific region in the cell, including the nucleus, mitochondria, endoplasmic reticulum (ER), chloroplast, apoplast, peroxisome and plasma membrane. Some target peptides are cleaved from the protein by signal peptidases after the proteins are transported.

David Domingo Sabatini is an Argentine-American cell biologist and the Frederick L. Ehrman Professor Emeritus of Cell Biology in the Department of Cell Biology at New York University School of Medicine, which he chaired from 1972 to 2011. Sabatini's major research interests have been on the mechanisms responsible for the structural complexity of the eukaryotic cell. Throughout his career, Sabatini has been recognized for his efforts in promoting science in Latin America.

<span class="mw-page-title-main">Bacterial secretion system</span> Protein complexes present on the cell membranes of bacteria for secretion of substances

Bacterial secretion systems are protein complexes present on the cell membranes of bacteria for secretion of substances. Specifically, they are the cellular devices used by pathogenic bacteria to secrete their virulence factors to invade the host cells. They can be classified into different types based on their specific structure, composition and activity. Generally, proteins can be secreted through two different processes. One process is a one-step mechanism in which proteins from the cytoplasm of bacteria are transported and delivered directly through the cell membrane into the host cell. Another involves a two-step activity in which the proteins are first transported out of the inner cell membrane, then deposited in the periplasm, and finally through the outer cell membrane into the host cell.

References

  1. Kapp, Katja; Schrempf, Sabrina; Lemberg, Marius K.; Dobberstein, Bernhard (2013-01-01). Post-Targeting Functions of Signal Peptides. Landes Bioscience.
  2. 1 2 Owji, Hajar; Nezafat, Navid; Negahdaripour, Manica; Hajiebrahimi, Ali; Ghasemi, Younes (August 2018). "A comprehensive review of signal peptides: Structure, roles, and applications". European Journal of Cell Biology. 97 (6): 422–441. doi:10.1016/j.ejcb.2018.06.003. PMID   29958716. S2CID   49612506.
  3. Blobel G, Dobberstein B (December 1975). "Transfer of proteins across membranes. I. Presence of proteolytically processed and unprocessed nascent immunoglobulin light chains on membrane-bound ribosomes of murine myeloma". The Journal of Cell Biology. 67 (3): 835–51. doi:10.1083/jcb.67.3.835. PMC   2111658 . PMID   811671.
  4. Rapoport TA (November 2007). "Protein translocation across the eukaryotic endoplasmic reticulum and bacterial plasma membranes". Nature. 450 (7170): 663–9. Bibcode:2007Natur.450..663R. doi:10.1038/nature06384. PMID   18046402. S2CID   2497138.
  5. Käll L, Krogh A, Sonnhammer EL (May 2004). "A combined transmembrane topology and signal peptide prediction method". Journal of Molecular Biology. 338 (5): 1027–36. doi:10.1016/j.jmb.2004.03.016. PMID   15111065.
  6. von Heijne G, Gavel Y (July 1988). "Topogenic signals in integral membrane proteins". European Journal of Biochemistry. 174 (4): 671–8. doi: 10.1111/j.1432-1033.1988.tb14150.x . PMID   3134198.
  7. "26.6 Peptide Sequencing: The Edman Degradation". Chemistry LibreTexts. 2015-08-26. Retrieved 2018-09-27.
  8. "N-terminal sequencing service - Edman degradation". www.alphalyse.com. Retrieved 2018-09-27.
  9. Walter P, Ibrahimi I, Blobel G (November 1981). "Translocation of proteins across the endoplasmic reticulum. I. Signal recognition protein (SRP) binds to in-vitro-assembled polysomes synthesizing secretory protein". The Journal of Cell Biology. 91 (2 Pt 1): 545–50. doi:10.1083/jcb.91.2.545. PMC   2111968 . PMID   7309795.
  10. Gilmore R, Blobel G, Walter P (November 1982). "Protein translocation across the endoplasmic reticulum. I. Detection in the microsomal membrane of a receptor for the signal recognition particle". The Journal of Cell Biology. 95 (2 Pt 1): 463–9. doi:10.1083/jcb.95.2.463. PMC   2112970 . PMID   6292235.
  11. Görlich D, Prehn S, Hartmann E, Kalies KU, Rapoport TA (October 1992). "A mammalian homolog of SEC61p and SECYp is associated with ribosomes and nascent polypeptides during translocation". Cell. 71 (3): 489–503. doi:10.1016/0092-8674(92)90517-G. PMID   1423609. S2CID   19078317.
  12. Panzner S, Dreier L, Hartmann E, Kostka S, Rapoport TA (May 1995). "Posttranslational protein transport in yeast reconstituted with a purified complex of Sec proteins and Kar2p". Cell. 81 (4): 561–70. doi: 10.1016/0092-8674(95)90077-2 . PMID   7758110. S2CID   14398668.
  13. Kober L, Zehe C, Bode J (April 2013). "Optimized signal peptides for the development of high expressing CHO cell lines". Biotechnology and Bioengineering. 110 (4): 1164–73. doi:10.1002/bit.24776. PMID   23124363. S2CID   449870.
  14. von Heijne G (July 1985). "Signal sequences. The limits of variation". Journal of Molecular Biology. 184 (1): 99–105. doi:10.1016/0022-2836(85)90046-4. PMID   4032478.
  15. Molino JV, de Carvalho JC, Mayfield SP (2018-02-06). "Comparison of secretory signal peptides for heterologous protein expression in microalgae: Expanding the secretion portfolio for Chlamydomonas reinhardtii". PLOS ONE. 13 (2): e0192433. Bibcode:2018PLoSO..1392433M. doi: 10.1371/journal.pone.0192433 . PMC   5800701 . PMID   29408937.
  16. Palazzo AF, Springer M, Shibata Y, Lee CS, Dias AP, Rapoport TA (December 2007). "The signal sequence coding region promotes nuclear export of mRNA". PLOS Biology. 5 (12): e322. doi: 10.1371/journal.pbio.0050322 . PMC   2100149 . PMID   18052610.
  17. Cenik C, Chua HN, Zhang H, Tarnawsky SP, Akef A, Derti A, et al. (April 2011). Snyder M (ed.). "Genome analysis reveals interplay between 5'UTR introns and nuclear mRNA export for secretory and mitochondrial genes". PLOS Genetics. 7 (4): e1001366. doi: 10.1371/journal.pgen.1001366 . PMC   3077370 . PMID   21533221.
  18. Nickel W, Seedorf M (2008). "Unconventional mechanisms of protein transport to the cell surface of eukaryotic cells". Annual Review of Cell and Developmental Biology. 24: 287–308. doi:10.1146/annurev.cellbio.24.110707.175320. PMID   18590485.
  19. Agrawal GK, Jwa NS, Lebrun MH, Job D, Rakwal R (February 2010). "Plant secretome: unlocking secrets of the secreted proteins". Proteomics. 10 (4): 799–827. doi:10.1002/pmic.200900514. PMID   19953550. S2CID   20647387.