N-terminus

Last updated
A tetrapeptide (example: Val-Gly-Ser-Ala) with green highlighted N-terminal a-amino acid (example: L-valine) and blue marked C-terminal a-amino acid (example: L-alanine). This tetrapeptide could be encoded by the mRNA sequence 5'-GUU GGU AGU GCU-3'. Tetrapeptide structural formulae v.1.png
A tetrapeptide (example: Val-Gly-Ser-Ala) with green highlighted N-terminal α-amino acid (example: L-valine) and blue marked C-terminal α-amino acid (example: L-alanine). This tetrapeptide could be encoded by the mRNA sequence 5'-GUU GGU AGU GCU-3'.

The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amine group is bonded to the carboxylic group of another amino acid, making it a chain. That leaves a free carboxylic group at one end of the peptide, called the C-terminus, and a free amine group on the other end called the N-terminus. By convention, peptide sequences are written N-terminus to C-terminus, left to right (in LTR writing systems). [1] This correlates the translation direction to the text direction, because when a protein is translated from messenger RNA, it is created from the N-terminus to the C-terminus, as amino acids are added to the carboxyl end of the protein.

Contents

Chemistry

Each amino acid has an amine group and a carboxylic group. Amino acids link to one another by peptide bonds which form through a dehydration reaction that joins the carboxyl group of one amino acid to the amine group of the next in a head-to-tail manner to form a polypeptide chain. The chain has two ends – an amine group, the N-terminus, and an unbound carboxyl group, the C-terminus. [2]

When a protein is translated from messenger RNA, it is created from N-terminus to C-terminus. The amino end of an amino acid (on a charged tRNA) during the elongation stage of translation, attaches to the carboxyl end of the growing chain. Since the start codon of the genetic code codes for the amino acid methionine, most protein sequences start with a methionine (or, in bacteria, mitochondria and chloroplasts, the modified version N-formylmethionine, fMet). However, some proteins are modified posttranslationally, for example, by cleavage from a protein precursor, and therefore may have different amino acids at their N-terminus.

Function

N-terminal targeting signals

The N-terminus is the first part of the protein that exits the ribosome during protein biosynthesis. It often contains signal peptide sequences, "intracellular postal codes" that direct delivery of the protein to the proper organelle. The signal peptide is typically removed at the destination by a signal peptidase. The N-terminal amino acid of a protein is an important determinant of its half-life (likelihood of being degraded). This is called the N-end rule.

Signal peptide

The N-terminal signal peptide is recognized by the signal recognition particle (SRP) and results in the targeting of the protein to the secretory pathway. In eukaryotic cells, these proteins are synthesized at the rough endoplasmic reticulum. In prokaryotic cells, the proteins are exported across the cell membrane. In chloroplasts, signal peptides target proteins to the thylakoids.

Mitochondrial targeting peptide

The N-terminal mitochondrial targeting peptide (mtTP) allows the protein to be imported into the mitochondrion.

Chloroplast targeting peptide

The N-terminal chloroplast targeting peptide (cpTP) allows for the protein to be imported into the chloroplast.

N-terminal modifications

Protein N-termini can be modified co - or post-translationally. Modifications include the removal of initiator methionine (iMet) by aminopeptidases, attachment of small chemical groups such as acetyl, propionyl and methyl, and the addition of membrane anchors, such as palmitoyl and myristoyl groups [3]

N-terminal acetylation

N-terminal acetylation is a form of protein modification that can occur in both prokaryotes and eukaryotes. It has been suggested that N-terminal acetylation can prevent a protein from following a secretory pathway. [4]

N-Myristoylation

The N-terminus can be modified by the addition of a myristoyl anchor. Proteins that are modified this way contain a consensus motif at their N-terminus as a modification signal.

N-Acylation

The N-terminus can also be modified by the addition of a fatty acid anchor to form N-acetylated proteins. The most common form of such modification is the addition of a palmitoyl group.

See also

Related Research Articles

<span class="mw-page-title-main">Amino acid</span> Organic compounds containing amine and carboxylic groups

Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 appear in the genetic code of life.

<span class="mw-page-title-main">Peptide</span> Short chains of 2–50 amino acids

Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides.

<span class="mw-page-title-main">Protein primary structure</span> Linear sequence of amino acids in a peptide or protein

Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.

<span class="mw-page-title-main">Protein biosynthesis</span> Assembly of proteins inside biological cells

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

<span class="mw-page-title-main">Proteolysis</span> Breakdown of proteins into smaller polypeptides or amino acids

Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called proteases, but may also occur by intra-molecular digestion.

Protein targeting or protein sorting is the biological mechanism by which proteins are transported to their appropriate destinations within or outside the cell. Proteins can be targeted to the inner space of an organelle, different intracellular membranes, the plasma membrane, or to the exterior of the cell via secretion. Information contained in the protein itself directs this delivery process. Correct sorting is crucial for the cell; errors or dysfunction in sorting have been linked to multiple diseases.

<span class="mw-page-title-main">Post-translational modification</span> Chemical changes in proteins following their translation from mRNA

In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.

The C-terminus is the end of an amino acid chain, terminated by a free carboxyl group (-COOH). When the protein is translated from messenger RNA, it is created from N-terminus to C-terminus. The convention for writing peptide sequences is to put the C-terminal end on the right and write the sequence from N- to C-terminus.

<span class="mw-page-title-main">Dipeptide</span> Shortest peptide molecule, containing two amino acids joined by a single peptide bond

A dipeptide is an organic compound derived from two amino acids. The constituent amino acids can be the same or different. When different, two isomers of the dipeptide are possible, depending on the sequence. Several dipeptides are physiologically important, and some are both physiologically and commercially significant. A well known dipeptide is aspartame, an artificial sweetener.

<span class="mw-page-title-main">Protein sequencing</span> Sequencing of amino acid arrangement in a protein

Protein sequencing is the practical process of determining the amino acid sequence of all or part of a protein or peptide. This may serve to identify the protein or characterize its post-translational modifications. Typically, partial sequencing of a protein provides sufficient information to identify it with reference to databases of protein sequences derived from the conceptual translation of genes.

In molecular biology, biosynthesis is a multi-step, enzyme-catalyzed process where substrates are converted into more complex products in living organisms. In biosynthesis, simple compounds are modified, converted into other compounds, or joined to form macromolecules. This process often consists of metabolic pathways. Some of these biosynthetic pathways are located within a single cellular organelle, while others involve enzymes that are located within multiple cellular organelles. Examples of these biosynthetic pathways include the production of lipid membrane components and nucleotides. Biosynthesis is usually synonymous with anabolism.

<span class="mw-page-title-main">Peptide synthesis</span> Production of peptides

In organic chemistry, peptide synthesis is the production of peptides, compounds where multiple amino acids are linked via amide bonds, also known as peptide bonds. Peptides are chemically synthesized by the condensation reaction of the carboxyl group of one amino acid to the amino group of another. Protecting group strategies are usually necessary to prevent undesirable side reactions with the various amino acid side chains. Chemical peptide synthesis most commonly starts at the carboxyl end of the peptide (C-terminus), and proceeds toward the amino-terminus (N-terminus). Protein biosynthesis in living organisms occurs in the opposite direction.

This is a list of topics in molecular biology. See also index of biochemistry articles.

The peptidyl transferase is an aminoacyltransferase as well as the primary enzymatic function of the ribosome, which forms peptide bonds between adjacent amino acids using tRNAs during the translation process of protein biosynthesis. The substrates for the peptidyl transferase reaction are two tRNA molecules, one bearing the growing peptide chain and the other bearing the amino acid that will be added to the chain. The peptidyl chain and the amino acids are attached to their respective tRNAs via ester bonds to the O atom at the CCA-3' ends of these tRNAs. Peptidyl transferase is an enzyme that catalyzes the addition of an amino acid residue in order to grow the polypeptide chain in protein synthesis. It is located in the large ribosomal subunit, where it catalyzes the peptide bond formation. It is composed entirely of RNA. The alignment between the CCA ends of the ribosome-bound peptidyl tRNA and aminoacyl tRNA in the peptidyl transferase center contribute to its ability to catalyze these reactions. This reaction occurs via nucleophilic displacement. The amino group of the aminoacyl tRNA attacks the terminal carboxyl group of the peptidyl tRNA. Peptidyl transferase activity is carried out by the ribosome. Peptidyl transferase activity is not mediated by any ribosomal proteins but by ribosomal RNA (rRNA), a ribozyme. Ribozymes are the only enzymes which are not made up of proteins, but ribonucleotides. All other enzymes are made up of proteins. This RNA relic is the most significant piece of evidence supporting the RNA World hypothesis.

<span class="mw-page-title-main">Myristoylation</span>

Myristoylation is a lipidation modification where a myristoyl group, derived from myristic acid, is covalently attached by an amide bond to the alpha-amino group of an N-terminal glycine residue. Myristic acid is a 14-carbon saturated fatty acid (14:0) with the systematic name of n-tetradecanoic acid. This modification can be added either co-translationally or post-translationally. N-myristoyltransferase (NMT) catalyzes the myristic acid addition reaction in the cytoplasm of cells. This lipidation event is the most common type of fatty acylation and is present in many organisms, including animals, plants, fungi, protozoans and viruses. Myristoylation allows for weak protein–protein and protein–lipid interactions and plays an essential role in membrane targeting, protein–protein interactions and functions widely in a variety of signal transduction pathways.

<i>N</i>-Formylmethionine Chemical compound

N-Formylmethionine is a derivative of the amino acid methionine in which a formyl group has been added to the amino group. It is specifically used for initiation of protein synthesis from bacterial and organellar genes, and may be removed post-translationally.

Protein metabolism denotes the various biochemical processes responsible for the synthesis of proteins and amino acids (anabolism), and the breakdown of proteins by catabolism.

<span class="mw-page-title-main">Chloroplast DNA</span> DNA located in cellular organelles called chloroplasts

Chloroplast DNA (cpDNA) is the DNA located in chloroplasts, which are photosynthetic organelles located within the cells of some eukaryotic organisms. Chloroplasts, like other types of plastid, contain a genome separate from that in the cell nucleus. The existence of chloroplast DNA was identified biochemically in 1959, and confirmed by electron microscopy in 1962. The discoveries that the chloroplast contains ribosomes and performs protein synthesis revealed that the chloroplast is genetically semi-autonomous. The first complete chloroplast genome sequences were published in 1986, Nicotiana tabacum (tobacco) by Sugiura and colleagues and Marchantia polymorpha (liverwort) by Ozeki et al. Since then, a great number of chloroplast DNAs from various species have been sequenced.

<span class="mw-page-title-main">Non-proteinogenic amino acids</span> Are not naturally encoded in the genome

In biochemistry, non-coded or non-proteinogenic amino acids are distinct from the 22 proteinogenic amino acids, which are naturally encoded in the genome of organisms for the assembly of proteins. However, over 140 non-proteinogenic amino acids occur naturally in proteins and thousands more may occur in nature or be synthesized in the laboratory. Chemically synthesized amino acids can be called unnatural amino acids. Unnatural amino acids can be synthetically prepared from their native analogs via modifications such as amine alkylation, side chain substitution, structural bond extension cyclization, and isosteric replacements within the amino acid backbone. Many non-proteinogenic amino acids are important:

<span class="mw-page-title-main">Arginylation</span> Arginylation Post-translational modification

Arginylation is a post-translational modification in which proteins are modified by the addition of arginine (Arg) at the N-terminal amino group or side chains of reactive amino acids by the enzyme, arginyltransferase (ATE1). Recent studies have also revealed that hundreds of proteins in vivo are arginylated, proteins which are essential for many biological pathways. While still poorly understood in a biological setting, the ATE1 enzyme is highly conserved which suggests that arginylation is an important biological post-translational modification.

References

  1. Reusch, William (5 May 2013). "Peptides & Proteins". Michigan State University Department of Chemistry.
  2. Voet, Donald; Voet, Judith G.; Pratt, Charlotte W. (2013). Fundamentals of Biochemistry: Life at the Molecular Level (4th ed.). Hoboken, NJ: Wiley. ISBN   978-0470547847.
  3. Varland (April 21, 2015). "N-terminal modifications of cellular proteins:The enzymes involved, their substrate specificities and biological effects". Proteomics. 15 (14): 2385–401. doi:10.1002/pmic.201400619. PMC   4692089 . PMID   25914051.
  4. Arnesen, Thomas (May 31, 2011). "Towards a Functional Understanding of Protein N-Terminal Acetylation". PLOS Biology. 9 (5): e1001074. doi: 10.1371/journal.pbio.1001074 . PMC   3104970 . PMID   21655309.