C-terminus

Last updated
A tetrapeptide (example: Val-Gly-Ser-Ala) with green highlighted N-terminal a-amino acid (example: L-valine) and blue marked C-terminal a-amino acid (example: L-alanine). Tetrapeptide structural formulae v.1.png
A tetrapeptide (example: Val-Gly-Ser-Ala) with green highlighted N-terminal α-amino acid (example: L-valine) and blue marked C-terminal α-amino acid (example: L-alanine).

The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, carboxy tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH). When the protein is translated from messenger RNA, it is created from N-terminus to C-terminus. The convention for writing peptide sequences is to put the C-terminal end on the right and write the sequence from N- to C-terminus.

Contents

Chemistry

Each amino acid has a carboxyl group and an amine group. Amino acids link to one another to form a chain by a dehydration reaction which joins the amine group of one amino acid to the carboxyl group of the next. Thus polypeptide chains have an end with an unbound carboxyl group, the C-terminus, and an end with an unbound amine group, the N-terminus. Proteins are naturally synthesized starting from the N-terminus and ending at the C-terminus.[ citation needed ]

Function

C-terminal retention signals

While the N-terminus of a protein often contains targeting signals, the C-terminus can contain retention signals for protein sorting. The most common ER retention signal is the amino acid sequence -KDEL (Lys-Asp-Glu-Leu) or -HDEL (His-Asp-Glu-Leu) at the C-terminus. This keeps the protein in the endoplasmic reticulum and prevents it from entering the secretory pathway.

Peroxisomal targeting signal

The sequence -SKL (Ser-Lys-Leu) or similar near C-terminus serves as peroxisomal targeting signal 1, directing the protein into peroxisome.[ citation needed ]

C-terminal modifications

The C-terminus of proteins can be modified posttranslationally, most commonly by the addition of a lipid anchor to the C-terminus that allows the protein to be inserted into a membrane without having a transmembrane domain.

Prenylation

One form of C-terminal modification is prenylation. During prenylation, a farnesyl- or geranylgeranyl-isoprenoid membrane anchor is added to a cysteine residue near the C-terminus. Small, membrane-bound G proteins are often modified this way.[ citation needed ]

GPI anchors

Another form of C-terminal modification is the addition of a phosphoglycan, glycosylphosphatidylinositol (GPI), as a membrane anchor. The GPI anchor is attached to the C-terminus after proteolytic cleavage of a C-terminal propeptide. The most prominent example for this type of modification is the prion protein.

Methylation

C-terminal leucine is methylated at carboxyl group by enzyme leucine carboxyl methyltransferase 1 in vertebrates, forming methyl ester. [1]

C-terminal domain

RNA POL II in action. ARN pol II en accion.jpg
RNA POL II in action.

The C-terminal domain of some proteins has specialized functions. In humans, the CTD of RNA polymerase II typically consists of up to 52 repeats of the sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser. [2] This allows other proteins to bind to the C-terminal domain of RNA polymerase in order to activate polymerase activity. These domains are then involved in the initiation of DNA transcription, the capping of the RNA transcript, and attachment to the spliceosome for RNA splicing. [3]

See also

Related Research Articles

<span class="mw-page-title-main">Amino acid</span> Organic compounds containing amine and carboxylic groups

Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 appear in the genetic code of life.

<span class="mw-page-title-main">Protein primary structure</span> Linear sequence of amino acids in a peptide or protein

Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.

Protein targeting or protein sorting is the biological mechanism by which proteins are transported to their appropriate destinations within or outside the cell. Proteins can be targeted to the inner space of an organelle, different intracellular membranes, the plasma membrane, or to the exterior of the cell via secretion. Information contained in the protein itself directs this delivery process. Correct sorting is crucial for the cell; errors or dysfunction in sorting have been linked to multiple diseases.

<span class="mw-page-title-main">Post-translational modification</span> Chemical changes in proteins following their translation from mRNA

In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.

<span class="mw-page-title-main">Lipid-anchored protein</span> Membrane protein

Lipid-anchored proteins are proteins located on the surface of the cell membrane that are covalently attached to lipids embedded within the cell membrane. These proteins insert and assume a place in the bilayer structure of the membrane alongside the similar fatty acid tails. The lipid-anchored protein can be located on either side of the cell membrane. Thus, the lipid serves to anchor the protein to the cell membrane. They are a type of proteolipids.

<span class="mw-page-title-main">Index of biochemistry articles</span>

Biochemistry is the study of the chemical processes in living organisms. It deals with the structure and function of cellular components such as proteins, carbohydrates, lipids, nucleic acids and other biomolecules.

<span class="mw-page-title-main">AMPA receptor</span> Transmembrane protein family

The α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (also known as AMPA receptor, AMPAR, or quisqualate receptor) is an ionotropic transmembrane receptor for glutamate (iGluR) and predominantly Na+ ion channel that mediates fast synaptic transmission in the central nervous system (CNS). It has been traditionally classified as a non-NMDA-type receptor, along with the kainate receptor. Its name is derived from its ability to be activated by the artificial glutamate analog AMPA. The receptor was first named the "quisqualate receptor" by Watkins and colleagues after a naturally occurring agonist quisqualate and was only later given the label "AMPA receptor" after the selective agonist developed by Tage Honore and colleagues at the Royal Danish School of Pharmacy in Copenhagen. The GRIA2-encoded AMPA receptor ligand binding core (GluA2 LBD) was the first glutamate receptor ion channel domain to be crystallized.

A signal peptide is a short peptide present at the N-terminus of most newly synthesized proteins that are destined toward the secretory pathway. These proteins include those that reside either inside certain organelles, secreted from the cell, or inserted into most cellular membranes. Although most type I membrane-bound proteins have signal peptides, most type II and multi-spanning membrane-bound proteins are targeted to the secretory pathway by their first transmembrane domain, which biochemically resembles a signal sequence except that it is not cleaved. They are a kind of target peptide.

Glycosylphosphatidylinositol or glycophosphatidylinositol (GPI) is a phosphoglyceride that can be attached to the C-terminus of a protein during posttranslational modification. The resulting GPI-anchored proteins play key roles in a wide variety of biological processes. GPI is composed of a phosphatidylinositol group linked through a carbohydrate-containing linker and via an ethanolamine phosphate (EtNP) bridge to the C-terminal amino acid of a mature protein. The two fatty acids within the hydrophobic phosphatidyl-inositol group anchor the protein to the cell membrane.

The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amine group is bonded to the carboxylic group of another amino acid, making it a chain. That leaves a free carboxylic group at one end of the peptide, called the C-terminus, and a free amine group on the other end called the N-terminus. By convention, peptide sequences are written N-terminus to C-terminus, left to right (in LTR writing systems). This correlates the translation direction to the text direction, because when a protein is translated from messenger RNA, it is created from the N-terminus to the C-terminus, as amino acids are added to the carboxyl end of the protein.

<span class="mw-page-title-main">Prenylation</span> Addition of hydrophobic moieties to proteins or other biomolecules

Prenylation is the addition of hydrophobic molecules to a protein or a biomolecule. It is usually assumed that prenyl groups (3-methylbut-2-en-1-yl) facilitate attachment to cell membranes, similar to lipid anchors like the GPI anchor, though direct evidence of this has not been observed. Prenyl groups have been shown to be important for protein–protein binding through specialized prenyl-binding domains.

This is a list of topics in molecular biology. See also index of biochemistry articles.

<span class="mw-page-title-main">SV40 large T antigen</span> Proto-oncogene derived from polyomavirus SV40

SV40 large T antigen is a hexamer protein that is a dominant-acting oncoprotein derived from the polyomavirus SV40. TAg is capable of inducing malignant transformation of a variety of cell types. The transforming activity of TAg is due in large part to its perturbation of the retinoblastoma (pRb) and p53 tumor suppressor proteins. In addition, TAg binds to several other cellular factors, including the transcriptional co-activators p300 and CBP, which may contribute to its transformation function. Similar proteins from related viruses are known as large tumor antigen in general.

Geranylgeranylation is a form of prenylation, which is a post-translational modification of proteins that involves the attachment of one or two 20-carbon lipophilic geranylgeranyl isoprene units from geranylgeranyl diphosphate to one or two cysteine residue(s) at the C-terminus of specific proteins. Prenylation is thought to function, at least in part, as a membrane anchor for proteins.

RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters of protein-coding genes in living cells. It consists of RNA polymerase II, a subset of general transcription factors, and regulatory proteins known as SRB proteins.

<span class="mw-page-title-main">Chloroplast DNA</span> DNA located in cellular organelles called chloroplasts

Chloroplast DNA (cpDNA) is the DNA located in chloroplasts, which are photosynthetic organelles located within the cells of some eukaryotic organisms. Chloroplasts, like other types of plastid, contain a genome separate from that in the cell nucleus. The existence of chloroplast DNA was identified biochemically in 1959, and confirmed by electron microscopy in 1962. The discoveries that the chloroplast contains ribosomes and performs protein synthesis revealed that the chloroplast is genetically semi-autonomous. The first complete chloroplast genome sequences were published in 1986, Nicotiana tabacum (tobacco) by Sugiura and colleagues and Marchantia polymorpha (liverwort) by Ozeki et al. Since then, a great number of chloroplast DNAs from various species have been sequenced.

Glypiation is the addition by covalent bonding of a glycosylphosphatidylinositol (GPI) anchor and is a common post-translational modification that localizes proteins to cell membranes. This special kind of glycosylation is widely detected on surface glycoproteins in eukaryotes and some Archaea.

<span class="mw-page-title-main">FAM98A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 98, member A, or FAM98A, is a gene that in the human genome encodes the FAM98A protein. FAM98A has two paralogs in humans, FAM98B and FAM98C. All three are characterized by DUF2465, a conserved domain shown to bind to RNA. FAM98A is also characterized by a glycine-rich C-terminal domain. FAM98A also has homologs in vertebrates and invertebrates and has distant homologs in choanoflagellates and green algae.

Protein methylation is a type of post-translational modification featuring the addition of methyl groups to proteins. It can occur on the nitrogen-containing side-chains of arginine and lysine, but also at the amino- and carboxy-termini of a number of different proteins. In biology, methyltransferases catalyze the methylation process, activated primarily by S-adenosylmethionine. Protein methylation has been most studied in histones, where the transfer of methyl groups from S-adenosyl methionine is catalyzed by histone methyltransferases. Histones that are methylated on certain residues can act epigenetically to repress or activate gene expression.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

References

  1. "RHEA:48544". Swiss Institute of Bioinformatics.
  2. Meinhart A, Cramer P (July 2004). "Recognition of RNA polymerase II carboxy-terminal domain by 3'-RNA-processing factors". Nature. 430 (6996): 223–6. Bibcode:2004Natur.430..223M. doi:10.1038/nature02679. hdl: 11858/00-001M-0000-0015-8512-8 . PMID   15241417. S2CID   4418258.
  3. Brickey WJ, Greenleaf AL (June 1995). "Functional studies of the carboxy-terminal repeat domain of Drosophila RNA polymerase II in vivo". Genetics. 140 (2): 599–613. doi:10.1093/genetics/140.2.599. PMC   1206638 . PMID   7498740.