In molecular biology, SUMO (Small Ubiquitin-like Modifier) proteins are a family of small proteins that are covalently attached to and detached from other proteins in cells to modify their function. This process is called SUMOylation (sometimes written sumoylation). SUMOylation is a post-translational modification involved in various cellular processes, such as nuclear-cytosolic transport, transcriptional regulation, apoptosis, protein stability, response to stress, and progression through the cell cycle. [1]
SUMO proteins are similar to ubiquitin and are considered members of the ubiquitin-like protein family. SUMOylation is directed by an enzymatic cascade analogous to that involved in ubiquitination. In contrast to ubiquitin, SUMO is not used to tag proteins for degradation. Mature SUMO is produced when the last four amino acids of the C-terminus have been cleaved off to allow formation of an isopeptide bond between the C-terminal glycine residue of SUMO and an acceptor lysine on the target protein.
SUMO family members often have dissimilar names; the SUMO homologue in yeast, for example, is called SMT3 (suppressor of mif two 3). Several pseudogenes have been reported for SUMO genes in the human genome.
SUMO modification of proteins has many functions. Among the most frequent and best studied are protein stability, nuclear-cytosolic transport, and transcriptional regulation. Typically, only a small fraction of a given protein is SUMOylated and this modification is rapidly reversed by the action of deSUMOylating enzymes. SUMOylation of target proteins has been shown to cause a number of different outcomes including altered localization and binding partners. The SUMO-1 modification of RanGAP1 (the first identified SUMO substrate) leads to its trafficking from cytosol to nuclear pore complex. [2] [3] The SUMO modification of ninein leads to its movement from the centrosome to the nucleus. [4] In many cases, SUMO modification of transcriptional regulators correlates with inhibition of transcription. [5] One can refer to the GeneRIFs of the SUMO proteins, e.g. human SUMO-1, [6] to find out more.
There are 4 confirmed SUMO isoforms in humans; SUMO-1, SUMO-2, SUMO-3 and SUMO-4. At the amino acid level, SUMO1 is about 50% identical to SUMO2.[ citation needed ] SUMO-2/3 show a high degree of similarity to each other and are distinct from SUMO-1. SUMO-4 shows similarity to SUMO-2/3 but differs in having a Proline instead of Glutamine at position 90. As a result, SUMO-4 isn't processed and conjugated under normal conditions, but is used for modification of proteins under stress-conditions like starvation. [7] During mitosis, SUMO-2/3 localize to centromeres and condensed chromosomes, whereas SUMO-1 localizes to the mitotic spindle and spindle midzone, indicating that SUMO paralogs regulate distinct mitotic processes in mammalian cells. [8] One of the major SUMO conjugation products associated with mitotic chromosomes arose from SUMO-2/3 conjugation of topoisomerase II, which is modified exclusively by SUMO-2/3 during mitosis. [9] SUMO-2/3 modifications seem to be involved specifically in the stress response. [10] SUMO-1 and SUMO-2/3 can form mixed chains, however, because SUMO-1 does not contain the internal SUMO consensus sites found in SUMO-2/3, it is thought to terminate these poly-SUMO chains. [11] Serine 2 of SUMO-1 is phosphorylated, raising the concept of a 'modified modifier'. [12]
Cellular DNA is regularly exposed to DNA damaging agents. A DNA damage response (DDR) that is well regulated and intricate is usually employed to deal with the potential deleterious effects of the damage. When DNA damage occurs, SUMO protein has been shown to act as a molecular glue to facilitate the assembly of large protein complexes in repair foci. [13] Also, SUMOylation can alter a protein's biochemical activities and interactions. SUMOylation plays a role in the major DNA repair pathways of base excision repair, nucleotide excision repair, non-homologous end joining and homologous recombinational repair. [13] SUMOylation also facilitates error prone translation synthesis.
SUMO proteins are small; most are around 100 amino acids in length and 12 kDa in mass. The exact length and mass varies between SUMO family members and depends on which organism the protein comes from. Although SUMO has very little sequence identity with ubiquitin (less than 20%) at the amino acid level, it has a nearly identical structural fold. SUMO protein has a unique N-terminal extension of 10-25 amino acids which other ubiquitin-like proteins do not have. This N-terminal is found related to the formation of SUMO chains. [14]
The structure of human SUMO1 is depicted on the right. It shows SUMO1 as a globular protein with both ends of the amino acid chain (shown in red and blue) sticking out of the protein's centre. The spherical core consists of an alpha helix and a beta sheet. The diagrams shown are based on an NMR analysis of the protein in solution.
Most SUMO-modified proteins contain the tetrapeptide consensus motif Ψ-K-x-D/E where Ψ is a hydrophobic residue, K is the lysine conjugated to SUMO, x is any amino acid (aa), D or E is an acidic residue. Substrate specificity appears to be derived directly from Ubc9 and the respective substrate motif. Currently available prediction programs are:
SUMO attachment to its target is similar to that of ubiquitin (as it is for the other ubiquitin-like proteins such as NEDD 8). The SUMO precursor has some extra amino acids that need to be removed, therefore a C-terminal peptide is cleaved from the SUMO precursor by a protease (in human these are the SENP proteases or Ulp1 in yeast) to reveal a di-glycine motif. The obtained SUMO then becomes bound to an E1 enzyme (SUMO Activating Enzyme (SAE)) which is a heterodimer (subunits SAE1 and SAE2). It is then passed to an E2, which is a conjugating enzyme (Ubc9). Finally, one of a small number of E3 ligating proteins attaches it to the protein. In yeast, there are four SUMO E3 proteins, Cst9, [19] Mms21, Siz1 and Siz2. While in ubiquitination an E3 is essential to add ubiquitin to its target, evidence suggests that the E2 is sufficient in SUMOylation as long as the consensus sequence is present. It is thought that the E3 ligase promotes the efficiency of SUMOylation and in some cases has been shown to direct SUMO conjugation onto non-consensus motifs. E3 enzymes can be largely classed into PIAS proteins, such as Mms21 (a member of the Smc5/6 complex) and Pias-gamma and HECT proteins. On Chromosome 17 of the human genome, SUMO2 is near SUMO1+E1/E2 and SUMO2+E1/E2, among various others. Some E3's, such as RanBP2, however, are neither. [20] Recent evidence has shown that PIAS-gamma is required for the SUMOylation of the transcription factor yy1 but it is independent of the zinc-RING finger (identified as the functional domain of the E3 ligases). SUMOylation is reversible and is removed from targets by specific SUMO proteases. In budding yeast, the Ulp1 SUMO protease is found bound at the nuclear pore, whereas Ulp2 is nucleoplasmic. The distinct subnuclear localisation of deSUMOylating enzymes is conserved in higher eukaryotes. [21]
SUMO can be removed from its substrate, which is called deSUMOylation. Specific proteases mediate this procedure (SENP in human or Ulp1 and Ulp2 in yeast). [14]
Recombinant proteins expressed in E. coli may fail to fold properly, instead forming aggregates and precipitating as inclusion bodies. [22] This insolubility may be due to the presence of codons read inefficiently by E. coli, differences in eukaryotic and prokaryotic ribosomes, or lack of appropriate molecular chaperones for proper protein folding. [23] In order to purify such proteins it may be necessary to fuse the protein of interest with a solubility tag such as SUMO or MBP (maltose-binding protein) to increase the protein's solubility. [23] SUMO can later be cleaved from the protein of interest using a SUMO-specific protease such as Ulp1 peptidase. [23]
Ubiquitin is a small regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ubiquitously. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Four genes in the human genome code for ubiquitin: UBB, UBC, UBA52 and RPS27A.
Post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes translating mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.
A ubiquitin ligase is a protein that recruits an E2 ubiquitin-conjugating enzyme that has been loaded with ubiquitin, recognizes a protein substrate, and assists or directly catalyzes the transfer of ubiquitin from the E2 to the protein substrate. In simple and more general terms, the ligase enables movement of ubiquitin from a ubiquitin carrier to another thing by some mechanism. The ubiquitin, once it reaches its destination, ends up being attached by an isopeptide bond to a lysine residue, which is part of the target protein. E3 ligases interact with both the target protein and the E2 enzyme, and so impart substrate specificity to the E2. Commonly, E3s polyubiquitinate their substrate with Lys48-linked chains of ubiquitin, targeting the substrate for destruction by the proteasome. However, many other types of linkages are possible and alter a protein's activity, interactions, or localization. Ubiquitination by E3 ligases regulates diverse areas such as cell trafficking, DNA repair, and signaling and is of profound importance in cell biology. E3 ligases are also key players in cell cycle control, mediating the degradation of cyclins, as well as cyclin dependent kinase inhibitor proteins. The human genome encodes over 600 putative E3 ligases, allowing for tremendous diversity in substrates.
Deubiquitinating enzymes (DUBs), also known as deubiquitinating peptidases, deubiquitinating isopeptidases, deubiquitinases, ubiquitin proteases, ubiquitin hydrolases, or ubiquitin isopeptidases, are a large group of proteases that cleave ubiquitin from proteins. Ubiquitin is attached to proteins in order to regulate the degradation of proteins via the proteasome and lysosome; coordinate the cellular localisation of proteins; activate and inactivate proteins; and modulate protein-protein interactions. DUBs can reverse these effects by cleaving the peptide or isopeptide bond between ubiquitin and its substrate protein. In humans there are nearly 100 DUB genes, which can be classified into two main classes: cysteine proteases and metalloproteases. The cysteine proteases comprise ubiquitin-specific proteases (USPs), ubiquitin C-terminal hydrolases (UCHs), Machado-Josephin domain proteases (MJDs) and ovarian tumour proteases (OTU). The metalloprotease group contains only the Jab1/Mov34/Mpr1 Pad1 N-terminal+ (MPN+) (JAMM) domain proteases.
Ubiquitin-conjugating enzymes, also known as E2 enzymes and more rarely as ubiquitin-carrier enzymes, perform the second step in the ubiquitination reaction that targets a protein for degradation via the proteasome. The ubiquitination process covalently attaches ubiquitin, a short protein of 76 amino acids, to a lysine residue on the target protein. Once a protein has been tagged with one ubiquitin molecule, additional rounds of ubiquitination form a polyubiquitin chain that is recognized by the proteasome's 19S regulatory particle, triggering the ATP-dependent unfolding of the target protein that allows passage into the proteasome's 20S core particle, where proteases degrade the target into short peptide fragments for recycling by the cell.
Protein inhibitor of activated STAT (PIAS), also known as E3 SUMO-protein ligase PIAS, is a protein that regulates transcription in mammals. PIAS proteins act as transcriptional co-regulators with at least 60 different proteins in order to either activate or repress transcription. The transcription factors STAT, NF-κB, p73, and p53 are among the many proteins that PIAS interacts with.
Abgent is a global biotechnology company based in San Diego, California, US with offices in Maidenhead, UK and Suzhou, China and distributors around the world. Abgent develops antibodies and related agents to study proteins involved in cellular function and disease. Abgent's antibodies target key areas of research including autophagy, neuroscience, cancer, stem cells and more. Abgent was acquired in 2011 by WuXi AppTec, a global pharmaceutical, biopharmaceutical, and medical device outsourcing company with operations in China and the United States.
Histone-modifying enzymes are enzymes involved in the modification of histone substrates after protein translation and affect cellular processes including gene expression. To safely store the eukaryotic genome, DNA is wrapped around four core histone proteins, which then join to form nucleosomes. These nucleosomes further fold together into highly condensed chromatin, which renders the organism's genetic material far less accessible to the factors required for gene transcription, DNA replication, recombination and repair. Subsequently, eukaryotic organisms have developed intricate mechanisms to overcome this repressive barrier imposed by the chromatin through histone modification, a type of post-translational modification which typically involves covalently attaching certain groups to histone residues. Once added to the histone, these groups elicit either a loose and open histone conformation, euchromatin, or a tight and closed histone conformation, heterochromatin. Euchromatin marks active transcription and gene expression, as the light packing of histones in this way allows entry for proteins involved in the transcription process. As such, the tightly packed heterochromatin marks the absence of current gene expression.
Small ubiquitin-related modifier 1 is a protein that in humans is encoded by the SUMO1 gene.
SUMO-conjugating enzyme UBC9 is an enzyme that in humans is encoded by the UBE2I gene. It is also sometimes referred to as "ubiquitin conjugating enzyme E2I" or "ubiquitin carrier protein 9", even though these names do not accurately describe its function.
E3 SUMO-protein ligase PIAS1 is an enzyme that in humans is encoded by the PIAS1 gene.
Small ubiquitin-related modifier 3 is a protein that in humans is encoded by the SUMO3 gene.
SUMO1/sentrin/SMT3 specific peptidase 3, also known as SENP3, is a protein which in humans is encoded by the SENP3 gene.
Sentrin-specific protease 1 is an enzyme that in humans is encoded by the SENP1 gene.
Sentrin-specific protease 6 is an enzyme that in humans is encoded by the SENP6 gene.
E3 ubiquitin-protein ligase RNF128 is an enzyme that in humans is encoded by the RNF128 gene.
Ubiquitin-like 1-activating enzyme E1B (UBLE1B) also known as SUMO-activating enzyme subunit 2 (SAE2) is an enzyme that in humans is encoded by the UBA2 gene.
Ubiquitin-like proteins (UBLs) are a family of small proteins involved in post-translational modification of other proteins in a cell, usually with a regulatory function. The UBL protein family derives its name from the first member of the class to be discovered, ubiquitin (Ub), best known for its role in regulating protein degradation through covalent modification of other proteins. Following the discovery of ubiquitin, many additional evolutionarily related members of the group were described, involving parallel regulatory processes and similar chemistry. UBLs are involved in a widely varying array of cellular functions including autophagy, protein trafficking, inflammation and immune responses, transcription, DNA repair, RNA splicing, and cellular differentiation.
C8orf34 is a protein that, in Homo sapiens, is encoded by the C8orf34 gene. Aliases for C8orf34 include vestibule-1 or VEST-1. Within the cell, C8orf34 is localized to the nucleus and nucleoli where it may play a role in the regulation of gene expression as well as the cell cycle.
Arabidopsis SUMO-conjugation enzyme (AtSCE1) is an enzyme that is a member of the small ubiquitin-like modifier (SUMO) post-translational modification pathway. This process, and the SCE1 enzyme with it, is highly conserved across eukaryotes yet absent in prokaryotes. In short, this pathway results in the attachment of a small polypeptide through an isopeptide bond between modifying enzyme and the ε-amino group of a lysine residue in the substrate. In plants, the 160 amino acid SCE1 enzyme was first characterized in 2003. One functional gene copy, SCE1a, was found on chromosomes 3.
Programs for prediction SUMOylation: