SET domain

Last updated
SET
PDB 1n3j EBI.jpg
structure and substrate of a histone H3 lysine methyltransferase from Paramecium bursaria chlorella virus 1
Identifiers
SymbolSET
Pfam PF00856
InterPro IPR001214
SMART SM0468
SCOP2 1ml9 / SCOPe / SUPFAM

The SET domain is a protein domain that typically has methyltransferase activity. It was originally identified as part of a larger conserved region present in the Drosophila Trithorax protein and was subsequently identified in the Drosophila Su(var)3-9 and 'Enhancer of zeste' proteins, from which the acronym SET is derived [Su(var)3-9, Enhancer-of-zeste and Trithorax].

Contents

Structure

The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions:

The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate.

Function

It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control. [2] The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure.

Histone lysine methylation is part of the histone code that regulates chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception, [3] the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain. [4] [5] Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities. [6]

Associated domains

The N-terminal pre-SET domain (InterPro :  IPR007728 ), as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils.

The C-terminal region including the post-SET domain (InterPro :  IPR003616 ) is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site. [7] The structured post-SET region brings in the C-terminal residues that participate in S-adenosyl-L-methionine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity. [8] [9]

Examples

Human genes encoding proteins containing this domain include:

Related Research Articles

Histone Family proteins package and order the DNA into structural units called nucleosomes.

In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are wrapped into 30-nanometer fibers that form tightly packed chromatin. Histones prevent DNA from becoming tangled and protect it from DNA damage. In addition, histones play important roles in gene regulation and DNA replication. Without histones, unwound DNA in chromosomes would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out; however, when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers.

In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These terms are commonly used in chemistry, biochemistry, soil science, and the biological sciences.

Histone methyltransferase Histone-modifying enzymes

Histone methyltransferases (HMT) are histone-modifying enzymes, that catalyze the transfer of one, two, or three methyl groups to lysine and arginine residues of histone proteins. The attachment of methyl groups occurs predominantly at specific lysine or arginine residues on histones H3 and H4. Two major types of histone methyltranferases exist, lysine-specific and arginine-specific. In both types of histone methyltransferases, S-Adenosyl methionine (SAM) serves as a cofactor and methyl donor group.
The genomic DNA of eukaryotes associates with histones to form chromatin. The level of chromatin compaction depends heavily on histone methylation and other post-translational modifications of histones. Histone methylation is a principal epigenetic modification of chromatin that determines gene expression, genomic stability, stem cell maturation, cell lineage development, genetic imprinting, DNA methylation, and cell mitosis.

Histone acetyltransferase Enzymes that catalyze acyl group transfer from acetyl-CoA to histones

Histone acetyltransferases (HATs) are enzymes that acetylate conserved lysine amino acids on histone proteins by transferring an acetyl group from acetyl-CoA to form ε-N-acetyllysine. DNA is wrapped around histones, and, by transferring an acetyl group to the histones, genes can be turned on and off. In general, histone acetylation increases gene expression.

Histone methylation is a process by which methyl groups are transferred to amino acids of histone proteins that make up nucleosomes, which the DNA double helix wraps around to form chromosomes. Methylation of histones can either increase or decrease transcription of genes, depending on which amino acids in the histones are methylated, and how many methyl groups are attached. Methylation events that weaken chemical attractions between histone tails and DNA increase transcription because they enable the DNA to uncoil from nucleosomes so that transcription factor proteins and RNA polymerase can access the DNA. This process is critical for the regulation of gene expression that allows different cells to express different genes.

Methyltransferase Group of methylating enzymes

Methyltransferases are a large group of enzymes that all methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which contain a Rossmann fold for binding S-Adenosyl methionine (SAM). Class II methyltransferases contain a SET domain, which are exemplified by SET domain histone methyltransferases, and class III methyltransferases, which are membrane associated. Methyltransferases can also be grouped as different types utilizing different substrates in methyl transfer reactions. These types include protein methyltransferases, DNA/RNA methyltransferases, natural product methyltransferases, and non-SAM dependent methyltransferases. SAM is the classical methyl donor for methyltrasferases, however, examples of other methyl donors are seen in nature. The general mechanism for methyl transfer is a SN2-like nucleophilic attack where the methionine sulfur serves as the leaving group and the methyl group attached to it acts as the electrophile that transfers the methyl group to the enzyme substrate. SAM is converted to S-Adenosyl homocysteine (SAH) during this process. The breaking of the SAM-methyl bond and the formation of the substrate-methyl bond happen nearly simultaneously. These enzymatic reactions are found in many pathways and are implicated in genetic diseases, cancer, and metabolic diseases. Another type of methyl transfer is the radical S-Adenosyl methionine (SAM) which is the methylation of unactivated carbon atoms in primary metabolites, proteins, lipids, and RNA.

PHD finger

The PHD finger was discovered in 1993 as a Cys4-His-Cys3 motif in the plant homeodomain proteins HAT3.1 in Arabidopsis and maize ZmHox1a. The PHD finger motif resembles the metal binding RING domain (Cys3-His-Cys4) and FYVE domain. It occurs as a single finger, but often in clusters of two or three, and it also occurs together with other domains, such as the chromodomain and the bromodomain.

Chromodomain

A chromodomain is a protein structural domain of about 40–50 amino acid residues commonly found in proteins associated with the remodeling and manipulation of chromatin. The domain is highly conserved among both plants and animals, and is represented in a large number of different proteins in many genomes, such as that of the mouse. Some chromodomain-containing genes have multiple alternative splicing isoforms that omit the chromodomain entirely. In mammals, chromodomain-containing proteins are responsible for aspects of gene regulation related to chromatin remodeling and formation of heterochromatin regions. Chromodomain-containing proteins also bind methylated histones and appear in the RNA-induced transcriptional silencing complex. In histone modifications, chromodomains are very conserved. They function by identifying and binding to methylated lysine residues that exist on the surface of chromatin proteins and thereby regulate gene transcription.

EZH2

Enhancer of zeste homolog 2 (EZH2) is a histone-lysine N-methyltransferase enzyme encoded by EZH2 gene, that participates in histone methylation and, ultimately, transcriptional repression. EZH2 catalyzes the addition of methyl groups to histone H3 at lysine 27, by using the cofactor S-adenosyl-L-methionine. Methylation activity of EZH2 facilitates heterochromatin formation thereby silences gene function. Remodeling of chromosomal heterochromatin by EZH2 is also required during cell mitosis.

SUV39H1

Histone-lysine N-methyltransferase SUV39H1 is an enzyme that in humans is encoded by the SUV39H1 gene.

KMT2A Protein-coding gene in the species Homo sapiens

Histone-lysine N-methyltransferase 2A also known as acute lymphoblastic leukemia 1 (ALL-1), myeloid/lymphoid or mixed-lineage leukemia1 (MLL1), or zinc finger protein HRX (HRX) is an enzyme that in humans is encoded by the KMT2A gene.

KMT2C

Lysine N-methyltransferase 2C (KMT2C) also known as myeloid/lymphoid or mixed-lineage leukemia protein 3 (MLL3) is an enzyme that in humans is encoded by the KMT2C gene.

SETD7

Histone-lysine N-methyltransferase SETD7 is an enzyme that in humans is encoded by the SETD7 gene.

CBX5 (gene)

Chromobox protein homolog 5 is a protein that in humans is encoded by the CBX5 gene. It is a highly conserved, non-histone protein part of the heterochromatin family. The protein itself is more commonly called HP1α. Heterochromatin protein-1 (HP1) has an N-terminal domain that acts on methylated lysines residues leading to epigenetic repression. The C-terminal of this protein has a chromo shadow-domain (CSD) that is responsible for homodimerizing, as well as interacting with a variety of chromatin-associated, non-histone proteins.

SMYD3

SET and MYND (myeloid-Nervy-DEAF-1) domain-containing protein 3 is a protein that in humans is encoded by the SMYD3 gene.

EHMT1

Euchromatic histone-lysine N-methyltransferase 1, also known as G9a-like protein (GLP), is a protein that in humans is encoded by the EHMT1 gene.

SETD6

SET domain containing 6 is a protein in humans that is encoded by the SETD6 gene.

Protein methylation is a type of post-translational modification featuring the addition of methyl groups to proteins. It can occur on the nitrogen-containing side-chains of arginine and lysine, but also at the amino- and carboxy-termini of a number of different proteins. In biology, methyltransferases catalyze the methylation process, activated primarily by S-adenosylmethionine. Protein methylation has been most studied in histones, where the transfer of methyl groups from S-adenosyl methionine is catalyzed by histone methyltransferases. Histones that are methylated on certain residues can act epigenetically to repress or activate gene expression.

Thomas Jenuwein German scientist

Thomas Jenuwein is a German scientist working in the fields of epigenetics, chromatin biology, gene regulation and genome function.

In epigenetics, proline isomerization is the effect that cis-trans isomerization of the amino acid proline has on the regulation of gene expression. Similar to aspartic acid, the amino acid proline has the rare property of being able to occupy both cis and trans isomers of its prolyl peptide bonds with ease. Peptidyl-prolyl isomerase, or PPIase, is an enzyme very commonly associated with proline isomerization due to their ability to catalyze the isomerization of prolines. PPIases are present in three types: cyclophilins, FK507-binding proteins, and the parvulins. PPIase enzymes catalyze the transition of proline between cis and trans isomers and are essential to the numerous biological functions controlled and affected by prolyl isomerization Without PPIases, prolyl peptide bonds will slowly switch between cis and trans isomers, a process that can lock proteins in a nonnative structure that can affect render the protein temporarily ineffective. Although this switch can occur on its own, PPIases are responsible for most isomerization of prolyl peptide bonds. The specific amino acid that precedes the prolyl peptide bond also can have an effect on which conformation the bond assumes. For instance, when an aromatic amino acid is bonded to a proline the bond is more favorable to the cis conformation. Cyclophilin A uses an "electrostatic handle" to pull proline into cis and trans formations. Most of these biological functions are affected by the isomerization of proline when one isomer interacts differently than the other, commonly causing an activation/deactivation relationship. As an amino acid, proline is present in many proteins. This aids in the multitude of effects that isomerization of proline can have in different biological mechanisms and functions.

References

  1. Helin K, Dhanak D (October 2013). "Chromatin proteins and modifications as drug targets" (PDF). Nature. 502 (7472): 480–8. Bibcode:2013Natur.502..480H. doi:10.1038/nature12751. PMID   24153301. S2CID   4450386.
  2. Cui X, De Vivo I, Slany R, Miyamoto A, Firestein R, Cleary ML (April 1998). "Association of SET domain and myotubularin-related proteins modulates growth control". Nature Genetics. 18 (4): 331–7. doi:10.1038/ng0498-331. PMID   9537414. S2CID   25209204.
  3. Feng Q, Wang H, Ng HH, Erdjument-Bromage H, Tempst P, Struhl K, Zhang Y (June 2002). "Methylation of H3-lysine 79 is mediated by a new family of HMTases without a SET domain". Current Biology. 12 (12): 1052–8. doi:10.1016/S0960-9822(02)00901-6. PMID   12123582. S2CID   17263035.
  4. Baumbusch LO, Thorstensen T, Krauss V, Fischer A, Naumann K, Assalkhou R, Schulz I, Reuter G, Aalen RB (November 2001). "The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes". Nucleic Acids Research. 29 (21): 4319–33. doi:10.1093/nar/29.21.4319. PMC   60187 . PMID   11691919.
  5. Kouzarides T (April 2002). "Histone methylation in transcriptional control". Current Opinion in Genetics & Development. 12 (2): 198–209. doi:10.1016/S0959-437X(02)00287-3. PMID   11893494.
  6. Xiao B, Jing C, Wilson JR, Walker PA, Vasisht N, Kelly G, Howell S, Taylor IA, Blackburn GM, Gamblin SJ (February 2003). "Structure and catalytic mechanism of the human histone methyltransferase SET7/9" (PDF). Nature. 421 (6923): 652–6. Bibcode:2003Natur.421..652X. doi:10.1038/nature01378. PMID   12540855. S2CID   4423407.
  7. Zhang X, Yang Z, Khan SI, Horton JR, Tamaru H, Selker EU, Cheng X (July 2003). "Structural basis for the product specificity of histone lysine methyltransferases". Molecular Cell. 12 (1): 177–85. doi:10.1016/S1097-2765(03)00224-7. PMC   2713655 . PMID   12887903.
  8. Zhang X, Tamaru H, Khan SI, Horton JR, Keefe LJ, Selker EU, Cheng X (October 2002). "Structure of the Neurospora SET domain protein DIM-5, a histone H3 lysine methyltransferase". Cell. 111 (1): 117–27. doi:10.1016/S0092-8674(02)00999-6. PMC   2713760 . PMID   12372305.
  9. Min J, Zhang X, Cheng X, Grewal SI, Xu RM (November 2002). "Structure of the SET domain histone lysine methyltransferase Clr4". Nature Structural Biology. 9 (11): 828–32. doi:10.1038/nsb860. PMID   12389037. S2CID   22432819.
This article incorporates text from the public domain Pfam and InterPro: IPR001214