PHD finger

Last updated
PHD-finger
PDB 1f62 EBI.jpg
PHD zinc finger. Zinc atoms shown in grey
Identifiers
SymbolPHD
Pfam PF00628
Pfam clan CL0390
InterPro IPR019787
PROSITE PS50016
SCOP2 1f62 / SCOPe / SUPFAM
OPM superfamily 59
OPM protein 1vfy

The PHD finger was discovered in 1993 as a Cys 4-His-Cys3 motif in the plant homeodomain (hence PHD) proteins HAT3.1 in Arabidopsis and maize ZmHox1a. [1] The PHD zinc finger motif resembles the metal binding RING domain (Cys3-His-Cys4) and FYVE domain. It occurs as a single finger, but often in clusters of two or three, and it also occurs together with other domains, such as the chromodomain and the bromodomain.

Contents

Role in epigenetics

The PHD finger, approximately 50-80 amino acids in length, is found in more than 100 human proteins. Several of the proteins it occurs in are found in the nucleus, and are involved in chromatin-mediated gene regulation. The PHD finger occurs in proteins such as the transcriptional co-activators p300 and CBP, Polycomb-like protein (Pcl), Trithorax-group proteins like ASH1L, ASH2L and MLL, the autoimmune regulator (AIRE), Mi-2 complex (part of histone deacetylase complex), the co-repressor TIF1, the JARID1-family of demethylases and many more.

Structure

The NMR structure of the PHD finger from human WSTF (Williams Syndrome Transcription Factor) shows that the conserved cysteines and histidine coordinate two Zn2+ ions. In general, the PHD finger adopts a globular fold, consisting of a two-stranded beta-sheet and an alpha-helix. The region consisting of these secondary structures and the residues involved in coordinating the zinc-ions are very conserved among species. The loop regions I and II are variable and could contribute functional specificity to the different PHD fingers.

Function

The PHD fingers of some proteins, including ING2, YNG1 and NURF, have been reported to bind to histone H3 tri-methylated on lysine 4 (H3K4me3), while other PHD fingers have tested negative in such assays. A protein called KDM5C has a PHD finger, which has been reported to bind histone H3 tri-methylated lysine 9 (H3K9me3). [2] Based on these publications, binding to tri-methylated lysines on histones may therefore be a property widespread among PHD fingers. Domains that bind to modified histones, are called epigenetic readers as they specifically recognize the modified version of the residue and binds to it. The modification H3K4me3 is associated with the transcription start site of active genes, while H3K9me3 is associated with inactive genes. The modifications of the histone lysines are dynamic, as there are methylases that add methyl groups to the lysines, and there are demethylases that remove methyl groups. KDM5C is a histone H3 lysine 4 demethylase, which means it is an enzyme that can remove the methyl groups of lysine 4 on histone 3 (making it H3K4me2 or H3K4me1). One can only speculate if the H3K9me3-binding of KDM5C PHD domain provides a crosstalk between trimethylation of H3K9 and the demethylation of H3K4me3. Such crosstalks have been suggested earlier with other domains involved in chromatin regulation, and may provide a strictly coordinated regulation.

Another example is the PHD finger of the BHC80/PHF21A protein, which is a component of the LSD1 complex. In this complex, LSD1 specifically demethylates H3K4me2 to H3K4me0, and BHC80 binds H3K4me0 through its PHD finger to stabilize the complex at its target promoters, presumably to prevent further re-methylation. This is the first example of a PHD finger recognizing lysine methyl-zero status.

Related Research Articles

Histone methyltransferase Histone-modifying enzymes

Histone methyltransferases (HMT) are histone-modifying enzymes, that catalyze the transfer of one, two, or three methyl groups to lysine and arginine residues of histone proteins. The attachment of methyl groups occurs predominantly at specific lysine or arginine residues on histones H3 and H4. Two major types of histone methyltranferases exist, lysine-specific and arginine-specific. In both types of histone methyltransferases, S-Adenosyl methionine (SAM) serves as a cofactor and methyl donor group.
The genomic DNA of eukaryotes associates with histones to form chromatin. The level of chromatin compaction depends heavily on histone methylation and other post-translational modifications of histones. Histone methylation is a principal epigenetic modification of chromatin that determines gene expression, genomic stability, stem cell maturation, cell lineage development, genetic imprinting, DNA methylation, and cell mitosis.

Histone methylation is a process by which methyl groups are transferred to amino acids of histone proteins that make up nucleosomes, which the DNA double helix wraps around to form chromosomes. Methylation of histones can either increase or decrease transcription of genes, depending on which amino acids in the histones are methylated, and how many methyl groups are attached. Methylation events that weaken chemical attractions between histone tails and DNA increase transcription because they enable the DNA to uncoil from nucleosomes so that transcription factor proteins and RNA polymerase can access the DNA. This process is critical for the regulation of gene expression that allows different cells to express different genes.

Methyltransferase Group of methylating enzymes

Methyltransferases are a large group of enzymes that all methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which contain a Rossmann fold for binding S-Adenosyl methionine (SAM). Class II methyltransferases contain a SET domain, which are exemplified by SET domain histone methyltransferases, and class III methyltransferases, which are membrane associated. Methyltransferases can also be grouped as different types utilizing different substrates in methyl transfer reactions. These types include protein methyltransferases, DNA/RNA methyltransferases, natural product methyltransferases, and non-SAM dependent methyltransferases. SAM is the classical methyl donor for methyltransferases, however, examples of other methyl donors are seen in nature. The general mechanism for methyl transfer is a SN2-like nucleophilic attack where the methionine sulfur serves as the leaving group and the methyl group attached to it acts as the electrophile that transfers the methyl group to the enzyme substrate. SAM is converted to S-Adenosyl homocysteine (SAH) during this process. The breaking of the SAM-methyl bond and the formation of the substrate-methyl bond happen nearly simultaneously. These enzymatic reactions are found in many pathways and are implicated in genetic diseases, cancer, and metabolic diseases. Another type of methyl transfer is the radical S-Adenosyl methionine (SAM) which is the methylation of unactivated carbon atoms in primary metabolites, proteins, lipids, and RNA.

Demethylases are enzymes that remove methyl (CH3-) groups from nucleic acids, proteins (in particular histones), and other molecules. Demethylase enzymes are important in epigenetic modification mechanisms. The demethylase proteins alter transcriptional regulation of the genome by controlling the methylation levels that occur on DNA and histones and, in turn, regulate the chromatin state at specific gene loci within organisms.

Chromodomain

A chromodomain is a protein structural domain of about 40–50 amino acid residues commonly found in proteins associated with the remodeling and manipulation of chromatin. The domain is highly conserved among both plants and animals, and is represented in a large number of different proteins in many genomes, such as that of the mouse. Some chromodomain-containing genes have multiple alternative splicing isoforms that omit the chromodomain entirely. In mammals, chromodomain-containing proteins are responsible for aspects of gene regulation related to chromatin remodeling and formation of heterochromatin regions. Chromodomain-containing proteins also bind methylated histones and appear in the RNA-induced transcriptional silencing complex. In histone modifications, chromodomains are very conserved. They function by identifying and binding to methylated lysine residues that exist on the surface of chromatin proteins and thereby regulate gene transcription.

KDM1A

Lysine-specific histone demethylase 1A (LSD1) also known as lysine (K)-specific demethylase 1A (KDM1A) is a protein in humans that is encoded by the KDM1A gene. LSD1 is a flavin-dependent monoamine oxidase, which can demethylate mono- and di-methylated lysines, specifically histone 3, lysines 4 and 9. This enzyme can have roles critical in embryogenesis and tissue-specific differentiation, as well as oocyte growth. KDM1A was the first histone demethylase to be discovered though more than 30 have been described.

KDM5A Protein-coding gene in the species Homo sapiens

Lysine-specific demethylase 5A is an enzyme that in humans is encoded by the KDM5A gene.

KDM4A Lysine-specific demethylase 4A is an enzyme that in humans is encoded by the Kdm4a gene

Lysine-specific demethylase 4A is an enzyme that in humans is encoded by the KDM4A gene.

KDM5C Protein-coding gene in the species Homo sapiens

Lysine-specific demethylase 5C is an enzyme that in humans is encoded by the KDM5C gene. KDM5C belongs to the alpha-ketoglutarate-dependent hydroxylase superfamily.

JADE1 Protein-coding gene in the species Homo sapiens

JADE1 is a protein that in humans is encoded by the JADE1 gene.

JARID1B Protein-coding gene in the species Homo sapiens

Lysine-specific demethylase 5B also known as histone demethylase JARID1B is a demethylase enzyme that in humans is encoded by the KDM5B gene. JARID1B belongs to the alpha-ketoglutarate-dependent hydroxylase superfamily.

X-linked intellectual disability refers to medical disorders associated with X-linked recessive inheritance that result in intellectual disability.

Methyl-CpG-binding domain

The Methyl-CpG-binding domain (MBD) in molecular biology binds to DNA that contains one or more symmetrically methylated CpGs. MBD has negligible non-specific affinity for unmethylated DNA. In vitro foot-printing with the chromosomal protein MeCP2 showed that the MBD could protect a 12 nucleotide region surrounding a methyl CpG pair.

EHMT1 Protein-coding gene in the species Homo sapiens

Euchromatic histone-lysine N-methyltransferase 1, also known as G9a-like protein (GLP), is a protein that in humans is encoded by the EHMT1 gene.

KDM2B

The human KDM2B gene encodes the protein lysine (K)-specific demethylase 2B.

Protein methylation is a type of post-translational modification featuring the addition of methyl groups to proteins. It can occur on the nitrogen-containing side-chains of arginine and lysine, but also at the amino- and carboxy-termini of a number of different proteins. In biology, methyltransferases catalyze the methylation process, activated primarily by S-adenosylmethionine. Protein methylation has been most studied in histones, where the transfer of methyl groups from S-adenosyl methionine is catalyzed by histone methyltransferases. Histones that are methylated on certain residues can act epigenetically to repress or activate gene expression.

KDM1B

Lysine (K)-specific demethylase 1B is a protein that in humans is encoded by the KDM1B gene.

KDM3A

Lysine demethylase 3A is a protein that in humans is encoded by the KDM3A gene.

H3K4me3 is an epigenetic modification to the DNA packaging protein Histone H3 that indicates tri-methylation at the 4th lysine residue of the histone H3 protein and is often involved in the regulation of gene expression. The name denotes the addition of three methyl groups (trimethylation) to the lysine 4 on the histone H3 protein.

H3K9me2 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the di-methylation at the 9th lysine residue of the histone H3 protein. H3K9me2 is strongly associated with transcriptional repression. H3K9me2 levels are higher at silent compared to active genes in a 10kb region surrounding the transcriptional start site. H3K9me2 represses gene expression both passively, by prohibiting acetylation and therefore binding of RNA polymerase or its regulatory factors, and actively, by recruiting transcriptional repressors. H3K9me2 has also been found in megabase blocks, termed Large Organised Chromatin K9 domains (LOCKS), which are primarily located within gene-sparse regions but also encompass genic and intergenic intervals. Its synthesis is catalyzed by G9a, G9a-like protein, and PRDM2. H3K9me2 can be removed by a wide range of histone lysine demethylases (KDMs) including KDM1, KDM3, KDM4 and KDM7 family members. H3K9me2 is important for various biological processes including cell lineage commitment, the reprogramming of somatic cells to induced pluripotent stem cells, regulation of the inflammatory response, and addiction to drug use.

References

  1. Schindler U, Beckmann H, Cashmore AR (July 1993). "HAT3.1, a novel Arabidopsis homeodomain protein containing a conserved cysteine-rich region". The Plant Journal. 4 (1): 137–50. doi: 10.1046/j.1365-313x.1993.04010137.x . PMID   8106082.
  2. Iwase S, Lan F, Bayliss P, de la Torre-Ubieta L, Huarte M, Qi HH, et al. (March 2007). "The X-linked mental retardation gene SMCX/JARID1C defines a family of histone H3 lysine 4 demethylases". Cell. 128 (6): 1077–88. doi: 10.1016/j.cell.2007.02.017 . PMID   17320160. S2CID   14729302.

Further reading