HERC2 is a giant E3 ubiquitin protein ligase, implicated in DNA repair regulation, pigmentation and neurological disorders. It is encoded by a gene of the same name belonging to the HERC family, which typically encodes large protein products with C-terminal HECT domains and one or more RCC1-like (RLD) domains. [1] [2]
HERC2, previously referred to as the rjs gene locus, was first identified in 1990 as the gene responsible for two phenotypes in mice: the runty, jerky, sterile (rjs) phenotype and the juvenile development and fertility-2 (Jdf2) phenotype. Mutant alleles are known to cause hypo-pigmentation and pink eye phenotypes, as well reduced growth, jerky gait, male sterility, female semi-sterility, and maternal behaviour defects in mice. [3] [4] [5]
The full HERC2 gene is located at 15q13, encoded by 93 exons and its transcription is under the control of a CpG rich promoter. This region on chromosome 15 is susceptible to breaks during chromosomal rearrangement and there are at least 12 partial duplicates of HERC2 between 15q11–15q13. [6]
At least 15 HERC2 SNPs have been identified and they are strongly associated with human iris colour variability, functioning to repress expression of OCA2's product. [7]
HERC2 encodes a 4834-amino acid protein with a theoretical size of 528 kDa. While a full structure has not yet been elucidated, potentially due to its large size, partial structures of its domains have been captured. [8]
It has an N-terminal bilobed HECT domain, conferring E3 ligase functionality, as well as 3 RLD domains with seven-bladed β-propeller folds. In addition to these HERC family hallmarks, it has several other motifs; a cytochrome-b5-like domain, several potential phosphorylation sites, and a ZZ-type zinc finger motif. [1] This is likely involved in protein binding, and has recently been identified as a SUMOylation target following DNA damage. [9]
Expression of HERC2 is ubiquitous, though particularly high in the brain and testes. Cellular localisation is predominantly to the nucleus and cytoplasm. [1]
SNPs of HERC2 are strongly associated with iris colour variability in humans. In particular, the rs916977 and rs12913832 SNPs have been reported as good predictors of this trait, and the latter is also significantly associated with skin and hair colour. The ancestral allele is linked to darker pigmentation and dominant over the lighter pigment recessive allele. [10] [11] The rs12913832 SNP, located in intron 86 of the HERC2 gene contains a silencing sequence that can inhibit the expression of OCA2 and, if both recessive alleles are present, can homozygously cause blue eyes. [12] This genotype is present in almost all people with blue eyes and is hypothesised as being the founder mutation of blue eyes in humans. [13] [14] [15]
The rs916977 SNP is most common in Europe; particularly in the north and east, where it nears fixation. The variant is also found at high frequencies in North Africa, the Near East, Oceania and the Americas. [16]
HERC2 is a component of the replication fork and essential for DNA damage repair pathways. Regulating DNA repair pathways is necessary, as unchecked they can target and excise undamaged DNA, potentially leading to mutation. [17]
It is involved in coordinating the Chk1-directed DNA damage/cell cycle checkpoint response by regulating the stability of the deubiquitination enzyme USP20. Under normal conditions HERC2 associates with USP20 and ubiquitinates it for degradation. Under replication stress, for example a DNA polymerase mismatch error, USP20 disassociates from HERC2 and deubiquitinates claspin, stabilising it to then bind and activate Chk1. This allows for DNA replication to be paused and the error corrected. [18] [19] [20]
At the site of doubles stranded breaks, HERC2 facilitates the binding of RNF8, a RING finger ubiquitin ligase to the E2 ubiquitin-conjugating enzyme UBC13. This association is required for RNF8 mediated Lys-63 poly-ubiquitination signalling, which both recruits and retains repair factors at the site of DNA damage to commence homologous recombination repair. [21]
HERC2 is also involved in regulating nucleotide excision repair by ubiquitinating the XPA repair protein for proteolysis. XPA is involved in recognising DNA damage and provides a scaffold for other repair factors to bind at the damage site. [22] [23]
HERC2 has been implicated in regulating stable centrosome architecture in conjunction with NEURL4 other ubiquitinated binding partners. Its absence is associated with aberrant centrosome morphology. [24]
HERC2 has recently been associated with regulating iron metabolism through ubiquitinating the F-box and leucine-rich repeat protein 5 (FBXL5) for proteasomal degradation. FBXL5 regulates the stability of the iron regulatory protein (IR2), which in turn controls the stability of proteins overlooking cellular iron homeostasis. Depletion of HERC2 results in decreased cellular iron levels. Iron is an essential nutrient in cells, but high levels can be cytotoxic, so maintaining cellular levels is important. [25]
HERC2 helps to regulate p53 signalling by facilitating the oligomerization of p53, which is necessary for its transcriptional activity. Silencing of HERC2 reportedly inhibits the expression of genes regulated by p53 and also results in increased cellular growth. [26]
The 15q11-q13 locus of HERC2 is also associated with Angelman syndrome (AS), specifically when a region of this locus is deleted. Similar to the rjs phenotype attributed to HERC2 in mice, AS is associated with seizures, developmental delay, intellectual disability and jerky movements. While a variety of disturbances to this locus can cause AS, all known mechanisms affect the functioning and expression of the E6AP E3 ligase, which also sits at this locus. HER2 is an allosteric activator of E6AP, and lies at the most commonly deleted region in AS. [27] Its deletion could result in the inactivation of E6AP and consequently the development of AS. [28]
In Old Order Amish families, a homozygous proline to leucine missense mutation within the first RLD domain has been implicated in a neurodevelopmental disorder with autism and features resembling AS. [29] In addition, a homozygous deletion of both OCA2 and HERC2 genes was recently reported as presenting with severe developmental abnormalities. [30] These phenotypes are suggestive of a role for HERC2 in normal neurodevelopment.
Certain alleles of HERC2 has recently been implicated in increasing the risk of iris cancer. Due its role in pigment determination, three HERC2 SNPs have been highlighted as associated with uveal melanoma. [31] HERC2 frameshift mutations have also been described in colorectal cancers. [32]
In accordance to its role in facilitating p53 oligomerization, HERC2 may be causally related to Li-Fraumeni syndrome and Li-Fraumeni-like syndromes, which occur in the absence of sufficient p53 oligomerization. [26]
HERC2 is known to interact with the following:
The HERC2 variation for blue eyes first appears around 14,000 years ago in Italy and the Caucasus. [35]
p53, also known as Tumor protein P53, cellular tumor antigen p53, or transformation-related protein 53 (TRP53) is a regulatory protein that is often mutated in human cancers. The p53 proteins are crucial in vertebrates, where they prevent cancer formation. As such, p53 has been described as "the guardian of the genome" because of its role in conserving stability by preventing genome mutation. Hence TP53 is classified as a tumor suppressor gene.
Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ubiquitously. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Four genes in the human genome code for ubiquitin: UBB, UBC, UBA52 and RPS27A.
A ubiquitin ligase is a protein that recruits an E2 ubiquitin-conjugating enzyme that has been loaded with ubiquitin, recognizes a protein substrate, and assists or directly catalyzes the transfer of ubiquitin from the E2 to the protein substrate. In simple and more general terms, the ligase enables movement of ubiquitin from a ubiquitin carrier to another protein by some mechanism. The ubiquitin, once it reaches its destination, ends up being attached by an isopeptide bond to a lysine residue, which is part of the target protein. E3 ligases interact with both the target protein and the E2 enzyme, and so impart substrate specificity to the E2. Commonly, E3s polyubiquitinate their substrate with Lys48-linked chains of ubiquitin, targeting the substrate for destruction by the proteasome. However, many other types of linkages are possible and alter a protein's activity, interactions, or localization. Ubiquitination by E3 ligases regulates diverse areas such as cell trafficking, DNA repair, and signaling and is of profound importance in cell biology. E3 ligases are also key players in cell cycle control, mediating the degradation of cyclins, as well as cyclin dependent kinase inhibitor proteins. The human genome encodes over 600 putative E3 ligases, allowing for tremendous diversity in substrates.
Parkin is a 465-amino acid residue E3 ubiquitin ligase, a protein that in humans and mice is encoded by the PARK2 gene. Parkin plays a critical role in ubiquitination – the process whereby molecules are covalently labelled with ubiquitin (Ub) and directed towards degradation in proteasomes or lysosomes. Ubiquitination involves the sequential action of three enzymes. First, an E1 ubiquitin-activating enzyme binds to inactive Ub in eukaryotic cells via a thioester bond and mobilises it in an ATP-dependent process. Ub is then transferred to an E2 ubiquitin-conjugating enzyme before being conjugated to the target protein via an E3 ubiquitin ligase. There exists a multitude of E3 ligases, which differ in structure and substrate specificity to allow selective targeting of proteins to intracellular degradation.
Nucleotide excision repair is a DNA repair mechanism. DNA damage occurs constantly because of chemicals, radiation and other mutagens. Three excision repair pathways exist to repair single stranded DNA damage: Nucleotide excision repair (NER), base excision repair (BER), and DNA mismatch repair (MMR). While the BER pathway can recognize specific non-bulky lesions in DNA, it can correct only damaged bases that are removed by specific glycosylases. Similarly, the MMR pathway only targets mismatched Watson-Crick base pairs.
Ubiquitin-protein ligase E3A (UBE3A) also known as E6AP ubiquitin-protein ligase (E6AP) is an enzyme that in humans is encoded by the UBE3A gene. This enzyme is involved in targeting proteins for degradation within cells.
Mouse double minute 2 homolog (MDM2) also known as E3 ubiquitin-protein ligase Mdm2 is a protein that in humans is encoded by the MDM2 gene. Mdm2 is an important negative regulator of the p53 tumor suppressor. Mdm2 protein functions both as an E3 ubiquitin ligase that recognizes the N-terminal trans-activation domain (TAD) of the p53 tumor suppressor and as an inhibitor of p53 transcriptional activation.
Topotecan, sold under the brand name Hycamtin among others, is a chemotherapeutic agent medication that is a topoisomerase inhibitor. It is a synthetic, water-soluble analog of the natural chemical compound camptothecin. It is used in the form of its hydrochloride salt to treat ovarian cancer, lung cancer and other cancer types.
Lethal alleles are alleles that cause the death of the organism that carries them. They are usually a result of mutations in genes that are essential for growth or development. Lethal alleles may be recessive, dominant, or conditional depending on the gene or genes involved.
DNA damage-binding protein 2 is a protein that in humans is encoded by the DDB2 gene.
DNA damage-binding protein 1 is a protein that in humans is encoded by the DDB1 gene.
Tyrosinase-related protein 1, also known as TYRP1, is an intermembrane enzyme which in humans is encoded by the TYRP1 gene.
P protein, also known as melanocyte-specific transporter protein or pink-eyed dilution protein homolog, is a protein that in humans is encoded by the oculocutaneous albinism II (OCA2) gene. The P protein is believed to be an integral membrane protein involved in small molecule transport, specifically of tyrosine—a precursor of melanin. Certain mutations in OCA2 result in type 2 oculocutaneous albinism. OCA2 encodes the human homologue of the mouse p gene.
RAD52 homolog , also known as RAD52, is a protein which in humans is encoded by the RAD52 gene.
Mediator of DNA damage checkpoint protein 1 is a 2080 amino acid long protein that in humans is encoded by the MDC1 gene located on the short arm (p) of chromosome 6. MDC1 protein is a regulator of the Intra-S phase and the G2/M cell cycle checkpoints and recruits repair proteins to the site of DNA damage. It is involved in determining cell survival fate in association with tumor suppressor protein p53. This protein also goes by the name Nuclear Factor with BRCT Domain 1 (NFBD1).
Ubiquitin-conjugating enzyme E2 D1 is a protein that in humans is encoded by the UBE2D1 gene.
Cullin-4B is a protein that in humans is encoded by the CUL4B gene which is located on the X chromosome. CUL4B has high sequence similarity with CUL4A, with which it shares certain E3 ubiquitin ligase functions. CUL4B is largely expressed in the nucleus and regulates several key functions including: cell cycle progression, chromatin remodeling and neurological and placental development in mice. In humans, CUL4B has been implicated in X-linked intellectual disability and is frequently mutated in pancreatic adenocarcinomas and a small percentage of various lung cancers. Viruses such as HIV can also co-opt CUL4B-based complexes to promote viral pathogenesis. CUL4B complexes containing Cereblon are also targeted by the teratogenic drug thalidomide.
G2/mitotic-specific cyclin-F is a protein that in humans is encoded by the CCNF gene.
WRAP53 is a gene implicated in cancer development. The name was coined in 2009 to describe the dual role of this gene, encoding both an antisense RNA that regulates the p53 tumor suppressor and a protein involved in DNA repair, telomere elongation and maintenance of nuclear organelles Cajal bodies.
Ubiquitin-Protein Ligase E3B (UBE3B) is an enzyme encoded by UBE3B gene in humans. UBE3B has an N-terminal IQ motif, which mediates calcium-independent calmodulin binding and a large C-terminal catalytic HECT domain.