The aryl hydrocarbon receptor (also known as AhR, AHR, ahr, ahR, AH receptor, or as the dioxin receptor) is a protein that in humans is encoded by the AHR gene. The aryl hydrocarbon receptor is a transcription factor that regulates gene expression. It was originally thought to function primarily as a sensor of xenobiotic chemicals and also as the regulator of enzymes such as cytochrome P450s that metabolize these chemicals. The most notable of these xenobiotic chemicals are aromatic (aryl) hydrocarbons from which the receptor derives its name.
More recently, it has been discovered that AhR is activated (or deactivated) by a number of endogenous indole derivatives such as kynurenine. In addition to regulating metabolism enzymes, the AhR has roles in regulating immune cells, stem cell maintenance, and cellular differentiation. [5] [6] [7]
The aryl hydrocarbon receptor is a member of the family of basic helix-loop-helix transcription factors. AhR binds several exogenous ligands such as natural plant flavonoids, polyphenols and indoles, as well as synthetic polycyclic aromatic hydrocarbons and dioxin-like compounds. AhR is a cytosolic transcription factor that is normally inactive, bound to several co-chaperones. Upon ligand binding to chemicals such as 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), the chaperones dissociate resulting in AhR translocating into the nucleus and dimerizing with ARNT (AhR nuclear translocator), leading to changes in gene transcription.
The AhR protein contains several domains critical for function and is classified as a member of the basic helix-loop-helix/Per-Arnt-Sim (bHLH/PAS) family of transcription factors. [8] [9] The bHLH motif is located in the N-terminal of the protein and is a common entity in a variety of transcription factors. [10] Members of the bHLH superfamily have two functionally distinctive and highly conserved domains. The first is the basic-region (b), which is involved in the binding of the transcription factor to DNA. The second is the helix-loop-helix (HLH) region, which facilitates protein-protein interactions. Also contained with the AhR are two PAS domains, PAS-A and PAS-B, which are stretches of 200–350 amino acids that exhibit a high sequence homology to the protein domains that were originally found in the Drosophila genes period (Per) and single-minded (Sim) and in AhR's dimerization partner the aryl hydrocarbon receptor nuclear translocator (ARNT). [11] The PAS domains support specific secondary interactions with other PAS domain containing proteins, as is the case with AhR and ARNT, so that dimeric and heteromeric protein complexes can form. The ligand binding site of AhR is contained within the PAS-B domain [12] and contains several conserved residues critical for ligand binding. [13] Finally, a glutamine-rich (Q-rich) domain is located in the C-terminal region of the protein and is involved in co-activator recruitment and transactivation. [14]
AhR ligands have been generally classified into two categories, synthetic or naturally occurring. The first ligands to be discovered were synthetic aromatic hydrocarbons such as the polychlorinated dibenzodioxins, dibenzofurans, biphenyls and the non-chlorinated naphthoflavones alongside the naturally occurring polycyclic aromatic hydrocarbons (3-methylcholanthrene, benzo[a]pyrene and benzanthracene). [15] [16] A range of synthetic ligands have been designed as potential breast cancer treatments. [17]
Research has focused on other naturally occurring compounds with the hope of identifying an endogenous ligand. Naturally occurring compounds that have been identified as ligands of Ahr include derivatives of tryptophan such as indigo dye and indirubin, [18] tetrapyrroles such as bilirubin, [19] the arachidonic acid metabolites lipoxin A4 and prostaglandin G, [20] modified low-density lipoprotein [21] and several dietary carotenoids. [16] One assumption made in the search for an endogenous ligand is that the ligand will be a receptor agonist. However, work by Savouret et al. has shown this may not be the case since their findings demonstrate that 7-ketocholesterol competitively inhibits Ahr signal transduction. [22]
Carbidopa is a selective aryl hydrocarbon receptor modulator (SAhRM). [23] Other SAhRMs include microbial-derived 1,4-dihydroxy-2-naphthoic acid [24] and plant-derived 3,3'-diindolylmethane. [25]
Indolocarbazole [ clarification needed ] (ICZ) is one of the strongest non-halogenated agonists for AhR in vitro reported. [26]
Ligand-independent AhR activity can be seen in mammalian AhR. The mammalian AhR needs no exogenous ligand-dependent activation to be functional, and this appears to be the case for its role in the regulation of the expression of some transforming growth factor-beta (TGF-b) isoforms. This is not to say that ligand-dependent AhR activation is not needed for the AhR to function in those cases, but that, if a ligand is needed, it is provided endogenously by the cells or tissues in question and its identity is unknown. [27]
Non-ligand bound AhR is retained in the cytoplasm as an inactive protein complex consisting of a dimer of Hsp90, [28] [29] prostaglandin E synthase 3 (PTGES3, p23) [30] [31] [32] [33] and a single molecule of the immunophilin-like AH receptor-interacting protein, also known as hepatitis B virus X-associated protein 2 (XAP2), [34] AhR interacting protein (AIP), [35] [36] and AhR-activated 9 (ARA9). [37] The dimer of Hsp90, along with PTGES3 (p23), has a multifunctional role in the protection of the receptor from proteolysis, constraining the receptor in a conformation receptive to ligand binding and preventing the premature binding of ARNT. [12] [31] [33] [38] [39] [40] AIP interacts with carboxyl-terminal of Hsp90 and binds to the AhR nuclear localization sequence (NLS) preventing the inappropriate trafficking of the receptor into the nucleus. [41] [42] [43]
TGF-β cytokines are members of a signaling protein family that includes activin, Nodal subfamily, bone morphogenetic proteins, growth and differentiation factors, and Müllerian inhibitor subfamily. TGF-β signaling plays an important role in cell physiology and development by inhibiting cell proliferation, promoting apoptosis, inducing differentiation, and determining developmental fate in vertebrates and invertebrates. [44] TGF-β activators include proteases such as plasmin, cathepsins, and calpains. Thrombospondin 1, a glycoprotein that inhibits angiogenesis, and matrix metalloproteinase 2 (MMP-2). The extracellular matrix itself appears to play an important regulatory role in TGF-β signaling. [45] [46]
Upon ligand binding to AhR, AIP is released resulting in exposure of the NLS, which is located in the bHLH region, [47] leading to import into the nucleus. [48] It is presumed that once in the nucleus, Hsp90 dissociates exposing the two PAS domains allowing the binding of ARNT. [40] [49] [50] [51] The activated AhR/ARNT heterodimer complex is then capable of either directly or indirectly interacting with DNA by binding to recognition sequences located in the 5’- regulatory region of dioxin-responsive genes. [40] [50] [52]
The classical recognition motif of the AhR/ARNT complex, referred to as either the AhR-, dioxin- or xenobiotic- responsive element (AHRE, DRE or XRE), contains the core sequence 5'-GCGTG-3' [53] within the consensus sequence 5'-T/GNGCGTGA/CG/CA-3' [54] [55] in the promoter region of AhR responsive genes. The AhR/ARNT heterodimer directly binds the AHRE/DRE/XRE core sequence in an asymmetric manner such that ARNT binds to 5'-GTG-3' and AhR binding 5'-TC/TGC-3'. [56] [57] [58] Recent research suggests that a second type of element termed AHRE-II, 5'-CATG(N6)C[T/A]TG-3', is capable of indirectly acting with the AhR/ARNT complex. [59] [60] Regardless of the response element, the result is a variety of differential changes in gene expression.
In terms of evolution, the oldest physiological role of AhR is in development. AhR is presumed to have evolved from invertebrates where it served a ligand-independent role in normal development processes. [61] The AhR homolog in Drosophila, spineless (ss) is necessary for development of the distal segments of the antenna and leg. [62] [63] Ss dimerizes with tango (tgo), which is the homolog to the mammalian Arnt, to initiate gene transcription. Evolution of the receptor in vertebrates resulted in the ability to bind ligands and might have helped humans evolve to tolerate smoky fires. In developing vertebrates, AhR seemingly plays a role in cellular proliferation and differentiation. [64] Despite lacking a clear endogenous ligand, AhR appears to play a role in the differentiation of many developmental pathways, including hematopoiesis, [65] lymphoid systems, [66] [67] T-cells, [68] neurons, [69] and hepatocytes. [70] AhR has also been found to have an important function in hematopoietic stem cells: AhR antagonism promotes their self-renewal and ex-vivo expansion [71] and is involved in megakaryocyte differentiation. [72] In adulthood, signaling is associated with the stress response and mutations in AhR are associated with major depressive disorder. [73]
The adaptive response is manifested as the induction of xenobiotic metabolizing enzymes. Evidence of this response was first observed from the induction of cytochrome P450, family 1, subfamily A, polypeptide 1 (Cyp1a1) resultant from TCDD exposure, which was determined to be directly related to activation of the AhR signaling pathway. [74] [75] [76] The search for other metabolizing genes induced by AhR ligands, due to the presence of DREs, has led to the identification of an "AhR gene battery" of Phase I and Phase II metabolizing enzymes consisting of CYP1A1, CYP1A2, CYP1B1, NQO1, ALDH3A1, UGT1A2 and GSTA1. [77] Presumably, vertebrates have this function to be able to detect a wide range of chemicals, indicated by the wide range of substrates AhR is able to bind and facilitate their biotransformation and elimination. The AhR may also signal the presence of toxic chemicals in food and cause aversion of such foods. [78]
AhR activation seems to be also important for immunological responses and inhibiting inflammation [67] through upregulation of interleukin 22 [79] and downregulation of Th17 response. [80] The Knockdown of AHR mostly downregulates the expression of innate immunity genes in THP-1 cells. [81]
Extensions of the adaptive response are the toxic responses elicited by AhR activation. Toxicity results from two different ways of AhR signaling. The first is a side effect of the adaptive response in which the induction of metabolizing enzymes results in the production of toxic metabolites. For example, the polycyclic aromatic hydrocarbon benzo[a]pyrene (BaP), a ligand for AhR, induces its own metabolism and bioactivation to a toxic metabolite via the induction of CYP1A1 and CYP1B1 in several tissues. [82] The second approach to toxicity is the result of aberrant changes in global gene transcription beyond those observed in the "AhR gene battery." These global changes in gene expression lead to adverse changes in cellular processes and function. [83] Microarray analysis has proved most beneficial in understanding and characterizing this response. [64] [84] [85] [86]
Xenobiotic metabolizing enzymes help with the metabolic process by transforming and the excretion of chemicals. The most potent inducer of CYP1A1 is 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). In addition, TCDD induces a broad spectrum of biochemical and toxic effects, such as teratogenesis, immunosuppression and tumor promotion. Most, if not all, of the effects caused by TCDD and other PAHs are known to be mediated by AhR which has a high binding affinity to TCDD. [44]
In addition to the protein interactions mentioned above, AhR has also been shown to interact with the following:
The androgen receptor (AR), also known as NR3C4, is a type of nuclear receptor that is activated by binding any of the androgenic hormones, including testosterone and dihydrotestosterone, in the cytoplasm and then translocating into the nucleus. The androgen receptor is most closely related to the progesterone receptor, and progestins in higher dosages can block the androgen receptor.
The glucocorticoid receptor also known as NR3C1 is the receptor to which cortisol and other glucocorticoids bind.
Transcription factor Sp1, also known as specificity protein 1* is a protein that in humans is encoded by the SP1 gene.
Estrogen receptor alpha (ERα), also known as NR3A1, is one of two main types of estrogen receptor, a nuclear receptor that is activated by the sex hormone estrogen. In humans, ERα is encoded by the gene ESR1.
The constitutive androstane receptor (CAR) also known as nuclear receptor subfamily 1, group I, member 3 is a protein that in humans is encoded by the NR1I3 gene. CAR is a member of the nuclear receptor superfamily and along with pregnane X receptor (PXR) functions as a sensor of endobiotic and xenobiotic substances. In response, expression of proteins responsible for the metabolism and excretion of these substances is upregulated. Hence, CAR and PXR play a major role in the detoxification of foreign substances such as drugs.
The ARNT gene encodes the aryl hydrocarbon receptor nuclear translocator protein that forms a complex with ligand-bound aryl hydrocarbon receptor (AhR), and is required for receptor function. The encoded protein has also been identified as the beta subunit of a heterodimeric transcription factor, hypoxia-inducible factor 1 (HIF1). A t(1;12)(q21;p13) translocation, which results in a TEL–ARNT fusion protein, is associated with acute myeloblastic leukemia. Three alternatively spliced variants encoding different isoforms have been described for this gene.
The aryl-hydrocarbon receptor repressor also known as AHRR is a human gene.
The nuclear receptor coactivator 2 also known as NCoA-2 is a protein that in humans is encoded by the NCOA2 gene. NCoA-2 is also frequently called glucocorticoid receptor-interacting protein 1 (GRIP1), steroid receptor coactivator-2 (SRC-2), or transcriptional mediators/intermediary factor 2 (TIF2).
The nuclear receptor 4A1 also known as Nur77, TR3, and NGFI-B is a protein that in humans is encoded by the NR4A1 gene.
Transcription factor p65 also known as nuclear factor NF-kappa-B p65 subunit is a protein that in humans is encoded by the RELA gene.
Heat shock protein HSP 90-alpha is a protein that in humans is encoded by the HSP90AA1 gene.
Receptor tyrosine-protein kinase erbB-4 is an enzyme that in humans is encoded by the ERBB4 gene. Alternatively spliced variants that encode different protein isoforms have been described; however, not all variants have been fully characterized.
Heat shock factor 1 is a protein that in humans is encoded by the HSF1 gene. HSF1 is highly conserved in eukaryotes and is the primary mediator of transcriptional responses to proteotoxic stress with important roles in non-stress regulation such as development and metabolism.
Hsp90 co-chaperone Cdc37 is a protein that in humans is encoded by the CDC37 gene. This protein is highly similar to Cdc 37, a cell division cycle control protein of Saccharomyces cerevisiae. This protein is a HSP90 Co-chaperone with specific function in cell signal transduction. It has been shown to form complex with Hsp90 and a variety of protein kinases including CDK4, CDK6, SRC, RAF1, MOK, as well as eIF-2 alpha kinases. It is thought to play a critical role in directing Hsp90 to its target kinases.
Interferon regulatory factor 7, also known as IRF7, is a member of the interferon regulatory factor family of transcription factors.
COUP-TF1 also known as NR2F1 is a protein that in humans is encoded by the NR2F1 gene. This protein is a member of nuclear hormone receptor family of steroid hormone receptors.
Liver X receptor beta (LXR-β) is a member of the nuclear receptor family of transcription factors. LXR-β is encoded by the NR1H2 gene.
AH receptor-interacting protein (AIP) also known as aryl hydrocarbon receptor-interacting protein, immunophilin homolog ARA9, or HBV X-associated protein 2 (XAP-2) is a protein that in humans is encoded by the AIP gene. The protein is a member of the FKBP family.
Single-minded homolog 1, also known as class E basic helix-loop-helix protein 14 (bHLHe14), is a protein that in humans is encoded by the SIM1 gene.
6-Formylindolo[3,2-b]carbazole (FICZ) is a chemical compound with the molecular formula C19H12N2O. It is a nitrogen heterocycle, having an extremely high affinity (Kd = 7 x 10−11M) for binding to the aryl hydrocarbon receptor (AHR).