This article may be too technical for most readers to understand.(May 2017) |
For molecular biology in mammals, DNA demethylation causes replacement of 5-methylcytosine (5mC) in a DNA sequence by cytosine (C) (see figure of 5mC and C). DNA demethylation can occur by an active process at the site of a 5mC in a DNA sequence or, in replicating cells, by preventing addition of methyl groups to DNA so that the replicated DNA will largely have cytosine in the DNA sequence (5mC will be diluted out).
Methylated cytosine is frequently present in the linear DNA sequence where a cytosine is followed by a guanine in a 5' → 3' direction (a CpG site). In mammals, DNA methyltransferases (which add methyl groups to DNA bases) exhibit a strong sequence preference for cytosines at CpG sites. [1] There appear to be more than 20 million CpG dinucleotides in the human genome (see genomic distribution). In mammals, on average, 70% to 80% of CpG cytosines are methylated, [2] though the level of methylation varies with different tissues. Methylated cytosines often occur in groups or CpG islands within the promoter regions of genes, where such methylation may reduce or silence gene expression (see gene expression). Methylated cytosines in the gene body, however, are positively correlated with expression. [3]
Almost 100% DNA demethylation occurs by a combination of passive dilution and active enzymatic removal during the reprogramming that occurs in early embryogenesis and in gametogenesis. Another large demethylation, of about 3% of all genes, can occur by active demethylation in neurons during formation of a strong memory. [4] After surgery, demethylations are found in peripheral blood mononuclear cells at sites annotated to immune system genes. [5] Demethylations also occur during the formation of cancers. [6] During global DNA hypomethylation of tumor genomes, there is a minor to moderate reduction of the number of methylated cytosines (5mC) amounting to a loss of about 5% to 20% on average of the 5mC bases. [7]
The mouse sperm genome is 80–90% methylated at its CpG sites in DNA, amounting to about 20 million methylated sites.[ citation needed ] After fertilization, the paternal chromosome is almost completely demethylated in six hours by an active process, before DNA replication (blue line in Figure).
Demethylation of the maternal genome occurs by a different process. In the mature oocyte, about 40% of its CpG sites in DNA are methylated. While somatic cells of mammals have three main DNA methyltransferases (which add methyl groups to cytosines at CpG sites), DNMT1, DNMT3A, and DNMT3B, in the pre-implantation embryo up to the blastocyst stage (see Figure), the only methyltransferase present is an isoform of DNMT1 designated DNMT1o. [8] DNMT1o has an alternative oocyte-specific promoter and first exon (exon 1o) located 5' of the somatic and spermatocyte promoters. As reviewed by Howell et al., [8] DNMT1o is sequestered in the cytoplasm of mature oocytes and in 2-cell and 4-cell embryos, but at the 8-cell stage is only present in the nucleus. At the 16 cell stage (the morula) DNMT1o is again found only in the cytoplasm. It appears that demethylation of the maternal chromosomes largely takes place by blockage of the methylating enzyme DNMT1o from entering the nucleus except briefly at the 8 cell stage. The maternal-origin DNA thus undergoes passive demethylation by dilution of the methylated maternal DNA during replication (red line in Figure). The morula (at the 16 cell stage), has only a small amount of DNA methylation (black line in Figure).
DNMT3b begins to be expressed in the blastocyst. [9] Methylation begins to increase at 3.5 days after fertilization in the blastocyst, and a large wave of methylation then occurs on days 4.5 to 5.5 in the epiblast, going from 12% to 62% methylation, and reaching maximum level after implantation in the uterus. [10] By day seven after fertilization, the newly formed primordial germ cells (PGC) in the implanted embryo segregate from the remaining somatic cells. At this point the PGCs have about the same level of methylation as the somatic cells.
The newly formed primordial germ cells (PGC) in the implanted embryo devolve from the somatic cells. At this point the PGCs have high levels of methylation. These cells migrate from the epiblast toward the gonadal ridge. As reviewed by Messerschmidt et al., [11] the majority of PGCs are arrested in the G2 phase of the cell cycle, while they migrate toward the hindgut during embryo days 7.5 to 8.5. Then demethylation of the PGCs takes place in two waves. [11] At day 9.5 the primordial germ cells begin to rapidly replicate going from about 200 PGCs at embryo day 9.5 to about 10,000 PGCs at day 12.5. [12] During days 9.5 to 12.5 DNMT3a and DNMT3b are repressed and DNMT1 is present in the nucleus at a high level. But DNMT1 is unable to methylate cytosines during days 9.5 to 12.5 because the UHRF1 gene (also known as NP95) is repressed and UHRF1 is an essential protein needed to recruit DNMT1 to replication foci where maintenance DNA methylation takes place. [12] This is a passive, dilution form of demethylation.
In addition, from embryo day 9.5 to 13.5 there is an active form of demethylation. As indicated below in "Molecular stages of active reprogramming," two enzymes are central to active demethylation. These are a ten-eleven translocation methylcytosine dioxygenase (TET) and thymine-DNA glycosylase (TDG). One particular TET enzyme, TET1, and TDG are present at high levels from embryo day 9.5 to 13.5, [12] and are employed in active demethylation during gametogenesis. [11] PGC genomes display the lowest levels of DNA methylation of any cells in the entire life cycle of the mouse at embryonic day 13.5. [13]
Learning and memory have levels of permanence, differing from other mental processes such as thought, language, and consciousness, which are temporary in nature. Learning and memory can be either accumulated slowly (multiplication tables) or rapidly (touching a hot stove), but once attained, can be recalled into conscious use for a long time. Rats subjected to one instance of contextual fear conditioning create an especially strong long-term memory. At 24 hours after training, 9.17% of the genes in the genomes of rat hippocampus neurons were found to be differentially methylated. This included more than 2,000 differentially methylated genes at 24 hours after training, with over 500 genes being demethylated. [4] Similar results to that in the rat hippocampus were also obtained in mice with contextual fear conditioning. [14]
The hippocampus region of the brain is where contextual fear memories are first stored (see figure of the brain, this section), but this storage is transient and does not remain in the hippocampus. In rats contextual fear conditioning is abolished when the hippocampus is subjected to hippocampectomy just one day after conditioning, but rats retain a considerable amount of contextual fear when hippocampectomy is delayed by four weeks. [15] In mice, examined at 4 weeks after conditioning, the hippocampus methylations and demethylations were reversed (the hippocampus is needed to form memories but memories are not stored there) while substantial differential CpG methylation and demethylation occurred in cortical neurons during memory maintenance. There were 1,223 differentially methylated genes in the anterior cingulate cortex of mice four weeks after contextual fear conditioning. Thus, while there were many methylations in the hippocampus shortly after memory was formed, all these hippocampus methylations were demethylated as soon as four weeks later.
The human genome contains about 28 million CpG sites, and roughly 60% of the CpG sites are methylated at the 5 position of the cytosine. [16] During formation of a cancer there is an average reduction of the number of methylated cytosines of about 5% to 20%, [17] or about 840,00 to 3.4 million demethylations of CpG sites.
DNMT1 methylates CpGs on hemi-methylated DNA during DNA replication. Thus, when a DNA strand has a methylated CpG, and the newly replicated strand during semi-conservative replication lacks a methyl group on the complementary CpG, DNMT1 is normally recruited to the hemimethylated site and adds a methyl group to cytosine in the newly synthesized CpG. However, recruitment of DNMT1 to hemimethylated CpG sites during DNA replication depends on the UHRF1 protein. If UHRF1 does not bind to a hemimethylated CpG site, then DNMT1 is not recruited and cannot methylate the newly synthesized CpG site. The arginine methyltransferase PRMT6 regulates DNA methylation by methylating the arginine at position 2 of histone 3 (H3R2me2a). [18] (See Protein methylation#Arginine.) In the presence of H3R2me2a UHRF1 can not bind to a hemimethylated CpG site, and then DNMT1 is not recruited to the site, and the site remains hemimethylated. Upon further rounds of replication the methylated CpG is passively diluted out. PRMT6 is frequently overexpressed in many types of cancer cells. [19] The overexpression of PRMT6 may be a source of DNA demethylation in cancer.
Three molecular stages are required for actively, enzymatically reprogramming the DNA methylome. Stage 1: Recruitment. The enzymes needed for reprogramming are recruited to genome sites that require demethylation or methylation. Stage 2: Implementation. The initial enzymatic reactions take place. In the case of methylation, this is a short step that results in the methylation of cytosine to 5-methylcytosine. Stage 3: Base excision DNA repair. The intermediate products of demethylation are catalysed by specific enzymes of the base excision DNA repair pathway that finally restore cystosine in the DNA sequence.
Demethylation of 5-methylcytosine to generate 5-hydroxymethylcytosine (5hmC) very often initially involves oxidation of 5mC (see Figure in this section) by ten-eleven translocation methylcytosine dioxygenases (TET enzymes). [21] The molecular steps of this initial demethylation are shown in detail in TET enzymes. In successive steps (see Figure) TET enzymes further hydroxylate 5hmC to generate 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Thymine-DNA glycosylase (TDG) recognizes the intermediate bases 5fC and 5caC and excises the glycosidic bond resulting in an apyrimidinic site (AP site). This is followed by base excision repair (stage 3). In an alternative oxidative deamination pathway, 5hmC can be oxidatively deaminated by APOBEC (AID/APOBEC) deaminases to form 5-hydroxymethyluracil (5hmU). Also, 5mC can be converted to thymine (Thy). 5hmU can be cleaved by TDG, MBD4, NEIL1 or SMUG1. AP sites and T:G mismatches are then repaired by base excision repair (BER) enzymes to yield cytosine (Cyt). The TET family of dioxygenases are employed in the most frequent type of demethylation reactions. [21]
TET dioxygenase isoforms include at least two isoforms of TET1, one of TET2 and three isoforms of TET3. [22] [23] The full-length canonical TET1 isoform appears virtually restricted to early embryos, embryonic stem cells and primordial germ cells (PGCs). The dominant TET1 isoform in most somatic tissues, at least in the mouse, arises from alternative promoter usage which gives rise to a short transcript and a truncated protein designated TET1s. The isoforms of TET3 are the full length form TET3FL, a short form splice variant TET3s, and a form that occurs in oocytes and neurons designated TET3o. TET3o is created by alternative promoter use and contains an additional first N-terminal exon coding for 11 amino acids. TET3o only occurs in oocytes and neurons and is not expressed in embryonic stem cells or in any other cell type or adult mouse tissue tested. Whereas TET1 expression can barely be detected in oocytes and zygotes, and TET2 is only moderately expressed, the TET3 variant TET3o shows extremely high levels of expression in oocytes and zygotes, but is nearly absent at the 2-cell stage. It is possible that TET3o, high in neurons, oocytes and zygotes at the one cell stage, is the major TET enzyme utilized when very large scale rapid demethylations occur in these cells.
The TET enzymes do not specifically bind to 5-methylcytosine except when recruited. Without recruitment or targeting, TET1 predominantly binds to high CG promoters and CpG islands (CGIs) genome-wide by its CXXC domain that can recognize un-methylated CGIs. [24] TET2 does not have an affinity for 5-methylcytosine in DNA. [25] The CXXC domain of the full-length TET3, which is the predominant form expressed in neurons, binds most strongly to CpGs where the C was converted to 5-carboxycytosine (5caC). However, it also binds to un-methylated CpGs. [23]
For a TET enzyme to initiate demethylation it must first be recruited to a methylated CpG site in DNA. Two of the proteins shown to recruit a TET enzyme to a methylated cytosine in DNA are OGG1 (see figure Initiation of DNA demethylation at a CpG site) [26] and EGR1. [27]
Oxoguanine glycosylase (OGG1) catalyses the first step in base excision repair of the oxidatively damaged base 8-OHdG. OGG1 finds 8-OHdG by sliding along the linear DNA at 1,000 base pairs of DNA in 0.1 seconds. [28] OGG1 very rapidly finds 8-OHdG. OGG1 proteins bind to oxidatively damaged DNA with a half maximum time of about 6 seconds. [29] When OGG1 finds 8-OHdG it changes conformation and complexes with 8-OHdG in its binding pocket. [30] OGG1 does not immediately act to remove the 8-OHdG. Half maximum removal of 8-OHdG takes about 30 minutes in HeLa cells in vitro, [31] or about 11 minutes in the livers of irradiated mice. [32] DNA oxidation by reactive oxygen species preferentially occurs at a guanine in a methylated CpG site, because of a lowered ionization potential of guanine bases adjacent to 5-methylcytosine. [33] TET1 binds (is recruited to) the OGG1 bound to 8-OHdG (see figure). [26] This likely allows TET1 to demethylate an adjacent methylated cytosine. When human mammary epithelial cells (MCF-10A) were treated with H2O2, 8-OHdG increased in DNA by 3.5-fold and this caused about 80% demethylation of the 5-methylcytosines in the MCF-10A genome. [26]
The gene early growth response protein 1 ( EGR1 ) is an immediate early gene (IEG). EGR1 can rapidly be induced by neuronal activity. [34] The defining characteristic of IEGs is the rapid and transient up-regulation—within minutes—of their mRNA levels independent of protein synthesis. [35] In adulthood, EGR1 is expressed widely throughout the brain, maintaining baseline expression levels in several key areas of the brain including the medial prefrontal cortex, striatum, hippocampus and amygdala. [35] This expression is linked to control of cognition, emotional response, social behavior and sensitivity to reward. [35] EGR1 binds to DNA at sites with the motifs 5′-GCGTGGGCG-3′ and 5'-GCGGGGGCGG-3′ and these motifs occur primarily in promoter regions of genes. [34] The short isoform TET1s is expressed in the brain. EGR1 and TET1s form a complex mediated by the C-terminal regions of both proteins, independently of association with DNA. [34] EGR1 recruits TET1s to genomic regions flanking EGR1 binding sites. [34] In the presence of EGR1, TET1s is capable of locus-specific demethylation and activation of the expression of downstream genes regulated by EGR1. [34]
As indicated in the Figure above, captioned "Demethylation of 5-methylcytosine," the first step in active demethylation is a TET oxidation of 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC). The demethylation process, in some tissues and at some genome locations, may stop at that point. As reviewed by Uribe-Lewis et al., [36] in addition to being an intermediate in active DNA demethylation, 5hmC is often a stable DNA modification. Within the genome, 5hmC is located at transcriptionally active genes, regulatory elements and chromatin associated complexes. In particular, 5hmC is dynamically changed and positively correlated with active gene transcription during cell lineage specification, and high levels of 5hmC are found in embryonic stem cells and in the central nervous system. [37] In humans, defective 5-hydroxymethylating activity is associated with a phenotype of lymphoproliferation, immunodeficiency and autoimmunity. [38]
The third stage of DNA demethylation is removal of the intermediate products of demethylation generated by a TET enzyme by base excision repair. As indicated above in Stage 2, after 5mC is first oxidized by a TET to form 5hmC, further oxidation of 5hmC by TET yields 5fC and oxidation of 5fC by TET yields 5caC. Both 5fC and 5caC are recognized by a DNA glycosylase, TDG, a base excision repair enzyme, as an abnormal base. As shown in the Figure in this section, TDG removes the abnormal base (e.g. 5fC) while leaving the sugar-phosphate backbone intact, creating an apurinic/apyrimidinic site, commonly referred to as an AP site. In this Figure, the 8-OHdG is left in the DNA, since it may have been present when OGG1 attracted TET1 to the CpG site with a methylated cytosine. After an AP site is formed, AP endonuclease creates a nick in the phosphodiester backbone of the AP site that was formed when the TDG DNA glycosylase removed the 5fC or 5caC. The human AP endonuclease incises DNA 5′ to the AP site by a hydrolytic mechanism, leaving a 3′-hydroxyl and a 5′-deoxyribose phosphate (5' dRP) residue. [39] This is followed by either short patch or long patch repair. In short patch repair, 5′ dRP lyase trims the 5′ dRP end to form a phosphorylated 5′ end. This is followed by DNA polymerase β (pol β) adding a single cytosine to pair with the pre-existing guanine in the complementary strand and then DNA ligase to seal the cut strand. In long patch repair, DNA synthesis is thought to be mediated by polymerase δ and polymerase ε performing displacement synthesis to form a flap. Pol β can also perform long-patch displacement synthesis. Long-patch synthesis typically inserts 2–10 new nucleotides. Then flap endonuclease removes the flap, and this is followed by DNA ligase to seal the strand. At this point there has been a complete replacement of the 5-methylcytosine by cytosine (demethylation) in the DNA sequence.
Physical exercise has well established beneficial effects on learning and memory (see Neurobiological effects of physical exercise). BDNF is a particularly important regulator of learning and memory. [40] As reviewed by Fernandes et al., [41] in rats, exercise enhances the hippocampus expression of the gene Bdnf , which has an essential role in memory formation. Enhanced expression of Bdnf occurs through demethylation of its CpG island promoter at exon IV [41] and this demethylation depends on steps illustrated in the two figures. [20]
In a panel of healthy adults, negative associations were found between total DNA methylation and exposure to traffic related air pollution. DNA methylation levels were associated both with recent and chronic exposure to Black Carbon as well as benzene. [42]
After injury, neurons in the adult peripheral nervous system can switch from a dormant state with little axonal growth to robust axon regeneration. DNA demethylation in mature mammalian neurons removes barriers to axonal regeneration. [43] This demethylation, in regenerating mouse peripheral neurons, depends upon TET3 to generate 5-hydroxymethylcytosine (5hmC) in DNA. [43] [44] 5hmC was altered in a large set of regeneration-associated genes (RAGs), including well-known RAGs such as Atf3 , Bdnf , and Smad1 , that regulate the axon growth potential of neurons. [44]
In biology, epigenetics is the study of heritable traits, or a stable change of cell function, that happen without changes to the DNA sequence. The Greek prefix epi- in epigenetics implies features that are "on top of" or "in addition to" the traditional genetic mechanism of inheritance. Epigenetics usually involves a change that is not erased by cell division, and affects the regulation of gene expression. Such effects on cellular and physiological phenotypic traits may result from environmental factors, or be part of normal development. Epigenetic factors can also lead to cancer.
5-Methylcytosine is a methylated form of the DNA base cytosine (C) that regulates gene transcription and takes several other biological roles. When cytosine is methylated, the DNA maintains the same sequence, but the expression of methylated genes can be altered. 5-Methylcytosine is incorporated in the nucleoside 5-methylcytidine.
Transcription is the process of copying a segment of DNA into RNA. Some segments of DNA are transcribed into RNA molecules that can encode proteins, called messenger RNA (mRNA). Other segments of DNA are transcribed into RNA molecules called non-coding RNAs (ncRNAs).
The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG islands.
A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses.
In biochemistry, the DNA methyltransferase family of enzymes catalyze the transfer of a methyl group to DNA. DNA methylation serves a wide variety of biological functions. All the known DNA methyltransferases use S-adenosyl methionine (SAM) as the methyl donor.
In biology and genetics, the germline is the population of a multicellular organism's cells that develop into germ cells. In other words, they are the cells that form gametes, which can come together to form a zygote. They differentiate in the gonads from primordial germ cells into gametogonia, which develop into gametocytes, which develop into the final gametes. This process is known as gametogenesis.
In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from altering the number of copies of RNA that are transcribed, to the temporal control of when the gene is transcribed. This control allows the cell or organism to respond to a variety of intra- and extracellular signals and thus mount a response. Some examples of this include producing the mRNA that encode enzymes to adapt to a change in a food source, producing the gene products involved in cell cycle specific activities, and producing the gene products responsible for cellular differentiation in multicellular eukaryotes, as studied in evolutionary developmental biology.
Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products. Sophisticated programs of gene expression are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from transcriptional initiation, to RNA processing, and to the post-translational modification of a protein. Often, one gene regulator controls another, and so on, in a gene regulatory network.
DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to repress gene transcription. In mammals, DNA methylation is essential for normal development and is associated with a number of key processes including genomic imprinting, X-chromosome inactivation, repression of transposable elements, aging, and carcinogenesis.
In biology, reprogramming refers to erasure and remodeling of epigenetic marks, such as DNA methylation, during mammalian development or in cell culture. Such control is also often associated with alternative covalent modifications of histones.
DNA oxidation is the process of oxidative damage of deoxyribonucleic acid. As described in detail by Burrows et al., 8-oxo-2'-deoxyguanosine (8-oxo-dG) is the most common oxidative lesion observed in duplex DNA because guanine has a lower one-electron reduction potential than the other nucleosides in DNA. The one electron reduction potentials of the nucleosides are guanine 1.29, adenine 1.42, cytosine 1.6 and thymine 1.7. About 1 in 40,000 guanines in the genome are present as 8-oxo-dG under normal conditions. This means that >30,000 8-oxo-dGs may exist at any given time in the genome of a human cell. Another product of DNA oxidation is 8-oxo-dA. 8-oxo-dA occurs at about 1/10 the frequency of 8-oxo-dG. The reduction potential of guanine may be reduced by as much as 50%, depending on the particular neighboring nucleosides stacked next to it within DNA.
DNA (cytosine-5)-methyltransferase 1(Dnmt1) is an enzyme that catalyzes the transfer of methyl groups to specific CpG sites in DNA, a process called DNA methylation. In humans, it is encoded by the DNMT1 gene. Dnmt1 forms part of the family of DNA methyltransferase enzymes, which consists primarily of DNMT1, DNMT3A, and DNMT3B.
5-Hydroxymethylcytosine (5hmC) is a DNA pyrimidine nitrogen base derived from cytosine. It is potentially important in epigenetics, because the hydroxymethyl group on the cytosine can possibly switch a gene on and off. It was first seen in bacteriophages in 1952. However, in 2009 it was found to be abundant in human and mouse brains, as well as in embryonic stem cells. In mammals, it can be generated by oxidation of 5-methylcytosine, a reaction mediated by TET enzymes.
8-Oxo-2'-deoxyguanosine (8-oxo-dG) is an oxidized derivative of deoxyguanosine. 8-Oxo-dG is one of the major products of DNA oxidation. Concentrations of 8-oxo-dG within a cell are a measurement of oxidative stress.
While the cellular and molecular mechanisms of learning and memory have long been a central focus of neuroscience, it is only in recent years that attention has turned to the epigenetic mechanisms behind the dynamic changes in gene transcription responsible for memory formation and maintenance. Epigenetic gene regulation often involves the physical marking of DNA or associated proteins to cause or allow long-lasting changes in gene activity. Epigenetic mechanisms such as DNA methylation and histone modifications have been shown to play an important role in learning and memory.
Tet methylcytosine dioxygenase 2 (TET2) is a human gene. It resides at chromosome 4q24, in a region showing recurrent microdeletions and copy-neutral loss of heterozygosity (CN-LOH) in patients with diverse myeloid malignancies.
DNA damage is an alteration in the chemical structure of DNA, such as a break in a strand of DNA, a nucleobase missing from the backbone of DNA, or a chemically changed base such as 8-OHdG. DNA damage can occur naturally or via environmental factors, but is distinctly different from mutation, although both are types of error in DNA. DNA damage is an abnormal chemical structure in DNA, while a mutation is a change in the sequence of base pairs. DNA damages cause changes in the structure of the genetic material and prevents the replication mechanism from functioning and performing properly. The DNA damage response (DDR) is a complex signal transduction pathway which recognizes when DNA is damaged and initiates the cellular response to the damage.
Tet methylcytosine dioxygenase 3 is a protein that in humans is encoded by the TET3 gene.
The TET enzymes are a family of ten-eleven translocation (TET) methylcytosine dioxygenases. They are instrumental in DNA demethylation. 5-Methylcytosine is a methylated form of the DNA base cytosine (C) that often regulates gene transcription and has several other functions in the genome.