Epigenomics

Last updated

Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. [1] [2] Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. [3] Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. [4] [5] Plant flavones are said to be inhibiting epigenomic marks that cause cancers. [6] Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development [7] and tumorigenesis. [8] The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays. [9] [7]

Contents

Epigenetics

Genomic modifications that alter gene expression that cannot be attributed to modification of the primary DNA sequence and that are heritable mitotically and meiotically are classified as epigenetic modifications. DNA methylation and histone modification are among the best characterized epigenetic processes. [3]

DNA methylation

The first epigenetic modification to be characterized in depth was DNA methylation. As its name implies, DNA methylation is the process by which a methyl group is added to DNA. The enzymes responsible for catalyzing this reaction are the DNA methyltransferases (DNMTs). While DNA methylation is stable and heritable, it can be reversed by an antagonistic group of enzymes known as DNA de-methylases. In eukaryotes, methylation is most commonly found on the carbon 5 position of cytosine residues (5mC) adjacent to guanine, termed CpG dinucleotides. [9] [10]

DNA methylation patterns vary greatly between species and even within the same organism. The usage of methylation among animals is quite different; with vertebrates exhibiting the highest levels of 5mC and invertebrates more moderate levels of 5mC. Some organisms like Caenorhabditis elegans have not been demonstrated to have 5mC nor a conventional DNA methyltransferase; this would suggest that other mechanisms other than DNA methylation are also involved. [11]

Within an organism, DNA methylation levels can also vary throughout development and by region. For example, in mouse primordial germ cells, a genome wide de-methylation even occurs; by implantation stage, methylation levels return to their previous somatic values. [11] When DNA methylation occurs at promoter regions, the sites of transcription initiation, it has the effect of repressing gene expression. This is in contrast to unmethylated promoter regions which are associated with actively expressed genes. [9]

The mechanism by which DNA methylation represses gene expression is a multi-step process. The distinction between methylated and unmethylated cytosine residues is carried out by specific DNA-binding proteins. Binding of these proteins recruit histone deacetylases (HDACs) enzyme which initiate chromatin remodeling such that the DNA becoming less accessible to transcriptional machinery, such as RNA polymerase, effectively repressing gene expression. [12]

Histone modification

In eukaryotes, genomic DNA is coiled into protein-DNA complexes called chromatin. Histones, which are the most prevalent type of protein found in chromatin, function to condense the DNA; the net positive charge on histones facilitates their bonding with DNA, which is negatively charged. The basic and repeating units of chromatin, nucleosomes, consist of an octamer of histone proteins (H2A, H2B, H3 and H4) and a 146 bp length of DNA wrapped around it. Nucleosomes and the DNA connecting form a 10 nm diameter chromatin fiber, which can be further condensed. [13] [14]

Chromatin packaging of DNA varies depending on the cell cycle stage and by local DNA region. [15] The degree to which chromatin is condensed is associated with a certain transcriptional state. Unpackaged or loose chromatin is more transcriptionally active than tightly packaged chromatin because it is more accessible to transcriptional machinery. By remodeling chromatin structure and changing the density of DNA packaging, gene expression can thus be modulated. [14]

Chromatin remodeling occurs via post-translational modifications of the N-terminal tails of core histone proteins. [16] The collective set of histone modifications in a given cell is known as the histone code. Many different types of histone modification are known, including: acetylation, methylation, phosphorylation, ubiquitination, SUMOylation, ADP-ribosylation, deamination and proline isomerization; acetylation, methylation, phosphorylation and ubiquitination have been implicated in gene activation whereas methylation, ubiquitination, SUMOylation, deimination and proline isomerization have been implicated in gene repression. Note that several modification types including methylation, phosphorylation and ubiquitination can be associated with different transcriptional states depending on the specific amino acid on the histone being modified. Furthermore, the DNA region where histone modification occurs can also elicit different effects; an example being methylation of the 3rd core histone at lysine residue 36 (H3K36). When H3K36 occurs in the coding sections of a gene, it is associated with gene activation but the opposite is found when it is within the promoter region. [14]

Histone modifications regulate gene expression by two mechanisms: by disruption of the contact between nucleosomes and by recruiting chromatin remodeling ATPases. An example of the first mechanism occurs during the acetylation of lysine terminal tail amino acids, which is catalyzed by histone acetyltransferases (HATs). HATs are part of a multiprotein complex that is recruited to chromatin when activators bind to DNA binding sites. Acetylation effectively neutralizes the basic charge on lysine, which was involved in stabilizing chromatin through its affinity for negatively charged DNA. Acetylated histones therefore favor the dissociation of nucleosomes and thus unwinding of chromatin can occur. Under a loose chromatin state, DNA is more accessible to transcriptional machinery and thus expression is activated. The process can be reversed through removal of histone acetyl groups by deacetylases. [14] [16]

The second process involves the recruitment of chromatin remodeling complexes by the binding of activator molecules to corresponding enhancer regions. The nucleosome remodeling complexes reposition nucleosomes by several mechanisms, enabling or disabling accessibility of transcriptional machinery to DNA. The SWI/SNF protein complex in yeast is one example of a chromatin remodeling complex that regulates the expression of many genes through chromatin remodeling. [14] [17]

Relation to other genomic fields

Epigenomics shares many commonalities with other genomics fields, in both methodology and in its abstract purpose. Epigenomics seeks to identify and characterize epigenetic modifications on a global level, similar to the study of the complete set of DNA in genomics or the complete set of proteins in a cell in proteomics. [1] [2] The logic behind performing epigenetic analysis on a global level is that inferences can be made about epigenetic modifications, which might not otherwise be possible through analysis of specific loci. [13] [1] As in the other genomics fields, epigenomics relies heavily on bioinformatics, which combines the disciplines of biology, mathematics and computer science. [18] However while epigenetic modifications had been known and studied for decades, it is through these advancements in bioinformatics technology that have allowed analyses on a global scale. Many current techniques still draw on older methods, often adapting them to genomic assays as is described in the next section.

Methods

Histone modification assays

The cellular processes of transcription, DNA replication and DNA repair involve the interaction between genomic DNA and nuclear proteins. It had been known that certain regions within chromatin were extremely susceptible to DNAse I digestion, which cleaves DNA in a low sequence specificity manner. Such hypersensitive sites were thought to be transcriptionally active regions, as evidenced by their association with RNA polymerase and topoisomerases I and II. [19]

It is now known that sensitivity to DNAse I regions correspond to regions of chromatin with loose DNA-histone association. Hypersensitive sites most often represent promoters regions, which require for DNA to be accessible for DNA binding transcriptional machinery to function. [20]

ChIP-Chip and ChIP-Seq

Histone modification was first detected on a genome wide level through the coupling of chromatin immunoprecipitation (ChIP) technology with DNA microarrays, termed ChIP-Chip. [13] However instead of isolating a DNA-binding transcription factor or enhancer protein through chromatin immunoprecipitation, the proteins of interest are the modified histones themselves. First, histones are cross-linked to DNA in vivo through light chemical treatment (e.g., formaldehyde). The cells are next lysed, allowing for the chromatin to be extracted and fragmented, either by sonication or treatment with a non-specific restriction enzyme (e.g., micrococcal nuclease). Modification-specific antibodies in turn, are used to immunoprecipitate the DNA-histone complexes. [14] Following immunoprecipitation, the DNA is purified from the histones, amplified via PCR and labeled with a fluorescent tag (e.g., Cy5, Cy3). The final step involves hybridization of labeled DNA, both immunoprecipitated DNA and non-immunoprecipitated onto a microarray containing immobilized gDNA. Analysis of the relative signal intensity allows the sites of histone modification to be determined. [21] [22]

ChIP-chip was used extensively to characterize the global histone modification patterns of yeast. From these studies, inferences on the function of histone modifications were made; that transcriptional activation or repression was associated with certain histone modifications and by region. While this method was effective providing near full coverage of the yeast epigenome, its use in larger genomes such as humans is limited. [13] [14]

In order to study histone modifications on a truly genome level, other high-throughput methods were coupled with the chromatin immunoprecipitation, namely: SAGE: serial analysis of gene expression (ChIP-SAGE), PET: paired end ditag sequencing (ChIP-PET) and more recently, next-generation sequencing (ChIP-Seq). ChIP-seq follows the same protocol for chromatin immunoprecipitation but instead of amplification of purified DNA and hybridization to a microarray, the DNA fragments are directly sequenced using next generation parallel re-sequencing. It has proven to be an effective method for analyzing the global histone modification patterns and protein target sites, providing higher resolution than previous methods. [13] [21]

DNA methylation assays

Techniques for characterizing primary DNA sequences could not be directly applied to methylation assays. For example, when DNA was amplified in PCR or bacterial cloning techniques, the methylation pattern was not copied and thus the information lost. The DNA hybridization technique used in DNA assays, in which radioactive probes were used to map and identify DNA sequences, could not be used to distinguish between methylated and non-methylated DNA. [23] [9]

Restriction endonuclease based methods

Non genome-wide approaches

The earliest methylation detection assays used methylation modification sensitive restriction endonucleases. Genomic DNA was digested with both methylation-sensitive and insensitive restriction enzymes recognizing the same restriction site. The idea being that whenever the site was methylated, only the methylation insensitive enzyme could cleave at that position. By comparing restriction fragment sizes generated from the methylation-sensitive enzyme to those of the methylation-insensitive enzyme, it was possible to determine the methylation pattern of the region. This analysis step was done by amplifying the restriction fragments via PCR, separating them through gel electrophoresis and analyzing them via southern blot with probes for the region of interest. [23] [9]

This technique was used to compare the DNA methylation modification patterns in the human adult and hemoglobin gene loci. Different regions of the gene (gamma delta beta globin) were known to be expressed at different stages of development. [24] Consistent with a role of DNA methylation in gene repression, regions that were associated with high levels of DNA methylation were not actively expressed. [25]

This method was limited not suitable for studies on the global methylation pattern, or ‘methylome’. Even within specific loci it was not fully representative of the true methylation pattern as only those restriction sites with corresponding methylation sensitive and insensitive restriction assays could provide useful information. Further complications could arise when incomplete digestion of DNA by restriction enzymes generated false negative results. [9]

Genome wide approaches

DNA methylation profiling on a large scale was first made possible through the Restriction Landmark Genome Scanning (RLGS) technique. Like the locus-specific DNA methylation assay, the technique identified methylated DNA via its digestion methylation sensitive enzymes. However it was the use of two-dimensional gel electrophoresis that allowed be characterized on a broader scale. [9]

However it was not until the advent of microarray and next generation sequencing technology when truly high resolution and genome-wide DNA methylation became possible. [26] As with RLGS, the endonuclease component is retained in the method but it is coupled to new technologies. One such approach is the differential methylation hybridization (DMH), in which one set of genomic DNA is digested with methylation-sensitive restriction enzymes and a parallel set of DNA is not digested. Both sets of DNA are subsequently amplified and each labelled with fluorescent dyes and used in two-colour array hybridization. The level of DNA methylation at a given loci is determined by the relative intensity ratios of the two dyes. Adaptation of next generation sequencing to DNA methylation assay provides several advantages over array hybridization. Sequence-based technology provides higher resolution to allele specific DNA methylation, can be performed on larger genomes, and does not require creation of DNA microarrays which require adjustments based on CpG density to properly function. [9]

Bisulfite sequencing

Bisulfite sequencing relies on chemical conversion of unmethylated cytosines exclusively, such that they can be identified through standard DNA sequencing techniques. Sodium bisulfate and alkaline treatment does this by converting unmethylated cytosine residues into uracil while leaving methylated cytosine unaltered. Subsequent amplification and sequencing of untreated DNA and sodium bisulphite treated DNA allows for methylated sites to be identified. Bisulfite sequencing, like the traditional restriction based methods, was historically limited to methylation patterns of specific gene loci, until whole genome sequencing technologies became available. However, unlike traditional restriction based methods, bisulfite sequencing provided resolution on a nucleotide level. [23] [9]

Limitations of the bisulfite technique include the incomplete conversion of cytosine to uracil, which is a source of false positives. Further, bisulfite treatment also causes DNA degradation and requires an additional purification step to remove the sodium bisulfite. [9]

Next-generation sequencing is well suited in complementing bisulfite sequencing in genome-wide methylation analysis. While this now allows for methylation pattern to be determined on the highest resolution possible, on the single nucleotide level, challenges still remain in the assembly step because of reduced sequence complexity in bisulphite treated DNA. Increases in read length seek to address this challenge, allowing for whole genome shotgun bisulphite sequencing (WGBS) to be performed. The WGBS approach using an Illumina Genome Analyzer platform and has already been implemented in Arabidopsis thaliana . [9] Reduced representation genomic methods based on bisulfite sequencing exist as well, [27] [28] and they are particularly suitable for species with large genome sizes. [29]

Chromatin accessibility assays

Chromatin accessibility is the measure of how "accessible" or "open" a region of genome is to transcription or binding of transcription factors. The regions which are inaccessible (i.e. because they're bound by nucleosomes) are not actively transcribed by the cell while open and accessible regions are actively transcribed. [30] Changes in chromatin accessibility are important epigenetic regulatory processes that govern cell- or context-specific expression of genes. [31] Assays such as MNase-seq, DNase-seq, ATAC-seq or FAIRE-seq are routinely used to understand the accessible chromatin landscape of cells. The main feature of all these methods is that they're able to selectively isolate either the DNA sequences that are bounded to the histones, or those that are not. These sequences are then compared to a reference genome that allows to identify their relative position. [32]

MNase-seq and DNase-seq both follow the same principles, as they employ lytic enzymes that target nucleic acids to cut the DNA strands unbounded by nucleosomes or other proteic factors, while the bounded pieces are sheltered, and can be retrieved and analysed. Since active, unbound regions are destroyed, their detection can only be indirect, by sequencing with a Next Generation Sequencing technique and comparison with a reference. MNase-seq utilises a micrococcal nuclease that produces a single strand cleavage on the opposite strand of the target sequence. [33] DNase-seq employs DNase I, a non-specific double strand-cleaving endonuclease. This technique has been used to such an extent that nucleosome-free regions have been labelled as DHSs, DNase I hypersensitive sites, [34] and has been ENCODE consortium's election method for genome wide chromatin accessibility analyses. [35] The main issue of this technique is that the cleavage distribution can be biased, [36] lowering the quality of the results.

FAIRE-seq (Formaldehyde-Assisted Isolation of Regulatory Elements) requires as its first step crosslinking of the DNA with nucleosomes, then DNA shearing by sonication. The free and linked fragments are separated with a traditional phenol-chloroform extraction, since the proteic fraction is stuck in the interphase while the unlinked DNA shifts to the aqueous phase and can be analysed with various methods. [37] Sonication produces random breaks, and therefore is not subject to any kind of bias, and is also the bigger length of the fragments (200-700 nt) makes this technique suitable for wider regions, while it's unable to resolve the single nucleosome. [32] Unlike the nuclease-based methods, FAIRE-seq allows the direct identification of the transcriptionally active sites, and a less laborious sample preparation. [38]

ATAC-seq is based on the activity of Tn5 transposase. The transposase is used to insert tags in the genome, with higher frequency on regions not covered by proteic factors. The tags are then used as adapters for PRC or other analytical tools. [39]

Direct detection

Polymerase sensitivity in single-molecule real-time sequencing made it possible for scientists to directly detect epigenetic marks such as methylation as the polymerase moves along the DNA molecule being sequenced. [40] Several projects have demonstrated the ability to collect genome-wide epigenetic data in bacteria. [41] [42] [43] [44]

Nanopore sequencing is based on changes of electrolytic current signals according to base modifications (e.g. Methylation). A polymerase mediates the entrance of ssDNA in the pore: the ion-current variation is modulated by a section of the pore and the consequently generated difference is recorded revealing the position of CpG. Discrimination between hydroxymethylation and methylation is possible thanks to solid-state nanopores even if the current while passing through the high-field region of the pore may be slightly influenced in it. [45] As a reference amplified DNA is used which will not present copied methylationed sites after the PCR process. [46] The Oxford Nanopore Technologies MinION sequencer is a technology where, according to a hidden Markov model, it is possible to distinguish unmethylated cytosine from the methylated one even without chemical treatment that acts to enhance the signal of that modification. The data are registered commonly in picoamperes during established time. Other devices are the Nanopolish and the SignaAlign: the former expresses the frequency of a methylation in a read while the latter gives a probability of it derived from the sum of all the reads. [47]

Single-molecule real-time sequencing (SMRT) is a single-molecule DNA sequencing method. Single-molecule real-time sequencing utilizes a zero-mode waveguide (ZMW). A single DNA polymerase enzyme is bound to the bottom of a ZMW with a single molecule of DNA as a template. Each of the four DNA bases is attached to one of four different fluorescent dyes. When a nucleotide is incorporated by the DNA polymerase, the fluorescent tag is cleaved off and the detector detects the fluorescent signal of the nucleotide incorporation. As the sequencing occurs, the polymerase enzyme kinetics shift when it encounters a region of methylation or any other base modification. When the enzyme encounters chemically modified bases, it will slow down or speed up in a uniquely identifiable way. Fluorescence pulses in SMRT sequencing are characterized not only by their emission spectra but also by their duration and by the interval between successive pulses. These metrics, defined as pulse width and interpulse duration (IPD), add valuable information about DNA polymerase kinetics. Pulse width is a function of all kinetic steps after nucleotide binding and up to fluorophore release, and IPD is determined by the kinetics of nucleotide binding and polymerase translocation.

In 2010 a team of scientists demonstrated the use of single-molecule real-time sequencing for direct detection of modified nucleotide in the DNA template including N6-methyladenosine, 5-methylcytosine and 5-hydroxylcytosine. These various modifications affect polymerase kinetics differently, allowing discrimination between them. [48]

In 2017, another team proposed a combined bisulfite conversion with third-generation single-molecule real-time sequencing, it is called single-molecule real-time bisulfite sequencing (SMRT-BS), which is an accurate targeted CpG methylation analysis method capable of a high degree of multiplying and long read lengths (1.5 kb) without the need for PCR amplicon sub-cloning. [49]

Theoretical modeling approaches

First mathematical models for different nucleosome states affecting gene expression were introduced in 1980s [ref]. Later, this idea was almost forgotten, until the experimental evidence has indicated a possible role of covalent histone modifications as an epigenetic code. [50] In the next several years, high-throughput data have indeed uncovered the abundance of epigenetic modifications and their relation to chromatin functioning which has motivated new theoretical models for the appearance, maintaining and changing these patterns,. [51] [52] These models are usually formulated in the frame of one-dimensional lattice approaches. [53]

See also

Notes

  1. 1 2 3 Russell 2010, p. 217.
  2. 1 2 Russell 2010, p. 230.
  3. 1 2 Russell 2010, p. 475.
  4. Alabert C, Groth A (February 2012). "Chromatin replication and epigenome maintenance" (PDF). Nature Reviews. Molecular Cell Biology. 13 (3): 153–67. doi:10.1038/nrm3288. PMID   22358331. S2CID   10911203.
  5. Ghosh S, Sinha JK, Raghunath M (September 2016). "Epigenomic maintenance through dietary intervention can facilitate DNA repair process to slow down the progress of premature aging". IUBMB Life. 68 (9): 717–21. doi: 10.1002/iub.1532 . PMID   27364681.
  6. "The Potential Epigenetic and Anticancer Power of Dietary Flavones". 2016-10-11.
  7. 1 2 Zhu J, Adli M, Zou JY, Verstappen G, Coyne M, Zhang X, et al. (January 2013). "Genome-wide chromatin state transitions associated with developmental and environmental cues". Cell. 152 (3): 642–54. doi:10.1016/j.cell.2012.12.033. PMC   3563935 . PMID   23333102.
  8. Russell 2010, p. 597.
  9. 1 2 3 4 5 6 7 8 9 10 11 Laird PW (March 2010). "Principles and challenges of genomewide DNA methylation analysis". Nature Reviews. Genetics. 11 (3): 191–203. doi:10.1038/nrg2732. PMID   20125086. S2CID   6780101.
  10. Russell 2010, pp. 531–2.
  11. 1 2 Bird A (January 2002). "DNA methylation patterns and epigenetic memory". Genes & Development. 16 (1): 6–21. doi: 10.1101/gad.947102 . PMID   11782440.
  12. Russell 2010, pp. 532–3.
  13. 1 2 3 4 5 Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, et al. (May 2007). "High-resolution profiling of histone methylations in the human genome". Cell. 129 (4): 823–37. doi: 10.1016/j.cell.2007.05.009 . PMID   17512414. S2CID   6326093.
  14. 1 2 3 4 5 6 7 Kouzarides T (February 2007). "Chromatin modifications and their function". Cell. 128 (4): 693–705. doi: 10.1016/j.cell.2007.02.005 . PMID   17320507. S2CID   11691263.
  15. Russell 2010, pp. 24–7.
  16. 1 2 Russell 2010, pp. 529–30.
  17. Russell 2010, p. 530.
  18. Russell 2010, p. 218.
  19. Gross DS, Garrard WT (1988). "Nuclease hypersensitive sites in chromatin". Annual Review of Biochemistry. 57: 159–97. doi:10.1146/annurev.bi.57.070188.001111. PMID   3052270.
  20. Russell 2010, p. 529.
  21. 1 2 Gibson & Muse 2009, pp. 229–32.
  22. Russell 2010, p. 532.
  23. 1 2 3 Eads CA, Danenberg KD, Kawakami K, Saltz LB, Blake C, Shibata D, et al. (April 2000). "MethyLight: a high-throughput assay to measure DNA methylation". Nucleic Acids Research. 28 (8): 32e–0. doi:10.1093/nar/28.8.e32. PMC   102836 . PMID   10734209.
  24. Russell 2010, pp. 552–3.
  25. van der Ploeg LH, Flavell RA (April 1980). "DNA methylation in the human gamma delta beta-globin locus in erythroid and nonerythroid tissues". Cell. 19 (4): 947–58. doi:10.1016/0092-8674(80)90086-0. PMID   6247075. S2CID   54324289.
  26. Johannes F, Colot V, Jansen RC (November 2008). "Epigenome dynamics: a quantitative genetics perspective" (PDF). Nature Reviews. Genetics. 9 (11): 883–90. doi:10.1038/nrg2467. hdl: 11370/731f6c62-6749-4e67-aa72-c2242f95527a . PMID   18927581. S2CID   7641577.
  27. Trucchi E, Mazzarella AB, Gilfillan GD, Lorenzo MT, Schönswetter P, Paun O (April 2016). "BsRADseq: screening DNA methylation in natural populations of non-model species". Molecular Ecology. 25 (8): 1697–1713. doi:10.1111/mec.13550. PMC   4949719 . PMID   26818626.
  28. van Gurp TP, Wagemaker NC, Wouters B, Vergeer P, Ouborg JN, Verhoeven KJ (April 2016). "epiGBS: reference-free reduced representation bisulfite sequencing". Nature Methods. 13 (4): 322–324. doi:10.1038/nmeth.3763. PMID   26855363. S2CID   10457615.
  29. Paun O, Verhoeven KJ, Richards CL (January 2019). "Opportunities and limitations of reduced representation bisulfite sequencing in plant ecological epigenomics". The New Phytologist. 221 (2): 738–742. doi:10.1111/nph.15388. PMC   6504643 . PMID   30121954.
  30. Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, et al. (February 2015). "Integrative analysis of 111 reference human epigenomes". Nature. 518 (7539): 317–30. Bibcode:2015Natur.518..317.. doi:10.1038/nature14248. PMC   4530010 . PMID   25693563.
  31. Pennacchio LA, Bickmore W, Dean A, Nobrega MA, Bejerano G (April 2013). "Enhancers: five essential questions". Nature Reviews. Genetics. 14 (4): 288–95. doi:10.1038/nrg3458. PMC   4445073 . PMID   23503198.
  32. 1 2 Zhang Z, Pugh BF (January 2011). "High-resolution genome-wide mapping of the primary structure of chromatin". Cell. 144 (2): 175–86. doi:10.1016/j.cell.2011.01.003. PMC   3061432 . PMID   21241889.
  33. Axel R (July 1975). "Cleavage of DNA in nuclei and chromatin with staphylococcal nuclease". Biochemistry. 14 (13): 2921–5. doi:10.1021/bi00684a020. PMID   1148185.
  34. Zhou W, Sherwood B, Ji Z, Xue Y, Du F, Bai J, et al. (October 2017). "Genome-wide prediction of DNase I hypersensitivity using gene expression". Nature Communications. 8 (1): 1038. Bibcode:2017NatCo...8.1038Z. doi:10.1038/s41467-017-01188-x. PMC   5715040 . PMID   29051481.
  35. Aldred SF, Collins PJ, Davis CA, Doyle F, Epstein CB, Frietze S, et al. (ENCODE Project Consortium) (September 2012). "An integrated encyclopedia of DNA elements in the human genome". Nature. 489 (7414): 57–74. Bibcode:2012Natur.489...57T. doi:10.1038/nature11247. PMC   3439153 . PMID   22955616.
  36. He HH, Meyer CA, Hu SS, Chen MW, Zang C, Liu Y, et al. (January 2014). "Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification". Nature Methods. 11 (1): 73–78. doi:10.1038/nmeth.2762. PMC   18771 . PMID   24317252.
  37. Tsompana M, Buck MJ (20 November 2014). "Chromatin accessibility: a window into the genome". Epigenetics & Chromatin. 7 (1): 33. doi: 10.1186/1756-8935-7-33 . PMC   4253006 . PMID   25473421.
  38. Giresi PG, Kim J, McDaniell RM, Iyer VR, Lieb JD (June 2007). "FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin". Genome Research. 17 (6): 877–85. doi:10.1101/gr.5533506. PMC   1891346 . PMID   17179217.
  39. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ (December 2013). "Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position". Nature Methods. 10 (12): 1213–8. doi:10.1038/nmeth.2688. PMC   3959825 . PMID   24097267.
  40. Schadt EE, Banerjee O, Fang G, Feng Z, Wong WH, Zhang X, et al. (January 2013). "Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases". Genome Research. 23 (1): 129–41. doi:10.1101/gr.136739.111. PMC   3530673 . PMID   23093720.
  41. Davis BM, Chao MC, Waldor MK (April 2013). "Entering the era of bacterial epigenomics with single molecule real time DNA sequencing". Current Opinion in Microbiology. 16 (2): 192–8. doi:10.1016/j.mib.2013.01.011. PMC   3646917 . PMID   23434113.
  42. Lluch-Senar M, Luong K, Lloréns-Rico V, Delgado J, Fang G, Spittle K, et al. (2013). "Comprehensive methylome characterization of Mycoplasma genitalium and Mycoplasma pneumoniae at single-base resolution". PLOS Genetics. 9 (1): e1003191. doi: 10.1371/journal.pgen.1003191 . PMC   3536716 . PMID   23300489.
  43. Murray IA, Clark TA, Morgan RD, Boitano M, Anton BP, Luong K, et al. (December 2012). "The methylomes of six bacteria". Nucleic Acids Research. 40 (22): 11450–62. doi:10.1093/nar/gks891. PMC   3526280 . PMID   23034806.
  44. Fang G, Munera D, Friedman DI, Mandlik A, Chao MC, Banerjee O, et al. (December 2012). "Genome-wide mapping of methylated adenine residues in pathogenic Escherichia coli using single-molecule real-time sequencing". Nature Biotechnology. 30 (12): 1232–9. doi:10.1038/nbt.2432. PMC   3879109 . PMID   23138224.
  45. Simpson JT, Workman RE, Zuzarte PC, David M, Dursi LJ, Timp W (April 2017). "Detecting DNA cytosine methylation using nanopore sequencing" (PDF). Nature Methods. 14 (4): 407–410. doi:10.1038/nmeth.4184. PMID   28218898. S2CID   16152628.
  46. Laszlo AH, Derrington IM, Brinkerhoff H, Langford KW, Nova IC, Samson JM, et al. (November 2013). "Detection and mapping of 5-methylcytosine and 5-hydroxymethylcytosine with nanopore MspA". Proceedings of the National Academy of Sciences of the United States of America. 110 (47): 18904–9. Bibcode:2013PNAS..11018904L. doi: 10.1073/pnas.1310240110 . PMC   3839702 . PMID   24167255.
  47. Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, et al. (April 2018). "Nanopore sequencing and assembly of a human genome with ultra-long reads". Nature Biotechnology. 36 (4): 338–345. doi:10.1038/nbt.4060. PMC   5889714 . PMID   29431738.
  48. Flusberg BA, Webster DR, Lee JH, Travers KJ, Olivares EC, Clark TA, et al. (June 2010). "Direct detection of DNA methylation during single-molecule, real-time sequencing". Nature Methods. 7 (6): 461–5. doi:10.1038/nmeth.1459. PMC   2879396 . PMID   20453866.
  49. Yang Y, Scott SA (2017). "DNA Methylation Profiling Using Long-Read Single Molecule Real-Time Bisulfite Sequencing (SMRT-BS)". Functional Genomics. Methods in Molecular Biology. Vol. 1654. pp. 125–134. doi:10.1007/978-1-4939-7231-9_8. ISBN   978-1-4939-7230-2. PMID   28986786.
  50. Strahl BD, Allis CD (January 2000). "The language of covalent histone modifications". Nature. 403 (6765): 41–5. Bibcode:2000Natur.403...41S. doi:10.1038/47412. PMID   10638745. S2CID   4418993.
  51. Sedighi M, Sengupta AM (November 2007). "Epigenetic chromatin silencing: bistability and front propagation". Physical Biology. 4 (4): 246–55. arXiv: 0710.3889 . Bibcode:2007PhBio...4..246S. doi:10.1088/1478-3975/4/4/002. PMC   2267688 . PMID   17991991.
  52. Dodd IB, Micheelsen MA, Sneppen K, Thon G (May 2007). "Theoretical analysis of epigenetic cell memory by nucleosome modification". Cell. 129 (4): 813–22. doi: 10.1016/j.cell.2007.02.053 . PMID   17512413. S2CID   16091877.
  53. Teif VB, Rippe K (October 2010). "Statistical-mechanical lattice models for protein-DNA binding in chromatin". Journal of Physics: Condensed Matter. 22 (41): 414105. arXiv: 1004.5514 . Bibcode:2010JPCM...22O4105T. doi:10.1088/0953-8984/22/41/414105. PMID   21386588. S2CID   103345.

Related Research Articles

<span class="mw-page-title-main">Epigenome</span> Biological term

In biology, the epigenome of an organism is the collection of chemical changes to its DNA and histone proteins that affects when, where, and how the DNA is expressed; these changes can be passed down to an organism's offspring via transgenerational stranded epigenetic inheritance. Changes to the epigenome can result in changes to the structure of chromatin and changes to the function of the genome.

H3K4me3 is an epigenetic modification to the DNA packaging protein Histone H3 that indicates tri-methylation at the 4th lysine residue of the histone H3 protein and is often involved in the regulation of gene expression. The name denotes the addition of three methyl groups (trimethylation) to the lysine 4 on the histone H3 protein.

H3K27ac is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates acetylation of the lysine residue at N-terminal position 27 of the histone H3 protein.

H3K9me3 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the tri-methylation at the 9th lysine residue of the histone H3 protein and is often associated with heterochromatin.

H3K4me1 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the mono-methylation at the 4th lysine residue of the histone H3 protein and often associated with gene enhancers.

H3K36me3 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the tri-methylation at the 36th lysine residue of the histone H3 protein and often associated with gene bodies.

H3K79me2 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the di-methylation at the 79th lysine residue of the histone H3 protein. H3K79me2 is detected in the transcribed regions of active genes.

H4K20me is an epigenetic modification to the DNA packaging protein Histone H4. It is a mark that indicates the mono-methylation at the 20th lysine residue of the histone H4 protein. This mark can be di- and tri-methylated. It is critical for genome integrity including DNA damage repair, DNA replication and chromatin compaction.

H4K16ac is an epigenetic modification to the DNA packaging protein Histone H4. It is a mark that indicates the acetylation at the 16th lysine residue of the histone H4 protein.

H4K5ac is an epigenetic modification to the DNA packaging protein histone H4. It is a mark that indicates the acetylation at the 5th lysine residue of the histone H4 protein. H4K5 is the closest lysine residue to the N-terminal tail of histone H4. It is enriched at the transcription start site (TSS) and along gene bodies. Acetylation of histone H4K5 and H4K12ac is enriched at centromeres.

H3K14ac is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the acetylation at the 14th lysine residue of the histone H3 protein.

H3K36ac is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the acetylation at the 36th lysine residue of the histone H3 protein.

H3K36me2 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the di-methylation at the 36th lysine residue of the histone H3 protein.

H3K36me is an epigenetic modification to the DNA packaging protein Histone H3, specifically, the mono-methylation at the 36th lysine residue of the histone H3 protein.

H3R42me is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the mono-methylation at the 42nd arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

H3R17me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 17th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

H3R26me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 26th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

H3R8me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 8th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

H3R2me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 2nd arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

H4R3me2 is an epigenetic modification to the DNA packaging protein histone H4. It is a mark that indicates the di-methylation at the 3rd arginine residue of the histone H4 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

References

Further reading