Regulatory sequence

Last updated

A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses.

Contents

Description

Interactive icon.svg
The structure of a eukaryotic protein-coding gene. Regulatory sequence controls when and where expression occurs for the protein coding region (red). Promoter and enhancer regions (yellow) regulate the transcription of the gene into a pre-mRNA which is modified to remove introns (light grey) and add a 5' cap and poly-A tail (dark grey). The mRNA 5' and 3' untranslated regions (blue) regulate translation into the final protein product. [1]

In DNA, regulation of gene expression normally happens at the level of RNA biosynthesis (transcription). It is accomplished through the sequence-specific binding of proteins (transcription factors) that activate or inhibit transcription. Transcription factors may act as activators, repressors, or both. Repressors often act by preventing RNA polymerase from forming a productive complex with the transcriptional initiation region (promoter), while activators facilitate formation of a productive complex. Furthermore, DNA motifs have been shown to be predictive of epigenomic modifications, suggesting that transcription factors play a role in regulating the epigenome. [2]

In RNA, regulation may occur at the level of protein biosynthesis (translation), RNA cleavage, RNA splicing, or transcriptional termination. Regulatory sequences are frequently associated with messenger RNA (mRNA) molecules, where they are used to control mRNA biogenesis or translation. A variety of biological molecules may bind to the RNA to accomplish this regulation, including proteins (e.g., translational repressors and splicing factors), other RNA molecules (e.g., miRNA) and small molecules, in the case of riboswitches.

Activation and implementation

A regulatory DNA sequence does not regulate unless it is activated. Different regulatory sequences are activated and then implement their regulation by different mechanisms.

Enhancer activation and implementation

Regulation of transcription in mammals. An active enhancer regulatory sequence of DNA is enabled to interact with the promoter DNA regulatory sequence of its target gene by formation of a chromosome loop. This can initiate messenger RNA (mRNA) synthesis by RNA polymerase II (RNAP II) bound to the promoter at the transcription start site of the gene. The loop is stabilized by one architectural protein anchored to the enhancer and one anchored to the promoter and these proteins are joined to form a dimer (red zigzags). Specific regulatory transcription factors bind to DNA sequence motifs on the enhancer. General transcription factors bind to the promoter. When a transcription factor is activated by a signal (here indicated as phosphorylation shown by a small red star on a transcription factor on the enhancer) the enhancer is activated and can now activate its target promoter. The active enhancer is transcribed on each strand of DNA in opposite directions by bound RNAP IIs. Mediator (a complex consisting of about 26 proteins in an interacting structure) communicates regulatory signals from the enhancer DNA-bound transcription factors to the promoter. Regulation of transcription in mammals.jpg
Regulation of transcription in mammals. An active enhancer regulatory sequence of DNA is enabled to interact with the promoter DNA regulatory sequence of its target gene by formation of a chromosome loop. This can initiate messenger RNA (mRNA) synthesis by RNA polymerase II (RNAP II) bound to the promoter at the transcription start site of the gene. The loop is stabilized by one architectural protein anchored to the enhancer and one anchored to the promoter and these proteins are joined to form a dimer (red zigzags). Specific regulatory transcription factors bind to DNA sequence motifs on the enhancer. General transcription factors bind to the promoter. When a transcription factor is activated by a signal (here indicated as phosphorylation shown by a small red star on a transcription factor on the enhancer) the enhancer is activated and can now activate its target promoter. The active enhancer is transcribed on each strand of DNA in opposite directions by bound RNAP IIs. Mediator (a complex consisting of about 26 proteins in an interacting structure) communicates regulatory signals from the enhancer DNA-bound transcription factors to the promoter.

Expression of genes in mammals can be upregulated when signals are transmitted to the promoters associated with the genes. Cis-regulatory DNA sequences that are located in DNA regions distant from the promoters of genes can have very large effects on gene expression, with some genes undergoing up to 100-fold increased expression due to such a cis-regulatory sequence. [3] These cis-regulatory sequences include enhancers, silencers, insulators and tethering elements. [4] Among this constellation of sequences, enhancers and their associated transcription factor proteins have a leading role in the regulation of gene expression. [5]

Enhancers are sequences of the genome that are major gene-regulatory elements. Enhancers control cell-type-specific gene expression programs, most often by looping through long distances to come in physical proximity with the promoters of their target genes. [6] In a study of brain cortical neurons, 24,937 loops were found, bringing enhancers to promoters. [3] Multiple enhancers, each often at tens or hundred of thousands of nucleotides distant from their target genes, loop to their target gene promoters and coordinate with each other to control expression of their common target gene. [6]

The schematic illustration in this section shows an enhancer looping around to come into close physical proximity with the promoter of a target gene. The loop is stabilized by a dimer of a connector protein (e.g. dimer of CTCF or YY1), with one member of the dimer anchored to its binding motif on the enhancer and the other member anchored to its binding motif on the promoter (represented by the red zigzags in the illustration). [7] Several cell function specific transcription factor proteins (in 2018 Lambert et al. indicated there were about 1,600 transcription factors in a human cell [8] ) generally bind to specific motifs on an enhancer [9] and a small combination of these enhancer-bound transcription factors, when brought close to a promoter by a DNA loop, govern the level of transcription of the target gene. Mediator (coactivator) (a complex usually consisting of about 26 proteins in an interacting structure) communicates regulatory signals from enhancer DNA-bound transcription factors directly to the RNA polymerase II (RNAP II) enzyme bound to the promoter. [10]

Enhancers, when active, are generally transcribed from both strands of DNA with RNA polymerases acting in two different directions, producing two eRNAs as illustrated in the Figure. [11] An inactive enhancer may be bound by an inactive transcription factor. Phosphorylation of the transcription factor may activate it and that activated transcription factor may then activate the enhancer to which it is bound (see small red star representing phosphorylation of a transcription factor bound to an enhancer in the illustration). [12] An activated enhancer begins transcription of its RNA before activating a promoter to initiate transcription of messenger RNA from its target gene. [13]

CpG island methylation and demethylation

A methyl group is added on the carbon at the number 5 position of the ring to form 5-methylcytosine Cytosine and 5-methylcytosine.svg
A methyl group is added on the carbon at the number 5 position of the ring to form 5-methylcytosine

5-Methylcytosine (5-mC) is a methylated form of the DNA base cytosine (see figure). 5-mC is an epigenetic marker found predominantly on cytosines within CpG dinucleotides, which consist of a cytosine is followed by a guanine reading in the 5′ to 3′ direction along the DNA strand (CpG sites). About 28 million CpG dinucleotides occur in the human genome. [14] In most tissues of mammals, on average, 70% to 80% of CpG cytosines are methylated (forming 5-methyl-CpG, or 5-mCpG). [15] Methylated cytosines within CpG sequences often occur in groups, called CpG islands. About 59% of promoter sequences have a CpG island while only about 6% of enhancer sequences have a CpG island. [16] CpG islands constitute regulatory sequences, since if CpG islands are methylated in the promoter of a gene this can reduce or silence gene expression. [17]

DNA methylation regulates gene expression through interaction with methyl binding domain (MBD) proteins, such as MeCP2, MBD1 and MBD2. These MBD proteins bind most strongly to highly methylated CpG islands. [18] These MBD proteins have both a methyl-CpG-binding domain and a transcriptional repression domain. [18] They bind to methylated DNA and guide or direct protein complexes with chromatin remodeling and/or histone modifying activity to methylated CpG islands. MBD proteins generally repress local chromatin by means such as catalyzing the introduction of repressive histone marks or creating an overall repressive chromatin environment through nucleosome remodeling and chromatin reorganization. [18]

Transcription factors are proteins that bind to specific DNA sequences in order to regulate the expression of a given gene. The binding sequence for a transcription factor in DNA is usually about 10 or 11 nucleotides long. There are approximately 1,400 different transcription factors encoded in the human genome and they constitute about 6% of all human protein coding genes. [19] About 94% of transcription factor binding sites that are associated with signal-responsive genes occur in enhancers while only about 6% of such sites occur in promoters. [9]

EGR1 is a transcription factor important for regulation of methylation of CpG islands. An EGR1 transcription factor binding site is frequently located in enhancer or promoter sequences. [20] There are about 12,000 binding sites for EGR1 in the mammalian genome and about half of EGR1 binding sites are located in promoters and half in enhancers. [20] The binding of EGR1 to its target DNA binding site is insensitive to cytosine methylation in the DNA. [20]

While only small amounts of EGR1 protein are detectable in cells that are un-stimulated, EGR1 translation into protein at one hour after stimulation is markedly elevated. [21] Expression of EGR1 in various types of cells can be stimulated by growth factors, neurotransmitters, hormones, stress and injury. [21] In the brain, when neurons are activated, EGR1 proteins are upregulated, and they bind to (recruit) pre-existing TET1 enzymes, which are highly expressed in neurons. TET enzymes can catalyze demethylation of 5-methylcytosine. When EGR1 transcription factors bring TET1 enzymes to EGR1 binding sites in promoters, the TET enzymes can demethylate the methylated CpG islands at those promoters. Upon demethylation, these promoters can then initiate transcription of their target genes. Hundreds of genes in neurons are differentially expressed after neuron activation through EGR1 recruitment of TET1 to methylated regulatory sequences in their promoters. [20]

Activation by double- or single-strand breaks

About 600 regulatory sequences in promoters and about 800 regulatory sequences in enhancers appear to depend on double-strand breaks initiated by topoisomerase 2β (TOP2B) for activation. [22] [23] The induction of particular double-strand breaks is specific with respect to the inducing signal. When neurons are activated in vitro, just 22 TOP2B-induced double-strand breaks occur in their genomes. [24] However, when contextual fear conditioning is carried out in a mouse, this conditioning causes hundreds of gene-associated DSBs in the medial prefrontal cortex and hippocampus, which are important for learning and memory. [25]

Regulatory sequence in a promoter at a transcription start site with a paused RNA polymerase and a TOP2B-induced double-strand break Regulatory sequence in a promoter at a transcription start site with a paused RNA polymerase and a TOP2B-induced double-strand break.jpg
Regulatory sequence in a promoter at a transcription start site with a paused RNA polymerase and a TOP2B-induced double-strand break

Such TOP2B-induced double-strand breaks are accompanied by at least four enzymes of the non-homologous end joining (NHEJ) DNA repair pathway (DNA-PKcs, KU70, KU80 and DNA LIGASE IV) (see figure). These enzymes repair the double-strand breaks within about 15 minutes to 2 hours. [24] [26] The double-strand breaks in the promoter are thus associated with TOP2B and at least these four repair enzymes. These proteins are present simultaneously on a single promoter nucleosome (there are about 147 nucleotides in the DNA sequence wrapped around a single nucleosome) located near the transcription start site of their target gene. [26]

The double-strand break introduced by TOP2B apparently frees the part of the promoter at an RNA polymerase–bound transcription start site to physically move to its associated enhancer. This allows the enhancer, with its bound transcription factors and mediator proteins, to directly interact with the RNA polymerase that had been paused at the transcription start site to start transcription. [24] [10]

Similarly, topoisomerase I (TOP1) enzymes appear to be located at many enhancers, and those enhancers become activated when TOP1 introduces a single-strand break. [27] TOP1 causes single-strand breaks in particular enhancer DNA regulatory sequences when signaled by a specific enhancer-binding transcription factor. [27] Topoisomerase I breaks are associated with different DNA repair factors than those surrounding TOP2B breaks. In the case of TOP1, the breaks are associated most immediately with DNA repair enzymes MRE11, RAD50 and ATR. [27]

Examples

Genomes can be analyzed systematically to identify regulatory regions. [28] Conserved non-coding sequences often contain regulatory regions, and so they are often the subject of these analyses.

Insulin gene

Regulatory sequences for the insulin gene are: [29]

See also

Related Research Articles

<span class="mw-page-title-main">Promoter (genetics)</span> Region of DNA encouraging transcription

In genetics, a promoter is a sequence of DNA to which proteins bind to initiate transcription of a single RNA transcript from the DNA downstream of the promoter. The RNA transcript may encode a protein (mRNA), or can have a function in and of itself, such as tRNA or rRNA. Promoters are located near the transcription start sites of genes, upstream on the DNA . Promoters can be about 100–1000 base pairs long, the sequence of which is highly dependent on the gene and product of transcription, type or class of RNA polymerase recruited to the site, and species of organism.

<span class="mw-page-title-main">Transcription factor</span> Protein that regulates the rate of DNA transcription

In molecular biology, a transcription factor (TF) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The function of TFs is to regulate—turn on and off—genes in order to make sure that they are expressed in the desired cells at the right time and in the right amount throughout the life of the cell and the organism. Groups of TFs function in a coordinated fashion to direct cell division, cell growth, and cell death throughout life; cell migration and organization during embryonic development; and intermittently in response to signals from outside the cell, such as a hormone. There are 1500-1600 TFs in the human genome. Transcription factors are members of the proteome as well as regulome.

<span class="mw-page-title-main">Epigenetics</span> Study of DNA modifications that do not change its sequence

In biology, epigenetics are stable heritable traits that cannot be explained by changes in DNA sequence, and the study of a type of stable change in cell function that does not involve a change to the DNA sequence. The Greek prefix epi- in epigenetics implies features that are "on top of" or "in addition to" the traditional genetic mechanism of inheritance. Epigenetics usually involves a change that is not erased by cell division, and affects the regulation of gene expression. Such effects on cellular and physiological phenotypic traits may result from environmental factors, or be part of normal development. They can lead to cancer.

<span class="mw-page-title-main">Gene expression</span> Conversion of a genes sequence into a mature gene product or products

Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, and ultimately affect a phenotype. These products are often proteins, but in non-protein-coding genes such as transfer RNA (tRNA) and small nuclear RNA (snRNA), the product is a functional non-coding RNA. Gene expression is summarized in the central dogma of molecular biology first formulated by Francis Crick in 1958, further developed in his 1970 article, and expanded by the subsequent discoveries of reverse transcription and RNA replication.

<span class="mw-page-title-main">Transcription (biology)</span> Process of copying a segment of DNA into RNA

Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules called non-coding RNAs (ncRNAs). mRNA comprises only 1–3% of total RNA samples. Less than 2% of the human genome can be transcribed into mRNA, while at least 80% of mammalian genomic DNA can be actively transcribed, with the majority of this 80% considered to be ncRNA.

<span class="mw-page-title-main">CpG site</span> Region of often-methylated DNA with a cytosine followed by a guanine

The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG islands.

<span class="mw-page-title-main">Enhancer (genetics)</span> DNA sequence that binds activators to increase the likelihood of gene transcription

In genetics, an enhancer is a short region of DNA that can be bound by proteins (activators) to increase the likelihood that transcription of a particular gene will occur. These proteins are usually referred to as transcription factors. Enhancers are cis-acting. They can be located up to 1 Mbp away from the gene, upstream or downstream from the start site. There are hundreds of thousands of enhancers in the human genome. They are found in both prokaryotes and eukaryotes.

<span class="mw-page-title-main">DNA methyltransferase</span> Class of enzymes

In biochemistry, the DNA methyltransferase family of enzymes catalyze the transfer of a methyl group to DNA. DNA methylation serves a wide variety of biological functions. All the known DNA methyltransferases use S-adenosyl methionine (SAM) as the methyl donor.

In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from altering the number of copies of RNA that are transcribed, to the temporal control of when the gene is transcribed. This control allows the cell or organism to respond to a variety of intra- and extracellular signals and thus mount a response. Some examples of this include producing the mRNA that encode enzymes to adapt to a change in a food source, producing the gene products involved in cell cycle specific activities, and producing the gene products responsible for cellular differentiation in multicellular eukaryotes, as studied in evolutionary developmental biology.

<span class="mw-page-title-main">Regulation of gene expression</span> Modifying mechanisms used by cells to increase or decrease the production of specific gene products

Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products. Sophisticated programs of gene expression are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from transcriptional initiation, to RNA processing, and to the post-translational modification of a protein. Often, one gene regulator controls another, and so on, in a gene regulatory network.

<span class="mw-page-title-main">DNA methylation</span> Biological process

DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to repress gene transcription. In mammals, DNA methylation is essential for normal development and is associated with a number of key processes including genomic imprinting, X-chromosome inactivation, repression of transposable elements, aging, and carcinogenesis.

In eukaryote cells, RNA polymerase III is a protein that transcribes DNA to synthesize 5S ribosomal RNA, tRNA and other small RNAs.

In biology, reprogramming refers to erasure and remodeling of epigenetic marks, such as DNA methylation, during mammalian development or in cell culture. Such control is also often associated with alternative covalent modifications of histones.

<span class="mw-page-title-main">CTCF</span> Transcription factor

Transcriptional repressor CTCF also known as 11-zinc finger protein or CCCTC-binding factor is a transcription factor that in humans is encoded by the CTCF gene. CTCF is involved in many cellular processes, including transcriptional regulation, insulator activity, V(D)J recombination and regulation of chromatin architecture.

<span class="mw-page-title-main">Eukaryotic transcription</span> Transcription is heterocatalytic function of DNA

Eukaryotic transcription is the elaborate process that eukaryotic cells use to copy genetic information stored in DNA into units of transportable complementary RNA replica. Gene transcription occurs in both eukaryotic and prokaryotic cells. Unlike prokaryotic RNA polymerase that initiates the transcription of all different types of RNA, RNA polymerase in eukaryotes comes in three variations, each translating a different type of gene. A eukaryotic cell has a nucleus that separates the processes of transcription and translation. Eukaryotic transcription occurs within the nucleus where DNA is packaged into nucleosomes and higher order chromatin structures. The complexity of the eukaryotic genome necessitates a great variety and complexity of gene expression control.

<span class="mw-page-title-main">TOP2B</span> Protein-coding gene in the species Homo sapiens

DNA topoisomerase 2-beta is an enzyme that in humans is encoded by the TOP2B gene.

Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. Plant flavones are said to be inhibiting epigenomic marks that cause cancers. Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays.

<span class="mw-page-title-main">DNA demethylation</span> Removal of a methyl group from one or more nucleotides within a DNA molecule.

For molecular biology in mammals, DNA demethylation causes replacement of 5-methylcytosine (5mC) in a DNA sequence by cytosine (C). DNA demethylation can occur by an active process at the site of a 5mC in a DNA sequence or, in replicating cells, by preventing addition of methyl groups to DNA so that the replicated DNA will largely have cytosine in the DNA sequence.

While the cellular and molecular mechanisms of learning and memory have long been a central focus of neuroscience, it is only in recent years that attention has turned to the epigenetic mechanisms behind the dynamic changes in gene transcription responsible for memory formation and maintenance. Epigenetic gene regulation often involves the physical marking of DNA or associated proteins to cause or allow long-lasting changes in gene activity. Epigenetic mechanisms such as DNA methylation and histone modifications have been shown to play an important role in learning and memory.

<span class="mw-page-title-main">TET enzymes</span> Family of translocation methylcytosine dioxygenases

The TET enzymes are a family of ten-eleven translocation (TET) methylcytosine dioxygenases. They are instrumental in DNA demethylation. 5-Methylcytosine is a methylated form of the DNA base cytosine (C) that often regulates gene transcription and has several other functions in the genome.

References

  1. 1 2 Shafee, Thomas; Lowe, Rohan (2017). "Eukaryotic and prokaryotic gene structure". WikiJournal of Medicine. 4 (1). doi:10.15347/wjm/2017.002. ISSN   2002-4436.
  2. Whitaker JW, Zhao Chen, Wei Wang. (2014) Predicting the Human Epigenome from DNA Motifs. Nature Methods. doi:10.1038/nmeth.3065
  3. 1 2 Beagan JA, Pastuzyn ED, Fernandez LR, Guo MH, Feng K, Titus KR, et al. (June 2020). "Three-dimensional genome restructuring across timescales of activity-induced neuronal gene expression". Nature Neuroscience. 23 (6): 707–717. doi:10.1038/s41593-020-0634-6. PMC   7558717 . PMID   32451484.
  4. Verheul TC, van Hijfte L, Perenthaler E, Barakat TS (2020). "The Why of YY1: Mechanisms of Transcriptional Regulation by Yin Yang 1". Frontiers in Cell and Developmental Biology. 8: 592164. doi: 10.3389/fcell.2020.592164 . PMC   7554316 . PMID   33102493.
  5. Spitz F, Furlong EE (September 2012). "Transcription factors: from enhancer binding to developmental control". Nature Reviews. Genetics. 13 (9): 613–26. doi:10.1038/nrg3207. PMID   22868264. S2CID   205485256.
  6. 1 2 Schoenfelder S, Fraser P (August 2019). "Long-range enhancer-promoter contacts in gene expression control". Nature Reviews. Genetics. 20 (8): 437–455. doi:10.1038/s41576-019-0128-0. PMID   31086298. S2CID   152283312.
  7. Weintraub AS, Li CH, Zamudio AV, Sigova AA, Hannett NM, Day DS, et al. (December 2017). "YY1 Is a Structural Regulator of Enhancer-Promoter Loops". Cell. 171 (7): 1573–1588.e28. doi:10.1016/j.cell.2017.11.008. PMC   5785279 . PMID   29224777.
  8. Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, et al. (February 2018). "The Human Transcription Factors". Cell. 172 (4): 650–665. doi: 10.1016/j.cell.2018.01.029 . PMID   29425488.
  9. 1 2 Grossman SR, Engreitz J, Ray JP, Nguyen TH, Hacohen N, Lander ES (July 2018). "Positional specificity of different transcription factor classes within enhancers". Proceedings of the National Academy of Sciences of the United States of America. 115 (30): E7222–E7230. doi: 10.1073/pnas.1804663115 . PMC   6065035 . PMID   29987030.
  10. 1 2 Allen BL, Taatjes DJ (March 2015). "The Mediator complex: a central integrator of transcription". Nature Reviews. Molecular Cell Biology. 16 (3): 155–66. doi:10.1038/nrm3951. PMC   4963239 . PMID   25693131.
  11. Mikhaylichenko O, Bondarenko V, Harnett D, Schor IE, Males M, Viales RR, Furlong EE (January 2018). "The degree of enhancer or promoter activity is reflected by the levels and directionality of eRNA transcription". Genes & Development. 32 (1): 42–57. doi:10.1101/gad.308619.117. PMC   5828394 . PMID   29378788.
  12. Li QJ, Yang SH, Maeda Y, Sladek FM, Sharrocks AD, Martins-Green M (January 2003). "MAP kinase phosphorylation-dependent activation of Elk-1 leads to activation of the co-activator p300". The EMBO Journal. 22 (2): 281–91. doi:10.1093/emboj/cdg028. PMC   140103 . PMID   12514134.
  13. Carullo NV, Phillips Iii RA, Simon RC, Soto SA, Hinds JE, Salisbury AJ, et al. (September 2020). "Enhancer RNAs predict enhancer-gene regulatory links and are critical for enhancer function in neuronal systems". Nucleic Acids Research. 48 (17): 9550–9570. doi:10.1093/nar/gkaa671. PMC   7515708 . PMID   32810208.
  14. Lövkvist C, Dodd IB, Sneppen K, Haerter JO (June 2016). "DNA methylation in human epigenomes depends on local topology of CpG sites". Nucleic Acids Research. 44 (11): 5123–32. doi:10.1093/nar/gkw124. PMC   4914085 . PMID   26932361.
  15. Jabbari K, Bernardi G (May 2004). "Cytosine methylation and CpG, TpG (CpA) and TpA frequencies". Gene. 333: 143–9. doi:10.1016/j.gene.2004.02.043. PMID   15177689.
  16. Steinhaus R, Gonzalez T, Seelow D, Robinson PN (June 2020). "Pervasive and CpG-dependent promoter-like characteristics of transcribed enhancers". Nucleic Acids Research. 48 (10): 5306–5317. doi:10.1093/nar/gkaa223. PMC   7261191 . PMID   32338759.
  17. Bird A (January 2002). "DNA methylation patterns and epigenetic memory". Genes & Development. 16 (1): 6–21. doi: 10.1101/gad.947102 . PMID   11782440.
  18. 1 2 3 Du Q, Luu PL, Stirzaker C, Clark SJ (2015). "Methyl-CpG-binding domain proteins: readers of the epigenome". Epigenomics. 7 (6): 1051–73. doi: 10.2217/epi.15.39 . PMID   25927341.
  19. Vaquerizas JM, Kummerfeld SK, Teichmann SA, Luscombe NM (April 2009). "A census of human transcription factors: function, expression and evolution". Nature Reviews. Genetics. 10 (4): 252–63. doi:10.1038/nrg2538. PMID   19274049. S2CID   3207586.
  20. 1 2 3 4 Sun Z, Xu X, He J, Murray A, Sun MA, Wei X, et al. (August 2019). "EGR1 recruits TET1 to shape the brain methylome during development and upon neuronal activity". Nature Communications. 10 (1): 3892. Bibcode:2019NatCo..10.3892S. doi:10.1038/s41467-019-11905-3. PMC   6715719 . PMID   31467272.
  21. 1 2 Kubosaki A, Tomaru Y, Tagami M, Arner E, Miura H, Suzuki T, et al. (2009). "Genome-wide investigation of in vivo EGR-1 binding sites in monocytic differentiation". Genome Biology. 10 (4): R41. doi: 10.1186/gb-2009-10-4-r41 . PMC   2688932 . PMID   19374776.
  22. Dellino GI, Palluzzi F, Chiariello AM, Piccioni R, Bianco S, Furia L, et al. (June 2019). "Release of paused RNA polymerase II at specific loci favors DNA double-strand-break formation and promotes cancer translocations". Nature Genetics. 51 (6): 1011–1023. doi:10.1038/s41588-019-0421-z. PMID   31110352. S2CID   159041612.
  23. Singh S, Szlachta K, Manukyan A, Raimer HM, Dinda M, Bekiranov S, Wang YH (March 2020). "Pausing sites of RNA polymerase II on actively transcribed genes are enriched in DNA double-stranded breaks". J Biol Chem. 295 (12): 3990–4000. doi: 10.1074/jbc.RA119.011665 . PMC   7086017 . PMID   32029477.
  24. 1 2 3 Madabhushi R, Gao F, Pfenning AR, Pan L, Yamakawa S, Seo J, et al. (June 2015). "Activity-Induced DNA Breaks Govern the Expression of Neuronal Early-Response Genes". Cell. 161 (7): 1592–605. doi:10.1016/j.cell.2015.05.032. PMC   4886855 . PMID   26052046.
  25. Stott RT, Kritsky O, Tsai LH (2021). "Profiling DNA break sites and transcriptional changes in response to contextual fear learning". PLOS ONE. 16 (7): e0249691. Bibcode:2021PLoSO..1649691S. doi: 10.1371/journal.pone.0249691 . PMC   8248687 . PMID   34197463.
  26. 1 2 Ju BG, Lunyak VV, Perissi V, Garcia-Bassets I, Rose DW, Glass CK, Rosenfeld MG (June 2006). "A topoisomerase IIbeta-mediated dsDNA break required for regulated transcription". Science. 312 (5781): 1798–802. Bibcode:2006Sci...312.1798J. doi:10.1126/science.1127196. PMID   16794079. S2CID   206508330.
  27. 1 2 3 Puc J, Kozbial P, Li W, Tan Y, Liu Z, Suter T, et al. (January 2015). "Ligand-dependent enhancer activation regulated by topoisomerase-I activity". Cell. 160 (3): 367–80. doi:10.1016/j.cell.2014.12.023. PMC   4422651 . PMID   25619691.
  28. Stepanova M, Tiazhelova T, Skoblov M, Baranova A (May 2005). "A comparative analysis of relative occurrence of transcription factor binding sites in vertebrate genomes and gene promoter areas". Bioinformatics. 21 (9): 1789–96. doi: 10.1093/bioinformatics/bti307 . PMID   15699025.
  29. Melloul D, Marshak S, Cerasi E (March 2002). "Regulation of insulin gene transcription". Diabetologia. 45 (3): 309–26. doi: 10.1007/s00125-001-0728-y . PMID   11914736.
  30. Jang WG, Kim EJ, Park KG, Park YB, Choi HS, Kim HJ, et al. (January 2007). "Glucocorticoid receptor mediated repression of human insulin gene expression is regulated by PGC-1alpha". Biochemical and Biophysical Research Communications. 352 (3): 716–21. doi:10.1016/j.bbrc.2006.11.074. PMID   17150186.