Pioneer factors are transcription factors that can directly bind condensed chromatin. They can have positive and negative effects on transcription and are important in recruiting other transcription factors and histone modification enzymes as well as controlling DNA methylation. They were first discovered in 2002 as factors capable of binding to target sites on nucleosomal DNA in compacted chromatin and endowing competency for gene activity during hepatogenesis. [1] Pioneer factors are involved in initiating cell differentiation and activation of cell-specific genes. This property is observed in histone fold-domain containing transcription factors (fork head box (FOX) [2] and NF-Y [3] ) and other transcription factors that use zinc finger(s) for DNA binding (Groucho TLE, Gal4, and GATA). [2] [4]
The eukaryotic cell condenses its genome into tightly packed chromatin and nucleosomes. This ability saves space in the nucleus for only actively transcribed genes and hides unnecessary or detrimental genes from being transcribed. Access to these condensed regions is done by chromatin remodelling by either balancing histone modifications or directly with pioneer factors that can loosen the chromatin themselves or as a flag recruiting other factors. Pioneer factors are not necessarily required for assembly of the transcription apparatus and may dissociate after being replaced by other factors.
Pioneer factors can also actively affect transcription by directly opening up condensed chromatin in an ATP-independent process. [2] [3] This is a common trait of fork head box factors (which contain a winged helix DNA-binding domain that mimics the DNA-binding domain of the linker H1 histone [5] ), and NF-Y (whose NF-YB and NF-YC subunits contain histone-fold domains similar to those of the core histones H2A/H2B [6] ).
The similarity to histone H1 explains how fork head factors are able to bind chromatin by interacting with the major groove of only the one available side of DNA wrapped around a nucleosome. [5] [7] Fork head domains also have a helix that confers sequence specificity unlike linker histone. [5] [8] The C terminus is associated with higher mobility around the nucleosome than linker histone, displacing it and rearranging nucleosomal landscapes effectively. [7] This active re-arrangement of the nucleosomes allows for other transcription factors to bind the available DNA. In thyroid cell differentiation FoxE binds to compacted chromatin of the thyroid peroxidase promoter and opens it for NF1 binding. [9]
NF-Y is a heterotrimeric complex composed of NF-YA, NF-YB, and NF-YC subunits. The key structural feature of the NF-Y/DNA complex is the minor-groove interaction of its DNA binding domain-containing subunit NF-YA, which induces an ~80° bend in the DNA. NF-YB and NF-YC interact with DNA through non-specific histone-fold domain-DNA contacts. [6] NF-YA's unique DNA-binding mode and NF-YB/NF-YC's nucleosome-like properties of non-specific DNA binding impose sufficient spatial constraints to induce flanking nucleosomes to slide outward, making nearby recognition sites for other transcription factors accessible. [3]
Pioneer factors can function passively, by acting as a bookmark for the cell to recruit other transcription factors to specific genes in condensed chromatin. This can be important for priming the cell for a rapid response as the enhancer is already bound by a pioneer transcription factor giving it a head start towards assembling the transcription preinitiation complex. Hormone responses are often quickly induced in the cell using this priming method such as with the estrogen receptor. [10] Another form of priming is when an enhancer is simultaneously bound by activating and repressing pioneer factors. This balance can be tipped by dissociation of one of the factors. In hepatic cell differentiation the activating pioneer factor FOXA1 recruits a repressor, grg3, that prevents transcription until the repressor is down-regulated later on in the differentiation process. [11]
In a direct role pioneer factors can bind an enhancer and recruit activation complex that will modify the chromatin directly. The change in the chromatin changes the affinity, decreasing the affinity of the pioneer factor such that it is replaced by a transcription factor that has a higher affinity. This is a mechanism for the cell to switch a gene on was observed with glucocorticoid receptor recruiting modification factors that then modify the site to bind activated estrogen receptor which was coined as a “bait and switch” mechanism. [12]
Pioneer factors can exhibit their greatest range of effects on transcription through the modulation of epigenetic factors by recruiting activating or repressing histone modification enzymes and controlling CpG methylation by protecting specific cysteine residues. This has effects on controlling the timing of transcription during cell differentiation processes.
Histone modification is a well-studied mechanism to transiently adjust chromatin density. Pioneer factors can play a role in this by binding specific enhancers and flagging histone modification enzymes to that specific gene. Repressive pioneer factors can inhibit transcription by recruiting factors that modify histones that further tighten the chromatin. This is important to limit gene expression to specific cell types and has to be removed only when cell differentiation begins. FoxD3 has been associated as a repressor of both B-cell and melanocytic cell differentiation pathways, maintaining repressive histone modifications where bound, that have to be overcome to start differentiation. [13] [14] Pioneer factors can also be associated with recruiting transcription-activating histone modifications. Enzymes that modify H3K4 with mono and di-methylation are associated with increasing transcription and have been shown to bind pioneer factors. [10] In B cell differentiation PU.1 is necessary to signal specific histones for activating H3K4me1 modifications that differentiate hematopoietic stem cells into either the B-cell or macrophage lineage. [15] FoxA1 binding induces HSK4me2 during neuronal differentiation of pluripotent stem cells [16] as well as the loss of DNA methylation. [17] SOX9 recruits histone modification enzymes MLL3 and MLL4 to deposit H3K4me1 prior to the opening of enhancers in developing hair follicle and basal cell carcinoma. [18]
Pioneer factors can also affect transcription and differentiation through the control of DNA methylation. Pioneer factors that bind to CpG islands and cytosine residues block access to methyltransferases. Many eukaryotic cells have CpG islands in their promoters that can be modified by methylation having adverse effects on their ability to control transcription. [19] This phenomenon is also present in promoters without CpG islands where single cytosine residues are protected from methylation until further cell differentiation. An example is FoxD3 preventing methylation of a cytosine residue in Alb1 enhancer, acting as a place holder for FoxA1 later in hepatic [20] as well as in CpG islands of genes in chronic lymphocytic leukemia. [21] For stable control of methylation state the cytosine residues are covered during mitosis, unlike most other transcription factors, to prevent methylation. Studies have shown that during mitosis 15% of all interphase FoxA1 binding sites were bound. [22] The protection of cytosine methylation can be quickly removed allowing for rapid induction when a signal is present.
A well studied pioneer factor family is the Groucho-related (Gro/TLE/Grg) transcription factors that often have a negative effect on transcription. These chromatin binding domains can span up to 3-4 nucleosomes. These large domains are scaffolds for further protein interactions and also modify the chromatin for other pioneer factors such as FoxA1 which has been shown to bind to Grg3. [23] Transcription factors with zinc finger DNA binding domains, such as the GATA family and glucocorticoid receptor. [10] The zinc finger domains do not appear to bind nucleosomes well and can be displaced by FOX factors. [22]
In the skin epidermis, SOX family transcription factor, SOX9, also behaves as a pioneer factor that governs hair follicle cell fate and can reprogram epidermal stem cells to a hair follicle fate. [24]
The ability of pioneer factors to respond to extracellular signals to differentiate cell type has been studied as a potential component of hormone-dependent cancers. Hormones such as estrogen and IGFI are shown to increase pioneer factor concentration leading to a change in transcription. [25] Known pioneer factors such as FoxA1, PBX1, TLE, AP2ɣ, GATA factors 2/3/4, and PU.1 have been associated with hormone-dependent cancer . FoxA1 is necessary for estrogen and androgen mediated hepatocarcinogenesis and is a defining gene for ER+ luminal breast cancer, as is another pioneer factor GATA3. [10] [25] FOXA1 particularly is expressed in 90% of breast cancer metastases and 89% of metastatic prostate cancers. [25] [26] In the breast cancer cell line, MCF-7, it was found that FoxA1 was bound to 50% of estrogen receptor binding sites independent of estrogen presence. High expression of pioneer factors is associated with poor prognosis with the exception of breast cancer where FoxA1 is associated with a stronger outcome. [25]
The correlation between pioneer factors and cancer has led to prospective therapeutic targeting. In knockdown studies in the MCF-7 breast cancer cell line it was found that decreasing pioneer factors FoxA1 and AP2ɣ decreased ER signalling. [4] [25] Other fork head proteins have been associated with cancer, including FoxO3 and FoxM that repress the cell survival pathways Ras and PPI3K/AKT/IKK. [27] Drugs such as Paclitaxel, Imatinib, and doxorubicin which activate FoxO3a or its targets are being used. Modification to modulate related factors with pioneer activity is a topic of interest in the early stages as knocking down pioneer factors may have toxic effects through alteration of the lineage pathways of healthy cells. [25]
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are wrapped into 30-nanometer fibers that form tightly packed chromatin. Histones prevent DNA from becoming tangled and protect it from DNA damage. In addition, histones play important roles in gene regulation and DNA replication. Without histones, unwound DNA in chromosomes would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out; however, when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers.
A nucleosome is the basic structural unit of DNA packaging in eukaryotes. The structure of a nucleosome consists of a segment of DNA wound around eight histone proteins and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of chromatin. Each nucleosome is composed of a little less than two turns of DNA wrapped around a set of eight proteins called histones, which are known as a histone octamer. Each histone octamer is composed of two copies each of the histone proteins H2A, H2B, H3, and H4.
In molecular biology, a transcription factor (TF) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The function of TFs is to regulate—turn on and off—genes in order to make sure that they are expressed in the desired cells at the right time and in the right amount throughout the life of the cell and the organism. Groups of TFs function in a coordinated fashion to direct cell division, cell growth, and cell death throughout life; cell migration and organization during embryonic development; and intermittently in response to signals from outside the cell, such as a hormone. There are approximately 1600 TFs in the human genome. Transcription factors are members of the proteome as well as regulome.
Cellular differentiation is the process in which a stem cell changes from one type to a differentiated one. Usually, the cell changes to a more specialized type. Differentiation happens multiple times during the development of a multicellular organism as it changes from a simple zygote to a complex system of tissues and cell types. Differentiation continues in adulthood as adult stem cells divide and create fully differentiated daughter cells during tissue repair and during normal cell turnover. Some differentiation occurs in response to antigen exposure. Differentiation dramatically changes a cell's size, shape, membrane potential, metabolic activity, and responsiveness to signals. These changes are largely due to highly controlled modifications in gene expression and are the study of epigenetics. With a few exceptions, cellular differentiation almost never involves a change in the DNA sequence itself. Metabolic composition, however, gets dramatically altered where stem cells are characterized by abundant metabolites with highly unsaturated structures whose levels decrease upon differentiation. Thus, different cells can have very different physical characteristics despite having the same genome.
In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from altering the number of copies of RNA that are transcribed, to the temporal control of when the gene is transcribed. This control allows the cell or organism to respond to a variety of intra- and extracellular signals and thus mount a response. Some examples of this include producing the mRNA that encode enzymes to adapt to a change in a food source, producing the gene products involved in cell cycle specific activities, and producing the gene products responsible for cellular differentiation in multicellular eukaryotes, as studied in evolutionary developmental biology.
Histone acetyltransferases (HATs) are enzymes that acetylate conserved lysine amino acids on histone proteins by transferring an acetyl group from acetyl-CoA to form ε-N-acetyllysine. DNA is wrapped around histones, and, by transferring an acetyl group to the histones, genes can be turned on and off. In general, histone acetylation increases gene expression.
HMGN proteins are members of the broader class of high mobility group (HMG) chromosomal proteins that are involved in regulation of transcription, replication, recombination, and DNA repair.
Histone H2B is one of the 5 main histone proteins involved in the structure of chromatin in eukaryotic cells. Featuring a main globular domain and long N-terminal and C-terminal tails, H2B is involved with the structure of the nucleosomes.
Histone methylation is a process by which methyl groups are transferred to amino acids of histone proteins that make up nucleosomes, which the DNA double helix wraps around to form chromosomes. Methylation of histones can either increase or decrease transcription of genes, depending on which amino acids in the histones are methylated, and how many methyl groups are attached. Methylation events that weaken chemical attractions between histone tails and DNA increase transcription because they enable the DNA to uncoil from nucleosomes so that transcription factor proteins and RNA polymerase can access the DNA. This process is critical for the regulation of gene expression that allows different cells to express different genes.
Methyltransferases are a large group of enzymes that all methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which contain a Rossmann fold for binding S-Adenosyl methionine (SAM). Class II methyltransferases contain a SET domain, which are exemplified by SET domain histone methyltransferases, and class III methyltransferases, which are membrane associated. Methyltransferases can also be grouped as different types utilizing different substrates in methyl transfer reactions. These types include protein methyltransferases, DNA/RNA methyltransferases, natural product methyltransferases, and non-SAM dependent methyltransferases. SAM is the classical methyl donor for methyltransferases, however, examples of other methyl donors are seen in nature. The general mechanism for methyl transfer is a SN2-like nucleophilic attack where the methionine sulfur serves as the leaving group and the methyl group attached to it acts as the electrophile that transfers the methyl group to the enzyme substrate. SAM is converted to S-Adenosyl homocysteine (SAH) during this process. The breaking of the SAM-methyl bond and the formation of the substrate-methyl bond happen nearly simultaneously. These enzymatic reactions are found in many pathways and are implicated in genetic diseases, cancer, and metabolic diseases. Another type of methyl transfer is the radical S-Adenosyl methionine (SAM) which is the methylation of unactivated carbon atoms in primary metabolites, proteins, lipids, and RNA.
Histone acetylation and deacetylation are the processes by which the lysine residues within the N-terminal tail protruding from the histone core of the nucleosome are acetylated and deacetylated as part of gene regulation.
Histone-modifying enzymes are enzymes involved in the modification of histone substrates after protein translation and affect cellular processes including gene expression. To safely store the eukaryotic genome, DNA is wrapped around four core histone proteins, which then join to form nucleosomes. These nucleosomes further fold together into highly condensed chromatin, which renders the organism's genetic material far less accessible to the factors required for gene transcription, DNA replication, recombination and repair. Subsequently, eukaryotic organisms have developed intricate mechanisms to overcome this repressive barrier imposed by the chromatin through histone modification, a type of post-translational modification which typically involves covalently attaching certain groups to histone residues. Once added to the histone, these groups elicit either a loose and open histone conformation, euchromatin, or a tight and closed histone conformation, heterochromatin. Euchromatin marks active transcription and gene expression, as the light packing of histones in this way allows entry for proteins involved in the transcription process. As such, the tightly packed heterochromatin marks the absence of current gene expression.
Chromatin remodeling is the dynamic modification of chromatin architecture to allow access of condensed genomic DNA to the regulatory transcription machinery proteins, and thereby control gene expression. Such remodeling is principally carried out by 1) covalent histone modifications by specific enzymes, e.g., histone acetyltransferases (HATs), deacetylases, methyltransferases, and kinases, and 2) ATP-dependent chromatin remodeling complexes which either move, eject or restructure nucleosomes. Besides actively regulating gene expression, dynamic remodeling of chromatin imparts an epigenetic regulatory role in several key biological processes, egg cells DNA replication and repair; apoptosis; chromosome segregation as well as development and pluripotency. Aberrations in chromatin remodeling proteins are found to be associated with human diseases, including cancer. Targeting chromatin remodeling pathways is currently evolving as a major therapeutic strategy in the treatment of several cancers.
Nuclear transcription factor Y subunit alpha is a protein that in humans is encoded by the NFYA gene.
Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. Plant flavones are said to be inhibiting epigenomic marks that cause cancers. Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays.
Forkhead box protein A2 (FOXA2), also known as hepatocyte nuclear factor 3-beta (HNF-3B), is a transcription factor that plays an important role during development, in mature tissues and, when dysregulated or mutated, also in cancer.
Epigenetics of human development is the study of how epigenetics effects human development.
H3K4me3 is an epigenetic modification to the DNA packaging protein Histone H3 that indicates tri-methylation at the 4th lysine residue of the histone H3 protein and is often involved in the regulation of gene expression. The name denotes the addition of three methyl groups (trimethylation) to the lysine 4 on the histone H3 protein.
H3K27me3 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the tri-methylation of lysine 27 on histone H3 protein.
H4K16ac is an epigenetic modification to the DNA packaging protein Histone H4. It is a mark that indicates the acetylation at the 16th lysine residue of the histone H4 protein.