Heterochromatin is a tightly packed form of DNA or condensed DNA , which comes in multiple varieties. These varieties lie on a continuum between the two extremes of constitutive heterochromatin and facultative heterochromatin . Both play a role in the expression of genes. Because it is tightly packed, it was thought to be inaccessible to polymerases and therefore not transcribed; however, according to Volpe et al. (2002), [1] and many other papers since, [2] much of this DNA is in fact transcribed, but it is continuously turned over via RNA-induced transcriptional silencing (RITS). Recent studies with electron microscopy and OsO4 staining reveal that the dense packing is not due to the chromatin. [3]
Constitutive heterochromatin can affect the genes near itself (e.g. position-effect variegation). It is usually repetitive and forms structural functions such as centromeres or telomeres, in addition to acting as an attractor for other gene-expression or repression signals.
Facultative heterochromatin is the result of genes that are silenced through a mechanism such as histone deacetylation or Piwi-interacting RNA (piRNA) through RNAi. It is not repetitive and shares the compact structure of constitutive heterochromatin. However, under specific developmental or environmental signaling cues, it can lose its condensed structure and become transcriptionally active. [4]
Heterochromatin has been associated with the di- and tri -methylation of H3K9 in certain portions of the human genome. [5] H3K9me3-related methyltransferases appear to have a pivotal role in modifying heterochromatin during lineage commitment at the onset of organogenesis and in maintaining lineage fidelity. [6]
Chromatin is found in two varieties: euchromatin and heterochromatin. [7] Originally, the two forms were distinguished cytologically by how intensely they get stained – the euchromatin is less intense, while heterochromatin stains intensely, indicating tighter packing. Heterochromatin was given its name for this reason by botanist Emil Heitz who discovered that heterochromatin remained darkly stained throughout the entire cell cycle, unlike euchromatin whose stain disappeared during interphase. [8] Heterochromatin is usually localized to the periphery of the nucleus. Despite this early dichotomy, recent evidence in both animals [9] and plants [10] has suggested that there are more than two distinct heterochromatin states, and it may in fact exist in four or five 'states', each marked by different combinations of epigenetic marks.
Heterochromatin mainly consists of genetically inactive satellite sequences, [11] and many genes are repressed to various extents, although some cannot be expressed in euchromatin at all. [12] Both centromeres and telomeres are heterochromatic, as is the Barr body of the second, inactivated X-chromosome in a female.
Heterochromatin has been associated with several functions, from gene regulation to the protection of chromosome integrity; [13] some of these roles can be attributed to the dense packing of DNA, which makes it less accessible to protein factors that usually bind DNA or its associated factors. For example, naked double-stranded DNA ends would usually be interpreted by the cell as damaged or viral DNA, triggering cell cycle arrest, DNA repair or destruction of the fragment, such as by endonucleases in bacteria.
Some regions of chromatin are very densely packed with fibers that display a condition comparable to that of the chromosome at mitosis. Heterochromatin is generally clonally inherited; when a cell divides, the two daughter cells typically contain heterochromatin within the same regions of DNA, resulting in epigenetic inheritance. Variations cause heterochromatin to encroach on adjacent genes or recede from genes at the extremes of domains. Transcribable material may be repressed by being positioned (in cis) at these boundary domains. This gives rise to expression levels that vary from cell to cell, [14] which may be demonstrated by position-effect variegation. [15] Insulator sequences may act as a barrier in rare cases where constitutive heterochromatin and highly active genes are juxtaposed (e.g. the 5'HS4 insulator upstream of the chicken β-globin locus, [16] and loci in two Saccharomyces spp. [17] [18] ).
All cells of a given species package the same regions of DNA in constitutive heterochromatin, and thus in all cells, any genes contained within the constitutive heterochromatin will be poorly expressed. For example, all human chromosomes 1, 9, 16, and the Y-chromosome contain large regions of constitutive heterochromatin. In most organisms, constitutive heterochromatin occurs around the chromosome centromere and near telomeres.
The regions of DNA packaged in facultative heterochromatin will not be consistent between the cell types within a species, and thus a sequence in one cell that is packaged in facultative heterochromatin (and the genes within are poorly expressed) may be packaged in euchromatin in another cell (and the genes within are no longer silenced). However, the formation of facultative heterochromatin is regulated, and is often associated with morphogenesis or differentiation. An example of facultative heterochromatin is X chromosome inactivation in female mammals: one X chromosome is packaged as facultative heterochromatin and silenced, while the other X chromosome is packaged as euchromatin and expressed.
Among the molecular components that appear to regulate the spreading of heterochromatin are the Polycomb-group proteins and non-coding genes such as Xist. The mechanism for such spreading is still a matter of controversy. [19] The polycomb repressive complexes PRC1 and PRC2 regulate chromatin compaction and gene expression and have a fundamental role in developmental processes. PRC-mediated epigenetic aberrations are linked to genome instability and malignancy and play a role in the DNA damage response, DNA repair and in the fidelity of replication. [20]
Saccharomyces cerevisiae , or budding yeast, is a model eukaryote and its heterochromatin has been defined thoroughly. Although most of its genome can be characterized as euchromatin, S. cerevisiae has regions of DNA that are transcribed very poorly. These loci are the so-called silent mating type loci (HML and HMR), the rDNA (encoding ribosomal RNA), and the sub-telomeric regions. Fission yeast ( Schizosaccharomyces pombe ) uses another mechanism for heterochromatin formation at its centromeres. Gene silencing at this location depends on components of the RNAi pathway. Double-stranded RNA is believed to result in silencing of the region through a series of steps.
In the fission yeast Schizosaccharomyces pombe, two RNAi complexes, the RITS complex and the RNA-directed RNA polymerase complex (RDRC), are part of an RNAi machinery involved in the initiation, propagation and maintenance of heterochromatin assembly. These two complexes localize in a siRNA-dependent manner on chromosomes, at the site of heterochromatin assembly. RNA polymerase II synthesizes a transcript that serves as a platform to recruit RITS, RDRC and possibly other complexes required for heterochromatin assembly. [21] [22] Both RNAi and an exosome-dependent RNA degradation process contribute to heterochromatic gene silencing. These mechanisms of Schizosaccharomyces pombe may occur in other eukaryotes. [23] A large RNA structure called RevCen has also been implicated in the production of siRNAs to mediate heterochromatin formation in some fission yeast. [24]
Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in reinforcing the DNA during cell division, preventing DNA damage, and regulating gene expression and DNA replication. During mitosis and meiosis, chromatin facilitates proper segregation of the chromosomes in anaphase; the characteristic shapes of chromosomes visible during this stage are the result of DNA being coiled into highly condensed chromatin.
The centromere links a pair of sister chromatids together during cell division. This constricted region of chromosome connects the sister chromatids, creating a short arm (p) and a long arm (q) on the chromatids. During mitosis, spindle fibers attach to the centromere via the kinetochore.
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are wrapped into 30-nanometer fibers that form tightly packed chromatin. Histones prevent DNA from becoming tangled and protect it from DNA damage. In addition, histones play important roles in gene regulation and DNA replication. Without histones, unwound DNA in chromosomes would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out; however, when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers.
Euchromatin is a lightly packed form of chromatin that is enriched in genes, and is often under active transcription. Euchromatin stands in contrast to heterochromatin, which is tightly packed and less accessible for transcription. 92% of the human genome is euchromatic.
Constitutive heterochromatin domains are regions of DNA found throughout the chromosomes of eukaryotes. The majority of constitutive heterochromatin is found at the pericentromeric regions of chromosomes, but is also found at the telomeres and throughout the chromosomes. In humans there is significantly more constitutive heterochromatin found on chromosomes 1, 9, 16, 19 and Y. Constitutive heterochromatin is composed mainly of high copy number tandem repeats known as satellite repeats, minisatellite and microsatellite repeats, and transposon repeats. In humans these regions account for about 200Mb or 6.5% of the total human genome, but their repeat composition makes them difficult to sequence, so only small regions have been sequenced.
Subtelomeres are segments of DNA between telomeric caps and chromatin.
Position-effect variegation (PEV) is a variegation caused by the silencing of a gene in some cells through its abnormal juxtaposition with heterochromatin via rearrangement or transposition. It is also associated with changes in chromatin conformation.
The family of heterochromatin protein 1 (HP1) consists of highly conserved proteins, which have important functions in the cell nucleus. These functions include gene repression by heterochromatin formation, transcriptional activation, regulation of binding of cohesion complexes to centromeres, sequestration of genes to the nuclear periphery, transcriptional arrest, maintenance of heterochromatin integrity, gene repression at the single nucleosome level, gene repression by heterochromatization of euchromatin, and DNA repair. HP1 proteins are fundamental units of heterochromatin packaging that are enriched at the centromeres and telomeres of nearly all eukaryotic chromosomes with the notable exception of budding yeast, in which a yeast-specific silencing complex of SIR proteins serve a similar function. Members of the HP1 family are characterized by an N-terminal chromodomain and a C-terminal chromoshadow domain, separated by a hinge region. HP1 is also found at some euchromatic sites, where its binding can correlate with either gene repression or gene activation. HP1 was originally discovered by Tharappel C James and Sarah Elgin in 1986 as a factor in the phenomenon known as position effect variegation in Drosophila melanogaster.
RNA-induced transcriptional silencing (RITS) is a form of RNA interference by which short RNA molecules – such as small interfering RNA (siRNA) – trigger the downregulation of transcription of a particular gene or genomic region. This is usually accomplished by posttranslational modification of histone tails which target the genomic region for heterochromatin formation. The protein complex that binds to siRNAs and interacts with the methylated lysine 9 residue of histones H3 (H3K9me2) is the RITS complex.
Centromere protein A, also known as CENPA, is a protein which in humans is encoded by the CENPA gene. CENPA is a histone H3 variant which is the critical factor determining the kinetochore position(s) on each chromosome in most eukaryotes including humans.
Chromobox protein homolog 5 is a protein that in humans is encoded by the CBX5 gene. It is a highly conserved, non-histone protein part of the heterochromatin family. The protein itself is more commonly called HP1α. Heterochromatin protein-1 (HP1) has an N-terminal domain that acts on methylated lysines residues leading to epigenetic repression. The C-terminal of this protein has a chromo shadow-domain (CSD) that is responsible for homodimerizing, as well as interacting with a variety of chromatin-associated, non-histone proteins.
RNA polymerase IV is an enzyme that synthesizes small interfering RNA (siRNA) in plants, which silence gene expression. RNAP IV belongs to a family of enzymes that catalyze the process of transcription known as RNA Polymerases, which synthesize RNA from DNA templates. Discovered via phylogenetic studies of land plants, genes of RNAP IV are thought to have resulted from multistep evolution processes that occurred in RNA Polymerase II phylogenies. Such an evolutionary pathway is supported by the fact that RNAP IV is composed of 12 protein subunits that are either similar or identical to RNA polymerase II, and is specific to plant genomes. Via its synthesis of siRNA, RNAP IV is involved in regulation of heterochromatin formation in a process known as RNA directed DNA Methylation (RdDM).
RevCen is a family of non-coding RNA found in Schizosaccharomyces. It is a megastructure containing several siRNA which use the RNAi pathway to regulate heterochromatin formation. The long RNA transcript forms a secondary structure with several stem-loops which are processed by dicer into siRNA. This siRNA then initiate the formation of heterochromatin at the centromeres of fission yeast. Northern blot analysis confirmed the siRNAs were produced from the large RNA structure RevCen in vivo. As with all siRNAs, the enzyme dicer is responsible for dissecting dsRNA into the 21nt stretch of double-stranded RNA. Human recombinant dicer enzyme processed the RevCen structure in vitro, though the same activity by yeast Dcr1 has not been confirmed.
Telomeric repeat–containing RNA (TERRA) is a long non-coding RNA transcribed from telomeres - repetitive nucleotide regions found on the ends of chromosomes that function to protect DNA from deterioration or fusion with neighboring chromosomes. TERRA has been shown to be ubiquitously expressed in almost all cell types containing linear chromosomes - including humans, mice, and yeasts. While the exact function of TERRA is still an active area of research, it is generally believed to play a role in regulating telomerase activity as well as maintaining the heterochromatic state at the ends of chromosomes. TERRA interaction with other associated telomeric proteins has also been shown to help regulate telomere integrity in a length-dependent manner.
Epigenetics of human development is the study of how epigenetics effects human development.
Nuclear organization refers to the spatial distribution of chromatin within a cell nucleus. There are many different levels and scales of nuclear organisation. Chromatin is a higher order structure of DNA.
Robin Campbell Allshire is Professor of Chromosome Biology at University of Edinburgh and a Wellcome Trust Principal Research Fellow. His research group at the Wellcome Trust Centre for Cell Biology focuses on the epigenetic mechanisms governing the assembly of specialised domains of chromatin and their transmission through cell division.
Thomas Jenuwein is a German scientist working in the fields of epigenetics, chromatin biology, gene regulation and genome function.
RNA-directed DNA methylation (RdDM) is a biological process in which non-coding RNA molecules direct the addition of DNA methylation to specific DNA sequences. The RdDM pathway is unique to plants, although other mechanisms of RNA-directed chromatin modification have also been described in fungi and animals. To date, the RdDM pathway is best characterized within angiosperms, and particularly within the model plant Arabidopsis thaliana. However, conserved RdDM pathway components and associated small RNAs (sRNAs) have also been found in other groups of plants, such as gymnosperms and ferns. The RdDM pathway closely resembles other sRNA pathways, particularly the highly conserved RNAi pathway found in fungi, plants, and animals. Both the RdDM and RNAi pathways produce sRNAs and involve conserved Argonaute, Dicer and RNA-dependent RNA polymerase proteins.
H3Y41P is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the phosphorylation the 41st tyrosine residue of the histone H3 protein.
An up-to-date account of the current understanding of repetitive DNA, which usually doesn't contain genetic information. If evolution makes sense only in the context of the regulatory control of genes, we propose that heterochromatin, which is the main form of chromatin in higher eukaryotes, is positioned to be a deeply effective target for evolutionary change. Future investigations into assembly, maintenance and the many other functions of heterochromatin will shed light on the processes of gene and chromosome regulation.