In molecular biology, the linker histone H1 is a protein family forming a critical component of eukaryotic chromatin. H1 histones bind to the linker DNA exiting from the nucleosome core particle, while the core histones (H2A, H2B, H3 and H4) form the octamer core of the nucleosome around which the DNA is wrapped. [1]
H1 forms a complex family of related proteins with distinct specificity for tissues, developmental stages, and organisms in which they are expressed. [2] Individual H1 proteins are often referred to as isoforms or variants.
The discovery of H1 variants in calf thymus preceded the discovery of core histone variants. [3] [4]
In human and mouse cells 11 H1 variants have been described and are encoded by single genes. Six of the variants are mainly expressed during the S phase and hence replication-dependent. They are encoded by genes within histone cluster 1 located in human cells on chromosome 6. The five further variants are expressed over the whole cell cycle and their encoding genes are scattered in the genome.
Human gene symbol | Unified phylogeny-based nomenclature [5] |
---|---|
H1 variants within histone gene cluster 1 (replication dependent) | |
HIST1H1A | H1.1 |
HIST1H1B | H1.5 |
HIST1H1C | H1.2 |
HIST1H1D | H1.3 |
HIST1H1E | H1.4 |
HIST1H1T | (TS) H1.6 |
H1 variants encoded by orphan genes (replication independent) | |
H1F0 | H1.0 |
H1FNT | (TS) H1.7 |
H1FOO | (OO) H1.8 |
HILS1 | (TS) H1.9 |
H1FX | H1.10 |
TS - testis specific, OO - oocyte specific variants
Histone H1 differs strongly from the core histones. Rather than originating from archaeal histones, it probably evolved from a bacterial protein. [6] Unlike core histones featuring a so-called histone fold, H1s typically have a short basic N-terminal domain, a globular domain and a lysine-rich C-terminal domain (the N- and C-termini are also referred to as tails). [7] H1s are also less conserved than the core histones. The mammalian H1 isoforms are paralogs, which means their encoding genes originated from gene duplication events. The corresponding H1 variants in two different species, such as human and mouse H1.4 are orthologs – they had a common ancestor gene and were separated by speciation. Within one species, the paralogous H1 variants show a high conservation of the globular core domain, while the N- and C-termini are more divergent. At the same time H1 orthologs among mammals are highly conserved across the whole protein sequence, for example human and mouse H1.4 share 93.6% sequence identity. [2]
The extent to which individual H1 variants can be redundant and what their distinct functions are isn't yet clear. The fact that many individual H1 variant knockouts in mice are viable and show compensation by other H1 variants seems to support the hypothesis of redundancy. [8] [9] [10] [11] However, many lines of evidence suggest specific functions exist for H1 variants. For example, individual H1 variant knockout mice reveal specific phenotypes and distinct effects on gene expression and chromatin structure. [9] [10] [12] [13] [14] [15] Also, different isotypes show different localization and bind to chromatin with different affinities. [16] [17] [18] [19] [20] [21]
Therefore, a model has been proposed according to which H1 variants have two distinct roles, a common and a specific one: [2] Individual H1 proteins are redundant in their ability to compact chromatin globally and to stabilize overall higher order chromatin structures. Such a common role can therefore be compensated in mutant cells by increasing the amount of other H1 variants. However, at the level of local chromatin organization, individual variants can regulate a subset of specific genes both in a negative and positive way. [2]
Multiple nomenclatures (around 12) for linker histone variants have been proposed and used in publications previously, greatly complicating comparison across studies. In 1994 Parseghian et al. have attempted to create a system in which variant designations were applied uniformly to orthologs across mammalian species, [22] however this nomenclature hasn't been taken up by other laboratories. In 2012, a diverse group of scientists from multiple institutions across the world working on different aspects of histone biology proposed a unified phylogeny-based nomenclature for histone variants, including H1 histones, with the aim of producing informative and easily searchable histone variant names. [5]
Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in reinforcing the DNA during cell division, preventing DNA damage, and regulating gene expression and DNA replication. During mitosis and meiosis, chromatin facilitates proper segregation of the chromosomes in anaphase; the characteristic shapes of chromosomes visible during this stage are the result of DNA being coiled into highly condensed chromatin.
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are wrapped into 30-nanometer fibers that form tightly packed chromatin. Histones prevent DNA from becoming tangled and protect it from DNA damage. In addition, histones play important roles in gene regulation and DNA replication. Without histones, unwound DNA in chromosomes would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out; however, when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers.
A nucleosome is the basic structural unit of DNA packaging in eukaryotes. The structure of a nucleosome consists of a segment of DNA wound around eight histone proteins and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of chromatin. Each nucleosome is composed of a little less than two turns of DNA wrapped around a set of eight proteins called histones, which are known as a histone octamer. Each histone octamer is composed of two copies each of the histone proteins H2A, H2B, H3, and H4.
HMGN proteins are members of the broader class of high mobility group (HMG) chromosomal proteins that are involved in regulation of transcription, replication, recombination, and DNA repair.
Histone H1 is one of the five main histone protein families which are components of chromatin in eukaryotic cells. Though highly conserved, it is nevertheless the most variable histone in sequence across species.
Histone H3 is one of the five main histones involved in the structure of chromatin in eukaryotic cells. Featuring a main globular domain and a long N-terminal tail, H3 is involved with the structure of the nucleosomes of the 'beads on a string' structure. Histone proteins are highly post-translationally modified however Histone H3 is the most extensively modified of the five histones. The term "Histone H3" alone is purposely ambiguous in that it does not distinguish between sequence variants or modification state. Histone H3 is an important protein in the emerging field of epigenetics, where its sequence variants and variable modification states are thought to play a role in the dynamic and long term regulation of genes.
Histone H4 is one of the five main histone proteins involved in the structure of chromatin in eukaryotic cells. Featuring a main globular domain and a long N-terminal tail, H4 is involved with the structure of the nucleosome of the 'beads on a string' organization. Histone proteins are highly post-translationally modified. Covalently bonded modifications include acetylation and methylation of the N-terminal tails. These modifications may alter expression of genes located on DNA associated with its parent histone octamer. Histone H4 is an important protein in the structure and function of chromatin, where its sequence variants and variable modification states are thought to play a role in the dynamic and long term regulation of genes.
Histone H2A is one of the five main histone proteins involved in the structure of chromatin in eukaryotic cells.
The solenoid structure of chromatin is a model for the structure of the 30 nm fibre. It is a secondary chromatin structure which helps to package eukaryotic DNA into the nucleus.
Chromatin remodeling is the dynamic modification of chromatin architecture to allow access of condensed genomic DNA to the regulatory transcription machinery proteins, and thereby control gene expression. Such remodeling is principally carried out by 1) covalent histone modifications by specific enzymes, e.g., histone acetyltransferases (HATs), deacetylases, methyltransferases, and kinases, and 2) ATP-dependent chromatin remodeling complexes which either move, eject or restructure nucleosomes. Besides actively regulating gene expression, dynamic remodeling of chromatin imparts an epigenetic regulatory role in several key biological processes, egg cells DNA replication and repair; apoptosis; chromosome segregation as well as development and pluripotency. Aberrations in chromatin remodeling proteins are found to be associated with human diseases, including cancer. Targeting chromatin remodeling pathways is currently evolving as a major therapeutic strategy in the treatment of several cancers.
Histone H2A.Z is a protein that in humans is encoded by the H2AZ1 gene.
Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones. The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. The H2AFZ gene encodes a replication-independent member of the histone H2A family that is distinct from other members of the family. Studies in mice have shown that this particular histone is required for embryonic development and indicate that lack of functional histone H2A leads to embryonic lethality.
Histone H4 is a protein that, in humans, is encoded by the HIST1H4I gene.
Histone H1.5 is a protein that in humans is encoded by the HIST1H1B gene.
Histone H2B type 1-D is a protein that in humans is encoded by the HIST1H2BD gene.
Histone H1.1 is a protein that in humans is encoded by the HIST1H1A gene.
Histone H1.4 is a protein that in humans is encoded by the HIST1H1E gene.
Histone H1.2 is a protein that in humans is encoded by the HIST1H1C gene.
Chromodomain-helicase-DNA-binding protein 8 is an enzyme that in humans is encoded by the CHD8 gene.
Histone H1x is a protein that in humans is encoded by the H1FX gene.
Histone variants are proteins that substitute for the core canonical histones in nucleosomes in eukaryotes and often confer specific structural and functional features. The term might also include a set of linker histone (H1) variants, which lack a distinct canonical isoform. The differences between the core canonical histones and their variants can be summarized as follows: (1) canonical histones are replication-dependent and are expressed during the S-phase of cell cycle whereas histone variants are replication-independent and are expressed during the whole cell cycle; (2) in animals, the genes encoding canonical histones are typically clustered along the chromosome, are present in multiple copies and are among the most conserved proteins known, whereas histone variants are often single-copy genes and show high degree of variation among species; (3) canonical histone genes lack introns and use a stem loop structure at the 3’ end of their mRNA, whereas histone variant genes may have introns and their mRNA tail is usually polyadenylated. Complex multicellular organisms typically have a large number of histone variants providing a variety of different functions. Recent data are accumulating about the roles of diverse histone variants highlighting the functional links between variants and the delicate regulation of organism development.