Chromatin immunoprecipitation (ChIP) is a type of immunoprecipitation experimental technique used to investigate the interaction between proteins and DNA in the cell. It aims to determine whether specific proteins are associated with specific genomic regions, such as transcription factors on promoters or other DNA binding sites, and possibly define cistromes. ChIP also aims to determine the specific location in the genome that various histone modifications are associated with, indicating the target of the histone modifiers. [1] ChIP is crucial for the advancements in the field of epigenomics and learning more about epigenetic phenomena. [2]
Briefly, the conventional method is as follows:
There are mainly two types of ChIP, primarily differing in the starting chromatin preparation. The first uses reversibly cross-linked chromatin sheared by sonication called cross-linked ChIP (XChIP). Native ChIP (NChIP) uses native chromatin sheared by micrococcal nuclease digestion.[ citation needed ]
Cross-linked ChIP is mainly suited for mapping the DNA target of transcription factors or other chromatin-associated proteins, and uses reversibly cross-linked chromatin as starting material. The agent for reversible cross-linking could be formaldehyde [3] or UV light. [4] Then the cross-linked chromatin is usually sheared by sonication, providing fragments of 300 - 1000 base pairs (bp) in length. Mild formaldehyde crosslinking followed by nuclease digestion has been used to shear the chromatin. [5] Chromatin fragments of 400 - 500bp have proven to be suitable for ChIP assays as they cover two to three nucleosomes.
Cell debris in the sheared lysate is then cleared by sedimentation and protein–DNA complexes are selectively immunoprecipitated using specific antibodies to the protein(s) of interest. The antibodies are commonly coupled to agarose, sepharose, or magnetic beads. Alternatively, chromatin-antibody complexes can be selectively retained and eluted by inert polymer discs. [6] [7] The immunoprecipitated complexes (i.e., the bead–antibody–protein–target DNA sequence complex) are then collected and washed to remove non-specifically bound chromatin, the protein–DNA cross-link is reversed and proteins are removed by digestion with proteinase K. An epitope-tagged version of the protein of interest, or in vivo biotinylation [8] can be used instead of antibodies to the native protein of interest.
The DNA associated with the complex is then purified and identified by polymerase chain reaction (PCR), microarrays (ChIP-on-chip), molecular cloning and sequencing, or direct high-throughput sequencing (ChIP-Seq).[ citation needed ]
Native ChIP is mainly suited for mapping the DNA target of histone modifiers. Generally, native chromatin is used as starting chromatin. As histones wrap around DNA to form nucleosomes, they are naturally linked. Then the chromatin is sheared by micrococcal nuclease digestion, which cuts DNA at the length of the linker, leaving nucleosomes intact and providing DNA fragments of one nucleosome (200bp) to five nucleosomes (1000bp) in length. Thereafter, methods similar to XChIP are used for clearing the cell debris, immunoprecipitating the protein of interest, removing protein from the immunoprecipitated complex, and purifying and analyzing the complex-associated DNA.[ citation needed ]
The major advantage of NChIP is antibody specificity. It is important to note that most antibodies to modified histones are raised against unfixed, synthetic peptide antigens and that the epitopes they need to recognize in the XChIP may be disrupted or destroyed by formaldehyde cross-linking, particularly as the cross-links are likely to involve lysine e-amino groups in the N-terminals, disrupting the epitopes. This is likely to explain the consistently low efficiency of XChIP protocols compared to NChIP.
But XChIP and NChIP have different aims and advantages relative to each other. XChIP is for mapping target sites of transcription factors and other chromatin-associated proteins; NChIP is for mapping target sites of histone modifiers (see Table 1).
Chromatin Immunoprecipitation sequencing, also known as ChIP-seq, is an experimental technique used to identify transcription factor binding events throughout an entire genome. Knowing how the proteins in the human body interact with DNA to regulate gene expression is a key component of our knowledge of human diseases and biological processes. ChIP-seq is the primary technique to complete this task, as it has proven to be extremely effective in resolving how proteins and transcription factors influence phenotypical mechanisms. Overall ChIP-seq has risen to be a very efficient method for determining these factors, but there is a rivaling method known as ChIP-on-chip.
ChIP-on-chip, also known as ChIP-chip, is an experimental technique used to isolate and identify genomic sites occupied by specific DNA-binding proteins in living cells. ChIP-on-chip is a relatively newer technique, as it was introduced in 2001 by Peggy Farnham and Michael Zhang. ChIP-on-chip gets its name by combining the methods of Chromatin Immunoprecipitation and DNA microarray, thus creating ChIP-on-chip.
The two methods seek similar results, as they both strive to find protein binding sites that can help identify elements in the human genome. Those elements in the human genome are important for the advancement of knowledge in human diseases and biological processes. The difference between ChIP-seq and ChIP-chip is established by the specific site of the protein binding identification. The main difference comes from the efficacy of the two techniques, ChIP-seq produces results with higher sensitivity and spatial resolution because of the wide range of genomic coverage. Even though ChIP-seq has proven to be more efficient than ChIP-chip, ChIP-seq is not always the first choice for scientists. The cost and accessibility of ChIP-seq is a major disadvantage, which has led to the more predominant use of ChIP-chip in laboratories across the world. [2]
Table 1 Advantages and disadvantages of NChIP and XChIP
XChIP | NChIP | |
---|---|---|
Advantages | Suitable for transcriptional factors, or any other weakly binding chromatin associated proteins. Applicable to any organisms where native protein is hard to prepare | Testable antibody specificity Better antibody specificity as target protein naturally intact Better chromatin and protein revery efficiency due to better antibody specificity |
Disadvantages | Inefficient chromatin recovery due to antibody target protein epitope disruption May cause false positive result due to fixation of transient proteins to chromatin Wide range of chromatin shearing size due to random cut by sonication. | Usually not suitable for non-histone proteins Nucleosomes may rearrange during digestion |
In 1984 John T. Lis and David Gilmour, at the time a graduate student in the Lis lab, used UV irradiation, a zero-length protein-nucleic acid crosslinking agent, to covalently cross-link proteins bound to DNA in living bacterial cells. Following lysis of cross-linked cells and immunoprecipitation of bacterial RNA polymerase, DNA associated with enriched RNA polymerase was hybridized to probes corresponding to different regions of known genes to determine the in vivo distribution and density of RNA polymerase at these genes. A year later they used the same methodology to study the distribution of eukaryotic RNA polymerase II on fruit fly heat shock genes. These reports are considered the pioneering studies in the field of chromatin immunoprecipitation. [9] [10] XChIP was further modified and developed by Alexander Varshavsky and co-workers, who examined the distribution of histone H4 on heat shock genes using formaldehyde cross-linking. [11] [12] This technique was extensively developed and refined thereafter. [13] NChIP approach was first described by Hebbes et al., 1988, [14] and has also been developed and refined quickly. [15] The typical ChIP assay usually takes 4–5 days and requires 106~ 107 cells at least. Now new techniques on ChIP could be achieved as few as 100~1000 cells and completed within one day.
ChIP has also been applied for genome-wide analysis by combining with microarray technology (ChIP-on-chip) or second-generation DNA-sequencing technology (Chip-Sequencing). ChIP can also combine with paired-end tags sequencing in Chromatin Interaction Analysis using Paired End Tag sequencing (ChIA-PET), a technique developed for large-scale, de novo analysis of higher-order chromatin structures. [25] [26] [27]
ChIP-on-chip is a technology that combines chromatin immunoprecipitation ('ChIP') with DNA microarray ("chip"). Like regular ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA in vivo. Specifically, it allows the identification of the cistrome, the sum of binding sites, for DNA-binding proteins on a genome-wide basis. Whole-genome analysis can be performed to determine the locations of binding sites for almost any protein of interest. As the name of the technique suggests, such proteins are generally those operating in the context of chromatin. The most prominent representatives of this class are transcription factors, replication-related proteins, like origin recognition complex protein (ORC), histones, their variants, and histone modifications.
ChIP-sequencing, also known as ChIP-seq, is a method used to analyze protein interactions with DNA. ChIP-seq combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. It can be used to map global binding sites precisely for any protein of interest. Previously, ChIP-on-chip was the most common technique utilized to study these protein–DNA relations.
Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. Plant flavones are said to be inhibiting epigenomic marks that cause cancers. Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays.
ChIP-exo is a chromatin immunoprecipitation based method for mapping the locations at which a protein of interest binds to the genome. It is a modification of the ChIP-seq protocol, improving the resolution of binding sites from hundreds of base pairs to almost one base pair. It employs the use of exonucleases to degrade strands of the protein-bound DNA in the 5'-3' direction to within a small number of nucleotides of the protein binding site. The nucleotides of the exonuclease-treated ends are determined using some combination of DNA sequencing, microarrays, and PCR. These sequences are then mapped to the genome to identify the locations on the genome at which the protein binds.
H3K27ac is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates acetylation of the lysine residue at N-terminal position 27 of the histone H3 protein.
H3K9me3 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the tri-methylation at the 9th lysine residue of the histone H3 protein and is often associated with heterochromatin.
H3K4me1 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the mono-methylation at the 4th lysine residue of the histone H3 protein and often associated with gene enhancers.
H3K79me2 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the di-methylation at the 79th lysine residue of the histone H3 protein. H3K79me2 is detected in the transcribed regions of active genes.
H4K5ac is an epigenetic modification to the DNA packaging protein histone H4. It is a mark that indicates the acetylation at the 5th lysine residue of the histone H4 protein. H4K5 is the closest lysine residue to the N-terminal tail of histone H4. It is enriched at the transcription start site (TSS) and along gene bodies. Acetylation of histone H4K5 and H4K12ac is enriched at centromeres.
H4K8ac, representing an epigenetic modification to the DNA packaging protein histone H4, is a mark indicating the acetylation at the 8th lysine residue of the histone H4 protein. It has been implicated in the prevalence of malaria.
H4K12ac is an epigenetic modification to the DNA packaging protein histone H4. It is a mark that indicates the acetylation at the 12th lysine residue of the histone H4 protein. H4K12ac is involved in learning and memory. It is possible that restoring this modification could reduce age-related decline in memory.
H3K14ac is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the acetylation at the 14th lysine residue of the histone H3 protein.
H3K9ac is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the acetylation at the 9th lysine residue of the histone H3 protein.
H3K36me2 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the di-methylation at the 36th lysine residue of the histone H3 protein.
MNase-seq, short for micrococcal nuclease digestion with deep sequencing, is a molecular biological technique that was first pioneered in 2006 to measure nucleosome occupancy in the C. elegans genome, and was subsequently applied to the human genome in 2008. Though, the term ‘MNase-seq’ had not been coined until a year later, in 2009. Briefly, this technique relies on the use of the non-specific endo-exonuclease micrococcal nuclease, an enzyme derived from the bacteria Staphylococcus aureus, to bind and cleave protein-unbound regions of DNA on chromatin. DNA bound to histones or other chromatin-bound proteins may remain undigested. The uncut DNA is then purified from the proteins and sequenced through one or more of the various Next-Generation sequencing methods.
H3R17me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 17th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.
H3R26me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 26th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.
H3R8me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 8th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.
H3R2me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 2nd arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.
H3T45P is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the phosphorylation the 45th threonine residue of the histone H3 protein.