ChiRP-Seq

Last updated

ChiRP-Seq (Chromatin Isolation by RNA purification) is a high-throughput sequencing method to discover regions of the genome which are bound by a specific RNA (or by a ribonucleoprotein containing the RNA of interest). Recent studies have shown that a significant proportion of some genomes (including mouse and human genomes) synthesize RNA that apparently do not code for proteins. The function of most of these non-coding RNA still has to be ascertained. Various genomic methods are being developed to map the functional association of these novel RNA to distinct regions of the genome to gain a better understanding of their function. ChiRP-Seq is one of these new methods which uses the massively parallel sequencing capability of 2nd generation sequencers to catalog the binding sites of these novel RNA molecules on a genome.

Although many have believed that RNAs mainly encode for proteins a very large portion of the eukaryotic genome is composed of RNAs that do not. These RNAs were originally considered junk until new advancements lead to the realization that they may indeed have a biological purpose. Over the last few years lncRNAs have been the least explored and functionally characterized emerging regulatory molecules, especially in comparison to their short counterparts, small ncRNAs. [1] ChiRP-Seq is a new technique that has allowed us to map long RNA occupancy across the genome at a higher resolution than ever before. ChiRP-Seq works via affinity capture of a target complex of lncRNA and chromatin by tiling antisense-oligos. [2] This technique will allow scientists to generate a map of genomic binding sites of several hundred bases very accurately due to high sensitivity and low background.

Overview of method

Tens of oligonucleotide probes are designed to be complementary to the RNA of interest. These oligos are labeled with biotin. Cells are cross-linked by UV or formalin and nuclei are isolated from these treated cells. The isolated nuclei were lysed and the released chromatin was fragmented by sonication to produce approximately 100-500 bp sized fragments. These chromatin fragments were hybridized to the biotinylated probe set. Complexes containing biotin-probe + RNA of interest + DNA fragment are captured by magnetic beads labeled with streptavidin.

Overview of a next generation sequencing method to characterize RNA binding sites to chromatin. Chromatin isolation by RNA purification. ChiRP-Seq.png
Overview of a next generation sequencing method to characterize RNA binding sites to chromatin. Chromatin isolation by RNA purification.

DNA is isolated from an aliquot of the bound complex by treatment with RNAse (or proteinase followed by RNAse) to digest associated protein and RNA. RNA may also be isolated from an additional aliquot of the bound complex to detect other RNA molecules associated with the RNA of interest. The purified DNA is then used to prepare a sequencing library and the library is sequenced on a next generation DNA sequencing system. The sequencing reads are then mapped to the genome. A pile-up of reads at specific locations on the genome indicates that the RNA of interest had bound to that region of the genome. This helps delineate specific genomic regions that interact with RNA. For example, genomic targets of enhancer RNA which act at a distance from their site of synthesis can be easily evaluated by ChiRP-Seq. [3] [4]

Related Research Articles

Complementary DNA Single-stranded DNA synthesized from RNA

In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to clone eukaryotic genes in prokaryotes. When scientists want to express a specific protein in a cell that does not normally express that protein, they will transfer the cDNA that codes for the protein to the recipient cell. In molecular biology, cDNA is also generated to analyze transcriptomic profiles in bulk tissue, single cells, or single nuclei in assays such as microarrays and RNA-seq.

Functional genomics Field of molecular biology

Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "gene-by-gene" approach.

Cross-linking immunoprecipitation (CLIP) is a method used in molecular biology that combines UV cross-linking with immunoprecipitation in order to analyse protein interactions with RNA or to precisely locate RNA modifications. CLIP-based techniques can be used to map RNA binding protein binding sites or RNA modification sites of interest on a genome-wide scale, thereby increasing the understanding of post-transcriptional regulatory networks.

ChIP-on-chip Molecular biology method

ChIP-on-chip is a technology that combines chromatin immunoprecipitation ('ChIP') with DNA microarray ("chip"). Like regular ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA in vivo. Specifically, it allows the identification of the cistrome, the sum of binding sites, for DNA-binding proteins on a genome-wide basis. Whole-genome analysis can be performed to determine the locations of binding sites for almost any protein of interest. As the name of the technique suggests, such proteins are generally those operating in the context of chromatin. The most prominent representatives of this class are transcription factors, replication-related proteins, like origin recognition complex protein (ORC), histones, their variants, and histone modifications.

ChIP-sequencing, also known as ChIP-seq, is a method used to analyze protein interactions with DNA. ChIP-seq combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. It can be used to map global binding sites precisely for any protein of interest. Previously, ChIP-on-chip was the most common technique utilized to study these protein–DNA relations.

DNA adenine methyltransferase identification, often abbreviated DamID, is a molecular biology protocol used to map the binding sites of DNA- and chromatin-binding proteins in eukaryotes. DamID identifies binding sites by expressing the proposed DNA-binding protein as a fusion protein with DNA methyltransferase. Binding of the protein of interest to DNA localizes the methyltransferase in the region of the binding site. Adenine methylation does not occur naturally in eukaryotes and therefore adenine methylation in any region can be concluded to have been caused by the fusion protein, implying the region is located near a binding site. DamID is an alternate method to ChIP-on-chip or ChIP-seq.

RNA immunoprecipitation chip

RIP-chip is a molecular biology technique which combines RNA immunoprecipitation with a microarray. The purpose of this technique is to identify which RNA sequences interact with a particular RNA binding protein of interest in vivo. It can also be used to determine relative levels of gene expression, to identify subsets of RNAs which may be co-regulated, or to identify RNAs that may have related functions. This technique provides insight into the post-transcriptional gene regulation which occurs between RNA and RNA binding proteins.

Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. Plant flavones are said to be inhibiting epigenomic marks that cause cancers. Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays.

Paired-end tags (PET) are the short sequences at the 5’ and 3' ends of a DNA fragment which are unique enough that they (theoretically) exist together only once in a genome, therefore making the sequence of the DNA in between them available upon search or upon further sequencing. Paired-end tags (PET) exist in PET libraries with the intervening DNA absent, that is, a PET "represents" a larger fragment of genomic or cDNA by consisting of a short 5' linker sequence, a short 5' sequence tag, a short 3' sequence tag, and a short 3' linker sequence. It was shown conceptually that 13 base pairs are sufficient to map tags uniquely. However, longer sequences are more practical for mapping reads uniquely. The endonucleases used to produce PETs give longer tags but sequences of 50–100 base pairs would be optimal for both mapping and cost efficiency. After extracting the PETs from many DNA fragments, they are linked (concatenated) together for efficient sequencing. On average, 20–30 tags could be sequenced with the Sanger method, which has a longer read length. Since the tag sequences are short, individual PETs are well suited for next-generation sequencing that has short read lengths and higher throughput. The main advantages of PET sequencing are its reduced cost by sequencing only short fragments, detection of structural variants in the genome, and increased specificity when aligning back to the genome compared to single tags, which involves only one end of the DNA fragment.

Chromatin Interaction Analysis by Paired-End Tag Sequencing is a technique that incorporates chromatin immunoprecipitation (ChIP)-based enrichment, chromatin proximity ligation, Paired-End Tags, and High-throughput sequencing to determine de novo long-range chromatin interactions genome-wide.

Chromatin immunoprecipitation

Chromatin immunoprecipitation (ChIP) is a type of immunoprecipitation experimental technique used to investigate the interaction between proteins and DNA in the cell. It aims to determine whether specific proteins are associated with specific genomic regions, such as transcription factors on promoters or other DNA binding sites, and possibly define cistromes. ChIP also aims to determine the specific location in the genome that various histone modifications are associated with, indicating the target of the histone modifiers. ChIP is crucial for the advancements in the field of epigenomics and learning more about epigenetic phenomena.

ChIP-exo

ChIP-exo is a chromatin immunoprecipitation based method for mapping the locations at which a protein of interest binds to the genome. It is a modification of the ChIP-seq protocol, improving the resolution of binding sites from hundreds of base pairs to almost one base pair. It employs the use of exonucleases to degrade strands of the protein-bound DNA in the 5'-3' direction to within a small number of nucleotides of the protein binding site. The nucleotides of the exonuclease-treated ends are determined using some combination of DNA sequencing, microarrays, and PCR. These sequences are then mapped to the genome to identify the locations on the genome at which the protein binds.

Chem-seq is a technique that is used to map genome-wide interactions between small molecules and their protein targets in the chromatin of eukaryotic cell nuclei. The method employs chemical affinity capture coupled with massively parallel DNA sequencing to identify genomic sites where small molecules interact with their target proteins or DNA. It was first described by Lars Anders et al. in the January, 2014 issue of "Nature Biotechnology".

CUT&RUN sequencing, also known as cleavage under targets and release using nuclease, is a method used to analyze protein interactions with DNA. CUT&RUN sequencing combines antibody-targeted controlled cleavage by micrococcal nuclease with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. It can be used to map global DNA binding sites precisely for any protein of interest. Currently, ChIP-Seq is the most common technique utilized to study protein–DNA relations, however, it suffers from a number of practical and economical limitations that CUT&RUN sequencing does not.

Spatial transcriptomics Range of methods designed for assigning cell types

Spatial transcriptomics is an overarching term for a range of methods designed for assigning cell types to their locations in the histological sections. This method can also be used to determine subcellular localization of mRNA molecules. The term is a variation of Spatial Genomics, first described by Doyle, et al., in 2000 and then expanded upon by Ståhl et al. in a technique developed in 2016, which has since undergone a variety of improvements and modifications.

BLESS, also known as breaks labeling, enrichment on streptavidin and next-generation sequencing, is a method used to detect genome-wide double-strand DNA damage. In contrast to chromatin immunoprecipitation (ChIP)-based methods of identifying DNA double-strand breaks (DSBs) by labeling DNA repair proteins, BLESS utilizes biotinylated DNA linkers to directly label genomic DNA in situ which allows for high-specificity enrichment of samples on streptavidin beads and the subsequent sequencing-based DSB mapping to nucleotide resolution.

CUT&Tag-sequencing, also known as cleavage under targets and tagmentation, is a method used to analyze protein interactions with DNA. CUT&Tag-sequencing combines antibody-targeted controlled cleavage by a protein A-Tn5 fusion with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. It can be used to map global DNA binding sites precisely for any protein of interest. Currently, ChIP-Seq is the most common technique utilized to study protein–DNA relations, however, it suffers from a number of practical and economical limitations that CUT&RUN and CUT&Tag sequencing do not. CUT&Tag sequencing is an improvement over CUT&RUN because it does not require cells to be lysed or chromatin to be fractionated. CUT&RUN is not suitable for single-cell platforms so CUT&Tag is advantageous for these.

ChIL sequencing (ChIL-seq), also known as Chromatin Integration Labeling sequencing, is a method used to analyze protein interactions with DNA. ChIL-sequencing combines antibody-targeted controlled cleavage by Tn5 transposase with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. It can be used to map global DNA binding sites precisely for any protein of interest. Currently, ChIP-Seq is the most common technique utilized to study protein–DNA relations, however, it suffers from a number of practical and economical limitations that ChIL-Sequencing does not. ChIL-Seq is a precise technique that reduces sample loss could be applied to single-cells.

H3R17me2 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the di-methylation at the 17th arginine residue of the histone H3 protein. In epigenetics, arginine methylation of histones H3 and H4 is associated with a more accessible chromatin structure and thus higher levels of transcription. The existence of arginine demethylases that could reverse arginine methylation is controversial.

Ribose-seq Genetic mapping technique

Ribose-seq is a mapping technique used in genetics research to determine the full profile of embedded ribonucleotides, specifically ribonucleoside monophosphates (rNMPs), in genomic DNA. Embedded ribonucleotides are thought to be the most common alteration to DNA in cells, and their presence in genomic DNA can affect genome stability. As recent studies have suggested that ribonucleotides in mouse DNA may affect disease pathology, ribonucleotide incorporation in genomic DNA has become an important target of medical genetics research. Ribose-seq allows scientists to determine the precise location and type of ribonucleotides that have been incorporated into eukaryotic or prokaryotic DNA.

References

  1. Chu, Quinn, J., & Chang, H. Y. (2012). Chromatin isolation by RNA purification (ChIRP). Journal of Visualized Experiments, 61. https://doi.org/10.3791/3912
  2. Jathar, Kumar, V., Srivastava, J., & Tripathi, V. (2017). Technological Developments in lncRNA Biology. Long Non Coding RNA Biology, 283–323. https://doi.org/10.1007/978-981-10-5203-3_10
  3. Chu, Ci; Qu, Kun; Zhong, Franklin L.; Artandi, Steven E.; Chang, Howard Y. (31 August 2011). "Genomic Maps of Long Noncoding RNA Occupancy Reveal Principles of RNA-Chromatin Interactions". Molecular Cell. 44: 667–678. doi:10.1016/j.molcel.2011.08.027. PMC   3249421 . PMID   21963238.
  4. Li, W.; Notani, D.; Ma, Q.; Tanasa, B.; Nunez, E.; et al. (2013). "Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation". Nature. 498: 516–520. doi:10.1038/nature12210. PMC   3718886 . PMID   23728302.