CRISPR-Display

Last updated

CRISPR-Display (CRISP-Disp) is a modification of the CRISPR/Cas9 (Clustered regularly interspaced short palindromic repeats) system for genome editing. The CRISPR/Cas9 system uses a short guide RNA (sgRNA) sequence to direct a Streptococcus pyogenes Cas9 nuclease, acting as a programmable DNA binding protein, to cleave DNA at a site of interest. [1] [2]

Contents

CRISPR-Display, in contrast, uses a nuclease deficient Cas9 (dCas9) and an engineered sgRNA with aptameric accessory RNA domains, ranging from 100bp to 5kb, outside of the normal complementary targeting sequence. [3] The accessory RNA domains can be functional domains, such as long non-coding RNAs (lncRNAs), protein-binding motifs, or epitope tags for immunochemistry. This allows for investigation of the functionality of certain lncRNAs, and targeting of ribonucleoprotein (RNP) complexes to genomic loci.

CRISPR-Display was first published in Nature Methods in July 2015, and developed by David M. Shechner, Ezgi Hacisuleyman, Scott T. Younger and John Rinn at Harvard University and Massachusetts Institute of Technology (MIT), USA.

Background

The CRISPR/Cas9 system is based on an adaptive immune system of prokaryotic organisms, and its use for genome editing was first proposed and developed in collaboration between Jennifer Doudna (University of California, Berkeley) and Emmanuelle Charpentier (Max Planck Institute for Infection Biology, Germany). [1] The method, and its application in editing human cells, was published in Science on August 17, 2012. In January 2013, the Feng Zhang lab at the Broad Institute at MIT published another method in Science, having further optimized the sgRNA structure and expression for use in mammalian cells. [2] By the beginning of 2014, almost 2500 studies mentioning CRISPR in their title has been published.

Non-coding RNAs (ncRNAs) are RNA transcripts that are not translated into a protein product, but instead exert their function as RNA molecules. They are involved in a range of processes, like post-transcriptional regulation of gene expression, genomic imprinting, and regulating the chromatin state, and thereby the expression, of a given locus. Many ncRNAs have been discovered, but in many cases, their function has yet to be accurately dissected due to technical challenges. ncRNA function is often not affected by introducing point mutations and premature stop codons. ncRNAs are also thought to regulate gene expression, so deletion studies have a hard time distinguishing effects of ncRNA loss from effects of gene misregulation due to the deletion. Studies of ncRNAs have also lacked the throughput necessary for discerning the RNA based functionality. To meet these challenges, the Rinn lab therefore developed a synthetic biology approach, using CRISPR/Cas9 system, with the Cas9 acting as a conduit, to target ncRNA modules to ectopic genomic locations, and investigating the ncRNAs effects on reporter genes and other genomic features at that site.

Method development

CRISPR-Disp modifies the CRISPR/Cas9 technology by using a catalytically inactive, i.e. nuclease deficient, Cas9 mutant (dCas9), and altering the RNA used for targeting Cas9 to a genomic location.

Since sgRNAs are usually expressed by RNA polymerase III, which limits the length of the RNA domain that can be inserted, CRISPR-Display incorporates RNA polymerase II to permit expression of longer transcripts (~80–250 nucleotides) to overcome this limitation. CRISPR-Display can therefore add larger RNA domains, like natural and lncRNA domains, without affecting dCas9 localization.

The sgRNA is engineered with an aptameric accessory RNA domain in the sequence outside of the targeting sequence. In the development of the technique, five model cofactors with different topology constructs were used: TOP1-4 and INT with an accessory domain (P4-P6 domain) at different positions, including the 5’ and 3’ end and internally within the sgRNA. Each domain contained a stem-loop that can be recognized by a PP7 bacteriophage coat protein. The complex was delivered into mammalian cells (HEK293FT cells) by a lentiviral vector.

Transcription activator assay used to verify sgRNA/Cas9 complex targeting activity, and the proper integrity of the added RNA module. Direct activation: The transcription activator, VP64, is fused to the Cas9 protein, so proper targeting of the complex results in reporter gene activation. Bridged activation: VP64 is fused to PP7, which recognizes and binds a sequence in the RNA module when the RNA module is properly folded. Transcription activator assay.png
Transcription activator assay used to verify sgRNA/Cas9 complex targeting activity, and the proper integrity of the added RNA module. Direct activation: The transcription activator, VP64, is fused to the Cas9 protein, so proper targeting of the complex results in reporter gene activation. Bridged activation: VP64 is fused to PP7, which recognizes and binds a sequence in the RNA module when the RNA module is properly folded.

To ensure that the attached RNA module both retains targeting functionality as well as the resulting complex drive transcriptional activation at a specific site of interest, transient reporter gene expression of luciferase and fluorescent protein was measured. Two variations of such a transcription activator assay was performed; directly with a dCas9 fused to a transcriptional activator/repressor (VP64, a factor known to enhance gene expression) (Direct activation) or indirectly where the transcriptional activator is fused to an RNA binding protein module on the sgRNA (Bridged activation). Reporter gene activation through direct activation imply the sgRNA variant binds and targets dCas9 efficiently. All the five topologies showed direct activation except TOP3 and TOP4, which showed reduced activity. Bridged activation indicates that the fused RNA accessory domain is intact in mature dCas9 complexes. Bridged activation was observed with TOP1, TOP3 and INT. The results were recapitulated at endogenous loci by targeting minimal sgRNA and selective expanded topologies (TOP1 and INT) to human ASCL1, IL1RN, NTF3 and TTN promoters. Direct and bridged activation were observed by qRT-PCR for each construct proving that CRISP-Disp allows deployment of large RNA domains to genomic loci.

The effect of internal (stem-loop) insertion size on dCas9 complex was assessed using INT-like constructs with cassettes of PP7 stem loops with a size range from 25 nt to 247 nt. Each construct induced significant activation in the reporter assays signifying that internal insertion size does not influence the dCas9 complex function. Similarly, the effect of internal insert sequence was also determined through a set of unique sgRNA variants displaying cassettes of 25 random nucleotides. Reporter assays and RIP-Seq confirmed that sequence does not govern complex efficacy.

The utility of CRISP-Disp was explored with an array of functional RNA domains such as natural protein binding motifs, artificial aptamers and small molecules with varying size. While all the complexes were functional and viable, and successfully deployed the RNA domains at endogenous loci, the efficacy changed with length and expression levels. This suggests that optimization of structure and sequence might be important required before designing the construct.

To determine if artificial lncRNA scaffolds can be used with CRISPR-Display, dCas9 complexes were assembled with artificial RNA with a size comparable to lncRNAs. The constructs were expanded to ~650nt size with an additional P4-P6 domain with hairpin loops that can be recognized by another phage coat protein, MS2. These topology constructs were called double TOP0-2 with the two domains either together at 5’ or 3’end or separately at each end. Transient reporter assays followed by confirmation with RNA Immunoprecipitation sequencing (RIP-qPCR) showed that all the three constructs retained both the domains in the complex.

This was also tested with natural lncRNA domains by building Pol II-driven TOP1 and INT constructs fused with human lncRNA domains. lncRNAs used had lengths between ~90–4800 nt, and included the NoRC-binding pRNA, three enhancer-transcribed RNAs (eRNAs) FALEC, TRERNA1 and ncRNA-a3), Xist A-repeat (RepA), and the 4,799-nt transcriptional activator HOTTIP . While all the constructs showed significant direct activation, it decreased with increasing lncRNA-sgRNA length. These lncRNA domains could regulate the reporters independent of dCas9 with pRNA and RepA repressing the GLuc reporter expression (repressors) and TRERNA1, ncRNA-a3 and HOTTIP inducing activation (activators), but were properly targeted to an ectopic location of interest by using the CRISP-Disp system. [3]

Thus, CRISP-Disp enables control of gene expression with deployment of both artificial scaffolds as well as natural lncRNA domains.

Applications

CRISPR-Display allows for previously unavailable studies of lncRNA functionality, artificial ncRNA functionalization, recruitment of endogenous and engineered proteins to genomic loci, and locus affinity tagging for cell imaging.

CRISPR-Display allows addition of RNA modules to the sgRNA for a range of functionalities, such as ectopic lncRNA localization, recruitment of endogenous or engineered RNA binding proteins for gene regulation, or affinity tagging for live cell imaging. The functions can be performed simultaneous in the same cell. CRISPR-Display applications.png
CRISPR-Display allows addition of RNA modules to the sgRNA for a range of functionalities, such as ectopic lncRNA localization, recruitment of endogenous or engineered RNA binding proteins for gene regulation, or affinity tagging for live cell imaging. The functions can be performed simultaneous in the same cell.

lncRNA domain localization

CRISPR-Display allows targeted localization of natural lncRNAs to ectopic sites for investigation of their function. Exposing various ectopic DNA loci to natural lncRNAs can help show the effects of lncRNAs on gene expression and chromatin state, and help dissect the mechanism of such effects. One of the major outstanding questions in the study of lncRNAs is whether effects on chromatin state or gene expression adjacent to a lncRNA locus is due to functional, sequence-specific mechanisms of the lncRNA itself, or due simply to the act of transcribing the lncRNA. [4] [5] Localizing lncRNA to ectopic sites with CRISPR-Display can help separate the function of the RNA itself from the effects of transcribing such RNA species. Before CRISPR-Display, such studies were challenging due to low throughput, and inability to distinguish lncRNA function from other confounding factors like cryptically encoded peptides or functional DNA elements.

Artificial ncRNA functionalization

CRISPR-Display also allows for targeted use of the wide array of artificial RNAs with specific functionality, such as RNAs for recruitment of endogenous RNA-binding proteins, antibody affinity tagging, and recruitment of tagged fusion proteins.

Affinity tagging for live cell imaging

One example of artificial ncRNA functionalization is incorporating RNA domains recognized by specific antibodies to the sgRNA. CRISPR-Display can target the sgRNA with a particular epitope sequence to various loci, and fluorescently tagged antibodies can be used to image the locus, showing its localization in the nucleus, and possible interactions with other tagged proteins or genomic loci.

Recruitment of endogenous or engineered RNA binding proteins for gene regulation

Endogenous proteins known to bind a specific RNA motif can be recruited to ectopic genomic locations by incorporating the RNA motif into the sgRNA. CRISPR-Display can also recruit fusion proteins engineered to bind specific RNA sequences. Recruiting these proteins can allow studies of specific proteins’ and protein complexes’ effects on gene regulation and chromatin states, as well as specific regulation of certain genes for investigation of gene function.

Multiplexed functional studies

Due to the modularity of the sgRNA, several different sgRNAs with distinct functional modules can be expressed in each cell at once. The different RNA modules can then work simultaneously and independently, allowing for, for example, regulation of one genomic location whilst imaging the effects of the regulation at another location.

The possible applications of CRISPR-Display will continue to increase with further development and understanding of ncRNA functionalization. It is not unreasonable to think that CRISPR-Display may one day enable complex synthetic biology systems, with distinct temporal expression of sgRNAs, and networks and circuits of gene regulation by targeting of regulatory proteins.

Advantages

Limitations

Related Research Articles

Gene knockouts are a widely used genetic engineering technique that involves the targeted removal or inactivation of a specific gene within an organism's genome. This can be done through a variety of methods, including homologous recombination, CRISPR-Cas9, and TALENs.

<span class="mw-page-title-main">Zinc finger</span> Small structural protein motif found mostly in transcriptional proteins

A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) which stabilizes the fold. It was originally coined to describe the finger-like appearance of a hypothesized structure from the African clawed frog (Xenopus laevis) transcription factor IIIA. However, it has been found to encompass a wide variety of differing protein structures in eukaryotic cells. Xenopus laevis TFIIIA was originally demonstrated to contain zinc and require the metal for function in 1983, the first such reported zinc requirement for a gene regulatory protein followed soon thereafter by the Krüppel factor in Drosophila. It often appears as a metal-binding domain in multi-domain proteins.

Gene knockdown is an experimental technique by which the expression of one or more of an organism's genes is reduced. The reduction can occur either through genetic modification or by treatment with a reagent such as a short DNA or RNA oligonucleotide that has a sequence complementary to either gene or an mRNA transcript.

<span class="mw-page-title-main">Regulation of gene expression</span> Modifying mechanisms used by cells to increase or decrease the production of specific gene products

Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products. Sophisticated programs of gene expression are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from transcriptional initiation, to RNA processing, and to the post-translational modification of a protein. Often, one gene regulator controls another, and so on, in a gene regulatory network.

<span class="mw-page-title-main">Functional genomics</span> Field of molecular biology

Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "candidate-gene" approach.

<span class="mw-page-title-main">CRISPR</span> Family of DNA sequence found in prokaryotic organisms

CRISPR is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. These sequences are derived from DNA fragments of bacteriophages that had previously infected the prokaryote. They are used to detect and destroy DNA from similar bacteriophages during subsequent infections. Hence these sequences play a key role in the antiviral defense system of prokaryotes and provide a form of acquired immunity. CRISPR is found in approximately 50% of sequenced bacterial genomes and nearly 90% of sequenced archaea.

Guide RNA (gRNA) or single guide RNA (sgRNA) is a short sequence of RNA that functions as a guide for the Cas9-endonuclease or other Cas-proteins that cut the double-stranded DNA and thereby can be used for gene editing. In bacteria and archaea, gRNAs are a part of the CRISPR-Cas system that serves as an adaptive immune defense that protects the organism from viruses. Here the short gRNAs serve as detectors of foreign DNA and direct the Cas-enzymes that degrades the foreign nucleic acid.

<span class="mw-page-title-main">Artificial transcription factor</span>

Artificial transcription factors (ATFs) are engineered individual or multi molecule transcription factors that either activate or repress gene transcription (biology).

<span class="mw-page-title-main">Long non-coding RNA</span> Non-protein coding transcripts longer than 200 nucleotides

Long non-coding RNAs are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs), Piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), and other short RNAs. Given that some lncRNAs have been reported to have the potential to encode small proteins or micro-peptides, the latest definition of lncRNA is a class of RNA molecules of over 200 nucleotides that have no or limited coding capacity. Long intervening/intergenic noncoding RNAs (lincRNAs) are sequences of lncRNA which do not overlap protein-coding genes.

<span class="mw-page-title-main">Genome editing</span> Type of genetic engineering

Genome editing, or genome engineering, or gene editing, is a type of genetic engineering in which DNA is inserted, deleted, modified or replaced in the genome of a living organism. Unlike early genetic engineering techniques that randomly inserts genetic material into a host genome, genome editing targets the insertions to site-specific locations. The basic mechanism involved in genetic manipulations through programmable nucleases is the recognition of target genomic loci and binding of effector DNA-binding domain (DBD), double-strand breaks (DSBs) in target DNA by the restriction endonucleases, and the repair of DSBs through homology-directed recombination (HDR) or non-homologous end joining (NHEJ).

<span class="mw-page-title-main">Cas9</span> Microbial protein found in Streptococcus pyogenes M1 GAS

Cas9 is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic engineering applications. Its main function is to cut DNA and thereby alter a cell's genome. The CRISPR-Cas9 genome editing technique was a significant contributor to the Nobel Prize in Chemistry in 2020 being awarded to Emmanuelle Charpentier and Jennifer Doudna.

<span class="mw-page-title-main">CRISPR interference</span> Genetic perturbation technique

CRISPR interference (CRISPRi) is a genetic perturbation technique that allows for sequence-specific repression of gene expression in prokaryotic and eukaryotic cells. It was first developed by Stanley Qi and colleagues in the laboratories of Wendell Lim, Adam Arkin, Jonathan Weissman, and Jennifer Doudna. Sequence-specific activation of gene expression refers to CRISPR activation (CRISPRa).

<span class="mw-page-title-main">Epigenome editing</span>

Epigenome editing or epigenome engineering is a type of genetic engineering in which the epigenome is modified at specific sites using engineered molecules targeted to those sites. Whereas gene editing involves changing the actual DNA sequence itself, epigenetic editing involves modifying and presenting DNA sequences to proteins and other DNA binding factors that influence DNA function. By "editing” epigenomic features in this manner, researchers can determine the exact biological role of an epigenetic modification at the site in question.

<span class="mw-page-title-main">Cas12a</span> DNA-editing technology

Cas12a is a subtype of Cas12 proteins and an RNA-guided endonuclease that forms part of the CRISPR system in some bacteria and archaea. It originates as part of a bacterial immune mechanism, where it serves to destroy the genetic material of viruses and thus protect the cell and colony from viral infection. Cas12a and other CRISPR associated endonucleases use an RNA to target nucleic acid in a specific and programmable matter. In the organisms from which it originates, this guide RNA is a copy of a piece of foreign nucleic acid that previously infected the cell.

No-SCAR genome editing is an editing method that is able to manipulate the Escherichia coli genome. The system relies on recombineering whereby DNA sequences are combined and manipulated through homologous recombination. No-SCAR is able to manipulate the E. coli genome without the use of the chromosomal markers detailed in previous recombineering methods. Instead, the λ-Red recombination system facilitates donor DNA integration while Cas9 cleaves double-stranded DNA to counter-select against wild-type cells. Although λ-Red and Cas9 genome editing are widely used technologies, the no-SCAR method is novel in combining the two functions; this technique is able to establish point mutations, gene deletions, and short sequence insertions in several genomic loci with increased efficiency and time sensitivity.

CRISPR activation (CRISPRa) is a type of CRISPR tool that uses modified versions of CRISPR effectors without endonuclease activity, with added transcriptional activators on dCas9 or the guide RNAs (gRNAs).

Off-target genome editing refers to nonspecific and unintended genetic modifications that can arise through the use of engineered nuclease technologies such as: clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9, transcription activator-like effector nucleases (TALEN), meganucleases, and zinc finger nucleases (ZFN). These tools use different mechanisms to bind a predetermined sequence of DNA (“target”), which they cleave, creating a double-stranded chromosomal break (DSB) that summons the cell's DNA repair mechanisms and leads to site-specific modifications. If these complexes do not bind at the target, often a result of homologous sequences and/or mismatch tolerance, they will cleave off-target DSB and cause non-specific genetic modifications. Specifically, off-target effects consist of unintended point mutations, deletions, insertions inversions, and translocations.

<span class="mw-page-title-main">CRISPR gene editing</span> Gene editing method

CRISPR gene editing is a genetic engineering technique in molecular biology by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense system. By delivering the Cas9 nuclease complexed with a synthetic guide RNA (gRNA) into a cell, the cell's genome can be cut at a desired location, allowing existing genes to be removed and/or new ones added in vivo.

<span class="mw-page-title-main">Genome-wide CRISPR-Cas9 knockout screens</span> Research tool in genomics

Genome-wide CRISPR-Cas9 knockout screens aim to elucidate the relationship between genotype and phenotype by ablating gene expression on a genome-wide scale and studying the resulting phenotypic alterations. The approach utilises the CRISPR-Cas9 gene editing system, coupled with libraries of single guide RNAs (sgRNAs), which are designed to target every gene in the genome. Over recent years, the genome-wide CRISPR screen has emerged as a powerful tool for performing large-scale loss-of-function screens, with low noise, high knockout efficiency and minimal off-target effects.

Robert E. Kingston is an American biochemist who studies the functional and regulatory role nucleosomes play in gene expression, specifically during early development. After receiving his PhD (1981) and completing post-doctoral research, Kingston became an assistant professor at Massachusetts General Hospital (1985), where he started a research laboratory focused on understanding chromatin's structure with regards to transcriptional regulation. As a Harvard graduate himself, Kingston has served his alma mater through his leadership.

References

  1. 1 2 Jinek, Martin; Chylinski, Krzysztof; Fonfara, Ines; Hauer, Michael; Doudna, Jennifer A.; Charpentier, Emmanuelle (2012-08-17). "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity". Science. 337 (6096): 816–821. Bibcode:2012Sci...337..816J. doi:10.1126/science.1225829. ISSN   1095-9203. PMC   6286148 . PMID   22745249.
  2. 1 2 Cong, Le; Ran, F. Ann; Cox, David; Lin, Shuailiang; Barretto, Robert; Habib, Naomi; Hsu, Patrick D.; Wu, Xuebing; Jiang, Wenyan (2013-02-15). "Multiplex genome engineering using CRISPR/Cas systems". Science. 339 (6121): 819–823. Bibcode:2013Sci...339..819C. doi:10.1126/science.1231143. ISSN   1095-9203. PMC   3795411 . PMID   23287718.
  3. 1 2 Shechner, David M.; Hacisuleyman, Ezgi; Younger, Scott T.; Rinn, John L. (2015-07-01). "Multiplexable, locus-specific targeting of long RNAs with CRISPR-Display". Nature Methods. 12 (7): 664–670. doi:10.1038/nmeth.3433. ISSN   1548-7105. PMC   4821475 . PMID   26030444.
  4. Perez-Pinera, Pablo; Jones, Matthew F.; Lal, Ashish; Lu, Timothy K. (2015-07-16). "Putting Non-coding RNA on Display with CRISPR". Molecular Cell. 59 (2): 146–148. doi:10.1016/j.molcel.2015.07.002. ISSN   1097-4164. PMC   6314022 . PMID   26186289.
  5. Melé, Marta; Rinn, John L. (2016-06-02). ""Cat's Cradling" the 3D Genome by the Act of LncRNA Transcription". Molecular Cell. 62 (5): 657–664. doi: 10.1016/j.molcel.2016.05.011 . ISSN   1097-4164. PMID   27259198.