Part of a series on |
CRISPR |
---|
Genome editing: CRISPR-Cas |
variants: Anti-CRISPR - CIRTS - CRISPeYCRISPR-Cas10 - CRISPR-Cas13 - CRISPR-BEST CRISPR-Disp - CRISPR-Gold - CRISPRa - CRISPRi Easi-CRISPR - FACE |
Enzyme |
Cas9 - FokI - EcoRI - PstI - SmaI HaeIII - Cas12a (Cpf1) - xCas9 |
Applications |
CAMERA - ICE - Genética dirigida |
other Genome editing method: |
Prime editing - Pro-AG - RESCUE - TALEN - ZFN - LEAPER |
CRISPR interference (CRISPRi) is a genetic perturbation technique that allows for sequence-specific repression of gene expression in prokaryotic and eukaryotic cells. [1] It was first developed by Stanley Qi and colleagues in the laboratories of Wendell Lim, Adam Arkin, Jonathan Weissman, and Jennifer Doudna. [2] Sequence-specific activation of gene expression refers to CRISPR activation (CRISPRa).
Based on the bacterial genetic immune system - CRISPR (clustered regularly interspaced short palindromic repeats) pathway, [3] the technique provides a complementary approach to RNA interference. The difference between CRISPRi and RNAi, though, is that CRISPRi regulates gene expression primarily on the transcriptional level, while RNAi controls genes on the mRNA level.
Many bacteria and most archaea have an adaptive immune system which incorporates CRISPR RNA (crRNA) and CRISPR-associated (cas) genes.
The CRISPR interference (CRISPRi) technique was first reported by Lei S. Qi and researchers at the University of California at San Francisco in early 2013. [2] The technology uses a catalytically dead Cas9 (usually denoted as dCas9) protein that lacks endonuclease activity to regulate genes in an RNA-guided manner. Targeting specificity is determined by complementary base-pairing of a single guide RNA (sgRNA) to the genomic locus. sgRNA is a chimeric noncoding RNA that can be subdivided into three regions: a 20 nt base-pairing sequence, a 42 nt dCas9-binding hairpin and a 40 nt terminator (bacteria, [4] [5] [6] yeast, [7] fruit flies, [8] zebrafish, [9] mice [10] ).
When designing a synthetic sgRNA, only the 20 nt base-pairing sequence is modified. Secondary variables must also be considered: off-target effects (for which a simple BLAST run of the base-pairing sequence is required), maintenance of the dCas9-binding hairpin structure, and ensuring that no restriction sites are present in the modified sgRNA, as this may pose a problem in downstream cloning steps. Due to the simplicity of sgRNA design, this technology is amenable to genome-wide scaling. [11] CRISPRi relies on the generation of catalytically inactive Cas9. This is accomplished by introducing point mutations in the two catalytic residues (D10A and H840A) of the gene encoding Cas9. [12] In doing so, dCas9 is unable to cleave dsDNA but retains the ability to target DNA. Together, sgRNA and dCas9 constitute a minimal system for gene-specific regulation. [2]
CRISPRi can sterically repress transcription by blocking either transcriptional initiation or elongation. This is accomplished by designing sgRNA complementary to the promoter or the exonic sequences. The level of transcriptional repression with a target within the coding sequence is strand-specific. Depending on the nature of the CRISPR effector, either the template or non-template strand leads to stronger repression. [13] For dCas9 (based on a Type-2 CRISPR system), repression is stronger when the guide RNA is complementary to the non-template strand. It has been suggested that this is due to the activity of helicase, which unwinds the RNA:DNA heteroduplex ahead of RNA pol II when the sgRNA is complementary to the template strand. Unlike transcription elongation block, silencing is independent of the targeted DNA strand when targeting the transcriptional start site. In prokaryotes, this steric inhibition can repress transcription of the target gene by almost 99.9%; in archaea, more than 90% repression was achieved; [14] in human cells, up to 90% repression was observed. [2] In bacteria, it is possible to saturate the target with a high enough level of dCas9 complex. In this case, the repression strength only depends on the probability that dCas9 is ejected upon collision with the RNA polymerase, which is determined by the guide sequence. [15] Higher temperatures are also associated with higher ejection probability, thus weaker repression. [15] In eukaryotes, CRISPRi can also repress transcription via an effector domain. Fusing a repressor domain to dCas9 allows transcription to be further repressed by inducing heterochromatinization. For example, the well-studied Krüppel associated box (KRAB) domain can be fused to dCas9 to repress transcription of the target gene up to 99% in human cells. [16]
Whereas genome-editing by the catalytically active Cas9 nuclease can be accompanied by irreversible off-target genomic alterations, CRISPRi is highly specific with minimal off-target reversible effects for two distinct sgRNA sequences. [16] Nonetheless, several methods have been developed to improve the efficiency of transcriptional modulation. Identification of the transcription start site of a target gene and considering the preferences of sgRNA improves efficiency, as does the presence of accessible chromatin at the target site. [17]
Along with other improvements mentioned, factors such as the distance from the transcription start and the local chromatin state may be critical parameters in determining activation/repression efficiency. Optimization of dCas9 and sgRNA expression, stability, nuclear localization, and interaction will likely allow for further improvement of CRISPRi efficiency in mammalian cells. [2]
A significant portion of the genome (both reporter and endogenous genes) in eukaryotes has been shown to be targetable using lentiviral constructs to express dCas9 and sgRNAs, with comparable efficiency to existing techniques such as RNAi and TALE proteins. [16] In tandem or as its own system, CRISPRi could be used to achieve the same applications as in RNAi.
For bacteria, gene knockdown by CRISPRi has been fully implemented and characterized (off-target analysis, leaky repression) for both Gram-negative E. coli [4] [6] and Gram-positive B. subtilis. [5]
Not only in bacteria but also in archaea (e.g., M. acetivorans) CRISPRi-Cas9 was successfully utilized to knockdown several genes/operons that related to nitrogen fixation. [14]
Differential gene expression can be achieved by modifying the efficiency of sgRNA base-pairing to the target loci. [11] In theory, modulating this efficiency can be used to create an allelic series for any given gene, in essence creating a collection of hypo- and hypermorphs. These powerful collections can be used to probe any genetic investigation. For hypomorphs, this allows the incremental reduction of gene function as opposed to the binary nature of gene knockouts and the unpredictability of knockdowns. For hypermorphs, this is in contrast to the conventional method of cloning the gene of interest under promoters with variable strength.
Fusing a fluorescent protein to dCas9 allows for imaging of genomic loci in living human cells. [18] Compared to fluorescence in situ hybridization (FISH), the method uniquely allows for dynamic tracking of chromosome loci. This has been used to study chromatin architecture and nuclear organization dynamics in laboratory cell lines including HeLa cells.
Activation of Yamanaka factors by CRISPRa has been used to induce pluripotency in human and mouse cells providing an alternative method to iPS technology. [19] [20] In addition, large-scale activation screens could be used to identify proteins that promote induced pluripotency or, conversely, promote differentiation to a specific cell lineage. [21]
The ability to upregulate gene expression using dCas9-SunTag with a single sgRNA also opens the door to large-scale genetic screens, such as Perturb-seq, to uncover phenotypes that result from increased or decreased gene expression, which will be especially important for understanding the effects of gene regulation in cancer. [22] Furthermore, CRISPRi systems have been shown to be transferable via horizontal gene transfer mechanisms such as bacterial conjugation and specific repression of reporter genes in recipient cells has been demonstrated. CRISPRi could serve as a tool for genetic screening and potentially bacterial population control. [23]
Gene knockouts are a widely used genetic engineering technique that involves the targeted removal or inactivation of a specific gene within an organism's genome. This can be done through a variety of methods, including homologous recombination, CRISPR-Cas9, and TALENs.
Gene knockdown is an experimental technique by which the expression of one or more of an organism's genes is reduced. The reduction can occur either through genetic modification or by treatment with a reagent such as a short DNA or RNA oligonucleotide that has a sequence complementary to either gene or an mRNA transcript.
Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "candidate-gene" approach.
CRISPR is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. These sequences are derived from DNA fragments of bacteriophages that had previously infected the prokaryote. They are used to detect and destroy DNA from similar bacteriophages during subsequent infections. Hence these sequences play a key role in the antiviral defense system of prokaryotes and provide a form of acquired immunity. CRISPR is found in approximately 50% of sequenced bacterial genomes and nearly 90% of sequenced archaea.
Guide RNA (gRNA) or single guide RNA (sgRNA) is a short sequence of RNA that functions as a guide for the Cas9-endonuclease or other Cas-proteins that cut the double-stranded DNA and thereby can be used for gene editing. In bacteria and archaea, gRNAs are a part of the CRISPR-Cas system that serves as an adaptive immune defense that protects the organism from viruses. Here the short gRNAs serve as detectors of foreign DNA and direct the Cas-enzymes that degrades the foreign nucleic acid.
In Molecular biology, an insert is a piece of DNA that is inserted into a larger DNA vector by a recombinant DNA technique, such as ligation or recombination. This allows it to be multiplied, selected, further manipulated or expressed in a host organism.
Genetic engineering techniques allow the modification of animal and plant genomes. Techniques have been devised to insert, delete, and modify DNA at multiple levels, ranging from a specific base pair in a specific gene to entire genes. There are a number of steps that are followed before a genetically modified organism (GMO) is created. Genetic engineers must first choose what gene they wish to insert, modify, or delete. The gene must then be isolated and incorporated, along with other genetic elements, into a suitable vector. This vector is then used to insert the gene into the host genome, creating a transgenic or edited organism.
Cas9 is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic engineering applications. Its main function is to cut DNA and thereby alter a cell's genome. The CRISPR-Cas9 genome editing technique was a significant contributor to the Nobel Prize in Chemistry in 2020 being awarded to Emmanuelle Charpentier and Jennifer Doudna.
Epigenome editing or epigenome engineering is a type of genetic engineering in which the epigenome is modified at specific sites using engineered molecules targeted to those sites. Whereas gene editing involves changing the actual DNA sequence itself, epigenetic editing involves modifying and presenting DNA sequences to proteins and other DNA binding factors that influence DNA function. By "editing” epigenomic features in this manner, researchers can determine the exact biological role of an epigenetic modification at the site in question.
A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but is not found in the bacterial host genome and hence is not a component of the bacterial CRISPR locus. Cas9 will not successfully bind to or cleave the target DNA sequence if it is not followed by the PAM sequence. PAM is an essential targeting component which distinguishes bacterial self from non-self DNA, thereby preventing the CRISPR locus from being targeted and destroyed by the CRISPR-associated nuclease.
Cas12a is a subtype of Cas12 proteins and an RNA-guided endonuclease that forms part of the CRISPR system in some bacteria and archaea. It originates as part of a bacterial immune mechanism, where it serves to destroy the genetic material of viruses and thus protect the cell and colony from viral infection. Cas12a and other CRISPR associated endonucleases use an RNA to target nucleic acid in a specific and programmable matter. In the organisms from which it originates, this guide RNA is a copy of a piece of foreign nucleic acid that previously infected the cell.
No-SCAR genome editing is an editing method that is able to manipulate the Escherichia coli genome. The system relies on recombineering whereby DNA sequences are combined and manipulated through homologous recombination. No-SCAR is able to manipulate the E. coli genome without the use of the chromosomal markers detailed in previous recombineering methods. Instead, the λ-Red recombination system facilitates donor DNA integration while Cas9 cleaves double-stranded DNA to counter-select against wild-type cells. Although λ-Red and Cas9 genome editing are widely used technologies, the no-SCAR method is novel in combining the two functions; this technique is able to establish point mutations, gene deletions, and short sequence insertions in several genomic loci with increased efficiency and time sensitivity.
Perturb-seq refers to a high-throughput method of performing single cell RNA sequencing (scRNA-seq) on pooled genetic perturbation screens. Perturb-seq combines multiplexed CRISPR mediated gene inactivations with single cell RNA sequencing to assess comprehensive gene expression phenotypes for each perturbation. Inferring a gene’s function by applying genetic perturbations to knock down or knock out a gene and studying the resulting phenotype is known as reverse genetics. Perturb-seq is a reverse genetics approach that allows for the investigation of phenotypes at the level of the transcriptome, to elucidate gene functions in many cells, in a massively parallel fashion.
CRISPR-Display (CRISP-Disp) is a modification of the CRISPR/Cas9 system for genome editing. The CRISPR/Cas9 system uses a short guide RNA (sgRNA) sequence to direct a Streptococcus pyogenes Cas9 nuclease, acting as a programmable DNA binding protein, to cleave DNA at a site of interest.
CRISPR activation (CRISPRa) is a type of CRISPR tool that uses modified versions of CRISPR effectors without endonuclease activity, with added transcriptional activators on dCas9 or the guide RNAs (gRNAs).
Off-target genome editing refers to nonspecific and unintended genetic modifications that can arise through the use of engineered nuclease technologies such as: clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9, transcription activator-like effector nucleases (TALEN), meganucleases, and zinc finger nucleases (ZFN). These tools use different mechanisms to bind a predetermined sequence of DNA (“target”), which they cleave, creating a double-stranded chromosomal break (DSB) that summons the cell's DNA repair mechanisms and leads to site-specific modifications. If these complexes do not bind at the target, often a result of homologous sequences and/or mismatch tolerance, they will cleave off-target DSB and cause non-specific genetic modifications. Specifically, off-target effects consist of unintended point mutations, deletions, insertions inversions, and translocations.
CRISPR gene editing is a genetic engineering technique in molecular biology by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense system. By delivering the Cas9 nuclease complexed with a synthetic guide RNA (gRNA) into a cell, the cell's genome can be cut at a desired location, allowing existing genes to be removed and/or new ones added in vivo.
Anti-CRISPR is a group of proteins found in phages, that inhibit the normal activity of CRISPR-Cas, the immune system of certain bacteria. CRISPR consists of genomic sequences that can be found in prokaryotic organisms, that come from bacteriophages that infected the bacteria beforehand, and are used to defend the cell from further viral attacks. Anti-CRISPR results from an evolutionary process occurred in phages in order to avoid having their genomes destroyed by the prokaryotic cells that they will infect.
Genome-wide CRISPR-Cas9 knockout screens aim to elucidate the relationship between genotype and phenotype by ablating gene expression on a genome-wide scale and studying the resulting phenotypic alterations. The approach utilises the CRISPR-Cas9 gene editing system, coupled with libraries of single guide RNAs (sgRNAs), which are designed to target every gene in the genome. Over recent years, the genome-wide CRISPR screen has emerged as a powerful tool for performing large-scale loss-of-function screens, with low noise, high knockout efficiency and minimal off-target effects.
Genome editing of synthetic target arrays for lineage tracing (GESTALT) is a method used to determine the developmental lineages of cells in multicellular systems. GESTALT involves introducing a small DNA barcode that contains regularly spaced CRISPR/Cas9 target sites into the genomes of progenitor cells. Alongside the barcode, Cas9 and sgRNA are introduced into the cells. Mutations in the barcode accumulate during the course of cell divisions and the unique combination of mutations in a cell's barcode can be determined by DNA or RNA sequencing to link it to a developmental lineage.