CRISPR-Cas design tools are computer software platforms and bioinformatics tools used to facilitate the design of guide RNAs (gRNAs) for use with the CRISPR/Cas gene editing system.
The CRISPR/Cas (clustered regularly interspaced short palindromic repeats/CRISPR associated nucleases) system was originally discovered to be an acquired immune response mechanism used by archaea and bacteria. It has since been adopted for use as a tool in the genetic engineering of higher organisms.
Designing an appropriate gRNA is an important element of genome editing with the CRISPR/Cas system. A gRNA can and at times does have unintended interactions ("off-targets") with other locations of the genome of interest. For a given candidate gRNA, these tools report its list of potential off-targets in the genome thereby allowing the designer to evaluate its suitability prior to embarking on any experiments.
Scientists have also begun exploring the mechanics of the CRISPR/Cas system and what governs how good, or active, a gRNA is at directing the Cas nuclease to a specific location of the genome of interest. [1] [2] As a result of this work, new methods of assessing a gRNA for its 'activity' have been published, [1] [2] and it is now best practice to consider both the unintended interactions of a gRNA as well as the predicted activity of a gRNA at the design stage.
The below table lists available tools and their attributes.
Tool Name | Provider | Searches whole genome for targets | Returns all targets of genome | Seed span and location can be defined | Maximum number of mismatches supported | Predicts gRNA activity | Available Protospacer adjacent motif (PAM) sequences | Annotation is reported | gRNA suggestion or scoring | References |
---|---|---|---|---|---|---|---|---|---|---|
CRISPRon, CRISPRoff | Center for non-coding RNA in Technology and Health, University of Copenhagen | Yes | Yes | Yes | All | Yes | NGG, NGA, NAG | Yes | Yes | , [3] [4] |
Invitrogen TrueDesign Genome Editor | Thermo Fisher Scientific | Yes | Yes | No | 3 | No | NGG | Yes | Yes | [5] |
Breaking-Cas | Spanish National Center for Biotechnology | Yes (over 1000 genomes) | Yes | Yes (by weights) | 4 | No | User customizable | Yes | Yes | [6] |
Cas-OFFinder | Seoul National University | Yes | Yes | No | 0-10 | No | NGG, NRG, NNAGAAW, NNNNGMTT | No | Yes | [7] |
CASTING | Caagle | Yes | Yes | No | 3 | No | NGG and NAG | No | Yes | [8] |
CRISPy | Technical University of Denmark | Yes | Yes | No | All | No | NGG | Yes | Yes | [9] |
CCTop | University of Heidelberg | Yes | Yes | Partial | 5 (0-5) | Yes | NGG, NRG, NNGRRT, NNNNGATT, NNAGAAW, NAAAAC | Yes | Yes | [10] |
CHOPCHOP | Harvard University | Yes | Yes | Partial | 0, 2 | No | NGG, NNAGAA, NNNNGANN | No | Yes | [11] |
CHOPCHOP v2 | University of Bergen | Yes | Yes | Yes | 3 (0-3) | Yes | User customizable | Yes | Yes | [12] |
CRISPOR | University of California, Santa Cruz TEFOR | Yes (over 200 genomes) | Yes | No | 4 | Yes | NGG, NGA, NGCG, NNAGAA, NGGNG, NNGRRT, NNNRRT, NNNNGMTT, NNNNACA, TTTN | Yes | Yes | [13] |
CRISPR Design | Zhang Lab, MIT | Yes | No | No | 4 | No | NGG and NAG | mRNA exons | Yes | [14] |
CRISPRdirect | Database Center for Life Science (DBCLS) | Yes (over 200 species) | Yes | No | Any number | No | NNN | Yes | Yes | [15] |
CRISPRscan | Giraldez Lab, Yale | Yes | Yes | No | 4 | Yes | NGG, TTTV, TTTN | Yes | Yes | [16] |
CRISPRseek | Bioconductor | Yes | Yes | No | Any number | No | User customizable | mRNA exons | Yes | [17] |
DESKGEN | Desktop Genetics | Yes | Yes | Yes | Any number | Yes | Fully user customizable | Yes | Yes | [18] |
GuideScan | GuideScan | Yes | Yes | Yes | 3 on website and customizable with command line | Yes | NGG/NAG on website and customizable with command line | Yes | Yes | [19] |
GT-Scan | CSIRO & EMBL-ABR | Yes | Yes | Yes | 3 (0-3) | No | User customizable | Links to Ensembl genome browser | Yes | [20] |
Off-Spotter | Thomas Jefferson University | Yes | Yes | Yes | 0-5 | NGG, NAG, NNNNACA, NNGRRT (R is A or G) | mRNA exons, unspliced mRNA, mRNA, 5'UTR, CDS, 3'UTR, unspliced lincRNA, lincRNA | User customizable | [21] | |
sgRNA Designer | Broad Institute | No | No | No | 0 | Yes | NGG | CDS (if searching by transcript ID) | Yes | [1] |
Synthego Design Tool | Synthego | Yes (over 120,000 genomes) | No (Optimized for Knockout) | Yes | 3 | Yes | NGG | Yes (RefSeq, Ensembl, Gencode) | Yes | [22] |
TUSCAN | CSIRO | No | No | No | 0 | Yes | NGG | No | Yes | [23] |
VARSCOT | CSIRO | Yes | Yes | No | 0-8 | Yes | User customizable | No | Yes | [24] |
CRISPR Targeted Gene Designer | Horizon Discovery [ permanent dead link ] | Yes, Multiple | yes | yes | 4 | Yes | NGG, NNGRRT, YTTV, other | Yes | Yes | (21) |
GuideMaker | United States Department of Agriculture, Agricultural Research Service | Yes, any user supplied genome | Yes | Yes | 0-5 | Yes | Any PAM site and PAM orientation | Yes | Yes | [25] |
WGE | Wellcome Sanger Institute | Yes, human and mouse reference genomes | Yes | Yes | 4 (0-4) | No | NGG | Yes | Yes | [26] |
Gene knockdown is an experimental technique by which the expression of one or more of an organism's genes is reduced. The reduction can occur either through genetic modification or by treatment with a reagent such as a short DNA or RNA oligonucleotide that has a sequence complementary to either gene or an mRNA transcript.
CRISPR is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. These sequences are derived from DNA fragments of bacteriophages that had previously infected the prokaryote. They are used to detect and destroy DNA from similar bacteriophages during subsequent infections. Hence these sequences play a key role in the antiviral defense system of prokaryotes and provide a form of acquired immunity. CRISPR is found in approximately 50% of sequenced bacterial genomes and nearly 90% of sequenced archaea.
Guide RNA (gRNA) or single guide RNA (sgRNA) is a short sequence of RNA that functions as a guide for the Cas9-endonuclease or other Cas-proteins that cut the double-stranded DNA and thereby can be used for gene editing. In bacteria and archaea, gRNAs are a part of the CRISPR-Cas system that serves as an adaptive immune defense that protects the organism from viruses. Here the short gRNAs serve as detectors of foreign DNA and direct the Cas-enzymes that degrades the foreign nucleic acid.
The restriction endonuclease Fok1, naturally found in Flavobacterium okeanokoites, is a bacterial type IIS restriction endonuclease consisting of an N-terminal DNA-binding domain and a non sequence-specific DNA cleavage domain at the C-terminal. Once the protein is bound to duplex DNA via its DNA-binding domain at the 5'-GGATG-3' recognition site, the DNA cleavage domain is activated and cleaves the DNA at two locations, regardless of the nucleotide sequence at the cut site. The DNA is cut 9 nucleotides downstream of the motif on the forward strand, and 13 nucleotides downstream of the motif on the reverse strand, producing two sticky ends with 4-bp overhangs.
In Molecular biology, an insert is a piece of DNA that is inserted into a larger DNA vector by a recombinant DNA technique, such as ligation or recombination. This allows it to be multiplied, selected, further manipulated or expressed in a host organism.
Transcription activator-like effector nucleases (TALEN) are restriction enzymes that can be engineered to cut specific sequences of DNA. They are made by fusing a TAL effector DNA-binding domain to a DNA cleavage domain. Transcription activator-like effectors (TALEs) can be engineered to bind to practically any desired DNA sequence, so when combined with a nuclease, DNA can be cut at specific locations. The restriction enzymes can be introduced into cells, for use in gene editing or for genome editing in situ, a technique known as genome editing with engineered nucleases. Alongside zinc finger nucleases and CRISPR/Cas9, TALEN is a prominent tool in the field of genome editing.
Genome editing, or genome engineering, or gene editing, is a type of genetic engineering in which DNA is inserted, deleted, modified or replaced in the genome of a living organism. Unlike early genetic engineering techniques that randomly inserts genetic material into a host genome, genome editing targets the insertions to site-specific locations. The basic mechanism involved in genetic manipulations through programmable nucleases is the recognition of target genomic loci and binding of effector DNA-binding domain (DBD), double-strand breaks (DSBs) in target DNA by the restriction endonucleases, and the repair of DSBs through homology-directed recombination (HDR) or non-homologous end joining (NHEJ).
Genetic engineering techniques allow the modification of animal and plant genomes. Techniques have been devised to insert, delete, and modify DNA at multiple levels, ranging from a specific base pair in a specific gene to entire genes. There are a number of steps that are followed before a genetically modified organism (GMO) is created. Genetic engineers must first choose what gene they wish to insert, modify, or delete. The gene must then be isolated and incorporated, along with other genetic elements, into a suitable vector. This vector is then used to insert the gene into the host genome, creating a transgenic or edited organism.
Cas9 is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic engineering applications. Its main function is to cut DNA and thereby alter a cell's genome. The CRISPR-Cas9 genome editing technique was a significant contributor to the Nobel Prize in Chemistry in 2020 being awarded to Emmanuelle Charpentier and Jennifer Doudna.
Feng Zhang is a Chinese–American biochemist. Zhang currently holds the James and Patricia Poitras Professorship in Neuroscience at the McGovern Institute for Brain Research and in the departments of Brain and Cognitive Sciences and Biological Engineering at the Massachusetts Institute of Technology. He also has appointments with the Broad Institute of MIT and Harvard. He is most well known for his central role in the development of optogenetics and CRISPR technologies.
CRISPR interference (CRISPRi) is a genetic perturbation technique that allows for sequence-specific repression of gene expression in prokaryotic and eukaryotic cells. It was first developed by Stanley Qi and colleagues in the laboratories of Wendell Lim, Adam Arkin, Jonathan Weissman, and Jennifer Doudna. Sequence-specific activation of gene expression refers to CRISPR activation (CRISPRa).
Epigenome editing or epigenome engineering is a type of genetic engineering in which the epigenome is modified at specific sites using engineered molecules targeted to those sites. Whereas gene editing involves changing the actual DNA sequence itself, epigenetic editing involves modifying and presenting DNA sequences to proteins and other DNA binding factors that influence DNA function. By "editing” epigenomic features in this manner, researchers can determine the exact biological role of an epigenetic modification at the site in question.
A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but is not found in the bacterial host genome and hence is not a component of the bacterial CRISPR locus. Cas9 will not successfully bind to or cleave the target DNA sequence if it is not followed by the PAM sequence. PAM is an essential targeting component which distinguishes bacterial self from non-self DNA, thereby preventing the CRISPR locus from being targeted and destroyed by the CRISPR-associated nuclease.
Cas12a is a subtype of Cas12 proteins and an RNA-guided endonuclease that forms part of the CRISPR system in some bacteria and archaea. In CRISPR systems, Cas12a serves to destroy the genetic material of viruses and other foreign DNA, thereby protecting the cell from infection. Like other Cas enzymes, Cas12a binds to an RNA to target nucleic acid in a specific and programmable matter. In the host organism, the crRNA contains a constant region that is recognized by the Cas12a protein and a spacer region that is complementary to a piece of foreign nucleic acid that previously infected the cell.
J. Keith Joung is an American pathologist and molecular biologist who holds the Robert B. Colvin Endowed Chair in Pathology at Massachusetts General Hospital and is Professor of Pathology at Harvard Medical School. He is a leading figure in the field of genome editing and has pioneered the development of designer nucleases and sensitive off-target detection methods.
Off-target genome editing refers to nonspecific and unintended genetic modifications that can arise through the use of engineered nuclease technologies such as: clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9, transcription activator-like effector nucleases (TALEN), meganucleases, and zinc finger nucleases (ZFN). These tools use different mechanisms to bind a predetermined sequence of DNA (“target”), which they cleave, creating a double-stranded chromosomal break (DSB) that summons the cell's DNA repair mechanisms and leads to site-specific modifications. If these complexes do not bind at the target, often a result of homologous sequences and/or mismatch tolerance, they will cleave off-target DSB and cause non-specific genetic modifications. Specifically, off-target effects consist of unintended point mutations, deletions, insertions inversions, and translocations.
CRISPR gene editing is a genetic engineering technique in molecular biology by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense system. By delivering the Cas9 nuclease complexed with a synthetic guide RNA (gRNA) into a cell, the cell's genome can be cut at a desired location, allowing existing genes to be removed and/or new ones added in vivo.
GUIDE-Seq is a molecular biology technique that allows for the unbiased in vitro detection of off-target genome editing events in DNA caused by CRISPR/Cas9 as well as other RNA-guided nucleases in living cells. Similar to LAM-PCR, it employs multiple PCRs to amplify regions of interest that contain a specific insert that preferentially integrates into double-stranded breaks. As gene therapy is an emerging field, GUIDE-Seq has gained traction as a cheap method to detect the off-target effects of potential therapeutics without needing whole genome sequencing.
Genome-wide CRISPR-Cas9 knockout screens aim to elucidate the relationship between genotype and phenotype by ablating gene expression on a genome-wide scale and studying the resulting phenotypic alterations. The approach utilises the CRISPR-Cas9 gene editing system, coupled with libraries of single guide RNAs (sgRNAs), which are designed to target every gene in the genome. Over recent years, the genome-wide CRISPR screen has emerged as a powerful tool for performing large-scale loss-of-function screens, with low noise, high knockout efficiency and minimal off-target effects.
CRISPR RNA or crRNA is a RNA transcript from the CRISPR locus. CRISPR-Cas is an adaptive immune system found in bacteria and archaea to protect against mobile genetic elements, like viruses, plasmids, and transposons. The CRISPR locus contains a series of repeats interspaced with unique spacers. These unique spacers can be acquired from MGEs.