Cas12a

Last updated
CRISPR-associated protein 12a
Acidaminococcus sp. Cas12a PDB 5B43.png
Acidaminococcus sp. Cas12a PDB: 5B43
Identifiers
SymbolCas12a
InterPro IPR027620

Cas12a (CRISPR-associated protein 12a, previously known as Cpf1) is an RNA-guided endonuclease that forms an essential component of the CRISPR systems found in some bacteria and archaea. In its natural context, Cas12a targets and destroys the genetic material of viruses and other foreign mobile genetic elements, thereby protecting the host cell from infection. Like other Cas enzymes, Cas12a binds to a "guide" RNA (termed a crRNA, or CRISPR RNA) which targets it to a DNA sequence in a specific and programmable matter. In the host organism, the crRNA contains a constant region that is recognized by the Cas12a protein and a "spacer" region that is complementary to a piece of foreign nucleic acid (e.g. a portion of a phage genome) that previously infected the cell. [1]

Contents

As with Cas9 and other Cas proteins, the programmable DNA-targeting activity of Cas12a makes it a useful tool for biotechnology and biological research applications. By modifying the spacer sequence in the crRNA, researchers can target Cas12a to specific DNA sequences, allowing for highly targeted modifications of DNA. [2] Cas12a is distinguished from Cas9 by a its single RuvC endonuclease active site, its 5' protospacer adjacent motif preference, and its formation of sticky rather than blunt ends at the cut site; these and other differences may make it more suitable for certain applications[ citation needed ]. Beyond its use in basic research, CRISPR-Cas12a could have applications in the treatment of genetic illnesses and in implementing gene drives. [2]

Description

Discovery

CRISPR-Cas12a was found by searching a published database of bacterial genetic sequences for promising bits of DNA. Its identification through bioinformatics as a CRISPR system protein, its naming, and a hidden Markov model (HMM) for its detection were provided in 2012 in a release of the TIGRFAMs database of protein families. Cas12a appears in many bacterial species. The ultimate Cas12a endonuclease that was developed into a tool for genome editing was taken from one of the first 16 species known to harbor it. [3] Two candidate enzymes from Acidaminococcus and Lachnospiraceae display efficient genome-editing activity in human cells. [2]

Classification

CRISPR-Cas systems are separated into two classes: Class I, in which several Cas proteins associate with a crRNA to build a functional endonuclease, and Class II, in which a single Cas endonuclease associates with a crRNA; Class II is further divided into Type II, Type V, and Type VI systems. Cas12a is identified as a Class II, Type V CRISPR-Cas system. [4]

Naming

The acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) refers to the invariant DNA sequences found in bacteria and archaea which encode Cas proteins and their crRNAs. Cas12a was originally known as Cpf1, an abbreviation of CRISPR and two genera of bacteria where it appears, Prevotella and Francisella. It was renamed in 2015 after a broader rationalization of the names of Cas (CRISPR associated) proteins to correspond to their sequence homology. [4]

Structure

The Cas12a protein contains a mixed alpha/beta domain, a RuvC-like endonuclease domain (broken into two non-contiguous segments, RuvC-I and RuvC-II) similar to the RuvC domain of Cas9, and a zinc finger-like domain [5] . Unlike Cas9, Cas12a does not have an HNH endonuclease domain, and the N-terminal region of Cas12a does not have an alpha-helical recognition lobe as seen in Cas9. [4]

The Cas12a loci encode Cas1, Cas2 and Cas4 proteins more similar to types I and III than from type II systems. Database searches suggest the abundance of Cas12a-family proteins in many bacterial species. [4]

Also unlike Cas9, Cas12a does not require a tracrRNA (which in natural CRISPR systems must base-pair with a separate crRNA before binding to a Cas protein), instead binding a single crRNA. Both Cas12a and its guide RNA are smaller than the protein and RNA components of the Cas9 system; the crRNA of Cas12a is approximately half as long as sgRNAs used with Cas9. [6] This reduced size renders Cas12a more suitable for applications such as in vivo delivery via adeno-associated virus (AAV), which have limited DNA packaging capacity due to their small capsids.

The Cas12a-crRNA complex cleaves target DNA or RNA by identification of a protospacer adjacent motif (PAM) 5'-YTN-3' [7] (where "Y" is a pyrimidine [8] and "N" is any nucleobase), in contrast to the G-rich PAM targeted by Cas9. After identification of PAM, Cas12a introduces a sticky-end-like DNA double- stranded break of 4 or 5 nucleotides overhang. [5]

Mechanism

Cas-gRNA Mechanism.jpg

The CRISPR-Cas12a system consist of a Cas12a enzyme and a guide RNA that finds and positions the complex at the correct spot on the double helix to cleave target DNA. CRISPR-Cas12a systems activity has three stages: [3]

Cas9 vs. Cas12a

Cas12a and Cas9 nucleases and their DNA cleavage positions Cas12a vs Cas9 cleavage position.svg
Cas12a and Cas9 nucleases and their DNA cleavage positions

Cas9 requires two RNA molecules to cut DNA while Cas12a needs one. The proteins also cut DNA at different places, offering researchers more options when selecting an editing site. Cas9 cuts both strands in a DNA molecule at the same position, leaving behind blunt ends. Cas12a leaves one strand longer than the other, creating sticky ends. The sticky ends have different properties than blunt ends during non-homologous end joining or homologous repair of DNA, which confers certain advantages to Cas12a when attempting gene insertions, compared to Cas9. [3] Although the CRISPR-Cas9 system can efficiently disable genes, it is challenging to insert genes or generate a knock-in. [1] Cas12a lacks tracrRNA, utilizes a T-rich PAM and cleaves DNA via a staggered DNA DSB. [6]

In summary, important differences between Cas12a and Cas9 systems are that Cas12a: [10]

FeatureCas9Cas12a
StructureTwo RNA required (Or 1 fusion transcript (crRNA+tracrRNA=sgRNA)One crRNA required
Cutting mechanismBlunt end cutsStaggered end cuts
Cutting siteProximal to recognition siteDistal from recognition site
Target sitesG-rich PAMT-rich PAM

Origin

Cas12 endonucleases ultimately likely evolved from the TnpB endonuclease of IS200/IS605-family transposons. TnpB, not yet "domesticated" into the CRISPR immune system, are themselves able to perform RNA-guided cleavage using a OmegaRNA template system. [11]

Tools

Multiple aspects influence target efficiency and specificity when using CRISPR, including guide RNA design. Many design models and CRISPR-Cas software tools for optimal design of guide RNA have been developed. These include SgRNA designer, CRISPR MultiTargeter, SSFinder. [12] In addition, commercial antibodies are available for use to detect Cas12a protein. [13]

Intellectual property

CRISPR-Cas9 is subject to Intellectual property disputes while CRISPR-Cas12a does not have the same issues. [2]

Notes

    Related Research Articles

    Gene knockdown is an experimental technique by which the expression of one or more of an organism's genes is reduced. The reduction can occur either through genetic modification or by treatment with a reagent such as a short DNA or RNA oligonucleotide that has a sequence complementary to either gene or an mRNA transcript.

    <span class="mw-page-title-main">CRISPR</span> Family of DNA sequence found in prokaryotic organisms

    CRISPR is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. Each sequence within an individual prokaryotic cell is derived from a DNA fragment of a bacteriophage that had previously infected the prokaryote or one of its ancestors. These sequences are used to detect and destroy DNA from similar bacteriophages during subsequent infections. Hence these sequences play a key role in the antiviral defense system of prokaryotes and provide a form of heritable, acquired immunity. CRISPR is found in approximately 50% of sequenced bacterial genomes and nearly 90% of sequenced archaea.

    Guide RNA (gRNA) or single guide RNA (sgRNA) is a short sequence of RNA that functions as a guide for the Cas9-endonuclease or other Cas-proteins that cut the double-stranded DNA and thereby can be used for gene editing. In bacteria and archaea, gRNAs are a part of the CRISPR-Cas system that serves as an adaptive immune defense that protects the organism from viruses. Here the short gRNAs serve as detectors of foreign DNA and direct the Cas-enzymes that degrades the foreign nucleic acid.

    <i>Fok</i>I Restriction enzyme

    The restriction endonuclease Fok1, naturally found in Flavobacterium okeanokoites, is a bacterial type IIS restriction endonuclease consisting of an N-terminal DNA-binding domain and a non sequence-specific DNA cleavage domain at the C-terminal. Once the protein is bound to duplex DNA via its DNA-binding domain at the 5'-GGATG-3' recognition site, the DNA cleavage domain is activated and cleaves the DNA at two locations, regardless of the nucleotide sequence at the cut site. The DNA is cut 9 nucleotides downstream of the motif on the forward strand, and 13 nucleotides downstream of the motif on the reverse strand, producing two sticky ends with 4-bp overhangs.

    <span class="mw-page-title-main">Genome editing</span> Type of genetic engineering

    Genome editing, or genome engineering, or gene editing, is a type of genetic engineering in which DNA is inserted, deleted, modified or replaced in the genome of a living organism. Unlike early genetic engineering techniques that randomly inserts genetic material into a host genome, genome editing targets the insertions to site-specific locations. The basic mechanism involved in genetic manipulations through programmable nucleases is the recognition of target genomic loci and binding of effector DNA-binding domain (DBD), double-strand breaks (DSBs) in target DNA by the restriction endonucleases, and the repair of DSBs through homology-directed recombination (HDR) or non-homologous end joining (NHEJ).

    <span class="mw-page-title-main">Genetic engineering techniques</span> Methods used to change the DNA of organisms

    Genetic engineering techniques allow the modification of animal and plant genomes. Techniques have been devised to insert, delete, and modify DNA at multiple levels, ranging from a specific base pair in a specific gene to entire genes. There are a number of steps that are followed before a genetically modified organism (GMO) is created. Genetic engineers must first choose what gene they wish to insert, modify, or delete. The gene must then be isolated and incorporated, along with other genetic elements, into a suitable vector. This vector is then used to insert the gene into the host genome, creating a transgenic or edited organism.

    <span class="mw-page-title-main">Cas9</span> Microbial protein found in Streptococcus pyogenes M1 GAS

    Cas9 is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic engineering applications. Its main function is to cut DNA and thereby alter a cell's genome. The CRISPR-Cas9 genome editing technique was a significant contributor to the Nobel Prize in Chemistry in 2020 being awarded to Emmanuelle Charpentier and Jennifer Doudna.

    <span class="mw-page-title-main">CRISPR interference</span> Genetic perturbation technique

    CRISPR interference (CRISPRi) is a genetic perturbation technique that allows for sequence-specific repression of gene expression in prokaryotic and eukaryotic cells. It was first developed by Stanley Qi and colleagues in the laboratories of Wendell Lim, Adam Arkin, Jonathan Weissman, and Jennifer Doudna. Sequence-specific activation of gene expression refers to CRISPR activation (CRISPRa).

    <span class="mw-page-title-main">Epigenome editing</span>

    Epigenome editing or epigenome engineering is a type of genetic engineering in which the epigenome is modified at specific sites using engineered molecules targeted to those sites. Whereas gene editing involves changing the actual DNA sequence itself, epigenetic editing involves modifying and presenting DNA sequences to proteins and other DNA binding factors that influence DNA function. By "editing” epigenomic features in this manner, researchers can determine the exact biological role of an epigenetic modification at the site in question.

    A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but is not found in the bacterial host genome and hence is not a component of the bacterial CRISPR locus. Cas9 will not successfully bind to or cleave the target DNA sequence if it is not followed by the PAM sequence. PAM is an essential targeting component which distinguishes bacterial self from non-self DNA, thereby preventing the CRISPR locus from being targeted and destroyed by the CRISPR-associated nuclease.

    CRISPR-Cas design tools are computer software platforms and bioinformatics tools used to facilitate the design of guide RNAs (gRNAs) for use with the CRISPR/Cas gene editing system.

    No-SCAR genome editing is an editing method that is able to manipulate the Escherichia coli genome. The system relies on recombineering whereby DNA sequences are combined and manipulated through homologous recombination. No-SCAR is able to manipulate the E. coli genome without the use of the chromosomal markers detailed in previous recombineering methods. Instead, the λ-Red recombination system facilitates donor DNA integration while Cas9 cleaves double-stranded DNA to counter-select against wild-type cells. Although λ-Red and Cas9 genome editing are widely used technologies, the no-SCAR method is novel in combining the two functions; this technique is able to establish point mutations, gene deletions, and short sequence insertions in several genomic loci with increased efficiency and time sensitivity.

    CRISPR activation (CRISPRa) is a gene regulation technique that utilizes an engineered form of the CRISPR-Cas9 system to enhance the expression of specific genes without altering the underlying DNA sequence. Unlike traditional CRISPR-Cas9, which introduces double-strand breaks to edit genes, CRISPRa employs a modified, catalytically inactive Cas9 (dCas9) fused with transcriptional activators to target promoter or enhancer regions, thereby boosting gene transcription. This method allows for precise control of gene expression, making it a valuable tool for studying gene function, creating gene regulatory networks, and developing potential therapeutic interventions for a variety of diseases.

    Off-target genome editing refers to nonspecific and unintended genetic modifications that can arise through the use of engineered nuclease technologies such as: clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9, transcription activator-like effector nucleases (TALEN), meganucleases, and zinc finger nucleases (ZFN). These tools use different mechanisms to bind a predetermined sequence of DNA (“target”), which they cleave, creating a double-stranded chromosomal break (DSB) that summons the cell's DNA repair mechanisms and leads to site-specific modifications. If these complexes do not bind at the target, often a result of homologous sequences and/or mismatch tolerance, they will cleave off-target DSB and cause non-specific genetic modifications. Specifically, off-target effects consist of unintended point mutations, deletions, insertions inversions, and translocations.

    <span class="mw-page-title-main">CRISPR gene editing</span> Gene editing method

    CRISPR gene editing (CRISPR, pronounced "crisper", refers to "clustered regularly interspaced short palindromic repeats") is a genetic engineering technique in molecular biology by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense system. By delivering the Cas9 nuclease complexed with a synthetic guide RNA (gRNA) into a cell, the cell's genome can be cut at a desired location, allowing existing genes to be removed and/or new ones added in vivo.

    <span class="mw-page-title-main">Anti-CRISPR</span> Group of proteins found in phages

    Anti-CRISPR is a group of proteins found in phages, that inhibit the normal activity of CRISPR-Cas, the immune system of certain bacteria. CRISPR consists of genomic sequences that can be found in prokaryotic organisms, that come from bacteriophages that infected the bacteria beforehand, and are used to defend the cell from further viral attacks. Anti-CRISPR results from an evolutionary process occurred in phages in order to avoid having their genomes destroyed by the prokaryotic cells that they will infect.

    Prime editing is a 'search-and-replace' genome editing technology in molecular biology by which the genome of living organisms may be modified. The technology directly writes new genetic information into a targeted DNA site. It uses a fusion protein, consisting of a catalytically impaired Cas9 endonuclease fused to an engineered reverse transcriptase enzyme, and a prime editing guide RNA (pegRNA), capable of identifying the target site and providing the new genetic information to replace the target DNA nucleotides. It mediates targeted insertions, deletions, and base-to-base conversions without the need for double strand breaks (DSBs) or donor DNA templates.

    <span class="mw-page-title-main">Genome-wide CRISPR-Cas9 knockout screens</span> Research tool in genomics

    Genome-wide CRISPR-Cas9 knockout screens aim to elucidate the relationship between genotype and phenotype by ablating gene expression on a genome-wide scale and studying the resulting phenotypic alterations. The approach utilises the CRISPR-Cas9 gene editing system, coupled with libraries of single guide RNAs (sgRNAs), which are designed to target every gene in the genome. Over recent years, the genome-wide CRISPR screen has emerged as a powerful tool for performing large-scale loss-of-function screens, with low noise, high knockout efficiency and minimal off-target effects.

    <span class="mw-page-title-main">CRISPR RNA</span> RNA transcript from the CRISPR locus

    CRISPR RNA or crRNA is a RNA transcript from the CRISPR locus. CRISPR-Cas is an adaptive immune system found in bacteria and archaea to protect against mobile genetic elements, like viruses, plasmids, and transposons. The CRISPR locus contains a series of repeats interspaced with unique spacers. These unique spacers can be acquired from MGEs.

    The Fanzor (Fz) protein is an eukaryotic, RNA-guided DNA endonuclease, which means it is a type of DNA cutting enzyme that uses RNA to target genes of interest. It has been recently discovered and explored in a number of studies. In bacteria, RNA-guided DNA endonuclease systems, such as the CRISPR/Cas system, serve as an immune system to prevent infection by cutting viral genetic material. Currently, CRISPR/Cas9-mediated's DNA cleavage has extensive application in biological research, and wide-reaching medical potential in human gene editing.

    References

    1. 1 2 "CRISPR-Based Genetic Engineering Gets a Kick in the Cas". Meta Science News. 2015-09-29. Archived from the original on 2017-10-22. Retrieved 2016-05-03.
    2. 1 2 3 4 "Even CRISPR". The Economist. ISSN   0013-0613 . Retrieved 2016-05-03.
    3. 1 2 3 Ledford H (October 2015). "Bacteria yield new gene cutter". Nature. 526 (7571): 17. doi: 10.1038/nature.2015.18432 . PMID   26432219.
    4. 1 2 3 4 Makarova KS, Wolf YI, Alkhnbashi OS, Costa F, Shah SA, Saunders SJ, et al. (November 2015). "An updated evolutionary classification of CRISPR-Cas systems". Nature Reviews. Microbiology. 13 (11): 722–736. doi:10.1038/nrmicro3569. PMC   5426118 . PMID   26411297.
    5. 1 2 3 Zetsche B, Gootenberg JS, Abudayyeh OO, Slaymaker IM, Makarova KS, Essletzbichler P, et al. (October 2015). "Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system". Cell. 163 (3): 759–771. doi:10.1016/j.cell.2015.09.038. PMC   4638220 . PMID   26422227.
    6. 1 2 "Cpf1 Moves in on Cas9 for Next-Gen CRISPR Genome Editing". epigenie.com. 29 September 2015. Retrieved 2016-05-03.
    7. Fonfara I, Richter H, Bratovič M, Le Rhun A, Charpentier E (April 2016). "The CRISPR-associated DNA-cleaving enzyme Cpf1 also processes precursor CRISPR RNA". Nature. 532 (7600): 517–521. Bibcode:2016Natur.532..517F. doi:10.1038/nature17945. PMID   27096362. S2CID   2271552.
    8. "Nucleotide Codes, Amino Acid Codes, and Genetic Codes". KEGG: Kyoto Encyclopedia of Genes and Genomes. July 15, 2014. Retrieved 2016-05-25.
    9. Cofsky JC, Karandur D, Huang CJ, Witte IP, Kuriyan J, Doudna JA (June 2020). "CRISPR-Cas12a exploits R-loop asymmetry to form double-strand breaks". eLife. 9. doi: 10.7554/eLife.55143 . PMC   7286691 . PMID   32519675.
    10. Yamano T, Nishimasu H, Zetsche B, Hirano H, Slaymaker IM, Li Y, et al. (May 2016). "Crystal Structure of Cpf1 in Complex with Guide RNA and Target DNA". Cell. 165 (4): 949–962. doi:10.1016/j.cell.2016.04.003. PMC   4899970 . PMID   27114038.
    11. Altae-Tran H, Kannan S, Demircioglu FE, Oshiro R, Nety SP, McKay LJ, et al. (October 2021). "The widespread IS200/IS605 transposon family encodes diverse programmable RNA-guided endonucleases". Science. 374 (6563): 57–65. Bibcode:2021Sci...374...57A. doi:10.1126/science.abj6856. PMC   8929163 . PMID   34591643.
    12. Graham DB, Root DE (November 2015). "Resources for the design of CRISPR gene editing experiments". Genome Biology. 16: 260. doi: 10.1186/s13059-015-0823-x . PMC   4661947 . PMID   26612492.
    13. "Anti-CPF1 antibody (GTX133301) | GeneTex". www.genetex.com. Retrieved 2020-10-30.