The SpyTag/SpyCatcher system is a technology for irreversible conjugation of recombinant proteins. The peptide SpyTag (13 amino acids) spontaneously reacts with the protein SpyCatcher (12.3 kDa) to form an intermolecular isopeptide bond between the pair. [1] DNA sequence encoding either SpyTag or SpyCatcher can be recombinantly introduced into the DNA sequence encoding a protein of interest, forming a fusion protein. These fusion proteins can be covalently linked when mixed in a reaction through the SpyTag/SpyCatcher system.
Using the Tag/Catcher pair, bioconjugation can be achieved between two recombinant proteins that would otherwise be restrictive or impossible with traditional direct genetic fusion between the two proteins. For example, issues regarding protein folding, suboptimal expression host, and specialized post-translational modifications can be alleviated by separating the production of the proteins with the modularity of the Tag/Catcher system. [2]
SpyTag and SpyCatcher were formed from the splitting and engineering of the CnaB2 domain of the FbaB protein from Streptococcus pyogenes , which naturally forms an intramolecular isopeptide bond to assist colonization of the host cell. [3] [4] With the formation of the isopeptide bond, the CnaB2 domain becomes more tolerant to conformational, thermal and pH changes.
Building upon this, SpyTag was obtained from CnaB2 by extracting the C-terminal beta strand containing the reactive aspartic acid at D556 and leaving the rest of the beta strands containing the reactive lysine K470 and the catalytic glutamic acid at E516 to become SpyCatcher, after further engineering to remove some hydrophobic surface residues. The resulting SpyTag/SpyCatcher can react to form the isopeptide bond with a second-order rate constant of 1.4 ± 0.4 × 103 M−1 s−1. It is postulated [5] that the reaction mechanism proceeds by a nucleophilic attack of D556 from K470, mediated by E516. By reconstituting SpyTag:SpyCatcher, the resulting conjugated complex acquires the stability of the parent CnaB2 domain.
A second generation SpyTag/SpyCatcher called SpyTag002/SpyCatcher002 was then created through phage display that enables the peptide-protein pair to react up to 12 times faster than the original pair, at a rate constant of 2.0 ± 0.2 × 104 M−1 s−1. [6] The second generation SpyCatcher002 also has abolished self-reactivity that is present with SpyCatcher.
A third generation SpyTag/SpyCatcher called SpyTag003/SpyCatcher003 has now also been created through rational design. This reacts up to 400 fold faster than the original pair with a rate constant of 5.5 ± 0.6 × 105 M−1 s−1. [7] This version is back reactive with the two previous generations of SpyTag/SpyCatcher reagents.
SpyTag/SpyCatcher react with high specificity even when in the presence of bacterial and mammalian cell environments. [1] [6] [7]
Because of the specificity, irreversible covalent linkage and ease of use, the SpyTag/SpyCatcher conjugation system has seen many different applications. [5] [8] [9]
By fusing either SpyTag or SpyCatcher to self-assembling molecules such as virus-like particles, antigens fused to the other pair can be decorated onto the molecule via the isopeptide bond formed. [10] [11] [12] [13] This enables fast production of vaccines as the central self-assembling molecule can be stocked up beforehand, whilst the antigen can be easily produced under optimal conditions to achieve proper protein folding.
Cyclization of enzymes by fusing the N- and C-termini of the protein helps elevate the stability of the enzyme against heat. By having SpyTag and SpyCatcher together in the enzyme at the termini, the enzyme will undergo spontaneous cyclization by forming the isopeptide bond. Cyclized beta-lactamase, phytase, firefly luciferase, and xylanase (to name a few) have shown retained enzyme activity even after being subjected to heat at 100 °C. [14] [15]
Hydrogels have had a wide-range of applications in biomedical sciences. One commonly used type of hydrogel starting material are the elastin-like polypeptides. SpyTag/SpyCatcher chemistry has been used to produce tailored molecular networks (“networks of spies”) within these hydrogels that enable the encapsulation of living mammalian cells such as fibroblasts. [16] Subsequent modifications have enabled photo-responsive hydrogel formation, [17] user-defined control over cell-material interactions, [18] combined hyaluronan-elastin-like polypeptides, [19] as well creating protein scaffolds for enzyme flow biocatalysis. [20]
Before the development of SpyTag/SpyCatcher, the pair Isopeptag/Pilin-C was created from protein Spy0128 of Streptococcus pyogenes. [21] Following SpyTag/SpyCatcher, the fully orthogonal pair SnoopTag/SnoopCatcher was developed from the RrgA protein of Streptococcus pneumoniae that has no cross-reactivity with SpyTag/SpyCatcher. [22] Note that SnoopTag/SnoopCatcher forms an isopeptide bond between a Lys-Asn instead of Lys-Asp found in SpyTag/SpyCatcher. The same domain from RrgA has now been split in a different way to that used to create SnoopTag/SnoopCatcher, with the new pair called DogTag/DogCatcher. Unlike SpyTag and SnoopTag which have extended structures, the region of RrgA used to create DogTag forms a β-hairpin and so predisposed for successful insertion into protein loops. This ability has been successfully exploited to fluorescently label an internal loop of the mammalian TRPC5 membrane channel protein which cannot be modified at the protein termini, without impacting on the channel properties of TRPC5. [23] DogTag has been successful coupled to DogCatcher when inserted into soluble proteins (superfolder GFP, HaloTag, and Gre2p). [23]
The pair SdyTag/SdyCatcher was also developed in the same year from Streptococcus dysgalactiae fibronectin-binding protein CnaB domain, but since the protein has sequence similarity to the parent protein where SpyTag/SpyCatcher is derived from, this Tag/Catcher pair has cross-reactivity with the latter pair. [24]
Rather than finding homologous proteins from different species, a new Tag/Catcher pair was developed from SpyTag/SpyCatcher with minimal mutations. SpyTag I3W (AW) reacts with SpyCatcher F77V, F94A (BVA) but minimally with SpyCatcher, whereas SpyCatcher F77V, F94A can react with both SpyTag I3W and SpyTag. [25] However, the cross-reactivity of SpyCatcher F77V, F94A with both SpyTag versions may limit its utility as a new Tag/Catcher pair.
A different chemistry can be exploited for protein ligation: the discovery of an intramolecular ester bond formation in Clostridium perfringens cell-surface adhesin protein Cpe0147 led to the development of another Tag/Catcher pair with Cpe0147565–587 as the Tag and Cpe0147439–563 as the Catcher. [26] The ester bond formed between Thr-Gln is irreversible, however by mutating the Thr to Ser, the Ser-Gln ester bond is reversible with a change of pH.
Mutation of the catalytic glutamic acid residue (E77) in SpyCatcher to alanine stops isopeptide bond formation but does not prevent the initial non-covalent SpyTag/SpyCatcher association. This non-covalent SpyTag/SpyCatcher interaction has been utilized in the affinity purification of SpyTag-fused recombinant proteins. [27] In this purification strategy, termed Spy&Go, resin-immobilized SpyCatcher is used to harvest SpyTag-fused proteins from cell culture supernatants or cell lysates. Non-specifically bound proteins are removed by washing the resin with a neutral buffer and the target protein eluted at neutral pH using high imidazole concentration.
The Spy&Go affinity resin is based on SpyCatcher2.1 E77A S49C variant termed SpyDock. [27] SpyDock can be expressed in E. coli as soluble protein, purified using Ni-NTA and anion-exchange resins and immobilized to iodoacetyl-activated agarose through the unpaired cysteine introduced by the S49C substitution. In neutral buffers with physiological salt concentration SpyDock binds to SpyTag- and SpyTag002-fused proteins with affinity in the high nanomolar range (Kd = 750 ± 50 nM for SpyTag, Kd = 73 ± 13 nM for SpyTag002). [27] Affinity to SpyTag003 has not been reported, but requires harsher conditions to ensure full dissociation suggesting it binds tighter. [7] SpyDock-bound proteins are eluted by incubating the resin with 2.5 M imidazole in neutral buffer. [27] The SpyDock resin can be regenerated several times using consecutive washes with 4 M imidazole, 6 M guanidinium hydrochloride and 0.1 M NaOH. [27]
Spy&Go purification of proteins with either N-terminal, internal or C-terminal SpyTag have all been reported. [7] [27] SpyDock resin is compatible with all SpyTag generations (SpyTag, [1] SpyTag002, [6] SpyTag003 [7] ) and it does not interfere with the later covalent conjugation of the purified proteins with SpyCatcher. [27]
In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 of one alpha-amino acid and N2 of another, along a peptide or protein chain.
In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.
An epitope, also known as antigenic determinant, is the part of an antigen that is recognized by the immune system, specifically by antibodies, B cells, or T cells. The part of an antibody that binds to the epitope is called a paratope. Although epitopes are usually non-self proteins, sequences derived from the host that can be recognized are also epitopes.
A ubiquitin ligase is a protein that recruits an E2 ubiquitin-conjugating enzyme that has been loaded with ubiquitin, recognizes a protein substrate, and assists or directly catalyzes the transfer of ubiquitin from the E2 to the protein substrate. In simple and more general terms, the ligase enables movement of ubiquitin from a ubiquitin carrier to another thing by some mechanism. The ubiquitin, once it reaches its destination, ends up being attached by an isopeptide bond to a lysine residue, which is part of the target protein. E3 ligases interact with both the target protein and the E2 enzyme, and so impart substrate specificity to the E2. Commonly, E3s polyubiquitinate their substrate with Lys48-linked chains of ubiquitin, targeting the substrate for destruction by the proteasome. However, many other types of linkages are possible and alter a protein's activity, interactions, or localization. Ubiquitination by E3 ligases regulates diverse areas such as cell trafficking, DNA repair, and signaling and is of profound importance in cell biology. E3 ligases are also key players in cell cycle control, mediating the degradation of cyclins, as well as cyclin dependent kinase inhibitor proteins. The human genome encodes over 600 putative E3 ligases, allowing for tremendous diversity in substrates.
In biochemistry, biotinylation is the process of covalently attaching biotin to a protein, nucleic acid or other molecule. Biotinylation is rapid, specific and is unlikely to disturb the natural function of the molecule due to the small size of biotin. Biotin binds to streptavidin and avidin with an extremely high affinity, fast on-rate, and high specificity, and these interactions are exploited in many areas of biotechnology to isolate biotinylated molecules of interest. Biotin-binding to streptavidin and avidin is resistant to extremes of heat, pH and proteolysis, making capture of biotinylated molecules possible in a wide variety of environments. Also, multiple biotin molecules can be conjugated to a protein of interest, which allows binding of multiple streptavidin, avidin or neutravidin protein molecules and increases the sensitivity of detection of the protein of interest. There is a large number of biotinylation reagents available that exploit the wide range of possible labelling methods. Due to the strong affinity between biotin and streptavidin, the purification of biotinylated proteins has been a widely used approach to identify protein-protein interactions and post-translational events such as ubiquitylation in molecular biology.
In organic chemistry, peptide synthesis is the production of peptides, compounds where multiple amino acids are linked via amide bonds, also known as peptide bonds. Peptides are chemically synthesized by the condensation reaction of the carboxyl group of one amino acid to the amino group of another. Protecting group strategies are usually necessary to prevent undesirable side reactions with the various amino acid side chains. Chemical peptide synthesis most commonly starts at the carboxyl end of the peptide (C-terminus), and proceeds toward the amino-terminus (N-terminus). Protein biosynthesis in living organisms occurs in the opposite direction.
Affinity chromatography is a method of separating a biomolecule from a mixture, based on a highly specific macromolecular binding interaction between the biomolecule and another substance. The specific type of binding interaction depends on the biomolecule of interest; antigen and antibody, enzyme and substrate, receptor and ligand, or protein and nucleic acid binding interactions are frequently exploited for isolation of various biomolecules. Affinity chromatography is useful for its high selectivity and resolution of separation, compared to other chromatographic methods.
Streptavidin is a 52 kDa protein (tetramer) purified from the bacterium Streptomyces avidinii. Streptavidin homo-tetramers have an extraordinarily high affinity for biotin. With a dissociation constant (Kd) on the order of ≈10−14 mol/L, the binding of biotin to streptavidin is one of the strongest non-covalent interactions known in nature. Streptavidin is used extensively in molecular biology and bionanotechnology due to the streptavidin-biotin complex's resistance to organic solvents, denaturants, detergents, proteolytic enzymes, and extremes of temperature and pH.
A tetrameric protein is a protein with a quaternary structure of four subunits (tetrameric). Homotetramers have four identical subunits, and heterotetramers are complexes of different subunits. A tetramer can be assembled as dimer of dimers with two homodimer subunits, or two heterodimer subunits.
Protein tags are peptide sequences genetically grafted onto a recombinant protein. Tags are attached to proteins for various purposes. They can be added to either end of the target protein, so they are either C-terminus or N-terminus specific or are both C-terminus and N-terminus specific. Some tags are also inserted at sites within the protein of interest; they are known as internal tags.
A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes. An acid-base-nucleophile triad is a common motif for generating a nucleophilic residue for covalent catalysis. The residues form a charge-relay network to polarise and activate the nucleophile, which attacks the substrate, forming a covalent intermediate which is then hydrolysed to release the product and regenerate free enzyme. The nucleophile is most commonly a serine or cysteine amino acid, but occasionally threonine or even selenocysteine. The 3D structure of the enzyme brings together the triad residues in a precise orientation, even though they may be far apart in the sequence.
Pilin refers to a class of fibrous proteins that are found in pilus structures in bacteria. These structures can be used for the exchange of genetic material, or as a cell adhesion mechanism. Although not all bacteria have pili or fimbriae, bacterial pathogens often use their fimbriae to attach to host cells. In Gram-negative bacteria, where pili are more common, individual pilin molecules are linked by noncovalent protein-protein interactions, while Gram-positive bacteria often have polymerized LPXTG pilin.
An isopeptide bond is a type of amide bond formed between a carboxyl group of one amino acid and an amino group of another. An isopeptide bond is the linkage between the side chain amino or carboxyl group of one amino acid to the α-carboxyl, α-amino group, or the side chain of another amino acid. In a typical peptide bond, also known as eupeptide bond, the amide bond always forms between the α-carboxyl group of one amino acid and the α-amino group of the second amino acid. Isopeptide bonds are rarer than regular peptide bonds. Isopeptide bonds lead to branching in the primary sequence of a protein. Proteins formed from normal peptide bonds typically have a linear primary sequence.
TEV protease is a highly sequence-specific cysteine protease from Tobacco Etch Virus (TEV). It is a member of the PA clan of chymotrypsin-like proteases. Due to its high sequence specificity, TEV protease is frequently used for the controlled cleavage of fusion proteins in vitro and in vivo. The consensus sequence recognized by TEV protease is Glu-Asn-Leu-Tyr-Phe-Gln-|-Ser, where "|" denotes cleaved peptide bond.
Bioconjugation is a chemical strategy to form a stable covalent link between two molecules, at least one of which is a biomolecule.
Bissulfosuccinimidyl suberate (BS3) is a crosslinker used in biological research. It is a water-soluble version of disuccinimidyl suberate.
Sortase refers to a group of prokaryotic enzymes that modify surface proteins by recognizing and cleaving a carboxyl-terminal sorting signal. For most substrates of sortase enzymes, the recognition signal consists of the motif LPXTG (Leu-Pro-any-Thr-Gly), then a highly hydrophobic transmembrane sequence, followed by a cluster of basic residues such as arginine. Cleavage occurs between the Thr and Gly, with transient attachment through the Thr residue to the active site Cys residue, followed by transpeptidation that attaches the protein covalently to cell wall components. Sortases occur in almost all Gram-positive bacteria and the occasional Gram-negative bacterium or Archaea, where cell wall LPXTG-mediated decoration has not been reported. Although sortase A, the "housekeeping" sortase, typically acts on many protein targets, other forms of sortase recognize variant forms of the cleavage motif, or catalyze the assembly of pilins into pili.
Isopeptag is a 16-amino acid peptide tag (TDKDMTITFTNKKDAE) that can be genetically linked to proteins without interfering with protein folding. What makes the isopeptag different from other peptide tags is that it can bind its binding protein through a permanent and irreversible covalent bond. Other peptide tags generally bind their targets through weak non-covalent interactions, thus limiting their use in applications where molecules experience extreme forces. The isopeptag's covalent binding to its target overcomes these barriers and allows target proteins to be studied in harsher molecular environments.
Targeted covalent inhibitors (TCIs) or Targeted covalent drugs are rationally designed inhibitors that bind and then bond to their target proteins. These inhibitors possess a bond-forming functional group of low chemical reactivity that, following binding to the target protein, is positioned to react rapidly with a proximate nucleophilic residue at the target site to form a bond.
HaloTag is a self-labeling protein tag. It is a 297 residue protein derived from a bacterial enzyme, designed to covalently bind to a synthetic ligand. The bacterial enzyme can be fused to various proteins of interest. The synthetic ligand is chosen from a number of available ligands in accordance with the type of experiments to be performed. This bacterial enzyme is a haloalkane dehalogenase, which acts as a hydrolase and is designed to facilitate visualization of the subcellular localization of a protein of interest, immobilization of a protein of interest, or capture of the binding partners of a protein of interest within its biochemical environment. The HaloTag is composed of two covalently bound segments including a haloalkane dehalogenase and a synthetic ligand of choice. These synthetic ligands consist of a reactive chloroalkane linker bound to a functional group. Functional groups can either be biotin or can be chosen from five available fluorescent dyes including Coumarin, Oregon Green, Alexa Fluor 488, diAcFAM, and TMR. These fluorescent dyes can be used in the visualization of either living or chemically fixed cells.