Cyclotide family | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | Cyclotide | ||||||||
Pfam | PF03784 | ||||||||
InterPro | IPR005535 | ||||||||
PROSITE | PDOC51052 | ||||||||
SCOP2 | 1kal / SCOPe / SUPFAM | ||||||||
OPM superfamily | 112 | ||||||||
OPM protein | 1nb1 | ||||||||
|
In biochemistry, cyclotides are small, disulfide-rich peptides isolated from plants. [1] Typically containing 28-37 amino acids, they are characterized by their head-to-tail cyclised peptide backbone and the interlocking arrangement of their three disulfide bonds. These combined features have been termed the cyclic cystine knot (CCK) motif. To date, over 100 cyclotides have been isolated and characterized from species of the families Rubiaceae, Violaceae, and Cucurbitaceae. Cyclotides have also been identified in agriculturally important families such as the Fabaceae and Poaceae. [2] [3] [4]
Cyclotides have a well-defined three-dimensional structure due to their interlocking disulfide bonds and cyclic peptide backbone. Backbone loops and selected residues are labeled on the structure to help orientation. The amino acid sequence (single-letter amino acid representation) for this peptide is indicated on the sequence diagram to the right. One of the interesting features of cyclic peptides is that knowledge of the peptide sequence does not reveal the ancestral head and tail; knowledge of the gene sequence is required for this. [5] In the case of kalata B1 the indicated glycine (G) and asparagine (N) amino acids are the terminal residues that are linked in a peptide bond to cyclize the peptide.
Cyclotides have been reported to have a wide range of biological activities, including anti-HIV, insecticidal, anti-tumour, antifouling, anti-microbial, hemolytic, neurotensin antagonism, trypsin inhibition, and uterotonic activities. [7] [8] [9] An ability to induce uterine contractions was what prompted the initial discovery of kalata B1. [10]
The potent insecticidal activity of cyclotides kalata B1 and kalata B2 has prompted the belief that cyclotides act as plant host-defence agents. The observations that dozens or more cyclotides may be present in a single plant and the cyclotide architecture comprises a conserved core onto which a series of hypervariable loops is displayed suggest that cyclotides may be able to target many pests/pathogens simultaneously. [11]
Analysis of the suite of known cyclotides reveals many sequence similarities that are important for understanding their unique physico-chemical properties, bioactivities and homology.
The cyclotides fall into two main structural subfamilies. Moebius cyclotides, the less common of the two, contain a cis-proline in loop 5 that induces a local 180° backbone twist (hence likening it to a Möbius strip), whereas bracelet cyclotides do not. There is smaller variation in sequences within these subfamilies than between them. A third subfamily of cyclotides are trypsin inhibitors and are more homologous to a family of non-cyclic trypsin inhibitors from squash plants known as knottins or inhibitor cystine knots [12] than they are to the other cyclotides.
It is convenient to discuss sequences in terms of the backbone segments, or loops, between successive cysteine residues. The six cysteine residues are absolutely conserved throughout the cyclotide suite and presumably contribute to preserving the CCK motif. Although the cysteines appear essential to maintaining the overall fold, several other residues highly conserved in cyclotides are thought to provide additional stability. [13]
Throughout the known cyclotides loop 1 is the most conserved. Apart from the six cysteine residues, the glutamic acid and serine/threonine residues of loop 1 are the only residues to have 100% identity across the bracelet and Möbius subfamilies. Furthermore, the remaining residue of this loop exhibits only a conservative change i.e. glycine/alanine. This loop is believed to play an important role in stabilizing the cyclotide structure through hydrogen bonding with residues from loops 3 and 5.
Loops 2-6 also have highly conserved features, including the ubiquitous presence of just a single amino acid in loop 4 that is likely involved in sidechain-sidechain hydrogen bonding. Other conserved residues include a hydroxyl-containing residue in loop 3, a glycine residue in the final position of loop 3, a basic and a proline residue in the penultimate position in loop 5 of bracelet and Möbius cyclotides respectively, and an asparagine (or occasionally aspartic acid) residue at the putative cyclisation [5] [6] [14] point in loop 6. It is of interest to note that not only are certain residues highly conserved, but the backbone and side chain angles are as well.
With recent screening programs suggesting that the number of cyclotide sequences may soon reach the thousands, [15] a database, CyBase, has been developed that offers the opportunity for comparisons of sequences and activity data for cyclotides. Several other families of circular proteins are known in bacteria, plants and animals and are also included in CyBase. [16]
Plants are a rich source of ribosomally-synthesised and post-translationally modified cyclic peptides. Among these, the cyclotides are gene-coded products generated via processing of a larger precursor protein. [5] The gene for the first such precursor is Oak1 (Oldenlandia affinis kalata clone number 1), which was shown to be responsible for the synthesis of kalata B1. [6] The generic configuration of the precursor protein consists of an endoplasmic reticulum signal sequence, a non-conserved pro-region, a highly conserved region known as the N-terminal repeat (NTR), the mature cyclotide domain and finally a short hydrophobic C-terminal tail. The cyclotide domain may contain either one cyclotide sequence, as in the case of Oak1, or multiple copies separated by additional NTR sequences as seen for Oak2 and Oak4. In precursor proteins containing multiple cyclotide domains these can either be all identical sequences, as is the case for Oak4, or they can be different cyclotides as in Oak2 which contains sequences corresponding to kalata B3 and B6. [17]
Recently, the enzyme responsible for the backbone cyclization of cyclotides has been isolated from the medicinal plant Clitoria ternatea . This enzyme was named butelase 1 in accordance to the local name of the plant (Bunga Telang Ligase). Butelase 1 has been shown to cyclize the linear precursor of kalata B1 with >95% yield at a remarkable rate of 5.42×105 M−1 s−1. The ligase also cyclizes various bioactive peptides of animal origin, such as human antimicrobial peptide histatin, conotoxin from cone snail and insect antimicrobial peptide thanatin. [18]
The remarkable stability of cyclotides means that they have an exciting range of potential applications centred on either their intrinsic biological activities or the possibility of using the CCK motif as a scaffold for stabilizing biologically active epitopes. [19] Interest in these has recently intensified with the publications of a chemical methodology capable of synthetically producing cyclotides with high yields, [20] [21] and the amenability of the CCK framework to amino-acid substitutions. [22] But for molecules to be useful in a therapeutic setting they require useful biopharmaceutical characteristics such as resistance to proteolysis and membrane permeability. The membrane interactive surface area and moment of the cyclotides are determinants in the prediction of their biological activities. [23] A recent study on related cystine knot proteins as drug candidates showed that cystine knots do permeate well through rat small intestinal mucosa relative to non-cystine knot peptide drugs such as insulin and bacitracin. [24] Furthermore, enzymatic digestion of cystine knot peptide drugs was associated with only a few proteases and it was suggested that this limitation may be overcome by mutating out particular cleavage sites. Thus, certain cystine knot proteins satisfy the basic criteria for drug delivery and represent exciting novel candidates as scaffolds for peptide drug delivery. [24] The diverse range of intrinsic activities of cyclotides also continues to hold promise for a wide range of applications in the agricultural fields against insects and nematodes, especially those from Clitoria ternatea. [25] [26]
During a Red Cross relief mission in the Democratic Republic of Congo during the 1960s, a Norwegian doctor, Lorents Gran, noted that during labor African women used a medicinal tea made from the leaves of the plant Oldenlandia affinis to induce labor and facilitate childbirth. [28] The active ingredient was later determined to be a cyclic peptide, named kalata B1, after the traditional name for the tea, kalata-kalata. Although in vivo studies in rats confirmed the uterotonic activity of the purified peptide, it was another 20 years before the cyclic cystine knot motif and structure of the purified peptide were elucidated. [29]
DD-transpeptidase is a bacterial enzyme that catalyzes the transfer of the R-L-αα-D-alanyl moiety of R-L-αα-D-alanyl-D-alanine carbonyl donors to the γ-OH of their active-site serine and from this to a final acceptor. It is involved in bacterial cell wall biosynthesis, namely, the transpeptidation that crosslinks the peptide side chains of peptidoglycan strands.
A conotoxin is one of a group of neurotoxic peptides isolated from the venom of the marine cone snail, genus Conus.
Cyclic peptides are polypeptide chains which contain a circular sequence of bonds. This can be through a connection between the amino and carboxyl ends of the peptide, for example in cyclosporin; a connection between the amino end and a side chain, for example in bacitracin; the carboxyl end and a side chain, for example in colistin; or two side chains or more complicated arrangements, for example in alpha-amanitin. Many cyclic peptides have been discovered in nature and many others have been synthesized in the laboratory. Their length ranges from just two amino acid residues to hundreds. In nature they are frequently antimicrobial or toxic; in medicine they have various applications, for example as antibiotics and immunosuppressive agents. Thin-Layer Chromatography (TLC) is a convenient method to detect cyclic peptides in crude extract from bio-mass.
The beta hairpin is a simple protein structural motif involving two beta strands that look like a hairpin. The motif consists of two strands that are adjacent in primary structure, oriented in an antiparallel direction, and linked by a short loop of two to five amino acids. Beta hairpins can occur in isolation or as part of a series of hydrogen bonded strands that collectively comprise a beta sheet.
The trefoil knot fold is a protein fold in which the protein backbone is twisted into a trefoil knot shape. "Shallow" knots in which the tail of the polypeptide chain only passes through a loop by a few residues are uncommon, but "deep" knots in which many residues are passed through the loop are extremely rare. Deep trefoil knots have been found in the SPOUT superfamily. including methyltransferase proteins involved in posttranscriptional RNA modification in all three domains of life, including bacterium Thermus thermophilus and proteins, in archaea and in eukaryota.
A cystine knot is a protein structural motif containing three disulfide bridges. The sections of polypeptide that occur between two of them form a loop through which a third disulfide bond passes, forming a rotaxane substructure. The cystine knot motif stabilizes protein structure and is conserved in proteins across various species. There are three types of cystine knot, which differ in the topology of the disulfide bonds:
In molecular biology, alanine scanning is a site-directed mutagenesis technique used to determine the contribution of a specific residue to the stability or function of a given protein. Alanine is used because of its non-bulky, chemically inert, methyl functional group that nevertheless mimics the secondary structure preferences that many of the other amino acids possess. Sometimes bulky amino acids such as valine or leucine are used in cases where conservation of the size of mutated residues is needed.
Theta-defensins are a family of mammalian antimicrobial peptides. They are found in non-human 'Old World' primates, but not in human, gorilla, bonobo, and chimpanzee.
Hedyotis (starviolet) is a genus of flowering plants in the family Rubiaceae. Many species of this genus such as Hedyotis biflora, H. corymbosa and H. diffusa are well known medicinal plants. Hedyotis is native to tropical and subtropical Asia and to islands of the northwest Pacific. It comprises about 115 species. The type species for the genus is Hedyotis fruticosa.
An inhibitor cystine knot is a protein structural motif containing three disulfide bridges. Knottins are one of three folds in the cystine knot motif; the other closely related knots are the growth factor cystine knot (GFCK) and the cyclic cystine knot. Types include a) cyclic mobius, b) cyclic bracelet, c) acyclic inhibitor knottins. Cystine knot motifs are found frequently in nature in a plethora of plants, animals, and fungi and serve diverse functions from appetite suppression to anti-fungal activity.
Ribosomally synthesized and post-translationally modified peptides (RiPPs), also known as ribosomal natural products, are a diverse class of natural products of ribosomal origin. Consisting of more than 20 sub-classes, RiPPs are produced by a variety of organisms, including prokaryotes, eukaryotes, and archaea, and they possess a wide range of biological functions.
Split-intein circular ligation of peptides and proteins (SICLOPPS) is a biotechnology technique that permits the creation of cyclic peptides. These peptides are produced by ribosomal protein synthesis, followed by an intein-like event that splices the protein into a loop. By contrast with the nonribosomal peptide synthetases that produces some cyclic peptides like gramicidin S, SICLOPPS offers the advantage that the peptides' structure can be encoded by DNA in a simple manner according to the genetic code, but for this reason it imposes limitations on the types of amino acids incorporated that are comparable to those that apply to ordinary proteins. As implemented there is also some constraint on the peptide sequence of the cyclic sequence; for example, libraries may use the sequence SGXX..XXPL to increase the efficiency of circularization of the peptide. SICLOPPS is frequently used with a library of randomized DNA sequence that permits the simultaneous production and screening of large numbers of constructs at once, followed by the recovery of the DNA sequences responsible for the activity of the clone of interest.
Cliotides are a group of related peptides that have been isolated from the heat-stable fraction of Clitoria ternatea (Cliotides) extracts. Cliotides belong to a larger classification of peptides, the cyclotides.
GTx1-15 is a toxin from the Chilean tarantula venom that acts as both a voltage-gated calcium channel blocker and a voltage-gated sodium channel blocker.
Asparagine endopeptidase is a proteolytic enzyme from C13 peptidase family which hydrolyses a peptide bond using the thiol group of a cysteine residue as a nucleophile. It is also known as asparaginyl endopeptidase, citvac, proteinase B, hemoglobinase, PRSC1 gene product or LGMN, vicilin peptidohydrolase and bean endopeptidase. In humans it is encoded by the LGMN gene.
A proteolipid is a protein covalently linked to lipid molecules, which can be fatty acids, isoprenoids or sterols. The process of such a linkage is known as protein lipidation, and falls into the wider category of acylation and post-translational modification. Proteolipids are abundant in brain tissue, and are also present in many other animal and plant tissues. They include ghrelin, a peptide hormone associated with feeding. Many proteolipids are composed of proteins covalenently bound to fatty acid chains, often granting them an interface for interacting with biological membranes. They are not to be confused with lipoproteins, a kind of spherical assembly made up of many molecules of lipids and some apolipoproteins.
μ-THTX-Cl6a, also known as Cl6a, is a 33-residue peptide toxin extracted from the venom of the spider Cyriopagopus longipes. The toxin acts as an inhibitor of the tetrodotoxin-sensitive (TTX-S) voltage-gated sodium channel (NaV1.7), thereby causing sustained reduction of NaV1.7 currents.
U7-ctenitoxin-Pn1a (or U7-CNTX-Pn1a for short) is a neurotoxin that blocks TRPV1 channels, and can exhibit analgestic effects. It is naturally found in the venom of Phoneutria nigriventer.
Protoxin-I, also known as ProTx-I, or Beta/omega-theraphotoxin-Tp1a, is a 35-amino-acid peptide neurotoxin extracted from the venom of the tarantula Thrixopelma pruriens. Protoxin-I belongs to the inhibitory cystine knot (ICK) family of peptide toxins, which have been known to potently inhibit voltage-gated ion channels. Protoxin-I selectively blocks low voltage threshold T-type calcium channels, voltage gated sodium channels and the nociceptor cation channel TRPA1. Due to its unique ability to bind to TRPA1, Protoxin-I has been implicated as a valuable pharmacological reagent with potential applications in clinical contexts with regards to pain and inflammation
Cl6b (μ-THTX-Cl6b) is a peptide toxin from the venom of the spider Cyriopagopus longipes. It acts as a sodium channel blocker: Cl6b significantly and persistently reduces currents through the tetrodotoxin-sensitive sodium channels NaV1.2-1.4, NaV1.6, and NaV1.7.