This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
Developer(s) | Prof Liam McGuffin Dr Recep Adiyaman Dr Bajuna Salehe |
---|---|
Stable release | IntFOLD version 5.0 |
Preview release | IntFOLD version 6.0 |
Written in | Java, Python, R |
Website | https://www.reading.ac.uk/bioinf/IntFOLD/ |
IntFOLD (Integrated Fold Recognition) is fully automated, integrated pipeline for prediction of 3D structure and function from amino acid sequences. [1] The pipeline is wrapped up and deployed as a publicly-available Web Server. [2] The core of the server method is quality assessment using built-in accuracy self-estimates (ASE) which improves performance prediction of 3D model using ModFOLD. [3]
IntFOLD server provides the tertiary structure prediction at a competitive accuracy and combines the cutting edge methods including IntFOLD-TS for generation of 3D models, [1] ModFOLD for 3D model quality estimation, [3] ReFOLD for refinement of 3D models, [4] DisoCLUST for disorder prediction, [5] DomFOLD for structural domain prediction, [6] and FunFOLD for protein ligand binding site prediction. [7] The integration of the tools enables users to reach all related information in a pipeline. IntFOLD Web Server has completed over 200,000 structure predictions since January 2010. [1]
The only required input is a protein sequence for the prediction of the protein 3D structure and function. [1] The IntFOLD output is presented via a user-friendly interface for the use of life scientists. The raw data is also formatted in Critical Assessment of Methods for Protein Structure Prediction (CASP) standards with a detailed help page. [1]
The IntFOLD method was firstly benchmarked in Critical Assessment of Techniques for Protein Structure Prediction 9 (CASP9) and ranked among the top 5. [8] The IntFOLD server has consolidated its performance in the following CASP experiments [1]
Its performance is being continually evaluated in Continuous Automated Model Evaluation (CAMEO) experiment. [9]
IntFOLD was used to generate 3D models of the SARS-CoV-2 targets for the CASP Commons COVID-19 initiative [10] and elsewhere [11] accelerating the race of vaccines and other therapeutics development with regard to COVID-19 pandemic. In other aspect of chronic diseases, IntFOLD was used to model HEV PCP, an essential protein of Hepatitis E virus causing Hepatitis E disease. [12] Additionally, IntFOLD was used to model disordered region of the Bovine milk αS2-casein proteins which were implicated in the formation amyloidogenic fibrils some of which are known to be major causes of neurodegenerative diseases. [13]
IntFOLD has been used in different aspects of food security. For instance, it has been used to model effector proteins molecules that causes fungus in Barley. [14] Furthermore, it has been applied in modelling several proteins involved in the functioning of key systems in Atlantic salmon, and HaACBP1 protein, which is vital for development and growth of sunflower, a key crop plant used for production of widely used cooking oil. [15] [16] IntFOLD was used to model Chitin proteins in Podosphaera xanthii, a causal agent of fungal disease called cucurbit powdery mildew, which hamper crop productivity. [17]
IntFOLD has been used as one of the standard server-based methods in validating the performance of some of the newer methods used in prediction of the 3D-protein models. This is important in advancing the structural bioinformatics field. [18]
Critical Assessment of Structure Prediction (CASP), sometimes called Critical Assessment of Protein Structure Prediction, is a community-wide, worldwide experiment for protein structure prediction taking place every two years since 1994. CASP provides research groups with an opportunity to objectively test their structure prediction methods and delivers an independent assessment of the state of the art in protein structure modeling to the research community and software users. Even though the primary goal of CASP is to help advance the methods of identifying protein three-dimensional structure from its amino acid sequence many view the experiment more as a "world championship" in this field of science. More than 100 research groups from all over the world participate in CASP on a regular basis and it is not uncommon for entire groups to suspend their other research for months while they focus on getting their servers ready for the experiment and on performing the detailed predictions.
InterPro is a database of protein families, protein domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences in order to functionally characterise them.
Nucleic acid structure prediction is a computational method to determine secondary and tertiary nucleic acid structure from its sequence. Secondary structure can be predicted from one or several nucleic acid sequences. Tertiary structure can be predicted from the sequence, or by comparative modeling.
This is a list of notable computer programs that are used for nucleic acids simulations.
SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, structural, and evolutionary units that form proteins. Domains of common Ancestry are grouped into superfamilies. The domains and domain superfamilies are defined and described in SCOP. Superfamilies are groups of proteins which have structural evidence to support a common evolutionary ancestor but may not have detectable sequence homology.
Phyre and Phyre2 are free web-based services for protein structure prediction. Phyre is among the most popular methods for protein structure prediction having been cited over 1,500 times. Like other remote homology recognition techniques, it is able to regularly generate reliable protein models when other widely used methods such as PSI-BLAST cannot. Phyre2 has been designed to ensure a user-friendly interface for users inexpert in protein structure prediction methods. Its development is funded by the Biotechnology and Biological Sciences Research Council.
Knotted proteins are proteins whose backbones entangle themselves in a knot. One can imagine pulling a protein chain from both termini, as though pulling a string from both ends. When a knotted protein is “pulled” from both termini, it does not get disentangled. Knotted proteins are very rare, making up only about one percent of the proteins in the Protein Data Bank, and their folding mechanisms and function are not well understood. Although there are experimental and theoretical studies that hint to some answers, systematic answers to these questions have not yet been found.
David Tudor Jones is a Professor of Bioinformatics, and Head of Bioinformatics Group in the University College London. He is also the director in Bloomsbury Center for Bioinformatics, which is a joint Research Centre between UCL and Birkbeck, University of London and which also provides bioinformatics training and support services to biomedical researchers. In 2013, he is a member of editorial boards for PLoS ONE, BioData Mining, Advanced Bioinformatics, Chemical Biology & Drug Design, and Protein: Structure, Function and Bioinformatics.
Swiss-model is a structural bioinformatics web-server dedicated to homology modeling of 3D protein structures. As of 2024, homology modeling is the most accurate method to generate reliable three-dimensional protein structure models and is routinely used in many practical applications. Homology modelling methods make use of experimental protein structures (templates) to build models for evolutionary related proteins (targets).
In molecular biology, protein fold classes are broad categories of protein tertiary structure topology. They describe groups of proteins that share similar amino acid and secondary structure proportions. Each class contains multiple, independent protein superfamilies.
Computer Atlas of Surface Topography of Proteins (CASTp) aims to provide comprehensive and detailed quantitative characterization of topographic features of protein, is now updated to version 3.0. Since its release in 2006, the CASTp server has ≈45000 visits and fulfills ≈33000 calculation requests annually. CASTp has been proven as a confident tool for a wide range of researches, including investigations of signaling receptors, discoveries of cancer therapeutics, understanding of mechanism of drug actions, studies of immune disorder diseases, analysis of protein–nanoparticle interactions, inference of protein functions and development of high-throughput computational tools. This server is maintained by Jie Liang's lab in University of Illinois at Chicago.
I-TASSER is a bioinformatics method for predicting three-dimensional structure model of protein molecules from amino acid sequences. It detects structure templates from the Protein Data Bank by a technique called fold recognition. The full-length structure models are constructed by reassembling structural fragments from threading templates using replica exchange Monte Carlo simulations. I-TASSER is one of the most successful protein structure prediction methods in the community-wide CASP experiments.
The ViennaRNA Package is software, a set of standalone programs and libraries used for predicting and analysing RNA nucleic acid secondary structures. The source code for the package is released as free and open-source software and compiled binaries are available for the operating systems Linux, macOS, and Windows. The original paper has been cited over 2,000 times.
Rita Casadio is an Adjunct Professor of Biochemistry/Biophysics in the Department of Pharmacy and Biotechnology at the University of Bologna.
[[
Computational methods that use protein sequence and/ or protein structure to predict protein aggregation. The table below, shows the main features of software for prediction of protein aggregation
The Predicted Aligned Error (PAE) is a quantitative output produced by AlphaFold, a protein structure prediction system developed by DeepMind. PAE estimates the expected positional error for each residue in a predicted protein structure if it were aligned to a corresponding residue in the true protein structure. This measurement helps scientists assess the confidence in the relative positions and orientations of different parts of the predicted protein model.