Unipept

Last updated
Unipept
Original author(s) Ghent University
Developer(s) Bart Mesuere
Initial release3 February 2011;12 years ago (2011-02-03)
Stable release
4.1.1 / 21 January 2019;4 years ago (2019-01-21) [1]
Repository
Written in Ruby, JavaScript
Type Bioinformatics, Data visualization
License MIT License
Website unipept.ugent.be

Unipept is an open source research tool for biodiversity analysis of metaproteomics samples. It also contains a tool to select peptides to use as biomarker and a tool to compare the genome of organisms based on their protein content. [2] [3] The software is developed at Ghent University.

Unipept consists of a web application and a stand-alone command line tool. The web application uses interactive data visualizations to explore datasets. The command line tool contains the same functionality, but is designed for use in automated data processing pipelines. [4]

Related Research Articles

<span class="mw-page-title-main">IRAF</span> Software collection for astronomical data reduction and data analysis

IRAF is a collection of software written at the National Optical Astronomy Observatory (NOAO) geared towards the reduction of astronomical images and spectra in pixel array form. This is primarily data taken from imaging array detectors such as CCDs. It is available for all major operating systems for mainframes and desktop computers. IRAF was designed cross-platform, supporting VMS and UNIX-like operating systems. Use on Microsoft Windows was made possible by Cygwin in earlier versions, and can be today done with the Windows Subsystem for Linux. Today, it is primarily used on macOS and Linux.

BioJava is an open-source software project dedicated to provide Java tools to process biological data. BioJava is a set of library functions written in the programming language Java for manipulating sequences, protein structures, file parsers, Common Object Request Broker Architecture (CORBA) interoperability, Distributed Annotation System (DAS), access to AceDB, dynamic programming, and simple statistical routines. BioJava supports a huge range of data, starting from DNA and protein sequences to the level of 3D protein structures. The BioJava libraries are useful for automating many daily and mundane bioinformatics tasks such as to parsing a Protein Data Bank (PDB) file, interacting with Jmol and many more. This application programming interface (API) provides various file parsers, data models and algorithms to facilitate working with the standard data formats and enables rapid application development and analysis.

<span class="mw-page-title-main">Liquid chromatography–mass spectrometry</span> Analytical chemistry technique

Liquid chromatography–mass spectrometry (LC–MS) is an analytical chemistry technique that combines the physical separation capabilities of liquid chromatography with the mass analysis capabilities of mass spectrometry (MS). Coupled chromatography - MS systems are popular in chemical analysis because the individual capabilities of each technique are enhanced synergistically. While liquid chromatography separates mixtures with multiple components, mass spectrometry provides spectral information that may help to identify each separated component. MS is not only sensitive, but provides selective detection, relieving the need for complete chromatographic separation. LC-MS is also appropriate for metabolomics because of its good coverage of a wide range of chemicals. This tandem technique can be used to analyze biochemical, organic, and inorganic compounds commonly found in complex samples of environmental and biological origin. Therefore, LC-MS may be applied in a wide range of sectors including biotechnology, environment monitoring, food processing, and pharmaceutical, agrochemical, and cosmetic industries. Since the early 2000s, LC-MS has also begun to be used in clinical applications.

Mass spectrometry is a scientific technique for measuring the mass-to-charge ratio of ions. It is often coupled to chromatographic techniques such as gas- or liquid chromatography and has found widespread adoption in the fields of analytical chemistry and biochemistry where it can be used to identify and characterize small molecules and proteins (proteomics). The large volume of data produced in a typical mass spectrometry experiment requires that computers be used for data storage and processing. Over the years, different manufacturers of mass spectrometers have developed various proprietary data formats for handling such data which makes it difficult for academic scientists to directly manipulate their data. To address this limitation, several open, XML-based data formats have recently been developed by the Trans-Proteomic Pipeline at the Institute for Systems Biology to facilitate data manipulation and innovation in the public sector. These data formats are described here.

The Apple Developer Tools are a suite of software tools from Apple to aid in making software dynamic titles for the macOS and iOS platforms. The developer tools were formerly included on macOS install media, but are now exclusively distributed over the Internet. As of macOS 10.12, Xcode is available as a free download from the Mac App Store.

The Trans-Proteomic Pipeline (TPP) is an open-source data analysis software for proteomics developed at the Institute for Systems Biology (ISB) by the Ruedi Aebersold group under the Seattle Proteome Center. The TPP includes PeptideProphet, ProteinProphet, ASAPRatio, XPRESS and Libra.

Mass spectrometry imaging (MSI) is a technique used in mass spectrometry to visualize the spatial distribution of molecules, as biomarkers, metabolites, peptides or proteins by their molecular masses. After collecting a mass spectrum at one spot, the sample is moved to reach another region, and so on, until the entire sample is scanned. By choosing a peak in the resulting spectra that corresponds to the compound of interest, the MS data is used to map its distribution across the sample. This results in pictures of the spatially resolved distribution of a compound pixel by pixel. Each data set contains a veritable gallery of pictures because any peak in each spectrum can be spatially mapped. Despite the fact that MSI has been generally considered a qualitative method, the signal generated by this technique is proportional to the relative abundance of the analyte. Therefore, quantification is possible, when its challenges are overcome. Although widely used traditional methodologies like radiochemistry and immunohistochemistry achieve the same goal as MSI, they are limited in their abilities to analyze multiple samples at once, and can prove to be lacking if researchers do not have prior knowledge of the samples being studied. Most common ionization technologies in the field of MSI are DESI imaging, MALDI imaging and secondary ion mass spectrometry imaging.

<span class="mw-page-title-main">Matrix-assisted laser desorption electrospray ionization</span>

Matrix-assisted laser desorption electrospray ionization (MALDESI) was first introduced in 2006 as a novel ambient ionization technique which combines the benefits of electrospray ionization (ESI) and matrix-assisted laser desorption/ionization (MALDI). An infrared (IR) or ultraviolet (UV) laser can be utilized in MALDESI to resonantly excite an endogenous or exogenous matrix. The term ‘matrix’ refers to any molecule that is present in large excess and absorbs the energy of the laser, thus facilitating desorption of analyte molecules. The original MALDESI design was implemented using common organic matrices, similar to those used in MALDI, along with a UV laser. The current MALDESI source employs endogenous water or a thin layer of exogenously deposited ice as the energy-absorbing matrix where O-H symmetric and asymmetric stretching bonds are resonantly excited by a mid-IR laser.

<span class="mw-page-title-main">UGENE</span>

UGENE is computer software for bioinformatics. It works on personal computer operating systems such as Windows, macOS, or Linux. It is released as free and open-source software, under a GNU General Public License (GPL) version 2.

Metaproteomics is an umbrella term for experimental approaches to study all proteins in microbial communities and microbiomes from environmental sources. Metaproteomics is used to classify experiments that deal with all proteins identified and quantified from complex microbial communities. Metaproteomics approaches are comparable to gene-centric environmental genomics, or metagenomics.

A scientific workflow system is a specialized form of a workflow management system designed specifically to compose and execute a series of computational or data manipulation steps, or workflow, in a scientific application.

LabKey Server is a software suite available for scientists to integrate, analyze, and share biomedical research data. The platform provides a secure data repository that allows web-based querying, reporting, and collaborating across a range of data sources. Specific scientific applications and workflows can be added on top of the basic platform and leverage a data processing pipeline.

<span class="mw-page-title-main">DNA barcoding</span> Method of species identification using a short section of DNA

DNA barcoding is a method of species identification using a short section of DNA from a specific gene or genes. The premise of DNA barcoding is that by comparison with a reference library of such DNA sections, an individual sequence can be used to uniquely identify an organism to species, just as a supermarket scanner uses the familiar black stripes of the UPC barcode to identify an item in its stock against its reference database. These "barcodes" are sometimes used in an effort to identify unknown species or parts of an organism, simply to catalog as many taxa as possible, or to compare with traditional taxonomy in an effort to determine species boundaries.

MG-RAST is an open-source web application server that suggests automatic phylogenetic and functional analysis of metagenomes. It is also one of the biggest repositories for metagenomic data. The name is an abbreviation of Metagenomic Rapid Annotations using Subsystems Technology. The pipeline automatically produces functional assignments to the sequences that belong to the metagenome by performing sequence comparisons to databases in both nucleotide and amino-acid levels. The applications supply phylogenetic and functional assignments of the metagenome being analysed, as well as tools for comparing different metagenomes. It also provides a RESTful API for programmatic access.

Skyline is an open source software for targeted proteomics and metabolomics data analysis. It runs on Microsoft Windows and supports the raw data formats from multiple mass spectrometric vendors. It contains a graphical user interface to display chromatographic data for individual peptide or small molecule analytes.

The Ebbe Nielsen Challenge is an international science competition conducted annually from 2015 onwards by the Global Biodiversity Information Facility (GBIF), with a set of cash prizes that recognize researcher(s)' submissions in creating software or approaches that successfully address a GBIF-issued challenge in the field of biodiversity informatics. It succeeds the Ebbe Nielsen Prize, which was awarded annually by GBIF between 2002 and 2014. The name of the challenge honours the memory of prominent entomologist and biodiversity informatics proponent Ebbe Nielsen, who died of a heart attack in the U.S.A. en route to the 2001 GBIF Governing Board meeting.

<span class="mw-page-title-main">Stein Aerts</span> Belgian bio-engineer and computational biologist

Stein Aerts is a Belgian bio-engineer and computational biologist. He leads the Laboratory of Computational Biology at VIB and KU Leuven, and has received several accolades for his research into the workings of the genomic regulatory code.

References

  1. "Unipept Releases". GitHub . Retrieved April 14, 2019.
  2. Mesuere, Bart; Devreese, Bart; Debyser, Griet; Aerts, Maarten; Vandamme, Peter; Dawyndt, Peter (2012). "Unipept: Tryptic Peptide-Based Biodiversity Analysis of Metaproteome Samples". Journal of Proteome Research. 11 (12): 5773–80. doi:10.1021/pr300576s. ISSN   1535-3893. PMID   23153116.
  3. Mesuere, Bart; Debyser, Griet; Aerts, Maarten; Devreese, Bart; Vandamme, Peter; Dawyndt, Peter (2015). "The Unipept metaproteomics analysis pipeline". Proteomics. 15 (8): 1437–1442. doi:10.1002/pmic.201400361. ISSN   1615-9853. PMID   25477242. S2CID   43820480.
  4. "Unipept CLI documentation" . Retrieved August 24, 2015.