This is a list of computer software which is made for bioinformatics and released under open-source software licenses with articles in Wikipedia.
Software | Description | Platform | License | Developer |
---|---|---|---|---|
.NET Bio | Language-neutral toolkit built using the Microsoft 4.0 .NET Framework to help developers, researchers, and scientists | .NET Framework | Apache | Collaborative project |
AMPHORA | Metagenomics analysis software | Linux | GPL | ? |
Anduril | Component-based workflow framework for data analysis | Linux, macOS, Windows | GPL | University of Helsinki |
Ascalaph Designer | Computer program for general purpose molecular modelling for molecular design and simulations. | ? | GPLv2 | Agile Molecule |
AutoDock | Suite of automated docking tools | ? | GPL | ? |
Avogadro | C++ (Qt) based molecule editor and visualizer for in computational chemistry, molecular modeling, bioinformatics, materials science, and related areas. | ? | GPL | ? |
BEDtools | "Genome arithmetic" -- manipulation of coordinate sets and the extraction of sequences from a BED file. | Linux | MIT | QuinlanLab, University of Utah |
Bioclipse | Visual platform for chemo- and bioinformatics based on the Eclipse Rich Client Platform (RCP) | ? | Eclipse Public | The Bioclipse Project |
Bioconductor | R (programming language) language toolkit | Linux, macOS, Windows | Artistic 2.0 | Fred Hutchinson Cancer Research Center |
BioJava | Java library functions for manipulating sequences, protein structures, file parsers, CORBA interoperability, Distributed Annotation System (DAS), access to AceDB, dynamic programming, and simple statistical routines | Linux, macOS, Windows | LGPL v2.1 | Open Bioinformatics Foundation |
BioJS | JavaScript library of components to visualize biological data | Web browser | Apache | ? |
BioMOBY | Registry of web services | Web browser | Artistic | Open Bioinformatics Foundation |
BioPerl | Perl language toolkit | Cross-platform | Artistic, GPL | Open Bioinformatics Foundation |
BioPHP | PHP language toolkit with classes for DNA and protein sequence analysis, alignment, database parsing, and other bioinformatics tools | Cross-platform | GPL v2 | Open Bioinformatics Foundation |
Biopython | Python language toolkit | Cross-platform | Biopython [1] | Open Bioinformatics Foundation |
BioRuby | Ruby language toolkit | ? | GPL v2 or Ruby | Open Bioinformatics Foundation |
BLAST | Algorithm and program for comparing primary biological sequence information, including DNA and protein sequences. | Cross-platform | Public domain | National Center for Biotechnology Information |
CP2K | Perform atomistic simulations of solid state, liquid, molecular and biological systems, written in Fortran 2003. | ? | GPL and LGPL | Free open source GNU GPLv2 or later |
EMBOSS | Suite of packages for sequencing, searching, etc. written in C | ? | GPL and LGPL | Collaborative project |
Galaxy | Scientific workflow and data integration system | Unix-like | Academic Free | Collaborative project |
GenePattern | Scientific workflow system that provides access to hundreds of genomic analysis tools | Unix-like (public server); Linux, macOS, Windows | MIT | Broad Institute, UC San Diego |
Geworkbench | Genomic data integration platform | Linux, macOS, Windows | GeWorkbench License [2] | Columbia University |
GMOD | Toolkit to address many common challenges at biological databases | Unix-like (server), Web browser (client) | Varies depending on tool | Collaborative project |
GenGIS | Application that allows combining digital map data with information about biological sequences collected from the environment | Windows, macOS | GPL | Collaborative project |
Genomespace | Centralized web application that provides data format transformations and facilitates connections with other bioinformatics tools | Web browser | LGPL | Broad Institute, collaborative project |
GENtle | An equivalent to the proprietary Vector NTI, a tool to analyze and edit DNA sequence files | ? | GPL | Magnus Manske |
gget | Efficient querying of genomic reference databases including UniProt, National Center for Biotechnology Information, and Ensembl genome database project | Linux, macOS, Windows | BSD | Laura Luebbert and Lior Pachter |
GROMACS | Molecular dynamics package mainly designed for simulations of proteins, lipids and nucleic acids. | Linux, macOS, Windows | Common Public 1.0 | GenoViz |
Integrated Genome Browser | Java-based desktop genome browser | Linux, macOS, Windows | Common Public 1.0 | GenoViz |
InterMine | Extensive data warehouse system for the analysis and integration of biological datasets written in Java and JavaScript | Cross-platform | LGPL | University of Cambridge |
LabKey Server | Software platform, allows organizations to integrate, analyze, and share complex biomedical data | Linux, macOS, Windows | Apache | LabKey Software Foundation |
LAMMPS | Molecular dynamics program written in C++ | Linux, macOS, Windows | Apache | Sandia National Laboratories. |
mothur | Software for analysis of 16S rRNA gene amplicon sequence data | Linux, macOS, Windows | GPL | University of Michigan |
PathVisio | Desktop software for drawing, analyzing, and visualizing biological pathways | Linux, macOS, Windows | Apache 2.0 | Maastricht University |
Orange | Component-based data mining and machine learning software suite written in C++, featuring a visual programming front-end for exploratory data analysis and interactive visualization, and Python bindings and libraries for scripting | Linux, macOS, Windows | GPL | University of Ljubljana |
SAMtools | Utilities for interacting with high-throughput sequencing data and alignments in sam/bam format | Unix/Linux | MIT | Collaborative project |
SOAP Suite | Suite of tools for assembly, alignment, and analysis of short read next generation sequencing data | Unix/Linux, macOS | GPL | BGI |
Staden Package | Sequence assembly, editing, and analysis, mainly consisting of gap4, gap5, and spin. Written in C, C++, Fortran and Tcl. | Linux, macOS, Windows | BSD | Wellcome Trust Sanger Institute, Medical Research Council |
Taverna workbench | Tool to design and execute workflows | Linux, macOS, Windows | LGPL | myGrid |
T-REX (web server) | Inference, validation and visualization of phylogenetic trees and phylogenetic networks | Web browser, Windows | ? | Université du Québec à Montréal |
UGENE | Integrated bioinformatics tools, written in C++ (Qt) | Linux, macOS, Windows | GPL 2 | Unipro |
Unipept | Metaproteomics biodiversity analysis written in Ruby and JavaScript | Web browser | MIT | Ghent University |
VOTCA | A Coarse-grained modeling package for molecular dynamics, written in C++, Perl, BASH | Linux, macOS, Windows, any other Unix variety | Apache License 2.0 | Max Planck Institute for Polymer Research |
Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The subsequent process of analyzing and interpreting data is often referred to as computational biology, though the distinction between the two terms is often disputed.
Computational biology refers to the use of data analysis, mathematical modeling and computational simulations to understand biological systems and relationships. An intersection of computer science, biology, and big data, the field also has foundations in applied mathematics, chemistry, and genetics. It differs from biological computing, a subfield of computer science and engineering which uses bioengineering to build computers.
The Open Bioinformatics Foundation is a non-profit, volunteer-run organization focused on supporting open source programming in bioinformatics. The mission of the foundation is to support the development of open source toolkits for bioinformatics, organise developer-centric hackathon events and generally assist in the development and promotion of open source software development in the life sciences. The foundation also organises and runs the annual Bioinformatics Open Source Conference, a satellite meeting of the Intelligent Systems for Molecular Biology conference. The foundation participates in the Google Summer of Code, acting as an umbrella organisation for individual bioinformatics-related projects.
Visual Molecular Dynamics (VMD) is a molecular modelling and visualization computer program. VMD is developed mainly as a tool to view and analyze the results of molecular dynamics simulations. It also includes tools for working with volumetric data, sequence data, and arbitrary graphics objects. Molecular scenes can be exported to external rendering tools such as POV-Ray, RenderMan, Tachyon, Virtual Reality Modeling Language (VRML), and many others. Users can run their own Tcl and Python scripts within VMD as it includes embedded Tcl and Python interpreters. VMD runs on Unix, Apple Mac macOS, and Microsoft Windows. VMD is available to non-commercial users under a distribution-specific license which permits both use of the program and modification of its source code, at no charge.
RasMol is a computer program written for molecular graphics visualization intended and used mainly to depict and explore biological macromolecule structures, such as those found in the Protein Data Bank (PDB).
PyMOL is a source-available molecular visualization system created by Warren Lyford DeLano. It was commercialized initially by DeLano Scientific LLC, which was a private software company dedicated to creating useful tools that become universally accessible to scientific and educational communities. It is currently commercialized by Schrödinger, Inc. As the original software license was a permissive licence, they were able to remove it; new versions are no longer released under the Python license, but under a custom license, and some of the source code is no longer released. PyMOL can produce high-quality 3D images of small molecules and biological macromolecules, such as proteins. According to the original author, by 2009, almost a quarter of all published images of 3D protein structures in the scientific literature were made using PyMOL.
Bioconductor is a free, open source and open development software project for the analysis and comprehension of genomic data generated by wet lab experiments in molecular biology.
Jmol is computer software for molecular modelling chemical structures in 3-dimensions. Jmol returns a 3D representation of a molecule that may be used as a teaching tool, or for research e.g., in chemistry and biochemistry.
The Chemistry Development Kit (CDK) is computer software, a library in the programming language Java, for chemoinformatics and bioinformatics. It is available for Windows, Linux, Unix, and macOS. It is free and open-source software distributed under the GNU Lesser General Public License (LGPL) 2.0.
The Common Lisp Interface Manager (CLIM) is a Common Lisp-based programming interface for creating user interfaces, i.e., graphical user interfaces (GUIs). It provides an application programming interface (API) to user interface facilities for the programming language Lisp. It is a fully object-oriented programming user interface management system, using the Common Lisp Object System (CLOS) and is based on the mechanism of stream input and output. There are also facilities for output device independence. It is descended from the GUI system Dynamic Windows of Symbolics' Lisp machines between 1988 and 1993.
... you can check out Common Lisp Interface Manager (CLIM). A descendant of the Symbolics Lisp machines GUI framework, CLIM is powerful but complex. Although many commercial Common Lisp implementations actually support it, it doesn't seem to have seen a lot of use. But in the past couple years, an open-source implementation of CLIM, McCLIM – now hosted at Common-Lisp.net – has been picking up steam lately, so we may be on the verge of a CLIM renaissance. – From Practical Common Lisp
BALL is a C++ class framework and set of algorithms and data structures for molecular modelling and computational structural bioinformatics, a Python interface to this library, and a graphical user interface to BALL, the molecule viewer BALLView.
Physiomics is a systematic study of physiome in biology. Physiomics employs bioinformatics to construct networks of physiological features that are associated with genes, proteins and their networks. A few of the methods for determining individual relationships between the DNA sequence and physiological function include metabolic pathway engineering and RNAi analysis. The relationships derived from methods such as these are organized and processed computationally to form distinct networks. Computer models use these experimentally determined networks to develop further predictions of gene function.
Avogadro is a molecule editor and visualizer designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science, and related areas. It is extensible via a plugin architecture.
geWorkbench is an open-source software platform for integrated genomic data analysis. It is a desktop application written in the programming language Java. geWorkbench uses a component architecture. As of 2016, there are more than 70 plug-ins available, providing for the visualization and analysis of gene expression, sequence, and structure data.
Pharmaceutical bioinformatics is a research field related to bioinformatics but with the focus on studying biological and chemical processes in the pharmaceutical area; to understand how xenobiotics interact with the human body and the drug discovery process.