BRENDA

Last updated
BRENDA (The Comprehensive Enzyme Information System)
Database.png
Content
Description molecular and biochemical information on enzymes that have been classified by the IUBMB
Contact
Research center Technische Universität Braunschweig, Department of Biochemistry and Bioinformatics
Primary citation PMID   21062828
Release date1987
Access
Website http://www.brenda-enzymes.org
Download URL form
Web service URL soap

BRENDA (The Comprehensive Enzyme Information System) is an information system representing one of the most comprehensive enzyme repositories. It is an electronic resource that comprises molecular and biochemical information on enzymes that have been classified by the IUBMB. Every classified enzyme is characterized with respect to its catalyzed biochemical reaction. Kinetic properties of the corresponding reactants (that is, substrates and products) are described in detail. BRENDA contains enzyme-specific data manually extracted from primary scientific literature and additional data derived from automatic information retrieval methods such as text mining. It provides a web-based user interface that allows a convenient and sophisticated access to the data.

Contents

History

BRENDA was founded in 1987 at the former German Research Centre for Biotechnology (now the Helmholtz Centre for Infection Research) in Braunschweig and was originally published as a series of books. Its name was originally an acronym for the Braunschweig Enzyme Database. From 1996 to 2007, BRENDA was located at the University of Cologne. There, BRENDA developed into a publicly accessible enzyme information system. [1] In 2007, BRENDA returned to Braunschweig. Currently, BRENDA is maintained and further developed at the BRICS ( Braunschweig Integrated Centre of Systems Biology) at the TU Braunschweig.

In 2018 BRENDA was appointed to ELIXIR Core Data Resource with fundamental importance to biological and biomedical research and long-term preservation of biological data.

Updates

A major update of the data in BRENDA is performed twice a year. Besides the upgrade of its content, improvements of the user interface are also incorporated into the BRENDA database.

Content and features

Database:

The database contains more than 40 data fields with enzyme-specific information on more than 8300 EC numbers that are classified according to the IUBMB. The different data fields cover information on the enzyme's nomenclature, reaction and specificity, enzyme structure, isolation and preparation, enzyme stability, kinetic parameters such as Km value and turnover number, occurrence and localization, mutants and engineered enzymes, application of enzymes and ligand-related data. Currently, BRENDA contains manually annotated data from over 165,000 different scientific articles. Each enzyme entry is clearly linked to at least one literature reference, to its source organism, and, where available, to the protein sequence of the enzyme. [1] An important part of BRENDA represent the almost 260,000 enzyme ligands, which are available on their names, synonyms or via the chemical structure. The term "ligand" is used in this context to all low molecular weight compounds which interact with enzymes. These include not only metabolites of primary metabolism, co-substrates or cofactors but also enzyme inhibitors or metal ions. The origin of these molecules ranges from naturally occurring antibiotics to synthetic compounds that have been synthesized for the development of drugs or pesticides. Furthermore, cross-references to external information resources such as sequence and 3D-structure databases, as well as biomedical ontologies, are provided.

Extensions:

Since 2006, the data in BRENDA is supplemented with information extracted from the scientific literature by a co-occurrence based text mining approach. For this purpose, four text-mining repositories FRENDA (Full Reference ENzyme DAta), AMENDA (Automatic Mining of ENzyme DAta), DRENDA (Disease-Related ENzyme information DAtabase) and KENDA (Kinetic ENzyme DAta) were introduced. These text-mining results were derived from the titles and abstracts of all articles in the literature database PubMed. [1] [2] [3]

Data access:

There are several tools to obtain access to the data in BRENDA. Some of them are listed here.

Availability

The usage of BRENDA is free of charge. In addition, FRENDA and AMENDA are free for non-profit users. Commercial users are in need of a license for these databases through BIOBASE.

Other databases

BRENDA provides links to several other databases with a different focus on the enzyme, e.g., metabolic function or enzyme structure. Other links lead to ontological information on the corresponding gene of the enzyme in question. Links to the literature are established with PubMed. BRENDA links to some further databases and repositories such as:

Related Research Articles

The Protein Information Resource (PIR), located at Georgetown University Medical Center, is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies. It contains protein sequences databases

The DrugBank database is a comprehensive, freely accessible, online database containing information on drugs and drug targets created and maintained by the University of Alberta and The Metabolomics Innovation Centre located in Alberta, Canada. As both a bioinformatics and a cheminformatics resource, DrugBank combines detailed drug data with comprehensive drug target information. DrugBank has used content from Wikipedia; Wikipedia also often links to Drugbank, posing potential circular reporting issues.

<span class="mw-page-title-main">KEGG</span> Collection of bioinformatics databases

KEGG is a collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics, metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug development.

<span class="mw-page-title-main">Amos Bairoch</span>

Amos Bairoch is a Swiss bioinformatician and Professor of Bioinformatics at the Department of Human Protein Sciences of the University of Geneva where he leads the CALIPHO group at the Swiss Institute of Bioinformatics (SIB) combining bioinformatics, curation, and experimental efforts to functionally characterize human proteins.

Chemical Entities of Biological Interest, also known as ChEBI, is a chemical database and ontology of molecular entities focused on 'small' chemical compounds, that is part of the Open Biomedical Ontologies (OBO) effort at the European Bioinformatics Institute (EBI). The term "molecular entity" refers to any "constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer, etc., identifiable as a separately distinguishable entity". The molecular entities in question are either products of nature or synthetic products which have potential bioactivity. Molecules directly encoded by the genome, such as nucleic acids, proteins and peptides derived from proteins by proteolytic cleavage, are not as a rule included in ChEBI.

Reactome is a free online database of biological pathways. There are several Reactomes that concentrate on specific organisms, the largest of these is focused on human biology, the following description concentrates on the human Reactome. It is authored by biologists, in collaboration with Reactome editorial staff. The content is cross-referenced to many bioinformatics databases. The rationale behind Reactome is to visually represent biological pathways in full mechanistic detail, while making the source data available in a computationally accessible format.

<span class="mw-page-title-main">Generic Model Organism Database</span>

The Generic Model Organism Database (GMOD) project provides biological research communities with a toolkit of open-source software components for visualizing, annotating, managing, and storing biological data. The GMOD project is funded by the United States National Institutes of Health, National Science Foundation and the USDA Agricultural Research Service.

The MetaCyc database is one of the largest metabolic pathways and enzymes databases currently available. The data in the database is manually curated from the scientific literature, and covers all domains of life. MetaCyc has extensive information about chemical compounds, reactions, metabolic pathways and enzymes. The data have been curated from more than 58,000 publications.

<span class="mw-page-title-main">MicrobesOnline</span>

MicrobesOnline is a publicly and freely accessible website that hosts multiple comparative genomic tools for comparing microbial species at the genomic, transcriptomic and functional levels. MicrobesOnline was developed by the Virtual Institute for Microbial Stress and Survival, which is based at the Lawrence Berkeley National Laboratory in Berkeley, California. The site was launched in 2005, with regular updates until 2011.

IntEnz contains data on enzymes organized by enzyme EC number and is the official version of the Enzyme Nomenclature system developed by the International Union of Biochemistry and Molecular Biology.

The BTO represents a comprehensive structured encyclopedia. It provides terms, classifications, and definitions of tissues, organs, anatomical structures, plant parts, cell cultures, cell types, and cell lines of organisms from all taxonomic groups as enzyme sources. The information is connected to the functional data in the BRENDA enzyme information system.

PDBsum is a database that provides an overview of the contents of each 3D macromolecular structure deposited in the Protein Data Bank. The original version of the database was developed around 1995 by Roman Laskowski and collaborators at University College London. As of 2014, PDBsum is maintained by Laskowski and collaborators in the laboratory of Janet Thornton at the European Bioinformatics Institute (EBI).

<span class="mw-page-title-main">SABIO-Reaction Kinetics Database</span>

SABIO-RK is a web-accessible database storing information about biochemical reactions and their kinetic properties.

<span class="mw-page-title-main">Human Metabolome Database</span> Database of human metabolites

The Human Metabolome Database (HMDB) is a comprehensive, high-quality, freely accessible, online database of small molecule metabolites found in the human body. Created by the Human Metabolome Project funded by Genome Canada. One of the first dedicated metabolomics databases, the HMDB facilitates human metabolomics research, including the identification and characterization of human metabolites using NMR spectroscopy, GC-MS spectrometry and LC/MS spectrometry. To aid in this discovery process, the HMDB contains three kinds of data: 1) chemical data, 2) clinical data, and 3) molecular biology/biochemistry data. The chemical data includes 41,514 metabolite structures with detailed descriptions along with nearly 10,000 NMR, GC-MS and LC/MS spectra.

The Small Molecule Pathway Database (SMPDB) is a comprehensive, high-quality, freely accessible, online database containing more than 600 small molecule pathways found in humans. SMPDB is designed specifically to support pathway elucidation and pathway discovery in metabolomics, transcriptomics, proteomics and systems biology. It is able to do so, in part, by providing colorful, detailed, fully searchable, hyperlinked diagrams of five types of small molecule pathways: 1) general human metabolic pathways; 2) human metabolic disease pathways; 3) human metabolite signaling pathways; 4) drug-action pathways and 5) drug metabolism pathways. SMPDB pathways may be navigated, viewed and zoomed interactively using a Google Maps-like interface. All SMPDB pathways include information on the relevant organs, subcellular compartments, protein cofactors, protein locations, metabolite locations, chemical structures and protein quaternary structures. Each small molecule in SMPDB is hyperlinked to detailed descriptions contained in the HMDB or DrugBank and each protein or enzyme complex is hyperlinked to UniProt. Additionally, all SMPDB pathways are accompanied with detailed descriptions and references, providing an overview of the pathway, condition or processes depicted in each diagram. Users can browse the SMPDB or search its contents by text searching, sequence searching, or chemical structure searching. More powerful queries are also possible including searching with lists of gene or protein names, drug names, metabolite names, GenBank IDs, Swiss-Prot IDs, Agilent or Affymetrix microarray IDs. These queries will produce lists of matching pathways and highlight the matching molecules on each of the pathway diagrams. Gene, metabolite and protein concentration data can also be visualized through SMPDB's mapping interface.

<span class="mw-page-title-main">Experimental factor ontology</span>

Experimental factor ontology, also known as EFO, is an open-access ontology of experimental variables particularly those used in molecular biology. The ontology covers variables which include aspects of disease, anatomy, cell type, cell lines, chemical compounds and assay information. EFO is developed and maintained at the EMBL-EBI as a cross-cutting resource for the purposes of curation, querying and data integration in resources such as Ensembl, ChEMBL and Expression Atlas.

<span class="mw-page-title-main">BacDive</span> Online database for bacteria

BacDive is a bacterial metadatabase that provides strain-linked information about bacterial and archaeal biodiversity.

Model organism databases (MODs) are biological databases, or knowledgebases, dedicated to the provision of in-depth biological data for intensively studied model organisms. MODs allow researchers to easily find background information on large sets of genes, plan experiments efficiently, combine their data with existing knowledge, and construct novel hypotheses. They allow users to analyse results and interpret datasets, and the data they generate are increasingly used to describe less well studied species. Where possible, MODs share common approaches to collect and represent biological information. For example, all MODs use the Gene Ontology (GO) to describe functions, processes and cellular locations of specific gene products. Projects also exist to enable software sharing for curation, visualization and querying between different MODs. Organismal diversity and varying user requirements however mean that MODs are often required to customize capture, display, and provision of data.

<span class="mw-page-title-main">Biocuration</span>

Biocuration is the field of life sciences dedicated to organizing biomedical data, information and knowledge into structured formats, such as spreadsheets, tables and knowledge graphs. The biocuration of biomedical knowledge is made possible by the cooperative work of biocurators, software developers and bioinformaticians and is at the base of the work of biological databases.

References

  1. 1 2 3 Chang A, Scheer M, Grote A, Schomburg I, Schomburg D (2008). "BRENDA, AMENDA and FRENDA the enzyme information system: new content and tools in 2009". Nucleic Acids Res. 37 (Database issue): D588-92. doi:10.1093/nar/gkn820. PMC   2686525 . PMID   18984617.
  2. Schomburg I, Chang A, Placzek S, Söhngen C, Rother M, Lang M, Munaretto C, Ulas S, Stelzer M, Grote A, Scheer M, Schomburg D: "BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA", Nucleic Acids Res., 41 (Database issue): D764-D772
  3. Barthelmes J, Ebeling C, Chang A, Schomburg I, Schomburg D (2006). "BRENDA, AMENDA and FRENDA: the enzyme information system in 2007". Nucleic Acids Res. 35 (Database issue): D511-4. doi:10.1093/nar/gkl972. PMC   1899097 . PMID   17202167.

Further reading