SADI

Last updated

Semantic Automated Discovery and Integration (SADI) [1] [2] is a lightweight set of fully standards-compliant Semantic Web service design patterns that simplify the publication of services of the type commonly found in bioinformatics and other scientific domains. SADI services utilize Semantic Web technologies at every level of the Web services "stack". Services are described in OWL-DL, where the property restrictions in OWL classes are used to define the properties expected of the input and output data. Invocation of SADI Services is achieved through HTTP POST of RDF data representing OWL Individuals ('instances') of the defined input OWL Class, and the resulting output data will be OWL Individuals of the defined output OWL Class.

SADI design patterns place a single unique constraint on the behaviour of the Service, in that the URI of the input individual, and the URI of the output individual must be identical. The consequence of this constraint is that the Service provider must connect the output to the input through a defined predicate; effectively, the output is "about" the input, and the relationship between input and output is explicit. As such, SADI services become the source of new Linked Data, relating the input and output of a service, and chains of SADI services produce uninterrupted Linked Data graphs.

SADI has been used in a number of Bioinformatics data integration case studies [3] [4] [5] and for semantic querying of relational data in Clinical Informatics settings. [6]

Software

Related Research Articles

<span class="mw-page-title-main">Semantic Web</span> Extension of the Web to facilitate data exchange

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

<span class="mw-page-title-main">BioPerl</span> Collection of Perl modules for bioinformatics

BioPerl is a collection of Perl modules that facilitate the development of Perl scripts for bioinformatics applications. It has played an integral role in the Human Genome Project.

Biomedical text mining refers to the methods and study of how text mining may be applied to texts and literature of the biomedical domain. As a field of research, biomedical text mining incorporates ideas from natural language processing, bioinformatics, medical informatics and computational linguistics. The strategies in this field have been applied to the biomedical literature available through services such as PubMed.

Vasant G. Honavar is an Indian born American computer scientist, and artificial intelligence, machine learning, big data, data science, causal inference, knowledge representation, bioinformatics and health informatics researcher and professor.

A semantic web service, like conventional web services, is the server end of a client–server system for machine-to-machine interaction via the World Wide Web. Semantic services are a component of the semantic web because they use markup which makes data machine-readable in a detailed and sophisticated way.

The Open Biological and Biomedical Ontologies (OBO) Foundry is a group of people dedicated to build and maintain ontologies related to the life sciences. The OBO Foundry establishes a set of principles for ontology development for creating a suite of interoperable reference ontologies in the biomedical domain. Currently, there are more than a hundred ontologies that follow the OBO Foundry principles.

BioMOBY is a registry of web services used in bioinformatics. It allows interoperability between biological data hosts and analytical services by annotating services with terms taken from standard ontologies. BioMOBY is released under the Artistic License.

<span class="mw-page-title-main">Systems Biology Ontology</span>

The Systems Biology Ontology (SBO) is a set of controlled, relational vocabularies of terms commonly used in systems biology, and in particular in computational modeling.

<span class="mw-page-title-main">Generic Model Organism Database</span>

The Generic Model Organism Database (GMOD) project provides biological research communities with a toolkit of open-source software components for visualizing, annotating, managing, and storing biological data. The GMOD project is funded by the United States National Institutes of Health, National Science Foundation and the USDA Agricultural Research Service.

<span class="mw-page-title-main">Apache Taverna</span>

Apache Taverna was an open source software tool for designing and executing workflows, initially created by the myGrid project under the name Taverna Workbench, then a project under the Apache incubator. Taverna allowed users to integrate many different software components, including WSDL SOAP or REST Web services, such as those provided by the National Center for Biotechnology Information, the European Bioinformatics Institute, the DNA Databank of Japan (DDBJ), SoapLab, BioMOBY and EMBOSS. The set of available services was not finite and users could import new service descriptions into the Taverna Workbench.

<span class="mw-page-title-main">Robert Stevens (scientist)</span>

Robert David Stevens is a professor of bio-health informatics. and former Head of Department of Computer Science at The University of Manchester

The National Centre for Text Mining (NaCTeM) is a publicly funded text mining (TM) centre. It was established to provide support, advice, and information on TM technologies and to disseminate information from the larger TM community, while also providing tailored services and tools in response to the requirements of the United Kingdom academic community.

The concept of the Social Semantic Web subsumes developments in which social interactions on the Web lead to the creation of explicit and semantically rich knowledge representations. The Social Semantic Web can be seen as a Web of collective knowledge systems, which are able to provide useful information based on human contributions and which get better as more people participate. The Social Semantic Web combines technologies, strategies and methodologies from the Semantic Web, social software and the Web 2.0.

Simple Sloppy Semantic Database (S3DB) is a distributed data management system that relies on Semantic Web concepts for management of heterogeneous data.

<span class="mw-page-title-main">Galaxy (computational biology)</span>

Galaxy is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system.

<span class="mw-page-title-main">Ontology engineering</span> Field that studies the methods and methodologies for building ontologies

In computer science, information science and systems engineering, ontology engineering is a field which studies the methods and methodologies for building ontologies, which encompasses a representation, formal naming and definition of the categories, properties and relations between the concepts, data and entities. In a broader sense, this field also includes a knowledge construction of the domain using formal ontology representations such as OWL/RDF. A large-scale representation of abstract concepts such as actions, time, physical objects and beliefs would be an example of ontological engineering. Ontology engineering is one of the areas of applied ontology, and can be seen as an application of philosophical ontology. Core ideas and objectives of ontology engineering are also central in conceptual modeling.

<span class="mw-page-title-main">NeuroLex</span> Dynamic lexicon of neuroscience concepts

NeuroLex is a lexicon of neuroscience concepts supported by the Neuroscience Information Framework project. It is structured as a semantic wiki, using Semantic MediaWiki.

The BioCatalogue is a curated catalogue of Life Science Web Services. The BioCatalogue was launched in June 2009 at the Intelligent Systems for Molecular Biology Conference. The project is a collaboration between the myGrid project at the University of Manchester led by Carole Goble and the European Bioinformatics Institute led by Rodrigo Lopez. It is funded by the Biotechnology and Biological Sciences Research Council.

MG-RAST is an open-source web application server that suggests automatic phylogenetic and functional analysis of metagenomes. It is also one of the biggest repositories for metagenomic data. The name is an abbreviation of Metagenomic Rapid Annotations using Subsystems Technology. The pipeline automatically produces functional assignments to the sequences that belong to the metagenome by performing sequence comparisons to databases in both nucleotide and amino-acid levels. The applications supply phylogenetic and functional assignments of the metagenome being analysed, as well as tools for comparing different metagenomes. It also provides a RESTful API for programmatic access.

DisGeNET is a discovery platform designed to address a variety of questions concerning the genetic underpinning of human diseases. DisGeNET is one of the largest and comprehensive repositories of human gene-disease associations (GDAs) currently available. It also offers a set of bioinformatic tools to facilitate the analysis of these data by different user profiles. It is maintained by the Integrative Biomedical Informatics (IBI) Group, of the (GRIB)-IMIM/UPF, based at the Barcelona Biomedical Research Park (PRBB), Barcelona, Spain.

References

  1. Wilkinson, Mark; Vandervalk, Benjamin; McCarthy, Luke (2011). "The Semantic Automated Discovery and Integration (SADI) Web service Design-Pattern, API and Reference Implementation". J Biomed Semantics. 2 (1): 8. doi: 10.1186/2041-1480-2-8 . PMC   3212890 . PMID   22024447.
  2. "SADI framework Web site".
  3. Riazanov, Alexandre; Laurila, Jonas Bergman; Baker, Christopher JO (2011). "Deploying mutation impact text-mining software with the SADI Semantic Web Services framework". BMC Bioinformatics . 12(Supl 4) (Suppl 4): 18. doi: 10.1186/1471-2105-12-S4-S6 . PMC   3194198 . PMID   21992079.
  4. Chepelev, Leonid L; Riazanov, Alexandre; Kouznetsov, Alexandre; Low, Hong Sang; Dumontier, Michel; Baker, Christopher JO (2011). "Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics". BMC Bioinformatics. 12: 14. doi: 10.1186/1471-2105-12-303 . PMC   3163564 . PMID   21791100.
  5. Riazanov, Alexandre; Hindle, Matthew M; Goudreau, E Scott; Martyniuk, Christopher J; Baker, Christopher JO (2012). Ecotoxicology Data Federation with SADI Semantic Web Services (PDF). Semantic Web Applications and Tools for Life Sciences. p. 18.
  6. Riazanov, Alexandre; Klein, Artjom; Shaban-Nejad, Arash; Rose, Gregory W; Forster, Alan J; Buckeridge, David L; Baker, Christopher JO (2013). "Semantic Querying of Relational Data for Clinical Intelligence: A Semantic Web Services-Based Approach". J Biomed Semantics. 4 (1): 19. doi: 10.1186/2041-1480-4-9 . PMC   3698140 . PMID   23497556.
  7. "SADI framework Web site: Building Services".
  8. Ben Vandervalk. "The SHARE System. A Semantic Web Based Approach for Evaluating Queries Across Distributed Bioinformatics Databases and Software, MSc thesis" (PDF).