Apache Stanbol

Last updated
Apache Stanbol
Developer(s) Apache Software Foundation
Repository
Written in Java
License Apache License 2.0
Website stanbol.apache.org

Apache Stanbol is an open source modular software stack and reusable set of components for semantic content management. Apache Stanbol components are meant to be accessed over RESTful interfaces to provide semantic services for content management. Thus, one application is to extend traditional content management systems with (internal or external) semantic services. [1]

Contents

Additionally, Apache Stanbol lets you create new types of content management systems with semantics at their core. The current code is written in Java and based on the OSGi component framework. Applications include extending existing content management systems with (internal or external) semantic services, and creating new types of content management systems with semantics at their core.

History

In 2008, the Salzburg Research led, as entity coordinator, a consortium of seven research partners and six industrial partners to the proposal of the IKS project with the aim of receiving funding by the European institutions under the 7th Framework Programme. [2]

The consortium comprised [2]

In January 2009, the Interactive Knowledge Stack (IKS) started partly funded by the European Commission to provide an "open source technology platform for semantically enhanced content management systems". [3] IKS received €6.58m co-funding by the European Union [4] for an overall project duration of 4 years, hence setting the project's end date by the end of 2012. [5]

Apache Stanbol was founded in November 2010 by members the EU research project Interactive Knowledge Stack (IKS). It was the result of an ongoing discussion about how to ensure that the results, especially the developed software, of the IKS project would be available to vendors of content management systems (CMS) after the project’s official funding period ended in 2012. The members of the IKS project decided to initiate the Apache Stanbol project as part of the incubation program of the Apache Software Foundation (ASF).

One of the first code imports of Apache Stanbol was the so-called "Furtwangen IKS Semantic Engine" (FISE) which eventually became the Apache Stanbol Enhancer with its Enhancement Engines. Other contributions of code were the KReS (Knowledge Representation and Reasoning) and the RICK (Reference Infrastructure for Content and Knowledge) components. Later on followed the Contenthub, while KReS was split into the Apache Stanbol Ontology Manager and Reasoner components, and the RICK is today known as the Apache Stanbol Entityhub. From that moment Apache Stanbol was developed as an open source software project independent of the IKS research project.

On 15 November 2010 Apache Stanbol enters incubation. [6]

On 9 May 2012 version 0.9.0-incubating is released. [7]

On 10 July 2012 version 0.10.0-incubating is released. [8]

By the middle of 2012 Apache Stanbol had demonstrated that it has an active community and is able to produce software and releases according to the ASF standards. The board of directors of the ASF accepted the formal resolution to establish Apache Stanbol as a top-level project on 2012-09-19. [9]

On 5 March 2013 Salzburg Research announced that 8 entities, among those Sebastian Schaffert (head of the knowledge and media technologies group [10] ), Rupert Westenthaler (Stanbol initial committer and PMC) and Sergio Fernández (Stanbol committer) set up an effort to deliver Apache Stanbol and Apache Marmotta services under the Redlink brand. [11]

On 15 April 2020 the Stanbol Project was officially terminated. [12]

Main features

Apache Stanbol's main features are:

Content Enhancement

Services that add semantic information to “non-semantic” pieces of content. The Apache Stanbol Enhancer provides both a RESTful and a Java API that allows a caller to extract features from passed content. In more detail the passed content is processed by Enhancement Engines as defined by the called Enhancement Chain.

Reasoning

Services that are able to retrieve additional semantic information about the content based on the semantic information retrieved via content enhancement. The Stanbol Reasoners component provides a set of services that take advantage of automatic inference engines.

The module implements a common api for reasoning services, providing the possibility to plug different reasoners and configurations in parallel.

Actually the module includes OWLApi and Jena based abstract services, with concrete implementations for Jena RDFS, OWL, OWLMini and HermiT reasoning service.

Knowledge Models

Services that are used to define and manipulate the data models (e.g. ontologies) that are used to store the semantic information. The Apache Stanbol Ontology Manager provides a controlled environment for managing ontologies, ontology networks and user sessions for semantic data modeled after them. It provides full access to ontologies stored into the Stanbol persistence layer. Managing an ontology network means that you can activate or deactivate parts of a complex model from time to time, so that your data can be viewed and classified under different "logical lenses". This is especially useful in Reasoning operations.

Persistence

Services that store (or cache) semantic information, i.e. enhanced content, entities, facts, and make it searchable. The Apache Stanbol Contenthub is an Apache Solr based document repository which enables storage of text-based documents and customizable semantic search facilities. The Contenthub exposes an efficient Java API together with the corresponding RESTful services.

Bibliography

Related Research Articles

<span class="mw-page-title-main">Semantic Web</span> Extension of the Web to facilitate data exchange

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

Representational state transfer (REST) is a software architectural style that was created to guide the design and development of the architecture for the World Wide Web. REST defines a set of constraints for how the architecture of an Internet-scale distributed hypermedia system, such as the Web, should behave. The REST architectural style emphasises the scalability of interactions between components, uniform interfaces, independent deployment of components, and the creation of a layered architecture to facilitate caching of components to reduce user-perceived latency, enforce security, and encapsulate legacy systems.

Apache Jackrabbit is an open source content repository for the Java platform. The Jackrabbit project was started on August 28, 2004, when Day Software licensed an initial implementation of the Java Content Repository API (JCR). Jackrabbit was also used as the reference implementation of JSR-170, specified within the Java Community Process. The project graduated from the Apache Incubator on March 15, 2006, and is now a Top Level Project of the Apache Software Foundation.

<span class="mw-page-title-main">Apache OFBiz</span> Open-source enterprise resource planning software

Apache OFBiz is an open source enterprise resource planning (ERP) system. It provides a suite of enterprise applications that integrate and automate many of the business processes of an enterprise.

<span class="mw-page-title-main">Apache Jena</span> Open source semantic web framework for Java

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

<span class="mw-page-title-main">Apache Solr</span> Open-source enterprise-search platform

Solr is an open-source enterprise-search platform, written in Java. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document handling. Providing distributed search and index replication, Solr is designed for scalability and fault tolerance. Solr is widely used for enterprise search and analytics use cases and has an active development community and regular releases.

<span class="mw-page-title-main">Apache Sling</span> Java web framework

Apache Sling is an open source Web framework for the Java platform designed to create content-centric applications on top of a JSR-170-compliant content repository such as Apache Jackrabbit. Apache Sling allows developers to deploy their application components as OSGi bundles or as scripts and templates in the content repository. Supported scripting languages are JSP, server-side JavaScript, Ruby, Velocity. The goal of Apache Sling is to expose content in the content repository as HTTP resources, fostering a RESTful style of application architecture.

The Ubiquitous Knowledge Processing Lab is a research lab at the Department of Computer Science at the Technische Universität Darmstadt. It was founded in 2006 by Iryna Gurevych.

<span class="mw-page-title-main">API</span> Software interface between computer programs

An application programming interface (API) is a way for two or more computer programs to communicate with each other. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build or use such a connection or interface is called an API specification. A computer system that meets this standard is said to implement or expose an API. The term API may refer either to the specification or to the implementation.

<span class="mw-page-title-main">Apache Aries</span> Open-source implementation of Blueprint Container

Apache Aries, a Blueprint Container implementations and extensions of application-focused specifications defined by OSGi Enterprise Expert Group. The project aims to deliver a set of pluggable Java components enabling an enterprise OSGi application programming model. The Aries project Content includes the following:

<span class="mw-page-title-main">Salzburg Research</span>

Salzburg Research Forschungsgesellschaft mbH is an independent research and technology organisation (RTO), located in Salzburg, Austria. The organisation specializes in applied research and development in the field of information and communications technologies (ICT).

<span class="mw-page-title-main">Apache OODT</span>

The Apache Object Oriented Data Technology (OODT) is an open source data management system framework that is managed by the Apache Software Foundation. OODT was originally developed at NASA Jet Propulsion Laboratory to support capturing, processing and sharing of data for NASA's scientific archives.

<span class="mw-page-title-main">Apache cTAKES</span> Natural language processing system

Apache cTAKES: clinical Text Analysis and Knowledge Extraction System is an open-source Natural Language Processing (NLP) system that extracts clinical information from electronic health record unstructured text. It processes clinical notes, identifying types of clinical named entities — drugs, diseases/disorders, signs/symptoms, anatomical sites and procedures. Each named entity has attributes for the text span, the ontology mapping code, context, and negated/not negated.

<span class="mw-page-title-main">Apache Chemistry</span> Implementation of Content Management Interoperability Services

Apache Chemistry is a project of the Apache Software Foundation (ASF) which provides open source Content Management Interoperability Services (CMIS) for Python, Java, PHP and .NET.

CloudStack is open-source Infrastructure-as-a-Service cloud computing software for creating, managing, and deploying infrastructure cloud services. It uses existing hypervisor platforms for virtualization, such as KVM, VMware vSphere, including ESXi and vCenter, XenServer/XCP and XCP-ng. In addition to its own API, CloudStack also supports the Amazon Web Services (AWS) API and the Open Cloud Computing Interface from the Open Grid Forum.

<span class="mw-page-title-main">Jahia</span> Software company

Jahia is a software company offering enterprise products, services, and technical support for its open-source digital experience platform. Jahia’s platform provides content and customer data management. The company’s head optional content management system and digital experience platform is designed to support various digital enterprise initiatives, such as websites, progressive web applications, mobile apps, intranets and portals.

<span class="mw-page-title-main">Sebastian Schaffert</span>

Sebastian Schaffert is a software engineer and researcher. He was born in Trostberg, Bavaria, Germany on March 18, 1976 and obtained his doctorate in 2004.

<span class="mw-page-title-main">Apache Marmotta</span>

Apache Marmotta is a linked data platform that comprises several components. In its most basic configuration it is a Linked Data server. Marmotta is one of the reference projects early implementing the new Linked Data Platform recommendation that is being developed by W3C.

The Open Semantic Framework (OSF) is an integrated software stack using semantic technologies for knowledge management. It has a layered architecture that combines existing open source software with additional open source components developed specifically to provide a complete Web application framework. OSF is made available under the Apache 2 license.

References

  1. Apache Stanbol
  2. 1 2 "IKS FAQ". IKS. Archived from the original on 2013-08-18. Retrieved 2013-08-20.
  3. "IKS Project". IKS. Archived from the original on 2013-08-09. Retrieved 2013-08-20.
  4. "About Us". IKS Project. Retrieved 2013-08-20.[ permanent dead link ]
  5. "IKS EU Research Project". IKS Project. Archived from the original on 2013-08-14. Retrieved 2013-08-20.
  6. Delacretaz, Bertrand (15 November 2010). "Accept Stanbol for incubation". incubator-general (Mailing list). Retrieved 2013-08-20.
  7. Christ, Fabian (9 May 2012). "Apache Stanbol 0.9.0-incubating staging". org.apache.incubator.stanbol-dev (Mailing list). Retrieved 2013-08-20.
  8. Christ, Fabian (10 July 2012). "Apache Stanbol Entityhub 0.10.0-incubating released". org.apache.incubator.stanbol-dev (Mailing list). Retrieved 2013-08-20.
  9. "Graduation Resolution". Apache Stanbol. Retrieved 2013-08-20.
  10. Dr. Sebastian Schaffert at Salzburg Research Archived December 20, 2013, at the Wayback Machine
  11. Volpini, Andrea. "Eight players joined hands to commence services in April". Redlink. Retrieved 2013-08-20.[ permanent dead link ]
  12. "Terminate the Apache Stanbol Project". Apache Stanbol. Retrieved 2020-09-28.