Ontotext GraphDB

Last updated
GraphDB
Developer(s) Ontotext
Stable release
9.9.1 / September 2021 (2021-09) [1]
Operating system Cross-platform
Available inEnglish
Type Database, Triplestore, Graph databases
License GraphDB-Free is free to use. SE and EE are licensed per CPU-Core used. Perpetual and annual subscription models are available.
Website www.ontotext.com/products/graphdb

Ontotext GraphDB (previously known as BigOWLIM) is a graph database [2] [3] and knowledge discovery [4] [5] [6] tool compliant with RDF [7] and SPARQL [8] and available as a high-availability cluster. Ontotext GraphDB is used in various European research projects. [9]

Contents

As of April 2021, Graph DB is ranked as the 4th most -popular [10] RDF store [11] [12] and 6th most-popular Graph DBMS system. [13] Some categorize it as a NoSQL database. [14] In 2014 Ontotext acquired the trademark "GraphDB" from Sones.

As for a typical graph DB, ontologies are an important input for the databases. [15] The underlying idea is a semantic repository. [16]

Architecture

GraphDB is used to store and manage semantic Knowledge Graph data. It is built on top of the RDF4J architecture implemented through RDF4J's Storage and Inference Layer (SAIL). The architecture is made of three main components:

Features and Integrations

According to Ontotext, Graph DB supports:

Uses

Ontotext Graph DB is used in various scientific areas, e.g., Genetics, [26] Healthcare, [27] Data Forensics, [28] Cultural Heritage, [29] Geography, [30] Infrastructure Planning, [31] Civil Engineering, [32] Digital Historiography, [33] Oceanography. [34]

For more examples see "Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research" below.

Commercial clients include BBC Sport, [35] [36] Financial Times , [37] Springer Nature, [38] UK Parliament, [39] [40] AstraZeneca [41] as well as in the pharmaceutical and finance industries.

Some use cases focus on scalability and large data sizes. [42]

See also

Related Research Articles

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

Query languages, data query languages or database query languages (DQL) are computer languages used to make queries in databases and information systems. A well known example is the Structured Query Language (SQL).

RDF Schema is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

SPARQL is an RDF query language—that is, a semantic query language for databases—able to retrieve and manipulate data stored in Resource Description Framework (RDF) format. It was made a standard by the RDF Data Access Working Group (DAWG) of the World Wide Web Consortium, and is recognized as one of the key technologies of the semantic web. On 15 January 2008, SPARQL 1.0 was acknowledged by W3C as an official recommendation, and SPARQL 1.1 in March, 2013.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

An RDF query language is a computer language, specifically a query language for databases, able to retrieve and manipulate data stored in Resource Description Framework (RDF) format.

Ontotext is a software company with offices in Europe and USA. It is the semantic technology branch of Sirma Group. Its main domain of activity is the development of software based on the Semantic Web languages and standards, in particular RDF, OWL and SPARQL. Ontotext is best known for the Ontotext GraphDB semantic graph database engine. Another major business line is the development of enterprise knowledge management and analytics systems that involve big knowledge graphs. Those systems are developed on top of the Ontotext Platform that builds on top of GraphDB capabilities for text mining using big knowledge graphs.

<span class="mw-page-title-main">Apache Jena</span> Open source semantic web framework for Java

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

<span class="mw-page-title-main">RDF4J</span>

Eclipse RDF4J is an open-source framework for storing, querying, and analysing RDF data. It was created by the Dutch software company Aduna as part of "On-To-Knowledge", a semantic web project that ran from 1999 to 2002. It contains implementations of an in-memory triplestore and an on-disk triplestore, along with two separate Servlet packages that can be used to manage and provide access to these triplestores, on a permanent server. The RDF4J Rio package contains a simple API for Java-based RDF parsers and writers. Parsers and writers for popular RDF serialisations are distributed along with RDF4J, and users can easily extend the list by putting their parsers and writers on the Java classpath when running their application.

<span class="mw-page-title-main">DBpedia</span> Online database project

DBpedia is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web. DBpedia allows users to semantically query relationships and properties of Wikipedia resources, including links to other related datasets.

Simple Sloppy Semantic Database (S3DB) is a distributed data management system that relies on Semantic Web concepts for management of heterogeneous data.

SPARUL, or SPARQL/Update, was a declarative data manipulation language that extended the SPARQL 1.0 query language standard. SPARUL provided the ability to insert, delete and update RDF data held within a triple store or quad store. SPARUL was originally written by Hewlett-Packard and has been used as the foundation for the current W3C recommendation entitled SPARQL 1.1 Update. With the publication of SPARQL 1.1, SPARUL is superseded and should only be consulted as a source of inspiration for possible future refinements of SPARQL, but not for real-world applications.

A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject–predicate–object, like "Bob is 35" or "Bob knows Fred".

A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

<span class="mw-page-title-main">Sones GraphDB</span>

Sones GraphDB was a graph database developed by the German company sones GmbH, available from 2010 to 2012. Its last version was released in May 2011. sones GmbH, which was based in Erfurt and Leipzig, was declared bankrupt on January 1, 2012.

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC). The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language.

Semantic queries allow for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results or to answer more fuzzy and wide open questions through pattern matching and digital reasoning.

Shapes Constraint Language (SHACL) is a World Wide Web Consortium (W3C) standard language for describing Resource Description Framework (RDF) graphs. SHACL has been designed to enhance the semantic and technical interoperability layers of ontologies expressed as RDF graphs.

<span class="mw-page-title-main">Blazegraph</span> Open source triplestore and graph database

Blazegraph is an open source triplestore and graph database, developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL.

<span class="mw-page-title-main">Knowledge graph</span> Type of knowledge base

In knowledge representation and reasoning, knowledge graph is a knowledge base that uses a graph-structured data model or topology to integrate data. Knowledge graphs are often used to store interlinked descriptions of entities – objects, events, situations or abstract concepts – while also encoding the semantics underlying the used terminology.

References

  1. "Graph Databases (Technology)" . Retrieved 2021-10-02.
  2. "Graph Databases (Technology)". Bloor Research. Retrieved 2020-12-09.
  3. "Global Graph Database Market by Type, Application, Component, Deployment Type, Industry Vertical & Region - Analysis & Forecast to 2023 - ResearchAndMarkets.com". businesswire.com. 2018-06-28. Retrieved 2020-12-09.
  4. "KMWorld AI 50: The Companies Empowering Intelligent Knowledge Management". kmworld.com. Retrieved 2020-12-09.
  5. "Global Semantic Knowledge Discovery Software Market Growth (Status and Outlook) 2019-2024 - Market Research Insights". mrinsights.biz. Retrieved 2020-12-09.
  6. Buchmann, Robert (2019). "Model-Aware Software EngineeringA Knowledge-based Approach to Model-Driven Software Engineering" (PDF). Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  7. Motik, Boris; Nenov, Yavor; Piro, Robert; Horrocks, Ian; Olteanu, Dan (2014-06-19). "Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems". Proceedings of the AAAI Conference on Artificial Intelligence. 28 (1). doi: 10.1609/aaai.v28i1.8730 . ISSN   2374-3468. S2CID   5547916.
  8. "SparqlImplementations - W3C Wiki". www.w3.org. Retrieved 2021-04-15.
  9. "Google Scholar". scholar.google.com. Retrieved 2020-12-09.
  10. "DB-Engines Ranking". DB-Engines. Retrieved 2020-12-09.
  11. Guest, CIO Central. "The Hype Around Graph Databases And Why It Matters". Forbes. Retrieved 2020-12-09.
  12. ltd, Research and Markets. "Graph Database Market by Type (RDF and Property Graph), Application (Recommendation Engines, Fraud Detection, Risk and Compliance Management), Component (Tools and Services), Deployment Mode, Industry Vertical, and Region - Global Forecast to 2024". researchandmarkets.com. Retrieved 2020-12-09.
  13. "GraphDB System Properties". db-engines.com. Retrieved 2021-04-15.
  14. "GraphDB". Capterra. Retrieved 2020-12-09.
  15. Ledvinka, Martin (2015). "JOPA: Accessing Ontologies in an Object-oriented Way" (PDF). Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  16. Kiryakov, Atanas (November 2005). "OWLIM—a pragmatic semantic repository for OWL". ResearchGate. Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  17. Stoilos, Giorgos; Grau, Bernardo Cuenca; Horrocks, Ian (2010-07-05). "How Incomplete Is Your Semantic Web Reasoner?". Proceedings of the AAAI Conference on Artificial Intelligence. 24 (1): 1431–1436. doi: 10.1609/aaai.v24i1.7498 . ISSN   2374-3468. S2CID   34119609.
  18. Kiryakov, Atanas; Ognyanov, Damyan; Manov, Dimitar (2005). Dean, Mike; Guo, Yuanbo; Jun, Woochun; Kaschek, Roland; Krishnaswamy, Shonali; Pan, Zhengxiang; Sheng, Quan Z. (eds.). "OWLIM – A Pragmatic Semantic Repository for OWL". Web Information Systems Engineering – WISE 2005 Workshops. Lecture Notes in Computer Science. Berlin, Heidelberg: Springer. 3807: 182–192. doi:10.1007/11581116_19. ISBN   978-3-540-32287-0.
  19. "SQL Access over JDBC". GraphDB documentation. Ontotext. Retrieved 5 October 2022.
  20. "Kafka Sink Connector¶". GraphDB documentation. Ontotext. Retrieved 5 October 2022.
  21. "Semantic Objects: Overview". Ontotext Platform documentation. Ontotext. Retrieved 5 October 2022.
  22. "Semantic Search: Overview". Ontotext Platform documentation. Ontotext. Retrieved 5 October 2022.
  23. "GraphDB Reasoning: predefined rulesets". GraphDB documentation. Ontotext. Retrieved 5 October 2022.
  24. "Ontotext Refine: Overview and features". Ontotext Platform documentation. Ontotext. Retrieved 5 October 2022.
  25. "Semantic similarity searches". GraphDB documentation. Ontotext. Retrieved 5 October 2022.
  26. Poncheewin, Wasin; Hermes, Gerben D. A.; van Dam, Jesse C. J.; Koehorst, Jasper J.; Smidt, Hauke; Schaap, Peter J. (2020). "NG-Tax 2.0: A Semantic Framework for High-Throughput Amplicon Analysis". Frontiers in Genetics. 10: 1366. doi: 10.3389/fgene.2019.01366 . ISSN   1664-8021. PMC   6989550 . PMID   32117417.
  27. Barisevičius, Gintaras; Coste, Martin; Geleta, David; Juric, Damir; Khodadadi, Mohammad; Stoilos, Giorgos; Zaihrayeu, Ilya (2018). Vrandečić, Denny; Bontcheva, Kalina; Suárez-Figueroa, Mari Carmen; Presutti, Valentina; Celino, Irene; Sabou, Marta; Kaffee, Lucie-Aimée; Simperl, Elena (eds.). "Supporting Digital Healthcare Services Using Semantic Web Technologies". The Semantic Web – ISWC 2018. Lecture Notes in Computer Science. Cham: Springer International Publishing. 11137: 291–306. doi:10.1007/978-3-030-00668-6_18. ISBN   978-3-030-00668-6.
  28. Zhuhadar, Leyla; Ciampa, Mark (2019-03-01). "Leveraging learning innovations in cognitive computing with massive data sets: Using the offshore Panama papers leak to discover patterns". Computers in Human Behavior. 92: 507–518. doi:10.1016/j.chb.2017.12.013. ISSN   0747-5632. S2CID   59528294.
  29. Damiano, Rossana; Lombardo, Vincenzo; Lieto, Antonio; Borra, Davide (2016-07-01). "Exploring cultural heritage repositories with creative intelligence. The Labyrinth 3D system". Entertainment Computing. 16: 41–52. doi:10.1016/j.entcom.2016.05.002. hdl: 2318/1578514 . ISSN   1875-9521. S2CID   31774697.
  30. Panasiuk, Oleksandra (2019). "Representing GeoData for Tourism with Schema.org" (PDF). Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  31. Azzam, Amr; Aryan, Peb Ruswono; Cecconi, Alessio; Di Ciccio, Claudio; Ekaputra, Fajar J.; Fernandez Garcia, Javier David; Karampatakis, Sotiris; Kiesling, Elmar; Musil, Angelika (2019), Antonella Longo, Maria Fazio (ed.), The CitySPIN Platform: A CPSS Environment for City-Wide Infrastructures (PDF), Bilbao, Spain: CEUR Workshop Proceedings, pp. 57–64, retrieved 2021-04-15
  32. Nundloll, Vatsala; Lamb, Rob; Hankin, Barry; Blair, Gordon (2021-04-01). "A semantic approach to enable data integration for the domain of flood risk management". Environmental Challenges. 3: 100064. doi: 10.1016/j.envc.2021.100064 . ISSN   2667-0100.
  33. Quaresma, Paulo (2020). "Information Extraction from Historical Texts:a Case Study" (PDF). Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  34. Zárate, Marcos; Rosales, Pablo; Braun, Germán; Lewis, Mirtha; Fillottrani, Pablo Rubén; Delrieux, Claudio (2019). Villazón-Terrazas, Boris; Hidalgo-Delgado, Yusniel (eds.). "OceanGraph: Some Initial Steps Toward a Oceanographic Knowledge Graph". Knowledge Graphs and Semantic Web. Communications in Computer and Information Science. Cham: Springer International Publishing. 1029: 33–40. doi:10.1007/978-3-030-21395-4_3. ISBN   978-3-030-21395-4. S2CID   160011396.
  35. "BBC - BBC Internet Blog: Sports Refresh: Dynamic Semantic Publishing". BBC. Retrieved 2020-12-09.
  36. "BBC - BBC Internet Blog: BBC World Cup 2010 dynamic semantic publishing". BBC. Retrieved 2020-12-09.
  37. "Semantic Technology for online, broadcast and print media". videolectures.net. Retrieved 2020-12-09.
  38. "SciGraph | For Researchers". Springer Nature. Retrieved 2020-12-09.
  39. "Linked Government Data". nationalarchives.gov.uk. Retrieved 2020-12-09.{{cite web}}: CS1 maint: url-status (link)
  40. "Performance testing a graph database | Parliamentary Digital Service". pds.blog.parliament.uk. Retrieved 2021-04-15.
  41. Anadiotis, George. "Graph databases and RDF: It's a family affair". ZDNet. Retrieved 2020-12-09.
  42. Bishop, Barry (January 2011). "OWLIM: A family of scalable semantic repositories". ResearchGate. Retrieved 2021-04-15.{{cite web}}: CS1 maint: url-status (link)
  43. Alexiev, Vladimir (March 2021). "Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research" (PDF). European Data Conference on Reference Data and Semantics (ENDORSE 2021).
  44. Alexiev, Vladimir. "Diverse Uses of Ontotext GraphDB". YouTube video.
  45. Alexiev, Vladimir. "Diverse Uses of Ontotext GraphDB". GitHub project.
  46. "Ontotext-GraphDB". Zotero shared bibliography. Retrieved 5 October 2022.