OGC GeoSPARQL

Last updated

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC). [1] The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language. [2]

Contents

The Ordnance Survey Linked Data Platform uses OWL mappings for GeoSPARQL equivalent properties in its vocabulary. [3] [4] The LinkedGeoData data set is a work of the Agile Knowledge Engineering and Semantic Web (AKSW) research group at the University of Leipzig, [5] a group mostly known for DBpedia, that uses the GeoSPARQL vocabulary to represent OpenStreetMap data.

In particular, GeoSPARQL provides for:

Example

The following example SPARQL query could help model the question "What is within the bounding box defined by 38°54′49″N77°05′20″W / 38.913574°N 77.089005°W / 38.913574; -77.089005 and 38°53′11″N77°01′48″W / 38.886321°N 77.029953°W / 38.886321; -77.029953 ?" [6]

PREFIXgeo:<http://www.opengis.net/ont/geosparql#>PREFIXgeof:<http://www.opengis.net/def/function/geosparql/>SELECT?whatWHERE{?whatgeo:hasGeometry?geometry.FILTER(geof:sfWithin(?geometry,"POLYGON((-77.089005 38.913574,-77.029953 38.913574,-77.029953 38.886321,-77.089005 38.886321,-77.089005 38.913574))"^^geo:wktLiteral))}

RCC8 use in GeoSPARQL

RCC8 has been implemented in GeoSPARQL as described below:

A graphical representation of Region Connection Calculus (RCC: Randell, Cui and Cohn, 1992) and the links to the equivalent naming by the Open Geospatial Consortium (OGC) with their equivalent URIs. Region Connection Calculus 8 Relations and Open Geospatial Consortium relations.svg
A graphical representation of Region Connection Calculus (RCC: Randell, Cui and Cohn, 1992) and the links to the equivalent naming by the Open Geospatial Consortium (OGC) with their equivalent URIs.

Implementations

There are (almost) no complete implementations of GeoSPARQL; however, there are partial or vendor implementations of GeoSPARQL. Currently there are the following implementations:

Apache Marmotta
GeoSPARQL was implemented in the context of the Google Summer of Code 2015. [7] on Apache Marmotta; it uses PostGIS, and it is available just for PostgreSQL.
Apache Jena
Since version 2.11 Apache Jena has a GeoSPARQL extension. [8]
Ontop VKG
Support for GeoSPARQL was added to Ontop in version 4.2. [9]
Parliament Archived 30 April 2014 at the Wayback Machine
Parliament has an almost complete implementation of GeoSPARQL by using JENA and a modified ARQ query processor. [10]
Eclipse RDF4J
Eclipse RDF4J is an open-source Java framework for scalable RDF processing, storage, reasoning and SPARQL querying. It offers support for a large subset of GeoSPARQL functionality. [11]
Strabon Archived 20 August 2014 at the Wayback Machine
Strabon is an open-source semantic spatiotemporal RDF store that supports two popular extensions of SPARQL: stSPARQL and GeoSPARQL. Strabon is built by extending the well-known RDF store Sesame and extends Sesame's components to manage thematic, spatial and temporal data that is stored in the backend RDBMS. It has been fully tested with PostgreSQL (with PostGIS and PostgreSQL-Temporal extensions [12] ) and MonetDB (with geom [13] module).
OpenSahara uSeekM IndexingSail Sesame Sail plugin
uSeekM IndexingSail uses a PostGIS installation to deliver GeoSPARQL. They deliver partial implementation of GeoSPARQL along with some vendor prefixes. [14] [15]
Oracle Spatial and Graph
GraphDB
GraphDB is an enterprise ready Semantic Graph Database, compliant with W3C Standards. Semantic graph databases (also called RDF triplestores) provide the core infrastructure for solutions where modelling agility, data integration, relationship exploration and cross-enterprise data publishing and consumption are important.
Stardog
Stardog is an enterprise data unification platform built on smart graph technology: query, search, inference, and data virtualization.
Virtuoso Universal Server
Virtuoso Universal Server is a middleware and database engine hybrid that combines the functionality of a traditional Relational database management system (RDBMS), Object-relational database (ORDBMS), virtual database, RDF, XML, free-text, web application server and file server functionality in a single system. [16]

Performance and Compliance Benchmarking

Benchmarking GeoSPARQL 1.0 and geospatial-enabled triplestores, in general, has been conducted using several approaches. One can distinguish between performance and compliance benchmarks. The former can reveal whether a triplestore gives a timely answer to a GeoSPARQL query and may or may not check the answer for correctness. The latter checks whether a triplestore gives compliant answers with respect to the definitions of the GeoSPARQL 1.0 standard irrespective of the time the query takes for execution.

Well-known geospatial performance benchmarks include the Geographica [17] and Geographica 2 [18] benchmarks which track the performance of predefined sets of queries on synthetic and real-world datasets. They each test a subset of GeoSPARQL query functions for performance. Another performance benchmark by Huang et al. [19] assessed the performance of GeoSPARQL-enabled triple stores as part of a spatial data infrastructure.

Compliance benchmarking of OGC standards is usually conducted as part of the OGC Team Engine Test Suite [20] which allows companies to obtain certification for implementing certain OGC specifications correctly. As of 2021, however, the OGC Team Engine does not provide a set of compliance tests to test GeoSPARQL compliance. Nevertheless, in 2021, Jovanovik et al. [21] developed the first comprehensive, reproducible GeoSPARQL Compliance benchmark in which nine different triple stores were initially tested. The results of these first compliance tests along with the software [22] are available on Github. [23]

Submission

The GeoSPARQL standard was submitted to the OGC by:

Future development

With regards to future work, the GeoSPARQL standard states:

Obvious extensions are to define new conformance classes for other standard serializations of geometry data (e.g. KML, GeoJSON). In addition, significant work remains in developing vocabularies for spatial data, and expanding the GeoSPARQL vocabularies with OWL axioms to aid in logical spatial reasoning would be a valuable contribution. There are also large amounts of existing feature data represented in either a GML file (or similar serialization) or in a datastore supporting the general feature model. It would be beneficial to develop standard processes for converting (or virtually converting and exposing) this data to RDF.

In 2019, the OGC's GeoSemantics Domain Working Group [24] set out to assess the current usage of GeoSPARQL in different domains in the White Paper "OGC Benefits of Representing Spatial Data Using Semantic and Graph Technologies" [25] and collected initial feature requests to extend GeoSPARQL.

This led to the re-establishment of the GeoSPARQL Standards Working Group with a newly formed working group charter in September 2020. [26] [27] The group is working towards a new release of the GeoSPARQL standard, with non-breaking changes - GeoSPARQL 1.1 - in the summer of 2021, the development of which can be followed on Github.

At the GeoLD workshop 2021, held as part of the Extended Semantic Web Conference 2021, an outline of the additions which are likely to be present in GeoSPARQL 1.1 has been presented. [28] The changes have been further consolidated and summarized in a publication in the ISPRS International Journal of GeoInformation. [29]

See also

Related Research Articles

<span class="mw-page-title-main">PostGIS</span> Geospatial extension for the PostgreSQL Database

PostGIS is an open source software program that adds support for geographic objects to the PostgreSQL object-relational database. PostGIS follows the Simple Features for SQL specification from the Open Geospatial Consortium (OGC).

RDF Schema (Resource Description Framework Schema, variously abbreviated as RDFS, RDF(S), RDF-S, or RDF/S) is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

SPARQL is an RDF query language—that is, a semantic query language for databases—able to retrieve and manipulate data stored in Resource Description Framework (RDF) format. It was made a standard by the RDF Data Access Working Group (DAWG) of the World Wide Web Consortium, and is recognized as one of the key technologies of the semantic web. On 15 January 2008, SPARQL 1.0 was acknowledged by W3C as an official recommendation, and SPARQL 1.1 in March, 2013.

Simple Features is a set of standards that specify a common storage and access model of geographic features made of mostly two-dimensional geometries used by geographic databases and geographic information systems. It is formalized by both the Open Geospatial Consortium (OGC) and the International Organization for Standardization (ISO).

A spatial database is a general-purpose database that has been enhanced to include spatial data that represents objects defined in a geometric space, along with tools for querying and analyzing such data.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

An RDF query language is a computer language, specifically a query language for databases, able to retrieve and manipulate data stored in Resource Description Framework (RDF) format.

Ontotext is a software company with offices in Europe and USA. It is the semantic technology branch of Sirma Group. Its main domain of activity is the development of software based on the Semantic Web languages and standards, in particular RDF, OWL and SPARQL. Ontotext is best known for the Ontotext GraphDB semantic graph database engine. Another major business line is the development of enterprise knowledge management and analytics systems that involve big knowledge graphs. Those systems are developed on top of the Ontotext Platform that builds on top of GraphDB capabilities for text mining using big knowledge graphs.

<span class="mw-page-title-main">Apache Jena</span> Open source semantic web framework for Java

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

<span class="mw-page-title-main">RDF4J</span>

Eclipse RDF4J is an open-source framework for storing, querying, and analysing RDF data. It was created by the Dutch software company Aduna as part of "On-To-Knowledge", a semantic web project that ran from 1999 to 2002. It contains implementations of an in-memory triplestore and an on-disk triplestore, along with two separate Servlet packages that can be used to manage and provide access to these triplestores, on a permanent server. The RDF4J Rio package contains a simple API for Java-based RDF parsers and writers. Parsers and writers for popular RDF serialisations are distributed along with RDF4J, and users can easily extend the list by putting their parsers and writers on the Java classpath when running their application.

A semantic reasoner, reasoning engine, rules engine, or simply a reasoner, is a piece of software able to infer logical consequences from a set of asserted facts or axioms. The notion of a semantic reasoner generalizes that of an inference engine, by providing a richer set of mechanisms to work with. The inference rules are commonly specified by means of an ontology language, and often a description logic language. Many reasoners use first-order predicate logic to perform reasoning; inference commonly proceeds by forward chaining and backward chaining. There are also examples of probabilistic reasoners, including non-axiomatic reasoning systems, and probabilistic logic networks.

A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject–predicate–object, like "Bob is 35" or "Bob knows Fred".

AllegroGraph is a closed source triplestore which is designed to store RDF triples, a standard format for Linked Data. It also operates as a document store designed for storing, retrieving and managing document-oriented information, in JSON-LD format. AllegroGraph is currently in use in commercial projects and a US Department of Defense project. It is also the storage component for the TwitLogic project that is bringing the Semantic Web to Twitter data.

<span class="mw-page-title-main">Open Geospatial Consortium</span> Standards organization

The Open Geospatial Consortium (OGC), an international voluntary consensus standards organization for geospatial content and location-based services, sensor web and Internet of Things, GIS data processing and data sharing. It originated in 1994 and involves more than 500 commercial, governmental, nonprofit and research organizations in a consensus process encouraging development and implementation of open standards.

<span class="mw-page-title-main">Apache Marmotta</span>

Apache Marmotta is a linked data platform that comprises several components. In its most basic configuration it is a Linked Data server. Marmotta is one of the reference projects early implementing the new Linked Data Platform recommendation that is being developed by W3C.

Semantic queries allow for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results or to answer more fuzzy and wide open questions through pattern matching and digital reasoning.

NitrosBase is a Russian high-performance multi-model database system. The database system supports relational, graph and document database models.

<span class="mw-page-title-main">Blazegraph</span> Open source triplestore and graph database

Blazegraph is an open source triplestore and graph database, developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL.

<span class="mw-page-title-main">Ontotext GraphDB</span> RDF-store

Ontotext GraphDB is a graph database and knowledge discovery tool compliant with RDF and SPARQL and available as a high-availability cluster. Ontotext GraphDB is used in various European research projects.

References

  1. Battle & Kolas 2012, p. 355.
  2. Battle & Kolas 2012, p. 358.
  3. Goodwin, John (26 April 2013). "GeoSPARQL and Ordnance Survey Linked Data". johngoodwin225.wordpress.com.
  4. Gemma (3 June 2013). "New Linked Data service launches". blog.ordnancesurvey.co.uk. Archived from the original on 8 October 2013.
  5. "Imprint". linkedgeodata.org. AKSW. 2012-05-18. Archived from the original on 15 June 2021.
  6. Battle & Kolas 2012, p. 363.
  7. "Proposal to Implement GeoSPARQL in Marmotta". Marmotta Wiki. Archived from the original on 2015-06-26.
  8. "Spatial searches with SPARQL". Apache Jena.
  9. "Standards compliance: GeoSPARQL 1.0". Ontop.
  10. "Parliament". Archived from the original on 30 April 2014.
  11. "Programming with RDF4J". Eclipse rdf4j: documentation. The Eclipse Foundation. Archived from the original on 4 November 2016.
  12. jeff-davis (21 January 2021). "PostgreSQL-Temporal". GitHub .
  13. "GeoSpatial". MonetDB Docs. Archived from the original on 28 March 2012.
  14. "IndexingSail - uSeekM - Adds Meaning to the Web". Archived from the original on 2014-04-15. Retrieved 2012-12-16.
  15. "GeoReference - uSeekM - Adds Meaning to the Web". Archived from the original on 2014-04-15. Retrieved 2014-04-14.
  16. Williams, Hugh (October 29, 2018). "Virtuoso GeoSPARQL Demo Server". OpenLink Software Community Forum. Retrieved 9 February 2024.
  17. Garbis, George; Kyzirakos, Kostis; Koubarakis, Manolis (2013). "Geographica: A Benchmark for Geospatial RDF Stores (Long Version)". The Semantic Web – ISWC 2013. 12th International Semantic Web Conference. Lecture Notes in Computer Science. Vol. 8219. pp. 343–359. doi:10.1007/978-3-642-41338-4_22. ISBN   978-3-642-41338-4. S2CID   40326844.
  18. Ioannidis, Theofilos; Garbis, George; Kyzirakos, Kostis; Bereta, Konstantina; Koubarakis, Manolis (2021). "Evaluating Geospatial RDF Stores Using the Benchmark Geographica 2". Journal on Data Semantics. 10 (3–4): 189–228. arXiv: 1906.01933 . doi:10.1007/s13740-021-00118-x. S2CID   174799159.
  19. Huang, Weiming; Raza, Syed Amir; Mirzov, Oleg; Harrie, Lars (2019). "Assessment and Benchmarking of Spatially Enabled RDF Stores for the Next Generation of Spatial Data Infrastructure". ISPRS International Journal of Geo-Information. 8 (7): 310. Bibcode:2019IJGI....8..310H. doi: 10.3390/ijgi8070310 .
  20. "TEAM Engine". Open Geospatial Consortium.
  21. Jovanovik, Milos; Homburg, Timo; Spasić, Mirko (2021). "A GeoSPARQL Compliance Benchmark". ISPRS International Journal of Geo-Information. 10 (7): 487. arXiv: 2102.06139 . Bibcode:2021IJGI...10..487J. doi: 10.3390/ijgi10070487 .
  22. Jovanovik, Milos; Homburg, Timo; Spasić, Mirko (2021). "Software for the GeoSPARQL compliance benchmark". Software Impacts. 8: 100071. doi: 10.1016/j.simpa.2021.100071 .
  23. "OpenLinkSoftware: GeoSPARQLBenchmark". Github.
  24. "Geosemantics DWG". ogc.org. Archived from the original on 9 August 2020.
  25. Abhayaratna, J; van den Brink, L; Car, N; Atkinson, R; Homburg, T; Knibbe, F; McGlinn, K; Wagner, A; Bonduel, M; Holten Rasmussen, M; Thiery, F (5 October 2020). "OGC Benefits of Representing Spatial Data Using Semantic and Graph Technologies". Open Geospatial Consortium.
  26. "GeoSPARQL SWG". ogc.org. Archived from the original on 18 January 2021.
  27. "OGC GeoSPARQL SWG Charter". ogc.org. Open Geospatial Consortium. 2020.
  28. Car, Nicholas J.; Homburg, Timo (May 2021). GeoSPARQL 1.1: an almost decadal update to the most important geospatial LOD standard (PDF). GeoLD Workshop at ESWC 2021.
  29. Car, Nicholas J.; Homburg, Timo (February 2022). "GeoSPARQL 1.1: Motivations, Details and Applications of the Decadal Update to the Most Important Geospatial LOD Standard". ISPRS International Journal of Geo-Information. 11 (2): 117. Bibcode:2022IJGI...11..117C. doi: 10.3390/ijgi11020117 .