Blazegraph

Last updated
Blazegraph
Blazegraph parallelogram logo.png
Developer(s) Systap
Stable release
2.1.5 / 19 March 2019 (2019-03-19) [1]
Preview release
2.1.6rc / 3 February 2020 (2020-02-03) [1]
Repository
Written in Java
Type Graph database
License GNU GPL (version 2) [2]
Website blazegraph.com OOjs UI icon edit-ltr-progressive.svg

Blazegraph is an open source triplestore and graph database, [3] [4] [5] [6] developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL (version 2).

Contents

Amazon acquired the Blazegraph developers and the Blazegraph open source development was essentially stopped in April 2018. [7]

Early history

The system was first known as Bigdata. Since release of version 1.5 (12 February 2015), it is named Blazegraph. [8] [9]

Prominent users

Notable features

Acqui-hiring by Amazon Web Service (AWS)

It was alleged [20] that Amazon Neptune is based on Blazegraph, as evidenced by:

Related Research Articles

Query languages, data query languages or database query languages (DQLs) are computer languages used to make queries in databases and information systems. A well known example is the Structured Query Language (SQL).

RDF Schema is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

SPARQL is an RDF query language—that is, a semantic query language for databases—able to retrieve and manipulate data stored in Resource Description Framework (RDF) format. It was made a standard by the RDF Data Access Working Group (DAWG) of the World Wide Web Consortium, and is recognized as one of the key technologies of the semantic web. On 15 January 2008, SPARQL 1.0 was acknowledged by W3C as an official recommendation, and SPARQL 1.1 in March, 2013.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

An RDF query language is a computer language, specifically a query language for databases, able to retrieve and manipulate data stored in Resource Description Framework (RDF) format.

Apache Jena Open source semantic web framework for Java

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

DBpedia Online database project

DBpedia is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web. DBpedia allows users to semantically query relationships and properties of Wikipedia resources, including links to other related datasets.

A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred".

Freebase was a large collaborative knowledge base consisting of data composed mainly by its community members. It was an online collection of structured data harvested from many sources, including individual, user-submitted wiki contributions. Freebase aimed to create a global resource that allowed people to access common information more effectively. It was developed by the American software company Metaweb and run publicly beginning in March 2007. Metaweb was acquired by Google in a private sale announced on 16 July 2010. Google's Knowledge Graph is powered in part by Freebase.

YAGO (database) Open-source information repository

YAGO is an open source knowledge base developed at the Max Planck Institute for Computer Science in Saarbrücken. It is automatically extracted from Wikipedia and other sources.

In computing, a graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

Named graph

Named graphs are a key concept of Semantic Web architecture in which a set of Resource Description Framework statements are identified using a URI, allowing descriptions to be made of that set of statements such as context, provenance information or other such metadata.

Wikidata Free knowledge database project

Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, can use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki, and is also powered by the set of knowledge graph MediaWiki extensions known as Wikibase.

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC). The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language.

Cypher is a declarative graph query language that allows for expressive and efficient data querying in a property graph.

This is a comparison of triplestores, also known as subject-predicate-object databases. Some of these database management systems have been built as database engines from scratch, while others have been built on top of existing commercial relational database engines. Like the early development of online analytical processing (OLAP) databases, this intermediate approach allowed large and powerful database engines to be constructed for little programming effort in the initial phases of triplestore development. Long-term though it seems that native triplestores will have the advantage for performance. A difficulty with implementing triplestores over SQL is that although triples may thus be stored, implementing efficient querying of a graph-based RDF model onto SQL queries is difficult.

Semantic queries allow for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results or to answer more fuzzy and wide open questions through pattern matching and digital reasoning.

Shapes Constraint Language (SHACL) is a World Wide Web Consortium (W3C) standard language for describing Resource Description Framework (RDF) graphs. SHACL has been designed to enhance the semantic and technical interoperability layers of ontologies expressed as RDF graphs.

Amazon Neptune is a managed graph database product published by Amazon.com. It is used as a web service and is part of Amazon Web Services (AWS). It was announced on November 29, 2017. Amazon Neptune supports popular graph models property graph and W3C's RDF, and their respective query languages Apache TinkerPop's Gremlin, openCypher, and SPARQL, including other Amazon Web Services products.

NitrosBase is a Russian high-performance multi-model database system. The database system supports relational, graph and document database models.

References

  1. 1 2 "Releases - blazegraph/database" . Retrieved 8 March 2020 via GitHub.
  2. "LICENSE.txt file" . Retrieved 8 March 2020 via GitHub.
  3. Kiryakov, Atanas; Mariana, Damova (2011). "Chapter 7: Storing the Semantic Web: Repositories". In Domingue, John; Fensel, Dieter; Hendler, James A. (eds.). Handbook of Semantic Web Technologies . Springer-Verlag Berlin Heidelberg. pp.  271–272. ISBN   978-3-540-92912-3.
  4. Thompson, Bryan; Personick, Mike; Cutcher, Martin (2014). "Chapter 8: The bigdata RDF Graph Database". In Harth, Andreas; Hose, Katja; Schenkel, Ralf (eds.). Linked data management. CRC Press. pp. 193–237. ISBN   978-1-466-58240-8.
  5. Sikos, Leslie (2015). Mastering structured data on the Semantic Web. APress. p. 171. ISBN   978-1-4842-1049-9.
  6. Sakr, Sh.; Wylot, M.; Mutharaju, R.; Le Phuoc, D.; Fundulaki, I. (2018). Linked Data: Storing, Querying, and Reasoning. Springer International Publishing Imprint Springer. pp. 70–71. ISBN   978-3-319-73515-3.
  7. "Blazegraph fork?". GitHub. 12 April 2018.
  8. "Blazegraph - Semantic Web Standards". W3.org. Retrieved 2018-08-29.
  9. "Announcing Blazegraph Release 1.5.0 | Blazegraph(tm)". blog.blazegraph.com. Retrieved 2018-09-17.
  10. "[Wikidata-tech] Wikidata Query Backend Update (take two!)". lists.wikimedia.org. Retrieved 2018-08-29. (The message also contains a link to the graph databases comparison performed by Wikimedia.)
  11. "Principes de l'API". Framagit.io (in French). Retrieved 2018-08-29.
  12. Hartig, Olaf; Thompson, Bryan (2014). "Foundations of an Alternative Approach to Reification in RDF". arXiv: 1406.3399 [cs.DB].
  13. Hartig, Olaf (2014). "Reconciliation of RDF* and Property Graphs". arXiv: 1409.3288 [cs.DB].
  14. "TinkerPop-enabled Providers". tinkerpop.apache.org. Archived from the original on 2018-02-10. Retrieved 2018-08-29. (Currently, the Blazegraph logo is replaced with the Amazon Neptune logo.)
  15. "Blazegraph TinkerPop Implementation". GitHub.com. Retrieved 2018-08-29.
  16. Fu, Zhisong; Personick, Michael (2014). MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs. Proceedings of Workshop on GRAph Data Management Experiences and Systems. pp. 1–6. ISBN   978-1-4503-2982-8.
  17. "Wikidata Query Service: User manual. Extensions". MediaWiki.org. Retrieved 2018-08-29.
  18. "Query Hints". Blazegraph Wiki. Retrieved 2018-08-29.
  19. "Named Subquery". Blazegraph Wiki. Retrieved 2018-08-29.
  20. "SPARQL and Amazon Web Service's Neptune database". Bob DuCharme's weblog. Retrieved 2018-08-29.
  21. "BLAZEGRAPH Trademark of Amazon Technologies, Inc.. Serial Number: 86498414". Trademarkia.com. Retrieved 2018-08-29.
  22. "blazegraph.com". ICANN WHOIS. Retrieved 2018-08-29.
  23. "Bradley Bebee". LinkedIn.com. Retrieved 2018-08-29.