Graph Query Language

Last updated
GQL (Graph Query Language)
Paradigm Declarative
Family Query language
Developer ISO/IEC JTC 1 (Joint Technical Committee 1) / SC 32 (Subcommittee 32) / WG 3 (Working Group 3)
First appearedApril 12, 2024;8 months ago (April 12, 2024)
Stable release
ISO/IEC 39075:2024 / April 12, 2024;8 months ago (April 12, 2024)
Website www.iso.org/standard/76120.html
Influenced by
SQL, Cypher, GSQL

GQL (Graph Query Language) is a standardized query language for property graphs first described in ISO/IEC 39075, released in April 2024 by ISO/IEC.

Contents

History

The GQL project is the culmination of converging initiatives dating back to 2016, particularly a private proposal from Neo4j to other database vendors in July 2016, [1] and a proposal from Oracle technical staff within the ISO/IEC JTC 1 standards process later that year. [2]

2019 GQL project proposal

In September 2019 a proposal for a project to create a new standard graph query language (ISO/IEC 39075 Information Technology — Database Languages — GQL) [3] was approved by a vote of national standards bodies which are members of ISO/IEC Joint Technical Committee 1(ISO/IEC JTC 1). JTC 1 is responsible for international Information Technology standards. GQL is intended to be a declarative database query language, like SQL.

The 2019 GQL project proposal states:

"Using graph as a fundamental representation for data modeling is an emerging approach in data management. In this approach, the data set is modeled as a graph, representing each data entity as a vertex (also called a node) of the graph and each relationship between two entities as an edge between corresponding vertices. The graph data model has been drawing attention for its unique advantages.

Firstly, the graph model can be a natural fit for data sets that have hierarchical, complex, or even arbitrary structures. Such structures can be easily encoded into the graph model as edges. This can be more convenient than the relational model, which requires the normalization of the data set into a set of tables with fixed row types.

Secondly, the graph model enables efficient execution of expensive queries or data analytic functions that need to observe multi-hop relationships among data entities, such as reachability queries, shortest or cheapest path queries, or centrality analysis. There are two graph models in current use: the Resource Description Framework (RDF) model and the Property Graph model. The RDF model has been standardized by W3C in a number of specifications. The Property Graph model, on the other hand, has a multitude of implementations in graph databases, graph algorithms, and graph processing facilities. However, a common, standardized query language for property graphs (like SQL for relational database systems) is missing. GQL is proposed to fill this void." [4]

Official ISO standard

The GQL standard, ISO/IEC 39075:2024 Information technology – Database languages – GQL, was officially published by ISO on 12 April 2024. [5]

GQL project organisation

The GQL project is led by Stefan Plantikow (who was the first lead engineer of Neo4j's Cypher for Apache Spark project) and Stephen Cannan (Technical Corrigenda editor of SQL). They are also the editors of the initial early working drafts of the GQL specification. [6]

As originally motivated, [2] the GQL project aims to complement the work of creating an implementable normative natural-language specification with supportive community efforts that enable contributions from those who are unable or uninterested in taking part in the formal process of defining a JTC 1 International Standard. [7] [8] In July 2019 the Linked Data Benchmark Council (LDBC) agreed to become the umbrella organization for the efforts of community technical working groups. The Existing Languages and the Property Graph Schema working groups formed in late 2018 and early 2019 respectively. A working group to define formal denotational semantics for GQL was proposed at the third GQL Community Update in October 2019. [9]

ISO/IEC JTC 1/SC 32 WG3

Seven national standards bodies (those of the United States, China, Korea, the Netherlands, the United Kingdom, Denmark and Sweden) have nominated national subject-matter experts to work on the project, which is conducted by Working Group 3 (Database Languages) of ISO/IEC JTC 1's Subcommittee 32 (Data Management and Interchange), usually abbreviated as ISO/IEC JTC 1/SC 32 WG3, or just WG3 for short. WG3 (and its direct predecessor committees within JTC 1) has been responsible for the SQL standard since 1987. [10] [11]

ISO stages

ISO stages by date [12]

  1. 2019-09-10 : 10.99 New project approved
  2. 2019-09-10 : 20.00 New project registered in TC/SC work programme
  3. 2021-11-22 : 30.00 Committee draft (CD) registered
  4. 2021-11-23 : 30.20 CD study initiated
  5. 2022-02-25 : 30.60 Close of comment period
  6. 2022-08-29 : 30.92 CD referred back to Working Group
  7. 2022-08-29 : 30.00 Committee draft (CD) registered
  8. 2022-08-30 : 30.20 CD study initiated
  9. 2022-10-26 : 30.60 Close of comment period
  10. 2023-03-22 : 30.99 CD approved for registration as DIS
  11. 2023-03-24 : 40.00 DIS registered
  12. 2023-05-24 : 40.20 DIS ballot initiated: 12 weeks
  13. 2023-08-17 : 40.60 Close of voting
  14. 2023-11-28 : 40.99 Full report circulated: DIS approved for registration as FDIS
  15. 2023-12-11 : 50.00 Final text received or FDIS registered for formal approval
  16. 2024-01-26 : 50.20 Proof sent to secretariat or FDIS ballot initiated: 8 weeks
  17. 2024-03-23 : 50.60 Close of voting. Proof returned by secretariat
  18. 2024-03-23 : 60.00 International Standard under publication
  19. 2024-04-12 : 60.60 International Standard published

GQL property graph data model

GQL is a query language specifically for property graphs. A property graph closely resembles a conceptual data model, as expressed in an entity–relationship model or in a UML class diagram (although it does not include n-ary relationships linking more than two entities). Entities are modelled as nodes, and relationships as edges, in a graph. Property graphs are multigraphs: there can be many edges between the same pair of nodes. GQL graphs can be mixed: they can contain directed edges, where one of the endpoint nodes of an edge is the tail (or source) and the other node is the head (or target or destination), but they can also contain undirected (bidirectional or reflexive) edges.

Nodes and edges, collectively known as elements, have attributes. Those attributes may be data values, or labels (tags). Values of properties cannot be elements of graphs, nor can they be whole graphs: these restrictions intentionally force a clean separation between the topology of a graph, and the attributes carrying data values in the context of a graph topology. The property graph data model therefore deliberately prevents nesting of graphs, or treating nodes in one graph as edges in another. Each property graph may have a set of labels and a set of properties that are associated with the graph as a whole.

Current graph database products and projects often support a limited version of the model described here. For example, Apache Tinkerpop [13] forces each node and each edge to have a single label; Cypher allows nodes to have zero to many labels, but relationships only have a single label (called a reltype). Neo4j's database supports undocumented graph-wide properties, Tinkerpop has graph values which play the same role, and also supports "metaproperties" or properties on properties. Oracle's PGQL supports zero to many labels on nodes and on edges, whereas SQL/PGQ supports one to many labels for each kind of element. The NGSI-LD information model specified by ETSI is an attempt at formally specifying property graphs, with node and relationship (edge) types that may play the role of labels in previously mentioned models and support semantic referencing by inheriting classes defined in shared ontologies.

The GQL project will define a standard data model, which is likely to be the superset of these variants, and at least the first version of GQL is likely to permit vendors to decide on the cardinalities of labels in each implementation, as does SQL/PGQ, and to choose whether to support undirected relationships.

Additional aspects of the ERM or UML models (like generalization or subtyping, or entity or relationship cardinalities) may be captured by GQL schemas or types that describe possible instances of the general data model.

Implementations

The first in-memory graph database that can interpret GQL is available. [14] [15] Aside from the implementation, one can also find a formalization and read the syntax of the specific subset of GQL. [16]

Extending existing graph query languages

The GQL project draws on multiple sources or inputs, notably existing industrial languages and a new section of the SQL standard. In preparatory discussions within WG3 surveys of the history [17] and comparative content of some of these inputs [18] were presented. GQL is a declarative language with its own distinct syntax, playing a similar role to SQL in the building of a database application. Other graph query languages have been defined which offer direct procedural features such as branching and looping (Apache Tinkerpop's Gremlin [19] ), and GSQL, [20] making it possible to traverse a graph iteratively to perform a class of graph algorithms, but GQL will not directly incorporate such features. [21] [22] However, GQL is envisaged as a specific case of a more general class of graph languages, which share a graph type system and a calling interface for procedures that process graphs.

SQL/PGQ Property Graph Query

Prior work by WG3 and SC32 mirror bodies, particularly in INCITS Data Management (formerly INCITS DM32), has helped to define a new planned Part 16 of the SQL Standard, which allows a read-only graph query to be called inside a SQL SELECT statement, matching a graph pattern using syntax which is very close to Cypher, PGQL and G-CORE, and returning a table of data values as the result. SQL/PGQ also contains DDL to allow SQL tables to be mapped to a graph view schema object with nodes and edges associated to sets of labels and set of data properties. [23] [24] [25] The GQL project coordinates closely with the SQL/PGQ "project split" of (extension to) ISO 9075 SQL, and the technical working groups in the U.S. (INCITS DM32) and at the international level (SC32/WG3) have several expert contributors who work on both projects. [24] The GQL project proposal mandates close alignment of SQL/PGQ and GQL, indicating that GQL will in general be a superset of SQL/PGQ.

More details about the pattern matching language can be found in the paper "Graph Pattern Matching in GQL and SQL/PGQ" [26] [27]

Cypher

Cypher [28] is a language originally designed by Andrés Taylor and colleagues at Neo4j Inc., and first implemented by that company in 2011. Since 2015 it has been made available as an open source language description [29] with grammar tooling, a JVM front-end that parses Cypher queries, and a Technology Compatibility Kit (TCK) of over 2000 test scenarios, using Cucumber for implementation language portability. [30] The TCK reflects the language description and an enhancement for temporal datatypes and functions documented in a Cypher Improvement Proposal. [31]

Cypher allows creation, reading, updating and deleting of graph elements, and is a language that can therefore be used for analytics engines and transactional databases.

Querying with visual path patterns

Cypher uses compact fixed- and variable-length patterns which combine visual representations of node and relationship (edge) topologies, with label existence and property value predicates. (These patterns are usually referred to as "ASCII art" patterns, and arose originally as a way of commenting programs which used a lower-level graph API. [17] ) By matching such a pattern against graph data elements, a query can extract references to nodes, relationships and paths of interest. Those references are emitted as a "binding table" where column names are bound to a multiset of graph elements. The name of a column becomes the name of a "binding variable", whose value is a specific graph element reference for each row of the table.

For example, a pattern  MATCH (p:Person)-[:LIVES_IN]->(c:City)  will generate a two-column output table. The first column named  p  will contain references to nodes with a label  Person . The second column named  c  will contain references to nodes with a label  City , denoting the city where the person lives.

The binding variables  p  and  c  can then be dereferenced to obtain access to property values associated with the elements referred to by a variable. The example query might be terminated with a  RETURN, resulting in a complete query like this:

MATCH(p:Person)-[:LIVES_IN]->(c:City)RETURNp.first_name,p.last_name,c.name,c.state

This would result in a final four-column table listing the names of the residents of the cities stored in the graph.

Pattern-based queries are able to express joins, by combining multiple patterns which use the same binding variable to express a natural join using the  MATCH  clause:

MATCH(p:Person)-[:LIVES_IN]->(c:City),(p:Person)-[:NATIONAL_OF]->(EUCountry)RETURNp.first_name,p.last_name,c.name,c.state

This query would return the residential location only of EU nationals.

An outer join can be expressed by  MATCH ... OPTIONAL MATCH :

MATCH(p:Person)-[:LIVES_IN]->(c:City)OPTIONALMATCH(p:Person)-[:NATIONAL_OF]->(ec:EUCountry)RETURNp.first_name,p.last_name,c.name,c.state,ec.name

This query would return the city of residence of each person in the graph with residential information, and, if an EU national, which country they come from.

Queries are therefore able to first project a sub-graph of the graph input into the query, and then extract the data values associated with that subgraph. Data values can also be processed by functions, including aggregation functions, leading to the projection of computed values which render the information held in the projected graph in various ways. Following the lead of G-CORE and Morpheus, GQL aims to project the sub-graphs defined by matching patterns (and graphs then computed over those sub-graphs) as new graphs to be returned by a query.

Patterns of this kind have become pervasive in property graph query languages, and are the basis for the advanced pattern sub-language being defined in SQL/PGQ, which is likely to become a subset of the GQL language. Cypher also uses patterns for insertion and modification clauses ( CREATE  and  MERGE ), and proposals have been made in the GQL project for collecting node and edge patterns to describe graph types.

Cypher 9 and Cypher 10

The current version of Cypher (including the temporal extension) is referred to as Cypher 9. Prior to the GQL project it was planned to create a new version, Cypher 10 [REF HEADING BELOW], that would incorporate features like schema and composable graph queries and views. The first designs for Cypher 10, including graph construction and projection, were implemented in the Cypher for Apache Spark project starting in 2016. [32]

PGQL

PGQL [33] is a language designed and implemented by Oracle Inc., but made available as an open source specification, [34] along with JVM parsing software. [35] PGQL combines familiar SQL SELECT syntax including SQL expressions and result ordering and aggregation with a pattern matching language very similar to that of Cypher. It allows the specification of the graph to be queried, and includes a facility for macros to capture "pattern views", or named sub-patterns. It does not support insertion or updating operations, having been designed primarily for an analytics environment, such as Oracle's PGX product. PGQL has also been implemented in Oracle Big Data Spatial and Graph, and in a research project, PGX.D/Async. [36]

G-CORE

G-CORE is a research language designed by a group of academic and industrial researchers and language designers which draws on features of Cypher, PGQL and SPARQL. [37] [38] The project was conducted under the auspices of the Linked Data Benchmark Council (LDBC), starting with the formation of a Graph Query Language task force in late 2015, with the bulk of the work of paper writing occurring in 2017. G-CORE is a composable language which is closed over graphs: graph inputs are processed to create a graph output, using graph projections and graph set operations to construct the new graph. G-CORE queries are pure functions over graphs, having no side effects, which mean that the language does not define operations which mutate (update or delete) stored data. G-CORE introduces views (named queries). It also incorporates paths as elements in a graph ("paths as first class citizens"), which can be queried independently of projected paths (which are computed at query time over node and edge elements). G-CORE has been partially implemented in open-source research projects in the LDBC GitHub organization. [39] [40] [41]

GSQL

GSQL [20] is a language designed for TigerGraph Inc.'s proprietary graph database. Since October 2018 TigerGraph language designers have been promoting and working on the GQL project. GSQL is a Turing-complete language that incorporates procedural flow control and iteration, and a facility for gathering and modifying computed values associated with a program execution for the whole graph or for elements of a graph called accumulators. These features are designed to enable iterative graph computations to be combined with data exploration and retrieval. GSQL graphs must be described by a schema of vertexes and edges, which constrains all insertions and updates. This schema therefore has the closed world property of an SQL schema, and this aspect of GSQL (also reflected in design proposals deriving from the Morpheus project [42] ) is proposed as an important optional feature of GSQL.

Vertexes and edges are named schema objects which contain data but also define an imputed type, much as SQL tables are data containers, with an associated implicit row type. GSQL graphs are then composed from these vertex and edge sets, and multiple named graphs can include the same vertex or edge set. GSQL has developed new features since its release in September 2017, [43] most notably introducing variable-length edge pattern matching [44] using a syntax related to that seen in Cypher, PGQL and SQL/PGQ, but also close in style to the fixed-length patterns offered by Microsoft SQL/Server Graph [45]

GSQL also supports the concept of Multigraphs [46] which allow subsets of a graph to have role-based access control. Multigraphs are important for enterprise-scale graphs that need fine-grain access control for different users.

Morpheus: multiple graphs and composable graph queries in Apache Spark

The opencypher Morpheus project [32] implements Cypher for Apache Spark users. Commencing in 2016, this project originally ran alongside three related efforts, in which Morpheus designers also took part: SQL/PGQ, G-CORE and design of Cypher extensions for querying and constructing multiple graphs. [47] The Morpheus project acted as a testbed for extensions to Cypher (known as "Cypher 10") in the two areas of graph DDL and query language extensions.

Graph DDL features include [48]

  1. definition of property graph views over JDBC-connected SQL tables and Spark DataFrames [49]
  2. definition of graph schemas or types defined by assembling node type and edge type patterns, with subtyping [49]
  3. constraining the content of a graph by a closed or fixed schema
  4. creating catalog entries for multiple named graphs in a hierarchically organized catalog
  5. graph data sources to form a federated, heterogeneous catalog
  6. creating catalog entries for named queries (views)

Graph query language extensions include [48]

  1. graph union
  2. projection of graphs computed from the results of pattern matches on multiple input graphs
  3. support for tables (Spark DataFrames) as inputs to queries ("driving tables")
  4. views which accept named or projected graphs as parameters.

These features have been proposed as inputs to the standardization of property graph query languages in the GQL project.

See also

Related Research Articles

Structured Query Language (SQL) is a domain-specific language used to manage data, especially in a relational database management system (RDBMS). It is particularly useful in handling structured data, i.e., data incorporating relations among entities and variables.

<span class="mw-page-title-main">Topic map</span> Knowledge organization system

A topic map is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. Topic maps were originally developed in the late 1990s as a way to represent back-of-the-book index structures so that multiple indexes from different sources could be merged. However, the developers quickly realized that with a little additional generalization, they could create a meta-model with potentially far wider application. The ISO/IEC standard is formally known as ISO/IEC 13250:2003.

A query language, also known as data query language or database query language (DQL), is a computer language used to make queries in databases and information systems. In database systems, query languages rely on strict theory to retrieve information. A well known example is the Structured Query Language (SQL).

A temporal database stores data relating to time instances. It offers temporal data types and stores information relating to past, present and future time. Temporal databases can be uni-temporal, bi-temporal or tri-temporal.

An XML database is a data persistence software system that allows data to be specified, and sometimes stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented databases which are in turn a category of NoSQL database.

<span class="mw-page-title-main">RDFLib</span> Python library to serialize, parse and process RDF data

RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information. This library contains parsers/serializers for almost all of the known RDF serializations, such as RDF/XML, Turtle, N-Triples, & JSON-LD, many of which are now supported in their updated form. The library also contains both in-memory and persistent Graph back-ends for storing RDF information and numerous convenience functions for declaring graph namespaces, lodging SPARQL queries and so on. It is in continuous development with the most recent stable release, rdflib 6.1.1 having been released on 20 December 2021. It was originally created by Daniel Krech with the first release in November, 2002.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

SQL:2008 is the sixth revision of the ISO and ANSI standard for the SQL database query language. It was formally adopted in July 2008. The standard consists of 9 parts which are described in detail in SQL. The next iteration is SQL:2011

A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

<span class="mw-page-title-main">Neo4j</span> Graph database implemented in Java

Neo4j is a graph database management system (GDBMS) developed by Neo4j Inc.

In computing, Open Data Protocol (OData) is an open protocol that allows the creation and consumption of queryable and interoperable Web service APIs in a standard way. Microsoft initiated OData in 2007. Versions 1.0, 2.0, and 3.0 are released under the Microsoft Open Specification Promise. Version 4.0 was standardized at OASIS, with a release in March 2014. In April 2015 OASIS submitted OData v4 and OData JSON Format v4 to ISO/IEC JTC 1 for approval as an international standard. In December 2016, ISO/IEC published OData 4.0 Core as ISO/IEC 20802-1:2016 and the OData JSON Format as ISO/IEC 20802-2:2016.

Sparksee is a high-performance and scalable graph database management system written in C++. From version 6.0, Sparksee has shifted its focus to embedded systems and mobile, becoming the first graph database specialized in mobile platforms with versions for IOS and Android.

InfiniteGraph is a distributed graph database implemented in Java and C++ and is from a class of NOSQL database technologies that focus on graph data structures. Developers use InfiniteGraph to find useful and often hidden relationships in highly connected, complex big data sets. InfiniteGraph is cross-platform, scalable, cloud-enabled, and is designed to handle very high throughput.

ISO/IEC 9075 "Information technology - Database languages - SQL" is an international standard for Structured Query Language, and is considered as specifying the minimum for what a database engine should fulfill in terms of SQL syntax, which is called Core SQL. The standard also defines a number of optional features.

Cypher is a declarative graph query language that allows for expressive and efficient data querying in a property graph.

Semantic queries allow for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results or to answer more fuzzy and wide open questions through pattern matching and digital reasoning.

SQL:2016 or ISO/IEC 9075:2016 is the eighth revision of the ISO (1987) and ANSI (1986) standard for the SQL database query language. It was formally adopted in December 2016. The standard consists of 9 parts which are described in some detail in SQL. The next version is SQL:2023.

<span class="mw-page-title-main">TypeDB</span> Open-source, strongly-typed database

TypeDB is an open-source, distributed database management system that relies on a user-defined type system to model, manage, and query data.

SQL:2023 or ISO/IEC 9075:2023 is the ninth edition of the ISO (1987) and ANSI (1986) standard for the SQL database query language. It was formally adopted in June 2023.

References

  1. Green, Alastair (July 2016). "Creating an Open Industry Standard for a Declarative Property Graph Query Language" (PDF). opencypher.org. Retrieved November 12, 2019.
  2. 1 2 Green, Alastair (July 2018). "Working towards a New Work Item for GQL, to complement SQL PGQ, ANSI INCITS DM32.2 submission DM32.2-2018-00128r1" (PDF). opencypher.org. Retrieved November 12, 2019.
  3. "ISO/IEC 39075 Information Technology — Database Languages — GQL". ISO. Retrieved January 7, 2022.
  4. "SC32 WG3 N282 "SC32 N3002 Draft NWIP Form4 Information Technology – Database Languages - GQL"". ISO. Retrieved December 9, 2019.
  5. "ISO/IEC 39075:2024 Information technology — Database languages — GQL". ISO. Retrieved 25 May 2024.
  6. Eds. Plantikow, Stefan; Cannan, Stephen (October 2019). "GQL Early Working Draft v2.2". ISO. Retrieved November 9, 2019.
  7. "GQL Standard" . Retrieved November 12, 2019.
  8. "GQL Community Updates" . Retrieved November 12, 2019.
  9. Libkin, Leonid. "Formal Semantics Working Group" . Retrieved November 12, 2019.
  10. "JTC 1/SC 32 Data Management and Interchange". ISO/IEC JTC1. Retrieved October 6, 2019.
  11. "Scope from the original standard, ISO 9075-1987, Database Language SQL". ISO/IEC JTC1. Retrieved November 9, 2019.
  12. "Iso/Iec 39075:2024".
  13. "Apache Tinkerpop". Apache Software Foundation. Retrieved November 11, 2019.
  14. "GQL Parser". GitHub . Retrieved January 18, 2021.
  15. "First GQL research implementation from Olof Morra at TU Eindhoven!". Alastair Green. Retrieved January 18, 2021.
  16. "A Semantics of GQL; a New Query Language for Property Graphs Formalized" (PDF). Olof Morra. Retrieved January 18, 2021.
  17. 1 2 Lindaaker, Tobias (May 2018). "An overview of the recent history of Graph Query Languages" (PDF). opencypher.org. Retrieved October 6, 2019.
  18. Plantikow, Stefan (May 2018). "Summary Chart of Cypher, PGQL, and G-Core" (PDF). opencypher.org. Retrieved November 3, 2019.
  19. Rodriguez, Marko A. (2015). "The Gremlin graph traversal machine and language (Invited talk)". Proceedings of the 15th Symposium on Database Programming Languages. ACM. pp. 1–10. arXiv: 1508.03843 . doi:10.1145/2815072.2815073. ISBN   9781450339025. S2CID   32623848 . Retrieved November 10, 2019.
  20. 1 2 Wu, Mingxi; Deutsch, Alin. "GSQL: An SQL-Inspired Graph Query Language" . Retrieved November 9, 2019.
  21. Wood, Peter T. (25 April 2012). "Query languages for graph databases". ACM SIGMOD Record. 41 (1). ACM: 50–60. doi:10.1145/2206869.2206879. S2CID   13537601 . Retrieved October 25, 2019.
  22. Angles, Renzo; et al. (September 2017). "Foundations of Modern Query Languages for Graph Databases". ACM Computing Surveys. 50 (5). ACM: 68:1–40. arXiv: 1610.06264 . doi:10.1145/3104031. S2CID   13526884 . Retrieved November 12, 2019.
  23. "ISO/IEC 9075-16 Information technology — Database languages SQL — Part 16: SQL Property Graph Queries (SQL/PGQ)". ISO. Retrieved January 7, 2022.
  24. 1 2 Hare, Keith; et al. (March 2019). "SQL and GQL, W3C Workshop on Web Standardization for Graph Data. Creating Bridges: RDF, Property Graph and SQL" (PDF). W3C. Retrieved October 6, 2019.
  25. Trigonakis, Vasileios (July 2019). "Property graph extensions for the SQL standard. LDBC 12th TUC" (PDF). LBDC. Retrieved January 7, 2022.
  26. Deutsch, Alin; Francis, Nadime; Green, Alastair; Hare, Keith; Li, Bei; Libkin, Leonid; Lindaaker, Tobias; Marsault, Victor; Martens, Wim; Michels, Jan; et al. (2021-12-12). "Graph Pattern Matching in GQL and SQL/PGQ". arXiv: 2112.06217 [cs.DB].
  27. Deutsch, Alin; Francis, Nadime; Green, Alastair; Hare, Keith; Li, Bei; Libkin, Leonid; Lindaaker, Tobias; Marsault, Victor; Martens, Wim; Michels, Jan; Murlak, Filip; Plantikow, Stefan; Selmer, Petra; van Rest, Oskar; Voigt, Hannes (2022-06-11). "Graph Pattern Matching in GQL and SQL/PGQ". Proceedings of the 2022 International Conference on Management of Data. SIGMOD '22. New York, NY, USA: Association for Computing Machinery. pp. 2246–2258. doi:10.1145/3514221.3526057. ISBN   978-1-4503-9249-5. S2CID   245124268.
  28. Francis, Nadime; et al. (27 May 2018). "Cypher: An Evolving Query Language for Property Graphs". Proceedings of the 2018 International Conference on Management of Data. ACM. pp. 1433–1445. doi:10.1145/3183713.3190657. ISBN   9781450347037. S2CID   13919896 . Retrieved October 25, 2019.
  29. "Cypher Query Language Reference (Version 9)" (PDF). opencypher.org. Retrieved November 10, 2019.
  30. "openCypher Resources". ACM. Retrieved November 10, 2019.
  31. "CIP2015-08-06 - Date and Time". opencypher.org. 15 May 2019. Retrieved October 25, 2019.
  32. 1 2 Rydberg, Mats; et al. (July 2016). "Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.". openCypher. Retrieved November 3, 2019.
  33. van Rest, Oskar; et al. (June 2016). "PGQL: A property graph query language". Proceedings of the Fourth International Workshop on Graph Data Management Experiences and Systems. ACM. pp. 1–6. doi:10.1145/2960414.2960421. ISBN   978-1-4503-4780-8. S2CID   6806901 . Retrieved October 25, 2019.
  34. "PGQL". pgql.org. Retrieved October 6, 2019.
  35. van Rest, Oskar; et al. (September 2015). "PGQL is an SQL-based query language for the Property Graph data model". pgql.org. Retrieved November 3, 2019.
  36. Roth, Nicholas P.; et al. (2017). "PGX.D/Async: A Scalable Distributed Graph Pattern Matching Engine". Proceedings of the Fifth International Workshop on Graph Data-management Experiences & Systems. ACM. pp. 1–6. doi:10.1145/3078447.3078454. ISBN   978-1-4503-5038-9. S2CID   26283328 . Retrieved October 29, 2019.
  37. Angles, Renzo; et al. (2018). "G-CORE: A Core for Future Graph Query Languages". Proceedings of the 2018 International Conference on Management of Data. ACM. pp. 1421–1432. doi:10.1145/3183713.3190654. ISBN   978-1-4503-4703-7. S2CID   4623760 . Retrieved November 9, 2019.
  38. Voigt, Hannes (February 2018). "G-CORE: The LDBC Graph Query Language Proposal. In archives of FOSDEM 2018" . Retrieved November 12, 2019.
  39. van Rest, Oskar (2017). "G-CORE Grammar and Parser". LDBC. Retrieved November 12, 2019.
  40. Ciocîrdel, Georgiana Diana (2018). "A G-CORE (Graph Query Language) Interpreter, Master's Thesis in Parallel and Distributed Computer Systems, CWI and Vrije Universiteit Amsterdam" (PDF). CWI. Retrieved November 12, 2019.
  41. Ciocîrdel, Georgiana Diana; Boncz, Peter (2017). "G-CORE interpreter on Spark". LDBC. Retrieved November 12, 2019.
  42. Voigt, Hannes; Selmer, Petra; Lindaaker, Tobias; Plantikow, Stefan; Green, Alastair; Furniss, Peter (December 2018). "Property Graph Schema, ANSI INCITS DM32.2 SQL Property Graph Extensions Ad Hoc submission sql-pg-2018-0056r1, Neo4j Query Languages Standards and Research Team" (PDF). openCypher.org. Retrieved November 12, 2019.
  43. "GSQL documentation Tigergraph 1.0". 2017. Retrieved November 9, 2019.
  44. "Pattern Matching, TigerGraph 2.4 Release Notes". June 2019. Retrieved November 9, 2019.
  45. "Query language extensions, Graph processing with SQL Server and Azure SQL Database". Microsoft Inc. 2017. Retrieved November 10, 2019.
  46. "Multigraphs, TigerGraph Online Documentation". June 2019. Retrieved January 7, 2022.
  47. Taylor, Andrés; Plantikow, Stefan; Selmer, Petra (2017–2018). "CIP2017-06-18 Querying and constructing multiple graphs". opencypher.org. Retrieved November 12, 2019.
  48. 1 2 Kiessling, Max (2019). "Multiple graphs and composable queries in Cypher for Apache Spark. openCypher Implementers Meeting V, Berlin" (PDF). opencypher.org. Retrieved November 9, 2019.
  49. 1 2 Johanssen, Tobias; et al. (2019). "graphddl-example-ldbc: A cypher-for-apache-spark example showing the use of SqlPropertyGraphSource and GraphDDL to provide a property graph view of a SQL dataset". GitHub . Retrieved November 9, 2019.