Amazon Neptune

Last updated

Amazon Neptune is a managed graph database product published by Amazon.com. It is used as a web service and is part of Amazon Web Services (AWS). It was announced on November 29, 2017. [1] Amazon Neptune supports popular graph models property graph and W3C's RDF, and their respective query languages Apache TinkerPop's Gremlin, [2] openCypher, [3] and SPARQL, [4] including other Amazon Web Services products.

Contents

Amazon Neptune general availability (GA) was announced on May 30, 2018 [5] [6] and is currently available in 22 AWS regions. Neptune is HIPAA eligible. [7] On December 12, 2018, it was announced that Amazon Neptune was in-scope for Payment Card Industry Data Security Standard, and ISO compliance programs. [8]

History

Amazon Neptune is based on Blazegraph. Amazon acquired the Blazegraph developers and the Blazegraph open source development was essentially stopped in April 2018. [9]

History of Amazon Neptune engine releases [10]
ReleaseChangeDate
1.0.2.2Engine version 1.0.2.2March 9, 2020
1.0.2.1Engine version 1.0.2.1.R4December 20, 2019
1.0.2.1Engine version 1.0.2.1.R3December 12, 2019
1.0.2.1Engine version 1.0.2.1.R2November 23, 2019
1.0.2.0Engine version 1.0.2.0.R2November 8, 2019
1.0.1.0Engine version 1.0.1.0.200502.0October 31, 2019
1.0.1.0Engine version 1.0.1.0.200463.0October 15, 2019
1.0.1.0Engine version 1.0.1.0.200457.0September 19, 2019
1.0.1.0Engine version 1.0.1.0.200369.0August 13, 2019
1.0.1.0Engine version 1.0.1.0.200348.0July 2, 2019
1.0.1.0Engine version 1.0.1.0.200310.0June 12, 2019
1.0.1.0Engine version 1.0.1.0.200296.0May 1, 2019
1.0.1.0Engine version 1.0.1.0.200267.0January 21, 2019
1.0.1.0Engine version 1.0.1.0.200264.0November 19, 2018
1.0.1.0Engine version 1.0.1.0.200258.0November 8, 2018
1.0.1.0Engine version 1.0.1.0.200255.0October 29, 2018
1.0.1.0Engine version 1.0.1.0.200237.0September 6, 2018
1.0.1.0Engine version 1.0.1.0.200236.0July 24, 2018
1.0.1.0Engine version 1.0.1.0.200233.0June 22, 2018
1.0.1.0Amazon Neptune initial releaseMay 30, 2018

Features

External support

Amazon Neptune supports the open source Apache TinkerPop Gremlin [2] graph traversal language, openCypher query language [3] for property graphs, and the W3C standard Resource Description Framework's (RDF) SPARQL [4] query language. All three can be used on the same Neptune instance, and allows the user to build queries to navigate highly connected data sets and provides high performance for both graph models. Neptune also uses other AWS product features such as those of Amazon S3, Amazon EC2 and Amazon CloudWatch. [11]

Security

All Amazon Neptune database clusters are created and stored in an Amazon Virtual Private Cloud (VPC), which allows the user to isolate their database in their own private network. Using Neptune's VPC configuration, the user can configure firewall settings to their needs in order to control network access to database instances. Amazon Neptune is integrated with AWS Identity and Access Management (IAM), which allows the user to create AWS IAM groups and control the actions that the groups and other AWS IAM users can do. Neptune allows the user to encrypt databases using keys created through AWS Key Management Service (KMS). A database instance running with Neptune Encryption, encrypts all of the stored data, backups, snapshots and replicas in the same cluster. Amazon Neptune allows the user to use HTTPS to encrypt data during its transfer to and from clients or Neptune service endpoints using Transport Layer Security (TLS). [12]

Storage

The data is stored in a cluster volume, a virtual volume utilizing SSD drives. These sizes of these volumes are dynamic, they increase depending how much data is stored in the database, with a maximum of 64 TB. The Amazon Neptune SLA policy is designed to offer a monthly uptime percentage greater that of 99.9%, increasing database performance and availability by integrating the engine with a virtual storage based on SSD drives, that are specially made for database workloads. Neptune maintains copies of the user's data in multiple Availability Zones. In case of failures, Neptune automatically detects any failed segments in a disk volume and repairs them. [13]

Availability

Neptune is now generally available in the US East (N. Virginia), US East (Ohio), US West (Oregon), US West (N. California), Canada (Central), [14] AWS GovCloud (US-West), [15] AWS GovCloud (US-East), [16] Europe (Ireland), Europe (London), [17] Europe (Frankfurt), [18] Europe (Stockholm), [19] Europe (Paris), [20] South America (São Paulo), Asia Pacific (Singapore), [21] Asia Pacific (Sydney), [22] Asia Pacific (Tokyo), [23] Asia Pacific (Mumbai), [24] Asia Pacific (Seoul), [25] Asia Pacific (Hong Kong), China (Ningxia), China (Beijing), Operated by NWCD, [26] and Middle-East (Bahrain) [27] AWS regions.

Pricing

Pricing is determined by AWS regions, with each region having different prices for the available services.

On-Demand Instance Pricing lets the user pay only for what instance they use by the hour instead of a fixed sum. The price also differs depending on the instance class. Similarly, the user only pays for the storage consumed by the database, with no payments in advance. The price is based on the storage rate and I/O rate, which are billed in GB per month and per million request increments respectively. This pricing model is beneficial to short-term and small-scale projects and is available in all AWS regions. The price for backup storage is also billed in per GB-month increments albeit at different rates. Data Transfer is priced in per GB increments, the amount depends on whether the data is transferred in or out of Amazon Neptune and how much data is transferred per month (TB per Month). [28]

Application

On September 12, 2018, it was announced that Neptune achieved HIPAA eligibility [7] enabling it to process data sets containing protected health information (PHI). On December 12, 2018, it was announced that Amazon Neptune was in-scope for Payment Card Industry Data Security Standard, ISO 9001, ISO 27001, ISO 27017, and ISO 27018 compliance programs. [8] The user can use Amazon Neptune in applications that are subject to PCI compliance or require ISO certification. [29] On May 17, 2019, it was announced that Neptune achieved SSAE 16 eligibility [30]

Neptune powers graph use cases such as recommendation engines, fraud detection, knowledge graphs, drug discovery, network security, and social networking. [31]

Notable users

Some notable customers of Amazon Neptune include, Samsung Electronics, Pearson, Intuit, Siemens, AstraZeneca, FINRA, LifeOmic, Blackfynn, and Amazon Alexa [32]

See also

Related Research Articles

<span class="mw-page-title-main">Amazon Web Services</span> On-demand cloud computing company

Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon that provides on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered, pay-as-you-go basis. Clients will often use this in combination with autoscaling. These cloud computing web services provide various services related to networking, compute, storage, middleware, IoT and other processing capacity, as well as software tools via AWS server farms. This frees clients from managing, scaling, and patching hardware and operating systems. One of the foundational services is Amazon Elastic Compute Cloud (EC2), which allows users to have at their disposal a virtual cluster of computers, with extremely high availability, which can be interacted with over the internet via REST APIs, a CLI or the AWS console. AWS's virtual computers emulate most of the attributes of a real computer, including hardware central processing units (CPUs) and graphics processing units (GPUs) for processing; local/RAM memory; Hard-disk(HDD)/SSD storage; a choice of operating systems; networking; and pre-loaded application software such as web servers, databases, and customer relationship management (CRM).

Amazon S3 or Amazon Simple Storage Service is a service offered by Amazon Web Services (AWS) that provides object storage through a web service interface. Amazon S3 uses the same scalable storage infrastructure that Amazon.com uses to run its e-commerce network. Amazon S3 can store any type of object, which allows uses like storage for Internet applications, backups, disaster recovery, data archives, data lakes for analytics, and hybrid cloud storage. AWS launched Amazon S3 in the United States on March 14, 2006, then in Europe in November 2007.

<span class="mw-page-title-main">Amazon Elastic Compute Cloud</span> Cloud computing platform

Amazon Elastic Compute Cloud (EC2) is a part of Amazon.com's cloud-computing platform, Amazon Web Services (AWS), that allows users to rent virtual computers on which to run their own computer applications. EC2 encourages scalable deployment of applications by providing a web service through which a user can boot an Amazon Machine Image (AMI) to configure a virtual machine, which Amazon calls an "instance", containing any software desired. A user can create, launch, and terminate server-instances as needed, paying by the second for active servers – hence the term "elastic". EC2 provides users with control over the geographical location of instances that allows for latency optimization and high levels of redundancy. In November 2010, Amazon switched its own retail website platform to EC2 and AWS.

This is a comparison of online backup services.

<span class="mw-page-title-main">Cloud computing</span> Form of shared Internet-based computing

Cloud computing is the on-demand availability of computer system resources, especially data storage and computing power, without direct active management by the user. Large clouds often have functions distributed over multiple locations, each of which is a data center. Cloud computing relies on sharing of resources to achieve coherence and typically uses a pay-as-you-go model, which can help in reducing capital expenses but may also lead to unexpected operating expenses for users.

<span class="mw-page-title-main">Amazon Virtual Private Cloud</span> Cloud-based service

Amazon Virtual Private Cloud (VPC) is a commercial cloud computing service that provides a virtual private cloud, by provisioning a logically isolated section of Amazon Web Services (AWS) Cloud. Enterprise customers can access the Amazon Elastic Compute Cloud (EC2) over an IPsec based virtual private network. Unlike traditional EC2 instances which are allocated internal and external IP numbers by Amazon, the customer can assign IP numbers of their choosing from one or more subnets.

<span class="mw-page-title-main">Virtual private cloud</span> Pool of shared resources allocated within a public cloud environment

A virtual private cloud (VPC) is an on-demand configurable pool of shared resources allocated within a public cloud environment, providing a certain level of isolation between the different organizations (denoted as users hereafter) using the resources. The isolation between one VPC user and all other users of the same cloud (other VPC users as well as other public cloud users) is achieved normally through allocation of a private IP subnet and a virtual communication construct (such as a VLAN or a set of encrypted communication channels) per user. In a VPC, the previously described mechanism, providing isolation within the cloud, is accompanied with a virtual private network (VPN) function (again, allocated per VPC user) that secures, by means of authentication and encryption, the remote access of the organization to its VPC resources. With the introduction of the described isolation levels, an organization using this service is in effect working on a 'virtually private' cloud (that is, as if the cloud infrastructure is not shared with other users), and hence the name VPC.

A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

<span class="mw-page-title-main">Redis</span> Open-source in-memory key–value database

Redis is an open-source in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability. Because it holds all data in memory and because of its design, Redis offers low-latency reads and writes, making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database, and one of the most popular databases overall. Redis is used in companies like Twitter, Airbnb, Tinder, Yahoo, Adobe, Hulu, Amazon and OpenAI.

<span class="mw-page-title-main">Amazon Elastic Block Store</span> Cloud-based raw storage service

Amazon Elastic Block Store (EBS) provides raw block-level storage that can be attached to Amazon EC2 instances and is used by Amazon Relational Database Service (RDS). It is one of the two block-storage options offered by AWS, with the other being the EC2 Instance Store.

Amazon Relational Database Service is a distributed relational database service by Amazon Web Services (AWS). It is a web service running "in the cloud" designed to simplify the setup, operation, and scaling of a relational database for use in applications. Administration processes like patching the database software, backing up databases and enabling point-in-time recovery are managed automatically. Scaling storage and compute resources can be performed by a single API call to the AWS control plane on-demand. AWS does not offer an SSH connection to the underlying virtual machine as part of the managed service.

A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.

Amazon S3 Glacier is an online file storage web service that provides storage for data archiving and backup.

Backend as a service (BaaS), sometimes also referred to as mobile backend as a service (MBaaS), is a service for providing web app and mobile app developers with a way to easily build a backend to their frontend applications. Features available include user management, push notifications, and integration with social networking services. These services are provided via the use of custom software development kits (SDKs) and application programming interfaces (APIs). BaaS is a relatively recent development in cloud computing, with most BaaS startups dating from 2011 or later. Some of the most popular service providers are AWS Amplify and Firebase.

Amazon Aurora is a relational database service developed and offered by Amazon Web Services beginning in October 2014. Aurora is available as part of the Amazon Relational Database Service (RDS).

<span class="mw-page-title-main">AWS Lambda</span> Serverless computing platform

AWS Lambda is an event-driven, serverless computing platform provided by Amazon as a part of Amazon Web Services. It is designed to enable developers to run code without provisioning or managing servers. It executes code in response to events and automatically manages the computing resources required by that code. It was introduced on November 13, 2014.

This is a timeline of Amazon Web Services, which offers a suite of cloud computing services that make up an on-demand computing platform.

Amazon Elastic File System is a cloud storage service provided by Amazon Web Services (AWS) designed to provide scalable, elastic, concurrent with some restrictions, and encrypted file storage for use with both AWS cloud services and on-premises resources. Amazon EFS is built to be able to grow and shrink automatically as files are added and removed. Amazon EFS supports Network File System (NFS) versions 4.0 and 4.1 (NFSv4) protocol, and control access to files through Portable Operating System Interface (POSIX) permissions.

<span class="mw-page-title-main">Blazegraph</span> Open source triplestore and graph database

Blazegraph is an open source triplestore and graph database, developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL.

Amazon DocumentDB is a managed proprietary NoSQL database service that supports document data structures, with some compatibility with MongoDB version 3.6 and version 4.0. As a document database, Amazon DocumentDB can store, query, and index JSON data. It is available on Amazon Web Services. As of March 2023, AWS introduced some compliance with MongoDB 5.0 but lacks time series collection support.

References

  1. "Amazon Neptune – A Fully Managed Graph Database Service - Amazon Web Services". Amazon Web Services. 29 November 2017.
  2. 1 2 "Accessing the Neptune Graph with Gremlin - Amazon Neptune". docs.aws.amazon.com. Retrieved 2019-05-12.
  3. 1 2 "Accessing the Neptune Graph with openCypher". docs.aws.amazon.com. Retrieved 2022-11-02.
  4. 1 2 "Accessing the Neptune Graph with SPARQL - Amazon Neptune". docs.aws.amazon.com. Retrieved 2019-05-12.
  5. "AWS Announces General Availability of Amazon Neptune". Amazon Web Services. 30 May 2018.
  6. "Amazon Neptune is now generally available to build fast, reliable graph applications". Amazon Web Services. 30 May 2018.
  7. 1 2 "Amazon Neptune Achieves HIPAA Eligibility". Amazon Web Services. 12 September 2018.
  8. 1 2 "Amazon Neptune Now In-Scope for PCI DSS and ISO Compliance Programs". Amazon Web Services. 12 December 2018.
  9. "Blazegraph fork?". GitHub. 12 April 2018.
  10. "Engine Releases for Amazon Neptune - Amazon Neptune". docs.aws.amazon.com. Retrieved 2019-04-29.
  11. "Amazon Neptune Features". Amazon Web Services, Inc. Retrieved 2019-05-12.
  12. "Configuring Security in Amazon Neptune - Amazon Neptune". docs.aws.amazon.com. Retrieved 2019-04-29.
  13. "Amazon Neptune Storage - Amazon Neptune". docs.aws.amazon.com. Retrieved 2019-04-29.
  14. "Amazon Neptune is Now Available in the Canada (Central) Region". Amazon Web Services. 30 October 2019.
  15. "Amazon Neptune is Now Available in the AWS GovCloud (US-West) Region". Amazon Web Services. 14 August 2019.
  16. "Amazon Neptune is Now Available in the AWS GovCloud (US-East) Region". Amazon Web Services. 21 August 2019.
  17. "Amazon Neptune is Now Available in Europe (London)". Amazon Web Services. 3 October 2018.
  18. "Amazon Neptune is Now Available in Europe (Frankfurt)". Amazon Web Services. 7 November 2018.
  19. "Amazon Neptune is Now Available in Europe (Stockholm)". Amazon Web Services. 27 June 2019.
  20. "Amazon Neptune is Now Available in Europe (Paris)". Amazon Web Services. 11 March 2020.
  21. "Amazon Neptune is Now Available in Asia Pacific (Singapore)". Amazon Web Services. 13 December 2018.
  22. "Amazon Neptune is Now Available in Asia Pacific (Sydney)". Amazon Web Services, Inc. Retrieved 2019-04-29.
  23. "Amazon Neptune is Now Available in the Asia Pacific (Tokyo) Region". Amazon Web Services, Inc. Retrieved 2019-04-29.
  24. "Amazon Neptune Now Available in Asia Pacific (Mumbai)". Amazon Web Services, Inc. Retrieved 2019-05-12.
  25. "Amazon Neptune is Now Available in Asia Pacific (Seoul)". Amazon Web Services, Inc. Retrieved 2019-05-12.
  26. "Amazon Neptune is now available in AWS China (Ningxia) Region, Operated by NWCD". Amazon Web Services, Inc. Retrieved 2020-01-15.
  27. "Amazon Neptune is Now Available in the Middle East (Bahrain) Region". Amazon Web Services, Inc. Retrieved 2019-10-30.
  28. "Amazon Neptune Pricing". Amazon Web Services, Inc. Retrieved 2019-04-29.
  29. "Amazon Neptune Now In-Scope for PCI DSS and ISO Compliance Programs". Amazon Web Services, Inc. Retrieved 2019-05-12.
  30. "Amazon Neptune Is Now SOC Compliant". Amazon Web Services, Inc. Retrieved 2019-05-17.
  31. "Amazon Neptune - Fast, Reliable Graph Database built for the cloud". Amazon Web Services, Inc. Retrieved 2019-05-12.
  32. "AWS Announces General Availability of Amazon Neptune". Amazon.com, Inc. - Press Room. 30 May 2018. Retrieved 2019-04-29.

Further reading