This article may rely excessively on sources too closely associated with the subject , potentially preventing the article from being verifiable and neutral.(February 2019) |
Amazon Neptune is a managed graph database product published by Amazon.com. It is used as a web service and is part of Amazon Web Services (AWS). It was announced on November 29, 2017. [1] Amazon Neptune supports popular graph models property graph and W3C's RDF, and their respective query languages Apache TinkerPop's Gremlin, [2] openCypher, [3] and SPARQL, [4] including other Amazon Web Services products.
Amazon Neptune general availability (GA) was announced on May 30, 2018 [5] [6] and is currently available in 22 AWS regions. Neptune is HIPAA eligible. [7] On December 12, 2018, it was announced that Amazon Neptune was in-scope for Payment Card Industry Data Security Standard, and ISO compliance programs. [8]
Amazon Neptune is based on Blazegraph. Amazon acquired the Blazegraph developers and the Blazegraph open source development was essentially stopped in April 2018. [9]
Release | Change | Date |
---|---|---|
1.0.2.2 | Engine version 1.0.2.2 | March 9, 2020 |
1.0.2.1 | Engine version 1.0.2.1.R4 | December 20, 2019 |
1.0.2.1 | Engine version 1.0.2.1.R3 | December 12, 2019 |
1.0.2.1 | Engine version 1.0.2.1.R2 | November 23, 2019 |
1.0.2.0 | Engine version 1.0.2.0.R2 | November 8, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200502.0 | October 31, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200463.0 | October 15, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200457.0 | September 19, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200369.0 | August 13, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200348.0 | July 2, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200310.0 | June 12, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200296.0 | May 1, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200267.0 | January 21, 2019 |
1.0.1.0 | Engine version 1.0.1.0.200264.0 | November 19, 2018 |
1.0.1.0 | Engine version 1.0.1.0.200258.0 | November 8, 2018 |
1.0.1.0 | Engine version 1.0.1.0.200255.0 | October 29, 2018 |
1.0.1.0 | Engine version 1.0.1.0.200237.0 | September 6, 2018 |
1.0.1.0 | Engine version 1.0.1.0.200236.0 | July 24, 2018 |
1.0.1.0 | Engine version 1.0.1.0.200233.0 | June 22, 2018 |
1.0.1.0 | Amazon Neptune initial release | May 30, 2018 |
Amazon Neptune supports the open source Apache TinkerPop Gremlin [2] graph traversal language, openCypher query language [3] for property graphs, and the W3C standard Resource Description Framework's (RDF) SPARQL [4] query language. All three can be used on the same Neptune instance, and allows the user to build queries to navigate highly connected data sets and provides high performance for both graph models. Neptune also uses other AWS product features such as those of Amazon S3, Amazon EC2 and Amazon CloudWatch. [11]
All Amazon Neptune database clusters are created and stored in an Amazon Virtual Private Cloud (VPC), which allows the user to isolate their database in their own private network. Using Neptune's VPC configuration, the user can configure firewall settings to their needs in order to control network access to database instances. Amazon Neptune is integrated with AWS Identity and Access Management (IAM), which allows the user to create AWS IAM groups and control the actions that the groups and other AWS IAM users can do. Neptune allows the user to encrypt databases using keys created through AWS Key Management Service (KMS). A database instance running with Neptune Encryption, encrypts all of the stored data, backups, snapshots and replicas in the same cluster. Amazon Neptune allows the user to use HTTPS to encrypt data during its transfer to and from clients or Neptune service endpoints using Transport Layer Security (TLS). [12]
The data is stored in a cluster volume, a virtual volume utilizing SSD drives. These sizes of these volumes are dynamic, they increase depending how much data is stored in the database, with a maximum of 64 TB. The Amazon Neptune SLA policy is designed to offer a monthly uptime percentage greater that of 99.9%, increasing database performance and availability by integrating the engine with a virtual storage based on SSD drives, that are specially made for database workloads. Neptune maintains copies of the user's data in multiple Availability Zones. In case of failures, Neptune automatically detects any failed segments in a disk volume and repairs them. [13]
Neptune is now generally available in the US East (N. Virginia), US East (Ohio), US West (Oregon), US West (N. California), Canada (Central), [14] AWS GovCloud (US-West), [15] AWS GovCloud (US-East), [16] Europe (Ireland), Europe (London), [17] Europe (Frankfurt), [18] Europe (Stockholm), [19] Europe (Paris), [20] South America (São Paulo), Asia Pacific (Singapore), [21] Asia Pacific (Sydney), [22] Asia Pacific (Tokyo), [23] Asia Pacific (Mumbai), [24] Asia Pacific (Seoul), [25] Asia Pacific (Hong Kong), China (Ningxia), China (Beijing), Operated by NWCD, [26] and Middle-East (Bahrain) [27] AWS regions.
Pricing is determined by AWS regions, with each region having different prices for the available services.
On-Demand Instance Pricing lets the user pay only for what instance they use by the hour instead of a fixed sum. The price also differs depending on the instance class. Similarly, the user only pays for the storage consumed by the database, with no payments in advance. The price is based on the storage rate and I/O rate, which are billed in GB per month and per million request increments respectively. This pricing model is beneficial to short-term and small-scale projects and is available in all AWS regions. The price for backup storage is also billed in per GB-month increments albeit at different rates. Data Transfer is priced in per GB increments, the amount depends on whether the data is transferred in or out of Amazon Neptune and how much data is transferred per month (TB per Month). [28]
On September 12, 2018, it was announced that Neptune achieved HIPAA eligibility [7] enabling it to process data sets containing protected health information (PHI). On December 12, 2018, it was announced that Amazon Neptune was in-scope for Payment Card Industry Data Security Standard, ISO 9001, ISO 27001, ISO 27017, and ISO 27018 compliance programs. [8] The user can use Amazon Neptune in applications that are subject to PCI compliance or require ISO certification. [29] On May 17, 2019, it was announced that Neptune achieved SSAE 16 eligibility [30]
Neptune powers graph use cases such as recommendation engines, fraud detection, knowledge graphs, drug discovery, network security, and social networking. [31]
Some notable customers of Amazon Neptune include, Samsung Electronics, Pearson, Intuit, Siemens, AstraZeneca, FINRA, LifeOmic, Blackfynn, and Amazon Alexa [32]
Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon that provides on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered, pay-as-you-go basis. Clients will often use this in combination with autoscaling. These cloud computing web services provide various services related to networking, compute, storage, middleware, IoT and other processing capacity, as well as software tools via AWS server farms. This frees clients from managing, scaling, and patching hardware and operating systems. One of the foundational services is Amazon Elastic Compute Cloud (EC2), which allows users to have at their disposal a virtual cluster of computers, with extremely high availability, which can be interacted with over the internet via REST APIs, a CLI or the AWS console. AWS's virtual computers emulate most of the attributes of a real computer, including hardware central processing units (CPUs) and graphics processing units (GPUs) for processing; local/RAM memory; Hard-disk(HDD)/SSD storage; a choice of operating systems; networking; and pre-loaded application software such as web servers, databases, and customer relationship management (CRM).
Amazon S3 or Amazon Simple Storage Service is a service offered by Amazon Web Services (AWS) that provides object storage through a web service interface. Amazon S3 uses the same scalable storage infrastructure that Amazon.com uses to run its e-commerce network. Amazon S3 can store any type of object, which allows uses like storage for Internet applications, backups, disaster recovery, data archives, data lakes for analytics, and hybrid cloud storage. AWS launched Amazon S3 in the United States on March 14, 2006, then in Europe in November 2007.
Amazon Elastic Compute Cloud (EC2) is a part of Amazon.com's cloud-computing platform, Amazon Web Services (AWS), that allows users to rent virtual computers on which to run their own computer applications. EC2 encourages scalable deployment of applications by providing a web service through which a user can boot an Amazon Machine Image (AMI) to configure a virtual machine, which Amazon calls an "instance", containing any software desired. A user can create, launch, and terminate server-instances as needed, paying by the second for active servers – hence the term "elastic". EC2 provides users with control over the geographical location of instances that allows for latency optimization and high levels of redundancy. In November 2010, Amazon switched its own retail website platform to EC2 and AWS.
This is a comparison of online backup services.
Cloud computing is the on-demand availability of computer system resources, especially data storage and computing power, without direct active management by the user. Large clouds often have functions distributed over multiple locations, each of which is a data center. Cloud computing relies on sharing of resources to achieve coherence and typically uses a pay-as-you-go model, which can help in reducing capital expenses but may also lead to unexpected operating expenses for users.
Amazon Virtual Private Cloud (VPC) is a commercial cloud computing service that provides a virtual private cloud, by provisioning a logically isolated section of Amazon Web Services (AWS) Cloud. Enterprise customers can access the Amazon Elastic Compute Cloud (EC2) over an IPsec based virtual private network. Unlike traditional EC2 instances which are allocated internal and external IP numbers by Amazon, the customer can assign IP numbers of their choosing from one or more subnets.
A virtual private cloud (VPC) is an on-demand configurable pool of shared resources allocated within a public cloud environment, providing a certain level of isolation between the different organizations (denoted as users hereafter) using the resources. The isolation between one VPC user and all other users of the same cloud (other VPC users as well as other public cloud users) is achieved normally through allocation of a private IP subnet and a virtual communication construct (such as a VLAN or a set of encrypted communication channels) per user. In a VPC, the previously described mechanism, providing isolation within the cloud, is accompanied with a virtual private network (VPN) function (again, allocated per VPC user) that secures, by means of authentication and encryption, the remote access of the organization to its VPC resources. With the introduction of the described isolation levels, an organization using this service is in effect working on a 'virtually private' cloud (that is, as if the cloud infrastructure is not shared with other users), and hence the name VPC.
A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.
Redis is an open-source in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability. Because it holds all data in memory and because of its design, Redis offers low-latency reads and writes, making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database, and one of the most popular databases overall. Redis is used in companies like Twitter, Airbnb, Tinder, Yahoo, Adobe, Hulu, Amazon and OpenAI.
Amazon Elastic Block Store (EBS) provides raw block-level storage that can be attached to Amazon EC2 instances and is used by Amazon Relational Database Service (RDS). It is one of the two block-storage options offered by AWS, with the other being the EC2 Instance Store.
Amazon Relational Database Service is a distributed relational database service by Amazon Web Services (AWS). It is a web service running "in the cloud" designed to simplify the setup, operation, and scaling of a relational database for use in applications. Administration processes like patching the database software, backing up databases and enabling point-in-time recovery are managed automatically. Scaling storage and compute resources can be performed by a single API call to the AWS control plane on-demand. AWS does not offer an SSH connection to the underlying virtual machine as part of the managed service.
A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.
Amazon S3 Glacier is an online file storage web service that provides storage for data archiving and backup.
Backend as a service (BaaS), sometimes also referred to as mobile backend as a service (MBaaS), is a service for providing web app and mobile app developers with a way to easily build a backend to their frontend applications. Features available include user management, push notifications, and integration with social networking services. These services are provided via the use of custom software development kits (SDKs) and application programming interfaces (APIs). BaaS is a relatively recent development in cloud computing, with most BaaS startups dating from 2011 or later. Some of the most popular service providers are AWS Amplify and Firebase.
Amazon Aurora is a relational database service developed and offered by Amazon Web Services beginning in October 2014. Aurora is available as part of the Amazon Relational Database Service (RDS).
AWS Lambda is an event-driven, serverless computing platform provided by Amazon as a part of Amazon Web Services. It is designed to enable developers to run code without provisioning or managing servers. It executes code in response to events and automatically manages the computing resources required by that code. It was introduced on November 13, 2014.
This is a timeline of Amazon Web Services, which offers a suite of cloud computing services that make up an on-demand computing platform.
Amazon Elastic File System is a cloud storage service provided by Amazon Web Services (AWS) designed to provide scalable, elastic, concurrent with some restrictions, and encrypted file storage for use with both AWS cloud services and on-premises resources. Amazon EFS is built to be able to grow and shrink automatically as files are added and removed. Amazon EFS supports Network File System (NFS) versions 4.0 and 4.1 (NFSv4) protocol, and control access to files through Portable Operating System Interface (POSIX) permissions.
Blazegraph is an open source triplestore and graph database, developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL.
Amazon DocumentDB is a managed proprietary NoSQL database service that supports document data structures, with some compatibility with MongoDB version 3.6 and version 4.0. As a document database, Amazon DocumentDB can store, query, and index JSON data. It is available on Amazon Web Services. As of March 2023, AWS introduced some compliance with MongoDB 5.0 but lacks time series collection support.