Postgres-XL

Last updated
Postgres-XL
Repository git.postgresql.org/gitweb/?p=postgres-xl.git
Type Distributed relational database management system
License PostgreSQL License [1]
Website www.postgres-xl.org

Postgres-XL is a distributed relational database management system (RDBMS) software based on PostgreSQL. It aims to provide feature parity with PostgreSQL while distributing the workload over a cluster. The name "Postgres-XL" stands for "eXtensible Lattice". [2]

Contents

Postgres-XL is based on Postgres-XC, an earlier distributed PostgreSQL system developed by NTT Data and EnterpriseDB. [3] In 2012, the cloud database startup StormDB [4] adopted Postgres-XC and developed some proprietary extensions and improvements to it. [5] In 2013, StormDB was acquired by TransLattice, [5] [6] and the improved software was open-sourced under the name "Postgres-XL" in 2014. [7] [8] Since 2015, Postgres-XL development has also been supported by 2ndQuadrant. [3]

Postgres-XL provides cluster-wide consistent transaction snapshots [9] [10] via a central Global Transaction Manager (GTM) node. It requires a fast interconnect between nodes, so Postgres-XL is not suited to geographically distributed clusters. Larger queries can be split and parallelized between multiple nodes. [10] Individual database tables can be chosen to be fully replicated across the cluster (usually for smaller tables) or sharded between separate nodes (for write scalability). [11]

See also

Related Research Articles

<span class="mw-page-title-main">PostgreSQL</span> Free and open-source object relational database management system

PostgreSQL, also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. PostgreSQL features transactions with atomicity, consistency, isolation, durability (ACID) properties, automatically updatable views, materialized views, triggers, foreign keys, and stored procedures. It is supported on all major operating systems, including Linux, FreeBSD, OpenBSD, macOS, and Windows, and handles a range of workloads from single machines to data warehouses or web services with many concurrent users.

In computing, a solution stack or software stack is a set of software subsystems or components needed to create a complete platform such that no additional software is needed to support applications. Applications are said to "run on" or "run on top of" the resulting platform.

Multi-master replication is a method of database replication which allows data to be stored by a group of computers, and updated by any member of the group. All members are responsive to client data queries. The multi-master replication system is responsible for propagating the data modifications made by each member to the rest of the group and resolving any conflicts that might arise between concurrent changes made by different members.

In database computing, Oracle Real Application Clusters (RAC) — an option for the Oracle Database software produced by Oracle Corporation and introduced in 2001 with Oracle9i — provides software for clustering and high availability in Oracle database environments. Oracle Corporation includes RAC with the Enterprise Edition, provided the nodes are clustered using Oracle Clusterware.

<span class="mw-page-title-main">EnterpriseDB</span> American software company

EnterpriseDB (EDB), a privately held company based in Massachusetts, provides software and services based on the open-source database PostgreSQL, and is one of the largest contributors to Postgres. EDB develops and integrates performance, security, and manageability enhancements into Postgres to support enterprise-class workloads. EDB has also developed database compatibility for Oracle to facilitate the migration of workloads from Oracle to EDB Postgres and to support the operation of many Oracle workloads on EDB Postgres.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

<span class="mw-page-title-main">Couchbase Server</span> Open-source NoSQL database

Couchbase Server, originally known as Membase, is a source-available, distributed multi-model NoSQL document-oriented database software package optimized for interactive applications. These applications may serve many concurrent users by creating, storing, retrieving, aggregating, manipulating and presenting data. In support of these kinds of application needs, Couchbase Server is designed to provide easy-to-scale key-value, or JSON document access, with low latency and high sustainability throughput. It is designed to be clustered from a single machine to very large-scale deployments spanning many machines.

Clustrix, Inc. is a San Francisco-based private company founded in 2006 that developed a database management system marketed as NewSQL.

<span class="mw-page-title-main">SingleStore</span> Database management system

SingleStore is a proprietary, cloud-native database designed for data-intensive applications. A distributed, relational, SQL database management system (RDBMS) that features ANSI SQL support, it is known for speed in data ingest, transaction processing, and query processing.

<span class="mw-page-title-main">Oracle NoSQL Database</span> Distributed database

Oracle NoSQL Database is a NoSQL-type distributed key-value database from Oracle Corporation. It provides transactional semantics for data manipulation, horizontal scalability, and simple administration and monitoring.

NewSQL is a class of relational database management systems that seek to provide the scalability of NoSQL systems for online transaction processing (OLTP) workloads while maintaining the ACID guarantees of a traditional database system.

DataStax, Inc. is a real-time data for AI company based in Santa Clara, California. Its product Astra DB is a cloud database-as-a-service based on Apache Cassandra. DataStax also offers DataStax Enterprise (DSE), an on-premises database built on Apache Cassandra, and Astra Streaming, a messaging and event streaming cloud service based on Apache Pulsar. As of June 2022, the company has roughly 800 customers distributed in over 50 countries.

Aerospike is the company behind the Aerospike open source NoSQL distributed database management system. Citrusleaf, a Mountain View, California based company which rebranded to Aerospike in August 2012, announced the product in 2011. The software is used by developers to deploy real-time big data applications.

<span class="mw-page-title-main">TransLattice</span>

TransLattice was a software company based in Santa Clara, California that operated from 2007 to around 2016. It geographically distributed databases and applications for enterprise, cloud, and hybrid environments. TransLattice offered a NewSQL database and an application platform, and was responsible for making Postgres-XL open source.

<span class="mw-page-title-main">Apache Spark</span> Open-source data analytics cluster computing framework

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.

SequoiaDB is a multi-model NewSQL database.

TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloads. Designed to be MySQL compatible, it is developed and supported primarily by PingCAP and licensed under Apache 2.0. It is also available as a paid product. TiDB drew its initial design inspiration from Google's Spanner and F1 papers.

A distributed SQL database is a single relational database which replicates data across multiple servers. Distributed SQL databases are strongly consistent and most support consistency across racks, data centers, and wide area networks including cloud availability zones and cloud geographic zones. Distributed SQL databases typically use the Paxos or Raft algorithms to achieve consensus across multiple nodes.

<span class="mw-page-title-main">YugabyteDB</span> Transactional distributed SQL database

YugabyteDB is a high-performance transactional distributed SQL database for cloud-native applications, developed by Yugabyte.

HammerDB is an open source database benchmarking application developed by Steve Shaw. HammerDB supports databases such as Oracle, SQL Server, Db2, MySQL and MariaDB. HammerDB is written in TCL and C, and is licensed under the GPL v3.

References

  1. "Frequently Asked Questions - Postgres-XL". www.postgres-xl.org.
  2. "Frequently Asked Questions". Postgres-XL. Retrieved 31 May 2017.
  3. 1 2 "A Brief History of PostgreSQL: Postgres-XL". Postgres-XL documentation. Retrieved 31 May 2017.
  4. Butler, Brandon (4 June 2014). "10 of the most useful cloud databases". Network World. Archived from the original on December 26, 2015.
  5. 1 2 Clark, Jack (10 October 2013). "TransLattice gobbles StormDB upstart for Postgres threesome". The Register .
  6. Harris, Derrick (9 October 2013). "Cloud database consolidation as TransLattice buys StormDB". Gigaom.
  7. Clark, Jack (13 May 2014). "'Behold my creation: Postgres-XL' bellows TransLattice". The Register.
  8. Harris, Derrick (13 May 2014). "Database vendor open sources Postgres-XL for scale-out workloads". Gigaom.
  9. "TransLattice Unveils Postgres-XL Open Source Database for OLTP and Big Data Analytics". Database Trends and Applications. 16 May 2014.
  10. 1 2 Baker, Jason (13 May 2014). "Postgres-XL released to tackle big data analytics and more". Opensource.com.
  11. Riggs, Simon; Ciolli, Gianni; Krosing, Hannu; Bartolini, Gabriele (30 April 2015). PostgreSQL 9 Administration Cookbook - Second Edition. Packt Publishing Ltd. p. 418. ISBN   9781849519076.