Amazon Redshift

Last updated
Amazon Redshift
Developer(s) Amazon.com
Initial releaseOctober 2012;11 years ago (2012-10)
Operating system Cross-platform
Available inEnglish
License Proprietary
Website aws.amazon.com/redshift/

Amazon Redshift is a data warehouse product which forms part of the larger cloud-computing platform Amazon Web Services. [1] It is built on top of technology from the massive parallel processing (MPP) data warehouse company ParAccel (later acquired by Actian), [2] to handle large scale data sets and database migrations. [3] Redshift differs from Amazon's other hosted database offering, Amazon RDS, in its ability to handle analytic workloads on big data data sets stored by a column-oriented DBMS principle. Redshift allows up to 16 petabytes of data on a cluster [4] compared to Amazon RDS Aurora's maximum size of 128 tebibytes. [5]

Contents

Amazon Redshift is based on an older version of PostgreSQL 8.0.2, and Redshift has made changes to that version. [6] [7] An initial preview beta was released in November 2012 [8] and a full release was made available on February 15, 2013. The service can handle connections from most other applications using ODBC and JDBC connections. [9] According to Cloud Data Warehouse report published by Forrester in Q4 2018, Amazon Redshift has the largest number of Cloud data warehouse deployments, with more than 6,500 deployments. [10]

Redshift uses parallel-processing and compression to decrease command execution time. [11] This allows Redshift to perform operations on billions of rows at once. [12] This also makes Redshift useful for storing and analyzing large quantities of data from logs or live feeds through a source such as Amazon Kinesis Data Firehose. [12]

Amazon has listed a number of business intelligence software proprietors as partners and tested tools in their "APN Partner" program, [13] including Actian, Actuate Corporation, Alteryx, Dundas Data Visualization, IBM Cognos, InetSoft, Infor, Logi Analytics, Looker, MicroStrategy, Pentaho, [14] [15] Qlik, SiSense, Tableau Software, and Yellowfin. Partner companies providing data integration tools include Informatica and SnapLogic. System integration and consulting partners include Accenture, Deloitte, Capgemini and DXC Technology.

The "Red" in Redshift's name alludes to Oracle, a competing computer technology company sometimes informally referred to as "Big Red" due to its red corporate color. Hence, customers choosing to move their databases from Oracle to Redshift would be "shifting" from "Red". [16]

See also

Related Research Articles

<span class="mw-page-title-main">PostgreSQL</span> Free and open-source object relational database management system

PostgreSQL, also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. PostgreSQL features transactions with atomicity, consistency, isolation, durability (ACID) properties, automatically updatable views, materialized views, triggers, foreign keys, and stored procedures. It is supported on all major operating systems, including Linux, FreeBSD, OpenBSD, macOS, and Windows, and handles a range of workloads from single machines to data warehouses or web services with many concurrent users.

<span class="mw-page-title-main">Ingres (database)</span> Database software

Ingres Database is a proprietary SQL relational database management system intended to support large commercial and government applications.

In database computing, Oracle Real Application Clusters (RAC) — an option for the Oracle Database software produced by Oracle Corporation and introduced in 2001 with Oracle9i — provides software for clustering and high availability in Oracle database environments. Oracle Corporation includes RAC with the Enterprise Edition, provided the nodes are clustered using Oracle Clusterware.

<span class="mw-page-title-main">EnterpriseDB</span> American software company

EnterpriseDB (EDB), a privately held company based in Massachusetts, provides software and services based on the open-source database PostgreSQL, and is one of the largest contributors to Postgres. EDB develops and integrates performance, security, and manageability enhancements into Postgres to support enterprise-class workloads. EDB has also developed database compatibility for Oracle to facilitate the migration of workloads from Oracle to EDB Postgres and to support the operation of many Oracle workloads on EDB Postgres.

ER/Studio is data architecture and database design software developed by IDERA, Inc. ER/Studio is compatible with multiple database platforms and is used to create and manage database designs, as well as to document and reuse data assets. In 2015, Embarcadero Technologies was acquired by database and infrastructure management software company IDERA, Inc. Since the acquisition by IDERA, Inc., ER/Studio has been renamed to ER/Studio Data Architect with updated features.

<span class="mw-page-title-main">Navicat</span> SQL database management software

Navicat is a series of graphical database management and development software produced by CyberTech Ltd. for MySQL, MariaDB, Redis, MongoDB, Oracle, SQLite, PostgreSQL and Microsoft SQL Server. It has an Explorer-like graphical user interface and supports multiple database connections for local and remote databases. Its design is made to meet the needs of a variety of audiences, from database administrators and programmers to various businesses/companies that serve clients and share information with partners.

<span class="mw-page-title-main">Netezza</span> Provider of Integrated Data Warehouse Hardware and Software

IBM Netezza is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for the most demanding analytic uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

ParAccel, Inc. was a California-based software company.

Amazon Relational Database Service is a distributed relational database service by Amazon Web Services (AWS). It is a web service running "in the cloud" designed to simplify the setup, operation, and scaling of a relational database for use in applications. Administration processes like patching the database software, backing up databases and enabling point-in-time recovery are managed automatically. Scaling storage and compute resources can be performed by a single API call to the AWS control plane on-demand. AWS does not offer an SSH connection to the underlying virtual machine as part of the managed service.

A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.

<span class="mw-page-title-main">Actian</span> American software company

Actian is an American software company headquartered in Santa Clara, California that provides analytics-related software, products, and services. The company sells database software and technology, cloud engineered systems, and data integration solutions.

Amazon Aurora is a relational database service developed and offered by Amazon Web Services beginning in October 2014. Aurora is available as part of the Amazon Relational Database Service (RDS).

Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.

Serverless computing is a cloud computing execution model in which the cloud provider allocates machine resources on demand, taking care of the servers on behalf of their customers. "Serverless" is a misnomer in the sense that servers are still used by cloud service providers to execute code for developers. However, developers of serverless applications are not concerned with capacity planning, configuration, management, maintenance, fault tolerance, or scaling of containers, VMs, or physical servers. Serverless computing does not hold resources in volatile memory; computing is rather done in short bursts with the results persisted to storage. When an app is not in use, there are no computing resources allocated to the app. Pricing is based on the actual amount of resources consumed by an application. It can be a form of utility computing.

Kyvos is a business intelligence acceleration platform for cloud and big data platforms developed by an American privately held company named Kyvos Insights. The company, headquartered in Los Gatos, California, was founded by Praveen Kankariya, CEO of Impetus Technologies. The software provides OLAP-based multidimensional analysis on big data and cloud platforms and was launched officially in June 2015. In December the same year, the company was listed among the 10 Coolest Big Data Startups of 2015 by CRN Magazine.

Amazon DocumentDB is a managed proprietary NoSQL database service that supports document data structures, with some compatibility with MongoDB version 3.6 and version 4.0. As a document database, Amazon DocumentDB can store, query, and index JSON data. It is available on Amazon Web Services. As of March 2023, AWS introduced some compliance with MongoDB 5.0 but lacks time series collection support.

Yellowbrick Data is a US-based database company delivering massively parallel processing (MPP) data warehouse and SQL analytics products. The company is headquartered in Mountain View, California.

HammerDB is an open source database benchmarking application developed by Steve Shaw. HammerDB supports databases such as Oracle, SQL Server, Db2, MySQL and MariaDB. HammerDB is written in TCL and C, and is licensed under the GPL v3.

References

  1. "Bye-bye, Big Red? Escaping Oracle's not that easy". infoworld.com. November 4, 2013. Archived from the original on January 15, 2020. Retrieved January 15, 2020.
  2. "Amazon Redshift: ParAccel in, costly appliances out". ZDNet. Archived from the original on June 5, 2013. Retrieved July 8, 2013.
  3. "Improve data processing performance on AWS Redshift by 200%". Ardentisys.com. Archived from the original on November 15, 2019. Retrieved April 14, 2017.
  4. "Amazon Redshift FAQs - Cloud Data Warehouse - Amazon Web Services". Amazon Web Services, Inc. Archived from the original on September 17, 2020. Retrieved September 20, 2020.
  5. "Quotas and constraints for Amazon Aurora". Archived from the original on August 27, 2021. Retrieved April 1, 2022. An Aurora cluster volume can grow to a maximum size of 128 tebibytes (TiB)
  6. "Redshift and PostgreSQL". AWS. Amazon. Archived from the original on November 15, 2019. Retrieved July 26, 2015.
  7. "Unsupported PostgreSQL features". AWS. Amazon. Archived from the original on November 15, 2019. Retrieved July 14, 2016.
  8. "Amazon Debuts Low-Cost, Big Data Warehousing". Information Week. Archived from the original on May 31, 2013. Retrieved July 8, 2013.
  9. Louwers, Johan (February 1, 2014). "Amazon Redshift cloud based data warehouse service". johanlouwers.blogspot.co.uk. Archived from the original on October 13, 2017. Retrieved February 2, 2014..
  10. Little, Cinny. "The Forrester Wave™: Cloud Data Warehouse Solutions, Q4 2018" (PDF). Forrester. Retrieved December 18, 2020.[ permanent dead link ]
  11. "System and architecture overview - Amazon Redshift". docs.aws.amazon.com. Archived from the original on October 16, 2020. Retrieved October 1, 2020.
  12. 1 2 "4 Amazon Redshift Use Cases: Collect, store, analyze & share data". intermix.io. August 26, 2020. Archived from the original on October 31, 2020. Retrieved October 1, 2020.
  13. "Amazon Redshift Partners", AWS Partner Network, Amazon, February 6, 2017, archived from the original on February 6, 2017.
  14. "APN - Amazon Redshift Partners - Pentaho". AWS. Amazon. Archived from the original on November 15, 2019. Retrieved February 6, 2017. Pentaho has certified its business analytics and data integration platform to work with Amazon Redshift.
  15. "Amazon Web Services". Pentaho.com. Archived from the original on April 1, 2018. Retrieved February 6, 2017. then transformed, refined, and immediately pushed into Amazon Redshift.
  16. Krazit, Tom (January 2, 2018). "Amazon Web Services reportedly named its cloud database RedShift in order to tweak Oracle". GeekWire. Archived from the original on March 9, 2022. Retrieved March 9, 2022.