Developer(s) | Amazon.com |
---|---|
Initial release | October 2012 |
Operating system | Cross-platform |
Available in | English |
License | Proprietary |
Website | aws |
Amazon Redshift is a data warehouse product which forms part of the larger cloud-computing platform Amazon Web Services. [1] It is built on top of technology from the massive parallel processing (MPP) data warehouse company ParAccel (later acquired by Actian), [2] to handle large scale data sets and database migrations. [3] Redshift differs from Amazon's other hosted database offering, Amazon RDS, in its ability to handle analytic workloads on big data data sets stored by a column-oriented DBMS principle. Redshift allows up to 16 petabytes of data on a cluster. [4]
Amazon Redshift is based on an older version of PostgreSQL 8.0.2, and Redshift has made changes to that version. [5] [6] An initial preview beta was released in November 2012 [7] and a full release was made available on February 15, 2013. The service can handle connections from most other applications using ODBC and JDBC connections. [8] According to Cloud Data Warehouse report published by Forrester in Q4 2018, Amazon Redshift had the largest number of cloud data warehouse deployments, with more than 6,500 deployments. [9]
Redshift uses parallel processing and compression to decrease command execution time. [10] This allows Redshift to perform operations on billions of rows at once. [11] This also makes Redshift useful for storing and analyzing large quantities of data from logs or live feeds through a source such as Amazon Kinesis Data Firehose. [11]
Amazon has listed a number of business intelligence software proprietors as partners and tested tools in their "APN Partner" program, [12] including Actian, Actuate Corporation, Alteryx, Dundas Data Visualization, IBM Cognos, InetSoft, Infor, Logi Analytics, Looker, MicroStrategy, Pentaho, [13] [14] Qlik, SiSense, Tableau Software, and Yellowfin. Partner companies providing data integration tools include Informatica and SnapLogic. System integration and consulting partners include Accenture, Deloitte, Capgemini and DXC Technology.
The "Red" in Redshift's name alludes to Oracle, a competing computer technology company sometimes informally referred to as "Big Red" due to its red corporate color. Hence, customers choosing to move their databases from Oracle to Redshift would be "shifting" from "Red". [15]
PostgreSQL also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. PostgreSQL features transactions with atomicity, consistency, isolation, durability (ACID) properties, automatically updatable views, materialized views, triggers, foreign keys, and stored procedures. It is supported on all major operating systems, including Windows, Linux, macOS, FreeBSD, and OpenBSD, and handles a range of workloads from single machines to data warehouses, data lakes, or web services with many concurrent users.
Ingres Database is a proprietary SQL relational database management system intended to support large commercial and government applications.
Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB2 until 2017, when it changed to its present form.
Oracle Database is a proprietary multi-model database management system produced and marketed by Oracle Corporation.
EnterpriseDB (EDB), a privately held company based in Massachusetts, provides software and services based on the open-source database PostgreSQL, and is one of the largest contributors to Postgres. EDB develops and integrates performance, security, and manageability enhancements into Postgres to support enterprise-class workloads. EDB has also developed database compatibility for Oracle to facilitate the migration of workloads from Oracle to EDB Postgres and to support the operation of many Oracle workloads on EDB Postgres.
ER/Studio is data architecture and database design software developed by IDERA, Inc. ER/Studio is compatible with multiple database platforms and is used to create and manage database designs, as well as to document and reuse data assets. In 2015, Embarcadero Technologies was acquired by database and infrastructure management software company IDERA, Inc. Since the acquisition by IDERA, Inc., ER/Studio has been renamed to ER/Studio Data Architect with updated features.
Navicat is a series of graphical database management and development software produced by CyberTech Ltd. for MySQL, MariaDB, Redis, MongoDB, Oracle, SQLite, PostgreSQL and Microsoft SQL Server. It has an Explorer-like graphical user interface and supports multiple database connections for local and remote databases. Its design is made to meet the needs of a variety of audiences, from database administrators and programmers to various businesses/companies that serve clients and share information with partners.
IBM Netezza is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for the most demanding analytic uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.
Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.
ParAccel, Inc. was a California-based software company.
Amazon Relational Database Service is a distributed relational database service by Amazon Web Services (AWS). It is a web service running "in the cloud" designed to simplify the setup, operation, and scaling of a relational database for use in applications. Administration processes like patching the database software, backing up databases and enabling point-in-time recovery are managed automatically. Scaling storage and compute resources can be performed by a single API call to the AWS control plane on-demand. AWS does not offer an SSH connection to the underlying virtual machine as part of the managed service.
A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.
Trifacta is a privately owned software company headquartered in San Francisco with offices in Bangalore, Boston, Berlin and London. The company was founded in October 2012 and primarily develops data wrangling software for data exploration and self-service data preparation on cloud and on-premises data platforms.
Actian is an American software company headquartered in Santa Clara, California that provides analytics-related software, products, and services. The company sells database software and technology, cloud engineered systems, and data integration solutions.
Kyvos is a business intelligence acceleration platform for cloud and big data platforms developed by an American privately held company named Kyvos Insights. The company, headquartered in Los Gatos, California, was founded by Praveen Kankariya, CEO of Impetus Technologies. The software provides OLAP-based multidimensional analysis on big data and cloud platforms and was launched officially in June 2015. In December the same year, the company was listed among the 10 Coolest Big Data Startups of 2015 by CRN Magazine.
Amazon DocumentDB is a managed proprietary NoSQL database service that supports document data structures, with some compatibility with MongoDB version 3.6 and version 4.0. As a document database, Amazon DocumentDB can store, query, and index JSON data. It is available on Amazon Web Services. As of March 2023, AWS introduced some compliance with MongoDB 5.0 but lacks time series collection support.
Yellowbrick Data is a US-based database company delivering massively parallel processing (MPP) data warehouse and SQL analytics products. The company is headquartered in Mountain View, California.
HammerDB is an open source database benchmarking application developed by Steve Shaw. HammerDB supports databases such as Oracle, SQL Server, Db2, MySQL and MariaDB. HammerDB is written in TCL and C, and is licensed under the GPL v3.
Pentaho has certified its business analytics and data integration platform to work with Amazon Redshift.
then transformed, refined, and immediately pushed into Amazon Redshift.