Netezza

Last updated
Netezza
Company typeSubsidiary of IBM
Industry Data warehousing
Founded1999
Headquarters Marlborough, Massachusetts, United States
ProductsData Warehouse Appliance
Integrated Data Warehouse Hardware and Software
Professional Services
Customer Services
RevenueIncrease2.svg US$190.6 million (FY 2010)
Number of employees
469 (2010) [1]
Parent IBM
Website www.netezza.com
Netezza Massive Parallel Processing Data Warehouse Appliance Netezza.jpeg
Netezza Massive Parallel Processing Data Warehouse Appliance

IBM Netezza (pronounced ne-teez-a) is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.

Contents

Netezza was acquired by IBM on September 20, 2010 [2] IBM released 3 generations of Netezza Appliances (Twinfin, Striper, Mako) where it was later reintroduced in June 2019 as a fourth generation NPS, part of the IBM CloudPak for Data offering (Hammerhead). [3] [4]

History

Netezza was founded in 1999 by Foster Hinshaw. In 2000 Jit Saxena joined Hinshaw as co-founder. The company was incorporated in Delaware on December 30, 1999 as Intelligent Data Engines, Inc. and changed its name to Netezza Corporation in November 2000. Netezza announced the industry's first "data warehouse appliance" in 2003 [5] to meet the industry's need to make use of the rapidly increasing ability to collect consumer data. In July 2007, Netezza Corporation had its initial public offering under the ticker “NZ” on NYSE Arca. [6] [7]

Hinshaw coined the term "data warehouse appliance" to describe a product of shared nothing parallel nodes specifically targeted for high data volumes for modern data analytics. [8] [9] He left Netezza to found Dataupia in 2005. [10]

Netezza software was based on PostgreSQL 7.2, [11] but did not maintain compatibility.

Jim Baum was appointed CEO of Netezza in January, 2008 after co-founder Jit Saxena announced his retirement. Baum started at Netezza as chief operating officer in 2006. Prior to joining Netezza, Baum was president and CEO of Endeca in Boston for five years. [12] [13]

IBM and Netezza on September 20, 2010 announced they entered into a definitive agreement for IBM to acquire Netezza in a cash transaction at a price of $27 per share or at a net price of approximately $1.7 billion, after adjusting for cash. [2]

In 2020, IMB Netezza and Yellowbrick join in a partnership.[ citation needed ]

In March 2023, the U.S. Navy chose to partner with Yellowbrick Data, with their U.S. Naval Supply Systems Command (NAVSUP) to modernize and accelerate their data strategy. [14] Then in August, AWS and IBM Netezza picked up a table format from Apache Iceberg which would extend the reach of data lakes. [15]

Products

TwinFin, Netezza’s primary product, is designed for rapid analysis of data volumes scaling into petabytes. The company introduced the fourth generation of the TwinFin product in August 2009. [1] Netezza introduced a scaled-down version of this appliance under the Skimmer brand in January 2010. [16]

In February 2010, Netezza announced that it had opened up its systems to support major programming models, including Hadoop, MapReduce, Java, C++, and Python models. Netezza's partners predicted to leverage this analytic application support are Tibco Spotfire, MicroStrategy, Pursway, DemandTec and QuantiSense.

The company also markets specialized appliances for retail, spatial, complex analytics and regulatory compliance needs. Netezza sells software-based products for migrating from Oracle Exadata and for implementing data virtualization and federation (data abstraction) schemes.

The Netezza appliance was the foundation of IBM Db2 Analytics Accelerator (IDAA). [17]

In 2012 the products were re-branded as IBM PureData for Analytics. [18]

In 2017, IBM replaced Netezza with the Integrated Analytics System [19] using Power-8 processing frame and Db2 as the database engine in an offering called Db2 Warehouse. It featured both row-based and columnar storage plus high-speed flash drives. The Db2 Warehouse engine runs both on the cloud or on-prem.

In 2019, after acquiring Red Hat, IBM established Cloud Pak offerings based on OpenShift, and revived Netezza as Netezza Performance Server under Cloud Pak for Data, both of which could run on-prem or on the cloud. The offering is a 64-bit NPS with flash drives and optimized FPGAs. The revived NPS is 100 percent identical in feature compatibility to Netezza Mako, and moving to this platform required only an nzmigrate or nzbackup/restore. [20]

In 2020, the first Netezza Performance Server in the cloud was GA on Amazon Web Services. This offering uses the actual AMPP Netezza Hardware, not commodity hardware running Netezza software. Migrating to this platform also requires only an nzmigrate or nzbackup/restore through an S3 bucket. It is a direct competitor to Amazon's Red Shift database. It is also available in Azure and IBM Cloud. [20]

Technology

Netezza’s proprietary AMPP (Asymmetric Massively Parallel Processing) architecture is a two-tiered system designed to quickly handle very large queries from multiple users.

The first tier is a high-performance Linux SMP host that compiles data query tasks received from business intelligence applications, and generates query execution plans. It then divides a query into a sequence of sub-tasks, or snippets that can be executed in parallel, and distributes the snippets to the second tier for execution.

The second tier consists of one to hundreds of snippet processing blades, or S-Blades, where all the primary processing work of the appliance is executed. The S-Blades are intelligent processing nodes that make up the massively parallel processing (MPP) engine of the appliance. Each S-Blade is an independent server that contains multi-core Intel-based CPUs and Netezza’s proprietary multi-engine, high-throughput FPGAs. The S-Blade is composed of a standard blade-server combined with a special Netezza Database Accelerator card that snaps alongside the blade. Each S-Blade is, in turn, connected to multiple disk drives processing multiple data streams in parallel in TwinFin or Skimmer.

AMPP employs industry-standard interfaces (SQL, ODBC, JDBC, OLE DB) and provides load times in excess of 2 TB/hour and backup/restore data rates of more than 4 TB/hour.

In 2009, the company transitioned from PowerPC processors to Intel CPUs. [21] In August, 2009, with the introduction of the 4th generation TwinFin product, Netezza moved from proprietary blades to IBM blades.

Recognition and criticism

Netezza was added to Gartner’s Magic Quadrant for DBMS in January, 2009. [22]

Related Research Articles

<span class="mw-page-title-main">Informix</span> Database management software product family

Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) and multi-model database offerings. The Informix products were originally developed by Informix Corporation, whose Informix Software subsidiary was acquired by IBM in 2001.

<span class="mw-page-title-main">IBM Db2</span> Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB2 until 2017, when it changed to its present form.

IBM Storage Protect is a data protection platform that gives enterprises a single point of control and administration for backup and recovery. It is the flagship product in the IBM Spectrum Protect family.

In IBM System z9 and successor mainframes, the System z Integrated Information Processor (zIIP) is a special purpose processor. It was initially introduced to relieve the general mainframe central processors (CPs) of specific Db2 processing loads, but currently is used to offload other z/OS workloads as described below. The idea originated with previous special purpose processors, the zAAP, which offloads Java processing, and the IFL, which runs Linux and z/VM but not other IBM operating systems such as z/OS, DOS/VSE and TPF. A System z PU is "characterized" as one of these processor types, or as a CP, or SAP. These processors do not contain microcode or hardware features that accelerate their designated workloads. Instead, by relieving the general CP of particular workloads, they often lead to a higher workload throughput at reduced license fees.

Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence. The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.

In database computing, Oracle Real Application Clusters (RAC) — an option for the Oracle Database software produced by Oracle Corporation and introduced in 2001 with Oracle9i — provides software for clustering and high availability in Oracle database environments. Oracle Corporation includes RAC with the Enterprise Edition, provided the nodes are clustered using Oracle Clusterware.

IBM System Management Facility (SMF) is a component of IBM's z/OS for mainframe computers, providing a standardised method for writing out records of activity to a file (or data set to use a z/OS term). SMF provides full "instrumentation" of all baseline activities running on that IBM mainframe operating system, including I/O, network activity, software usage, error conditions, processor utilization, etc.

The IBM Data Warehousing Balanced Configuration Unit is a family of data warehousing servers from IBM. IBM introduced the Balanced Configuration Unit (BCU) for AIX in 2005, and the BCU for Linux in 2006. The BCU is a "balanced" combination of computer server hardware combined with DB2 Data Warehouse Edition software to form a data warehouse "appliance like" system to compete with systems such as Greenplum, DATAllegro, Netezza Performance Server, and Teradata.

IBM Unica NetInsight was a web analytics application that utilized an Extract, transform, load methodology to populate a database that could then be queried using a browser-based interface. NetInsight is from the same family of tools as Unica NetTracker. In April 2014, IBM announced Unica NetInsight would be discontinued.

In computing, the term data warehouse appliance (DWA) was coined by Foster Hinshaw for a computer architecture for data warehouses (DW) specifically marketed for big data analysis and discovery that is simple to use and has a high performance for the workload. A DWA includes an integrated set of servers, storage, operating systems, and databases.

In computing, the SAP BW Accelerator is a computer appliance - preinstalled software on predefined hardware - which is used to speed up OLAP queries. The software was initially known as the BI Accelerator.

Dataupia was a supplier of data warehouse appliances. Dataupia focuses on data warehousing for applications running on Oracle, Microsoft SQL Server databases. Dataupia's Satori Server included server computers, storage, and software.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

ParAccel, Inc. was a California-based software company.

In-database processing, sometimes referred to as in-database analytics, refers to the integration of data analytics into data warehousing functionality. Today, many large databases, such as those used for credit card fraud detection and investment bank risk management, use this technology because it provides significant performance improvements over traditional methods.

<span class="mw-page-title-main">PureSystems</span> Family of computer systems

PureSystems is an IBM product line of factory pre-configured components and servers also being referred to as an "Expert Integrated System". The centrepiece of PureSystems is the IBM Flex System Manager in tandem with the so-called "Patterns of Expertise" for the automated configuration and management of PureSystems.

HP ConvergedSystem is a portfolio of system-based products from Hewlett-Packard (HP) that integrates preconfigured IT components into systems for virtualization, cloud computing, big data, collaboration, converged management, and client virtualization. Composed of servers, storage, networking, and integrated software and services, the systems are designed to address the cost and complexity of data center operations and maintenance by pulling the IT components together into a single resource pool so they are easier to manage and faster to deploy. Where previously it would take three to six months from the time of order to get a system up and running, it now reportedly takes as few as 20 days with the HP ConvergedSystem.

<span class="mw-page-title-main">SAP HANA</span> Database management system by SAP

SAP HANA is an in-memory, column-oriented, relational database management system developed and marketed by SAP SE. Its primary function as the software running a database server is to store and retrieve data as requested by the applications. In addition, it performs advanced analytics and includes extract, transform, load (ETL) capabilities as well as an application server.

Yellowbrick Data is a US-based database company delivering massively parallel processing (MPP) data warehouse and SQL analytics products. The company is headquartered in Mountain View, California.

References

  1. 1 2 Dignan, Larry (27 Aug 2010). "Netezza's TwinFin fuels profit surge". ZDNet. Retrieved 10 Aug 2023.
  2. 1 2 "IBM to buy analytics company Netezza for $1.7 billion". Reuters. 21 September 2010. Retrieved 10 Aug 2023.
  3. "What happened to Netezza?". www.ibm.com. 2020-05-28. Retrieved 2020-10-09.
  4. "Netezza Database | How does Netezza Database work with Examples?". EDUCBA. 2022-04-29. Retrieved 2023-08-17.
  5. "Netezza Performance Server (NPS™) 8000 Series". Product web page. Netezza. Archived from the original on February 3, 2004. Retrieved August 16, 2013.
  6. "sv1". www.sec.gov.
  7. Vance, Ashlee (21 July 2007). "Netezza nets plenty of cash in IPO". www.theregister.com. Retrieved 10 August 2023.
  8. Steve Norall (May 18, 2007). "Introducing 'data warehouse appliances'". Infostor. Retrieved April 3, 2017.
  9. "Still Another Data Warehouse Appliance Is Coming!". www.tdwi.org. 23 May 2007. Retrieved 10 August 2023.
  10. Wade Roush (November 17, 2009). "Foster Hinshaw Back in Command at Dataupia; News of Company's Death Greatly Exaggerated, He Says". XConomy. Retrieved April 3, 2017.
  11. "Elephant Roads: a tour of Postgres forks". October 6, 2010.
  12. "Netezza CEO Baum guides data storage firm through downturn". Mass High Tech. 30 August 2010.
  13. "NETEZZA NAMES JIM BAUM PRESIDENT AND COO" (Press release). Netezza. August 1, 2006.
  14. "U.S. Navy Chooses Yellowbrick, Sunsets IBM Netezza". www.businesswire.com. 2023-03-22. Retrieved 2023-04-21.
  15. Clark, Lindsay. "AWS and IBM Netezza back Iceberg in table format smackdown". www.theregister.com. Retrieved 2023-10-11.
  16. Lai, Eric (January 25, 2010). "Netezza launches Skimmer data appliance, teases two more". Computerworld.
  17. "IBM - DB2 High Performance Query Accelerator - DB2 Analytics Accelerator for z/OS - Software". 01.ibm.com. Retrieved 2013-07-19.
  18. Timothy Prickett Morgan (October 10, 2012). "IBM takes on Oracle with PureData appliances: 'Watch out, Larry, here we come'". The Register. Retrieved April 3, 2017.
  19. "IAS - Overview". www.ibm.com.
  20. 1 2 "Netezza Performance Server - Overview". www.ibm.com.
  21. "Netezza Is Changing its Hardware Architecture, Slashing Prices," "Intelligent Enterprise," July 31, 2009
  22. "Gartner's 2008 data warehouse database management system Magic Quadrant is out | DBMS 2 : DataBase Management System Services".