Company type | Subsidiary of IBM |
---|---|
Industry | Data warehousing |
Founded | 1999 |
Headquarters | Marlborough, Massachusetts, United States |
Products | Data Warehouse Appliance Integrated Data Warehouse Hardware and Software Professional Services Customer Services |
Revenue | US$190.6 million (FY 2010) |
Number of employees | 469 (2010) [1] |
Parent | IBM |
Website | www |
IBM Netezza (pronounced ne-teez-a) is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for the most demanding analytic uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.
Netezza was acquired by IBM on September 20, 2010. [2] IBM released 4 generations of Netezza Appliances (Twinfin, Striper, Mako) where it was later reintroduced in June 2019 as a fourth generation NPS, Netezza Performance Server, part of the IBM CloudPak for Data offering (Hammerhead). [3] [4]
Netezza was founded in 1999 by Foster Hinshaw. [5] In 2000 Jit Saxena joined Hinshaw as co-founder. [5] The company was incorporated in Delaware on December 30, 1999 as Intelligent Data Engines, Inc. and changed its name to Netezza Corporation in November 2000. Netezza announced the industry's first "data warehouse appliance" in 2003 [6] to meet the industry's need to make use of the rapidly increasing ability to collect consumer data. In July 2007, Netezza Corporation had its initial public offering under the ticker “NZ” on NYSE Arca. [7] [8]
Hinshaw coined the term "data warehouse appliance" to describe a product of shared nothing parallel nodes specifically targeted for high data volumes for modern data analytics. [9] [10] He left Netezza to found Dataupia in 2005. [11]
Netezza software was based on PostgreSQL 7.2. [12]
Jim Baum was appointed CEO of Netezza in January 2008 [13] after co-founder Jit Saxena announced his retirement. Baum started at Netezza as chief operating officer in 2006. Prior to joining Netezza, Baum was president and CEO of Endeca in Boston for five years. [14] [15]
IBM and Netezza on September 20, 2010 announced they entered into a definitive agreement for IBM to acquire Netezza in a cash transaction at a price of $27 per share or at a net price of approximately $1.7 billion, after adjusting for cash. [2]
IBM released 4 generations of Netezza Appliances (Twinfin N1001 (in 2010), Striper N2001, Mako N3001 (in 2015)), where it was later introduced in June 2019 as a fourth generation NPS system, part of the IBM CloudPak for Data System offering (Hammerhead). [3] [4]
IBM also released Netezza as a service (SaaS) fully managed and hosted offering, in 2020, on both Microsoft Azure as well as on AWS, fully backward compatible with the on-premise appliance form factor.
In August 2023, IBM Netezza picked up a table format from Apache Iceberg which would extend the reach of Netezza capabilities into a data lake house. [16] Furthermore it's integration with IBM watsonx.data (released in 2023) allows it to become a unique, hybrid compute engine based data lake house solution, the next generation data store, extending it's strategic importance even further.
TwinFin, Netezza’s primary product, is designed for rapid analysis of data volumes scaling into petabytes. The company introduced the fourth generation of the TwinFin product in August 2009. [1] Netezza introduced a scaled-down version of this appliance under the Skimmer brand in January 2010. [17]
In February 2010, Netezza announced that it had opened up its systems to support major programming models, including Hadoop, MapReduce, Java, C++, and Python models. Netezza's partners predicted to leverage this analytic application support are Tibco Spotfire, MicroStrategy, Pursway, DemandTec and QuantiSense.[ citation needed ]
The company also markets specialized appliances for retail, spatial, complex analytics and regulatory compliance needs. Netezza sells software-based products for migrating from Oracle Exadata and for implementing data virtualization and federation (data abstraction) schemes.[ citation needed ]
The Netezza appliance was the foundation of IBM Db2 Analytics Accelerator (IDAA). [18]
In 2012 the products were re-branded as IBM PureData for Analytics. [19]
In 2017, IBM released next to Netezza, the Integrated Analytics System [20] using Power-8 processing frame and Db2 as the database engine in an offering called Db2 Warehouse. It featured both row-based and columnar storage plus high-speed flash drives. The Db2 Warehouse engine runs both on the cloud or on-prem.[ citation needed ]
In 2019, after acquiring Red Hat, IBM established Cloud Pak offerings based on OpenShift, and revived Netezza as Netezza Performance Server under Cloud Pak for Data, both of which could run on-prem or on the cloud. The offering is a 64-bit NPS with flash drives and optimized FPGAs. The modernized NPS is 100 percent identical in feature compatibility to Netezza Mako, and moving to this platform required only, either nzmigrate to clone the environment or an nzmigrate or nzbackup/restore. [21]
In 2020, the first Netezza Performance Server in the cloud was GA on Amazon Web Services. This offering uses the actual AMPP Netezza Hardware, not commodity hardware running Netezza software. Migrating to this platform also requires only an nzmigrate or nzbackup/restore through an S3 bucket. It is a direct competitor to Amazon's Red Shift database. It is also available in Azure and IBM Cloud. [21]
Netezza’s proprietary AMPP (Asymmetric Massively Parallel Processing) architecture is a two-tiered system designed to quickly handle very large queries from multiple users.[ citation needed ]
The first tier is a high-performance Linux SMP host that compiles data query tasks received from business intelligence applications, and generates query execution plans. It then divides a query into a sequence of sub-tasks, or snippets that can be executed in parallel, and distributes the snippets to the second tier for execution.[ citation needed ]
The second tier consists of one to hundreds of snippet processing blades, or S-Blades, where all the primary processing work of the appliance is executed. The S-Blades are intelligent processing nodes that make up the massively parallel processing (MPP) engine of the appliance. Each S-Blade is an independent server that contains multi-core Intel-based CPUs and Netezza’s proprietary multi-engine, high-throughput FPGAs. The S-Blade is composed of a standard blade-server combined with a special Netezza Database Accelerator card that snaps alongside the blade. Each S-Blade is, in turn, connected to multiple disk drives processing multiple data streams in parallel in TwinFin or Skimmer.[ citation needed ]
AMPP employs industry-standard interfaces (SQL, ODBC, JDBC, OLE DB) and provides load times in excess of 2 TB/hour and backup/restore data rates of more than 4 TB/hour.[ citation needed ]
In 2009, the company transitioned from PowerPC processors to Intel CPUs. [22] In August, 2009, with the introduction of the 4th generation TwinFin product, Netezza moved from proprietary blades to IBM blades.[ citation needed ]
Netezza was added to Gartner’s Magic Quadrant for DBMS in January, 2009. [23]
z/OS is a 64-bit operating system for IBM z/Architecture mainframes, introduced by IBM in October 2000. It derives from and is the successor to OS/390, which in turn was preceded by a string of MVS versions. Like OS/390, z/OS combines a number of formerly separate, related products, some of which are still optional. z/OS has the attributes of modern operating systems but also retains much of the older functionality that originated in the 1960s and is still in regular use—z/OS is designed for backward compatibility.
Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) and multi-model database offerings. The Informix products were originally developed by Informix Corporation, whose Informix Software subsidiary was acquired by IBM in 2001.
Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB2 until 2017, when it changed to its present form.
Essbase is a multidimensional database management system (MDBMS) that provides a platform upon which to build analytic applications. Essbase began as a product from Arbor Software, which merged with Hyperion Software in 1998. Oracle Corporation acquired Hyperion Solutions Corporation in 2007. Until late 2005 IBM also marketed an OEM version of Essbase as DB2 OLAP Server.
IBM Storage Protect is a data protection platform that gives enterprises a single point of control and administration for backup and recovery. It is the flagship product in the IBM Spectrum Protect family.
In IBM System z9 and successor mainframes, the System z Integrated Information Processor (zIIP) is a special purpose processor. It was initially introduced to relieve the general mainframe central processors (CPs) of specific Db2 processing loads, but currently is used to offload other z/OS workloads as described below. The idea originated with previous special purpose processors, the zAAP, which offloads Java processing, and the IFL, which runs Linux and z/VM but not other IBM operating systems such as z/OS, DOS/VSE and TPF. A System z PU is "characterized" as one of these processor types, or as a CP, or SAP. These processors do not contain microcode or hardware features that accelerate their designated workloads. Instead, by relieving the general CP of particular workloads, they often lead to a higher workload throughput at reduced license fees.
Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence (BI). The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.
In database computing, Oracle Real Application Clusters (RAC) — an option for the Oracle Database software produced by Oracle Corporation and introduced in 2001 with Oracle9i — provides software for clustering and high availability in Oracle database environments. Oracle Corporation includes RAC with the Enterprise Edition, provided the nodes are clustered using Oracle Clusterware.
IBM System Management Facility (SMF) is a component of IBM's z/OS for mainframe computers, providing a standardised method for writing out records of activity to a file. SMF provides full "instrumentation" of all baseline activities running on that IBM mainframe operating system, including I/O, network activity, software usage, error conditions, processor utilization, etc.
The IBM Data Warehousing Balanced Configuration Unit is a family of data warehousing servers from IBM. IBM introduced the Balanced Configuration Unit (BCU) for AIX in 2005, and the BCU for Linux in 2006. The BCU is a "balanced" combination of computer server hardware combined with DB2 Data Warehouse Edition software to form a data warehouse "appliance like" system to compete with systems such as Greenplum, DATAllegro, Netezza Performance Server, and Teradata.
IBM Unica NetInsight was a web analytics application that utilized an Extract, transform, load methodology to populate a database that could then be queried using a browser-based interface. NetInsight is from the same family of tools as Unica NetTracker. In April 2014, IBM announced Unica NetInsight would be discontinued.
In computing, the term data warehouse appliance (DWA) was coined by Foster Hinshaw for a computer architecture for data warehouses (DW) specifically marketed for big data analysis and discovery that is simple to use and has a high performance for the workload. A DWA includes an integrated set of servers, storage, operating systems, and databases.
In computing, the SAP BW Accelerator is a computer appliance - preinstalled software on predefined hardware - which is used to speed up OLAP queries. The software was initially known as the BI Accelerator.
Dataupia was a supplier of data warehouse appliances. Dataupia focuses on data warehousing for applications running on Oracle, Microsoft SQL Server databases. Dataupia's Satori Server included server computers, storage, and software.
Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.
In-database processing, sometimes referred to as in-database analytics, refers to the integration of data analytics into data warehousing functionality. Today, many large databases, such as those used for credit card fraud detection and investment bank risk management, use this technology because it provides significant performance improvements over traditional methods.
PureSystems is an IBM product line of factory pre-configured components and servers also being referred to as an "Expert Integrated System". The centrepiece of PureSystems is the IBM Flex System Manager in tandem with the so-called "Patterns of Expertise" for the automated configuration and management of PureSystems.
HP ConvergedSystem is a portfolio of system-based products from Hewlett-Packard (HP) that integrates preconfigured IT components into systems for virtualization, cloud computing, big data, collaboration, converged management, and client virtualization. Composed of servers, storage, networking, and integrated software and services, the systems are designed to address the cost and complexity of data center operations and maintenance by pulling the IT components together into a single resource pool so they are easier to manage and faster to deploy. Where previously it would take three to six months from the time of order to get a system up and running, it now reportedly takes as few as 20 days with the HP ConvergedSystem.
SAP HANA is an in-memory, column-oriented, relational database management system developed and marketed by SAP SE. Its primary function as the software running a database server is to store and retrieve data as requested by the applications. In addition, it performs advanced analytics and includes extract, transform, load (ETL) capabilities as well as an application server.