Cloudera

Last updated
Cloudera, Inc.
Company type Private
Industry Software
Cloud computing
FoundedJune 27, 2008;15 years ago (2008-06-27)
Founders Christophe Bisciglia
Amr Awadallah
Jeff Hammerbacher
Mike Olson
Headquarters Santa Clara, California, U.S.
Key people
Charles Sansbury, CEO
Abhas, CSO
Frank O'Dowd, CRO
Products Analytics tools
Big data tools
Data engineering tools
Data science tools
Data warehousing tools
ETL
Machine learning tools
Streaming data tools
ServicesCloud data platform
Owner Clayton, Dubilier & Rice
Kohlberg Kravis Roberts
Number of employees
3,084 (2023)
Website www.cloudera.com
Footnotes /references
[1] [2]

Cloudera, Inc. is an American software company providing an enterprise data management and analytics platform. The platform is the only cloud native platform purpose built from the ground up to run on all major public cloud providers (AWS, Azure, and GCP [3] ) as well as on on-premises private cloud (Red Hat OCP, and Open Source Kubernetes) environments. It allows users to store and analyze data using hardware and software in cloud-based and data center operations, spanning hybrid and multi-cloud environments. Cloudera offers cloud-native analytics for data distribution, data engineering, data warehousing, transactional data, streaming data, data science, and machine learning. [4]

Contents

History

Cloudera, Inc. was formed on June 27, 2008, by Christophe Bisciglia (from Google), Amr Awadallah (from Yahoo!), Jeff Hammerbacher (from Facebook), and Mike Olson (from Oracle). [5] [6] Awadallah oversaw a business unit performing data analysis using Hadoop while at Yahoo!; [7] Hammerbacher used Hadoop to develop some of Facebook's data analytics applications; [8] and Olson formerly served as the CEO of Sleepycat Software, the company that created Berkeley DB. The four were joined in 2009 by Doug Cutting, a co-founder of Hadoop. [9]

In March 2009, Cloudera released Cloudera Distribution for Hadoop (CDH), a commercial distribution of Hadoop, [10] in conjunction with a $5 million investment led by Accel Partners. [11] This was followed by a $25 million funding round in October 2010, [12] a $40M funding round in November 2011, [13] and a $160M funding round in March 2014. [14] [15] [16]

In June 2013, Tom Reilly became CEO, although Olson remained as chairman of the board and chief strategist. [17] Both left the company in June 2019. [18] Rob Bearden was appointed as Cloudera's CEO in January 2020. [19]

In March 2014, Intel invested $740 million in Cloudera for an 18% stake in the company. [20] These shares were repurchased by Cloudera in December 2020 for $314 million. [21] [22]

On April 28, 2017, the company became a public company via an initial public offering. [23] Over the next four years, the company's share price declined in the wake of falling sales figures [24] and the rise of public cloud services like Amazon Web Services. [25] In October 2018, Cloudera and Hortonworks announced their merger, [26] which the two companies completed the following January. [27] In October 2021, the company went private after an acquisition by KKR and Clayton, Dubilier & Rice in an all cash transaction valued at approximately $5.3 billion. [28] [25]

Cloudera has formed partnerships with companies such as Dell, [29] IBM, [30] [31] and Oracle. [32]

Products and services

Cloudera provides the Cloudera Data Platform, a collection of products related to cloud services and data processing. [33] Some of these services are provided through public cloud servers such as Microsoft Azure or Amazon Web Services, while others are private cloud services that require a subscription. Cloudera markets these products for purposes related to machine learning and data analysis. [1]

Cloudera has adopted the marketing term "lakehouse," which derives from a combination of the terms "data lake" and "data warehouse." Cloudera's data lakehouse [34] is based on Apache Iceberg, an open source format for very large analytics tables that enables both SQL queries and allows other engines to work with the same tables simultaneously. [35]

Operations

Cloudera is headquartered in Santa Clara, California. It has operations in 19 countries, including Canada, Chile, Brazil, Netherlands, Hungary, Ireland, United Arab Emirates, United Kingdom, Germany, France, Switzerland, India, China, Australia, Indonesia, South Korea, Singapore, and Japan. [36]

Related Research Articles

MicroStrategy Incorporated is an American company that provides business intelligence (BI), mobile software, and cloud-based services. Founded in 1989 by Michael J. Saylor, Sanju Bansal, and Thomas Spahr, the firm develops software to analyze internal and external data in order to make business decisions and to develop mobile apps. It is a public company headquartered in Tysons Corner, Virginia, in the Washington metropolitan area. Its primary business analytics competitors include SAP AG Business Objects, IBM Cognos, and Oracle Corporation's BI Platform. Saylor is the Executive Chairman and, from 1989 to 2022, was the CEO.

Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop was originally designed for computer clusters built from commodity hardware, which is still the common use. It has since also found use on clusters of higher-end hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework.

<span class="mw-page-title-main">Apache Solr</span> Open-source enterprise-search platform

Solr is an open-source enterprise-search platform, written in Java. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document handling. Providing distributed search and index replication, Solr is designed for scalability and fault tolerance. Solr is widely used for enterprise search and analytics use cases and has an active development community and regular releases.

Christophe Bisciglia is an American entrepreneur known for his work with big data and cloud computing. Known for helping to popularize the programming model MapReduce while working at Google, and in addition he co-founded Cloudera and WibiData.

<span class="mw-page-title-main">Netezza</span> Provider of Integrated Data Warehouse Hardware and Software

IBM Netezza is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.

Aster Data Systems was a data management and analysis software company headquartered in San Carlos, California. It was founded in 2005 and acquired by Teradata in 2011.

Revolution Analytics is a statistical software company focused on developing open source and "open-core" versions of the free and open source software R for enterprise, academic and analytics customers. Revolution Analytics was founded in 2007 as REvolution Computing providing support and services for R in a model similar to Red Hat's approach with Linux in the 1990s as well as bolt-on additions for parallel processing. In 2009 the company received nine million in venture capital from Intel along with a private equity firm and named Norman H. Nie as their new CEO. In 2010 the company announced the name change as well as a change in focus. Their core product, Revolution R, would be offered free to academic users and their commercial software would focus on big data, large scale multiprocessor computing, and multi-core functionality.

<span class="mw-page-title-main">Hortonworks</span> American software company

Hortonworks was a data software company based in Santa Clara, California that developed and supported open-source software designed to manage big data and associated processing.

WibiData was a software company that developed big data applications for enterprises to personalize their customer experiences. It developed applications based on open-source technologies Apache Hadoop, Apache Cassandra, Apache HBase, Apache Avro and the Kiji Project. Wibidata was founded under the name Odiago in 2010 by Christophe Bisciglia, Aaron Kimball, and Garrett Wu. Based in San Francisco, California, WibiData was backed by investors such as Canaan Partners, New Enterprise Associates, SV Angel, and Eric Schmidt.

<span class="mw-page-title-main">Alpine Data Labs</span> Environment for analytics

Alpine Data Labs is an advanced analytics interface working with Apache Hadoop and big data. It provides a collaborative, visual environment to create and deploy analytics workflow and predictive models. This aims to make analytics more suitable for business analyst level staff, like sales and other departments using the data, rather than requiring a "data engineer" or "data scientist" who understands languages like MapReduce or Pig.

Apache Phoenix is an open source, massively parallel, relational database engine supporting OLTP for Hadoop using Apache HBase as its backing store. Phoenix provides a JDBC driver that hides the intricacies of the NoSQL store enabling users to create, delete, and alter SQL tables, views, indexes, and sequences; insert and delete rows singly and in bulk; and query data through SQL. Phoenix compiles queries and other statements into native NoSQL store APIs rather than using MapReduce enabling the building of low latency applications on top of NoSQL stores.

<span class="mw-page-title-main">Databricks</span> American software company

Databricks, Inc. is an American software company founded by the original creators of Apache Spark. Databricks develops a web-based platform for working with Spark, that provides automated cluster management and IPython-style notebooks. The company develops Delta Lake, an open-source project to bring reliability to data lakes for machine learning and other data science use cases.

<span class="mw-page-title-main">Mirantis</span> Cloud computing software and services company

Mirantis Inc. is a Campbell, California, based B2B open source cloud computing software and services company. Its primary container and cloud management products, part of the Mirantis Cloud Native Platform suite of products, are Mirantis Container Cloud and Mirantis Kubernetes Engine. The company focuses on the development and support of container and cloud infrastructure management platforms based on Kubernetes and OpenStack. The company was founded in 1999 by Alex Freedland and Boris Renski. It was one of the founding members of the OpenStack Foundation, a non-profit corporate entity established in September, 2012 to promote OpenStack software and its community. Mirantis has been an active member of the Cloud Native Computing Foundation since 2016.

Snowflake Inc. is an American cloud computing–based data cloud company based in Bozeman, Montana. It was founded in July 2012 and was publicly launched in October 2014 after two years in stealth mode.

Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.

BlueTalon, Inc. was a private enterprise software company that provides data-centric security, user access control, data masking, and auditing solutions for complex, hybrid data environments. BlueTalon was founded in 2013 by Pratik Verma and is headquartered in Redwood City, California.

Cohesity is an American privately held information technology company headquartered in San Jose, California with offices in India and Ireland. The company develops software that allows IT professionals to backup, manage and gain insights from their data across multiple systems or cloud providers. Their products also include anti-ransomware features, Disaster Recovery-as-a-Service, and SaaS management.

Watson Studio, formerly Data Science Experience or DSX, is IBM’s software platform for data science. The platform consists of a workspace that includes multiple collaboration and open-source tools for use in data science.

<span class="mw-page-title-main">Apache ORC</span> Column-oriented data storage format

Apache ORC is a free and open-source column-oriented data storage format. It is similar to the other columnar-storage file formats available in the Hadoop ecosystem such as RCFile and Parquet. It is used by most of the data processing frameworks Apache Spark, Apache Hive, Apache Flink and Apache Hadoop.

References

  1. 1 2 "Cloudera, Inc. 2021 Form 10-K Annual Report". U.S. Securities and Exchange Commission.
  2. "Entity Details". Delaware.
  3. "Cloudera Data Platform Pricing". Cloudera. Retrieved 2023-04-20.
  4. Moorhead, Patrick. "Cloudera – As The World Goes Hybrid, So Must Data Management". Forbes. Retrieved 2023-04-20.
  5. Vance, Ashlee (March 16, 2009). "Bottling the Magic Behind Google and Facebook" . The New York Times .
  6. Vance, Ashlee (March 17, 2009). "Hadoop, a Free Software Program, Finds Uses Beyond Search" . The New York Times .
  7. Bort, Julie (January 31, 2012). "This Former Yahoo-er's Startup Is So Hot, Even the CIA Invested In It" . Business Insider . Archived from the original on February 9, 2012.
  8. Vance, Ashlee (April 14, 2011). "This Tech Bubble Is Different" . Bloomberg News .
  9. Handy, Alex (10 August 2009). "Hadoop creator goes to Cloudera". Software Development Times. Archived from the original on 13 March 2012. Retrieved 2011-03-22.
  10. "Cloudera Announces New Distribution for Hadoop to Bring Data Processing Power to Enterprises". Cloudera. 16 Mar 2009. Archived from the original on 27 Sep 2020.
  11. Wauters, Robin (March 16, 2009). "Cloudera Raises $5 Million Series A Round For Hadoop Commercialization". TechCrunch .
  12. "Cloudera Raises $25 Million for Hadoop Development". The New York Times . VentureBeat. October 27, 2010.
  13. "Cloudera Raises $25 Million for Hadoop Development". The New York Times . VentureBeat. October 27, 2010.
  14. Gage, Deborah (March 18, 2014). "Cloudera Raises $160 Million From T. Rowe Price, Other Public-Market Investors" . The Wall Street Journal .
  15. "Startup Cloudera raises $160 mln from T Rowe, Google Ventures". Reuters . March 18, 2014.
  16. Schubarth, Cromwell (March 18, 2014). "Big bucks for Big Data: Cloudera raises $160 million". American City Business Journals .
  17. Morgan, Timothy Prickett (June 20, 2013). "Cloudera taps new CEO for inevitable IPO push or acquisition: Former CEO becomes chairman and chief strategist". The Register .
  18. Levy, Ari (June 6, 2019). "Cloudera plummets 43% after CEO abruptly departs and company cuts forecast". CNBC . Archived from the original on 2019-06-06. Retrieved January 22, 2022.
  19. Novet, Jordan (13 January 2020). "Cloudera taps former head of the company it merged with to be its new CEO". CNBC. Retrieved 23 January 2022.
  20. Randewich, Noel (March 31, 2014). "Intel invested $740 million to buy 18 percent of Cloudera". Reuters .
  21. "Cloudera Completes $500 Million Term Loan and Repurchases 26 Million Shares" (Press release). PR Newswire. December 23, 2020.
  22. Cherney, Max A. (December 23, 2020). "Cloudera Buys Back $314 Million Intel Stake. Here's What It Means for the Stock" . Barron's .
  23. Balakrishnan, Anita (April 28, 2017). "Cloudera shares close more than 20% higher on Day 1". CNBC .
  24. Levy, Ari (June 6, 2019). "Cloudera plummets 43% after CEO abruptly departs and company cuts forecast". CNBC . Archived from the original on 2019-06-06. Retrieved January 22, 2022.
  25. 1 2 Gottfried, Miriam (June 1, 2021). "KKR, CD&R Strike $5.3 Billion Deal to Buy Cloudera" . The Wall Street Journal .
  26. Novet, Jordan (3 October 2018). "Cloudera and Hortonworks shares skyrocket as rivals merge". CNBC .
  27. Schubarth, Cromwell (3 January 2019). "Cloudera completes Hortonworks deal, but investors aren't convinced". American City Business Journals.
  28. "Cloudera Completes Agreement to Become a Private Company" (Press release). PR Newswire. October 8, 2021.
  29. Menchaca, Lionel (August 4, 2011). "Introducing the Dell Cloudera solution for Apache Hadoop — Harnessing the power of big data". Dell Technologies .
  30. "IBM, Cloudera Announce Strategic Partnership". IBM. June 21, 2019.
  31. Dignan, Larry (June 21, 2019). "IBM, Cloudera forge strategic pact". ZDNet .
  32. "Oracle Selects Cloudera to Provide Apache Hadoop Distribution and Tools for Oracle Big Data Appliance" (Press release). Cloudera. January 10, 2012.
  33. "Cloudera Data Platform (CDP)". Cloudera. Retrieved 2023-05-02.
  34. Brust, Andrew J. (2023-03-02). "GigaOm Radar for Data Lakes and Lakehouses". Gigaom. Retrieved 2023-05-02.
  35. "What Are Apache Iceberg Tables and How Are They Useful?". Snowflake. Retrieved 2024-02-26.
  36. "Locations". Cloudera. Retrieved 2023-05-02.