Revolution Analytics

Last updated
Revolution Analytics
Type Subsidiary
IndustryStatistical software
PredecessorRevolution Computing
Founded2007
Headquarters
Mountain View, CA
,
United States
Key people
David Rich, CEO
ProductsRevolution R
Revenue8-11 Million in 2009
Owner Microsoft [1]
Parent Microsoft
Website revolutionanalytics.com

Revolution Analytics (formerly REvolution Computing) is a statistical software company focused on developing open source and "open-core" [2] versions of the free and open source software R for enterprise, academic and analytics customers. Revolution Analytics was founded in 2007 as REvolution Computing providing support and services for R in a model similar to Red Hat's approach with Linux in the 1990s as well as bolt-on additions for parallel processing. In 2009 the company received nine million in venture capital from Intel along with a private equity firm and named Norman H. Nie as their new CEO. In 2010 the company announced the name change as well as a change in focus. Their core product, Revolution R, would be offered free to academic users and their commercial software would focus on big data, large scale multiprocessor (or "high performance") computing, and multi-core functionality.

Contents

Microsoft announced on January 23, 2015, that they had reached an agreement to purchase Revolution Analytics for an as yet undisclosed amount. [3] [4] In 2021, Microsoft announced they would be retiring their R distribution they acquired from Revolution Analytics. [5] In 2023, Microsoft retired the Microsoft R Application Network, which was a proprietary package hosting service similar to the Comprehensive R Archive Network for packages acquired from Revolution Analytics (like "ScaleR"). [6]

Founding and venture capital

REvolution Computing was founded in New Haven, Connecticut in 2007 by Richard Schultz, Martin Schultz, Steve Weston and Kirk Mettler. At the time Martin Schultz was also the Watson Professor of Computer Science at Yale University. [7] [8] Adding parallel computing to R allowed the company to net large gains in speed for many common analytics operations and early clients like Pfizer took advantage of REvolution R to see large performance gains using R on computing clusters. [9] While the improvements to core R were released under the GNU General Public License (GPL), REvolution provides support and services to customers of their commercial product and had considerable early success with life sciences and pharmaceutical companies. [10] [11] A year later the company opened an additional office in Seattle. [12]

In 2009 REvolution Computing accepted nine million dollars in venture capital from Intel and North Bridge Venture Partners, a private equity firm. Intel had previously supported REvolution Computing with venture capital in 2008. [13] A number of Intel employees also joined Revolution Analytics as employees or as advisors. [9] Concurrently, the company changed their name to Revolution Analytics and invited Norman Nie, founder of SPSS, to serve as CEO. [14] [15] This change in management corresponded with a movement toward building a more complete set of software for commercial users; prior to 2009 Revolution had been focused on building parallel processing functionality into the then mostly single threaded R. [16] David Rich replaced Norman Nie as CEO in February 2012. [17]

High performance computing, big data and the shift to analytics

Unlike analytics products offered by SAS Institute, R does not natively handle datasets larger than main memory. In 2010 Revolution Analytics introduced ScaleR, a package for Revolution R Enterprise designed to handle big data through a high-performance disk-based data store called XDF (not related to IBM's Extensible Data Format) and high performance computing across large clusters. [18] The release of ScaleR marked a push away from consulting and services alone to custom code and a la carte package pricing. [19] ScaleR also works with Apache Hadoop and other distributed file systems and Revolution Analytics has partnered with IBM to further integrate Hadoop into Revolution R. [20] [21] Packages to integrate Hadoop and MapReduce into open source R can also be found on the community package repository, CRAN. [22] [23]

Market position

In comparison to developers of similar analytics tools, Revolution Analytics is a small company; in 2010 the company had a projected revenue of $8–11 million, but no official records of revenue or profit were published in their projections. [24] According to Nie, the increased use of R - a fully fledged programming language, in contrast to other analytics packages - within academia is helping the company to grow quickly. [25] [26] [27] [28] [29] Community vice president David Smith suggested that movement away from "black box" analytics toward open source tools in general supported vendors like Revolution over solely proprietary tools. [30]

Products

Revolution Analytics' product Revolution R is available in three editions. Revolution R Open is a free and open source distribution of R with additional features for performance and reproducibility. Revolution R Plus provides technical support and open-source assurance (legal indemnification) subscriptions for Revolution R Open and other open-source components that work with R. (These products were first announced October 15, 2014. [31] ) Revolution R Enterprise adds proprietary components to support statistical analysis of Big Data, and is sold as subscriptions for workstations, servers, Hadoop and databases. (Single-user licenses are available free for academic users as well as users competing in Kaggle data mining competitions. [32] [33] )

In January 2015 Microsoft rebranded and renewed several Revolution Analytics products and offerings for Hadoop, Teradata Database, SUSE Linux, Red Hat, and Microsoft Windows. Microsoft made several of these R-based products free of charge for developers - these products included:

See also

Related Research Articles

<span class="mw-page-title-main">IBM Db2</span> Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB/2, then DB2 until 2017 and finally changed to its present form.

Teradata Corporation is an American software company that provides cloud database and analytics-related software, products, and services. The company was formed in 1979 in Brentwood, California, as a collaboration between researchers at Caltech and Citibank's advanced technology group.

<span class="mw-page-title-main">SPSS</span> Statistical analysis software

SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. Versions of the software released since 2015 have the brand name IBM SPSS Statistics.

<span class="mw-page-title-main">SAS (software)</span> Statistical software

SAS is a statistical software suite developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, criminal investigation, and predictive analytics.

A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, analyze, and visualize geographic data, that is, data representing phenomena for which location is important. The GIS software industry encompasses a broad range of commercial and open-source products that provide some or all of these capabilities within various information technology architectures.

PowerLinux is the combination of a Linux-based operating system (OS) running on PowerPC- or Power ISA-based computers from IBM. It is often used in reference along with Linux on Power, and is also the name of several Linux-only IBM Power Systems.

Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop was originally designed for computer clusters built from commodity hardware, which is still the common use. It has since also found use on clusters of higher-end hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework.

In computing, the term data warehouse appliance (DWA) was coined by Foster Hinshaw for a computer architecture for data warehouses (DW) specifically marketed for big data analysis and discovery that is simple to use and has a high performance for the workload. A DWA includes an integrated set of servers, storage, operating systems, and databases.

Aster Data Systems was a data management and analysis software company headquartered in San Carlos, California. It was founded in 2005 and acquired by Teradata in 2011.

<span class="mw-page-title-main">Microsoft Azure</span> Cloud computing platform by Microsoft

Microsoft Azure, often referred to as Azure, is a cloud computing platform run by Microsoft. It offers access, management, and the development of applications and services through global data centers. It also provides a range of capabilities, including software as a service (SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS). Microsoft Azure supports many programming languages, tools, and frameworks, including Microsoft-specific and third-party software and systems.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

<span class="mw-page-title-main">Vertica</span> Software company

Vertica is an analytic database management software company. Vertica was founded in 2005 by the database researcher Michael Stonebraker with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as CEOs later on.

Altoros Systems is a software development company that provides products and services for the Cloud Foundry platform. Altoros contributes to development and evolution of this open source initiative as governed by the Linux Foundation.

Pentaho is business intelligence (BI) software that provides data integration, OLAP services, reporting, information dashboards, data mining and extract, transform, load (ETL) capabilities. Its headquarters are in Orlando, Florida. Pentaho was acquired by Hitachi Data Systems in 2015 and in 2017 became part of Hitachi Vantara.

Cloudera, Inc. is an American software company providing an enterprise data management and analytics platform. The platform is the only cloud native platform purpose built from the ground up to run on all major public cloud providers as well as on on-premises private cloud environments. It allows users to store and analyze data using hardware and software in cloud-based and data center operations, spanning hybrid and multi-cloud environments. Cloudera offers cloud-native analytics for data distribution, data engineering, data warehousing, transactional data, streaming data, data science, and machine learning.

<span class="mw-page-title-main">Apache Drill</span> Open-source software framework

Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. Built chiefly by contributions from developers from MapR, Drill is inspired by Google's Dremel system. Drill is an Apache top-level project. Tom Shiran is the founder of the Apache Drill Project. It was designated an Apache Software Foundation top-level project in December 2016.

<span class="mw-page-title-main">PSSC Labs</span>

PSSC Labs is a California-based company that provides supercomputing solutions in the United States and internationally. Its products include "high-performance" servers, clusters, workstations, and RAID storage systems for scientific research, government and military, entertainment content creators, developers, and private clouds. The company has implemented clustering software from NASA Goddard's Beowulf project in its supercomputers designed for bioinformatics, medical imaging, computational chemistry and other scientific applications.

Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.

References

  1. Kniskern, Kip (6 April 2015). "Microsoft completes Revolution Analytics acquisition: bringing big data analytics "to everyone"". WinBeta.
  2. Blankenhorn, Dana. "Revolution rebooting R with name change and new strategy". ZDNet. Retrieved 14 July 2011.
  3. "Microsoft to acquire Revolution Analytics to help customers find big data value with advanced statistical analysis". Official Microsoft Blog Post. Retrieved 24 January 2015.
  4. "Revolution Analytics joins Microsoft". Official RA Announcement. Retrieved 24 January 2015.
  5. Rowland-Jones, James (2021-06-30). "Looking to the future for R in Azure SQL and SQL Server". Microsoft SQL Server Blog. Retrieved 2024-01-17.
  6. "Microsoft R Application Network retirement". TECHCOMMUNITY.MICROSOFT.COM. Retrieved 2024-01-17.
  7. Bogdon, Steve. "One-on-One with David Smith". Dashboard Insight. Retrieved 31 August 2011.
  8. Leidel, John. "Revolution Analytics Defines The Future of R-Statistics". InsideHPC. Retrieved 31 August 2011.
  9. 1 2 Shankland, Stephen. "Intel open-source expert heads to start-up". cnet News. CBS Interactive . Retrieved 14 July 2011.
  10. Vance, Ashlee (8 January 2009). "R You Ready for R?". The New York Times. Retrieved 14 July 2011.
  11. Davies, Kevin (14 July 2008). "The New England Computing Revolution". Bio-IT World Magazine. Retrieved 14 July 2011.
  12. "REvolution Computing expands senior management team, opens west coast headquarters in Seattle". Revolution Analytics press release. Retrieved 31 August 2011.
  13. "Intel capital makes series a investment in REvolution Computing—investment highlights Intel capital's open source incubator program". Revolution Analytics press release. Retrieved 31 August 2011.
  14. Rao, Leena. "REvolution Computing Raises $9 Million". TechCrunch. Retrieved 14 July 2011.
  15. Higginbotham, Stacey (2 February 2011). "The Data Whisperer: Norman Nie of Revolution Analytics". The New York Times. Retrieved 14 July 2011.
  16. Prickett Morgan, Timothy. "Open source R in commercial Revolution". The Register. Retrieved 1 September 2011.
  17. "Revolution Analytics Names David Rich New CEO".
  18. Gardner, Dana. "Revolution Analytics targets R language, platform at growing need to handle 'big data' crunching challenges". ZDNet. Retrieved 14 July 2011.
  19. Morgan, Timothy Prickett (3 August 2010). "Revolution lets R to stats on big data". The Register. Retrieved 14 July 2011.
  20. Harris, Derrick (14 March 2011). "IBM Creates Big Data Frankenstein With Netezza-R Fusion". The New York Times. Retrieved 14 July 2011.
  21. Rosenberg, Dave. "Open-source 'R' gets Hadoop integration". cnet News. CBS Interactive . Retrieved 14 July 2011.
  22. Smith, David. "Hadoop ported to R (and it's trivial)". Revolutions. Revolution Analytics. Retrieved 1 September 2011.
  23. Brown, Christopher. "Package:mapReduce". CRAN. The R Project. Retrieved 1 September 2011.
  24. Xavier, Jon (15 August 2010). "Revolution Analytics wants to overthrow old statistical tools". Silicon Valley Business Journal. Retrieved 14 July 2011.
  25. Hardy, Quentin (24 May 2010). "Power in the Numbers". Forbes Magazine. Archived from the original on May 10, 2010. Retrieved 14 July 2011.
  26. McNally, Steve (10 November 2010). "Names You Need to Know in 2011: R Data Analysis Software". Forbes. Retrieved 14 July 2011.
  27. Olds, Dan. "'R' is for Revolution Analytics". The Register. Retrieved 14 July 2011.
  28. Lawson, Lorraine. "Another Tool for Analyzing Big Data". IT Business Edge. Retrieved 14 July 2011.
  29. Hardy, Quentin (1 February 2011). "Another Open Source Swipe at IBM and SAS". Forbes. Retrieved 14 July 2011.
  30. Bodkin, Ron. "Revolution Analytics - Commercializing R for Statistics". InfoQ. Retrieved 14 July 2011.
  31. "Revolution Analytics Introduces Revolution R Open and Revolution R Plus" (Press release). 15 October 2014.
  32. "Free single user subscription to Revolution R Enterprise". Revolution Analytics website. Retrieved 2 September 2011.
  33. Finley, Klint. "Revolution Analytics Offers Free Software for Kaggle Competitors". ReadWriteWeb. Archived from the original on 2 January 2012. Retrieved 14 July 2011.
  34. Robinson, Daniel. "Microsoft unveils free Microsoft R Server Developer Edition for big data analytics". V3. Retrieved 13 January 2016.
  35. Viswav, Pradeep. "Microsoft R Server Now Available For Hadoop, Linux And Teradata". Microsoft-News. Retrieved 13 January 2016.
  36. Foley, Mary Jo. "Microsoft delivers free version of its R analytics Server for developers. Microsoft is rolling out a free version of its R big-data analytics server for developers alongside the rest of the newly rebranded Revolution Analytics servers". ZDNet - All About Microsoft. Retrieved 13 January 2016.

Further reading