Paxata

Last updated
Paxata
TypePrivate [1]
IndustryEnterprise analytics software
FoundedJanuary 2012;11 years ago (2012-01)
Headquarters
Redwood City, CA [2]
Area served
Worldwide
Key people
  • Prakash Nanduri (Co-Founder & CEO) [3]
  • Dave Brewster (Co-Founder & CTO) [3]
  • Chris Maddox (Co-Founder & VP of Business Development) [3]
  • Nenshad Bardoliwalla (Co-Founder & Chief Product Officer) [3]
ProductsThe Paxata suite of self-service data preparation software

Paxata is a privately owned software company headquartered in Redwood City, California. It develops self-service data preparation software that gets data ready for data analytics software. Paxata's software is intended for business analysts, as opposed to technical staff. It is used to combine data from different sources, then check it for data quality issues, such as duplicates and outliers. Algorithms and machine learning automate certain aspects of data preparation and users work with the software through a user-interface similar to Excel spreadsheets.

Contents

The company was founded in January 2012 and operated in stealth mode until October 2013. It received more than $10 million in venture funding before being acquired by DataRobot. [4] [5]

History

Paxata was founded in January 2012. [6] It initially raised $2 million in venture capital. [7] The company came out of stealth mode in October 2013. [6] Simultaneously with its public release, Paxata announced an $8 million funding round led by Accel Partners. [6] [8] Adoption of the software grew quickly. [6] [9] In March 2014, In-Q-Tel acquired an interest in the startup. [10] It raised an additional $18 million in funding in September 2015. [11] It also began working with Cisco to jointly develop the Cisco Data Preparation suite of software and services. [12]

Software

Paxata refers to its suite of cloud-based data quality, integration, enrichment and governance products as "Adaptive Data Preparation." [8] [13] [14] [15] The software is intended for business analysts, who need to combine data from a variety of sources, then check the data for duplicates, empty fields, outliers, trends and integrity issues before conducting analysis or visualization in a third-party software tool. [15] [16] It uses algorithms and machine-learning to automate certain aspects of data preparation. [15] [17] For example, it may automatically detect records belonging to the same person or address, even if the information is formatted differently in each record in different data sets. [17] [18]

The software has a spreadsheet-based user interface. [15] [18] Patterns and anomalies in the data are color-coded in the spreadsheet. Then users are provided with instructions on how to resolve data quality issues or to supplement the data with contextual information. [14] Data sets and related quality issues can also be addressed in a collaborative environment through the "Paxata Share" feature. [18] It runs on Apache Spark. [11] [19]

According to analyst firm Ovum, the software is made possible through advances in predictive analytics, machine learning and the NoSQL data caching methodology. [15] The software uses semantic algorithms to understand the meaning of a data table's columns and pattern recognition algorithms to find potential duplicates in a data-set. [15] [7] It also uses indexing, text pattern recognition and other technologies traditionally found in social media and search software. [20] One of the software's users is dairy producer Danone, which uses the software so that business staff can create their own reports on merchandising, supply chain and product data, without the IT department. [21]

Paxata's spreadsheet-based user interface Paxata-screenshot.png
Paxata's spreadsheet-based user interface

Reception

In its 2014 report "Cool Vendors in Data Integration and Data Quality", Gartner praised Paxata for developing a "business-user-friendly" data quality product that does not use code. [14] Ventana Research said its spreadsheet-based user interface "should resonate well with business analysts," who are resistant to move away from familiar Excel-like programs. [18] Gartner also said Paxata was recognized in the report due to its automated, algorithm-based features and how it tracks any changes made to the data. [14]

Ventana Research said Paxata was in a "noisy marketplace". [18] According to Gartner, while Paxata is an early entrant into the market, many startups and large corporations are making investments in developing similar competing products. [14] According to Gigaom and IT Business Edge, one way Paxata differs is that it automatically merges multiple data-sets into a single table, so it can be easily imported into a visualization or analysis tool. [7] [22]

Gartner said Paxata will have a difficult time finding a compelling pricing model, when many data discovery tools that it supplements provide some similar features. [14] In contrast, Ventana said Paxata's pricing was "a pretty small amount" compared to the amount of time users can save. [18]

Related Research Articles

Software AG is a German multinational software corporation that develops enterprise software for business process management, integration, and big data analytics. Founded in 1969, the company is headquartered in Darmstadt, Germany, and has offices worldwide.

<span class="mw-page-title-main">RapidMiner</span> Data science software

RapidMiner is a data science platform that analyses the collective impact of an organization's data. It was acquired by Altair Engineering in September 2022.

Qlik [pronounced "klik"] provides a business analytics platform. The software company was founded in 1993 in Lund, Sweden and is now based in King of Prussia, Pennsylvania, United States. The company's main products are Qlik Replicate and Qlik Sense, both software for business intelligence and data integration.

<span class="mw-page-title-main">Splunk</span> American technology company

Splunk Inc. is an American software company based in San Francisco, California, that produces software for searching, monitoring, and analyzing machine-generated data via a web-style interface.

<span class="mw-page-title-main">Parasoft</span> Software testing framework

Parasoft is an independent software vendor specializing in automated software testing and application security with headquarters in Monrovia, California. It was founded in 1987 by four graduates of the California Institute of Technology who planned to commercialize the parallel computing software tools they had been working on for the Caltech Cosmic Cube, which was the first working hypercube computer built.

Pentaho is business intelligence (BI) software that provides data integration, OLAP services, reporting, information dashboards, data mining and extract, transform, load (ETL) capabilities. Its headquarters are in Orlando, Florida. Pentaho was acquired by Hitachi Data Systems in 2015 and in 2017 became part of Hitachi Vantara.

Tidemark is a private enterprise performance management firm founded in 2010 that provides cloud-based analytics applications built for a mobile device enabled platform. Tidemark was known as Proferi when it was in stealth mode and is located in Redwood City, California. In Sept. 2013, Tidemark won the Big Data Startup Challenge and earned a spot in the Big Data 50.

Platfora, Inc. is a big data analytics company based in San Mateo, California. The firm’s software works with the open-source software framework Apache Hadoop to assist with data analysis, data visualization, and sharing.

<span class="mw-page-title-main">Alpine Data Labs</span> Environment for analytics

Alpine Data Labs is an advanced analytics interface working with Apache Hadoop and big data. It provides a collaborative, visual environment to create and deploy analytics workflow and predictive models. This aims to make analytics more suitable for business analyst level staff, like sales and other departments using the data, rather than requiring a "data engineer" or "data scientist" who understands languages like MapReduce or Pig.

<span class="mw-page-title-main">Narrative Science</span> American [[natural language]] generation company

Narrative Science was a technology company based in Chicago, Illinois, that specialized in data storytelling. As of December 17, 2021, Narrative Science was acquired by Salesforce and has been integrated into Salesforce's Tableau Software.

Alteryx is an American computer software company based in Irvine, California, with a development center in Broomfield, Colorado, and offices worldwide. The company's products are used for data science and analytics. The software is designed to make advanced analytics automation accessible to any data worker.

<span class="mw-page-title-main">Dynatrace</span> American technology company

Dynatrace, Inc. is a global technology company listed on the NYSE that provides a software observability platform based on artificial intelligence (AI) and automation. Dynatrace technologies are used to monitor, analyze, and optimize application performance, software development and security practices, IT infrastructure, and user experience for businesses and government agencies throughout the world.

Kinetica is a distributed, memory-first OLAP database developed by Kinetica DB, Inc. Kinetica is designed to use GPUs and modern vector processors to improve performance on complex queries across large volumes of real-time data. Kinetica is well suited for analytics on streaming geospatial and temporal data.

ThousandEyes, Inc. is a network intelligence company headquartered in San Francisco with offices in Dublin, London, New York, Tokyo, and Austin, Texas. The company produces software that analyzes the performance of local and wide area networks. On May 29, 2020, Cisco announced it would be acquiring ThousandEyes.

<span class="mw-page-title-main">Smartsheet</span> Collaboration software application

Smartsheet is a software as a service (SaaS) offering for collaboration and work management, developed and marketed by Smartsheet Inc. It is used to assign tasks, track project progress, manage calendars, share documents, and manage other work, using a tabular user interface.

DataOps is a set of practices, processes and technologies that combines an integrated and process-oriented perspective on data with automation and methods from agile software engineering to improve quality, speed, and collaboration and promote a culture of continuous improvement in the area of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.

Augmented Analytics is an approach of data analytics that employs the use of machine learning and natural language processing to automate analysis processes normally done by a specialist or data scientist. The term was introduced in 2017 by Rita Sallam, Cindi Howson, and Carlie Idoine in a Gartner research paper.

Anodot is an American data analytics company that uses machine learning and artificial intelligence for business monitoring and anomaly detection.

Exabeam is a global cybersecurity company headquartered in Foster City, California. In 2021 it joined the Snowflake Inc. data services platform and achieved unicorn status with over $2B valuation.

Honeycomb is an American software company known for its eponymous observability and application performance management (APM) platform and for its diversity, equity, and inclusion (DEI) practices. Honeycomb's venture capital investors to date include Headline, Scale Venture Partners, and Insight Partners.

References

  1. "Paxata: Company Profile". Bloomberg L.P. Retrieved September 28, 2014.
  2. "Contact us". Paxata. Archived from the original on October 27, 2014. Retrieved September 28, 2014.
  3. 1 2 3 4 "Paxata Leadership". Paxata. Archived from the original on August 12, 2014. Retrieved September 28, 2014.
  4. Vizard, Michael (2019-12-19). "DataRobot Acquires Paxata to Extend AI Platform". RTInsights. Retrieved 2020-03-18.
  5. "DataRobot is acquiring Paxata to add data prep to machine learning platform". TechCrunch. 12 December 2019. Retrieved 2020-03-18.
  6. 1 2 3 4 Blattberg, Eric (October 28, 2013). "Paxata grabs $8M to help data scientists skip the dirty work". VentureBeat. Retrieved June 19, 2014.
  7. 1 2 3 Harris, Derrick (October 28, 2013). "With $10M from Accel, Paxata wants to make data prep a breeze". Gigaom. Retrieved June 19, 2014.
  8. 1 2 Woodie, Alex (October 28, 2013). "Paxata Debuts Data Quality Tools at Strata". Datanami. Retrieved June 19, 2014.
  9. McStravick, Alan (February 12, 2014). "Paxata: streamlining data analytics". SiliconAngle. The SiliconAngle Network. Retrieved June 19, 2014.
  10. Wait, Patience (March 7, 2014). "In-Q-Tel Invests in Data-Prep Platform Paxata". InformationWeek. UBM Tech. Retrieved June 19, 2014.
  11. 1 2 Harris, Derrick (September 9, 2015). "This startup raised $18 million to make data analysis less of a chore". Fortune. Retrieved October 13, 2015.
  12. "Cisco Makes Move Into Data Preparation Space". eWeek.com. September 30, 2015. Retrieved October 13, 2015.
  13. Forrest, Conner (March 4, 2014). "Startup Paxata automates the dirty work of big data". TechRepublic. CBS Interactive. Retrieved June 26, 2014.
  14. 1 2 3 4 5 6 Thoo, Eric; Friedman, Ted; Judah, Saul; Sallam, Rita L.; Edjlali, Roxane (April 24, 2014). "Cool Vendors in Data Integration and Data". Gartner. Retrieved June 19, 2014.
  15. 1 2 3 4 5 6 Baer, Tony (October 28, 2013). "Paxata puts a business-user face on data preparation". Ovum. Archived from the original on January 12, 2015. Retrieved June 19, 2014.
  16. Baer, Tony (December 13, 2013). "On the Radar: Paxata". Ovum. Archived from the original on January 12, 2015. Retrieved June 13, 2014.
  17. 1 2 Fitzgerald, Michael (February 11, 2014). "Is Your Company Running a Data Dump?". InformationWeek. UBM Tech. Retrieved June 19, 2014.
  18. 1 2 3 4 5 6 Cosentino, Tony (January 29, 2014). "Paxata Give Analysts Valuable Time Back for Analytics". Ventana Research. Archived from the original on June 20, 2014. Retrieved June 19, 2014.
  19. "Paxata Applies Data Governance Controls to Big Data". IT Business Edge. April 23, 2015. Retrieved August 20, 2015.
  20. Woodie, Alex (January 24, 2014). "Automating the Pain Out of Big Data Transformation". Datanami. Retrieved June 19, 2014.
  21. Feretic, Eileen (March 26, 2014). "Dannon Speeds Up Data Preparation and Analysis". Baseline. Bradbourne Publishing. Retrieved June 22, 2014.
  22. Vizard, Mike (November 27, 2013). "Paxata Rises to the Challenge of Big Data Preparation". IT Business Edge. QuinStreet. Retrieved June 19, 2014.