Operational analytical processing

Last updated

Operational analytical processing, or more popularly known as operational analytics, is a subset of data analytics that focuses on improving the operational nature of a business or entity.

Contents

The main characteristic that distinguishes operational analytics from other types of analytics is that it is analytics on the fly, [1] which means that signals emanating from various parts of a business are processed in real-time to feed back into instant decision-making for the business. Some people refer to this as "continuous analytics," which is another way to emphasize the continuous digital feedback loop that can exist from one part of a business to its other parts.

Overview

The rapid digital transformation of many businesses means that an increasing number of business signals are being recorded and stored in digital form. Businesses are using these signals to improve their efficiency, improve their performance and provide better experiences to their users and customers. A Forrester Report [2] details how digitization of a business is impacting its customer experiences by leveraging data. Operational analytics allows you to process various types of information from different sources and then decide what to do next: what action to take, whom to talk to, what immediate plans to make. Gartner defines this as Continuous Intelligence [3] in a research report and goes on to describe this as a design pattern in which real-time analytics are integrated within a business operation, processing current and historical data to prescribe actions in response to events. Andreessen Horowitz [4] [5] describes this as ...more and more decisions are automated away altogether—think of Amazon continually updating prices for its products throughout the day. This form of analytics has become popular with the digitization trend in almost all industry verticals, because it is digitization that furnishes the data needed for operational decision-making.

A few examples of operational analytics include... a product manager who looks at product-usage logs to determine which features of the product are liked by its users, which features slow them down, and which features are disliked by its users. The product manager can gather all these answers by querying data that records usage patterns from the product's user base; and he or she can immediately feed that information back to make the product better. Similarly, in the case of marketing analytic in the pre-digitized world, a marketing manager would organize a few focus groups, try out a few experiments based on their own creativity and then implement them. Depending on the results of experimentation, they would then decide what to do next. An experiment may take weeks or months. In the digitized world, there is the "marketing engineer," a person who is well-versed in using data systems. These marketing engineers can run multiple experiments at once, gather results from experiments in the form of data, terminate the ineffective experiments and nurture the ones that work, all through the use of data-based software systems. The more experiments they can run and the quicker the turnaround times of results, the better their effectiveness in marketing their product.

An MIT Technology Review article [6] describes how a ride-sharing application uses algorithms for real-time monitoring of traffic and trip times to balance demand and supply for ride sourcing—and to adjust fees accordingly and rapidly. The use of operations analytics is not confined to the field of information technology. Data from business intelligence, finance, science, weather, and even current events are combined and then analyze together to extract valuable insight from it, and this in turn, drives quick decision making in almost every conceivable use. A metrics collection system like Scuba [7] is an operational analytics system because it is used extensively for interactive, ad hoc, analysis queries that run in under a second over live data.

Definition of an operational analytics processing engine

The definition of an operational analytics processing engine (OPAP) [8] can be expressed in the form of the following six propositions:

  1. Complex queries: Support for queries like inner & outer joins, aggregations, sorting, relevance, etc.
  2. Low data latency: An update to any data record is visible in query results in under than a few seconds.
  3. Low query latency: A simple search query returns in under a few milliseconds.
  4. High query volume: Able to serve at least a few hundred concurrent queries per second.
  5. Live sync with data sources: Ability to keep itself in sync with various external sources without having to write external scripts. This can be done via change-data-capture of an external database, or by tailing streaming data sources.
  6. Mixed types: Allows values of different types in the same column. This is needed to be able to ingest new data without needing to manipulate them at write time.

System requirements

Operational Analytics is a subset of the broader set of processes that characterizes OLAP (online analytical processing). As such, it inherits the large data sizes and complex queries that OLAP systems typically has to handle. However, the characteristics that uniquely identify operational analytics is the requirement for quick predictions based on most recent signals. This means that the data latency and query latency are very small. For example, operational analytics applied to real time business processes specify that data latency be zero. It also means that queries should be fast and finish at interactive speeds. Because these decisions are taken at a micro-level and are very personalized [9] to each individual entity, operational analytics processing is characterized by how easy it is to deliver personalized recommendations using such a system.

Related Research Articles

Customer relationship management (CRM) is a process in which a business or other organization administers its interactions with customers, typically using data analysis to study large amounts of information.

Data warehouse

In computing, a data warehouse, also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis and is considered a core component of business intelligence. DWs are central repositories of integrated data from one or more disparate sources. They store current and historical data in one single place that are used for creating analytical reports for workers throughout the enterprise.

IBM Db2 Family Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. They initially supported the relational model, but were extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB/2, then DB2 until 2017 and finally changed to its present form.

Business intelligence (BI) comprises the strategies and technologies used by enterprises for the data analysis of business information. BI technologies provide historical, current, and predictive views of business operations. Common functions of business intelligence technologies include reporting, online analytical processing, analytics, dashboard development, data mining, process mining, complex event processing, business performance management, benchmarking, text mining, predictive analytics, and prescriptive analytics. BI technologies can handle large amounts of structured and sometimes unstructured data to help identify, develop, and otherwise create new strategic business opportunities. They aim to allow for the easy interpretation of these big data. Identifying new opportunities and implementing an effective strategy based on insights can provide businesses with a competitive market advantage and long-term stability.

Online analytical processing, or OLAP, is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining. Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting and similar areas, with new applications emerging, such as agriculture.

Personalized marketing, also known as one-to-one marketing or individual marketing, is a marketing strategy by which companies leverage data analysis and digital technology to deliver individualized messages and product offerings to current or prospective customers. Advancements in data collection methods, analytics, digital electronics, and digital economics, have enabled marketers to deploy more effective real-time and prolonged customer experience personalization tactics.

Business performance management is a set of performance management and analytic processes that enables the management of an organization's performance to achieve one or more pre-selected goals. Gartner retired the concept of "CPM" and reclassified it as "financial planning and analysis (FP&A)," and "financial close" to reflect two concepts: increased focus on planning and the emergence of a new category of solutions supporting the management of the financial close.

An Executive information system (EIS), also known as an Executive support system (ESS), is a type of management support system that facilitates and supports senior executive information and decision-making needs. It provides easy access to internal and external information relevant to organizational goals. It is commonly considered a specialized form of decision support system (DSS).

Essbase is a multidimensional database management system (MDBMS) that provides a multidimensional database platform upon which to build analytic applications. Essbase began as a product of Arbor Software, which merged with Hyperion Software in 1998. Oracle Corporation acquired Hyperion Solutions Corporation in 2007, as of 2009 Oracle marketed Essbase as "Oracle Essbase" and more recently, Essbase is offered as part of the Oracle Analytics Cloud. Until late 2005 IBM also marketed an OEM version of Essbase as DB2 OLAP Server.

In online transaction processing (OLTP), information systems typically facilitate and manage transaction-oriented applications.

Business analytics (BA) refers to the skills, technologies, and practices for continuous iterative exploration and investigation of past business performance to gain insight and drive business planning. Business analytics focuses on developing new insights and understanding of business performance based on data and statistical methods. In contrast, business intelligence traditionally focuses on using a consistent set of metrics to both measure past performance and guide business planning. In other words, business intelligence focusses on description, while business analytics focusses on prediction and prescription.

Panopticon Software was a multi-national data visualization software company specializing in monitoring and analysis of real-time data. The firm was headquartered in Stockholm, Sweden. It partnered with several large systems integrators and infrastructure software companies, including SAP, Thomson Reuters, Kx Systems, and One Market Data (OneTick). The company's name is derived from the Greek: 'pan' for all, 'optic' for sight. The company name is derived from the word panopticon which is an architectural concept originally intended to facilitate surveillance of prisons.

The following tables compare general and technical information for a number of online analytical processing (OLAP) servers. Please see the individual products articles for further information.

SQLstream is a distributed, SQL standards-compliant plus Java stream processing platform. SQLstream, Inc. is based in San Francisco, California and was launched in 2009 by Damian Black, Edan Kabatchnik and Julian Hyde, author of the open source Mondrian Relational OLAP Server Engine.

In computer science, in-memory processing is an emerging technology for processing of data stored in an in-memory database. Older systems have been based on disk storage and relational databases using SQL query language, but these are increasingly regarded as inadequate to meet business intelligence (BI) needs. Because stored data is accessed much more quickly when it is placed in random-access memory (RAM) or flash memory, in-memory processing allows data to be analysed in real time, enabling faster reporting and decision-making in business.

Platfora, Inc. is a big data analytics company based in San Mateo, California. The firm’s software works with the open-source software framework Apache Hadoop to assist with data analysis, data visualization, and sharing.

Hybrid transaction/analytical processing (HTAP), a term created by Gartner Inc. – an information technology research and advisory company. As defined by Gartner:

Hybrid transaction/analytical processing (HTAP) is an emerging application architecture that "breaks the wall" between transaction processing and analytics. It enables more informed and "in business real time" decision making.

Azure Cosmos DB is Microsoft's proprietary globally-distributed, multi-model database service "for managing data at planet-scale" launched in May 2017. It is schema-agnostic, horizontally scalable, and generally classified as a NoSQL database.

Quantifind

Quantifind is a machine learning platform and software provider used by financial institutions and law enforcement to discover signals of risk, including fraud and money laundering.

Imply is a computer software company founded by the creators of Druid, that aims to help organizations with exploratory data analysis using Druid.

References

  1. "What is Operational Analytics? - Definition from Techopedia". Techopedia.com. Retrieved 2019-08-11.
  2. Forrester (Feb 26, 2018). "Digital Rewrites the Rules of Business" (PDF). Lantelligence.
  3. "Gartner Identifies Top 10 Data and Analytics Technology Trends for 2019". Gartner. Retrieved 2019-08-11.
  4. Naous, Jad (2019-05-16). "Everyone is an Analyst: Opportunities in Operational Analytics". Andreessen Horowitz. Retrieved 2019-08-11.
  5. Naous, Jad; Chen, Frank (2019-04-13). "Re-tooling How Organizations Make Decisions". Andreessen Horowitz. Retrieved 2019-08-11.
  6. "Data Analytics and Machine Learning: Driving Speed to Insight". MIT Technology Review. Retrieved 2019-08-11.
  7. "Scuba: Diving into Data at Facebook". Facebook Research. Retrieved 2019-08-11.
  8. "Operational Analytics: What every software engineer should know about low-latency queries on large data sets". rockset.com. Retrieved 2019-11-03.
  9. "Personalization", Wikipedia, 2019-08-07, retrieved 2019-08-11