SAP HANA

Last updated

SAP HANA
Developer(s) SAP
Initial release2010 (2010)
Stable release 2.0 SPS07 [1] (April 4, 2023;10 months ago (2023-04-04)) [±]
Operating system Linux
Available in English
Type Multi-model database
License Proprietary commercial software
Website www.sap.com/products/hana.html

SAP HANA (HochleistungsANalyseAnwendung or High-performance ANalytic Application) is an in-memory, column-oriented, relational database management system developed and marketed by SAP SE. [2] [3] Its primary function as the software running a database server is to store and retrieve data as requested by the applications. In addition, it performs advanced analytics (predictive analytics, spatial data processing, text analytics, text search, streaming analytics, graph data processing) and includes extract, transform, load (ETL) capabilities as well as an application server.

Contents

History

During the early development of SAP HANA, a number of technologies were developed or acquired by SAP SE. These included TREX search engine (in-memory column-oriented search engine), P*TIME (in-memory online transaction processing (OLTP) Platform acquired by SAP in 2005), and MaxDB with its in-memory liveCache engine. [4] [5]

The first major demonstration of the platform was in 2011: teams from SAP SE, the Hasso Plattner Institute and Stanford University demonstrated an application architecture for real-time analytics and aggregation using the name HYRISE. [6] Former SAP SE executive, Vishal Sikka, mentioned this architecture as "Hasso's New Architecture". [7] Before the name "HANA" stabilized, people referred to this product as "New Database". [8] The software was previously called "SAP High-Performance Analytic Appliance". [9]

A first research paper on HYRISE was published in November 2010. [10] The research engine is later released open source in 2013, [11] and was reengineered in 2016 to become HYRISE2 in 2017. [12]

The first product shipped in late November 2010. [5] [13] By mid-2011, the technology had attracted interest but more experienced business customers considered it to be "in early days". [14] HANA support for SAP NetWeaver Business Warehouse (BW) was announced in September 2011 for availability by November. [15]

In 2012, SAP promoted aspects of cloud computing. [16] In October 2012, SAP announced a platform as a service offering called the SAP HANA Cloud Platform [17] [18] and a variant called SAP HANA One that used a smaller amount of memory. [19] [20]

In May 2013, a managed private cloud offering called the HANA Enterprise Cloud service was announced. [21] [22]

In May 2013, Business Suite on HANA became available, enabling customers to run SAP Enterprise Resource Planning functions on the HANA platform. [23] [24]

S/4HANA, released in 2015, written specifically for the HANA platform, combines functionality for ERP, CRM, SRM and others into a single HANA system. [25] S/4HANA is intended to be a simplified business suite, replacing earlier generation ERP systems. [26] While it is likely that SAP will focus its innovations on S/4HANA, some customers using non-HANA systems have raised concerns of being locked into SAP products. Since S/4HANA requires an SAP HANA system to run, customers running SAP business suite applications on hardware not certified by SAP would need to migrate to a SAP-certified HANA database should they choose the features offered by S/4HANA. [27]

Rather than versioning, the software utilizes service packs, referred to as Support Package Stacks (SPS), for updates. Support Package Stacks are released every 6 months. [28]

In November 2016 SAP announced SAP HANA 2, which offers enhancements to multiple areas such as database management and application management and includes two new cloud services: Text Analysis and Earth Observation Analysis.[ citation needed ] HANA customers can upgrade to HANA 2 from SPS10 and above. Customers running SPS9 and below must first upgrade to SPS12 before upgrading to HANA 2 SPS01. [29]

Architecture

Overview

The key distinctions between HANA and previous generation SAP systems are that it is a column-oriented, in-memory database, that combines OLAP and OLTP operations into a single system; thus in general SAP HANA is an "online transaction and analytical processing" (OLTAP) system, [30] also known as a hybrid transactional/analytical processing (HTAP). Storing data in main memory rather than on disk provides faster data access and, by extension, faster querying and processing. [31] While storing data in-memory confers performance advantages, it is a more costly form of data storage. Observing data access patterns, up to 85% of data in an enterprise system may be infrequently accessed [31] therefore it can be cost-effective to store frequently accessed, or "hot", data in-memory while the less frequently accessed "warm" data is stored on disk, an approach SAP began to support in 2016 and termed "Dynamic tiering". [32]

Column-oriented systems store all data for a single column in the same location, rather than storing all data for a single row in the same location (row-oriented systems). This can enable performance improvements for OLAP queries on large datasets and allows greater vertical compression of similar types of data in a single column. If the read times for column-stored data is fast enough, consolidated views of the data can be performed on the fly, removing the need for maintaining aggregate views and its associated data redundancy. [33]

Although row-oriented systems have traditionally been favored for OLTP, in-memory storage opens techniques to develop hybrid systems suitable for both OLAP and OLTP capabilities, [34] removing the need to maintain separate systems for OLTP and OLAP operations.

Indexer components Hana.jpg
Indexer components

The index server performs session management, authorization, transaction management and command processing. The database has both a row store and a columnar store. Users can create tables using either store, but the columnar store has more capabilities and is most frequently used.[ citation needed ] The index server also manages persistence between cached memory images of database objects, log files and permanent storage files. The XS engine allows web applications to be built. [35]

SAP HANA Information Modeling (also known as SAP HANA Data Modeling) is a part of HANA application development. Modeling is the methodology to expose operational data to the end user. Reusable virtual objects (named calculation views) are used in the modelling process.

MVCC

SAP HANA manages concurrency through the use of multiversion concurrency control (MVCC), which gives every transaction a snapshot of the database at a point in time. When an MVCC database needs to update an item of data, it will not overwrite the old data with new data, but will instead mark the old data as obsolete and add the newer version. [36] [37]

Big data

In a scale-out environment, HANA can keep volumes of up to a petabyte of data in memory while returning query results in under a second. However, RAM is still much more expensive than disk space, so the scale-out approach is only feasible for certain time critical use cases. [38]

Analytics

SAP HANA includes a number of analytic engines for various kinds of data processing. The Business Function Library includes a number of algorithms made available to address common business data processing algorithms such as asset depreciation, rolling forecast and moving average. [39] The Predictive Analytics Library includes native algorithms for calculating common statistical measures in areas such as clustering, classification and time series analysis. [40]

HANA incorporates the open source statistical programming language R as a supported language within stored procedures. [41]

The column-store database offers graph database capabilities. The graph engine processes the Cypher Query Language and also has a visual graph manipulation via a tool called Graph Viewer. Graph data structures are stored directly in relational tables in HANA's column store. [42] Pre-built algorithms in the graph engine include pattern matching, neighborhood search, single shortest path, and strongly connected components. Typical usage situations for the Graph Engine include examples like supply chain traceability, fraud detection, and logistics and route planning. [43]

HANA also includes a spatial database engine which implements spatial data types and SQL extensions for CRUD operations on spatial data. HANA is certified by the Open Geospatial Consortium, [44] and it integrates with ESRI's ArcGIS geographic information system. [45]

In addition to numerical and statistical algorithms, HANA can perform text analytics and enterprise text search. HANA's search capability is based on “fuzzy” fault-tolerant search, much like modern web-based search engines. Results include a statistical measure for how relevant search results are, and search criteria can include a threshold of accuracy for results. [46] Analyses available include identifying entities such as people, dates, places, organizations, requests, problems, and more. Such entity extraction can be catered to specific use cases such as Voice of the Customer (customer's preferences and expectations), Enterprise (i.e. mergers and acquisitions, products, organizations), and Public Sector (public persons, events, organizations). [47] Custom extraction and dictionaries can also be implemented.

Application development

Besides the database and data analytics capabilities, SAP HANA is a web-based application server, hosting user-facing applications tightly integrated with the database and analytics engines of HANA. The "XS Advanced Engine" (XSA) natively works with Node.js and JavaEE languages and runtimes. XSA is based on Cloud Foundry architecture and thus supports the notion of “Bring Your Own Language”, allowing developers to develop and deploy applications written in languages and in runtimes other than those XSA implements natively, as well as deploying applications as microservices. XSA also allows server-side JavaScript with SAP HANA XS Javascript (XSJS). [48]

Supporting the application server is a suite of application lifecycle management tools allowing development deployment and monitoring of user-facing applications.

Deployment

HANA can be deployed on-premises or in the cloud from a number of cloud service providers. [49]

HANA can be deployed on-premises as a new appliance from a certified hardware vendor. [50] Alternatively, existing hardware components such as storage and network can be used as part of the implementation, an approach which SAP calls "Tailored Data Center Integration (TDI)". [51] [52] HANA is certified to run on multiple operating systems [53] including SUSE Linux Enterprise Server [54] and Red Hat Enterprise Linux. [55] Supported hardware platforms for on-premise deployment include Intel 64 [56] and POWER Systems. [57] The system is designed to support both horizontal and vertical scaling.

Multiple cloud providers offer SAP HANA on an Infrastructure as a Service basis, including:

SAP also offer their own cloud services in the form of:

Editions

SAP HANA licensing is primarily divided into two categories. [65]

Runtime License:

Used to run SAP applications such as SAP Business Warehouse powered by SAP HANA and SAP S/4HANA.

Full Use License:

Used to run both SAP and non-SAP applications. This licensing can be used to create custom applications. [66]

As part of the full use license, features are grouped as editions targeting various use cases.

In addition, capabilities such as streaming and ETL are licensed as additional options. [67]

As of March 9, 2017, SAP HANA is available in an Express edition; a streamlined version which can run on laptops and other resource-limited environments. The license for SAP HANA, express edition is free of charge, even for productive use up to 32 GB of RAM. [68] Additional capacity increases can be purchased up to 128 GB of RAM. [69]

See also

Related Research Articles

<span class="mw-page-title-main">IBM Db2</span> Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB2 until 2017, when it changed to its present form.

Online analytical processing, or OLAP, is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining. Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting and similar areas, with new applications emerging, such as agriculture.

SAP R/3 is the former name of the enterprise resource planning software produced by the German corporation SAP AG. It is an enterprise-wide information system designed to coordinate all the resources, information, and activities needed to complete business processes such as order fulfillment, billing, human resource management, and production planning.

SAP ASE (Adaptive Server Enterprise), originally known as Sybase SQL Server, and also commonly known as Sybase DB or Sybase ASE, is a relational model database server developed by Sybase Corporation, which later became part of SAP SE. ASE was developed for the Unix operating system, and is also available for Microsoft Windows.

Essbase is a multidimensional database management system (MDBMS) that provides a platform upon which to build analytic applications. Essbase began as a product from Arbor Software, which merged with Hyperion Software in 1998. Oracle Corporation acquired Hyperion Solutions Corporation in 2007. Until late 2005 IBM also marketed an OEM version of Essbase as DB2 OLAP Server.

Online transaction processing (OLTP) is a type of database system used in transaction-oriented applications, such as many operational systems. "Online" refers to that such systems are expected to respond to user requests and process them in real-time. The term is contrasted with online analytical processing (OLAP) which instead focuses on data analysis.

Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence. The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.

A column-oriented DBMS or columnar DBMS is a database management system (DBMS) that stores data tables by column rather than by row. Benefits include more efficient access to data when only querying a subset of columns, and more options for data compression. However, they are typically less efficient for inserting new data.

SAP IQ is a column-based, petabyte scale, relational database software system used for business intelligence, data warehousing, and data marts. Produced by Sybase Inc., now an SAP company, its primary function is to analyze large amounts of data in a low-cost, highly available environment. SAP IQ is often credited with pioneering the commercialization of column-store technology.

Microsoft SQL Server is a proprietary relational database management system developed by Microsoft. As a database server, it is a software product with the primary function of storing and retrieving data as requested by other software applications—which may run either on the same computer or on another computer across a network. Microsoft markets at least a dozen different editions of Microsoft SQL Server, aimed at different audiences and for workloads ranging from small single-machine applications to large Internet-facing applications with many concurrent users.

Panopticon Software was a multi-national data visualization software company specializing in monitoring and analysis of real-time data. The firm was headquartered in Stockholm, Sweden. It partnered with software companies, including SAP, Thomson Reuters, Kx Systems, and One Market Data (OneTick). The company's name is derived from the Greek: 'pan' for all, 'optic' for sight. The company name is derived from the word panopticon which is an architectural concept originally intended to facilitate surveillance of prisons.

A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

<span class="mw-page-title-main">SingleStore</span> Database management system

SingleStore is a proprietary, cloud-native database designed for data-intensive applications. A distributed, relational, SQL database management system (RDBMS) that features ANSI SQL support, it is known for speed in data ingest, transaction processing, and query processing.

Hybrid transaction/analytical processing (HTAP) is a term created by Gartner Inc., an information technology research and advisory company, in its early 2014 research report Hybrid Transaction/Analytical Processing Will Foster Opportunities for Dramatic Business Innovation. As defined by Gartner:

Hybrid transaction/analytical processing (HTAP) is an emerging application architecture that "breaks the wall" between transaction processing and analytics. It enables more informed and "in business real time" decision making.

<span class="mw-page-title-main">SAP S/4HANA</span> Enterprise resource planning by SAP SE

SAP S/4HANA is an enterprise resource planning software for large enterprises developed by SAP SE. It is the successor to SAP R/3 and SAP ERP and is optimized for SAP's in-memory database SAP HANA.

SAP Cloud Platform has been rebranded as SAP Business Technology Platform (BTP).

Starcounter is an in-memory application platform built by Starcounter AB. The platform is based on a combined in-memory database engine and application server.

<span class="mw-page-title-main">SAP Converged Cloud</span> Private computing cloud

SAP Converged Cloud is a private managed cloud developed and marketed by SAP.

References

  1. "SAP HANA 2.0 SPS 07 Now Available" . Retrieved July 27, 2023.
  2. Jeff Kelly (July 12, 2013). "Primer on SAP HANA". Wikibon. Retrieved October 9, 2013.
  3. SAP HANA - The Column Oriented (Based) Database on YouTube (December 8, 2012)
  4. Vey, Gereon; Krutov, Ilya (January 2012). "SAP In-Memory Computing on IBM eX5 Systems" (PDF). Archived from the original (PDF) on June 7, 2014.
  5. 1 2 SAP SE (June 17, 2012). "SAP HANA Timeline". SlideShare. Retrieved October 9, 2013.
  6. Plattner, Hasso (2011). In-memory data management : an inflection point for enterprise applications. Zeier, Alexander. Berlin: Springer. ISBN   978-3-642-19363-7. OCLC   719363183.
  7. "Vishal Sikka: Timeless Software". October 22, 2008. Retrieved March 10, 2017.
  8. "What is SAP HANA Database". Gucons web site. 2011. Retrieved October 9, 2013.
  9. Jaikumar Vijayan (December 1, 2010). "SAP's HANA will speed real-time data analytics". Computerworld. Retrieved January 4, 2018.
  10. Grund, Martin; Krüger, Jens; Plattner, Hasso; Zeier, Alexander; Cudre-Mauroux, Philippe; Madden, Samuel (November 1, 2010). "HYRISE: a main memory hybrid storage engine". Proceedings of the VLDB Endowment. 4 (2): 105–116. doi:10.14778/1921071.1921077.
  11. The history of the project on GitHub shows a first commit on 4 February 2013.
  12. "HYRISE". hpi.de (in German). Retrieved November 27, 2019.
  13. Chris Kanaracus (December 1, 2010). "SAP launches HANA for in-memory analytics: The in-memory analytic appliance will compete with next-generation data-processing platforms such as Oracle's Exadata machines". Info World. Retrieved September 24, 2013.
  14. Chris Kanaracus (September 15, 2011). "SAP's HANA is hot, but still in early days". Network World. Archived from the original on October 19, 2011. Retrieved October 15, 2013.
  15. Courtney Bjorlin (November 9, 2011). "SAP Begins BW on HANA Ramp-Up, First Big Test for the HANA Database". ASUG News. Archived from the original on November 29, 2013. Retrieved October 15, 2013.
  16. Trevis Team (April 30, 2012). "SAP Headed For $71 On Cloud, Mobile And HANA Growth". Forbes. Retrieved October 9, 2013.
  17. "SAP Introduces SAP HANA Cloud, an In-Memory Cloud Platform". Database Trends and Applications. October 24, 2012. Retrieved June 18, 2016.
  18. "Overview | SAP HANA Cloud Platform". hcp.sap.com. Retrieved June 18, 2016.
  19. IBM Cloud AMM for SAP HANA One Archived November 19, 2015, at the Wayback Machine
  20. Doug Henschen (October 17, 2012). "SAP Launches Cloud Platform Built On Hana". InformationWeek. Archived from the original on October 19, 2012. Retrieved October 15, 2013.
  21. "SAP unveils HANA Enterprise Cloud service Network World". May 7, 2013. Retrieved July 13, 2017.
  22. "SAP HANA Enterprise Cloud". hana.sap.com. Retrieved June 18, 2016.
  23. Brian McKenna (January 11, 2013). "SAP puts Business Suite on HANA, joins transactional to analytical". Computer Weekly. Retrieved October 15, 2013.
  24. "Sapphire 2013: Business Suite on HANA goes to general availability". Computer Weekly. May 15, 2013. Retrieved October 15, 2013.
  25. "SAP unwraps a new enterprise suite based on Hana PCWorld" . Retrieved July 13, 2017.
  26. "SAP Business Suite on HANA vs. S/4HANA Symmetry" . Retrieved July 13, 2017.
  27. "SAP's S4/HANA master plan: The lingering questions ZDNet". ZDNet . Retrieved August 1, 2017.
  28. "HANA 2 – What is it? SAP Blogs" . Retrieved July 13, 2017.
  29. "SAP HANA 2 - The Next Generation Platform" . Retrieved July 13, 2017.
  30. "What is SAP HANA? Expert Insight from Symmetry" . Retrieved August 1, 2017.
  31. 1 2 "SAP HANA sales fly but there's more to the in-memory story ZDNet". ZDNet . Retrieved July 28, 2017.
  32. "SAP Unleashes Major Hana Upgrade - InformationWeek". October 24, 2014. Retrieved July 28, 2017.
  33. "A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database" (PDF). Retrieved August 1, 2017.
  34. "Compacting Transactional Data in Hybrid OLTP&OLAP Databases" (PDF). Retrieved August 1, 2017.
  35. "Monthly Archives". SAP Hana Blog. December 2012. Retrieved January 4, 2018.
  36. "Multiversion Concurrency Control (MVCC) Issues". SAP Help Portal. Retrieved January 4, 2018.
  37. "High-Performance Transaction Processing in SAP HANA" (PDF). Bulletin of the IEEE Computer Society Technical Committee on Data Engineering. n.d. Retrieved January 4, 2018.
  38. "SAP HANA and Big Data – Scale-out Options". Felix Weber Research. April 7, 2017. Retrieved April 7, 2019.
  39. "Business Function Library - Real Time Analytics with SAP HANA" . Retrieved October 2, 2017.
  40. "SAPexperts An Introduction to SAP Predictive Analysis and How It Integrates with SAP HANA". June 30, 2013. Retrieved October 2, 2017.
  41. "When SAP HANA met R – What's new? R-bloggers". February 18, 2013. Retrieved October 2, 2017.
  42. "FOSDEM 2017 - Graph Processing on SAP HANA, express edition" . Retrieved October 2, 2017.
  43. "The Graph Story of the SAP HANA Database" . Retrieved October 2, 2017.
  44. "SAP HANA SPS11 tackles analytics, IT and development" . Retrieved October 2, 2017.
  45. "FAQ: Does the ArcGIS platform support the SAP HANA database?" . Retrieved October 2, 2017.
  46. "SAP Releases Sentiment Analysis Solution - CRM Magazine" . Retrieved October 2, 2017.
  47. "SAP HANA TA – Text Analysis" . Retrieved October 2, 2017.
  48. "A New Development Platform for Native SAP HANA Applications". April 26, 2016. Retrieved October 2, 2017.
  49. "SAP HANA Deployment Options On Premise, Cloud, or Hybrid" . Retrieved July 14, 2017.
  50. "Certified SAP HANA® Hardware Directory". global.sap.com. Retrieved June 30, 2016.
  51. "Datacenter integration is the new 'table stakes' | #SAPPHIRENOW". May 18, 2016. Retrieved June 30, 2016.
  52. "SAP HANA Tailored Data Center Integration - SAP HANA Technical Operations Manual - SAP Library". help.sap.com. Retrieved June 30, 2016.
  53. "SAP HANA Hardware and Software Requirements".
  54. "SUSE Linux Enterprise Server for SAP Applications" . Retrieved July 14, 2017.
  55. "Red Hat launches Enterprise Linux for SAP HANA ZDNet". ZDNet . Retrieved July 14, 2017.
  56. "SAP HANA Wrings Performance From New Intel Xeons". February 19, 2014. Retrieved July 14, 2017.
  57. "SAP HANA on Power with SUSE Linux Enterprise Server for SAP Applications". January 14, 2016.
  58. "AWS - SAP HANA" . Retrieved May 12, 2017.
  59. "SAP HANA on Azure Virtual Machines - Microsoft Azure" . Retrieved May 12, 2017.
  60. "Google Cloud and SAP forge partnership to develop enterprise solutions". March 7, 2017. Retrieved May 12, 2017.
  61. "SAP chooses IBM as a premier strategic provider of Cloud infrastructure services for its business critical applications". IBM.
  62. "Huawei Announces Availability of SAP HANA® Running on Huawei FusionSphere-huawei press center". huawei. Retrieved September 8, 2016.
  63. "SAP unveils HANA Enterprise Cloud service Network World". May 7, 2013. Retrieved July 14, 2017.
  64. "What is SAP Cloud Platform ? - Definition from WhatIs.com" . Retrieved July 14, 2017.
  65. "Update IV: The SAP HANA FAQ - answering key SAP In-Memory questions". bluefinsolutions.com. Retrieved July 8, 2016.
  66. "SAP HANA in-memory DBMS overview" . Retrieved July 8, 2016.
  67. "SAP HANA Options and Additional Capabilities – SAP Help Portal Page". help.sap.com. Retrieved July 8, 2016.
  68. "SAP Developer center - SAP HANA express edition". developers.sap.com. Retrieved January 28, 2019.
  69. "OS Licensing requirements for SAP HANA Express Edition". November 30, 2021. Retrieved December 11, 2021.