Spreadmart

Last updated

A spreadmart (spreadsheet data mart) is a business data analysis system running on spreadsheets or other desktop databases that is created and maintained by individuals or groups to perform tasks that can be done in a more structured way by a data mart or data warehouse. [1] Typically a spreadmart is created by individuals at different times using different data sources and rules for defining metrics in an organization, creating a decentralized, fractured view of the enterprise.

Contents

The concept was coined in 2002 by Wayne Eckerson at TDWI in his article Taming Spreadsheet Jockeys, [2] and intended pejoratively, as an undesirable system, which should be replaced by a data mart. However, critics such as Stephen Samild argue that spreadmarts have advantages over data marts and can be a desirable system.

Problems

Usually, spreadmarts grow where standard business intelligence (BI) reporting is too inflexible and too slow. A business analyst uses the "export to Microsoft Excel" button in the BI software and creates their own report with the exported data table. By this, the number of independently generated spreadsheets dealing with a particular group of analyses grows inside the company, and the data inside each spreadsheet is uncoupled from its source. When this happens, the data reflected in the spreadsheets is no longer verifiable and is not automatically kept up to date. Usually these spreadsheet files are distributed via email to colleagues resulting in even more copies of the data roaming through the enterprise. With Microsoft Power Pivot for Microsoft SharePoint, Excel spreadsheets can be distributed as dashboards throughout the entire company, giving even more users the tools to create spreadmarts.

The growth of spreadmarts poses tangible risks for companies, since undefined and uncoupled data can be used to draw false conclusions that lead to wrong decisions, which will cost time and money to discover and correct. Although Business Intelligence 2.0 software vendors claim to have overcome this issue, locally installed spreadsheet and graphing software continues to be easier to access and use, giving the business analyst the freedom to create the needed analysis quickly, and choose to live with the risk of data inconsistency that goes with it.

Criticism of concept

Critics like Stephen Samild argue that the definition stems from a biased view that sees a data warehouse as desirable end-result, whereas One might more accurately define data marts and data warehouses as "scaled-up systems which perform some of the tasks normally done by a spreadmart". [3] In the rest of the article Stephen Samild argues that a spreadmart fulfills a number of roles that a data warehouse cannot fulfill as easily or as cheaply due to the lack of integration with unstructured data, the lack of read-write capabilities, the long time span needed for integration of new sources in the data warehouse and the inherent 'free form' of many analytical presentations done in Word, PowerPoint or Excel.

Related Research Articles

<span class="mw-page-title-main">Gnumeric</span> Free and open-source spreadsheet software

Gnumeric is a spreadsheet program that is part of the GNOME Free Software Desktop Project. Gnumeric version 1.0 was released on 31 December 2001. Gnumeric is distributed as free software under the GNU General Public License; it is intended to replace proprietary spreadsheet programs like Microsoft Excel. Gnumeric was created and developed by Miguel de Icaza, but he has since moved on to other projects. The maintainer as of 2002 was Jody Goldberg.

<span class="mw-page-title-main">Microsoft Excel</span> Spreadsheet editor, part of Microsoft 365

Microsoft Excel is a spreadsheet editor developed by Microsoft for Windows, macOS, Android, iOS and iPadOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro programming language called Visual Basic for Applications (VBA). Excel forms part of the Microsoft 365 suite of software.

Business intelligence comprises the strategies and technologies used by enterprises for the data analysis and management of business information. Common functions of business intelligence technologies include reporting, online analytical processing, analytics, dashboard development, data mining, process mining, complex event processing, business performance management, benchmarking, text mining, predictive analytics, and prescriptive analytics.

<span class="mw-page-title-main">Data mart</span> Data management pattern

A data mart is a structure/access pattern specific to data warehouse environments, used to retrieve client-facing data. The data mart is a subset of the data warehouse and is usually oriented to a specific business line or team. Whereas data warehouses have an enterprise-wide depth, the information in data marts pertains to a single department. In some deployments, each department or business unit is considered the owner of its data mart including all the hardware, software and data. This enables each department to isolate the use, manipulation and development of their data. In other deployments where conformed dimensions are used, this business unit owner will not hold true for shared dimensions like customer, product, etc.

<span class="mw-page-title-main">OLAP cube</span> Multidimensional data array organized for rapid analysis

An OLAP cube is a multi-dimensional array of data. Online analytical processing (OLAP) is a computer-based technique of analyzing data to look for insights. The term cube here refers to a multi-dimensional dataset, which is also sometimes called a hypercube if the number of dimensions is greater than three.

Essbase is a multidimensional database management system (MDBMS) that provides a platform upon which to build analytic applications. Essbase began as a product from Arbor Software, which merged with Hyperion Software in 1998. Oracle Corporation acquired Hyperion Solutions Corporation in 2007. Until late 2005 IBM also marketed an OEM version of Essbase as DB2 OLAP Server.

A pivot table is a table of values which are aggregations of groups of individual values from a more extensive table within one or more discrete categories. The aggregations or summaries of the groups of the individual terms might include sums, averages, counts, or other statistics. A pivot table is the outcome of the statistical processing of tabularized raw data and can be used for decision-making.

<span class="mw-page-title-main">Self-service</span> Practice of serving oneself when shopping

Self-service is the practice of serving oneself, usually when making purchases. Aside from Automated Teller Machines, which are not limited to banks, and customer-operated supermarket check-out, labor-saving which has been described as self-sourcing, there is the latter's subset, selfsourcing and a related pair: End-user development and End-user computing.

Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence. The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.

<span class="mw-page-title-main">Dashboard (business)</span> Aggregate business progress report

In business computer information systems, a dashboard is a type of graphical user interface which often provides at-a-glance views of key performance indicators (KPIs) relevant to a particular objective or business process. In other usage, "dashboard" is another name for "progress report" or "report" and considered a form of data visualization. In providing this overview, business owners can save time and improve their decision making by utilizing dashboards.

Microsoft Office PerformancePoint Server is a business intelligence software product released in 2007 by Microsoft. The product was generally an integration of the acquisitions from ProClarity - the Planning Server and Monitoring Server - into Microsoft's SharePoint server product line. Although discontinued in 2009, the dashboard, scorecard, and analytics capabilities of PerformancePoint Server were incorporated into SharePoint 2010 and later versions.

<span class="mw-page-title-main">Palo (OLAP database)</span>

Palo is a memory resident multidimensional database server and typically used as a business intelligence tool for controlling and budgeting purposes with spreadsheet software acting as the user interface. Beyond the multidimensional data concept, Palo enables multiple users to share one centralised data storage.

Panorama Software is a Canadian software and consulting company specializing in business intelligence. The company was founded by Rony Ross in Israel in 1993; it relocated its headquarters to Toronto, Canada in 2003. Panorama sold its online analytical processing (OLAP) technology to Microsoft in 1996, which was built into Microsoft OLAP Services and later SQL Server Analysis Services, an integrated component of Microsoft SQL Server.

F9 is a financial reporting software application that dynamically links general ledger data to Microsoft Excel through the use of financial cell-based formulas, wizards, and analysis tools to create spreadsheet reports that can be calculated, filtered, and drilled upon. The F9 software is developed, marketed, and support by an organization also called F9, a division of Infor Global Solutions (Canada) Ltd. which is headquartered in Vancouver, British Columbia.

<span class="mw-page-title-main">XLeratorDB</span>

XLeratorDB is a suite of database function libraries that enable Microsoft SQL Server to perform a wide range of additional (non-native) business intelligence and ad hoc analytics. The libraries, which are embedded and run centrally on the database, include more than 450 individual functions similar to those found in Microsoft Excel spreadsheets. The individual functions are grouped and sold as six separate libraries based on usage: finance, statistics, math, engineering, unit conversions and strings. WestClinTech, the company that developed XLeratorDB, claims it is "the first commercial function package add-in for Microsoft SQL Server."

<span class="mw-page-title-main">XLCubed</span>

XLCubed is a business intelligence software and consulting services company. Established in 2001, XLCubed develops business intelligence software and provides business intelligence and performance management consulting services. The company is privately held and based out of the United Kingdom in the Thames Valley IT corridor.

Power Pivot, formerly known as PowerPivot, is a feature of Microsoft Excel, a computer software spreadsheet. It is available as an add-in in Excel 2010, 2013 in separate downloads, and as an add-in included with the Excel 2016 program. Power Pivot extends a local instance of Microsoft Analysis Services tabular that is embedded directly into an Excel Workbook. This allows a user to build a ROLAP model in Power Pivot, and use pivot tables to explore the model once it is built. This allows Excel to act as a self-service business intelligence (BI) platform, implementing professional expression languages to query the model and calculate advanced measures.

<span class="mw-page-title-main">LibreOffice Calc</span> Spreadsheet component of LibreOffice

LibreOffice Calc is the spreadsheet component of the LibreOffice software package.

Data warehouse automation (DWA) refers to the process of accelerating and automating the data warehouse development cycles, while assuring quality and consistency. DWA is believed to provide automation of the entire lifecycle of a data warehouse, from source system analysis to testing to documentation. It helps improve productivity, reduce cost, and improve overall quality.

Microsoft Power BI is an interactive data visualization software product developed by Microsoft with a primary focus on business intelligence. It is part of the Microsoft Power Platform. Power BI is a collection of software services, apps, and connectors that work together to turn various sources of data into static and interactive data visualizations. Data may be input by reading directly from a database, webpage, PDF, or structured files such as spreadsheets, CSV, XML, JSON, XLSX, and SharePoint.

References

  1. The Data Warehousing Institute (TDWI) in a 2008 survey
  2. Eckerson, Wayne (July 2002). "Taming Spreadsheet Jockeys". TDWI Case Studies and Solutions. TDWI. Archived from the original on 2007-10-12. Retrieved 2008-06-13.
  3. Samild, Stephen (2011-09-13). "Analysis is Read-Write". Analyst First. Retrieved 2014-05-03.