Data valuation

Last updated

Data valuation is a discipline in the fields of accounting and information economics. [1] It is concerned with methods to calculate the value of data collected, stored, analyzed and traded by organizations. This valuation depends on the type, reliability and field of data.

Contents

History

In the 21st century, exponential increases in computing power and data storage capabilities (in line with Moore's law) have led to a proliferation of big data, machine learning and other data analysis techniques. Businesses increasingly adapt these techniques and technologies to pursue data-driven strategies to create new business models.[ citation needed ] Traditional accounting techniques used to value organizations were developed in an era before high-volume data capture and analysis became widespread and focused on tangible assets (machinery, equipment, capital, property, materials etc.), ignoring data assets. As a result, accounting calculations often ignore data and leave its value off organizations' balance sheets. [2] Notably, in the wake of the 9/11 attacks on the World Trade Center in 2001, a number of businesses lost significant amounts of data. They filed claims with their insurance companies for the value of information that was destroyed, but the insurance companies denied the claims, arguing that information did not count as property and therefore was not covered by their policies. [3]

A number of organizations and individuals began noticing this and then publishing on the topic of data valuation. Doug Laney, vice president and analyst at Gartner, conducted research on Wall Street valued companies, which found that companies that had become information-centric, treating data as an asset, often had market-to-book values two to three times higher than the norm. [3] [4] On the topic, Laney commented: "Even as we are in the midst of the Information Age, information simply is not valued by those in the valuation business. However, we believe that, over the next several years, those in the business of valuing corporate investments, including equity analysts, will be compelled to consider a company's wealth of information in properly valuing the company itself." [2] In the latter part of the 2010s, the list of most valuable firms in the world (a list traditionally dominated by oil and energy companies) was dominated by data firms – Microsoft, Alphabet, Apple, Amazon and Facebook. [5] [6]

Characteristics of data as an asset

A 2020 study by the Nuffield Institute at Cambridge University, UK divided the characteristics of data into two categories, economic characteristics and informational characteristics. [7]

Economic characteristics

Informational characteristics

Data value drivers

A number of drivers affect the extent to which future economic benefits can be derived from data. Some drivers relate to data quality, while others may either render the data valueless or create unique and valuable competitive advantages for data owners. [8]

The process of realizing value from data can be subdivided into a number of key stages: data assessment, where the current states and uses of data are mapped; data valuation, where data value is measured; data investment, where capital is spent to improve processes, governance and technologies underlying data; data utilization, where data is used in business initiatives; and data reflection, where the previous stages are reviewed and new ideas and improvements are suggested. [9]

Methods for valuing data

Due to the wide range of potential datasets and use cases, as well as the relative infancy of data valuation, there are no simple or universally agreed upon methods. High option value and externalities mean data value may fluctuate unpredictably, and seemingly worthless data may suddenly become extremely valuable at an unspecified future date. [7] Nonetheless, a number of methods have been proposed for calculating or estimating data value.

Information-theoretic characterization

Information theory provides quantitative mechanisms for data valuation. For instance, secure data sharing requires careful protection of individual privacy or organization intellectual property. Information-theoretic approaches and data obfuscation can be applied to sanitize data prior to its dissemination. [10] [11]

Information-theoretic measures, such as entropy, information gain, and information cost, are useful for anomaly and outlier detection. [12] In data-driven analytics, a common problem is quantifying whether larger data sizes and/or more complex data elements actually enhance, degrade, or alter the data information content and utility. The data value metric (DVM) quantifies the useful information content of large and heterogeneous datasets in terms of the tradeoffs between the size, utility, value, and energy of the data. [13] Such methods can be used to determine if appending, expanding, or augmenting an existent dataset may improve the modeling or understanding of the underlying phenomenon.

Infonomics valuation models

Doug Laney identifies six approaches for valuing data, dividing these into two categories: foundational models and financial models. Foundational models assign a relative, informational value to data, where financial models assign an absolute, economic value. [14]

Foundational models

  • Intrinsic Value of Information (IVI) measures data value drivers including correctness, completeness and exclusivity of data and assigns a value accordingly.
  • Business Value of Information (BVI) measures how fit the data is for specific business purposes (e.g., initiative X requires 80% accurate data that is updated weekly – how closely does the data match this requirement?).
  • Performance Value of Information (PVI) measures how the usage of the data effects key business drivers and KPIs, often using a control group study.

Financial models

  • Cost Value of Information (CVI) measures the cost to produce and store the data, the cost to replace it, or the impact on cash flows if it was lost.
  • Market Value of Information (MVI) measures the actual or estimated value the data would be traded for in the data marketplace.
  • Economic Value of Information (EVI) measures the expected cash flows, returns or savings from the usage of the data.

Bennett institute valuations

Research by the Bennett Institute divides approaches for estimating the value of data into market-based valuations and non-market-based valuations. [7]

Market based valuations

  • Stock market valuations measure the advantage gained by organizations that invest in data and data capability.
  • Income based valuations seek to measure the current and future income derived from data. This approach has limitations due to its inability to measure value realized in a wider business or societal ecosystem, or beyond financial transactions involving data. Where income from data is realized through trading data in a marketplace, there are further limitations, as markets fail to describe the full option value of data, and usually lack enough buyers and sellers for the market to settle on a price that truly reflects the economic value of the data.
  • Cost based valuations measure the cost to create and maintain data. This can look at the actual cost incurred, or projected costs if the data needed to be replaced.

Non-market based valuations

  • Economic value of open data examines who open or free data creates value for: organizations that host or steward the data; intermediary organizations or individuals that reuse the data to create products and services; organizations and individuals that use these products and services.
  • Value of personal data can be estimated by asking consumers questions such as how much they would be willing to pay to access a data-privacy service or would charge for access to their personal data. Values can also be estimated by examining the profits of companies that rely on personal data (In 2018 Facebook generated $10 for every active user), and by examining fines handed out to organizations that breach data privacy or other regulations.

Other approaches

Companies performing Data Valuations

Data Valuation as a Service provides:

Related Research Articles

References

  1. Allen, Beth (1990). "Information as an Economic Commodity". The American Economic Review. 80 (2): 268–273. JSTOR   2006582.
  2. 1 2 "Gartner Says Within Five Years, Organizations Will Be Valued on Their Information Portfolios".
  3. 1 2 "How Do You Value Information?". 15 September 2016.
  4. "Applied Infonomics: Why and How to Measure the Value of Your Information Assets".
  5. "The Value of Data". 22 September 2017.
  6. "Most Valuable Companies in the World – 2020".
  7. 1 2 3 "The Value of Data Summary Report" (PDF).
  8. "Putting a value on data" (PDF).
  9. "Data Valuation – What is Your Data Worth and How do You Value it?". 13 September 2019.
  10. Askari, M; Safavi-Naini, R; Barker, K (2012). "An information theoretic privacy and utility measure for data sanitization mechanisms". Proceedings of the second ACM conference on Data and Application Security and Privacy. Association for Computing Machinery. pp. 283–294. doi:10.1145/2133601.2133637. ISBN   9781450310918. S2CID   18338542.
  11. Zhou, N; Wu, Q; Wu, Z; Marino, S; Dinov, ID (2022). "DataSifterText: Partially Synthetic Text Generation for Sensitive Clinical Notes". Journal of Medical Systems. 46 (96): 96. doi:10.1007/s10916-022-01880-6. PMC  10111580. PMID   36380246.
  12. Lee , W; Xiang, D (2001). "Information-theoretic measures for anomaly detection". Proceedings 2001 IEEE Symposium on Security and Privacy. S&P 2001. IEEE. pp. 130–143. doi:10.1109/SECPRI.2001.924294. ISBN   0-7695-1046-9. S2CID   6014214.
  13. Noshad, M; Choi, J; Sun, Y; Hero, A; Dinov, ID (2021). "An information theoretic privacy and utility measure for data sanitization mechanisms". J Big Data. Springer. 8 (82): 82. doi:10.1186/s40537-021-00446-6. PMC   8550565 . PMID   34777945.
  14. "Why and How to Measure the Value of your Information Assets".
  15. "Measuring the Value of Information: An Asset Valuation Approach" (PDF).
  16. "The Valuation of Data as an Asset" (PDF).
  17. "Consumption-Based Method". 4 December 2018.
  18. "Keeping Research Data Safe Method". 4 December 2018.
  19. "Why you should be treating data as an asset". 2 March 2020.
  20. "Data Valuation – Valuing the World's Greatest Asset".