Utility assessment

Last updated

Utility assessment, [1] also called utility measurement, [2] is a process by which the utility function of individuals or groups can be estimated. There are many different methods for utility assessment. [3]

Contents

Assessing single-attribute utility functions

A single-attribute utility function maps the amount of money a person has (or gains), to a number representing the subjective satisfaction he derives from it. The motivation to define a utility function comes from the St. Petersburg paradox: the observation that people are not willing to pay much for a lottery, even if its expected monetary gain is infinite. The classical solution to this paradox, suggested by Daniel Bernoulli and Gabriel Cramer, is that most people have a utility function that is strictly concave, and they aim to maximize their expected utility, rather than their expected gain.

Power-log utility

Bernouli himself assumed that the utility is logarithmic, that is, u(x)=log(x) where x is the amount of money; this was sufficient for solving the St. Petersburg paradox. Gustav Fechner [4] also supplied psychophysical justification for the logarithmic function (known as the Weber–Fechner law). But Stanley Smith Stevens [5] showed that the relation between physical stimulus and psychological perception can be better explaind by a power function, that is, u(x)=xp, with exponent p between 0.3 to 2.

Many investigators tried to determine whether utility is better represented by logarithmic functions or by power functions. Using various methods, they showed that power functions fit utility data better. [6] [7] [8] [9] [10] As a result, power functions were incorporated into psychological decision theories, such as Cumulative prospect theory, rank-affected multiplicative (RAM) weights model, and transfer of attention exchange (TAX) model. Some economic applications still use logarithmic functions though. [11]

Wakker [12] noted that power functions can have a negative exponent, but in this case their sign should change so that they remain increasing. One way to define this generalized family of functions is:

which is increasing for any exponent r ≠ 0. Moreover, the limit of this function when r → 0 is exactly the logarithmic function: . Therefore, the family of functions ur(x) for all real p is sometimes called power-log utility. [13]

Procedures for assessing utility

Utility functions are usually assessed in experiments checking subjects' preferences over lotteries. Two general types of procedures have been used: [1]

  1. In equivalence procedures, subjects are asked to adjust a sum of money in one lottery so that it becomes equivalent - in their eyes - to another lottery. For example, subjects may be asked to consider two lotteries: (a) getting $x for sure; (b) getting $20 with probability 60% and getting $0 with probability 40%. They are asked "for what value of x, would you be indifferent between these two lotteries?". Each such question yields an equation of the form , which can be used to assess the form of the utility function. For example, if the subject answers x=$10, we get the equation . Assuming power utility, this yields , which yields .
  2. In choice procedures, subjects are shown two or more lotteries, and asked which lottery they prefer. For example, subjects may be asked to choose between two options: "(a) getting $10 for sure; (b) getting $20 with probability 60% and getting $0 with probability 40%". Each such question yields an inequality of the form , where p1,i and x1,i are the probabilites and sums in the preferred lottery, and p2,i and x2,i are the probabilities and sums in the other lottery. The advantage of choice questions is that they are easier to answer; a sequence of such questions can lead to the point of equivalence.

There are several problems with these procedures. [3]

First, they assume people weight events by their true (objective) probabilities, p1,i and p2,i. In fact, much evidence shows that people weight events by subjective probabilities . [14] In particular, people tend to overweight small probabilities and underweight medium to large probabilities (see Prospect theory). The non-linearity in the subjective probability may be confounded with the concavity of the utility function. For example, the person indifferent between the lotteries [100%: $10] and [60%:$20, 40%:0] can be modeled by a linear utility function, if we assume that he underweights the probability 60% to around 50%.

One way to avoid this confounding is using equal probabilities in all queries; this was done, for example, by Coombs and Komorita. [15] This trick works for non-configural weight theories, [16] which assume that the subjective probability is a function of the objective probability (that is, every objective probability is translated to a unique subjective probability). In this case, when all probabilities in the queries are equal, and they cancel out in the equations. The equations involve only the utilities, and we can again use them to infer the form of the utility function.

However, configural weight theories, [17] motivated by the Allais paradox, show that subjective probability may depend both on the objective probability and on the outcome. Kirby [3] presented a way to design the queries such that, for power-log utilities and negative-exponential utilities, the predictions do not depend on canceling subjective probabilities.

A second problem is that some experiments use both gains and losses. [15] [18] However, later research show that the concavity of the utility function may be different between gains and losses (see prospect theory and loss aversion). Combining gain and loss domains may yield an incorrect utility function.

A possible solution is to measure each of these two domains separately. [19] Eugene Galanter devised another solution, for both the first and the second problem. [6] [20] He conducted experiments in which no probabilities were used; instead, he asked questions such as "how much money would you need in order to feel twice as happy as $10"? If the answer is e.g. $18, then we get an equation such as , which gives information on the utility function, without and dependence on probabilities and risk attitudes. His experiments consistently showed that power functions better fit the data than log functions.

A third problem is that most experiments compare the relative fit of different utility models to the data. For example, they can show that power functions fit the data better than logarithmic functions, but cannot reject the hypothesis that power functions fit the data. Kirby [3] presented a novel experiment design, that allowed him to get point-predictions for each model separately. His experiments indicate that both power-log functions and negative-exponent functions do not fit the data. He leaves finding a better-fitting function as an open problem.

Assessing multi-attribute utility functions

A multi-attribute utility (MAU) function maps a bundle with two or more attributes (e.g. money and free time) to a number representing the subjective satisfaction from that bundle.

Assessing MAU is relevant even in conditions of certainty. For example, whereas most people prefer $12,000 for sure to $10,000 for sure, different people may have different preferences between the bundles ($10,000 salary, 8 work hours per day) and ($12,000 salary, 9 work hours per day), even when both bundles are certain. A procedure for assessing MAU in conditions of certainty is presented in Ordinal utility#assessment.

Assessing MAU in conditions of uncertainty is more complex; see multi-attribute utility#assessment for details.

In health

Assessement of MAU functions is particularly relevant in Health economics. It is often required to choose among different possible treatments, where each treatment has different attributes regarding life expectancy, life quality, safety, and cost. Following extensive surveys, MAU functions for health-related conditions were developed; see Quality-adjusted life year and EQ-5D#assessment. The most common method for assessing health-related utilities is time trade-off. To enable decision-making in the national level, a MAU function for health is constructed in the national level, as an "average" utility function of all patiens in the country.

The utility functions are usually normalized such that a utility of 1 means "full health", and a utility of 0 means "death". Negative utility functions are possible, for situations considered "worse than death". As an example, here is a description of a protocol for constructing a value-set for EQ-5D-Y (the EuroQol 5-Dimensional scale for Young people). [21] The construction is done in two steps: an online Discrete Choice Experiment (DCE) survey, and a face-to-face composite time-tradeoff interview (cTTO):

  1. The DCE step measures the relative importance of the five dimensions, but yields results on a latent scale, rather than normalized utility values. The recommended experimental design contains 10 blocks, with 15 pairs of health-states in each block (150 pairs overall). Each subject is given a single block (15 pairs). For each pair, the subject has to say which of these two states he prefers (e.g. "do you prefer 12333 or 31122?").
  2. The cTTO step is used to compute a normalized value to the anchor state, which is the state 33333: the state with the worst level (3) in all 5 dimensions. Each subject should answer 10 TTO queries. Each TTO query shows a pair of health-state+duration, and the subject has to say which of these he prefers. There are two kinds of queries: [21] :660,Fig.2
    • One kind assumes that the state is better than death. The questions look like: "what life do you prefer: 2 years in full health, or 10 years in state 33333"?.
    • The other kind assumes that the state is worse than death. The questions look like: "what life do you prefer: 2 years in full health, or 5 years in full health plus 10 years in state 33333"?.

In both steps, the subjects are adults, and they are asked to answer the queries from the point-of-view of a 10-years-old child. The reasons for asking adults, rather than children, were (1) adults are the taxpayers - they should decide how health budget is used; (2) there are queries about death, which may be inappropriate for children; (3) children may misunderstand the questions.

The above protocol was first executed in Slovenia as follows: [22]

See also

Related Research Articles

<span class="mw-page-title-main">Logarithm</span> Inverse of the exponential function

In mathematics, the logarithm is the inverse function to exponentiation. That means that the logarithm of a number x to the base b is the exponent to which b must be raised to produce x. For example, since 1000 = 103, the logarithm base 10 of 1000 is 3, or log10 (1000) = 3. The logarithm of x to base b is denoted as logb (x), or without parentheses, logbx, or even without the explicit base, log x, when no confusion is possible, or when the base does not matter such as in big O notation.

In economics, utility is a measure of the satisfaction that a certain person has from a certain state of the world. Over time, the term has been used in two different meanings.

<span class="mw-page-title-main">Prospect theory</span> Theory of behavioral economics and behavioral finance

Prospect theory is a theory of behavioral economics, judgment and decision making that was developed by Daniel Kahneman and Amos Tversky in 1979. The theory was cited in the decision to award Kahneman the 2002 Nobel Memorial Prize in Economics.

<span class="mw-page-title-main">St. Petersburg paradox</span> Paradox involving a game with repeated coin flipping

The St. Petersburg paradox or St. Petersburg lottery is a paradox involving the game of flipping a coin where the expected payoff of the theoretical lottery game approaches infinity but nevertheless seems to be worth only a very small amount to the participants. The St. Petersburg paradox is a situation where a naïve decision criterion that takes only the expected value into account predicts a course of action that presumably no actual person would be willing to take. Several resolutions to the paradox have been proposed, including the impossible amount of money a casino would need to continue the game indefinitely.

In computational complexity theory, a probabilistically checkable proof (PCP) is a type of proof that can be checked by a randomized algorithm using a bounded amount of randomness and reading a bounded number of bits of the proof. The algorithm is then required to accept correct proofs and reject incorrect proofs with very high probability. A standard proof, as used in the verifier-based definition of the complexity class NP, also satisfies these requirements, since the checking procedure deterministically reads the whole proof, always accepts correct proofs and rejects incorrect proofs. However, what makes them interesting is the existence of probabilistically checkable proofs that can be checked by reading only a few bits of the proof using randomness in an essential way.

The expected utility hypothesis is a foundational assumption in mathematical economics concerning decision making under uncertainty. It postulates that rational agents maximize utility, meaning the subjective desirability of their actions. Rational choice theory, a cornerstone of microeconomics, builds this postulate to model aggregate social behaviour.

In decision theory, subjective expected utility is the attractiveness of an economic opportunity as perceived by a decision-maker in the presence of risk. Characterizing the behavior of decision-makers as using subjective expected utility was promoted and axiomatized by L. J. Savage in 1954 following previous work by Ramsey and von Neumann. The theory of subjective expected utility combines two subjective concepts: first, a personal utility function, and second a personal probability distribution.

<span class="mw-page-title-main">Logarithmic distribution</span> Discrete probability distribution

In probability and statistics, the logarithmic distribution is a discrete probability distribution derived from the Maclaurin series expansion

<span class="mw-page-title-main">Cardinal utility</span> In contrast with ordinal utility, in economics

In economics, a cardinal utility function or scale is a utility index that preserves preference orderings uniquely up to positive affine transformations. Two utility indices are related by an affine transformation if for the value of one index u, occurring at any quantity of the goods bundle being evaluated, the corresponding value of the other index v satisfies a relationship of the form

Bayesian experimental design provides a general probability-theoretical framework from which other theories on experimental design can be derived. It is based on Bayesian inference to interpret the observations/data acquired during the experiment. This allows accounting for both any prior knowledge on the parameters to be determined as well as uncertainties in observations.

<span class="mw-page-title-main">Kelly criterion</span> Formula for bet sizing that maximizes the expected logarithmic value

In probability theory, the Kelly criterion is a formula for sizing a bet. The Kelly bet size is found by maximizing the expected value of the logarithm of wealth, which is equivalent to maximizing the expected geometric growth rate. It assumes that the expected returns are known and is optimal for a bettor who values their wealth logarithmically. J. L. Kelly Jr, a researcher at Bell Labs, described the criterion in 1956. Under the stated assumptions, the Kelly criterion leads to higher wealth than any other strategy in the long run.

The Allais paradox is a choice problem designed by Maurice Allais (1953) to show an inconsistency of actual observed choices with the predictions of expected utility theory. Rather than adhering to rationality, the Allais paradox proves that individuals rarely make rational decisions consistently when required to do so immediately. The independence axiom of expected utility theory, which requires that the preferences of an individual should not change when altering two lotteries by equal proportions, was proven to be violated by the paradox.

Stochastic dominance is a partial order between random variables. It is a form of stochastic ordering. The concept arises in decision theory and decision analysis in situations where one gamble can be ranked as superior to another gamble for a broad class of decision-makers. It is based on shared preferences regarding sets of possible outcomes and their associated probabilities. Only limited knowledge of preferences is required for determining dominance. Risk aversion is a factor only in second order stochastic dominance.

<span class="mw-page-title-main">Scoring rule</span> Measure for evaluating probabilistic forecasts

In decision theory, a scoring rule provides a summary measure for the evaluation of probabilistic predictions or forecasts. It is applicable to tasks in which predictions assign probabilities to events, i.e. one issues a probability distribution as prediction. This includes probabilistic classification of a set of mutually exclusive outcomes or classes.

<span class="mw-page-title-main">Isoelastic utility</span> Concept in economics

In economics, the isoelastic function for utility, also known as the isoelastic utility function, or power utility function, is used to express utility in terms of consumption or some other economic variable that a decision-maker is concerned with. The isoelastic utility function is a special case of hyperbolic absolute risk aversion and at the same time is the only class of utility functions with constant relative risk aversion, which is why it is also called the CRRA utility function.

In decision theory, the von Neumann–Morgenstern (VNM) utility theorem shows that, under certain axioms of rational behavior, a decision-maker faced with risky (probabilistic) outcomes of different choices will behave as if they are maximizing the expected value of some function defined over the potential outcomes at some specified point in the future. This function is known as the von Neumann–Morgenstern utility function. The theorem is the basis for expected utility theory.

In expected utility theory, a lottery is a discrete distribution of probability on a set of states of nature. The elements of a lottery correspond to the probabilities that each of the states of nature will occur, e.g.. Much of the theoretical analysis of choice under uncertainty involves characterizing the available choices in terms of lotteries.

Equitable (EQ) cake-cutting is a kind of a fair cake-cutting problem, in which the fairness criterion is equitability. It is a cake-allocation in which the subjective value of all partners is the same, i.e., each partner is equally happy with his/her share. Mathematically, that means that for all partners i and j:

In decision theory, a multi-attribute utility function is used to represent the preferences of an agent over bundles of goods either under conditions of certainty about the results of any potential choice, or under conditions of uncertainty.

In measure and probability theory in mathematics, a convex measure is a probability measure that — loosely put — does not assign more mass to any intermediate set "between" two measurable sets A and B than it does to A or B individually. There are multiple ways in which the comparison between the probabilities of A and B and the intermediate set can be made, leading to multiple definitions of convexity, such as log-concavity, harmonic convexity, and so on. The mathematician Christer Borell was a pioneer of the detailed study of convex measures on locally convex spaces in the 1970s.

References

  1. 1 2 Farquhar, Peter H. (November 1984). "State of the Art—Utility Assessment Methods". Management Science. 30 (11): 1283–1300. doi:10.1287/mnsc.30.11.1283.
  2. Alchian, Armen A. (1953). "The Meaning of Utility Measurement". The American Economic Review. 43 (1): 26–50. JSTOR   1810289.
  3. 1 2 3 4 Kirby, Kris N. (2011). "An empirical assessment of the form of utility functions". Journal of Experimental Psychology: Learning, Memory, and Cognition. 37 (2): 461–476. doi:10.1037/a0021968. PMID   21355669.
  4. Fechner, Gustav Theodor (1860). Elemente der Psychophysik (in German). Breitkopf u. Härtel.[ page needed ]
  5. Stevens, S. S. (1957). "On the psychophysical law". Psychological Review. 64 (3): 153–181. doi:10.1037/h0046162. PMID   13441853. ProQuest   614283678.
  6. 1 2 Galanter, Eugene (1962). "The Direct Measurement of Utility and Subjective Probability". The American Journal of Psychology. 75 (2): 208–220. doi:10.2307/1419604. JSTOR   1419604. PMID   13896303.
  7. Galanter, Eugene; Holman, Garvin L. (1967). "Some invariances of the isosensitivity function and their implications for the utility function of money". Journal of Experimental Psychology. 73 (3): 333–339. doi:10.1037/h0024275. PMID   6032534. ProQuest   614257044.
  8. Hamblin, Robert L; Clairmont, Donald H; Chadwick, Bruce A (March 1975). "Utility and gambling decisions: Experiments and equations". Social Science Research. 4 (1): 1–15. doi:10.1016/0049-089X(75)90016-2.
  9. Kornbrot, Diana E.; Donnelly, Michael; Galanter, Eugene (April 1981). "Estimates of utility function parameters from signal detection experiments". Journal of Experimental Psychology: Human Perception and Performance. 7 (2): 441–458. doi:10.1037/0096-1523.7.2.441. ProQuest   614275914.
  10. Parker, Scott; Stein, David; Darte, Ed; Schneider, Bruce; Popper, Richard; Needel, Steven (1981). "Utility Function for Money Determined Using Conjoint Measurement". The American Journal of Psychology. 94 (4): 563–573. doi:10.2307/1422419. JSTOR   1422419.
  11. O'Donoghue, Ted; Rabin, Matthew (November 2006). "Optimal sin taxes". Journal of Public Economics. 90 (10–11): 1825–1849. doi:10.1016/j.jpubeco.2006.03.001.
  12. Wakker, Peter P. (December 2008). "Explaining the characteristics of the power (CRRA) utility family". Health Economics. 17 (12): 1329–1344. doi:10.1002/hec.1331. PMID   18213676.
  13. Kale, Jivendra K. (2009). "Growth maximisation and downside protection using power-log utility functions for optimising portfolios with derivatives". International Journal of Computer Applications in Technology. 34 (4): 309. doi:10.1504/IJCAT.2009.024085.
  14. Preston, Malcolm G.; Baratta, Philip (1948). "An Experimental Study of the Auction-Value of an Uncertain Outcome". The American Journal of Psychology. 61 (2): 183–193. doi:10.2307/1416964. ISSN   0002-9556.
  15. 1 2 Coombs, C. H.; Komorita, S. S. (1958). "Measuring Utility of Money through Decisions". The American Journal of Psychology. 71 (2): 383–389. doi:10.2307/1420083. ISSN   0002-9556.
  16. Birnbaum, Michael H (1992-08-01). "Issues in utility measurement". Organizational Behavior and Human Decision Processes. Utility Measurement. 52 (3): 319–330. doi:10.1016/0749-5978(92)90024-2. ISSN   0749-5978.
  17. Birnbaum, Michael H. (2004-04-01). "Causes of Allais common consequence paradoxes: An experimental dissection". Journal of Mathematical Psychology. 48 (2): 87–106. doi:10.1016/j.jmp.2004.01.001. ISSN   0022-2496.
  18. Mosteller, Frederick; Nogee, Philip (1951-10-01). "An Experimental Measurement of Utility". Journal of Political Economy. 59 (5): 371–404. doi:10.1086/257106. ISSN   0022-3808.
  19. Hershey, John C.; Kunreuther, Howard C.; Schoemaker, Paul J. H. (August 1982). "Sources of Bias in Assessment Procedures for Utility Functions". Management Science. 28 (8): 936–954. doi:10.1287/mnsc.28.8.936. ISSN   0025-1909.
  20. Galanter, Eugene (1990). "Utility Functions for Nonmonetary Events". The American Journal of Psychology. 103 (4): 449–470. doi:10.2307/1423318. ISSN   0002-9556.
  21. 1 2 Ramos-Goñi, Juan M.; Oppe, Mark; Stolk, Elly; Shah, Koonal; Kreimeier, Simone; Rivero-Arias, Oliver; Devlin, Nancy (2020-07-01). "International Valuation Protocol for the EQ-5D-Y-3L". PharmacoEconomics. 38 (7): 653–663. doi: 10.1007/s40273-020-00909-3 . hdl: 11343/252049 . ISSN   1179-2027.
  22. Prevolnik Rupel, Valentina; Ogorevc, Marko; IMPACT HTA HRQoL Group (April 2021). "EQ-5D-Y Value Set for Slovenia". PharmacoEconomics. 39 (4): 463–471. doi:10.1007/s40273-020-00994-4. PMC   8009800 . PMID   33565048.
  23. Bose, Utpal; Davey, Anne M.; Olson, David L. (1997-12-01). "Multi-attribute utility methods in group decision making: Past applications and potential for inclusion in GDSS". Omega. 25 (6): 691–706. doi:10.1016/S0305-0483(97)00040-6. ISSN   0305-0483.