Stochastic transitivity

Last updated

Stochastic transitivity models [1] [2] [3] [4] are stochastic versions of the transitivity property of binary relations studied in mathematics. Several models of stochastic transitivity exist and have been used to describe the probabilities involved in experiments of paired comparisons, specifically in scenarios where transitivity is expected, however, empirical observations of the binary relation is probabilistic. For example, players' skills in a sport might be expected to be transitive, i.e. "if player A is better than B and B is better than C, then player A must be better than C"; however, in any given match, a weaker player might still end up winning with a positive probability. Tightly matched players might have a higher chance of observing this inversion while players with large differences in their skills might only see these inversions happen seldom. Stochastic transitivity models formalize such relations between the probabilities (e.g. of an outcome of a match) and the underlying transitive relation (e.g. the skills of the players).

Contents

A binary relation on a set is called transitive, in the standard non-stochastic sense, if and implies for all members of .

Stochastic versions of transitivity include:

  1. Weak Stochastic Transitivity (WST): and implies , for all ; [5] :12 [6] :43rg
  2. Strong Stochastic Transitivity (SST): and implies , for all ; [5] :12
  3. Linear Stochastic Transitivity (LST):, for all , where is some increasing and symmetric[ clarify ] function (called a comparison function), and is some mapping from the set of alternatives to the real line (called a merit function).

A toy example

The marble game - Assume two kids, Billy and Gabriela, collect marbles. Billy collects blue marbles and Gabriela green marbles. When they get together they play a game where they mix all their marbles in a bag and sample one randomly. If the sampled marble is green, then Gabriela wins and if it is blue then Billy wins. If is the number of blue marbles and is the number of green marbles in the bag, then the probability of Billy winning against Gabriela is

.

In this example, the marble game satisfies linear stochastic transitivity, where the comparison function is given by and the merit function is given by , where is the number of marbles of the player. This game happens to be an example of a Bradley–Terry model. [7]

Applications

Connections between models

Positive Results:

  1. Every model that satisfies Linear Stochastic Transitivity must also satisfy Strong Stochastic Transitivity, which in turn must satisfy Weak Stochastic Transitivity. This is represented as: LST SSTWST ;
  2. Since the Bradley-Terry models and Thurstone's Case V model [8] are LST models, they also satisfy SST and WST;
  3. Due to the convenience of more structured models[ clarify ], a few authors [1] [2] [3] [4] [20] [21] have identified axiomatic justifications[ clarify ] of linear stochastic transitivity (and other models), most notably Gérard Debreu showed that: [10] Quadruple Condition[ clarify ] + Continuity[ clarify ] LST (see also Debreu Theorems);
  4. Two LST models given by invertible comparison functions and are equivalent[ clarify ] if and only if for some [22]

Negative Results:

  1. Stochastic transitivity models are empirically unverifiable[ clarify ], [4] however, they may be falsifiable;
  2. Distinguishing[ clarify ] between LST comparison functions and can be impossible even if an infinite amount of data is provided over a finite number of points[ clarify ]; [23]
  3. The estimation problem[ clarify ] for WST, SST and LST models are in general NP-Hard, [24] however, near optimal polynomially computable estimation procedures are known for SST and LST models. [13] [14] [15]

See also

Related Research Articles

In mathematics, the Cantor set is a set of points lying on a single line segment that has a number of unintuitive properties. It was discovered in 1874 by Henry John Stephen Smith and mentioned by German mathematician Georg Cantor in 1883.

<span class="mw-page-title-main">Probability distribution</span> Mathematical function for the probability a given outcome occurs in an experiment

In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon in terms of its sample space and the probabilities of events.

<span class="mw-page-title-main">Markov property</span> Memoryless property of a stochastic process

In probability theory and statistics, the term Markov property refers to the memoryless property of a stochastic process, which means that its future evolution is independent of its history. It is named after the Russian mathematician Andrey Markov. The term strong Markov property is similar to the Markov property, except that the meaning of "present" is defined in terms of a random variable known as a stopping time.

In probability theory and related fields, Malliavin calculus is a set of mathematical techniques and ideas that extend the mathematical field of calculus of variations from deterministic functions to stochastic processes. In particular, it allows the computation of derivatives of random variables. Malliavin calculus is also called the stochastic calculus of variations. P. Malliavin first initiated the calculus on infinite dimensional space. Then, the significant contributors such as S. Kusuoka, D. Stroock, J-M. Bismut, Shinzo Watanabe, I. Shigekawa, and so on finally completed the foundations.

<span class="mw-page-title-main">Stopping time</span> Time at which a random variable stops exhibiting a behavior of interest

In probability theory, in particular in the study of stochastic processes, a stopping time is a specific type of “random time”: a random variable whose value is interpreted as the time at which a given stochastic process exhibits a certain behavior of interest. A stopping time is often defined by a stopping rule, a mechanism for deciding whether to continue or stop a process on the basis of the present position and past events, and which will almost always lead to a decision to stop at some finite time.

<span class="mw-page-title-main">Mixing (mathematics)</span> Mathematical description of mixing substances

In mathematics, mixing is an abstract concept originating from physics: the attempt to describe the irreversible thermodynamic process of mixing in the everyday world: e.g. mixing paint, mixing drinks, industrial mixing.

A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics and are used to model various behaviours of stochastic models such as stock prices, random growth models or physical systems that are subjected to thermal fluctuations.

In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, if nothing is known about a distribution except that it belongs to a certain class, then the distribution with the largest entropy should be chosen as the least-informative default. The motivation is twofold: first, maximizing entropy minimizes the amount of prior information built into the distribution; second, many physical systems tend to move towards maximal entropy configurations over time.

In statistics and probability theory, a point process or point field is a collection of mathematical points randomly located on a mathematical space such as the real line or Euclidean space. Point processes can be used for spatial data analysis, which is of interest in such diverse disciplines as forestry, plant ecology, epidemiology, geography, seismology, materials science, astronomy, telecommunications, computational neuroscience, economics and others.

Stochastic dominance is a partial order between random variables. It is a form of stochastic ordering. The concept arises in decision theory and decision analysis in situations where one gamble can be ranked as superior to another gamble for a broad class of decision-makers. It is based on shared preferences regarding sets of possible outcomes and their associated probabilities. Only limited knowledge of preferences is required for determining dominance. Risk aversion is a factor only in second order stochastic dominance.

Pairwise comparison generally is any process of comparing entities in pairs to judge which of each entity is preferred, or has a greater amount of some quantitative property, or whether or not the two entities are identical. The method of pairwise comparison is used in the scientific study of preferences, attitudes, voting systems, social choice, public choice, requirements engineering and multiagent AI systems. In psychology literature, it is often referred to as paired comparison.

In probability theory, a Hunt process is a type of Markov process, named for mathematician Gilbert A. Hunt who first defined them in 1957. Hunt processes were important in the study of probabilistic potential theory until they were superseded by right processes in the 1970s.

In mathematics, the theory of optimal stopping or early stopping is concerned with the problem of choosing a time to take a particular action, in order to maximise an expected reward or minimise an expected cost. Optimal stopping problems can be found in areas of statistics, economics, and mathematical finance. A key example of an optimal stopping problem is the secretary problem. Optimal stopping problems can often be written in the form of a Bellman equation, and are therefore often solved using dynamic programming.

Robust optimization is a field of mathematical optimization theory that deals with optimization problems in which a certain measure of robustness is sought against uncertainty that can be represented as deterministic variability in the value of the parameters of the problem itself and/or its solution. It is related to, but often distinguished from, probabilistic optimization methods such as chance-constrained optimization.

In probability theory and statistics, a stochastic order quantifies the concept of one random variable being "bigger" than another. These are usually partial orders, so that one random variable may be neither stochastically greater than, less than nor equal to another random variable . Many different orders exist, which have different applications.

In mathematics and information theory, Sanov's theorem gives a bound on the probability of observing an atypical sequence of samples from a given probability distribution. In the language of large deviations theory, Sanov's theorem identifies the rate function for large deviations of the empirical measure of a sequence of i.i.d. random variables.

In stochastic analysis, a part of the mathematical theory of probability, a predictable process is a stochastic process whose value is knowable at a prior time. The predictable processes form the smallest class that is closed under taking limits of sequences and contains all adapted left-continuous processes.

In probability theory, a nonlinear expectation is a nonlinear generalization of the expectation. Nonlinear expectations are useful in utility theory as they more closely match human behavior than traditional expectations. The common use of nonlinear expectations is in assessing risks under uncertainty. Generally, nonlinear expectations are categorized into sub-linear and super-linear expectations dependent on the additive properties of the given sets. Much of the study of nonlinear expectation is attributed to work of mathematicians within the past two decades.

In probability theory, a McKean–Vlasov process is a stochastic process described by a stochastic differential equation where the coefficients of the diffusion depend on the distribution of the solution itself. The equations are a model for Vlasov equation and were first studied by Henry McKean in 1966. It is an example of propagation of chaos, in that it can be obtained as a limit of a mean-field system of interacting particles: as the number of particles tends to infinity, the interactions between any single particle and the rest of the pool will only depend on the particle itself.

In the theory of stochastic processes, a subdiscipline of probability theory, filtrations are totally ordered collections of subsets that are used to model the information that is available at a given point and therefore play an important role in the formalization of random (stochastic) processes.

References

  1. 1 2 Fishburn, Peter C. (November 1973). "Binary choice probabilities: on the varieties of stochastic transitivity". Journal of Mathematical Psychology. 10 (4): 327–352. doi:10.1016/0022-2496(73)90021-7. ISSN   0022-2496.
  2. 1 2 Clark, Stephen A. (March 1990). "A concept of stochastic transitivity for the random utility model". Journal of Mathematical Psychology. 34 (1): 95–108. doi:10.1016/0022-2496(90)90015-2.
  3. 1 2 3 Ryan, Matthew (2017-01-21). "Uncertainty and binary stochastic choice". Economic Theory. 65 (3): 629–662. doi:10.1007/s00199-017-1033-4. ISSN   0938-2259. S2CID   125420775.
  4. 1 2 3 Oliveira, I.F.D.; Zehavi, S.; Davidov, O. (August 2018). "Stochastic transitivity: Axioms and models". Journal of Mathematical Psychology. 85: 25–35. doi:10.1016/j.jmp.2018.06.002. ISSN   0022-2496.
  5. 1 2 Donald Davidson and Jacob Marschak (Jul 1958). Experimental tests of a stochastic decision theory (PDF) (Technical Report). Stanford University.
  6. Michel Regenwetter and Jason Dana and Clintin P. Davis-Stober (2011). "Transitivity of Preferences" (PDF). Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. PMID   21244185.
  7. Bradley, Ralph Allan; Terry, Milton E. (December 1952). "Rank Analysis of Incomplete Block Designs: I. The Method of Paired Comparisons". Biometrika. 39 (3/4): 324. doi:10.2307/2334029. JSTOR   2334029.
  8. 1 2 Thurstone, L. L. (1994). "A law of comparative judgment". Psychological Review. 101 (2): 266–270. doi:10.1037/0033-295X.101.2.266. ISSN   0033-295X.
  9. Luce, R. Duncan (Robert Duncan) (2005). Individual choice behavior : a theoretical analysis. Mineola, N.Y.: Dover Publications. ISBN   0486441369. OCLC   874031603.
  10. 1 2 Debreu, Gerard (July 1958). "Stochastic Choice and Cardinal Utility" (PDF). Econometrica. 26 (3): 440–444. doi:10.2307/1907622. ISSN   0012-9682. JSTOR   1907622.
  11. Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P. (2011). "Transitivity of preferences". Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. ISSN   1939-1471. PMID   21244185.
  12. Cavagnaro, Daniel R.; Davis-Stober, Clintin P. (2014). "Transitive in our preferences, but transitive in different ways: An analysis of choice variability". Decision. 1 (2): 102–122. doi:10.1037/dec0000011. ISSN   2325-9973.
  13. 1 2 Shah, Nihar B.; Balakrishnan, Sivaraman; Guntuboyina, Adityanand; Wainwright, Martin J. (February 2017). "Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues". IEEE Transactions on Information Theory. 63 (2): 934–959. arXiv: 1510.05610 . doi: 10.1109/tit.2016.2634418 . ISSN   0018-9448.
  14. 1 2 Chatterjee, Sabyasachi; Mukherjee, Sumit (June 2019). "Estimation in Tournaments and Graphs Under Monotonicity Constraints". IEEE Transactions on Information Theory. 65 (6): 3525–3539. arXiv: 1603.04556 . doi:10.1109/tit.2019.2893911. ISSN   0018-9448. S2CID   54740089.
  15. 1 2 Oliveira, Ivo F.D.; Ailon, Nir; Davidov, Ori (2018). "A New and Flexible Approach to the Analysis of Paired Comparison Data". Journal of Machine Learning Research. 19: 1–29.
  16. Israel, Robert B. (December 1981). "Stronger Players Need not Win More Knockout Tournaments". Journal of the American Statistical Association. 76 (376): 950–951. doi:10.2307/2287594. ISSN   0162-1459. JSTOR   2287594.
  17. Chen, Robert; Hwang, F. K. (December 1988). "Stronger players win more balanced knockout tournaments". Graphs and Combinatorics. 4 (1): 95–99. doi:10.1007/bf01864157. ISSN   0911-0119. S2CID   44602228.
  18. Adler, Ilan; Cao, Yang; Karp, Richard; Peköz, Erol A.; Ross, Sheldon M. (December 2017). "Random Knockout Tournaments". Operations Research. 65 (6): 1589–1596. arXiv: 1612.04448 . doi:10.1287/opre.2017.1657. ISSN   0030-364X. S2CID   1041539.
  19. Sen, Amartya (January 1977). "Social Choice Theory: A Re-Examination". Econometrica. 45 (1): 53–89. doi:10.2307/1913287. ISSN   0012-9682. JSTOR   1913287.
  20. Blavatskyy, Pavlo R. (2007). Stochastic utility theorem. Inst. for Empirical Research in Economics. OCLC   255736997.
  21. Dagsvik, John K. (October 2015). "Stochastic models for risky choices: A comparison of different axiomatizations". Journal of Mathematical Economics. 60: 81–88. doi:10.1016/j.jmateco.2015.06.013. ISSN   0304-4068.
  22. Yellott, John I. (April 1977). "The relationship between Luce's Choice Axiom, Thurstone's Theory of Comparative Judgment, and the double exponential distribution". Journal of Mathematical Psychology. 15 (2): 109–144. doi:10.1016/0022-2496(77)90026-8. ISSN   0022-2496.
  23. Rockwell, Christina; Yellott, John I. (February 1979). "A note on equivalent Thurstone models". Journal of Mathematical Psychology. 19 (1): 65–71. doi:10.1016/0022-2496(79)90006-3. ISSN   0022-2496.
  24. deCani, John S. (December 1969). "Maximum Likelihood Paired Comparison Ranking by Linear Programming". Biometrika. 56 (3): 537–545. doi:10.2307/2334661. ISSN   0006-3444. JSTOR   2334661.