Matching law

Last updated

In operant conditioning, the matching law is a quantitative relationship that holds between the relative rates of response and the relative rates of reinforcement in concurrent schedules of reinforcement. For example, if two response alternatives A and B are offered to an organism, the ratio of response rates to A and B equals the ratio of reinforcements yielded by each response. [1] This law applies fairly well when non-human subjects are exposed to concurrent variable interval schedules (but see below); its applicability in other situations is less clear, depending on the assumptions made and the details of the experimental situation. The generality of applicability of the matching law is subject of current debate. [2]

Contents

The matching law can be applied to situations involving a single response maintained by a single schedule of reinforcement if one assumes that alternative responses are always available to an organism, maintained by uncontrolled "extraneous" reinforcers. For example, an animal pressing a lever for food might pause for a drink of water.

The matching law was first formulated by R.J. Herrnstein (1961) following an experiment with pigeons on concurrent variable interval schedules. [3] Pigeons were presented with two buttons in a Skinner box, each of which led to varying rates of food reward. The pigeons tended to peck the button that yielded the greater food reward more often than the other button, and the ratio of their rates to the two buttons matched the ratio of their rates of reward on the two buttons.

Mathematical statement

If R1 and R2 are the rate of responses on two schedules that yield obtained (as distinct from programmed) rates of reinforcement Rf1 and Rf2, the strict matching law holds that the relative response rate R1 / (R1 + R2) matches, that is, equals, the relative reinforcement rate Rf1 / (Rf1 + Rf2). That is,

This relationship can also be stated in terms of response and reinforcement ratios:

Alternatively stated, it states that there exists a constant for an individual animal, such that

for any . That is, for an individual animal, the rate of response is proportional to rate of reinforcement for any task.

Deviations from matching, and the generalized matching law

A recent review by McDowell reveals that Herrnstein's original equation fails to accurately describe concurrent-schedule data under a substantial range of conditions. Three deviations from matching have been observed: undermatching, overmatching, and bias. Undermatching means that the response proportions are less extreme than the law predicts. Undermatching can happen if subjects too often switch between the two response options, a tendency that may be strengthened by reinforcers that happen to occur just after a subject switches. A changeover delay may be used to reduce the effectiveness of such post-switch reinforcers; typically, this is a 1.5 second interval after a switch when no reinforcer is presented. Overmatching is the opposite of undermatching, and is less common. Here the subjects response proportions are more extreme than reinforcement proportions. Overmatching may occur if there is a penalty for switching. A final deviation is bias, which occurs when subjects spend more time on one alternative than the matching equation predicts. This may happen if a subject prefers a certain environment, area in a laboratory, or method of responding.

These failures of the matching law have led to the development of the "generalized matching law", which has parameters that reflect the deviations just described. The power law was first shown to fit operant choice data by Staddon (1968) [4] and was generalized by Baum (1974). [5] It has been found to fit a wide variety of matching data. The power law was shown by Mackay (1963) [6] to be derivable from input and output logarithmic function and psychophysical and other behavioral data fitting this model were described by Staddon (1975). [7]

This is more conveniently expressed in logarithmic form

The constants b and s are referred to as "bias" and "sensitivity" respectively. "Bias" reflects any tendency the subject may have to prefer one response over the other. "Sensitivity" reflects the degree to which the reinforcement ratio actually impacts the choice ratio. When this equation is plotted, the result is straight line; sensitivity changes the slope and bias changes the intercept of this line.

The generalized matching law accounts for high proportions of the variance in most experiments on concurrent variable interval schedules in non-humans. Values of b often depend on details of the experiment set up, but values of s are consistently found to be around 0.8, whereas the value required for strict matching would be 1.0. [8] [9] The concurrent VI VI choice situation involves strong negative feedbacks: the longer the subject refrains from responding to an alternative, the higher his probability of payoff: switching is encouraged. Hinson and Staddon (1983) [10] showed that power-law matching always results no matter what the balance between a transient tendency to "stay" vs. a growing tendency to switch after each reinforcement. Hence the molar law is compatible with many molecular choice processes.

Processes underlying the distribution of responses

There are three ideas on how humans and animals maximize reinforcement, molecular maximizing, molar maximizing and melioration.

Theoretical importance

The matching law is theoretically important for several reasons. First, it offers a simple quantification of behavior that can be applied to a number of situations. Secondly, offers a lawful account of choice. As Herrnstein (1970) expressed it, under an operant analysis, choice is nothing but behavior set into the context of other behavior. [11] The matching law thus challenges the idea that choice is an unpredictable outcome of free will, just as B.F. Skinner and others have argued. [12] However this challenge becomes serious only if it applies to human behavior, as well as to the behavior of pigeons and other animals. When human participants perform under concurrent schedules of reinforcement, matching has been observed in some experiments, [13] but wide deviations from matching have been found in others. [14] Finally, if nothing else, the matching law is important because it has generated a great deal of research that has widened our understanding of operant control.

Relevance to psychopathology

The matching law, and the generalized matching law, have helped behavior analysts to understand some complex human behaviors, especially the behavior of children in certain conflict situations. [15] [16] James Snyder and colleague have found that response matching predicts the use of conflict tactics by children and parents during conflict bouts. [17] This matching rate predicts future arrests. Even children's use of deviant talk appears to follow a matching pattern. [16]

Notes

  1. Poling, A., Edwards, T. L., Weeden, M., & Foster, T. (2011). The matching law. Psychological Record, 61(2), 313-322.
  2. Simon, C., & Baum, W. M. (2017). Allocation of Speech in Conversation. Journal of Experimental Analysis of Behavior, 107.
  3. Herrnstein, R.J. (1961). Relative and absolute strength of responses as a function of frequency of reinforcement. Journal of the Experimental Analysis of Behaviour, 4, 267–72.
  4. Staddon, J. E. R. (1968). Spaced responding and choice: A preliminary analysis. Journal of the Experimental Analysis of Behavior, 11, 669-682.
  5. Baum, W. M. (1974) On two types of deviation from the matching law: bias and undermatching. J. Exp. Anal. Behav. 22(1), 231-242.
  6. MacKay, D. M. Psychophysics of perceived intensity:A theoretical basis for Fechner's and Stevens' laws. Science, 1963, 139, 1213-1216.
  7. Staddon, J. E. R. (1978). Theory of behavioral power functions. Psychological Review, 85, 305-320.
  8. Baum, W.M. (1974). On two types of deviation from the matching law: Bias and undermatching. Journal of the Experimental Analysis of Behavior, 22, 231–42.
  9. Davison, M. & McCarthy, D. (1988). The matching law: A research review. Hillsdale, NJ: Erlbaum.
  10. Hinson, J. M., & Staddon, J. E. R. (1983). Matching, maximizing and hill climbing. Journal of the Experimental Analysis of Behavior, 40, 321-31.
  11. Herrnstein, R.J. (1970). On the law of effect. Journal of the Experimental Analysis of Behavior, 13, 243–66.
  12. Skinner, B. F. (1971)Beyond Freedom and Dignity, New Yori:Knopf
  13. Bradshaw, C.M.; Szabadi, E. & Bevan, P. (1976). Behavior of humans in variable-interval schedules of reinforcement Journal of the Experimental Analysis of Behavior, 26, 135–41.
  14. Horne, P.J. & Lowe, C.F. (1993). Determinants of human performance on concurrent schedules. Journal of the Experimental Analysis of Behavior, 59, 29–60. doi : 10.1901/jeab.1993.59-29
  15. Strand, P.S. (2001) Momentum, Matching, and Meaning: Toward a Fuller Exploitation of Operant Principles. The Behavior Analyst Today, 2(3), 170–84
  16. 1 2 James Snyder, Mike Stoolmiller, Gerald R. Patterson, Lynn Schrepferman, Jessica Oeser, Kassy Johnson, and Dana Soetaert (2003): The Application of Response Allocation Matching to Understanding Risk Mechanisms in Development: The Case of Young Children's Deviant Talk and Play, and Risk for Early-Onset Antisocial Behavior. The Behavior Analyst Today, 4(4), 435–45 "Archived copy" (PDF). Archived from the original (PDF) on 2011-08-24. Retrieved 2011-11-17.{{cite web}}: CS1 maint: archived copy as title (link)
  17. Snyder, J.J. & Patterson, G.R. (1995). Individualized differences in social aggression: A test of the reinforcement hypothesis in the natural environment. Behavior Therapy, 26, 371–91.

Related Research Articles

<span class="mw-page-title-main">B. F. Skinner</span> American psychologist and social philosopher (1904–1990)

Burrhus Frederic Skinner was an American psychologist, behaviorist, author, inventor, and social philosopher. He was a professor of psychology at Harvard University from 1958 until his retirement in 1974.

Operant conditioning is a type of associative learning process through which the strength of a behavior is modified by reinforcement or punishment. It is also a procedure that is used to bring about such learning.

In behavioral psychology, reinforcement is a consequence applied that will strengthen an organism's future behavior whenever that behavior is preceded by a specific antecedent stimulus. This strengthening effect may be measured as a higher frequency of behavior, longer duration, greater magnitude, or shorter latency.

Radical behaviorism is a "philosophy of the science of behavior" developed by B. F. Skinner. It refers to the philosophy behind behavior analysis, and is to be distinguished from methodological behaviorism—which has an intense emphasis on observable behaviors—by its inclusion of thinking, feeling, and other private events in the analysis of human and animal psychology. The research in behavior analysis is called the experimental analysis of behavior and the application of the field is called applied behavior analysis (ABA), which was originally termed "behavior modification."

<span class="mw-page-title-main">Richard Herrnstein</span> American psychologist

Richard Julius Herrnstein was an American psychologist at Harvard University. He was an active researcher in animal learning in the Skinnerian tradition. Herrnstein was the Edgar Pierce Professor of Psychology until his death, and previously chaired the Harvard Department of Psychology for five years. With political scientist Charles Murray, he co-wrote The Bell Curve, a controversial 1994 book on human intelligence. He was one of the founders of the Society for Quantitative Analysis of Behavior.

The experimental analysis of behavior is school of thought in psychology founded on B. F. Skinner's philosophy of radical behaviorism and defines the basic principles used in applied behavior analysis. A central principle was the inductive reasoning data-driven examination of functional relations, as opposed to the kinds of hypothetico-deductive learning theory that had grown up in the comparative psychology of the 1920–1950 period. Skinner's approach was characterized by observation of measurable behavior which could be predicted and controlled. It owed its early success to the effectiveness of Skinner's procedures of operant conditioning, both in the laboratory and in behavior therapy.

<span class="mw-page-title-main">Behaviorism</span> Systematic approach to understanding the behavior of humans and other animals

Behaviorism is a systematic approach to understanding the behavior of humans and animals. It assumes that behavior is either a reflex evoked by the pairing of certain antecedent stimuli in the environment, or a consequence of that individual's history, including especially reinforcement and punishment contingencies, together with the individual's current motivational state and controlling stimuli. Although behaviorists generally accept the important role of heredity in determining behavior, they focus primarily on environmental events.

The law of effect is a psychology principle advanced by Edward Thorndike in 1898 on the matter of behavioral conditioning which states that "responses that produce a satisfying effect in a particular situation become more likely to occur again in that situation, and responses that produce a discomforting effect become less likely to occur again in that situation."

John Eric Rayner Staddon is a British-born American psychologist. He has been a critic of Skinnerian behaviorism and proposed a theoretically-based "New Behaviorism".

In ecology, an ideal free distribution (IFD) is a theoretical way in which a population's individuals distribute themselves among several patches of resources within their environment, in order to minimize resource competition and maximize fitness. The theory states that the number of individual animals that will aggregate in various patches is proportional to the amount of resources available in each. For example, if patch A contains twice as many resources as patch B, there will be twice as many individuals foraging in patch A as in patch B.

Behavioral momentum is a theory in quantitative analysis of behavior and is a behavioral metaphor based on physical momentum. It describes the general relation between resistance to change and the rate of reinforcement obtained in a given situation.

Melioration theory in behavioral psychology is a theoretical algorithm that predicts the matching law. Melioration theory is used as an explanation for why an organism makes choices based on the rewards or reinforcers it receives. The principle of melioration states that animals will invest increasing amounts of time and/or effort into whichever alternative is better. To meliorate essentially means to "make better".

The scalar timing or scalar expectancy theory (SET) is a model of the processes that govern behavior controlled by time. The model posits an internal clock, and particular memory and decision processes. SET is one of the most important models of animal timing behavior.

Quantitative analysis of behavior is the application of mathematical models to the experimental analysis of behavior. The aim is to describe and/or predict relations between independent environmental variables and dependent behavioral variables. The parameters in the models hopefully have theoretical meaning beyond their use in fitting models to data. The field was founded by Richard Herrnstein (1961) when he introduced the matching law to quantify the behavior of organisms working on concurrent schedules of reinforcement. The field has integrated models from economics, zoology, philosophy, and other branches of psychology, especially mathematical psychology of which it is a branch. The field is represented by the Society for Quantitative Analysis of Behavior. Quantitative analysis of behavior addresses the following topics among others: behavioral economics, behavioral momentum, connectionist systems or neural networks, integration, hyperbolic discounting including the delay reduction hypothesis, foraging, hunting, errorless learning, creativity, learning, and the Rescorla-Wagner model, matching law, melioration, scalar expectancy, signal detection, neural hysteresis, and reinforcement control.

In behaviorism, rate of reinforcement is number of reinforcements per time, usually per minute. Symbol of this rate is usually Rf. Its first major exponent was B.F. Skinner (1939). It is used in the Matching Law.

In behaviorism, rate of response is a ratio between two measurements with different units. Rate of responding is the number of responses per minute, or some other time unit. It is usually written as R. Its first major exponent was B.F. Skinner (1939). It is used in the Matching Law.

In behavioral psychology, stimulus control is a phenomenon in operant conditioning that occurs when an organism behaves in one way in the presence of a given stimulus and another way in its absence. A stimulus that modifies behavior in this manner is either a discriminative stimulus (Sd) or stimulus delta (S-delta). Stimulus-based control of behavior occurs when the presence or absence of an Sd or S-delta controls the performance of a particular behavior. For example, the presence of a stop sign (S-delta) at a traffic intersection alerts the driver to stop driving and increases the probability that "braking" behavior will occur. Such behavior is said to be emitted because it does not force the behavior to occur since stimulus control is a direct result of historical reinforcement contingencies, as opposed to reflexive behavior that is said to be elicited through respondent conditioning.

<span class="mw-page-title-main">Charles Ferster</span>

Charles Bohris Ferster was an American behavioral psychologist. A pioneer of applied behavior analysis, he developed errorless learning and was a colleague of B.F. Skinner's at Harvard University, co-authoring the book Schedules of Reinforcement (1957).

The mathematical principles of reinforcement (MPR) constitute of a set of mathematical equations set forth by Peter Killeen and his colleagues attempting to describe and predict the most fundamental aspects of behavior.

Self-administration is, in its medical sense, the process of a subject administering a pharmacological substance to themself. A clinical example of this is the subcutaneous "self-injection" of insulin by a diabetic patient.

References