Melioration theory

Last updated

Melioration theory in behavioral psychology is a theoretical algorithm that predicts the matching law. [1] Melioration theory is used as an explanation for why an organism makes choices based on the rewards or reinforcers it receives. The principle of melioration states that animals will invest increasing amounts of time and/or effort into whichever alternative is better. To meliorate essentially means to "make better". [2]

Melioration theory accounts for many of the choices that organisms make when presented with two variable interval schedules. Melioration is a form of matching where the subject is constantly shifting its behavior from the poorer reinforcement schedule to the richer reinforcement schedule, until it is spending most of its time at the richest variable interval schedule. By matching, the subject is equalizing the price of the reinforcer they are working for. This is also called hyperbolic discounting. In making a choice between options, living organisms need not maximize expected payoff as classical economic theory posits. Rather than being aggregated, the options compete against one another based on differences in their local reinforcement rate. The organism continuously shifts from one alternative to the other, if one is better than the other, until the other is better than the first one, regardless of the effect on overall rate of reinforcement. Melioration is capable of accounting for behavior on both concurrent ratio and concurrent interval schedules.

Melioration Equation R1/B1 = R2/B2

If this ratio is not equal, the animal will shift its behavior to the alternative that currently has the higher response ratio. When the ratio is equal, the "cost" of each reinforcer is the same for both alternatives.

Melioration theory grew out of an impersonal anonymous interest in how the matching law comes to hold on. Richard J. Herrnstein (1961) reported that on concurrent VIVIVI reinforcement schedules, the proportion of responses to one alternative was approximately equal to the proportion of reinforcer received there. This finding is summarized in the matching law, which generated a great deal of both matching research and matching theorizing. Herrnstein (1970) suggested that matching may be a basic behavioral process, whereas Rachlin et al. (1976) suggested that matching comes about because it maximizes rate of matching reinforcement.

William Vaughan, Jr. (1976) suggested that the local rate of matching reinforcement on each reinforcement matching schedule is evaluated, and if those local rates differ, the distribution of time on a schedule is shifted from the poorer to the better schedule. On concurrent VIVIVI reinforcement schedules this process gives rise to matching, whereas on concurrent VRVRVR reinforcement schedules it gives rise to exclusive preferences for the better alternative and not the worse alternative. This rule was subsequently named Melioration (Herrnstein & Vaughan, 1980). See also Herrnstein, 1982, Vaughan, 1981; Vaughan & Herrnstein, 1987; Bland, Cowie, Podlesnik & Elliffe, 2018)

Related Research Articles

<span class="mw-page-title-main">B. F. Skinner</span> American psychologist and social philosopher (1904–1990)

Burrhus Frederic Skinner was an American psychologist, behaviorist, inventor, and social philosopher. Considered the father of Behaviorism, he was the Edgar Pierce Professor of Psychology at Harvard University from 1958 until his retirement in 1974.

Operant conditioning, also called instrumental conditioning, is a learning process where voluntary behaviors are modified by association with the addition of reward or aversive stimuli. The frequency or duration of the behavior may increase through reinforcement or decrease through punishment or extinction.

<span class="mw-page-title-main">Reinforcement</span> Consequence affecting an organisms future behavior

In behavioral psychology, reinforcement is any consequence that increases the likelihood of an organism's future behavior whenever that behavior is preceded by a particular antecedent stimulus. For example, a rat can be trained to push a lever to receive food whenever a light is turned on. In this example, the light is the antecedent stimulus, the lever pushing is the behavior, and the food is the reinforcement. Likewise, a student that receives attention and praise when answering a teacher's question will be more likely to answer future questions in class. The teacher's question is the antecedent, the student's response is the behavior, and the praise and attention are the reinforcements.

<span class="mw-page-title-main">Richard Herrnstein</span> American psychologist (1930–1994)

Richard Julius Herrnstein was an American psychologist at Harvard University. He was an active researcher in animal learning in the Skinnerian tradition. Herrnstein was the Edgar Pierce Professor of Psychology until his death, and previously chaired the Harvard Department of Psychology for five years. With political scientist Charles Murray, he co-wrote The Bell Curve, a controversial 1994 book on human intelligence. He was one of the founders of the Society for Quantitative Analysis of Behavior.

The experimental analysis of behavior is a science that studies the behavior of individuals across a variety of species. A key early scientist was B. F. Skinner who discovered operant behavior, reinforcers, secondary reinforcers, contingencies of reinforcement, stimulus control, shaping, intermittent schedules, discrimination, and generalization. A central method was the examination of functional relations between environment and behavior, as opposed to hypothetico-deductive learning theory that had grown up in the comparative psychology of the 1920–1950 period. Skinner's approach was characterized by observation of measurable behavior which could be predicted and controlled. It owed its early success to the effectiveness of Skinner's procedures of operant conditioning, both in the laboratory and in behavior therapy.

Behaviorism is a systematic approach to understand the behavior of humans and other animals. It assumes that behavior is either a reflex evoked by the pairing of certain antecedent stimuli in the environment, or a consequence of that individual's history, including especially reinforcement and punishment contingencies, together with the individual's current motivational state and controlling stimuli. Although behaviorists generally accept the important role of heredity in determining behavior, they focus primarily on environmental events. The cognitive revolution of the late 20th century largely replaced behaviorism as an explanatory theory with cognitive psychology, which unlike behaviorism examines internal mental states.

In economics, hyperbolic discounting is a time-inconsistent model of delay discounting. It is one of the cornerstones of behavioral economics and its brain-basis is actively being studied by neuroeconomics researchers.

The law of effect is a psychology principle advanced by Edward Thorndike in 1898 on the matter of behavioral conditioning which states that "responses that produce a satisfying effect in a particular situation become more likely to occur again in that situation, and responses that produce a discomforting effect become less likely to occur again in that situation."

In operant conditioning, the matching law is a quantitative relationship that holds between the relative rates of response and the relative rates of reinforcement in concurrent schedules of reinforcement. For example, if two response alternatives A and B are offered to an organism, the ratio of response rates to A and B equals the ratio of reinforcements yielded by each response. This law applies fairly well when non-human subjects are exposed to concurrent variable interval schedules ; its applicability in other situations is less clear, depending on the assumptions made and the details of the experimental situation. The generality of applicability of the matching law is subject of current debate.

In ecology, an ideal free distribution (IFD) is a theoretical way in which a population's individuals distribute themselves among several patches of resources within their environment, in order to minimize resource competition and maximize fitness. The theory states that the number of individual animals that will aggregate in various patches is proportional to the amount of resources available in each. For example, if patch A contains twice as many resources as patch B, there will be twice as many individuals foraging in patch A as in patch B.

In psychology, a social trap is a conflict of interest or perverse incentive where individuals or a group of people act to obtain short-term individual gains, which in the long run leads to a loss for the group as a whole. Social traps are the cause of countless environmental issues, including overfishing, energy "brownout" and "blackout" power outages during periods of extreme temperatures, the overgrazing of cattle on the Sahelian Desert, the destruction of the rainforest by logging interests and agriculture, and, most importantly, climate change.

Behavioral momentum is a theory in quantitative analysis of behavior and is a behavioral metaphor based on physical momentum. It describes the general relation between resistance to change and the rate of reinforcement obtained in a given situation.

The scalar timing or scalar expectancy theory (SET) is a model of the processes that govern behavior controlled by time. The model posits an internal clock, and particular memory and decision processes. SET is one of the most important models of animal timing behavior.

Allen Neuringer is an American psychologist. He is a highly published and well regarded scientist in the field of the experimental analysis of behavior, as pioneered by B.F. Skinner. His areas of research include human volition studies, the generation of randomness in organisms, self-experimentation, and many other areas. He received his B.A. at Columbia College in 1962, and his PhD from Harvard University in 1967. He served on National Institute of Health (NIH) and National Science Foundation (NSF) committees, received numerous awards and grants for his research, and has published widely. As of June 2008, Neuringer retired as a professor of psychology at Reed College.

Quantitative analysis of behavior is the application of mathematical models--conceptualized from the robust corpus of environment-behavior-consequence interactions in published behavioral science--to the experimental analysis of behavior. The aim is to describe and/or predict relations between varying levels of independent environmental variables and dependent behavioral variables. The parameters in the models hopefully have theoretical meaning beyond their use in fitting models to data. The field was founded by Richard Herrnstein (1961) when he introduced the matching law to quantify the behavior of organisms working on concurrent schedules of reinforcement.

In behaviorism, rate of reinforcement is number of reinforcements per time, usually per minute. Symbol of this rate is usually Rf. Its first major exponent was B.F. Skinner (1939). It is used in the Matching Law.

In behaviorism, rate of response is a ratio between two measurements with different units. Rate of responding is the number of responses per minute, or some other time unit. It is usually written as R. Its first major exponent was B.F. Skinner (1939). It is used in the Matching Law.

In behavioral psychology, stimulus control is a phenomenon in operant conditioning that occurs when an organism behaves in one way in the presence of a given stimulus and another way in its absence. A stimulus that modifies behavior in this manner is either a discriminative stimulus or stimulus delta. For example, the presence of a stop sign at a traffic intersection alerts the driver to stop driving and increases the probability that braking behavior occurs. Stimulus control does not force behavior to occur, as it is a direct result of historical reinforcement contingencies, as opposed to reflexive behavior elicited through classical conditioning.

Discrimination learning is defined in psychology as the ability to respond differently to different stimuli. This type of learning is used in studies regarding operant and classical conditioning. Operant conditioning involves the modification of a behavior by means of reinforcement or punishment. In this way, a discriminative stimulus will act as an indicator to when a behavior will persist and when it will not. Classical conditioning involves learning through association when two stimuli are paired together repeatedly. This conditioning demonstrates discrimination through specific micro-instances of reinforcement and non-reinforcement. This phenomenon is considered to be more advanced than learning styles such as generalization and yet simultaneously acts as a basic unit to learning as a whole. The complex and fundamental nature of discrimination learning allows for psychologists and researchers to perform more in-depth research that supports psychological advancements. Research on the basic principles underlying this learning style has their roots in neuropsychology sub-processes.

George W. Ainslie is an American psychiatrist, psychologist and behavioral economist. He is chief Psychiatrist at the Veterans Affairs Medical Center in Coatesville, Pennsylvania and Clinical Professor of Psychiatry at Temple University School of Medicine.

References

Footnotes
  1. Vaughan and Herrnstein (1980)
  2. Mazur, James E. Learning and Behavior (6th ed.) Upper Saddle River NJ: 2006 p. 332-335
Sources