The scalar timing or scalar expectancy theory (SET) is a model of the processes that govern behavior controlled by time. The model posits an internal clock, and particular memory and decision processes. [1] SET is one of the most important models of animal timing behavior. [2]
John Gibbon originally proposed SET to explain the temporally controlled behavior of non-human subjects. [1] He initially used the model to account for a pattern of behavior seen in animals that are being reinforced at fixed-intervals, for example every 2 minutes. [3] An animal that is well trained on such a fixed-interval schedule pauses after each reinforcement and then suddenly starts responding about two-thirds of the way through the new interval. (See operant conditioning) The model explains how the animal's behavior is controlled by time in this manner. [1] Gibbon and others later elaborated the model and applied it to a variety of other timing phenomena.
SET assumes that the animal has a clock, a working memory, a reference memory, and a decision process. The clock contains a discrete pacemaker that generates pulses like the ticks a mechanical clock. A stimulus that signals the start of a timed interval closes a switch, allowing pulses to enter an accumulator. The resulting accumulation of pulses represents elapsed time, and this time value is continuously sent to a working memory. When reinforcement happens at the end of the timed interval, the time value is stored in a long-term reference memory. This time-to-reinforcement in reference memory represents the expected time to reinforcement.
Key to the SET model is the decision process that controls timing behavior. While the animal is timing some interval it continually compares the current time (stored in working memory) to the expected time (stored in reference memory). Specifically, the animal continually samples from its memory of past times at which reinforcement occurred and compares this memory sample with the current time on its clock. When the two values are close to one another the animal responds; when they are far enough apart, the animal stops responding. To make this comparison, it computes the ratio of the two values; when the ratio is less than a certain value it responds, when the ratio is larger it does not respond.
By using a ratio of current time to expected time, rather than, for example, simply subtracting one from the other, SET accounts for a key observation about animal and human timing. That is, timing precision is relative to the size of the interval being timed [4] (See Accuracy and precision). This is the "scalar" property that gives the model its name. For example, when timing a 10 sec interval an animal might be precise to within 1 sec, whereas when timing a 100 sec interval the animal would be precise to only about 10 sec. Thus time perception is like the perception of lights, sounds, and other sensory events, where precision is also relative to the size (brightness, loudness, etc.) of the percept being judged. (See Weber-Fechner law.)
A number of alternative models of timing have appeared over the years. These include Killeen’s Behavioral Theory of timing (BeT) model [5] and Machado’s learning-to-time (LeT) model. [6]
Moreover, there are some evidence that this property might not be valid in all ranges of durations. [7] Additionally John Staddon argues that SET is inconsistent on explaining the location of temporal indifference point in temporal bisection procedure. [8]
In 1993, John Wearden claimed that human behavior exhibits appropriate scalar properties, as was indicated by experiments on internal production with concurrent chronometric counting. [9] However, human timing behavior is undoubtedly more varied than animal timing behavior. A major factor responsible for this variability is attentional allocation. [9] [10]
Burrhus Frederic Skinner was an American psychologist, behaviorist, inventor, and social philosopher. He was the Edgar Pierce Professor of Psychology at Harvard University from 1958 until his retirement in 1974.
Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions in a dynamic environment in order to maximize a reward signal. Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning.
Operant conditioning, also called instrumental conditioning, is a learning process where voluntary behaviors are modified by association with the addition of reward or aversive stimuli. The frequency or duration of the behavior may increase through reinforcement or decrease through punishment or extinction.
Classical conditioning is a behavioral procedure in which a biologically potent stimulus is paired with a neutral stimulus. The term classical conditioning refers to the process of an automatic, conditioned response that is paired with a specific stimulus.
In behavioral psychology, reinforcement refers to consequences that increase the likelihood of an organism's future behavior, typically in the presence of a particular antecedent stimulus. For example, a rat can be trained to push a lever to receive food whenever a light is turned on. In this example, the light is the antecedent stimulus, the lever pushing is the operant behavior, and the food is the reinforcer. Likewise, a student that receives attention and praise when answering a teacher's question will be more likely to answer future questions in class. The teacher's question is the antecedent, the student's response is the behavior, and the praise and attention are the reinforcements.
Social learning theory is a theory of social behavior that proposes that new behaviors can be acquired by observing and imitating others. It states that learning is a cognitive process that takes place in a social context and can occur purely through observation or direct instruction, even in the absence of motor reproduction or direct reinforcement. In addition to the observation of behavior, learning also occurs through the observation of rewards and punishments, a process known as vicarious reinforcement. When a particular behavior is rewarded regularly, it will most likely persist; conversely, if a particular behavior is constantly punished, it will most likely desist. The theory expands on traditional behavioral theories, in which behavior is governed solely by reinforcements, by placing emphasis on the important roles of various internal processes in the learning individual. Albert Bandura is known for studying this theory.
Animal cognition encompasses the mental capacities of non-human animals, including insect cognition. The study of animal conditioning and learning used in this field was developed from comparative psychology. It has also been strongly influenced by research in ethology, behavioral ecology, and evolutionary psychology; the alternative name cognitive ethology is sometimes used. Many behaviors associated with the term animal intelligence are also subsumed within animal cognition.
Behaviorism is a systematic approach to understand the behavior of humans and other animals. It assumes that behavior is either a reflex elicited by the pairing of certain antecedent stimuli in the environment, or a consequence of that individual's history, including especially reinforcement and punishment contingencies, together with the individual's current motivational state and controlling stimuli. Although behaviorists generally accept the important role of heredity in determining behavior, they focus primarily on environmental events. The cognitive revolution of the late 20th century largely replaced behaviorism as an explanatory theory with cognitive psychology, which unlike behaviorism views internal mental states as explanations for observable behavior.
Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods.
Animal training is the act of teaching animals specific responses to specific conditions or stimuli. Training may be for purposes such as companionship, detection, protection, and entertainment. The type of training an animal receives will vary depending on the training method used, and the purpose for training the animal. For example, a seeing eye dog will be trained to achieve a different goal than a wild animal in a circus.
Extinction is a behavioral phenomenon observed in both operantly conditioned and classically conditioned behavior, which manifests itself by fading of non-reinforced conditioned response over time. When operant behavior that has been previously reinforced no longer produces reinforcing consequences the behavior gradually stops occurring. In classical conditioning, when a conditioned stimulus is presented alone, so that it no longer predicts the coming of the unconditioned stimulus, conditioned responding gradually stops. For example, after Pavlov's dog was conditioned to salivate at the sound of a metronome, it eventually stopped salivating to the metronome after the metronome had been sounded repeatedly but no food came. Many anxiety disorders such as post traumatic stress disorder are believed to reflect, at least in part, a failure to extinguish conditioned fear.
In psychology and neuroscience, time perception or chronoception is the subjective experience, or sense, of time, which is measured by someone's own perception of the duration of the indefinite and unfolding of events. The perceived time interval between two successive events is referred to as perceived duration. Though directly experiencing or understanding another person's perception of time is not possible, perception can be objectively studied and inferred through a number of scientific experiments. Some temporal illusions help to expose the underlying neural mechanisms of time perception.
Neural coding is a neuroscience field concerned with characterising the hypothetical relationship between the stimulus and the neuronal responses, and the relationship among the electrical activities of the neurons in the ensemble. Based on the theory that sensory and other information is represented in the brain by networks of neurons, it is believed that neurons can encode both digital and analog information.
Melioration theory in behavioral psychology is a theoretical algorithm that predicts the matching law. Melioration theory is used as an explanation for why an organism makes choices based on the rewards or reinforcers it receives. The principle of melioration states that animals will invest increasing amounts of time and/or effort into whichever alternative is better. To meliorate essentially means to "make better".
Quantitative analysis of behavior is the application of mathematical models--conceptualized from the robust corpus of environment-behavior-consequence interactions in published behavioral science--to the experimental analysis of behavior. The aim is to describe and/or predict relations between varying levels of independent environmental variables and dependent behavioral variables. The parameters in the models hopefully have theoretical meaning beyond their use in fitting models to data. The field was founded by Richard Herrnstein (1961) when he introduced the matching law to quantify the behavior of organisms working on concurrent schedules of reinforcement.
In behavioral psychology, stimulus control is a phenomenon in operant conditioning that occurs when an organism behaves in one way in the presence of a given stimulus and another way in its absence. A stimulus that modifies behavior in this manner is either a discriminative stimulus or stimulus delta. For example, the presence of a stop sign at a traffic intersection alerts the driver to stop driving and increases the probability that braking behavior occurs. Stimulus control does not force behavior to occur, as it is a direct result of historical reinforcement contingencies, as opposed to reflexive behavior elicited through classical conditioning.
The mathematical principles of reinforcement (MPR) constitute of a set of mathematical equations set forth by Peter Killeen and his colleagues attempting to describe and predict the most fundamental aspects of behavior.
Peter Richard Killeen is an American psychologist who has made major contributions to a number of fields in the behavioral sciences. He has been one of the few premier contributors in quantitative analysis of behavior, and memory.
Abram Amsel was a Canadian-born American psychologist and faculty member at several universities. A member of the National Academy of Sciences and a Fellow of the American Association for the Advancement of Science, he conducted influential research into concepts of reward and nonreward in learning and behavior. Later, his research shifted from learning theory to neurobiology.
Theoretical behaviorism is a framework for psychology proposed by J. E. R. Staddon as an extension of experimental psychologist B. F. Skinner's radical behaviorism. It originated at Harvard in the early 1960s.