Bayesian inference in motor learning

Last updated

Bayesian inference is a statistical tool that can be applied to motor learning , specifically to adaptation. Adaptation is a short-term learning process involving gradual improvement in performance in response to a change in sensory information. Bayesian inference is used to describe the way the nervous system combines this sensory information with prior knowledge to estimate the position or other characteristics of something in the environment. Bayesian inference can also be used to show how information from multiple senses (e.g. visual and proprioception) can be combined for the same purpose. In either case, Bayesian inference dictates that the estimate is most influenced by whichever information is most certain.

Contents

Example: integrating prior knowledge with sensory information in tennis

Using Bayesian inference to combine prior and sensory information to estimate the position of a tennis ball Bayesian tennis court.png
Using Bayesian inference to combine prior and sensory information to estimate the position of a tennis ball

A person uses Bayesian inference to create an estimate that is a weighted combination of his current sensory information and his previous knowledge, or prior. This can be illustrated by decisions made in a tennis match. [1] If someone plays against a familiar opponent who likes to serve such that the ball strikes on the sideline, one's prior would lead one to place the racket above the sideline to return the serve. However, when one sees the ball moving, it may appear that it will land closer to the middle of the court. Rather than completely following this sensory information or completely following the prior, one would move the racket to a location between the sideline (suggested by the prior) and the point where her eyes indicate the ball will land.

Another key part of Bayesian inference is that the estimate will be closer to the physical state suggested by sensory information if the senses are more accurate and will be closer to the state of the prior if the sensory information is more uncertain than the prior. Extending this to the tennis example, a player facing an opponent for the first time would have little certainty in his/her previous knowledge of the opponent and would therefore have an estimate weighted more heavily on visual information concerning ball position. Alternatively, if one were familiar with one's opponent but were playing in foggy or dark conditions that would hamper sight, sensory information would be less certain and one's estimate would rely more heavily on previous knowledge.

Statistical overview

Bayes' theorem states

In the language of Bayesian statistics, , or probability of A given B, is called the posterior, while and are the likelihood and the prior probabilities, respectively. [2] is a constant scaling factor which allows the posterior to be between zero and one. Translating this into the language of motor learning, the prior represents previous knowledge about the physical state of the thing being observed, the likelihood is sensory information used to update the prior, and the posterior is the nervous system's estimate of the physical state. Therefore, for adaptation, Bayes' Theorem can be expressed as

estimate = (previous knowledge × sensory information)/scaling factor

The 3 terms in the equation above are all probability distributions. To find the estimate in non-probabilistic terms, a weighted sum can be used.

where is the estimate, is sensory information, is previous knowledge, and the weighting factors and are the variances of and , respectively. Variance is a measure of uncertainty in a variable, so the above equation indicates that higher uncertainty in sensory information causes previous knowledge to have more influence on the estimate and vice versa.

More rigorous mathematical Bayesian descriptions are available here and here.

Reaching

Many motor tasks exhibit adaptation to new sensory information. Bayesian inference has been most commonly studied in reaching.

Integrating a prior with current sensory information

(A) The cursor is represented by one dot with an exact location. (B) The location of the cursor is less certain, because it is somewhere within the cloud of dots. One exact data point (A), versus data drawn randomly from a normal distribution (B).jpg
(A) The cursor is represented by one dot with an exact location. (B) The location of the cursor is less certain, because it is somewhere within the cloud of dots.

Adaptation studies often involve a person reaching for a target without seeing either the target or his/her hand. Instead, the hand is represented by a cursor on a computer screen, which they must move over the target on the screen. In some cases, the cursor is shifted a small distance away from the actual hand position to test how the person responds to changes in visual feedback. [3] [4] A person learns to counteract this shifts by moving his/her hand an equal and opposite distance from the shift and still moves the cursor to the target, meaning he has developed a prior for this specific shift. When the cursor is then shifted a new, different distance from the same person's reaching hand, the person reaction is consistent with Bayesian inference; the hand moves a distance that is between the old shift (prior) and the new shift (sensory information). [3]

If, for the new shift, the cursor is a large cloud of dots instead of one dot (as shown in the figure), the person's sensory information is less clear and will have less influence on how he reacts than the prior will. This supports the Bayesian idea that sensory information with more certainty will have greater influence on a person's adaptation to shifted sensory feedback.

This form of adaptation holds true only when the shift is small compared to the distance the person has to reach to hit the target. A person reaching for a target 15 cm away would adapt to a 2 cm shift of the cursor in a Bayesian way. [4] However, if the target were only 5 cm away, a 2 cm shift cursor position (visual information) would be recognized and the person would realize the visual information does not accurately show hand position. Instead, the person would rely on proprioception and prior knowledge to move the hand to the target.

Humans also adapt to changing forces when reaching. [5] When a force field a person reaches through changes slightly, he modifies his force to maintain reaching in a straight line based partly on a prior force which had been applied earlier. He relies more on the prior if the prior shift is less variable (more certain).

Integrating information from multiple senses

Bayesian inference can also be applied to the way humans combine information about changes in their environment from multiple senses without any consideration of prior knowledge. The two senses that have the strongest influence on how humans' reaching adaptations are vision and proprioception. Typically, proprioception has more weight than vision for adapting hand position in depth – the direction moving toward or away from the person reaching – and vision has more weight in the vertical and horizontal directions. [6] However, changing conditions can alter the relative influence of these two senses. For example, the influence of vision on adapting hand depth is increased when the hand is passive, while proprioception has more influence when the hand is moving. [6] Moreover, when vision is reduced (e.g. in darkness) proprioception has more influence on determining hand position. [7] This result is consistent with Bayesian inference; when one sense becomes more uncertain, humans increase their reliance on another sense.

Posture

Additionally, Bayesian inference has been found to play a part in adaptation of postural control. In one study, for example, subjects use a Wii Balance Board to do a surfing task in which they must move a cursor representing their center of pressure (COP) on a screen. [8] The Wii surfer got visual information about his/her COP from clouds of dots similar to the one shown in the reaching section. With larger clouds, the surfers were more uncertain and less able to move the COP to the target on the screen. While this result is consistent with Bayesian inference, Bayesian mathematical models did not provide the best predictions of COP motion, perhaps because moving the COP accurately is more mechanically difficult than reaching. Therefore, the extent to which postural movement can be described by Bayesian inference is not yet clear.

Gait

Adaptation to shifted feedback also occurs during walking and running. People walking with each foot on a different treadmill belt can adapt their step length when one belt begins moving faster than the other. [9] Additionally, runners are able to alter their maximum ground reaction force and leg acceleration when they see a graph of peak leg acceleration. [10] However, to date, no studies have determined if humans adapt their gates using Bayesian inference or not.

Possible contradictions to Bayesian inference

Some adaptation studies do not support the application of Bayesian inference to motor learning. One study of reaching in a force field found that, rather than being influenced by a prior developed over hundreds of previous reaches, adaptation to subsequent reaches is only influenced by recent memories. [11] People reaching in the force field adapted to shifts in the amount of force exerted on the arm, but this adaptation was only affected by the shift in force of the immediately previous reach, not by a well-developed prior knowledge of shifts that had occurred throughout the previous trials of the experiment. This appears to conflict with the application of Bayesian inference to adaptation, but Bayesian adaptation proponents have argued that this particular study required each participant to do only 600 reaches, which is not enough to develop a prior. [5] In reaching studies that show evidence of Bayesian inference, participants typically perform 900 reaches or more. [3] [4] This indicates that, while Bayesian inference is used in adaptation, it is limited in that much previous experience is necessary to develop an influential prior.

See also

Related Research Articles

<span class="mw-page-title-main">Perception</span> Interpretation of sensory information

Perception is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system, which in turn result from physical or chemical stimulation of the sensory system. Vision involves light striking the retina of the eye; smell is mediated by odor molecules; and hearing involves pressure waves.

<span class="mw-page-title-main">Raven paradox</span> Paradox arising from the question of what constitutes evidence for a statement

The raven paradox, also known as Hempel's paradox, Hempel's ravens, or rarely the paradox of indoor ornithology, is a paradox arising from the question of what constitutes evidence for the truth of a statement. Observing objects that are neither black nor ravens may formally increase the likelihood that all ravens are black even though, intuitively, these observations are unrelated.

Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, and especially in mathematical statistics. Bayesian updating is particularly important in the dynamic analysis of a sequence of data. Bayesian inference has found application in a wide range of activities, including science, engineering, philosophy, medicine, sport, and law. In the philosophy of decision theory, Bayesian inference is closely related to subjective probability, often called "Bayesian probability".

A Bayesian network is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bayesian networks are ideal for taking an event that occurred and predicting the likelihood that any one of several possible known causes was the contributing factor. For example, a Bayesian network could represent the probabilistic relationships between diseases and symptoms. Given symptoms, the network can be used to compute the probabilities of the presence of various diseases.

Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability where probability expresses a degree of belief in an event. The degree of belief may be based on prior knowledge about the event, such as the results of previous experiments, or on personal beliefs about the event. This differs from a number of other interpretations of probability, such as the frequentist interpretation that views probability as the limit of the relative frequency of an event after many trials.

A prior probability distribution of an uncertain quantity, often simply called the prior, is its assumed probability distribution before some evidence is taken into account. For example, the prior could be the probability distribution representing the relative proportions of voters who will vote for a particular politician in a future election. The unknown quantity may be a parameter of the model or a latent variable rather than an observable variable.

<span class="mw-page-title-main">Doomsday argument</span> Doomsday scenario on human births

The Doomsday Argument (DA), or Carter catastrophe, is a probabilistic argument that claims to predict the future population of the human species based on an estimation of the number of humans born to date.

Multisensory integration, also known as multimodal integration, is the study of how information from the different sensory modalities may be integrated by the nervous system. A coherent representation of objects combining modalities enables animals to have meaningful perceptual experiences. Indeed, multisensory integration is central to adaptive behavior because it allows animals to perceive a world of coherent perceptual entities. Multisensory integration also deals with how different sensory modalities interact with one another and alter each other's processing.

The Self-Indication Assumption Doomsday argument rebuttal is an objection to the Doomsday argument by arguing that the chance of being born is not one, but is an increasing function of the number of people who will be born.

<span class="mw-page-title-main">Romberg's test</span> Test used in an exam of neurological function for balance

Romberg's test, Romberg's sign, or the Romberg maneuver is a test used in an exam of neurological function for balance. The exam is based on the premise that a person requires at least two of the three following senses to maintain balance while standing: proprioception ; vestibular function ; and vision.

In Bayesian statistics, a credible interval is an interval within which an unobserved parameter value falls with a particular probability. It is an interval in the domain of a posterior probability distribution or a predictive distribution. The generalisation to multivariate problems is the credible region.

The size–weight illusion, also known as the Charpentier illusion, is named after the French physician Augustin Charpentier because he was the first to demonstrate the illusion experimentally. It is also called De Moor's illusion, named after Belgian physician Jean Demoor (1867-1941).

Approximate Bayesian computation (ABC) constitutes a class of computational methods rooted in Bayesian statistics that can be used to estimate the posterior distributions of model parameters.

Bayesian approaches to brain function investigate the capacity of the nervous system to operate in situations of uncertainty in a fashion that is close to the optimal prescribed by Bayesian statistics. This term is used in behavioural sciences and neuroscience and studies associated with this term often strive to explain the brain's cognitive abilities based on statistical principles. It is frequently assumed that the nervous system maintains internal probabilistic models that are updated by neural processing of sensory information using methods approximating those of Bayesian probability.

<span class="mw-page-title-main">Proprioception</span> Sense of self-movement, force, and body position

Proprioception, also referred to as kinaesthesia, is the sense of self-movement, force, and body position. It is sometimes described as the "sixth sense".

<span class="mw-page-title-main">Motor program</span> Abstract representation of movement

A motor program is an abstract metaphor of the central organization of movement and control of the many degrees of freedom involved in performing an action.p. 182 Signals transmitted through efferent and afferent pathways allow the central nervous system to anticipate, plan or guide movement. Evidence for the concept of motor programs include the following:p. 182

<span class="mw-page-title-main">Bayesian inference in marketing</span>

In marketing, Bayesian inference allows for decision making and market research evaluation under uncertainty and with limited data.

The free energy principle is a mathematical principle in biophysics and cognitive science. It describes a formal account of the representational capacities of physical systems: that is, why things that exist look as if they track properties of the systems to which they are coupled. It establishes that the dynamics of physical systems minimise a quantity known as surprisal ; or equivalently, its variational upper bound, called free energy. The principle is formally related to variational Bayesian methods and was originally introduced by Karl Friston as an explanation for embodied perception-action loops in neuroscience, where it is also known as active inference.

<span class="mw-page-title-main">Domain adaptation</span> Field associated with machine learning and transfer learning

Domain adaptation is a field associated with machine learning and transfer learning. This scenario arises when we aim at learning from a source data distribution a well performing model on a different target data distribution. For instance, one of the tasks of the common spam filtering problem consists in adapting a model from one user to a new user who receives significantly different emails. Domain adaptation has also been shown to be beneficial for learning unrelated sources. Note that, when more than one source distribution is available the problem is referred to as multi-source domain adaptation.

Probabilistic numerics is an active field of study at the intersection of applied mathematics, statistics, and machine learning centering on the concept of uncertainty in computation. In probabilistic numerics, tasks in numerical analysis such as finding numerical solutions for integration, linear algebra, optimization and simulation and differential equations are seen as problems of statistical, probabilistic, or Bayesian inference.

References

  1. Körding, K. P., & Wolpert, Daniel M. (2006). Bayesian decision theory in sensorimotor control. Trends in Cognitive Sciences (Vol. 10, pp. 319–326).
  2. Lee, PM. (2004). Bayesian Statistics: An Introduction. Oxford University Press: London.
  3. 1 2 3 Körding, K. P., & Wolpert, Daniel M. (2004). Bayesian integration in sensorimotor learning. Nature 427:244-7.
  4. 1 2 3 Wei, K., Kording, K. (2009). Relevance of error: what drives motor adaptation? J Neurophysiol 101:655-64.
  5. 1 2 Kording, K.P., Ku, S., Wolpert, D.M. (2004). Bayesian Integration in force estimation. J Neurophysiol 92:3161-5.
  6. 1 2 Van Beers, R.J., Wolpert, D.M., Haggard, P. (2002). When feeling is more important than seeing in sensorimotor adaptation. Current Biology 12:834-7.
  7. Plooy, A., Tresilian, J.R., Mon-Williams, M., Wann, J.P. (1998). The contribution of vision and proprioception to judgements of finger proximity. Experimental Brain Research 118(3):415-20.
  8. Stevenson, I.H., Fernades, H.L., Vilares, I., Wei, K., Kording, K.P. (2009). Bayesian integration and non-linear feedback control in a full-body motor task. PLoS Comp Biol 5(12):1-9.
  9. Vasudevan, E.V., Bastian, A.J. (2010). Split-belt treadmill adaptation shows different functional networks for fast and slow human walking. J Neurophysiol 103:183-191.
  10. Crowell, H.P., Milner, C.E., Hamill, J., Davis, I.S. (2010). J Orthopaedic & Sports Physical Therapy 40(4):206-13.
  11. Scheidt, R.A., Dingwell, J.B., Mussa-Ivaldi, F.A. (2001). Learning to move amid uncertainty. J Neurophysiol 86:971-85.