Structural information theory

Last updated

Structural information theory (SIT) is a theory about human perception and in particular about visual perceptual organization, which is a neuro-cognitive process. It has been applied to a wide range of research topics, [1] mostly in visual form perception but also in, for instance, visual ergonomics, data visualization, and music perception.

Contents

SIT began as a quantitative model of visual pattern classification. Nowadays, it includes quantitative models of symmetry perception and amodal completion, and is theoretically sustained by a perceptually adequate formalization of visual regularity, a quantitative account of viewpoint dependencies, and a powerful form of neurocomputation. [2] SIT has been argued to be the best defined and most successful extension of Gestalt ideas. [3] It is the only Gestalt approach providing a formal calculus that generates plausible perceptual interpretations.

The simplicity principle

A simplest code is a code with minimum information load, that is, a code that enables a reconstruction of the stimulus using a minimum number of descriptive parameters. Such a code is obtained by capturing a maximum amount of visual regularity and yields a hierarchical organization of the stimulus in terms of wholes and parts.

The assumption that the visual system prefers simplest interpretations is called the simplicity principle. [4] Historically, the simplicity principle is an information-theoretical translation of the Gestalt law of Prägnanz, [5] which was inspired by the natural tendency of physical systems to settle into relatively stable states defined by a minimum of free-energy. Furthermore, just as the later-proposed minimum description length principle in algorithmic information theory (AIT), a.k.a. the theory of Kolmogorov complexity, it can be seen as a formalization of Occam's Razor, according to which the simplest interpretation of data is the best one.

Simplicity versus likelihood

Crucial to the latter finding is the distinction between, and integration of, viewpoint-independent and viewpoint-dependent factors in vision, as proposed in SIT's empirically successful model of amodal completion. [6] In the Bayesian framework, these factors correspond to prior probabilities and conditional probabilities, respectively. In SIT's model, however, both factors are quantified in terms of complexities, that is, complexities of objects and of their spatial relationships, respectively. [7] [8] [9]

Modeling principles

In SIT's formal coding model, candidate interpretations of a stimulus are represented by symbol strings, in which identical symbols refer to identical perceptual primitives (e.g., blobs or edges). Every substring of such a string represents a spatially contiguous part of an interpretation, so that the entire string can be read as a reconstruction recipe for the interpretation and, thereby, for the stimulus. These strings then are encoded (i.e., they are searched for visual regularities) to find the interpretation with the simplest code.

This encoding is performed by way of symbol manipulation, which, in psychology, has led to critical statements of the sort of "SIT assumes that the brain performs symbol manipulation". Such statements, however, fall in the same category as statements such as "physics assumes that nature applies formulas such as Einstein's E=mc2 or Newton's F=ma" and "DST models assume that dynamic systems apply differential equations".

Visual regularity

To obtain simplest codes, SIT applies coding rules that capture the kinds of regularity called iteration, symmetry, and alternation. These have been shown to be the only regularities that satisfy the formal criteria of (a) being holographic regularities that (b) allow for hierarchically transparent codes. [10]

A crucial difference with respect to the traditionally considered transformational formalization of visual regularity is that, holographically, mirror symmetry is composed of many relationships between symmetry pairs rather than one relationship between symmetry halves. Whereas the transformational characterization may be suited better for object recognition, the holographic characterization seems more consistent with the buildup of mental representations in object perception.

The perceptual relevance of the criteria of holography and transparency has been verified in the holographic approach to visual regularity. [11] It also explains that the detectability of mirror symmetries and Glass pattens in the presence of noise follows a psychophysical law that improves on Weber's law. [12]

See also

Related Research Articles

<span class="mw-page-title-main">Philosophy of perception</span> Branch of philosophy

The philosophy of perception is concerned with the nature of perceptual experience and the status of perceptual data, in particular how they relate to beliefs about, or knowledge of, the world. Any explicit account of perception requires a commitment to one of a variety of ontological or metaphysical views. Philosophers distinguish internalist accounts, which assume that perceptions of objects, and knowledge or beliefs about them, are aspects of an individual's mind, and externalist accounts, which state that they constitute real aspects of the world external to the individual. The position of naïve realism—the 'everyday' impression of physical objects constituting what is perceived—is to some extent contradicted by the occurrence of perceptual illusions and hallucinations and the relativity of perceptual experience as well as certain insights in science. Realist conceptions include phenomenalism and direct and indirect realism. Anti-realist conceptions include idealism and skepticism. Recent philosophical work have expanded on the philosophical features of perception by going beyond the single paradigm of vision.

<span class="mw-page-title-main">Perception</span> Interpretation of sensory information

Perception is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system, which in turn result from physical or chemical stimulation of the sensory system. Vision involves light striking the retina of the eye; smell is mediated by odor molecules; and hearing involves pressure waves.

<span class="mw-page-title-main">Optical illusion</span> Visually perceived images that differ from objective reality

In visual perception, an optical illusion is an illusion caused by the visual system and characterized by a visual percept that arguably appears to differ from reality. Illusions come in a wide variety; their categorization is difficult because the underlying cause is often not clear but a classification proposed by Richard Gregory is useful as an orientation. According to that, there are three main classes: physical, physiological, and cognitive illusions, and in each class there are four kinds: Ambiguities, distortions, paradoxes, and fictions. A classical example for a physical distortion would be the apparent bending of a stick half immerged in water; an example for a physiological paradox is the motion aftereffect. An example for a physiological fiction is an afterimage. Three typical cognitive distortions are the Ponzo, Poggendorff, and Müller-Lyer illusion. Physical illusions are caused by the physical environment, e.g. by the optical properties of water. Physiological illusions arise in the eye or the visual pathway, e.g. from the effects of excessive stimulation of a specific receptor type. Cognitive visual illusions are the result of unconscious inferences and are perhaps those most widely known.

Gestalt psychology, gestaltism, or configurationism is a school of psychology and a theory of perception that emphasises the processing of entire patterns and configurations, and not merely individual components. It emerged in the early twentieth century in Austria and Germany as a rejection of basic principles of Wilhelm Wundt's and Edward Titchener's elementalist and structuralist psychology.

<span class="mw-page-title-main">Phi phenomenon</span> Optical illusion of apparent motion

The term phi phenomenon is used in a narrow sense for an apparent motion that is observed if two nearby optical stimuli are presented in alternation with a relatively high frequency. In contrast to beta movement, seen at lower frequencies, the stimuli themselves do not appear to move. Instead, a diffuse, amorphous shadowlike something seems to jump in front of the stimuli and occlude them temporarily. This shadow seems to have nearly the color of the background. Max Wertheimer first described this form of apparent movement in his habilitation thesis, published 1912, marking the birth of Gestalt psychology.

The consciousness and binding problem is the problem of how objects, background and abstract or emotional features are combined into a single experience.

<span class="mw-page-title-main">Figure–ground (perception)</span> Humans ability to separate foreground from background in visual images

Figure–ground organization is a type of perceptual grouping that is a vital necessity for recognizing objects through vision. In Gestalt psychology it is known as identifying a figure from the background. For example, black words on a printed paper are seen as the "figure", and the white sheet as the "background".

Holonomic brain theory is a branch of neuroscience investigating the idea that human consciousness is formed by quantum effects in or between brain cells. Holonomic refers to representations in a Hilbert phase space defined by both spectral and space-time coordinates. Holonomic brain theory is opposed by traditional neuroscience, which investigates the brain's behavior by looking at patterns of neurons and the surrounding chemistry.

Amodal perception is the perception of the whole of a physical structure when only parts of it affect the sensory receptors. For example, a table will be perceived as a complete volumetric structure even if only part of it—the facing surface—projects to the retina; it is perceived as possessing internal volume and hidden rear surfaces despite the fact that only the near surfaces are exposed to view. Similarly, the world around us is perceived as a surrounding plenum, even though only part of it is in view at any time. Another much quoted example is that of the "dog behind a picket fence" in which a long narrow object is partially occluded by fence-posts in front of it, but is nevertheless perceived as a single continuous object. Albert Bregman noted an auditory analogue of this phenomenon: when a melody is interrupted by bursts of white noise, it is nonetheless heard as a single melody continuing "behind" the bursts of noise.

<span class="mw-page-title-main">Ehrenstein illusion</span> Optical illusion

The Ehrenstein illusion is an optical illusion of brightness or colour perception. The visual phenomena was studied by the German psychologist Walter H. Ehrenstein (1899–1961) who originally wanted to modify the theory behind the Hermann grid illusion. In the discovery of the optical illusion, Ehrenstein found that grating patterns of straight lines that stop at a certain point appear to have a brighter centre, compared to the background.

Visual perception is the ability to interpret the surrounding environment through photopic vision, color vision, scotopic vision, and mesopic vision, using light in the visible spectrum reflected by objects in the environment. This is different from visual acuity, which refers to how clearly a person sees. A person can have problems with visual perceptual processing even if they have 20/20 vision.

Scene statistics is a discipline within the field of perception. It is concerned with the statistical regularities related to scenes. It is based on the premise that a perceptual system is designed to interpret scenes.

The principles of grouping are a set of principles in psychology, first proposed by Gestalt psychologists to account for the observation that humans naturally perceive objects as organized patterns and objects, a principle known as Prägnanz. Gestalt psychologists argued that these principles exist because the mind has an innate disposition to perceive patterns in the stimulus based on certain rules. These principles are organized into five categories: Proximity, Similarity, Continuity, Closure, and Connectedness.

<span class="mw-page-title-main">Watercolor illusion</span> Optical illusion in which a white area takes on a pale tint

The watercolor illusion, also referred to as the water-color effect, is an optical illusion in which a white area takes on a pale tint of a thin, bright, intensely colored polygon surrounding it if the coloured polygon is itself surrounded by a thin, darker border. The inner and outer borders of watercolor illusion objects often are of complementary colours. The watercolor illusion is best when the inner and outer contours have chromaticities in opposite directions in color space. The most common complementary pair is orange and purple. The watercolor illusion is dependent on the combination of luminance and color contrast of the contour lines in order to have the color spreading effect occur.

Rainer Mausfeld is a retired German professor of psychology at Kiel University. He did research on the psychology of perception, cognitive science, and the history of psychology. Since 2015, he has published on manipulation in media and politics and the transformation of representative democracy to neoliberal elite democracy.

<span class="mw-page-title-main">Amodal completion</span>

Amodal completion is the ability to see an entire object despite parts of it being covered by another object in front of it. It is one of the many functions of the visual system which aid in both seeing and understanding objects encountered on an everyday basis. This mechanism allows the world to be perceived as though it is made of coherent wholes. For example, when the sun sets over the horizon it is still perceived as a full circle, despite occlusion causing it to appear as a semi-circle. Another example of this is a cat behind a picket fence. Amodal completion allows the cats to be seen as a full animal continuing behind each picket of the fence. Essentially amodal completion allows for sensory stimulation from any parts of an occluded object we can not directly see.

<span class="mw-page-title-main">Kurt Koffka</span> German psychologist and professor (1886–1941)

Kurt Koffka was a German psychologist and professor. He was born and educated in Berlin, Germany; he died in Northampton, Massachusetts, from coronary thrombosis. He was influenced by his maternal uncle, a biologist, to pursue science. He had many interests including visual perception, brain damage, sound localization, developmental psychology, and experimental psychology. He worked alongside Max Wertheimer and Wolfgang Köhler to develop Gestalt psychology. Koffka had several publications including "The Growth of the Mind: An Introduction to Child Psychology" (1924) and "The Principles of Gestalt Psychology" (1935) which elaborated on his research.

Ensemble coding, also known as ensemble perception or summary representation, is a theory in cognitive neuroscience about the internal representation of groups of objects in the human mind. Ensemble coding proposes that such information is recorded via summary statistics, particularly the average or variance. Experimental evidence tends to support the theory for low-level visual information, such as shapes and sizes, as well as some high-level features such as face gender. Nonetheless, it remains unclear the extent to which ensemble coding applies to high-level or non-visual stimuli, and the theory remains the subject of active research.

<span class="mw-page-title-main">Michael Kubovy</span>

Michael Kubovy is an Israeli American psychologist known for his work on the psychology of perception and psychology of art.

Johan Wagemans is a Belgian experimental psychologist. He is a full professor at the KU Leuven in Leuven (Belgium). He directs a long-term Methusalem project that focuses upon the psychology and neuroscience of visual perception and most recently art perception.

References

  1. Leeuwenberg, E. L. J. & van der Helm, P. A. (2013). Structural information theory: The simplicity of visual form. Cambridge, UK: Cambridge University Press.
  2. van der Helm, P. A. (2014). Simplicity in vision: A multidisciplinary account of perceptual organization. Cambridge, UK: Cambridge University Press.
  3. Palmer, S. E. (1999). Vision science: Photons to phenomenology. Cambridge, MA: MIT Press.
  4. Hochberg, J. E., & McAlister, E. (1953). A quantitative approach to figural "goodness". Journal of Experimental Psychology, 46, 361—364.
  5. Koffka, K. (1935). Principles of gestalt psychology. London: Routledge & Kegan Paul.
  6. van Lier, R. J., van der Helm, P. A., & Leeuwenberg, E. L. J. (1994). Integrating global and local aspects of visual occlusion. Perception, 23, 883—903. doi : 10.1068/p230883.
  7. Ungerleider, L. G., & Mishkin, M. (1982). Two cortical visual systems. In D. J. Ingle, M. A. Goodale, & R. J. W. Mansfield (Eds.), Analysis of Visual Behavior (pp. 549—586). Cambridge, MA: MIT Press.
  8. von Helmholtz, H. L. F. (1962). Treatise on Physiological Optics (J. P. C. Southall, Trans.). New York: Dover. (Original work published 1909)
  9. van der Helm, P. A. (2000). Simplicity versus likelihood in visual perception: From surprisals to precisals. Psychological Bulletin, 126, 770—800. doi : 10.1037/0033-2909.126.5.770.
  10. van der Helm, P. A., & Leeuwenberg, E. L. J. (1991). Accessibility, a criterion for regularity and hierarchy in visual pattern codes. Journal of Mathematical Psychology, 35, 151—213. doi : 10.1016/0022-2496(91)90025-O.
  11. van der Helm, P. A., & Leeuwenberg, E. L. J. (1996). Goodness of visual regularities: A nontransformational approach. Psychological Review, 103, 429—456. doi : 10.1037/0033-295X.103.3.429.
  12. van der Helm, P. A. (2010). Weber-Fechner behaviour in symmetry perception? Attention, Perception, & Psychophysics, 72, 1854—1864. doi : 10.3758/APP.72.7.1854.