Visual crowding is the inability to view a target stimulus distinctly when presented in a clutter. Crowding impairs the ability to discriminate object features and contours among flankers, which in turn impairs people's ability to respond appropriately to the target stimulus. [1]
An operational definition of crowding explains what crowding is and how it is different from similar effects such as masking, lateral interaction and surround suppression; effects that make the target more challenging to see as well. There are different criteria that are used to differentiate crowding from these other effects. Firstly, crowding makes it difficult to identify an object but not detecting it among the clutter. [2] [3] [4] Crowded objects are collectively perceived to have high contrast, but they remain indistinct. The eccentricity of the target and the distance between the target and flankers influence crowding. As the distance between the target and the flankers' increases at a given eccentricity the ability to detect the target also improves as the eccentricity of a target is increased the more it pops out from the flankers and the more easily it is identified. [5]
Crowding is anisotropic, which means it has different values when measured in different directions. Radially positioned flankers make it harder to identify the target than tangentially positioned ones. [6] Crowding is stronger in the upper field of the four quadrants than the lower ones. [7] A recent study tells us that crowding is intense where the distractor and the target are in the same visual field than when they are in separate visual fields despite equal retinal distance. [8] Crowding is also asymmetrical meaning that a single flanker at an eccentric locus higher than the target makes it harder to identify the target than the single flanker at an eccentric locus closer to the fovea. [5] Crowding is not just a spatial phenomenon it happens over time as well, when a target is moving it is found to be more crowded when the flankers are leading than when they follow the target. [9]
There is much information that gets through to peoples conscious even under the circumstances of crowding these include the appearance of a feature, people can easily perceive the appearance of a feature but cannot identify or discriminate the changes in this feature. [2] [3] [4] [10] Secondly, After effects from adaptation survive crowding, adapting to a target in a crowded stimulus can help people form an orientation[8] and track the motion of the target. [11] [12] Next is combined orientation, even though people are not able to tell the individual orientation of a target stimulus in a crowded setting, they can reliably report an average combined orientation of the stimulus, which means that the orientation signals from the target stimulus are combined than lost. [13]
Some target identity information survives crowding, people can identify more correct targets from a crowded setting when they are asked to report information on both the target and the flanker. Sometimes certain information such as the target information is lost, but the people are able to make better "target “ responses in this condition. Some information on the target is preserved, but most of the times the location information is lost. [14] [15] [16]
Crowding is broken when the target and stimulus are different than when they are similar. The difference of target group from the flanker group of stimulus in shape, size, [17] [18] orientation, [3] [19] [20] polarity, [21] [17] spatial frequency, [22] depth, [17] color, [17] [13] [23] motion and order, [24] breaks crowding. Crowding happens among faces(holistic crowding) having an inverted face flanker when searching for an upright face breaks crowding. Which makes upright faces more effective flankers. [25] [26] Providing cue about the target location tends to reduce crowding. [27] [15] [28] [29] Crowding is broken when the flankers are collectively masked, but this happens only when the flankers are masked with noise or using metacontrast masks but not with substitution masks. [30] When people are adapted to look for a target stimulus in a certain spatial position, it renders the flankers perceptually invisible (adaptation induced blindness), thus releasing the effect of crowding. [31]
Neurophysiological studies have not made much progress in narrowing down the locus of the brain at which crowding occurs. Previous researches have demonstrated that crowding is “dichoptical” meaning that the target is perceived by one eye and the distractor by the other. [32] [33] Which should mean that the effect of crowding occurs in the cortex. Different researchers have claimed different sites to be the processing center for crowding e.g. (V1, [34] V2, V3, [35] [36] V4 [8] [37] [38] some researchers claim crowding happens at a later stage of visual processing [11] [26] ). So, the locus of the brain at which crowding occurs is still not clearly defined.
From the many models that try and explain the process of crowding, there is still a lack of an actual model that helps predict the entirety of how crowding works. All models have three major division that remains as its essence: masking, pooling and substitution. The pooling can be of the low-level features or the pooling of attention. One of the model that nicely predicts is the model by Wilkinson where he boils down the process of crowding to the interaction between complex cells and simple cells, where the simple cells suppress weak complex cell responses and the complex cells respond actively resulted from spatial pooling and then they suppress simple cell activity in their receptive cell area[41].
Another model that best predicts crowding process proposes a quantitative model for a spatial integration of orientation signals, As per the principles of population coding, this model satisfactorily predicts properties like critical spacing, compulsory averaging and the inner and outer asymmetry. [18]
Different studies implicitly assume that crowding is a unitary effect due to a single stage of processing. [39] [40] The other notion states that crowding happens independently at several stages of visual processing. This notion supports the view that crowding is influenced by the similarity and configuration of flankers and the target. By this notion, the effect is selectively observed between whole objects, [25] [26] object parts, [41] and features. [39] This view is also consistent with Bouma's law. This view has gained much support. Crowding in a natural setting may also occur in layers depending on the location, content and attention dependence.
Herman Bouma, a Dutch vision researcher and gerontologist stated, “For complete visual isolation of a letter presented at an eccentricity of φ deg, … no other letters should be present within (roughly) 0.5 φ distance." [5] At a later stage, he reduced the proportionality constant from 0.5 to 0.4. [19] This gave rise to the notion of “critical spacing” which is proportional to the eccentricity. Critical spacing is the sufficient distance needed for the identification of an object among its flankers in the retinotopically organized cortex. Bouma explains how the effect of crowding is dependent on the eccentricity of a subject and the distance between the flankers and the object. Many studies support the claim that critical space needed for crowding depends on the eccentricity of the subject. [2] [3] [4] [42] The proportionality constant named b, after Bouma, is dependent on how similar the flankers are to the target, the number of possible targets, and the arbitrary threshold criterion. [40] The value of Boumas proportionality constant ‘b’ is different among studies, but most of the times it is reported to be ≈0.4 - 0.5. This rule is sometimes raised to the status of a 'law' but this remains controversial.
The visual cortex of the brain is the area of the cerebral cortex that processes visual information. It is located in the occipital lobe. Sensory input originating from the eyes travels through the lateral geniculate nucleus in the thalamus and then reaches the visual cortex. The area of the visual cortex that receives the sensory input from the lateral geniculate nucleus is the primary visual cortex, also known as visual area 1 (V1), Brodmann area 17, or the striate cortex. The extrastriate areas consist of visual areas 2, 3, 4, and 5.
The visual system is the physiological basis of visual perception. The system detects, transduces and interprets information concerning light within the visible range to construct an image and build a mental model of the surrounding environment. The visual system is associated with the eye and functionally divided into the optical system and the neural system.
Amblyopia, also called lazy eye, is a disorder of sight in which the brain fails to fully process input from one eye and over time favors the other eye. It results in decreased vision in an eye that typically appears normal in other aspects. Amblyopia is the most common cause of decreased vision in a single eye among children and younger adults.
Stereopsis is the component of depth perception retrieved by means of binocular disparity through binocular vision. It is not the only contributor to depth perception, but it is a major one. Binocular vision occurs because each eye receives a different image due to their slightly different positions in one's head. These positional differences are referred to as "horizontal disparities" or, more generally, "binocular disparities". Disparities are processed in the visual cortex of the brain to yield depth perception. While binocular disparities are naturally present when viewing a real three-dimensional scene with two eyes, they can also be simulated by artificially presenting two different images separately to each eye using a method called stereoscopy. The perception of depth in such cases is also referred to as "stereoscopic depth".
Fixation disparity is a tendency of the eyes to drift in the direction of the heterophoria. While the heterophoria refers to a fusion-free vergence state, the fixation disparity refers to a small misalignment of the visual axes when both eyes are open in an observer with normal fusion and binocular vision. The misalignment may be vertical, horizontal or both. The misalignment is much smaller than that of strabismus. While strabismus prevents binocular vision, fixation disparity keeps binocular vision, however it may reduce a patient's level of stereopsis. A patient may or may not have fixation disparity and a patient may have a different fixation disparity at distance than near. Observers with a fixation disparity are more likely to report eye strain in demanding visual tasks; therefore, tests of fixation disparity belong to the diagnostic tools used by eye care professionals: remediation includes vision therapy, prism eye glasses, or visual ergonomics at the workplace.
Visual search is a type of perceptual task requiring attention that typically involves an active scan of the visual environment for a particular object or feature among other objects or features. Visual search can take place with or without eye movements. The ability to consciously locate an object or target amongst a complex array of stimuli has been extensively studied over the past 40 years. Practical examples of using visual search can be seen in everyday life, such as when one is picking out a product on a supermarket shelf, when animals are searching for food among piles of leaves, when trying to find a friend in a large crowd of people, or simply when playing visual search games such as Where's Wally?
The Stiles–Crawford effect is a property of the human eye that refers to the directional sensitivity of the cone photoreceptors.
The flash lag illusion or flash-lag effect is a visual illusion wherein a flash and a moving object that appear in the same location are perceived to be displaced from one another. Several explanations for this simple illusion have been explored in the neuroscience literature.
The Chubb illusion is an optical illusion or error in visual perception in which the apparent contrast of an object varies substantially to most viewers depending on its relative contrast to the field on which it is displayed. These visual illusions are of particular interest to researchers because they may provide valuable insights in regard to the workings of human visual systems.
Within computer technology, the gaze-contingency paradigm is a general term for techniques allowing a computer screen display to change in function depending on where the viewer is looking. Gaze-contingent techniques are part of the eye movement field of study in psychology.
In behavioral biology, the optomotor response is an innate, orienting behavior evoked by whole-field visual motion and is common to fish and insects during locomotion, such as swimming, walking and flying. The optomotor response has algorithmic properties such that the direction of the whole-field coherent motion dictates the direction of the behavioral output. For instance, when zebrafish larvae are presented with a sinusoidal black and white grating pattern, the larvae will turn and swim in the direction of the perceived motion.
Perceptual learning is learning better perception skills such as differentiating two musical tones from one another or categorizations of spatial and temporal patterns relevant to real-world expertise. Examples of this may include reading, seeing relations among chess pieces, and knowing whether or not an X-ray image shows a tumor.
Visual processing abnormalities in schizophrenia are commonly found, and contribute to poor social function.
Crowding is a perceptual phenomenon where the recognition of objects presented away from the fovea is impaired by the presence of other neighbouring objects. It has been suggested that crowding occurs due to mandatory integration of the crowded objects by a texture-processing neural mechanism, but there are several competing theories about the underlying mechanisms. It is considered a kind of grouping since it is "a form of integration over space as target features are spuriously combined with flanker features."
Surround suppression is where the relative firing rate of a neuron may under certain conditions decrease when a particular stimulus is enlarged. It has been observed in electrophysiology studies of the brain and has been noted in many sensory neurons, most notably in the early visual system. Surround suppression is defined as a reduction in the activity of a neuron in response to a stimulus outside its classical receptive field.
Cerebral diplopia or polyopia describes seeing two or more images arranged in ordered rows, columns, or diagonals after fixation on a stimulus. The polyopic images occur monocular bilaterally and binocularly, differentiating it from ocular diplopia or polyopia. The number of duplicated images can range from one to hundreds. Some patients report difficulty in distinguishing the replicated images from the real images, while others report that the false images differ in size, intensity, or color. Cerebral polyopia is sometimes confused with palinopsia, in which multiple images appear while watching an object. However, in cerebral polyopia, the duplicated images are of a stationary object which are perceived even after the object is removed from the visual field. Movement of the original object causes all of the duplicated images to move, or the polyopic images disappear during motion. In palinoptic polyopia, movement causes each polyopic image to leave an image in its wake, creating hundreds of persistent images (entomopia).
Binocular switch suppression (BSS) is a technique to suppress usually salient images from an individual's awareness, a type of experimental manipulation used in visual perception and cognitive neuroscience. In BSS, two images of differing signal strengths are repetitively switched between the left and right eye at a constant rate of 1 Hertz. During this process of switching, the image of lower contrast and signal strength is perceptually suppressed for a period of time.
In psychology and neuroscience, multiple object tracking (MOT) refers to the ability of humans and other animals to simultaneously monitor multiple objects as they move. It is also the term for certain laboratory techniques used to study this ability.
Ensemble coding, also known as ensemble perception or summary representation, is a theory in cognitive neuroscience about the internal representation of groups of objects in the human mind. Ensemble coding proposes that such information is recorded via summary statistics, particularly the average or variance. Experimental evidence tends to support the theory for low-level visual information, such as shapes and sizes, as well as some high-level features such as face gender. Nonetheless, it remains unclear the extent to which ensemble coding applies to high-level or non-visual stimuli, and the theory remains the subject of active research.
Roland William Fleming, FRSB is a British and German interdisciplinary researcher specializing in the visual perception of objects and materials. He is the Kurt Koffka Professor of Experimental Psychology at Justus Liebig University of Giessen. and the Executive Director of the Center for Mind, Brain and Behavior of the Universities of Marburg and Giessen. He is also co-Spokesperson for the Research Cluster “The Adaptive Mind”.