Crowding

Last updated
A demonstration of the crowding effect. Fixate on the "x" and attempt to identify the central (or single) letter appearing to the right. The presence of flankers should make the task more difficult. CrowdingDemo.gif
A demonstration of the crowding effect. Fixate on the "x" and attempt to identify the central (or single) letter appearing to the right. The presence of flankers should make the task more difficult.

Crowding (or visual crowding) is a perceptual phenomenon where the recognition of objects presented away from the fovea is impaired by the presence of other neighbouring objects (sometimes called "flankers"). [1] It has been suggested that crowding occurs due to mandatory integration of the crowded objects by a texture-processing neural mechanism, [2] but there are several competing theories about the underlying mechanisms. [3] [4] [5] [6] [7] It is considered a kind of grouping [8] since it is "a form of integration over space as target features are spuriously combined with flanker features." [9]

Contents

Crowding has long been thought to be predominantly a characteristic of peripheral vision. Yet, while it is indeed particularly prominent there, it is present in the whole of the visual field, with only its spatial extent varying (governed by Bouma's law; see below). [10] In particular, crowding is of utmost importance in foveal vision, [11] overriding the importance of visual acuity in pattern recognition and reading where crowding represents the bottleneck in processing. [12] [4] [10]

Crowding is prominently present in amblyopia and has been first mentioned in that context [13] [10] and studied quantitatively there. [14] Crowding deficits have further been found in neuropsychiatric disorders such as schizophrenia and autism and may have clinical implications in these disorders. [15] It is also suggested that head injuries can cause a crowding effect. [16] Normally sighted children up to the age of about eight years further have more pronounced crowding than adults, [17] and this may be the reason for larger print in children's books.

Bouma's law

The extent of crowding is mostly independent of a letter's or form's size, unlike what is the case in acuity. Instead, it depends very systematically on the distance to its neighbors. If the latter is above a critical value, crowding vanishes. In 1970, Herman Bouma has described a rule-of-thumb for that critical distance, stating that it amounts to about half the eccentricity value under which the crowded letter is seen [18] (eccentricity measured as visual angle away from the fovea's center). If, e.g., a letter is shown at 2.5 deg away from the fovea center – which is approximately at the border of the fovea – the critical distance amounts to 1.25 deg visual angle. [19] When the flankers are closer, crowding will thus occur.

Newer research suggests that the factor in Bouma's rule (originally ½) can vary quite a bit, and might often be a little smaller (e.g., 0.4). [20] Furthermore, a small constant should be added in the equation, and there are further caveats. [10] Overall, however, Bouma's rule has since proven valid over a large variety of perceptual tasks. For its robustness, it is now often considered a perceptual law, similar to other perceptual laws (like Weber's law, Riccò's law, Bloch's law). [21] [10]

History

Crowding, as we know today, is – except in a few special circumstances – the essential bottleneck [4] for human pattern recognition and can be demonstrated in the easiest of ways. It is thus striking that it has been overlooked over the centuries; the cause for degraded pattern recognition has mostly been, and still is, incorrectly ascribed to degraded visual acuity. [10]

The percepts in peripheral vision have already been described by Ibn al-Haytham in the 11th century as "confused and obscure". [22] Later, James Jurin in 1738 described the phenomenon of "indistinct vision" which, in two examples, could be seen as the result of crowding. [22] In the 19th century, the ophthalmologists Hermann Aubert and Richard Förster in Breslau/Poland described the percept of two neighboring points in indirect vision as “quite strangely undefined ["ganz eigenthümlich unbestimmt"] as something black, the form of which cannot be further specified”. [23] Note that, in none of these examples, the description is as "blurred" or "distorted", as is often (and misleadingly) seen in today's characterizations.

Crowding itself, however, i.e. the difference between singular letters and groups thereof, went unnoticed up to the 20th century. In 1924, then, the Gestalt psychologist Wilhelm Korte was the first to describe, in detail, percepts and phenomena of form perception in indirect vision (peripheral vision). [5] Probably around that time, crowding has become an issue in optometry and ophthalmology when testing amblyopic subjects with eye charts, as is apparent from a remark of the Danish ophthalmologist Holger Ehlers in 1936. [13] James A. Stuart und Hermann M. Burian in Iowa were, in 1962, the first to study crowding systematically, for amblyopic subjects. [14] In foveal vision, the related phenomenon of contour interaction was described (Merton Flom, Frank Weymouth & Daniel Kahneman, 1963).

Herman Bouma, in 1970, famously found what was later called Bouma's law, [18] yet that paper was fully neglected for many years. In the coming three decades, the phenomenon was studied in experimental psychology, under different terms. Only then, the subject of Crowding found increasingly wide attention in visual perception research (Levi et al. 1985; Strasburger et al., 1991; Toet & Levi, 1992, Pelli et al., 2004). [24] Today, it is a major topic in vision and perception and is increasingly recognized for being the major limitation of foveal and peripheral form perception. [10] [25] [26]

Related Research Articles

<span class="mw-page-title-main">Phi phenomenon</span> Optical illusion of apparent motion

The term phi phenomenon is used in a narrow sense for an apparent motion that is observed if two nearby optical stimuli are presented in alternation with a relatively high frequency. In contrast to beta movement, seen at lower frequencies, the stimuli themselves do not appear to move. Instead, a diffuse, amorphous shadowlike something seems to jump in front of the stimuli and occlude them temporarily. This shadow seems to have nearly the color of the background. Max Wertheimer first described this form of apparent movement in his habilitation thesis, published 1912, marking the birth of Gestalt psychology.

<span class="mw-page-title-main">Macula</span> Oval-shaped pigmented area near the center of the retina

The macula (/ˈmakjʊlə/) or macula lutea is an oval-shaped pigmented area in the center of the retina of the human eye and in other animals. The macula in humans has a diameter of around 5.5 mm (0.22 in) and is subdivided into the umbo, foveola, foveal avascular zone, fovea, parafovea, and perifovea areas.

<span class="mw-page-title-main">Peripheral vision</span> Area of ones field of vision outside of the point of fixation

Peripheral vision, or indirect vision, is vision as it occurs outside the point of fixation, i.e. away from the center of gaze or, when viewed at large angles, in the "corner of one's eye". The vast majority of the area in the visual field is included in the notion of peripheral vision. "Far peripheral" vision refers to the area at the edges of the visual field, "mid-peripheral" vision refers to medium eccentricities, and "near-peripheral", sometimes referred to as "para-central" vision, exists adjacent to the center of gaze.

<span class="mw-page-title-main">Field of view</span> Extent of the observable world seen at any given moment

The field of view (FOV) is the angular extent of the observable world that is seen at any given moment. In the case of optical instruments or sensors, it is a solid angle through which a detector is sensitive to electromagnetic radiation. It is further relevant in photography.

<span class="mw-page-title-main">Amblyopia</span> Failure of the brain to process input from one eye

Amblyopia, also called lazy eye, is a disorder of sight in which the brain fails to fully process input from one eye and over time favors the other eye. It results in decreased vision in an eye that typically appears normal in other aspects. Amblyopia is the most common cause of decreased vision in a single eye among children and younger adults.

<span class="mw-page-title-main">Visual acuity</span> Clarity of vision

Visual acuity (VA) commonly refers to the clarity of vision, but technically rates an animal's ability to recognize small details with precision. Visual acuity depends on optical and neural factors. Optical factors of the eye influence the sharpness of an image on its retina. Neural factors include the health and functioning of the retina, of the neural pathways to the brain, and of the interpretative faculty of the brain.

<span class="mw-page-title-main">Fovea centralis</span> Small pit in the retina of the eye responsible for all central vision

The fovea centralis is a small, central pit composed of closely packed cones in the eye. It is located in the center of the macula lutea of the retina.

The visual field is "that portion of space in which objects are visible at the same moment during steady fixation of the gaze in one direction"; in ophthalmology and neurology the emphasis is on the structure inside the visual field and it is then considered “the field of functional capacity obtained and recorded by means of perimetry”.

<span class="mw-page-title-main">Ternus illusion</span>

The Ternus illusion, also commonly referred to as the Ternus Effect, is an illusion related to human visual perception involving apparent motion. In a simplified explanation of one form of the illusion, two discs, are shown side by side as the first frame in a sequence of three frames. Next a blank frame is presented for a very short, variable duration. In the final frame, two similar discs are then shown in a shifted position. Depending on various factors including the time intervals between frames as well as spacing and layout, observers perceive either element motion, in which L appears to move to R while C remains stationary or they report experiencing group motion, in which L and C appear to move together to C and R. Both element motion and group motion can be observed in animated examples to the right in Figures 1 and 2.

<span class="mw-page-title-main">Vision span</span> Arc of accurate visual perception

Vision span or perceptual span is a controversial concept referring to the angular span, within which the human eye has sharp enough vision to perform an action accurately. The visual field of the human eye spans approximately 120 degrees of arc. However, most of that arc is peripheral vision. The human eye has much greater resolution in the macula, where there is a higher density of cone cells. The macula has a diameter of about 16 degrees of the retina. The field of view that is observed with sufficient resolution to read text typically spans about 6 degrees of arc, which is wide enough to allow a clear view of about five words in a row when printed text at ordinary size is held about 50 centimeters from the eyes. Regarding face processing, the field of view with a sufficient amount of information in order to recognise faces accurately spans about 7° which represents about 45% of a face. The brain creates the illusion of having a greater visual span by automatically and unconsciously moving the center of vision into any area of interest in the field of view.

Cortical magnification describes how many neurons in an area of the visual cortex are 'responsible' for processing a stimulus of a given size, as a function of visual field location. In the center of the visual field, corresponding to the center of the fovea of the retina, a very large number of neurons process information from a small region of the visual field. If the same stimulus is seen in the periphery of the visual field, it would be processed by a much smaller number of neurons. The reduction of the number of neurons per visual field area from foveal to peripheral representations is achieved in several steps along the visual pathway, starting already in the retina.

Visual agnosia is an impairment in recognition of visually presented objects. It is not due to a deficit in vision, language, memory, or intellect. While cortical blindness results from lesions to primary visual cortex, visual agnosia is often due to damage to more anterior cortex such as the posterior occipital and/or temporal lobe(s) in the brain.[2] There are two types of visual agnosia: apperceptive agnosia and associative agnosia.

<span class="mw-page-title-main">Retinal implant</span>

A retinal implant is a visual prosthesis for restoration of sight to patients blinded by retinal degeneration. The system is meant to partially restore useful vision to those who have lost their photoreceptors due to retinal diseases such as retinitis pigmentosa (RP) or age-related macular degeneration (AMD). Retinal implants are being developed by a number of private companies and research institutions, and three types are in clinical trials: epiretinal, subretinal, and suprachoroidal. The implants introduce visual information into the retina by electrically stimulating the surviving retinal neurons. So far, elicited percepts had rather low resolution, and may be suitable for light perception and recognition of simple objects.

Within computer technology, the gaze-contingency paradigm is a general term for techniques allowing a computer screen display to change in function depending on where the viewer is looking. Gaze-contingent techniques are part of the eye movement field of study in psychology.

Visual perception is the ability to interpret the surrounding environment through photopic vision, color vision, scotopic vision, and mesopic vision, using light in the visible spectrum reflected by objects in the environment. This is different from visual acuity, which refers to how clearly a person sees. A person can have problems with visual perceptual processing even if they have 20/20 vision.

<span class="mw-page-title-main">Stereopsis recovery</span>

Stereopsis recovery, also recovery from stereoblindness, is the phenomenon of a stereoblind person gaining partial or full ability of stereo vision (stereopsis).

<span class="mw-page-title-main">Vernier acuity</span>

Vernier acuity is a type of visual acuity – more precisely of hyperacuity – that measures the ability to discern a disalignment among two line segments or gratings. A subject's vernier acuity is the smallest visible offset between the stimuli that can be detected. Because the disalignments are often much smaller than the diameter and spacing of retinal receptors, vernier acuity requires neural processing and "pooling" to detect it. Because vernier acuity exceeds acuity by far, the phenomenon has been termed hyperacuity. Vernier acuity develops rapidly during infancy and continues to slowly develop throughout childhood. At approximately three to twelve months old, it surpasses grating acuity in foveal vision in humans. However, vernier acuity decreases more quickly than grating acuity in peripheral vision. Vernier acuity was first explained by Ewald Hering in 1899, based on earlier data by Alfred Volkmann in 1863 and results by Ernst Anton Wülfing in 1892.

Herman Bouma is a Dutch vision researcher and gerontechnologist. He is considered to be one of the founders of the field of gerontechnology. He spent the majority of his career at the Institute of Perception Research, serving as its director from the mid-1970s until 1994. He subsequently led the Institute for Gerontechnology until 1999. A perceptual law, based on a publication of his in 1970 is named Bouma's Law in his honour.

Visual crowding is the inability to view a target stimulus distinctly when presented in a clutter. Crowding impairs the ability to discriminate object features and contours among flankers, which in turn impairs people's ability to respond appropriately to the target stimulus.

Ensemble coding, also known as ensemble perception or summary representation, is a theory in cognitive neuroscience about the internal representation of groups of objects in the human mind. Ensemble coding proposes that such information is recorded via summary statistics, particularly the average or variance. Experimental evidence tends to support the theory for low-level visual information, such as shapes and sizes, as well as some high-level features such as face gender. Nonetheless, it remains unclear the extent to which ensemble coding applies to high-level or non-visual stimuli, and the theory remains the subject of active research.

References

  1. Levi D (February 2008). "Crowding - an essential bottleneck for object recognition: a mini-review". Vision Research. 48 (5): 635–654. doi:10.1016/j.visres.2007.12.009. PMC   2268888 . PMID   18226828.
  2. Parkes L; Lund J; Angelucci A; Solomon JA; Morgan M (2001). "Compulsory averaging of crowded orientation signals in human vision". Nature Neuroscience. 4 (7): 739–744. doi:10.1038/89532. PMID   11426231. S2CID   10975462.
  3. Tyler, C.W.; Likova, L.T. (2007). "Crowding: A neuroanalytic approach". Journal of Vision. 7 (2:16): 1–9. doi: 10.1167/7.2.16 . PMID   18217831.
  4. 1 2 3 Levi, Dennis M. (2008). "Crowding – an essential bottleneck for object recognition: a minireview". Vision Research. 48 (5): 635–654. doi:10.1016/j.visres.2007.12.009. PMC   2268888 . PMID   18226828.
  5. 1 2 Strasburger, H. (2014). "Dancing letters and ticks that buzz around aimlessly – On the origin of crowding". Perception. 43 (9): 963–976. doi: 10.1068/p7726 . PMID   25420335. S2CID   26136221.
  6. Rosenholtz, R. (2016). "Capabilities and limitations of peripheral vision". Annual Review of Vision Science . 2: 437–457. doi: 10.1146/annurev-vision-082114-035733 . PMID   28532349. S2CID   207728323.
  7. Strasburger, H. (2020). "On the cortical mapping function (Preprint)". bioRxiv. doi:10.1101/621458. S2CID   149700090.
  8. Herzog, M. H.; Sayim, B.; Chicherov, V.; Manassi, M. (2015). "Crowding, grouping, and object recognition: A matter of appearance". Journal of Vision. 15 (6:5): 1–18. doi:10.1167/15.6.5. PMC   4429926 . PMID   26024452.
  9. Wixted, John (2018). Stevens' Handbook of Experimental Psychology and Cognitive Neuroscience, Sensation, Perception, and Attention, Fourth Edition Volume 2. Hoboken, NJ: John Wiley & Sons. p. 833. ISBN   9781119170044.
  10. 1 2 3 4 5 6 7 Strasburger, Hans (2020). "Seven myths on crowding and peripheral vision". i-Perception. 11 (2): 1–45. doi:10.1177/2041669520913052. PMC   7238452 . PMID   32489576.
  11. Foveal vision refers to the situation where the observer fixates a visual object and, depending on the latter, uses anything between the tiny region of highest acuity (in acuity measurements) or the whole of the fovea and more (in reading) for processing the visual input.
  12. Pelli, D.G.; Tillman, K.A.; Freeman, J.; Su, M.; Berger, T.D.; Majaj, N.J. (2007). "Crowding and eccentricity determine reading rate". Journal of Vision. 7 (2:20): 1–36. doi: 10.1167/7.2.20 . PMID   18217835.
  13. 1 2 Ehlers, Holger (1936). "The movements of the eyes during reading". Acta Ophthalmologica. 14: 56–63. doi:10.1111/j.1755-3768.1936.tb07306.x. S2CID   73409510.
  14. 1 2 Stuart, J.A.; Burian, H.M. (1962). "A study of separation difficulty: its relationship to visual acuity in normal and amblyopic eyes". American Journal of Ophthalmology. 53: 471–477. doi:10.1016/0002-9394(62)94878-X. PMID   13917936.
  15. Kraehenmann, Rainer; Vollenweider FX; Seifritz E; Kometer M (2012). "Crowding deficits in the visual periphery of schizophrenia patients". PLOS ONE. 7 (9): e45884. Bibcode:2012PLoSO...745884K. doi: 10.1371/journal.pone.0045884 . PMC   3458825 . PMID   23049884.
  16. Millichap, Gordon (2009). Attention Deficit Hyperactivity Disorder Handbook: A Physician's Guide to ADHD, Second Edition. New York: Springer Science+Business Media. p. 17. ISBN   9781441913968.
  17. Atkinson, J.; Pimm-Smith, E.; Evans, C.; Harding, G.; Braddick, O. (1986). "Visual Crowding in Young Children". Detection and Measurement of Visual Impairment in Pre-Verbal Children. Documenta Ophthalmologica Proceedings Series. Vol. 45. pp. 201–213. doi:10.1007/978-94-009-4263-9_27. ISBN   978-94-010-8393-5.
  18. 1 2 Bouma, Herman (1970). "Interaction effects in parafoveal letter recognition". Nature. 226 (5241): 177–178. Bibcode:1970Natur.226..177B. doi:10.1038/226177a0. PMID   5437004. S2CID   29459715.
  19. For illustration: The thumb, held at arm's length, subtends about 2.5 deg visual angle. Two adjacent thumbs at arm's length thus correspond roughly to the size of the fovea. In a group of people, that might be the size of another person's head
  20. Pelli, Denis G.; Palomares, M.; Majaj, N.J. (2004). "Crowding is unlike ordinary masking: Distinguishing feature integration from detection". Journal of Vision. 4 (12): 1129–111136–116935. doi: 10.1167/4.12.12 . PMID   15669917.
  21. Pelli, D.G.; Tillman, Katharine A. (2008). "The uncrowded window of object recognition". Nature Neuroscience. 11 (10): 1129–1135. doi:10.1038/nn.2187. PMC   2772078 . PMID   18828191.
  22. 1 2 Strasburger, H.; Wade, N.J. (2015). "James Jurin (1684–1750): A pioneer of crowding research?". Journal of Vision. 15 (1:9): 1–7. doi: 10.1167/15.1.9 . PMID   25589293.
  23. Aubert, H.R.; Förster, C. F. R. (1857). "Beiträge zur Kenntniss des indirecten Sehens. (I). Untersuchungen über den Raumsinn der Retina". Archiv für Ophthalmologie. 3: 1–37.
  24. Fig. 19 in: Strasburger, Hans (2020). "Seven myths on crowding and peripheral vision". i-Perception. 11 (2): 1–45. doi:10.1177/2041669520913052. PMC   7238452 . PMID   32489576.
  25. Strasburger, H.; Rentschler, I.; Jüttner, M. (2011). "Peripheral vision and pattern recognition: a review". Journal of Vision. 11 (5): 1–82. doi: 10.1167/11.5.13 . PMID   22207654.
  26. Whitney, D.; Levi, D.M. (2011). "Visual crowding: A fundamental limit on conscious perception and object recognition". Trends in Cognitive Sciences. 15 (4): 160–168. doi:10.1016/j.tics.2011.02.005. PMC   3070834 . PMID   21420894.