Contrast is the difference in luminance or color that makes an object (or its representation in an image or display) visible against a background of different luminance or color. [1] The human visual system is more sensitive to contrast than to absolute luminance; thus, we can perceive the world similarly despite significant changes in illumination throughout the day or across different locations. [2]
The maximum contrast of an image is termed the contrast ratio or dynamic range. In images where the contrast ratio approaches the maximum possible for the medium, there is a conservation of contrast. In such cases, increasing contrast in certain parts of the image will necessarily result in a decrease in contrast elsewhere. Brightening an image increases contrast in darker areas but decreases it in brighter areas; conversely, darkening the image will have the opposite effect. Bleach bypass reduces contrast in the darkest and brightest parts of an image while enhancing luminance contrast in areas of intermediate brightness.
Campbell and Robson (1968) showed that the human contrast sensitivity function shows a typical band-pass filter shape peaking at around 4 cycles per degree (cpd or cyc/deg), with sensitivity dropping off either side of the peak. [3] This can be observed by changing one's viewing distance from a "sweep grating" (shown below) showing many bars of a sinusoidal grating that go from high to low contrast along the bars, and go from narrow (high spatial frequency) to wide (low spatial frequency) bars across the width of the grating.
The high-frequency cut-off represents the optical limitations of the visual system's ability to resolve detail and is typically about 60 cpd. The high-frequency cut-off is also related to the packing density of the retinal photoreceptor cells: a finer matrix can resolve finer gratings.
The low frequency drop-off is due to lateral inhibition within the retinal ganglion cells. [4] A typical retinal ganglion cell's receptive field comprises a central region in which light either excites or inhibits the cell, and a surround region in which light has the opposite effects.
One experimental phenomenon is the inhibition of blue in the periphery if blue light is displayed against a white background, leading to a yellow surrounding. The yellow is derived from the inhibition of blue on the surroundings by the center. Since white minus blue is red and green, this mixes to become yellow. [5]
For example, in the case of graphical computer displays, contrast depends on the properties of the picture source or file and the properties of the computer display, including its variable settings. For some screens the angle between the screen surface and the observer's line of sight is also important.
There are many possible definitions of contrast. Some include color; others do not. Russian scientist N. P. Travnikova laments, "Such a multiplicity of notions of contrast is extremely inconvenient. It complicates the solution of many applied problems and makes it difficult to compare the results published by different authors." [6] [7]
Various definitions of contrast are used in different situations. Here, luminance contrast is used as an example, but the formulas can also be applied to other physical quantities. In many cases, the definitions of contrast represent a ratio of the type
The rationale behind this is that a small difference is negligible if the average luminance is high, while the same small difference matters if the average luminance is low (see Weber–Fechner law). Below, some common definitions are given.
Weber contrast is defined as [6]
with and representing the luminance of the features and the background, respectively. The measure is also referred to as Weber fraction, since it is the term that is constant in Weber's Law. Weber contrast is commonly used in cases where small features are present on a large uniform background, i.e., where the average luminance is approximately equal to the background luminance.
Michelson contrast [8] (also known as the visibility) is commonly used for patterns where both bright and dark features are equivalent and take up similar fractions of the area (e.g. sine-wave gratings). The Michelson contrast is defined as [6]
with and representing the highest and lowest luminance. The denominator represents twice the average of the maximum and minimum luminances. [9]
This form of contrast is an effective way to quantify contrast for periodic functions and is also known as the modulation of a periodic signal . Modulation quantifies the relative amount by which the amplitude (or difference) of stands out from the average value (or background) .
In general, refers to the contrast of the periodic signal relative to its average value. If , then has no contrast. If two periodic functions and have the same average value, then has more contrast than if . [10]
Root mean square (RMS) contrast does not depend on the spatial frequency content or the spatial distribution of contrast in the image. RMS contrast is defined as the standard deviation of the pixel intensities: [6]
where intensities are the -th -th element of the two-dimensional image of size by . is the average intensity of all pixel values in the image. The image is assumed to have its pixel intensities normalized in the range .
Contrast sensitivity is a measure of the ability to discern different luminances in a static image. It varies with age, increasing to a maximum around 20 years at spatial frequencies of about 2–5 cpd; aging then progressively attenuates contrast sensitivity beyond this peak. Factors such as cataracts and diabetic retinopathy also reduce contrast sensitivity. [11] In the sweep grating figure below, at an ordinary viewing distance, the bars in the middle appear to be the longest due to their optimal spatial frequency. However, at a far viewing distance, the longest visible bars shift to what were originally the wide bars, now matching the spatial frequency of the middle bars at reading distance.
Visual acuity is a parameter that is frequently used to assess overall vision. However, diminished contrast sensitivity may cause decreased visual function in spite of normal visual acuity. [12] For example, some individuals with glaucoma may achieve 20/20 vision on acuity exams, yet struggle with activities of daily living, such as driving at night.
As mentioned above, contrast sensitivity describes the ability of the visual system to distinguish bright and dim components of a static image. Visual acuity can be defined as the angle with which one can resolve two points as being separate since the image is shown with 100% contrast and is projected onto the fovea of the retina. [13] Thus, when an optometrist or ophthalmologist assesses a patient's visual acuity using a Snellen chart or some other acuity chart, the target image is displayed at high contrast, e.g., black letters of decreasing size on a white background. A subsequent contrast sensitivity exam may demonstrate difficulty with decreased contrast (using, e.g., the Pelli–Robson chart, which consists of uniform-sized but increasingly pale grey letters on a white background).
To assess a patient's contrast sensitivity, one of several diagnostic exams may be used. Most charts in an ophthalmologist's or optometrist's office will show images of varying contrast and spatial frequency. Parallel bars of varying width and contrast, known as sine-wave gratings, are sequentially viewed by the patient. The width of the bars and their distance apart represent spatial frequency, measured in cycles per degree.
Studies have demonstrated that contrast sensitivity is maximum for spatial frequencies of 2-5 cpd, falling off for lower spatial frequencies and rapidly falling off for higher spatial frequencies. The upper limit for the human vision system is about 60 cpd. The correct identification of small letters requires the letter size be about 18-30 cpd. [14] Contrast threshold can be defined as the minimum contrast that can be resolved by the patient. Contrast sensitivity is typically expressed as the reciprocal of the threshold contrast for detection of a given pattern (i.e., 1 ÷ contrast threshold). [15]
Using the results of a contrast sensitivity exam, a contrast sensitivity curve can be plotted, with spatial frequency on the horizontal, and contrast threshold on the vertical axis. Also known as contrast sensitivity function (CSF), the plot demonstrates the normal range of contrast sensitivity, and will indicate diminished contrast sensitivity in patients who fall below the normal curve. Some graphs contain "contrast sensitivity acuity equivalents", with lower acuity values falling in the area under the curve. In patients with normal visual acuity and concomitant reduced contrast sensitivity, the area under the curve serves as a graphical representation of the visual deficit. It can be because of this impairment in contrast sensitivity that patients have difficulty driving at night, climbing stairs and other activities of daily living in which contrast is reduced. [16]
Recent studies have demonstrated that intermediate-frequency sinusoidal patterns are optimally-detected by the retina due to the center-surround arrangement of neuronal receptive fields. [17] In an intermediate spatial frequency, the peak (brighter bars) of the pattern is detected by the center of the receptive field, while the troughs (darker bars) are detected by the inhibitory periphery of the receptive field. For this reason, low- and high-spatial frequencies elicit excitatory and inhibitory impulses by overlapping frequency peaks and troughs in the center and periphery of the neuronal receptive field. [18] Other environmental, [19] physiological, and anatomical factors influence the neuronal transmission of sinusoidal patterns, including adaptation. [20]
Decreased contrast sensitivity arises from multiple etiologies, including retinal disorders such as age-related macular degeneration (ARMD), amblyopia, lens abnormalities, such as cataract, and by higher-order neural dysfunction, including stroke and Alzheimer's disease. [21] In light of the multitude of etiologies leading to decreased contrast sensitivity, contrast sensitivity tests are useful in the characterization and monitoring of dysfunction, and less helpful in detection of disease.
A large-scale study of luminance contrast thresholds was done in the 1940s by Blackwell, [22] using a forced-choice procedure. Discs of various sizes and luminances were presented in different positions against backgrounds at a wide range of adaptation luminances, and subjects had to indicate where they thought the disc was being shown. After statistical pooling of results (90,000 observations by seven observers), the threshold for a given target size and luminance was defined as the Weber contrast level at which there was a 50% detection level. The experiment employed a discrete set of contrast levels, resulting in discrete values of threshold contrast. Smooth curves were drawn through these, and values tabulated. The resulting data have been used extensively in areas such as lighting engineering and road safety. [24]
A separate study by Knoll et al [25] investigated thresholds for point sources by requiring subjects to vary the brightness of the source to find the level at which it was just visible. A mathematical formula for the resulting threshold curve was proposed by Hecht, [26] with separate branches for scotopic and photopic vision. Hecht's formula was used by Weaver [27] to model the naked-eye visibility of stars. The same formula was used later by Schaefer [28] to model stellar visibility through a telescope.
Crumey [23] showed that Hecht's formula fitted the data very poorly at low light levels, so was not really suitable for modelling stellar visibility. Crumey instead constructed a more accurate and general model applicable to both the Blackwell and Knoll et al data. Crumey's model covers all light levels, from zero background luminance to daylight levels, and instead of parameter-tuning is based on an underlying linearity related to Ricco's law. Crumey used it to model astronomical visibility for targets of arbitrary size, and to study the effects of light pollution.
Test images types [29]
The Weber–Fechner laws are two related scientific laws in the field of psychophysics, known as Weber's law and Fechner's law. Both relate to human perception, more specifically the relation between the actual change in a physical stimulus and the perceived change. This includes stimuli to all senses: vision, hearing, taste, touch, and smell.
A grating is any regularly spaced collection of essentially identical, parallel, elongated elements. Gratings usually consist of a single set of elongated elements, but can consist of two sets, in which case the second set is usually perpendicular to the first. When the two sets are perpendicular, this is also known as a grid or a mesh.
Visual acuity (VA) commonly refers to the clarity of vision, but technically rates an animal's ability to recognize small details with precision. Visual acuity depends on optical and neural factors. Optical factors of the eye influence the sharpness of an image on its retina. Neural factors include the health and functioning of the retina, of the neural pathways to the brain, and of the interpretative faculty of the brain.
In astronomy, limiting magnitude is the faintest apparent magnitude of a celestial body that is detectable or detected by a given instrument.
Aerial perspective, or atmospheric perspective, refers to the effect the atmosphere has on the appearance of an object as viewed from a distance. As the distance between an object and a viewer increases, the contrast between the object and its background decreases, and the contrast of any markings or details within the object also decreases. The colours of the object also become less saturated and shift toward the background colour, which is usually bluish, but may be some other colour under certain conditions.
In astronomy, surface brightness (SB) quantifies the apparent brightness or flux density per unit angular area of a spatially extended object such as a galaxy or nebula, or of the night sky background. An object's surface brightness depends on its surface luminosity density, i.e., its luminosity emitted per unit surface area. In visible and infrared astronomy, surface brightness is often quoted on a magnitude scale, in magnitudes per square arcsecond (MPSAS) in a particular filter band or photometric system.
Riccò's law, discovered by astronomer Annibale Riccò, is one of several laws that describe a human's ability to visually detect targets on a uniform background. It says that for visual targets below a certain size, threshold visibility depends on the area of the target, and hence on the total light received. The "certain size", is small in daylight conditions, larger in low light levels. The law is of special significance in visual astronomy, since it concerns the ability to distinguish between faint point sources and small, faint extended objects ("DSOs").
In the study of visual perception, scotopic vision is the vision of the eye under low-light conditions. The term comes from the Greek skotos, meaning 'darkness', and -opia, meaning 'a condition of sight'. In the human eye, cone cells are nonfunctional in low visible light. Scotopic vision is produced exclusively through rod cells, which are most sensitive to wavelengths of around 498 nm and are insensitive to wavelengths longer than about 640 nm. Under scotopic conditions, light incident on the retina is not encoded in terms of the spectral power distribution. Higher visual perception occurs under scotopic vision as it does under photopic vision.
Optical resolution describes the ability of an imaging system to resolve detail, in the object that is being imaged. An imaging system may have many individual components, including one or more lenses, and/or recording and display components. Each of these contributes to the optical resolution of the system; the environment in which the imaging is done often is a further important factor.
In mathematics, physics, and engineering, spatial frequency is a characteristic of any structure that is periodic across position in space. The spatial frequency is a measure of how often sinusoidal components of the structure repeat per unit of distance.
In imaging science, difference of Gaussians (DoG) is a feature enhancement algorithm that involves the subtraction of one Gaussian blurred version of an original image from another, less blurred version of the original. In the simple case of grayscale images, the blurred images are obtained by convolving the original grayscale images with Gaussian kernels having differing width. Blurring an image using a Gaussian kernel suppresses only high-frequency spatial information. Subtracting one image from the other preserves spatial information that lies between the range of frequencies that are preserved in the two blurred images. Thus, the DoG is a spatial band-pass filter that attenuates frequencies in the original grayscale image that are far from the band center.
Eigengrau, also called Eigenlicht, dark light, or brain gray, is the uniform dark gray background color that many people report seeing in the absence of light. The term Eigenlicht dates back to the nineteenth century, and has rarely been used in recent scientific publications. Common scientific terms for the phenomenon include "visual noise" or "background adaptation". These terms arise due to the perception of an ever-changing field of tiny black and white dots seen in the phenomenon.
The Chubb illusion is an optical illusion or error in visual perception in which the apparent contrast of an object varies substantially to most viewers depending on its relative contrast to the field on which it is displayed. These visual illusions are of particular interest to researchers because they may provide valuable insights in regard to the workings of human visual systems.
Vision is the most important sense for birds, since good eyesight is essential for safe flight. Birds have a number of adaptations which give visual acuity superior to that of other vertebrate groups; a pigeon has been described as "two eyes with wings". Birds are theropods, and the avian eye resembles that of other sauropsids, with ciliary muscles that can change the shape of the lens rapidly and to a greater extent than in the mammals. Birds have the largest eyes relative to their size in the animal kingdom, and movement is consequently limited within the eye's bony socket. In addition to the two eyelids usually found in vertebrates, bird's eyes are protected by a third transparent movable membrane. The eye's internal anatomy is similar to that of other vertebrates, but has a structure, the pecten oculi, unique to birds.
A parasol cell, sometimes called an M cell or M ganglion cell, is one type of retinal ganglion cell (RGC) located in the ganglion cell layer of the retina. These cells project to magnocellular cells in the lateral geniculate nucleus (LGN) as part of the magnocellular pathway in the visual system. They have large cell bodies as well as extensive branching dendrite networks and as such have large receptive fields. Relative to other RGCs, they have fast conduction velocities. While they do show clear center-surround antagonism, they receive no information about color. Parasol ganglion cells contribute information about the motion and depth of objects to the visual system.
Tom Norman Cornsweet was an American experimental psychologist known for his pioneering work in visual perception, especially the effect that bears his name, and in the development of ophthalmic instrumentation.
A phantom contour is a type of illusory contour. Most illusory contours are seen in still images, such as the Kanizsa triangle and the Ehrenstein illusion. A phantom contour, however, is perceived in the presence of moving or flickering images with contrast reversal. The rapid, continuous alternation between opposing, but correlated, adjacent images creates the perception of a contour that is not physically present in the still images. Quaid et al. have also authored a PhD thesis on the phantom contour illusion and its spatiotemporal limits which maps out limits and proposes mechanisms for its perception centering around magnocellularly driven visual area MT.
Vernier acuity is a type of visual acuity – more precisely of hyperacuity – that measures the ability to discern a disalignment among two line segments or gratings. A subject's vernier acuity is the smallest visible offset between the stimuli that can be detected. Because the disalignments are often much smaller than the diameter and spacing of retinal receptors, vernier acuity requires neural processing and "pooling" to detect it. Because vernier acuity exceeds acuity by far, the phenomenon has been termed hyperacuity. Vernier acuity develops rapidly during infancy and continues to slowly develop throughout childhood. At approximately three to twelve months old, it surpasses grating acuity in foveal vision in humans. However, vernier acuity decreases more quickly than grating acuity in peripheral vision. Vernier acuity was first explained by Ewald Hering in 1899, based on earlier data by Alfred Volkmann in 1863 and results by Ernst Anton Wülfing in 1892.
Russell L. De Valois was an American scientist recognized for his pioneering research on spatial and color vision.
The De Vries–Rose law or Rose–de Vries law is a principle of vision science named after Hessel de Vries and Albert Rose. De Vries discovered it in 1943 from considerations of quantum efficiency, and Rose developed the idea substantially a few years later. The law says that for visual targets seen against a background luminance , subject to certain assumptions, the threshold contrast should be inversely proportional to . In reality it holds only approximately, at luminance levels between the regimes of "dark light" and Weber's Law.