![]() | This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
Perceptual coding is a method of lossy data compression that exploits the limitations of human sensory system in order to reduce data size. It is widely applied in audio, image and video compression standards, where certain details are removed or simplified because they are unlikely to be noticed under regular listening or viewing conditions.
Perceptual coding is based on models of human hearing and vision, such as those studied in psychoacoustics and psychovisual research. By discarding or reducing components of a signal that fall below perceptual thresholds, it achieves significant reductions in bit rate while maintaining a subjectively acceptable quality.
Perceptual coding is central to many everyday technologies, including:
The principles of perceptual coding were applied in analog communication systems long before the advent of digital media.
These analog systems demonstrated the effectiveness of tailoring transmission to the characteristics of human perception, laying the groundwork for digital perceptual coding methods.
Research in the 1970s and 1980s on psychoacoustics and psychovisual modeling provided the basis for digital perceptual coding. In audio, this led to the late-1980s development of the MPEG audio formats (such as MP3), which achieved major reductions in bit rate by discarding inaudible sound components. At the same time, the MPEG standards applied similar principles to video, using techniques such as chroma subsampling and motion-adaptive coding.
During the 1990s and 2000s, perceptual coding was embedded in widely used formats including AAC, MPEG-2 video, and H.264/AVC, supporting the rise of digital media distribution on CDs, DVDs, and early internet platforms. More recent codecs, such as HEVC, AV1, and Opus, continue to refine perceptual models to balance compression efficiency with quality on modern networks and devices.
Perceptual coding draws directly on psychoacoustics (the study of auditory perception) and psychovisual research (the study of visual perception). These disciplines provide the models that determine which parts of a signal can be safely removed without affecting perceived quality.