Loudness

Last updated
The horizontal axis shows frequency in Hz Lindos1.svg
The horizontal axis shows frequency in Hz

In acoustics, loudness is the subjective perception of sound pressure. More formally, it is defined as the "attribute of auditory sensation in terms of which sounds can be ordered on a scale extending from quiet to loud". [1] The relation of physical attributes of sound to perceived loudness consists of physical, physiological and psychological components. The study of apparent loudness is included in the topic of psychoacoustics and employs methods of psychophysics.

Contents

In different industries, loudness may have different meanings and different measurement standards. Some definitions, such as ITU-R BS.1770 refer to the relative loudness of different segments of electronically reproduced sounds, such as for broadcasting and cinema. Others, such as ISO 532A (Stevens loudness, measured in sones), ISO 532B (Zwicker loudness), DIN 45631 and ASA/ANSI S3.4, have a more general scope and are often used to characterize loudness of environmental noise. More modern standards, such as Nordtest ACOU112 and ISO/AWI 532-3 (in progress) take into account other components of loudness, such as onset rate, time variation and spectral masking.

Loudness, a subjective measure, is often confused with physical measures of sound strength such as sound pressure, sound pressure level (in decibels), sound intensity or sound power. Weighting filters such as A-weighting and LKFS attempt to compensate measurements to correspond to loudness as perceived by the typical human.

Explanation

The perception of loudness is related to sound pressure level (SPL), frequency content and duration of a sound. [2] The relationship between SPL and loudness of a single tone can be approximated by Stevens's power law in which SPL has an exponent of 0.67. [lower-alpha 1] A more precise model known as the Inflected Exponential function , [3] indicates that loudness increases with a higher exponent at low and high levels and with a lower exponent at moderate levels. [4]

The sensitivity of the human ear changes as a function of frequency, as shown in the equal-loudness graph. Each line on this graph shows the SPL required for frequencies to be perceived as equally loud, and different curves pertain to different sound pressure levels. It also shows that humans with normal hearing are most sensitive to sounds around 2–4 kHz, with sensitivity declining to either side of this region. A complete model of the perception of loudness will include the integration of SPL by frequency. [5]

Historically, loudness was measured using an "ear-balance" audiometer in which the amplitude of a sine wave was adjusted by the user to equal the perceived loudness of the sound being evaluated. Contemporary standards for measurement of loudness are based on the summation of energy in critical bands. [6]

Hearing loss

When sensorineural hearing loss (damage to the cochlea or in the brain) is present, the perception of loudness is altered. Sounds at low levels (often perceived by those without hearing loss as relatively quiet) are no longer audible to the hearing impaired, but sounds at high levels often are perceived as having the same loudness as they would for an unimpaired listener. This phenomenon can be explained by two theories, called loudness recruitment and softness imperception.

Loudness recruitment posits that loudness grows more rapidly for certain listeners than normal listeners with changes in level. This theory has been accepted as the classical explanation.

Softness imperception, a term coined by Mary Florentine around 2002, [7] proposes that some listeners with sensorineural hearing loss may exhibit a normal rate of loudness growth, but instead have an elevated loudness at their threshold. That is, the softest sound that is audible to these listeners is louder than the softest sound audible to normal listeners.

Compensation

The loudness control associated with a loudness compensation feature on some consumer stereos alters the frequency response curve to correspond roughly with the equal loudness characteristic of the ear. [8] Loudness compensation is intended to make the recorded music sound more natural when played at a lower levels by boosting low frequencies, to which the ear is less sensitive at lower sound pressure levels.

Normalization

Loudness normalization is a specific type of audio normalization that equalizes perceived level such that, for instance, commercials do not sound louder than television programs. Loudness normalization schemes exist for a number of audio applications.

Broadcast

Movie and home theaters

Music playback

Measurement

Historically Sone (loudness N) and Phon (loudness level LN) units have been used to measure loudness. [10]

A-weighting follows human sensitivity to sound and describes relative perceived loudness for at quiet to moderate speech levels, around 40 phons.

Relative loudness monitoring in production is measured in accordance with ITU-R BS.1770 in units of LKFS. [11] Work began on ITU-R BS.1770 in 2001 after 0 dBFS+ level distortion in converters and lossy codecs had become evident; and the original Leq(RLB)[ clarification needed ] loudness metric was proposed by Gilbert Soulodre in 2003. [12] Based on data from subjective listening tests, Leq(RLB) compared favorably to numerous other algorithms. CBC, Dolby and TC Electronic and numerous broadcasters contributed to the listening tests. Loudness levels measured according to the Leq(RLB) specified in ITU-R BS.1770 are reported in LKFS units.

The ITU-R BS.1770 measurement system was improved for made multi-channel applications (monaural to 5.1 surround sound). To make the loudness metric cross-genre friendly, a relative measurement gate was added. This work was carried out in 2008 by the EBU. The improvements were brought back into BS.1770-2. ITU subsequently updated the true-peak metric (BS.1770-3) and added provision for even more audio channels, for instance 22.2 surround sound (BS.1770-4).

See also

Notes

  1. The relationship between loudness and energy intensity of sound can therefore be approximated by a power function with an exponent of 0.3.

Related Research Articles

<span class="mw-page-title-main">Weighting filter</span>

A weighting filter is used to emphasize or suppress some aspects of a phenomenon compared to others, for measurement or other purposes.

A noise weighting is a specific amplitude-vs.-frequency characteristic that is designed to allow subjectively valid measurement of noise. It emphasises the parts of the spectrum that are most important.

The sone is a unit of loudness, the subjective perception of sound pressure. The study of perceived loudness is included in the topic of psychoacoustics and employs methods of psychophysics. Doubling the perceived loudness doubles the sone value. Proposed by Stanley Smith Stevens in 1936, it is not an SI unit.

<span class="mw-page-title-main">Phon</span> Logarithmic unit of loudness level

The phon is a logarithmic unit of loudness level for tones and complex sounds. Loudness is measured in sones, a linear unit. Human sensitivity to sound is variable across different frequencies; therefore, although two different tones may present an identical sound pressure to a human ear, they may be psychoacoustically perceived as differing in loudness. The purpose of the phon is to provide a logarithmic measurement for perceived sound magnitude, while the primary loudness standard methods result in a linear representation. A sound with a loudness of 1 sone is judged equally loud as a 1 kHz tone with a sound pressure level of 40 decibels above 20 micropascal. The phon is psychophysically matched to a reference frequency of 1 kHz. In other words, the phon matches the sound pressure level (SPL) in decibels of a similarly perceived 1 kHz pure tone. For instance, if a sound is perceived to be equal in intensity to a 1 kHz tone with an SPL of 50 dB, then it has a loudness of 50 phons, regardless of its physical properties. The phon was proposed in DIN 45631 and ISO 532 B by Stanley Smith Stevens.

<span class="mw-page-title-main">Dynamic range compression</span> Audio signal processing operation

Dynamic range compression (DRC) or simply compression is an audio signal processing operation that reduces the volume of loud sounds or amplifies quiet sounds, thus reducing or compressing an audio signal's dynamic range. Compression is commonly used in sound recording and reproduction, broadcasting, live sound reinforcement and in some instrument amplifiers.

<span class="mw-page-title-main">Audio system measurements</span> Means of quantifying system performance

Audio system measurements are a means of quantifying system performance. These measurements are made for several purposes. Designers take measurements so that they can specify the performance of a piece of equipment. Maintenance engineers make them to ensure equipment is still working to specification, or to ensure that the cumulative defects of an audio path are within limits considered acceptable. Audio system measurements often accommodate psychoacoustic principles to measure the system in a way that relates to human hearing.

The threshold of pain or pain threshold is the point along a curve of increasing perception of a stimulus at which pain begins to be felt. It is an entirely subjective phenomenon. A distinction must be maintained between the stimulus and the person's or animal's resulting pain perception. Although an IASP document defines "pain threshold" as "the minimum intensity of a stimulus that is perceived as painful", it then goes on to say that:

Traditionally the threshold has often been defined, as we defined it formerly, as the least stimulus intensity at which a subject perceives pain. Properly defined, the threshold is really the experience of the patient, whereas the intensity measured is an external event. It has been common usage for most pain research workers to define the threshold in terms of the stimulus, and that should be avoided ... The stimulus is not pain (q.v.) and cannot be a measure of pain.

<span class="mw-page-title-main">Equal-loudness contour</span> Frequency characteristics of hearing and perceived volume

An equal-loudness contour is a measure of sound pressure level, over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones. The unit of measurement for loudness levels is the phon and is arrived at by reference to equal-loudness contours. By definition, two sine waves of differing frequencies are said to have equal-loudness level measured in phons if they are perceived as equally loud by the average young person without significant hearing impairment.

ReplayGain is a proposed technical standard published by David Robinson in 2001 to measure and normalize the perceived loudness of audio in computer audio formats such as MP3 and Ogg Vorbis. It allows media players to normalize loudness for individual tracks or albums. This avoids the common problem of having to manually adjust volume levels between tracks when playing audio files from albums that have been mastered at different loudness levels.

A weighting curve is a graph of a set of factors, that are used to 'weight' measured values of a variable according to their importance in relation to some outcome. An important example is frequency weighting in sound level measurement where a specific set of weighting curves known as A-, B-, C- and D-weighting as defined in IEC 61672 are used. Unweighted measurements of sound pressure do not correspond to perceived loudness because the human ear is less sensitive at low and high frequencies, with the effect more pronounced at lower sound levels. The four curves are applied to the measured sound level, for example by the use of a weighting filter in a sound level meter, to arrive at readings of loudness in phons or in decibels (dB) above the threshold of hearing.

In digital and analog audio, headroom refers to the amount by which the signal-handling capabilities of an audio system can exceed a designated nominal level. Headroom can be thought of as a safety zone allowing transient audio peaks to exceed the nominal level without damaging the system or the audio signal, e.g., via clipping. Standards bodies differ in their recommendations for nominal level and headroom.

<span class="mw-page-title-main">ITU-R 468 noise weighting</span> Noise measurement standard

ITU-R 468 is a standard relating to noise measurement, widely used when measuring noise in audio systems. The standard, now referred to as ITU-R BS.468-4, defines a weighting filter curve, together with a quasi-peak rectifier having special characteristics as defined by specified tone-burst tests. It is currently maintained by the International Telecommunication Union who took it over from the CCIR.

<span class="mw-page-title-main">Weighting</span>

The process of weighting involves emphasizing the contribution of particular aspects of a phenomenon over others to an outcome or result; thereby highlighting those aspects in comparison to others in the analysis. That is, rather than each variable in the data set contributing equally to the final result, some of the data is adjusted to make a greater contribution than others. This is analogous to the practice of adding (extra) weight to one side of a pair of scales in order to favour either the buyer or seller.

Programme level refers to the signal level that an audio source is transmitted or recorded at, and is important in audio if listeners of Compact Discs (CDs), radio and television are to get the best experience, without excessive noise in quiet periods or distortion of loud sounds. Programme level is often measured using a peak programme meter or a VU meter.

<span class="mw-page-title-main">Sound level meter</span> Device for acoustic measurements

A sound level meter is used for acoustic measurements. It is commonly a hand-held instrument with a microphone. The best type of microphone for sound level meters is the condenser microphone, which combines precision with stability and reliability. The diaphragm of the microphone responds to changes in air pressure caused by sound waves. That is why the instrument is sometimes referred to as a sound pressure level meter (SPL). This movement of the diaphragm, i.e. the sound pressure, is converted into an electrical signal. While describing sound in terms of sound pressure, a logarithmic conversion is usually applied and the sound pressure level is stated instead, in decibels (dB), with 0 dB SPL equal to 20 micropascals.

Loudness monitoring of programme levels is needed in radio and television broadcasting, as well as in audio post production. Traditional methods of measuring signal levels, such as the peak programme meter and VU meter, do not give the subjectively valid measure of loudness that many would argue is needed to optimise the listening experience when changing channels or swapping disks.

Dialnorm is the metadata parameter that controls playback gain within the Dolby Laboratories Dolby Digital (AC-3) audio compression system. Dialnorm stands for dialog normalization. Dialnorm is an integer value with range 1 to 31 corresponding to a playback gain of -30 to 0 dB (unity) respectively. Higher values afford more headroom and are appropriate for dynamic material such as an action film.

<span class="mw-page-title-main">A-weighting</span> Frequency response curves used in sound pressure level measurement

A-weighting is the most commonly used of a family of curves defined in the International standard IEC 61672:2003 and various national standards relating to the measurement of sound pressure level. A-weighting is applied to instrument-measured sound levels in an effort to account for the relative loudness perceived by the human ear, as the ear is less sensitive to low audio frequencies. It is employed by arithmetically adding a table of values, listed by octave or third-octave bands, to the measured sound pressure levels in dB. The resulting octave band measurements are usually added to provide a single A-weighted value describing the sound; the units are written as dB(A). Other weighting sets of values – B, C, D and now Z – are discussed below.

EBU R 128 is a recommendation for loudness normalisation and maximum level of audio signals. It is primarily followed during audio mixing of television and radio programmes and adopted by broadcasters to measure and control programme loudness. It was first issued by the European Broadcasting Union in August 2010 and most recently revised in August 2020.

Loudness, K-weighted, relative to full scale (LKFS) is a standard loudness measurement unit used for audio normalization in broadcast television systems and other video and music streaming services.

References

  1. American National Standards Institute, "American national psychoacoustical terminology" S3.20, 1973, American Standards Association.
  2. Poulsen, Torben (1981). "Loudness of tone pulses in a free field" (PDF). The Journal of the Acoustical Society of America. 69 (6): 1786–1790. Bibcode:1981ASAJ...69.1786P. doi:10.1121/1.385915. PMID   7240592. S2CID   7190836.
  3. Goldstein, E. Bruce (2009). Encyclopedia of Perception Vol. 1. Sage. p. 147. ISBN   9781412940818.
  4. Florentine, Mary; Epstein, Michael (2006). "To honor Stevens and repeal his law". Proceedings of the International Society for Psychophysics. 22.
  5. Olson, Harry (1972). "The Measurement of Loudness". Audio Magazine.
  6. As described in IEC 532, DIN 45631 and ASA/ANSI S3.4
  7. Florentine, Mary (March 2003). "It's not recruitment-gasp!! It's softness imperception". Hearing Journal. 56 (3): 10, 12, 14, 15. doi: 10.1097/01.HJ.0000293012.17887.b4 .
  8. Lenk, John D. (1998). Circuit Troubleshooting Handbook. McGraw-Hill. p. 163. ISBN   0-07-038185-2.
  9. EBU Recommendation R 128: Loudness normalisation and permitted maximum level of audio signals (PDF). European Broadcasting Union. August 2011. Retrieved 2013-04-22.
  10. Olson, Harry F. (February 1972). "The Measurement of Loudness" (PDF). Audio: 18–22.
  11. Recommendation BS.1770. International Telecommunication Union. August 2012. Retrieved 2013-05-31.
  12. "Leq Meter" . Retrieved 2015-12-15.