Audio system measurements are a means of quantifying system performance. These measurements are made for several purposes. Designers take measurements so that they can specify the performance of a piece of equipment. Maintenance engineers make them to ensure equipment is still working to specification, or to ensure that the cumulative defects of an audio path are within limits considered acceptable. Audio system measurements often accommodate psychoacoustic principles to measure the system in a way that relates to human hearing.
Measurements based on psychoacoustics, such as the measurement of noise, often use a weighting filter. It is well established that human hearing is more sensitive to some frequencies than others, as demonstrated by equal-loudness contours, but it is not well appreciated that these contours vary depending on the type of sound. The measured curves for pure tones, for instance, are different from those for random noise. The ear also responds less well to short bursts, below 100 to 200 ms, than to continuous sounds such that a quasi-peak detector has been found to give the most representative results when noise contains click or bursts, as is often the case for noise in digital systems. For these reasons, a set of subjectively valid measurement techniques have been devised and incorporated into BS, IEC, EBU and ITU standards. These methods of audio quality measurement are used by broadcast engineers throughout most of the world, as well as by some audio professionals, though the older A-weighting standard for continuous tones is still commonly used by others.
No single measurement can assess audio quality. Instead, engineers use a series of measurements to analyze various types of degradation that can reduce fidelity. Thus, when testing an analogue tape machine it is necessary to test for wow and flutter and tape speed variations over longer periods, as well as for distortion and noise. When testing a digital system, testing for speed variations is normally considered unnecessary because of the accuracy of clocks in digital circuitry, but testing for aliasing and timing jitter is often desirable, as these have caused audible degradation in many systems.
Once subjectively valid methods have been shown to correlate well with listening tests over a wide range of conditions, then such methods are generally adopted as preferred. Standard engineering methods are not always sufficient when comparing like with like. One CD player, for example, might have higher measured noise than another CD player when measured with a RMS method, or even an A-weighted RMS method, yet sound quieter and measure lower when 468-weighting is used. This could be because it has more noise at high frequencies, or even at frequencies beyond 20kHz, both of which are less important since human ears are less sensitive to them. (See noise shaping.) This effect is how Dolby B works and why it was introduced. Cassette noise, which was predominately high frequency and unavoidable given the small size and speed of the recorded track could be made subjectively much less important. The noise sounded 10dB quieter, but failed to measure much better unless 468-weighting was used rather than A-weighting.
This measurement tells you over what frequency range output level for an audio component will remain reasonably constant (either within a specified decibel range, or no more than a certain number of dB from the amplitude at 1kHz). Some audio components such as tone controls are designed to adjust the loudness of signal content at particular frequencies, e.g., a bass control allows the attenuation or accentuation of low-frequency signal content, in which case the specification may specify the frequency response is taken with tone controls "flat" or disabled. Preamplifiers may also contain equalizers, filters for example to play LPs requiring RIAA frequency response correction, in which case the specification may describe how closely the response matches the standard. By comparison, Frequency range is a term sometimes used of loudspeakers and other transducers to indicate the frequencies that are usable, without normally specifying a decibel range. Power bandwidth is also related to frequency response - indicating the range of frequencies usable at high power (since frequency response measurements are normally taken at low signal levels, where slew rate limitations or transformer saturation would not be a problem.
A component having a 'flat' frequency response will not change the weighting (i.e., intensity) of signal content across the specified frequency range. The frequency range often specified for audio components is between 20 Hz to 20 kHz, which broadly reflects the human hearing range (the highest audible frequency for most people is less than 20 kHz, with 16 kHz being more typical ). Components with 'flat' frequency responses are often described as being linear. Most audio components are designed to be linear across their entire operating range. Well-designed solid-state amplifiers and CD players may have a frequency response that varies by only 0.2 dB between 20 Hz to 20 kHz. Loudspeakers tend to have considerably less flat frequency responses than this.
Music material contains distinct tones, and some kinds of distortion involve spurious tones at double or triple the frequencies of those tones. Such harmonically related distortion is called harmonic distortion. For high fidelity, this is usually expected to be < 1% for electronic devices; mechanical elements such as loudspeakers usually have inescapable higher levels. Low distortion is relatively easy to achieve in electronics with use of negative feedback, but the use of high levels of feedback in this manner has been the topic of much controversy among audiophiles. Essentially all loudspeakers produce more distortion than electronics, and 1–5% distortion is not unheard of at moderately loud listening levels. Human ears are less sensitive to distortion in the low frequencies, and levels are usually expected to be under 10% at loud playback. Distortion that creates only even-order harmonics for a sine wave input is sometimes considered less bothersome than odd-order distortion.
Output power for amplifiers is ideally measured and quoted as maximum Root Mean Square (RMS) power output per channel, at a specified distortion level at a particular load, which, by convention and government regulation, is considered the most meaningful measure of power available on music signals, though real, non-clipping music has a high peak-to-average ratio, and usually averages well below the maximum possible. The commonly given measurement of PMPO (peak music power out) is largely meaningless and often used in marketing literature; in the late 1960s there was much controversy over this point and the US Government (FTA) required that RMS figures be quoted for all high fidelity equipment. Music power has been making a comeback in recent years. See also Audio power.
Power specifications require the load impedance to be specified, and in some cases two figures will be given (for instance, the output power of a power amplifier for loudspeakers will be typically measured at 4 and 8 ohms). To deliver maximum power to the load, the impedance of the driver should be the complex conjugate of the impedance of the load. In the case of a purely resistive load, the resistance of the driver should be equal to the resistance of the load to achieve maximum output power. This is referred to as impedance matching.
Distortion that is not harmonically related to the signal being amplified is intermodulation distortion. It is a measure of the level of spurious signals resulting from unwanted combination of different frequency input signals. This effect results from non-linearities in the system. Sufficiently high levels of negative feedback can reduce this effect in an amplifier. Many believe it is better to design electronics in a way to minimize feedback levels, though this is difficult to achieve while meeting other high accuracy requirements. Intermodulation in loudspeaker drivers is, as with harmonic distortion, almost always larger than in most electronics. IMD increases with cone excursion. Reducing a driver's bandwidth directly reduces IMD. This is achieved by splitting the desired frequency range into separate bands and employing separate drivers for each band of frequencies, and feeding them through a crossover filter network. Steep slope crossover filters are most effective at IMD reduction, but may be too expensive to implement using high-current components and may introduce ringing distortion. Intermodulation distortion in multi-driver loudspeakers can be greatly reduced with the use of active crossover, though it significantly increases system cost and complexity.
The level of unwanted noise generated by the system itself, or by interference from external sources added to the signal. Hum usually refers to noise only at power line frequencies (as opposed to broadband white noise), which is introduced through induction of power line signals into the inputs of gain stages. Or from inadequately regulated power supplies.
The introduction of noise (from another signal channel) caused by ground currents, stray inductance or capacitance between components or lines. Crosstalk reduces, sometimes noticeably, separation between channels (e.g., in a stereo system). A crosstalk measurement yields a figure in dB relative to a nominal level of signal in the path receiving interference. Crosstalk is normally only a problem in equipment that processes multiple audio channels in the same chassis.
In balanced audio systems, there are equal and opposite signals (difference-mode) in inputs, and any interference imposed on both leads will be subtracted, canceling out that interference (i.e., the common-mode). CMRR is a measure of a system's ability to ignore such interference and especially hum at its input. It is generally only significant with long lines on an input, or when some kinds of ground loop problems exist. Unbalanced inputs do not have common mode resistance; induced noise on their inputs appears directly as noise or hum.
The difference between the maximum level a component can accommodate and the noise level it produces. Input noise is not counted in this measurement. It is measured in dB.
Dynamic range refers to the ratio of maximum to minimum loudness in a given signal source (e.g., music or programme material), and this measurement also quantifies the maximum dynamic range an audio system can carry. This is the ratio (usually expressed in dB) between the noise floor of the device with no signal and the maximum signal (usually a sine wave) that can be output at a specified (low) distortion level.
Since the early 1990s it has been recommended by several authorities including the Audio Engineering Society that measurements of dynamic range be made with an audio signal present. This avoids questionable measurements based on the use of blank media or muting circuits.
Signal-to-noise ratio (SNR), however, is the ratio between the noise floor and an arbitrary reference level or alignment level. In "professional" recording equipment, this reference level is usually +4 dBu (IEC 60268-17), though sometimes 0 dBu (UK and Europe - EBU standard Alignment level). 'Test level', 'measurement level' and 'line-up level' mean different things, often leading to confusion. In "consumer" equipment, no standard exists, though −10 dBV and −6 dBu are common.
Different media characteristically exhibit different amounts of noise and headroom. Though the values vary widely between units, a typical analogue cassette might give 60 dB, a CD almost 100 dB. Most modern quality amplifiers have >110 dB dynamic range, which approaches that of the human ear, usually taken as around 130 dB. See Programme levels.
A perfect audio component will maintain the phase coherency of a signal over the full range of frequencies. Phase distortion can be extremely difficult to reduce or eliminate. The human ear is largely insensitive to phase distortion, though it is exquisitely sensitive to relative phase relationships within heard sounds. The complex nature of our sensitivity to phase errors, coupled with the lack of a convenient test that delivers an easily understood quality rating, is the reason that it is not a part of conventional audio specifications. Multi-driver loudspeaker systems may have complex phase distortions, caused or corrected by crossovers, driver placement, and the phase behaviour of the specific driver.
A system may have low distortion for a steady-state signal, but not on sudden transients. In amplifiers, this problem can be traced to power supplies in some instances, to insufficient high-frequency performance or to excessive negative feedback. Related measurements are slew rate and rise time. Distortion in transient response can be hard to measure. Many otherwise good power amplifier designs have been found to have inadequate slew rates, by modern standards. In loudspeakers, transient response performance is affected by the mass and resonances of drivers and enclosures and by group delay and phase delay introduced by crossover filtering or inadequate time alignment of the loudspeaker's drivers. Most loudspeakers generate significant amounts of transient distortion, though some designs are less prone to this (e.g. electrostatic loudspeakers, plasma arc tweeters, ribbon tweeters and horn enclosures with multiple entry points).
A higher number is generally believed to be better. This is a measure of how well a power amplifier controls the undesired motion of a loudspeaker driver. An amplifier must be able to suppress resonances caused by mechanical motion (e.g., inertia) of a speaker cone, especially a low-frequency driver with greater mass. For conventional loudspeaker drivers, this essentially involves ensuring that the output impedance of the amplifier is close to zero and that the speaker wires are sufficiently short and have sufficiently large diameter. Damping factor is the ratio of the output impedance of an amplifier and connecting cables to the DC resistance of a voice coil, which means that long, high resistance speaker wires will reduce the damping factor. A damping factor of 20 or greater is considered adequate for live sound reinforcement systems, as the SPL of inertia-related driver movement is 26dB less than signal level and won't be heard. Negative feedback in an amplifier lowers its effective output impedance and thus increases its damping factor.
These measurements are related to physical motion in a component, largely the drive mechanism of analogue media, such as vinyl records and magnetic tape. "Wow" is slow speed (a few Hz) variation, caused by longer-term drift of the drive motor speed, whereas "flutter" is faster speed (a few tens of Hz) variations, usually caused by mechanical defects such as out-of-roundness of the capstan of a tape transport mechanism. The measurement is given in% and a lower number is better.
The measure of the low frequency (many tens of Hz) noise contributed by the turntable of an analogue playback system. It is caused by imperfect bearings, uneven motor windings, vibrations in driving bands in some turntables, room vibrations (e.g., from traffic) that is transmitted by the turntable mounting and so to the phono cartridge. A lower number is better.
Note that digital systems do not suffer from many of these effects at a signal level, though the same processes occur in the circuitry since the data being handled is symbolic. As long as the symbol survives the transfer between components, and can be perfectly regenerated (e.g., by pulse shaping techniques) the data itself is perfectly maintained. The data is typically buffered in a memory, and is clocked out by a very precise crystal oscillator. The data usually does not degenerate as it passes through many stages, because each stage regenerates new symbols for transmission.
Digital systems have their own problems. Digitizing adds noise, which is measurable and depends on the audio bit depth of the system, regardless of other quality issues. Timing errors in sampling clocks (jitter) result in non-linear distortion (FM modulation) of the signal. One quality measurement for a digital system (Bit Error Rate) relates to the probability of an error in transmission or reception. Other metrics on the quality of the system are defined by the sample rate and bit depth. In general, digital systems are much less prone to error than analogue systems; However, nearly all digital systems have analogue inputs and/or outputs, and certainly all of those that interact with the analogue world do so. These analogue components of the digital system can suffer analogue effects and potentially compromise the integrity of a well designed digital system.
A measurement of the variation in period (periodic jitter) and absolute timing (random jitter) between measured clock timing versus an ideal clock. Less jitter is generally better for sampling systems.
A specification of the rate at which measurements are taken of the analogue signal. This is measured in samples per second, or hertz. A higher sampling rate allows a greater total bandwidth or pass-band frequency response and allows less-steep anti-aliasing/anti-imaging filters to be used in the stop-band, which can in turn improve overall phase linearity in the pass-band.
Example: A 16-bit system has 216 different possibilities, from 0 – 65,535. The smallest signal without dithering is 1, so the number of different levels is one less, 216 - 1.
So for a 16-bit digital system, the Dynamic Range is 20·log(216 - 1) ≈ 96 dB.
Not as much a specification as an ability. Since independent digital audio devices are each run by their own crystal oscillator, and no two crystals are exactly the same, the sample rate will be slightly different. This will cause the devices to drift apart over time. The effects of this can vary. If one digital device is used to monitor another digital device, this will cause dropouts or distortion in the audio, as one device will be producing more or less data than the other per unit time. If two independent devices record at the same time, one will lag the other more and more over time. This effect can be circumvented with a word clock synchronization. It can also be corrected in the digital domain using a drift correction algorithm. Such an algorithm compares the relative rates of two or more devices and drops or adds samples from the streams of any devices that drift too far from the master device. Sample rate will also vary slightly over time, as crystals change in temperature, etc. See also clock recovery
Differential non-linearity and integral non-linearity are two measurements of the accuracy of an analog-to-digital converter. Basically, they measure how close the threshold levels for each bit are to the theoretical equally-spaced levels.
Automated sequence testing
Sequence testing uses a specific sequence of test signals, for frequency response, noise, distortion etc., generated and measured automatically to carry out a complete quality check on a piece of equipment or signal path. A single 32-second sequence was standardized by the EBU in 1985, incorporating 13 tones (40Hz–15kHz at −12dB) for frequency response measurement, two tones for distortion (1024Hz/60Hz at +9dB) plus crosstalk and compander tests. This sequence, which began with a 110-baudFSK signal for synchronizing purposes, also became CCITT standard O.33 in 1985.
Lindos Electronics expanded the concept, retaining the FSK concept, and inventing segmented sequence testing, which separated each test into a 'segment' starting with an identifying character transmitted as 110-baud FSK so that these could be regarded as 'building blocks' for a complete test suited to a particular situation. Regardless of the mix chosen, the FSK provides both identification and synchronization for each segment, so that sequence tests sent over networks and even satellite links are automatically responded to by measuring equipment. Thus TUND represents a sequence made up of four segments which test the alignment level, frequency response, noise and distortion in less than a minute, with many other tests, such as Wow and flutter, Headroom, and Crosstalk also available in segments as well as a whole.
The Lindos sequence test system is now a 'de facto' standard in broadcasting and many other areas of audio testing, with over 25 different segments recognized by Lindos test sets, and the EBU standard is no longer used.
Many audio components are tested for performance using objective and quantifiable measurements, e.g., THD, dynamic range and frequency response. Some take the view that objective measurements are useful and often relate well to subjective performance, i.e., the sound quality as experienced by the listener. Floyd Toole has extensively evaluated loudspeakers in acoustical engineering research. In a peer reviewedscientific journal, Toole has presented findings that subjects have a range of abilities to distinguish good loudspeakers from bad, and that blind listening tests are more reliable than sighted tests. He found that subjects can more accurately perceive differences in speaker quality during monaural playback though a single loudspeaker, whereas subjective perception of stereophonic sound is more influenced by room effects. One of Toole's papers showed that objective measurements of loudspeaker performance match subjective evaluations in listening tests.
Some argue that because human hearing and perception are not fully understood, listener experience should be valued above everything else. This tactic is often encountered in the high-endhome audio world, where it is used to sell amplifiers with poor specifications. The usefulness of blind listening tests and common objective performance measurements, e.g., THD, are questioned. For instance, crossover distortion at a given THD is much more audible than clipping distortion at the same THD, since the harmonics produced are at higher frequencies. This does not imply that the defect is somehow unquantifiable or unmeasurable; just that a single THD number is inadequate to specify it and must be interpreted with care. Taking THD measurements at different output levels would expose whether the distortion is clipping (which increases with level) or crossover (which decreases with level).
Whichever the view, some measurements have been traditionally used, despite having no objective value. For example, THD is an average of a number of harmonics equally weighted, even though research performed decades ago identifies that lower order harmonics are harder to hear at the same level, compared with higher-order ones. In addition, even-order harmonics are said to be generally harder to hear than odd order. A number of formulas that attempt to correlate THD with actual audibility have been published, however, none have gained mainstream use.
The mass market consumer magazine Stereophile promotes the claim that home audio enthusiasts prefer sighted tests than blind tests.
In electronics, the figures of merit of an amplifier are numerical measures that characterize its properties and performance. Figures of merit can be given as a list of specifications that include properties such as gain, bandwidth, noise and linearity, among others listed in this article. Figures of merit are important for determining the suitability of a particular amplifier for an intended use.
High fidelity is a term used by listeners, audiophiles and home audio enthusiasts to refer to high-quality reproduction of sound. This is in contrast to the lower quality sound produced by inexpensive audio equipment, AM radio, or the inferior quality of sound reproduction that can be heard in recordings made until the late 1940s.
Distortion is the alteration of the original shape of something. In communications and electronics it means the alteration of the waveform of an information-bearing signal, such as an audio signal representing sound or a video signal representing images, in an electronic device or communication channel.
The total harmonic distortion is a measurement of the harmonic distortion present in a signal and is defined as the ratio of the sum of the powers of all harmonic components to the power of the fundamental frequency. Distortion factor, a closely related term, is sometimes used as a synonym.
A signal generator is one of a class of electronic devices that generates electronic signals with set properties of amplitude, frequency, and wave shape. These generated signals are used as a stimulus for electronic measurements, typically used in designing, testing, troubleshooting, and repairing electronic or electroacoustic devices, though it often has artistic uses as well.
A loudspeaker is an electroacoustic transducer; a device which converts an electrical audio signal into a corresponding sound. The most widely used type of speaker is the dynamic speaker. The sound source must be amplified or strengthened with an audio power amplifier before the signal is sent to the speaker.
In electronics, a digital-to-analog converter is a system that converts a digital signal into an analog signal. An analog-to-digital converter (ADC) performs the reverse function.
Audio crossovers are a type of electronic filter circuitry used in a range of audio applications. They split up an audio signal into two or more frequency ranges, so that the signals can be sent to loudspeaker drivers that are designed to operate within different frequency ranges. Crossovers are often described as "two-way" or "three-way", which indicate, respectively, that the crossover splits a given signal into two frequency ranges or three frequency ranges. Crossovers are used in loudspeaker cabinets, power amplifiers in consumer electronics and pro audio and musical instrument amplifier products. For the latter two markets, crossovers are used in bass amplifiers, keyboard amplifiers, bass and keyboard speaker enclosures and sound reinforcement system equipment.
An audiophile is a person who is enthusiastic about high-fidelity sound reproduction. An audiophile seeks to reproduce the sound of a live musical performance, typically in a room with good acoustics. It is widely agreed that reaching this goal is very difficult and that even the best-regarded recording and playback systems rarely, if ever, achieve it.
Sound can be recorded and stored and played using either digital or analog techniques. Both techniques introduce errors and distortions in the sound, and these methods can be systematically compared. Musicians and listeners have argued over the superiority of digital versus analog sound recordings. Arguments for analog systems include the absence of fundamental error mechanisms which are present in digital audio systems, including aliasing and quantization noise. Advocates of digital point to the high levels of performance possible with digital audio, including excellent linearity in the audible band and low levels of noise and distortion.
Audio power is the electrical power transferred from an audio amplifier to a loudspeaker, measured in watts. The electrical power delivered to the loudspeaker, together with its efficiency, determines the sound power generated.
A single-ended triode (SET) is a vacuum tube electronic amplifier that uses a single triode to produce an output, in contrast to a push-pull amplifier which uses a pair of devices with antiphase inputs to generate an output with the wanted signals added and the distortion components subtracted. Single-ended amplifiers normally operate in Class A; push-pull amplifiers can also operate in Classes AB or B without excessive net distortion, due to cancellation.
Audio noise measurement is carried out to assess the quality of audio equipment, such as is used in recording studios, broadcast engineering, and in-home high fidelity.
Loudspeaker measurement is the practice of determining the behaviour of loudspeakers by measuring various aspects of performance. This measurement is especially important because loudspeakers, being transducers, have a higher level of distortion than other audio system components used in playback or sound reinforcement.
A total harmonic distortion analyzer calculates the total harmonic content of a sinewave with some distortion, expressed as total harmonic distortion (THD). A typical application is to determine the THD of an amplifier by using a very-low-distortion sinewave input and examining the output. The figure measured will include noise, and any contribution from imperfect filtering out of the fundamental frequency. Harmonic-by-harmonic measurement, without wideband noise, can be measured by a more complex wave analyser.
Technical specifications and detailed information on the valve audio amplifier, including its development history.
Audio equipment testing is the measurement of audio quality through objective and/or subjective means. The results of such tests are published in journals, magazines, whitepapers, websites, and in other media.
Spurious-free dynamic range (SFDR) is the strength ratio of the fundamental signal to the strongest spurious signal in the output. It is also defined as a measure used to specify analog-to-digital and digital-to-analog converters and radio receivers.
Tube sound is the characteristic sound associated with a vacuum tube amplifier, a vacuum tube-based audio amplifier. At first, the concept of tube sound did not exist, because practically all electronic amplification of audio signals was done with vacuum tubes and other comparable methods were not known or used. After introduction of solid state amplifiers, tube sound appeared as the logical complement of transistor sound, which had some negative connotations due to crossover distortion in early transistor amplifiers. The audible significance of tube amplification on audio signals is a subject of continuing debate among audio enthusiasts.
An Audio Analyzer is a test and measurement instrument used to objectively quantify the audio performance of electronic and electro-acoustical devices. Audio quality metrics cover a wide variety of parameters, including level, gain, noise, harmonic and intermodulation distortion, frequency response, relative phase of signals, interchannel crosstalk, and more. In addition, many manufacturers have requirements for behavior and connectivity of audio devices that require specific tests and confirmations.
↑ Moore, Brian C. J., An Introduction to the Psychology of Hearing, 2004, 5th ed. p137, Elsevier Press
↑ BBC Research Report EL17, The Assessment of Noise in Audio Frequency Circuits, 1968.