Audio analysis

Last updated July 12, 2025

Audio analysis refers to the extraction of information and meaning from audio signals for analysis, classification, storage, retrieval, synthesis, etc. The observation mediums and interpretation methods vary, as audio analysis can refer to the human ear and how people interpret the audible sound source, or it could refer to using technology such as an audio analyzer to evaluate other qualities of a sound source such as amplitude, distortion, frequency response. Once an audio source's information has been observed, the information revealed can then be processed for the logical, emotional, descriptive, or otherwise relevant interpretation by the user.

Natural Analysis

The most prevalent form of audio analysis is derived from the sense of hearing. A type of sensory perception that occurs in much of the planet's fauna, audio analysis is a fundamental process of many living beings. Sounds made by the surrounding environment or other living beings provides input to the hearing mechanism, for which the listener's brain can interpret the sound and how it should respond. Examples of functions include speech, startle response, music listening, and more.

An inherent ability of humans, hearing is fundamental in communication across the globe, and the process of assigning meaning and value to speech is a complex but necessary function of the human body. The study of the auditory system has been greatly centered using mathematics and the analysis of sinusoidal vibrations and sounds. The Fourier transform has been an essential theorem in understanding how the human ear processes moving air and turns it into the audible frequency range, about 20 to 20,000 Hz.^[1] The ear is able to take one complex waveform and process it into varying frequency ranges thanks to differences in the structures of the ear canal that are tuned to specific frequency ranges.^[2] The initial sensory input is then analyzed further up in the neurological system where the perception of sound takes place.

The auditory system also works in tandem with the neural system so that the listener is capable of spatially locating the direction from which a sound source originated. This is known as the Haas or Precedence effect and is possible due to the nature of having two ears, or auditory receptors. The difference in time it takes for a sound to reach both ears provides the necessary information for the brain to calculate the spatial positioning of the source.^[3]

Signal Analysis

Audio signals can be analyzed in several different ways, depending on the kind of information desired from the signal.

Types of signal analysis include:

Level and gain
Frequency domain analysis
Frequency response
Total Harmonic Distortion plus Noise (THD+N)
Phase
Crosstalk
Intermodulation distortion (IMD)
Stereo and Surround

A spectrogram image of the THX audio sound THX-DeepNote-Spectogram.png — A spectrogram image of the THX audio sound

Hardware analyzers have been the primary means of signal analysis since the invention of the first audio analyzer, made by Hewlett-Packard, the HP200A. Hardware analyzers are typically used in engineering, testing, and manufacturing of professional and consumer grade products. As computer technology progressed, integrated software found its way into these hardware systems, and later there would be audio analysis tools that did not require any hardware components save for the computer running the software. Software audio analyzers are regularly used in various stages of music production, such as live audio, mixing, and mastering. These products tend to employ Fast Fourier Transform (FFT) algorithms and processing to provide a visual representation of the signal being analyzed. Display and information types include frequency spectrum, stereo field, surround field, spectrogram, and more.

References

↑ Acton, Ciaran; Miller, Robert; Maltby, John; Fullerton, Deirdre (2009), "Analysis of Variance (ANOVA)", SPSS for Social Scientists, Macmillan Education UK, pp. 183–198, doi:10.1007/978-1-137-01390-3_9 (inactive 11 July 2025), ISBN 9780230209930 {{citation}}: CS1 maint: DOI inactive as of July 2025 (link)
↑ Guha, Martin (December 2006). Elsevier's Dictionary of Psychological Theories2006405Compiled by J.E. Roeckelein. Elsevier's Dictionary of Psychological Theories. Amsterdam: Elsevier 2006. xii+679 pp. £90; $143. Vol. 20. pp. 10–11. doi:10.1108/09504120610709402. ISBN 0-444-51750-2. ISSN 0950-4125.{{cite book}}: |journal= ignored (help)
↑ Farmer, Lesley (2011-01-18). A/V A to Z: An Encyclopedic Dictionary of Media, Entertainment and Other Audiovisual Terms201139Richard W. Kroon. A/V A to Z: An Encyclopedic Dictionary of Media, Entertainment and Other Audiovisual Terms. Jefferson, NC: McFarland 2010. vi+766 pp., £173.95 $195 Available in the United Kingdom, Europe, the Middle East and Africa from Eurospan. Vol. 25. p. 50. doi:10.1108/09504121111103335. ISBN 978-0-7864-4405-2. ISSN 0950-4125.{{cite book}}: |journal= ignored (help)

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Acton, Ciaran; Miller, Robert; Maltby, John; Fullerton, Deirdre (2009), "Analysis of Variance (ANOVA)", SPSS for Social Scientists, Macmillan Education UK, pp. 183–198, doi:10.1007/978-1-137-01390-3_9 (inactive 11 July 2025), ISBN 9780230209930 {{citation}}: CS1 maint: DOI inactive as of July 2025 (link)

[2] Guha, Martin (December 2006). Elsevier's Dictionary of Psychological Theories2006405Compiled by J.E. Roeckelein. Elsevier's Dictionary of Psychological Theories. Amsterdam: Elsevier 2006. xii+679 pp. £90; $143. Vol. 20. pp. 10–11. doi:10.1108/09504120610709402. ISBN 0-444-51750-2. ISSN 0950-4125.{{cite book}}: |journal= ignored (help)

[3] Farmer, Lesley (2011-01-18). A/V A to Z: An Encyclopedic Dictionary of Media, Entertainment and Other Audiovisual Terms201139Richard W. Kroon. A/V A to Z: An Encyclopedic Dictionary of Media, Entertainment and Other Audiovisual Terms. Jefferson, NC: McFarland 2010. vi+766 pp., £173.95 $195 Available in the United Kingdom, Europe, the Middle East and Africa from Eurospan. Vol. 25. p. 50. doi:10.1108/09504121111103335. ISBN 978-0-7864-4405-2. ISSN 0950-4125.{{cite book}}: |journal= ignored (help)

[1]

[2]

[3]

Audio analysis

Contents

Natural Analysis

Signal Analysis

See also

References