Sonic Visualiser

Last updated
Sonic Visualiser
Developer(s) Centre for Digital Music at Queen Mary, University of London
Stable release
5.0.1 [1] / 1 October 2024;3 months ago (1 October 2024)
Repository
Written in C++
Operating system Linux, MacOS, Windows
Type Audio analysis
License GPL-2.0-or-later [2]
Website www.sonicvisualiser.org

Sonic Visualiser is an application for viewing and analysing the contents of music audio files. It is a free software distributed under the GPL-2.0-or-later licence. [3]

Contents

History

Sonic Visualiser was developed at the Queen Mary University of London's Centre for Digital Music in 2007. It was written in C++ with Qt and released under the terms of the GNU GPL. [3]

Overview

Screenshot of the spectrum of the refrain of a pop song (precisely "Piu bella cosa" by Eros Ramazzotti): basses, drums and artist's voice can clearly be identified. Sonic Visualizer - "Piu bella cosa" by Eros Ramazzotti.png
Screenshot of the spectrum of the refrain of a pop song (precisely " Più bella cosa " by Eros Ramazzotti): basses, drums and artist's voice can clearly be identified.
Sonic visualiser melodic range spectrogram example Sonic visualiser melodic range spectrogram example.jpg
Sonic visualiser melodic range spectrogram example

Sonic Visualiser represents acoustic features of the audio file either as a waveform or as a spectrogram. [4] A spectrogram is a heatmap, where the horizontal axis represents time, the vertical axis represents frequency, and the colors show presence of frequencies. The sharpness and smoothness of the spectrogram can be configured. [5] There are three types of spectrogram:

Generic spectrogram covers the full frequency range and uses linear frequency scale. Melodic-range spectrogram covers the range which usually contains musical detail. Peak-frequency spectrogram performs phase difference calculations and estimates exact frequencies at each peak cell. [3]

The interface consists of panes and layers. Panes allow to display multiple visualisations simultaneously, and they get aligned in the time axis. A pane can have multiple layers which are used for annotation. [3] The user can configure color schemes for layers, and they can be navigated by clicking the labeled tabs. [5]

There are multiple types of annotation layers which can be edited, including time instants, time-value plots, labels and images. Time instants do not have any associated value, and they can be used to annotate points (e.g. beat locations). [3] Annotations allow the user to clarify relationships between musical parameters. [4]

Sonic Visualiser supports third-party plugins in the Vamp plugin format. The plugins take audio input and parameters and return values for display. [3] There are plugins which compute spectral flux and spectral centroid. Other plugins include automatic melody extraction, beat finding, chord analysis, etc. [5]

Sonic Visualiser is available for Linux, OS X, and Windows operating systems. [3]

See also

Related Research Articles

<span class="mw-page-title-main">XMMS</span> Free and open source audio player

X Multimedia System (XMMS) is an audio player for Unix-like systems released under a free software license.

Granular synthesis is a sound synthesis method that operates on the microsound time scale.

<span class="mw-page-title-main">Ardour (software)</span> Open-source digital audio workstation

Ardour is a hard disk recorder and digital audio workstation application that runs on Linux, macOS, FreeBSD and Microsoft Windows. Its primary author is Paul Davis, who was also responsible for the JACK Audio Connection Kit. It is intended as a digital audio workstation suitable for professional use.

Flanging is an audio effect produced by mixing two identical signals together, one signal delayed by a small and (usually) gradually changing period, usually smaller than 20 milliseconds. This produces a swept comb filter effect: peaks and notches are produced in the resulting frequency spectrum, related to each other in a linear harmonic series. Varying the time delay causes these to sweep up and down the frequency spectrum. A flanger is an effects unit that creates this effect.

<span class="mw-page-title-main">Spectrogram</span> Visual representation of the spectrum of frequencies of a signal as it varies with time

A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time. When applied to an audio signal, spectrograms are sometimes called sonographs, voiceprints, or voicegrams. When the data are represented in a 3D plot they may be called waterfall displays.

foobar2000 Freeware audio player

foobar2000 is a freeware audio player for Microsoft Windows, iOS, Android, macOS, and formerly Windows Phone, developed by Peter Pawłowski. It has a modular design, which provides user flexibility in configuration and customization. Standard "skin" elements can be individually augmented or replaced with different dials and buttons, as well as visualizers such as waveform, oscilloscope, spectrum, spectrogram (waterfall), peak and smoothed VU meters, which all of them are analysis-oriented, at least for built-in visualizations. foobar2000 offers third-party user interface modifications through a software development kit (SDK).

<span class="mw-page-title-main">Audacity (audio editor)</span> Open-source digital audio editor and recording application

Audacity is a free and open-source digital audio editor and recording application software, available for Windows, macOS, Linux, and other Unix-like operating systems.

SuperCollider is an environment and audio programming language released originally in 1996 by James McCartney for real-time audio synthesis and algorithmic composition.

<span class="mw-page-title-main">Virtual Studio Technology</span> Audio plug-in software interface

Virtual Studio Technology (VST) is an audio plug-in software interface that integrates software synthesizers and effects units into digital audio workstations. VST and similar technologies use digital signal processing to simulate traditional recording studio hardware in software. Thousands of plugins exist, both commercial and freeware, and many audio applications support VST under license from its creator, Steinberg.

<span class="mw-page-title-main">Digital audio workstation</span> Electronic device or application software used for recording, editing and producing audio files

A digital audio workstation is an electronic device or application software used for recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software program on a laptop, to an integrated stand-alone unit, all the way to a highly complex configuration of numerous components controlled by a central computer. Regardless of configuration, modern DAWs have a central interface that allows the user to alter and mix multiple recordings and tracks into a final produced piece.

<span class="mw-page-title-main">Quartz Composer</span> Node-based visual programming language

Quartz Composer is a node graph system provided as part of the Xcode development environment in macOS for processing and rendering graphical data. It is capable of making sophisticated animations for keynote or presentations and creating animated screensavers.

<span class="mw-page-title-main">Transcription (music)</span> Process of creating sheet music from a recording or performance

In music, transcription is the practice of notating a piece or a sound which was previously unnotated and/or unpopular as a written music, for example, a jazz improvisation or a video game soundtrack. When a musician is tasked with creating sheet music from a recording and they write down the notes that make up the piece in music notation, it is said that they created a musical transcription of that recording. Transcription may also mean rewriting a piece of music, either solo or ensemble, for another instrument or other instruments than which it was originally intended. The Beethoven Symphonies transcribed for solo piano by Franz Liszt are an example. Transcription in this sense is sometimes called arrangement, although strictly speaking transcriptions are faithful adaptations, whereas arrangements change significant aspects of the original piece.

Computer audition (CA) or machine listening is the general field of study of algorithms and systems for audio interpretation by machines. Since the notion of what it means for a machine to "hear" is very broad and somewhat vague, computer audition attempts to bring together several disciplines that originally dealt with specific problems or had a concrete application in mind. The engineer Paris Smaragdis, interviewed in Technology Review, talks about these systems — "software that uses sound to locate people moving through rooms, monitor machinery for impending breakdowns, or activate traffic cameras to record accidents."

<span class="mw-page-title-main">LV2</span> Open standard for audio plugins

LV2 is a set of royalty-free open standards for music production plug-ins and matching host applications. It includes support for the synthesis and processing of digital audio and CV, events such as MIDI and OSC, and provides a free alternative to audio plug-in standards such as Virtual Studio Technology (VST) and Audio Units (AU).

<span class="mw-page-title-main">WaveSurfer</span> Audio editing software

WaveSurfer is an audio editor widely used for studies of acoustic phonetics. It is a simple but fairly powerful program for interactive display of sound pressure waveforms, spectral sections, spectrograms, pitch tracks and transcriptions. It can read and write a number of transcription file formats used in industrial speech research including TIMIT.

Harmonic pitch class profiles (HPCP) is a group of features that a computer program extracts from an audio signal, based on a pitch class profile—a descriptor proposed in the context of a chord recognition system. HPCP are an enhanced pitch distribution feature that are sequences of feature vectors that, to a certain extent, describe tonality, measuring the relative intensity of each of the 12 pitch classes of the equal-tempered scale within an analysis frame. Often, the twelve pitch spelling attributes are also referred to as chroma and the HPCP features are closely related to what is called chroma features or chromagrams.

An acoustic fingerprint is a condensed digital summary, a digital fingerprint, deterministically generated from an audio signal, that can be used to identify an audio sample or quickly locate similar items in a music database.

<span class="mw-page-title-main">Audio plug-in</span> Software signal processor or synthesizer module

An audio plug-in, in computer software, is a plug-in that can add or enhance audio-related functions in a computer program, typically a digital audio workstation. Such functions may include digital signal processing or sound synthesis. Audio plug-ins usually provide their own user interface, which often contains graphical user interface (GUI) widgets that can be used to control and visualize the plug-in's audio parameters.

<span class="mw-page-title-main">Spectrum (physical sciences)</span> Concept relating to waves and signals

In the physical sciences, the term spectrum was introduced first into optics by Isaac Newton in the 17th century, referring to the range of colors observed when white light was dispersed through a prism. Soon the term referred to a plot of light intensity or power as a function of frequency or wavelength, also known as a spectral density plot.

References

  1. https://github.com/sonic-visualiser/sonic-visualiser/releases/tag/sv_v5.0.1 . Retrieved 6 December 2024.{{cite web}}: Missing or empty |title= (help)
  2. "Sonic Visualiser license". Archived from the original on 2021-05-09.
  3. 1 2 3 4 5 6 7 Cannam, Chris; Landone, Christian; Sandler, Mark (October 2010). "Sonic visualiser: An open source application for viewing, analysing, and annotating music audio files" (PDF). Proceedings of the 18th ACM international conference on Multimedia. pp. 1467–1468. doi:10.1145/1873951.1874248. ISBN   978-1-60558-933-6 . Retrieved January 18, 2024.
  4. 1 2 Dolan, David; Slaboda, John; Jensen, Henrik Jeldtoft; Cruts, Bjorn; Feygelson, Eugene (2013). "The improvisatory approach to classical music performance: an empirical investigation into its characteristics and impact" (PDF). Music Performance Research. 6. Retrieved January 19, 2024.
  5. 1 2 3 Thompson, Marc R. (December 1, 2021). "Sonic Visualiser: Visualisation, Analysis, and Annotation of Music Audio Recordings". Journal of the American Musicological Society. 74 (3): 701–714. doi:10.1525/jams.2021.74.3.701 . Retrieved January 20, 2024.