SoX

Last updated
Sound eXchange
Developer(s) Chris Bagwell, et al.
Initial releaseJuly 1991;32 years ago (1991-07)
Stable release
14.4.2 / 22 February 2015;8 years ago (2015-02-22)
Repository
Written in C
Operating system Cross-platform: Windows, Linux, OS X
Type Audio editing software
License GPL-2.0-or-later
LGPL-2.1-or-later [1]
Website sourceforge.net/projects/sox/

Sound eXchange (SoX) is a cross-platform audio editing software. It has a command-line interface, and is written in standard C. It is free software, licensed under GPL-2.0-or-later, with libsox licensed under LGPL-2.1-or-later, and distributed by Chris Bagwell through SourceForge. [1]

Contents

History

SoX was created in July 1991 by Lance Norskog and posted to the Usenet group alt.sources as Aural eXchange: Sound sample translator. With the second release (in November the same year) it was renamed Sound Exchange. Norskog continued to maintain and release SoX via Usenet, File Transfer Protocol (FTP), and then the web until early 1995, at which time SoX was at version 11 (gamma). In May 1996, Chris Bagwell started to maintain and release updated versions of SoX, starting with version sox-11gamma-cb. In September 2000, Bagwell registered the project at SourceForge with project name "sox". The registration was announced on 4 September 2000[ citation needed ] and SoX 12.17 was released on 7 September 2000.

Throughout its history SoX has had many contributing authors; Guido van Rossum, best known as creator of the programming language Python, was a significant contributor in SoX's early days. [2]

Features

A SoX spectrogram Spectrogram-of-swept-triangular-wave.png
A SoX spectrogram

Some of SoX's features are:

Examples

SoX being used to process some audio:

$ sox track1.wav track1-processed.flac remix - norm -3  highpass 22 gain -3 rate 48k norm -3 dither  Input File     : 'track1.wav' Channels       : 2 Sample Rate    : 44100 Precision      : 16-bit Duration       : 00:02:54.97 = 7716324 samples = 13123 CDDA sectors Sample Encoding: 16-bit Signed Integer PCM Endian Type    : little  Output File    : 'track1-processed.flac' Channels       : 1 Sample Rate    : 48000 Precision      : 16-bit Duration       : 00:02:54.97 = 8398720 samples ~ 13123 CDDA sectors Sample Encoding: 16-bit FLAC  sox: effects chain: input      44100Hz 2 channels 16 bits (multi) sox: effects chain: remix      44100Hz 2 channels 16 bits (multi) sox: effects chain: norm       44100Hz 1 channels 16 bits sox: effects chain: highpass   44100Hz 1 channels 16 bits sox: effects chain: gain       44100Hz 1 channels 16 bits (multi) sox: effects chain: rate       44100Hz 1 channels 16 bits sox: effects chain: norm       48000Hz 1 channels 16 bits sox: effects chain: dither     48000Hz 1 channels 16 bits (multi) sox: effects chain: output     48000Hz 1 channels 16 bits (multi) 

Playing some audio files:

$ play *.ogg  01 - Summer's Cauldron.ogg:    Encoding: Vorbis   Channels: 2 @ 16-bit   Track: 01 of 15 Samplerate: 44100Hz      Album: Skylarking Album gain: -7.8dB      Artist: XTC   Duration: 00:03:19.99  Title: Summer's Cauldron  In:20.8% 00:00:41.61 [00:02:38.38] Out:1.84M [  ====|====  ]        Clip:0 

Vulnerabilities

SoX has had several vulnerabilities listed in the National Vulnerability Database since its last public release in 2015. These vulnerabilities include stack and heap overflows and denial-of-service attacks.

See also

Related Research Articles

An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data is called the audio coding format and can be uncompressed, or compressed to reduce the file size, often using lossy compression. The data can be a raw bitstream in an audio coding format, but it is usually embedded in a container format or an audio data format with defined storage layer.

<span class="mw-page-title-main">Ogg</span> Open container format maintained by the Xiph.Org Foundation

Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality digital multimedia. Its name is derived from "ogging", jargon from the computer game Netrek.

Waveform Audio File Format is an audio file format standard, developed by IBM and Microsoft, for storing an audio bitstream on personal computers. It is the main format used on Microsoft Windows systems for uncompressed audio. The usual bitstream encoding is the linear pulse-code modulation (LPCM) format.

Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The original WMA codec, known simply as WMA, was conceived as a competitor to the popular MP3 and RealAudio codecs. WMA Pro, a newer and more advanced codec, supports multichannel and high resolution audio. A lossless codec, WMA Lossless, compresses audio data without loss of audio fidelity. WMA Voice, targeted at voice content, applies compression using a range of low bit rates. Microsoft has also developed a digital container format called Advanced Systems Format to store audio encoded by WMA.

Shorten (SHN) is a file format used for compressing audio data. It is a form of data compression of files and is used to losslessly compress CD-quality audio files. Shorten is no longer developed and other lossless audio codecs such as FLAC, Monkey's Audio (APE), TTA, and WavPack (WV) have become more popular. It is still in use to trade concert recordings that are already encoded as Shorten files. Shorten files use the .shn file extension.

<span class="mw-page-title-main">FLAC</span> Lossless digital audio coding format

FLAC is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software package that includes a codec implementation. Digital audio compressed by FLAC's algorithm can typically be reduced to between 50 and 70 percent of its original size and decompresses to an identical copy of the original audio data.

<span class="mw-page-title-main">Direct Stream Digital</span> System for digitally encoding audio signals

Direct Stream Digital (DSD) is a trademark used by Sony and Philips for their system for digitally encoding audio signals for the Super Audio CD (SACD).

<span class="mw-page-title-main">Renoise</span> Digital audio workstation

Renoise is a digital audio workstation (DAW) based upon the heritage and development of tracker software. Its primary use is the composition of music using sound samples, soft synths, and effects plug-ins. It is also able to interface with MIDI and OSC equipment. The main difference between Renoise and other music software is the characteristic vertical timeline sequencer used by tracking software.

<span class="mw-page-title-main">K3b</span> Optical disc authoring software

K3b is a CD, DVD and Blu-ray authoring application by KDE for Unix-like computer operating systems. It provides a graphical user interface to perform most CD/DVD burning tasks like creating an Audio CD from a set of audio files or copying a CD/DVD, as well as more advanced tasks such as burning eMoviX CD/DVDs. It can also perform direct disc-to-disc copies. The program has many default settings which can be customized by more experienced users. The actual disc recording in K3b is done by the command line utilities cdrecord or cdrkit, cdrdao, and growisofs. As of version 1.0, K3b features a built-in DVD ripper.

The Apple Lossless Audio Codec (ALAC), also known as Apple Lossless, or Apple Lossless Encoder (ALE), is an audio coding format, and its reference audio codec implementation, developed by Apple Inc. for lossless data compression of digital music. After initially keeping it proprietary from its inception in 2004, in late 2011 Apple made the codec available open source and royalty-free. Traditionally, Apple has referred to the codec as Apple Lossless, though more recently it has begun to use the abbreviated term ALAC when referring to the codec.

ReplayGain is a proposed technical standard published by David Robinson in 2001 to measure and normalize the perceived loudness of audio in computer audio formats such as MP3 and Ogg Vorbis. It allows media players to normalize loudness for individual tracks or albums. This avoids the common problem of having to manually adjust volume levels between tracks when playing audio files from albums that have been mastered at different loudness levels.

WavPack is a free and open-source lossless audio compression format and application implementing the format. It is unique in the way that it supports hybrid audio compression alongside normal compression which is similar to how FLAC works. It also supports compressing a wide variety of lossless formats, including various variants of PCM and also DSD as used in SACDs, together with its support for surround audio.

iAUDIO

iAUDIO is the brand name for a range of portable media players produced by Korean consumer electronics and software corporation Cowon Systems, Inc.

<span class="mw-page-title-main">Squeezebox (network music player)</span>

Squeezebox is a family of network music players. The original device was the SliMP3, introduced in 2001 by Slim Devices. It had an Ethernet interface and played MP3 music files from a media server. The first Squeezebox was released two years later and was followed by several more models. Slim Devices was acquired by Logitech in 2006.

<span class="mw-page-title-main">Tag editor</span> Software for editing the metadata of media files

A tag editor is an app that can add, edit, or remove embedded metadata on multimedia file formats. Content creators, such as musicians, photographers, podcasters, and video producers, may need to properly label and manage their creations, adding such details as title, creator, date of creation, and copyright notice.

This article describes audio APIs and components in Microsoft Windows which are now obsolete or deprecated.

<span class="mw-page-title-main">WaveLab</span> Digital audio editor and recording computer software application for Windows and macOS

WaveLab is a digital audio editor and recording computer software application for Windows and macOS, created by Steinberg. WaveLab was started in 1995 and it is mainly the work of one programmer, Philippe Goutier.

<span class="mw-page-title-main">Mpxplay</span>

Mpxplay is a 32-bit console audio player for MS-DOS and Windows. It supports a wide range of audio codecs, playlists, as well as containers for video formats. The MS-DOS version uses a 32-bit DOS extender.

<span class="mw-page-title-main">AIMP</span> Audio player software developed by Artem Izmaylov

AIMP is a freeware audio player for Windows, Android and Linux, originally developed by Russian developer Artem Izmaylov. It supports a variety of audio codecs, and includes tools to convert audio files and edit their metadata, as well as the capability of installing user-made skins and plugins.

References

  1. 1 2 "SoX licensing".
  2. van Rossum, Guido. "Guido van Rossum - Personal Home Page". Guido's Personal Home Page. Retrieved 5 November 2021. And here is a link to SOX, to which I contributed some early code.