ATRAC

Last updated
ATRAC
Atrac.svg
Filename extension
  • .aa3 .at3, .at9, .atp, .hma
    OpenMG container
  • .oma, .omg
Internet media type audio/ATRAC3, audio/ATRAC-ADVANCED-LOSSLESS, audio/ATRAC-X
Developed by Sony Corporation
Type of format Audio
Open format?No
Free format?No

Adaptive Transform Acoustic Coding (ATRAC) is a family of proprietary audio compression algorithms developed by Sony. MiniDisc was the first commercial product to incorporate ATRAC, in 1992. ATRAC allowed a relatively small disc like MiniDisc to have the same running time as CD while storing audio information with minimal perceptible loss in quality. Improvements to the codec in the form of ATRAC3, ATRAC3plus, and ATRAC Advanced Lossless followed in 1999, 2002, and 2006 respectively. [1]

Contents

Files in ATRAC3 format originally had the .aa3 extension; however, in most cases, the files would be stored in an OpenMG Audio container using the extension .oma. Previously, files that were encrypted with OpenMG had the .omg extension, which was replaced by .oma starting in SonicStage v2.1. Encryption is no longer compulsory as of v3.2.

Other MiniDisc manufacturers such as Sharp and Panasonic also implemented their own versions of the ATRAC codec.

History

ATRAC was developed for Sony's MiniDisc format. ATRAC was updated with version 2, then version 3, version 4, version 4.5, and Type R and Type S. [2]

The first major update was ATRAC3 (not to be confused with version 3 of original ATRAC) in 1999. [3] ATRAC3 was used on MiniDisc as well as the Network Walkman and Vaio Music Clip. ATRAC3plus launched in 2003 for Hi-MD, but was also compatible with some PlayStation, VAIO and Xplod devices. [4]

On 31 March 2008 Sony all but dropped the ATRAC-related codecs in the United States and Europe, and in their SonicStage powered Connect Music Store (Sony's equivalent of iTunes and iTunes Music Store). This was partly due to low adoption of the format, with a source claiming that 90% of European Walkman users did not use ATRAC. Walkman digital players outside Japan no longer worked with ATRAC after September 2007. [5]

Until October 1, 2012, ATRAC was the only codec available to download music from mora until they transitioned to a DRM free model and began offering FLAC files the next year. [6] [7]

ATRAC9 was designed for PlayStation audio and debuted with the PlayStation Vita. [4]

Bitrate quality

ATRAC's 292 kbit/s bitrate [8] used on the original MiniDiscs was designed to be near to CD audio quality. Years later ATRAC was improved over earlier versions at similar bitrates. For comparison, CDs are encoded at 1411.2 kbit/s, and lossless encoders can encode most CDs below 1000 kbit/s, with further bitrate reduction for easier-to-encode content such as voice.

Performance

ATRAC algorithms were developed in close cooperation with LSI integrated circuit development engineers within Sony in order to deliver a product that could encode at high speeds and with minimal power consumption. [9] This contrasts with other codecs developed on computers without regard for the constraints of portable hardware. This is reflected in the design of the ATRAC codecs, which emphasize processing smaller groups of samples at a time to save memory at the cost of compression efficiency and additional multiplies. These trade-offs are logical for DSP systems, where memory was often at a premium compared to multiplier performance.

Sony Walkmans offer better battery life when playing ATRAC files than when playing MP3 files. However, as Sony only pushed ATRAC compatibility in Sony Ericsson Walkman series phones in the Japanese market, it is not supported in GSM/UMTS market phones. Sony's Xplod series of car audio CD players support ATRAC CDs. Minidiscs with ATRAC format songs have, in the past, been supported on Eclipse brand car stereos.

Formats

ATRAC (1) (versions 1.0–4.5, Type R/S)

ATRAC1 was first used in Sony's own theater format SDDS system in the 1990s, and in this context is a direct competitor to Dolby Digital (AC3) and DTS. SDDS uses ATRAC1 with 8 channel encoding, and with a total encoding rate over all the channels of 1168 kbit/s.

Two stacked quadrature mirror filters split the signal into 3 parts:

Full stereo (i.e., independent channel) encoding with a data rate of 292 kbit/s.

High-frequency lowpass depends on the complexity of the material; some encodings have content clear up to 22.05 kHz.

ATRAC1 can also be used in mono (one channel) mode, doubling recording time.

FFmpeg has an implementation [10] of an ATRAC1 decoder.

ATRAC3 (LP2 and LP4 Modes)

Like ATRAC1 and MP3, ATRAC3 is also a hybrid subband-MDCT encoder, but with several differences.

In ATRAC3, Three stacked QMF split the signal into 4 parts:

The four subbands are then MDCT encoded using a fixed-length transform. Unlike nearly all modern formats, the transform length cannot be varied to optimize coding transients. Instead, a simpler transient encoding technique called gain control is used, in which the gain of different subbands is varied during a transient prior to MDCT and then restored during decoding after the inverse MDCT to try to smooth over transients. Additionally, prior to quantization, tonal components are subtracted from the signal and independently quantized. During decoding, they are separately reconstructed and added back to reform the original MDCT coefficients.

Sony claims the major advantage of ATRAC3 is its coding efficiency, which was tuned for portable DSP which provides less computing power and battery life. However, as ATRAC is a hybrid subband-MDCT codec that is algorithmically very similar to MP3, any advantage is probably exaggerated. Compared to newer formats such as Ogg Vorbis which use a simple MDCT rather than a hybrid, ATRAC3 must perform an additional computationally expensive inverse-QMF, although the hybrid system significantly reduces memory usage, which was likely a factor given the limited memory available when ATRAC was first developed.

LP2 Mode

This uses a 132 kbit/s data rate, the quality of which is advertised to be similar to that of MP3 encoded at a similar bit rate. However, in an independent double-blind test (2004/05) without format encoding parameters reference against Ogg Vorbis, AAC, and LAME VBR MP3, ATRAC3 came last. [11]

LP4 Mode

This reduces the data rate to 66 kbit/s (half that of LP2), partly by using joint stereo coding and a lowpass filter around 13.5 kHz. It allows 324 minutes to be recorded on an 80-minute MiniDisc, with the same padding required as LP2.

Notes

FFmpeg has an implementation [12] of an ATRAC3 decoder, which was converted to fixed precision and implemented in the Rockbox series of firmware for ARM, Coldfire and MIPS processors. RealAudio8 is a high-bitrate implementation of ATRAC3 (up to 352.8kbit/s). Atracdenc is an open source implementation of ATRAC3 compatible encoder which also can use RealAudio container.

The PlayStation 3 video game Race Driver: Grid uses 224 simultaneous streams of ATRAC3 compressed audio, with between one and eight channels per stream at sample rates between 24 and 48 kHz, each filtered using 512 frequency bands of adaptive equalisation, routed via six reverb units running on the same SPU co-processor (one of eight on the PS3's Cell chip), alongside 7.1 channel hybrid third-order Ambisonic mixing. [13]

ATRAC3plus

Atrac3plus logo.png

This codec is used in Sony Hi-MD Walkman devices (e.g., "Hi-LP and Hi-SP"), Network Walkman players, Memory Stick players, VAIO Pocket, PS3 and PSP console, and ATRAC CD players. It is a hybrid subband/MDCT codec based on a 16 channel QMF followed by a 128-point MDCT. Prior to MDCT coding, Generalized Harmonic Analysis (GHA) is used to extract tonal components, an improved version of the process used in ATRAC3. As in previous ATRAC versions, gain control is used to control preecho rather than variable sized transforms, although different MDCT windows are apparently possible.

An ATRAC3plus CD playing on a Sony car stereo. Car stereo display.jpg
An ATRAC3plus CD playing on a Sony car stereo.

SonicStage version 3.4, released in Feb 2006, [14] introduced ripping CDs in bitrates 320 and 352. [15] The available bitrates are: 48, 64, 96, 128, 160, 192, 256, 320 and 352 kbit/s. The newer bitrates are not always compatible with all older hardware decoders, however, some of the older hardware has been found to be compatible with certain newer ATRAC3plus bitrates.

MiniDiscs recorded in this format are incompatible with older players.

In a test conducted by an independent firm, but financed by Sony, it was concluded that ATRAC3plus at 64 kbit/s is equal in subjective sound quality to an obsolete MP3 encoder at 128 kbit/s. [16] Performance against modern high quality MP3 encoders was not evaluated.

ATRAC Advanced Lossless

ATRAC Advanced Lossless is a "scalable" lossless audio codec that records a lossy ATRAC3 or ATRAC3plus stream, and supplements it with a stream of correction information stored within the file itself that allows the original signal to be reproduced, if desired. A player/decoder can extract and use just the ATRAC3 or ATRAC3plus data, or it can combine that with the correction stream to perfectly reproduce the original audio information. This allows the file to be decoded as either lossless or lossy. It is implemented in such a way that allows the file size to be smaller than uncompressed or compressed versions of the same file. Compression is approximately 30–80% of the original file. Benefits of scalable compression include providing backward compatibility, such that older devices that are not AAL-aware can still have the ATRAC3 stream available for playback without understanding the AAL format, and faster transfer speed between portable audio devices and PC. [17]

ATRAC Advanced Lossless is widely supported in older Walkman players and SonicStage version 4 or later. SonicStage 4 allows download of ATRAC Advanced Lossless to MiniDisc Players, PlayStation Portable, and PlayStation 3. Recent Walkman players do not support ATRAC Advanced Lossless/ATRAC.

AAL's use of a "core" (lossy) and "residual" (correction) stream is similar to the idea behind Opus, MPEG-4 SLS, DTS-HD Master Audio, Dolby TrueHD and Ogg Vorbis bitrate peeling. In fact, AAL was the first to be released in the commercial market with this scheme for backward compatibility.

WavPack hybrid mode and OptimFROG DualStream are in the same category, but store the correction stream in a separate file.

ATRAC9

According to Sony ATRAC9 is a high-compression audio codec optimized for games, offering low delay (granularity) and low CPU and memory usage. It is used in the PS5, PS4 and PS Vita consoles. Audio middleware such as FMOD and Audiokinetic Wwise supports it. [18] [19]

FFmpeg has an implementation [20] of an ATRAC9 decoder.

See also

Related Research Articles

<span class="mw-page-title-main">Lossy compression</span> Data compression approach that reduces data size while discarding or changing some of it

In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size for storing, handling, and transmitting content. The different versions of the photo of the cat on this page show how higher degrees of approximation create coarser images as more details are removed. This is opposed to lossless data compression which does not degrade the data. The amount of data reduction possible using lossy compression is much higher than using lossless techniques.

<span class="mw-page-title-main">MP3</span> Digital audio format

MP3 is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg, with support from other digital scientists in other countries. Originally defined as the third audio format of the MPEG-1 standard, it was retained and further extended—defining additional bit rates and support for more audio channels—as the third audio format of the subsequent MPEG-2 standard. A third version, known as MPEG-2.5—extended to better support lower bit rates—is commonly implemented but is not a recognized standard.

MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s without excessive quality loss, making video CDs, digital cable/satellite TV and digital audio broadcasting (DAB) practical.

Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The original WMA codec, known simply as WMA, was conceived as a competitor to the popular MP3 and RealAudio codecs. WMA Pro, a newer and more advanced codec, supports multichannel and high-resolution audio. A lossless codec, WMA Lossless, compresses audio data without loss of audio fidelity. WMA Voice, targeted at voice content, applies compression using a range of low bit rates. Microsoft has also developed a digital container format called Advanced Systems Format to store audio encoded by WMA.

<span class="mw-page-title-main">MiniDisc</span> Magneto-optical storage medium, mainly for audio (1992–2013)

MiniDisc (MD) is an erasable magneto-optical disc-based data storage format offering a capacity of 60, 74, and later, 80 minutes of digitized audio.

Dolby Digital, originally synonymous with Dolby AC-3, is the name for a family of audio compression technologies developed by Dolby Laboratories. Called Dolby Stereo Digital until 1995, it is lossy compression. The first use of Dolby Digital was to provide digital sound in cinemas from 35 mm film prints. It has since also been used for TV broadcast, radio broadcast via satellite, digital video streaming, DVDs, Blu-ray discs and game consoles.

<span class="mw-page-title-main">G.711</span> ITU-T recommendation

G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. It is an ITU-T standard (Recommendation) for audio encoding, titled Pulse code modulation (PCM) of voice frequencies released for use in 1972.

Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. It was designed to be the successor of the MP3 format and generally achieves higher sound quality than MP3 at the same bit rate.

MPEG-4 Part 3 or MPEG-4 Audio is the third part of the ISO/IEC MPEG-4 international standard developed by Moving Picture Experts Group. It specifies audio coding methods. The first version of ISO/IEC 14496-3 was published in 1999.

In telecommunications and computing, bit rate is the number of bits that are conveyed or processed per unit of time.

Musepack or MPC is an open source lossy audio codec, specifically optimized for transparent compression of stereo audio at bitrates of 160–180 kbit/s. It was formerly known as MPEGplus, MPEG+ or MP+.

<span class="mw-page-title-main">High-Efficiency Advanced Audio Coding</span> Audio codec

High-Efficiency Advanced Audio Coding (HE-AAC) is an audio coding format for lossy data compression of digital audio defined as an MPEG-4 Audio profile in ISO/IEC 14496–3. It is an extension of Low Complexity AAC (AAC-LC) optimized for low-bitrate applications such as streaming audio. The usage profile HE-AAC v1 uses spectral band replication (SBR) to enhance the modified discrete cosine transform (MDCT) compression efficiency in the frequency domain. The usage profile HE-AAC v2 couples SBR with Parametric Stereo (PS) to further enhance the compression efficiency of stereo signals.

<span class="mw-page-title-main">SonicStage</span>

SonicStage is a discontinued software product from Sony that is used for managing portable devices when they are plugged into a computer running Microsoft Windows. It comprises a music player and library manager, similar to iTunes, Windows Media Player and RealPlayer. It is used to manage the library of ATRAC and MP3 recordings on a PC.

MPEG-1 Audio Layer I, commonly abbreviated to MP1, is one of three audio formats included in the MPEG-1 standard. It is a deliberately simplified version of MPEG-1 Audio Layer II (MP2), created for applications where lower compression efficiency could be tolerated in return for a less complex algorithm that could be executed with simpler hardware requirements. While supported by most media players, the codec is considered largely obsolete, and replaced by MP2 or MP3.

aptX Family of proprietary audio codecs owned by Qualcomm

aptX is a family of proprietary audio codec compression algorithms owned by Qualcomm, with a heavy emphasis on wireless audio applications.

<span class="mw-page-title-main">Sub-band coding</span>

In signal processing, sub-band coding (SBC) is any form of transform coding that breaks a signal into a number of different frequency bands, typically by using a fast Fourier transform, and encodes each one independently. This decomposition is often the first step in data compression for audio and video signals.

<span class="mw-page-title-main">Audio coding format</span> Digitally coded format for audio signals

An audio coding format is a content representation format for storage or transmission of digital audio. Examples of audio coding formats include MP3, AAC, Vorbis, FLAC, and Opus. A specific software or hardware implementation capable of audio compression and decompression to/from a specific audio coding format is called an audio codec; an example of an audio codec is LAME, which is one of several different codecs which implements encoding and decoding audio in the MP3 audio coding format in software.

<span class="mw-page-title-main">ATRAC CD</span>

An ATRAC CD is a compressed audio optical disc containing compressed digital audio in the ATRAC3 or ATRAC3plus formats, which are part of Sony's ATRAC family of proprietary audio compression algorithms.

References

  1. "Siliconvalley.com". Archived from the original on 2007-09-02. Retrieved 2007-09-01.
  2. "Type R? Type S? What's the deal?". Reddit. 26 October 2020. Retrieved 2021-12-11.
  3. "Minidisc FAQ: MDLP (Long-Play) Mode Topics".
  4. 1 2 "ReallyRareWares - Sony SonicStage (ATRAC)".
  5. "IFA 2007【ソニー編】新ウォークマンは「Goes OPEN」". www.watch.impress.co.jp. Archived from the original on 3 September 2007. Retrieved 15 May 2022.
  6. "音楽配信「mora」がDRMフリー+高音質化。10月1日から -AV Watch". Av.watch.impress.co.jp. 20 September 2012. Retrieved 2013-10-16.
  7. "mora、FLAC形式のハイレゾ音源配信を10月17日開始 - AV Watch". Av.watch.impress.co.jp. 26 September 2013. Retrieved 2013-10-17.
  8. Woudenberg, Eric. "MiniDisc FAQ: Audio Topics" . Retrieved 2022-07-30.
  9. "Sony Global - ATRAC - Developer's Interview". www.sony.net.
  10. "Source code for ATRAC decoder". Archived from the original on July 26, 2011.
  11. "Results of Multiformat at 128kbit/s Listening Test". listening-tests.freetzi.com.
  12. "Source code for ATRAC3 decoder". Archived from the original on July 26, 2011.
  13. "Develop - Issue 86 - August 2008". Issuu. 14 August 2008.
  14. "Overseas Model Customer Support". April 7, 2006. Archived from the original on 2006-04-07. Retrieved 2021-08-30.
  15. "Overseas Model Customer Support". February 5, 2006. Archived from the original on 2006-02-05.
  16. "ITS test report" (PDF). Archived (PDF) from the original on 2006-04-05.
  17. "Sony Global - ATRAC - ATRAC Advanced Lossless". www.sony.net.
  18. "FMOD sound formats" . Retrieved 26 March 2018.
  19. "include/AK/SoundEngine/Platforms/PS4/AkTypes.h File Reference" . Retrieved 26 March 2018.
  20. "Source code for ATRAC9 decoder".