Filename extension | .wma |
---|---|
Internet media type | audio/x-ms-wma |
Developed by | Microsoft |
Initial release | August 17, 1999 [1] |
Type of format | Audio |
Open format? | No |
Free format? | No |
Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The original WMA codec, known simply as WMA, was conceived as a competitor to the popular MP3 and RealAudio codecs. [2] [3] WMA Pro, a newer and more advanced codec, supports multichannel and high-resolution audio. [4] A lossless codec, WMA Lossless, compresses audio data without loss of audio fidelity (the regular WMA format is lossy). [4] WMA Voice, targeted at voice content, applies compression using a range of low bit rates. [4] Microsoft has also developed a digital container format called Advanced Systems Format to store audio encoded by WMA.
The first WMA codec was based on earlier work by Henrique Malvar and his team which was transferred to the Windows Media team at Microsoft. [5] Malvar was a senior researcher and manager of the Signal Processing Group at Microsoft Research, [6] whose team worked on the MSAudio project. [7] The first finalized codec was initially referred to as MSAudio 4.0. [8] [9] It was later officially released as Windows Media Audio, [1] as part of Windows Media Technologies 4.0. Microsoft claimed that WMA could produce files that were half the size of equivalent-quality MP3 files; [10] Microsoft also claimed that WMA delivered "near CD-quality" audio at 64 kbit/s. [10] The former claim however was rejected by some audiophiles [11] and both claims have been refuted through publicly-available codec listening tests. RealNetworks also challenged Microsoft's claims regarding WMA's superior audio quality compared to RealAudio. [3]
Newer versions of WMA became available: Windows Media Audio 2 in 1999, [12] Windows Media Audio 7 in 2000, [13] Windows Media Audio 8 in 2001, [14] and Windows Media Audio 9 in 2003. [4] Microsoft first announced its plans to license WMA technology to third parties in 1999. [15] Prior to Windows XP, WMA files were primarily streamed using the Windows Media Source Filter (DirectShow codec), later being removed in Windows Vista with the addition of Media Foundation. [16] Although earlier versions of Windows Media Player played WMA files, support for WMA file creation was not added until the seventh version. [17] In 2003, Microsoft released new audio codecs that were not compatible with the original WMA codec. These codecs were Windows Media Audio 9 Professional, [4] Windows Media Audio 9 Lossless, [4] and Windows Media Audio 9 Voice. [4]
All versions of WMA released since version 9.0 –namely 9.1, 9.2, and 10 –have been backwards compatible with the original v9 decoder and are therefore not considered separate codecs. The sole exception to this is the WMA 10 Professional codec whose Low Bit Rate (LBR) mode is only backwards compatible with the older WMA Professional decoders at a half sampling rate (similar to how HE-AAC is backwards compatible with AAC-LC). Full fidelity decoding of WMA 10 Professional LBR bitstreams requires a WMA version 10 or newer decoder.
A WMA file is in most circumstances contained in the Advanced Systems Format (ASF), a proprietary Microsoft container format for digital audio or digital video. [18] The ASF container format specifies how metadata about the file is to be encoded, similar to the ID3 tags used by MP3 files. Metadata may include song name, track number, artist name, and also audio normalization values. This container can optionally support digital rights management (DRM) using a combination of elliptic curve cryptography key exchange, DES block cipher, a custom block cipher, RC4 stream cipher and the SHA-1 hashing function. See Windows Media DRM for further information.
Since 2008 Microsoft has also been using WMA Professional in its Protected Interoperable File Format (PIFF) based on the ISO Base Media File Format and most commonly used for Smooth Streaming, a form of adaptive bitrate streaming over HTTP. Related industry standards such as DECE UltraViolet and MPEG-DASH have not standardized WMA as a supported audio codec, deciding in favor of the more industry-prevalent MPEG and Dolby audio codecs.
Each WMA file features a single audio track in one of the four sub-formats: WMA, WMA Pro, WMA Lossless, or WMA Voice. These formats are implemented differently from one another, such that they are technically distinct and mutually incompatible; that is to say, a device or software compatible with one sub-format does not therefore automatically support any of the other codecs. Each codec is further explained below.
Windows Media Audio (WMA) is the most common codec of the four WMA codecs. The colloquial usage of the term WMA, especially in marketing materials and device specifications, usually refers to this codec only. The first version of the codec released in 1999 is regarded as WMA 1. In the same year, the bit stream syntax, or compression algorithm, was altered in minor ways and became WMA 2. [12] Since then, newer versions of the codec have been released, but the decoding process remained the same, ensuring compatibility between codec versions. [12] WMA is a lossy audio codec based on the study of psychoacoustics. Audio signals that are deemed to be imperceptible to the human ear are encoded with reduced resolution during the compression process.
WMA can encode audio signals sampled at up to 48 kHz with up to two discrete channels (stereo). WMA 9 introduced variable bit rate (VBR) and average bit rate (ABR) coding techniques into the MS encoder although both were technically supported by the original format. [12] WMA 9.1 also added support for low-delay audio, [19] which reduces latency for encoding and decoding.
Fundamentally, WMA is a transform coder based on modified discrete cosine transform (MDCT), somewhat similar to AAC, Cook and Vorbis. The bit stream of WMA is composed of superframes, each containing 1 or more frames of 2048 samples. If the bit reservoir is not used, a frame is equal to a superframe. Each frame contains several blocks, which are 128, 256, 512, 1024, or 2048 samples long after being transformed into the frequency domain via the MDCT. In the frequency domain, masking for the transformed samples is determined, and then used to requantize the samples. Finally, the floating point samples are decomposed into coefficient and exponent parts and independently huffman coded. Stereo information is typically mid/side coded. At low bit rates, line spectral pairs (typically less than 17 kbit/s) and a form of noise coding (typically less than 33 kbit/s) can also be used to improve quality.
Like AAC and Ogg Vorbis, WMA was intended to address perceived deficiencies in the MP3 standard. Given their common design goals, the three formats ended up making similar design choices. All three are pure transform codecs. Furthermore, the MDCT implementation used in WMA is essentially a superset of those used in Ogg and AAC such that WMA iMDCT and windowing routines can be used to decode AAC and Ogg Vorbis almost unmodified. However, quantization and stereo coding is handled differently in each codec. The primary distinguishing trait of the WMA Standard format is its unique use of 5 different block sizes, compared to MP3, AAC, and Ogg Vorbis which each restrict files to just two sizes. WMA Pro extends this by adding a 6th block size used at 88.2/96 kHz sampling rate.
Certified PlaysForSure devices, as well as a large number of uncertified devices, ranging from portable hand-held music players to set-top DVD players, support the playback of WMA files. Most PlaysForSure-certified online stores distribute content using this codec only. In 2005, Nokia announced its plans to support WMA playback in future Nokia handsets. [20] In the same year, an update was made available for the PlayStation Portable (version 2.60) which allowed WMA files to be played on the device for the first time. [21]
Windows Media Audio Professional (WMA Pro) is an improved lossy codec closely related to WMA standards. It retains most of the same general coding features, but also features improved entropy coding and quantization strategies as well as more efficient stereo coding. Notably, many of the WMA standard's low bitrate features have been removed, as the core codec is designed for efficient coding at most bitrates. Its main competitors include AAC, HE-AAC, Vorbis, Dolby Digital, and DTS. It supports 16-bit and 24-bit sample bit depth, sampling rates up to 96 kHz, and up to eight discrete channels (7.1 channel surround). [22] WMA Pro also supports dynamic range compression, which reduces the volume difference between the loudest and quietest sounds in the audio track. According to Microsoft's Amir Majidimehr, WMA Pro could theoretically go beyond 7.1 surround sound and support "an unlimited number of channels"; however, Microsoft chose to limit its current capability to eight (7.1 discrete channels). [23]
The codec's bit stream syntax was frozen at the first version, WMA 9 Pro. [24] Later versions of WMA Pro introduced low-bit rate encoding, low-delay audio, [25] frequency interpolation mode, [26] and an expanded range of sampling rate and bit-depth encoding options. A WMA 10 Pro file compressed with frequency interpolation mode comprises a WMA 9 Pro track encoded at half the original sampling rate, which is then restored using a new compression algorithm. [27] In this situation, WMA 9 Pro players which have not been updated to the WMA 10 Pro codec can only decode the lower quality WMA 9 Pro stream. Starting with WMA 10 Pro, eight channel encoding starts at 128 kbit/s, and tracks can be encoded at the native audio CD resolution (44.1 kHz, 16-bit), previously the domain of WMA Standard.
Despite a growing number of supported devices and its superiority over WMA, WMA Pro still has little hardware and software support. Some notable exceptions to this are the Microsoft Zune (limited to stereo), [28] Xbox 360, [29] Windows Mobile-powered devices with Windows Media Player 10 Mobile, [30] newer Toshiba Gigabeat and Motorola devices, [31] [32] and devices running recent versions of the Rockbox alternative firmware. [33] In addition, WMA Pro is a requirement for the WMV HD certification program. [34] On the software side, Verizon utilizes WMA 10 Pro for its V CAST Music Service, [35] and Windows Media Player 11 has promoted the codec as an alternative to WMA for copying audio CD tracks. [36] WMA Pro is supported in Silverlight as of version 2 (though only in stereo mode). In the absence of the appropriate audio hardware, WMA Pro can automatically downmix multichannel audio to stereo or mono, and 24-bit resolution to 16-bit during playback.
A notable example of WMA Pro being used instead of WMA Standard is the NBC Olympics website which uses WMA 10 Pro in its low-bitrate mode at 48 kbit/s.[ citation needed ]
Windows Media Audio 9 Lossless is a lossless incarnation of Windows Media Audio, an audio codec by Microsoft, released in early 2003. It compresses an audio CD to a range of 206 to 411 MB, at bit rates of 470 to 940 kbit/s. The result is a bit-for-bit duplicate of the original audio file; in other words, the audio quality on the CD will be the same as the file when played back. WMA Lossless uses the same .WMA file extension as other Windows Media Audio formats. It supports 6 discrete channels and up to 24-bit/96 kHz lossless audio. The format has never been publicly documented, although an open-source decoder has been reverse-engineered for non-Microsoft platforms by the libav and ffmpeg projects.
Windows Media Audio Lossless (WMA Lossless) is a lossless audio codec that competes with ATRAC Advanced Lossless, Dolby TrueHD, DTS-HD Master Audio, Shorten, Monkey's Audio, FLAC, Apple Lossless, and WavPack (Since late 2011, [37] [38] [39] the last three have the advantage of being open source software and available for nearly any operating system.) Designed for archival purposes, [40] it compresses audio signals without loss of quality from the original using VBR. When decompressed, the audio signal is an exact replica of the original. The first version of the codec, WMA 9 Lossless, and its revisions support up to 96 kHz, 24-bit audio for up to 6 discrete channels (5.1 channel surround) with dynamic range compression control. The typical compression ratio for music varies between 1.7:1 and 3:1. [40] [41] [42]
Hardware support for the codec is available on the Cowon A3, [43] Cowon S9, Bang & Olufsen Serenata [44] Sony Walkman NWZ-A and NWZ-S series, Zune 4, 8, 80 30, Zune 120 (with firmware version 2.2 or later) and the Zune HD, Xbox 360, [29] Windows Mobile-powered devices with Windows Media Player 10 Mobile, [30] Windows Phone (version 8 and above), Toshiba Gigabeat S and V models, Toshiba T-400, the Meizu M3, and Best Buy's Insignia NS-DV, Pilot, and Sport music players. The Logitech Squeezebox Touch now supports the format natively despite previously only supporting it via transcoding. Like WMA Standard, WMA Lossless is being used by a few online stores to distribute music online. [45] [46] Similar to WMA Pro, the WMA Lossless decoder can perform downmixing when capable audio hardware is not present.[ citation needed ] As of 2012, the ffmpeg and libav projects have open source WMA Lossless decoders based on reverse engineering of the official decoder. Only 16-bit WMA files can be successfully decoded by ffmpeg as of June 20, 2012.
Windows Media Audio Voice (WMA Voice) is a lossy audio codec that competes with Speex (used in Microsoft's own Xbox Live online service [47] ), ACELP, and other codecs. Designed for low-bandwidth, voice playback applications, [48] it employs low-pass and high-pass filtering of sound outside the human speech frequency range to achieve higher compression efficiency than WMA. It can automatically detect sections of an audio track containing both voice and music and use the standard WMA compression algorithm instead. [24] WMA Voice supports up to 22.05 kHz for a single channel (mono) only. [48] Encoding is limited to constant bit rate (CBR) and up to 20 kbit/s. The first and only version of the codec is WMA 9 Voice.
Windows Mobile-powered devices with Windows Media Player 10 Mobile have native support for WMA 9 Voice playback. [30] In addition, BBC World Service has employed WMA Voice for its Internet radio streaming service. [49]
Microsoft claims that audio encoded with WMA sounds better than MP3 at the same bit rate; Microsoft also claims that audio encoded with WMA at lower bit rates sound better than MP3 at higher bit rates. [50] Double blind listening tests with other lossy audio codecs have shown varying results, from failure to support Microsoft's claims about its superior quality to supremacy over other codecs. One independent test conducted in May 2004 at 128 kbit/s showed that WMA was roughly equivalent to LAME MP3; inferior to AAC and Vorbis; and superior to ATRAC3 (software version). [51]
Some studies concluded:
Microsoft's claims of WMA sound quality have frequently drawn complaints. "Some audiophiles challenge Microsoft's claims regarding WMA's quality", according to a published article from EDN. [11] Another article from MP3 Developments wrote that Microsoft's claim about CD-quality audio at 64 kbit/s with WMA was "very far from the truth". [52] At the early stages of WMA's development, a representative from RealNetworks claimed that WMA was a "clear and futile effort by Microsoft to catch up with RealAudio 8". [53]
Microsoft has sometimes claimed that the sound quality of WMA at 64 kbit/s equals or exceeds that of MP3 at 128 kbit/s (both WMA and MP3 are considered near-transparent at 192 kbit/s by most listeners). In a 1999 study funded by Microsoft, National Software Testing Laboratories (NSTL) found that listeners preferred WMA at 64 kbit/s to MP3 at 128 kbit/s (as encoded by MusicMatch Jukebox). [54]
Both MP3 and WMA encoders have undergone active development and improvement for many years, so their relative quality may change over time.
Apart from Windows Media Player, most of the WMA compression formats can be played using ALLPlayer, VLC media player, [55] Media Player Classic, [56] MPlayer, RealPlayer, Winamp, Zune Software (with certain limitations—DSP plugin support and DirectSound output is disabled using the default WMA plugin),[ citation needed ] and many other software media players. The Microsoft Zune media management software supports most WMA codecs, but uses a variation of Windows Media DRM which is used by PlaysForSure.
The FFmpeg project has reverse-engineered and re-implemented the WMA codecs to allow their use on POSIX-compliant operating systems such as Linux. The Rockbox project further extended this codec to be suitable for embedded cores, allowing playback on portable MP3 players and cell phones running open source software. RealNetworks has announced plans to support playback of DRM-free WMA files in RealPlayer for Linux. [57] On the Macintosh platform, Microsoft released a PowerPC version of Windows Media Player for Mac OS X in 2003, [58] but further development of the software has ceased. Microsoft currently endorses the third-party Flip4Mac WMA, a QuickTime component that allows Macintosh users to play WMA files in any player that uses the QuickTime framework. [59] Flip4Mac, however, does not currently support the Windows Media Audio Voice codec. [60]
The core Android platform does not itself support WMA, but there is third-party WMA software for Android devices. [61]
WMA format can be played on almost all Windows Mobile and later Windows Phone devices.
There are many proprietary and open source software packages that can export audio in WMA format, including amongst many others Windows Media Player, Windows Movie Maker, Microsoft Expression Encoder, Sony Sound Forge, GOM Player, RealPlayer, [62] Adobe Premiere Pro, [63] Adobe Audition, [64] Adobe Soundbooth, and VLC media player. [65] Microsoft Office OneNote supports encoding in all WMA codecs, [66] and Windows Media Encoder supports all available bit rate and resolution options as well.
The WMA codecs are most often used with the ASF container format, which has an optional DRM facility. Windows Media DRM, which can be used in conjunction with WMA, supports time-limited music subscription services such as those offered by unlimited download services, including MTV's URGE, Napster, Rhapsody, Yahoo! Music Unlimited, and Virgin Digital. Windows Media DRM, a component of PlaysForSure and Windows Media Connect, is supported on many modern portable audio devices and streaming media clients such as Roku, SoundBridge, Xbox 360, and Wii. Players that support the WMA format but not Windows Media DRM cannot play DRM-protected files.
An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data is called the audio coding format and can be uncompressed, or compressed to reduce the file size, often using lossy compression. The data can be a raw bitstream in an audio coding format, but it is usually embedded in a container format or an audio data format with defined storage layer.
A codec is a device or computer program that encodes or decodes a data stream or signal. Codec is a portmanteau of coder/decoder.
In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size for storing, handling, and transmitting content. The different versions of the photo of the cat on this page show how higher degrees of approximation create coarser images as more details are removed. This is opposed to lossless data compression which does not degrade the data. The amount of data reduction possible using lossy compression is much higher than using lossless techniques.
MP3 is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg, with support from other digital scientists in other countries. Originally defined as the third audio format of the MPEG-1 standard, it was retained and further extended—defining additional bit rates and support for more audio channels—as the third audio format of the subsequent MPEG-2 standard. A third version, known as MPEG-2.5—extended to better support lower bit rates—is commonly implemented but is not a recognized standard.
Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality digital multimedia. Its name is derived from "ogging", jargon from the computer game Netrek.
Windows Media Player, is the first media player and media library application that Microsoft developed to play audio and video on personal computers. It has been a component of the Microsoft Windows operating system, including Windows 9x, Windows NT, Pocket PC, and Windows Mobile. Microsoft also released editions of Windows Media Player for classic Mac OS, Mac OS X, and Solaris, but has since discontinued them.
Adaptive Transform Acoustic Coding (ATRAC) is a family of proprietary audio compression algorithms developed by Sony. MiniDisc was the first commercial product to incorporate ATRAC, in 1992. ATRAC allowed a relatively small disc like MiniDisc to have the same running time as CD while storing audio information with minimal perceptible loss in quality. Improvements to the codec in the form of ATRAC3, ATRAC3plus, and ATRAC Advanced Lossless followed in 1999, 2002, and 2006 respectively.
Monkey's Audio is an algorithm and file format for lossless audio data compression. Lossless data compression does not discard data during the process of encoding, unlike lossy compression methods such as Advanced Audio Coding, MP3, Vorbis, and Opus. Therefore, it may be decompressed to a file that is identical to the source material.
Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. It was designed to be the successor of the MP3 format and generally achieves higher sound quality than MP3 at the same bit rate.
Xiph.Org Foundation is a nonprofit organization that produces free multimedia formats and software tools. It focuses on the Ogg family of formats, the most successful of which has been Vorbis, an open and freely licensed audio format and codec designed to compete with the patented WMA, MP3 and AAC. As of 2013, development work was focused on Daala, an open and patent-free video format and codec designed to compete with VP9 and the patented High Efficiency Video Coding.
Musepack or MPC is an open source lossy audio codec, specifically optimized for transparent compression of stereo audio at bitrates of 160–180 kbit/s. It was formerly known as MPEGplus, MPEG+ or MP+.
ffdshow is an open-source unmaintained codec library that is mainly used for decoding of video in the MPEG-4 ASP and H.264/MPEG-4 AVC video formats, but it supports numerous other video and audio formats as well. It is free software released under GNU General Public License 2.0, runs on Windows, and is implemented as a Video for Windows (VFW) codec and a DirectShow filter.
High-Efficiency Advanced Audio Coding (HE-AAC) is an audio coding format for lossy data compression of digital audio defined as an MPEG-4 Audio profile in ISO/IEC 14496–3. It is an extension of Low Complexity AAC (AAC-LC) optimized for low-bitrate applications such as streaming audio. The usage profile HE-AAC v1 uses spectral band replication (SBR) to enhance the modified discrete cosine transform (MDCT) compression efficiency in the frequency domain. The usage profile HE-AAC v2 couples SBR with Parametric Stereo (PS) to further enhance the compression efficiency of stereo signals.
Gapless playback is the uninterrupted playback of consecutive audio tracks, such that relative time distances in the original audio source are preserved over track boundaries on playback. For this to be useful, other artifacts at track boundaries should not be severed either. Gapless playback is common with compact discs, gramophone records, or tapes, but is not always available with other formats that employ compressed digital audio. The absence of gapless playback is a source of annoyance to listeners of music where tracks are meant to segue into each other, such as some classical music, progressive rock, concept albums, electronic music, and live recordings with audience noise between tracks.
The following tables compare general and technical information for a variety of audio coding formats.
MPEG-1 Audio Layer III HD was an audio compression codec developed by Technicolor, formerly known as Thomson.
An audio coding format is a content representation format for storage or transmission of digital audio. Examples of audio coding formats include MP3, AAC, Vorbis, FLAC, and Opus. A specific software or hardware implementation capable of audio compression and decompression to/from a specific audio coding format is called an audio codec; an example of an audio codec is LAME, which is one of several different codecs which implements encoding and decoding audio in the MP3 audio coding format in software.
some audiophiles challenge Microsoft's claims regarding WMA's quality