This article is written like a personal reflection, personal essay, or argumentative essay that states a Wikipedia editor's personal feelings or presents an original argument about a topic.(March 2014) |
Media type |
|
---|---|
Encoding | Digital |
Read mechanism | DSD |
Write mechanism | Super Audio CD |
Standard | ISO/IEC 14496-3 |
Usage | Audio recording |
Extended to | present |
Released | 1999 |
Direct Stream Digital (DSD) is a trademark used by Sony and Philips for their system for digitally encoding audio signals for the Super Audio CD (SACD).
DSD uses delta-sigma modulation, a form of pulse-density modulation encoding, a technique to represent audio signals in digital format, a sequence of single-bit values at a sampling rate of 2.8224 MHz. This is 64 times the CD audio sampling rate of 44.1 kHz, but with 1-bit samples instead of 16-bit samples. Noise shaping of the 64-times oversampled signal provides low quantization noise and low distortion in the audible bandwidth necessary for high resolution audio.
DSD is simply a format for storing a delta-sigma signal without applying a decimation process that converts the signal to a PCM signal.
DSD technology was developed and commercialized by Sony and Philips, the designers of the audio CD. However, in 2005, Philips sold its DSD tool division to Sonic Studio. [1] [ better source needed ]
DVD-Audio was endorsed by the Warner Music Group, while the SACD format was endorsed by Sony and Universal Music Group, with an especially high-profile by UMG imprint Virgin Records. [2] [3] Despite this, in 2011, The Warner Premium Sound series of albums was released by Warner Music Group, marking the first time the label released titles in a SACD format, with recording in DSD. [4] The series grew to ten rock and pop albums, with Super Audio CD/CD hybrid discs containing both an SACD layer and a standard CD layer. [5]
Sony did not promote SACD actively in North America, with the result that DVD-Audio gained competitive traction in the market. Elsewhere, such as in Europe or Japan, SACD gained more of a foothold. [6] Examples include the German Stockfisch Records, which releases vinyl editions of albums and DSD-recordings, released as hybrid SACDs. [7]
Music companies that specialize in Super Audio CD products therefore use DSD encoding. A number of independent record labels have also worked directly with Sony to focus on DSD products or the DSD recording process.
DMP Digital Music Products was an early user of the SACD digital audio format. [8] In 1997 their release Alto by Joe Beck & Ali Ryerson was the first commercial recording captured with Sony's DSD recording technology. The label's Just Jobim by Manfredo Fest in 1998 was the first project captured with the new Meitner DSD conversion technology. In 2000, DMP released the world's first multi-channel SACD—Sacred Feast by Gaudeamus. [9]
The majority of Telarc International Corporation's releases were on (generally hybrid) SACD, and are DSD recordings. [10] [ failed verification ] Telarc often worked with early audiophile company Soundstream, and re-released many of its original Soundstream recordings in SACD format. [11] Soundstream, which made the first digital recording in the United States, recorded in 16-bit PCM at a sample rate of 50 kHz via its own proprietary digital recorder. This 50 kHz PCM format was converted to DSD for release on Telarc SACD.
The record label Mobile Fidelity had engineers who decided to adopt the SACD over the DVD-Audio disc as a favored high-resolution digital format after listening tests and technical evaluations. On the label's Hybrid SACD releases, the SACD layer is a direct DSD recording of the analog master tape, while the CD layer is a digital down conversion of the DSD, with Super Bit Mapping applied. Post-2001, CD-only releases are sourced from DSD, but omit the SACD layer. [12]
In 2007, Blue Coast Records was founded in California for the purpose of recording and releasing music recorded with the DSD format, primarily focusing on jazz and acoustic artists. [13]
On August 28, 2013, the Acoustic Sounds label launched SuperHiRez.com (now defunct), which sold mainstream albums from major record labels that were produced with Direct Stream Digital or PCM audio formats. [14] [15] On September 4, 2013, Acoustic Sounds announced an agreement with Sony Music Entertainment to provide the company's new digital download service with albums that have been produced or remastered in DSD format. [16]
The format is used on albums such as Pop, Songs & Death in 2009, [17] and the remastered The Rolling Stones album Their Satanic Majesties Request in 2002. [18] [19]
DSD differs from the PCM format used by compact disc or typical computer audio systems: while PCM uses a multi-bit value (representing a large range of amplitudes) at a low sample rate, DSD instead uses a single-bit value (representing an increase or decrease in amplitude) at a sample rate much higher than the signal's bandwidth.
The process of creating a DSD signal is the same as the front-end modulator and noise-shaper portion of a 1-bit delta-sigma analog-to-digital converter (ADC), without the back end decimator (which would typically convert the 1-bit bitstream into multi-bit PCM). The short-term average of the 1-bit DSD bitstream signal is proportional to the original signal amplitude. Because of the higher sample rate, an SACD player can use a one-bit DAC with a low-order analog filter to reconstruct and produce the analog output.
DSD uses noise shaping techniques to push quantization noise up to inaudible ultrasonic frequencies. Because of this, single rate DSD64 can deliver a dynamic range of 120 dB from 20 Hz to 20 kHz and an extended frequency response up to 100 kHz (most recent SACD players specify an upper limit of 80 to 90 kHz).[ citation needed ]
The various DSD formats use sampling rates well into the megahertz (MHz) range, typically with an oversampling ratio that is a power of two relative to the 44.1 kHz rate used for CD audio:
Name | Abbreviation | Sample rate | Oversampling ratio relative to 44.1 kHz | Notes |
---|---|---|---|---|
Single-rate DSD | DSD64 | 2.8224 MHz | 64x | The format used for SACD media. |
Double-rate DSD | DSD128 | 5.6448 MHz | 128x | Since its establishment, content creators have made DSD128 recordings available, such as the audiophile label Opus3. [20] |
Quad-rate DSD | DSD256 | 11.2896 MHz | 256x | |
Octuple-rate DSD | DSD512 | 22.5792 MHz | 512x | |
Sexdecuple-rate DSD | DSD1024 | 45.1584 MHz | 1024x |
See delta-sigma modulation § Theoretical effective number of bits for analysis of the effective number of bits that can be achieved by oversampling, especially when using a higher order modulator.
DSD music mixing and mastering for SACD or Internet download presents challenges due to the difficulty of performing digital signal processing (DSP) operations (such as equalization, balance, panning) in a one-bit environment.
Older analog recordings were processed using analog equipment and then digitized to DSD. It is also possible to avoid processing by using only the available adjustments in the studio equipment while recording to DSD.
One DSP technique available is to convert the DSD to PCM and use standard PCM equipment such as Pro Tools, useful for rock and contemporary music which rely on multitrack techniques, then digitally convert back to DSD format. Some DSD proponents dislike this technique claiming that the PCM conversion to a lower sample rate reduces the sound quality of DSD.
A format and set of tools for PCM processing of DSD has been developed under the name Digital eXtreme Definition (DXD). This is a PCM format with 24-bit resolution sampled at 352.8 kHz. Another DSP technique uses a format commonly referred to as DSD-wide, which retains the high sample rate of standard DSD, but uses 8-bit samples with noise shaping. DSD-wide is sometimes disparagingly referred to as PCM-narrow. It has the benefit of making DSP operations practical while retaining the 2.8224 MHz sampling frequency. The processed DSD-wide signal is converted to the final 1-bit DSD product at the same sample rate. Processing DSD-wide at the higher multiple-DSD sample rates is also possible. Pyramix and some SADiE digital audio workstation (DAW) systems can operate in the DSD-wide domain.[ citation needed ]
DXD was initially developed for the Merging Technologies Pyramix workstation and introduced together with their Sphynx 2, AD/DA converter in 2004. This combination meant that it was possible to record and edit directly in DXD, [21] and that the sample only converts to DSD once before publishing. This offers an advantage to the user as the noise created by converting DSD rises dramatically above 20 kHz, and more noise is added each time a signal is converted back to DSD during editing.
The Pyramix Virtual Studio Digital Audio Workstation allows for recording, editing and mastering all DSD formats up to DSD256. A 12.288 MHz DSD256 variant based on a 48 kHz reference [a] is supported. The exaSound e20 DAC was the first commercially available device capable of DSD256 playback at sampling rates of 11.2896 or 12.288 MHz. [22] [23] The Merging Technologies Horus AD/DA Converter offers sample rates up to 11.2 MHz, or four times the SACD rate.
DSD128, 5.6448 MHz, twice the SACD rate, has been supported by multiple hardware devices such as the exaSound e20 Mk II DA,C [24] Korg MR-1000 1-bit digital recorder, Amanero Combo384 DSD output adapter and the discontinued exaU2I USB to I²S interface.
Software supporting DSD includes Sonic Studio's Amarra Luxe, Audirvana, foobar2000 with SACD plugin, JRiver Media Center, Roon, [25] HQPlayer and Neutron Music Player are all able to handle DSD files of up to DSD512 rate fully natively.
Sony developed DSD for SACD, and many disk players support SACD. Since the format is digital, there are other ways to play back a DSD stream; the development of these alternatives has enabled companies to offer high-quality music downloads in DSD. [26]
Some professional audio recorders (from Korg, Tascam, and others) can record in DSD format. Transferring this signal to a recordable DVD with the appropriate tools, such as the AudioGate software bundled with Korg MR-2000S recorder, [27] renders a DSD Disc. Such discs can be played back in native DSD only on certain Sony VAIO laptops and PlayStation 3 systems. [28] Sony produced two SACD players, the SCD-XA5400ES and the SCD-XE800, that fully support the DSD-disc format. In June 2012, Pioneer launched a series of SACD players compatible with DSD-disc. The PD-30 and PD-50. In January 2013, TEAC announced a DSD-disc compatible player, the PD-501HR. [29]
An alternative to burning DSD files onto disks for eventual playback is to transfer the (non-encrypted) files from a computer to audio hardware over a digital link such as USB. The USB audio 2.0 specification defined several formats for the more common PCM approach to digital audio, but did not define a format for DSD.
In 2012, representatives from many companies and others developed a standard to represent and detect DSD audio within PCM frames; the standard, commonly known as DSD over PCM (DoP), is suitable for other digital links that use PCM. [30] The 1.1 revision added protocol support for higher DSD sample rates without requiring an increase the underlying PCM sample rate.
The definition of native DSD playback is somewhat a matter of philosophy. Generally speaking, it avoids the conversion of DSD data into multibit PCM anywhere along the reproduction chain.[ citation needed ] Many commercially available DACs now support native DSD.
There has been much controversy between proponents of DSD and PCM over which encoding system is superior. In 2000, Lipshitz and Vanderkooy stated that one-bit converters, as employed by DSD, are unsuitable for high-end applications due to their high distortion. [31] [32] In 2002, Philips published a paper arguing the contrary. [33] Lipshitz and Vanderkooy's paper was further criticized by Angus. [34] Lipshitz and Vanderkooy responded to the criticisms. [35] Stuart also defined sigma-delta modulation a "totally unsuitable choice" for high-resolution digital audio. [36]
Conventional implementation of DSD has an intrinsic high distortion. [37] Distortion can be alleviated to some degree by using multibit DACs. State-of-the-art ADCs are based around sigma-delta modulation designs. Oversampling converters are usually used in linear PCM formats, where the ADC or DAC output is subject to bandlimiting and dithering. [38] Most modern ADC and DAC converters use oversampling and a multi-bit design; in other words, while DSD is a 1-bit format, modern converters internally use a 2-bit [39] to 6-bit [40] format.
Comparisons of DSD and PCM recordings with the same origin, number of channels and similar bandwidth and noise have yielded contradictory results. A 2004 study conducted at the Erich-Thienhaus Institute in Detmold, Germany found that in double-blind tests "hardly any of the subjects could make a reproducible distinction between the two encoding systems." [41] In contrast, a 2014 study conducted at the Tokyo University of the Arts found that listeners could distinguish PCM (192 kHz/24 bits) from either DSD (2.8 MHz) or DSD (5.6 MHz) (but not between the two DSD samplings), preferring the sound of DSD over PCM: "For example, Drums stimulus of DSD (5.6 MHz) has p = 0.001 when compared against PCM (192 kHz/24 bit) in overall preference. This suggests that DSD version was statistically significantly preferred over the PCM version." [42] These findings are questionable however, because "the two formats were subject to different processing, most notably, different filtering of the low-frequency content." [43]
Even though DSD-based SACD was more successful than its direct competitor, the PCM-based DVD-Audio, DSD met with relatively little success in the consumer market. Direct manipulation of recorded DSD data is difficult due to the limited availability of appropriate software. The advent of new high-resolution PCM standards, such as DXD, further restricted DSD's market niche.
There are several ways to store DSD encoded audio as files on a computer. One option is to use DSD native file formats used specifically for this purpose. Alternatively, DSD can be stored in general purpose audio formats officially adapted to support DSD storage. Finally, DSD audio can be embedded into PCM audio streams that do not have special DSD support. However, a special decoder is needed to recover the DSD stream from these PCM files.
Filename extension | .dff |
---|---|
Internet media type | audio/x-dff |
Developed by | Philips |
Latest release | 1.5 2004-27-04 |
Type of format | DSD Audio |
Website | sonicstudio.com/pdf/dsd/DSDIFF_1.5_Spec.pdf |
DSD Interchange File Format (DSDIFF) is a native DSD file format developed by Philips between 2000 and 2004 for storage of DSD recordings. [44] The format supports DST compression of the payload as well as annotations used in Super Audio CD production. A single DSDIFF file may store an entire album as a single audio stream together with markers indicating the location of individual tracks for the album. Some parts of the audio content may be left out entirely from the resulting SACD. The embedded metadata format is intended for mastering engineers not consumers. For example, markers indicating the beginning of a new audio track have a text field for storing arbitrary text-based information. However, there is no requirement for the text to contain the title of the track. A de facto standard for including ID3 metadata in an unofficial ID3 chunk was later developed as consumers adopted the format for storing individual tracks of DSD audio. DSDIFF files typically use the .dff
file suffix. No official media type has been registered for the DSDIFF file format. [45] Freedesktop.org uses the unofficial media types audio/x-dff
. [46]
Filename extension | .wsd |
---|---|
Internet media type | audio/x-dsf |
Developed by | 1-bit Audio Consortium |
Type of format | DSD Audio |
Wideband Single-bit Data (WSD) is a native DSD file format developed by 1-bit Audio Consortium in 2002. [47] The consortium was established by Waseda University, Sharp and Pioneer a year earlier. [48] In 2012 the consortium published an updated 1.1 version of the specification. [49] The WSD header has a field for storing an absolute timestamp in samples since midnight. The timestamp makes it possible to accurately know how much time passed between two recordings made consecutively on the same physical recording device. Korg portable recorders MR-1, MR-2, MR-1000 and MR-2000s support WSD alongside other DSD based formats. [50] [51] [52]
Filename extension | .dsf |
---|---|
Internet media type | audio/x-dsf |
Developed by | Sony |
Latest release | 1.01 November 2005 |
Type of format | DSD Audio |
Website | dsd-guide.com/sonys-dsf-file-format-spec |
DSD Stream File (DSF) is a native DSD file format developed by Sony around 2005. [53] [54] The format is intended for storing individual tracks of DSD audio and has native support for ID3 metadata inclusion. The format defines a "format ID" field that could presumably be used to indicate DST compression. However, the only identifier defined in the spec is number 0 indicating "DSD raw". Files containing DSF data would typically use the.dsf
file suffix. No official media type has been registered for the DSF file format. [45] Freedesktop.org uses the unofficial media types audio/x-dsf
. [46]
DSD-compatible file formats support storing DSD-encoded audio in addition to audio encoded as PCM.
WavPack is a generic audio storage format supporting various different forms of audio. [55] Compressed DSD audio is also supported by the format. [56] The DSD support was initially introduced with the release of WavPack software suite version 5 in December 2016. [57] Files containing WavPack data would typically use the.wv
file suffix. No official media type has been registered for the WavPack file format. [45] Freedesktop.org uses the unofficial media types audio/x-wavpack
. [46] Since WavPack supports multiple formats, the suffix or mime type alone does not indicate the presence of DSD audio.
In addition to actual DSD file formats, a format called DSD Audio over PCM Frames (DoP) is used for embedding DSD into PCM audio streams. [58] DoP makes it possible to circumvent any PCM-only component in the playback chain that would prevent a DSD-capable DAC from receiving the raw DSD data required for native DSD playback. While DoP is intended to be a wire protocol used for communication between music player application and a DAC, there exists tools that can embed DoP-encoded DSD into FLAC files enabling users to use a non-DSD capable player for DSD playback with their DoP-capable DAC. [59] A DoP stream is designed to sound like low-volume noise when played back by a PCM-only DAC, while a DoP-capable DAC will detect the presence of DSD data, extract it from the PCM and play it back as DSD.
Compact Disc Digital Audio, also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the Red Book technical specifications, which is why the format is also dubbed "Redbook audio" in some contexts. CDDA utilizes pulse-code modulation (PCM) and uses a 44,100 Hz sampling frequency and 16-bit resolution, and was originally specified to store up to 74 minutes of stereo audio per disc.
Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, samples are taken 44,100 times per second, each with 16-bit resolution. Digital audio is also the name for the entire technology of sound recording and reproduction using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of audio engineering, record production and telecommunications in the 1990s and 2000s.
Digital Audio Tape is a signal recording and playback medium developed by Sony and introduced in 1987. In appearance it is similar to a Compact Cassette, using 3.81 mm / 0.15" magnetic tape enclosed in a protective shell, but is roughly half the size at 73 mm × 54 mm × 10.5 mm. The recording is digital rather than analog. DAT can record at sampling rates equal to, as well as higher and lower than a CD at 16 bits quantization. If a comparable digital source is copied without returning to the analogue domain, then the DAT will produce an exact clone, unlike other digital media such as Digital Compact Cassette or non-Hi-MD MiniDisc, both of which use a lossy data-reduction system.
Super Audio CD (SACD) is an optical disc format for audio storage introduced in 1999. It was developed jointly by Sony and Philips Electronics and intended to be the successor to the compact disc (CD) format.
S/PDIF is a type of digital audio interface used in consumer audio equipment to output audio over relatively short distances. The signal is transmitted over either a coaxial cable using RCA or BNC connectors, or a fibre-optic cable using TOSLINK connectors. S/PDIF interconnects components in home theaters and other digital high-fidelity systems.
Sound can be recorded and stored and played using either digital or analog techniques. Both techniques introduce errors and distortions in the sound, and these methods can be systematically compared. Musicians and listeners have argued over the superiority of digital versus analog sound recordings. Arguments for analog systems include the absence of fundamental error mechanisms which are present in digital audio systems, including aliasing and associated anti-aliasing filter implementation, jitter and quantization noise. Advocates of digital point to the high levels of performance possible with digital audio, including excellent linearity in the audible band and low levels of noise and distortion.
DVD-Audio is a digital format for delivering high-fidelity audio content on a DVD. DVD-Audio uses most of the storage on the disc for high-quality audio and is not intended to be a video delivery format.
In digital recording, an audio or video signal is converted into a stream of discrete numbers representing the changes over time in air pressure for audio, or chroma and luminance values for video. This number stream is saved to a storage device. To play back a digital recording, the numbers are retrieved and converted back into their original analog audio or video forms so that they can be heard or seen.
Noise shaping is a technique typically used in digital audio, image, and video processing, usually in combination with dithering, as part of the process of quantization or bit-depth reduction of a signal. Its purpose is to increase the apparent signal-to-noise ratio of the resultant signal. It does this by altering the spectral shape of the error that is introduced by dithering and quantization; such that the noise power is at a lower level in frequency bands at which noise is considered to be less desirable and at a correspondingly higher level in bands where it is considered to be more desirable. A popular noise shaping algorithm used in image processing is known as ‘Floyd Steinberg dithering’; and many noise shaping algorithms used in audio processing are based on an ‘Absolute threshold of hearing’ model.
A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mastering early compact discs.
Soundstream Inc. was the first United States audiophile digital audio recording company, providing commercial services for recording and computer-based editing.
The dbx Model 700 Digital Audio Processor was a professional audio ADC/DAC combination unit, which digitized a stereo analog audio input into a bitstream, which was then encoded and encapsulated in an analog composite video signal, for recording to tape using a VCR as a transport. Unlike other similar pieces of equipment like the Sony PCM-F1, the Model 700 used a technique called Companded Predictive Delta Modulation, rather than the now-common pulse-code modulation. At the time of its introduction in the mid-1980s the device was the first commercial product to use this method, although it had been proposed in the 1960s and prototyped in the late '70s.
In digital audio using pulse-code modulation (PCM), bit depth is the number of bits of information in each sample, and it directly corresponds to the resolution of each sample. Examples of bit depth include Compact Disc Digital Audio, which uses 16 bits per sample, and DVD-Audio and Blu-ray Disc, which can support up to 24 bits per sample.
Digital eXtreme Definition (DXD) is a digital audio format that originally was developed by Philips and Merging Technologies for editing high-resolution recordings recorded in Direct Stream Digital (DSD), the audio standard used on Super Audio CD (SACD). As the 1-bit DSD format used on SACD is not suitable for editing, alternative formats such as DXD or DSD-Wide must be used during the mastering stage.
Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude of the analog signal is sampled at uniform intervals, and each sample is quantized to the nearest value within a range of digital steps. Alec Reeves, Claude Shannon, Barney Oliver and John R. Pierce are credited with its invention.
In digital audio, 44,100 Hz is a common sampling frequency. Analog audio is often recorded by sampling it 44,100 times per second, and then these samples are used to reconstruct the audio signal when playing it back.
The Warner Premium Sound series is series of 2011 hybrid multichannel-stereo Super Audio CD/CD releases, which an important development in the format war between Super Audio CDs (SACDs) and DVD-Audio discs. It marks the first time that the Warner Music Group has released titles in the SACD format; previously they had endorsed the DVD-Audio format. In a parallel development in the United States, four albums by the Doors, titles strongly associated with the Warner Music Group, are also to be released on hybrid multichannel-stereo SACDs by Analogue Productions. Also Mobile Fidelity have been issuing and continue to be issuing SACDs of the Warner catalogue, but in the past these have been stereo-only discs, and for their new releases they do not specify whether a given disc might have multichannel contents also.
High-resolution audio is a term for audio files with greater than 44.1 kHz sample rate or higher than 16-bit audio bit depth. It commonly refers to 96 or 192 kHz sample rates. However, 44.1 kHz/24-bit, 48 kHz/24-bit and 88.2 kHz/24-bit recordings also exist that are labeled HD Audio.
AES50 is an Audio over Ethernet protocol for multichannel digital audio. It is defined in the AES50-2011 standard for High-resolution multi-channel audio interconnection (HRMAI).
In Europe and some other parts of the world this was done using the SACD format but not in North America, like with other reissues (such as the Depeche Mode albums).
Thanks @wizardofoz, managed to download the 'Wechseldominante - Babel DSD512\Wechseldominante - Babel (DSD512).dff' and playback on my Holo Spring DAC. Direct DSD512 playback is far less computer intensive if it doesn't do any form of conversion. I managed to playback DSD512 without any issue.
{{cite web}}
: CS1 maint: bot: original URL status unknown (link) —DSD vs PCM comparison