PCM adaptor

Last updated

Sony PCM-1630 Sony PCM-1630 20111008.jpg
Sony PCM-1630

A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mastering early compact discs.

Contents

Operation

High-quality pulse-code modulation (PCM) audio requires a significantly larger bandwidth than a regular analog audio signal. For example, a 16-bit PCM signal requires an analog bandwidth of about 1-1.5  MHz compared to about 15-20  kHz of analog bandwidth required for an analog audio signal. A standard analog audio recorder cannot meet this requirement. One solution arrived at in the early 1980s was to use a videotape recorder, which is capable of recording signals with higher bandwidths.

A means of converting digital audio into a video format was necessary. Such an audio recording system includes two devices: the PCM adaptor, which converts audio into pseudo-video, and the videocassette recorder. A PCM adaptor performs an analog-to-digital conversion producing series of binary digits, which, in turn, is coded and modulated into a black and white video signal, appearing as a vibrating checkerboard pattern, which can then be recorded as a video signal.

Most video-based PCM adaptors record audio at 14 or 16 bits per sample, with a sampling frequency of 44.1 kHz for PAL or monochrome NTSC, or 44.056 kHz for color NTSC. Some of the earlier models, such as the Sony PCM-100, recorded 16 bits per sample, but used only 14 of the bits for the audio, with the remaining 2 bits used for error correction for the case of dropouts or other anomalies being present on the videotape.

Sampling frequency

The use of video for the PCM adapter helps to explain the choice of sampling frequency for the CD, because the number of video lines, frame rate and bits per line end up dictating the sampling frequency one can achieve. A sampling frequency of 44.1 kHz was thus adopted for the compact disc, as at the time, there was no other practical way of storing digital audio than by using a PCM adaptor and videocassette recorder combination.

It is simplest if the same number of lines are used in each field, and, crucially, it was decided to adopt a sample rate that could be used on both PAL and monochrome NTSC equipment. Since monochrome NTSC has a field rate of 60 Hz, and PAL has a field rate of 50 Hz, their least common multiple is 300 Hz, and with 3 samples per line, this yields a sample rate that is a multiple of 900 Hz. For monochrome NTSC the sample rate is 5m × 60 × 3, where 5m is the number of active lines per field, which must be a multiple of 5 (the rest used for synchronization), and for PAL the sample rate is 6n × 50 × 3, where 6n is the number of active lines per field, which must be a multiple of 6. The sampling rates that satisfy these requirements – at least 40 kHz (to encode up to 20 kHz sounds), no more than 46.875 kHz (requiring no more than 3 samples per line in PAL), and a multiple of 900 Hz (to allow encoding in both NTSC and PAL), are thus 40.5, 41.4, 42.3, 43.2, 44.1, 45, 45.9, and 46.8 kHz. The lower ones are eliminated due to low-pass filters requiring a transition band, while the higher ones are eliminated due to some lines being required for vertical blanking interval; 44.1 kHz was the higher usable rate, and was eventually chosen.

The sampling frequencies of 44.1 and 44.056 kHz were thus the result of a need for compatibility with the 25-frame (PAL countries) and 30-frame black and white (NTSC countries) video formats used for audio storage at the time.

Video format

Audio samples are recorded as if they were on the lines of a raster scan of video, as follows: analog video standards represent video at a field rate of 60 Hz (NTSC, North America – or 60/1.001 Hz ≈ 59.94 Hz for color NTSC) or 50 Hz (PAL, Europe), which corresponds to a frame rate of 30 frames per second (frame/s) or 25 frame/s – each field is half the lines of an interlaced image (alternating the odd lines and the even lines). Each of these fields is in turn composed of lines – a frame of 625 lines for PAL and 525 lines for NTSC, though some of the lines are actually for synchronizing the signal, and a field comprises half the visible lines in one vertical scan. Digital audio samples are then encoded along each line, thus allowing reuse of the existing synchronization circuitry – as video, the resulting images look like lines of binary black and white (rather, gray) dots along each scan line. The line frequency (lines per second) was 15,625 Hz for PAL (625 × 50/2), 15,750 Hz for 60 Hz (monochrome) NTSC (525 × 60/2), and 15,750/1.001 Hz (approximately 15,734.26 Hz) for 59.94 (color) NTSC, and thus to record audio at the required over 40 kHz required encoding multiple samples per line, with 3 samples per line being sufficient, yielding up to 15,625 × 3 = 46,875 for PAL and 15,750 × 3 = 47,250 for NTSC. It is desirable to minimize the number of samples per line, so that each sample can have more space devoted to it, thus making it easier to have a higher bit depth (16 bits, rather than 14 or 12 bits, say) and better error tolerance, and in practice, the signal was stereo, requiring 3 × 2 = 6 samples per line. However, some of these lines are devoted to (vertical) synchronization: specifically, the lines during the vertical blanking interval (VBI) could not be used, so a maximum of 490 lines per frame (245 lines per field) could be used in NTSC, and about 588 lines per frame (294 lines per field) on PAL (Note that, in video, PAL has (up to) 575 visible lines [1] while NTSC has up to 485).

Models

A Sony PCM-501ES EIAJ LPCM Adapter on a Sony SL-HF360 VTR Sony PCM-501ES & Sony SL-HF360.jpg
A Sony PCM-501ES EIAJ LPCM Adapter on a Sony SL-HF360 VTR

The Sony PCM-1600 was the first commercial video-based 16-bit recorder. The 1600 (and its later versions, the 1610 and 1630) used special U-matic-format VCRs also furnished by Sony for transports, such as the BVU-200B (the first model of VCR optimized to work, and sold with, the PCM-1600 in 1979), [2] BVU-800DA, VO-5630DA, and the later DMR-2000 and DMR-4000, which were based on the industrial VO-5850 and the broadcast BVU-800 video machines respectively. These were all in essence modified versions of existing Sony U-Matic video recorders adapted for use with the 1600-series adaptors by way of disabling the chroma and dropout compensator circuits of the VCRs, which would hinder the proper recording of the monochrome-video-based digital audio data from the 1600-series adaptors if enabled. The BVU-200B packaged with the PCM-1600 also was modified to have its video head switching point moved to the vertical blanking interval of the digital-audio-bearing video signal being recorded to prevent errors or interference with the digital audio data. Editing was accomplished by using a 1600-series adaptor and two or more of these VCRs with a DAE-1100 or DAE-3000 editing controller. The 1600-series were the first systems used for mastering audio compact discs in the early 1980s by many major record labels, with the final U-matic 1600-format digital audio tapes being sent to CD pressing plants to be recorded to a glass master disc used for making the replicated CDs.

Several semi-professional/consumer models of PCM adaptors were also released by Sony:

Technics also made a battery-powered portable PCM adaptor, the SV-100, a hi-fi component adapter, the SV-110, and a version with a built-in VHS videocassette transport, the SV-P100.[ citation needed ] All the Technics (Panasonic) PCM adapters are limited to 14-bit resolution. Other makes and models of PCM adaptors offered on the market were the Nakamichi DMP-100, the JVC VP-100, the Sharp RX-3, the Sansui PC-X1 and the Hitachi PCM-V300. [4]

dbx, Inc. also manufactured a pseudo-video adaptor, the Model 700. It differed from the above-listed models in the fact that it did not use PCM, but rather delta-sigma modulation. This resulted in a higher quality digital recording with more dynamic range than what standard PCM modulation could offer.[ citation needed ] Like a standard PCM adaptor, the Model 700 also utilized a VCR for a transport.

Obsolescence

In 1987, a few years after the PCM adaptor's introduction, Sony introduced a new cassette-based format for digital audio recording called Digital Audio Tape (DAT). Since DAT did not rely on a separate video cassette recorder, it was a much more portable and less-cumbersome format to use than a PCM adaptor-based system. DAT recorders had their own built-in transport using a small cassette unique to the format. DAT used tape 4 millimetres (0.16 in) in width loaded into a cassette 73 mm × 54 mm × 10.5 mm (2.87 in. x 2.12 in. x 0.41 in.) in size. The audio data was recorded to the tape by using helical scan recording, the same fashion that a VCR connected to a PCM adaptor would record to a videotape. In essence, DAT was a modernized, integrated, and miniaturized version of a PCM adaptor-based system.

Like a PCM adaptor, DAT could record only two tracks of audio at a time, but the smaller size of the equipment and media, as well as being able to accept multiple sampling rates and other flexibility, [lower-alpha 1] gave DAT many advantages over PCM adaptor-based systems.

Digital recorders capable of multi-track recording [lower-alpha 2] such as Mitsubishi's ProDigi format and Sony's DASH format also became available on the professional audio market about the same time as the introduction of PCM adaptors. Other tape-based digital audio recording systems overcame problems that made typical analog recorders unable to meet the bandwidth (frequency range) demands of digital recording by a combination of higher tape speeds, narrower head gaps used in combination with metal-formulation tapes, and the spreading of data across multiple parallel tracks.

Despite obsolescence, hobbyists are still capable of using modern-day DVDs or Blu-ray discs as a transport medium for video-based encoding of digital audio streams.

Notes

  1. 44.1 kHz, 48 kHz and 32 kHz were supported, all at 16 bits per sample. A special LP recording mode using 12 bits per sample at 32 kHz gave extended recording time.
  2. As opposed to only two tracks for stereo that a PCM adaptor or DAT could record.

Related Research Articles

<span class="mw-page-title-main">NTSC</span> Analog television system

NTSC is the first American standard for analog television, published in 1941. In 1961, it was assigned the designation System M. It is also known as EIA standard 170.

<span class="mw-page-title-main">PAL</span> Colour encoding system for analogue television

Phase Alternating Line (PAL) is a colour encoding system for analog television. It was one of three major analogue colour television standards, the others being NTSC and SECAM. In most countries it was broadcast at 625 lines, 50 fields per second, and associated with CCIR analogue broadcast television systems B, D, G, H, I or K. The articles on analog broadcast television systems further describe frame rates, image resolution, and audio modulation.

<span class="mw-page-title-main">SECAM</span> French analog color television system

SECAM, also written SÉCAM, is an analog color television system that was used in France, Russia and some other countries or territories of Europe and Africa. It was one of three major analog color television standards, the others being PAL and NTSC. Like PAL, a SECAM picture is also made up of 625 interlaced lines and is displayed at a rate of 25 frames per second. However, due to the way SECAM processes color information, it is not compatible with the German PAL video format standard. This page primarily discusses the SECAM colour encoding system. The articles on broadcast television systems and analog television further describe frame rates, image resolution, and audio modulation. SECAM video is composite video because the luminance and chrominance are transmitted together as one signal.

<span class="mw-page-title-main">Video</span> Electronic moving image

Video is an electronic medium for the recording, copying, playback, broadcasting, and display of moving visual media. Video was first developed for mechanical television systems, which were quickly replaced by cathode-ray tube (CRT) systems, which, in turn, were replaced by flat-panel displays of several types.

<span class="mw-page-title-main">VHS</span> Consumer-level analog videotape recording and cassette form standard

The VHS is a standard for consumer-level analog video recording on tape cassettes, invented in 1976 by the Victor Company of Japan (JVC). It was the dominant home video format throughout the tape media period in the 1980s and 1990s.

<span class="mw-page-title-main">Digital audio</span> Technology that records, stores, and reproduces sound

Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, samples are taken 44,100 times per second, each with 16-bit sample depth. Digital audio is also the name for the entire technology of sound recording and reproduction using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of audio engineering, record production and telecommunications in the 1990s and 2000s.

<span class="mw-page-title-main">Digital Audio Tape</span> Digital audio cassette format developed by Sony

Digital Audio Tape is a signal recording and playback medium developed by Sony and introduced in 1987. In appearance it is similar to a Compact Cassette, using 3.81 mm / 0.15" magnetic tape enclosed in a protective shell, but is roughly half the size at 73 mm × 54 mm × 10.5 mm. The recording is digital rather than analog. DAT can record at sampling rates equal to, as well as higher and lower than a CD at 16 bits quantization. If a comparable digital source is copied without returning to the analogue domain, then the DAT will produce an exact clone, unlike other digital media such as Digital Compact Cassette or non-Hi-MD MiniDisc, both of which use a lossy data-reduction system.

<span class="mw-page-title-main">S-VHS</span> Improved version of VHS

S-VHS (スーパー・ヴィエイチエス), the common initialism for Super VHS, is an improved version of the VHS standard for consumer-level video recording. Victor Company of Japan introduced S-VHS in Japan in April 1987, with their JVC-branded HR-S7000 VCR, and in certain overseas markets soon afterward. By the end of 1987, the first S-VHS VCR models from other competitors included Hitachi VT-2700A, Mitsubishi HS-423UR, Panasonic PV-S4764, RCA VPT-695HF, and Toshiba SV-950. It has been standardized as IEC 60774-3 and IEC 60774-4.

<span class="mw-page-title-main">Betamax</span> Consumer-level analog video tape recording and cassette form factor standard

Betamax is a consumer-level analog recording and cassette format of magnetic tape for video, commonly known as a video cassette recorder. It was developed by Sony and was released in Japan on May 10, 1975, followed by the US in November of the same year.

<span class="mw-page-title-main">Sampling (signal processing)</span> Measurement of a signal at discrete time intervals

In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples". A sample is a value of the signal at a point in time and/or space; this definition differs from the term's usage in statistics, which refers to a set of such values.

Betacam is a family of half-inch professional videocassette products developed by Sony in 1982. In colloquial use, Betacam singly is often used to refer to a Betacam camcorder, a Betacam tape, a Betacam video recorder or the format itself.

<span class="mw-page-title-main">Digital recording</span> Audio or video represented as a stream of discrete numbers

In digital recording, an audio or video signal is converted into a stream of discrete numbers representing the changes over time in air pressure for audio, or chroma and luminance values for video. This number stream is saved to a storage device. To play back a digital recording, the numbers are retrieved and converted back into their original analog audio or video forms so that they can be heard or seen.

<span class="mw-page-title-main">8 mm video format</span> Magnetic tape-based videocassette format for camcorders

The 8mm video format refers informally to three related videocassette formats. These are the original Video8 format and its improved successor Hi8, as well as a more recent digital recording format known as Digital8. Their user base consisted mainly of amateur camcorder users, although they also saw important use in the professional television production field.

The Digital Audio Stationary Head or DASH standard is a reel-to-reel, digital audio tape format introduced by Sony in early 1982 for high-quality multitrack studio recording and mastering, as an alternative to analog recording methods. DASH is capable of recording two channels of audio on a quarter-inch tape, and 24 or 48 tracks on 12-inch-wide (13 mm) tape on open reels of up to 14 inches. The data is recorded on the tape linearly, with a stationary recording head, as opposed to the DAT format, where data is recorded helically with a rotating head, in the same manner as a VCR. The audio data is encoded as linear PCM and boasts strong cyclic redundancy check (CRC) error correction, allowing the tape to be physically edited with a razor blade as analog tape would, e.g. by cutting and splicing, and played back with no loss of signal. In a two-track DASH recorder, the digital data is recorded onto the tape across nine data tracks: eight for the digital audio data and one for the CRC data; there is also provision for two linear analog cue tracks and one additional linear analog track dedicated to recording time code.

<span class="mw-page-title-main">U-matic</span> Videocassette format; the first of its kind

U-matic or 34-inch Type E Helical Scan or SMPTE E is an analogue recording videocassette format first shown by Sony in prototype in October 1969, and introduced to the market in September 1971. It was among the first video formats to contain the videotape inside a cassette, as opposed to the various reel-to-reel or open-reel formats of the time. The videotape is 34 in (19 mm) wide, so the format is often known as "three-quarter-inch" or simply "three-quarter", compared to open reel videotape formats in use, such as 1 in (25 mm) type C videotape and 2 in (51 mm) quadruplex videotape.

Time base correction (TBC) is a technique to reduce or eliminate errors caused by mechanical instability present in analog recordings on mechanical media.

The dbx Model 700 Digital Audio Processor was a professional audio ADC/DAC combination unit, which digitized a stereo analog audio input into a bitstream, which was then encoded and encapsulated in an analog composite video signal, for recording to tape using a VCR as a transport. Unlike other similar pieces of equipment like the Sony PCM-F1, the Model 700 used a technique called Companded Predictive Delta Modulation, rather than the now-common pulse-code modulation. At the time of its introduction in the mid-1980s the device was the first commercial product to use this method, although it had been proposed in the 1960s and prototyped in the late '70s.

MUSE, commercially known as Hi-Vision was a Japanese analog high-definition television system, with design efforts going back to 1979.

Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude of the analog signal is sampled at uniform intervals, and each sample is quantized to the nearest value within a range of digital steps.

In digital audio, 44,100 Hz is a common sampling frequency. Analog audio is often recorded by sampling it 44,100 times per second, and then these samples are used to reconstruct the audio signal when playing it back.

References

  1. ITU-R BT.470-6
  2. Ned Soseman (2012-01-13). "MADI Magic". TV Technology. Retrieved 2018-12-12.
  3. Frederick J. Bashour (May 2000). "Sony PCM-F1 Digital Recording Processor". Pro Audio Review. Archived from the original on February 8, 2008.
  4. Heitarō Nakajima (1983). Digital Audio Technology. Tab Books. p. 268.