Matrix decoder

Last updated

Matrix decoding is an audio technology where a small number of discrete audio channels (e.g., 2) are decoded into a larger number of channels on play back (e.g., 5). The channels are generally, but not always, arranged for transmission or recording by an encoder, and decoded for playback by a decoder. The function is to allow multichannel audio, such as quadraphonic sound or surround sound to be encoded in a stereo signal, and thus played back as stereo on stereo equipment, and as surround on surround equipment – this is "compatible" multichannel audio.

Contents

Process

Matrix encoding does not allow one to encode several channels in fewer channels without losing information: one cannot fit 5 channels into 2 (or even 3 into 2) without losing information, as this loses dimensions: the decoded signals are not independent. The idea is rather to encode something that will both be an acceptable approximation of the surround sound when decoded, and acceptable (or even superior) stereo.

Notation

The notation for matrix encoding consists of the number of original discrete audio channels separated by a colon from the number of encoded and decoded channels. For example, four channels encoded into two discrete channels and decoded back to four-channels would be notated:

4:2:4

Some methods derive new channels from the existing ones, with no special encoding of the audio source. For example, five discrete channels decoded to six channels would be notated:

5:5:6

Such derived channel "decoders" may take advantage of the Haas effect, as well as audio cues inherent in the source channels.

Many matrix encoding methods have been developed:

Hafler circuit (2:2:4)

The earliest and simpler form of decoding is the Hafler circuit, deriving back channels out of normal stereo recording (2:2:4). It was used for decoding only (encoding sound was not considered).

Decoding matrix

Decoding matrixLeft FrontRight FrontLeft BackRight Back
Left Total1.00.01.0-1.0
Right Total0.01.0-1.01.0

Dynaquad matrix (2:2:4) / (4:2:4)

The Dynaquad matrix introduced in 1969 was based on the Hafler circuit, but also used for a specific encoding of 4 sound channels in some albums (4:2:4). [1]

Encoding matrix

MatrixLeft FrontRight FrontLeft BackRight Back
Left Total1.00.251.0-0.5
Right Total0.251.0-0.51.0

Decoding matrix

Matrix [2] Left FrontRight FrontLeft BackRight Back
Left Total1.00.00.64-0.36
Right Total0.01.0-0.360.64

Electro-Voice Stereo-4 matrix (2:2:4) / (4:2:4)

The Stereo-4 matrix was invented by Leonard Feldman and Jon Fixler, introduced in 1970, and sold by Electro-Voice and Radio Shack. This matrix was used to encode 4 sound channels on many record albums (4:2:4). [3]

Encoding matrix

MatrixLeft FrontRight FrontLeft BackRight Back
Left Total1.00.31.0-0.5
Right Total0.31.0-0.51.0

Decoding matrix

Matrix [2] Left FrontRight FrontLeft BackRight Back
Left Total1.00.21.0-0.8
Right Total0.21.0-0.81.0

SQ matrix, "Stereo Quadraphonic", CBS SQ (4:2:4)

MatrixLeft FrontRight FrontLeft BackRight Back
Left Total1.00.0k0.70.7
Right Total0.01.0-0.7j0.7

phase-shift, phase-shift

The basic SQ matrix had mono/stereo anomalies as well as encoding/decoding problems, heavily criticized by Michael Gerzon and others. [4]

An attempt to improve the system lead to the use of other encoders or sound capture techniques, yet the decoding matrix remained unchanged.

Position Encoder

An N/2 encoder that encoded every position in a 360° circle - it had 16 inputs and each could be dialed to the exact direction desired, generating an optimized encode.

Forward-Oriented encoder

MatrixLeft FrontRight FrontLeft BackRight Back
Left Total1.00.00.7k0.7
Right Total0.01.0k0.70.7

phase-shift, phase-shift

The Forward-Oriented encoder caused Center Back to be encoded as Center Front and was recommended for live broadcast use for maximum mono compatibility - it also encoded Center Left/Center Right and both diagonal splits in the optimal manner. Could be used to modify existing 2-channel stereo recordings and create 'synthesized SQ' that when played through a Full-Logic or Tate DES SQ decoder, exhibited a 180° or 270° synthesized quad effect. Many stereo FM radio stations broadcasting SQ in the 1970s used their Forward-Oriented SQ encoder for this. For SQ decoders, CBS designed a circuit that produced the 270° enhancement using the 90° phase shifters in the decoder. Sansui's QS Encoders and QS Vario-Matrix Decoders had a similar capability.

Backwards-Oriented encoder

MatrixLeft FrontRight FrontLeft BackRight Back
Left Totalk1.00.0k0.70.7
Right Total0.0j1.0-0.7j0.7

phase-shift, phase-shift

The Backwards-Oriented Encoder was the reverse of the Forward-Oriented Encoder - it allowed sounds to be placed optimally in the back half of the room, but mono-compatibility was sacrificed. When used with standard stereo recordings it created "extra wide" stereo with sounds outside the speakers.

Some encoding mixers had channel strips switchable between forward-oriented and backwards-oriented encoding.

London Box

It encoded the Center Back in such a way that it didn't cancel in mono playback, thus its output was usually mixed with that of a Position Encoder or a Forward Oriented encoder. After 1972, the vast majority of SQ Encoded albums were mixed with either the Position Encoder or the Forward-Oriented encoder.

Ghent microphone

In addition, CBS created the SQ Ghent Microphone, which was a spatial microphone system using the Neumann QM-69 mic. The signals from the QM-69 were differenced, and then phase-matrixed into 2-channel SQ. [5] With the Ghent Microphone, SQ was transformed from a Matrix into a Kernel and an additional signal could be derived to provide N:3:4 performance.

Universal SQ

In 1976, Ben Bauer integrated matrix and discrete systems into USQ, or Universal SQ. It was a hierarchical 4-4-4 discrete matrix that used the SQ matrix as the baseband for discrete quadraphonic FM broadcasts using additional difference signals called "T" and "Q". For a USQ FM broadcast, the additional "T" modulation was placed at 38 kHz in quadrature to the standard stereo difference signal and the "Q" modulation was placed on a carrier at 76 kHz. For standard 2-channel SQ Matrix broadcasts, CBS recommended that an optional pilot-tone be placed at 19 kHz in quadrature to the regular pilot-tone to indicate SQ encoded signals and activate the listeners Logic decoder.

CBS argued that the SQ system should be selected as the standard for quadraphonic FM because, in FCC listening tests of the various four channel broadcast proposals, the 4:2:4 SQ system, decoded with a CBS Paramatrix decoder, outperformed 4:3:4 (without logic) as well as all other 4:2:4 (with logic) systems tested, approaching the performance of a discrete master tape within a very slight margin. [6] At the same time, the SQ "fold" to stereo and mono was preferred to the stereo and mono "fold" of 4:4:4, 4:3:4 and all other 4:2:4 encoding systems.

Tate DES decoder

The Directional Enhancement System, also known as the Tate DES, was an advanced decoder that enhanced the directionality of the basic SQ matrix.

It first matrixed the four outputs of the SQ decoder to derive additional signals, then compared their envelopes to detect the predominant direction and degree of dominance. A processor section, implemented outside of the Tate IC chips, applied variable attack/decay timing to the control signals and determined the coefficients of the "B" (Blend) matrices needed to enhance the directionality. These were acted upon by true analog multipliers in the Matrix Multiplier IC's, to multiply the incoming matrix by the "B" matrices and produce outputs in which the directionality of all predominant sounds were enhanced.

Since the DES could recognize all three directions of the Energy Sphere[ clarification needed ] simultaneously, and enhance the separation, it had a very open and 'discrete'[ clarification needed ] sounding soundfield.

In addition, the enhancement was done with sufficient additional complexity that all non-dominant sounds were kept at their proper levels.

Dolby used the Tate DES IC's in their theater processors until around 1986, when they developed the Pro Logic system. Unfortunately, delays and problems kept the Tate DES IC's from the market until the late-1970s and only two consumer decoders were ever made that employed them, the Audionics Space & Image Composer and the Fosgate Tate II 101A. The Fosgate used a faster, updated version of the IC, called the Tate II, and additional circuitry that provided for separation enhancement around the full 360 soundfield. Unlike the earlier Full Wave-matching Logic decoders for SQ, that varied the output levels to enhance directionality, the Tate DES cancelled SQ signal crosstalk as a function of the predominant directionality, keeping non-dominant sounds and reverberation in its proper spatial locations at their correct level.

QS matrix, "Regular Matrix", "Quadraphonic Sound" (4:2:4)

MatrixLeft FrontRight FrontLeft BackRight Back
Left Total0.920.38j0.92j0.38
Right Total0.380.92k0.38k0.92

phase-shift, phase-shift

Matrix H (4:2:4)

Matrix H Matrix [7] Left FrontRight FrontLeft BackRight Back
Left Total-j0.94-l0.34+k0.94+m0.34
Right Total+l0.34+j0.94-m0.34-k0.94

j = 20° phase-shiftk = 25° phase-shiftl = 55° phase-shiftm = 115° phase-shift

Ambisonic UHJ kernel (3:2:4 or more)

MatrixW (pressure signal)X (front-back signal)Y (left-right signal)
Left Total0.470 + k0.1710.093 + j0.255+0.328
Right Total0.470 + j0.1710.093 + k0.255-0.328

phase-shift, phase-shift

Dolby Stereo and Dolby Surround (matrix) 4:2:4

Dolby Stereo and Dolby Surround are also known as Dolby MP, Dolby SVA and Pro Logic.

Dolby SVA matrix is the original name of the Dolby Stereo 4:2:4 encoding matrix.

The term "Dolby Surround" refers to both the encoding and decoding in the home environment, while in the theater it is known "Dolby Stereo", "Dolby Motion Picture matrix" or "Dolby MP". "Pro Logic" refers to the decoder used, there is no special Pro Logic encoding matrix.

The Ultra Stereo system, developed by different company, is compatible and uses similar matrixes to Dolby Stereo.

The Dolby Stereo Matrix is straightforward: the four original channels: Left (L), Center (C), Right (R), and Surround (S), are combined into two, known as Left-total (LT) and Right-total (RT) by this formula:

Dolby Stereo MixLeftRightCenterSurround
Left Total
Right Total

where j = 90° phase-shift

The center channel information is carried by both LT and RT in phase, and surround channel information by both LT and RT but out of phase. The surround channel is a single limited frequency-range (7 kHz low-pass filtered [8] ) mono rear channel, dynamically compressed and placed with a lower volume than the rest. This allows for better separation of signals.

This gives good compatibility with both mono playback, which reproduces L, C and R from the mono speaker with C at a level 3 dB higher than L or R, but surround information cancels out. It also gives good compatibility with two-channel stereo playback where C is reproduced from both left and right speakers to form a phantom center and surround is reproduced from both speakers but in a diffuse manner.

A simple 4-channel decoder could simply send the sum signal (L+R) to the center speaker, and the difference signal (L-R) to the surrounds. But such a decoder would provide poor separation between adjacent speaker channels, thus anything intended for the center speaker would also reproduce from left and right speakers only 3 dB below the level in the center speaker. Similarly anything intended for the left speaker would be reproduced from both the center and surround speakers, again only 3 dB below the level in the left speaker. There is, however, complete separation between left and right, and between center and surround channels.

To overcome this problem the cinema decoder uses so-called "logic" circuitry to improve the separation. The logic circuitry decides which speaker channel has the highest signal level and gives it priority, attenuating the signals fed to the adjacent channels. Because there already is complete separation between opposite channels there is no need to attenuate those, in effect the decoder switches between L and R priority and C and S priority. This places some limitations on mixing for Dolby Stereo and to ensure that sound mixers mixed soundtracks appropriately they would monitor the sound mix via a Dolby Stereo encoder and decoder in tandem. In addition to the logic circuitry the surround channel is also fed via a delay, adjustable up to 100 ms to suit auditoria of differing sizes, to ensure that any leakage of program material intended for left or right speakers into the surround channel is always heard first from the intended speaker. This exploits the "Precedence effect" to localize the sound to the intended direction.

Dolby Pro Logic II matrix (5:2:5)

MatrixLeftRightCenterRear LeftRear Right
Left Total
Right Total

phase-shift, phase-shift

The Pro Logic II matrix provides for stereo full frequency back channels. Normally a sub-woofer channel is driven by simply filtering and redirecting the existing bass frequencies of the original stereo track.

See also

Related Research Articles

Dolby Digital, originally synonymous with Dolby AC-3, is the name for what has now become a family of audio compression technologies developed by Dolby Laboratories. Formerly named Dolby Stereo Digital until 1995, the audio compression is lossy, based on the modified discrete cosine transform (MDCT) algorithm. The first use of Dolby Digital was to provide digital sound in cinemas from 35 mm film prints; today, it is now also used for applications such as TV broadcast, radio broadcast via satellite, digital video streaming, DVDs, Blu-ray discs and game consoles.

<span class="mw-page-title-main">Quadraphonic sound</span> Four-channel speaker audio

Quadraphonic sound – equivalent to what is now called 4.0 surround sound – uses four audio channels in which speakers are positioned at the four corners of a listening space. The system allows for the reproduction of sound signals that are independent of one another.

<span class="mw-page-title-main">Surround sound</span> System with loudspeakers that surround the listener

Surround sound is a technique for enriching the fidelity and depth of sound reproduction by using multiple audio channels from speakers that surround the listener. Its first application was in movie theaters. Prior to surround sound, theater sound systems commonly had three screen channels of sound that played from three loudspeakers located in front of the audience. Surround sound adds one or more channels from loudspeakers to the side or behind the listener that are able to create the sensation of sound coming from any horizontal direction around the listener.

Dolby Pro Logic is a surround sound processing technology developed by Dolby Laboratories, designed to decode soundtracks encoded with Dolby Surround.

<span class="mw-page-title-main">Dolby</span> American audio technology company

Dolby Laboratories, Inc. is an American company specializing in audio noise reduction, audio encoding/compression, spatial audio, and HDR imaging. Dolby licenses its technologies to consumer electronics manufacturers.

<span class="mw-page-title-main">DTS (company)</span> Series of multichannel audio technologies

DTS, Inc. is an American company that makes multichannel audio technologies for film and video. Based in Calabasas, California, the company introduced its DTS technology in 1993 as a competitor to Dolby Laboratories, incorporating DTS in the film Jurassic Park (1993). The DTS product is used in surround sound formats for both commercial/theatrical and consumer-grade applications. It was known as The Digital Experience until 1995. DTS licenses its technologies to consumer electronics manufacturers.

Dolby Stereo is a sound format made by Dolby Laboratories. It is a unified brand for two completely different basic systems: the Dolby SVA 1976 system used with optical sound tracks on 35mm film, and Dolby Stereo 70mm noise reduction on 6-channel magnetic soundtracks on 70mm prints.

Ambisonic UHJ format is a development of the Ambisonic surround sound system designed to be compatible with mono and stereo media. It is a hierarchy of systems in which the recorded soundfield will be reproduced with a degree of accuracy that varies according to the available channels. Although UHJ permits the use of up to four channels, only the 2-channel variant is in current use. In Ambisonics, UHJ is also known as "C-Format".

<span class="mw-page-title-main">Stereo Quadraphonic</span> Matrix 4-channel quadraphonic sound system

SQ Quadraphonic was a matrix 4-channel quadraphonic sound system for vinyl LP records. It was introduced by CBS Records in 1971. Many recordings using this technology were released on LP during the 1970s.

<span class="mw-page-title-main">Center channel</span> Audio channel

Center channel refers to an audio channel common to many surround sound formats. It is the channel that is mostly, or fully, dedicated to the reproduction of the dialogue of an audiovisual program. The speaker(s) connected to the center channel are placed in the center of and behind the perforated projection screen, to give the effect that sounds from the center channel are coming from the screen. In many home surround sound units, the center channel is positioned above or below the video screen.

<span class="mw-page-title-main">Home audio</span>

Home audio systems are audio electronics intended for home entertainment use, such as shelf stereos, music centres and surround sound receivers. Home audio generally does not include standard equipment such as built-in television speakers, but rather accessory equipment, which may be intended to enhance or replace standard equipment, such as standard TV speakers. Since surround sound receivers, which are primarily intended to enhance the reproduction of a movie, are the most popular home audio device, the primary field of home audio is home cinema.

<span class="mw-page-title-main">Surround channels</span>

Surround channels are audio channels in surround sound multichannel audio. They primarily serve to deliver ambience and diffuse sounds in a film or music soundtrack.

Jim Fosgate American audio engineer

James M. Fosgate is an American inventor, engineer and businessman. The self-taught son of a television and radio repairman, Fosgate invented the first car amplifier in 1973 and founded Fosgate Electronics, now called Rockford Fosgate. Since his departure from Rockford Fosgate in 1981, Fosgate has remained active in the audio world, running Fosgate Laboratories and leading the team that created Dolby Pro Logic II. Fosgate was also the developer of one of the finest quadraphonic decoders, the TATE II 101A (see Stereo Quadraphonic for details), in collaboration with Peter Scheiber and Martin Willcocks, which was superseded by his 3601 decoder.

<span class="mw-page-title-main">QS Regular Matrix</span>

Quadraphonic Sound was a matrix 4-channel quadraphonic sound system for phonograph records. The system was based on technology created by Peter Scheiber, but further developed by engineer Ryosuke Ito of Sansui in the early 1970s.

<span class="mw-page-title-main">Compatible Discrete 4</span>

Compatible Discrete 4, also known as Quadradisc or CD-4 was as a discrete four-channel quadraphonic system for phonograph records. The system was created by JVC and RCA in 1971 and introduced in May 1972. Hundreds of recordings using this technology were released on LP during the 1970s.

The Hafler circuit is a passive electronics circuit with the aim of getting derived surround sound or ambiophony from regular stereo recordings without using costly electronics. Such circuits are generally known as matrix decoders. The Dynaquad system works using similar principles.

Stereo-4, also known as EV or EV-4, was a matrix 4-channel quadraphonic sound system developed in 1970 by Leonard Feldman and Jon Fixler.

<span class="mw-page-title-main">Dynaquad</span>

Dynaquad, or DY, was a matrix decoder 4-channel quadraphonic sound system developed by Dynaco in 1969.

<span class="mw-page-title-main">UD-4</span>

UD-4 was a discrete four-channel quadraphonic sound system for phonograph records introduced by Nippon Columbia (Denon) in 1974. This system had some similarities with the more successful CD-4 process introduced by JVC and RCA in 1972.

Matrix H was developed by BBC engineers in the late 1970s to carry quadraphonic sound via FM radio in a way that would be most compatible with existing mono and stereo receivers.

References

  1. Feldman, Leonard (1973). Four Channel Sound (1 ed.). Indianapolis IN: Howard W. Sams & C. Inc. pp. 49–51. ISBN   0-672-20966-7.
  2. 1 2 Patterson, Tab. "Encoding SQ at home" . Retrieved August 13, 2018.
  3. Feldman, Leonard (1973). Four Channel Sound (1 ed.). Indianapolis IN: Howard W. Sams & C. Inc. pp. 44–49. ISBN   0-672-20966-7.
  4. Gerzon, Michael (8 December 1977). "Don't say quadsay psychoacoustics". New Scientist. pp. 634–636. The forward oriented encoder is one of at least six different encoding options offered by SQ so that producers can decide for themselves which collection of faults they prefer.
  5. Bauer, Benjamin B.; Louis A. Abbagnaro; Daniel W. Gravereaux; Trevor J. Marshall (January–February 1978). "The Ghent Microphone System for SQ Quadraphonic Recording and Broadcasting". Journal of the Audio Engineering Society. AES. 26 (1/2): 2–11.
  6. “A subjective evaluation of FM Quadraphonic reproduction systems – Listening tests” Federal Communications Commission, Office of the Chief Engineer, Laboratory Division, Laurel, Maryland. Project Number 2710-1, August 1977
  7. P.S. Gaskell; P.A. Ratliff (February 1977). "QUADRAPHONY: developments in Matrix H decoding" (PDF). Research Department, Engineering Division. The British Broadcasting Corporation. BBC RD 1977/2. Archived (PDF) from the original on October 2, 2009. Retrieved August 13, 2018.
  8. "Dolby Surround Pro Logic II Decoder Principles of Operation" (PDF). Dolby Laboratories. Archived from the original (PDF) on 2012-01-28. Retrieved 2009-12-04.