Virtual surround

Last updated

Virtual surround is an audio system that attempts to create the perception that there are many more sources of sound than are actually present. In order to achieve this, it is necessary to devise some means of tricking the human auditory system into thinking that a sound is coming from somewhere that it is not. Most recent examples of such systems are designed to simulate the true (physical) surround sound experience using one, two or three loudspeakers. Such systems are popular among consumers who want to enjoy the experience of surround sound without the large number of speakers that are traditionally required to do so. [1]

Contents

Types

A virtual surround system must provide a means for 2-dimensional imaging of sound, using some properties of the human auditory system. The way that the auditory system localises a sound source is a topic that is studied in the field of psychoacoustics. Thus, virtual surround systems use knowledge of psychoacoustics to "trick" the listener. There are several ways in which this has been attempted.

Using HRTFs

Some methods use knowledge of head-related transfer function (HRTF). With an appropriate HRTF the signals required at the eardrums for the listener to perceive sound from any direction can be calculated. These signals are then recreated at the eardrum using either headphones or a crosstalk calculation method. [2] [3] The disadvantage of this approach is that it is very difficult to get these systems to work for more than one listener at a time.

Using reflections

Some virtual surround systems work by directing a strong beam of sound to reflect off the walls of a room so that the listener hears the reflection at a higher level than the sound directly from the loudspeaker. One example of this technology is a commercially available Digital Sound Projector by Cambridge Mechatronics (formerly 1 Ltd). It employs 40 micro drivers and 2 woofers as well as projection technology to control the direction of the sound. The micro drivers' sound is focused into groups of "beams" that reflect off the room's walls. The center channel's sound is projected directly to the listening position. Another example is S-Logic marketed by the German headphones manufacturer Ultrasone. With this technology (which may also be considered a hybrid of HRTF and reflection-based methods), decentralized transducer positioning is used to spread sound over the outer ear in an attempt to mimic sound heard over speakers.

For virtual surround to be effective, the room should be both physically symmetrical about the perpendicular to the line between the speakers, and the absorbing characteristics of the left and right walls. An absorptive piece of furniture close to one speaker, and not matched on the other side will cause the sound field to shift to the "live" side of the room. The resulting "sound stage" is affected by asymmetry.

Creating a diffuse source

Perception of direction is greatly affected by the relative time that a sound arrives at each ear and any difference in the amplitude of a sound at each ear. It is possible to create a sound source having an output characteristic which is rapidly varying with direction and frequency of signal. These kinds of sources create sound fields which are rapidly variable around the listeners room. These are often referred to as diffuse sources, this is because their output resembles a diffuse sound field — a sound field where soundwaves are traveling in all directions with equal probability. In a diffuse field the sound at each of a listeners' ears is so completely different that it is impossible for the brain to work out where the sound has come from. A diffuse source located in front of the listener will be hard to localize and can be used to carry the surround signals. [4]

Notes

  1. "DTS Virtual X Vs Dolby Atmos Height Virtualization Vs Virtual Surround? Which Is Better?". homelytainment.com. 2022-06-24. Retrieved 2023-09-24.
  2. Kirkeby, Ole; Nelson, Philip A.; Hamada, Hareo (May 1998). "The 'Stereo Dipole': A Virtual Source Imaging System Using Two Closely Spaced Loudspeakers". Journal of the Audio Engineering Society. 46: 387–395.
  3. For an application of this method, see Virtual Acoustics And Audio Engineering, Institute of Sound and Vibration Research, University of Southampton.
  4. "How Virtual Surround Sound Works". HowStuffWorks. 2007-05-31. Retrieved 2023-09-24.

Related Research Articles

<span class="mw-page-title-main">Binaural recording</span> Method of recording sound

Binaural recording is a method of recording sound that uses two microphones, arranged with the intent to create a 3-D stereo sound sensation for the listener of actually being in the room with the performers or instruments. This effect is often created using a technique known as dummy head recording, wherein a mannequin head is fitted with a microphone in each ear. Binaural recording is intended for replay using headphones and will not translate properly over stereo speakers. This idea of a three-dimensional or "internal" form of sound has also translated into useful advancement of technology in many things such as stethoscopes creating "in-head" acoustics and IMAX movies being able to create a three-dimensional acoustic experience.

An audiophile is a person who is enthusiastic about high-fidelity sound reproduction. An audiophile seeks to reproduce recorded music to achieve high sound quality, typically using closed headphones, in-ear monitors, open headphones in a quiet listening space, or stereo speakers in a room with good acoustics.

<span class="mw-page-title-main">Head-related transfer function</span> Response that characterizes how anĀ earĀ receives a sound from a point in space

A head-related transfer function (HRTF), also known as anatomical transfer function (ATF), or a head shadow, is a response that characterizes how an ear receives a sound from a point in space. As sound strikes the listener, the size and shape of the head, ears, ear canal, density of the head, size and shape of nasal and oral cavities, all transform the sound and affect how it is perceived, boosting some frequencies and attenuating others. Generally speaking, the HRTF boosts frequencies from 2–5 kHz with a primary resonance of +17 dB at 2,700 Hz. But the response curve is more complex than a single bump, affects a broad frequency spectrum, and varies significantly from person to person.

<span class="mw-page-title-main">Ambisonics</span> Full-sphere surround sound format

Ambisonics is a full-sphere surround sound format: in addition to the horizontal plane, it covers sound sources above and below the listener.

<span class="mw-page-title-main">Surround sound</span> System with loudspeakers that surround the listener

Surround sound is a technique for enriching the fidelity and depth of sound reproduction by using multiple audio channels from speakers that surround the listener. Its first application was in movie theaters. Prior to surround sound, theater sound systems commonly had three screen channels of sound that played from three loudspeakers located in front of the audience. Surround sound adds one or more channels from loudspeakers to the side or behind the listener that are able to create the sensation of sound coming from any horizontal direction around the listener.

The precedence effect or law of the first wavefront is a binaural psychoacoustical effect concerning echo perception. When a sound is followed by another sound separated by a sufficiently short time delay, listeners perceive a single auditory event; its perceived spatial location is dominated by the location of the first-arriving sound. The lagging sound does also affect the perceived location; however, its effect is suppressed by the first-arriving sound.

3D audio effects are a group of sound effects that manipulate the sound produced by stereo speakers, surround-sound speakers, speaker-arrays, or headphones. This frequently involves the virtual placement of sound sources anywhere in three-dimensional space, including behind, above or below the listener.

Sound localization is a listener's ability to identify the location or origin of a detected sound in direction and distance.

Virtual acoustic space (VAS), also known as virtual auditory space, is a technique in which sounds presented over headphones appear to originate from any desired direction in space. The illusion of a virtual sound source outside the listener's head is created.

<span class="mw-page-title-main">Wave field synthesis</span> Technique for creating virtual acoustic environments

Wave field synthesis (WFS) is a spatial audio rendering technique, characterized by creation of virtual acoustic environments. It produces artificial wavefronts synthesized by a large number of individually driven loudspeakers from elementary waves. Such wavefronts seem to originate from a virtual starting point, the virtual sound source. Contrary to traditional phantom sound sources, the localization of WFS established virtual sound sources does not depend on the listener's position. Like as a genuine sound source the virtual source remains at fixed starting point.

Sound Retrieval System (SRS) is a patented psychoacoustic 3D audio processing technology originally invented by Arnold Klayman in the early 1980s. The SRS technology applies head-related transfer functions (HRTFs) to create an immersive 3D soundfield using only two speakers, widening the "sweet spot", creating a more spacious sense of ambience, and producing strong localization cues for discrete instruments within an audio mix. SRS is not a Dolby matrix surround decoder but works with normal stereo recordings.

This page focusses on decoding of classic first-order Ambisonics. Other relevant information is available on the Ambisonic reproduction systems page.

Ambiophonics is a method in the public domain that employs digital signal processing (DSP) and two loudspeakers directly in front of the listener in order to improve reproduction of stereophonic and 5.1 surround sound for music, movies, and games in home theaters, gaming PCs, workstations, or studio monitoring applications. First implemented using mechanical means in 1986, today a number of hardware and VST plug-in makers offer Ambiophonic DSP. Ambiophonics eliminates crosstalk inherent in the conventional stereo triangle speaker placement, and thereby generates a speaker-binaural soundfield that emulates headphone-binaural sound, and creates for the listener improved perception of reality of recorded auditory scenes. A second speaker pair can be added in back in order to enable 360° surround sound reproduction. Additional surround speakers may be used for hall ambience, including height, if desired.

The Franssen effect is an auditory illusion where the listener incorrectly localizes a sound. It was found in 1960 by Nico Valentinus Franssen (1926–1979), a Dutch physicist and inventor. There are two classical experiments, which are related to the Franssen effect, called Franssen effect F1 and Franssen effect F2.

Psychoacoustics is the branch of psychophysics involving the scientific study of sound perception and audiology—how human auditory system perceives various sounds. More specifically, it is the branch of science studying the psychological responses associated with sound. Psychoacoustics is an interdisciplinary field of many areas, including psychology, acoustics, electronic engineering, physics, biology, physiology, and computer science.

Perceptual-based 3D sound localization is the application of knowledge of the human auditory system to develop 3D sound localization technology.

3D sound reconstruction is the application of reconstruction techniques to 3D sound localization technology. These methods of reconstructing three-dimensional sound are used to recreate sounds to match natural environments and provide spatial cues of the sound source. They also see applications in creating 3D visualizations on a sound field to include physical aspects of sound waves including direction, pressure, and intensity. This technology is used in entertainment to reproduce a live performance through computer speakers. The technology is also used in military applications to determine location of sound sources. Reconstructing sound fields is also applicable to medical imaging to measure points in ultrasound.

3D sound is most commonly defined as the daily human experience of sounds. The sounds arrive to the ears from every direction and varying distances, which contribute to the three-dimensional aural image humans hear. Scientists and engineers who work with 3D sound work to accurately synthesize the complexity of real-world sounds.

Apparent source width (ASW) is the audible impression of a spatially extended sound source. This psychoacoustic impression results from the sound radiation characteristics of the source and the properties of the acoustic space into which it is radiating. Wide source widths are desired by listeners of music because these are associated with the sound of acoustic music, opera, classical music, and historically informed performance. Research concerning ASW comes from the field of room acoustics, architectural acoustics and auralization, as well as musical acoustics, psychoacoustics and systematic musicology.


Transaural Stereo is a technology suite of analog circuits and digital signal processing algorithms related to the field of sound playback for audio communication and entertainment. It is based on the concept of crosstalk cancellation but in some versions can embody other processes such as binaural synthesis and equalization.