Ensonido

Last updated

Ensonido is a real-time post processing algorithm that allows users to play back MP3 Surround files in standard headphones. Ensonido was developed by the Fraunhofer Society. It simulates the natural reception of surround sound by the human ear, which usually receives tones from surrounding loudspeakers and from reflections and echoes of the listening room. The out-of-head localization achieved that way increases the listening comfort noticeably in contrast to conventional stereo headphone listening with its in-head localization of all sounds. In version 3.0 of the Fraunhofer IIS MP3 Surround Player, Ensonido is replaced with newer mp3HD

MP3 Surround

MP3 Surround is an extension of MP3 for multi-channel audio support including 5.1 surround sound. It was developed by Fraunhofer IIS in collaboration with Thomson and Agere Systems, and released in December 2004.

The Fraunhofer Society is a German research organization with 72 institutes spread throughout Germany, each focusing on different fields of applied science. With some 26,600 employees, mainly scientists and engineers and with an annual research budget of about €2.6 billion it is the biggest organization for applied research and development services in Europe.

MPEG-1 Audio Layer III HD more commonly known and advertised by its abbreviation mp3HD is an audio compression codec developed by Technicolor formerly known as Thomson. It achieves lossless data compression, and is backwards compatible with the MP3 format by storing two data streams in one file.

Related Research Articles

Disc jockey Person who plays recorded music for an audience

A disc jockey, more commonly abbreviated as DJ, is a person who plays existing recorded music for a live audience. Most common types of DJs include radio DJ, club DJ who performs at a nightclub or music festival and turntablist who uses record players, usually turntables, to manipulate sounds on phonograph records. Originally, the disc in disc jockey referred to gramophone records, but now DJ is used as an all-encompassing term to describe someone who mixes recorded music from any source, including cassettes, CDs or digital audio files on a CDJ or laptop. The title 'DJ' is commonly used by DJs in front of their real names or adopted pseudonyms or stage names. In recent years it has become common for DJs to be featured as the credited artist on tracks they produced despite having a guest vocalist that performs the entire song: like for example Uptown Funk.

LAME audio encoder

LAME is a software encoder that converts audio to the MP3 file format. LAME is a free software project that was first released in 1998, and has incorporated many improvements since then, including an improved psychoacoustic model. The LAME encoder vastly outperforms early encoders like L3enc.

Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression. Vorbis is most commonly used in conjunction with the Ogg container format and it is therefore often referred to as Ogg Vorbis.

Binaural recording method of recording sound that uses two microphones

Binaural recording is a method of recording sound that uses two microphones, arranged with the intent to create a 3-D stereo sound sensation for the listener of actually being in the room with the performers or instruments. This effect is often created using a technique known as "dummy head recording", wherein a mannequin head is outfitted with a microphone in each ear. Binaural recording is intended for replay using headphones and will not translate properly over stereo speakers. This idea of a three dimensional or "internal" form of sound has also translated into useful advancement of technology in many things such as stethoscopes creating "in-head" acoustics and IMAX movies being able to create a three dimensional acoustic experience.

Head-related transfer function

A head-related transfer function (HRTF) also sometimes known as the anatomical transfer function (ATF) is a response that characterizes how an ear receives a sound from a point in space. As sound strikes the listener, the size and shape of the head, ears, ear canal, density of the head, size and shape of nasal and oral cavities, all transform the sound and affect how it is perceived, boosting some frequencies and attenuating others. Generally speaking, the HRTF boosts frequencies from 2–5 kHz with a primary resonance of +17 dB at 2,700 Hz. But the response curve is more complex than a single bump, affects a broad frequency spectrum, and varies significantly from person to person.

Headphones pair of small speakers held close to a users ears

Headphones traditionally refer to a pair of small loudspeaker drivers worn on or around the head over a user's ears. They are electroacoustic transducers, which convert an electrical signal to a corresponding sound. Headphones let a single user listen to an audio source privately, in contrast to a loudspeaker, which emits sound into the open air for anyone nearby to hear. Headphones are also known as earspeakers, earphones or, colloquially, cans. Circumaural and supra-aural headphones use a band over the top of the head to hold the speakers in place. Another type, known as earbuds or earpieces consist of individual units that plug into the user's ear canal. A third type are bone conduction headphones, which typically wrap around the back of the head and rest in front of the ear canal, leaving the ear canal open.

WinDVD

WinDVD is a commercial video player and music player software for Microsoft Windows. It enables the viewing of DVD-Video movies on the user's PC. DVD-Video backups stored on hard disk can also be played. The player can also be used to play videos and audio/music files in other formats encoded with different codecs, for instance DivX, Xvid, Windows Media Video video and MP3 and AAC audio. Newer versions also support full Blu-ray Disc and HD DVD playback with menus, with CPRM DRM support.

3D audio effects are a group of sound effects that manipulate the sound produced by stereo speakers, surround-sound speakers, speaker-arrays, or headphones. This frequently involves the virtual placement of sound sources anywhere in three-dimensional space, including behind, above or below the listener.

Portable media player Portable device capable of storing and playing digital media

A portable media player (PMP) or digital audio player (DAP) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored on a CD, DVD, BD, flash memory, microdrive, or hard drive. Most portable media players are equipped with a 3.5 mm headphone jack, which users can plug headphones into, or connect to a boombox or hifi system. In contrast, analogue portable audio players play music from non-digital media that use analogue signal storage, such as cassette tapes or vinyl records.

Fraunhofer l3enc was the first public software able to encode PCM (.wav) files to the MP3 format. The first public version was released on July 13, 1994. This commandline tool was shareware and limited to 112 kbit/s. It was available for MS-DOS, Linux, Solaris, SunOS, NeXTstep and IRIX. A licence that allowed full use cost 350 Deutsche Mark, or about 250 US$.

In acoustics, the dummy head recording is a method of recording used to generate binaural recordings. The tracks are then listened to through headphones allowing for the listener to hear from the dummy’s perspective. The dummy head is designed to record multiple sounds at the same time enabling it to be exceptional at recording music as well as in other industries where multiple sound sources are involved.

Virtual surround is an audio system that attempts to create the perception that there are many more sources of sound than are actually present. In order to achieve this, it is necessary to devise some means of tricking the human auditory system into thinking that a sound is coming from somewhere that it is not. Most recent examples of such systems are designed to simulate the true (physical) surround sound experience using one, two or three loudspeakers. Such systems are popular among consumers who want to enjoy the experience of surround sound without the large number of speakers that are traditionally required to do so.

mp3 SX is a program that allows users to upgrade mp3 stereo files to MP3 Surround files. mp3 SX analyzes the existing natural ambience of the stereo material and plays it back through the rear channels. The sound sources remain in the front channels, but are played back through the Left, Center and Right channel, providing a stable front image even for off-sweet-spot listening. mp3 SX preserves the original stereo sound stage, creating additional surround envelopment, with only 15 kB/s additional information.

The Sound Retrieval System (SRS) is a patented psychoacoustic 3D audio processing technology originally invented by Arnold Klayman in the early 1980s.. The SRS technology applies head-related transfer functions (HRTFs) to create an immersive 3D soundfield using only two speakers, widening the "sweet spot", creating a more spacious sense of ambience, and producing strong localization cues for discrete instruments within an audio mix. SRS is not a Dolby matrix surround decoder but works with normal stereo recordings.

MPEG Surround, also known as Spatial Audio Coding (SAC) is a lossy compression format for surround sound that provides a method for extending mono or stereo audio services to multi-channel audio in a backwards compatible fashion. The total bit rates used for the core and the MPEG Surround data are typically only slightly higher than the bit rates used for coding of the core. MPEG Surround adds a side-information stream to the core bit stream, containing spatial image data. Legacy stereo playback systems will ignore this side-information while players supporting MPEG Surround decoding will output the reconstructed multi-channel audio.

Ambiophonics is a method in the public domain that employs digital signal processing (DSP) and two loudspeakers directly in front of the listener in order to improve reproduction of stereophonic and 5.1 surround sound for music, movies, and games in home theaters, gaming PCs, workstations, or studio monitoring applications. First implemented using mechanical means in 1986, today a number of hardware and VST plug-in makers offer Ambiophonic DSP. Ambiophonics eliminates crosstalk inherent in the conventional “stereo triangle” speaker placement, and thereby generates a speaker-binaural soundfield that emulates headphone-binaural sound, and creates for the listener improved perception of “reality” of recorded auditory scenes. A second speaker pair can be added in back in order to enable 360° surround sound reproduction. Additional surround speakers may be used for hall ambience, including height, if desired.

InfoZoom software is a data analysis, business intelligence and data visualization software product created using in-memory analytics. The software is created and supported by humanIT and the Fraunhofer Institute FIT, the same organization that created MP3 compression technology. The software has over 100,000 licensed users and over 1000 customers worldwide.

MPEG-H 3D Audio, specified as ISO/IEC 23008-3, is an audio coding standard developed by the ISO/IEC Moving Picture Experts Group (MPEG) to support coding audio as audio channels, audio objects, or higher order ambisonics (HOA). MPEG-H 3D Audio can support up to 64 loudspeaker channels and 128 codec core channels.