Sound recognition

Last updated

Sound recognition is a technology, which is based on both traditional pattern recognition theories and audio signal analysis methods. Sound recognition technologies contain preliminary data processing, feature extraction and classification algorithms. Sound recognition can classify feature vectors. Feature vectors are created as a result of preliminary data processing and linear predictive coding.

Contents

Sound recognition technologies are used for:

Security

In monitoring and security, an important contribution to alarm detection and alarm verification can be supplied, using sound recognition techniques. In particular, these methods could be helpful for intrusion detection in places like offices, stores, private homes or for the supervision of public premises exposed to person aggression. In all these cases, a recognition system can report about a danger or distress event. It could further identify sounds like glass break, doorbells, smoke detector alarms, red alerts, human screams, baby cries and others. Sometimes, the alarm is triggered by other detectors (e.g. temperature or video-based) and the sound recognizer would be associated with these other modalities, to verify the alarm, with the purpose of decreasing the global false alarm detection rate.

Assistance

Solutions based on a sound recognition technology can offer assistance to disabled and elderly people affected in hearing capabilities, helping them to keep or recover some independence in their daily occupations. [1]

Companies

There are only a handful of companies who are working on sound recognition technology:

See also

Related Research Articles

<span class="mw-page-title-main">Intelligent transportation system</span> Advanced application

An intelligent transportation system (ITS) is an advanced application which aims to provide innovative services relating to different modes of transport and traffic management and enable users to be better informed and make safer, more coordinated, and 'smarter' use of transport networks.

<span class="mw-page-title-main">Smoke detector</span> Device that detects smoke, typically as an indicator of fire

A smoke detector is a device that senses smoke, typically as an indicator of fire. Smoke detectors/Alarms are usually housed in plastic enclosures, typically shaped like a disk about 150 millimetres (6 in) in diameter and 25 millimetres (1 in) thick, but shape and size vary. Smoke can be detected either optically (photoelectric) or by physical process (ionization). Detectors may use one or both sensing methods. Sensitive alarms can be used to detect and deter smoking in banned areas. Smoke detectors in large commercial and industrial buildings are usually connected to a central fire alarm system.

Motion detection is the process of detecting a change in the position of an object relative to its surroundings or a change in the surroundings relative to an object. It can be achieved by either mechanical or electronic methods. When it is done by natural organisms, it is called motion perception.

<span class="mw-page-title-main">Security alarm</span> System that detects unauthorised entry

A security alarm is a system designed to detect intrusions, such as unauthorized entry, into a building or other areas, such as a home or school. Security alarms protect against burglary (theft) or property damage, as well as against intruders. Examples include personal systems, neighborhood security alerts, car alarms, and prison alarms.

<span class="mw-page-title-main">Transcription (music)</span>

In music, transcription is the practice of notating a piece or a sound which was previously unnotated and/or unpopular as a written music, for example, a jazz improvisation or a video game soundtrack. When a musician is tasked with creating sheet music from a recording and they write down the notes that make up the piece in music notation, it is said that they created a musical transcription of that recording. Transcription may also mean rewriting a piece of music, either solo or ensemble, for another instrument or other instruments than which it was originally intended. The Beethoven Symphonies transcribed for solo piano by Franz Liszt are an example. Transcription in this sense is sometimes called arrangement, although strictly speaking transcriptions are faithful adaptations, whereas arrangements change significant aspects of the original piece.

Building automation (BAS), also known as building management system (BMS) or building energy management system (BEMS), is the automatic centralized control of a building's HVAC, electrical, lighting, shading, access control, security systems, and other interrelated systems. Some objectives of building automation are improved occupant comfort, efficient operation of building systems, reduction in energy consumption, reduced operating and maintaining costs and increased security.

<span class="mw-page-title-main">Motion detector</span> Electrical device which utilizes a sensor to detect nearby motion

A motion detector is an electrical device that utilizes a sensor to detect nearby motion. Such a device is often integrated as a component of a system that automatically performs a task or alerts a user of motion in an area. They form a vital component of security, automated lighting control, home control, energy efficiency, and other useful systems.

<span class="mw-page-title-main">Fire alarm system</span> A system, that works using multiple devices to warn of a fire or other types of emergencies

A fire alarm system is a building system designed to detect, alert occupants, and alert emergency forces of the presence of fire, smoke, carbon monoxide, or other fire-related emergencies. Fire alarm systems are required in most commercial buildings. They may include smoke detectors, heat detectors, and manual fire alarm activation devices. All of which are connected to a Fire Alarm Control Panel (FACP). A Fire Alarm Control Panel is usually found in an electrical room, or panel room. Fire alarm systems generally use visual and audio signalization to warn the occupants of the building. Some fire alarm systems may also disable elevators, which under most circumstances, are unsafe to use during a fire.

<span class="mw-page-title-main">Gunfire locator</span> System that detects and conveys the location of gunfire or other weapon fire

A gunfire locator or gunshot detection system is a system that detects and conveys the location of gunfire or other weapon fire using acoustic, vibration, optical, or potentially other types of sensors, as well as a combination of such sensors. These systems are used by law enforcement, security, military, government offices, schools and businesses to identify the source and, in some cases, the direction of gunfire and/or the type of weapon fired. Most systems possess three main components:

<span class="mw-page-title-main">Electronic nose</span> Electronic sensor for odor detection

An electronic nose is an electronic sensing device intended to detect odors or flavors. The expression "electronic sensing" refers to the capability of reproducing human senses using sensor arrays and pattern recognition systems.

Computer audition (CA) or machine listening is the general field of study of algorithms and systems for audio interpretation by machines. Since the notion of what it means for a machine to "hear" is very broad and somewhat vague, computer audition attempts to bring together several disciplines that originally dealt with specific problems or had a concrete application in mind. The engineer Paris Smaragdis, interviewed in Technology Review, talks about these systems — "software that uses sound to locate people moving through rooms, monitor machinery for impending breakdowns, or activate traffic cameras to record accidents."

Materials MASINT is one of the six major disciplines generally accepted to make up the field of Measurement and Signature Intelligence (MASINT), with due regard that the MASINT subdisciplines may overlap, and MASINT, in turn, is complementary to more traditional intelligence collection and analysis disciplines such as SIGINT and IMINT. MASINT encompasses intelligence gathering activities that bring together disparate elements that do not fit within the definitions of Signals Intelligence (SIGINT), Imagery Intelligence (IMINT), or Human Intelligence (HUMINT).

<span class="mw-page-title-main">Unattended ground sensor</span> Unattended ground sensor

The Unattended Ground Sensor (UGS) are a variety of small sensors, generally covert, dedicated to detect and identify activities on the ground such as enemy soldiers or vehicles. UGS come as systems with an integrated communication network and processing capabilities.

Pipeline leak detection is used to determine if and in some cases where a leak has occurred in systems which contain liquids and gases. Methods of detection include hydrostatic testing, tracer gas leak detection, infrared, and laser technology after pipeline erection and leak detection during service.

Audio Analytic is a British company headquartered in Cambridge, England that has developed a patented sound recognition software framework called ai3, which provides technology with the ability to understand context through sound. This framework includes an embeddable software platform that can react to a range of sounds such as smoke alarms and carbon monoxide alarms, window breakage, infant crying and dogs barking.

Video content analysis or video content analytics (VCA), also known as video analysis or video analytics (VA), is the capability of automatically analyzing video to detect and determine temporal and spatial events.

Human sensing encompasses a range of technologies for detecting the presence of a human body in an area of space, typically without the intentional participation of the detected person. Common applications include search and rescue, surveillance, and customer analytics.

<span class="mw-page-title-main">Human presence detection</span> Technology for the detection of human bodies

Human presence detection is a range of technologies and methods for detecting the presence of a human body in an area of interest (AOI), or verification that computer, smartphone is operated by human. Software and hardware technologies are used for human presence detection. Unlike human sensing, that is dealing with human body only, human presence detection technologies are used to verify for safety, security or other reasons that human person, but not any other object is identified. Methods can be used for internet security authentication. These include software technologies such CAPTCHA and reCAPTCHA, as well as hardware technologies such as:

Intelligent transformation is the process of deriving better business and societal outcomes by leveraging smart devices, big data, artificial intelligence, and cloud technologies. Intelligent transformation can facilitate firms in gaining recognition from external investors, thereby enhancing their market image and attracting larger consumers who are more eager to collaborate. Conversely, intelligent transformation can foster the development of more interactive and multidimensional value-creation models while optimizing the conventional organizational model.

An audio deepfake is a type of artificial intelligence used to create convincing speech sentences that sound like specific people saying things they did not say. This technology was initially developed for various applications to improve human life. For example, it can be used to produce audiobooks, and also to help people who have lost their voices to get them back. Commercially, it has opened the door to several opportunities. This technology can also create more personalized digital assistants and natural-sounding text-to-speech as well as speech translation services.

References

  1. Arslan, Yuksel; Guldogan, Burak (2015). "Impulsive sound detection and gunshot recognition". 2015 23nd Signal Processing and Communications Applications Conference (SIU). pp. 511–514. doi:10.1109/SIU.2015.7129872. ISBN   978-1-4673-7386-9. S2CID   6574487.
  2. "Audio Analytic - enabling intelligent products through sound recognition". Audio Analytic. Retrieved 2018-04-09.
  3. US 10062304,Watkins, Greyson Kendall; Baltzer, Zachary& Lamb, Nicholas,"Apparatus and method for wireless sound recognition to notify users of detected sounds",published 2018-08-28, assigned to Hz Innovations Inc.