SVOX

Last updated
SVOX
Type Private
Industry Speech Processing, Natural Language Processing
PredecessorSVOX GmbH
Founded2000 (GmbH), 2001 (AG)
FounderVolker Jantzen
Christof Traber (GmbH) [1]
Bettina Hein
Thomas Benz (AG) [2]
Headquarters Zurich, Switzerland
Area served
Worldwide
Key people
Martin Reber, (CEO)
Products Speech Recognition (ASR), Speech Output (TTS), Speech Dialog
Number of employees
90 (2009)

SVOX is an embedded speech technology company founded in 2000 and headquartered in Zurich, Switzerland. SVOX was acquired by Nuance Communications in 2011. The company's products included Automated Speech Recognition (ASR), Text-to-Speech (TTS) and Speech Dialog systems, with customers mostly being manufacturers and system integrators in automotive and mobile device industries.

Contents

History

SVOX was started in 2000 by researchers at Federal Institute of Technology Zurich (ETH Zurich) and first focused exclusively on Speech Output (TTS) solutions for automotive industry.

In 2002, Siemens Mobile Acceleration (today's smac|partners GmbH) invested into SVOX. [3]

Later, as the market for Personal Navigation Devices and smartphones developed, the company started to supply those markets as well. In 2008, SVOX released Pico, a small-footprint TTS system optimized for mobile phones. [4]

In parallel, SVOX has branched into Speech Recognition and Speech Dialog. As part of that process, the company acquired Professional Speech Processing Group of Siemens AG in early 2009. [5]

In 2009, SVOX made headlines with news that Google had chosen to include the company's Pico TTS solution into the 1.6 release of Android platform. [6]

In June 2011, Nuance Communications acquired SVOX. [7]

Products

SVOX products include Automated Speech Recognition (ASR), Text-to-Speech (TTS) and Speech Dialog systems. Typical uses include:

The company's speech products are especially popular with German carmakers such as Audi, Porsche, BMW, Daimler, and VW and are often found in premium cars.

See also

Related Research Articles

SpeechWorks was a company founded in Boston in 1994 by speech recognition pioneer Mike Phillips and Bill O'Farrell. The Boston-based company developed and supported speech-related computer software. Originally known as Applied Language Technologies, SpeechWorks went public in 2000 and tripled its value. ScanSoft acquired Nuance in 2003, and changed its name to Nuance Communications.

Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.

VDO is a German brand of Continental Automotive which makes automotive electronics and mechatronics for powertrains, engine management systems and fuel injection systems. A full range of Tachograph, Data Management, and Telematics products are produced. VDO has also supplied components for pleasure boats, yachts and sailing boats since 1958. In 2018 the marine business was separated into VDO Marine, offering products made by Swiss manufacture Veratron AG.

SpeechFX

SpeechFX, Inc., offers voice technology for mobile phone and wireless devices, interactive video games, toys, home appliances, computer telephony systems and vehicle telematics. Fonix speech solutions are based on the firm’s proprietary neural network-based automatic speech recognition (ASR) and Fonix DECtalk, a text-to-speech speech synthesis system (TTS). Fonix speech technology is user-independent, meaning no voice training is involved.

Dragon NaturallySpeaking Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) makes spoken human interaction with computers possible, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device (VCD) is a device controlled with a voice user interface.

Dialogue system

A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.

IraqComm

IraqComm is a speech translation system that performs two-way, speech-to-speech machine translation between English and colloquial Iraqi Arabic. SRI International in Menlo Park, California led development of the IraqComm system under the DARPA program Spoken Language Communication and Translation System for Tactical Use.

Voice search, also called voice-enabled, allows the user to use a voice command to search the Internet, a website, or an app.

A spoken dialog system is a computer system able to converse with a human with voice. It has two essential components that do not exist in a written text dialog system: a speech recognizer and a text-to-speech module. It can be further distinguished from command and control speech systems that can respond to requests but do not attempt to maintain continuity over time.

Navigon

Navigon GmbH was a Würzburg, Germany-based manufacturer of GPS devices and GPS navigation software. The company was privately owned, until in June 2011, it was announced that Navigon was to be acquired by Garmin and become a subsidiary of the company. With effect from October 31, 2011, Navigon has changed its legal entity from corporation (Aktiengesellschaft) to GmbH.

Here Technologies Netherlands-based mapping data company

HERE Technologies is a multinational group dealing with mapping, location data and related automotive services to individuals and companies. It is majority-owned by a consortium of German automotive companies and American semiconductor company Intel whilst other companies also own minority stakes. Its roots date back to U.S.-based Navteq in 1985, which was acquired by Finland-based Nokia in 2007. HERE is currently based in The Netherlands.

Swype was a virtual keyboard for touchscreen smartphones and tablets originally developed by Swype Inc., founded in 2002, where the user enters words by sliding a finger or stylus from the first letter of a word to its last letter, lifting only between words. It uses error-correction algorithms and a language model to guess the intended word. It also includes a predictive text system, handwriting and speech recognition support. Swype was first commercially available on the Samsung Omnia II running Windows Mobile, and was originally pre-loaded on specific devices.

Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.

Vlingo was a speech recognition software company co-founded by speech-to-text pioneers Mike Phillips and John Nguyen in 2006. It was best known for its intelligent personal assistant and knowledge navigator, also named Vlingo, which functioned as a personal assistant application for Symbian, Android, iPhone, BlackBerry, and other smartphones. Vlingo was acquired by speech recognition giant Nuance Communications in 2012.

Sensory, Inc.

Sensory, Inc. is an American company which develops software AI technologies for speech, sound and vision. It is based in Santa Clara, California.

NeoSpeech is a company that specializes in text-to-speech (TTS) software for embedded devices, mobile, desktop, and network/server applications. NeoSpeech was founded by two speech engineers in Fremont, California, US, in 2002. NeoSpeech is privately held, headquartered in Santa Clara, California.

FBReader E-book reader

FBReader is an e-book reader for Linux, Microsoft Windows, Android, and other platforms.

Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.

References

  1. Handelsregister des Kantons Zürich. "Internet-Auszug".
  2. Handelsregister des Kantons Zürich. "Internet-Auszug".
  3. "Erste Investition in der Schweiz Siemens Mobile Acceleration investiert in den Schweizer Text-to-Speech-Spezialisten SVOX AG". na presseportal.
  4. "SVOX releases Pico: highest-quality sub-1 MB Text-to-Speech system available". PresseBox.
  5. Boretz, Adam. "SVOX Acquires Siemens' Speech Unit". Speech Technology Magazine.
  6. Conneally, Tim. "Android 'Donut' SDK released: What's new inside".
  7. Nuance Press Release. "Nuance acquires SVOX".