SpeechMagic

Last updated

SpeechMagic is an industrial speech recognition platform developed by Philips Speech Recognition Systems of Vienna, Austria. The name is also used by the company for services supporting information capture. The technology is mainly used in the healthcare sector, with applications available for the legal market and tax consultants.

On October 1, 2008, Nuance Communications, Inc. announced that it had acquired Philips Speech Recognition Systems. [1]

SpeechMagic supports 25 recognition languages and provides more than 150 ConTexts (industry-specific vocabularies). The world’s largest location using SpeechMagic is the United States with more than 60,000 authors, more than 3,000 editors and a throughput of 400 million lines per year. [2] [ full citation needed ]

Related Research Articles

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers with the main benefit of searchability. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

SpeechWorks was a company founded in Boston in 1994 by speech recognition pioneer Mike Phillips and Bill O'Farrell. The Boston-based company developed and supported speech-related computer software. Originally known as Applied Language Technologies, SpeechWorks went public in 2000 and tripled its value. ScanSoft acquired Nuance in 2003, and changed its name to Nuance Communications.

Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) makes spoken human interaction with computers possible, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows. It has now been replaced by Dragon NaturallySpeaking for Windows, and has since been acquired by Nuance Communications. Dragon Dictate for Mac 2.0 is supported only on Mac OS X 10.6. Nuance's other products for Mac include MacSpeech Scribe.

OmniPage is an optical character recognition (OCR) application available from Kofax Incorporated.

<span class="mw-page-title-main">Virtual assistant</span> Mobile software agent

An intelligent virtual assistant (IVA) or intelligent personal assistant (IPA) is a software agent that can perform tasks or services for an individual based on commands or questions. The term "chatbot" is sometimes used to refer to virtual assistants generally or specifically accessed by online chat. In some cases, online chat programs are exclusively for entertainment purposes. Some virtual assistants are able to interpret human speech and respond via synthesized voices. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal commands. A similar concept, however with differences, underlies the dialogue systems.

Voxeo Corporation was a technology company that specialized in providing development platforms for unified customer experience (self-service) and unified communications applications. Voxeo was headquartered in Orlando, Florida with main offices in Cologne, Germany; Beijing, China; London, UK and San Francisco, US.

SVOX is an embedded speech technology company founded in 2000 and headquartered in Zurich, Switzerland. SVOX was acquired by Nuance Communications in 2011. The company's products included Automated Speech Recognition (ASR), Text-to-Speech (TTS) and Speech Dialog systems, with customers mostly being manufacturers and system integrators in automotive and mobile device industries.

<span class="mw-page-title-main">Siri</span> Software based personal assistant from Apple Inc.

Siri is a virtual assistant that is part of Apple Inc.'s iOS, iPadOS, watchOS, macOS, tvOS, and audioOS operating systems. It uses voice queries, gesture based control, focus-tracking and a natural-language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of Internet services. With continued use, it adapts to users' individual language usages, searches and preferences, returning individualized results.

The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows operating systems; server voices are available for download for use with server applications such as Speech Server, Lync etc. for both Windows client and server platforms, and mobile voices are often shipped with more recent versions.

Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.

Loquendo is a multinational computer software technology corporation, headquartered in Torino, Italy, that provides speech recognition, speech synthesis, speaker verification and identification applications. Loquendo, which was founded in 2001 under the Telecom Italia Lab, also had offices in United Kingdom, Spain, Germany, France, and the United States.

MedSLT is a medium-ranged open source spoken language translator developed by the University of Geneva. It is funded by the Swiss National Science Foundation. The system has been designed for the medical domain. It currently covers the doctor-patient diagnosis dialogues for the domains of headache, chest and abdominal pain in English, French, Japanese, Spanish, Catalan and Arabic. The vocabulary used ranges from 350 to 1000 words depending on the domain and language pair.

<span class="mw-page-title-main">Speech Processing Solutions</span> Manufacturer of speech processing devices

Speech Processing Solutions is an international electronics company headquartered in Vienna, Austria. The company designs, develops, manufactures and markets speech processing devices, such as those used in digital dictation and speech recognition. Speech Processing Solutions was formed on 1 July 2012. Philips Speech Processing was part of the Philips Consumer Lifestyle sector. Speech Processing Solutions is now an official licensee of the Philips brand. The company has subsidiaries in the US, Canada, Australia, the United Kingdom, Belgium, France and Germany, and employs around 170 people worldwide.

<span class="mw-page-title-main">Automotive Grade Linux</span>

Automotive Grade Linux (AGL) is an open source project hosted by The Linux Foundation that is building an open operating system and framework for automotive applications. AGL was launched in 2012 with founding members including Jaguar Land Rover, Nissan, Toyota, DENSO Corporation, Fujitsu, HARMAN, NVIDIA, Renesas, Samsung, and Texas Instruments (TI). Today, AGL has 150 members. Notably, IndyKite, Marelli, and Red Hat joined as members in April 2022.

<span class="mw-page-title-main">Sanbot (robot)</span> Service robot by Qihan Technology Co. Ltd

Sanbot is an intelligent, cloud-enabled service robot developed by Qihan Technology Co. Ltd., a robotics and AI company headquartered in Shenzhen, China. Qihan has launched three generations of intelligent robots under the Sanbot brand – Sanbot Elf(QIHAN Technology renamed the first generation Sanbot to Sanbot Elf), Sanbot King Kong and Sanbot Nano.

References

  1. www.nuance.com press release archived at Wayback Machine
  2. MedQuist DocQment Enterprise Platform with Integrated Speech Recognition