This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages) |
Developer(s) | Nuance Communications |
---|---|
Initial release | June 1997 |
Stable release | 16 / February 28, 2023 |
Operating system | Microsoft Windows |
Available in | 8 languages |
Type | Speech recognition |
License | Proprietary |
Website | www |
Dragon NaturallySpeaking (also known as Dragon for PC, or DNS) [1] is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15 (Professional Individual and Legal Individual), [2] which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016. [3] [4]
Dragon NaturallySpeaking uses a minimal user interface. As an example, dictated words appear in a floating tooltip as they are spoken (though there is an option to suppress this display to increase speed), and when the speaker pauses, the program transcribes the words into the active window at the location of the cursor. (Dragon does not support dictating to background windows.) The software has three primary areas of functionality: voice recognition in dictation with speech transcribed as written text, recognition of spoken commands, and text-to-speech: speaking text content of a document. Voice profiles can be accessed by different computers in a networked environment, although the audio hardware and configuration must be identical to those of the machine generating the configuration. The Professional version allows creation of custom commands to control programs or functions not built into NaturallySpeaking.
Dr. James Baker laid out the description of a speech understanding system called DRAGON in 1975. [5] In 1982 he and Dr. Janet M. Baker, his wife, founded Dragon Systems to release products centered around their voice recognition prototype. [6] He was President of the company and she was CEO.
DragonDictate was first released for DOS, and utilized hidden Markov models, a probabilistic method for temporal pattern recognition. At the time, the hardware was not powerful enough to address the problem of word segmentation, and DragonDictate was unable to determine the boundaries of words during continuous speech input. Users were forced to enunciate one word at a time, clearly separated by a small pause after each word. DragonDictate was based on a trigram model, and is known as a discrete utterance speech recognition engine. [7]
Dragon Systems released NaturallySpeaking 1.0 as their first continuous dictation product in 1997. [8]
The company was then purchased in June 2000 by Lernout & Hauspie, a Belgium-based corporation that was subsequently found to have been perpetrating financial fraud. [9] Following the all-share deal advised by Goldman Sachs, Lernout & Hauspie declared bankruptcy in November 2000. The deal was not originally supposed to be all stock and the unavailability of the Goldman Sachs team to advise concerning the change in terms was one of the grounds of the Bakers' subsequent lawsuit. The Bakers had received stock worth hundreds of millions of US dollars, but were only able to sell a few million dollars' worth before the stock lost all its value as a result of the accounting fraud. The Bakers sued Goldman Sachs for negligence, intentional misrepresentation and breach of fiduciary duty, which in January 2013 led to a 23-day trial in Boston. The jury cleared Goldman Sachs of all charges. [10] Following the bankruptcy of Lernout & Hauspie, the rights to the Dragon product line were acquired by ScanSoft of Burlington, Massachusetts, also a Goldman Sachs client. In 2005 ScanSoft launched a de facto acquisition of Nuance Communications, and rebranded itself as Nuance. [11]
As of 2012, LG Smart TVs included voice recognition feature powered by the same speech engine as Dragon NaturallySpeaking. [12] In 2014, following the discontinuation of DragonDictate for Mac, a product dating back to Nuance's 2010 purchase of MacSpeech Dictate, NaturallySpeaking gained Mac compatibility, though Mac support was later terminated in 2018. [13]
In 2021, Microsoft announced plans to acquire Nuance, and therefore Dragon NaturallySpeaking. [14] The acquisition completed in March 2022. [15] [16]
Dragon Naturally Speaking Version | Release date | Editions | Operating Systems Supported |
---|---|---|---|
1.0 | April 1997 | Personal | Windows 95, NT 4.0. |
2.0 | November 1997 | Standard, Preferred, Deluxe | Windows 95, NT 4.0 |
3.0 | October 1998 | Point & Speak, Standard, Preferred, Professional (with optional Legal and Medical add-on products) | Windows 95, 98, NT 4.0. |
4.0 | August 4, 1999 | Essentials, Standard, Preferred, Professional, Legal, Medical, Mobile | Windows 95, 98, NT 4.0 SP3+. |
5.0 | August 2000 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98, Me, NT 4.0 SP6+, 2000. |
6.0 | November 15, 2001 | Essentials, Standard, Preferred, Professional, Legal, Medical | |
7.0 | March 2003 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98SE, Me, NT4 SP6+, 2000, XP. |
8.0 | November 2004 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows Me (Only Standard and Preferred editions), Windows 2000 SP4+, Windows XP SP1+. |
9.0 | July 2006 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server, | Windows 2000 SP4+, XP SP1+. |
9.5 | January 2007 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server | Windows 2000 SP4+, XP SP1+, Vista (32-bit). |
10.0 | August 7, 2008 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit). Server 2003. |
10.1 | March 2009 | Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit and 64-bit), Windows 7 (32 and 64-bit). Server 2003. |
11.0 | August 2010 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
11.0 | 2011 | SDK client (DSC), SDK server (DSS) | Windows XP SP2+ (32-bit only), Vista SP1+ (32-bit and 64-bit), Windows 7 (32-bit and 64-bit), Windows Server 2003 and 2008, SP1, SP2 and R2 (32-bit and 64-bit) |
11.5 | June 2011 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
11.0 | August 2011 | Medical (Dragon Medical Practice Edition) | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
12.0 | October 2012 | Home, Premium, Professional, Legal | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
12.5 | February 2013 | Home, Premium, Professional, Legal | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
12 | June 2013 | Medical (Dragon Medical Practice Edition 2) | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
13 | August 2014 | Home, Premium, Professional, and Legal. | 7 (32 and 64-bit), 8.1 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+ |
13 | September 2015 | Medical (UK, French, German) (Dragon Medical Practice Edition 3) | 7 (32 and 64-bit), 8.1 (32 and 64-bit), 10 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+ |
14 | September 2015 | Professional (individual, and Group) | 7 (32 and 64-bit), 8.1 (32 and 64-bit), 10 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+. Server 2008, Server 2008 R2, Server 2012. |
15 | August 16, 2016 | Dragon Professional Individual; Dragon Legal Individual; Dragon Professional Individual for Mac (version 6) | 7, 8.1, 10 (32- and 64-bit); Server 2008 R2, Server 2012 R2. Mac OS X 0.11, macOS 10.12 |
15 | May 1, 2017 | Dragon Professional Group (Languages: English US and German only) | 7, 8.1, and 10, 32-bit and 64-bit |
15 | January 22, 2018 | Dragon Medical Practice Edition 4 (Languages: English US) | |
16 | February 28, 2023 | Dragon Professional | Windows 10, 11, Server 2016, 2019 and 2022 |
Dragon NaturallySpeaking 12 is available in the following languages; UK English, US English, French, German, Italian, Spanish, Dutch, and Japanese (aka "Dragon Speech 11" in Japan).
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech-to-text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.
Dictaphone was an American company founded by Alexander Graham Bell that produced dictation machines. It is now a division of Nuance Communications, based in Burlington, Massachusetts.
IBM ViaVoice was a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices. The latest stable version of IBM Via Voice was 9.0 and was able to transfer text directly into Microsoft Word.
Lernout & Hauspie Speech Products (L&H) was a Belgium-based speech recognition technology company, founded by Jo Lernout and Pol Hauspie, that went bankrupt in 2001 because of a fraud engineered by the management. The company was based in Ypres, Flanders, in what was later called Flanders Language Valley.
Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.
MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.
A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.
DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows, and was replaced by Dragon NaturallySpeaking for Windows. It was later acquired by Nuance Communications. Dragon Dictate for Mac 2.0 is supported only on Mac OS X 10.6. Nuance's other products for Mac include MacSpeech Scribe.
The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date, a number of versions of the API have been released, which have shipped either as part of a Speech SDK or as part of the Windows OS itself. Applications that use SAPI include Microsoft Office, Microsoft Agent and Microsoft Speech Server.
iListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. In 2006, iListen was the only third-party software that allowed inputting text using one's voice that works on newer Macintosh models. Its competitors were Apple's own speech recognition software ; Dragon Naturally Speaking by Nuance, running under Windows virtualization software such as Parallels Desktop for Mac or VMware Fusion; and the discontinued speech recognition program ViaVoice by Nuance/IBM.
As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.
Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks.
The Voice Navigator was the first voice recognition device for command and control of a graphical user interface. The system was developed by Articulate Systems, Inc. originally designed for the Apple Macintosh Plus and released in 1989. Subsequent versions were created for Microsoft Windows. Articulate Systems, Inc. was acquired by Dragon Systems in 1998.
MacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically, the speaker will record their dictation using a digital recording device such as a handheld digital recorder, mobile smartphone, or desktop or laptop computer with a suitable microphone. MacSpeech Scribe supports specific audio file formats for recorded dictation: .aif, .aiff, .wav, .mp4, .m4a, and .m4v.
Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.
Tazti is a speech recognition software package developed and sold by Voice Tech Group, Inc. for Windows personal computers. The most recent package is version 3.2, which supports Windows 10, Windows 8.1, Windows 8 and Windows 7 64-bit editions. Earlier versions of Tazti supported Windows Vista and Windows XP. PC video game play by voice, controlling PC applications and programs by voice and creating speech commands to trigger a browser to open web pages, or trigger the Windows operating system to open files, folders or programs are Tazti's primary features. Earlier versions of Tazti included a lite Dictation feature that is eliminated from the latest version.
Speech Processing Solutions is an international electronics company headquartered in Vienna, Austria. The company designs, develops, manufactures and markets speech processing devices, such as those used in digital dictation and speech recognition. Speech Processing Solutions was formed on 1 July 2012. Philips Speech Processing was part of the Philips Consumer Lifestyle sector. Speech Processing Solutions is now an official licensee of the Philips brand. The company has subsidiaries in the US, Canada, Australia, the United Kingdom, Belgium, France and Germany, and employs around 170 people worldwide.
Braina is a virtual assistant and speech-to-text dictation application for Microsoft Windows developed by Brainasoft. Braina uses natural language interface, speech synthesis, and speech recognition technology to interact with its users and allows them to use natural language sentences to perform various tasks on a computer. The name Braina is a short form of "Brain Artificial".
Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.