This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these template messages) |
Developer(s) | Nuance Communications |
---|---|
Initial release | June 1997 |
Stable release | 16 / February 28, 2023 |
Operating system | Microsoft Windows |
Available in | 8 languages |
Type | Speech recognition |
License | Proprietary |
Website | www |
Dragon NaturallySpeaking (also known as Dragon for PC, or DNS) [1] is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15 (Professional Individual and Legal Individual), [2] which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016. [3] [4]
Dragon NaturallySpeaking uses a minimal user interface. As an example, dictated words appear in a floating tooltip as they are spoken (though there is an option to suppress this display to increase speed), and when the speaker pauses, the program transcribes the words into the active window at the location of the cursor. (Dragon does not support dictating to background windows.) The software has three primary areas of functionality: voice recognition in dictation with speech transcribed as written text, recognition of spoken commands, and text-to-speech: speaking text content of a document. Voice profiles can be accessed by different computers in a networked environment, although the audio hardware and configuration must be identical to those of the machine generating the configuration. The Professional version allows creation of custom commands to control programs or functions not built into NaturallySpeaking.
Dr. James Baker laid out the description of a speech understanding system called DRAGON in 1975. [5] In 1982 he and Dr. Janet M. Baker, his wife, founded Dragon Systems to release products centered around their voice recognition prototype. [6] He was President of the company and she was CEO.
DragonDictate was first released for DOS, and utilized hidden Markov models, a probabilistic method for temporal pattern recognition. At the time, the hardware was not powerful enough to address the problem of word segmentation, and DragonDictate was unable to determine the boundaries of words during continuous speech input. Users were forced to enunciate one word at a time, clearly separated by a small pause after each word. DragonDictate was based on a trigram model, and is known as a discrete utterance speech recognition engine. [7]
Dragon Systems released NaturallySpeaking 1.0 as their first continuous dictation product in 1997. [8]
Joel Gould was the director of emerging technologies at Dragon Systems. Gould was the principal architect and lead engineer for the development of Dragon NaturallyOrganized (1.0), Dragon NaturallySpeaking Mobile Organizer (3.52), Dragon NaturallySpeaking (1.0 through 2.02), and DragonDictate for Windows (1.0). Gould also designed the tutorials in both DragonDictate for DOS version 2.0 and Dragon Talk.[ citation needed ]
The company was then purchased in June 2000 by Lernout & Hauspie, a Belgium-based corporation that was subsequently found to have been perpetrating financial fraud. [9] Following the all-share deal advised by Goldman Sachs, Lernout & Hauspie declared bankruptcy in November 2000. The deal was not originally supposed to be all stock and the unavailability of the Goldman Sachs team to advise concerning the change in terms was one of the grounds of the Bakers' subsequent lawsuit. The Bakers had received stock worth hundreds of millions of US dollars, but were only able to sell a few million dollars' worth before the stock lost all its value as a result of the accounting fraud. The Bakers sued Goldman Sachs for negligence, intentional misrepresentation and breach of fiduciary duty, which in January 2013 led to a 23-day trial in Boston. The jury cleared Goldman Sachs of all charges. [10] Following the bankruptcy of Lernout & Hauspie, the rights to the Dragon product line were acquired by ScanSoft of Burlington, Massachusetts, also a Goldman Sachs client. In 2005 ScanSoft launched a de facto acquisition of Nuance Communications, and rebranded itself as Nuance. [11]
As of 2012, LG Smart TVs included voice recognition feature powered by the same speech engine as Dragon NaturallySpeaking. [12] In 2014, following the discontinuation of DragonDictate for Mac, a product dating back to Nuance's 2010 purchase of MacSpeech Dictate, NaturallySpeaking gained Mac compatibility, though Mac support was later terminated in 2018. [13]
In 2021, Microsoft announced plans to acquire Nuance, and therefore Dragon NaturallySpeaking. [14] The acquisition completed in March 2022. [15] [16]
Dragon Naturally Speaking Version | Release date | Editions | Operating Systems Supported |
---|---|---|---|
1.0 | April 1997 | Personal | Windows 95, NT 4.0. |
2.0 | November 1997 | Standard, Preferred, Deluxe | Windows 95, NT 4.0 |
3.0 | October 1998 | Point & Speak, Standard, Preferred, Professional (with optional Legal and Medical add-on products) | Windows 95, 98, NT 4.0. |
4.0 | August 4, 1999 | Essentials, Standard, Preferred, Professional, Legal, Medical, Mobile | Windows 95, 98, NT 4.0 SP3+. |
5.0 | August 2000 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98, Me, NT 4.0 SP6+, 2000. |
6.0 | November 15, 2001 | Essentials, Standard, Preferred, Professional, Legal, Medical | |
7.0 | March 2003 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98SE, Me, NT4 SP6+, 2000, XP. |
8.0 | November 2004 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows Me (Only Standard and Preferred editions), Windows 2000 SP4+, Windows XP SP1+. |
9.0 | July 2006 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server, | Windows 2000 SP4+, XP SP1+. |
9.5 | January 2007 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server | Windows 2000 SP4+, XP SP1+, Vista (32-bit). |
10.0 | August 7, 2008 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit). Server 2003. |
10.1 | March 2009 | Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit and 64-bit), Windows 7 (32 and 64-bit). Server 2003. |
11.0 | August 2010 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
11.0 | 2011 | SDK client (DSC), SDK server (DSS) | Windows XP SP2+ (32-bit only), Vista SP1+ (32-bit and 64-bit), Windows 7 (32-bit and 64-bit), Windows Server 2003 and 2008, SP1, SP2 and R2 (32-bit and 64-bit) |
11.5 | June 2011 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
11.0 | August 2011 | Medical (Dragon Medical Practice Edition) | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
12.0 | October 2012 | Home, Premium, Professional, Legal | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
12.5 | February 2013 | Home, Premium, Professional, Legal | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
12 | June 2013 | Medical (Dragon Medical Practice Edition 2) | Windows XP SP3+ (32-bit), Vista SP2+ (32-bit and 64-bit), 7 (32 and 64-bit), 8 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. |
13 | August 2014 | Home, Premium, Professional, and Legal. | 7 (32 and 64-bit), 8.1 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+ |
13 | September 2015 | Medical (UK, French, German) (Dragon Medical Practice Edition 3) | 7 (32 and 64-bit), 8.1 (32 and 64-bit), 10 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+ |
14 | September 2015 | Professional (individual, and Group) | 7 (32 and 64-bit), 8.1 (32 and 64-bit), 10 (32 and 64-bit). Server 2008, Server 2008 R2, Server 2012. Mac OS X 10.6+. Server 2008, Server 2008 R2, Server 2012. |
15 | August 16, 2016 | Dragon Professional Individual; Dragon Legal Individual; Dragon Professional Individual for Mac (version 6) | 7, 8.1, 10 (32- and 64-bit); Server 2008 R2, Server 2012 R2. Mac OS X 0.11, macOS 10.12 |
15 | May 1, 2017 | Dragon Professional Group (Languages: English US and German only) | 7, 8.1, and 10, 32-bit and 64-bit |
15 | January 22, 2018 | Dragon Medical Practice Edition 4 (Languages: English US) | |
16 | February 28, 2023 | Dragon Professional | Windows 10, 11, Server 2016, 2019 and 2022 |
Dragon NaturallySpeaking 12 is available in the following languages; UK English, US English, French, German, Italian, Spanish, Dutch, and Japanese (aka "Dragon Speech 11" in Japan).
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.
Dictaphone was an American company founded by Alexander Graham Bell that produced dictation machines. It is now a division of Nuance Communications, based in Burlington, Massachusetts.
The Office Assistant is a discontinued intelligent user interface for Microsoft Office that assisted users by way of an interactive animated character which interfaced with the Office help content. It was included in Microsoft Office for Windows, in Microsoft Publisher and Microsoft Project, Microsoft FrontPage, and Microsoft Office for Mac. The Office Assistant used technology initially from Microsoft Bob and later Microsoft Agent, offering advice based on Bayesian algorithms.
IBM ViaVoice was a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices. The latest stable version of IBM Via Voice was 9.0 and was able to transfer text directly into Word.
BonziBuddy was a freeware desktop virtual assistant created by Joe and Jay Bonzi. Upon a user's choice, it would share jokes and facts, manage downloads, sing songs, and talk, among other functions, as it used Microsoft Agent.
Lernout & Hauspie Speech Products (L&H) was a Belgium-based speech recognition technology company, founded by Jo Lernout and Pol Hauspie, that went bankrupt in 2001 because of a fraud engineered by the management. The company was based in Ypres, Flanders, in what was later called Flanders Language Valley.
Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.
MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.
A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.
DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows, and was replaced by Dragon NaturallySpeaking for Windows. It was later acquired by Nuance Communications. Dragon Dictate for Mac 2.0 is supported only on Mac OS X 10.6. Nuance's other products for Mac include MacSpeech Scribe.
iListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. In 2006, iListen was the only third-party software that allowed inputting text using one's voice that works on newer Macintosh models. Its competitors were Apple's own speech recognition software ; Dragon Naturally Speaking by Nuance, running under Windows virtualization software such as Parallels Desktop for Mac or VMware Fusion; and the discontinued speech recognition program ViaVoice by Nuance/IBM.
As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.
Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks.
Microsoft Translator is a multilingual machine translation cloud service provided by Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services and integrated across multiple consumer, developer, and enterprise products, including Bing, Microsoft Office, SharePoint, Microsoft Edge, Microsoft Lync, Yammer, Skype Translator, Visual Studio, and Microsoft Translator apps for Windows, Windows Phone, iPhone and Apple Watch, and Android phone and Android Wear.
The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows operating systems; server voices are available for download for use with server applications such as Speech Server, Lync etc. for both Windows client and server platforms, and mobile voices are often shipped with more recent versions.
The Voice Navigator was the first voice recognition device for command and control of a graphical user interface. The system was developed by Articulate Systems, Inc. originally designed for the Apple Macintosh Plus and released in 1989. Subsequent versions were created for Microsoft Windows. Articulate Systems, Inc. was acquired by Dragon Systems in 1998.
MacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically, the speaker will record their dictation using a digital recording device such as a handheld digital recorder, mobile smartphone, or desktop or laptop computer with a suitable microphone. MacSpeech Scribe supports specific audio file formats for recorded dictation: .aif, .aiff, .wav, .mp4, .m4a, and .m4v.
Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.
Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.