DragonDictate

Last updated

DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows, and was replaced by Dragon NaturallySpeaking for Windows. It was later acquired by Nuance Communications. Dragon Dictate for Mac 2.0 (originally named MacSpeech Dictate [1] ) is supported only on Mac OS X 10.6 (Snow Leopard). Nuance's other products for Mac include MacSpeech Scribe.

Contents

Original DragonDictate

DragonDictate for Windows was the original speech recognition application from Dragon Systems and used discrete speech where the user must pause between speaking each word. The first version, 1.0 was available only through a few distribution and support partners. It included a Shure cardioid microphone headset. Later it was replaced by Dragon NaturallySpeaking, which allows continuous speech recognition and correction and training of words via the keyboard. NaturallySpeaking remains a Windows-only program, and since 2016 distributes in Version 15. DragonDictate for Windows is still available but has not been updated since Windows 98 was the current operating system.

Dragon Dictate for Mac

Dragon Dictate for Mac 2.0, an upgrade for MacSpeech Dictate, was announced on September 20, 2010 by Nuance Communications, the developer of MacSpeech products. The upgrade incorporates some of the features of NaturallySpeaking into the MacSpeech software. Dragon Dictate for Mac lacks other NaturallySpeaking features, such as training mis-recognized words by simply re-typing them using the keyboard. An early review by David Pogue notes,

I’m thrilled about the power, the control, the speed and the accuracy of Dragon Dictate. It does, however, have some room for improvement.

For example, in the dictation software world, teaching the software to know its location in your text document is a huge challenge. If you never touch the mouse, the program always knows where it is in the text — because it has deposited all that text itself.

But if you click to edit somewhere, it’s blind. It no longer knows where it is in the document.

In Windows, Nuance has used some clever tricks to overcome this problem in the most important programs, like Word and Outlook. On the Mac, however, the program has no idea what you’ve done manually, by clicking. So you can say something like “select fishmonger,” and the program correctly selects that word. But if you then say “italicize that” or “capitalize that,” the program operates on the wrong words, italicizing or capping something a mile away from the selection. Bizarre.

(This problem doesn’t happen in TextEdit or Dictate’s own included word processor.) [2]

In October 2018 Nuance announced that it was dropping Macintosh support for its products. [3]

See also

Notable users

Peter David - American writer of comic books, novels, television, movies and video games. David began using DragonDictate following his stroke in December 2012. [4] [5]

Related Research Articles

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

<span class="mw-page-title-main">Dictaphone</span> American producer of dictation machines

Dictaphone was an American company founded by Alexander Graham Bell that produced dictation machines. It is now a division of Nuance Communications, based in Burlington, Massachusetts.

IBM ViaVoice was a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices. The latest stable version of IBM Via Voice was 9.0 and was able to transfer text directly into Word.

PlainTalk is the collective name for several speech synthesis (MacinTalk) and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some 68k Macintoshes.

<span class="mw-page-title-main">Microsoft Office XP</span> Version of Microsoft Office suite

Microsoft Office XP is an office suite which was officially revealed in July 2000 by Microsoft for the Windows operating system. Office XP was released to manufacturing on March 5, 2001, and was later made available to retail on May 31, 2001, less than five months prior to the release of Windows XP. It is the successor to Office 2000 and the predecessor of Office 2003. A Mac OS X equivalent, Microsoft Office v. X was released on November 19, 2001.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) makes spoken human interaction with computers possible, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

iListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. In 2006, iListen was the only third-party software that allowed inputting text using one's voice that works on newer Macintosh models. Its competitors were Apple's own speech recognition software ; Dragon Naturally Speaking by Nuance, running under Windows virtualization software such as Parallels Desktop for Mac or VMware Fusion; and the discontinued speech recognition program ViaVoice by Nuance/IBM.

As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.

A speech-to-text reporter (STTR), also known as a captioner, is a person who listens to what is being said and inputs it, word for word, as properly written texts. Many captioners use tools which commonly converts verbally communicated information into written words to be composed as a text. The reproduced text can then be read by deaf or hard-of-hearing people, language learners, or people with auditory processing disabilities.

<span class="mw-page-title-main">Windows Speech Recognition</span> Speech recognition software

Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks. Hotkeys for this action is Windows logo key+Ctrl+S

Swype was a virtual keyboard for touchscreen smartphones and tablets originally developed by Swype Inc., founded in 2002, where the user enters words by sliding a finger or stylus from the first letter of a word to its last letter, lifting only between words. It uses error-correction algorithms and a language model to guess the intended word. It also includes a predictive text system, handwriting and speech recognition support. Swype was first commercially available on the Samsung Omnia II running Windows Mobile, and was originally pre-loaded on specific devices.

MacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically the speaker will record their dictation using a digital recording device such as a handheld digital recorder, mobile smartphone, or desktop or laptop computer with a suitable microphone. MacSpeech Scribe supports specific audio file formats for recorded dictation: .aif, .aiff, .wav, .mp4, .m4a, and .m4v.

Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.

<span class="mw-page-title-main">WordQ+SpeakQ</span>

WordQ® is an assistive technology software developed by Quillsoft Ltd. and Holland Bloorview Kids Rehabilitation Hospital. WordQ's main purpose is helping individuals who struggle with reading and writing. Often that struggle is due to physical disability or neurodivergence such as Dyslexia or ADHD. WordQ uses intelligent word prediction to suggest words that the user is typing in as few as 1.3 keystrokes. This helps with spelling, and reduces the amount of physical movements a user has to make when typing. It also uses high-quality Acapela text-to-speech voices to read back text that the user has entered, allowing for proofreading and editing. WordQs speech recognition feature, SpeakQ®, is included in WordQ Desktop for Windows OS. It allows users to switch between speaking and typing while writing. WordQ Desktop is available for Windows and Mac OS, iWordQ is available for iPadOS, and WordQ Chrome for the Chrome browser is slated for release in January 2022.

Inputlog is one of the most used keyloggers. It enables researchers to observe the online writing process unobtrusively. It logs every input action that is used to produce a text, viz. keystrokes, mouse movements and clicks and speech input via Dragon Naturally Speaking (Nuance). The program also provides a timestamp and detailed information about the Windows environment that is activated. Researchers can download the program from the Inputlog website for free.

Voice writing is a method used for court reporting, medical transcription, CART, and closed captioning. Using the voice writing method, a court reporter speaks directly into a stenomask or speech silencer - a hand-held mask containing one or two microphones and voice-dampening materials. As the reporter repeats the testimony into the recorder, the mask prevents the reporter from being heard during testimony.

Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.

References