DragonDictate

Last updated

DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows, and was replaced by Dragon NaturallySpeaking for Windows. It was later acquired by Nuance Communications. Dragon Dictate for Mac 2.0 (originally named MacSpeech Dictate [1] ) is supported only on Mac OS X 10.6 (Snow Leopard). Nuance's other products for Mac include MacSpeech Scribe.

Contents

Original DragonDictate

DragonDictate for Windows was the original speech recognition application from Dragon Systems and used discrete speech where the user must pause between speaking each word. The first version, 1.0 was available only through a few distribution and support partners. It included a Shure cardioid microphone headset. Later it was replaced by Dragon NaturallySpeaking, which allows continuous speech recognition and correction and training of words via the keyboard. NaturallySpeaking remains a Windows-only program, and since 2016 distributes in Version 15. DragonDictate for Windows is still available but has not been updated since Windows 98 was the current operating system.

Dragon Dictate for Mac

Dragon Dictate for Mac 2.0, an upgrade for MacSpeech Dictate, was announced on September 20, 2010 by Nuance Communications, the developer of MacSpeech products. The upgrade incorporates some of the features of NaturallySpeaking into the MacSpeech software. Dragon Dictate for Mac lacks other NaturallySpeaking features, such as training mis-recognized words by simply re-typing them using the keyboard. An early review by David Pogue notes,

I’m thrilled about the power, the control, the speed and the accuracy of Dragon Dictate. It does, however, have some room for improvement.

For example, in the dictation software world, teaching the software to know its location in your text document is a huge challenge. If you never touch the mouse, the program always knows where it is in the text — because it has deposited all that text itself.

But if you click to edit somewhere, it’s blind. It no longer knows where it is in the document.

In Windows, Nuance has used some clever tricks to overcome this problem in the most important programs, like Word and Outlook. On the Mac, however, the program has no idea what you’ve done manually, by clicking. So you can say something like “select fishmonger,” and the program correctly selects that word. But if you then say “italicize that” or “capitalize that,” the program operates on the wrong words, italicizing or capping something a mile away from the selection. Bizarre.

(This problem doesn’t happen in TextEdit or Dictate’s own included word processor.) [2]

In October 2018 Nuance announced that it was dropping Macintosh support for its products. [3]

See also

Notable users

Peter David - American writer of comic books, novels, television, movies and video games. David began using DragonDictate following his stroke in December 2012. [4] [5]

Related Research Articles

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech-to-text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

<span class="mw-page-title-main">Dictaphone</span> American producer of dictation machines

Dictaphone was an American company founded by Alexander Graham Bell that produced dictation machines. It is now a division of Nuance Communications, based in Burlington, Massachusetts.

IBM ViaVoice was a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices. The latest stable version of IBM Via Voice was 9.0 and was able to transfer text directly into Microsoft Word.

PlainTalk is the collective name for several speech synthesis (MacinTalk) and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some 68k Macintoshes.

<span class="mw-page-title-main">Microsoft Office XP</span> Version of Microsoft Office suite

Microsoft Office XP is an office suite which was officially revealed in July 2000 by Microsoft for the Windows operating system. Office XP was released to manufacturing on March 5, 2001, and was later made available to retail on May 31, 2001. A Mac OS X equivalent, Microsoft Office v. X was released on November 19, 2001.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

Medical transcription, also known as MT, is an allied health profession dealing with the process of transcribing voice-recorded medical reports that are dictated by physicians, nurses and other healthcare practitioners. Medical reports can be voice files, notes taken during a lecture, or other spoken material. These are dictated over the phone or uploaded digitally via the Internet or through smart phone apps.

iListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. In 2006, iListen was the only third-party software that allowed inputting text using one's voice that works on newer Macintosh models. Its competitors were Apple's own speech recognition software ; Dragon Naturally Speaking by Nuance, running under Windows virtualization software such as Parallels Desktop for Mac or VMware Fusion; and the discontinued speech recognition program ViaVoice by Nuance/IBM.

As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.

<span class="mw-page-title-main">Windows Speech Recognition</span> Speech recognition software

Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks.

Swype was a virtual keyboard for touchscreen smartphones and tablets originally developed by Swype Inc., founded in 2002, where the user enters words by sliding a finger or stylus from the first letter of a word to its last letter, lifting only between words. It uses error-correction algorithms and a language model to guess the intended word. It also includes a predictive text system, handwriting and speech recognition support. Swype was first commercially available on the Samsung Omnia II running Windows Mobile, and was originally pre-loaded on specific devices.

The Voice Navigator was the first voice recognition device for command and control of a graphical user interface. The system was developed by Articulate Systems, Inc. originally designed for the Apple Macintosh Plus and released in 1989. Subsequent versions were created for Microsoft Windows. Articulate Systems, Inc. was acquired by Dragon Systems in 1998.

MacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically, the speaker will record their dictation using a digital recording device such as a handheld digital recorder, mobile smartphone, or desktop or laptop computer with a suitable microphone. MacSpeech Scribe supports specific audio file formats for recorded dictation: .aif, .aiff, .wav, .mp4, .m4a, and .m4v.

Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.

<span class="mw-page-title-main">WordQ+SpeakQ</span>

WordQ® is an assistive technology software developed by Quillsoft Ltd. and Holland Bloorview Kids Rehabilitation Hospital. WordQ's main purpose is helping individuals who struggle with reading and writing. Often that struggle is due to physical disability or neurodivergence such as Dyslexia or ADHD. WordQ uses intelligent word prediction to suggest words that the user is typing in as few as 1.3 keystrokes. This helps with spelling, and reduces the amount of physical movements a user has to make when typing. It also uses high-quality Acapela text-to-speech voices to read back text that the user has entered, allowing for proofreading and editing. WordQs speech recognition feature, SpeakQ®, is included in WordQ Desktop for Windows OS. It allows users to switch between speaking and typing while writing. WordQ Desktop is available for Windows and Mac OS, iWordQ is available for iPadOS, and WordQ Chrome for the Chrome browser is slated for release in January 2022.

<span class="mw-page-title-main">Braina</span> Intelligent personal assistant & dictation software

Braina is a virtual assistant and speech-to-text dictation application for Microsoft Windows developed by Brainasoft. Braina uses natural language interface, speech synthesis, and speech recognition technology to interact with its users and allows them to use natural language sentences to perform various tasks on a computer. The name Braina is a short form of "Brain Artificial".

Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.

References