MacSpeech Scribe

Last updated

MacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically, the speaker will record their dictation using a digital recording device such as a handheld digital recorder, mobile smartphone (e.g. iPhone), or desktop or laptop computer with a suitable microphone. MacSpeech Scribe supports specific audio file formats for recorded dictation: .aif, .aiff, .wav, .mp4, .m4a, and .m4v.

MacSpeech Scribe was originally developed by MacSpeech, Inc. and released February 11, 2010, at Macworld Expo in San Francisco. The product is now owned by Nuance Communications which acquired MacSpeech on February 16, 2010. Nuance is the developer of other speech recognition products including Dragon NaturallySpeaking for Windows, Dragon Dictate for Mac (formerly "MacSpeech Dictate"), and Dragon Dictation apps for iOS.

Jeffery Battersby of Macworld noted in his September 2010 review [1] of MacSpeech Scribe, v1.1:

Small foibles aside, MacSpeech Scribe is a powerful and intelligent tool for transcribing your recorded speech. A simple training process and access to a wide variety of standard audio formats mean that you’ll be moving your spoken text to the printed page in a matter of minutes and with a minimum of hassle. Scribe is the best, simplest way for you to get your spoken word to the printed page.

Release History

VersionRelease DateChanges [2]
1.0February 2010Initial Release.
1.0.1June 2010Minor bug fixes.
1.1September 2010Minor bug fixes, interface enhancements, volume licensing support.

Related Research Articles

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

IBM ViaVoice was a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices. The latest stable version of IBM Via Voice was 9.0 and was able to transfer text directly into Word.

PlainTalk is the collective name for several speech synthesis (MacinTalk) and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some 68k Macintoshes.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

DragonDictate, Dragon Dictate, or Dragon for Mac is proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows, and was replaced by Dragon NaturallySpeaking for Windows. It was later acquired by Nuance Communications. Dragon Dictate for Mac 2.0 is supported only on Mac OS X 10.6. Nuance's other products for Mac include MacSpeech Scribe.

Medical transcription, also known as MT, is an allied health profession dealing with the process of transcribing voice-recorded medical reports that are dictated by physicians, nurses and other healthcare practitioners. Medical reports can be voice files, notes taken during a lecture, or other spoken material. These are dictated over the phone or uploaded digitally via the Internet or through smart phone apps.

iListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. In 2006, iListen was the only third-party software that allowed inputting text using one's voice that works on newer Macintosh models. Its competitors were Apple's own speech recognition software ; Dragon Naturally Speaking by Nuance, running under Windows virtualization software such as Parallels Desktop for Mac or VMware Fusion; and the discontinued speech recognition program ViaVoice by Nuance/IBM.

<span class="mw-page-title-main">Dictation machine</span> Device for recording human speech

A dictation machine is a sound recording device most commonly used to record speech for playback or to be typed into print. It includes digital voice recorders and tape recorder.

<span class="mw-page-title-main">Jott</span>

Jott was a web-based voice-to-text transcription service which allowed its users to call a toll-free telephone number and speak for up to 30 seconds. The speech was then transcribed to text using a combination of computerized speech recognition software and human transcribers who worked in a "sterile environment which also includes medical dictation." The message could be sent back to oneself, turned into a reminder, sent to a contact or group, or sent to a third-party "Jott link" such as LiveJournal.

As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.

<span class="mw-page-title-main">Windows Speech Recognition</span> Speech recognition software

Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks.

The Voice Navigator was the first voice recognition device for command and control of a graphical user interface. The system was developed by Articulate Systems, Inc. originally designed for the Apple Macintosh Plus and released in 1989. Subsequent versions were created for Microsoft Windows. Articulate Systems, Inc. was acquired by Dragon Systems in 1998.

Dragon Dictation started as speech recognition application for Apple's iOS platforms, including iPhone, iPod Touch and iPad. The app provided automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free app. It is now commonly found licensed in vehicle infotainment systems and healthcare equipment.

<span class="mw-page-title-main">Speech Processing Solutions</span> Manufacturer of speech processing devices

Speech Processing Solutions is an international electronics company headquartered in Vienna, Austria. The company designs, develops, manufactures and markets speech processing devices, such as those used in digital dictation and speech recognition. Speech Processing Solutions was formed on 1 July 2012. Philips Speech Processing was part of the Philips Consumer Lifestyle sector. Speech Processing Solutions is now an official licensee of the Philips brand. The company has subsidiaries in the US, Canada, Australia, the United Kingdom, Belgium, France and Germany, and employs around 170 people worldwide.

Voice writing is a method used for court reporting, medical transcription, CART, and closed captioning. Using the voice writing method, a court reporter speaks directly into a stenomask or speech silencer - a hand-held mask containing one or two microphones and voice-dampening materials. As the reporter repeats the testimony into the recorder, the mask prevents the reporter from being heard during testimony.

Michael Phillips is the CEO and co-founder of Sense Labs and a pioneer in machine learning, including mobile speech recognition and text-to-speech technology.

InfraWare is an American technology company that focuses on speech transcription and other technologies for machine-assisted documentation. It has many users who work in the healthcare industry. It is headquartered in Terre Haute, Indiana.

References

  1. Battersby, Jeffery (2010-09-08). "MacSpeech Scribe 1.1". Macworld.com. Retrieved 2016-07-17.
  2. "Dragon NaturallySpeaking | Nuance". Macspeech.com. Retrieved 2016-07-17.