Voice Navigator

Last updated

The Voice Navigator was the first voice recognition device for command and control of a graphical user interface (Patent no. 5377303 [1] ). The system was developed by Articulate Systems, Inc. (founded in 1986) originally designed for the Apple Macintosh Plus and released in 1989. [2] Subsequent versions were created for Microsoft Windows. Articulate Systems, Inc. was acquired by Dragon Systems in 1998.

Contents

The original system included both hardware and software. It was an elegant black base with a slick microphone. A software-only version was introduced in 1992 for computers with built-in microphone and adequate microprocessor (Mac IIsi, [3] Mac Quadra AV).

The hardware consisted of a TMS320 digital signal processor, a Rockwell fax modem and a SCSI interface as well as a headset microphone.

The software consisted of Dragon Systems [4] (acquired by Nuance) speaker dependent, discrete utterance, voice recognition driver and Articulate Systems patented voice control technology.

The software enabled voice control of any Macintosh application using context dependent synchronised grammars derived from the current processes and operating system data structures (menus, windows, controls) and events (mouse, key and AppleEvents). The system recognised spoken utterances and posted corresponding system events.

The system was designed to be extensible using a plug-in architecture (voice extensions) for custom functionality and included a software developer kit (SDK) for third-party applications.

Products

Voice Navigator - The original version with fax modem capability.

Voice Navigator II - A low cost version with desktop microphone.

Voice Navigator SW - A software-only version for Macs with built-in microphones.

Voice Navigator SDK - The software developer kit for third-party applications.

Derivatives

PowerSecretary [5] - Voice command and control with dictation capability.

Credits

The core engineering team included: Tom Firman, Tim Morgan, [6] Dave Hoch, Dan Zimmerman, David Kelts [7] and Glenn McElhoe. Peter Durlach [8] was responsible for product management and marketing. Dana Morgan managed customer service, technical support and quality control. Ivan Mimica was the visionary CEO.

Related Research Articles

The Telephony Application Programming Interface (TAPI) is a Microsoft Windows API, which provides computer telephony integration and enables PCs running Microsoft Windows to use telephone services. Different versions of TAPI are available on different versions of Windows. TAPI allows applications to control telephony functions between a computer and telephone network for data, fax, and voice calls. It includes basic functions, such as dialing, answering, and hanging up a call. It also supports supplementary functions, such as hold, transfer, conference, and call park found in PBX, ISDN, and other telephone systems.

In computing, cross-platform software is computer software that is designed to work in several computing platforms. Some cross-platform software requires a separate build for each platform, but some can be directly run on any platform without special preparation, being written in an interpreted language or compiled to portable bytecode for which the interpreters or run-time packages are common or standard components of all supported platforms.

A software development kit (SDK) is a collection of software development tools in one installable package. They facilitate the creation of applications by having a compiler, debugger and sometimes a software framework. They are normally specific to a hardware platform and operating system combination. To create applications with advanced functionalities such as advertisements, push notifications, etc; most application software developers use specific software development kits.

GeoPort is a serial data system used on some models of the Apple Macintosh that could be externally clocked to run at a 2 megabit per second data rate. GeoPort slightly modified the existing Mac serial port pins to allow the computer's internal DSP hardware or software to send data that, when passed to a digital-to-analog converter, emulated various devices such as modems and fax machines. GeoPort could be found on late-model 68K-based machines as well as many pre-USB Power Macintosh models and PiPPiN. Some later Macintosh models also included an internal GeoPort via an internal connector on the Communications Slot. Apple GeoPort technology is now obsolete, and modem support is typically offered through USB.

<span class="mw-page-title-main">A/UX</span> Early Unix-based operating system from Apple Computer

A/UX is a Unix-based operating system from Apple Computer for Macintosh computers, integrated with System 7's graphical interface and application compatibility. It is Apple's first official Unix-based operating system, launched in 1988 and discontinued in 1995 with version 3.1.1. A/UX requires select 68k-based Macintosh models with an FPU and a paged memory management unit (PMMU), including the Macintosh II, SE/30, Quadra, and Centris series.

<span class="mw-page-title-main">Virtual Studio Technology</span> Audio plug-in software interface

Virtual Studio Technology (VST) is an audio plug-in software interface that integrates software synthesizers and effects units into digital audio workstations. VST and similar technologies use digital signal processing to simulate traditional recording studio hardware in software. Thousands of plugins exist, both commercial and freeware, and many audio applications support VST under license from its creator, Steinberg.

PlainTalk is the collective name for several speech synthesis (MacinTalk) and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some 68k Macintoshes.

Conexant Systems, Inc. was an American-based software developer and fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell International, before being spun off as a public company. Conexant itself then spun off several business units, creating independent public companies which included Skyworks Solutions and Mindspeed Technologies.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

A voice-user interface (VUI) makes spoken human interaction with computers possible, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

The Apple USB Modem is a combined 56 kbit/s data modem and 14.4 kbit/s fax external USB modem introduced by Apple Inc. after the internal 56k modem was dropped on the October 12, 2005 iMac G5 revision. While it looks similar, it should not be confused with Apple's optional USB Ethernet Adapter accessory, available for its MacBook Air and MacBook Pro Retina range of laptops since 2008.

The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date, a number of versions of the API have been released, which have shipped either as part of a Speech SDK or as part of the Windows OS itself. Applications that use SAPI include Microsoft Office, Microsoft Agent and Microsoft Speech Server.

Mobile app development is the act or process by which a mobile app is developed for one or more mobile devices, which can include personal digital assistants (PDA), enterprise digital assistants (EDA), or mobile phones. Such software applications are specifically designed to run on mobile devices, taking numerous hardware constraints into consideration. Common constraints include CPU architecture and speeds, available memory (RAM), limited data storage capacities, and considerable variation in displays and input methods. These applications can be pre-installed on phones during manufacturing or delivered as web applications, using server-side or client-side processing to provide an "application-like" experience within a web browser.

<span class="mw-page-title-main">Macintosh Quadra 840AV</span> Personal computer by Apple Computer

The Macintosh Quadra 840AV is a personal computer designed, manufactured, and sold by Apple Computer from July 1993 to July 1994. It was introduced alongside the Centris 660AV, where "AV" signifies audiovisual capabilities, such as video input and output, telecommunications, speech recognition, and enhanced audio. The 840AV has the same mini-tower form factor as the Quadra 800, with a faster Motorola 68040 processor.

<span class="mw-page-title-main">Apple Pippin</span> 1990s video game console platform

The Apple Pippin is a defunct open multimedia technology platform, designed by Apple Computer, and marketed as PiPP!N. According to Apple, Pippin was directed at the home market as "an integral part of the consumer audiovisual, stereo, and television environment."

<span class="mw-page-title-main">Modem</span> Device that modulates an analog carrier signal to encode digital information

A modulator-demodulator or modem is a computer hardware device that converts data from a digital format into a format suitable for an analog transmission medium such as telephone or radio. A modem transmits data by modulating one or more carrier wave signals to encode digital information, while the receiver demodulates the signal to recreate the original digital information. The goal is to produce a signal that can be transmitted easily and decoded reliably. Modems can be used with almost any means of transmitting analog signals, from light-emitting diodes to radio.

The MacintoshCommunications Toolbox, generally shortened to CommToolbox or CTB, was a suite of application programming interfaces, libraries and dynamically loaded code modules for the classic Mac OS that implemented a wide variety of serial and network communication protocols, as well as file transfer protocols and terminal emulations.

<span class="mw-page-title-main">Classic Mac OS</span> Original operating system of Apple Mac (1984–2001)

Mac OS is the series of operating systems developed for the Macintosh family of personal computers by Apple Computer from 1984 to 2001, starting with System 1 and ending with Mac OS 9. The Macintosh operating system is credited with having popularized the graphical user interface concept. It was included with every Macintosh that was sold during the era in which it was developed, and many updates to the system software were done in conjunction with the introduction of new Macintosh systems.

References

  1. "Voice Navigator Patent".
  2. InfoWorld Announcement. 18 July 1988.
  3. "Macintosh IIsi Compatibility".
  4. "Dragon Systems history".
  5. "PowerSecretary Announcement".
  6. "Tim Morgan". LinkedIn.
  7. "David Kelts". LinkedIn.
  8. "Peter Durlach". LinkedIn.