Predecessor | Federal Screw Works |
---|---|
Founded | 1971 Detroit, Michigan |
Fate | Merged with Vysion, Inc. (1992) |
Successors | |
Headquarters | Troy, Michigan |
Votrax International, Inc. (originally the Vocal division of Federal Screw Works), or just Votrax, was a speech synthesis company located in the Detroit, Michigan area from 1971 to 1996. [1] It began as a division of Federal Screw Works from 1971 to 1973. In 1974, it was given the Votrax name (taken from the name of its first commercial product, the model VS4 "Votrax") and moved to Troy, Michigan and, in 1980, split off of its parent company entirely and became Votrax International, Inc., which produced speech products up until 1984. [2] [3]
In 1984, the company restructured itself as a commercial phone/speech audio-response/auto-answering systems company after downsizing some of the staff. In 1987, Votrax merged with Vynet Corp., a voice-recognition prompt pioneer. [4] [5] It remained Votrax inc. until about 1992, when it was renamed to or otherwise merged with Vysion, Inc., a maker of security cameras and other related devices. [6] It remained 'Vysion Inc.' until the company declared bankruptcy in June 1994 following a court battle patent litigation loss against PATCO inc., [7] and from the remains of the old company, restructured itself as 'Maxxar' inc in 1995. [8] Maxxar was acquired by Open Solutions, LLC (then Open Solutions, Inc.), on February 24, 2004, [9] and Open Solutions, LLC was acquired by Fiserv, Inc. on January 14, 2013. [10] Maxxar owned the rights to the Votrax name, but the trademark lapsed on March 11, 2016. [11]
All the Votrax speech synthesizers owe their existence to the speech synthesizer design created in 1970 by Richard T. Gagnon. After coming up with a viable design scheme in his basement laboratory, Gagnon licensed it to Federal Screw Works, whom he was working for at the time, and they continued development of his original design. This became the "Vocal division of Federal Screw Works". [6]
In 1984, Votrax either declared bankruptcy or came close to doing so, and restructured itself as a commercial phone-interface provider, and hence produced no new consumer products. The later commercial-only products are not listed on the below list because literature about these seems to have been of limited distribution and has not yet been found. During the restructuring, much of the existing staff was downsized off, including Tim Gargagliano and Kathryn F. Gargagliano, who along with two other former Votrax employees, Art Velthoven and Dale McDaniel, started Artic Technologies in 1984. [2] [3] Tim and Kate had earlier written an article about the SC-01 for BYTE magazine. [12] In 1987, Votrax merged with Vynet Corp and the product lines of both companies were combined. [4] [5]
Votrax was responsible for designing and manufacturing several important early speech synthesizer back-ends, and several widely used integrated circuit phoneme synthesizers. Votrax produced speech backend modules and cards for various personal computers, and worked with the United States Naval Research Laboratory (NRL) to create an extensible speech frontend system. Votrax's speech technology was also used by 3rd parties in several arcade games, Gottlieb System 80 pinball machines, and talking terminals. [13] A Votrax synthesizer was used as part of the text-to-speech subsystem of the first generation Kurzweil Reading Machine for the Blind. [14]
During the 1970s, Votrax produced a series of discrete speech synthesizers, with epoxy-coated boards to thwart people copying their designs. In 1980, they designed and manufactured an integrated circuit speech synthesizer called the SC-01. This IC proved very popular in the third party market, and was produced until at least 1984.
It was succeeded by the somewhat more dynamic SC-02, also known as the SSI-263P. From the beginning of SC-02 production, Silicon Systems Inc. (now part of Texas Instruments) [13] manufactured the SC-02 chip under the product number SSI-263P, and this was apparently later adopted as the official name of the IC. Votrax continued to intermittently sell SC-01-A and SC-02 synthesis chips, and Personal Speech System text to speech units until at least October 1990. [15]
Since early in its life, Votrax specialized in making phoneme-based speech synthesizers and text-to-speech algorithms. The popular United States Naval Research Laboratory, or "NRL" text-to-phoneme algorithm was developed by a collaboration between Votrax and the NRL in 1973. This algorithm and variants of it were used on a number of text-to-speech devices, such as the Votrax Type 'N Talk, the Votrax Personal Speech System, and the General Instruments CTS256A-AL2 text-to-allophone chip. [6] A good rundown of the NRL algorithm can be found under reference. [16]
Votrax also supplied the SC-02 speech chip used in the amateur radio 'DOVE-OSCAR 17' or 'DOVE' Microsatellite. [17] [18] [19]
M. D. McIlroy used a "Votrax" branded "Federal Screw Works" synth, a single potted block, as the 'Screw Works' backend for the Unix 'speak' command on Unix V1/2/3/4 in 1972/1973. [20] Details of the algorithm were later (1974) described in his paper "Synthetic English speech by rule", Bell Telephone Laboratories Computer Science Technical Report #14, which is available on his personal site's publications page. [21]
The most typical commercial products are two boxes named "Type 'N Talk (TNT)" and "Personal Speech System (PSS)".
The TNT consists of a board with Motorola MC6802 microprocessor, a 4K ROM, some 74xx TTL chips, a Motorola 6850 (ACIA) for RS-232 communication, and an SC-01A synth chip. [22]
The PSS has 2K RAM chips and an 8K EPROM which holds "non-critical" data. Inside the epoxy-covered blackbox, there are four 74xx TTL chips, a Zilog Z80 microprocessor, two 8K EPROMs, and the synth chip. It communicates via RS-232. [23]
1971:
1972:
1973:
1973-1975:
1975:
1977:
1978
1978-1980:
1980:
1981:
1982:
1983:
1984:
1985:
1987:
1978:
1979:
1980:
1981:
1982:
1983:
1984-96:
Scott Adams, who pioneered text adventures for home computers, implemented support of Votrax speech in VIC-20 porting of some of his adventures, like Adventureland (VIC-1914) and Voodoo Castle (VIC-1918). [41]
Frequency modulation synthesis is a form of sound synthesis whereby the frequency of a waveform is changed by modulating its frequency with a modulator. The (instantaneous) frequency of an oscillator is altered in accordance with the amplitude of a modulating signal.
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The reverse process is speech recognition.
The Mockingboard is a sound card built by Sweet Micro Systems for the Apple II microcomputers. It improves on the Apple II's limited sound capabilities, as did other Apple II sound cards.
The Yamaha DX7 is a synthesizer manufactured by Yamaha Corporation from 1983 to 1989. It was the first successful digital synthesizer and is one of the best-selling synthesizers in history, selling more than 200,000 units.
PlainTalk is the collective name for several speech synthesis (MacinTalk) and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some 68k Macintoshes.
The Adaptive Multi-Rateaudio codec is an audio compression format optimized for speech coding. AMR is a multi-rate narrowband speech codec that encodes narrowband (200–3400 Hz) signals at variable bit rates ranging from 4.75 to 12.2 kbit/s with toll quality speech starting at 7.4 kbit/s.
The Texas Instruments LPC Speech Chips are a series of speech synthesizer digital signal processor integrated circuits created by Texas Instruments beginning in 1978. They continued to be developed and marketed for many years, though the speech department moved around several times within TI until finally dissolving in late 2001. The rights to the speech-specific subset of the MSP line, the last remaining line of TI speech products as of 2001, were sold to Sensory, Inc. in October 2001.
G.729 is a royalty-free narrow-band vocoder-based audio data compression algorithm using a frame length of 10 milliseconds. It is officially described as Coding of speech at 8 kbit/s using code-excited linear prediction speech coding (CS-ACELP), and was introduced in 1996. The wide-band extension of G.729 is called G.729.1, which equals G.729 Annex J.
Conexant Systems, Inc. was an American-based software developer and fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell International, before being spun off as a public company. Conexant itself then spun off several business units, creating independent public companies which included Skyworks Solutions and Mindspeed Technologies.
HERO is a series of several educational robots sold by Heathkit during the 1980s. The Heath Company began the HERO 1 project in October 1979, with the first release in 1982. Models include the HERO 1, HERO Jr., and HERO 2000. Heathkit supported the HERO robot line until 1995. The units were either sold as assembly kits or prebuilt by Heathkit for an additional fee. The 1980s models are considered collectors items, due to their rarity. For the most part, they cannot perform practical tasks, but are more geared toward entertainment and education above all.
DECtalk was a speech synthesizer and text-to-speech technology developed by Digital Equipment Corporation in 1983, based largely on the work of Dennis Klatt at MIT, whose source-filter algorithm was variously known as KlattTalk or MITalk.
ESS Technology Incorporated is a private manufacturer of computer multimedia products, Audio DACs and ADCs based in Fremont, California with R&D centers in Kelowna, British Columbia, Canada and Beijing, China. It was founded by Forrest Mozer in 1983. Robert L. Blair is the CEO and President of the company.
An automixer, or automatic microphone mixer, is a live sound mixing device that automatically reduces the strength of a microphone's audio signal when it is not being used.
Forrest S. Mozer is an American experimental physicist, inventor, and entrepreneur known best for his pioneering work on electric field measurements in space plasma and for development of solid state electronic speech synthesizers and speech recognizers.
Silicon Systems Inc. (SSi) was an American semiconductor company based in Tustin, California. The company manufactured mixed-signal integrated circuits and semiconductors for telecommunications and data storage.
speak was a Unix utility that used a predefined set of rules to turn a file of English text into phoneme data compatible with a Federal Screw Works model VS4 "Votrax" Speech Synthesizer. It was first included in Unix v3 and possibly later ones, with the OS-end support files and help files persisting until v6. As of late 2011, the original source code for speak, and portions of speak.m were discovered. At least three versions of the man page are known to still exist.
Loquendo was an Italian multinational computer software technology corporation, headquartered in Torino, Italy, that provides speech recognition, speech synthesis, speaker verification and identification applications. Loquendo, which was founded in 2001 under the Telecom Italia Lab, also had offices in United Kingdom, Spain, Germany, France, and the United States.
Yossi Matias is an Israeli-American computer scientist, entrepreneur and Google executive. Matias is Vice President, Engineering & Research at Google, and the founding managing director of Google's Center in Israel. He is on the leadership team of Google's Research, the global exec lead overseeing Google’s Health AI, Crisis Response and Climate AI efforts, and leads efforts in Conversational AI. For over a decade he was on the leadership team of Google’s Search, building and leading efforts including Google Trends, Google Autocomplete, Search Console, and Search experiences in weather, sports, dictionaries and more. In 2024 Matias move to Silicon Valley to head Google Research, the company’s global research activity.
Sensory, Inc. is an American company which develops software AI technologies for speech, sound and vision. It is based in Santa Clara, California.
Chipspeech is a singing vocal synthesizer software application and plugin created by Plogue that recreates the vocals of several 1980s speech synthesis chips from early home computers and video games.
{{cite web}}
: CS1 maint: archived copy as title (link){{cite web}}
: CS1 maint: unfit URL (link){{cite web}}
: CS1 maint: unfit URL (link)The first commercial mass-marketed communication aid with speech synthesis was probably the Handivoice from the Federal Screw Works and Phonic Ear (1978)
19xx VSK - on encapsulated circuit cards for personal computers (Radio Shack, Tandy Corp.) and products like Handivoice, (HC Electronics, Mill Valley CA)
{{cite web}}
: CS1 maint: unfit URL (link)