This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages) |
Developer(s) | Plogue Art et Technologie, Inc. |
---|---|
Initial release | 14 January 2015 [1] |
Stable release | 1.777 / 17 May 2023 [2] |
Operating system | Windows 7 or later, macOS 10.11 or later [3] |
Available in | English, Japanese |
Type | Voice synthesizer |
License | Proprietary |
Website | plogue |
Chipspeech is a singing vocal synthesizer software application and plugin created by Plogue that recreates the vocals of several 1980s speech synthesis chips from early home computers and video games.
The software is used for creating vocals for use within music. Chipspeech is designed to produce vintage-style vocals from synthesizers that were used by the music industry in the 1980s, having a cut off date of 1989 technology. [4] The vocals, therefore, are not meant to sound realistic and are more suited for sound experimentation. It works as a text-to-speech method. Users type the lyrics in and receive instant playback results which was a capability beyond the original soundchips the software vocals are based on. The software is as simple as Vocaloid. Though English and Japanese come as standard, other languages can be created by direct entry of syllables. Though human-like vocals can be achieved, the results are always machine-like rather than man-like. [5] It is capable of different synthesis methods or re-samplers. [6] In addition for 1.032 version of the software a new "Speak and Spell" program was added creating the circuit bending feature. [7]
Chipspeech itself as created as a result of research for Chipsounds by Plogue in the 2000s. David Viens himself would often collect sound chips even if there was no need for them. This obsession eventually lead to further events which resulted in the creation of the Chipspeech software after he spent years hacking, protoboard making, probing, and reverse engineering the speech chips. He noted that the software's main goal was to be a singing emulator and not a text-to-speech software. The source data of each vocal is 8 kHz or 10 kHz. Despite all their effort, the project came to a halt. Hubert Lamontagne joined Plogue with knowledge of phonetics and digital signal processing, Hubert took interest in creating a vintage-sounding synthesizer, and designed the synthesizer to work beyond being a sound library. [5]
It originally came with 7 "characters" upon purchase, more vocals have been added since and continue to be added. These characters come with their own backstory and are based on a sound synthesizer. Recreation of these voices was done with permission from their respective license holders. Plogue itself gained rights to the speech data from three TI-99/4A games (Alpiner, Parsec and Moon Mine), and the internal vocabulary of the TI Speech Device. The process of gaining right for the vocals took over 10 years, as the company did not want to disrespect the copyright holders even when met with issues such as the license holder having gone bankrupt. And while the technology was easy to emulate, the data needed for the emulation was not. [5]
In January 2016, Plogue announced that Hubert Lamontagne had found a way to improve quality. [8] On 9 February, Version 1.066 was released. This fixed bugs with Deeklatt and Otto Mozer. Voice improvements to Dandy 704 and Bert Gotrax were scheduled for the next release and were updated in 1.072. [9] [10] Some vocals such as Dandy 704 are restricted by how far they can be improved. [11] In addition, Chipspeech will be receiving the ability to talk as well as sing in its next major update. [12] Chipspeech also was exported to Japan during June 2016. [13]
Version 1.5 was released on 16 September 2016 adding talk capabilities, a growl adjustment and two new vocals "Rotten.ST" and "CiderTalk'84" based on the 16 bite era vocals.
In 2017, the Voder and Software Automatic Mouth were announced to be added to the software later that year. [14]
An official album was created featuring the software. The album is titled "chipspeech AUTOMATE SONGS .01" and includes a cover of the song Stakker Humanoid using Otto Mozer, whose vocal is an emulation of the same synthesizer used for the samples taken from the arcade game Berzerk. [15]
The vocals are split between a number of characters, in addition, Daisy from Alter/Ego could be imported into the software;
Reception to the software was mostly positive. It won 3 Computer Music awards; Editor's Choice, Performance and Innovation. The software was described as a polished product at their MusicRadar review and noted as "tons of fun to use". [30]
AskAudio in their "Voice of the Machines" review focused on the fact that with the raise of Autotuning software, a human is always required. Chipspeech allowed a nostalgic approach to vocal synthesizing with its resulting vocals coming purely from a computer. It listed the positives of the software as " Incredibly unique, fairly easy to use, sounds excellent, affordable" but noted as its main weakness was how the software strained the CPU. [31]
CDM, who had been given exclusive early access-to the software, also highlighted how "boring" modern synthesizers had become and focused on the "fun" that the software provided. One of its highlighted merits of the software was how rare some historical chips it aimed to recreate had become. [5]
In August 2016, Chipspeech topped the virtual instrument top 25 rankings at Sonicwire, owned by Crypton Future Media, beating their Vocaloids products such as Hatsune Miku which normally dominated their rankings. [32]
A vocoder is a category of speech coding that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation.
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The reverse process is speech recognition.
Digital music technology encompasses the use of digital instruments to produce, perform or record music. These instruments vary, including computers, electronic effects units, software, and digital audio equipment. Digital music technology is used in performance, playback, recording, composition, mixing, analysis and editing of music, by professions in all parts of the music industry.
The TI-99/4 and TI-99/4A are home computers released by Texas Instruments in 1979 and 1981, respectively. Based on Texas Instruments's own TMS9900 microprocessor originally used in minicomputers, the TI-99/4 was the first 16-bit home computer. The associated TMS9918 video display controller provides color graphics and sprite support which were only comparable with those of the Atari 400 and 800 released a month later. The TI-99 series also initially competed with the Apple II and TRS-80.
Software Automatic Mouth, or S.A.M., is a speech synthesis program developed by Mark Barton and sold by Don't Ask Software. The program was released for the Atari 8-bit computers, Apple II, and Commodore 64. Released in 1982, it was one of the first commercial all-software voice-synthesis programs.
The Texas Instruments LPC Speech Chips are a series of speech synthesizer digital signal processor integrated circuits created by Texas Instruments beginning in 1978. They continued to be developed and marketed for many years, though the speech department moved around several times within TI until finally dissolving in late 2001. The rights to the speech-specific subset of the MSP line, the last remaining line of TI speech products as of 2001, were sold to Sensory, Inc. in October 2001.
Parsec is a horizontally scrolling shooter written by Jim Dramis and Paul Urbanus for the TI-99/4A and published by Texas Instruments in 1982. Dramis also programmed Car Wars and Munch Man for the TI-99/4A.
Votrax International, Inc., or just Votrax, was a speech synthesis company located in the Detroit, Michigan area from 1971 to 1996. It began as a division of Federal Screw Works from 1971 to 1973. In 1974, it was given the Votrax name and moved to Troy, Michigan and, in 1980, split off of its parent company entirely and became Votrax International, Inc., which produced speech products up until 1984.
DECtalk was a speech synthesizer and text-to-speech technology developed by Digital Equipment Corporation in 1983, based largely on the work of Dennis Klatt at MIT, whose source-filter algorithm was variously known as KlattTalk or MITalk.
The Geneve 9640 is an enhanced TI-99/4A computer-on-a-card upgrade. It was sold by Myarc as a card to fit into the Texas Instruments TI Peripheral Expansion System. Released in 1987, it is in many ways similar to the earlier TI-99/8, which was in prototype form in early 1983. The Geneve 9640 was designed by Paul Charlton, and the graphical swan on the boot up screen was designed by Mi-Kyung Kim.
ESS Technology Incorporated is a private manufacturer of computer multimedia products, Audio DACs and ADCs based in Fremont, California with R&D centers in Kelowna, British Columbia, Canada and Beijing, China. It was founded by Forrest Mozer in 1983. Robert L. Blair is the CEO and President of the company.
eSpeak is a free and open-source, cross-platform, compact, software speech synthesizer. It uses a formant synthesis method, providing many languages in a relatively small file size. eSpeakNG is a continuation of the original developer's project with more feedback from native speakers.
Homer W. Dudley was an American pioneering electronic and acoustic engineer who created the first electronic voice synthesizer for Bell Labs in the 1930s and led the development of a method of sending secure voice transmissions during World War Two. His awards include the Franklin Institute's Stuart Ballantine Medal (1965).
UTAU is a Japanese singing synthesizer application created by Ameya/Ayame (飴屋/菖蒲). This program is similar to the VOCALOID software, with the difference being it is shareware instead of under a third party licensing.
Cantor was a vocal singing synthesizer software released four months after the original release of Vocaloid by the company VirSyn, and was based on the same idea of synthesizing the human voice. VirSyn released English and German versions of this software. Cantor 2 boasted a variety of voices from near-realistic sounding ones to highly expressive vocals and robotic voices.
GI SP0256 refers to a family of closely related NMOS LSI chips manufactured by General Instrument in the early 1980s, able to model the human vocal tract by a software programmable digital filter, creating a digital output converted into an analog signal through an external low-pass filter. The SP0256 includes 2 KB of mask ROM. The various versions of SP0256 differ primarily in the voice data programmed into their mask ROMs.
Sensory, Inc. is an American company which develops software AI technologies for speech, sound and vision. It is based in Santa Clara, California.
Megpoid is a Vocaloid by Internet Co., Ltd. Her voice is sampled by Megumi Nakajima. The mascot of the software is called Gumi . She is also sometimes called Megpoid GUMI, or GUMI Megpoid.
Plogue Art et Technologie, Inc. is an incorporated company based in Montreal, Quebec, Canada that develops music software including Bidule, chipsounds, Alter/Ego and chipspeech.
Alter/Ego (アルター・エゴ) is a free real-time vocal synthesizer software which was created by Plogue.
{{cite web}}
: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link){{cite web}}
: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link){{cite web}}
: CS1 maint: numeric names: authors list (link)