Developer(s) | VirSyn |
---|---|
Initial release | July 2004 |
Stable release | 2.1 / February 6, 2007 |
Operating system | Windows XP and later, OS X 10.5 or later |
Available in | English, German |
Type | Musical synthesizer application |
Website | virsyn |
Cantor was a vocal singing synthesizer software released four months after the original release of Vocaloid by the company VirSyn, and was based on the same idea of synthesizing the human voice. VirSyn released English and German versions of this software. Cantor 2 boasted a variety of voices from near-realistic sounding ones to highly expressive vocals and robotic voices.
Cantor was not based on singing samples, and its results were reproduced by a morphing additive synthesis engine derived from VirSyn's Cube software synthesizer. It is used to generate the 39 phonemes that VirSyn used to reproduce English speech or singing. Each phoneme is created by passing an additive sound source through a formant filter, which morphs between a start and an end state. These filter responses are fully editable: Up to six peaks and three troughs in the formant filter response can be specified as morph points. Cantor 2 offered 20 ready-to-use vocals in English and German and added many new voices on top of the original Cantor software, bringing the total to 50 voices.
The sound generator used a combination of additive synthesis and noise sculpting that it used specifically for the 50 voiced sounds provided by the software as set as a complete set for the unvoiced sounds. The concept of voiced and unvoiced sounds was complicated but was used to describe how Cantor was able to master its language capabilities of human speech. For voiced sounds, the additive synth controls the pitched component of the sound (vocal cords), whereas the noise synth controls the breath component (whisper). It controlled up to 256 partials. As the user went higher into the octaves, these became grouped for control. For those who had used other VirSyn's software, Cantor was familiar grounds and bore many things in common with past synthesizers VirSyn had produced.
Because of its design, it was more like a virtual instrument than a virtual singer. It never claimed to mimic a real singer's voice and was intended purely for special effects. Although it was complex, Cantor was considered a simple design overall and relatively easy to use for its purposes.
It hosted VST, AU and RTAS capabilities. By Cantor 2's release, midi file format was fully functional. It was able to work as a standalone software or as a plugin; there were slight differences between the software for both. It worked as a standalone software or plug-in and supported ReWire. Though it was released in German and English, with adjustments of the sound output it was possible to recreate vocal languages beyond this and mimic other languages.
Cantor was released after the original Vocaloid engine and was considered a suitable software to rival Yamaha's Vocaloid engine, then only known in the western hemisphere by the Vocaloids 'Leon', 'Lola' and 'Miriam'. Cantor reached a level of vocal synthesising that had not yet been reached.
A demo of the software was released. It required purchasing an elicence dongle to download the demo, as well as the full software if it was purchased electronically. [1] The final version, Cantor 2.1 was released on February 6, 2007. Even though updates ceased, the software was never removed from sale.
The album Light + Shade by Mike Oldfield featured both the Vocaloid 'Miriam' singing alongside the Cantor software in the song "Tears of an Angel."
Despite being a rival program to Vocaloid, it was able to be purchased on Crypton Future Media's website.
For the capabilities of what the software could do, the Cantor software was dubbed "the future of music." At the time of its release, Cantor 2 was considered ground breaking technology. Despite its capabilities, one of its let-downs was considered the high price for its contents in comparison to other software. The biggest criticism toward the software itself was its unintelligible results despite being a powerful tool and though improvements were made between Cantor and Cantor 2. It still lacked clarity which put it at a disadvantage against the more realistic sound of Vocaloid. The simple design of its interface despite the complexity of its capabilities was highly praised overall by reviewers.
Cantor was able to create a playground for experimental vocal sounds and give composers a tool for high levels of vocal affects and sounds. [2] However, Cantor and Vocaloid were based on the same concepts and ideas; they shared a number of similar designs. [3] It was unable to escape comparisons between itself and Vocaloid, although some musicians used both software at the time. [4]
Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The reverse process is speech recognition.
Digital music technology encompasses the use of digital instruments to produce, perform or record music. These instruments vary, including computers, electronic effects units, software, and digital audio equipment. Digital music technology is used in performance, playback, recording, composition, mixing, analysis and editing of music, by professions in all parts of the music industry.
Vocaloid is a singing voice synthesizer software product. Its signal processing part was developed through a joint research project between Yamaha Corporation and the Music Technology Group in Universitat Pompeu Fabra, Barcelona. The software was ultimately developed into the commercial product "Vocaloid" that was released in 2004.
The source–filter model represents speech as a combination of a sound source, such as the vocal cords, and a linear acoustic filter, the vocal tract. While only an approximation, the model is widely used in a number of applications such as speech synthesis and speech analysis because of its relative simplicity. It is also related to linear prediction. The development of the model is due, in large part, to the early work of Gunnar Fant, although others, notably Ken Stevens, have also contributed substantially to the models underlying acoustic analysis of speech and speech synthesis. Fant built off the work of Tsutomu Chiba and Masato Kajiyama, who first showed the relationship between a vowel's acoustic properties and the shape of the vocal tract.
The Yamaha FS1R is a sound synthesizer module, manufactured by the Yamaha Corporation from 1998 to 2000. Based on Formant synthesis, it also has FM synthesis capabilities similar to the DX range. Its editing involves 2,000+ parameters in any one 'performance', prompting the creation of a number of third party freeware programming applications. These applications provide the tools needed to program the synth which were missing when it was in production by Yamaha. The synth was discontinued after two years, probably in part due to its complexity, poor front-panel controls, brief manual and limited polyphony.
eSpeak is a free and open-source, cross-platform, compact, software speech synthesizer. It uses a formant synthesis method, providing many languages in a relatively small file size. eSpeakNG is a continuation of the original developer's project with more feedback from native speakers.
Gnuspeech is an extensible text-to-speech computer software package that produces artificial speech output based on real-time articulatory speech synthesis by rules. That is, it converts text strings into phonetic descriptions, aided by a pronouncing dictionary, letter-to-sound rules, and rhythm and intonation models; transforms the phonetic descriptions into parameters for a low-level articulatory speech synthesizer; uses these to drive an articulatory model of the human vocal tract producing an output suitable for the normal sound output devices used by various computer operating systems; and does this at the same or faster rate than the speech is spoken for adult speech.
UTAU is a Japanese singing synthesizer application created by Ameya/Ayame (飴屋/菖蒲). This program is similar to the VOCALOID software, with the difference being it is shareware instead of under a third party licensing.
Internet Co., Ltd. or Internet, is a software company based in Osaka, Japan. It is best known for the music sequencer Singer Song Writer and Niconico Movie Maker for Nico Nico Douga, a video sharing website. It also develops singing synthesizers using the Vocaloid 4 engine developed by Yamaha Corporation. In 2014, they were the second leading company in sound-related software in Japan, boasting a 14.0% share of the market.
Voiceroid is a speech synthesizer application developed by AH-Software and is designed for speech. It is only available in the Japanese language. Its name comes from the singing software Vocaloid, for which AH-Software also develops voicebanks. Both AH-Software's first Vocaloids and Voiceroids went on sale on December 4, 2009.
Vocaloid is a singing voice synthesizer and the first engine released in the Vocaloid series. It was succeeded by Vocaloid 2. This version was made to be able to sing both English and Japanese.
Vocaloid 2 is a singing voice synthesizer and the successor to the Vocaloid voice synthesizer application by Yamaha. Unlike the first engine, Vocaloid 2 based its output on vocal samples, rather than voice analysis. The synthesis engine and the user interface were completely revamped, with Japanese Vocaloids possessing a Japanese interface, as opposed to the previous version, which used English for both versions. It is noteworthy for introducing the popular character Hatsune Miku. It was succeeded by Vocaloid 3.
Vocaloid 3 is a singing voice synthesizer and successor to Vocaloid 2 in the Vocaloid series. This version of the software is a much more expansive version, containing many new features, three new languages and many more vocals than past software versions combined. It was succeeded by Vocaloid 4.
Vocaloid 4 is a singing voice synthesizer and successor to Vocaloid 3 in the Vocaloid series. It was succeeded by Vocaloid 5.
Harmor is a software synthesizer created by Image-Line Software for the music production program FL Studio. It is available as a demo version within the software; however, it must be purchased separately in order to save projects that contain Harmor instances. Harmor is an upgraded and more elaborate version of its predecessor, Harmless. It was originally released with a 32-bit engine and upgraded to a 64-bit engine in 2013.
Megpoid is a Vocaloid by Internet Co., Ltd. Her voice is sampled by Megumi Nakajima. The mascot of the software is called Gumi . She is also sometimes called Megpoid GUMI, or GUMI Megpoid.
Vocaloid 5 is a singing voice synthesizer and successor to Vocaloid 4 in the Vocaloid series. It was succeeded by Vocaloid 6.
Kasane Teto is a virtual singer software created on the Japanese textboard 2channel for April Fools' Day, 2008. Although the software was initially created as a hoax and did not exist, it later was actually produced and made compatible with singing voice synthesis software, allowing it to sing. Teto was introduced as a "diva born from a hoax".
The MicroFreak is a synthesizer manufactured by French music technology company Arturia and released in 2019. Described as a "Hybrid Experimental Synthesizer", it uses 18 digital sound engines (algorithms) to synthesize raw tones. This digital oscillator is then fed into a multi-mode analog filter, giving the MicroFreak its hybrid sounds.