Cantor (music software)

Last updated
Cantor
Developer(s) VirSyn
Initial releaseJuly 2004;20 years ago (2004-07)
Stable release
2.1 / February 6, 2007;17 years ago (2007-02-06)
Operating system Windows XP and later, OS X 10.5 or later
Available inEnglish, German
Type Musical synthesizer application
Website virsyn.com/en/E_Home/e_home.html

Cantor was a vocal singing synthesizer software released four months after the original release of Vocaloid by the company VirSyn, and was based on the same idea of synthesizing the human voice. VirSyn released English and German versions of this software. Cantor 2 boasted a variety of voices from near-realistic sounding ones to highly expressive vocals and robotic voices.

Contents

Technology

Cantor was not based on singing samples, and its results were reproduced by a morphing additive synthesis engine derived from VirSyn's Cube software synthesizer. It is used to generate the 39 phonemes that VirSyn used to reproduce English speech or singing. Each phoneme is created by passing an additive sound source through a formant filter, which morphs between a start and an end state. These filter responses are fully editable: Up to six peaks and three troughs in the formant filter response can be specified as morph points. Cantor 2 offered 20 ready-to-use vocals in English and German and added many new voices on top of the original Cantor software, bringing the total to 50 voices.

The sound generator used a combination of additive synthesis and noise sculpting that it used specifically for the 50 voiced sounds provided by the software as set as a complete set for the unvoiced sounds. The concept of voiced and unvoiced sounds was complicated but was used to describe how Cantor was able to master its language capabilities of human speech. For voiced sounds, the additive synth controls the pitched component of the sound (vocal cords), whereas the noise synth controls the breath component (whisper). It controlled up to 256 partials. As the user went higher into the octaves, these became grouped for control. For those who had used other VirSyn's software, Cantor was familiar grounds and bore many things in common with past synthesizers VirSyn had produced.

Because of its design, it was more like a virtual instrument than a virtual singer. It never claimed to mimic a real singer's voice and was intended purely for special effects. Although it was complex, Cantor was considered a simple design overall and relatively easy to use for its purposes.

It hosted VST, AU and RTAS capabilities. By Cantor 2's release, midi file format was fully functional. It was able to work as a standalone software or as a plugin; there were slight differences between the software for both. It worked as a standalone software or plug-in and supported ReWire. Though it was released in German and English, with adjustments of the sound output it was possible to recreate vocal languages beyond this and mimic other languages.

History

Cantor was released after the original Vocaloid engine and was considered a suitable software to rival Yamaha's Vocaloid engine, then only known in the western hemisphere by the Vocaloids 'Leon', 'Lola' and 'Miriam'. Cantor reached a level of vocal synthesising that had not yet been reached.

A demo of the software was released. It required purchasing an elicence dongle to download the demo, as well as the full software if it was purchased electronically. [1] The final version, Cantor 2.1 was released on February 6, 2007. Even though updates ceased, the software was never removed from sale.

The album Light + Shade by Mike Oldfield featured both the Vocaloid 'Miriam' singing alongside the Cantor software in the song "Tears of an Angel."

Despite being a rival program to Vocaloid, it was able to be purchased on Crypton Future Media's website.

Reception

For the capabilities of what the software could do, the Cantor software was dubbed "the future of music." At the time of its release, Cantor 2 was considered ground breaking technology. Despite its capabilities, one of its let-downs was considered the high price for its contents in comparison to other software. The biggest criticism toward the software itself was its unintelligible results despite being a powerful tool and though improvements were made between Cantor and Cantor 2. It still lacked clarity which put it at a disadvantage against the more realistic sound of Vocaloid. The simple design of its interface despite the complexity of its capabilities was highly praised overall by reviewers.

Cantor was able to create a playground for experimental vocal sounds and give composers a tool for high levels of vocal affects and sounds. [2] However, Cantor and Vocaloid were based on the same concepts and ideas; they shared a number of similar designs. [3] It was unable to escape comparisons between itself and Vocaloid, although some musicians used both software at the time. [4]

See also

Related Research Articles

Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech. The reverse process is speech recognition.

<span class="mw-page-title-main">Music technology (electronic and digital)</span>

Digital music technology encompasses the use of digital instruments to produce, perform or record music. These instruments vary, including computers, electronic effects units, software, and digital audio equipment. Digital music technology is used in performance, playback, recording, composition, mixing, analysis and editing of music, by professions in all parts of the music industry.

<span class="mw-page-title-main">Vocaloid</span> Singing voice synthesizer software

Vocaloid is a singing voice synthesizer software product. Its signal processing part was developed through a joint research project between Yamaha Corporation and the Music Technology Group in Universitat Pompeu Fabra, Barcelona. The software was ultimately developed into the commercial product "Vocaloid" that was released in 2004.

The source–filter model represents speech as a combination of a sound source, such as the vocal cords, and a linear acoustic filter, the vocal tract. While only an approximation, the model is widely used in a number of applications such as speech synthesis and speech analysis because of its relative simplicity. It is also related to linear prediction. The development of the model is due, in large part, to the early work of Gunnar Fant, although others, notably Ken Stevens, have also contributed substantially to the models underlying acoustic analysis of speech and speech synthesis. Fant built off the work of Tsutomu Chiba and Masato Kajiyama, who first showed the relationship between a vowel's acoustic properties and the shape of the vocal tract.

The Yamaha FS1R is a sound synthesizer module, manufactured by the Yamaha Corporation from 1998 to 2000. Based on Formant synthesis, it also has FM synthesis capabilities similar to the DX range. Its editing involves 2,000+ parameters in any one 'performance', prompting the creation of a number of third party freeware programming applications. These applications provide the tools needed to program the synth which were missing when it was in production by Yamaha. The synth was discontinued after two years, probably in part due to its complexity, poor front-panel controls, brief manual and limited polyphony.

eSpeak Compact, open-source, software speech synthesizer

eSpeak is a free and open-source, cross-platform, compact, software speech synthesizer. It uses a formant synthesis method, providing many languages in a relatively small file size. eSpeakNG is a continuation of the original developer's project with more feedback from native speakers.

Gnuspeech is an extensible text-to-speech computer software package that produces artificial speech output based on real-time articulatory speech synthesis by rules. That is, it converts text strings into phonetic descriptions, aided by a pronouncing dictionary, letter-to-sound rules, and rhythm and intonation models; transforms the phonetic descriptions into parameters for a low-level articulatory speech synthesizer; uses these to drive an articulatory model of the human vocal tract producing an output suitable for the normal sound output devices used by various computer operating systems; and does this at the same or faster rate than the speech is spoken for adult speech.

<span class="mw-page-title-main">Utau</span> Japanese shareware voice synthesizer

UTAU is a Japanese singing synthesizer application created by Ameya/Ayame (飴屋/菖蒲). This program is similar to the VOCALOID software, with the difference being it is shareware instead of under a third party licensing.

<span class="mw-page-title-main">Internet Co., Ltd.</span> Software company in Osaka, Japan

Internet Co., Ltd. or Internet, is a software company based in Osaka, Japan. It is best known for the music sequencer Singer Song Writer and Niconico Movie Maker for Nico Nico Douga, a video sharing website. It also develops singing synthesizers using the Vocaloid 4 engine developed by Yamaha Corporation. In 2014, they were the second leading company in sound-related software in Japan, boasting a 14.0% share of the market.

<span class="mw-page-title-main">Voiceroid</span> Speech synthesizer application

Voiceroid is a speech synthesizer application developed by AH-Software and is designed for speech. It is only available in the Japanese language. Its name comes from the singing software Vocaloid, for which AH-Software also develops voicebanks. Both AH-Software's first Vocaloids and Voiceroids went on sale on December 4, 2009.

Vocaloid is a singing voice synthesizer and the first engine released in the Vocaloid series. It was succeeded by Vocaloid 2. This version was made to be able to sing both English and Japanese.

<span class="mw-page-title-main">Vocaloid 2</span> 2007 singing voice synthesizer

Vocaloid 2 is a singing voice synthesizer and the successor to the Vocaloid voice synthesizer application by Yamaha. Unlike the first engine, Vocaloid 2 based its output on vocal samples, rather than voice analysis. The synthesis engine and the user interface were completely revamped, with Japanese Vocaloids possessing a Japanese interface, as opposed to the previous version, which used English for both versions. It is noteworthy for introducing the popular character Hatsune Miku. It was succeeded by Vocaloid 3.

<span class="mw-page-title-main">Vocaloid 3</span> 2011 singing voice synthesizer

Vocaloid 3 is a singing voice synthesizer and successor to Vocaloid 2 in the Vocaloid series. This version of the software is a much more expansive version, containing many new features, three new languages and many more vocals than past software versions combined. It was succeeded by Vocaloid 4.

<span class="mw-page-title-main">Vocaloid 4</span> 2014 singing voice synthesizer

Vocaloid 4 is a singing voice synthesizer and successor to Vocaloid 3 in the Vocaloid series. It was succeeded by Vocaloid 5.

<span class="mw-page-title-main">Harmor</span> Software synthesizer

Harmor is a software synthesizer created by Image-Line Software for the music production program FL Studio. It is available as a demo version within the software; however, it must be purchased separately in order to save projects that contain Harmor instances. Harmor is an upgraded and more elaborate version of its predecessor, Harmless. It was originally released with a 32-bit engine and upgraded to a 64-bit engine in 2013.

<span class="mw-page-title-main">Megpoid</span> Vocaloid 3 voicebank

Megpoid is a Vocaloid by Internet Co., Ltd. Her voice is sampled by Megumi Nakajima. The mascot of the software is called Gumi . She is also sometimes called Megpoid GUMI, or GUMI Megpoid.

<span class="mw-page-title-main">Vocaloid 5</span> 2018 singing voice synthesizer

Vocaloid 5 is a singing voice synthesizer and successor to Vocaloid 4 in the Vocaloid series. It was succeeded by Vocaloid 6.

<span class="mw-page-title-main">Kasane Teto</span> Vocalbank on UTAU and SynthV

Kasane Teto is a virtual singer software created on the Japanese textboard 2channel for April Fools' Day, 2008. Although the software was initially created as a hoax and did not exist, it later was actually produced and made compatible with singing voice synthesis software, allowing it to sing. Teto was introduced as a "diva born from a hoax".

<span class="mw-page-title-main">Arturia MicroFreak</span> Synthesizer

The MicroFreak is a synthesizer manufactured by French music technology company Arturia and released in 2019. Described as a "Hybrid Experimental Synthesizer", it uses 18 digital sound engines (algorithms) to synthesize raw tones. This digital oscillator is then fed into a multi-mode analog filter, giving the MicroFreak its hybrid sounds.

References

  1. "E_CNT2Demo" . Retrieved 27 April 2016.
  2. "VIRSYN Cantor 1.02 (Mac/Win)". Archived from the original on 2 May 2005. Retrieved 27 April 2016.
  3. Virsyn Cantor Singing Synthesis Software (XP/ Mac OS X), soundonsound.com
  4. Walden, John (December 2004). "Vocaloid Miriam". Sound on Sound . Retrieved March 26, 2011.