ElevenLabs

Last updated

ElevenLabs Inc.
Company typePrivate company
Industry Artificial intelligence
Founded2022;3 years ago (2022)
Founders
  • Piotr Dąbkowski (CTO)
  • Mati Staniszewski (CEO)
Headquarters New York City, U.S.
Website elevenlabs.io

ElevenLabs is a software company that specializes in developing natural-sounding speech synthesis software using deep learning.

Contents

History

ElevenLabs was co-founded in 2022 by Piotr Dąbkowski, an ex-Google machine learning engineer and Mati Staniszewski, an ex-Palantir deployment strategist. [1] Both were raised in Poland, and their inspiration for founding ElevenLabs reportedly came from watching inadequately dubbed American films. [2] [3]

Dąbkowski and Staniszewski initially considered different funding options, including the possibility of collaborating with a startup accelerator. In January 2023 they revealed having secured a $2 million pre-seed round. The startup's specialization in AI voice intelligence, a still-emerging field in Europe, played a significant role in attracting investors. The pre-seed funding was primarily led by Credo Ventures, and joined by Concept Ventures. [4]

In January 2023, ElevenLabs publicly released its beta platform. [5]

In June 2023, ElevenLabs raised a $19 million Series A funding round at a valuation of about $100 million, [6] [7] despite the company having no office and only 15 employees. [3] [7] The funding round was co-led by the venture capital firm Andreessen Horowitz, former GitHub CEO Nat Friedman, and entrepreneur Daniel Gross. It also saw participation from prominent individuals such as SV Angel, Mike Krieger (co-founder of Instagram), Brendan Iribe (co-founder of Oculus), Mustafa Suleyman (co-founder of Deepmind), and Tim O'Reilly (founder of O'Reilly Media). It was also announced that Andreessen Horowitz would be joining ElevenLabs' board. [2]

On January 22, 2024, ElevenLabs raised an additional $80 million in Series B funding raising the total valuation of the company to $1.1 billion. The funding round was led by Andreessen Horowitz, Friedman, Gross, and Sequoia Capital. Additionally, the company announced a series of new products, including their Voice Marketplace, AI Dubbing Studio, and mobile app. [8]

Products

ElevenLabs is primarily known for its browser-based, AI-assisted text-to-speech software, Speech Synthesis, which can produce lifelike speech by synthesizing vocal emotion and intonation. [9] The company states that its models are trained to interpret the context in the text, and adjust the intonation and pacing accordingly. [10] It uses advanced algorithms to analyze the contextual aspects of text, aiming to detect emotions like anger, sadness, happiness, or alarm, which enables the system to understand the user's sentiment, [11] resulting in achieving a more realistic and human-like inflection. The startup is in the process of patenting this technology. [4] On its beta site, users can submit text and generate audio files from a selection of default voices. Paying users are given the ability to upload custom voice samples to create new vocal styles using the company's voice cloning tool. [12]

Voice Library is the company's feature for sharing unique voice profiles created using their Voice Design technology. These pre-designed voice profiles allow users to select a voice that best suits their needs, rather than creating one from scratch. [13] There are now more than 1,000 community-created voices in the library. Another tool called VoiceLab allows users to clone voices from just a few short snippets of audio and can create entirely new synthetic voices. [2]

On 20 June 2023, ElevenLabs released an AI recognition tool called the AI Speech Classifier, which it claims is the first of its kind. [2] The tool is accessible through an API and designed to determine if an uploaded audio sample originates from ElevenLabs' proprietary AI technology. [3] The company has expressed its intention to collaborate with other AI developers in creating a universal detection system that could be adopted industry-wide. [14]

In July 2023, ElevenLabs announced "Projects", a tool for creating long-form spoken content such as audiobooks and dialogue segments with contextually-aware synthetic or custom voices. [3] [15] The tool was released in September. In August, ElevenLabs expanded its voice generation capabilities to 28 languages. Using an in-house AI model, it automatically detects languages like Korean, Dutch, and Vietnamese, allowing for "emotionally rich" multilingual speech generation. The company also announced that its technology had officially exited its beta phase. [16] [17]

In October 2023, ElevenLabs presented "AI Dubbing," a tool that is able to translate speech into more than 20 languages. The feature is capable of preserving the speaker's original voice, emotions, and intonation, by employing proprietary methods to handle tasks like noise removal, speaker differentiation, transcription, and synchronization of translated speech with the original audio. [18]

In May 2024, ElevenLabs launched a text-to-music model. [19] In June 2024, ElevenLabs released the ElevenLabs Reader App on iOS and Android which allows users to listen to articles, PDFs, and ePubs with AI Voices on their phone. [20] In July 2024, ElevenLabs released "Voice Isolator" which removes background noise from audio. [21]

Reception

Following its launch in January 2023, ElevenLabs gained rapid momentum and was commended for its voice output quality, fast generation times, and a "generous free tier". It has also been praised for its ability to accurately pronounce names with unique or uncommon pronunciations, addressing a common shortcoming in similar tools that often cater primarily to Western names. [22] The company reached over one million registered users between its launch and June 2023. [2] [3] [23]

Criticism and controversy

ElevenLabs was criticized after users were able to abuse its software to generate controversial statements in the vocal style of celebrities, public officials, and other famous individuals, [24] [25] [26] [27] [28] particularly attracting attention after users on 4chan used the tool to share hateful messages. [29] [14] The software's ability to closely replicate real voices has raised ethical concerns, with critics likening it to deepfaking. [30] In response, the company said it would work on mitigating potential abuse through safeguards and identity verification. [5] The company has subsequently limited access to its voice cloning feature to paid subscribers, [31] citing the requirement to provide payment information as means for improving accountability, [32] and has implemented bans on users who repeatedly violate the terms of service.

In the leadup to the January 2024 New Hampshire democratic primary, AI-generated robocalls purportedly from Joe Biden encouraging voters to skip voting on the day of the primary were sent to thousands of residents. The New Hampshire attorney general's office launched an investigation into the incident and linked it to a company based in Texas, with audio experts concluding the call was made using ElevenLabs. In response to the incident, CEO Mati Staniszewski stated that the company was “dedicated to preventing the misuse of audio AI tools” but provided no comment on specific incidents. [33]

Additional concerns have been raised over the ethics of the source of ElevenLabs' training data, with multiple voice actors claiming ElevenLabs used samples of their voices without their consent. [34] ElevenLabs, along with other companies in its category, has thus been seen as a potential challenge to the voice acting sector. [17]

See also

References

  1. Kanetkar, Riddhi. "This startup, founded by ex-Google and Palantir staffers, uses AI to generate realistic voiceovers. Here's the 14-slide pitch deck ElevenLabs used to raise $2 million". Business Insider. Retrieved February 9, 2023.
  2. 1 2 3 4 5 "Now hear this: Voice cloning AI startup ElevenLabs nabs $19M from a16z and other heavy hitters". VentureBeat. June 20, 2023. Retrieved July 25, 2023.
  3. 1 2 3 4 5 Wiggers, Kyle (June 20, 2023). "Voice-generating platform ElevenLabs raises $19M, launches detection tool". TechCrunch. Retrieved July 25, 2023.
  4. 1 2 Kanetkar, Riddhi. "Hot AI startup ElevenLabs, founded by ex-Google and Palantir staff, is set to raise $18 million at a $100 million valuation. Check out the 14-slide pitch deck it used for its $2 million pre-seed". Business Insider. Retrieved July 25, 2023.
  5. 1 2 "A new AI voice tool is already being abused to make deepfake celebrity audio clips". Engadget. Retrieved February 3, 2023.
  6. "The trials and tribulations of AI voice tech". Financial Times. June 21, 2023. Retrieved July 25, 2023.
  7. 1 2 Hunt, Simon (June 20, 2023). "AI firm ElevenLabs achieves $100 million valuation within months of launch". Evening Standard. Retrieved July 25, 2023.
  8. "ElevenLabs Releases New Voice AI Products and Raises $80M Series B". January 22, 2024.
  9. "Generative AI comes for cinema dubbing: Audio AI startup ElevenLabs raises pre-seed". Sifted. January 23, 2023. Retrieved February 3, 2023.
  10. Ashworth, Boone (April 12, 2023). "AI Can Clone Your Favorite Podcast Host's Voice". Wired. Retrieved April 25, 2023.
  11. WIRED Staff. "This Podcast Is Not Hosted by AI Voice Clones. We Swear". Wired. ISSN   1059-1028 . Retrieved July 25, 2023.
  12. Frauenfelder, Mark (January 12, 2023). "Software lets you design new synthetic voices from scratch". Boing Boing. Retrieved February 3, 2023.
  13. "As Generative AI booms, this British startup secures $2M to imitate human voices — TFN". Tech Funding News. January 25, 2023. Retrieved February 5, 2023.
  14. 1 2 Thompson, Stuart A. (March 12, 2023). "Making Deepfakes Gets Cheaper and Easier Thanks to A.I." The New York Times. ISSN   0362-4331 . Retrieved July 25, 2023.
  15. Bonk, Lawrence. "ElevenLabs' Powerful New AI Tool Lets You Make a Full Audiobook in Minutes". Lifewire. Retrieved July 25, 2023.
  16. "ElevenLabs' AI Voice Generator Can Now Fake Your Voice in 30 Languages". Gizmodo. August 22, 2023. Retrieved September 25, 2023.
  17. 1 2 Wiggers, Kyle (August 22, 2023). "ElevenLabs' voice-generating tools launch out of beta". TechCrunch. Retrieved September 25, 2023.
  18. Sharma, Shubham (October 10, 2023). "ElevenLabs introduces AI Dubbing, translating video and audio into 20 languages". VentureBeat. Retrieved November 28, 2023.
  19. Morrison, Ryan (May 10, 2024). "ElevenLabs is launching a new AI music generator — and you have to hear these clips to appreciate it". Tom's Guide. Retrieved May 14, 2024.
  20. "ElevenLabs Launches Reader, A Text-to-Audio App". Maginative. June 25, 2024. Retrieved July 24, 2024.
  21. Sharma, Shubham (July 4, 2024). "ElevenLabs launches free AI voice isolator to take on Adobe". VentureBeat. Retrieved July 24, 2024.
  22. Desai, Saahil (July 17, 2023). "A Voicebot Just Left Me Speechless". The Atlantic. Retrieved September 25, 2023.
  23. "Your AI Clone Can Fool Family, Your Bank, But Not Your Video Meeting - Tech News Briefing - WSJ Podcasts". WSJ. Retrieved July 25, 2023.
  24. Jimenez, Jorge (January 31, 2023). "AI company promises changes after 'voice cloning' tool used to make celebrities say awful things". PC Gamer. Retrieved February 3, 2023.
  25. "People Are Still Terrible: AI Voice-Cloning Tool Misused for Deepfake Celeb Clips". PCMag Middle East. January 31, 2023. Retrieved July 25, 2023.
  26. "Internet Up in Arms as 4Chan User Uses AI Voice Simulator To Deepfake Emma Watson's Voice, Makes Her Read Hitler's Autobiography – FandomWire". fandomwire.com. February 2, 2023. Retrieved February 3, 2023.
  27. "The generative A.I. software race has begun". Fortune. Retrieved February 3, 2023.
  28. Milmo, Dan; Hern, Alex (May 20, 2023). "Elections in UK and US at risk from AI-driven disinformation, say experts". The Guardian. ISSN   0261-3077 . Retrieved July 25, 2023.
  29. Vincent, James (January 31, 2023). "4chan users embrace AI voice clone tool to generate celebrity hatespeech". The Verge. Retrieved February 3, 2023.
  30. "Seeing is believing? Global scramble to tackle deepfakes". news.yahoo.com. Retrieved February 3, 2023.
  31. @elevenlabsio (January 31, 2023). "Thank you everyone for your advice. We love what you're creating, but a set of actors use our tech for malicious purposes. We decided to take the following steps to address the issues" (Tweet). Retrieved April 25, 2023 via Twitter.
  32. @elevenlabsio (January 31, 2023). "This will keep our tools accessible while allowing us to fight potential misuse. Payment details won't always prevent abuse, but they make VoiceLab users less anonymous and force them to think twice before sharing improper content" (Tweet). Retrieved April 25, 2023 via Twitter.
  33. Knibbs, Kate. "Researchers Say the Deepfake Biden Robocall Was Likely Made With Tools From AI Startup ElevenLabs". Wired. ISSN   1059-1028 . Retrieved February 15, 2024.
  34. "Your Favorite Voice Actors Call Out AI Sites Copying Voices Without Consent". Kotaku. February 13, 2023. Retrieved December 10, 2023.