Suno AI

Last updated
Suno AI
Developer(s) Suno, Inc.
Initial releaseDecember 20, 2023;6 months ago (2023-12-20)
Stable release
v3.5 / May 24, 2024
Type Generative artificial intelligence
Website suno.com

Suno AI, or simply Suno, is a generative artificial intelligence music creation program designed to generate realistic songs that combine vocals and instrumentation, [1] or are purely instrumental. Suno has been widely available since December 20, 2023, after the launch of a web application and a partnership with Microsoft, which included Suno as a plugin in Microsoft Copilot. [2]

Contents

Example of a two-minute song generated by Suno AI; its lyrics were generated by ChatGPT. The Style of Music prompt was "Calm, psychedelic rock".

The program operates by producing songs based on text prompts provided by users. Suno does not disclose the dataset used to train its artificial intelligence but claims it has been safeguarded against plagiarism and copyright concerns. [1]

History

Suno was founded by four people: Michael Shulman, Georg Kucsko, Martin Camacho, and Keenan Freyberg. They all worked for Kensho, an AI startup, before starting their own company in Cambridge, Massachusetts. [3]

In April 2023, Suno released their open-source text-to-speech and audio model called "Bark" on GitHub and Hugging Face, under the MIT License. [4] [5] On March 21, 2024, Suno released its v3 version for all users. [6] The new version allows users to create a limited number of 2-minute songs using a free account. [7] Users can pay to subscribe monthly or annually to unlock more capabilities of Suno.

On July 1, 2024, a mobile app for Suno was released. [8]

In June 2024, a lawsuit, led by the Recording Industry Association of America, was filed against Suno and Udio alleging widespread infringement of copyrighted sound recordings. The lawsuit sought to bar the companies from training on copyrighted music, as well as damages of up to $150,000 per work from infringements that have already taken place. [9] [10]

See also

Related Research Articles

Anthropic PBC is a U.S.-based artificial intelligence (AI) startup public-benefit company, founded in 2021. It researches and develops AI to "study their safety properties at the technological frontier" and use this research to deploy safe, reliable models for the public. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini.

Music and artificial intelligence (AI) is the development of music software programs which use AI to generate music. As with applications in other fields, AI in music also simulates mental tasks. A prominent feature is the capability of an AI algorithm to learn based on past data, such as in computer accompaniment technology, wherein the AI is capable of listening to a human performer and performing accompaniment. Artificial intelligence also drives interactive composition technology, wherein a computer composes music in response to a live performance. There are other AI applications in music that cover not only music composition, production, and performance but also how music is marketed and consumed. Several music player programs have also been developed to use voice recognition and natural language processing technology for music voice control. Current research includes the application of AI in music composition, performance, theory and digital sound processing.

<span class="mw-page-title-main">Mustafa Suleyman</span> British entrepreneur and activist

Mustafa Suleyman is a British artificial intelligence (AI) entrepreneur. He is the CEO of Microsoft AI, and the co-founder and former head of applied AI at DeepMind, an AI company acquired by Google. After leaving DeepMind, he co-founded Inflection AI, a machine learning and generative AI company, in 2022.

OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco. Its mission is to develop "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.

<span class="mw-page-title-main">Artificial intelligence art</span> Machine application of knowledge of human aesthetic expressions

Artificial intelligence art is visual artwork created through the use of an artificial intelligence (AI) program.

GitHub Copilot is a code completion tool developed by GitHub and OpenAI that assists users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocompleting code. Currently available by subscription to individual developers and to businesses, the generative artificial intelligence software was first announced by GitHub on 29 June 2021, and works best for users coding in Python, JavaScript, TypeScript, Ruby, and Go. In March 2023 GitHub announced plans for "Copilot X", which will incorporate a chatbot based on GPT-4, as well as support for voice commands, into Copilot.

OpenAI Codex is an artificial intelligence model developed by OpenAI. It parses natural language and generates code in response. It powers GitHub Copilot, a programming autocompletion tool for select IDEs, like Visual Studio Code and Neovim. Codex is a descendant of OpenAI's GPT-3 model, fine-tuned for use in programming applications.

<span class="mw-page-title-main">Midjourney</span> Image-generating machine learning model

Midjourney is a generative artificial intelligence program and service created and hosted by the San Francisco–based independent research lab Midjourney, Inc. Midjourney generates images from natural language descriptions, called prompts, similar to OpenAI's DALL-E and Stability AI's Stable Diffusion. It is one of the technologies of the AI boom.

<span class="mw-page-title-main">Stable Diffusion</span> Image-generating machine learning model

Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered to be a part of the ongoing artificial intelligence boom.

<span class="mw-page-title-main">ChatGPT</span> Chatbot and virtual assistant developed by OpenAI

ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive user prompts and replies are considered at each conversation stage as context.

Prisma Labs is a software company based in Sunnyvale, California that is known for developing Prisma and Lensa.

<span class="mw-page-title-main">Generative artificial intelligence</span> AI system capable of generating content in response to prompts

Generative artificial intelligence is artificial intelligence capable of generating text, images, videos, or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.

<span class="mw-page-title-main">AI boom</span> Ongoing period of rapid progress in artificial intelligence

The AI boom, or AI spring, is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s. Known examples include protein folding prediction led by Google DeepMind and generative AI led by OpenAI.

<span class="mw-page-title-main">Microsoft Copilot</span> Chatbot developed by Microsoft

Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft. Based on a large language model, it was launched in February 2023 as Microsoft's primary replacement for the discontinued Cortana.

In the 2020s, the rapid advancement of deep learning-based generative artificial intelligence models are raising questions about whether copyright infringement occurs when the generative AI is trained or used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there are several pending U.S. lawsuits challenging the use of copyrighted data to train AI models, with defendants arguing that this falls under fair use.

Inflection AI, Inc. is a technology company which has developed a machine learning and generative artificial intelligence hardware and apps, founded in 2022. The company is structured as a public benefit corporation and is headquartered in Palo Alto, California.

Runway AI, Inc. is an American company headquartered in New York City that specializes in generative artificial intelligence research and technologies. The company is primarily focused on creating products and models for generating videos, images, and various multimedia content. It is most notable for developing the commercial text-to-video and video generative AI models Gen-1, Gen-2 and Gen-3 Alpha.

Tabnine is an artificial intelligence (AI) coding assistant developed by Tabnine, which was founded by Dror Weiss and Professor Eran Yahav in Tel Aviv, Israel, in 2013. Initially established under the name Codota, the company underwent a rebranding in May 2021 following the release of the company’s first large language model based AI coding assistant, adopting the name Tabnine.

<span class="mw-page-title-main">Udio</span> Generative text-to-music model

Udio is a generative artificial intelligence model that produces music based on simple text prompts. It can generate vocals and instrumentation. Its free beta version was released publicly on April 10, 2024. Users can pay to subscribe monthly or annually to unlock more capabilities such as audio inpainting.

<span class="mw-page-title-main">BBL Drizzy</span> 2024 instrumental by Metro Boomin

"BBL Drizzy" is a "diss track beat" by American record producer Metro Boomin. It was released on May 5, 2024 dissing Drake in response to the Drake–Kendrick Lamar feud which consisted of multiple diss tracks from both sides. "BBL Drizzy" samples an artificial intelligence-generated track, released on April 14, of the same name by comedian King Willonius. It is the first notable example of AI sampling in mainstream hip-hop music, according to Billboard.

References

  1. 1 2 Ward, Abby (2023-12-21). "How to Use Microsoft Copilot's New Suno AI Music Creation Tool". Tech.co. Retrieved 2024-04-05.
  2. "Microsoft's Copilot and Suno AI team up to create a music generator extension". The Verge. Vox Media. December 19, 2023. Retrieved January 4, 2024.
  3. King, Hope (2023-12-20). "Generative AI startup Suno wants to make songwriting as easy as taking iPhone photos". Axios. Retrieved 2024-04-05.
  4. Bastian, Matthias (2023-09-17). "Suno AI's new text-to-music model generates impressive songs". The Decoder. Retrieved 2024-04-26.
  5. "Bark: The Ultimate Audio Generation Model". KDnuggets. Retrieved 2024-04-26.
  6. Hiatt, Brian (2024-03-22). "Our AI-Generated Blues Song Went Viral -- and Sparked Controversy". Rolling Stone. Retrieved 2024-04-05.
  7. Wilson, Mark (2024-03-23). "What is Suno? The viral AI song generator explained – and how to use it for free". TechRadar. Retrieved 2024-04-05.
  8. Coombes, Lloyd (July 2, 2024). "Suno launches iPhone app — now you can make AI music on the go". Tom's Guide . Future US . Retrieved July 7, 2024.
  9. Sato, Mia (2024-06-24). "Major record labels sue AI company behind "BBL Drizzy"". The Verge. Retrieved 2024-06-24.
  10. Robinson, Kristin (2024-06-24). "Major Labels Sue AI Firms Suno and Udio for Alleged Copyright Infringement". Billboard. Retrieved 2024-06-24.