Stability AI

Last updated

Stability AI is an artificial intelligence company, best known for it's text-to-image model Stable Diffusion

Contents

History and founding

Stability AI was founded in 2019[ citation needed ] by Emad Mostaque, who also initially funded the company [1] .

In August 2022 Stability AI rose to prominence with the release of it's source and weights available text-to-image model Stable Diffusion

On March 23rd 2024 Emad Mostaque stepped down from his position as CEO. The board of directors appointed COO, Shan Shan Wong, and CTO, Christian Laforte, as the interim co-CEOs of Stability AI. [2]

Funding and investors

Stability AI's early stages were significantly supported by Mostaque's own investments, alongside contributions from investment firms like Eros Investments. A notable milestone in the company's funding history was a $101 million investment round led by Coatue and Lightspeed Venture Partners, with O’Shaughnessy Ventures LLC also participating. [3] .

Product and application

Stability AI has made contributions to the field of generative AI, most notably through Stable Diffusion. This AI model has change the way images can be generated from textual descriptions. Beyond Stable Diffusion, Stability AI also develops Video, Audio, 3D, and text models.

Litigation

Stability AI has faced legal challenges, notably from Getty Images, which accused the company of misusing over 12 million photos from its collection to train Stability AI's AI image-generation system, Stable Diffusion. This lawsuit, filed in Delaware federal court, is part of a series of actions against Stability AI concerning the use of images in AI training. Getty Images alleges that Stability AI copied these images without proper licensing, using them to enhance Stable Diffusion's ability to generate accurate depictions from user prompts. This case raises significant questions about copyright and the use of digital assets in training AI systems. [4]

Reference

  1. Roose, Kevin (2022-10-21). "A Coming-Out Party for Generative A.I., Silicon Valley's New Craze". The New York Times. ISSN   0362-4331 . Retrieved 2024-06-28.
  2. https://stability.ai/news/stabilityai-announcement
  3. Wiggers, Kyle (2022-10-17). "Stability AI, the startup behind Stable Diffusion, raises $101M". TechCrunch. Retrieved 2024-02-17.
  4. Brittain, Blake (February 7, 2023). "Getty Images lawsuit says Stability AI misused photos to train AI". Reuters. Retrieved February 17, 2024.

Related Research Articles

Getty Images Holdings, Inc. is a visual media company and supplier of stock images, editorial photography, video, and music for business and consumers, with a library of over 477 million assets. It targets three markets—creative professionals, the media, and corporate.

Anthropic PBC is a U.S.-based artificial intelligence (AI) startup public-benefit company, founded in 2021. It researches and develops AI to "study their safety properties at the technological frontier" and use this research to deploy safe, reliable models for the public. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT and Google's Gemini.

Music and artificial intelligence (AI) is the development of music software programs which use AI to generate music. As with applications in other fields, AI in music also simulates mental tasks. A prominent feature is the capability of an AI algorithm to learn based on past data, such as in computer accompaniment technology, wherein the AI is capable of listening to a human performer and performing accompaniment. Artificial intelligence also drives interactive composition technology, wherein a computer composes music in response to a live performance. There are other AI applications in music that cover not only music composition, production, and performance but also how music is marketed and consumed. Several music player programs have also been developed to use voice recognition and natural language processing technology for music voice control. Current research includes the application of AI in music composition, performance, theory and digital sound processing.

OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco, California. Its mission is to develop "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.

<span class="mw-page-title-main">Artificial intelligence art</span> Machine application of knowledge of human aesthetic expressions

Artificial intelligence art is visual artwork created through the use of an artificial intelligence (AI) program.

<span class="mw-page-title-main">DALL-E</span> Image-generating deep-learning model

DALL·E, DALL·E 2, and DALL·E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as "prompts".

<span class="mw-page-title-main">Midjourney</span> Image-generating machine learning model

Midjourney is a generative artificial intelligence program and service created and hosted by the San Francisco–based independent research lab Midjourney, Inc. Midjourney generates images from natural language descriptions, called prompts, similar to OpenAI's DALL-E and Stability AI's Stable Diffusion. It is one of the technologies of the AI boom.

<span class="mw-page-title-main">Stable Diffusion</span> Image-generating machine learning model

Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered to be a part of the ongoing artificial intelligence boom.

<span class="mw-page-title-main">Text-to-image model</span> Machine learning model

A text-to-image model is a machine learning model which takes an input natural language description and produces an image matching that description.

<span class="mw-page-title-main">Emad Mostaque</span> Bangladeshi-British businessperson and former hedge fund manager

Mohammad Emad Mostaque is a British-Bangladeshi business executive, mathematician, and former hedge fund manager. He is the founder and was CEO of Stability AI until 23 March 2024, one of the companies behind Stable Diffusion.

<span class="mw-page-title-main">LAION</span> Non-profit German artificial intelligence organization

LAION is a German non-profit which makes open-sourced artificial intelligence models and datasets. It is best known for releasing a number of large datasets of images and captions scraped from the web which have been used to train a number of high-profile text-to-image models, including Stable Diffusion and Imagen.

Prisma Labs is a software company based in Sunnyvale, California that is known for developing Prisma and Lensa.

<span class="mw-page-title-main">Generative artificial intelligence</span> AI system capable of generating content in response to prompts

Generative artificial intelligence is artificial intelligence capable of generating text, images, videos, or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.

<span class="mw-page-title-main">AI boom</span> Ongoing period of rapid progress in artificial intelligence

The AI boom, or AI spring, is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s before gaining global prominence by 2022. Known examples include protein folding prediction led by Google DeepMind and generative AI led by OpenAI.

In the 2020s, the rapid advancement of deep learning-based generative artificial intelligence models are raising questions about whether copyright infringement occurs when the generative AI is trained or used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there are several pending U.S. lawsuits challenging the use of copyrighted data to train AI models, with defendants arguing that this falls under fair use.

Text-to-Image personalization is a task in deep learning for computer graphics that augments pre-trained text-to-image generative models. In this task, a generative model that was trained on large-scale data, is adapted such that it can generate images of novel, user-provided concepts. These concepts are typically unseen during training, and may represent specific objects or more abstract categories.

Runway AI, Inc. is an American company headquartered in New York City that specializes in generative artificial intelligence research and technologies. The company is primarily focused on creating products and models for generating videos, images, and various multimedia content. It is most notable for developing the commercial text-to-video and video generative AI models Gen-1, Gen-2 and Gen-3 Alpha.

<span class="mw-page-title-main">Udio</span> Generative text-to-music model

Udio is a generative artificial intelligence model that produces music based on simple text prompts. It can generate vocals and instrumentation. Its free beta version was released publicly on April 10, 2024. Users can pay to subscribe monthly or annually to unlock more capabilities such as audio inpainting.

<span class="mw-page-title-main">ComfyUI</span> Open source node-based generative artificial intelligence UI

ComfyUI is an open source, node-based, generative artificial intelligence computer program that allows users to generate images from a series of text prompts. It uses Stable Diffusion as the base model for its image capabilities combined with other tools such as ControlNet and LCM Low-rank adaptation with each tool being a node in the program.

AUTOMATIC1111 Stable Diffusion Web UI is an open source generative artificial intelligence computer program that allows users to generate images from a text prompt. It uses Stable Diffusion as the base model for its image capabilities together with a large set of extensions and features to customize its output.