Developer(s) | Midjourney, Inc. |
---|---|
Initial release | July 12, 2022 (open beta) |
Stable release | V6.1 / July 31, 2024 |
Website | midjourney.com |
Part of a series on |
Artificial intelligence |
---|
Midjourney is a generative artificial intelligence program and service created and hosted by the San Francisco-based independent research lab Midjourney, Inc. Midjourney generates images from natural language descriptions, called prompts , similar to OpenAI's DALL-E and Stability AI's Stable Diffusion. [1] [2] It is one of the technologies of the AI boom.
The tool is in open beta as of August 2024, which it entered on July 12, 2022. [3] The Midjourney team is led by David Holz, who co-founded Leap Motion. [4] Holz told The Register in August 2022 that the company was already profitable. [5] Users create artwork with Midjourney using Discord bot commands or the official website. [6] [7]
Midjourney, Inc. was founded in San Francisco, California, by David Holz, [8] previously a co-founder of Leap Motion. [9] The Midjourney image generation platform entered open beta on July 12, 2022. [3] On March 14, 2022, the Midjourney Discord server launched with a request to post high-quality photographs to Twitter and Reddit for systems training.[ citation needed ]
The company has been working on improving its algorithms, releasing new model versions every few months. Version 2 of their algorithm was launched in April 2022, [10] and version 3 on July 25. [11] On November 5, 2022, the alpha iteration of version 4 was released to users. [12] [13] On March 15, 2023, the alpha iteration of version 5 was released. [14] The 5.1 model is more opinionated than version 5, applying more of its own stylization to images, while the 5.1 RAW model adds improvements while working better with more literal prompts. The version 5.2 included a new "aesthetics system", and the ability to "zoom out" by generating surroundings to an existing image. [15] On December 21, 2023, the alpha iteration of version 6 was released. The model was trained from scratch over a nine month period. Support was added for better text rendition and a more literal interpretation of prompts.
|
Midjourney is accessible through a Discord bot or by accessing their website. Users can use Midjourney through Discord either through their official Discord server, by directly messaging the bot, or by inviting the bot to a third-party server. To generate images, users use the /imagine
command and type in a prompt; [22] the bot then returns a set of four images, which users are given the option to upscale. To generate images on the website, users must first have generated at least 1,000 images through the bot. [7]
Midjourney's founder, David Holz, told The Register that artists use Midjourney for rapid prototyping of artistic concepts to show to clients before starting work themselves. [5]
The advertising industry has been quick to embrace AI tools such as Midjourney, DALL-E, and Stable Diffusion, among others. The tools that enable advertisers to create original content and brainstorm ideas quickly are providing new opportunities, such as "custom ads created for individuals, a new way to create special effects, or even making e-commerce advertising more efficient", according to Ad Age . [23] [ promotion? ]
Architects have described using the software to generate mood boards for the early stages of projects, as an alternative to searching Google Images. [24]
The program was used by the British magazine The Economist to create the front cover for an issue in June 2022. [26] [27] In Italy, the leading newspaper Corriere della Sera published a comic created with Midjourney by writer Vanni Santoni in August 2022. [28] Charlie Warzel used Midjourney to generate two images of Alex Jones for Warzel's newsletter in The Atlantic . The use of an AI-generated cover was criticised by people who felt it was taking jobs from artists. Warzel called his action a mistake in an article about his decision to use generated images. [29] Last Week Tonight with John Oliver included a 10-minute segment on Midjourney in an episode broadcast in August 2022. [30] [31]
A Midjourney image called Théâtre D'opéra Spatial won first place in the digital art competition at the 2022 Colorado State Fair. Jason Allen, who wrote the prompt that led Midjourney to generate the image, printed the image onto a canvas and entered it into the competition using the name Jason M. Allen via Midjourney. Other digital artists were upset by the news. [32] Allen was unapologetic, insisting that he followed the competition's rules. The two category judges were unaware that Midjourney used AI to generate images, although they later said that had they known this, they would have awarded Allen the top prize anyway. [33]
In December 2022, Midjourney was used to generate the images for an AI-generated children's book that was created over a weekend. Titled Alice and Sparkle , the book features a young girl who builds a robot that becomes self-aware. The creator, Ammaar Reeshi, used Midjourney to generate a large number of images, from which he chose 13 for the book. [34] Both the product and process drew criticism. One artist wrote that "the main problem... is that it was trained off of artists' work. It's our creations, our distinct styles that we created, that we did not consent to being used." [25]
In 2023, the realism of AI-based text-to-image generators, such as Midjourney, DALL-E, or Stable Diffusion, [35] [36] reached such a high level that it led to a significant wave of viral AI-generated photos. Widespread attention was gained by a Midjourney-generated photo of Pope Francis wearing a white puffer coat, [37] [38] the fictional arrest of Donald Trump, [39] and a hoax of an attack on the Pentagon, [40] as well as the usage in professional creative arts. [41] [42]
Research has suggested that the images Midjourney generates can be biased. For example, even neutral prompts in one study returned unequal results on the aspects of gender, skin color, and location. [43] A study by researchers at the nonprofit group Center for Countering Digital Hate found the tool to be easy to generate racist and conspiratorial images. [44]
In 2024, a Frontiers journal published a paper [46] which contained gibberish figures generated with Midjourney, one of which was a diagram of a rat with large testicles and a large penis towering over himself. The paper was retracted a day after the images went viral on Twitter. [45]
Prior to May 2023, Midjourney implemented a moderation mechanism predicated on a banned word system. This method prohibited the use of language associated with explicit content, such as sexual or pornographic themes, as well as extreme violence. Moreover, the system also banned certain individual words, including those of religious and political figures, such as Allah or Xi Jinping. This practice occasionally stirred controversy due to perceived instances of censorship within the Midjourney platform. [47] [48]
Commencing in May 2023, with subsequent updates post version 5, Midjourney transitioned to an AI-powered content moderation system. This advanced mechanism allowed for a more nuanced interpretation of user prompts by analyzing them in their entirety. It consequently facilitated the context-dependent use of words that had previously been prohibited. For instance, users can now prompt the AI to generate a portrait of Xi Jinping. At the same time, the system will prevent the generation of contentious images, such as depictions of global leaders, including Xi Jinping, in situations of arrest. [49]
On January 13, 2023, three artists—Sarah Andersen, Kelly McKernan, and Karla Ortiz—filed a copyright infringement lawsuit against Stability AI, Midjourney, and DeviantArt, claiming that these companies have infringed on the rights of millions of artists by training AI tools on five billion images scraped from the web, without the consent of the original artists. [50]
The legal action was initiated in San Francisco by attorney Matthew Butterick in partnership with the Joseph Saveri Law Firm, the same team challenging Microsoft, GitHub, and OpenAI (developers of ChatGPT and DALL-E) in court. In July 2023, U.S. District Judge William Orrick inclined to dismiss most of the lawsuit filed by Andersen, McKernan, and Ortiz but allowed them to file a new complaint. [51] Another lawsuit was filed in November 2023 against Midjourney, Stability AI, DeviantArt and Runway AI for using the copyrighted work of over 4,700 artists. [52]
Wacom Co., Ltd. is a Japanese company headquartered in Kazo, Saitama, Japan, that specializes in manufacturing graphics tablets and related products. As of 2012 Wacom generated sales of approximately 40.7 billion yen with 785 employees. The company's shares are listed on the Tokyo Stock Exchange.
Microsoft Bing, commonly referred to as Bing, is a search engine owned and operated by Microsoft. The service traces its roots back to Microsoft's earlier search engines, including MSN Search, Windows Live Search, and Live Search. Bing offers a broad spectrum of search services, encompassing web, video, image, and map search products, all developed using ASP.NET.
OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco, California. Its mission is to develop "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.
Deepfakes are images, videos, or audio which are edited or generated using artificial intelligence tools, and which may depict real or non-existent people. They are a type of synthetic media.
Artificial intelligence art is visual artwork created through the use of an artificial intelligence (AI) program.
Deepfake pornography, or simply fake pornography, is a type of synthetic pornography that is created via altering already-existing pornographic material by applying deepfake technology to the faces of the actors. The use of deepfake porn has sparked controversy because it involves the making and sharing of realistic videos featuring non-consenting individuals, typically female celebrities, and is sometimes used for revenge porn. Efforts are being made to combat these ethical concerns through legislation and technology-based solutions.
DALL·E, DALL·E 2, and DALL·E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as "prompts".
Artbreeder, formerly known as Ganbreeder, is a collaborative, machine learning-based art website. Using the models StyleGAN and BigGAN, the website allows users to generate and modify images of faces, landscapes, and paintings, among other categories.
Prompt engineering is the process of structuring an instruction that can be interpreted and understood by a generative AI model. A prompt is natural language text describing the task that an AI should perform.
Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered to be a part of the ongoing artificial intelligence boom.
A text-to-image model is a machine learning model which takes an input natural language description and produces an image matching that description.
NovelAI is an online cloud-based, SaaS model, and a paid subscription service for AI-assisted storywriting and text-to-image synthesis, originally launched in beta on June 15, 2021, with the image generation feature being implemented later on October 3, 2022. NovelAI is owned and operated by Anlatan, which is headquartered in Wilmington, Delaware.
ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive user prompts and replies are considered at each conversation stage as context.
Alice and Sparkle is a 2022 illustrated children's book published by American technology product designer Ammaar Reshi. Reshi created the book using artificial intelligence programs ChatGPT and Midjourney in one weekend, which sparked controversy among artists, both in regard to the copyright status of the book and the quality of the illustration and text.
Generative artificial intelligence is artificial intelligence capable of generating text, images, videos, or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.
The AI boom, or AI spring, is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s before gaining international prominence in the early 2020s. Examples include protein folding prediction led by Google DeepMind and generative AI applications developed by OpenAI.
Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft. Based on a large language model, it was launched in February 2023 as Microsoft's primary replacement for the discontinued Cortana.
Théâtre D'opéra Spatial is an image created by Jason Michael Allen with the generative artificial intelligence platform Midjourney. The image won the 2022 Colorado State Fair's annual fine art competition in the photomanipulation category on September 5, becoming one of the first AI-generated images to win such a prize.
In the 2020s, the rapid advancement of deep learning-based generative artificial intelligence models are raising questions about whether copyright infringement occurs when the generative AI is trained or used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there are several pending U.S. lawsuits challenging the use of copyrighted data to train AI models, with defendants arguing that this falls under fair use.
Ideogram is a freemium generative artificial intelligence website founded in 2022. Ideogram uses as a text-to-image model based on the generated prompt given by the user. Ideogram is often considered a major competitor to Midjourney, a similar artificial intelligence tool.