Part of a series on |
Artificial intelligence |
---|
The AI boom, [1] [2] or AI spring, [3] [4] is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s before gaining international prominence in the early 2020s. Examples include protein folding prediction led by Google DeepMind as well as large language models and generative AI applications developed by OpenAI.
In 2012, a University of Toronto research team used artificial neural networks and deep learning techniques to lower the error rate below 25% for the first time during the ImageNet challenge for object recognition in computer vision. The event catalyzed the AI boom later that decade, when many alumni of the ImageNet challenge became leaders in the tech industry. [5] [6] In March 2016, AlphaGo beat Lee Sedol in a five-game match, marking the first time a computer Go program had beaten a 9-dan professional without handicap. This match led to significant increase in public interest in AI. [7] The generative AI race began in earnest in 2016 or 2017 following the founding of OpenAI and earlier advances made in graphical processing units (GPUs), the amount and quality of training data, generative adversarial networks, diffusion models and transformer architectures. [8] [9] In 2018, the Artificial Intelligence Index, an initiative from Stanford University, reported a global explosion of commercial and research efforts in AI. Europe published the largest number of papers in the field that year, followed by China and North America. [10] Technologies such as AlphaFold led to more accurate predictions of protein folding and improved the process of drug development. [11] Economists and lawmakers began to discuss the potential impact of AI more frequently. [12] [13] By 2022, large language models (LLMs) saw increased usage in chatbot applications; text-to-image-models could generate images that appeared to be human-made; [14] and speech synthesis software was able to replicate human speech efficiently. [15]
According to metrics from 2017 to 2021, the United States outranks the rest of the world in terms of venture capital funding, the number of startups, and patents granted in AI. [16] [17] Scientists who have immigrated to the U.S. play an outsized role in the country's development of AI technology. [18] [19] Many of them were educated in China, prompting debates about national security concerns amid worsening relations between the two countries. [20]
Experts have framed AI development as a competition for economic and geopolitical advantage between the United States and China. [21] In 2021, an analyst for the Council on Foreign Relations outlined ways that the U.S. could maintain its position amid progress made by China. [22] [23] In 2023, an analyst at the Center for Strategic and International Studies advocated for the U.S. to use its dominance in AI technology to drive its foreign policy instead of relying on trade agreements. [16]
There have been proposals to use AI to advance radical forms of human life extension. [24]
The AlphaFold 2 score of more than 90 in CASP's global distance test (GDT) is considered a significant achievement in computational biology [25] and great progress towards a decades-old grand challenge of biology. [26] Nobel Prize winner and structural biologist Venki Ramakrishnan called the result "a stunning advance on the protein folding problem", [25] adding that "It has occurred decades before many people in the field would have predicted." [27] [28]
The ability to predict protein structures accurately based on the constituent amino acid sequence is expected to accelerate drug discovery and enable a better understanding of diseases. [26] [29] [30] It went on to note that the AI algorithm could "predict the shape of proteins to within the width of an atom." [30]
Text-to-image models captured widespread public attention when OpenAI announced DALL-E, a transformer system, in January 2021. [31] A successor capable of generating complex and realistic images, DALL-E 2, was unveiled in April 2022. [32] An alternative text-to-image model, Midjourney, was released in July 2022. [33] Another alternative, open-source model Stable Diffusion, released in August 2022. [34]
Following other text-to-image models, language model-powered text-to-video platforms such as Runway, [35] OpenAI's Sora, DAMO, [36] Make-A-Video, [37] Imagen Video [38] and Phenaki [39] can generate video from text as well as image prompts. [40]
GPT-3 is a large language model that was released in 2020 by OpenAI and is capable of generating high-quality human-like text. [41] The tool has been credited with spurring and accelerating the A.I. boom following its release. [42] [43] [44] An upgraded version called GPT-3.5 was used in ChatGPT, which later garnered attention for its detailed responses and articulate answers across many domains of knowledge. [45] A new version called GPT-4 was released on March 14, 2023, and was used in the Microsoft Bing search engine. [46] [47] Other language models have been released, such as PaLM and Gemini by Google and LLaMA by Meta Platforms.
In January 2023, DeepL Write, an AI-based tool to improve monolingual texts, was released. [48] In December 2023, Gemini, the latest model by Google, was unveiled, claiming to beat previous state-of-the-art-model GPT-4 on most benchmarks. [49]
In 2016, Google DeepMind unveiled WaveNet, a deep learning network that produced English, Mandarin, and piano music. [50]
In 2020, the non-commercial freeware artificial intelligence web application 15.ai was released. [51] [52] 15.ai is credited with popularizing AI voice cloning in content creation, being the first publicly available AI vocal synthesis application and having had a significant impact in multiple Internet fandoms, most notably the My Little Pony: Friendship Is Magic and Team Fortress 2 fandoms. [53] [54]
ElevenLabs allowed users to upload voice samples and create audio that sounds similar to the samples. The company was criticized[ by whom? ] after controversial[ among whom? ] statements were generated based on the vocal styles of celebrities, public officials, and other famous individuals, [55] raising concerns[ among whom? ] that the technology could make deepfakes even more convincing. [56] An unofficial song created using the voices of musicians Drake and The Weeknd raised questions[ among whom? ] about the ethics and legality of similar software. [57]
The AI boom may have a profound cultural, philosophical, [58] religious, [59] economic, [60] and social impact, [61] as questions such as AI alignment, [62] qualia , [58] and the development of artificial general intelligence (AGI) [62] became widely prominent topics of popular discussion. [63] AI has the potential to be applied in various fields, including in education, [64] healthcare, [65] and transportation. [66]
During the AI boom, different groups emerged, ranging from the ones that want to accelerate AI development as quickly as possible to those that are more concerned about AI safety and would like to "decelerate". [67]
Big tech viewed the AI boom as both opportunity and threat; Alphabet's Google, for example, realized that ChatGPT could be an innovator's dilemma-like replacement for Google Search. The company merged DeepMind and Google Brain, a rival internal unit, to accelerate its AI research. [68]
The market capitalization of Nvidia, whose GPUs are in high demand to train and use generative AI models, rose to over US$3.3 trillion, making it the world's largest company by market capitalization as of June 19 2024. [69]
In 2023, San Francisco's population increased for the first time in years, with the boom cited as a contributing factor. [70]
Machine learning resources, hardware or software can be bought and licensed off-the-shelf or as cloud platform services. [71] This enables wide and publicly available uses, spreading AI skills. [72] Over half of businesses consider AI to be a top organizational priority and to be the most crucial technological advancement in many decades. [73]
Across industries, generative AI tools are becoming widely available through the AI boom and are increasingly used in businesses across regions. [74] A main area of use is data analytics. Seen as an incremental change, machine learning improves industry performance. [75] Businesses report AI to be most useful in increased process efficiency, improved decision-making and strengthening of existing services and products. [76] Through adoption, AI has already positively influenced revenue generation in multiple business functions. Businesses have experienced revenue increases of up to 16%, mainly in manufacturing, risk management and research and development. [77]
AI and generative AI investments have been increasing with the boom, increasing from $18 billion in 2014 to $119 billion in 2021. Most notably, the share of generative AI investments was around 30% in 2023. [78] Further, generative AI businesses have seen considerable venture capital investments even though regulatory and economic outlooks remain in question. [79]
Tech giants capture the bulk of the monetary gains from AI and act as major suppliers to or customers of private users and other businesses. [80] [81]
Inaccuracy, cybersecurity and intellectual property infringement are considered to be the main risks associated with the boom, although not many actively attempt to mitigate the risk. [82] Large language models have been criticized for reproducing biases inherited from their training data, including discriminatory biases related to ethnicity or gender. [83] As a dual-use technology, AI carries risks of misuse by malicious actors. [84] As AI becomes more sophisticated, it may eventually become cheaper and more efficient than human workers, which could cause technological unemployment and a transition period of economic turmoil. [85] [12] Public reaction to the AI boom has been mixed, with some hailing the new possibilities that AI creates, its sophistication and potential for benefiting humanity; [86] [87] while others denounced it for threatening job security [88] [89] and for giving 'uncanny' or flawed responses. [90]
The commercial AI scene is dominated by American Big Tech companies such as Alphabet Inc., Amazon, Apple Inc., Meta Platforms, and Microsoft, whose investments in this area have surpassed those from U.S.-based venture capitalists. [91] [92] [93] Some of these players already own the vast majority of existing cloud infrastructure, AI chips, and computing power from data centers, allowing them to entrench further in the marketplace. [94] [95] AI-related patents have been hoarded by the largest American tech companies as well, with IBM leading the way with 1,200. [96]
Tech companies such as Meta, OpenAI and Nvidia have been sued by artists, writers, journalists, and software developers for using their work to train AI models. [97] [98] Early generative AI chatbots, such as the GPT-1, used the BookCorpus, and books are still the best source of training data for producing high-quality language models. ChatGPT aroused suspicion that its sources included libraries of pirated content after the chatbot produced detailed summaries of every part of Sarah Silverman's The Bedwetter and verbatim excerpts of paywalled content from The New York Times . [99] [100]
The ability to generate convincing, personalized messages as well as realistic images may facilitate large-scale misinformation, manipulation, and propaganda. [101]
On April 19, 2024, as part of an ongoing feud with fellow rapper Kendrick Lamar, the artist Drake released the diss track Taylor Made Freestyle, which feature generated vocals imitating the voices of Tupac Shakur and Snoop Dogg. [102] Shakur's estate threatened to sue over the use of Shakur's likeness, [103] saying that it constituted a violation of Shakur's personality rights.
On May 20, 2024, following the release of a demo of updates to OpenAI's ChatGPT Voice Mode feature a week earlier, [104] [105] actor Scarlett Johansson issued a statement [106] [107] in relation to the "Sky" voice shown in the demo, accusing OpenAI of producing it to be very similar to her own, and her portrayal of the artificial intelligence voice assistant Samantha in the film Her (2013), despite Johansson refusing an earlier offer from the company to provide her voice for the system. The unnamed voice actress who voiced Sky has stated she was coached to sound like Johansson, and used her natural speaking voice. [108]
Several incidents involving sharing of non-consensual deepfake pornography have occurred. In late January 2024, deepfake images of American musician Taylor Swift proliferated. Several experts have warned that deepfake pornography is more quickly created and disseminated, due to the relative ease of using the technology. [109] Canada introduced federal legislation targeting sharing of non-consensual sexually explicit AI-generated photos; most provinces already had such laws. [110] In the United States, the DEFIANCE Act was introduced in March 2024. [111]
A large amount of electricity is needed to power generative AI products, [112] making it more difficult for companies to achieve net zero emissions. From 2019 to 2024, Google's greenhouse gas emissions increased by 50%. [113]
AI is expected by researchers of the Center for AI Safety to improve the "accessibility, success rate, scale, speed, stealth and potency of cyberattacks", potentially causing "significant geopolitical turbulence" if it reinforces attack more than defense. [84] [114] Concerns have been raised about the potential capability of future AI systems to engineer particularly lethal and contagious pathogens. [115] [116]
The AI boom is said to have started an arms race in which large companies are competing against each other to have the most powerful AI model on the market, with speed and profit prioritized over safety and user protection. [117] [118] [119]
Rapid progress in artificial intelligence has also sparked interest in whether some future AI systems will be sentient or otherwise worthy of moral consideration, [120] and whether they should be granted rights. [121]
Industry leaders have further warned in the statement on AI risk of extinction that humanity might irreversibly lose control over a sufficiently advanced artificial general intelligence. [122]
A chatbot is a software application or web interface designed to have textual or spoken conversations. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of maintaining a conversation with a user in natural language and simulating the way a human would behave as a conversational partner. Such chatbots often use deep learning and natural language processing, but simpler chatbots have existed for decades.
Music and artificial intelligence (AI) is the development of music software programs which use AI to generate music. As with applications in other fields, AI in music also simulates mental tasks. A prominent feature is the capability of an AI algorithm to learn based on past data, such as in computer accompaniment technology, wherein the AI is capable of listening to a human performer and performing accompaniment. Artificial intelligence also drives interactive composition technology, wherein a computer composes music in response to a live performance. There are other AI applications in music that cover not only music composition, production, and performance but also how music is marketed and consumed. Several music player programs have also been developed to use voice recognition and natural language processing technology for music voice control. Current research includes the application of AI in music composition, performance, theory and digital sound processing.
Databricks, Inc. is a global data, analytics, and artificial intelligence (AI) company, founded in 2013 by the original creators of Apache Spark. The company provides a cloud-based platform to help enterprises build, scale, and govern data and AI, including generative AI and other machine learning models.
OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco, California. Its stated mission is to develop "safe and beneficial" artificial general intelligence (AGI), which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.
Artificial intelligence art is visual artwork created or enhanced through the use of artificial intelligence (AI) programs.
Synthetic media is a catch-all term for the artificial production, manipulation, and modification of data and media by automated means, especially through the use of artificial intelligence algorithms, such as for the purpose of misleading people or changing an original meaning. Synthetic media as a field has grown rapidly since the creation of generative adversarial networks, primarily through the rise of deepfakes as well as music synthesis, text generation, human image synthesis, speech synthesis, and more. Though experts use the term "synthetic media," individual methods such as deepfakes and text synthesis are sometimes not referred to as such by the media but instead by their respective terminology Significant attention arose towards the field of synthetic media starting in 2017 when Motherboard reported on the emergence of AI altered pornographic videos to insert the faces of famous actresses. Potential hazards of synthetic media include the spread of misinformation, further loss of trust in institutions such as media and government, the mass automation of creative and journalistic jobs and a retreat into AI-generated fantasy worlds. Synthetic media is an applied form of artificial imagination.
Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.
DALL-E, DALL-E 2, and DALL-E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as prompts.
LaMDA is a family of conversational large language models developed by Google. Originally developed and introduced as Meena in 2020, the first-generation LaMDA was announced during the 2021 Google I/O keynote, while the second generation was announced the following year.
Midjourney is a generative artificial intelligence program and service created and hosted by the San Francisco-based independent research lab Midjourney, Inc. Midjourney generates images from natural language descriptions, called prompts, similar to OpenAI's DALL-E and Stability AI's Stable Diffusion. It is one of the technologies of the AI boom.
ChatGPT is a generative artificial intelligence chatbot developed by OpenAI and launched in 2022. It is currently based on the GPT-4o large language model (LLM). ChatGPT can generate human-like conversational responses and enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. It is credited with accelerating the AI boom, which has led to ongoing rapid investment in and public attention to the field of artificial intelligence (AI). Some observers have raised concern about the potential of ChatGPT and similar programs to displace human intelligence, enable plagiarism, or fuel misinformation.
In the field of artificial intelligence (AI), a hallucination or artificial hallucination is a response generated by AI that contains false or misleading information presented as fact. This term draws a loose analogy with human psychology, where hallucination typically involves false percepts. However, there is a key difference: AI hallucination is associated with erroneous responses rather than perceptual experiences.
A generative pre-trained transformer (GPT) is a type of large language model (LLM) and a prominent framework for generative artificial intelligence. It is an artificial neural network that is used in natural language processing by machines. It is based on the transformer deep learning architecture, pre-trained on large data sets of unlabeled text, and able to generate novel human-like content. As of 2023, most LLMs had these characteristics and are sometimes referred to broadly as GPTs.
Generative artificial intelligence is a subset of artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models learn the underlying patterns and structures of their training data and use them to produce new data based on the input, which often comes in the form of natural language prompts.
Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft. Based on the GPT-4 series of large language models, it was launched in 2023 as Microsoft's primary replacement for the discontinued Cortana.
Gemini, formerly known as Bard, is a generative artificial intelligence chatbot developed by Google. Based on the large language model (LLM) of the same name, it was launched in 2023 after being developed as a direct response to the rise of OpenAI's ChatGPT. It was previously based on PaLM, and initially the LaMDA family of large language models.
Artificial intelligence detection software aims to determine whether some content was generated using artificial intelligence (AI).
Copyleaks is a plagiarism detection platform that uses artificial intelligence (AI) to identify similar and identical content across various formats.
As artificial intelligence (AI) has become more mainstream, there is growing concern about how this will influence elections. Potential targets of AI include election processes, election offices, election officials and election vendors.
Apple Intelligence is an artificial intelligence developed by Apple Inc. Relying on a combination of on-device and server processing, it was announced on June 10, 2024, at WWDC 2024, as a built-in feature of Apple's iOS 18, iPadOS 18, and macOS Sequoia, which were announced alongside Apple Intelligence. Apple Intelligence is free for all users with supported devices. It launched for developers and testers on July 29, 2024, in U.S. English, with the iOS 18.1, macOS 15.1, and iPadOS 18.1 developer betas, released partially in October 2024, and will fully launch by 2025. UK, Australia, Canada, New Zealand, and South African localized versions of English will have support by the end of 2024, while Chinese, English (India), English (Singapore), French, German, Italian, Japanese, Korean, Portuguese, Spanish, and Vietnamese will be added over the course of 2025.
{{cite book}}
: CS1 maint: date and year (link)