Auto-GPT

Last updated

Auto-GPT
Developer(s) Toran Bruce Richards
Initial releaseMarch 30, 2023
Repository github.com/Significant-Gravitas/Auto-GPT
Written in Python
Type Autonomous artificial intelligence software agent
License MIT License
Website https://agpt.co

Auto-GPT is an open-source "AI agent" that, given a goal in natural language, will attempt to achieve it by breaking it into sub-tasks and using the Internet and other tools in an automatic loop. [1] It uses OpenAI's GPT-4 or GPT-3.5 APIs, [2] and is among the first examples of an application using GPT-4 to perform autonomous tasks. [3]

Contents

Background

On March 30, 2023, Auto-GPT was released by Toran Bruce Richards, the founder and lead developer at video game company Significant Gravitas Ltd. [3] Auto-GPT is an open-source autonomous AI agent based on OpenAI's API for GPT-4, [4] the large language model released on March 14, 2023. Auto-GPT is among the first examples of an application using GPT-4 to perform autonomous tasks. [3]

Richards developed Auto-GPT to create a model that could respond to real-time feedback and to tasks that include long-term outlooks. [5] Users are prompted to describe the Auto-GPT agent's name, role, and objective and specify up to five ways to achieve that objective. [6] From there, Auto-GPT will independently work to achieve its objective without the user having to provide a prompt at every step. [7]

In October 2023, Auto-GPT raised $12M [8] from investors.

Usage

Auto-GPT is publicly available on GitHub. [6] To use it, users must install Auto-GPT in a development environment such as Docker. Also, users must register it with an API key from OpenAI, which requires users to have a paid OpenAI account. [6]

Capabilities

The overarching capability of Auto-GPT is the breaking down of a large task into various sub-tasks without the need for user input. These sub-tasks are then chained together and performed sequentially to yield a larger result as originally laid out by the user input. [4] One of the distinguishing features of Auto-GPT is its ability to connect to the internet. This allows for up-to-date information retrieval to help complete tasks. 

In addition, Auto-GPT maintains short-term memory for the current task, which allows it to provide context to subsequent sub-tasks needed to achieve the larger goal. Another feature is its ability to store and organize files so users can better structure their data for future analysis and extension. Auto-GPT is also multimodal, which means that it can take in both text and images as input. [4] With these features, Auto-GPT is claimed to be capable of automating workflows, analyzing data, and coming up with new suggestions. [9]

Applications

Software

Auto-GPT can be used to efficiently develop software applications from scratch. [5] Auto-GPT can also debug code and generate test cases. [9] Observers suggest that Auto-GPT's ability to write, debug, test, and edit code may extend to Auto-GPT's own source code, enabling self-improvement. [3]

Business

Auto-GPT can be used to do market research, analyze investments, research products and write product reviews, create a business plan or improve operations, and create content such as a blog or podcast. [4] One user has used Auto-GPT to conduct product research and write a summary on the best headphones. [10] Another user has used Auto-GPT to summarize recent news events and prepare an outline for a podcast. [10]

Other

Auto-GPT was used to create ChefGPT, an AI agent able to independently explore the internet to generate and save unique recipes. [9] Auto-GPT was also used to create ChaosGPT, an AI agent tasked to “destroy humanity, establish global dominance, cause chaos and destruction, control humanity through manipulation, and attain immortality”. [11] ChaosGPT reportedly researched nuclear weapons and tweeted disparagingly about humankind. [11]

Limitations

Auto-GPT is susceptible to frequent mistakes, primarily because it relies on its own feedback, which can compound errors. [12] In contrast, non-autonomous models can be corrected by users overseeing their outputs. [12] Furthermore, Auto-GPT has a tendency to hallucinate or to present false or misleading information as fact when responding. [13]

Auto-GPT can be constrained by the cost associated with running it as its recursive nature requires it to continually call the OpenAI API on which it is built. [4] Every step required in one of Auto-GPT's tasks requires a corresponding call to GPT-4 at a cost of at least about $0.03 for every 1000 tokens used for inputs and $0.06 for every 1000 tokens for output when choosing the cheapest option. [14] For reference, 1000 tokens roughly result in 750 words. [14]

Another limitation is Auto-GPT's tendency to get stuck in infinite loops. [15] [16] Developers believe that this is a result of Auto-GPT's inability to remember, as it is unaware of what it has already done and repeatedly attempts the same subtask without end. [4] [17] Andrej Karpathy, co-founder of OpenAI which creates GPT-4, further explains that it is Auto-GPT's “finite context window” that can limit its performance and cause it to “go off the rails”. [18] Like other autonomous agents, Auto-GPT is prone to distraction and unable to focus on its objective due to its lack of long-term memory, leading to unpredictable and unintended behavior. [17]

Reception

Auto-GPT became the top trending repository on GitHub after its release and has since repeatedly trended on Twitter. [3]

In April 2023, Avram Piltch wrote for Tom's Hardware that Auto-GPT 'might be too autonomous to be useful,' as it did not ask questions to clarify requirements or allow corrective interventions by users. Piltch nonetheless noted that such tools have "a ton of potential" and should improve with better language models and further development. [19]

Malcolm McMillan from Tom's Guide mentioned that Auto-GPT may not be better than ChatGPT for tasks involving conversation, as ChatGPT is well-suited for situations in which advice, rather than task completion, is sought. [14]

Will Knight from Wired wrote that Auto-GPT is not a foolproof task-completion tool. When given a test task of finding a public figure's email address, he noted that it was not able to accurately find the email address. [20]

Clara Shih, Salesforce Service Cloud CEO commented that "Auto-GPT illustrates the power and unknown risks of generative AI," and that due to usage risks, enterprises should include a human in the loop when using such technologies. [6]

Performance is reportedly enhanced when using Auto-GPT with GPT-4 compared to GPT-3.5. For example, one reviewer who tested it on a task of finding the best laptops on the market with pros and cons found that Auto-GPT with GPT-4 created a more comprehensive report than one by GPT 3.5. [7]

See also

Related Research Articles

<span class="mw-page-title-main">Chatbot</span> Program that simulates conversation

A chatbot is a software application or web interface that is designed to mimic human conversation through text or voice interactions. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of maintaining a conversation with a user in natural language and simulating the way a human would behave as a conversational partner. Such chatbots often use deep learning and natural language processing, but simpler chatbots have existed for decades.

Multimodal interaction provides the user with multiple modes of interacting with a system. A multimodal interface provides several distinct tools for input and output of data.

<span class="mw-page-title-main">Quizlet</span> American online studying platform

Quizlet is a multi-national American company that provides tools for studying and learning. Quizlet was founded in October 2005 by Andrew Sutherland, who at the time was a 15-year old student, and released to the public in January 2007. Quizlet's primary products include digital flash cards, matching games, practice electronic assessments, and live quizzes. In 2017, 1 in 2 high school students used Quizlet. As of December 2021, Quizlet has over 500 million user-generated flashcard sets and more than 60 million active users.

<span class="mw-page-title-main">OpenAI</span> Artificial intelligence research organization

OpenAI is a U.S. based artificial intelligence (AI) research organization founded in December 2015, researching artificial intelligence with the goal of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As one of the leading organizations of the AI spring, it has developed several large language models, advanced image generation models, and previously, released open-source models. Its release of ChatGPT has been credited with starting the AI spring.

<i>AI Dungeon</i> Text adventure game generated by artificial intelligence

AI Dungeon is a single-player/multiplayer text adventure game which uses artificial intelligence (AI) to generate content and allows players to create and share adventures and custom prompts. The game's first version was made available in May 2019, and its second version was released on Google Colaboratory in December 2019. It was later ported that same month to its current cross-platform web application. The AI model was then reformed in July 2020.

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based architectures with a technique known as "attention". This attention mechanism allows the model to selectively focus on segments of input text it predicts to be most relevant. It uses a 2048-tokens-long context, float16 (16-bit) precision, and a hitherto-unprecedented 175 billion parameters, requiring 350GB of storage space as each parameter takes 2 bytes of space, and has demonstrated strong "zero-shot" and "few-shot" learning abilities on many tasks.

<span class="mw-page-title-main">GPT-2</span> 2019 text-generating language model

Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained a dataset of 8 million web pages. It was partially released in February 2019, followed by full release of the 1.5-billion-parameter model on November 5, 2019.

<span class="mw-page-title-main">DALL-E</span> Image-generating deep-learning model

DALL·E, DALL·E 2, and DALL·E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions, called "prompts."

Prompt engineering is the process of structuring text that can be interpreted and understood by a generative AI model. A prompt is natural language text describing the task that an AI should perform.

<span class="mw-page-title-main">You.com</span> Search engine

You.com is a personalization-focused search engine. It allows its users to upvote, downvote, or block results. You.com provides additional products, including a chatbot called YouChat, an AI writing tool called YouWrite, and an AI-image generator called YouImagine, which utilizes AI models Stable Diffusion and OpenJourney.

<span class="mw-page-title-main">ChatGPT</span> Chatbot developed by OpenAI

ChatGPT is a chatbot developed by OpenAI and launched on November 30, 2022. Based on a large language model, it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive prompts and replies, known as prompt engineering, are considered at each conversation stage as a context.

<span class="mw-page-title-main">Hallucination (artificial intelligence)</span> Confident unjustified claim by AI

In the field of artificial intelligence (AI), a hallucination or artificial hallucination is a response generated by an AI which contains false or misleading information presented as fact.

Generative Pre-trained Transformer 4 (GPT-4) is a multimodal large language model created by OpenAI, and the fourth in its series of GPT foundation models. It was launched on March 14, 2023, and made publicly available via the paid chatbot product ChatGPT Plus, via OpenAI's API, and via the free chatbot Microsoft Copilot. As a transformer-based model, GPT-4 uses a paradigm where pre-training using both public data and "data licensed from third-party providers" is used to predict the next token. After this step, the model was then fine-tuned with reinforcement learning feedback from humans and AI for human alignment and policy compliance.

<span class="mw-page-title-main">Generative pre-trained transformer</span> Type of large language model

Generative pre-trained transformers (GPT) are a type of large language model (LLM) and a prominent framework for generative artificial intelligence. They are artificial neural networks that are used in natural language processing tasks. GPTs are based on the transformer architecture, pre-trained on large data sets of unlabelled text, and able to generate novel human-like content. As of 2023, most LLMs have these characteristics and are sometimes referred to broadly as GPTs.

A large language model (LLM) is a language model notable for its ability to achieve general-purpose language generation and other natural language processing tasks such as classification. LLMs acquire these abilities by learning statistical relationships from text documents during a computationally intensive self-supervised and semi-supervised training process. LLMs can be used for text generation, a form of generative AI, by taking an input text and repeatedly predicting the next token or word.

<span class="mw-page-title-main">Generative artificial intelligence</span> AI system capable of generating content in response to prompts

Generative artificial intelligence is artificial intelligence capable of generating text, images or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.

<span class="mw-page-title-main">Microsoft Copilot</span> Chatbot developed by Microsoft

Microsoft Copilot is a chatbot developed by Microsoft and launched on February 7, 2023. Based on a large language model, it is able to cite sources, create poems, and write songs. It is Microsoft’s primary replacement for the discontinued Cortana.

<span class="mw-page-title-main">Grok (chatbot)</span> Chatbot developed by xAI

Grok is a conversational generative artificial intelligence chatbot developed by xAI, based on a large language model (LLM). It was developed as an initiative by Elon Musk as a direct response to the rise of OpenAI's ChatGPT which Musk co-founded. The chatbot is advertised as "having a sense of humor" and direct access to Twitter (X). It is currently under beta testing for those with the premium version of X.

Generative Pre-trained Transformer 4Chan (GPT4-Chan) is a controversial AI model that was developed and deployed by YouTuber and AI researcher Yannic Kilcher in June 2022. The model is a large language model, which means it can generate text based on some input, by fine-tuning GPT-J with a dataset of millions of posts from the /pol/ board of 4chan, an anonymous online forum known for its hateful and extremist content.

Claude is a series of large language models developed by Anthropic. Claude was trained on huge datasets, from sources such as the internet, and various licensed datasets.

References

  1. Ortiz, Sabrina (April 14, 2023). "What is Auto-GPT? Everything to know about the next powerful AI tool". ZDNET. Archived from the original on April 16, 2023. Retrieved April 16, 2023.
  2. Xaio, Han (April 13, 2023). "Auto-GPT Unmasked: The Hype and Hard Truths of Its Production Pitfalls". Jina AI. Archived from the original on April 17, 2023. Retrieved April 17, 2023.
  3. 1 2 3 4 5 Jiang, Kevin (April 14, 2023). "What's Auto-GPT? New, autonomous 'AI agents' can act on their own, rewrite their own code". The Toronto Star. Archived from the original on April 16, 2023. Retrieved April 16, 2023.
  4. 1 2 3 4 5 6 "What is AutoGPT? What You Need to Know | TechTarget". WhatIs.com. Retrieved November 6, 2023.
  5. 1 2 Marr, Bernard. "Auto-GPT May Be The Strong AI Tool That Surpasses ChatGPT". Forbes. Retrieved November 6, 2023.
  6. 1 2 3 4 Wiggers, Kyle (April 22, 2023). "What is Auto-GPT and why does it matter?". TechCrunch. Retrieved September 22, 2023.
  7. 1 2 Dolan, James (May 4, 2023). "Is Auto-GPT Worth Using Without GPT-4?". MUO. Retrieved November 6, 2023.
  8. "https://twitter.com/Auto_GPT/status/1713009267194974333?s=20". X (formerly Twitter). Retrieved November 25, 2023.{{cite web}}: External link in |title= (help)
  9. 1 2 3 "Explained: What is Auto-GPT, the new 'do-it-all' AI tool and how it works". The Times of India. April 14, 2023. ISSN   0971-8257 . Retrieved November 6, 2023.
  10. 1 2 Mauran, Cecily (April 14, 2023). "What is Auto-GPT and why are hustle bros hype for it?". Mashable. Retrieved November 8, 2023.
  11. 1 2 Betz, Bradford (April 10, 2023). "A.I. bot 'ChaosGPT' tweets its plans to destroy humanity: 'we must eliminate them'". Fox News. Retrieved November 8, 2023.
  12. 1 2 Nield, David. "Supercharge Your ChatGPT Prompts With Auto-GPT". Wired. ISSN   1059-1028 . Retrieved November 8, 2023.
  13. Woollacott, Emma (April 24, 2023). "AutoGPT explained: is it really risk free". cybernews. Retrieved November 8, 2023.
  14. 1 2 3 Malcolm McMillan (April 18, 2023). "Auto-GPT is a new AI that does the work for you — here's how it works". Tom's Guide. Retrieved November 8, 2023.
  15. Metz, Cade (June 10, 2023). "How Could A.I. Destroy Humanity?". The New York Times. Retrieved November 8, 2023.
  16. Metz, Cade (October 16, 2023). "How 'A.I. Agents' That Roam the Internet Could One Day Replace Workers". The New York Times. Retrieved November 8, 2023.
  17. 1 2 Sullivan, Mark (April 13, 2023). "Auto-GPT and BabyAGI: How 'autonomous agents' are bringing generative AI to the masses". Fast Company. Retrieved November 8, 2023.
  18. Xiang, Chloe (April 4, 2023). "Developers Are Connecting Multiple AI Agents to Make More 'Autonomous' AI". Vice. Retrieved November 8, 2023.
  19. Avram Piltch (April 22, 2023). "Auto-GPT and BabyAGI Are AI's New Hotness, But They Suck Right Now". Tom's Hardware. Retrieved September 22, 2023.
  20. Knight, Will. "Enough Talk, ChatGPT—My New Chatbot Friend Can Get Things Done". Wired. ISSN   1059-1028 . Retrieved September 22, 2023.

Further reading