Devin AI

Last updated
Devin
Other namesDevin AI
Developer(s) Cognition Labs
Website https://www.cognition-labs.com/introducing-devin

Devin AI is an autonomous artificial intelligence assistant tool created by Cognition Labs. Branded as an "AI software developer", [1] the demo tool is notable for its software development abilities, including plan implementation, source code generation, and benchmark unit testing. The tool has received praise, concern, and skepticism over implications surrounding the future of artificial intelligence and software development.

Contents

Background

Devin AI was created by Cognition Labs, a startup company consisting of ten members including CEO Scott Wu and chief technology officer Steven Hao, with funding from Peter Thiel's Founders Fund firm. [2] [3] Several of the members had participated in competitive coding contests before forming the company. [3] The members developed the software via a combination of training large language models akin to OpenAI's GPT-4 with aspects from reinforcement learning. [3] According to a Bloomberg article, Cognition Labs claimed that Devin AI represents a "breakthrough in a computer's ability to reason." [3] Devin AI has also been considered part of a trend surrounding the advent of autonomous AI agents that can take direct action to solve problems. [1]

Abilities

Devin AI has been noted for its ability to perform software engineering tasks autonomously. [4] [5] Compared to the GitHub Copilot tool, [3] [4] the software can code, debug, plan and problem solve via machine learning techniques. [5] Devin AI works through a user prompting the software with a task in natural language, with the software responding by showing its plan while implementing the code. [3] It searches online resources during the process to learn to complete a task. [4] The software also takes prompts from users during the implementation process and adjusts its plans accordingly, such as when a user notices an issue or bug. [3] [6]

One application of Devin AI is website creation. A test conducted by Bloomberg revealed that the tool could create a website within ten minutes and could recreate a Pong website in a similar timeframe. [3] In a demo from Cognition Labs, the tool also created a website based on the Llama 2 language model through plan, source code and benchmark testing generation. [1] Other examples include building a project to display images from a blog post, and compiling a computer vision model from an Upwork project. [6] In a benchmark test for analyzing the performance of large language models on real world projects, Devin was found to fix 13.86 percent of encountered issues with no human assistance, compared to an average of 1.96 percent and 4.8 percent for an unassisted and assisted model, respectively. [5] [6]

Reception

Devin AI has been met with praise, concern and skepticism from journalists and software engineers. [1] [7] [8] Its announcement on X led to praise from investors and software engineers while spawning various memes. [1] Along with the company, the tool has seen optimism amongst AI enthusiasts and anticipation for its public availability. [3] The tool has also been noted for potentially allowing users of a non-technical background to create projects, and aiding developers in solving more complex tasks. [4] The Indian Express claimed that its capabilities could streamline the software development process while avoiding human error. [5] CEO Aravind Srinivas of Perplexity.ai offered praise to Devin, claiming that it "seemed to be 'the first demo of any agent, leave alone coding, that seems to cross the threshold' of human capability." [7] After the release of Devin AI, Cognition Labs experienced exponential growth and interest. Earlier this year, the startup raised $21 million in a deal valuing it at $350 million. It then turned down offers valuing it at $1 billion. According to the Wall Street Journal , the company has been in talks with investors for a deal that would value it at up to $2 billion. [9]

Concern for the software includes its implications for the future of AI and the software development industry. [3] [7] In the wake of layoffs within the tech industry throughout 2023 and 2024, [7] discourse of the tool involves concerns that it may replace engineers and remove lower-level jobs. [4] On social media, various developers expressed criticism for the software's capabilities and potential to incite job layoffs. [1] [7] [8] Skepticism also emerged that the tool may struggle to complete tasks with more intricate requirements and scenarios that would necessitate human creativity, along with its efficiency. [5] [7] However, the tool has also been regarded to encourage software engineers to perform more creative work. [3] [5] Just a few weeks after Devin has debuted, open source replacements such as OpenDevin and Devika [10] have been released.

See also

Related Research Articles

<span class="mw-page-title-main">Synopsys</span> American software company

Synopsys, Inc. is an American electronic design automation (EDA) company headquartered in Sunnyvale, California, that focuses on silicon design and verification, silicon intellectual property and software security and quality. Synopsys supplies tools and services to the semiconductor design and manufacturing industry. Products include tools for logic synthesis and physical design of integrated circuits, simulators for development, and debugging environments that assist in the design of the logic for chips and computer systems. As of 2023, the company is a component of both the Nasdaq-100 and S&P 500 indices.

<span class="mw-page-title-main">Canva</span> Online graphic design platform

Canva is an online template editor app for creating social media graphics, presentations, merch and websites.

<span class="mw-page-title-main">ServiceNow</span> American technology company

ServiceNow, Inc. is an American software company based in Santa Clara, California, that develops a cloud computing platform to help companies manage digital workflows for enterprise operations. Founded in 2003 by Fred Luddy, ServiceNow is listed on the New York Stock Exchange and is a constituent of the Russell 1000 Index and S&P 500 Index. In 2018, Forbes magazine named it number one on its list of the world's most innovative companies.

<span class="mw-page-title-main">Databricks</span> American software company

Databricks, Inc. is a global data, analytics and artificial intelligence company founded by the original creators of Apache Spark.

<span class="mw-page-title-main">OpenAI</span> Artificial intelligence research organization

OpenAI is an American artificial intelligence (AI) research organization founded in December 2015, researching artificial intelligence with the goal of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As one of the leading organizations of the AI boom, it has developed several large language models, advanced image generation models, and previously, released open-source models. Its release of ChatGPT has been credited with starting the AI boom.

<i>AI Dungeon</i> Text adventure game generated by artificial intelligence

AI Dungeon is a single-player/multiplayer text adventure game which uses artificial intelligence (AI) to generate content and allows players to create and share adventures and custom prompts. The game's first version was made available in May 2019, and its second version was released on Google Colaboratory in December 2019. It was later ported that same month to its current cross-platform web application. The AI model was then reformed in July 2020.

<span class="mw-page-title-main">Opentrons</span> Bioscience liquid handler manufacturer

Opentrons Labworks, Inc. is a biotechnology company that manufactures liquid handling robots that use open-source software, which at one point used open-source hardware but no longer does. Their robots can be used by scientists to manipulate small volumes of liquids for the purpose of undertaking biochemical or chemical reactions. Currently, they offer the OT-2 and Flex robots. These robots are used primarily by researchers and scientists interested in DIY biology, but they are increasingly being used by other biologists.

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.

<span class="mw-page-title-main">GPT-2</span> 2019 text-generating language model

Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 million web pages. It was partially released in February 2019, followed by full release of the 1.5-billion-parameter model on November 5, 2019.

GitHub Copilot is a code completion tool developed by GitHub and OpenAI that assists users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocompleting code. Currently available by subscription to individual developers and to businesses, the generative artificial intelligence software was first announced by GitHub on 29 June 2021, and works best for users coding in Python, JavaScript, TypeScript, Ruby, and Go. In March 2023 GitHub announced plans for "Copilot X", which will incorporate a chatbot based on GPT-4, as well as support for voice commands, into Copilot.

OpenAI Codex is an artificial intelligence model developed by OpenAI. It parses natural language and generates code in response. It powers GitHub Copilot, a programming autocompletion tool for select IDEs, like Visual Studio Code and Neovim. Codex is a descendant of OpenAI's GPT-3 model, fine-tuned for use in programming applications.

You.com is an AI Assistant that began as a personalization-focused search engine. While still offering web search capabilities, You.com has evolved to prioritize a chat-first AI Assistant.

<span class="mw-page-title-main">ChatGPT</span> Chatbot developed by OpenAI

ChatGPT is a chatbot developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive user prompts and replies are considered at each conversation stage as context.

Generative Pre-trained Transformer 4 (GPT-4) is a multimodal large language model created by OpenAI, and the fourth in its series of GPT foundation models. It was launched on March 14, 2023, and made publicly available via the paid chatbot product ChatGPT Plus, via OpenAI's API, and via the free chatbot Microsoft Copilot. As a transformer-based model, GPT-4 uses a paradigm where pre-training using both public data and "data licensed from third-party providers" is used to predict the next token. After this step, the model was then fine-tuned with reinforcement learning feedback from humans and AI for human alignment and policy compliance.

ElevenLabs is a software company that specializes in developing natural-sounding speech synthesis and text-to-speech software, using artificial intelligence and deep learning.

<span class="mw-page-title-main">Microsoft Copilot</span> Chatbot developed by Microsoft

Microsoft Copilot is a chatbot developed by Microsoft and launched on February 7, 2023. Based on a large language model, it is able to cite sources, create poems, and write songs. It is Microsoft's primary replacement for the discontinued Cortana.

<span class="mw-page-title-main">Auto-GPT</span> Autonomous AI agent

Auto-GPT is an open-source "AI agent" that, given a goal in natural language, will attempt to achieve it by breaking it into sub-tasks and using the Internet and other tools in an automatic loop. It uses OpenAI's GPT-4 or GPT-3.5 APIs, and is among the first examples of an application using GPT-4 to perform autonomous tasks.

<span class="mw-page-title-main">Gemini (language model)</span> Large language model developed by Google

Gemini is a family of multimodal large language models developed by Google DeepMind, serving as the successor to LaMDA and PaLM 2. Comprising Gemini Ultra, Gemini Pro, and Gemini Nano, it was announced on December 6, 2023, positioned as a competitor to OpenAI's GPT-4. It powers the chatbot of the same name.

Mistral AI is a French company selling artificial intelligence (AI) products. It was founded in April 2023 by previous employees of Meta Platforms and Google DeepMind. The company raised €385 million in October 2023, and in December 2023, it was valued at more than $2 billion.

Perplexity AI is an AI-chatbot-powered research and conversational search engine that answers queries using natural language predictive text. Launched in 2022, Perplexity generates answers using the sources from the web and cites links within the text response. Perplexity works on a freemium model; the free product uses its Perplexity model based on OpenAI's GPT-3.5 model combined with the company's standalone large language model (LLM) that incorporates natural language processing (NLP) capabilities, while the paid version Perplexity Pro has access to GPT-4, Claude 3, Mistral Large, Llama 3 and an Experimental Perplexity Model. It has garnered about 10 million monthly users.

References

  1. 1 2 3 4 5 6 Knight, Will (March 14, 2024). "Forget Chatbots. AI Agents Are the Future". Wired . Retrieved March 18, 2024.
  2. Vance, Ashlee (March 12, 2024). "Sport-Coders Get Serious With Advanced AI Software Assistant". Bloomberg . Retrieved March 18, 2024.
  3. 1 2 3 4 5 6 7 8 9 10 11 Vance, Ashlee (March 12, 2024). "Gold-Medalist Coders Build an AI That Can Do Their Job for Them". Bloomberg . Retrieved March 18, 2024.
  4. 1 2 3 4 5 Irwin, Kate (March 12, 2024). "This Software Engineer AI Can Train Other AIs, Code Websites by Itself". PC Mag . Retrieved March 18, 2024.
  5. 1 2 3 4 5 6 Jose, Bijin (March 15, 2024). "Meet Devin AI, the world's 'first fully autonomous' AI software engineer". The Indian Express . Retrieved March 18, 2024.
  6. 1 2 3 Sharma, Shubham (March 12, 2024). "Cognition emerges from stealth to launch AI software engineer Devin". VentureBeat . Retrieved March 18, 2024.
  7. 1 2 3 4 5 6 Chowdhury, Hasan (March 14, 2024). "Software engineers are getting closer to finding out if AI really can make them jobless". Insider . Retrieved March 18, 2024.
  8. 1 2 Sengupta, Trisha (March 14, 2024). "'World's first AI software engineer' Devin that can write code with a single prompt leaves people with mixed reactions". Hindustan Times . Retrieved March 18, 2024.
  9. Mollman, Steve (March 31, 2024). "AI startup Cognition Labs, founded in November, seeks $2B valuation amid investor frenzy, warnings of bubble". Fortune . Retrieved April 7, 2024.
  10. Sengupta, Trisha (April 2, 2024). "Who is Devika? India's 'AI coder', an alternative to Devin". Hindustan Times . Retrieved April 7, 2024.