Devin AI

Last updated
Devin AI
Developer(s) Cognition Labs
Available in
  • Python (Frontend)
  • Language model training weights (Middleware)
License Proprietary (SaaS), using open source software (Ubuntu) in part
Website preview.devin.ai

Devin AI is an autonomous artificial intelligence assistant tool created by Cognition Labs. Branded as an "AI software developer", [1] the demo tool is designed to complete software development tasks. The tool has received praise, concern, and skepticism over implications surrounding the future of artificial intelligence and software development.

Contents

Background

Devin AI was created by Cognition Labs, a startup company consisting of ten members including CEO Scott Wu and chief technology officer Steven Hao, with funding from Peter Thiel's Founders Fund firm. [2] [3] Several of the members had participated in competitive coding contests before forming the company. [3] The members developed the software via a combination of training large language models akin to OpenAI's GPT-4 with aspects from reinforcement learning. [3] According to a Bloomberg article, Cognition Labs claimed that Devin AI represents a "breakthrough in a computer's ability to reason." [3] Devin AI has also been considered part of a trend surrounding the advent of autonomous AI agents that can take direct action to solve problems. [1]

Abilities

Devin AI has been noted for its ability to perform software engineering tasks autonomously. [4] [5] Compared to the GitHub Copilot tool, [3] [4] the software can code, debug, plan and problem solve via machine learning techniques. [5] Devin AI works through a user prompting the software with a task in natural language, with the software responding by showing its plan while implementing the code. [3] It searches online resources during the process to learn to complete a task. [4] The software also takes prompts from users during the implementation process and adjusts its plans accordingly, such as when a user notices an issue or bug. [3] [6]

One application of Devin AI is website creation. A test conducted by Bloomberg revealed that the tool could create a website within ten minutes and could recreate a Pong website in a similar timeframe. [3] In a demo from Cognition Labs, the tool also created a website based on the Llama 2 language model through plan, source code and benchmark testing generation. [1] Other examples include building a project to display images from a blog post, and compiling a computer vision model from an Upwork project. [6] In a benchmark test for analyzing the performance of large language models on real world projects, Devin was found to fix 13.86 percent of encountered issues with no human assistance, compared to an average of 1.96 percent and 4.8 percent for an unassisted and assisted model, respectively. [5] [6]

Later revisions of Devin got multi-agent operation capability, where one of the AI agents dispatch task to other AI agents. [7]

Reception

Devin AI has been met with praise, concern and skepticism from journalists and software engineers. [1] [8] [9] Its announcement on X led to praise from investors and software engineers while spawning various memes. [1] Along with the company, the tool has seen optimism amongst AI enthusiasts and anticipation for its public availability. [3] The tool has also been noted for potentially allowing users of a non-technical background to create projects, and aiding developers in solving more complex tasks. [4] The Indian Express claimed that its capabilities could streamline the software development process while avoiding human error. [5] CEO Aravind Srinivas of Perplexity.ai offered praise to Devin, claiming that it "seemed to be 'the first demo of any agent, leave alone coding, that seems to cross the threshold' of human capability." [8] After the release of Devin AI, Cognition Labs experienced increasing growth and interest. Earlier this year, the startup raised $21 million in a deal valuing it at $350 million. It then turned down offers valuing it at $1 billion. According to the Wall Street Journal , the company has been in talks with investors for a deal that would value it at up to $2 billion. [10]

Concern for the software includes its implications for the future of AI and the software development industry. [3] [8] In the wake of layoffs within the tech industry throughout 2023 and 2024, [8] discourse of the tool involves concerns that it may replace engineers and remove lower-level jobs. [4] On social media, various developers expressed criticism for the software's capabilities and potential to incite job layoffs. [1] [8] [9] Skepticism also emerged that the tool may struggle to complete tasks with more intricate requirements and scenarios that would necessitate human creativity, along with its efficiency. [5] [8] Further skepticism regarding its accuracy has emerged following the tool's promotional videos, such as its performance of Devin AI's execution of the Upwork project; YouTube channels such as Internet of Bugs and Computer Vision Project criticized the tool for failing to deliver on the project request, instead writing, testing, and debugging code irrelevant to the Upwork request. [11] However, the tool has also been regarded to encourage software engineers to perform more creative work. [3] [5] Following Devin's debut, various AI software engineering models have been released, such as free and open source replacements like OpenDevin (now called OpenHands) [12] and Devika, [13] and Genie by San Francisco-based startup Cosine. [14]

See also

Related Research Articles

<span class="mw-page-title-main">Chatbot</span> Program that simulates conversation

A chatbot is a software application or web interface designed to have textual or spoken conversations. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of maintaining a conversation with a user in natural language and simulating the way a human would behave as a conversational partner. Such chatbots often use deep learning and natural language processing, but simpler chatbots have existed for decades.

Business process automation (BPA), also known as business automation, refers to the technology-enabled automation of business processes.

GitLab Inc. is a company that operates and develops GitLab, a open-core DevOps software package that can develop, secure, and operate software. GitLab includes a distributed version control system based on Git, including features such as access control, bug tracking, software feature requests, task management, and wikis for every project, as well as snippets.

From 2014 until 2024, Apple undertook a research and development effort to develop an electric and self-driving car, codenamed "Project Titan". Apple never openly discussed any of its automotive research, but around 5,000 employees were reported to be working on the project as of 2018. In May 2018, Apple reportedly partnered with Volkswagen to produce an autonomous employee shuttle van based on the T6 Transporter commercial vehicle platform. In August 2018, the BBC reported that Apple had 66 road-registered driverless cars, with 111 drivers registered to operate those cars. In 2020, it was believed that Apple was still working on self-driving related hardware, software and service as a potential product, instead of actual Apple-branded cars. In December 2020, Reuters reported that Apple was planning on a possible launch date of 2024, but analyst Ming-Chi Kuo claimed it would not be launched before 2025 and might not be launched until 2028 or later.

OpenAI is an American artificial intelligence (AI) research organization founded in December 2015 and headquartered in San Francisco, California. Its stated mission is to develop "safe and beneficial" artificial general intelligence (AGI), which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As a leading organization in the ongoing AI boom, OpenAI is known for the GPT family of large language models, the DALL-E series of text-to-image models, and a text-to-video model named Sora. Its release of ChatGPT in November 2022 has been credited with catalyzing widespread interest in generative AI.

<span class="mw-page-title-main">UiPath</span> Romanian-American developer of robotic process automation software

UiPath Inc. is a global software company that makes robotic process automation (RPA) software. It was founded in Bucharest, Romania, by Daniel Dines and Marius Tîrcă. Its headquarters are in New York City. The company's software monitors user activity to automate repetitive front and back office tasks, including those performed using other business software such as customer relationship management or enterprise resource planning (ERP) software.

<span class="mw-page-title-main">Timeline of computing 2020–present</span> Historical timeline

This article presents a detailed timeline of events in the history of computing from 2020 to the present. For narratives explaining the overall developments, see the history of computing.

<span class="mw-page-title-main">Opentrons</span> Bioscience liquid handler manufacturer

Opentrons Labworks, Inc. is a biotechnology company that manufactures liquid handling robots that use open-source software, which at one point used open-source hardware but no longer does. Their robots can be used by scientists to manipulate small volumes of liquids for the purpose of undertaking biochemical or chemical reactions. Currently, they offer the OT-2 and Flex robots. These robots are used primarily by researchers and scientists interested in DIY biology, but they are increasingly being used by other biologists.

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.

<span class="mw-page-title-main">DALL-E</span> Image-generating deep-learning model

DALL-E, DALL-E 2, and DALL-E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as prompts.

GitHub Copilot is a code completion and automatic programming tool developed by GitHub and OpenAI that assists users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocompleting code. Currently available by subscription to individual developers and to businesses, the generative artificial intelligence software was first announced by GitHub on 29 June 2021, and works best for users coding in Python, JavaScript, TypeScript, Ruby, and Go. In March 2023 GitHub announced plans for "Copilot X", which will incorporate a chatbot based on GPT-4, as well as support for voice commands, into Copilot.

<span class="mw-page-title-main">ChatGPT</span> Chatbot developed by OpenAI

ChatGPT is a generative artificial intelligence chatbot developed by OpenAI and launched in 2022. It is currently based on the GPT-4o large language model (LLM). ChatGPT can generate human-like conversational responses and enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. It is credited with accelerating the AI boom, which has led to ongoing rapid investment in and public attention to the field of artificial intelligence (AI). Some observers have raised concern about the potential of ChatGPT and similar programs to displace human intelligence, enable plagiarism, or fuel misinformation.

ElevenLabs is a software company that specializes in developing natural-sounding speech synthesis software using deep learning.

<span class="mw-page-title-main">AutoGPT</span> Open source autonomous AI agent

AutoGPT is an open-source "AI agent" that, given a goal in natural language, will attempt to achieve it by breaking it into sub-tasks and using the Internet and other tools in an automatic loop. It uses OpenAI's GPT-4 or GPT-3.5 APIs, and is among the first examples of an application using GPT-4 to perform autonomous tasks.

Tabnine is a code completion tool which uses generative artificial intelligence to assist users by autocompleting code. It was created in 2018 by Jacob Jackson, a student at the University of Waterloo.. It is now developed by Tabnine, a software company founded under the name Codota by Dror Weiss and Eran Yahav in Tel Aviv, Israel, in 2013, and renamed to Tabnine in 2021. Initially established under the name Codota, the company underwent a rebranding in May 2021 following the release of the company’s first large language model based AI coding assistant, adopting the name Tabnine.

Copyleaks is a plagiarism detection platform that uses artificial intelligence (AI) to identify similar and identical content across various formats.

Cognition AI, Inc., doing business as Cognition, is an artificial intelligence (AI) company headquartered in San Francisco in the US State of California. The company developed Devin AI, an AI software developer.

<span class="mw-page-title-main">01.AI</span> Artificial intelligence company

01.AI is an artificial intelligence (AI) company based in Beijing, China. It focuses on developing open source products.

Qodo is a code integrity platform that uses AI to help create software through out its development stages.

References

  1. 1 2 3 4 5 6 Knight, Will (March 14, 2024). "Forget Chatbots. AI Agents Are the Future". Wired . Retrieved March 18, 2024.
  2. Vance, Ashlee (March 12, 2024). "Sport-Coders Get Serious With Advanced AI Software Assistant". Bloomberg . Retrieved March 18, 2024.
  3. 1 2 3 4 5 6 7 8 9 10 11 Vance, Ashlee (March 12, 2024). "Gold-Medalist Coders Build an AI That Can Do Their Job for Them". Bloomberg . Retrieved March 18, 2024.
  4. 1 2 3 4 5 Irwin, Kate (March 12, 2024). "This Software Engineer AI Can Train Other AIs, Code Websites by Itself". PC Mag . Retrieved March 18, 2024.
  5. 1 2 3 4 5 6 Jose, Bijin (March 15, 2024). "Meet Devin AI, the world's 'first fully autonomous' AI software engineer". The Indian Express . Retrieved March 18, 2024.
  6. 1 2 3 Sharma, Shubham (March 12, 2024). "Cognition emerges from stealth to launch AI software engineer Devin". VentureBeat . Retrieved March 18, 2024.
  7. "Let Devin create Devins with MultiDevin".
  8. 1 2 3 4 5 6 Chowdhury, Hasan (March 14, 2024). "Software engineers are getting closer to finding out if AI really can make them jobless". Insider . Retrieved March 18, 2024.
  9. 1 2 Sengupta, Trisha (March 14, 2024). "'World's first AI software engineer' Devin that can write code with a single prompt leaves people with mixed reactions". Hindustan Times . Retrieved March 18, 2024.
  10. Mollman, Steve (March 31, 2024). "AI startup Cognition Labs, founded in November, seeks $2B valuation amid investor frenzy, warnings of bubble". Fortune . Retrieved April 7, 2024.
  11. Levine, Gloria (April 16, 2024). ""First AI Software Engineer" Creators Are Accused of Lying". 80 Level. Retrieved May 19, 2024.
  12. Jindal, Siddarth (April 8, 2024). "Top 6 Devin Alternatives to Automate Your Coding Tasks". Analytics India Magazine. Retrieved May 19, 2024.
  13. Sengupta, Trisha (April 2, 2024). "Who is Devika? India's 'AI coder', an alternative to Devin". Hindustan Times . Retrieved April 7, 2024.
  14. Franzen, Carl (August 12, 2024). "Move over, Devin: Cosine's Genie takes the AI coding crown". VentureBeat . Retrieved September 27, 2024.