GitHub Copilot

GitHub Copilot
GitHub Copilot
	Logo
Developers	GitHub ; OpenAI
Initial release	October 2021;4 years ago
Stable release	1.7.4421
Operating system	Microsoft Windows, Linux, macOS, Web
Website	github.com/features/copilot/

Last updated November 05, 2025

GitHub Copilot is a code completion and programming AI-assistant developed by GitHub and OpenAI that assists users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocompleting code.^[1] Currently available by subscription to individual developers and to businesses, the generative artificial intelligence software was first announced by GitHub on 29 June 2021.^[2] Users can choose the large language model used for generation.^[3]

History

On June 29, 2021, GitHub announced GitHub Copilot for technical preview in the Visual Studio Code development environment.^[1]^[4] GitHub Copilot was released as a plugin on the JetBrains marketplace on October 29, 2021.^[5] October 27, 2021, GitHub released the GitHub Copilot Neovim plugin as a public repository.^[6] GitHub announced Copilot's availability for the Visual Studio 2022 IDE on March 29, 2022.^[7] On June 21, 2022, GitHub announced that Copilot was out of "technical preview", and is available as a subscription-based service for individual developers.^[8]

GitHub Copilot is the evolution of the "Bing Code Search" plugin for Visual Studio 2013, which was a Microsoft Research project released in February 2014.^[9] This plugin integrated with various sources, including MSDN and Stack Overflow, to provide high-quality contextually relevant code snippets in response to natural language queries.^[10]

Features

When provided with a programming problem in natural language, Copilot is capable of generating solution code.^[11] It is also able to describe input code in English and translate code between programming languages.^[11]

Copilot enables developers to utilize a variety of Large Language Models (LLMs) from leading LLM providers, including various versions of OpenAI's GPT (including GPT-5 and GPT-5 Mini^[12]), Anthropic's Sonnet, and Google's Gemini.^[13]

According to its website, GitHub Copilot includes assistive features for programmers, such as the conversion of code comments to runnable code, and autocomplete for chunks of code, repetitive sections of code, and entire methods and/or functions.^[2]^[14] GitHub reports that Copilot's autocomplete feature is accurate roughly half of the time; with some Python function header code, for example, Copilot correctly autocompleted the rest of the function body code 43% of the time on the first try and 57% of the time after ten attempts.^[2]

GitHub states that Copilot's features allow programmers to navigate unfamiliar coding frameworks and languages by reducing the amount of time users spend reading documentation.^[2]

Implementation

GitHub Copilot was initially powered by the OpenAI Codex,^[15] which is a modified, production version of GPT-3.^[16] The Codex model is additionally trained on gigabytes of source code in a dozen programming languages. Copilot's OpenAI Codex was trained on a selection of the English language, public GitHub repositories, and other publicly available source code.^[2] This includes a filtered dataset of 159 gigabytes of Python code sourced from 54 million public GitHub repositories.^[17] OpenAI's GPT-3 is licensed exclusively to Microsoft, GitHub's parent company.^[18]

In November 2023, Copilot Chat was updated to use OpenAI's GPT-4 model.^[19] In 2024, Copilot began allowing users to choose between different large language models, such as GPT-4o or Claude 3.5.^[3]

On 6 February 2025, GitHub announced "agent mode", which is a more autonomous mode of operation for the Copilot. Given a programming task, it attempts to accomplish it by executing commands on a Visual Studio instance on the user's computer. The agent mode can connect to different LLMs, including GPT-4o, o1, o3-mini, Claude 3.5 Sonnet, and Gemini 2.0 Flash.^[20]

On 17 May 2025, GitHub announced "coding agent", which is a more autonomous mode of operation for the Copilot. The user would assign a task or issue to Copilot, which would then initialize a development environment in the cloud (powered by GitHub Actions) and perform the request. It would compose a draft pull request and pushes commits to the draft as it works. After accomplishing the request, it tags the user for code review.^[21] It is essentially an asynchronous version of agent mode.

Reception

Since Copilot's release, there have been concerns with its security and educational impact, as well as licensing controversy surrounding the code it produces. With the nature of large language models relying on massive datasets scraped from public sources, this makes it difficult to ensure that the data used for training is fully accurate, unbiased, and ethically sourced. Including Copilot, which is based off of large language models, is no different. Copilot will generate code derived from vast datasets that may include copyrighted or insecure examples. According to a study in December 2021, Copilot was given 89 scenarios that could replicate a MITRE CWE to auto-fill, creating a total of 1689 programs, in which 40% of code auto-filled by Copilot was deemed vulnerable. ^[22]^[11]^[23]

Licensing controversy

While GitHub CEO Nat Friedman stated in June 2021 that "training ML systems on public data is fair use",^[24] a class-action lawsuit filed in November 2022 called this "pure speculation", asserting that "no Court has considered the question of whether 'training ML systems on public data is fair use.'"^[25] The lawsuit from Joseph Saveri Law Firm, LLP challenges the legality of Copilot on several claims, ranging from breach of contract with GitHub's users, to breach of privacy under the CCPA for sharing PII.^[26]^[25]

GitHub admits that a small proportion of the tool's output may be copied verbatim, which has led to fears that the output code is insufficiently transformative to be classified as fair use and may infringe on the copyright of the original owner.^[22] In June 2022, the Software Freedom Conservancy announced it would end all uses of GitHub in its own projects,^[27] accusing Copilot of ignoring code licenses used in training data.^[28] In a customer-support message, GitHub stated that "training machine learning models on publicly available data is considered fair use across the machine learning community",^[25] but the class action lawsuit called this "false" and additionally noted that "regardless of this concept's level of acceptance in 'the machine learning community,' under Federal law, it is illegal".^[25]

Privacy concerns

The Copilot service is cloud-based and requires continuous communication with the GitHub Copilot servers.^[29] This opaque architecture has fueled concerns over telemetry and data mining of individual keystrokes.^[30]^[31]

In late 2022 GitHub Copilot has been accused of emitting Quake game source code, with no author attribution or license.^[32]

Security

Security concerns surrounding GitHub Copilot have been a key focus of both academic and industry discussion since its release, as researchers seek to understand whether AI-generated code introduces vulnerabilities or encourages insecure programming practices. Earlier analyses such as the 2021 study “Asleep at the Keyboard?” by Pearce et al. tested Copilot against 89 security-sensitive programming scenarios and found that approximately 40 percent of the code completions contained at least one vulnerability matching Common Weakness Enumeration (CWE) categories, including issues like hardcoded credentials, SQL injection, and buffer overflows.^[33]

A 2023 user study by Sandoval et al. investigated the security implications of using a large-language-model-based code assistant (specifically an instance of OpenAI Codex) in a controlled programming task. In the study, 58 computer-science students were randomly assigned either to a “control” group or an “assisted” group with access to the AI suggestions, and were asked to implement a singly-linked-list “shopping list” program in C — a language chosen because of its susceptibility to memory-safety bugs such as buffer overflows and null-pointer dereferences. The authors found that the assisted group produced slightly better functional results (more code compiled, more tasks completed) and, crucially, that the incidence rate of serious security-relevant bugs (as measured by Common Weakness Enumeration (CWE) categories) in the AI-assisted group was no more than ~10 % higher than in the control group, thereby not indicating a large additional security risk from using the assistant under these conditions. Moreover, when analysing origins of bugs, about 63 % originated from human-written portions and about 36 % from accepted AI suggestions, suggesting that the majority of vulnerabilities still stemmed from human edits rather than the AI directly. The authors caution, however, that the study is limited to a low-level C context with students rather than professional developers, and that different languages, tasks, or threat models may yield different outcomes.^[34]

Following these studies, discussion in the developer and cybersecurity communities has emphasized the trade-off between productivity and safety when using AI code assistants. While Copilot can accelerate coding and improve completeness, researchers note that AI-generated code may still contain subtle logic errors or insecure defaults. Proposals to mitigate these issues include integrating static-analysis or linting tools into Copilot’s workflow, enhancing training datasets with secure-coding examples, and educating users on the need to review AI-generated code with the same rigor as human-written code.

References

1 2 Gershgorn, Dave (29 June 2021). "GitHub and OpenAI launch a new AI tool that generates its own code". The Verge . Retrieved 6 July 2021.
1 2 3 4 5 "GitHub Copilot · Your AI pair programmer". GitHub Copilot. Retrieved 7 April 2022.
1 2 Warren, Tom (29 October 2024). "GitHub Copilot will support models from Anthropic, Google, and OpenAI". The Verge. Retrieved 28 January 2025.
↑ "Introducing GitHub Copilot: your AI pair programmer". The GitHub Blog. 29 June 2021. Retrieved 7 April 2022.
↑ "GitHub Copilot - IntelliJ IDEs Plugin | Marketplace". JetBrains Marketplace. Retrieved 7 April 2022.
↑ Copilot.vim, GitHub, 7 April 2022, retrieved 7 April 2022
↑ "GitHub Copilot now available for Visual Studio 2022". The GitHub Blog. 29 March 2022. Retrieved 7 April 2022.
↑ "GitHub Copilot is generally available to all developers". The GitHub Blog. 21 June 2022. Retrieved 21 June 2022.
↑ Lardinois, Frederic (17 February 2014). "Microsoft Launches Smart Visual Studio Add-On For Code Snippet Search". TechCrunch. Retrieved 5 September 2023.
↑ "Bing Code Search". Microsoft Research. 11 February 2014. Retrieved 5 September 2023.
1 2 3 Finnie-Ansley, James; Denny, Paul; Becker, Brett A.; Luxton-Reilly, Andrew; Prather, James (14 February 2022). "The Robots Are Coming: Exploring the Implications of OpenAI Codex on Introductory Programming". Australasian Computing Education Conference. ACE '22. New York, NY, USA: Association for Computing Machinery. pp. 10–19. doi: 10.1145/3511861.3511863 . ISBN 978-1-4503-9643-1. S2CID 246681316.
↑ "OpenAI GPT-5 and GPT-5 mini are now generally available in GitHub Copilot - GitHub Changelog". The GitHub Blog. 9 September 2025. Retrieved 10 September 2025.
↑ VibeCentral (21 May 2025). "Navigating the AI Coding Landscape: A Comparative Analysis of GitHub Copilot's LLMs for Optimal Developer Productivity". VibeCentral. Retrieved 23 May 2025.
↑ Sobania, Dominik; Schweim, Dirk; Rothlauf, Franz (2022). "A Comprehensive Survey on Program Synthesis with Evolutionary Algorithms". IEEE Transactions on Evolutionary Computation. 27: 82–97. doi:10.1109/TEVC.2022.3162324. ISSN 1941-0026. S2CID 247721793.
↑ Krill, Paul (12 August 2021). "OpenAI offers API for GitHub Copilot AI model". InfoWorld. Retrieved 7 April 2022.
↑ "OpenAI Releases GPT-3, The Largest Model So Far". Analytics India Magazine. 3 June 2020. Retrieved 7 April 2022.
↑ "OpenAI Announces 12 Billion Parameter Code-Generation AI Codex". InfoQ. Retrieved 7 April 2022.
↑ "OpenAI is giving Microsoft exclusive access to its GPT-3 language model". MIT Technology Review. Retrieved 7 April 2022.
↑ "GitHub Copilot – November 30th Update · GitHub Changelog". 30 November 2023.
↑ Dohmke, Thomas (6 February 2025). "GitHub Copilot: The agent awakens". The GitHub Blog. Retrieved 31 July 2025.
↑ Dohmke, Thomas (19 May 2025). "GitHub Copilot: Meet the new coding agent". The GitHub Blog. Retrieved 31 July 2025.
1 2 "GitHub's automatic coding tool rests on untested legal ground". The Verge. 7 July 2021. Retrieved 11 July 2021.
↑ Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (16 December 2021). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". arXiv: 2108.09293 [cs.CR].
↑ Nat Friedman [@natfriedman] (29 June 2021). "In general: (1) training ML systems on public data is fair use" (Tweet). Archived from the original on 30 June 2021. Retrieved 23 February 2023– via Twitter.
1 2 3 4 Butterick, Matthew (3 November 2022). "GitHub Copilot litigation" (PDF). githubcopilotlitigation.com. Joseph Saveri Law Firm. Archived from the original on 3 November 2022. Retrieved 12 February 2023. 22-cv-06823-JST
↑ Vincent, James (8 November 2022). "The lawsuit that could rewrite the rules of AI copyright". The Verge. Retrieved 7 December 2022.
↑ "Give Up GitHub: The Time Has Come!". Software Freedom Conservancy. Retrieved 8 September 2022.
↑ "If Software is My Copilot, Who Programmed My Software?". Software Freedom Conservancy. Retrieved 8 September 2022.
↑ "GitHub Copilot - Your AI pair programmer". GitHub. Retrieved 18 October 2022.
↑ "CoPilot: Privacy & DataMining". GitHub. Retrieved 18 October 2022.
↑ Stallman, Richard. "Who does that server really serve?". gnu.org. Retrieved 18 October 2022.
↑ "GitHub Copilot: The Latest in the List of AI Generative Models Facing Copyright Allegations". Analytics India Magazine. 23 October 2022. Archived from the original on 22 March 2023. Retrieved 23 March 2023.
↑ Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (16 December 2021). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". arXiv:2108.09293 [cs.CR].
↑ Sandoval G, Pearce H, Nys T, Karri R, Garg S, Dolan-Gavitt B. (2023). *Lost at C: A User Study on the Security Implications of Large Language Model Code Assistants.* In 32nd USENIX Security Symposium, 2205-2222. arXiv:2208.09727. https://arxiv.org/abs/2208.09727

External links

Official website

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 Gershgorn, Dave (29 June 2021). "GitHub and OpenAI launch a new AI tool that generates its own code". The Verge . Retrieved 6 July 2021.

[:2-2] 1 2 3 4 5 "GitHub Copilot · Your AI pair programmer". GitHub Copilot. Retrieved 7 April 2022.

[:3-3] 1 2 Warren, Tom (29 October 2024). "GitHub Copilot will support models from Anthropic, Google, and OpenAI". The Verge. Retrieved 28 January 2025.

[4] "Introducing GitHub Copilot: your AI pair programmer". The GitHub Blog. 29 June 2021. Retrieved 7 April 2022.

[5] "GitHub Copilot - IntelliJ IDEs Plugin | Marketplace". JetBrains Marketplace. Retrieved 7 April 2022.

[6] Copilot.vim, GitHub, 7 April 2022, retrieved 7 April 2022

[7] "GitHub Copilot now available for Visual Studio 2022". The GitHub Blog. 29 March 2022. Retrieved 7 April 2022.

[8] "GitHub Copilot is generally available to all developers". The GitHub Blog. 21 June 2022. Retrieved 21 June 2022.

[9] Lardinois, Frederic (17 February 2014). "Microsoft Launches Smart Visual Studio Add-On For Code Snippet Search". TechCrunch. Retrieved 5 September 2023.

[10] "Bing Code Search". Microsoft Research. 11 February 2014. Retrieved 5 September 2023.

[:1-11] 1 2 3 Finnie-Ansley, James; Denny, Paul; Becker, Brett A.; Luxton-Reilly, Andrew; Prather, James (14 February 2022). "The Robots Are Coming: Exploring the Implications of OpenAI Codex on Introductory Programming". Australasian Computing Education Conference. ACE '22. New York, NY, USA: Association for Computing Machinery. pp. 10–19. doi: 10.1145/3511861.3511863 . ISBN 978-1-4503-9643-1. S2CID 246681316.

[12] "OpenAI GPT-5 and GPT-5 mini are now generally available in GitHub Copilot - GitHub Changelog". The GitHub Blog. 9 September 2025. Retrieved 10 September 2025.

[13] VibeCentral (21 May 2025). "Navigating the AI Coding Landscape: A Comparative Analysis of GitHub Copilot's LLMs for Optimal Developer Productivity". VibeCentral. Retrieved 23 May 2025.

[14] Sobania, Dominik; Schweim, Dirk; Rothlauf, Franz (2022). "A Comprehensive Survey on Program Synthesis with Evolutionary Algorithms". IEEE Transactions on Evolutionary Computation. 27: 82–97. doi:10.1109/TEVC.2022.3162324. ISSN 1941-0026. S2CID 247721793.

[15] Krill, Paul (12 August 2021). "OpenAI offers API for GitHub Copilot AI model". InfoWorld. Retrieved 7 April 2022.

[16] "OpenAI Releases GPT-3, The Largest Model So Far". Analytics India Magazine. 3 June 2020. Retrieved 7 April 2022.

[17] "OpenAI Announces 12 Billion Parameter Code-Generation AI Codex". InfoQ. Retrieved 7 April 2022.

[18] "OpenAI is giving Microsoft exclusive access to its GPT-3 language model". MIT Technology Review. Retrieved 7 April 2022.

[19] "GitHub Copilot – November 30th Update · GitHub Changelog". 30 November 2023.

[20] Dohmke, Thomas (6 February 2025). "GitHub Copilot: The agent awakens". The GitHub Blog. Retrieved 31 July 2025.

[21] Dohmke, Thomas (19 May 2025). "GitHub Copilot: Meet the new coding agent". The GitHub Blog. Retrieved 31 July 2025.

[Verge_legal-22] 1 2 "GitHub's automatic coding tool rests on untested legal ground". The Verge. 7 July 2021. Retrieved 11 July 2021.

[:4-23] Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (16 December 2021). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". arXiv: 2108.09293 [cs.CR].

[24] Nat Friedman [@natfriedman] (29 June 2021). "In general: (1) training ML systems on public data is fair use" (Tweet). Archived from the original on 30 June 2021. Retrieved 23 February 2023– via Twitter.

[class_action_suit-25] 1 2 3 4 Butterick, Matthew (3 November 2022). "GitHub Copilot litigation" (PDF). githubcopilotlitigation.com. Joseph Saveri Law Firm. Archived from the original on 3 November 2022. Retrieved 12 February 2023. 22-cv-06823-JST

[Verge_class_action-26] Vincent, James (8 November 2022). "The lawsuit that could rewrite the rules of AI copyright". The Verge. Retrieved 7 December 2022.

[27] "Give Up GitHub: The Time Has Come!". Software Freedom Conservancy. Retrieved 8 September 2022.

[28] "If Software is My Copilot, Who Programmed My Software?". Software Freedom Conservancy. Retrieved 8 September 2022.

[29] "GitHub Copilot - Your AI pair programmer". GitHub. Retrieved 18 October 2022.

[30] "CoPilot: Privacy & DataMining". GitHub. Retrieved 18 October 2022.

[31] Stallman, Richard. "Who does that server really serve?". gnu.org. Retrieved 18 October 2022.

[32] "GitHub Copilot: The Latest in the List of AI Generative Models Facing Copyright Allegations". Analytics India Magazine. 23 October 2022. Archived from the original on 22 March 2023. Retrieved 23 March 2023.

[Pearce2021-33] Pearce, Hammond; Ahmad, Baleegh; Tan, Benjamin; Dolan-Gavitt, Brendan; Karri, Ramesh (16 December 2021). "Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions". arXiv:2108.09293 [cs.CR].

[Sandoval2023-34] Sandoval G, Pearce H, Nys T, Karri R, Garg S, Dolan-Gavitt B. (2023). *Lost at C: A User Study on the Security Implications of Large Language Model Code Assistants.* In 32nd USENIX Security Symposium, 2205-2222. arXiv:2208.09727. https://arxiv.org/abs/2208.09727

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]