GPTZero

Last updated

GPTZero
Developer(s) Edward Tian
Alex Cui
Yazan Mimi [1]
Initial release3 January 2023;15 months ago (2023-01-03)
Written in Python
Platform Cloud computing
Website gptzero.me

GPTZero is an artificial intelligence detection software developed to identify artificially generated text, such as that produced by large language models. [2] [3] [4] [5] [6]

Contents

While GPTZero has received positive coverage for its efforts to prevent academic dishonesty, its reported outputs of false positives has been source of criticism. [7]

History

GPTZero was developed by Edward Tian, a Princeton University undergraduate student, and launched online in January 2023 in response to concerns about AI-generated usage in academic plagiarism. [8] [9] GPTZero has raised over 3.5 million dollars in seed funding. [10] [11]

In the first week of its release, the GPTZero experienced 30,000 uses, which led to a crash. It is supported by the web app company Streamlit, who allocated more server resources in response. [12]

Mechanism

GPTZero uses qualities it terms perplexity and burstiness to attempt determining if a passage was written by a AI. [13] According to the company, perplexity is how random the text in the sentence is, and whether the way the sentence is constructed is unusual or "surprising" for the application. It relies on language models, and the more such models, the higher the chance that a person did not write the text. [14] In contrast, burstiness compares sentences with each other, determining their similarity. Human text is more discontinuous, meaning humans tend to write with more sentence variation than AI. [9]

News website Ars Technica commented that humans can still write sentences in a highly regular way, leading to false positives. The writer, Benj Edwards, went on to state that the perplexity score only concerns itself with what is "surprising" for the AI, leading to instances where highly common texts, such as the US Constitution, are labeled as AI-generated. [15]

Use cases and applications

The academic community has attempted using GPTZero to tackle concerns about AI-generated content for plagiarism. [16] [14] [13] Educational institutions, including Princeton University, have discussed the use of GPTZero to combat AI-generated content in academic settings, with mixed opinions. [9] [17] In October 2023, GPTZero had partnered with the American Federation of Teachers. [18]

Efficacy

In a March 2023 a paper named "Can AI-Generated Text be Reliably Detected?", [19] computer scientists Vinu Sankar Sadasivan, Aounon Kumar, Sriram Balasubramanian, Wenxiao Wang, and Soheil Feizi from the University of Maryland demonstrate empirically and theoretically that several AI-text detectors are not reliable in practical scenarios. [20] [21]

Tech website Futurism tested the tool, and said that while the "results are impressive", based on its error rate, teachers relying the tool would end up "falsely accusing nearly 20 percent of [innocent students] of academic misconduct". [22]

The Washington Post have noted GPTZero suffers from false positives, emphasizing that "even a small “false positive” error rate means that some students could be wrongly accused [of academic misconduct]". [7]

See also

Related Research Articles

<span class="mw-page-title-main">Chatbot</span> Program that simulates conversation

A chatbot is a software application or web interface that is designed to mimic human conversation through text or voice interactions. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of maintaining a conversation with a user in natural language and simulating the way a human would behave as a conversational partner. Such chatbots often use deep learning and natural language processing, but simpler chatbots have existed for decades.

<span class="mw-page-title-main">Turnitin</span> Internet-based plagiarism-prevention service

Turnitin is an Internet-based similarity detection service run by the American company Turnitin, LLC, a subsidiary of Advance Publications.

Plagiarism detection or content similarity detection is the process of locating instances of plagiarism or copyright infringement within a work or document. The widespread use of computers and the advent of the Internet have made it easier to plagiarize the work of others.

<span class="mw-page-title-main">Plagiarism</span> Using another authors work as if it was ones own original work

Plagiarism is the representation of another person's language, thoughts, ideas, or expressions as one's own original work. Although precise definitions vary depending on the institution, in many countries and cultures plagiarism is considered a violation of academic integrity and journalistic ethics, as well as social norms around learning, teaching, research, fairness, respect, and responsibility. As such, a person or entity that is determined to have committed plagiarism is often subject to various punishments or sanctions, such as suspension, expulsion from school or work, fines, imprisonment, and other penalties.

<span class="mw-page-title-main">Quizlet</span> American online studying platform

Quizlet is a multi-national American company that provides tools for studying and learning. Quizlet was founded in October 2005 by Andrew Sutherland, who at the time was a 15-year old student, and released to the public in January 2007. Quizlet's primary products include digital flash cards, matching games, practice electronic assessments, and live quizzes. In 2017, 1 in 2 high school students used Quizlet. As of December 2021, Quizlet has over 500 million user-generated flashcard sets and more than 60 million active users.

<span class="mw-page-title-main">OpenAI</span> Artificial intelligence research organization

OpenAI is a U.S.-based artificial intelligence (AI) research organization founded in December 2015, researching artificial intelligence with the goal of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As one of the leading organizations of the AI boom, it has developed several large language models, advanced image generation models, and previously, released open-source models. Its release of ChatGPT has been credited with starting the AI boom.

Artificial intelligence is used in Wikipedia and other Wikimedia projects for the purpose of developing those projects. Human and bot interaction in Wikimedia projects is routine and iterative.

<i>AI Dungeon</i> Text adventure game generated by artificial intelligence

AI Dungeon is a single-player/multiplayer text adventure game which uses artificial intelligence (AI) to generate content and allows players to create and share adventures and custom prompts. The game's first version was made available in May 2019, and its second version was released on Google Colaboratory in December 2019. It was later ported that same month to its current cross-platform web application. The AI model was then reformed in July 2020.

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based architectures with a technique known as "attention". This attention mechanism allows the model to selectively focus on segments of input text it predicts to be most relevant. It uses a 2048-tokens-long context, float16 (16-bit) precision, and a hitherto-unprecedented 175 billion parameters, requiring 350GB of storage space as each parameter takes 2 bytes of space, and has demonstrated strong "zero-shot" and "few-shot" learning abilities on many tasks.

<span class="mw-page-title-main">DALL-E</span> Image-generating deep-learning model

DALL·E, DALL·E 2, and DALL·E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions, called "prompts."

You.com is an AI Assistant that began as a personalization-focused search engine. While still offering web search capabilities, You.com has evolved to prioritize a chat-first AI Assistant.

<span class="mw-page-title-main">ChatGPT</span> Chatbot developed by OpenAI

ChatGPT is a chatbot developed by OpenAI and launched on November 30, 2022. Based on large language models, it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive user prompts and replies are considered at each conversation stage as context.

<span class="mw-page-title-main">Hallucination (artificial intelligence)</span> Confident unjustified claim by AI

In the field of artificial intelligence (AI), a hallucination or artificial hallucination is a response generated by AI which contains false or misleading information presented as fact. This term draws a loose analogy with human psychology, where hallucination typically involves false percepts. However, there’s a key difference: AI hallucination is associated with unjustified responses or beliefs rather than perceptual experiences.

<span class="mw-page-title-main">Generative artificial intelligence</span> AI system capable of generating content in response to prompts

Generative artificial intelligence is artificial intelligence capable of generating text, images, videos, or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.

<span class="mw-page-title-main">Microsoft Copilot</span> Chatbot developed by Microsoft

Microsoft Copilot is a chatbot developed by Microsoft and launched on February 7, 2023. Based on a large language model, it is able to cite sources, create poems, and write songs. It is Microsoft's primary replacement for the discontinued Cortana.

Writesonic is a company that develops artificial intelligence tools for content creation. It was founded by Samanyou Garg in October 2020 and is based in San Francisco. The platform uses GPT-3.5 and GPT-4 technologies.

Artificial intelligence detection software aims to determine whether some content was generated using artificial intelligence (AI).

<span class="mw-page-title-main">ChatGPT in education</span> Use of the chatbot in education

Since OpenAI's public release of ChatGPT in November 2022, the chatbot and its peers have been at the source of intense discussion within education, with many schools and universities taking hostile stances towards usage of large language models, while others have embraced the use of the tools in assignments. The usage of ChatGPT has inspired many to foresee a potential paradigm shift in education, with oral exams being proposed to assure that it cannot be used in tests.

QuillBot is a software developed in 2017 that uses artificial intelligence to rewrite and paraphrase text.

<span class="mw-page-title-main">Undetectable.ai</span> Online text analysis and obfuscation software

Undetectable AI (Undetectable.ai) is an AI content detection software that rewrites AI-generated text to make it appear more human.

References

  1. "Meet the Etobicoke-born inventor of the ChatGPT detector". Toronto Life.
  2. "GPTZERO: A New Tool to Detect AI-Generated Content in ChatGPT". The Washington Post.
  3. "How apps like GPTZero detect content written by A.I." CNBC . July 24, 2023. Retrieved September 21, 2023.
  4. "GPTZero App Seeks to Thwart AI Plagiarism in Schools, Online Media". Bloomberg.com. May 8, 2023. Retrieved October 21, 2023.
  5. "American Federation of Teachers partners with AI identification platform, GPTZero". CBS News . October 17, 2023. Retrieved October 21, 2023.
  6. "AI Detector with Sentiment Analysis, GPTZero". GPTZero . April 16, 2024. Retrieved April 16, 2024.
  7. 1 2 Fowler, Geoffrey (August 14, 2023). "What to do when you're accused of AI cheating". The Washington Post . Retrieved October 20, 2023.
  8. "How a 23-year-old college student built one of the leading AI detection tools". Business Insider . Retrieved September 21, 2023.
  9. 1 2 3 "Edward Tian's GPTZERO Software Aims to Detect AI-Generated Plagiarism". The Daily Princetonian .
  10. "This AI detection tool raised $3.5 million to check the internet for computer-generated work". Fast Company . 2023.
  11. Shrivastava, Rashi. "With Seed Funding Secured, AI Detection Tool GPTZero Launches New Browser Plugin". Forbes . Retrieved May 17, 2023.
  12. "GPTZERO: A New AI Detector Aims to Combat ChatGPT Plagiarism". NPR . January 9, 2023.
  13. 1 2 "What is GPTZERO? The ChatGPT Detection Tool Explained". Tech Learning.
  14. 1 2 "AI Detector for Educators: What is GPTZERO?". Jumpstart Magazine.
  15. Edwards, Benj (July 14, 2023). "Why AI detectors think the US Constitution was written by AI". Ars Technica . Retrieved December 14, 2023.
  16. Tribune.com.pk. "GPTZero: A ChatGPT Detection Tool". The Express Tribune.
  17. "GPTZero to help teachers deal with ChatGPT-generated student essays". The Indian Express . January 12, 2023. Retrieved September 21, 2023.
  18. "US Teachers Union Bans ChatGPT and Deploys GPTZero To Detect Cheating". MobileAppAaily. Retrieved October 21, 2023.
  19. Vinu Sankar Sadasivan; Kumar, Aounon; Balasubramanian, Sriram; Wang, Wenxiao; Feizi, Soheil (March 17, 2023). "Can AI-Generated Text be Reliably Detected?". arXiv: 2303.11156 [cs.CL].
  20. Knibbs, Kate. "Researchers Tested AI Watermarks—and Broke All of Them". Wired. ISSN   1059-1028 . Retrieved October 21, 2023.
  21. "No reliable way to detect AI-generated text, boffins sigh". The Register . March 21, 2023. Retrieved September 25, 2023.
  22. "There's a Problem With That App That Detects GPT-Written Text: It's Not Very Accurate". Futurism. Retrieved October 21, 2023.