Copyscape

Copyscape
Type of site	Plagiarism detection
Available in	Multilingual
Founded	July 10, 2004
Area served	Worldwide
Industry	Digital content
URL	copyscape.com
Commercial	Yes
Registration	Optional

Last updated December 16, 2024

Copyscape is an online plagiarism detection service that checks whether similar text content appears elsewhere on the web.^[1]^[2]^[3] It was launched in 2004 by Indigo Stream Technologies, Ltd.

History

Copyscape was launched in 2004 by Indigo Stream Technologies, Ltd., co-founded in 2003 by Gideon Greenspan.^[7] According to an interview with Greenspan, the company originally developed an alerting service called Google Alert, out of which the Copyscape service grew as an expansion.^[8]

Functionality

Given the URL or text of the original content, Copyscape returns a list of web pages that contain similar text to all or parts of this content.^[9] It also shows the matching text highlighted on the found web page. Copyscape banners can be placed on a web page to warn potential plagiarists not to steal content. Copysentry monitors the web and sends notifications by email when new copies are found, and Copyscape Premium verifies the originality of content purchased by online content publishers.

Copyscape uses the Google Web API to power its searches.^[10] Copyscape uses a set of algorithms to identify copied content that has been modified from its original form.

Reported use in plagiarism cases

Copyscape's use has been reported in cases involving online plagiarism:

On March 18, 2005, Copyscape was reported as the means used to search the Internet for unauthorized use of materials in the case of Brayton Purcell LLP vs. Recordon & Recordon, filed in the United States District Court for the Northern District of California (361 F.Supp.2d 1135). According to Brayton Purcell, Copyscape was used to search the Internet for unauthorized use of materials on October 7, 2004.^[11] On August 6, 2009, Copyscape was cited as the means used to detect plagiarism before the 9th U.S. Circuit Court of Appeals.^[12]
On April 6, 2005, Arve Bersvendsen, a Norwegian Web developer, used Copyscape to find a copy of a CSS tutorial he wrote posted on a site owned by Apple Inc. ^[13] Bersvendsen claimed that Apple had infringed his copyright, and the content in question was immediately removed.^[14]
On October 17, 2005, Paul Litterick of the New Zealand Association of Rationalists and Humanists used Copyscape to analyze Bruce Logan's published newspaper work, setting off a plagiarism scandal.^[15] Litterick found that some of Logan's work was taken (in most cases with permission) from Anglo-American sources, including The Heritage Foundation, the Conservative Christian Fellowship, the Institute for American Values, Digby Anderson of the Social Affairs Unit and writers Maggie Gallagher and Melanie Phillips. Litterick published the results in the Fundy Post (Issues 18 and 19). Logan retired from the Maxim Institute one month later.^[16]
On December 9, 2005, Richard Stiennon, a writer at ZDNet, used Copyscape to find six Web sites that had stolen and re-published an ISP business plan he had written.^[17]

Related Research Articles

The Internet Archive is an American non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including websites, software applications, music, audiovisual, and print materials. The Archive also advocates a free and open Internet. Its mission is committing to provide "universal access to all knowledge".

In the context of the World Wide Web, deep linking is the use of a hyperlink that links to a specific, generally searchable or indexed, piece of web content on a website, rather than the website's home page. The URL contains all the information needed to point to a particular item. Deep linking is different from mobile deep linking, which refers to directly linking to in-app content using a non-HTTP URI.

Google News is a news aggregator service developed by Google. It presents a continuous flow of links to articles organized from thousands of publishers and magazines. Google News is available as an app on Android, iOS, and the Web.

Fictitious or fake entries are deliberately incorrect entries in reference works such as dictionaries, encyclopedias, maps, and directories, added by the editors as copyright traps to reveal subsequent plagiarism or copyright infringement. There are more specific terms for particular kinds of fictitious entry, such as Mountweazel, trap street, paper town, phantom settlement, and nihilartikel.

Turnitin is an Internet-based similarity detection service run by the American company Turnitin, LLC, a subsidiary of Advance Publications.

<span class="mw-page-title-main">Open Library</span> Online project for book data of the Internet Archive

Open Library is an online project intended to create "one web page for every book ever published". Created by Aaron Swartz, Brewster Kahle, Alexis Rossi, Anand Chitipothu, and Rebecca Hargrave Malamud, Open Library is a project of the Internet Archive, a nonprofit organization. It has been funded in part by grants from the California State Library and the Kahle/Austin Foundation. Open Library provides online digital copies in multiple formats, created from images of many public domain, out-of-print, and in-print books.

Google Books is a service from Google that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.

Bruce Logan is a New Zealand conservative Christian author who has been involved in, and is in opposition to liberal social policies within his country for over two decades.

WebCite is an intermittently available archive site, originally designed to digitally preserve scientific and educationally important material on the web by taking snapshots of Internet contents as they existed at the time when a blogger or a scholar cited or quoted from it. The preservation service enabled verifiability of claims supported by the cited sources even when the original web pages are being revised, removed, or disappear for other reasons, an effect known as link rot.

Plagiarism detection or content similarity detection is the process of locating instances of plagiarism or copyright infringement within a work or document. The widespread use of computers and the advent of the Internet have made it easier to plagiarize the work of others.

Digital rights management (DRM) is the management of legal access to digital content. Various tools or technological protection measures (TPM), such as access control technologies, can restrict the use of proprietary hardware and copyrighted works. DRM technologies govern the use, modification and distribution of copyrighted works and of systems that enforce these policies within devices. DRM technologies include licensing agreements and encryption.

<span class="mw-page-title-main">Copyright infringement</span> Illegal usage of copyrighted works

Copyright infringement is the use of works protected by copyright without permission for a usage where such permission is required, thereby infringing certain exclusive rights granted to the copyright holder, such as the right to reproduce, distribute, display or perform the protected work, or to produce derivative works. The copyright holder is usually the work's creator, or a publisher or other business to whom copyright has been assigned. Copyright holders routinely invoke legal and technological measures to prevent and penalize copyright infringement.

Plagiarism is the representation of another person's language, thoughts, ideas, or expressions as one's own original work. Although precise definitions vary depending on the institution, in many countries and cultures plagiarism is considered a violation of academic integrity and journalistic ethics, as well as of social norms around learning, teaching, research, fairness, respect, and responsibility. As such, a person or entity that is determined to have committed plagiarism is often subject to various punishments or sanctions, such as suspension, expulsion from school or work, fines, imprisonment, and other penalties.

Music plagiarism is the use or close imitation of another author's music while representing it as one's own original work. Plagiarism in music now occurs in two contexts—with a musical idea or sampling. For a legal history of the latter see sampling.

The Digital Millennium Copyright Act (DMCA) is a 1998 United States copyright law that implements two 1996 treaties of the World Intellectual Property Organization (WIPO). It criminalizes production and dissemination of technology, devices, or services intended to circumvent measures that control access to copyrighted works. It also criminalizes the act of circumventing an access control, whether or not there is actual infringement of copyright itself. In addition, the DMCA heightens the penalties for copyright infringement on the Internet. Passed on October 12, 1998, by a unanimous vote in the United States Senate and signed into law by President Bill Clinton on October 28, 1998, the DMCA amended Title 17 of the United States Code to extend the reach of copyright, while limiting the liability of the providers of online services for copyright infringement by their users.

Duplicate content is a term used in the field of search engine optimization to describe content that appears on more than one web page. The duplicate content can be substantial parts of the content within or across domains and can be either exactly duplicate or closely similar. When multiple pages contain essentially the same content, search engines such as Google and Bing can penalize or cease displaying the copying site in any relevant search results.

Authors Guild v. Google 804 F.3d 202 was a copyright case heard in federal court for the Southern District of New York, and then the Second Circuit Court of Appeals between 2005 and 2015. It concerned fair use in copyright law and the transformation of printed copyrighted books into an online searchable database through scanning and digitization. It centered on the legality of the Google Book Search Library Partner project that had been launched in 2003.

Kindle Direct Publishing is Amazon.com's e-book publishing platform launched in November 2007, concurrently with the first Amazon Kindle device. Originally called Digital Text Platform, the platform allows authors and publishers to publish their books to the Amazon Kindle Store.

PlagTracker is a Ukrainian-based online plagiarism detection service that checks whether similar text content appears elsewhere on the web. It was launched in 2011 by Devellar.

PlagScan is a plagiarism detection software, mostly used by academic institutions. PlagScan compares submissions with web documents, journals and internal archives. The software was launched in 2009 by Markus Goldbach and Johannes Knabe of Cologne, Germany.

References

↑ Gilbertson, Scott (November 17, 2006). "Copyscape: Track Stolen Content". Wired . Retrieved July 25, 2019.
↑ Keener, Matt (December 26, 2014). "16 Productivity Tools Nobody Can Live Without". Time . Retrieved July 25, 2019.
↑ Mills, Elinor (February 8, 2007). "Steal this post". USA Today . Retrieved July 25, 2019.
↑ Mapes, Diane (September 10, 2009). "Steal this story? Beware Net's plagiarism 'cops'". NBC News . Archived from the original on July 2, 2015. Retrieved July 25, 2019.
↑ Welch, Maura (May 8, 2006). "Online plagiarism strikes blog world". The Boston Globe . Retrieved July 25, 2019.
↑ Klein, Karen E. (March 3, 2008). "Scanning for Scammers Before You Buy In". Bloomberg Businessweek . Retrieved July 25, 2019.
↑ "Gideon Greenspan" . Retrieved July 25, 2019.
↑ Weinberg, Tamar (April 21, 2016). "Interview with Gideon Greenspan, Co-Founder and CTO Copyscape". Host Advice. Retrieved July 25, 2019.
↑ Klein, Karen E. (March 3, 2008). "Scanning for Scammers Before You Buy In". Bloomberg Businessweek . Retrieved July 25, 2019.
↑ Delaney, Kevin J. (December 18, 2006). "Copyright Tool Will Scan Web For Violations". The Wall Street Journal . Retrieved July 25, 2019.
↑ "Brayton Purcell LLP v. Recordon & Recordon, 361 F.Supp.2d 1135". United States District Court for the Northern District of California . March 18, 2005. Retrieved July 25, 2019.
↑ Bailey, Jonathan (August 6, 2009). "9th Circuit Finds for PI Firm Over Theft of Firm's Web Site Content". Plagiarism Today. Retrieved July 25, 2019.
↑ Festa, Paul (April 11, 2005). "Apple accused of copyright wrongs". CNET . Retrieved July 25, 2019.
↑ Bersvendsen, Arve (April 6, 2005). "Apple and copyright violations". Virtuelvis. Retrieved July 25, 2019.
↑ "The Fundy Post: Sorry Seems to be The Hardest Word". Scoop News. November 3, 2005. Retrieved July 25, 2019.
↑ Middleton, Julie (November 4, 2005). "Maxim back in gun over plagiarism". The New Zealand Herald . Retrieved July 25, 2019.
↑ Stiennon, Richard (December 9, 2005). "Copyscape, a very interesting twist on IP protection". ZDNet . Retrieved July 25, 2019.

External links

Official website .
Copyscape forums. For discussion about:
- Copyright law in general.
- Dealing with plagiarism.
- Copyscape.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Gilbertson, Scott (November 17, 2006). "Copyscape: Track Stolen Content". Wired . Retrieved July 25, 2019.

[2] Keener, Matt (December 26, 2014). "16 Productivity Tools Nobody Can Live Without". Time . Retrieved July 25, 2019.

[3] Mills, Elinor (February 8, 2007). "Steal this post". USA Today . Retrieved July 25, 2019.

[4] Mapes, Diane (September 10, 2009). "Steal this story? Beware Net's plagiarism 'cops'". NBC News . Archived from the original on July 2, 2015. Retrieved July 25, 2019.

[5] Welch, Maura (May 8, 2006). "Online plagiarism strikes blog world". The Boston Globe . Retrieved July 25, 2019.

[6] Klein, Karen E. (March 3, 2008). "Scanning for Scammers Before You Buy In". Bloomberg Businessweek . Retrieved July 25, 2019.

[7] "Gideon Greenspan" . Retrieved July 25, 2019.

[8] Weinberg, Tamar (April 21, 2016). "Interview with Gideon Greenspan, Co-Founder and CTO Copyscape". Host Advice. Retrieved July 25, 2019.

[9] Klein, Karen E. (March 3, 2008). "Scanning for Scammers Before You Buy In". Bloomberg Businessweek . Retrieved July 25, 2019.

[10] Delaney, Kevin J. (December 18, 2006). "Copyright Tool Will Scan Web For Violations". The Wall Street Journal . Retrieved July 25, 2019.

[11] "Brayton Purcell LLP v. Recordon & Recordon, 361 F.Supp.2d 1135". United States District Court for the Northern District of California . March 18, 2005. Retrieved July 25, 2019.

[12] Bailey, Jonathan (August 6, 2009). "9th Circuit Finds for PI Firm Over Theft of Firm's Web Site Content". Plagiarism Today. Retrieved July 25, 2019.

[13] Festa, Paul (April 11, 2005). "Apple accused of copyright wrongs". CNET . Retrieved July 25, 2019.

[14] Bersvendsen, Arve (April 6, 2005). "Apple and copyright violations". Virtuelvis. Retrieved July 25, 2019.

[15] "The Fundy Post: Sorry Seems to be The Hardest Word". Scoop News. November 3, 2005. Retrieved July 25, 2019.

[16] Middleton, Julie (November 4, 2005). "Maxim back in gun over plagiarism". The New Zealand Herald . Retrieved July 25, 2019.

[17] Stiennon, Richard (December 9, 2005). "Copyscape, a very interesting twist on IP protection". ZDNet . Retrieved July 25, 2019.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]