Anna's Archive

Last updated

Anna's Archive
AnnasArchive Logo.svg
Official logo
Annasarchivehome 1.15.25.png
Anna's Archive homepage (January 15, 2025)
Type of site
Search engine, digital library, file sharing
Founder(s) Anna Archivist, Pirate Library Mirror
URL
CommercialNo
RegistrationOptional
LaunchedNovember 10, 2022;2 years ago (2022-11-10)

Anna's Archive is an open source search engine for shadow libraries that calls itself "the largest truly open library in human history", [1] and has said it aims to "catalog all the books in existence" and "track humanity's progress toward making all these books easily available in digital form". [2] The site was launched by the pseudonymous Anna shortly after law enforcement efforts to close down Z-Library in 2022. It aggregates data from several major shadow libraries, including Z-Library, Sci-Hub, and Library Genesis, as well as other sources. It claims it does not directly host copyrighted materials and that it only indexes metadata that is already publicly available. However, it has faced legal action from publishers and anti-piracy groups for violating copyright law.

Contents

Website

The code and data for Anna's Archive are fully open source, and it asks for volunteer contributions. It preserves its data in bulk using torrent files in order to remain resilient to website takedowns. [3] [a] The site itself claims not to host copyrighted materials, but it links to places where they can be downloaded. [2] [5]

The site provides file downloads via the servers of anonymous partners, [3] as well as through the IPFS protocol. [6] [7] [b] It has a two-tiered system of download options in which high-speed downloads are only available to users with a paid membership, while nonmembers must use slower options with browser verification to prevent abuse by bots. It describes itself as a nonprofit, claiming that donations and membership fees are mostly spent on infrastructure and that none are personally used by team members. [3] Memberships are awarded to some volunteers. [9]

As of 15 January 2025, Anna's Archive includes 40,369,782 books and 98,401,746 papers, [1] and its torrents total roughly one petabyte in size. [10] It lists Library Genesis, Sci-Hub, Z-Library, the Internet Archive, DuXiu, MagzDB, and Nexus/STC among its "source libraries", and Open Library and WorldCat as metadata-only sources. [11]

History

Origins

Anna's Archive was created by members of the Pirate Library Mirror (PiLiMi) project, an anonymous effort to mirror shadow libraries that completed a full copy of Z-Library in September 2022. [12] [13] [14] PiLiMi explicitly acknowledged that it "deliberately violated the copyright law in most countries" in mirroring these libraries to ensure global reach. [12] Days after US law enforcement attempted to close down Z-Library in November of that year, PiLiMi member Anna launched Anna's Archive, which initially displayed results from Z-Library and Library Genesis. [2] [12] [13]

On 3 October 2023, Anna's Archive was reported to have scraped the entirety of WorldCat, the world's largest bibliographic database, and made its proprietary data freely available, which it described as "a major milestone in mapping out all the books in the world". [15] OCLC, one of WorldCat's maintainers, responded by suing the organization on 12 January 2024, claiming the scrape was achieved through cyberattacks on its servers that incurred over $5 million in damages and seeking an injunction to curtail the site's operations. [5] [16] [17] The only named defendant in the suit denied any involvement with Anna's Archive or the WorldCat scrape. [18] Technology writer Glyn Moody criticized the action as "costly and pointless", saying it went against OCLC's stated mission of making information accessible. [19]

In July 2024, in the wake of the OCLC lawsuit, the site's .org mirror was temporarily replaced with a new .gs mirror to avoid falling under US jurisdiction; [16] however, shortly afterwards, the .gs domain was suspended and the mirror reverted to the old .org domain. [20]

In January 2024, the site was blocked by Italy's national communications agency due to a copyright complaint by the Italian Publishers Association. [21] An investigation by the country's Digital Services Directorate confirmed the presence of copyrighted material and found that some of the site's servers were likely owned by Ukrainian hosting provider Epinatura LLC, but failed to uncover the identity of its operator. [22]

In March 2024, the site was blocked by some internet service providers in the Netherlands due to a request by BREIN, an anti-piracy group. [23] [24] [25] [26]

In March 2024, a group of authors filed a lawsuit against Nvidia for allegedly training its generative AI platform NeMo on the Books3 dataset, which includes copyrighted data from several shadow libraries, among them Anna's Archive. [27] [28] [29] In the company's response, it disputed the characterization of those sites as shadow libraries, despite Anna's Archive's own use of the term. [29] [30]

In January 2025, the messaging app Telegram suspended Anna's Archive and shut down its channel for copyright infringement, despite the team reportedly taking precautions to avoid infringing posts on the app. Z-Library's Telegram account was suspended the same week, and neither was alerted of the action. [31]

Anna's Archive has consistently been one of the most targeted sites of Dutch anti-piracy service Link-Busters, which sends DMCA takedown notices to search engines like Google on behalf of major publishers. [32] [33] [34] It was among Google Search's top ten most reported domains as of June 2024. [35]

The site's domains appeared in both the 2023 and 2024 Notorious Markets List of the Office of the United States Trade Representative, which identifies online and physical markets that allegedly engage in or facilitate large-scale copyright and trademark infringement. These reports describe the site as related to Sci-Hub and Library Genesis. [36] [37] [38] In response to a request for comment by the Office on its 2023 List, the Association of American Publishers identified Anna's Archive as an infringing site, and analyzed its cryptocurrency wallets to find a total of $29,596.21 in received funds as of July 2023. [7] [39]

Notes

  1. According to a post on Anna's personal blog, they have standardized their data under the custom Anna’s Archive Containers format to allow for incremental releases. [4]
  2. According to Anna's blog, they no longer host the protocol themselves because they believe it is not yet suitable for their purposes. [8]

Related Research Articles

Topsite is a term used by the warez scene to refer to underground, highly secretive, high-speed FTP servers used by release groups and couriers for distribution, storage and archiving of warez releases. Topsites have very high-bandwidth Internet connections, commonly supporting transfer speeds of hundreds to thousands of megabits per second (Mbps); enough to transfer a full Blu-ray in seconds. Topsites also have very high storage capacity; a total of many terabytes was typical in 2006. It was common for home computers in these years to have access to broadband internet link with 1–1.5 Mbps and 80–120 GB of storage. Generally the characteristics of the link and (especially) storage can be at least two or three orders of magnitude above home appliances. Early on these warez sites were mainly distributing software such as games and applications after the release groups removed any protections. Now they are also a source of other copyright protected works such as movies and music. It is strictly prohibited for sites to charge for access to the content, due to decreased security, and sites found doing so are shunned by the topsite community.

MediaDefender, Inc. was a company that fought copyright infringement that offered services designed to prevent alleged copyright infringement using peer-to-peer distribution. They used unusual tactics such as flooding peer-to-peer networks with decoy files that tie up users' computers and bandwidth. MediaDefender was based in Los Angeles, California in the United States. As of March 2007, the company had approximately 60 employees and used 2,000 servers hosted in California with contracts for 9 Gbit/s of bandwidth.

<span class="mw-page-title-main">BREIN</span> Dutch entertainment industry interest group

The Bescherming Rechten Entertainment Industrie Nederland is an advocacy group with international links, based in the Netherlands, which represents the interests of the Dutch entertainment industry and is organised under the Dutch law through the legal form of stichting. It is notable for launching court proceedings against copyright infringement in the country and for engaging in lobbying in order to create legal precedents of global significance.

<span class="mw-page-title-main">The Pirate Bay</span> Website providing torrent files and magnet links

The Pirate Bay, commonly abbreviated as TPB, is a freely searchable online index of movies, music, video games, pornography and software. Founded in 2003 by Swedish think tank Piratbyrån, The Pirate Bay facilitates the connection among users of the peer-to-peer torrent protocol, which are able to contribute to the site through the addition of magnet links. The Pirate Bay has consistently ranked as one of the most visited torrent websites in the world.

<span class="mw-page-title-main">Legal issues with BitTorrent</span>

The use of the BitTorrent protocol for the unauthorized sharing of copyrighted content generated a variety of novel legal issues. While the technology and related platforms are legal in many jurisdictions, law enforcement and prosecutorial agencies are attempting to address this avenue of copyright infringement. Notably, the use of BitTorrent in connection with copyrighted material may make the issuers of the BitTorrent file, link or metadata liable as an infringing party under some copyright laws. Similarly, the use of BitTorrent to procure illegal materials could potentially create liability for end users as an accomplice.

<span class="mw-page-title-main">TorrentFreak</span> Blog on file sharing, copyright infringement, and digital rights

TorrentFreak (TF) is a blog dedicated to reporting the latest news and trends on the BitTorrent protocol and file sharing, as well as on copyright infringement and digital rights.

<span class="mw-page-title-main">You Wouldn't Steal a Car</span> Anti–copyright infringement campaign

"You Wouldn't Steal a Car" is the first sentence and commonly used name of a public service announcement that debuted on July 12, 2004 in cinemas, and July 27 on home media, which was part of the anti-copyright infringement campaign "Piracy. It's a crime." It was a co-production between the Federation Against Copyright Theft and the Motion Picture Association of America in cooperation with the Intellectual Property Office of Singapore, and appeared in theaters internationally from 2004 until 2008, and on many commercial DVDs during the same period as an ad preceding the main menu, as either an unskippable or skippable video.

<span class="mw-page-title-main">Library.nu</span> Popular linking website

Library.nu, previously called ebooksclub.org from 2004 to 2007 and gigapedia.com from 2007 to 2010, was a popular linking website. It was accused of copyright infringement and shut down by court order on February 15, 2012. According to the takedown notice, it hosted some 400,000 ebooks.

<span class="mw-page-title-main">KickassTorrents</span> Defunct file-sharing website

KickassTorrents was a website that provided a directory for torrent files and magnet links to facilitate peer-to-peer file sharing using the BitTorrent protocol. It was founded in 2008 and by November 2014, KAT became the most visited BitTorrent directory in the world, overtaking The Pirate Bay, according to the site's Alexa ranking. KAT went offline on 20 July 2016 when the domain was seized by the U.S. government. The site's proxy servers were shut down by its staff at the same time.

<span class="mw-page-title-main">Nyaa Torrents</span> File sharing website focused on East Asian media

Nyaa Torrents is a BitTorrent website focused on East Asian media. It is one of the largest public anime-dedicated torrent indexes.

A notorious market is a website or physical market where, according to the Office of the United States Trade Representative (USTR), large-scale intellectual property infringement takes place. Officially termed Notorious Markets for Counterfeiting and Piracy, the USTR has generated a yearly list of such notorious markets since 2006 with input from various industry groups.

<span class="mw-page-title-main">RARBG</span> BitTorrent metasearch engine

RARBG was a website that provided torrent files and magnet links to facilitate peer-to-peer file sharing using the BitTorrent protocol. From 2014 to 2023, RARBG repeatedly appeared in TorrentFreak's yearly list of most visited torrent websites. It was ranked 4th as of January 2023. The website did not allow users to upload their own torrents.

<span class="mw-page-title-main">Library Genesis</span> File-sharing website for publications

Library Genesis (LibGen) is a shadow library project for file-sharing access to scholarly journal articles, academic and general-interest books, images, comics, audiobooks, and magazines. The site enables free access to content that is otherwise paywalled or not digitized elsewhere. LibGen describes itself as a "links aggregator", providing a searchable database of items "collected from publicly available public Internet resources" as well as files uploaded "from users".

<span class="mw-page-title-main">1337x</span> File sharing website

1337x is an online website that provides a directory of torrent files and magnet links used for peer-to-peer file sharing through the BitTorrent protocol. According to the TorrentFreak news blog, 1337x is the second-most popular torrent website as of 2024. The U.S. Trade Representative flagged it as one of the most notorious pirate sites earlier in 2024. The site and its variants have been blocked in a variety of nations including Australia, and Portugal.

<span class="mw-page-title-main">YIFY</span> Peer-to-peer movies release group

YIFY Torrents or YTS was a peer-to-peer release group known for distributing large numbers of movies as free downloads through BitTorrent. YIFY releases were characterised through their small file size, which attracted many downloaders.

FMovies was a series of file streaming websites that host links and embedded videos, allowing users to stream or download movies for free. The sites have been subject to legal action in various jurisdictions on grounds of copyright infringement and piracy. In August 2024, the Alliance for Creativity and Entertainment announced that the site was shut down by Vietnamese authorities. The sites were receiving billions of views a year at its peak.

<span class="mw-page-title-main">KissAnime</span> Former anime-focused piracy file streaming site

KissAnime was an anime-focused file streaming website that hosted links and embedded videos, allowing users to stream or download movies and TV shows illegally for free. It was a sister site to a related manga viewing website, KissManga. KissAnime was described as "one of the world’s biggest streaming anime websites". TorrentFreak reported that the sites had audiences of millions and that, for a time, KissAnime was "the most visited pirate site in the world".

Shadow libraries are online databases of readily available content that is normally obscured or otherwise not readily accessible. Such content may be inaccessible for a number of reasons, including the use of paywalls, copyright controls, or other barriers to accessibility placed upon the content by its original owners. Shadow libraries usually consist of textual information as in electronic books, but may also include other digital media, including software, music, or films.

<span class="mw-page-title-main">Z-Library</span> File-sharing site for journal articles, books, and magazines

Z-Library is a shadow library project for file-sharing access to scholarly journal articles, academic texts and general-interest books. It began as a mirror of Library Genesis, but has expanded dramatically.

<span class="mw-page-title-main">Openload</span> File-sharing website

Openload was a file-sharing website that shut down in 2019 after legal action by the Alliance for Creativity and Entertainment. The site was highly-used before its shutdown, making most of its money from advertising and cryptojacking. The site was designated as a notorious market and often used for copyright infringement.

References

  1. 1 2 "Home". Anna's Archive. Retrieved 2025-01-15.
  2. 1 2 3 Manos, Leda (November 22, 2022). "Free Z-Library E-Book Download Search Engine "Anna's Archive" Launches Amid Arrests". LA Weekly . Retrieved 2024-12-29.
  3. 1 2 3 "Frequently Asked Questions (FAQ)". Anna's Archive. Retrieved 2024-08-19.
  4. "Anna's Archive Containers (AAC): standardizing releases from the world's largest shadow library". Anna's Blog. August 15, 2023. Retrieved 2025-01-17.
  5. 1 2 Van der Sar, Ernesto (February 7, 2024). "Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data". TorrentFreak. Retrieved 2024-12-30.
  6. Son, Jihun; Kim, Gyubin; Jung, Hyunwoo; Bang, Jewan; Park, Jungheum (October 1, 2023). "IF-DSS: A forensic investigation framework for decentralized storage services". Forensic Science International: Digital Investigation. 46: 301611. doi: 10.1016/j.fsidi.2023.301611 . ISSN   2666-2817.
  7. 1 2 Van der Sar, Ernesto (October 13, 2023). "Pirate Sites Exploit 'Interplanetary File System' Gateways, Publishers Warn". TorrentFreak. Retrieved 2025-01-17.
  8. "Putting 5,998,794 books on IPFS". Anna's Blog. November 19, 2022. Retrieved 2025-01-15.
  9. "Volunteering & Bounties". Anna’s Archive. Retrieved 2025-01-18.
  10. "Torrents". Anna’s Archive. Retrieved 2025-01-15.
  11. "Datasets". Anna’s Archive. Retrieved 2025-01-15.
  12. 1 2 3 Van der Sar, Ernesto (April 16, 2024). ""Anna's Archive" Opens the Door to Z-Library and Other Pirate Libraries". TorrentFreak. Retrieved 2024-08-19.
  13. 1 2 Iyer, Kavita (November 20, 2022). "Anna's Archive: eBooks Search Engine Emerges After Z-Library Shuts Down". TechWorm. Retrieved 2024-12-29.
  14. Booth, Callum (July 4, 2022). "The Pirate Library Mirror wants to preserve all human knowledge… illegally". TNW . Retrieved 2024-10-19.
  15. Van der Sar, Ernesto (October 3, 2023). "Anna's Archive Scraped WorldCat to Help Preserve 'All' Books in the World". TorrentFreak. Retrieved 2024-08-19.
  16. 1 2 Van der Sar, Ernesto (July 8, 2024). "Anna's Archive Faces Millions in Damages and a Permanent Injunction". TorrentFreak. Retrieved 2024-12-30.
  17. "OCLC Inc. v. Anna's Archive, 2:24-cv-144". Casetext . Retrieved 2024-08-19.
  18. Van der Sar, Ernesto. "Key Defendant in Anna's Archive Lawsuit Denies Any Involvement With the Site". TorrentFreak. Retrieved 2024-08-19.
  19. Moody, Glyn (August 21, 2024). "OCLC says "what is known must be shared", but sues Anna's Archive to stop it sharing knowledge". Walled Culture. Retrieved 2025-01-19.
  20. Van der Sar, Ernesto (July 18, 2024). "Anna's Archive Loses .GS Domain Name But Remains Resilient". TorrentFreak. Retrieved 2024-12-29.
  21. Stefanello, Viola (January 12, 2024). "Che fine ha fatto il movimento per il libero accesso alle pubblicazioni accademiche" [What happened to the movement for open access to academic publications?]. Il Post (in Italian). Retrieved 2025-01-19.
  22. Maxwell, Andy (January 4, 2024). "Silenzio! 'Anna's Archive' Shadow Library Blocked Following Publishers' Complaint". TorrentFreak. Retrieved 2024-12-29.
  23. Van der Sar, Ernesto (March 23, 2024). "Dutch Court Orders ISP to Block 'Anna's Archive' and 'LibGen'". TorrentFreak. Retrieved 2024-12-29.
  24. "Succesvolle toepassing Convenant Blokkeren Websites voor Library Genesis en Anna's Archive" [Successful application of Covenant Blocking Websites for Library Genesis and Anna's Archive]. Recht.nl (in Dutch). April 26, 2024. Retrieved 2025-01-18.
  25. "BREIN wil blokkering shadow libraries" [BREIN wants to block shadow libraries]. ICT Magazine (in Dutch). April 4, 2024. Retrieved 2025-01-18.
  26. "Blokkering shadow libraries bevolen" [Blocking shadow libraries ordered]. BREIN (in Dutch). March 21, 2024. Retrieved 2025-01-17.
  27. Stempel, Jonathan (March 11, 2024). "Nvidia is sued by authors over AI use of copyrighted works". Reuters . Retrieved 2025-01-19.
  28. Belanger, Ashley (March 11, 2024). "Nvidia sued over AI training data as copyright clashes continue". Ars Technica. Retrieved 2025-01-18.
  29. 1 2 Belanger, Ashley (May 28, 2024). "Nvidia denies pirate e-book sites are "shadow libraries" to shut down lawsuit". Ars Technica. Retrieved 2025-01-18.
  30. Van der Sar, Ernesto (May 27, 2024). "NVIDIA Denies Copyright Infringement Claims in Authors' AI Lawsuit". TorrentFreak. Retrieved 2025-01-18.
  31. Van der Sar, Ernesto (January 15, 2025). "Telegram Shuts Down Z-Library & Anna's Archive Channels Over Copyright Infringement". TorrentFreak. Retrieved 2025-01-16.
  32. Van der Sar, Ernesto (May 31, 2024). "Link-Busters Flagged Over 56 Million 'Pirate' URLs to Google in a Week". TorrentFreak. Retrieved 2025-01-18.
  33. Van der Sar, Ernesto (July 29, 2024). "Link-Busters Sent a Billion DMCA Takedown Requests to Google Search". TorrentFreak. Retrieved 2025-01-18.
  34. Van der Sar, Ernesto (January 17, 2025). "More Than Half of All Google Search Takedowns Now Come from Link-Busters". TorrentFreak. Retrieved 2025-01-18.
  35. Van der Sar, Ernesto (June 22, 2024). "Google Search Processed a Billion DMCA Takedowns in Four Months". TorrentFreak. Retrieved 2025-01-18.
  36. Maxwell, Andy (January 31, 2024). "World's Most Notorious Pirate Sites Listed in New USTR Report". TorrentFreak. Retrieved 2025-01-17.
  37. "USTR Releases 2023 Review of Notorious Markets for Counterfeiting and Piracy". United States Trade Representative. January 30, 2024. Retrieved 2025-01-15.
  38. "USTR Releases 2024 Review of Notorious Markets for Counterfeiting and Piracy". United States Trade Representative. January 8, 2025. Retrieved 2025-01-15.
  39. "Comment from Association of American Publishers". Regulations.gov. October 9, 2023. Retrieved 2025-01-17.