Shadow libraries are online databases of readily available content that is normally obscured or otherwise not readily accessible. Such content may be inaccessible for a number of reasons, including the use of paywalls, copyright controls, or other barriers to accessibility placed upon the content by its original owners. [1] [2] Shadow libraries usually consist of textual information as in electronic books, but may also include other digital media, including software, music, or films.
Examples of shadow libraries include Anna's Archive, Library Genesis, Sci-Hub and Z-Library, which are popular book and academic shadow libraries [1] [3] and may be the largest public libraries for books and literature.
One of the goals of shadow libraries is to more readily disseminate academic content, especially papers from academic journals. [2] Academic literature has become increasingly expensive, as costs to access information created by scholars have risen dramatically in recent years, especially the cost of books. [4] The term serials crisis has emerged to describe this ongoing trend.
There has also been a concerted international movement, known as the Open Access movement, to make academic knowledge free or very inexpensive. [5] The Open Access movement strives to establish both journals that are free to access (known as open access journals) and free-to-access repositories of academic journal papers published elsewhere. However, many open access journals require academics to pay fees to be published in an open access journal, which disincentivizes academics from publishing in such journals. [6]
A third reason for the establishment of shadow libraries is the tacit endorsement by many academics of such efforts. [7] Academics are rarely compensated by publishers for their work, regardless of whether their work is published in an open access journal or a conventionally priced journal. Thus, there is now little incentive for academics to disavow shadow libraries. Furthermore, shadow libraries greatly increase the impact of academics whose work is made available. According to one study from Cornell University, articles that are on Sci-Hub receive 1.72 times as many citations as articles from journals of similar quality that are not available on Sci-Hub. [8]
Content hosted by some shadow libraries may be hosted without the consent of the original owners of the material. This may make some shadow libraries illegal; however, as researchers are not required to disclose the means by which they access academic material, it is difficult to monitor the use of illegally accessed academic papers. Not all authors agree with trying to compromise access to shadow libraries. [9]
The legality of directing individuals to shadow libraries is broadly undetermined. There is currently no consensus among legal authorities in the United States and Europe as to what extent advertising shadow libraries constitutes a criminal offense. There are currently no settled cases determining whether it is permissible by academics to directly provide links to shadow libraries, though threats of legal action by academic publishers regarding such references have occurred in isolated incidents. Legal action against researchers remains uncommon. [10]
Although most academics are not penalized for distributing their published works independently and freely (therefore obviating the need for shadow libraries in the first place), there are reports of academic publishers threatening such academics with legal action. [11]
Shadow libraries (or their content databases) make use of BitTorrent (mainly for database dumps), dark web and IPFS technologies to increase their resilience or distribute loads. [12] [13] [14] [2] [15] In the case of Anna's Archive, the software is developed and made accessible as open source software, enabling code development by any volunteer and mirrors or forks, with the site claiming that "if we get taken down we'll just pop right up elsewhere, since all our code and data is fully open source". [16] [17]
JSTOR is a digital library of academic journals, books, and primary sources founded in 1994. Originally containing digitized back issues of academic journals, it now encompasses books and other primary sources as well as current issues of journals in the humanities and social sciences. It provides full-text searches of almost 2,000 journals. Most access is by subscription but some of the site is public domain, and open access content is available free of charge.
BitTorrent, also referred to simply as torrent, is a communication protocol for peer-to-peer file sharing (P2P), which enables users to distribute data and electronic files over the Internet in a decentralized manner. The protocol is developed and maintained by Rainberry, Inc., and was first released in 2001.
An archivist is an information professional who assesses, collects, organizes, preserves, maintains control over, and provides access to records and archives determined to have long-term value. The records maintained by an archivist can consist of a variety of forms, including letters, diaries, logs, other personal documents, government documents, sound or picture recordings, digital files, or other physical objects.
The Bescherming Rechten Entertainment Industrie Nederland is an advocacy group with international links, based in the Netherlands, which represents the interests of the Dutch entertainment industry and is organised under the Dutch law through the legal form of stichting. It is notable for launching court proceedings against copyright infringement in the country and for engaging in lobbying in order to create legal precedents of global significance.
Per Gottfrid Svartholm Warg, alias anakata, is a Swedish computer specialist, known as the former co-owner of the web hosting company PRQ and co-founder of the BitTorrent site The Pirate Bay together with Fredrik Neij and Peter Sunde.
qBittorrent is a cross-platform free and open-source BitTorrent client written in native C++. It relies on Boost, OpenSSL, zlib, Qt 6 toolkit and the libtorrent-rasterbar library, with an optional search engine written in Python.
Tribler is an open source decentralized BitTorrent client which allows anonymous peer-to-peer by default. Tribler is based on the BitTorrent protocol and uses an overlay network for content searching. Due to this overlay network, Tribler does not require an external website or indexing service to discover content. The user interface of Tribler is very basic and focused on ease of use instead of diversity of features. Tribler is available for Linux, Windows, and OS X.
File sharing is the practice of distributing or providing access to digital media, such as computer programs, multimedia, documents or electronic books. Common methods of storage, transmission and dispersion include removable media, centralized servers on computer networks, Internet-based hyperlinked documents, and the use of distributed peer-to-peer networking.
Popcorn Time is a multi-platform, free software BitTorrent client that includes an integrated media player. The application provides a piracy-based alternative to subscription-based video streaming services such as Netflix. Popcorn Time uses sequential downloading to stream video listed by several torrent websites, and third-party trackers can also be added manually. The legality of the software depends on the jurisdiction.
Mirror sites or mirrors are replicas of other websites. The concept of mirroring applies to network services accessible through any protocol, such as HTTP or FTP. Such sites have different URLs than the original site, but host identical or near-identical content. Mirror sites are often located in a different geographic region than the original, or upstream site. The purpose of mirrors is to reduce network traffic, improve access speed, ensure availability of the original site for technical or political reasons, or provide a real-time backup of the original site. Mirror sites are particularly important in developing countries, where internet access may be slower or less reliable.
The InterPlanetary File System (IPFS) is a protocol, hypermedia and file sharing peer-to-peer network for storing and sharing data in a distributed file system. By using content addressing, IPFS uniquely identifies each file in a global namespace that connects IPFS hosts, creating a resilient system of file storage and sharing.
Library Genesis (LibGen) is a shadow library project for file-sharing access to scholarly journal articles, academic and general-interest books, images, comics, audiobooks, and magazines. The site enables free access to content that is otherwise paywalled or not digitized elsewhere. LibGen describes itself as a "links aggregator", providing a searchable database of items "collected from publicly available public Internet resources" as well as files uploaded "from users".
Sci-Hub is a shadow library website that provides free access to millions of research papers, regardless of copyright, by bypassing publishers' paywalls in various ways. Unlike Library Genesis, it does not provide access to books. Sci-Hub was founded in Kazakhstan by Alexandra Elbakyan in 2011, in response to the high cost of research papers behind paywalls. The site is extensively used worldwide. In September 2019, the site's operator(s) said that it served approximately 400,000 requests per day. In addition to its intensive use, Sci-Hub stands out among other shadow libraries because of its easy use/reliability and because of the enormous size of its collection; a 2018 study estimated that Sci-Hub provided access to 95% of all scholarly publications with issued DOI numbers. On 15 July 2022, Sci-Hub reported that its collection comprised 88,343,822 files. Since December 2020, the site has paused uploads due to legal troubles.
#ICanHazPDF is a hashtag used on Twitter to request access to academic journal articles which are behind paywalls. It began in 2011 by scientist Andrea Kuszewski. The name is derived from the meme I Can Has Cheezburger?
Alexandra Asanovna Elbakyan is a Kazakhstani computer programmer and creator of the website Sci-Hub, which provides free access to research papers without regard for copyright. According to a study published in 2018, Sci-Hub provides access to nearly all scholarly literature.
The Guerilla Open Access Manifesto is a document published by Aaron Swartz in 2008 that argues for transgressive approaches to achieving the goals of the open access movement through civil disobedience, willful violation of copyright and contracts that restrict redistribution of knowledge, and activities that exist in legal grey areas.
Z-Library is a shadow library project for file-sharing access to scholarly journal articles, academic texts and general-interest books. It began as a mirror of Library Genesis, but has expanded dramatically.
Anna's Archive is a search engine for shadow libraries created by the pseudonymous Anna. It was founded in direct response to law enforcement efforts to close down Z-Library in 2022. It describes itself as aiming to "catalog all the books in existence" and to "track humanity's progress toward making all these books easily available in digital form".