Open Library

Last updated
Open Library
Open Library logo.svg
Screenshot
OpenLibrarypage.jpg
Open Library homepage in September 2011
Type of site
Digital library index
Available in English
Revenue Donation
URL openlibrary.org
CommercialNo
RegistrationFree
Launched2006;15 years ago (2006)
Current statusActive
Content license
data: public domain [1]
source code: AGPLv3 [2]

Open Library is an online project intended to create "one web page for every book ever published". Created by Aaron Swartz, [3] [4] Brewster Kahle, [5] Alexis Rossi, [6] Anand Chitipothu, [6] and Rebecca Malamud, [6] Open Library is a project of the Internet Archive, a nonprofit organization. It has been funded in part by grants from the California State Library and the Kahle/Austin Foundation. Open Library provides online digital copies in multiple formats, created from images of many public domain, out-of-print, and in-print books.

Contents

Book database and digital lending library

Its book information is collected from the Library of Congress, other libraries, and Amazon.com, as well as from user contributions through a wiki-like interface. [4] If books are available in digital form, a button labeled "Read" appears next to its catalog listing. Digital copies of the contents of each scanned book are distributed as encrypted e-books (created from images of scanned pages), audiobooks and streaming audio (created from the page images using OCR and text-to-speech software), unencrypted images of full pages from OpenLibrary.org and Archive.org, and APIs for automated downloading of page images. [7] Links to where books can be purchased or borrowed are also provided.

There are different entities in the database:

Open Library claims to have over 20 million records in its database. [8] Copies of the contents of tens of thousands of modern books have been made available from 150 libraries and publishers for ebook digital lending. [9] Other books including in-print and in-copyright books have been scanned from copies in library collections, library discards, and donations, and are also available for lending in digital form. [10] In total, the Open Library offers copies of over 1.4 million books for what it calls "digital lending" and critics have called distribution of digital copies. [11]

Technical

Open Library began in 2006 with Aaron Swartz as the original engineer and leader of the Open Library's technical team. [3] [4] The project was led by George Oates from April 2009 to December 2011. [12] Oates was responsible for a complete site redesign during her tenure. [13] In 2015, the project was continued by Giovanni Damiola [6] and then Brenton Cheng [6] and Mek Karpeles [6] in 2016.

The site was redesigned and relaunched in May 2010. Its codebase is on GitHub. [14] The site uses Infobase, its own database framework based on PostgreSQL, and Infogami, its own Wiki engine written in Python. [15] The source code to the site is published under the GNU Affero General Public License. [16] [2]

Book sponsorship program

In the week of October 21, 2019, the Open Library website introduced a Book Sponsorship program, which according to Cory Doctorow, "lets you direct a cash donation to pay for the purchase and scanning of any books. In return, you are first in line to check that book out when it is available, and then anyone who holds an Open Library library card can check it out.". [17] The feature was developed by Mek Karpeles, Tabish Shaikh, [6] and other members of the community. [18]

Books for the blind and dyslexic

The website was relaunched adding ADA compliance and offering over 1 million modern and older books to the print disabled in May 2010 [19] using the DAISY Digital Talking Book. [20] Under certain provisions of United States copyright law, libraries are sometimes able to reproduce copyrighted works in formats accessible to users with disabilities. [21] [22]

The Open Library has justified its ability to offer full contents of books in digital formats as part of the first-sale doctrine and fair use law. [23] [24] The Open Library owns a physical copy of each book that they have made available, and thus argue that the lending out of one digital scan of the book in a controlled manner falls within the first-sale doctrine, a practice known as Controlled Digital Lending and in use by multiple public and academic libraries. [24]

Since its launch, the Open Library has been accused of mass copyright violation by numerous groups, [24] including the American Authors Guild, [25] the British Society of Authors, [26] the Australian Society of Authors, [27] the Science Fiction and Fantasy Writers of America, [28] the US National Writers Union, [29] and a coalition of 37 national and international organizations of "writers, translators, photographers, and graphic artists; unions, organizations, and federations representing the creators of works included in published books; book publishers; and reproduction rights and public lending rights organizations". [30] The UK Society of Authors threatened legal action unless the Open Library agreed to cease distribution of copyrighted works by February 1, 2019. [31]

The Open Library further came under criticism from several authors and publishers groups when it created the National Emergency Library in response to the COVID-19 pandemic in March 2020. Under these exigent circumstances, the National Emergency Library removed the waitlists of all books in its Open Library collection and allowed any number of digital copies of a book to be downloaded as an encrypted file that would be unusable after two weeks, asserting that this unlimited borrowing was a reasonable exception under the national emergency to allow educational functions to continue since physical libraries and bookstores were forced to be shuttered. [24] The Authors Guild, the Association of American Publishers, the National Writers Union, and others argued that this allowed unlimited copyright infringement and denied revenues from distribution of authorized digital copies of books to authors who also needed relief during the COVID-19 national emergency. [24] Though the Open Library asserted that the copies of entire books in ebook format were still encrypted and the unlimited borrowing was for educational purposes, the National Writers Union asserted that images of each page of each book could still be accessed on the Web without encryption or other controls. [7] [32]

Four major publishers—Hachette, Penguin Random House, John Wiley & Sons, and HarperCollins, all members of the Association of American Publishers—filed a lawsuit in the Southern New York Federal District Court against the Internet Archive in June 2020, asserting the Open Library project violated numerous copyrights. [33] In their suit, the publishers claimed "Without any license or any payment to authors or publishers, [the Internet Archive] scans print books, uploads these illegally scanned books to its servers, and distributes verbatim digital copies of the books in whole via public-facing websites. With just a few clicks, any Internet-connected user can download complete digital copies of in-copyright books from [the] defendant." [34] The publishers are represented by the law firms Davis Wright Tremaine and Oppenheim + Zebrak. [35] The Internet Archive ended the National Emergency Library on June 16, 2020, instead of the intended June 30 date, and requested the publishers to "call off their costly assault". [36]

See also

Related Research Articles

The Baen Free Library is a digital library of the science fiction and fantasy publishing house Baen Books where 61 e-books as of June 2016 can be downloaded free in a number of formats, without copy protection. It was founded in late 1999 by science fiction writer Eric Flint and publisher Jim Baen to determine whether the availability of books free of charge on the Internet encourages or discourages the sale of their paper books.

Brewster Kahle American computer engineer, founder of the Internet Archive

Brewster Lurton Kahle is an American digital librarian, a computer engineer, Internet entrepreneur, and advocate of universal access to all knowledge. Kahle founded the Internet Archive and Alexa. In 2012 he was inducted into the Internet Hall of Fame.

Internet Archive American non-profit organization providing archives of digital media since 1996

The Internet Archive is an American digital library with the stated mission of "universal access to all knowledge". It provides free public access to collections of digitized materials, including websites, software applications/games, music, movies/videos, moving images, and millions of books. In addition to its archiving function, the Archive is an activist organization, advocating a free and open Internet. As of November 2021, the Internet Archive holds over 33 million books and texts, 7.3 million movies, videos and TV shows, 785,000 software programs, 13,901,000 audio files, 4 million images, and 627 billion web pages in the Wayback Machine.

The Million Book Project was a book digitization project led by Carnegie Mellon University School of Computer Science and University Libraries from 2007–2008. Working with government and research partners in India and China, the project scanned books in many languages, using OCR to enable full text searching, and providing free-to-read access to the books on the web. As of 2007, they have completed the scanning of 1 million books and have made the entire catalog accessible online.

National Library of New Zealand Legal-deposit national library

The National Library of New Zealand is New Zealand's legal deposit library charged with the obligation to "enrich the cultural and economic life of New Zealand and its interchanges with other nations". Under the Act, the library's duties include:

Authors Guild

The Authors Guild is America's oldest and largest professional organization for writers and provides advocacy on issues of free expression and copyright protection. Since its founding in 1912 as the Authors League of America, it has counted among its board members notable authors of fiction, nonfiction, and poetry, including numerous winners of the Nobel and Pulitzer Prizes and National Book Awards. It has over 9,000 members, who receive free legal advice and guidance on contracts with publishers as well as insurance services and assistance with subsidiary licensing and royalties.

Google Books Service from Google

Google Books is a service from Google Inc. that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.

National Writers Union United States trade union

National Writers Union (NWU), founded on 19 November 1981, is the trade union in the United States for freelance and contract writers: journalists, book and short fiction authors, business and technical writers, web content providers and poets. Organized into 17 local chapters nationwide, it had been Local 1981 of the United Automobile Workers, AFL-CIO since merging with them in 1992. On 11 May 2020, the NWU disaffiliated with the UAW.

qBittorrent

qBittorrent is a cross-platform free and open-source BitTorrent client.

International Music Score Library Project Project for the creation of a virtual library of public domain music scores

The International Music Score Library Project (IMSLP), also known as the Petrucci Music Library after publisher Ottaviano Petrucci, is a subscription-based digital library of public-domain music scores. It includes public domain and licensed recordings to allow for study by ear. The project, which uses MediaWiki software, has uploaded more than 495,000 scores and 59,000 recordings of more than 152,000 works by 18,000 composers. IMSLP has both an iOS app and an Android app.

Australian Society of Authors

The Australian Society of Authors (ASA) was formed in 1963 as the organisation to promote and protect the rights of Australia's authors and illustrators. The Fellowship of Australian Writers played a key role it its establishment. The organisation established Public Lending Right (PLR) in 1975 and Educational Lending Right (ELR) in 2000. The ASA was also instrumental in setting up Copyright Agency, the Australian Copyright Council and the International Authors Forum.

HathiTrust Digital library

HathiTrust Digital Library is a large-scale collaborative repository of digital content from research libraries including content digitized via Google Books and the Internet Archive digitization initiatives, as well as content digitized locally by libraries.

A digital library, also called an online library, an internet library, a digital repository, or a digital collection is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection.

The Book Rights Registry is an entity to be founded as part of a settlement of the lawsuit between the Authors Guild and Google over the Google Books scanning project. The Registry will be initially funded by $34.5 million from Google but it will be an independent, not-for-profit organization that collects and disburses revenue from third party users of content to authors, publishers and other rightsholders. According to the Settlement Agreement, the Registry will own and maintain a rights information database for all books covered by the Agreement and their authors and publishers. It will also resolve disputes between rightsholders.

<i>Authors Guild, Inc. v. Google, Inc.</i> U.S. copyright law case, 2015

Authors Guild v. Google 721 F.3d 132 was a copyright case heard in the United States District Court for the Southern District of New York, and on appeal to the United States Court of Appeals for the Second Circuit between 2005 and 2015. The case concerned fair use in copyright law and the transformation of printed copyrighted books into an online searchable database through scanning and digitization. The case centered on the legality of the Google Book Search Library Partner project that had been launched in 2003.

Ebook Book-length publication in digital form

An ebook, also known as an e-book or eBook, is a book publication made available in digital form, consisting of text, images, or both, readable on the flat-panel display of computers or other electronic devices. Although sometimes defined as "an electronic version of a printed book", some e-books exist without a printed equivalent. E-books can be read on dedicated e-reader devices, but also on any computer device that features a controllable viewing screen, including desktop computers, laptops, tablets and smartphones.

United States copyright registrations, renewals, and other catalog entries since 1978 are published online at the U.S. Copyright Office website. Entries prior to 1978 are not published in the Online Catalog. Copyright registrations and renewals after 1890 were formerly published in semi-annual softcover catalogs called The Catalog of Copyright Entries or Copyright Catalog or published in microfiche.

E-book lending or elending is a practice in which access to already-purchased downloads or online reads of e-books is made available on a time-limited basis to others. It works around the digital rights management built into online-store-published e-books by limiting access to a purchased e-book file to the borrower, resulting in loss of access to the file by the purchaser for the duration of the borrowing period.

Controlled digital lending Digital library lending model

Controlled digital lending (CDL) is a model by which libraries digitize materials in their collection and make them available for lending. It is based on interpretations of the United States copyright principles of fair use and copyright exhaustion.

youtube-dl is an open-source download manager for video and audio from YouTube and over 1000 other video hosting websites. It is released under the Unlicense software license.

References

  1. Bookfinch; Chitipothu, Anand; Oates, George; West, Jessamyn (2013-10-10). "Using Open Library Data § Who owns the Open Library catalog?".
  2. 1 2 "openlibrary/LICENSE at master · internetarchive/openlibrary · GitHub". GitHub.com. Archived from the original on 2017-01-22. Retrieved 2015-06-26.
  3. 1 2 "A library bigger than any building". BBC News. 2007-07-31. Archived from the original on 2009-11-27. Retrieved 2010-07-06.
  4. 1 2 3 Grossman, Wendy M (2009-01-22). "Why you can't find a library book in your search engine". The Guardian. London. Archived from the original on 2014-01-14. Retrieved 2010-07-06.
  5. "Aaron Swartz: howtoget". Aaronsw.jottit.com. Archived from the original on 2015-05-23. Retrieved 2015-06-05.
  6. 1 2 3 4 5 6 7 "The Open Library Team". Open Library. Archived from the original on 2018-07-17. Retrieved 2018-07-16.
  7. 1 2 Hasbrouck, Edward (16 April 2020). "What is the Internet Archive doing with our books?". National Writers Union. Retrieved 2020-05-07.
  8. "About Us". Openlibrary.org. Archived from the original on 2015-06-27. Retrieved 2015-06-26.
  9. "Internet Archive Forums: In-Library eBook Lending Program Launched". 2011-02-22. Archived from the original on 2015-07-17. Retrieved 2015-06-26.
  10. "FAQ on Controlled Digital Lending (CDL)". 13 February 2019. Retrieved 2019-02-14.
  11. Lee, Timothy B. (2020-03-28). "Internet Archive offers 1.4 million copyrighted books for free online". Ars Technica. Archived from the original on 2020-03-28. Retrieved 2020-04-20.
  12. "George". Openlibrary.org. Archived from the original on 2017-02-22. Retrieved 2015-06-26.
  13. Oates, George (2010-03-17). "Announcing the Open Library redesign « The Open Library Blog". Blog.openlibrary.org. Archived from the original on 2015-06-27. Retrieved 2015-06-26.
  14. "internetarchive/openlibrary · GitHub". GitHub.com. Archived from the original on 2015-08-10. Retrieved 2015-06-26.
  15. "About the Technology". Openlibrary.org. Archived from the original on 2015-06-27. Retrieved 2015-06-26.
  16. "Developers / Licensing". Openlibrary.org. Archived from the original on 2015-06-27. Retrieved 2015-06-26.
  17. Doctorow, Cory (2019-10-22). "The Internet Archive's Open Library will let you sponsor a book, paying for it to be scanned". BoingBoing. Archived from the original on 2019-10-23. Retrieved 2019-10-24.
  18. El-Sabrout, Omar Rafik. "Scan On Demand: Building the World's Open Library, Together". The Open Library Blog. Archived from the original on 2019-10-24. Retrieved 2019-10-24.
  19. "Project puts 1M books online for blind, dyslexic | UTSanDiego.com". Signonsandiego.com. 2010-05-05. Archived from the original on 2011-12-17. Retrieved 2015-06-26.
  20. "Welcome to Daisy Books for the Print Disabled". Internet Archive. Archived from the original on 2013-01-04. Retrieved 2012-12-10.
  21. "NLS Factsheets: Copyright Law Amendment, 1996: PL 104-197". Library of Congress NLS Factsheets. Library of Congress. Archived from the original on 2017-05-21.
  22. Scheid, Maria. "Copyright and Accessibility". Copyright Corner. The Ohio State University Libraries. Archived from the original on 2016-06-30.
  23. Hansen, David R.; Courtney, Kyle K. (2018). A White Paper on Controlled Digital Lending of Library Books (Report). Controlled Digital Lendings by Libraries. Archived from the original on 2019-08-02. Retrieved 2020-04-02.
  24. 1 2 3 4 5 Grady, Constance (2020-04-02). "Why authors are so angry about the Internet Archive's Emergency Library". Vox . Archived from the original on 2020-04-04. Retrieved 2020-04-02.
  25. The Authors Guild. "Open Letter to Internet Archive and Other Proponents of 'Controlled Digital Lending'". JotForm. Archived from the original on 2019-07-28. Retrieved 2019-04-04.
  26. The Society of Authors. "Open letter to Internet Archive about 'Controlled Digital Lending'". JotForm. Archived from the original on 2019-07-28. Retrieved 2019-04-04.
  27. "Open Library: copyright infringement". Australian Society of Authors. 2019-01-21. Archived from the original on 2019-08-20. Retrieved 2019-02-10.
  28. "Infringement Alert". Science Fiction and Fantasy Writers of America. 2018-01-08. Archived from the original on 2019-02-12. Retrieved 2019-02-10.
  29. Hasbrouck, Edward (2019-02-13). "NWU denounces 'Controlled Digital Lending'". National Writers Union.
  30. "Controlled Digital Lending (CDL): An appeal to readers and librarians from the victims of CDL". National Writers Union. 13 February 2019. Retrieved 2019-02-14.
  31. Flood, Alison (2019-01-22). "Internet Archive's ebook loans face UK copyright challenge". The Guardian . London. Archived from the original on 2019-02-12. Retrieved 2019-02-10.
  32. Hasbrouck, Edward (24 March 2020). "Internet Archive removes controls on "lending" of bootleg e-books". National Writers Union. Retrieved 2020-05-07.
  33. Bustillos, Maria (2020-09-10). "Publishers Are Taking the Internet to Court". The Nation.
  34. Brandom, Russell (2020-06-01). "Publishers sue Internet Archive over Open Library ebook lending". The Verge . Retrieved 2020-06-01.
  35. "Publishers File Suit Against Internet Archive for Systematic Mass Scanning and Distribution of Literary Works". AAP. 2020-06-01.
  36. Lee, Timothy (2020-06-11). "Internet Archive ends "emergency library" early to appease publishers". Ars Technica . Retrieved 2020-06-14.