Open Content Alliance

Last updated
Open Content Alliance logo Open Content Alliance logo.svg
Open Content Alliance logo

The Open Content Alliance (OCA) was a consortium of organizations contributing to a permanent, publicly accessible archive of digitized texts. Its creation was announced in October 2005 by Yahoo!, the Internet Archive, the University of California, the University of Toronto and others. [1] Scanning for the Open Content Alliance was administered by the Internet Archive, which also provided permanent storage and access through its website.

Contents

The OCA was, in part, a response to Google Book Search, which was announced in October 2004. OCA's approach to seeking permission from copyright holders differed significantly from that of Google Book Search. OCA digitized copyrighted works only after asking and receiving permission from the copyright holder ("opt-in"). By contrast, Google Book Search digitized copyrighted works unless explicitly told not to do so ("opt-out"), and contends that digitizing for the purposes of indexing is fair use.

Microsoft had a special relationship with the Open Content Alliance until May 2008. Microsoft joined the Open Content Alliance in October 2005 as part of its Live Book Search project. [2] However, in May 2008 Microsoft announced it would be ending the Live Book Search project and no longer funding the scanning of books through the Internet Archive. [3] Microsoft removed any contractual restrictions on the content they had scanned and they relinquished the scanning equipment to their digitization partners and libraries to continue digitization programs. [3] Between about 2006 and 2008 Microsoft sponsored the scanning of over 750,000 books, 300,000 of which are now part of the Internet Archive's on-line collections.

Opposition to Google Book Settlement

Brewster Kahle, a founder of the Open Content Alliance, actively opposed the proposed Google Book Settlement until its defeat in March 2011.

Contributors

The following are contributors to the OCA:

Biodiversity Heritage Library, a cooperative project of:

See also

Related Research Articles

<span class="mw-page-title-main">Internet Archive</span> American non-profit digital archive

The Internet Archive is an American digital library with the stated mission of "universal access to all knowledge". It provides free public access to collections of digitized materials, including websites, software applications/games, music, movies/videos, moving images, and millions of books. In addition to its archiving function, the Archive is an activist organization, advocating a free and open Internet. As of January 1, 2023, the Internet Archive holds over 36 million books and texts, 11.6 million movies, videos and TV shows and clips, 950 thousand software programs, 15 million audio files, 4.5 million images, 251 thousand concerts, and 780 billion web pages in the Wayback Machine.

<span class="mw-page-title-main">Digitization</span> Converting information into digital form

Digitization is the process of converting information into a digital format. The result is the representation of an object, image, sound, document, or signal obtained by generating a series of numbers that describe a discrete set of points or samples. The result is called digital representation or, more specifically, a digital image, for the object, and digital form, for the signal. In modern practice, the digitized data is in the form of binary numbers, which facilitates processing by digital computers and other operations, but digitizing simply means "the conversion of analog source material into a numerical format"; the decimal or any other number system can be used instead.

<span class="mw-page-title-main">Rick Prelinger</span> American film director

Rick Prelinger is an archivist, professor at the University of California, Santa Cruz; writer and filmmaker, and founder of the Prelinger Archives, a collection of 60,000 advertising, educational, industrial, and amateur films acquired by the Library of Congress in 2002 after 20 years' operation.

<span class="mw-page-title-main">Google Books</span> Service from Google

Google Books is a service from Google Inc. that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.

A universal library is a library with universal collections. This may be expressed in terms of it containing all existing information, useful information, all books, all works or even all possible works. This ideal, although unrealizable, has influenced and continues to influence librarians and others and be a goal which is aspired to. Universal libraries are often assumed to have a complete set of useful features.

<span class="mw-page-title-main">Live Search Books</span>

Live Search Books was a search service for books launched in December 2006, part of Microsoft's Live Search range of services. Microsoft was working with a number of libraries, including the British Library, to digitize books and make them searchable, and in the case of out-of-copyright books, available across the web.

Google News Archive is an extension of Google News providing free access to scanned archives of newspapers and links to other newspaper archives on the web, both free and paid.

<span class="mw-page-title-main">Book scanning</span> Process of converting physical media into digital media

Book scanning or book digitization is the process of converting physical books and magazines into digital media such as images, electronic text, or electronic books (e-books) by using an image scanner. Large scale book scanning projects have made many books available online.

Sidney Verba was an American political scientist, librarian and library administrator. His academic interests were mainly American and comparative politics. He was the Carl H. Pforzheimer University Professor at Harvard University and also served Harvard as the director of the Harvard University Library from 1984 to 2007.

The Michigan Digitization Project is a project in partnership with Google Books to digitize the entire print collection of the University of Michigan Library. The digitized collection is available through the University of Michigan Library catalog, Mirlyn, the HathiTrust Digital Library, and Google Books. Full-text of works that are out of copyright or in the public domain are available.

<span class="mw-page-title-main">Biodiversity Heritage Library</span> Discipline-oriented digital libraries

The Biodiversity Heritage Library (BHL) is the world’s largest open access digital library for biodiversity literature and archives. BHL operates as a worldwide consortium of natural history, botanical, research, and national libraries working together to address this challenge by digitizing the natural history literature held in their collections and making it freely available for open access as part of a global "biodiversity community". The BHL consortium works with the international taxonomic community, publishers, bioinformaticians, and information technology professionals to develop tools and services to facilitate greater access, interoperability, and reuse of content and data. BHL provides a range of services, data exports, and APIs to allow users to download content, harvest source data files, and reuse materials for research purposes. Through taxonomic intelligence tools developed by Global Names Architecture, BHL indexes the taxonomic names throughout the collection, allowing researchers to locate publications about specific taxa. In partnership with the Internet Archive and through local digitization efforts, BHL's portal provides free access to hundreds of thousands of volumes, comprising over 59 million pages, from the 15th-21st centuries.

<i>Encyclopedia of Life</i> Free, online collaborative encyclopedia that documents species

The Encyclopedia of Life (EOL) is a free, online encyclopedia intended to document all of the 1.9 million living species known to science. It is compiled from existing trusted databases curated by experts and with the assistance of non-experts throughout the world. It aims to build one "infinitely expandable" page for each species, including video, sound, images, graphics, as well as text. In addition, the Encyclopedia incorporates content from the Biodiversity Heritage Library, which digitizes millions of pages of printed literature from the world's major natural history libraries. The project was initially backed by a US$50 million funding commitment, led by the MacArthur Foundation and the Sloan Foundation, who provided US$20 million and US$5 million, respectively. The additional US$25 million came from five cornerstone institutions—the Field Museum, Harvard University, the Marine Biological Laboratory, the Missouri Botanical Garden, and the Smithsonian Institution. The project was initially led by Jim Edwards and the development team by David Patterson. Today, participating institutions and individual donors continue to support EOL through financial contributions.

<i>Hawkins Electrical Guide</i> Book by Nehemiah Hawkins

The Hawkins Electrical Guide was a technical engineering book written by Nehemiah Hawkins, first published in 1914, intended to explain the highly complex principles of the new technology of electricity in a way that could be understood by the common man. The book is notable for the extremely high number of detailed illustrations it contains, and the small softbound size of the volumes.

<span class="mw-page-title-main">HathiTrust</span> Digital library

HathiTrust Digital Library is a large-scale collaborative repository of digital content from research libraries including content digitized via Google Books and the Internet Archive digitization initiatives, as well as content digitized locally by libraries.

A digital library, also called an online library, an internet library, a digital repository, or a digital collection is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection. Digital libraries can vary immensely in size and scope, and can be maintained by individuals or organizations. The digital content may be stored locally, or accessed remotely via computer networks. These information retrieval systems are able to exchange information with each other through interoperability and sustainability.

The Book Rights Registry is an entity to be founded as part of a settlement of the lawsuit between the Authors Guild and Google over the Google Books scanning project. The Registry will be initially funded by $34.5 million from Google but it will be an independent, not-for-profit organization that collects and disburses revenue from third party users of content to authors, publishers and other rightsholders. According to the Settlement Agreement, the Registry will own and maintain a rights information database for all books covered by the Agreement and their authors and publishers. It will also resolve disputes between rightsholders.

<i>Authors Guild, Inc. v. Google, Inc.</i> U.S. copyright law case, 2015

Authors Guild v. Google 721 F.3d 132 was a copyright case heard in the United States District Court for the Southern District of New York, and on appeal to the United States Court of Appeals for the Second Circuit between 2005 and 2015. The case concerned fair use in copyright law and the transformation of printed copyrighted books into an online searchable database through scanning and digitization. The case centered on the legality of the Google Book Search Library Partner project that had been launched in 2003.

<span class="mw-page-title-main">Biodiversity Heritage Library for Europe</span>

The Biodiversity Heritage Library for Europe (BHL-Europe) was a three-year (2009–2012) EU project aimed to the coordination of digitization of literature on biodiversity. It involved 28 major natural history museums, botanical gardens, libraries and other European institutions. BHL-Europe was founded in Berlin in May 2009 and regarded itself as a European partner project of the Biodiversity Heritage Library (BHL) project, which was founded in 2005 and initially formed by ten United States and British libraries.

American Libraries is a digital collection of ebooks and texts at the Internet Archive. This collection contains over 1,900,000 items sponsored by these partners:

The Boston Library Consortium (BLC) is a library consortium based in the Boston area with 23 member institutions across New England.

References

  1. Katie Hafner (October 3, 2005). "Open Content Alliance". The New York Times . Retrieved 2013-10-23.
  2. Katie Hafner (2005-10-26). "Microsoft to Offer Online Book-Content Searches". The New York Times . Retrieved 2013-10-23.
  3. 1 2 "Book search winding down", Live Search Blog. Official announcement from Microsoft. Last accessed May 23, 2008.
  4. "National Writers Union Joins Open Book Alliance". Nwuboston.org. 2009-09-04. Archived from the original on 2009-09-07. Retrieved 2013-10-23.