Type of site | Newspaper archive |
---|---|
Available in | English, German, French, Spanish, Italian, Polish, Portuguese, Chinese, Japanese, Korean, Dutch, Arabic, Hebrew, Norwegian, Czech, Hungarian, Swedish, Greek, Russian, Hindi, Telugu, Tamil, Turkish, and Malayalam. |
Dissolved | yes |
Created by | |
URL | news |
Registration | Not required |
Launched | June 6, 2006 |
Current status | Scanning project discontinued; search function "facelift" is "in the works" |
Google News Archive is an extension of Google News providing free access to scanned archives of newspapers and links to other newspaper archives on the web, both free and paid.
Some of the news archives date back to 18th century. There is a timeline view available, to select news from various years.
The archive went live on June 6, 2006, after Google acquired PaperofRecord.com, originally created by Robert J. Huggins and his team at Cold North Wind, Inc. The acquisition was not publicly announced by Cold North Wind until 2008.
While the service initially provided a simple index of other web pages, on September 8, 2008, Google News began to offer indexed content from scanned newspapers. [1] The depth of chronological coverage varies.
Newspapers were thought to have escaped copyright obligations of news articles because of Google's method of publishing the archives as searchable image files of the actual newspaper pages, rather than as pure text of articles.[ citation needed ]
In 2011, Google announced that it would no longer add content to the archive project. [2] On August 14, 2011, without notice, Google made the News Archives home page unavailable. Apparently, the service merged with Google News. [3] Carly Carlioi, an editor at the Boston Phoenix , speculated that Google discontinued the project because they found it harder than expected, for newspapers were more difficult to index than books because of layout complexities. [4] Another cause might have been that the project attracted a lesser audience than expected.
While archived newspapers [5] are still available for browsing, keyword searching is not fully functional. On December 16, 2013, Google News employee Stacie Chan wrote in the Google Product Forums that Google News is "performing a much needed facelift on our News Archive search function", and that access to archived stories would be limited for several months while "this new system" is being built. [6] This was reaffirmed on May 22 and July 30, 2014, when Chan wrote that Google is still "working on the archives to provide a better user experience", [7] and "it's in the works", [8] and again on December 18, 2014, when Chan wrote that Google "is currently working on creating a better experience on the Newspaper Archives that should be available in the near future." [9]
Some papers formerly included in the News Archive have been removed because of copyright issues. For instance, the archives of the Milwaukee Journal Sentinel disappeared on August 16, 2016, due to a contract between the paper's owner, the Gannett Company, and NewsBank. [10]
Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the oldest digital library. Most of the items in its collection are the full texts of books or individual stories in the public domain. All files can be accessed for free under an open format layout, available on almost any computer. As of 13 February 2024, Project Gutenberg had reached 70,000 items in its collection of free eBooks.
The Internet Archive is an American nonprofit digital library founded on May 10, 1996, and chaired by free information advocate Brewster Kahle. It provides free access to collections of digitized materials including websites, software applications, music, audiovisual and print materials. The Archive also advocates for a free and open Internet. As of February 4, 2024, the Internet Archive holds more than 44 million print materials, 10.6 million videos, 1 million software programs, 15 million audio files, 4.8 million images, 255,000 concerts, and over 835 billion web pages in its Wayback Machine. Its mission is committing to provide "universal access to all knowledge".
Electronic publishing includes the digital publication of e-books, digital magazines, and the development of digital libraries and catalogues. It also includes the editing of books, journals, and magazines to be posted on a screen.
Digitization is the process of converting information into a digital format. The result is the representation of an object, image, sound, document, or signal obtained by generating a series of numbers that describe a discrete set of points or samples. The result is called digital representation or, more specifically, a digital image, for the object, and digital form, for the signal. In modern practice, the digitized data is in the form of binary numbers, which facilitates processing by digital computers and other operations, but digitizing simply means "the conversion of analog source material into a numerical format"; the decimal or any other number system can be used instead.
Google News is a news aggregator service developed by Google. It presents a continuous flow of links to articles organized from thousands of publishers and magazines. Google News is available as an app on Android, iOS, and the Web.
The Milwaukee Journal Sentinel is a daily morning broadsheet printed in Milwaukee, Wisconsin, where it is the primary newspaper and also the largest newspaper in the state of Wisconsin, where it is widely read. It was purchased by the Gannett Company in 2016.
The Open Content Alliance (OCA) was a consortium of organizations contributing to a permanent, publicly accessible archive of digitized texts. Its creation was announced in October 2005 by Yahoo!, the Internet Archive, the University of California, the University of Toronto and others. Scanning for the Open Content Alliance was administered by the Internet Archive, which also provided permanent storage and access through its website.
Ancestry.com LLC is an American genealogy company based in Lehi, Utah. The largest for-profit genealogy company in the world, it operates a network of genealogical, historical records, and related genetic genealogy websites.
Google Books is a service from Google that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.
Live Search Books was a search service for books launched in December 2006, part of Microsoft's Live Search range of services. Microsoft was working with a number of libraries, including the British Library, to digitize books and make them searchable, and in the case of out-of-copyright books, available across the web.
In United States copyright law, transformative use or transformation is a type of fair use that builds on a copyrighted work in a different manner or for a different purpose from the original, and thus does not infringe its holder's copyright. Transformation is an important issue in deciding whether a use meets the first factor of the fair-use test, and is generally critical for determining whether a use is in fact fair, although no one factor is dispositive.
Book scanning or book digitization is the process of converting physical books and magazines into digital media such as images, electronic text, or electronic books (e-books) by using an image scanner. Large scale book scanning projects have made many books available online.
The International Music Score Library Project (IMSLP), also known as the Petrucci Music Library after publisher Ottaviano Petrucci, is a subscription-based digital library of public-domain music scores. The project uses MediaWiki software, and as of 24 November 2023 has uploaded more than 736,000 scores and 80,700 recordings by 1,900 performers of more than 226,000 works by 27,400 composers. IMSLP has both an iOS app and an Android app.
The Michigan Digitization Project is a project in partnership with Google Books to digitize the entire print collection of the University of Michigan Library. The digitized collection is available through the University of Michigan Library catalog, Mirlyn, the HathiTrust Digital Library, and Google Books. Full-text of works that are out of copyright or in the public domain are available.
A digital library is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection. Digital libraries can vary immensely in size and scope, and can be maintained by individuals or organizations. The digital content may be stored locally, or accessed remotely via computer networks. These information retrieval systems are able to exchange information with each other through interoperability and sustainability.
Authors Guild v. Google 804 F.3d 202 was a copyright case heard in federal court for the Southern District of New York, and then the Second Circuit Court of Appeals between 2005 and 2015. It concerned fair use in copyright law and the transformation of printed copyrighted books into an online searchable database through scanning and digitization. It centered on the legality of the Google Book Search Library Partner project that had been launched in 2003.
Google Catalogs was a shopping application for tablet computers, which was produced by Google in August 2011. Google Catalogs delivered virtual catalogs to users from merchants like Nordstrom, L.L. Bean, Macy's, Pottery Barn, and many more. Merchants were added through a process by which they submitted a form with information and a sample of their catalog, which was then reviewed by Google's editorial team. The application was noted as a "Greener Way to Shop", as the digitization of catalogs substituted for paper versions.
The British Newspaper Archive web site provides access to searchable digitized archives of British and Irish newspapers. It was launched in November 2011.
Heritage Microfilm, Inc. is a preservation microfilm and microfilm digitization business located in Cedar Rapids, Iowa.
Authors Guild v. HathiTrust, 755 F.3d 87, is a United States copyright decision finding search and accessibility uses of digitized books to be fair use.