Google News Archive

Last updated
Google News Archive
Google News icon.svg
Type of site
Newspaper archive
Available inEnglish, German, French, Spanish, Italian, Polish, Portuguese, Chinese, Japanese, Korean, Dutch, Arabic, Hebrew, Norwegian, Czech, Hungarian, Swedish, Greek, Russian, Hindi, Telugu, Tamil, Turkish, and Malayalam.
Dissolved yes
Created by Google
URL news.google.com/newspapers
RegistrationNot required
LaunchedJune 6, 2006;17 years ago (2006-06-06)
Current statusScanning project discontinued; search function "facelift" is "in the works"

Google News Archive is an extension of Google News providing free access to scanned archives of newspapers and links to other newspaper archives on the web, both free and paid.

Contents

Some of the news archives date back to 18th century. There is a timeline view available, to select news from various years.

History

The archive went live on June 6, 2006, after Google acquired PaperofRecord.com, originally created by Robert J. Huggins and his team at Cold North Wind, Inc. The acquisition was not publicly announced by Cold North Wind until 2008.

While the service initially provided a simple index of other web pages, on September 8, 2008, Google News began to offer indexed content from scanned newspapers. [1] The depth of chronological coverage varies.

Newspapers were thought to have escaped copyright obligations of news articles because of Google's method of publishing the archives as searchable image files of the actual newspaper pages, rather than as pure text of articles.[ citation needed ]

In 2011, Google announced that it would no longer add content to the archive project. [2] On August 14, 2011, without notice, Google made the News Archives home page unavailable. Apparently, the service merged with Google News. [3] Carly Carlioi, an editor at the Boston Phoenix , speculated that Google discontinued the project because they found it harder than expected, for newspapers were more difficult to index than books because of layout complexities. [4] Another cause might have been that the project attracted a lesser audience than expected.

While archived newspapers [5] are still available for browsing, keyword searching is not fully functional. On December 16, 2013, Google News employee Stacie Chan wrote in the Google Product Forums that Google News is "performing a much needed facelift on our News Archive search function", and that access to archived stories would be limited for several months while "this new system" is being built. [6] This was reaffirmed on May 22 and July 30, 2014, when Chan wrote that Google is still "working on the archives to provide a better user experience", [7] and "it's in the works", [8] and again on December 18, 2014, when Chan wrote that Google "is currently working on creating a better experience on the Newspaper Archives that should be available in the near future." [9]

Some papers formerly included in the News Archive have been removed because of copyright issues. For instance, the archives of the Milwaukee Journal Sentinel disappeared on August 16, 2016, due to a contract between the paper's owner, the Gannett Company, and NewsBank. [10]

See also

Related Research Articles

Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the oldest digital library. Most of the items in its collection are the full texts of books or individual stories in the public domain. All files can be accessed for free under an open format layout, available on almost any computer. As of 13 February 2024, Project Gutenberg had reached 70,000 items in its collection of free eBooks.

<span class="mw-page-title-main">Internet Archive</span> American nonprofit digital archive

The Internet Archive is an American nonprofit digital library founded on May 10, 1996, and chaired by free information advocate Brewster Kahle. It provides free access to collections of digitized materials including websites, software applications, music, audiovisual and print materials. The Archive also advocates for a free and open Internet. As of February 4, 2024, the Internet Archive holds more than 44 million print materials, 10.6 million videos, 1 million software programs, 15 million audio files, 4.8 million images, 255,000 concerts, and over 835 billion web pages in its Wayback Machine. Its mission is committing to provide "universal access to all knowledge".

Electronic publishing includes the digital publication of e-books, digital magazines, and the development of digital libraries and catalogues. It also includes the editing of books, journals, and magazines to be posted on a screen.

<span class="mw-page-title-main">Digitization</span> Converting information into digital form

Digitization is the process of converting information into a digital format. The result is the representation of an object, image, sound, document, or signal obtained by generating a series of numbers that describe a discrete set of points or samples. The result is called digital representation or, more specifically, a digital image, for the object, and digital form, for the signal. In modern practice, the digitized data is in the form of binary numbers, which facilitates processing by digital computers and other operations, but digitizing simply means "the conversion of analog source material into a numerical format"; the decimal or any other number system can be used instead.

<span class="mw-page-title-main">Google News</span> News aggregator website and app

Google News is a news aggregator service developed by Google. It presents a continuous flow of links to articles organized from thousands of publishers and magazines. Google News is available as an app on Android, iOS, and the Web.

<i>Milwaukee Journal Sentinel</i> Newspaper based in Milwaukee, Wisconsin

The Milwaukee Journal Sentinel is a daily morning broadsheet printed in Milwaukee, Wisconsin, where it is the primary newspaper and also the largest newspaper in the state of Wisconsin, where it is widely read. It was purchased by the Gannett Company in 2016.

The Open Content Alliance (OCA) was a consortium of organizations contributing to a permanent, publicly accessible archive of digitized texts. Its creation was announced in October 2005 by Yahoo!, the Internet Archive, the University of California, the University of Toronto and others. Scanning for the Open Content Alliance was administered by the Internet Archive, which also provided permanent storage and access through its website.

Ancestry.com LLC is an American genealogy company based in Lehi, Utah. The largest for-profit genealogy company in the world, it operates a network of genealogical, historical records, and related genetic genealogy websites.

<span class="mw-page-title-main">Google Books</span> Service from Google

Google Books is a service from Google that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.

<span class="mw-page-title-main">Live Search Books</span>

Live Search Books was a search service for books launched in December 2006, part of Microsoft's Live Search range of services. Microsoft was working with a number of libraries, including the British Library, to digitize books and make them searchable, and in the case of out-of-copyright books, available across the web.

In United States copyright law, transformative use or transformation is a type of fair use that builds on a copyrighted work in a different manner or for a different purpose from the original, and thus does not infringe its holder's copyright. Transformation is an important issue in deciding whether a use meets the first factor of the fair-use test, and is generally critical for determining whether a use is in fact fair, although no one factor is dispositive.

<span class="mw-page-title-main">Book scanning</span> Process of converting physical media into digital media

Book scanning or book digitization is the process of converting physical books and magazines into digital media such as images, electronic text, or electronic books (e-books) by using an image scanner. Large scale book scanning projects have made many books available online.

<span class="mw-page-title-main">International Music Score Library Project</span> Library of public-domain music scores

The International Music Score Library Project (IMSLP), also known as the Petrucci Music Library after publisher Ottaviano Petrucci, is a subscription-based digital library of public-domain music scores. The project uses MediaWiki software, and as of 24 November 2023 has uploaded more than 736,000 scores and 80,700 recordings by 1,900 performers of more than 226,000 works by 27,400 composers. IMSLP has both an iOS app and an Android app.

The Michigan Digitization Project is a project in partnership with Google Books to digitize the entire print collection of the University of Michigan Library. The digitized collection is available through the University of Michigan Library catalog, Mirlyn, the HathiTrust Digital Library, and Google Books. Full-text of works that are out of copyright or in the public domain are available.

<span class="mw-page-title-main">Digital library</span> Online database of digital objects stored in electronic media formats and accessible via computers

A digital library is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection. Digital libraries can vary immensely in size and scope, and can be maintained by individuals or organizations. The digital content may be stored locally, or accessed remotely via computer networks. These information retrieval systems are able to exchange information with each other through interoperability and sustainability.

<i>Authors Guild, Inc. v. Google, Inc.</i> U.S. copyright law case, 2015

Authors Guild v. Google 804 F.3d 202 was a copyright case heard in federal court for the Southern District of New York, and then the Second Circuit Court of Appeals between 2005 and 2015. It concerned fair use in copyright law and the transformation of printed copyrighted books into an online searchable database through scanning and digitization. It centered on the legality of the Google Book Search Library Partner project that had been launched in 2003.

<span class="mw-page-title-main">Google Catalogs</span> Shopping application for tablet computers

Google Catalogs was a shopping application for tablet computers, which was produced by Google in August 2011. Google Catalogs delivered virtual catalogs to users from merchants like Nordstrom, L.L. Bean, Macy's, Pottery Barn, and many more. Merchants were added through a process by which they submitted a form with information and a sample of their catalog, which was then reviewed by Google's editorial team. The application was noted as a "Greener Way to Shop", as the digitization of catalogs substituted for paper versions.

The British Newspaper Archive web site provides access to searchable digitized archives of British and Irish newspapers. It was launched in November 2011.

<span class="mw-page-title-main">Heritage Microfilm, Inc.</span> Microfilm digitization business based in Cedar Rapids, Iowa

Heritage Microfilm, Inc. is a preservation microfilm and microfilm digitization business located in Cedar Rapids, Iowa.

<i>Authors Guild, Inc. v. HathiTrust</i> American legal case

Authors Guild v. HathiTrust, 755 F.3d 87, is a United States copyright decision finding search and accessibility uses of digitized books to be fair use.

References

  1. "Bringing history online, one newspaper at a time". September 8, 2008. Retrieved 2008-09-08.
  2. Horn, Leslie (May 20, 2011). "Google Ending Newspaper Archiving Project". PC Magazine . Retrieved 2011-05-20.
  3. "Google Integrates News Archive Search Into Current News Search, Weakens Archive Search". Internet for Lawyers
  4. Jared Keller (May 20, 2011). "Google Shuts Down Newspaper Archive Project". The Atlantic .
  5. "Google News Archive Search".
  6. C., Stacie (December 16, 2013). "Google News Archive update". Google Product Forums . Retrieved 2013-12-17.
  7. C., Stacie (May 22, 2014). "Google News Archive Search update..." Google Product Forums . Retrieved 2014-05-22.
  8. C., Stacie (August 10, 2014). "Google News Archive Search update..." Google Product Forums . Retrieved 2014-08-10.
  9. C., Stacie (December 18, 2014). "Google News Archive Search update..." Google Product Forums . Retrieved 2015-06-23.
  10. Takach, Michail (August 19, 2016). "Journal Sentinel Archive Disappears". Urban Milwaukee. Retrieved 2016-08-20.