The Historic Cambridge Newspaper Collection [1] is a freely available, keyword searchable archive of four Cambridge, Massachusetts newspapers. The collection includes editions that are in the public domain. The Collection is a project of the Cambridge Room, the Archives and Special Collections of the Cambridge Public Library, and is supported by funding from the Community Preservation Act. In excess of 650,000 articles are available.
The newspapers are scanned and OCRd. Both images and OCR are made available, registered members may correct the OCR.
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo or from subtitle text superimposed on an image.
Cambridge University Library is the main research library of the University of Cambridge. It is the largest of over 100 libraries within the university. The library is a major scholarly resource for members of the University of Cambridge and external researchers. It is often referred to within the university as the UL. Thirty-three faculty and departmental libraries are associated with the University Library for the purpose of central governance and administration, forming "Cambridge University Libraries".
State Library Victoria (SLV) is the state library of Victoria, Australia. Located in Melbourne, it was established in 1854 as the Melbourne Public Library, making it Australia's oldest public library and one of the first free libraries in the world. It is also Australia's busiest library and, as of 2018, the world's fourth-most-visited library.
The Phoenix was the name of several alternative weekly periodicals published in the United States of America by Phoenix Media/Communications Group of Boston, Massachusetts, including the Portland Phoenix and the now-defunct Boston Phoenix, Providence Phoenix and Worcester Phoenix. These publications emphasized local arts and entertainment coverage as well as lifestyle and political coverage. The Portland Phoenix, although it is still publishing, is now owned by another company, New Portland Publishing.
Google Books is a service from Google that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.
Tesseract is an optical character recognition engine for various operating systems. It is free software, released under the Apache License. Originally developed by Hewlett-Packard as proprietary software in the 1980s, it was released as open source in 2005 and development has been sponsored by Google since 2006.
What Is to Be Done?, sometimes translated as What Then Must We Do?, is a non-fiction work by Leo Tolstoy in which he describes the social conditions of Russia in his day.
The Buffalo & Erie County Public Library is located on Lafayette Square, Buffalo, New York. The current facility, designed by Kideney Architects and built in 1964, replaced the original Cyrus Eidlitz Buffalo Public Library Building dedicated in February 1887. The first Buffalo Public Library, in turn, replaced the Erie County, New York courthouse, which occupied the parcel from 1816-1876.
The Collection Budé, or the Collection des Universités de France, is an editorial collection comprising the Greek and Latin classics up to the middle of the 6th century. It is published by Les Belles Lettres, and is sponsored by the Association Guillaume Budé.
The Cambridge Public Library in Cambridge, Massachusetts is part of the Minuteman Library Network. It consists of a main library and six branches, located throughout the city. Having developed from the Cambridge Athenaeum, the main library building was built in 1888, and was expanded and renovated in 2009. This expansion greatly increased the area of the building, more than tripling the square footage.
Henry Van Brunt FAIA was a 19th-century American architect and architectural writer.
Trove is an Australian online library database owned by the National Library of Australia in which it holds partnerships with source providers National and State Libraries Australia, an aggregator and service which includes full text documents, digital images, bibliographic and holdings data of items which are not available digitally, and a free faceted-search engine as a discovery tool.
Heritage Microfilm, Inc. is a preservation microfilm and microfilm digitization business located in Cedar Rapids, Iowa.
The California Digital Newspaper Collection (CDNC) is a freely-available, archive of digitized California newspapers; it is accessible through the project's website. The collection contains over six million pages from over forty-two million articles. The project is part of the Center for Bibliographical Studies and Research (CBSR) at the University of California Riverside.
Newspaper digitization is the process of converting old newspapers from analog form into digital images. The most common analog forms for old newspapers are paper and microfilm. Digitized images of newspaper pages are typically analyzed with OCR software in order to produce text files of the newspaper content. Newspaper digitization is a special case of digitization in general.
Francis John Henry Jenkinson was librarian of the University of Cambridge 1889–1923. He was succeeded by A. F. Scholfield.
Dmytro Ivanovych Bahalii was a Ukrainian historian and public and political figure, one of founding members of the National Academy of Sciences of Ukraine, and a full member of the Shevchenko Scientific Society since 1923. He was also a professor and rector at Kharkiv University, and mayor of Kharkiv (1914–1917).