Digital Scriptorium | |
---|---|
Curators | Educational consortium |
Funded by | Institute for Museum and Library Services |
Website | digital-scriptorium.org |
Digital Scriptorium (DS) is a non-profit, tax-exempt consortium of American libraries with collections of medieval and early modern manuscripts, that is, handwritten books made in the traditions of the world's scribal cultures. [1] [2] The DS Catalog represents these manuscript collections in a web-based platform form building a national union catalog for teaching and scholarly research in medieval and early modern studies.
The DS Catalog is an open-access resource based on Linked Open Data technologies and practices. It enables users to study manuscripts held in academic, research, and public libraries and museums in the United States. It makes available collections that are often restricted from public access and includes not only famous masterpieces of book illumination but also understudied manuscripts that have been previously overlooked for publication or study.
DS is overseen by a board of directors and is supported by its member institutions. As an organization with national representation, DS serves the interests of a diverse community of scholars, teachers, students, hobbyists, booksellers, and collectors—anyone with an interest in premodern manuscripts.
Founded in 1997 and funded by grants from the Andrew W. Mellon Foundation, the National Endowment for the Humanities, the Gladys Krieble Delmas Foundation and the Institute for Museum and Library Services, DS at its inception was a joint project funded by the Andrew W. Mellon Foundation between the Bancroft Library at the University of California, Berkeley (under Prof. Charles Faulhaber [3] ) and the Rare Book & Manuscript Library of Columbia University (under Dr. Consuelo W. Dutschke [4] ). The original goal was to digitize and make an online database available on the World Wide Web catalog records and selected images from the two universities' medieval European and early Renaissance manuscript collections. [5] By providing free online access to these collections, the founders hoped to inspire more research and study of medieval manuscript culture. Moreover, because of patterns of collecting in the 19th and early 20th century, many manuscripts in American collections comprise partial texts or detached single leaves. [6] Cataloging as many of these fragmentary works as possible increases the chance that some manuscripts could be reconstituted, if only virtually.
Based on this model, DS attracted additional members. Between 1999 and 2005, additional holdings from Huntington Library, the University of Texas, Austin, and the New York Public Library, Harvard University's Houghton Library, Yale University's Beinecke Library, and the University of Pennsylvania. Among these institutions with substantial collections were libraries with few but rare works such as the Providence Public Library, which owns an unusual 15th century Bible (Wetmore Ms 1) in rebus format. By September 2015, the DS database included catalog records for over 8,300 manuscripts and 47,000 digitized images (all manuscript records are now preserved on the Digital Scriptorium site on the Internet Archive.
The University of California, Berkeley provided the first home to the DS database, both in terms of managing the project and devising its initial technology. [7] For an interim period of time (2003–2011) DS was hosted at Columbia University but returned to Berkeley in 2011. The technical innovations produced by the teams of both originating universities created a digital product based on a progressive, standards-based digitization policy. [8] [9] [10] Originally using Microsoft Access to serve as a cross-institutional data collection tool, the DS database used SGML and later XML to aggregate and query the combined information. When the database returned to U.C. Berkeley in 2011, a new platform was developed using software known as WebGenDB. WebGenDB is a non-proprietary, web-based interface for the underlying control database GenDB. [11] [12] However, weaknesses relating to both the technical platform and the workflows for data creation and management were by this time beginning to threaten the sustainability of Digital Scriptorium. [13] High standards for data entry required staffing and expertise that many institutions did not have. Further complicating member participation was the fact that many institutions were developing their own institutional platforms for publishing manuscript metadata and images. For example, in 2005 the University of Pennsylvania Libraries began full digitization and cataloging practices using MARC standards and published the MARC records in its own OPAC (Online Public Access Catalog) as well as on its open-access digital repository Openn. [14] Consequently, Penn Libraries stopped adding records to DS because the work was duplicative and the standards of cataloging in DS and in MARC were incompatible. With the increase in digitization of rare materials, especially of medieval manuscripts, [15] [16] across institutions in the United States and around the world, more institutions began to need DS's image hosting services less and less.
When, in 2018, the UC Berkeley Library determined that it would no longer support the WebGen software that supported the DS database, the DS Board of Directors determined that the DS technical platform required an overhaul. DS took this opportunity to reconsider not only the technical infrastructure but also the workflows and processes of the DS organization in creating and maintaining the platform.
A planning meeting held at the Beinecke Library in February 2019 brought together stakeholders to decide the future of DS. [17] The meeting resulted in the establishment of five guiding principles for development: 1) as a national union catalog, DS 2.0's primary function will be to enable researchers to find premodern manuscripts in US collections, including non-European manuscripts, 2) DS 2.0 will require minimal standards for data entry; 3) members will manage their own manuscript metadata in their institutional formats; 4) DS 2.0 will use what members provide from their institutional record and will not correct or add to a member's metadata; 5) DS 2.0 will not host images, but will provide IIIF functionality to view images in platform; and 6) DS 2.0 will reconcile and enhance metadata with external authorities and in-house Manuscript ID and Name Authorities and will make DS 2.0 data available for reuse.
In 2020, the Schoenberg Institute for Manuscript Studies at the University of Pennsylvania Libraries was awarded, on behalf of DS, a planning grant from the Institute for Museum and Library Services [18] [19] to develop a data model for DS 2.0 and implement a prototype using Wikibase as its technical platform. [20] DS 2.0 solved earlier workflow challenges by transforming data created and maintained by member institutions' structured metadata into LOD and enriches it with semantic connections to external authorities and Wikidata. DS Catalog entries also link out to member institution's websites and digital repositories, where users can discover more detailed information about and often images of the manuscripts held in their respective home collections.
The DS Catalog is an online data repository, a semantic portal, and knowledge base allowing users to explore and query heterogeneous data contained in manuscript records from multiple sources in a single interface powered by LOD. [21] [22] The DS Catalog transforms member institution's structured metadata into LOD and enriches it with semantic connections to external authorities and vocabularies, including the Getty Vocabularies, FAST, and Wikidata. DS Catalog entries also link out to member institution's websites and digital repositories, where users can discover more detailed information about the manuscripts held in their respective home collections.
The beta version of DS 2.0 launched in March 2023 and is known as the DS Catalog. [23]
Since 1997, Digital Scriptorium has enabled public viewing of non-circulating materials normally available only to specialists with restricted access. Special emphasis has been placed on touchstone materials such as manuscripts signed and dated by scribes, thus beginning the American contribution to the goal established in 1953 by the Comité international de paléographie latine (International Committee of Latin Paleography): to document the relatively small number of codices of certain origin that will serve stylistically to localize and date the vast quantities of unsigned manuscripts[RL1] . DS publishes not only manuscripts of firm attribution but also ones that need the attention of further scholarship that traditionally have gone unnoticed by scholarship. Because it is web-based, it also allows for updates and corrections, and as a matter of form individual records in DS acknowledge contributions from outside scholars. Because the DS consortium consists of academic, public, and rare book libraries and museums, it encourages a broad audience that benefits from a reciprocally beneficial body of knowledge. While attending to the needs of community of specialists, including, medievalists, classicists, musicologists, paleographers, diplomatists, literary scholars and art historians, DS also recognizes a public user community that values rare and unique works of historical, literary and artistic significance. [24] [
A manuscript was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has come to be understood to further include any written, typed, or word-processed copy of an author's work, as distinguished from the rendition as a printed version of the same.
An illuminated manuscript is a formally prepared document where the text is decorated with flourishes such as borders and miniature illustrations. Often used in the Roman Catholic Church for prayers and liturgical books such as psalters and courtly literature, the practice continued into secular texts from the 13th century onward and typically include proclamations, enrolled bills, laws, charters, inventories, and deeds.
Digitization is the process of converting information into a digital format. The result is the representation of an object, image, sound, document, or signal obtained by generating a series of numbers that describe a discrete set of points or samples. The result is called digital representation or, more specifically, a digital image, for the object, and digital form, for the signal. In modern practice, the digitized data is in the form of binary numbers, which facilitates processing by digital computers and other operations, but digitizing simply means "the conversion of analog source material into a numerical format"; the decimal or any other number system can be used instead.
The National Digital Newspaper Program is a joint project between the National Endowment for the Humanities and the Library of Congress to create and maintain a publicly available, online digital archive of historically significant newspapers published in the United States between 1836 and 1922. Additionally, the program will make available bibliographic records and holdings information for some 140,000 newspaper titles from the 17th century to the present. Further, it will include scope notes and encyclopedia-style entries discussing the historical significance of specific newspapers. Added content will also include contextually relevant historical information. "One organization within each U.S. state or territory will receive an award to collaborate with relevant state partners in this effort."
In library and archival science, digital preservation is a formal process to ensure that digital information of continuing value remains accessible and usable in the long term. It involves planning, resource allocation, and application of preservation methods and technologies, and combines policies, strategies and actions to ensure access to reformatted and "born-digital" content, regardless of the challenges of media failure and technological change. The goal of digital preservation is the accurate rendering of authenticated content over time.
An institutional repository (IR) is an archive for collecting, preserving, and disseminating digital copies of the intellectual output of an institution, particularly a research institution. Academics also utilize their IRs for archiving published works to increase their visibility and collaboration with other academics. However, most of these outputs produced by universities are not effectively accessed and shared by researchers and other stakeholders. As a result academics should be involved in the implementation and development of an IR project so that they can learn the benefits and purpose of building an IR.
Google Books is a service from Google that searches the full text of books and magazines that Google has scanned, converted to text using optical character recognition (OCR), and stored in its digital database. Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives.
The California Digital Library (CDL) was founded by the University of California in 1997. Under the leadership of then UC President Richard C. Atkinson, the CDL's original mission was to forge a better system for scholarly information management and improved support for teaching and research. In collaboration with the ten University of California Libraries and other partners, CDL assembled one of the world's largest digital research libraries. CDL facilitates the licensing of online materials and develops shared services used throughout the UC system. Building on the foundations of the Melvyl Catalog, CDL has developed one of the largest online library catalogs in the country and works in partnership with the UC campuses to bring the treasures of California's libraries, museums, and cultural heritage organizations to the world. CDL continues to explore how services such as digital curation, scholarly publishing, archiving and preservation support research throughout the information lifecycle.
Preservation metadata is item level information that describes the context and structure of a digital object. It provides background details pertaining to a digital object's provenance, authenticity, and environment. Preservation metadata, is a specific type of metadata that works to maintain a digital object's viability while ensuring continued access by providing contextual information, usage details, and rights.
The Center for the Study of New Testament Manuscripts (CSNTM) is a 501(c)(3) non-profit organization whose mission is to digitally preserve Greek New Testament manuscripts. Toward that end, CSNTM takes digital photographs of manuscripts at institutions, libraries, museums, monasteries, universities, and archives around the world. The images produced are freely accessible on the Center's website—a searchable library of Greek New Testament manuscripts. With more than 50,000 users examining manuscripts in their digital library each year, the Center's digitization work facilitates a partnership between manuscript owners, archivists, and researchers around the world.
Metadata is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:
A digital library is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection. Digital libraries can vary immensely in size and scope, and can be maintained by individuals or organizations. The digital content may be stored locally, or accessed remotely via computer networks. These information retrieval systems are able to exchange information with each other through interoperability and sustainability.
Digital Maryland, formerly Maryland Digital Cultural Heritage (MDCH), is a collaborative, statewide digitization program. Headquartered at the Enoch Pratt Free Library/State Library Resource Center in Baltimore, the program partners with Maryland libraries, archives, historical societies, museums, and other institutions to digitize and provide free online access to materials relating to the state's history and culture. Materials in Digital Maryland's online digital collections include maps, manuscripts, photographs, artwork, books, and other media.
Smithsonian Libraries and Archives is an institutional archives and library system comprising 21 branch libraries serving the various Smithsonian Institution museums and research centers. The Libraries and Archives serve Smithsonian Institution staff as well as the scholarly community and general public with information and reference support. Its collections number nearly 3 million volumes including 50,000 rare books and manuscripts.
The Water Resources Collections and Archives (WRCA), formerly known as the Water Resources Center Archives, is an archive with unpublished manuscript collections and a library with published materials. It was established to collect unique, hard-to-find, technical report materials pertaining to all aspects of water resources and supply in California and the American West. Located on the campus of the University of California Riverside (UCR), it is jointly administered by the UCR College of Natural and Agricultural Sciences (CNAS) and the UCR Libraries. WRCA was part of the University of California Center for Water Resources (WRC) that was established and funded in 1957 by a special act of the California State Legislature and was designated the California Water Research Institute by a federal act in 1964.
The JISC Digitisation Programme was a series of projects to digitise the cultural heritage and scholarly materials in universities, libraries, museums, archives, and other cultural memory organizations in the United Kingdom, from 2004 to 2010 The program was managed by the UK's Joint Information Systems Committee, the body that supports United Kingdom post-16 and higher education and research in support of learning, teaching, research and administration in the context of ICT.
D-Scribe Digital Publishing is an open access electronic publishing program of the University Library System (ULS) of the University of Pittsburgh. It comprises over 100 thematic collections that together contain over 100,000 digital objects. This content, most of which is available through open access, includes both digitized versions of materials from the collections of the University of Pittsburgh and other local institutions as well as original 'born-electronic' content actively contributed by scholars worldwide. D-Scribe includes such items as photographs, maps, books, journal articles, dissertations, government documents, and technical reports, along with over 745 previously out-of-print titles published by the University of Pittsburgh Press. The digital publishing efforts of the University Library System began in 1998 and have won praise for their innovation from the leadership at the Association of Research Libraries and peer institutions.
Howard Besser is a scholar of digital preservation, digital libraries, and preservation of film and video. He is Professor of Cinema Studies and the founding director of the NYU Moving Image Archiving and Preservation Program ("MIAP"), a graduate program in the Tisch School. Besser also worked as a Senior Scientist at New York University's Digital Library Initiative. He conducted extensive research in image databases, multimedia operation, digital library, and social and cultural influence of the latest Information Technology. Besser is a prolific writer and speaker, and has consulted with many governments, educational institutions, and arts agencies on digital preservation matters. Besser researched libraries' new technology, archives, and museums. Besser has been actively contributing at the international level to build metadata and upgrade the quality of the cultural heritage community. He predominantly, focused on image and multimedia databases; digital library aspects ; cultural and societal impacts of information technology, and developing new teaching methods through technology such as web-based instructions and distance learning. Besser was closely involved in development of the Dublin Core and the Metadata Encoding and Transmission Standard (METS), international standards within librarianship.
The International Image Interoperability Framework defines several application programming interfaces that provide a standardised method of describing and delivering images over the web, as well as "presentation based metadata" about structured sequences of images. If institutions holding artworks, books, newspapers, manuscripts, maps, scrolls, single sheet collections, and archival materials provide IIIF endpoints for their content, any IIIF-compliant viewer or application can consume and display both the images and their structural and presentation metadata.
The ETH Library, serving as the central university library at ETH Zurich, has a notable collection of scientific and technical information. It is considered one of the largest public scientific and technical libraries in Switzerland. Furthermore, it also offers resources for the public and companies in research and development. Particular emphasis is placed on electronic information for university members and the development of innovative services.