Transcribe Bentham

Last updated

Transcribe Bentham
Transcribe Bentham logo.png
Type of site
crowdsourced transcription project
Available inEnglish, French, Latin, Greek
Owner Transcribe Bentham team
Created by Transcribe Bentham team
URL http://www.ucl.ac.uk/transcribe-bentham
CommercialNo
RegistrationYes
Launched8 September 2010
Current statusongoing

Transcribe Bentham is a crowdsourced manuscript transcription project, run by University College London's Bentham Project, [1] in partnership with UCL Centre for Digital Humanities, UCL Library Services, UCL Learning and Media Services, the University of London Computer Centre, and the online community. Transcribe Bentham was launched under a twelve-month Arts and Humanities Research Council grant.

Contents

For two years from October 2012, the project was funded by a grant from the Andrew W. Mellon Foundation's 'Scholarly Communications' programme, and the project consortium has been expanded to include the British Library. [2]

Rationale

Transcribe Bentham was launched in September 2010. The project makes available, via a transcription interface based on a customised MediaWiki, high-quality digital images of UCL's vast collection of unpublished manuscripts written and composed by the philosopher and reformer, Jeremy Bentham, which runs to some 60,000 manuscript folios (an estimated 30,000,000 words). Under the Mellon Foundation grant, the remainder of the UCL Bentham Papers were digitised, along with all of the British Library's own collection of Bentham manuscripts, some 12,500 manuscript folios (or an estimated 6,000,000 words).

The project recruits volunteers to assist in transcribing the material, and thereby contribute to the Bentham Project's production of the new edition of The Collected Works of Jeremy Bentham . Volunteer-produced transcripts are also uploaded to UCL's digital Bentham Papers repository, [3] in order to widen access to the collection, and ensure its long-term preservation.

Transcription

Volunteers can sign-up for a user account at the Transcription Desk. [4] Once registered, they are given transcriber privileges. The volunteer then selects a manuscript, and is presented with a manuscript image alongside a free-text box, into which he or she enters their transcript (which can be saved at any time). Volunteers are also asked to add some basic formatting to their transcripts, and encode their work in Text-Encoding Initiative-compliant XML using a specially designed transcription toolbar. Using this, the volunteer can highlight a piece of text, or a position in the text, and click a button on the toolbar to identify a particular characteristic of that chosen portion. These include line breaks, paragraphs, unusual spellings, and frequent additions, deletions and marginalia present in the manuscripts.

When a volunteer is happy with his or her transcript, it is submitted to Transcribe Bentham project staff for checking. Changes are made to the text and code, if necessary, and staff decide whether or not the transcript has been completed to a satisfactory degree for uploading to the digital repository. If it is decided that no further appreciable improvements can be made, the transcript is locked for further editing and converted to an XML file. However, if staff decide that a submitted transcript is incomplete - i.e. if it is partially transcribed, or there are a number of missing or unclear words - then it will remain unlocked for further crowdsourcing.

Work is currently ongoing to make improvements and modifications to the transcription interface.

As of 4 January 2019, volunteers had transcribed or partially transcribed 21,307 manuscripts - around 10.5 million words - of which 94% were of the required standard to form a basis for editorial work, and to be uploaded to the digital repository. Monthly progress updates are issued via the Transcribe Bentham blog. [5]

Media coverage and prizes

The work of Transcribe Bentham has been reported upon by the international media. This coverage includes a feature article in The New York Times , [6] The Sunday Times , [7] The Chronicle of Higher Education , [8] Deutsche Welle World [9] radio, and Austria's ORF1 radio. [10]

In September 2011, Transcribe Bentham was honoured with an Award of Distinction in the Digital Communities category of the Prix Ars Electronica, the world's foremost digital arts competition. [11] In its report, the Digital Communities jury noted that the Transcribe Bentham transcription interface has 'the potential to become a standard tool for scholarly crowdsourcing projects', and that Transcribe Bentham as a whole has the 'potential to create the legacy of participatory education and the preservation of heritage or an endangered culture'. [12]

Transcribe Bentham was also nominated for the 2011 Digital Heritage Award, [13] along with:

In November 2012, Transcribe Bentham came second in the Knetworks 'Platforms for Networked Innovation Competition', [14] which sought to identify the 'most innovative web-based platform enabling regional innovation for public, private or research organizations'. [15]

Transcribe Bentham was featured on BBC Radio 4's PM programme [16] and the BBC News website [17] on 27 August 2013. The report discussed how volunteers transcribed a series of recipes which were collated for Bentham's proposed panopticon prison, and how one - a 'Devonshire Pie' consisting of potatoes, tripe, onions, spleen, lungs, and gooseberries - was made by the Michelin-starred St John Smithfield restaurant. The recipes were published in 2014 as Jeremy Bentham's Prison Cooking: A Collection of Utilitarian Recipes. [18]

Open-source code

The code for Transcribe Bentham's MediaWiki-based transcription interface is available for reuse and customisation, on an open source basis. [19] It has been implemented by the Public Record Office Victoria for their pilot transcription project. [20]

Related Research Articles

Jeremy Bentham British philosopher, jurist, and social reformer (1748–1832)

Jeremy Bentham was an English philosopher, jurist, and social reformer regarded as the founder of modern utilitarianism.

Panopticon Prison design

The panopticon is a type of institutional building and a system of control designed by the English philosopher and social theorist Jeremy Bentham in the 18th century. The concept of the design is to allow all prisoners of an institution to be observed by a single security guard, without the inmates being able to tell whether they are being watched.

The Northeastern University Women Writers Project or WWP, founded in 1986 at Brown University, is a long-term research and publication project which focuses on making texts from early modern women writers in the English language available online. The Women Writers Project maintains "Women Writers Online" an electronic collection of rare or difficult to obtain works written or co-authored by women from the sixteenth century to the mid nineteenth century. In addition, the WWP is actively engaged in researching the complex issues involved in representing manuscripts and early printed texts in digital form and holds an occasional conference, Women in the Archives, as well as teaching workshops in text encoding and other practices central to digital humanities.

Prix Ars Electronica

The Prix Ars Electronica is one of the best known and longest running yearly prizes in the field of electronic and interactive art, computer animation, digital culture and music. It has been awarded since 1987 by Ars Electronica.

Fedora Commons

Fedora is a digital asset management (DAM) content repository architecture upon which institutional repositories, digital archives, and digital library systems might be built. Fedora is the underlying architecture for a digital repository, and is not a complete management, indexing, discovery, and delivery application. It is a modular architecture built on the principle that interoperability and extensibility are best achieved by the integration of data, interfaces, and mechanisms as clearly defined modules.

The Child Language Data Exchange System (CHILDES) is a corpus established in 1984 by Brian MacWhinney and Catherine Snow to serve as a central repository for data of first language acquisition. Its earliest transcripts date from the 1960s, and it now has contents in 26 languages from 130 different corpora, all of which are publicly available worldwide. Recently, CHILDES has been made into a component of the larger corpus TalkBank, which also includes language data from aphasics, second language acquisition, conversation analysis, and classroom language learning. CHILDES is mainly used for analyzing the language of young children and directed to the child speech of adults.

Interment.net is a United States-based website containing a free online database of transcriptions from headstones, intended to be a research tool for use by genealogists and historians. As of 2006, the site was one of the top 15 free genealogy websites on the Internet. Its cemetery database to date includes more than 6 million cemetery records from around the world.

Wikimedia Commons Online media repository of free-use images, sounds and other media files

Wikimedia Commons is a media repository of open images, sounds, videos and other media. It also contains JSON files. It is a project of the Wikimedia Foundation.

Trove Australian online library database aggregator

Trove is an Australian online library database aggregator and service which includes full text documents, digital images, bibliographic and holdings data of items which are not available digitally, and a free faceted-search engine as a discovery tool. The database includes archives, images, newspapers, official documents, archived websites, manuscripts and other types of data. Hosted by the National Library of Australia in partnership with content providers, including members of the National and State Libraries Australia, it is one of the most well-respected and accessed GLAM services in Australia, with over 70,000 daily users.

UCL Centre for Digital Humanities

The UCL Centre for Digital Humanities is a cross-faculty research centre of University College London. It brings together digital humanities work being done in many of the university's different departments and centres, including the library services, museums and collections. The Centre counts among the "most visible" in the field and facilitates various opportunities for study at post-graduate level, including the MA/MSc in Digital Humanities, doctoral study, and short courses as part of the Department of Information Studies.

Thingiverse

Thingiverse is a website dedicated to the sharing of user-created digital design files. Providing primarily free, open-source hardware designs licensed under the GNU General Public License or Creative Commons licenses, the site allows contributors to select a user license type for the designs that they share. 3D printers, laser cutters, milling machines and many other technologies can be used to physically create the files shared by the users on Thingiverse.

Microwork is a series of many small tasks which together comprise a large unified project, and it is completed by many people over the Internet. Microwork is considered the smallest unit of work in a virtual assembly line. It is most often used to describe tasks for which no efficient algorithm has been devised, and require human intelligence to complete reliably. The term was developed in 2008 by Leila Chirayath Janah of Samasource.

The Collected Works of Jeremy Bentham is a series of volumes which, when complete, will form a definitive edition of the writings of the philosopher and reformer Jeremy Bentham (1748–1832). It includes texts which Bentham published during his lifetime; and also the many texts which remained unpublished at his death, and which exist only in manuscript.

The LINGUIST List is a major online resource for the academic field of linguistics. It was founded by Anthony Aristar in early 1990 at the University of Western Australia, and is used as a reference by the National Science Foundation in the United States. Its main and oldest feature is the premoderated electronic mailing list, now with thousands of subscribers all over the world, where queries and their summarised results, discussions, journal table of contents, dissertation abstracts, calls for papers, book and conference announcements, software notices and other useful pieces of linguistic information are posted.

Kevin Kiernan is an American scholar of Anglo-Saxon literature. Kiernan is the editor of the Electronic Beowulf and an acknowledged expert on the Beowulf manuscript. Kiernan is the T. Marshall Hahn Sr. Professor of Arts and Sciences Emeritus at the University of Kentucky. He was inducted into the University of Kentucky College of Arts and Sciences Hall of Fame in 2015.

Ars Electronica Austrian cultural, educational and scientific institute

Ars Electronica Linz GmbH is an Austrian cultural, educational and scientific institute active in the field of new media art, founded in Linz in 1979. It is based at the Ars Electronica Center (AEC), which houses the Museum of the Future, in the city of Linz. Ars Electronica's activities focus on the interlinkages between art, technology and society. It runs an annual festival, and manages a multidisciplinary media arts R&D facility known as the Futurelab. It also confers the Prix Ars Electronica awards.

James Martin was a convict transported to New South Wales, notable for being the author of the only extant First Fleet convict account of life in the colony, known as the Memorandoms by James Martin.

Siobhan Leachman New Zealand citizen scientist

Siobhan Leachman is a New Zealand citizen scientist, open knowledge advocate, and Wikipedian whose work focusses on natural history.

Melissa Terras

Melissa Mhairi Terras is a leading international figure in the field of Digital Humanities. Since 2017, she has been Professor of Digital Cultural Heritage at the University of Edinburgh, and director of its Centre for Digital Scholarship. She previously taught at University College London, where she was Professor of Digital Humanities and served as director of its Centre for Digital Humanities from 2012 to 2017: she remains an honorary professor. She has a wide ranging academic background: she has an undergraduate degree in art history and English literature, then took a Master of Science (MSc) degree in computer science, before undertaking a Doctor of Philosophy (DPhil) degree at the University of Oxford in engineering.

References

  1. UCL (24 May 2018). "Bentham Project". Bentham Project. Retrieved 22 July 2022.
  2. "Bentham Project receives grant from the Mellon Foundation | UCL Transcribe Bentham".
  3. UCL (4 September 2018). "Bentham Manuscripts". Library Services. Retrieved 22 July 2022.
  4. Transcribe Bentham Transcription Desk, http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham
  5. "UCL Transcribe Bentham". blogs.ucl.ac.uk. Retrieved 22 July 2022.
  6. Cohen, Patricia (27 December 2010). "Scholars Recruit Public for Project". The New York Times. ISSN   0362-4331 . Retrieved 22 July 2022.
  7. R. Kinchen, 'One Stir and I'll Discover a Galaxy', 12 September 2011, http://www.thesundaytimes.co.uk/sto/newsreview/features/article772703.ece Archived 3 December 2012 at the Wayback Machine
  8. T. Kaya, Crowdsourcing Project Hopes to Make Short Work of Transcribing Bentham, 13 September 2010, http://chronicle.com/blogs/wiredcampus/crowdsourcing-project-hopes-to-make-short-work-of-transcribing-bentham/26829
  9. R. Powell, 'Philosophy Fans Pitch in to put British thinker's manuscripts online', 4 February 2011, http://www.dw.de/dw/article/0,,14809726,00.html and http://www.dw.de/popups/popup_single_mediaplayer/0,,14808024_start_0_end_0_type_audio_struct_3126_contentId_6424149,00.html
  10. 'Create Your World', 25 July 2011, http://oe1.orf.at/programm/280040, and Matrix, 29 January 2012, http://oe1.orf.at/programm/294290
  11. "Ars Electronica Archiv".
  12. B. Achaleke, G. Harwood, A. Koblin, L. Yan, and T. Peixoto, 'Guinea pigs and apples: statement of the Digital Communities Jury', in H. Leopoldseder, C. Schöpf, and G. Stocker, Prix Ars Electronica International Compendium: CyberArts 2011, Ostfildern: Hatje Cantz, p. 206.
  13. "Rose Holley's Blog - views and news on digital libraries and archives: Digital Cultural Heritage Awards for Crowdsourcing (And thoughts on gamification)". 4 February 2012.
  14. "Competition winners | knetworks". Archived from the original on 23 May 2013. Retrieved 26 November 2012.
  15. "Welcome | knetworks". Archived from the original on 23 October 2012. Retrieved 26 November 2012.
  16. Radio 4 PM report, 27 August 2013, https://audioboo.fm/boos/1570749-how-a-recipe-intended-for-inmates-of-bentham-s-proposed-panopticon-prison-is-making-its-way-onto-a-modern-restaurant-menu
  17. "Cooking an 18th Century 'prison pie'". BBC News. Retrieved 22 July 2022.
  18. UCL (17 May 2018). "Jeremy Bentham's Prison Cooking". Bentham Project. Retrieved 17 January 2019.
  19. "Google Code Archive - Long-term storage for Google Code Project Hosting".
  20. "Category:PROV Transcription Pilot Project - Public Record Office Victoria". Archived from the original on 27 April 2012. Retrieved 13 April 2012.

Further reading