Processed Book Project

Last updated

The Processed Book Project is a prototype website and service with customized software tools, launched in November 2005, to explore the evolving nature of books, journals, and other authored content published electronically as digital data, and accessible on a widely connected, often global, network.

Contents

Beginnings

Supported by funding from the Hewlett Foundation, the Project grew out of an essay entitled "The Processed Book", published online in the open-access journal FirstMonday in March 2003, by Joseph J. Esposito, [1] former CEO of Encyclopædia Britannica, who in 1994 launched Britannica Online, the first Internet encyclopedia.

The essay proposed that a "processed book" will become "a node in a network, with connections to other books, commentary, online library card catalogues, teachers' recommendations, and so forth"—connections linking both to and from the e-book. Esposito noted that this is very different from the "Romantic myth" of the "primal book...usually written by a single author, someone who has Something to Say".

Annotations

Annotations are one of the main tools of the Processed Book Operating System (PBOS), which allows users to connect "anything that adds lexical, semantic, or procedural value" to a specific segment of a document.

Annotations can include: text notes (which can also have their own annotations); outbound Web links that can be added to a document by someone other than the author or Webmaster; inbound links from other Web sites or e-mail to specific points inside a document, which can be disconnected by the document author without deleting the page they connect to; "BizVantage" links to a proprietary dynamically updated Net "clipping service", [2] driven by "user selected keywords, so that related, external content is discovered and connected to the Book"; and bookmarks placed by the user for quickly returning to points in the document.

In addition, the Project has several document-global tools, including dissect text that "provides extraction, annotation and statistical reporting for both user supplied words/phrases, and for associated words/phrases found via an interface to the WordNet software's extensive catalog of connections among words/phrases."

As an operating system and not simply a vehicle for Web publication, PBOS also provides the ability to add new features or bolt on new computer processes that can be brought to bear on any text stored in the Project library. PBOS is available as open source software at SourceForge. The Project was designed by Lynn Brock and programmed by Wayne Davison.

Notes

  1. The processed book
  2. BizVantage, proprietary Net "clipping service" from Prosaix.com

Related Research Articles

<span class="mw-page-title-main">PDF</span> Portable Document Format, a computer file format

Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. Based on the PostScript language, each PDF file encapsulates a complete description of a fixed-layout flat document, including the text, fonts, vector graphics, raster images and other information needed to display it. PDF has its roots in "The Camelot Project" initiated by Adobe co-founder John Warnock in 1991.

The Rich Text Format is a proprietary document file format with published specification developed by Microsoft Corporation from 1987 until 2008 for cross-platform document interchange with Microsoft products. Prior to 2008, Microsoft published updated specifications for RTF with major revisions of Microsoft Word and Office versions.

<span class="mw-page-title-main">Wiki software</span> Collaborative software that runs a wiki

Wiki software, is collaborative software that runs a wiki, which allows the users to create and collaboratively edit pages or entries via a web browser. A wiki system is usually a web application that runs on one or more web servers. The content, including previous revisions, is usually stored in either a file system or a database. Wikis are a type of web content management system, and the most commonly supported off-the-shelf software that web hosting facilities offer.

A translation memory (TM) is a database that stores "segments", which can be sentences, paragraphs or sentence-like units that have previously been translated, in order to aid human translators. The translation memory stores the source text and its corresponding translation in language pairs called “translation units”. Individual words are handled by terminology bases and are not within the domain of TM.

<span class="mw-page-title-main">Web conferencing</span> Forms of online many-to-many communication

Web conferencing is used as an umbrella term for various types of online conferencing and collaborative services including webinars, webcasts, and web meetings. Sometimes it may be used also in the more narrow sense of the peer-level web meeting context, in an attempt to disambiguate it from the other types known as collaborative sessions. The terminology related to these technologies is exact and agreed relying on the standards for web conferencing but specific organizations practices in usage exist to provide also term usage reference.

Open-source software development (OSSD) is the process by which open-source software, or similar software whose source code is publicly available, is developed by an open-source software project. These are software products available with its source code under an open-source license to study, change, and improve its design. Examples of some popular open-source software products are Mozilla Firefox, Google Chromium, Android, LibreOffice and the VLC media player.

<span class="mw-page-title-main">OmegaT</span> Computer assisted translation tool written in Java

OmegaT is a computer-assisted translation tool written in the Java programming language. It is free software originally developed by Keith Godfrey in 2000, and is currently developed by a team led by Aaron Madlon-Kay.

<span class="mw-page-title-main">Babylon (software)</span> Computer dictionary and translation program

Babylon is a computer dictionary and translation program developed by the Israeli company Babylon Software Ltd. based in the city of Or Yehuda. The company was established in 1997 by the Israeli entrepreneur Amnon Ovadia. Its IPO took place ten years later. It is considered a part of Israel's Download Valley, a cluster of software companies monetizing "free" software downloads through adware. Babylon includes in-house proprietary dictionaries, as well as community-created dictionaries and glossaries. It is a tool used for translation and conversion of currencies, measurements and time, and for obtaining other contextual information. The program also uses a text-to-speech agent, so users hear the proper pronunciation of words and text. Babylon has developed 36 English-based proprietary dictionaries in 21 languages. In 2008–2009, Babylon reported earnings of 50 million NIS through its collaboration with Google.

<span class="mw-page-title-main">General Architecture for Text Engineering</span>

General Architecture for Text Engineering or GATE is a Java suite of tools originally developed at the University of Sheffield beginning in 1995 and now used worldwide by a wide community of scientists, companies, teachers and students for many natural language processing tasks, including information extraction in many languages.

The following is a comparison of e-book formats used to create and publish e-books.

Biblical software or Bible software is a group of computer applications designed to read, study and in some cases discuss biblical texts and concepts. Biblical software programs are similar to e-book readers in that they include digitally formatted books, may be used to display a wide variety of inspirational books and Bibles, and can be used on portable computers. However, biblical software is geared more toward word and phrase searches, accessing study bible notes and commentaries, referencing various modern translations, cross-referencing similar passages and topics, biblical dictionaries, original language texts and language tools, maps, charts, and other e-books deemed relevant to understanding texts from a philological approach.

A.nnotate is a web service for storing and annotating documents. Documents are either uploaded by the user or fetched from a web address supplied by the user. Uploads are accepted as PDF, Microsoft Word, office formats supported by OpenOffice and common image formats. When a URL of a web page is entered, the service makes a local copy of the HTML and stylesheet. The service offers a browser bookmarklet to facilitate making snapshots of web pages.

openTMS is an acronym for Open Source Translation Management System.

<span class="mw-page-title-main">Text annotation</span> Adding a note or gloss to a text

Text annotation is the practice and the result of adding a note or gloss to a text, which may include highlights or underlining, comments, footnotes, tags, and links. Text annotations can include notes written for a reader's private purposes, as well as shared annotations written for the purposes of collaborative writing and editing, commentary, or social reading and sharing. In some fields, text annotation is comparable to metadata insofar as it is added post hoc and provides information about a text without fundamentally altering that original text. Text annotations are sometimes referred to as marginalia, though some reserve this term specifically for hand-written notes made in the margins of books or manuscripts. Annotations have been found to be useful and help to develop knowledge of English literature.

Nota Bene consists of a tightly integrated software suite of applications, including word processing, reference management and document text analysis software that is focused on writers and scholars in the Humanities, Social Sciences, and the Arts. The integrated suite is referred to as the Nota Bene Workstation. It operates on the Windows platform and on Mac computers.

Microsoft Office shared tools are software components that are included in all Microsoft Office products.

<span class="mw-page-title-main">Art of Illusion</span>

Art of Illusion is a free software, and open source software package for making 3D graphics.

<span class="mw-page-title-main">Obsidian (software)</span> Knowledge base and note-taking software

Obsidian is a knowledge base and note-taking software that operates on Markdown files. It allows users to make internal links for notes and then to visualize the connections as a graph. It is designed to help users organize and structure their thoughts and knowledge in a flexible, non-linear way. The software is free for personal use, with commercial licenses available for pay. Obsidian is popular among writers, researchers, academics, and other professionals who need a flexible and powerful note-taking tool.

References

  1. Joseph J. Esposito, "The Processed Book," FirstMonday.org, March 2003, 8:3 The original article now has an update as of October 23, 2005.