Dynatext

Last updated

DynaText is an SGML publishing tool. It was introduced in 1990, and was the first system to handle arbitrarily large SGML documents, and to render them according to multiple style-sheets that could be switched at will. [1]

Contents

DynaText and its Web sibling DynaWeb won multiple Seybold and other awards, [2] [3] and there are eleven US Patents related to the DynaText technology: 5,557,722; 5,644,776; 5,708,806; 5,893,109; 5,983,248; 6,055,544; 6,101,511; 6,101,512; 6,105,044; 6,167,409; and 6,546,406.

History

DynaText was developed by Electronic Book Technologies (EBT), Incorporated, of Providence, Rhode Island. EBT was founded by Louis Reynolds, Steven DeRose, [4] Jeffrey Vogel, and Andries van Dam, and was sold to Inso corporation in 1996, when it had about 150 employees.

DynaText stands in the long tradition of hypermedia at Brown University, and adopted many features pioneered by FRESS, such as unlimited document sizes, dynamically-controllable styles and views, and reader-created links and trails.

DynaText heavily influenced stylesheet technologies such as DSSSL and CSS. XML chairman Jon Bosak cites EBT chief architect Steven DeRose as one of the originators of the notion of well-formedness formalized in XML, as well as DynaText for influencing the design of Web browsers in general; Jon Bosak produced SGML versions of the complete works of Shakespeare, the KJV Old Testament and New Testament, Book of Mormon, and Quran, and released them in 1994 bundled with Dynatext. [5]

Inso corporation went out of business in 2002.

DynaText was demonstrated live by DeRose and David Sklar at "A Half-Century of Hypertext at Brown: A Symposium", held at Brown University on May 23, 2019, [6] using a variorum edition The Wife of Bath's Tale, published in DynaText by Cambridge University Press.

Technology

DynaText accepted SGML as input, and built a binary representation of the structure (similar to DOM for XML, but persistent), as well as a full-text inverted index of the text, elements, and attributes. Customers typically distributed such compiled e-books on CD-ROM or via network servers. Later versions of DynaText could also read SGML and XML on the fly, providing exactly the same interface.

Unlike many prior systems, DynaText was not limited to any particular DTD (or schema). Rather, customers could build style sheets in a simple language (also SGML-based), using properties very much like the later DSSSL, CSS, and XSL-FO. However, every property could have an expression as its value, which would be evaluated (if necessary) for each element the style applied to. Graphics, tables, formulae, and plug-ins could be included in documents.

Unlike nearly all prior SGML systems, DynaText was not limited to documents that could fit in RAM on the viewing or serving computer system. Users commonly created documents in the tens to hundreds of MB. DynaText customers included aerospace, workstation and other computer industry firms, government, literary and technical publishers, and others.

Full-text searches were based on an inverted index of words and other tokens (except for Japanese text, which was handled specially). Dynatext could report the number of "hits" for a given search, that occur within each section in the table of contents (by default, the table of contents appeared in a separate pane as an expandable outline, and clicking on any entry scrolled the full-text pane to the start of the corresponding section). Searches could also restrict hits to particular SGML element types, or sequences of types; refer to attributes; and use Boolean operators and parentheses. The "and" operator restricted its operands to occurring near each other, by default in the same paragraph or comparable element.

Related Research Articles

<span class="mw-page-title-main">HTML</span> Hypertext Markup Language

The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScript.

<span class="mw-page-title-main">Markup language</span> Modern system for annotating a document

Markuplanguage refers to a text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship between its parts. Markup is often used to control the display of the document or to enrich its content to facilitating automated processing. A markup language is a set of rules governing what markup information may be included in a document and how it is combined with the content of the document in a way to facilitate use by humans and computer programs. The idea and terminology evolved from the "marking up" of paper manuscripts, which is traditionally written with a red pen or blue pencil on authors' manuscripts.

<span class="mw-page-title-main">Standard Generalized Markup Language</span> Markup language

The Standard Generalized Markup Language is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on two postulates":

<span class="mw-page-title-main">XML</span> Markup language by the W3C for encoding of data

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

In computing, the term Extensible Stylesheet Language (XSL) is used to refer to a family of languages used to transform and render XML documents.

DocBook is a semantic markup language for technical documentation. It was originally intended for writing technical documents related to computer hardware and software, but it can be used for any other sort of documentation.

The Document Style Semantics and Specification Language (DSSSL) is an international standard developed to provide stylesheets for SGML documents.

An HTML element is a type of HTML document component, one of several types of HTML nodes. The first used version of HTML was written by Tim Berners-Lee in 1993 and there have since been many versions of HTML. The most commonly used version is HTML 4.01, which became official standard in December 1999. An HTML document is composed of a tree of simple HTML nodes, such as text nodes, and HTML elements, which add semantics and formatting to parts of document. Each element can have HTML attributes specified. Elements can also have content, including other elements and text.

This article presents a timeline of hypertext technology, including "hypermedia" and related human–computer interaction projects and developments from 1945 on. The term hypertext is credited to the author and philosopher Ted Nelson.

James Clark is a software engineer and creator of various open-source software including groff, expat and several XML specifications.

In web development, "tag soup" is a pejorative for syntactically or structurally incorrect HTML written for a web page. Because web browsers have historically treated structural or syntax errors in HTML leniently, there has been little pressure for web developers to follow published standards, and therefore there is a need for all browser implementations to provide mechanisms to cope with the appearance of "tag soup", accepting and correcting for invalid syntax and structure where possible.

<span class="mw-page-title-main">Jon Bosak</span> Creator of the XML specification

Jon Bosak led the creation of the XML specification at the W3C. From 1996–2008, he worked for Sun Microsystems.

HyTime is a markup language that is an application of SGML. HyTime defines a set of hypertext-oriented element types that, in effect, supplement SGML and allow SGML document authors to build hypertext and multimedia presentations in a standardized way.

The term CDATA, meaning character data, is used for distinct, but related, purposes in the markup languages SGML and XML. The term indicates that a certain portion of the document is general character data, rather than non-character data or character data with a more specific, limited structure.

Dan Connolly is an American computer scientist who was closely involved with the creation of the World Wide Web as a member of the World Wide Web Consortium (W3C).

<span class="mw-page-title-main">SoftQuad Software</span>

SoftQuad Software was a Canadian software company best known for HoTMetaL, the first commercial HTML editor. It is also known for Author/Editor, the first specialized SGML editor, and Panorama, the first browser plugin for SGML. Panorama demonstrated the need for standardization of SGML on the web, which eventually resulted in the development of the XML specification.

Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages. It mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated.

Steven J DeRose is a computer scientist noted for his contributions to Computational Linguistics and to key standards related to document processing, mostly around ISO's Standard Generalized Markup Language (SGML) and W3C's Extensible Markup Language (XML).

<span class="mw-page-title-main">History of hypertext</span>

Hypertext is text displayed on a computer or other electronic device with references (hyperlinks) to other text that the reader can immediately access, usually by a mouse click or keypress sequence. Early conceptions of hypertext defined it as text that could be connected by a linking system to a range of other documents that were stored outside that text. In 1934 Belgian bibliographer, Paul Otlet, developed a blueprint for links that telescoped out from hypertext electrically to allow readers to access documents, books, photographs, and so on, stored anywhere in the world.

A document type declaration, or DOCTYPE, is an instruction that associates a particular XML or SGML document with a document type definition (DTD). In the serialized form of the document, it manifests as a short string of markup that conforms to a particular syntax.

References

  1. Smith, MacKenzie (1993). "Review: DynaText: An Electronic Publishing System". Computers and the Humanities. Springer. 27 (5/6): 415–420. ISSN   0010-4817. JSTOR   30204569. MS.
  2. "Extract from Seybold Editors' Awards 1996: Electronic Book Technologies for DynaWeb". xml.coverpages.org.
  3. "Inso Corporation Announces DynaWeb® 3.1; First Web Publishing Solution To Support Interactive Mathematical Equations". xml.coverpages.org. Retrieved 13 March 2019.
  4. "Executive Profile Steven DeRose Ph.D." Bloomberg.com. Retrieved 14 March 2019.
  5. "Curriculum Vitae: Jon Bosak". ibiblio.org. Retrieved 13 March 2019.
  6. "A Half-Century of Hypertext at Brown: A Symposium | ELMCIP". elmcip.net. Retrieved 1 December 2020.