Dynatext

Last updated

DynaText is an SGML publishing tool. It was introduced in 1990, and was the first system to handle arbitrarily large SGML documents, and to render them according to multiple style-sheets that could be switched at will. [1]

Contents

DynaText and its Web sibling DynaWeb won multiple Seybold and other awards, [2] [3] and there are eleven US Patents related to the DynaText technology: 5,557,722; 5,644,776; 5,708,806; 5,893,109; 5,983,248; 6,055,544; 6,101,511; 6,101,512; 6,105,044; 6,167,409; and 6,546,406.

History

DynaText was developed by Electronic Book Technologies (EBT), Incorporated, of Providence, Rhode Island. EBT was founded by Louis Reynolds, Steven DeRose, [4] Jeffrey Vogel, and Andries van Dam, and was sold to Inso corporation in 1996, when it had about 150 employees.

DynaText stands in the long tradition of hypermedia at Brown University, and adopted many features pioneered by FRESS, such as unlimited document sizes, dynamically-controllable styles and views, and reader-created links and trails.

DynaText heavily influenced stylesheet technologies such as DSSSL and CSS. XML chairman Jon Bosak cites EBT chief architect Steven DeRose as one of the originators of the notion of well-formedness formalized in XML, as well as DynaText for influencing the design of Web browsers in general; Jon Bosak produced SGML versions of the complete works of Shakespeare, the KJV Old Testament and New Testament, Book of Mormon, and Quran, and released them in 1994 bundled with Dynatext. [5]

Inso corporation went out of business in 2002.

DynaText was demonstrated live by DeRose and David Sklar at "A Half-Century of Hypertext at Brown: A Symposium", held at Brown University on May 23, 2019, [6] using a variorum edition The Wife of Bath's Tale, published in DynaText by Cambridge University Press.

Technology

DynaText accepted SGML as input, and built a binary representation of the structure (similar to DOM for XML, but persistent), as well as a full-text inverted index of the text, elements, and attributes. Customers typically distributed such compiled e-books on CD-ROM or via network servers. Later versions of DynaText could also read SGML and XML on the fly, providing exactly the same interface.

Unlike many prior systems, DynaText was not limited to any particular DTD (or schema). Rather, customers could build style sheets in a simple language (also SGML-based), using properties very much like the later DSSSL, CSS, and XSL-FO. However, every property could have an expression as its value, which would be evaluated (if necessary) for each element the style applied to. Graphics, tables, formulae, and plug-ins could be included in documents.

Unlike nearly all prior SGML systems, DynaText was not limited to documents that could fit in RAM on the viewing or serving computer system. Users commonly created documents in the tens to hundreds of MB. DynaText customers included aerospace, workstation and other computer industry firms, government, literary and technical publishers, and others.

Full-text searches were based on an inverted index of words and other tokens (except for Japanese text, which was handled specially). Dynatext could report the number of "hits" for a given search, that occur within each section in the table of contents (by default, the table of contents appeared in a separate pane as an expandable outline, and clicking on any entry scrolled the full-text pane to the start of the corresponding section). Searches could also restrict hits to particular SGML element types, or sequences of types; refer to attributes; and use Boolean operators and parentheses. The "and" operator restricted its operands to occurring near each other, by default in the same paragraph or comparable element.

Related Research Articles

<span class="mw-page-title-main">HTML</span> HyperText Markup Language

Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScript, a programming language.

<span class="mw-page-title-main">Markup language</span> Modern system for annotating a document

A markuplanguage is a text-encoding system which specifies the structure and formatting of a document and potentially the relationships among its parts. Markup can control the display of a document or enrich its content to facilitate automated processing.

<span class="mw-page-title-main">Standard Generalized Markup Language</span> Markup language

The Standard Generalized Markup Language is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on two postulates":

<span class="mw-page-title-main">XML</span> Markup language by the W3C for encoding of data

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

In computing, the term Extensible Stylesheet Language (XSL) is used to refer to a family of languages used to transform and render XML documents.

DocBook is a semantic markup language for technical documentation. It was originally intended for writing technical documents related to computer hardware and software, but it can be used for any other sort of documentation.

The Document Style Semantics and Specification Language (DSSSL) is an international standard developed to provide stylesheets for SGML documents.

An HTML element is a type of HTML document component, one of several types of HTML nodes. The first used version of HTML was written by Tim Berners-Lee in 1993 and there have since been many versions of HTML. The current de facto standard is governed by the industry group WHATWG and is known as the HTML Living Standard.

James Clark is a software engineer and creator of various open-source software including groff, expat and several XML specifications.

In web development, "tag soup" is a pejorative for HTML written for a web page that is syntactically or structurally incorrect. Web browsers have historically treated structural or syntax errors in HTML leniently, so there has been little pressure for web developers to follow published standards. Therefore there is a need for all browser implementations to provide mechanisms to cope with the appearance of "tag soup", accepting and correcting for invalid syntax and structure where possible.

<span class="mw-page-title-main">Jon Bosak</span> Creator of the XML specification

Jon Bosak led the creation of the XML specification at the W3C. From 1996–2008, he worked for Sun Microsystems.

In computer hypertext, a URI fragment is a string of characters that refers to a resource that is subordinate to another, primary resource. The primary resource is identified by a Uniform Resource Identifier (URI), and the fragment identifier points to the subordinate resource.

HyTime is a markup language that is an application of SGML. HyTime defines a set of hypertext-oriented element types that, in effect, supplement SGML and allow SGML document authors to build hypertext and multimedia presentations in a standardized way.

The term CDATA, meaning character data, is used for distinct, but related, purposes in the markup languages SGML and XML. The term indicates that a certain portion of the document is general character data, rather than non-character data or character data with a more specific, limited structure.

<span class="mw-page-title-main">SoftQuad Software</span>

SoftQuad Software, Ltd., was a Canadian software company best known for HoTMetaL, the first commercial HTML editor. It is also known for Author/Editor, the first specialized SGML editor, and Panorama, the first browser plugin for SGML. Panorama demonstrated the need for standardization of SGML on the web, which eventually resulted in the development of the XML specification.

A Formal Public Identifier (FPI) is a short piece of text with a particular structure that may be used to uniquely identify a product, specification or document. FPIs were introduced as part of Standard Generalized Markup Language (SGML), and serve particular purposes in formats historically derived from SGML. Some of their most common uses are as part of document type declarations (DOCTYPEs) and document type definitions (DTDs) in SGML, XML and historically HTML, but they are also used in the vCard and iCalendar file formats to identify the software product which generated the file.

Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages which mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated.

Steven J DeRose is a computer scientist noted for his contributions to computational linguistics and to key standards related to document processing, mostly around ISO's Standard Generalized Markup Language (SGML) and W3C's Extensible Markup Language (XML).

The Web platform is a collection of technologies developed as open standards by the World Wide Web Consortium and other standardization bodies such as the Web Hypertext Application Technology Working Group, the Unicode Consortium, the Internet Engineering Task Force, and Ecma International. It is the umbrella term introduced by the World Wide Web Consortium, and in 2011 it was defined as "a platform for innovation, consolidation and cost efficiencies" by W3C CEO Jeff Jaffe. Being built on The evergreen Web has allowed for the addition of new capabilities while addressing security and privacy risks. Additionally, developers are enabled to build interoperable content on a cohesive platform.

A document type declaration, or DOCTYPE, is an instruction that associates a particular XML or SGML document with a document type definition (DTD). In the serialized form of the document, it manifests as a short string of markup that conforms to a particular syntax.

References

  1. Smith, MacKenzie (1993). "Review: DynaText: An Electronic Publishing System". Computers and the Humanities. 27 (5/6). Springer: 415–420. ISSN   0010-4817. JSTOR   30204569. MS.
  2. "Extract from Seybold Editors' Awards 1996: Electronic Book Technologies for DynaWeb". xml.coverpages.org.
  3. "Inso Corporation Announces DynaWeb® 3.1; First Web Publishing Solution To Support Interactive Mathematical Equations". xml.coverpages.org. Retrieved 13 March 2019.
  4. "Executive Profile Steven DeRose Ph.D." Bloomberg.com. Retrieved 14 March 2019.
  5. "Curriculum Vitae: Jon Bosak". ibiblio.org. Retrieved 13 March 2019.
  6. "A Half-Century of Hypertext at Brown: A Symposium | ELMCIP". elmcip.net. Retrieved 1 December 2020.