Open Document Architecture

Last updated
Open Document Architecture
X-office-document.svg
Internet media type application/ODA
Developed by ITU-T, ISO
Initial release1989;33 years ago (1989)
Type of format Document file format
Standard CCITT T.411-T.424, ISO 8613
Website ISO 8613

The Open Document Architecture (ODA) and interchange format (informally referred to as just ODA) is a free and open international standard document file format maintained by the ITU-T to replace all proprietary document file formats. ODA is detailed in the standards documents CCITT T.411-T.424, which is equivalent to ISO 8613.

Contents

Format

ODA defines a compound document format that can contain raw text, raster images and vector graphics. In the original release the difference between this standard and others like it is that the graphics structures were exclusively defined as CCITT raster image and Computer Graphics Metafile (CGM - ISO 8632). This was to limit the problem of having word processor and desktop publisher software be required to interpret all known graphics formats.

The documents have both logical and layout structures. Logically the text can be partitioned into chapters, footnotes and other subelements akin to HTML, and the layout fill a function similar to Cascading Style Sheets in the web world. The binary transport format for an ODA-conformant file is called Open Document Interchange Format (ODIF) and is based on the Standard Generalized Markup Language and Abstract Syntax Notation One (ASN.1).

One of the features of this standard could be stored or interchanged in one of three formats: Formatted, Formatted Processable, or Processable. The latter two are editable formats. The first is an uneditable format that is logically similar to Adobe Systems PDF that is in common use today.

History

In 1985, ESPRIT financed a pilot implementation of the ODA concept, involving, among others, Bull corporation, Olivetti, ICL and Siemens AG.

The intent was to have a universal storable and interchangeable document structure that would not go out of date and could be used by any word processor or desktop publisher. The rapid adoption of personal computers in the late 1970s and early 1980s by consumers and small businesses and the relative ease of writing applications for the primitive early PCs had resulted in a huge number of new word processing applications that were then duking it out around the world for market dominance. At the same time, large corporations who had purchased dedicated word processor devices in the 1970s were switching over to the new PCs that could run word processing software and much more. The result was a profusion of constantly evolving proprietary file formats. It was already clear by 1985 that this confusing and often frustrating situation would get much worse before it got better, as desktop publishing and multimedia computing were already on the horizon.

Thus, ODA was intended to solve the problem of software applications whose developers were continually updating their native file formats to accommodate new features, which frequently broke backward compatibility. Older native formats were repeatedly becoming obsolete and therefore unusable after only a few years. This led to a large financial impact on companies that were using ad hoc standard applications, such as Microsoft Word or WordPerfect, because their IT departments had to constantly assist frustrated users with transferring content between so many different formats, and also hire employees whose sole job was to import old stored documents into the latest version of applications before they became unreadable. The intended result of the ODA standard was that companies would not have to commit to an ad hoc standard for word processor or desktop publisher applications, because any application adhering to a common open standard could be used to read and edit long stored documents.[ citation needed ]

The initial round of documents that made up ISO 8613 was completed after a multi-year effort at an ISO/IEC JTC1/SC18/WG3 meeting in Paris La Defense, France, around Armistice (Nov. 11) 1987, called "Office Document Architecture" at the time. CCITT picked them up as the T.400 series of recommendations, using the term "Open Document Architecture". Work continued on additional parts for a while, for instance at an ISO working group meeting in Ottawa in February 1989. Improvements and additions were continually being made. The revised standard was finally published in 1999. However, no significant developer of document application software chose to support the format, probably because the conversion from the existing dominant word processor formats such as WordPerfect and Microsoft Word was difficult, offered little fidelity, and would only have weakened their advantage of vendor lock-in over their existing user base. There were also cultural obstacles because ODA was a predominantly European project that took a top-down design approach. It was unable to garner significant interest from the American software developer community or trade press. Finally, it took an extraordinarily long time to release the ODA format (the pilot was financed in 1985, but the final specification not published until 1999). Given a lack of products that supported the format, in part because of the excessive time used to create the specification, few users were interested in using it. Eventually interest in the format faded.

IBM's European Networking Center (ENC) in Heidelberg, Germany, developed prototype extensions to IBM OfficeVision/VM to support ODA, in particular a converter between ODA and Document Content Architecture (DCA) document formats. [1]

It would be improper to call ODA anything but a failure, but its spirit clearly influenced latter-day document formats that were successful in gaining support from many document software developers and users. These include the already-mentioned HTML and CSS as well as XML and XSL leading up to OpenDocument and Office Open XML.

See also

Related Research Articles

Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. Based on the PostScript language, each PDF file encapsulates a complete description of a fixed-layout flat document, including the text, fonts, vector graphics, raster images and other information needed to display it. PDF has its roots in "The Camelot Project" initiated by Adobe co-founder John Warnock in 1991.

Desktop publishing (DTP) is the creation of documents using page layout software on a personal ("desktop") computer. It was first used almost exclusively for print publications, but now it also assists in the creation of various forms of online content. Desktop publishing software can generate layouts and produce typographic-quality text and images comparable to traditional typography and printing. Desktop publishing is also the main reference for digital typography. This technology allows individuals, businesses, and other organizations to self-publish a wide variety of content, from menus to magazines to books, without the expense of commercial printing.

Corel Ventura Desktop publishing application

Ventura Publisher was the first popular desktop publishing package for IBM PC compatible computers running the GEM extension to the DOS operating system. The software was originally developed by Ventura Software, a small software company founded by John Meyer, Don Heiskell, and Lee Jay Lorenzen, all of whom met while working at Digital Research. It ran under an included run-time copy of Digital Research's GEM.

TIFF Series of image file formats

Tag Image File Format, abbreviated TIFF or TIF, is an image file format for storing raster graphics images, popular among graphic artists, the publishing industry, and photographers. TIFF is widely supported by scanning, faxing, word processing, optical character recognition, image manipulation, desktop publishing, and page-layout applications. The format was created by the Aldus Corporation for use in desktop publishing. It published the latest version 6.0 in 1992, subsequently updated with an Adobe Systems copyright after the latter acquired Aldus in 1994. Several Aldus or Adobe technical notes have been published with minor extensions to the format, and several specifications have been based on TIFF 6.0, including TIFF/EP, TIFF/IT, TIFF-F and TIFF-FX.

Damn Small Linux

Damn Small Linux (DSL) is a computer operating system for the x86 family of personal computers. It is free and open-source software under the terms of the GNU GPL and other free and open source licenses. It was designed to run graphical user interface applications on older PC hardware, for example, machines with 486 and early Pentium microprocessors and very little random-access memory (RAM). DSL is a Live CD with a size of 50 megabytes (MB). What originally began as an experiment to see how much software could fit in 50 MB eventually became a full Linux distribution. It can be installed on storage media with small capacities, like bootable business cards, USB flash drives, various memory cards, and Zip drives.

A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats.

The Open Document Format for Office Applications (ODF), also known as OpenDocument, is an open file format for spreadsheets, charts, presentations and word processing documents using ZIP-compressed XML files. It was developed with the aim of providing an open, XML-based file format specification for office applications. It is also the default format for documents in typical Linux distributions.

OpenOffice or open office may refer to:

Advanced Function Presentation (AFP) is a presentation architecture and family of associated printer software and hardware that provides for document and information presentation independent of specific applications and devices.

Image file formats are standardized means of organizing and storing digital images. An image file format may store data in an uncompressed format, a compressed format, or a vector format. Image files are composed of digital data in one of these formats so that the data can be rasterized for use on a computer display or printer. Rasterization converts the image data into a grid of pixels. Each pixel has a number of bits to designate its color. Rasterizing an image file for a specific device takes into account the number of bits per pixel that the device is designed to handle.

Open XML Paper Specification is an open specification for a page description language and a fixed-document format. Microsoft developed it as the XML Paper Specification (XPS). In June 2009, Ecma International adopted it as international standard ECMA-388.

This is an overview of software support for the OpenDocument format, an open document file format for saving and exchanging editable office documents.

Adobe LiveCycle

Adobe LiveCycle Enterprise Suite (ES4) is a service-oriented architecture Java EE server software product from Adobe Systems used to build applications that automate a broad range of business processes for enterprises and government agencies. LiveCycle ES4 is an enterprise document and form platform that allows capturing and processing information, delivering personalized communications, and protecting and tracking sensitive information. It is used for purposes such as account opening, services and benefits enrollment, correspondence management, request for proposal processes, and other manual based workflows. LiveCycle ES4 incorporates new features with a particular focus on mobile devices. LiveCycle applications also function in both online and offline environments. These capabilities are enabled through the use of Adobe Reader, HTML/PhoneGap and the Flash Player clients to reach desktop computers and mobile devices.

The Document Content Architecture, or DCA for short, is a standard developed by IBM for text documents in the early 1980s. DCA was used on mainframe and IBM i systems, and formed the basis of DisplayWrite's file format. DCA was later extended as MO:DCA, which added embedded data files, like graphics.

A file format is a standard way that information is encoded for storage in a computer file. It specifies how bits are used to encode information in a digital storage medium. File formats may be either proprietary or free and may be either unpublished or open.

OpenRaster is a file format proposed for the common exchange of layered images between raster graphics editors. It is meant as a replacement for later versions of the Adobe PSD format. OpenRaster is still in development and so far is supported by a few programs. The default file extension for OpenRaster files is ".ora".

The Office Open XML file formats are a set of file formats that can be used to represent electronic office documents. There are formats for word processing documents, spreadsheets and presentations as well as specific formats for material such as mathematical formulae, graphics, bibliographies etc.

References

  1. Fanderl, H.; Fischer, K.; Kmper, J. (1992). "The Open Document Architecture: From standardization to the market". IBM Systems Journal. 31 (4): 728–754. doi:10.1147/sj.314.0728. ISSN   0018-8670.

The standard itself was made available for free download on September 7, 2007 (the "missing" documents T.420 and T.423 do not exist):