Office Open XML

Last updated
Office Open XML Document
.docx icon.svg
The OOXML Document icon, as appears on the Microsoft OneDrive web service
Filename extension
.docx, .docm
Internet media type
application/vnd.
openxmlformats-officedocument.
wordprocessingml.
document [1]
Developed by Microsoft, Ecma, ISO, IEC
Initial release7 December 2006;17 years ago (2006-12-07)
Latest release
4th edition
26 October 2016;8 years ago (2016-10-26)
Type of format Document file format
Extended from XML, DOC, WordProcessingML
Standard ECMA-376, ISO/IEC 29500
Open format?Yes [2]
Website ECMA-376, ISO/IEC 29500-1:2016
Office Open XML Presentation
.pptx icon.svg
Filename extension
.pptx, .pptm
Internet media type
application/vnd.
openxmlformats-officedocument.
presentationml.
presentation [3]
Developed by Microsoft, Ecma, ISO, IEC
Initial release7 December 2006;17 years ago (2006-12-07)
Latest release
3rd edition
29 June 2011;13 years ago (2011-06-29)
Type of format Presentation
Extended from XML, PPT
Standard ECMA-376, ISO/IEC 29500
Open format?Yes
Website ECMA-376, ISO/IEC 29500:2016
Office Open XML Workbook
.xlsx icon.svg
Filename extension
.xlsx, .xlsm
Internet media type
application/vnd.
openxmlformats-officedocument.
spreadsheetml.
sheet [4]
Developed by Microsoft, Ecma, ISO, IEC
Initial release7 December 2006;17 years ago (2006-12-07) (as Microsoft Open XML)
Latest release
3rd edition
29 June 2011;13 years ago (2011-06-29)
Type of format Spreadsheet
Extended from XML, XLS, SpreadsheetML
Standard ECMA-376, ISO/IEC 29500
Open format?Yes
Website ECMA-376, ISO/IEC 29500:2016

Office Open XML (also informally known as OOXML) [5] is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial version as ECMA-376. ISO and IEC standardized later versions as ISO/IEC 29500.

Contents

Microsoft Office 2010 provides read support for ECMA-376, full support for ISO/IEC 29500 Transitional, and read support for ISO/IEC 29500 Strict. [6] Microsoft Office 2013 and later fully support ISO/IEC 29500 Strict, [7] but do not use it as the default file format because of backwards compatibility concerns. [8]

Background

In 2000, Microsoft released an initial version of an XML-based format for Microsoft Excel, which was incorporated in Office XP. In 2002, a new file format for Microsoft Word followed. [9] The Excel and Word formats—known as the Microsoft Office XML formats—were later incorporated into the 2003 release of Microsoft Office.

Microsoft announced in November 2005 that it would co-sponsor standardization of the new version of their XML-based formats through Ecma International as "Office Open XML". [10] [11] The presentation was made to Ecma by Microsoft's Jean Paoli and Isabelle Valet-Harper. [12] [13]

Standardization process

Microsoft submitted initial material to Ecma International Technical Committee TC45, where it was standardized to become ECMA-376, approved in December 2006. [14]

This standard was then fast-tracked in the Joint Technical Committee 1 of ISO and IEC. After initially failing to pass, an amended version of the format received the necessary votes for approval as an ISO/IEC Standard as the result of a JTC 1 fast-tracking standardization process that concluded in April 2008. [15] The resulting four-part International Standard (designated ISO/IEC 29500:2008) was published in November 2008 [16] and can be downloaded from the ITTF. [17] A technically equivalent set of texts is published by Ecma as ECMA-376 Office Open XML File Formats—2nd edition (December 2008); they can be downloaded from their website. [18]

The ISO/IEC standardization of Office Open XML was controversial and embittered, [19] with much discussion both about the specification and about the standardization process. [20] According to InfoWorld , "OOXML was opposed by many on grounds it was unneeded, as software makers could use OpenDocument Format (ODF), a less complicated office software format that was already an international standard." [19] The same InfoWorld article reported that IBM (which supports the ODF format) threatened to leave standards bodies that it said allow dominant corporations like Microsoft to wield undue influence. The article further says that Microsoft was accused of co-opting the standardization process by leaning on countries to ensure that it got enough votes at the ISO/IEC for Office Open XML to pass, although it does not specify exactly who accused Microsoft. [19]

Licensing

Under the Ecma International code of conduct in patent matters, [21] participating and approving member organizations of ECMA are required to make their patent rights available on a reasonable and non-discriminatory (RAND) basis.

Holders of patents which concern ISO/IEC International Standards may agree to a standardized license governing the terms under which such patents may be licensed, in accord with the ISO/IEC/ITU common patent policy. [22]

Microsoft, the main contributor to the standard, provided a covenant not to sue [23] for its patent licensing. The covenant received a mixed reception, with some like the Groklaw blog criticizing it, [24] and others such as Lawrence Rosen, (an attorney and lecturer at Stanford Law School), endorsing it. [25]

Microsoft has added the format to their Open Specification Promise [26] in which

Microsoft irrevocably promises not to assert any Microsoft Necessary Claims against you for making, using, selling, offering for sale, importing or distributing any implementation to the extent it conforms to a Covered Specification [...]

This is limited to applications which do not deviate from the ISO/IEC 29500:2008 or Ecma-376 standard and to parties that do not "file, maintain or voluntarily participate in a patent infringement lawsuit against a Microsoft implementation of such Covered Specification". [27] [28] The Open Specification Promise was included in documents submitted to ISO/IEC in support of the ECMA-376 fast-track submission. [29] Ecma International asserted that, "The OSP enables both open source and commercial software to implement [the specification]". [30]

Versions

The Office Open XML specification exists in several versions.

ECMA-376 1st edition (2006)

The ECMA standard is structured in five parts to meet the needs of different audiences. [18]

Part 1. Fundamentals
  • Vocabulary, notational conventions and abbreviations
  • Summary of primary and supporting markup languages
  • Conformance conditions and interoperability guidelines
  • Constraints within the Open Packaging Conventions that apply to each document type
Part 2. Open Packaging Conventions
  • The Open Packaging Conventions (OPC), for the package model and physical package, is defined and used by various document types in various applications from multiple vendors.
  • It defines core properties, thumbnails, digital signatures, and authorizations & encryption capabilities for parts or all of the contents in the package.
  • XML schemas for the OPC are declared as XML Schema Definitions (XSD) and (non-normatively) using RELAX NG (ISO/IEC 19757-2)
Part 3. Primer
  • Informative (non-normative) introduction to WordprocessingML, SpreadsheetML, PresentationML, DrawingML, VML and Shared MLs, providing context and illustrating elements through examples and diagrams
  • Describes the custom XML data-storing facility within a package to support integration with business data
Part 4. Markup Language Reference
  • Contains the reference material for WordprocessingML, SpreadsheetML, PresentationML, DrawingML, Shared MLs and Custom XML Schema, defining every element and attribute including the element hierarchy (parent/child relationships)
  • XML schemas for the markup languages are declared as XSD and (non-normatively) using RELAX NG
  • Defines the custom XML data-storing facility
Part 5. Markup Compatibility and Extensibility
  • Describes extension facilities of OpenXML documents and specifies elements & attributes through which applications can operate across different extensions.

Later versions of the ECMA-376 standard are aligned and technically equivalent to the corresponding ISO standard.

ISO/IEC 29500:2008

The ISO/IEC standard is structured into four parts: [31] Parts 1, 2 and 3 are independent standards; for example, Part 2, specifying Open Packaging Conventions, is used by other file formats including XPS and Design Web Format. Part 4 is to be read as a modification to Part 1, which it requires.

A technically equivalent set of texts is also published by Ecma as ECMA-376 2nd edition (2008).

Part 1. Fundamentals & Markup Language Reference
Consisting of 5560 pages, this part contains:
  • Conformance definitions
  • Reference material for the XML document markup languages defined by the Standard
  • XML schemas for the document markup languages declared using XSD and (non-normatively) RELAX NG
  • Defines the foreign markup facilities
Part 2. Open Packaging Conventions
Consisting of 129 pages, this part contains:
  • A description of the Open Packaging Conventions (package model, physical package)
  • Core properties, thumbnails and digital signatures
  • XML schemas for the OPC are declared using XSD and (non-normatively) RELAX NG
Part 3. Markup Compatibility and Extensibility
Consisting of 40 pages, this part contains:
  • A description of extensions: elements & attributes which define mechanisms allowing applications to specify alternative means of negotiating content
  • Extensibility rules are expressed using NVDL
Part 4. Transitional Migration Features
Consisting of 1464 pages, this part contains:
  • Legacy material such as compatibility settings and the graphics markup language VML
  • A list of syntactic differences between this text and ECMA-376 1st Edition

The standard specifies two levels of document & application conformance, strict and transitional, for each of WordprocessingML, PresentationML and SpreadsheetML, and also specifies applications' descriptions of base and full.

Compatibility between versions

The intent of the changes from ECMA-376 1st Edition to ISO/IEC 29500:2008 was that a valid ECMA-376 document would also be a valid ISO 29500 Transitional document; [32] however, at least one change introduced at the BRM—refusing to allow further values for xsd:boolean—had the effect of breaking backwards-compatibility for most documents. [33] A fix for this had been suggested to ISO/IEC JTC 1/SC 34/WG 4, and was approved in June 2009 as a recommendation for the first revision to Office Open XML. [34]

Applications capable of reading documents compliant to ECMA-376 Edition 1 would regard ISO/IEC 29500-4 Transitional documents containing ISO 8601 dates as corrupt. [35]

Application support

Some older versions of Microsoft Word and Microsoft Office are able to read and write .docx files after installation of the free compatibility pack provided by Microsoft, [36] although some items, such as equations, are converted into images that cannot be edited. [37]

Starting with Microsoft Office 2007, the Office Open XML file formats have become the default file format [38] of Microsoft Office. [39] [40] However, due to the changes introduced in the Office Open XML standard, Office 2007 is not wholly in compliance with ISO/IEC 29500:2008. [41] Office 2010 includes support for opening documents of the ISO/IEC 29500:2008-compliant version of Office Open XML, but it can only save documents conforming to the transitional, not the strict, schemas of the specification. [42] [43] Note that the intent of the ISO/IEC is to allow the removal of the transitional variant from the ISO/IEC 29500 standard. [43] Microsoft Office 2013 and later fully support ISO/IEC 29500 Strict, [7] but do not use it as the default file format because of backwards compatibility concerns. [8]

The ability to read and write Office Open XML format is, however, not limited to Microsoft Office; other office products are also able to read & write this format:

Other office products that offer import support for the Office Open XML formats include:

See also

Related Research Articles

<span class="mw-page-title-main">XML</span> Markup language by the W3C for encoding of data

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

Apache POI, a project run by the Apache Software Foundation, and previously a sub-project of the Jakarta Project, provides pure Java libraries for reading and writing files in Microsoft Office formats, such as Word, PowerPoint and Excel.

The Organization for the Advancement of Structured Information Standards is a nonprofit consortium that works on the development, convergence, and adoption of projects - both open standards and open source - for Computer security, blockchain, Internet of things (IoT), emergency management, cloud computing, legal data exchange, energy, content technologies, and other areas.

X3D is a set of royalty-free ISO/IEC standards for declaratively representing 3D computer graphics. X3D includes multiple graphics file formats, programming-language API definitions, and run-time specifications for both delivery and integration of interactive network-capable 3D data. X3D version 4.0 has been approved by Web3D Consortium, and is under final review by ISO/IEC as a revised International Standard (IS).

An open file format is a file format for storing digital data, defined by an openly published specification usually maintained by a standards organization, and which can be used and implemented by anyone. An open file format is licensed with an open license. For example, an open format can be implemented by both proprietary and free and open-source software, using the typical software licenses used by each. In contrast to open file formats, closed file formats are considered trade secrets.

Vector Markup Language (VML) is an obsolete XML-based file format for two-dimensional vector graphics. It was specified in Part 4 of the Office Open XML standards ISO/IEC 29500 and ECMA-376. According to the specification, VML is a deprecated format included in Office Open XML for legacy reasons only.

A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats.

The Open Document Format for Office Applications (ODF), also known as OpenDocument, standardized as ISO 26300, is an open file format for word processing documents, spreadsheets, presentations and graphics and using ZIP-compressed XML files. It was developed with the aim of providing an open, XML-based file format specification for office applications.

Ecma International is a nonprofit standards organization for information and communication systems. It acquired its current name in 1994, when the European Computer Manufacturers Association (ECMA) changed its name to reflect the organization's global reach and activities. As a consequence, the name is no longer considered an acronym and no longer uses full capitalization.

JSON is an open standard file format and data interchange format that uses human-readable text to store and transmit data objects consisting of name–value pairs and arrays. It is a commonly used data format with diverse uses in electronic data interchange, including that of web applications with servers.

Open XML Paper Specification is an open specification for a page description language and a fixed-document format. Microsoft developed it as the XML Paper Specification (XPS). In June 2009, Ecma International adopted it as international standard ECMA-388.

<span class="mw-page-title-main">Rick Jelliffe</span> Australian computer programmer

Richard (Rick) Alan Jelliffe is an Australian programmer and standards activist, particularly associated with web standards, markup languages, internationalization and schema languages. He is the founder and Chief Technical Officer of Topologi Pty. Ltd, an XML tools vendor in Sydney. He has a degree in economics from the University of Sydney.

The Open Packaging Conventions (OPC) is a container-file technology initially created by Microsoft to store a combination of XML and non-XML files that together form a single entity such as an Open XML Paper Specification (OpenXPS) document. OPC-based file formats combine the advantages of leaving the independent file entities embedded in the document intact and resulting in much smaller files compared to normal use of XML.

The Office Open XML file formats, also known as OOXML, were standardised between December 2006 and November 2008, first by the Ecma International consortium, and subsequently, after a contentious standardization process, by the ISO/IEC's Joint Technical Committee 1.

This is a comparison of the Office Open XML document file format with the OpenDocument file format.

The Office Open XML format (OOXML), is an open and free document file format for saving and exchanging editable office documents such as text documents, spreadsheets, charts, and presentations.

The Office Open XML file formats are a set of file formats that can be used to represent electronic office documents. There are formats for word processing documents, spreadsheets and presentations as well as specific formats for material such as mathematical formulas, graphics, bibliographies etc.

References

  1. "application/vnd.openxmlformats-officedocument.wordprocessingml.document". www.iana.org. IANA. 2011-02-25. Retrieved 2024-11-03.
  2. Klaus-Peter Eckert; Jan Henrik Ziesing; Ucheoma Ishionwu. "Document Interoperability: Open Document Format and Office Open XML" (PDF). Fraunhofer Verlag. p. 90.
  3. "application/vnd.openxmlformats-officedocument.presentationml.presentation". www.iana.org. IANA. 2011-03-14. Retrieved 2024-11-03.
  4. "vnd.openxmlformats-officedocument.spreadsheetml.sheet". www.iana.org. IANA. 2011-03-16. Retrieved 2024-11-03.
  5. "The Document Foundation, LibreOffice and OOXML". The Document Foundation . Retrieved 2016-03-22.
  6. "Overview of the XML file formats in Office 2010". Office 2010 Resource Kit. Microsoft. 5 August 2011.
  7. 1 2 "XML file name extension reference for Office 2013". Office 2013 Resource Kit. Microsoft. 26 December 2016.
  8. 1 2 "XLSX Strict (Office Open XML), ISO 29500-1:2008-2016". Library of Congress . Retrieved 2018-09-09.
  9. Brian Jones (2007-01-25). "History of office XML formats (1998–2006)". MSDN blogs. Retrieved 2020-12-31.
  10. "Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization". Microsoft. 2005-11-21.
  11. Casson and Ryan, Open Standards, Open Source Adoption in the Public Sector, and Their Relationship to Microsoft’s Market Dominance
  12. Microsoft hands over Office XML specs to Ecma
  13. "Slides presented by the TC45 committee to Ecma International". Archived from the original on 2011-10-21. Retrieved 2011-06-08.
  14. "Ecma International approves Office Open XML standard". Ecma International. 2006-12-07.
  15. "ISO/IEC DIS 29500 receives necessary votes for approval as an International Standard". ISO. 2008-04-02.
  16. ISO/IEC (2008-11-18). "Publication of ISO/IEC 29500:2008, Information technology—Office Open XML formats". ISO. Archived from the original on 2009-07-06. Retrieved 2008-11-19.
  17. "Freely Available Standards". ITTF (ISO/IEC). 2008-11-18. Retrieved 2023-09-26.
  18. 1 2 "Standard ECMA-376". Ecma-international.org. Retrieved 2009-05-19.
  19. 1 2 3 Kirk, Jeremy (19 November 2008). "ISO publishes Office Open XML specification". InfoWorld . Retrieved 12 June 2010.
  20. "Norwegian standards body implodes over OOXML controversy". Ars Technica. 3 October 2008.
  21. "Code of Conduct in Patent Matters". Ecma International.
  22. "ISO/IEC/ITU common patent policy".
  23. "Microsoft's Open Specification Promise Eases Web Services Patent Concerns". September 12, 2006. Retrieved 2015-04-18.
  24. "2 Escape Hatches in MS's Covenant Not to Sue". Groklaw . December 4, 2005.
  25. Berlind, David (November 28, 2005). "Top open source lawyer blesses new terms on Microsoft's XML file format". ZDNet. Retrieved 2007-01-27.
  26. "Microsoft Open Specification Promise". Microsoft. 2007-02-15. Retrieved 2015-04-18.
  27. "Ecma formal publications". Ecma International. Ecma Standards and Technical Reports are made available to all interested persons or organizations, free of charge and licensing restrictions
  28. "Microsoft Open Specification Promise". Microsoft.com.
  29. "Licensing conditions that Microsoft offers for Office Open XML". Jtc1sc34.org. 2006-12-20. Retrieved 2009-05-19.
  30. "Microsoft Word — Responses to Comments and Perceived Contradictions.doc" (PDF). Retrieved 2009-09-16.
  31. "ISO search for "29500"". International Organization for Standardization. 2009-06-05.
  32. "Re-introducing on/off-values to ST-OnOff in OOXML Part 4". Archived from the original on 2009-06-26. Retrieved 2009-09-29.
  33. "OOXML and Office 2007 Conformance: a Smoke Test". Archived from the original on 2010-04-28. Retrieved 2009-09-29.
  34. "Minutes of the Copenhagen Meeting of ISO/IEC JTC1/SC34/WG4" (PDF). 2009-06-22. Archived from the original (PDF) on 2014-05-12. Retrieved 2009-09-29. page 15
  35. "ISO/IEC 29500-4:2008/Draft Amd2:2011 - Draft - Information technology — Document description and processing languages — Office Open XML File Formats — Part 4: Transitional Migration Features - AMENDMENT 2". 2011-03-02. Archived from the original (PDF) on 2014-05-12. Retrieved 2011-04-04.
  36. "Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats (Version 3)". Microsoft. 2007-06-18. Retrieved 2018-06-23.
  37. Open a Word 2007 document in an earlier version of Word - Word - Office.com. Office.microsoft.com. Retrieved on 2013-07-17.
  38. "Microsoft Expands List of Formats Supported in Microsoft Office". Microsoft. Retrieved 2008-05-21.
  39. "Microsoft's future lies somewhere beyond the Vista by Evansville Courier & Press". Courierpress.com. Archived from the original on 2014-07-10. Retrieved 2009-05-19.
  40. "Rivals Set Their Sights on Microsoft Office: Can They Topple the Giant? - Knowledge@Wharton". Knowledge.wharton.upenn.edu. Retrieved 2009-05-19.
  41. Andy Updegrove (21 May 2008). "Microsoft Office 2007 to Support ODF — and not OOXML". ConsortiumInfo.org. Archived from the original on 2008-05-23. Retrieved 2009-05-19.
  42. "ISO OOXML convener: Microsoft's format "heading for failure"". Ars Technica. 2 April 2010.
  43. 1 2 Brown, Alex (31 March 2010). "Microsoft Fails the Standards Test". Where is an end of it?. Alex Brown's weblog. Retrieved 23 June 2018.
  44. "Collabora Online 6.4.0-released". 2 November 2020.
  45. "LibreOffice OOXML" . Retrieved 22 March 2012.
  46. "OpenOffice.org 3.0 New Features". 2008-10-13. Retrieved 2009-10-24.
  47. "OpenOffice.org 3.2 New Features". 2010. Retrieved 2010-11-13.
  48. Scott Gilbertson (13 February 2010). "OpenOffice 3.2 - now with less Microsoft envy". The Register. Retrieved 18 Feb 2013. the ability to open password-protected Word, Excel, and PowerPoint files
  49. "3.2.0 (build OOO320_m12) - Release Notes" . Retrieved 18 Feb 2013. Import of password protected Microsoft Office XML files

Further reading