Doc (computing)

Last updated
Word Document
.doc icon.svg
Filename extension
doc
Internet media type
application/msword [1]
Uniform Type Identifier (UTI) com.microsoft.word.doc [2] [3]
Developed by Microsoft
Latest release
12.0
20 August 2024;3 months ago (2024-08-20) [4]
Type of format document file format
Container for Text, Image,Table
Extended from Compound File Binary Format (since 97)
Extended to Microsoft Office XML formats, Office Open XML
Open format?Yes

.doc (an abbreviation of "document") is a filename extension used for word processing documents stored on Microsoft's proprietary Microsoft Word Binary File Format; it was the primary format for Microsoft Word until the 2007 version replaced it with Office Open XML .docx files. [4] Microsoft has used the extension since 1983.

Contents

Overview

Binary DOC files often contain more text formatting information (as well as scripts and undo information) than some other document file formats like Rich Text Format and Hypertext Markup Language, but are usually less widely compatible.

The DOC files created with Microsoft Word versions differ: Word for Windows 1.0 and 2.0 uses a format that was changed in the following Word 6.0 and 95 ("7.0") releases. DOC was changed once again into an OLE and CFBF-based format used from Word 97 ("8.0") to 2003 ("11.0"). Word for MS-DOS used its own specific DOC format. [5]

In order to allow users of Word 6.0 and Word 95 to be able to open and read documents created in the newer (97–2003) format, Microsoft released the downloadable Word 97 Import Converter. [6] [7]

In Microsoft Word 2007 and later, the binary file format was replaced as the default format by the Office Open XML (.docx) format, though Microsoft Word can still produce and open DOC files. Microsoft released the Office Compatibility Pack allowing users of Word 2003, 2002, 2000 and 97 to open the new format as well as edit and save them. [8] Modern versions of Word can also open documents in the pre-97 DOC formats, but should be manually enabled in the File Block Settings. [5]

Application support

The DOC format is native to Microsoft Word. Other word processors, such as OpenOffice.org Writer, IBM Lotus Symphony, Apple Pages and AbiWord, can also create and read DOC files, although with some limitations. Command line programs for Unix-like operating systems that can convert files from the DOC format to plain text or other standard formats include the wv library, which itself is used directly by AbiWord.

Specification

Because the DOC file format was a closed specification for many years, inconsistent handling of the format persists and may cause some loss of formatting information when handling the same file with multiple word processing programs. Some specifications for Microsoft Office 97 binary file formats were published in 1997 under a restrictive license, but these specifications were removed from online download in 1999. [9] [10] [11] [12] Specifications of later versions of Microsoft Office binary file formats were not publicly available. The DOC format specification was available from Microsoft on request [13] since 2006 [14] under restrictive RAND-Z terms until February 2008. Sun Microsystems and OpenOffice.org reverse engineered the file format. [15] On February 15, 2008, Microsoft released a .DOC format specification [4] [16] [17] under the Microsoft Open Specification Promise. [18] [19] However, this specification does not describe all of the features used by DOC format and reverse engineered work remains necessary. [20] Since 2008 the specification has been updated several times; the latest change was made in May 2022.

The format used in earlier, pre-97 ("1.0" 1989 through "7.0" 1995) versions of Word are less known, but both OpenOffice and LibreOffice contain open-source code for reading these formats. The format is probably related to the "Stream" format found in similar Excel versions. [21] Word 95 also seems to have an OLE-wrapped form.

Other .doc file formats

Some historical documentations may use the DOC filename extension for plain-text files, indicating documentation for software or hardware. The DOC filename extension was also used during the 1980s by WordPerfect for its proprietary format.[ citation needed ]

DOC is sometimes used by users of Palm OS as shorthand for PalmDoc, an unrelated format (commonly using PDB filename extension) used to encode text files such as ebooks.[ citation needed ]

See also

Related Research Articles

<span class="mw-page-title-main">Microsoft Word</span> Word processor

Microsoft Word is a word processing program developed by Microsoft. It was first released on October 25, 1983, under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platforms including: IBM PCs running DOS (1983), Apple Macintosh running the Classic Mac OS (1985), AT&T UNIX PC (1985), Atari ST (1988), OS/2 (1989), Microsoft Windows (1989), SCO Unix (1990), macOS (2001), Web browsers (2010), iOS (2014) and Android (2015).

The Rich Text Format is a proprietary document file format with published specification developed by Microsoft Corporation from 1987 until 2008 for cross-platform document interchange with Microsoft products. Prior to 2008, Microsoft published updated specifications for RTF with major revisions of Microsoft Word and Office versions.

<span class="mw-page-title-main">WPS Office</span> Office suite software by Kingsoft

WPS Office is an office suite for Microsoft Windows, macOS, Linux, iOS, Android, Fire OS and HarmonyOS developed by Chinese software developer company, Kingsoft Office Software. It also comes pre-installed on Amazon Fire tablets. WPS Office is made up of three primary components: WPS Writer, WPS Slides, and WPS Sheets. By 2022, WPS Office reached a number of more than 494 million monthly active users and over 1.2 billion installations.

A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats.

The Open Document Format for Office Applications (ODF), also known as OpenDocument, standardized as ISO 26300, is an open file format for word processing documents, spreadsheets, presentations and graphics and using ZIP-compressed XML files. It was developed with the aim of providing an open, XML-based file format specification for office applications.

<span class="mw-page-title-main">NeoOffice</span> macOS office suite

NeoOffice was an office suite for the macOS operating system developed by Planamesa Inc. It was a commercial fork of the free and open source LibreOffice office suite, including a word processor, spreadsheet, presentation program, and graphics program. It added some features not present in the macOS versions of LibreOffice and Apache OpenOffice. The last few versions were based on LibreOffice 4.4, which was released mid-2014.

Office Open XML is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial version as ECMA-376. ISO and IEC standardized later versions as ISO/IEC 29500.

This article describes the technical specifications of the OpenDocument office document standard, as developed by the OASIS industry consortium. A variety of organizations developed the standard publicly and make it publicly accessible, meaning it can be implemented by anyone without restriction. The OpenDocument format aims to provide an open alternative to proprietary document formats.

In file formats, a document template is a common feature of many software applications that define a unique non-executable file format intended specifically for that particular application.

Uniform Office Format, sometimes known as Unified Office Format, is an open standard for office applications developed in China. It includes word processing, presentation, and spreadsheet modules, and is made up of GUI, API, and format specifications. The document format described uses XML contained in a compressed file container, similar to OpenDocument and Office Open XML.

The Microsoft Open Specification Promise is a promise by Microsoft, published in September 2006, to not assert its patents, in certain conditions, against implementations of a certain list of specifications.

<span class="mw-page-title-main">Microsoft Word Viewer</span> Freeware program

Microsoft Word Viewer is a discontinued freeware program for Microsoft Windows that can display and print Microsoft Word documents. Word Viewer allows text from a Word document to be copied into clipboard and pasted into a word processor. The last version was Word Viewer 2003 Service Pack 3 released in 2007.

The Office Open XML format (OOXML), is an open and free document file format for saving and exchanging editable office documents such as text documents, spreadsheets, charts, and presentations.

<span class="mw-page-title-main">Solid Converter PDF</span>

Solid Converter PDF is document reconstruction software from Solid Documents which converts PDF files to editable formats. Originally released for the Microsoft Windows operating system, a Mac OS X version was released in 2010. The current versions are Solid Converter PDF 9.0 for Windows and Solid PDF to Word for Mac 2.1. The same technology used by the product's Solid Framework SDK is licensed by Adobe for Acrobat X.

References

  1. "IME Content-Type/Subtype - application/msword". IANA. 1993-07-22. Retrieved 2012-06-20.
  2. Uniform Type Identifiers Reference (PDF), Apple, retrieved 2012-06-20
  3. "System-Declared Uniform Type Identifiers (Mac OS X v10.4)". Apple Developer Connection. Apple Inc. 2008-04-08.
  4. 1 2 3 MS-DOC: Word (.doc) Binary File Format, 2019-11-19, retrieved 2020-02-25
  5. 1 2 "Configuring the File Block Settings". www.ibm.com. Retrieved 2024-12-03.
  6. "WD: Word 97 Import Converter: How to Do an Administrative Setup". helparchive.huntertur.net. Retrieved 2024-12-04.
  7. "Q162214 - WD: How to Obtain the Word 97-2000 Import Converter". Microsoft. 2002-10-09. Archived from the original on 2002-10-09. Retrieved 2024-12-03.
  8. "Download details: Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats". Microsoft. 2006-12-06. Retrieved 2024-12-03.
  9. "Comparing ODF and OOXML" (PDF). 2006. Archived from the original (pdf) on 2011-09-28. Retrieved 2011-05-23.
  10. Beware of Geeks Bearing Gifts, 2006, retrieved 2011-05-23
  11. "A Word 8 converter for Unix" . Retrieved 2011-05-23.
  12. "Microsoft Word 97 Binary File Format" . Retrieved 2011-05-23.
  13. "Royalty-free specifications for Microsoft Office binary file formats" . Retrieved 2011-05-23.
  14. "Mapping documents in the binary format (.doc; .xls; .ppt) to the Open XML format". 2008-01-16. Retrieved 2011-05-23.
  15. "Microsoft Compound Document Format" (PDF). OpenOffice.org . 2007-08-07.
  16. Microsoft Office Binary (doc, xls, ppt) File Formats, 2008-02-15, archived from the original on 2008-02-18
  17. "Microsoft Office Word 97 - 2007 Binary File Format Specification (*.doc)" (PDF). Microsoft Corporation. 2008.
  18. "Microsoft Open Specification Promise". Microsoft Corporation. March 23, 2009.
  19. "How to extract information from Office files by using Office file formats and schemas" . Retrieved 2011-05-23.
  20. Joel Spolsky. "Why are the Microsoft Office file formats so complicated? (And some workarounds)". Archived from the original on 2013-10-14. Retrieved 2011-05-23.
  21. "LibreOffice/core". GitHub.