DOC
Dan Tobias (Talk | contribs) (→Commentary) |
(→Other links: added decalage link) |
||
Line 57: | Line 57: | ||
== Other links == | == Other links == | ||
* [http://www.forensicswiki.org/wiki/Word_Document_%28DOC%29 Forensics Wiki article] | * [http://www.forensicswiki.org/wiki/Word_Document_%28DOC%29 Forensics Wiki article] | ||
+ | * [http://decalage.info/file_formats_security/office MS Office 97-2003 legacy/binary formats security] - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries | ||
[[Category:Microsoft]] | [[Category:Microsoft]] |
Revision as of 12:25, 20 November 2014
MS Word Doc format is a family of formats used by older versions of MS Word (they now use DOCX as a default as of Office 2007).
Contents |
File Types
Wikipedia says the following four types exist
- Word for DOS
- Word 1 & Word 2 for MS Windows, and Word 4 & 5 for Mac
- Word 6 & Word 95 for MS Windows, and Word 6 for Mac
- Word 97, 2000, 2002, 2003, 2007 & 2010 for MS Windows, and Word 98, 2001, X, & 2004 for Mac
What's up, DOC?
But just because the file you stumbled onto has a DOC extension doesn't mean it is necessarily actually an MS Word file, though if it's not that old it probably is. Older files, like from the 1980s or 1990s, might be something else entirely. Several other word processors in that era used .DOC file extensions, even though their format was nothing like MS Word's. Also, it was fairly common for people to save plain text files with that extension when they were DOCumenting something, like the instructions for a program that was packed up in an ARC or ZIP file for download on a bulletin board system (BBS). But you might still try opening them with Word (as will normally happen in Windows if you have Word installed and double-click on a DOC file), since it will open plain-text files all right (even ancient ones).
Sample files
Opening Word for DOS files in a modern Microsoft Word
Word for DOS files can't be opened natively with the current versions of Microsoft Word anymore. However it is possible to import such old Word files with an additional converter for Word.
- Download the file ftp://ftp.microsoft.com/Softlib/MSLFILES/WDSUPCNV.EXE, open it (it is a self extracting zip file) and select a directory to save the files.
- Copy all the resulting *.cnv files (but most importantly
Doswrd32.cnv
) toC:\Program Files (x86)\Common Files\microsoft shared\TextConv
(For users with a 32-bit Windows it is justC:\Program Files\Common Files\microsoft shared\TextConv
) - (Re)Start Microsoft Word
- Open the old Word file via the Open-dialog within Word.
- Word will show a prompt informing you that a text converter has to be started and that this might impose a security risk which you should only do if you trust the source where you got the files from. Press OK (if you trust the source of the files).
- Word will most likely show a prompt like "Style Sheet D:/STANDARD.DFV not found". Press Ok. Now a file selector dialog will open asking you to select a style sheet (*.sty) file. If you do have a style sheet for the file then select this. Otherwise create a new empty file in the regular Windows Explorer, rename it "empty.sty" and select it in the file selector. Selecting such an empty file could cause opened file to lose some general properties like print margins etc.
- Now you should be able to see the Word for DOS file within the modern Microsoft Word.
Official specs
Other format descriptions
- Word 5.0 (for DOS) file format, with notes on earlier versions
- Another site with notes on Word for DOS file format
Software and Program Code
- Microsoft Word for Windows Version 1.1a Source Code
- Tool for finding hidden metadata in Word files, and some of the stuff it found
- Textract: extract text from various document formats including DOC
- Word for DOS 5.5
Commentary
- Why are the Microsoft Office file formats so complicated? (And some workarounds)
- Why Microsoft Word must Die
- LibreOffice import filter for legacy Mac file-formats
- Why Mac Word 6.0 was crappy, from a developer
- Article about Word 1.15
Other links
- Forensics Wiki article
- MS Office 97-2003 legacy/binary formats security - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries