WordProcessingML

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Sample file)
Line 27: Line 27:
 
* [[DataDiagrammingML]]
 
* [[DataDiagrammingML]]
  
== Sample file ==
+
== Sample files ==
 
* [http://www.dan.info/sampledata/Word_2003_XML_Document.xml Sample WordProcessingML document (saved from Word 2007)]
 
* [http://www.dan.info/sampledata/Word_2003_XML_Document.xml Sample WordProcessingML document (saved from Word 2007)]
 +
* [http://www.dan.info/sampledata/msword/testing.xml Document saved from Windows Word 2007 in XML 2006 format]
 +
* [http://www.dan.info/sampledata/msword/testing-2003.xml Document saved from Windows Word 2007 in XML 2003 format]
  
 
== Links ==
 
== Links ==

Revision as of 01:09, 19 April 2014

File Format
Name WordProcessingML
Ontology
Extension(s) .xml

WordProcessingML or Word 2003 XML Document is an XML-based format which was introduced in Microsoft Office 2003 as one of the formats which could be chosen in the "Save As" feature to save Word documents, though not the default format (which was DOC, a proprietary binary format). This is a different format from the DOCX format introduced in Office 2007, which consists of a ZIP archive of various files including XML. In contrast, WordProcessingML is a single XML file, uncompressed, and is unable to store all features which can be present in an Word document. The 2007 versions are still capable of loading and saving WordProcessingML, even if a different XML-based format is the default format.

Contents

Identification

A WordProcessingML file has the following header:

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument
  xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
  xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
  xmlns:o="urn:schemas-microsoft-com:office:office"
  w:macrosPresent="no"
  w:embeddedObjPresent="no"
  w:ocxPresent="no"
  xml:space="preserve">

(which can vary somewhat depending on what version of Word it is saved from; from Word 2007 it lacks line breaks and uses different namespace URIs: http://schemas.microsoft.com/aml/2001/core, uuid:C2F41010-65B3-11d1-A29F-00AA00C14882, http://schemas.openxmlformats.org/markup-compatibility/2006)

Related formats

Sample files

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox