XLS

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Category:Microsoft Compound File)
m (Change telparia.com samples link to template)
 
(13 intermediate revisions by 6 users not shown)
Line 2: Line 2:
 
|formattype=electronic
 
|formattype=electronic
 
|subcat=Document
 
|subcat=Document
 +
|subcat2=Spreadsheet
 
|extensions={{ext|xls}}
 
|extensions={{ext|xls}}
 
|mimetypes={{mimetype|application/vnd.ms-excel}}
 
|mimetypes={{mimetype|application/vnd.ms-excel}}
 
}}
 
}}
 +
[[XLS]] is a family of spreadsheet formats native to [[Microsoft Excel]].
 +
 
==Microsoft Office EXCEL 97-2007==  
 
==Microsoft Office EXCEL 97-2007==  
  
Line 10: Line 13:
  
 
Later Excel versions use [[XLSX]] as their native format, though they still support the older format as well.
 
Later Excel versions use [[XLSX]] as their native format, though they still support the older format as well.
 +
 +
===Versions===
 +
{| class="wikitable"
 +
! Version !! BIFF Version !! Document type !! PRONOM !!Year
 +
|-
 +
|Excel 2.x || BIFF2  || Worksheet || {{PRONOM|fmt/55}} || 1987 
 +
|-
 +
|Excel 3.0 || BIFF3  || Worksheet || {{PRONOM|fmt/56}} || 1990
 +
|-
 +
|Excel 4.0 || BIFF4  || Worksheet || {{PRONOM|fmt/57}} || 1992
 +
|-
 +
|Excel 5.0 || BIFF5  || Workbook || {{PRONOM|fmt/59}} || 1993
 +
|-
 +
|Excel 7.0 || BIFF5  || Workbook || {{PRONOM|fmt/59}} || 1995
 +
|-
 +
|Excel 8.0 || BIFF8  || Workbook || {{PRONOM|fmt/61}} || 1997
 +
|-
 +
|Excel 9.0 || BIFF8  || Workbook || {{PRONOM|fmt/62}} || 1999
 +
|-
 +
|Excel 10.0 || BIFF8  || Workbook || {{PRONOM|fmt/62}} || 2001
 +
|-
 +
|Excel 11.0 || BIFF8  || Workbook || {{PRONOM|fmt/62}} || 2003
 +
|}
  
 
==Handling of date values==
 
==Handling of date values==
 
Excel stores date values as floating point numbers that represent the number of days since a given start date. According to [http://datapub.cdlib.org/2014/04/10/abandon-all-hope-ye-who-enter-dates-in-excel/ this piece], the default start dates are different for Excel for Windows (which uses January 1, 1900 and) Excel for Mac (which uses January 1, 1904). On top of this, the 1900 date system also erroneously assumes that 1900 was a leap year. This assumption was introduced on purpose in order to ensure compatibility with a bug in Lotus 1-2-3. In practice this may lead to dates that are off by 4 years and 1 day, depending on the software that is used to read/process the files.
 
Excel stores date values as floating point numbers that represent the number of days since a given start date. According to [http://datapub.cdlib.org/2014/04/10/abandon-all-hope-ye-who-enter-dates-in-excel/ this piece], the default start dates are different for Excel for Windows (which uses January 1, 1900 and) Excel for Mac (which uses January 1, 1904). On top of this, the 1900 date system also erroneously assumes that 1900 was a leap year. This assumption was introduced on purpose in order to ensure compatibility with a bug in Lotus 1-2-3. In practice this may lead to dates that are off by 4 years and 1 day, depending on the software that is used to read/process the files.
 +
 +
== Software ==
 +
* [http://libxls.sourceforge.net/ libxls] - Library for reading XLS
 +
* [http://sourceforge.net/projects/xlslib/ xlsLib] - Library for writing XLS
 +
* [http://www.lexicon.net/sjmachin/xlrd.html The xlrd Module, a Python module for extracting data from MS Excel spreadsheet files] - contains detailed information about Excel's date handling
 +
 +
== Sample files ==
 +
* [http://labs.nationalarchives.gov.uk/wordpress/index.php/2014/01/datasets/ National Archives (UK) datasets (includes some XLS files)]
 +
* {{DexvertSamples|document/xls}}
  
 
==References==
 
==References==
Line 18: Line 53:
 
* [http://www.joelonsoftware.com/items/2008/02/19.html Why are the Microsoft Office file formats so complicated? (And some workarounds)]
 
* [http://www.joelonsoftware.com/items/2008/02/19.html Why are the Microsoft Office file formats so complicated? (And some workarounds)]
 
* [http://www.advisorone.com/2013/04/16/oops-reinhart-rogoff-research-tripped-up-by-typo?t=theory-strategy Typo in Excel spreadsheet apparently led to erroneous result in economic paper that was influential on government policy]
 
* [http://www.advisorone.com/2013/04/16/oops-reinhart-rogoff-research-tripped-up-by-typo?t=theory-strategy Typo in Excel spreadsheet apparently led to erroneous result in economic paper that was influential on government policy]
* [http://datapub.cdlib.org/2014/04/10/abandon-all-hope-ye-who-enter-dates-in-excel/ Abandon all hope, ye who enter dates in Excel]
+
* [https://uc3.cdlib.org/2014/04/09/abandon-all-hope-ye-who-enter-dates-in-excel/ Abandon all hope, ye who enter dates in Excel]
 
* [http://support.microsoft.com/kb/180162 XL: The 1900 Date System vs. the 1904 Date System]
 
* [http://support.microsoft.com/kb/180162 XL: The 1900 Date System vs. the 1904 Date System]
* [http://www.lexicon.net/sjmachin/xlrd.html The xlrd Module, a Python module for extracting data from MS Excel spreadsheet files] - contains detailed information about Excel's date handling
 
 
* [http://decalage.info/file_formats_security/office MS Office 97-2003 legacy/binary formats security] - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries
 
* [http://decalage.info/file_formats_security/office MS Office 97-2003 legacy/binary formats security] - article with lots of resources on MS Office formats, including analysis techniques, tools and parsing libraries
 +
* [https://www.openoffice.org/sc/excelfileformat.pdf Open Office - Excel Versions 2, 3, 4, 5, 95, 97, 2000, XP, 2003]
 +
 +
== External links ==
 +
* [{{ForensicsWikiURL|excel_spreadsheet_%28xls%29}} ForensicsWiki entry] (need to copy over whatever relevant details are available)
 +
 
[[Category:Microsoft]]
 
[[Category:Microsoft]]
 
[[Category:Microsoft Compound File]]
 
[[Category:Microsoft Compound File]]

Latest revision as of 04:07, 28 December 2023

File Format
Name XLS
Ontology
Extension(s) .xls
MIME Type(s) application/vnd.ms-excel

XLS is a family of spreadsheet formats native to Microsoft Excel.

Contents

[edit] Microsoft Office EXCEL 97-2007

The Excel Binary File Format (.xls) Structure is the binary file format used by Microsoft Excel 97, 2000, 2002, and Office Excel 2003. It is also supported by Microsoft Office Excel 2007.

Later Excel versions use XLSX as their native format, though they still support the older format as well.

[edit] Versions

Version BIFF Version Document type PRONOM Year
Excel 2.x BIFF2 Worksheet fmt/55 1987
Excel 3.0 BIFF3 Worksheet fmt/56 1990
Excel 4.0 BIFF4 Worksheet fmt/57 1992
Excel 5.0 BIFF5 Workbook fmt/59 1993
Excel 7.0 BIFF5 Workbook fmt/59 1995
Excel 8.0 BIFF8 Workbook fmt/61 1997
Excel 9.0 BIFF8 Workbook fmt/62 1999
Excel 10.0 BIFF8 Workbook fmt/62 2001
Excel 11.0 BIFF8 Workbook fmt/62 2003

[edit] Handling of date values

Excel stores date values as floating point numbers that represent the number of days since a given start date. According to this piece, the default start dates are different for Excel for Windows (which uses January 1, 1900 and) Excel for Mac (which uses January 1, 1904). On top of this, the 1900 date system also erroneously assumes that 1900 was a leap year. This assumption was introduced on purpose in order to ensure compatibility with a bug in Lotus 1-2-3. In practice this may lead to dates that are off by 4 years and 1 day, depending on the software that is used to read/process the files.

[edit] Software

[edit] Sample files

[edit] References

[edit] External links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox