Windows encodings
From Just Solve the File Format Problem
(Difference between revisions)
(Created page with "{{FormatInfo |formattype=electronic |subcat=Character encoding }} '''Windows encodings''' refers to the various legacy character encodings used by the non-Unicode Microsof...") |
|||
| Line 15: | Line 15: | ||
* [[Windows 1257]] (Baltic Rim) | * [[Windows 1257]] (Baltic Rim) | ||
* [[Windows 1258]] (Vietnamese) | * [[Windows 1258]] (Vietnamese) | ||
| + | |||
| + | == Links == | ||
| + | * [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/ Unicode mappings] | ||
[[Category:Microsoft]] | [[Category:Microsoft]] | ||
[[Category:Windows]] | [[Category:Windows]] | ||
Revision as of 17:10, 25 January 2018
Windows encodings refers to the various legacy character encodings used by the non-Unicode Microsoft Windows API, and most non-Unicode-aware Windows applications. (GUI applications, anyway. Windows console applications default to using the MS-DOS encodings in many cases.)
In Windows jargon, these encodings are misleadingly called "ANSI", or in some cases (also somewhat misleadingly), "multi-byte" encodings.
All too many file formats use one of these encodings, with no reliable way to determine which one.
List of encodings
- Windows 1252 (ISO 8859-1 plus additional characters)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic, Farsi, Urdu)
- Windows 1257 (Baltic Rim)
- Windows 1258 (Vietnamese)