Character encoding
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) |
(ISO 8859 variants, Windows encodings, ISO 646) |
||
Line 12: | Line 12: | ||
* [[Braille]] | * [[Braille]] | ||
* [[EBCDIC]] | * [[EBCDIC]] | ||
+ | * [[ISO 646]] | ||
+ | ** [[ISO 646-CA]] (Canada / French) | ||
+ | ** [[ISO 646-CA-2]] (Canada / French) | ||
+ | ** [[ISO 646-CH]] (Switzerland) | ||
+ | ** [[ISO 646-CN]] (China / Basic Latin) | ||
+ | ** [[ISO 646-CU]] (Cuba / Spanish) | ||
+ | ** [[ISO 646-DE]] (Germany) | ||
+ | ** [[ISO 646-DK]] (Denmark) | ||
+ | ** [[ISO 646-FI]] (Finland) | ||
+ | ** [[ISO 646-FR]] (France) | ||
+ | ** [[ISO 646-GB]] (Great Britain) | ||
+ | ** [[ISO 646-HU]] (Hungary) | ||
+ | ** [[ISO 646-IRV]] (International Reference Version) | ||
+ | ** [[ISO 646-IT]] (Italy) | ||
+ | ** [[ISO 646-JP]] (Japan / Romaji) | ||
+ | ** [[ISO 646-JP OCR-B]] (Japan / Romaji) | ||
+ | ** [[ISO 646-KR]] (Korea / Latin) | ||
+ | ** [[ISO 646-MT]] (Malta) | ||
+ | ** [[ISO 646-NL]] (Netherlands) | ||
+ | ** [[ISO 646-NO]] (Norway) | ||
+ | ** [[ISO 646-NO-2]] (Norway) | ||
+ | ** [[ISO 646-PT]] (Portugal) | ||
+ | ** [[ISO 646-SE]] (Sweden) | ||
+ | ** [[ISO 646-SE-2]] (Sweden) | ||
+ | ** [[ISO 646-US]] (Same as [[ASCII]]) | ||
+ | ** [[ISO 646-YU]] (Yugoslavia) | ||
* [[ISO 8859]] | * [[ISO 8859]] | ||
+ | ** [[ISO 8859-1]] (Latin-1) | ||
+ | ** [[ISO 8859-2]] (Latin-2, Central/East European) | ||
+ | ** [[ISO 8859-3]] (Latin-3, Esperanto, Galician, Maltese, and Turkish) | ||
+ | ** [[ISO 8859-4]] (Latin-4, Scandinavian and Baltic) | ||
+ | ** [[ISO 8859-5]] (Cyrillic) | ||
+ | ** [[ISO 8859-6]] (Arabic) | ||
+ | ** [[ISO 8859-7]] (Modern Greek) | ||
+ | ** [[ISO 8859-8]] (Hebrew) | ||
+ | ** [[ISO 8859-9]] (Latin-5, Turkish) | ||
+ | ** [[ISO 8859-10]] (Latin-6, Lappish, Nordic, and Inuit) | ||
+ | ** [[ISO 8859-11]] (Thai) | ||
+ | ** [[ISO 8859-13]] (Latin-7, Baltic Rim) | ||
+ | ** [[ISO 8859-14]] (Celtic) | ||
+ | ** [[ISO 8859-15]] (Latin-9, Latin-1 with a Euro sign) | ||
+ | ** [[ISO 8859-16]] (Romanian) | ||
* [[Morse code]] | * [[Morse code]] | ||
* [[Unicode]] | * [[Unicode]] | ||
Line 20: | Line 61: | ||
** [[UTF-7]] | ** [[UTF-7]] | ||
** [[UTF-EBCDIC]] | ** [[UTF-EBCDIC]] | ||
− | + | * [[Microsoft Windows encodings]] | |
+ | ** [[Windows 1252]] (ISO 8859-1 plus additional characters) | ||
+ | ** [[Windows 1255]] (Hebrew) | ||
+ | ** [[Windows 1256]] (Arabic, Farsi, Urdu) | ||
+ | ** [[Windows 1257]] (Baltic Rim) | ||
+ | ** [[Windows 1258]] (Vietnamese) | ||
== External links == | == External links == | ||
* [http://www.transbay.net/~enf/ascii/ascii.pdf The Evolution of Character Codes, 1874–1968] | * [http://www.transbay.net/~enf/ascii/ascii.pdf The Evolution of Character Codes, 1874–1968] | ||
+ | * [http://www.kreativekorp.com/charset/ Collection of character encodings]] |
Revision as of 16:11, 2 November 2012
File Formats | > | Electronic File Formats | > | Character Encoding |
- ASCII
- PET ASCII (or PETSCII or CBM-ASCII; used by Commodore computers)
- Baudot code
- Braille
- EBCDIC
- ISO 646
- ISO 646-CA (Canada / French)
- ISO 646-CA-2 (Canada / French)
- ISO 646-CH (Switzerland)
- ISO 646-CN (China / Basic Latin)
- ISO 646-CU (Cuba / Spanish)
- ISO 646-DE (Germany)
- ISO 646-DK (Denmark)
- ISO 646-FI (Finland)
- ISO 646-FR (France)
- ISO 646-GB (Great Britain)
- ISO 646-HU (Hungary)
- ISO 646-IRV (International Reference Version)
- ISO 646-IT (Italy)
- ISO 646-JP (Japan / Romaji)
- ISO 646-JP OCR-B (Japan / Romaji)
- ISO 646-KR (Korea / Latin)
- ISO 646-MT (Malta)
- ISO 646-NL (Netherlands)
- ISO 646-NO (Norway)
- ISO 646-NO-2 (Norway)
- ISO 646-PT (Portugal)
- ISO 646-SE (Sweden)
- ISO 646-SE-2 (Sweden)
- ISO 646-US (Same as ASCII)
- ISO 646-YU (Yugoslavia)
- ISO 8859
- ISO 8859-1 (Latin-1)
- ISO 8859-2 (Latin-2, Central/East European)
- ISO 8859-3 (Latin-3, Esperanto, Galician, Maltese, and Turkish)
- ISO 8859-4 (Latin-4, Scandinavian and Baltic)
- ISO 8859-5 (Cyrillic)
- ISO 8859-6 (Arabic)
- ISO 8859-7 (Modern Greek)
- ISO 8859-8 (Hebrew)
- ISO 8859-9 (Latin-5, Turkish)
- ISO 8859-10 (Latin-6, Lappish, Nordic, and Inuit)
- ISO 8859-11 (Thai)
- ISO 8859-13 (Latin-7, Baltic Rim)
- ISO 8859-14 (Celtic)
- ISO 8859-15 (Latin-9, Latin-1 with a Euro sign)
- ISO 8859-16 (Romanian)
- Morse code
- Unicode
- Microsoft Windows encodings
- Windows 1252 (ISO 8859-1 plus additional characters)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic, Farsi, Urdu)
- Windows 1257 (Baltic Rim)
- Windows 1258 (Vietnamese)