Windows 1252
m |
Dan Tobias (Talk | contribs) |
||
(8 intermediate revisions by 2 users not shown) | |||
Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Character encoding | |subcat=Character encoding | ||
+ | |subcat2=Windows encodings | ||
+ | |wikidata={{wikidata|Q2736131}} | ||
+ | |charset=windows-1252 | ||
+ | |charsetaliases=cswindows1252 | ||
+ | |mibenum=2252 | ||
+ | |codepage=1252 | ||
+ | |cfstringencoding=1280 | ||
+ | |nsstringencoding=12 | ||
}} | }} | ||
− | '''Windows 1252''' is a character encoding used in Microsoft Windows systems, particularly English- | + | '''Windows 1252''' (CP1252, Windows-1252, Windows CP1252, Windows Latin Western, Windows Latin, Windows ANSI) is a character encoding used in Microsoft Windows systems, particularly English-language installations. It is one of the [[Windows encodings]]. It includes all the printable characters of [[ISO 8859-1]] (Latin-1) (plus the [[ASCII]] control characters of the [[C0 controls]]), as well as additional characters in the range 128-159, which in ISO 8859 is reserved for control characters of the [[C1 controls]]. It's often falsely claimed to be an ANSI standard. |
− | Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of <code>&128;</code> through <code>&159;</code>, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be [[Unicode]] code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode. | + | Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of <code>&128;</code> through <code>&159;</code>, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be [[Unicode]] code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode. (But note that [[HTML]] 5 sidesteps this issue, by not really supporting ISO-8859-1 at all. Instead, "iso-8859-1" and many other encoding names are to be treated as [https://www.w3.org/TR/encoding/#names-and-labels aliases of Windows 1252].) |
== References == | == References == | ||
* [http://msdn.microsoft.com/en-us/library/cc195054.aspx Microsoft Windows 1252 code page] | * [http://msdn.microsoft.com/en-us/library/cc195054.aspx Microsoft Windows 1252 code page] | ||
+ | * [http://www.kreativekorp.com/charset/encoding/CP1252/ Code chart at KreativeKorp] | ||
+ | * [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT Unicode mapping] | ||
[[Category:Microsoft]] | [[Category:Microsoft]] | ||
+ | [[Category:Windows]] |
Latest revision as of 17:33, 23 June 2019
Windows 1252 (CP1252, Windows-1252, Windows CP1252, Windows Latin Western, Windows Latin, Windows ANSI) is a character encoding used in Microsoft Windows systems, particularly English-language installations. It is one of the Windows encodings. It includes all the printable characters of ISO 8859-1 (Latin-1) (plus the ASCII control characters of the C0 controls), as well as additional characters in the range 128-159, which in ISO 8859 is reserved for control characters of the C1 controls. It's often falsely claimed to be an ANSI standard.
Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of &128;
through &159;
, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be Unicode code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode. (But note that HTML 5 sidesteps this issue, by not really supporting ISO-8859-1 at all. Instead, "iso-8859-1" and many other encoding names are to be treated as aliases of Windows 1252.)