Windows 1252

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
 
(10 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
{{FormatInfo
 
{{FormatInfo
 
|formattype=electronic
 
|formattype=electronic
|subcat=Character Encodings
+
|subcat=Character encoding
 +
|subcat2=Windows encodings
 +
|wikidata={{wikidata|Q2736131}}
 +
|charset=windows-1252
 +
|charsetaliases=cswindows1252
 +
|mibenum=2252
 +
|codepage=1252
 +
|cfstringencoding=1280
 +
|nsstringencoding=12
 
}}
 
}}
'''Windows 1252''' is a character encoding used in Microsoft Windows systems, particularly English-langauge installations. It includes all the printable characters of [[ISO 8859-1]] (Latin-1) (plus the [[ASCII]] control characters of the [[C0 controls]]), as well as additional characters in the range 128-159, which in ISO 8859 is reserved for control characters of the [[C1 controls]]. It's often falsely claimed to be an ANSI standard.
+
'''Windows 1252''' (CP1252, Windows-1252, Windows CP1252, Windows Latin Western, Windows Latin, Windows ANSI) is a character encoding used in Microsoft Windows systems, particularly English-language installations. It is one of the [[Windows encodings]]. It includes all the printable characters of [[ISO 8859-1]] (Latin-1) (plus the [[ASCII]] control characters of the [[C0 controls]]), as well as additional characters in the range 128-159, which in ISO 8859 is reserved for control characters of the [[C1 controls]]. It's often falsely claimed to be an ANSI standard.
  
Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of <code>&amp;128;</code> through <code>&amp;159;</code>, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be Unicode code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode.
+
Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of <code>&amp;128;</code> through <code>&amp;159;</code>, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be [[Unicode]] code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode. (But note that [[HTML]] 5 sidesteps this issue, by not really supporting ISO-8859-1 at all. Instead, "iso-8859-1" and many other encoding names are to be treated as [https://www.w3.org/TR/encoding/#names-and-labels aliases of Windows 1252].)
  
 
== References ==
 
== References ==
 
* [http://msdn.microsoft.com/en-us/library/cc195054.aspx Microsoft Windows 1252 code page]
 
* [http://msdn.microsoft.com/en-us/library/cc195054.aspx Microsoft Windows 1252 code page]
 +
* [http://www.kreativekorp.com/charset/encoding/CP1252/ Code chart at KreativeKorp]
 +
* [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT Unicode mapping]
  
 
[[Category:Microsoft]]
 
[[Category:Microsoft]]
 +
[[Category:Windows]]

Latest revision as of 17:33, 23 June 2019

File Format
Name Windows 1252
Ontology
Wikidata ID Q2736131
IANA charset windows-1252
IANA aliases cswindows1252
IANA MIBenum 2252
Code Page 1252
CFStringEncoding 1280
NSStringEncoding 12

Windows 1252 (CP1252, Windows-1252, Windows CP1252, Windows Latin Western, Windows Latin, Windows ANSI) is a character encoding used in Microsoft Windows systems, particularly English-language installations. It is one of the Windows encodings. It includes all the printable characters of ISO 8859-1 (Latin-1) (plus the ASCII control characters of the C0 controls), as well as additional characters in the range 128-159, which in ISO 8859 is reserved for control characters of the C1 controls. It's often falsely claimed to be an ANSI standard.

Websites and e-mail messages often mistakenly use headers and parameters claiming a document or message to be in ISO-8859-1 when it actually uses characters that are part of Windows 1252. Bogus numeric character references in HTML, in the range of &128; through &159;, will also sometimes turn up, usually extruded by misbehaving web-authoring programs, when such references are intended by the standards to be Unicode code positions, not codes from a system-specific encoding. The Windows 1252 characters in this range, including curly quotes and apostrophes and the ellipsis, turn up often in Web documents and have proper codings elsewhere in Unicode. (But note that HTML 5 sidesteps this issue, by not really supporting ISO-8859-1 at all. Instead, "iso-8859-1" and many other encoding names are to be treated as aliases of Windows 1252.)

[edit] References

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox