C1 controls
From Just Solve the File Format Problem
(Difference between revisions)
(New version cloned from C0, still working on it.) |
(Continuing work on basic content) |
||
Line 11: | Line 11: | ||
! title="Hexadecimal code point" | Hex | ! title="Hexadecimal code point" | Hex | ||
! title="Decimal code point" | Dec | ! title="Decimal code point" | Dec | ||
− | + | ! title="Standard abbreviation" | Abbreviation | |
− | ! title="Standard | + | |
! title="Character name" | Name | ! title="Character name" | Name | ||
! title="Description and uses" | Description and uses | ! title="Description and uses" | Description and uses | ||
|- | |- | ||
− | |80|| | + | |80||128||PAD||Null character||Marks unused space or padding (e.g., to intentionally slow down terminals or to leave space for added data in memory or storage media). Used in C-based programming languages to mark end of string. |
|- | |- | ||
− | |81|| | + | |81||129||HOP||Start of Heading||Marks the beginning of a header in a message or data structure. |
|- | |- | ||
− | |82|| | + | |82||130||BPH||Start of Text||Marks the beginning of the body text of a message, and/or the end of the header. |
|- | |- | ||
− | |83|| | + | |83||131||NBH||End of Text||Marks the end of the body text. Also used as "break character" (Control-C) to terminate a program or process. |
|- | |- | ||
− | |84|| | + | |84||132||IND||End of Transmission||In Unix-style operating systems, signals end-of-file and is used to log out of a terminal. On Apple II, this character signalled that what followed was a DOS command when it was "printed" to standard output. |
|- | |- | ||
− | |85|| | + | |85||133||NEL||Enquiry||Used in transmission protocols to request acknowledgement from the other end to make sure connection is still active. In DEC TOPS-20 mainframes, usually resulted in currently-active application outputing status information to terminal. |
|- | |- | ||
− | |86|| | + | |86||134||SSA||Acknowledge||Sent as response to ENQ message, or used to positively acknowledge receipt of data or messages (as opposed to NAK). |
|- | |- | ||
− | |87|| | + | |87||135||ESA||Bell||On some systems, this causes a bell, buzzer, or beep to sound, or flashes inverse video to alert a system operator. The Apple II had "BELL" on the front side of the "G" key to remind users that Ctrl-G caused this sound effect. |
|- | |- | ||
− | |88|| | + | |88||136||HTS||Backspace||Moves back one space. Usually deletes last character (e.g., from input string), but on some old terminals it just moved backward without deleting and allowed "overstrike" effects overlaying multiple characters. |
|- | |- | ||
− | |89|| | + | |89||137||HTJ||Horizontal Tab||The typewriter "tab key", usually moving to the next tab stop as defined in the particular software being used. |
|- | |- | ||
− | |8A|| | + | |8A||138||VTS||Line Feed||Move down one line. In Unix-style operating systems, it also moves to the beginning of the next line so that it can be used as a line break (newline) character, while in some other systems and terminals it just moves down without moving to the left, requiring the "CR LF" sequence to break a line. |
|- | |- | ||
− | |8B|| | + | |8B||139||PLD||Vertical Tab||Moves to vertical tab stops; not used nearly as often as the more-common horizontal tab. |
|- | |- | ||
− | |8C|| | + | |8C||140||PLU||Form Feed||Causes page to eject in printers, and may clear the screen in some terminal emulators. Sometimes used as a logical division of sections of a document. |
|- | |- | ||
− | |8D|| | + | |8D||141||RI||Carriage Return||Moves to the beginning of the line. In some systems (e.g., Apple II, Commodore 64, and TRS-80, and early Macintosh systems before its OS switched to a Unix-based system), also moves to the next line so that it can be used as a line break character, while in other systems it stays on the same line so that it must be accompanied by a LF character to break a line (but on some printing terminals CR with no LF was used for overstrike effects including underlining by printing underscores). Thus the three different line-break conventions (LF, CR, and CR+LF) arose, which bedevil users of text files to this day. As an input character, CR is generally mapped onto the Enter key, signaling the completion of input. |
|- | |- | ||
− | |8E|| | + | |8E||142||SS2||Shift Out||Switch to alternate character set (reversed by SI). Used in various systems and terminals to set different characters (e.g., APL or Cyrillic), or change the color or font. |
|- | |- | ||
− | |8F|| | + | |8F||143||SS3||Shift In||Return to normal character set (reverses operation of SO). |
|- | |- | ||
− | |90|| | + | |90||144||DCS||Data Link Escape||Signals the start of a sequence of raw data as opposed to normal printable or control characters. |
|- | |- | ||
− | |91|| | + | |91||145||PU1||Device Control 1||One of four device-control codes intended to be system-specific. This one (CTRL-Q, also known as XON) is often used to resume operations of a process, device, or output stream that has been paused with CTRL-S (XOFF). |
|- | |- | ||
− | |92|| | + | |92||146||PU2||Device Control 2||Another device-control code; not used as much as DC1 and DC3. |
|- | |- | ||
− | |93|| | + | |93||147||STS||Device Control 3||The third of the device-control codes; this one (CTRL-S, also known as XOFF) is often used to pause processes, devices, or output streams, with CTRL-Q (XON) resuming them (though in some cases, any keypress causes output to resume). |
|- | |- | ||
− | |94|| | + | |94||148||PCH||Device Control 4||The fourth device-control code; not used as much as DC1 or DC3. In DEC TOPS-20 mainframes, usually resulted in output of system status to terminal. |
|- | |- | ||
− | |95|| | + | |95||149||MW||Negative Acknowledge||In transmission protocols, indicates a failure requiring a re-send, or a negative response to a query of whether the process is ready to proceed. |
|- | |- | ||
− | |96|| | + | |96||150||SPA||Synchronous Idle||Signals that a correction may now be received in synchronous transmission protocols. |
|- | |- | ||
− | |97|| | + | |97||151||EPA||End of Transmission Block||Marks the end of a block of data divided into blocks for transmission. |
|- | |- | ||
− | |98|| | + | |98||152||SOS||Cancel||Cancels an operation and signals that previously-sent data can be disregarded. |
|- | |- | ||
− | |99|| | + | |99||153||SGCI||End of Medium||Marks the end of a physical medium such as a data-storage tape. |
|- | |- | ||
− | |9A|| | + | |9A||154||SCI||Substitute Character||Used to mark the spot where garbled, missing, or incomplete characters were received due to transmission errors, or various other uses involving place-holder characters. This character (Ctrl-Z) is also used by MS/PC-DOS to mark the end of a file or input stream, calling it EOF (although CTRL-D, EOT, would have been more standards-compliant and is used by Unix-style OSs for this purpose; however, some DEC operating systems used the CTRL-Z convention and this is what was followed by PC-DOS). |
|- | |- | ||
− | |9B|| | + | |9B||155||CSI||Escape||Mapped onto the ESC key on keyboards, this usually signals a user attempting to exit a menu or mode. It is also commonly used in printer and terminal control protocols to signal the beginning of a special "escape sequence" where immediately-following characters are interpreted as commands. |
|- | |- | ||
− | |9C|| | + | |9C||156||ST||File Separator||One of four separator characters intended to delimit structured data. FS is the highest-level separator, intended to separate structures which are in turn internally delimited with GS, RS, and US (in descending order). Also used as a "quit and dump core" signal in Unix shells. |
|- | |- | ||
− | |9D|| | + | |9D||157||OSC||Group Separator||The second of four separator characters, subordinate to FS, but higher-level than RS and US. |
|- | |- | ||
− | |9E|| | + | |9E||158||PM||Record Separator||The third of four separator characters, subordinate to FS and GS, but higher-level than US. |
|- | |- | ||
− | |9F|| | + | |9F||159||APC||Unit Separator||The lowest-level of the separator characters, used to divide strings of ASCII characters which are the base elements of a data structure. A sequence of such US-delimited strings can in turn be used as a higher-level data element separated by other such elements by the RS character, and this structure in turn can be delimited from other such elements by GS, and finally if a fourth level is needed the FS character separates those elements. |
|} | |} | ||
[[Category:File format details]] | [[Category:File format details]] |
Revision as of 12:49, 30 November 2012
The C1 controls are the control characters (code positions 128-159 decimal) which are part of the ISO-8859 standard. They are also part of a number of other character sets derived from ASCII. They are not often used, and in otherwise equivalent Microsoft character sets (e.g., Windows 1252) they are replaced by printing characters.
NOTE: I'm cloning this from the C0 article and saving it occasionally. There will be gross errors till I'm done. Bear with me or dive in. --Gmcgath (talk) 12:34, 30 November 2012 (UTC)
Hex | Dec | Abbreviation | Name | Description and uses |
---|---|---|---|---|
80 | 128 | PAD | Null character | Marks unused space or padding (e.g., to intentionally slow down terminals or to leave space for added data in memory or storage media). Used in C-based programming languages to mark end of string. |
81 | 129 | HOP | Start of Heading | Marks the beginning of a header in a message or data structure. |
82 | 130 | BPH | Start of Text | Marks the beginning of the body text of a message, and/or the end of the header. |
83 | 131 | NBH | End of Text | Marks the end of the body text. Also used as "break character" (Control-C) to terminate a program or process. |
84 | 132 | IND | End of Transmission | In Unix-style operating systems, signals end-of-file and is used to log out of a terminal. On Apple II, this character signalled that what followed was a DOS command when it was "printed" to standard output. |
85 | 133 | NEL | Enquiry | Used in transmission protocols to request acknowledgement from the other end to make sure connection is still active. In DEC TOPS-20 mainframes, usually resulted in currently-active application outputing status information to terminal. |
86 | 134 | SSA | Acknowledge | Sent as response to ENQ message, or used to positively acknowledge receipt of data or messages (as opposed to NAK). |
87 | 135 | ESA | Bell | On some systems, this causes a bell, buzzer, or beep to sound, or flashes inverse video to alert a system operator. The Apple II had "BELL" on the front side of the "G" key to remind users that Ctrl-G caused this sound effect. |
88 | 136 | HTS | Backspace | Moves back one space. Usually deletes last character (e.g., from input string), but on some old terminals it just moved backward without deleting and allowed "overstrike" effects overlaying multiple characters. |
89 | 137 | HTJ | Horizontal Tab | The typewriter "tab key", usually moving to the next tab stop as defined in the particular software being used. |
8A | 138 | VTS | Line Feed | Move down one line. In Unix-style operating systems, it also moves to the beginning of the next line so that it can be used as a line break (newline) character, while in some other systems and terminals it just moves down without moving to the left, requiring the "CR LF" sequence to break a line. |
8B | 139 | PLD | Vertical Tab | Moves to vertical tab stops; not used nearly as often as the more-common horizontal tab. |
8C | 140 | PLU | Form Feed | Causes page to eject in printers, and may clear the screen in some terminal emulators. Sometimes used as a logical division of sections of a document. |
8D | 141 | RI | Carriage Return | Moves to the beginning of the line. In some systems (e.g., Apple II, Commodore 64, and TRS-80, and early Macintosh systems before its OS switched to a Unix-based system), also moves to the next line so that it can be used as a line break character, while in other systems it stays on the same line so that it must be accompanied by a LF character to break a line (but on some printing terminals CR with no LF was used for overstrike effects including underlining by printing underscores). Thus the three different line-break conventions (LF, CR, and CR+LF) arose, which bedevil users of text files to this day. As an input character, CR is generally mapped onto the Enter key, signaling the completion of input. |
8E | 142 | SS2 | Shift Out | Switch to alternate character set (reversed by SI). Used in various systems and terminals to set different characters (e.g., APL or Cyrillic), or change the color or font. |
8F | 143 | SS3 | Shift In | Return to normal character set (reverses operation of SO). |
90 | 144 | DCS | Data Link Escape | Signals the start of a sequence of raw data as opposed to normal printable or control characters. |
91 | 145 | PU1 | Device Control 1 | One of four device-control codes intended to be system-specific. This one (CTRL-Q, also known as XON) is often used to resume operations of a process, device, or output stream that has been paused with CTRL-S (XOFF). |
92 | 146 | PU2 | Device Control 2 | Another device-control code; not used as much as DC1 and DC3. |
93 | 147 | STS | Device Control 3 | The third of the device-control codes; this one (CTRL-S, also known as XOFF) is often used to pause processes, devices, or output streams, with CTRL-Q (XON) resuming them (though in some cases, any keypress causes output to resume). |
94 | 148 | PCH | Device Control 4 | The fourth device-control code; not used as much as DC1 or DC3. In DEC TOPS-20 mainframes, usually resulted in output of system status to terminal. |
95 | 149 | MW | Negative Acknowledge | In transmission protocols, indicates a failure requiring a re-send, or a negative response to a query of whether the process is ready to proceed. |
96 | 150 | SPA | Synchronous Idle | Signals that a correction may now be received in synchronous transmission protocols. |
97 | 151 | EPA | End of Transmission Block | Marks the end of a block of data divided into blocks for transmission. |
98 | 152 | SOS | Cancel | Cancels an operation and signals that previously-sent data can be disregarded. |
99 | 153 | SGCI | End of Medium | Marks the end of a physical medium such as a data-storage tape. |
9A | 154 | SCI | Substitute Character | Used to mark the spot where garbled, missing, or incomplete characters were received due to transmission errors, or various other uses involving place-holder characters. This character (Ctrl-Z) is also used by MS/PC-DOS to mark the end of a file or input stream, calling it EOF (although CTRL-D, EOT, would have been more standards-compliant and is used by Unix-style OSs for this purpose; however, some DEC operating systems used the CTRL-Z convention and this is what was followed by PC-DOS). |
9B | 155 | CSI | Escape | Mapped onto the ESC key on keyboards, this usually signals a user attempting to exit a menu or mode. It is also commonly used in printer and terminal control protocols to signal the beginning of a special "escape sequence" where immediately-following characters are interpreted as commands. |
9C | 156 | ST | File Separator | One of four separator characters intended to delimit structured data. FS is the highest-level separator, intended to separate structures which are in turn internally delimited with GS, RS, and US (in descending order). Also used as a "quit and dump core" signal in Unix shells. |
9D | 157 | OSC | Group Separator | The second of four separator characters, subordinate to FS, but higher-level than RS and US. |
9E | 158 | PM | Record Separator | The third of four separator characters, subordinate to FS and GS, but higher-level than US. |
9F | 159 | APC | Unit Separator | The lowest-level of the separator characters, used to divide strings of ASCII characters which are the base elements of a data structure. A sequence of such US-delimited strings can in turn be used as a higher-level data element separated by other such elements by the RS character, and this structure in turn can be delimited from other such elements by GS, and finally if a fourth level is needed the FS character separates those elements. |