Cancel character

Last updated

In telecommunication, the term cancel character has the following meanings:

  1. A control character ("CAN", "Cancel", U+0018, or ^X) used to indicate that the data with which it is associated are in error or are to be disregarded. Exact meaning can depend on protocol. For example:
    • In some journalistic text transmission formats, it signifies that the preceding word should be deleted; it is sometimes called "Kill Word" ("KW") in this context. [1]
    • In some Videotex formats, it stops any running macros. [2] [3] In others, it clears the current line after the cursor position (compare EL). [4]
  2. A control character ("CCH", "Cancel Character", U+0094, or ESC T) used to erase the previous character. This character was created as an unambiguous alternative to the much more common backspace character ("BS", U+0008), which has a now mostly obsolete alternative function of causing the following character to be superimposed on the preceding one.

Related Research Articles

ISO/IEC 646 is the name of a set of ISO standards, described as Information technology — ISO 7-bit coded character set for information interchange and developed in cooperation with ASCII at least since 1964. Since its first edition in 1967 it has specified a 7-bit character code from which several national standards are derived.

ISO/IEC 8859-8, Information technology — 8-bit single-byte coded graphic character sets — Part 8: Latin/Hebrew alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings. ISO/IEC 8859-8:1999 from 1999 represents its second and current revision, preceded by the first edition ISO/IEC 8859-8:1988 in 1988. It is informally referred to as Latin/Hebrew. ISO/IEC 8859-8 covers all the Hebrew letters, but no Hebrew vowel signs. IBM assigned code page 916 to it. This character set was also adopted by Israeli Standard SI1311:2002, with some extensions.

ISO/IEC 2022Information technology—Character code structure and extension techniques, is an ISO standard specifying:

T.61 is an ITU-T Recommendation for a Teletex character set. T.61 predated Unicode, and was the primary character set in ASN.1 used in early versions of X.500 and X.509 for encoding strings containing characters used in Western European languages. It is also used by older versions of LDAP. While T.61 continues to be supported in modern versions of X.500 and X.509, it has been deprecated in favor of Unicode. It is also called Code page 1036, CP1036, or IBM 01036.

Shift Out and Shift In characters

Shift Out (SO) and Shift In (SI) are ASCII control characters 14 and 15, respectively. These are sometimes also called "Control-N" and "Control-O".

The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use ASCII and derivatives of ASCII. The codes represent additional information about the text, such as the position of a cursor, an instruction to start a new line, or a message that the text has been received.

T.51 / ISO/IEC 6937:2001, Information technology — Coded graphic character set for text communication — Latin alphabet, is a multibyte extension of ASCII, or rather of ISO/IEC 646-IRV. It was developed in common with ITU-T for telematic services under the name of T.51, and first became an ISO standard in 1983. Certain byte codes are used as lead bytes for letters with diacritics (accents). The value of the lead byte often indicates which diacritic that the letter has, and the follow byte then has the ASCII-value for the letter that the diacritic is on.

JIS X 0201 Japanese single byte character encoding

JIS X 0201, a Japanese Industrial Standard developed in 1969, was the first Japanese electronic character set to become widely used. It is either a 7-bit encoding or an 8-bit encoding, although the 8-bit form is dominant for modern use. The full name of this standard is 7-bit and 8-bit coded character sets for information interchange (7ビット及び8ビットの情報交換用符号化文字集合).

YUSCII is an informal name for several JUS standards for 7-bit character encoding. These include:

In mathematics, the radical sign, radical symbol, root symbol, radix, or surd is a symbol for the square root or higher-order root of a number. The square root of a number is written as

The CCITT Chinese Primary Set is a multi-byte graphic character set for Chinese communications created for the Consultative Committee on International Telephone and Telegraph (CCITT) in 1992. It is defined in ITU T.101, annex C, which codifies Data Syntax 2 Videotex. It is registered with the ISO-IR registry for use with ISO/IEC 2022 as ISO-IR-165, and encodable in the ISO-2022-CN-EXT code version.

ISO 2047 is a standard for graphical representation of the control characters for debugging purposes, such as may be found in the character generator of a computer terminal; it also establishes a two-letter abbreviation of each control character. It started out as ANSI X3.32-1973 (American National Standard – Graphic Representation of the Control Characters of American National Standard Code for Information Interchange in 1973 and became an ISO standard in 1975. In addition, RFC 1345 "Character Mnemonics & Character Sets" is cited as the ISO 2047 two-letter abbreviation of the control character. ISO 2047, ECMA-17 in Europe, GB/T 3911-1983 in China, that corresponds to KS X 1010 in Korea has been established as a standard. It was enacted "graphical representation of information exchange capabilities for character" JIS X 0209:1976 in Japan, and was abolished on January 20, 2010.

The APL Character Set for Workspace Interchange, registered for use with ISO/IEC 2022 as ISO-IR-68, is a character set developed by the APL Working Group of the Canadian Standards Association. IBM calls it Code page 371. It is one of several APL code pages used for the syntax and symbols used by the APL programming language.

The ISO 2033:1983 standard defines character sets for use with Optical Character Recognition or Magnetic Ink Character Recognition systems. The Japanese standard JIS X 9010:1984 is closely related.

IEC-P27-1 is an 8-bit character set developed by the IEC. When combined with the ISO/IEC 646 character set, this includes all characters required to print the symbols defined in IEC 60027-1.

INIS is a 7-bit subset of ASCII developed by the International Nuclear Information System (INIS). It has MIB 51 and is also known as iso-ir-49 and csISO49INIS.

INIS-8 is an 8-bit character encoding developed by the International Nuclear Information System (INIS). It is an 8-bit extension of the 7-bit INIS character set, adding a G1 set, and has MIB 52. It is also known as iso-ir-50 and csISO50INIS8.

ISO-IR-197 is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies.

The character sets used by Videotex are based, to greater or lesser extents, on ISO/IEC 2022. Three Data Syntax systems are defined by ITU T.101, corresponding to the Videotex systems of different countries.

References