Alias(es) | ISO-IR-53 |
---|---|
Standard | ISO 5426 |
Other related encoding(s) | |
ISO 5426 ("Extension of the Latin alphabet coded character set for bibliographic information interchange") is a character set developed by ISO, [1] similar to ISO/IEC 6937. It was first published in 1983. [2]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | ¡ | „ | £ | $ | ¥ | † | § | ʹ | ‘ | “ | « | ♭ | © | ℗ | ® |
3x | ʿ | ʾ | ‚ | ‡ | · | ʺ | ’ | ” | » | ♯ | ʹ | ʺ | ¿ | |||
4x | ◌̉ | ◌̀ | ◌́ | ◌̂ | ◌̃ | ◌̄ | ◌̆ | ◌̇ | ◌̈ | ◌̈ | ◌̊ | ◌̕ | ◌̒ | ◌̋ | ◌̛ | ◌̌ |
5x | ◌̧ | ◌̨ | ◌̡ | ◌̢ | ◌̥ | ◌̮ | ◌̣ | ◌̤ | ◌̲ | ◌̳ | ◌̩ | ◌̭ | ◌︠ | ◌︡ | ◌︣ | |
6x | Æ | Đ | IJ | Ł | Ø | Œ | Þ | |||||||||
7x | æ | đ | ð | ı | ij | ł | ø | œ | ß | þ | DEL |
ISO 5426-2 ("Latin characters used in minor European languages and obsolete typography") is a second part to ISO 5426, published in 1996. [4] It specifies a set of 70 characters, some of which do not exist in Unicode.[ as of? ][ clarification needed ] Michael Everson proposed the missing characters in Unicode 3.0, but some were postponed for further study. Later, new evidence was found, and more was encoded. P with belt is probably an error for P with flourish. [5]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | / 002F | ✶ 2736 | ¶ 00B6 | ☞ 261E | ⁌ 204C | ☙ 2619 | δ 03B4 | ⁊ 204A | � | ꝝ A75D | � | ꝯ A76F | ꝭ A76D | ꝰ A770 | |
3x | ´ 00B4 | ※ 203B | ⁋ 204B | ✠ 2720 | ⁍ 204D | ❧ 2767 | ℺ 213A | � | � | Ↄ 2183 | ꝫ A76B | � | ꝛ A75B | |||
4x | ◌̓ 0313 | ◌ᷣ 1DE3 | � | ◌᪰ 1AB0 | ◌᷈ 1DC8 | ◌ͣ 0363 | ◌ͤ 0364 | ◌ͦ 0366 | ◌ᷦ 1DE6 | ◌̴ 0334 | ◌̵ 0335 | ◌̸ 0338 | ◌̷ 0337 | |||
5x | ||||||||||||||||
6x | Ʒ 01B7 | Ǥ 01E4 | Ħ 0126 | Kʼ 004B 02BC | Ŋ 014A | � | Ꝓ A752 | Ꝑ A750 | Ꝗ A756 | Ʀ 01A6 | Ŧ 0166 | Ƿ 01F7 | Ȝ 021C | ꝙ A759 | ſ 017F | |
7x | ʒ 0292 | ǥ 01E5 | ħ 0127 | ĸ 0138 | ŋ 014B | ᵱ 1D71 | ꝓ A753 | ꝑ A751 | ꝗ A757 | ʀ 0280 | ŧ 0167 | ƿ 01BF | ȝ 021D | qꝫ 0071 A76B | � | DEL |
� Not in Unicode
Extended Binary Coded Decimal Interchange Code is an eight-bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding six-bit binary-coded decimal code used with most of IBM's computer peripherals of the late 1950s and early 1960s. It is supported by various non-IBM platforms, such as Fujitsu-Siemens' BS2000/OSD, OS-IV, MSP, and MSP-EX, the SDS Sigma series, Unisys VS/9, Unisys MCP and ICL VME.
ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. The ISO working group maintaining this series of standards has been disbanded.
ISO/IEC 646 is a set of ISO/IEC standards, described as Information technology — ISO 7-bit coded character set for information interchange and developed in cooperation with ASCII at least since 1964. Since its first edition in 1967 it has specified a 7-bit character code from which several national standards are derived.
ISO/IEC 8859-10:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 10: Latin alphabet No. 6, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1992. It is informally referred to as Latin-6. It was designed to cover the Nordic languages, deemed of more use for them than ISO 8859-4.
ISO/IEC 8859-12 would have been part 12 of the ISO/IEC 8859 character encoding standard series.
ISO/IEC 8859-14:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic), is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1998. It is informally referred to as Latin-8 or Celtic. It was designed to cover the Celtic languages, such as Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton.
ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.
T.61 is an ITU-T Recommendation for a Teletex character set. T.61 predated Unicode, and was the primary character set in ASN.1 used in early versions of X.500 and X.509 for encoding strings containing characters used in Western European languages. It is also used by older versions of LDAP. While T.61 continues to be supported in modern versions of X.500 and X.509, it has been deprecated in favor of Unicode. It is also called Code page 1036, CP1036, or IBM 01036.
Kra is a glyph formerly used to write the Kalaallisut language of Greenland and is now only found in Inuttitut, a distinct Inuktitut dialect. It is visually similar to a Latin small capital letter K, a Greek letter Kappa: κ, or a Cyrillic small letter Ka: к.
The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use ASCII and derivatives of ASCII. The codes represent additional information about the text, such as the position of a cursor, an instruction to start a new line, or a message that the text has been received.
〒 is the service mark of Japan Post and its successor, Japan Post Holdings, the postal operator in Japan. It is also used as a Japanese postal code mark since the introduction of the latter in 1968. Historically, it was used by the Ministry of Communications, which operated the postal service. The mark is a stylized katakana syllable te (テ), from the word teishin. The mark was introduced on February 8, 1887.
T.51 / ISO/IEC 6937:2001, Information technology — Coded graphic character set for text communication — Latin alphabet, is a multibyte extension of ASCII, or more precisely ISO/IEC 646-IRV. It was developed in common with ITU-T for telematic services under the name of T.51, and first became an ISO standard in 1983. Certain byte codes are used as lead bytes for letters with diacritics. The value of the lead byte often indicates which diacritic that the letter has, and the follow byte then has the ASCII-value for the letter that the diacritic is on.
A signature mark, in traditional bookbinding, is a letter, number or combination of either or both, which is printed at the bottom of the first page, or leaf, of a section.
ISO 6438:1983, Documentation — African coded character set for bibliographic information interchange, is an ISO standard for an 8-bit character encoding for African languages. Developed separately from the African reference alphabet but apparently based on the same data sets, it has had little use; its forms are retained Unicode. FreeDOS calls this Code Page 65504.
KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.
The Universal Coded Character Set is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added.
ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets. The international secretariat of ISO/IEC JTC 1/SC 2 is the Japanese Industrial Standards Committee (JISC), located in Japan. SC 2 is responsible for the development of the Universal Coded Character Set standard, which is the international standard corresponding to the Unicode Standard.
ISO/IEC 10367:1991 is a standard developed by ISO/IEC JTC 1/SC 2, defining graphical character sets for use in character encodings implementing levels 2 and 3 of ISO/IEC 4873.
ISO-IR-197 is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies. FreeDOS calls it code page 59187.
{{cite web}}
: CS1 maint: numeric names: authors list (link){{citation}}
: CS1 maint: numeric names: authors list (link){{cite web}}
: CS1 maint: numeric names: authors list (link)