ISO 5426

ISO 5426
Alias(es)	ISO-IR-53
Standard	ISO 5426
Other related encoding(s)	ETS 300 706 ; ISO 6937 / ITU T.51 ; ITU T.61 ; NeXT Multinational ; PostScript Standard Encoding ; ITU T.101 ;
	v ; t ; e ;

Last updated January 29, 2025

ISO 5426 ("Extension of the Latin alphabet coded character set for bibliographic information interchange") is a character set developed by ISO,^[1] similar to ISO/IEC 6937. It was first published in 1983.^[2]

Character set

ISO 5426^[1]^[3]
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
0x	NUL	SOH	STX	ETX	EOT	ENQ	ACK	BEL	BS	HT	LF	VT	FF	CR	SO	SI
1x	DLE	DC1	DC2	DC3	DC4	NAK	SYN	ETB	CAN	EM	SUB	ESC	FS	GS	RS	US
2x	SP	¡	„	£	$	¥	†	§	ʹ	‘	“	«	♭	©	℗	®
3x	ʿ	ʾ	‚				‡	·	ʺ	’	”	»	♯	ʹ	ʺ	¿
4x	◌̉	◌̀	◌́	◌̂	◌̃	◌̄	◌̆	◌̇	◌̈	◌̈	◌̊	◌̕	◌̒	◌̋	◌̛	◌̌
5x	◌̧	◌̨	◌̡	◌̢	◌̥	◌̮	◌̣	◌̤	◌̲	◌̳	◌̩	◌̭		◌︠	◌︡	◌︣
6x		Æ	Đ				Ĳ		Ł	Ø	Œ		Þ
7x		æ	đ	ð		ı	ĳ		ł	ø	œ	ß	þ			DEL

ISO 5426-2

ISO 5426-2 ("Latin characters used in minor European languages and obsolete typography") is a second part to ISO 5426, published in 1996.^[4] It specifies a set of 70 characters, some of which do not exist in Unicode.^{[ as of? ]}^{[ clarification needed ]} Michael Everson proposed the missing characters in Unicode 3.0, but some were postponed for further study. Later, new evidence was found, and more were encoded. P with belt is an error for P with flourish. P with middle tilde is an error for P with squirrel tail.^[5] The character at 0x42 will be encoded at U+1ACF in Unicode 17.0.

ISO 5426-2^[6]^[7]
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
0x	NUL	SOH	STX	ETX	EOT	ENQ	ACK	BEL	BS	HT	LF	VT	FF	CR	SO	SI
1x	DLE	DC1	DC2	DC3	DC4	NAK	SYN	ETB	CAN	EM	SUB	ESC	FS	GS	RS	US
2x	SP	/ 002F	✶ 2736	¶ 00B6	☞ 261E	⁌ 204C	☙ 2619	δ 03B4		⁊ 204A	ꝯ A76F	ꝝ A75D	�	�	Ꝭ A76C	ꝰ A770
3x		´ 00B4	※ 203B	⁋ 204B	✠ 2720	⁍ 204D	❧ 2767	℺ 213A		⁊̴ 204A 0334	�	Ↄ 2183	ꝫ A76B	ꝭ A76D	ꝛ A75B
4x	◌̓ 0313	◌ᷣ 1DE3	�	◌᪰ 1AB0	◌᷈ 1DC8	◌ͣ 0363	◌ͤ 0364	◌ͦ 0366	◌ᷦ 1DE6	◌̴ 0334	◌̵ 0335	◌̸ 0338	◌̷ 0337
5x
6x	Ʒ 01B7	Ǥ 01E4	Ħ 0126	Kʼ 004B 02BC	Ŋ 014A	Ꝕ A754	Ꝓ A752	Ꝑ A750	Ꝗ A756	Ʀ 01A6	Ŧ 0166	Ƿ 01F7	Ȝ 021C	ꝙ A759	ſ 017F
7x	ʒ 0292	ǥ 01E5	ħ 0127	ĸ 0138	ŋ 014B	ꝕ A755	ꝓ A753	ꝑ A751	ꝗ A757	ʀ 0280	ŧ 0167	ƿ 01BF	ȝ 021D	qꝫ 0071 A76B	�	DEL

� Not in Unicode

Related Research Articles

ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. The ISO working group maintaining this series of standards has been disbanded.

ISO/IEC 8859-15:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 15: Latin alphabet No. 9, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1999. It is informally referred to as Latin-9. It is similar to ISO 8859-1, and thus also intended for “Western European” languages, but replaces some less common symbols with the euro sign and some letters that were deemed necessary.

ISO/IEC 8859-5:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin/Cyrillic.

ISO/IEC 2022Information technology—Character code structure and extension techniques, is an ISO/IEC standard in the field of character encoding. It is equivalent to the ECMA standard ECMA-35, the ANSI standard ANSI X3.41 and the Japanese Industrial Standard JIS X 0202. Originating in 1971, it was most recently revised in 1994.

<span class="mw-page-title-main">ArmSCII</span> Set of obsolete single-byte character encodings

ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.

ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.

T.61 is an ITU-T Recommendation for a Teletex character set. T.61 predated Unicode, and was the primary character set in ASN.1 used in early versions of X.500 and X.509 for encoding strings containing characters used in Western European languages. It is also used by older versions of LDAP. While T.61 continues to be supported in modern versions of X.500 and X.509, it has been deprecated in favor of Unicode. It is also called Code page 1036, CP1036, or IBM 01036.

Kra is a glyph formerly used to write the Kalaallisut language of Greenland and is now only found in Inuttitut, a distinct Inuktitut dialect. It is visually similar to a Latin small capital letter K, a Greek letter Kappa: κ, or a Cyrillic small letter Ka: к.

The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use ASCII and derivatives of ASCII. The codes represent additional information about the text, such as the position of a cursor, an instruction to start a new line, or a message that the text has been received.

UTF-1 is an obsolete method of transforming ISO/IEC 10646/Unicode into a stream of bytes. Its design does not provide self-synchronization, which makes searching for substrings and error recovery difficult. It reuses the ASCII printing characters for multi-byte encodings, making it unsuited for some uses. UTF-1 is also slow to encode or decode due to its use of division and multiplication by a number which is not a power of 2. Due to these issues, it did not gain acceptance and was quickly replaced by UTF-8.

〒 is the service mark of Japan Post and its successor, Japan Post Holdings, the postal operator in Japan. It is also used as a Japanese postal code mark since the introduction of the latter in 1968. Historically, it was used by the Ministry of Communications, which operated the postal service. The mark is a stylized katakana syllable te (テ), from the word teishin. The mark was introduced on February 8, 1887.

T.51 / ISO/IEC 6937:2001, Information technology — Coded graphic character set for text communication — Latin alphabet, is a multibyte extension of ASCII, or more precisely ISO/IEC 646-IRV. It was developed in common with ITU-T for telematic services under the name of T.51, and first became an ISO standard in 1983. Certain byte codes are used as lead bytes for letters with diacritics. The value of the lead byte often indicates which diacritic that the letter has, and the follow byte then has the ASCII-value for the letter that the diacritic is on.

<span class="mw-page-title-main">Signature mark</span> Mark identifying a section in bookbinding

A signature mark, in traditional bookbinding, is a letter, number or combination of either or both, which is printed at the bottom of the first page, or leaf, of a section.

ISO 6438:1983, Documentation — African coded character set for bibliographic information interchange, is an ISO standard for an 8-bit character encoding for African languages. Developed separately from the African reference alphabet but apparently based on the same data sets, it has had little use; its forms are retained Unicode. FreeDOS calls this Code Page 65504.

KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.

The Universal Coded Character Set is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS), which is the basis of many character encodings, improving as characters from previously unrepresented writing systems are added.

ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets. The international secretariat of ISO/IEC JTC 1/SC 2 is the Japanese Industrial Standards Committee (JISC), located in Japan. SC 2 is responsible for the development of the Universal Coded Character Set standard, which is the international standard corresponding to the Unicode Standard.

ISO/IEC 10367:1991 is a standard developed by ISO/IEC JTC 1/SC 2, defining graphical character sets for use in character encodings implementing levels 2 and 3 of ISO/IEC 4873.

ISO-IR-197 is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies. FreeDOS calls it code page 59187.

References

1 2 Schneider, Wayne (2000-11-01). "ISO 5426-1980 to Unicode 3.0 mapping table". epixtech. Retrieved 2020-05-01.
↑ ISO/IEC JTC 1/SC 2 (1983). "ISO 5426:1983: Extension of the Latin alphabet coded character set for bibliographic information interchange". ISO.{{cite web}}: CS1 maint: numeric names: authors list (link)
↑ ISO TC 46/SC 4 (1982-06-01). Extension of the Latin alphabet coded character set for bibliographic interchange (PDF). ITSCJ/IPSJ. ISO-IR-53.{{citation}}: CS1 maint: numeric names: authors list (link)
↑ ISO/IEC JTC 1/SC 2 (1996). "ISO 5426-2:1996: Information and documentation — Extension of the Latin alphabet coded character set for bibliographic information interchange — Part 2: Latin characters used in minor European languages and obsolete typography". ISO.{{cite web}}: CS1 maint: numeric names: authors list (link)
↑ Everson, Michael; et al. (2006-01-30). "Proposal to add medievalist characters to the UCS" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N3027; L2/06-027.
↑ National Standards Authority of Ireland (1998-07-06). "Application for Registration No.213, Supplementary minor European and obsolete typographical Latin set" (PDF). ISO/IEC JTC 1/SC 2 N 3126. Retrieved 2019-10-13.
↑ Aliprand, Joan M. (2002-05-14). "Status of Mapping between Characters of ISO 5426-2 and ISO/IEC 10646-1 (UCS)". ISO/IEC JTC 1/SC 2/WG 2 N2464. Retrieved 2019-10-13.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[mathmasu-1] 1 2 Schneider, Wayne (2000-11-01). "ISO 5426-1980 to Unicode 3.0 mapping table". epixtech. Retrieved 2020-05-01.

[2] ISO/IEC JTC 1/SC 2 (1983). "ISO 5426:1983: Extension of the Latin alphabet coded character set for bibliographic information interchange". ISO.{{cite web}}: CS1 maint: numeric names: authors list (link)

[ipsj-3] ISO TC 46/SC 4 (1982-06-01). Extension of the Latin alphabet coded character set for bibliographic interchange (PDF). ITSCJ/IPSJ. ISO-IR-53.{{citation}}: CS1 maint: numeric names: authors list (link)

[4] ISO/IEC JTC 1/SC 2 (1996). "ISO 5426-2:1996: Information and documentation — Extension of the Latin alphabet coded character set for bibliographic information interchange — Part 2: Latin characters used in minor European languages and obsolete typography". ISO.{{cite web}}: CS1 maint: numeric names: authors list (link)

[5] Everson, Michael; et al. (2006-01-30). "Proposal to add medievalist characters to the UCS" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N3027; L2/06-027.

[6] National Standards Authority of Ireland (1998-07-06). "Application for Registration No.213, Supplementary minor European and obsolete typographical Latin set" (PDF). ISO/IEC JTC 1/SC 2 N 3126. Retrieved 2019-10-13.

[7] Aliprand, Joan M. (2002-05-14). "Status of Mapping between Characters of ISO 5426-2 and ISO/IEC 10646-1 (UCS)". ISO/IEC JTC 1/SC 2/WG 2 N2464. Retrieved 2019-10-13.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

v t e Character encodings
Early telecommunications	Telegraph code Needle Morse Non-Latin Wabun/Kana Chinese Cyrillic Baudot and Murray Fieldata ASCII ISO/IEC 646 BCDIC Teletex and Videotex/Teletext T.51/ISO/IEC 6937 ITU T.61 ITU T.101 World System Teletext background sets Transcode
ISO/IEC 8859	Approved parts -1 (Western Europe) -2 (Central Europe) -3 (Maltese/Esperanto) -4 (North Europe) -5 (Cyrillic) -6 (Arabic) -7 (Greek) -8 (Hebrew) -9 (Turkish) -10 (Nordic) -11 (Thai) -13 (Baltic) -14 (Celtic) -15 (New Western Europe) -16 (Romanian) Abandoned parts -12 (Devanagari) Proposed but not approved KOI-8 Cyrillic Sámi Adaptations Welsh Barents Cyrillic Estonian Ukrainian Cyrillic
Bibliographic use	MARC-8 ANSEL CCCII/EACC ISO 5426 5426-2 5427 5428 6438 6862
National standards	ArmSCII Big5 BraSCII CNS 11643 DIN 66003 ELOT 927 GOST 10859 GB 2312 GB 12345 GB 12052 GB 18030 HKSCS ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 KS X 1002 LST 1564 LST 1590-4 PASCII Shift JIS SI 960 TIS-620 TSCII VISCII VSCII YUSCII
ISO/IEC 2022	ISO/IEC 8859 ISO/IEC 10367 Extended Unix Code / EUC
Mac OS Code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic Central European Croatian Cyrillic Devanagari Farsi (Persian) Font X (Kermit) Gaelic Georgian Greek Gujarati Gurmukhi Hebrew Iceland Inuit Keyboard Latin (Kermit) Maltese/Esperanto Ogham Roman Romanian Sámi Turkish Turkic Cyrillic Ukrainian VT100
DOS code pages	437 737 850 858 861 862 863 864 865 866 867 868 869 897 899 903 904 932 936 942 949 950 951 1040 1042 1043 1046 1098 1115 1116 1117 1118 1127 3846 ABICOMP CS Indic CSX Indic CSX+ Indic CWI-2 Iran System Kamenický Mazovia MIK
IBM AIX code pages	895 896 912 915 921 922 1006 1008 1009 1010 1012 1013 1014 1015 1016 1017 1018 1019 1046 1124 1133
Windows code pages	CER-GS 932 936 (GBK) 950 Extended Latin-8 1250 1251 1252 1253 1254 1255 1256 1257 1258 1270 Cyrillic + Finnish Cyrillic + French Cyrillic + German Polytonic Greek
EBCDIC code pages	Japanese language in EBCDIC DKOI
DEC terminals (VTx)	Multinational (MCS) National Replacement (NRCS) French Canadian Swiss Spanish United Kingdom Dutch Finnish French Norwegian and Danish Swedish Norwegian and Danish (alternative) 8-bit Greek 8-bit Turkish SI 960 Hebrew Special Graphics Technical (TCS)
Platform specific	1052 1053 1054 1055 1058 Acorn RISC OS Amstrad CPC Apple II ATASCII Atari ST BICS Casio calculators CDC Compucolor 8001 Compucolor II CP/M+ DEC RADIX 50 DEC MCS/NRCS DG International Galaksija GEM GSM 03.38 HP Roman HP FOCAL HP RPL SQUOZE LICS LMBCS MSX NEC APC NeXT PETSCII PostScript Standard PostScript Latin 1 SAM Coupé Sega SC-3000 Sharp calculators Sharp MZ Sinclair QL Teletext TI calculators TRS-80 Ventura International WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 UTF-32 UTF-EBCDIC GB 18030 DIN 91379 BOCU-1 CESU-8 SCSU TACE16 Comparison of Unicode encodings
TeX typesetting system	Cork LY1 OML OMS OT1
Miscellaneous code pages	ABICOMP ASMO 449 Digital encoding of APL symbols ISO-IR-68 ARIB STD-B24 Fieldata HZ IEC-P27-1 INIS 7-bit 8-bit ISO-IR-169 ISO 2033 KOI KOI8-R KOI8-RU KOI8-U Mojikyō SEASCII Stanford/ITS Symbol TRON Unified Hangul Code
Control character	Morse prosigns C0 and C1 control codes ISO/IEC 6429 JIS X 0211 Unicode control, format and separator characters Whitespace characters
Related topics	CCSID Character encodings in HTML Charset detection Han unification Hardware code page MICR code Mojibake Variable-length encoding
Character sets

v t e ISO standards by standard number
List of ISO standards – ISO romanizations – IEC standards
1–9999	1 2 3 4 6 7 9 16 17 31 -0 -1 -3 -4 -5 -6 -7 -8 -9 -10 -11 -12 -13 68-1 128 216 217 226 228 233 259 261 262 302 306 361 500 518 519 639 -1 -2 -3 -5 -6 646 657 668 690 704 732 764 838 843 860 898 965 999 1000 1004 1007 1073-1 1073-2 1155 1413 1538 1629 1745 1989 2014 2015 2022 2033 2047 2108 2145 2146 2240 2281 2533 2709 2711 2720 2788 2848 2852 2921 3029 3103 3166 -1 -2 -3 3297 3307 3601 3602 3864 3901 3950 3977 4031 4157 4165 4217 4909 5218 5426 5427 5428 5725 5775 5776 5800 5807 5964 6166 6344 6346 6373 6385 6425 6429 6438 6523 6709 6943 7001 7002 7010 7027 7064 7098 7185 7200 7498 -1 7637 7736 7810 7811 7812 7813 7816 7942 8000 8093 8178 8217 8373 8501-1 8571 8583 8601 8613 8632 8651 8652 8691 8805/8806 8807 8820-5 8859 -1 -2 -3 -4 -5 -6 -7 -8 -8-I -9 -10 -11 -12 -13 -14 -15 -16 8879 9000/9001 9036 9075 9126 9141 9227 9241 9293 9314 9362 9407 9496 9506 9529 9564 9592/9593 9594 9660 9797-1 9897 9899 9945 9984 9985 9995
10000–19999	10006 10007 10116 10118-3 10160 10161 10165 10179 10206 10218 10279 10303 -11 -21 -22 -28 -238 10383 10585 10589 10628 10646 10664 10746 10861 10957 10962 10967 11073 11170 11172 11179 11404 11544 11783 11784 11785 11801 11889 11898 11940 (-2) 11941 11941 (TR) 11992 12006 12052 12182 12207 12234-2 12620 13211 -1 -2 13216 13250 13399 13406-2 13450 13485 13490 13567 13568 13584 13616 13816 13818 14000 14031 14224 14289 14396 14443 14496 -2 -3 -6 -10 -11 -12 -14 -17 -20 14617 14644 14649 14651 14698 14764 14882 14971 15022 15189 15288 15291 15398 15408 15444 -3 -9 15445 15438 15504 15511 15686 15693 15706 -2 15707 15897 15919 15924 15926 15926 WIP 15930 15938 16023 16262 16355-1 16485 16612-2 16750 16949 (TS) 17024 17025 17100 17203 17369 17442 17506 17799 18004 18014 18181 18245 18629 18916 19005 19011 19092 -1 -2 19114 19115 19125 19136 19407 19439 19500 19501 19502 19503 19505 19506 19507 19508 19509 19510 19600 19752 19757 19770 19775-1 19794-5 19831
20000–29999	20000 20022 20121 20400 20802 20830 21000 21001 21047 21122 21500 21827 22000 22275 22300 22301 22395 22537 23000 23003 23008 23009 23090-3 23092 23094-1 23094-2 23270 23271 23360 23941 24517 24613 24617 24707 24728 25178 25964 26000 26262 26300 26324 27000 series 27000 27001 27002 27005 27006 27729 28000 29110 29148 29199-2 29500
30000+	30170 31000 32000 37001 38500 39075 40500 42010 45001 50001 55000 56000 80000
Category

ISO 5426

Contents

Character set

ISO 5426-2

Related Research Articles

References