Code page 1046 (CCSID 1046 and euro sign extended CCSID 9238), also known as Arabic Extended-Euro, is used by IBM platforms in Egypt, Iraq, Jordan, Saudi Arabia, and Syria for Arabic. [1] [2] [3] It is similar to the DOS code page 1127.
Code page 1046 [4] [5] | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
2x | SP | ! | " | # | $ | ٪ | & | ' | ( | ) | ٭ | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
8x | ﺈ | × | ÷ | س | ش | ص | ض | ﹱ | ■ | │ | ─ | ┐ | ┌ | └ | ┘ | |
9x | ﹹ | ﹻ | ﹽ | ﹿ | ﹷ | ﺊ | ﻰ | ﻳ | ﻲ | ﻎ | ﻏ | ﻐ | ﻶ | ﻸ | ﻺ | ﻼ |
Ax | NBSP | � | � | � | ¤ | � | ئ | ﺑ | ﺗ | ﺛ | ﺟ | ﺣ | ، | SHY | ﺧ | ﺳ |
Bx | ٠ | ١ | ٢ | ٣ | ٤ | ٥ | ٦ | ٧ | ٨ | ٩ | ﺷ | ؛ | ﺻ | ﺿ | ﻊ | ؟ |
Cx | ﻋ | ء | آ | أ | ؤ | إ | ئ | ﺍ | ب | ة | ت | ث | ج | ح | خ | د |
Dx | ذ | ر | ز | س | ش | ص | ض | ط | ظ | ع | غ | ﻌ | ﺂ | ﺄ | ﺎ | ﻓ |
Ex | ـ | ف | ق | ك | ل | م | ن | ﻫ | و | ى | ي | ◌ً | ◌ٌ | ◌ٍ | ◌َ | ◌ُ |
Fx | ◌ِ | ◌ّ | ◌ْ | ﻗ | ﻛ | ﻟ | ﹳ | ﻵ | ﻷ | ﻹ | ﻻ | ﻣ | ﻧ | ﻬ | ه | € |
Not in Unicode, mapped to private use area: second (left) halves of لآ, لأ, لإ and لا respectively, for use in conjunction with the U+FEDFﻟ at 0xF5. Notice also the pre-composed forms of this ligature at 0x9C–9F and 0xF7–FA. |
Code page 1029 is an older variant of Code page 1046.
Code page 1029 [6] | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
2x | SP | ! | " | # | $ | ٪ | & | ' | ( | ) | ٭ | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
8x | ||||||||||||||||
9x | ||||||||||||||||
Ax | NBSP | � | � | � | ¤ | ئ | � | ﺑ | ﺗ | ﺛ | ﺟ | ﺣ | ، | SHY | ﺧ | ﺳ |
Bx | ٠ | ١ | ٢ | ٣ | ٤ | ٥ | ٦ | ٧ | ٨ | ٩ | ﺷ | ؛ | ﺻ | ﺿ | ﻊ | ؟ |
Cx | ﻋ | ء | آ | أ | ؤ | إ | ئ | ﺍ | ب | ة | ت | ث | ج | ح | خ | د |
Dx | ذ | ر | ز | س | ش | ص | ض | ط | ظ | ع | غ | ﻌ | ﻎ | ﻏ | ﻐ | ﻓ |
Ex | ـ | ف | ق | ك | ل | م | ن | ه | و | ى | ي | ◌ً | ◌ٌ | ◌ٍ | ◌َ | ◌ُ |
Fx | ◌ِ | ◌ّ | ◌ْ | ﻗ | ﻛ | ﻟ | ﻣ | ﻧ | ﻫ | ﻳ | ﹷ | ﹹ | ﹻ | ﹽ | ﹿ | |
Not in Unicode, mapped to private use area: second (left) halves of لآ, لأ, لإ and لا respectively, for use in conjunction with the U+FEDFﻟ at 0xF5. Code page 1029 does not include the pre-composed forms of this ligature found in code page 1046. Differences from code page 1046. |
ISO/IEC 8859-11:2001, Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined.
Windows-1258 is a code page used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks.
Code page 855 is a code page used under DOS to write Cyrillic script.
Windows-1255 is a code page used under Microsoft Windows to write Hebrew. It is an almost compatible superset of ISO-8859-8 – most of the symbols are in the same positions, but Windows-1255 adds vowel-points and other signs in lower positions.
Windows-1256 is a code page used under Microsoft Windows to write Arabic and other languages that use Arabic script, such as Persian and Urdu.
Windows-1257 is an 8-bit, single-byte extended ASCII code page used to support the Estonian, Latvian and Lithuanian languages under Microsoft Windows. In Lithuania, it is standardised as LST 1590-3, alongside a modified variant named LST 1590-4.
Code page 857 is a code page used under DOS in Turkey to write Turkish.
Code page 950 is the code page used on Microsoft Windows for Traditional Chinese. It is Microsoft's implementation of the de facto standard Big5 character encoding. The code page is not registered with IANA, and hence, it is not a standard to communicate information over the internet, although it is usually labelled simply as big5
, including by Microsoft library functions.
Code page 720 is a code page used under DOS to write Arabic in Egypt, Iraq, Jordan, Saudi Arabia, and Syria. The Windows (ANSI) code page for Arabic is Windows-1256.
Code page 864 is a code page used to write Arabic in Egypt, Iraq, Jordan, Saudi Arabia, and Syria.
Code page 867 is a Hebrew 8-bit code page defined by IBM in 1998. It is based on Code page 862 but replaces several characters not used in Hebrew with nonprinting characters for bidirectional text support, a euro sign and a shekel sign.
Code page 856, is a code page used under DOS for Hebrew in Israel.
Code page 896, called Japan 7-Bit Katakana Extended, is IBM's code page for code-set G2 of EUC-JP, a 7-bit code page representing the Kana set of JIS X 0201 and accompanying Code page 895 which corresponds to the lower half of that standard. It encodes half-width katakana.
Code page 921 is a code page used under IBM AIX and DOS to write the Estonian, Latvian, and Lithuanian languages. It is an extension of ISO/IEC 8859-13.
Code page 922 is a code page used under IBM AIX and DOS to write the Estonian language. It is an extension and modification of ISO/IEC 8859-1, where the letters Ð/ð and Þ/þ used for Icelandic are replaced by the letters Š/š and Ž/ž respectively. This matches the encoding of these letters in Windows-1257 and ISO/IEC 8859-13.
Code page 1006, also known as ISO 8-bit Urdu, is used by IBM in its AIX operating system in Pakistan for Urdu.
Code page 1008, also known as ISO 8-bit Arabic, is used by IBM in its AIX operating system.
Code page 1042, also known as Simplified Chinese PC Data Extended, is a single byte character set (SBCS) used by IBM in its PC DOS operating system in China. This code page is intended for use with code page 928. It is an extension of Code page 903.
Code page 1115, also known as Simplified Chinese PC Data, is a single byte character set (SBCS) used by IBM in its PC DOS operating system in China.
Code page 1127, also known as Arabic / French PC Data, is used by IBM in its PC DOS operating system. It is closely related to code page 1046.