Arabic Extended-B | |
---|---|
Range | U+0870..U+089F (48 code points) |
Plane | BMP |
Scripts | Arabic (41 char.) |
Major alphabets | Bosnian Javanese Malagasy Sundanese |
Assigned | 41 code points |
Unused | 7 reserved code points |
Unicode version history | |
14.0 (2021) | 41 (+41) |
Code chart Note: [1] [2] |
Arabic Extended-B is a Unicode block encoding Qur'anic annotations and letter variants used for various non-Arabic languages. The block also includes currency symbols and an abbreviation mark. [3]
Arabic Extended-B [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+087x | ࡰ | ࡱ | ࡲ | ࡳ | ࡴ | ࡵ | ࡶ | ࡷ | ࡸ | ࡹ | ࡺ | ࡻ | ࡼ | ࡽ | ࡾ | ࡿ |
U+088x | ࢀ | ࢁ | ࢂ | ࢃ | ࢄ | ࢅ | ࢆ | ࢇ | ࢈ | ࢉ | ࢊ | ࢋ | ࢌ | ࢍ | ࢎ | |
U+089x | | | ࢘ | ࢙ | ࢚ | ࢛ | ࢜ | ࢝ | ࢞ | ࢟ | ||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic Extended-B block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
14.0 | U+0870..0888, 089D..089F | 28 | L2/19-306 | N5142 | Pournader, Roozbeh; Anderson, Deborah (2019-09-29), Arabic additions for Quranic orthographies |
L2/19-343 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-10-06), "a. Additions for Quranic orthographies", Recommendations to UTC #161 October 2019 on Script Proposals | ||||
L2/19-323 | Moore, Lisa (2019-10-01), "Consensus 161-C4", UTC #161 Minutes | ||||
L2/20-105 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-04-20), "3f. Comments on L2/19-306", Recommendations to UTC #163 April 2020 on Script Proposals | ||||
U+0889..088A | 2 | L2/19-339 | Jacquerye, Denis Moyogo (2019-10-03), Proposal to encode Bosnian Arabic characters | ||
L2/19-343 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-10-06), "d. Bosnian Arabic characters", Recommendations to UTC #161 October 2019 on Script Proposals | ||||
L2/19-323 | Moore, Lisa (2019-10-01), "C.6.5", UTC #161 Minutes | ||||
U+088B..088D | 3 | L2/19-340 | Jacquerye, Denis Moyogo (2019-10-03), Proposal to encode Javanese and Sundanese Arabic characters | ||
L2/19-323 | Moore, Lisa (2019-10-01), "C.6.6", UTC #161 Minutes | ||||
U+088E | 1 | L2/20-071R | Pournader, Roozbeh; Izadpanah, Borna (2020-05-01), Proposal to encode an Arabic tail character used for abbreviation | ||
L2/20-105 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-04-20), "3b. Arabic Tail Character", Recommendations to UTC #163 April 2020 on Script Proposals | ||||
L2/20-102 | Moore, Lisa (2020-05-06), "Consensus 163-C26", UTC #163 Minutes | ||||
U+0890..0891 | 2 | L2/20-245 | Hosny, Khaled; Pournader, Roozbeh (2020-09-09), Proposal to encode three Arabic symbols | ||
L2/20-250 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "5a. Three Symbols", Recommendations to UTC #165 October 2020 on Script Proposals | ||||
L2/20-237 | Moore, Lisa (2020-10-27), "Consensus 165-C15", UTC #165 Minutes | ||||
U+0898..089C | 5 | L2/20-089 | Syarifuddin, M. Mahali (2020-02-28), Proposal to Encode Characters from Indonesian Orthography of Quran | ||
L2/20-105 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-04-20), "3c. Indonesian Orthography of Quran", Recommendations to UTC #163 April 2020 on Script Proposals | ||||
L2/20-102 | Moore, Lisa (2020-05-06), "Consensus 163-C14", UTC #163 Minutes | ||||
|
Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories: astrological, astronomical, chess, dice, musical notation, political symbols, recycling, religious symbols, trigrams, warning signs, and weather, among others.
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.
Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills, astrological symbols, technical symbols, intonation marks, and others.
Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.
Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits.
Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically for internal use.
Arabic Extended-A is a Unicode block encoding Qur'anic annotations and letter variants used for various non-Arabic languages.
Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint, ZWNBSP is also here, which is used as a byte order mark. Its block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.
Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.
CJK Compatibility Ideographs is a Unicode block created to contain Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. Such encodings include the South Korean KS X 1001:1998, Taiwanese Big5, Japanese IBM 32, South Korean KS X 1001:2004, Japanese JIS X 0213, Japanese ARIB STD-B24 and the North Korean KPS 10721-2000 source standards.
Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.
General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.
Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.
Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.
Geometric Shapes Extended is a Unicode block containing Webdings/Wingdings symbols, mostly different weights of squares, crosses, and saltires, and different weights of variously spoked asterisks ,stars, and various color squares and circles for emoji.