Elymaic | |
---|---|
Range | U+10FE0..U+10FFF (32 code points) |
Plane | SMP |
Scripts | Elymaic |
Assigned | 23 code points |
Unused | 9 reserved code points |
Unicode version history | |
12.0 | 23 (+23) |
Note: [1] [2] |
Elymaic is a Unicode block containing characters for the Elymaic alphabet, used in the ancient state of Elymais. [3]
Elymaic [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+10FEx | 𐿠 | 𐿡 | 𐿢 | 𐿣 | 𐿤 | 𐿥 | 𐿦 | 𐿧 | 𐿨 | 𐿩 | 𐿪 | 𐿫 | 𐿬 | 𐿭 | 𐿮 | 𐿯 |
U+10FFx | 𐿰 | 𐿱 | 𐿲 | 𐿳 | 𐿴 | 𐿵 | 𐿶 | |||||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Elymaic block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
12.0 | U+10FE0..10FF6 | 23 | L2/17-055 | Pandey, Anshuman (2017-02-01), Preliminary proposal to encode the Elymaic script | |
L2/17-255 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-07-28), "5. Elymaean", Recommendations to UTC #152 July-August 2017 on Script Proposals | ||||
L2/17-384 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-10-22), "6. Elymaic", Recommendations to UTC #153 October 2017 on Script Proposals | ||||
L2/17-226R2 | N4916 | Pandey, Anshuman (2017-10-23), Proposal to encode the Elymaic script in Unicode (revised) | |||
L2/17-362 | Moore, Lisa (2018-02-02), "Consensus 153-C29", UTC #153 Minutes | ||||
N5020 (pdf, doc) | Umamaheswaran, V. S. (2019-01-11), "7.4.3", Unconfirmed minutes of WG 2 meeting 67 | ||||
|
Unicode is a information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard is maintained by the Unicode Consortium, and as of March 2020 the most recent version, Unicode 13.0, contains a repertoire of 143,924 characters covering 154 modern and historic scripts, as well as multiple symbol sets and emoji. The character repertoire of the Unicode Standard is synchronized with ISO/IEC 10646, and both are code-for-code identical.
The Standard Compression Scheme for Unicode (SCSU) is a Unicode Technical Standard for reducing the number of bytes needed to represent Unicode text, especially if that text uses mostly characters from one or a small number of per-language character blocks. It does so by dynamically mapping values in the range 128–255 to offsets within particular blocks of 128 characters. The initial conditions of the encoder mean that existing strings in ASCII and ISO-8859-1 that do not contain C0 control codes other than NULL TAB CR and LF can be treated as SCSU strings. Since most alphabets do reside in blocks of contiguous Unicode codepoints, texts that use small alphabets and either ASCII punctuation or punctuation that fits within the window for the main alphabet can be encoded at one byte per character, most other punctuation can be encoded at 2 bytes per symbol through non-locking shifts. SCSU can also switch to UTF-16 internally to handle non-alphabetic languages.
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows.
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. In the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 12.0, Unicode defines a total of 87,887 CJK Unified Ideographs.
The Unicode Standard assigns character properties to each code point. These properties can be used to handle "characters" in processes, like in line-breaking, script direction right-to-left or applying controls. Slightly inconsequently, some "character properties" are also defined for code points that have no character assigned, and code points that are labeled like "<not a character>". The character properties are described in Standard Annex #44.
CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with east Asian character sets.
Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.
Tirhuta is a Unicode block containing characters for Brahmi-derived Tirhuta script which was the primary writing system for Maithili in Bihar, India and Madhesh, Nepal until the 20th century.
The Elymaic alphabet is a right-to-left, non-joining abjad. It is derived from the Aramaic alphabet. Elymaic was used in the ancient state of Elymais, which was a semi-independent state of the 2nd century BCE to the early 3rd century CE, frequently a vassal under Parthian control, in the present-day region of Khuzestan, Iran (Susiana).
Egyptian Hieroglyph Format Controls is a Unicode block containing formatting characters that enable full formatting of quadrats for Egyptian hieroglyphs.
Nandinagari is a Unicode block containing characters for Nandinagari script, historically used to write Sanskrit in southern India.
Nyiakeng Puachue Hmong is a Unicode block containing characters devised in the 1980s for writing the White Hmong and Green Hmong languages.
Ottoman Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in Ottoman Turkish documents.
Tamil Supplement is a Unicode block containing Tamil historic fractions and symbols.
Wancho is a Unicode block containing the characters of the script used to write the Wancho language.
Symbols and Pictographs Extended-A is a Unicode block containing emoji characters. It extends the set of symbols included in the Supplemental Symbols and Pictographs block.
Small Kana Extension is a Unicode block containing additional small variants for the Hiragana and Katakana syllabaries, in addition to those in the Hiragana, Katakana and Katakana Phonetic Extensions blocks.
CJK Unified Ideographs Extension G is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. It is the first block to be allocated to the Tertiary Ideographic Plane.