Carian | |
---|---|
Range | U+102A0..U+102DF (64 code points) |
Plane | SMP |
Scripts | Carian |
Major alphabets | Carian |
Assigned | 49 code points |
Unused | 15 reserved code points |
Unicode version history | |
5.1 | 49 (+49) |
Note: [1] [2] |
Carian is a Unicode block containing the Masson set and four additional characters for writing the ancient Carian language in Caria and Egypt, where the Carians served as mercenaries.
A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.
The Carian language is an extinct language of the Luwian subgroup of the Anatolian branch of the Indo-European language family. The Carian language was spoken in Caria, a region of western Anatolia between the ancient regions of Lycia and Lydia, by the Carians, a name possibly first mentioned in Hittite sources. Carian is closely related to Lycian and Milyan, and both are closely related to, though not direct descendants of, Luwian. Whether the correspondences between Luwian, Carian, and Lycian are due to direct descent, or are due to dialect geography, is disputed.
Carian [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+102Ax | 𐊠 | 𐊡 | 𐊢 | 𐊣 | 𐊤 | 𐊥 | 𐊦 | 𐊧 | 𐊨 | 𐊩 | 𐊪 | 𐊫 | 𐊬 | 𐊭 | 𐊮 | 𐊯 |
U+102Bx | 𐊰 | 𐊱 | 𐊲 | 𐊳 | 𐊴 | 𐊵 | 𐊶 | 𐊷 | 𐊸 | 𐊹 | 𐊺 | 𐊻 | 𐊼 | 𐊽 | 𐊾 | 𐊿 |
U+102Cx | 𐋀 | 𐋁 | 𐋂 | 𐋃 | 𐋄 | 𐋅 | 𐋆 | 𐋇 | 𐋈 | 𐋉 | 𐋊 | 𐋋 | 𐋌 | 𐋍 | 𐋎 | 𐋏 |
U+102Dx | 𐋐 | |||||||||||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Carian block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
5.1 | U+102A0..102D0 | 49 | L2/00-128 | Bunz, Carl-Martin (2000-03-01), Scripts from the Past in Future Versions of Unicode | |
L2/00-153 | Bunz, Carl-Martin (2000-04-26), Further comments on historic scripts | ||||
L2/05-100 | N2938 | Everson, Michael (2005-04-27), Proposal for encoding the Carian script in the UCS | |||
L2/05-241 | Everson, Michael (2005-08-31), Old Anatolian scripts | ||||
L2/05-386 | N3020R | Everson, Michael (2006-01-12), Proposal to encode the Carian script in the SMP of the UCS | |||
L2/06-008R2 | Moore, Lisa (2006-02-13), "C.2", UTC #106 Minutes | ||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "7.4.2", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | ||||
N3103 (pdf, doc) | Umamaheswaran, V. S. (2006-08-25), "M48.8", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27 | ||||
|
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.
Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorise these characters as being "letterlike".
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.
Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization.
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows.
Specials is a short Unicode block allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five are assigned as of Unicode 12.0:
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages.
Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.
Georgian Supplement is a Unicode block containing characters for the ecclesiastical form of the Georgian script, Nuskhuri. To write the full ecclesiastical Khutsuri orthography, the Asomtavruli capitals encoded in the Georgian block.
Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Hiragana is a Unicode block containing hiragana characters for the Japanese language.
Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.
Katakana Phonetic Extensions is a Unicode block containing additional small katakana characters for writing the Ainu language, in addition to characters in the Katakana block.
Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. During the unification with ISO 10646 for version 1.1, the Japanese Industrial Standard Symbol was reassigned from the code point U+32FF at the end of the block to U+3004. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.
CJK Compatibility Forms is a Unicode block containing vertical glyph variants for east Asian compatibility.
Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine-era musical notation.
Ancient Greek Musical Notation is a Unicode block containing symbols representing musical notations used in ancient Greece.
Tai Viet is a Unicode block containing characters for writing the Tai languages Tai Dam, Tai Dón, and Thai Song.
CJK Unified Ideographs Extension E is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese.