Old Sogdian (Unicode block)

Last updated
Old Sogdian
RangeU+10F00..U+10F2F
(48 code points)
Plane SMP
Scripts Old Sogdian
Assigned40 code points
Unused8 reserved code points
Unicode version history
11.0 (2018)40 (+40)
Note: [1] [2]

Old Sogdian is a Unicode block containing characters for a group of related, non-cursive Sogdian writing systems used to write historic Sogdian in the 3rd to 5th centuries CE. [3]

Contents

Block

Old Sogdian [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+10F0x𐼀𐼁𐼂𐼃𐼄𐼅𐼆𐼇𐼈𐼉𐼊𐼋𐼌𐼍𐼎𐼏
U+10F1x𐼐𐼑𐼒𐼓𐼔𐼕𐼖𐼗𐼘𐼙𐼚𐼛𐼜𐼝𐼞𐼟
U+10F2x𐼠𐼡𐼢𐼣𐼤𐼥𐼦𐼧
Notes
1. ^ As of Unicode version 13.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Old Sogdian block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
11.0U+10F00..10F2740 L2/00-128 Bunz, Carl-Martin (2000-03-01), Scripts from the Past in Future Versions of Unicode
L2/01-007 Bunz, Carl-Martin (2000-12-21), "Inscriptional Alphabets (Middle Persian, Parthian) and Sogdian vs. Aramaic", Iranianist Meeting Report: Symposium on Encoding Iranian Scripts in Unicode
L2/02-009 Bunz, Carl-Martin (2001-11-23), "Sogdian script", 2nd Iranian Meeting Report
L2/15-149 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Pandey, Anshuman; Glass, Andrew (2015-05-03), "8. Old Sogdian", Recommendations to UTC #143 May 2015 on Script Proposals
L2/15-089R Pandey, Anshuman (2015-11-03), Preliminary Proposal to Encode the Old Sogdian Script
L2/16-037 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-01-22), "10. Old Sogdian", Recommendations to UTC #146 January 2016 on Script Proposals
L2/16-312R N4814 Pandey, Anshuman (2016-12-01), Proposal to encode the Old Sogdian script
L2/17-037 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa; Liang, Hai; Ishida, Richard; Misra, Karan; McGowan, Rick (2017-01-21), "13. Old Sogdian", Recommendations to UTC #150 January 2017 on Script Proposals
L2/17-016 Moore, Lisa (2017-02-08), "D.11", UTC #150 Minutes
L2/17-362 Moore, Lisa (2018-02-02), "Consensus 153-C41", UTC #153 Minutes
  1. Proposed code points and characters names may differ from final code points and names

See also

Font

There is a Unicode font encoding Old Sogdian - Noto Sans Old Sogdian.

Related Research Articles

Unicode Character encoding standard

Unicode is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines 144,762 characters covering 159 modern and historic scripts, as well as symbols, emoji, and non-visual control and formatting codes.

Sogdian alphabet Alphabet for use with the Sogdian language of central Asia

The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia. The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet. It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.

Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts often have specific, fixed meanings in particular areas of mathematics. By providing uniformity over numerous mathematical articles and books, these conventions help to read mathematical formulas.

The Unicode Standard assigns various properties to each Unicode character and code point.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the last of the Basic Multilingual Plane excepting the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.

Kana Extended-A is a Unicode block containing hentaigana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

Gunjala Gondi is a Unicode block containing characters of Gunjala Gondi script used for writing the Adilabad dialect of the Gondi language.

Hanifi Rohingya is a Unicode block containing characters for Hanifi Rohingya script used for writing the Rohingya language in Myanmar and Bangladesh.

Chess Symbols is a Unicode block containing characters for chess notations beyond the basic Western chess symbols in the Miscellaneous Symbols block, as well as symbols representing game pieces for xiangqi.

Georgian Extended is a Unicode block containing Georgian Mtavruli letters that function as uppercase versions of their Mkhedruli counterparts in the Georgian block. Unlike all other casing scripts in Unicode, there is no title casing between Mkhedruli and Mtavruli letters, because Mtavruli is typically used only in all-caps text, although there have been some historical attempts at capitalization.

Indic Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in India under the Mughals by the 17th century through the middle of the 20th century.

Makasar is a Unicode block containing characters for Makasar script . The script was used historically in South Sulawesi, Indonesia for writing the Makassarese language.

Mayan Numerals is a Unicode block containing characters for the historical Mayan numeral system.

Sogdian is a Unicode block containing characters used to write the Sogdian language from the 7th to 14th centuries CE.

Dogri script Unicode character block

The Dogri script is a writing system originally used for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent. The Takri script version of Jammu is known as Dogra Akkhar

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2018-06-08.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2018-06-08.
  3. "Chapter 14: South and Central Asia-III, Ancient Scripts". The Unicode Standard, Version 11.0 (PDF). Mountain View, CA: Unicode, Inc. June 2018. ISBN   978-1-936213-19-1.