Old Uyghur (Unicode block)

Old Uyghur
Old Uyghur
Range	U+10F70..U+10FAF; (64 code points)
Plane	SMP
Scripts	Old Uyghur
Assigned	26 code points
Unused	38 reserved code points
Unicode version history
14.0 (2021)	26 (+26)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated July 27, 2024

Old Uyghur is a Unicode block containing characters of the Old Uyghur alphabet.

Old Uyghur ^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+10F7x	𐽰	𐽱	𐽲	𐽳	𐽴	𐽵	𐽶	𐽷	𐽸	𐽹	𐽺	𐽻	𐽼	𐽽	𐽾	𐽿
U+10F8x	𐾀	𐾁	𐾂	𐾃	𐾄	𐾅	𐾆	𐾇	𐾈	𐾉
U+10F9x
U+10FAx
Notes 1. ^ As of Unicode version 15.1 2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Old Uyghur block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
14.0	U+10F70..10F89	26	L2/00-128		Bunz, Carl-Martin (2000-03-01), Scripts from the Past in Future Versions of Unicode
			L2/12-066	N4226	Osman, Omarjan (2011-11-07), Proposal for encoding the Uygur script in the SMP
			L2/13-028		Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-01-28), "3. L2/12‐066", Recommendations to UTC on Script Proposals
			L2/13-071		Osman, Omarjan (2013-03-27), Proposal to Encode the Uyghur Script
			L2/13-086		Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-04-26), "16. L2/13‐071", Recommendations to UTC on Script Proposals
			L2/18-168		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Chapman, Chris; Cook, Richard (2018-04-28), "16. Old Uighur", Recommendations to UTC #155 April-May 2018 on Script Proposals
			L2/18-126		Pandey, Anshuman (2018-04-30), Preliminary proposal to encode Old Uyghur
			L2/18-335		Matsui, Dai (2018-05-02), Comments on the preliminary proposal to encode Old Uyghur in Unicode (L2/18-126)
			L2/18-333		Pandey, Anshuman (2018-11-30), Proposal to encode Old Uyghur in Unicode
			L2/19-016		Pandey, Anshuman (2019-01-07), Revised proposal to encode Old Uyghur
			L2/19-047		Anderson, Deborah; et al. (2019-01-13), "10. Old Uyghur", Recommendations to UTC #158 January 2019 on Script Proposals
			L2/20-046		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2020-01-10), "9. Old Uyghur", Recommendations to UTC #162 January 2020 on Script Proposals
			L2/20-003R		Pandey, Anshuman (2020-02-16), Revised proposal to encode Old Uyghur
			L2/20-169		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-07-21), "14. Old Uyghur", Recommendations to UTC #164 July 2020 on Script Proposals
			L2/20-199		Kontovas, Nicholas (2020-07-29), Endorsement of the Old Uyghur encoding proposal L2/20-191
			L2/20-250		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "12. Old Uyghur", Recommendations to UTC #165 October 2020 on Script Proposals
			L2/20-191	N5153	Pandey, Anshuman (2020-12-18), Final proposal to encode Old Uyghur
			L2/21-016R		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-01-14), "14 Old Uyghur", Recommendations to UTC #166 January 2021 on Script Proposals
			L2/21-009		Moore, Lisa (2021-01-27), "B.1 — 14", UTC #166 Minutes
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia. The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet. It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English Dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows. Its block name in Unicode 1.0 was Blocks.

Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes, and other control characters. Its block name in Unicode 1.0 was Pictures for Control Codes.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

Cyrillic Extended-B is a Unicode block containing Cyrillic characters for writing Old Cyrillic and Old Abkhazian, and combining numeric signs for Cyrillic numerals used in early Slavic or Church Slavonic texts.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically for internal use.

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana Phonetic Extensions is a Unicode block containing additional small katakana characters for writing the Ainu language, in addition to characters in the Katakana block.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine music in ekphonetic notation.

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Kana Extended-A is a Unicode block containing hentaigana and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[1]

[2]

[lower-alpha 1]

Old Uyghur
Range	U+10F70..U+10FAF (64 code points)
Plane	SMP
Scripts	Old Uyghur
Assigned	26 code points
Unused	38 reserved code points
Unicode version history

14.0 (2021)	26 (+26)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]