Old Persian (Unicode block)

Old Persian
Old Persian
Range	U+103A0..U+103DF; (64 code points)
Plane	SMP
Scripts	Old Persian
Major alphabets	Old Persian
Assigned	50 code points
Unused	14 reserved code points
Unicode version history
4.1 (2005)	50 (+50)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated October 08, 2024

Old Persian is a Unicode block containing cuneiform characters for writing the Old Persian language of the Achaemenid Empire.

Old Persian ^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+103Ax	𐎠	𐎡	𐎢	𐎣	𐎤	𐎥	𐎦	𐎧	𐎨	𐎩	𐎪	𐎫	𐎬	𐎭	𐎮	𐎯
U+103Bx	𐎰	𐎱	𐎲	𐎳	𐎴	𐎵	𐎶	𐎷	𐎸	𐎹	𐎺	𐎻	𐎼	𐎽	𐎾	𐎿
U+103Cx	𐏀	𐏁	𐏂	𐏃					𐏈	𐏉	𐏊	𐏋	𐏌	𐏍	𐏎	𐏏
U+103Dx	𐏐	𐏑	𐏒	𐏓	𐏔	𐏕
Notes 1. ^ As of Unicode version 16.0 2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Old Persian block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
4.1	U+103A0..103C3, 103C8..103D5	50	L2/97-269	N1639	Everson, Michael (1997-09-18), Proposal to encode Old Persian Cuneiform, Plane 1
			L2/98-070		Aliprand, Joan; Winkler, Arnold, "3.A.4. item c. Old Persian Cuneiform", Minutes of the joint UTC and L2 meeting from the meeting in Cupertino, February 25-27, 1998
			L2/98-286	N1703	Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.19", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
			L2/99-224	N2097, N2025-2	Röllig, W. (1999-07-23), Comments on proposals for the Universal Multiple-Octed Coded Character Set
				N2133	Response to comments on the question of encoding Old Semitic scripts in the UCS (N2097), 1999-10-04
			L2/00-010	N2103	Umamaheswaran, V. S. (2000-01-05), "10.4", Minutes of WG 2 meeting 37, Copenhagen, Denmark: 1999-09-13—16
			L2/00-128		Bunz, Carl-Martin (2000-03-01), Scripts from the Past in Future Versions of Unicode
			L2/01-007		Bunz, Carl-Martin (2000-12-21), "Old Persian cuneiform", Iranianist Meeting Report: Symposium on Encoding Iranian Scripts in Unicode
			L2/02-009		Bunz, Carl-Martin (2001-11-23), "Old Persian cuneiform script", 2nd Iranian Meeting Report
			L2/03-097	N2545	Everson, Michael (2002-12-03), Revised proposal to encode the Old Persian script in the UCS
			L2/03-149R	N2583R	Everson, Michael (2003-09-18), Final proposal to encode the Old Persian script in the UCS
			L2/05-289		Whistler, Ken (2005-10-06), Cuneiform property inconsistencies
			L2/05-279		Moore, Lisa (2005-11-10), "B.14.1", UTC #105 Minutes
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English Dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes, and other control characters. Its block name in Unicode 1.0 was Pictures for Control Codes.

Arabic Supplement is a Unicode block that encodes Arabic letter variants used for writing non-Arabic languages, including languages of Pakistan and Africa, and old Persian.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically for internal use.

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

Syriac is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine music in ekphonetic notation.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Inscriptional Pahlavi is a Unicode block containing monumental inscription characters for writing Middle Persian.

Avestan is a Unicode block containing characters devised for recording the Zoroastrian religious texts, Avesta, and was used to write the Middle Persian, or Pazand language.

Psalter Pahlavi is a Unicode block containing characters for writing Middle Persian. The script derives its name from the "Pahlavi Psalter", a 6th- or 7th-century translation of a Syriac book of psalms.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Kana Extended-A is a Unicode block containing hentaigana and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

Old Sogdian is a Unicode block containing characters for a group of related, non-cursive Sogdian writing systems used to write historic Sogdian in the 3rd to 5th centuries CE.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[1]

[2]

[lower-alpha 1]

Old Persian
Range	U+103A0..U+103DF (64 code points)
Plane	SMP
Scripts	Old Persian
Major alphabets	Old Persian
Assigned	50 code points
Unused	14 reserved code points
Unicode version history

4.1 (2005)	50 (+50)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]