Phags-pa | |
---|---|
Range | U+A840..U+A87F (64 code points) |
Plane | BMP |
Scripts | Phags Pa |
Major alphabets | Mongolian Chinese |
Assigned | 56 code points |
Unused | 8 reserved code points |
Unicode version history | |
5.0 (2006) | 56 (+56) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Phags-pa is a Unicode block containing characters from the 'Phags-pa script promulgated as a national script by Kublai Khan, the founder of the Yuan dynasty. It was used primarily in writing Mongolian and Chinese, although it was intended for the use of all written languages of the Mongol Empire.
Phags-pa [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+A84x | ꡀ | ꡁ | ꡂ | ꡃ | ꡄ | ꡅ | ꡆ | ꡇ | ꡈ | ꡉ | ꡊ | ꡋ | ꡌ | ꡍ | ꡎ | ꡏ |
U+A85x | ꡐ | ꡑ | ꡒ | ꡓ | ꡔ | ꡕ | ꡖ | ꡗ | ꡘ | ꡙ | ꡚ | ꡛ | ꡜ | ꡝ | ꡞ | ꡟ |
U+A86x | ꡠ | ꡡ | ꡢ | ꡣ | ꡤ | ꡥ | ꡦ | ꡧ | ꡨ | ꡩ | ꡪ | ꡫ | ꡬ | ꡭ | ꡮ | ꡯ |
U+A87x | ꡰ | ꡱ | ꡲ | ꡳ | ꡴ | ꡵ | ꡶ | ꡷ | ||||||||
Notes |
The block has six variation sequences defined for standardized variants. [3] They use U+FE00 VARIATION SELECTOR-1 (VS01):
U+ | Character | Base code point | Base + VS01 |
---|---|---|---|
A856 | Phags‑Pa Letter Small A | ꡖ | ꡖ︀ |
A85C | Phags‑Pa Letter Ha | ꡜ | ꡜ︀ |
A85E | Phags‑Pa Letter I | ꡞ | ꡞ︀ |
A85F | Phags‑Pa Letter U | ꡟ | ꡟ︀ |
A860 | Phags‑Pa Letter E | ꡠ | ꡠ︀ |
A868 | Phags‑Pa Subjoined Letter Ya | ꡨ | ꡨ︀ |
Note that four vowel letters have positional variants:
U+ | Character | Orientation | Isolate | Initial | Medial | Final |
---|---|---|---|---|---|---|
U+A85E | Phags‑Pa Letter I | regular | ꡞ | ꡞ | ꡞ | ꡞ |
reversed | ꡞ︀ | ꡞ︀ | ꡞ︀ | ꡞ︀ | ||
U+A85F | Phags‑Pa Letter U | regular | ꡟ | ꡟ | ꡟ | ꡟ |
reversed | ꡟ︀ | ꡟ︀ | ꡟ︀ | ꡟ︀ | ||
U+A860 | Phags‑Pa Letter E | regular | ꡠ | ꡠ | ꡠ | ꡠ |
reversed | ꡠ︀ | ꡠ︀ | ꡠ︀ | ꡠ︀ | ||
U+A861 | Phags‑Pa Letter O | regular | ꡡ | ꡡ | ꡡ | ꡡ |
The following Unicode-related documents record the purpose and process of defining specific characters in the Phags-pa block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
5.0 | U+A840..A877 | 56 | L2/00-055 | N2163 (pdf, doc) | Sato, T. K. (2000-01-06), Soyombo and Pagba (old Mongol scripts) |
L2/03-224 | Baxter, William H. (2003-07-10), Email on 'Phags-pa | ||||
L2/03-246 | Quejingzhabu; West, Andrew (2003-08-12), Letter commenting on Phags-pa encoding | ||||
L2/03-269 | West, Andrew; Fynn, Christopher (2003-08-16), E-mail questions and answers on Phags-pa encoding | ||||
L2/03-229R3 | N2622 | West, Andrew (2003-09-18), Proposal to encode the Phags-pa script | |||
L2/03-366 | N2666 | Principles on Encoding Phags-pa Script, 2003-10-20 | |||
L2/03-374 | West, Andrew (2003-10-22), Response to N2666 (Principles on Encoding Phags-pa Script) | ||||
N2912 | Constable, Peter (2004-01-22), Open Issues on Phags-pa Encoding | ||||
L2/04-085 | N2706 | Summary of Voting on ISO/IEC JTC 1/SC 2 N 3696 : Project Subdivision Proposal for ISO/IEC 10646: 2003/Amendment 1, 2004-02-03 | |||
L2/04-112 | N2719 | West, Andrew (2004-03-10), Response to Comments on Phags-pa Proposal in N2706 | |||
L2/04-134 | N2745 | HPhags-pa script encoding, 2004-04-02 | |||
L2/04-174 | N2771 | West, Andrew (2004-06-01), Comments on Chinese-Mongolian joint proposal to encode the Hphags-pa script (N2475) | |||
L2/04-275 | N2829 | Chen, Zhuang; Jia, La Sen; He, Xi Ge Du Ren; Tumurtogoo, Domi; Everson, Michael; Sekiguchi; Constable, Peter; Whistler, Ken; Freytag, Asmus (2004-06-22), Consensus on the encoding of the Phags-pa script in the PDAM code chart | |||
L2/04-414 | N2870 | Summary of the Revised User's Agreement Related to Phags-pa Script, 2004-10-25 | |||
L2/04-412 | N2869 | Proposal to Encode the Phags-Pa Script, 2004-11-17 | |||
L2/04-413 | N2869c | Cover letter to Updated proposal to encode the Phags-pa Script, 2004-11-17 | |||
L2/04-415 | N2871 | Some Problems on the Encoding of the Phags-pa Script, 2004-11-17 | |||
L2/05-036 | N2922 | Consensus on Encoding Phags-pa Script, 2005-01-25 | |||
L2/05-059 | Whistler, Ken (2005-02-03), "1. Phags-pa", WG2 Consent Docket, Part 2: Unicode 5.0 Issues | ||||
L2/05-026 | Moore, Lisa (2005-05-16), "WG2 - Unicode 5.0 Consent Docket (B.1.16)", UTC #102 Minutes | ||||
L2/05-219 | N2964 | A User's Agreement Related to Phags-pa Script, 2005-08-05 | |||
L2/05-255 | N2972 | West, Andrew (2005-08-17), Glyph Forms for PHAGS-PA LETTER YA and PHAGS-PA LETTER ALTERNATE YA | |||
L2/05-257 | N2979 | West, Andrew (2005-09-01), Phags-pa Glyphs | |||
L2/05-270 | Whistler, Ken (2005-09-21), "F. Phags-pa Glyphs", WG2 Consent Docket (Sophia Antipolis) | ||||
L2/05-279 | Moore, Lisa (2005-11-10), "Consensus 105-C29", UTC #105 Minutes | ||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "7.2.1", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | ||||
L2/12-360 | Esfahbod, Behdad; Pournader, Roozbeh (2012-11-05), Mongolian and 'Phags-Pa Shaping | ||||
L2/12-343R2 | Moore, Lisa (2012-12-04), "B.14.5 Mongolian Shaping", UTC #133 Minutes | ||||
L2/13-146 | N4435 | Suignard, Michel (2013-05-27), Presentation of vertical scripts | |||
L2/13-132 | Moore, Lisa (2013-07-29), "B.1.7 Presentation of vertical scripts", UTC #136 Minutes | ||||
N4403 (pdf, doc) | Umamaheswaran, V. S. (2014-01-28), "11.1.2 Presentation of Vertical scripts (Mongolian and Phags-pa)", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14 | ||||
L2/18-278 | Dyi-Yuaan, Lo (2018-08-30), A Glyph Error of 'Phags-pa Alternate YA (U+A86D) | ||||
L2/18-280 | N5012 | West, Andrew (2018-08-30), Discussion of 'Glyph Error of 'Phags-pa Alternate YA' | |||
L2/18-272 | Moore, Lisa (2018-10-29), "C.5.1 A Glyph Error of 'Phags-pa Alternate YA (U+A86D)", UTC #157 Minutes | ||||
|
Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."
Supplemental Mathematical Operators is a Unicode block containing various mathematical symbols, including N-ary operators, summations and integrals, intersections and unions, logical and relational operators, and subset/superset relations.
Mathematical Operators is a Unicode block containing characters for mathematical, logical, and set notation.
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.
CJK Unified Ideographs Extension-A is a Unicode block containing rare Han ideographs submitted to the Ideographic Research Group between 1992 and 1998, plus ten ideographs added in Unicode 13.0 which had previously been mistakenly unified with others.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar.
Mongolian is a Unicode block containing characters for dialects of Mongolian, Manchu, and Sibe languages. It is traditionally written in vertical lines Top-Down, right across the page, although the Unicode code charts cite the characters rotated to horizontal orientation as this is the orientation of glyphs in a font that supports layout in vertical orientation.
Myanmar Extended-A is a Unicode block containing Myanmar characters for writing the Khamti Shan and Aiton languages.
A variant form is an alternate glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character.
Variation Selectors Supplement is a Unicode block containing additional variation selectors beyond those found in the Variation Selectors block.
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters. When contrasted with other blocks containing CJK Unified Ideographs, it is also referred to as the Unified Repertoire and Ordering (URO).
CJK Unified Ideographs Extension B is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 1998 and 2000, plus seven gongche characters for kunqu added in Unicode 13.0, and two characters for the Macao Supplementary Character Set added in Unicode 14.0.
General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.
Egyptian Hieroglyphs is a Unicode block containing the Gardiner's sign list of Egyptian hieroglyphs.
Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.
Variation Selectors is a Unicode block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.
Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.
Egyptian Hieroglyph Format Controls is a Unicode block containing formatting characters that enable full formatting of quadrats for Egyptian hieroglyphs.