Tangut Components | |
---|---|
Range | U+18800..U+18AFF (768 code points) |
Plane | SMP |
Scripts | Tangut |
Assigned | 768 code points |
Unused | 0 reserved code points |
Unicode version history | |
9.0 (2016) | 755 (+755) |
13.0 (2020) | 768 (+13) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Tangut Components is a Unicode block containing components and radicals used in the modern study of the Tangut script.
Tangut Components [1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1880x | 𘠀 | 𘠁 | 𘠂 | 𘠃 | 𘠄 | 𘠅 | 𘠆 | 𘠇 | 𘠈 | 𘠉 | 𘠊 | 𘠋 | 𘠌 | 𘠍 | 𘠎 | 𘠏 |
U+1881x | 𘠐 | 𘠑 | 𘠒 | 𘠓 | 𘠔 | 𘠕 | 𘠖 | 𘠗 | 𘠘 | 𘠙 | 𘠚 | 𘠛 | 𘠜 | 𘠝 | 𘠞 | 𘠟 |
U+1882x | 𘠠 | 𘠡 | 𘠢 | 𘠣 | 𘠤 | 𘠥 | 𘠦 | 𘠧 | 𘠨 | 𘠩 | 𘠪 | 𘠫 | 𘠬 | 𘠭 | 𘠮 | 𘠯 |
U+1883x | 𘠰 | 𘠱 | 𘠲 | 𘠳 | 𘠴 | 𘠵 | 𘠶 | 𘠷 | 𘠸 | 𘠹 | 𘠺 | 𘠻 | 𘠼 | 𘠽 | 𘠾 | 𘠿 |
U+1884x | 𘡀 | 𘡁 | 𘡂 | 𘡃 | 𘡄 | 𘡅 | 𘡆 | 𘡇 | 𘡈 | 𘡉 | 𘡊 | 𘡋 | 𘡌 | 𘡍 | 𘡎 | 𘡏 |
U+1885x | 𘡐 | 𘡑 | 𘡒 | 𘡓 | 𘡔 | 𘡕 | 𘡖 | 𘡗 | 𘡘 | 𘡙 | 𘡚 | 𘡛 | 𘡜 | 𘡝 | 𘡞 | 𘡟 |
U+1886x | 𘡠 | 𘡡 | 𘡢 | 𘡣 | 𘡤 | 𘡥 | 𘡦 | 𘡧 | 𘡨 | 𘡩 | 𘡪 | 𘡫 | 𘡬 | 𘡭 | 𘡮 | 𘡯 |
U+1887x | 𘡰 | 𘡱 | 𘡲 | 𘡳 | 𘡴 | 𘡵 | 𘡶 | 𘡷 | 𘡸 | 𘡹 | 𘡺 | 𘡻 | 𘡼 | 𘡽 | 𘡾 | 𘡿 |
U+1888x | 𘢀 | 𘢁 | 𘢂 | 𘢃 | 𘢄 | 𘢅 | 𘢆 | 𘢇 | 𘢈 | 𘢉 | 𘢊 | 𘢋 | 𘢌 | 𘢍 | 𘢎 | 𘢏 |
U+1889x | 𘢐 | 𘢑 | 𘢒 | 𘢓 | 𘢔 | 𘢕 | 𘢖 | 𘢗 | 𘢘 | 𘢙 | 𘢚 | 𘢛 | 𘢜 | 𘢝 | 𘢞 | 𘢟 |
U+188Ax | 𘢠 | 𘢡 | 𘢢 | 𘢣 | 𘢤 | 𘢥 | 𘢦 | 𘢧 | 𘢨 | 𘢩 | 𘢪 | 𘢫 | 𘢬 | 𘢭 | 𘢮 | 𘢯 |
U+188Bx | 𘢰 | 𘢱 | 𘢲 | 𘢳 | 𘢴 | 𘢵 | 𘢶 | 𘢷 | 𘢸 | 𘢹 | 𘢺 | 𘢻 | 𘢼 | 𘢽 | 𘢾 | 𘢿 |
U+188Cx | 𘣀 | 𘣁 | 𘣂 | 𘣃 | 𘣄 | 𘣅 | 𘣆 | 𘣇 | 𘣈 | 𘣉 | 𘣊 | 𘣋 | 𘣌 | 𘣍 | 𘣎 | 𘣏 |
U+188Dx | 𘣐 | 𘣑 | 𘣒 | 𘣓 | 𘣔 | 𘣕 | 𘣖 | 𘣗 | 𘣘 | 𘣙 | 𘣚 | 𘣛 | 𘣜 | 𘣝 | 𘣞 | 𘣟 |
U+188Ex | 𘣠 | 𘣡 | 𘣢 | 𘣣 | 𘣤 | 𘣥 | 𘣦 | 𘣧 | 𘣨 | 𘣩 | 𘣪 | 𘣫 | 𘣬 | 𘣭 | 𘣮 | 𘣯 |
U+188Fx | 𘣰 | 𘣱 | 𘣲 | 𘣳 | 𘣴 | 𘣵 | 𘣶 | 𘣷 | 𘣸 | 𘣹 | 𘣺 | 𘣻 | 𘣼 | 𘣽 | 𘣾 | 𘣿 |
U+1890x | 𘤀 | 𘤁 | 𘤂 | 𘤃 | 𘤄 | 𘤅 | 𘤆 | 𘤇 | 𘤈 | 𘤉 | 𘤊 | 𘤋 | 𘤌 | 𘤍 | 𘤎 | 𘤏 |
U+1891x | 𘤐 | 𘤑 | 𘤒 | 𘤓 | 𘤔 | 𘤕 | 𘤖 | 𘤗 | 𘤘 | 𘤙 | 𘤚 | 𘤛 | 𘤜 | 𘤝 | 𘤞 | 𘤟 |
U+1892x | 𘤠 | 𘤡 | 𘤢 | 𘤣 | 𘤤 | 𘤥 | 𘤦 | 𘤧 | 𘤨 | 𘤩 | 𘤪 | 𘤫 | 𘤬 | 𘤭 | 𘤮 | 𘤯 |
U+1893x | 𘤰 | 𘤱 | 𘤲 | 𘤳 | 𘤴 | 𘤵 | 𘤶 | 𘤷 | 𘤸 | 𘤹 | 𘤺 | 𘤻 | 𘤼 | 𘤽 | 𘤾 | 𘤿 |
U+1894x | 𘥀 | 𘥁 | 𘥂 | 𘥃 | 𘥄 | 𘥅 | 𘥆 | 𘥇 | 𘥈 | 𘥉 | 𘥊 | 𘥋 | 𘥌 | 𘥍 | 𘥎 | 𘥏 |
U+1895x | 𘥐 | 𘥑 | 𘥒 | 𘥓 | 𘥔 | 𘥕 | 𘥖 | 𘥗 | 𘥘 | 𘥙 | 𘥚 | 𘥛 | 𘥜 | 𘥝 | 𘥞 | 𘥟 |
U+1896x | 𘥠 | 𘥡 | 𘥢 | 𘥣 | 𘥤 | 𘥥 | 𘥦 | 𘥧 | 𘥨 | 𘥩 | 𘥪 | 𘥫 | 𘥬 | 𘥭 | 𘥮 | 𘥯 |
U+1897x | 𘥰 | 𘥱 | 𘥲 | 𘥳 | 𘥴 | 𘥵 | 𘥶 | 𘥷 | 𘥸 | 𘥹 | 𘥺 | 𘥻 | 𘥼 | 𘥽 | 𘥾 | 𘥿 |
U+1898x | 𘦀 | 𘦁 | 𘦂 | 𘦃 | 𘦄 | 𘦅 | 𘦆 | 𘦇 | 𘦈 | 𘦉 | 𘦊 | 𘦋 | 𘦌 | 𘦍 | 𘦎 | 𘦏 |
U+1899x | 𘦐 | 𘦑 | 𘦒 | 𘦓 | 𘦔 | 𘦕 | 𘦖 | 𘦗 | 𘦘 | 𘦙 | 𘦚 | 𘦛 | 𘦜 | 𘦝 | 𘦞 | 𘦟 |
U+189Ax | 𘦠 | 𘦡 | 𘦢 | 𘦣 | 𘦤 | 𘦥 | 𘦦 | 𘦧 | 𘦨 | 𘦩 | 𘦪 | 𘦫 | 𘦬 | 𘦭 | 𘦮 | 𘦯 |
U+189Bx | 𘦰 | 𘦱 | 𘦲 | 𘦳 | 𘦴 | 𘦵 | 𘦶 | 𘦷 | 𘦸 | 𘦹 | 𘦺 | 𘦻 | 𘦼 | 𘦽 | 𘦾 | 𘦿 |
U+189Cx | 𘧀 | 𘧁 | 𘧂 | 𘧃 | 𘧄 | 𘧅 | 𘧆 | 𘧇 | 𘧈 | 𘧉 | 𘧊 | 𘧋 | 𘧌 | 𘧍 | 𘧎 | 𘧏 |
U+189Dx | 𘧐 | 𘧑 | 𘧒 | 𘧓 | 𘧔 | 𘧕 | 𘧖 | 𘧗 | 𘧘 | 𘧙 | 𘧚 | 𘧛 | 𘧜 | 𘧝 | 𘧞 | 𘧟 |
U+189Ex | 𘧠 | 𘧡 | 𘧢 | 𘧣 | 𘧤 | 𘧥 | 𘧦 | 𘧧 | 𘧨 | 𘧩 | 𘧪 | 𘧫 | 𘧬 | 𘧭 | 𘧮 | 𘧯 |
U+189Fx | 𘧰 | 𘧱 | 𘧲 | 𘧳 | 𘧴 | 𘧵 | 𘧶 | 𘧷 | 𘧸 | 𘧹 | 𘧺 | 𘧻 | 𘧼 | 𘧽 | 𘧾 | 𘧿 |
U+18A0x | 𘨀 | 𘨁 | 𘨂 | 𘨃 | 𘨄 | 𘨅 | 𘨆 | 𘨇 | 𘨈 | 𘨉 | 𘨊 | 𘨋 | 𘨌 | 𘨍 | 𘨎 | 𘨏 |
U+18A1x | 𘨐 | 𘨑 | 𘨒 | 𘨓 | 𘨔 | 𘨕 | 𘨖 | 𘨗 | 𘨘 | 𘨙 | 𘨚 | 𘨛 | 𘨜 | 𘨝 | 𘨞 | 𘨟 |
U+18A2x | 𘨠 | 𘨡 | 𘨢 | 𘨣 | 𘨤 | 𘨥 | 𘨦 | 𘨧 | 𘨨 | 𘨩 | 𘨪 | 𘨫 | 𘨬 | 𘨭 | 𘨮 | 𘨯 |
U+18A3x | 𘨰 | 𘨱 | 𘨲 | 𘨳 | 𘨴 | 𘨵 | 𘨶 | 𘨷 | 𘨸 | 𘨹 | 𘨺 | 𘨻 | 𘨼 | 𘨽 | 𘨾 | 𘨿 |
U+18A4x | 𘩀 | 𘩁 | 𘩂 | 𘩃 | 𘩄 | 𘩅 | 𘩆 | 𘩇 | 𘩈 | 𘩉 | 𘩊 | 𘩋 | 𘩌 | 𘩍 | 𘩎 | 𘩏 |
U+18A5x | 𘩐 | 𘩑 | 𘩒 | 𘩓 | 𘩔 | 𘩕 | 𘩖 | 𘩗 | 𘩘 | 𘩙 | 𘩚 | 𘩛 | 𘩜 | 𘩝 | 𘩞 | 𘩟 |
U+18A6x | 𘩠 | 𘩡 | 𘩢 | 𘩣 | 𘩤 | 𘩥 | 𘩦 | 𘩧 | 𘩨 | 𘩩 | 𘩪 | 𘩫 | 𘩬 | 𘩭 | 𘩮 | 𘩯 |
U+18A7x | 𘩰 | 𘩱 | 𘩲 | 𘩳 | 𘩴 | 𘩵 | 𘩶 | 𘩷 | 𘩸 | 𘩹 | 𘩺 | 𘩻 | 𘩼 | 𘩽 | 𘩾 | 𘩿 |
U+18A8x | 𘪀 | 𘪁 | 𘪂 | 𘪃 | 𘪄 | 𘪅 | 𘪆 | 𘪇 | 𘪈 | 𘪉 | 𘪊 | 𘪋 | 𘪌 | 𘪍 | 𘪎 | 𘪏 |
U+18A9x | 𘪐 | 𘪑 | 𘪒 | 𘪓 | 𘪔 | 𘪕 | 𘪖 | 𘪗 | 𘪘 | 𘪙 | 𘪚 | 𘪛 | 𘪜 | 𘪝 | 𘪞 | 𘪟 |
U+18AAx | 𘪠 | 𘪡 | 𘪢 | 𘪣 | 𘪤 | 𘪥 | 𘪦 | 𘪧 | 𘪨 | 𘪩 | 𘪪 | 𘪫 | 𘪬 | 𘪭 | 𘪮 | 𘪯 |
U+18ABx | 𘪰 | 𘪱 | 𘪲 | 𘪳 | 𘪴 | 𘪵 | 𘪶 | 𘪷 | 𘪸 | 𘪹 | 𘪺 | 𘪻 | 𘪼 | 𘪽 | 𘪾 | 𘪿 |
U+18ACx | 𘫀 | 𘫁 | 𘫂 | 𘫃 | 𘫄 | 𘫅 | 𘫆 | 𘫇 | 𘫈 | 𘫉 | 𘫊 | 𘫋 | 𘫌 | 𘫍 | 𘫎 | 𘫏 |
U+18ADx | 𘫐 | 𘫑 | 𘫒 | 𘫓 | 𘫔 | 𘫕 | 𘫖 | 𘫗 | 𘫘 | 𘫙 | 𘫚 | 𘫛 | 𘫜 | 𘫝 | 𘫞 | 𘫟 |
U+18AEx | 𘫠 | 𘫡 | 𘫢 | 𘫣 | 𘫤 | 𘫥 | 𘫦 | 𘫧 | 𘫨 | 𘫩 | 𘫪 | 𘫫 | 𘫬 | 𘫭 | 𘫮 | 𘫯 |
U+18AFx | 𘫰 | 𘫱 | 𘫲 | 𘫳 | 𘫴 | 𘫵 | 𘫶 | 𘫷 | 𘫸 | 𘫹 | 𘫺 | 𘫻 | 𘫼 | 𘫽 | 𘫾 | 𘫿 |
Notes
|
The following Unicode-related documents record the purpose and process of defining specific characters in the Tangut Components block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
9.0 | U+18800..18AF2 | 755 | L2/08-335 | N3495 | Everson, Michael; West, Andrew (2008-09-01), Proposal to encode Tangut Radicals and CJK Strokes in the UCS |
L2/08-399 | Cook, Richard; Anderson, Deborah (2008-10-29), Comments on the Tangut radicals and strokes proposal (N3495 = L2/08‐335) | ||||
N4094 | Cook, Richard; Anderson, Deborah (2011-06-01), Comments on Tangut report N4033 | ||||
L2/12-314 | N4326 | West, Andrew; Zaytsev, Viacheslav; Everson, Michael (2012-10-02), Proposal to encode Tangut radicals in the UCS | |||
L2/12-315 | N4327 | Everson, Michael; West, Andrew (2012-10-02), Code chart for Tangut ideographs and Tangut radicals | |||
L2/13-241 | N4516 | Anderson, Deborah (2013-12-10), Summary of Tangut meeting (Beijing, China) | |||
L2/14-023 | N4522 | West, Andrew; Everson, Michael; Xiaomang, Han; Jia, Changye; Jing, Yongshi; Zaytsev, Viacheslav (2014-01-21), Proposal to encode the Tangut script in the UCS | |||
L2/14-246 | N4642 | Anderson, Deborah (2014-09-29), Ad Hoc Reports for Tangut and Khitan Large Script | |||
L2/14-228 | N4636 | West, Andrew; Zaytsev, Viacheslav; Sun, Bojun; Everson, Michael (2014-09-30), Proposal to encode Tangut radicals in the UCS | |||
L2/14-268R | Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Iancu, Laurențiu; Glass, Andrew; Constable, Peter; Suignard, Michel (2014-10-27), "12. Tangut Radicals", Recommendations to UTC #141 October 2014 on Script Proposals | ||||
L2/15-017 | Moore, Lisa (2015-02-12), "Consensus 142-C2", UTC #142 Minutes | ||||
L2/16-052 | N4603 (pdf, doc) | Umamaheswaran, V. S. (2015-09-01), "M63.10", Unconfirmed minutes of WG 2 meeting 63 | |||
L2/15-254 | Moore, Lisa (2015-11-16), "Consensus 145-C16", UTC #145 Minutes, Remove U+186A2; add U+18817 and U+18818, and reorder the Tangut block and components block, and change glyphs based on document L2/15-265. | ||||
L2/17-313 | N4850 | West, Andrew; Zaytsev, Viacheslav (2017-09-07), Glyph Corrections for 31 Tangut ideographs and one Tangut component | |||
L2/17-367 | N4885 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa (2017-09-18), "2a. Tangut", Comments on WG2 #66 (Sept. 2017) documents | |||
L2/17-360 | N4896 | West, Andrew; Zaytsev, Viacheslav; Sun, Bojun; You, Jerry (2017-09-22), Tangut Character Additions and Glyph Corrections (replaces L2/17-313 and L2/17-314) | |||
L2/19-064 | N5031 | West, Andrew; Zaytsev, Viacheslav (2019-02-11), Investigation of Tangut unification issues | |||
L2/19-173 | Anderson, Deborah; et al. (2019-04-29), "20. Tangut", Recommendations to UTC #159 April-May 2019 on Script Proposals | ||||
L2/19-207 | N5064 | West, Andrew; Zaytsev, Viacheslav; Jia, Changye; Jing, Yongshi; Sun, Bojun (2019-05-27), Proposal to encode nine Tangut ideographs and six Tangut components | |||
13.0 | U+18AF3..18AF9 | 7 | L2/18-194 | N4957 | West, Andrew (2018-06-01), Proposal to encode seven additional Tangut components |
L2/18-183 | Moore, Lisa (2018-11-20), "C.7", UTC #156 Minutes | ||||
N5020 (pdf, doc) | Umamaheswaran, V. S. (2019-01-11), "10.3.10", Unconfirmed minutes of WG 2 meeting 67 | ||||
U+18AFA..18AFF | 6 | L2/19-064 | N5031 | West, Andrew; Zaytsev, Viacheslav (2019-02-11), Investigation of Tangut unification issues | |
L2/19-173 | Anderson, Deborah; et al. (2019-04-29), "20. Tangut", Recommendations to UTC #159 April-May 2019 on Script Proposals | ||||
L2/19-207 | N5064 | West, Andrew; Zaytsev, Viacheslav; Jia, Changye; Jing, Yongshi; Sun, Bojun (2019-05-27), Proposal to encode nine Tangut ideographs and six Tangut components | |||
N5095 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Liang, Hai; Constable, Peter; Moore, Lisa (2019-06-10), "TANGUT", Comments on WG2 #68 documents | ||||
N5122 | "M68.04", Unconfirmed minutes of WG 2 meeting 68, 2019-12-31 | ||||
L2/19-270 | Moore, Lisa (2019-10-07), "Consensus 160-C11", UTC #160 Minutes | ||||
|
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.
Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."
Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the characters in the Number Forms block, three fractions were inherited from ISO-8859-1, which was incorporated whole as the Latin-1 Supplement block.
Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows. Its block name in Unicode 1.0 was Blocks.
The Tangut script is a logographic writing system, formerly used for writing the extinct Tangut language of the Western Xia dynasty. According to the latest count, 5863 Tangut characters are known, excluding variants. The Tangut characters are similar in appearance to Chinese characters, with the same type of strokes, but the methods of forming characters in the Tangut writing system are significantly different from those of forming Chinese characters. As in Chinese calligraphy, regular, running, cursive and seal scripts were used in Tangut writing.
Mathematical Operators is a Unicode block containing characters for mathematical, logical, and set notation.
Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:
The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.
Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine music in ekphonetic notation.
Box Drawing is a Unicode block containing characters for compatibility with legacy graphics standards that contained characters for making bordered charts and tables, i.e. box-drawing characters. Its block name in Unicode 1.0 was Form and Chart Components.
Ideographic Symbols and Punctuation is a Unicode block containing symbols and punctuation marks used by ideographic scripts such as Tangut and Nüshu.
Tangut is a Unicode block containing characters from the Tangut script, which was used for writing the Tangut language spoken by the Tangut people in the Western Xia Empire, and in China during the Yuan dynasty and early Ming dynasty.
Tangut Supplement is a Unicode block containing characters from the Tangut script, which was used for writing the Tangut language spoken by the Tangut people in the Western Xia Empire, and in China during the Yuan dynasty and early Ming dynasty. This block is a supplement to the main Tangut block.
Symbols for Legacy Computing is a Unicode block containing graphic characters that were used for various home computers from the 1970s and 1980s and in Teletext broadcasting standards. It includes characters from the Amstrad CPC, MSX, Mattel Aquarius, RISC OS, MouseText, Atari ST, TRS-80 Color Computer, Oric, Texas Instruments TI-99/4A, TRS-80, Minitel, Teletext, ATASCII, PETSCII, ZX80, and ZX81 character sets. Semigraphics characters are also included in the form of new block-shaped characters, line-drawing characters, and 60 "sextant" characters.