CJK Compatibility Ideographs Supplement | |
---|---|
Range | U+2F800..U+2FA1F (544 code points) |
Plane | SIP |
Scripts | Han |
Assigned | 542 code points |
Unused | 2 reserved code points |
Source standards | CNS 11643-1992 |
Unicode version history | |
3.1 (2001) | 542 (+542) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
CJK Compatibility Ideographs Supplement is a Unicode block containing Han characters used only for roundtrip compatibility mapping with planes 3, 4, 5, 6, 7, and 15 of CNS 11643-1992.
CJK Compatibility Ideographs Supplement [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+2F80x | 丽 | 丸 | 乁 | 𠄢 | 你 | 侮 | 侻 | 倂 | 偺 | 備 | 僧 | 像 | 㒞 | 𠘺 | 免 | 兔 |
U+2F81x | 兤 | 具 | 𠔜 | 㒹 | 內 | 再 | 𠕋 | 冗 | 冤 | 仌 | 冬 | 况 | 𩇟 | 凵 | 刃 | 㓟 |
U+2F82x | 刻 | 剆 | 割 | 剷 | 㔕 | 勇 | 勉 | 勤 | 勺 | 包 | 匆 | 北 | 卉 | 卑 | 博 | 即 |
U+2F83x | 卽 | 卿 | 卿 | 卿 | 𠨬 | 灰 | 及 | 叟 | 𠭣 | 叫 | 叱 | 吆 | 咞 | 吸 | 呈 | 周 |
U+2F84x | 咢 | 哶 | 唐 | 啓 | 啣 | 善 | 善 | 喙 | 喫 | 喳 | 嗂 | 圖 | 嘆 | 圗 | 噑 | 噴 |
U+2F85x | 切 | 壮 | 城 | 埴 | 堍 | 型 | 堲 | 報 | 墬 | 𡓤 | 売 | 壷 | 夆 | 多 | 夢 | 奢 |
U+2F86x | 𡚨 | 𡛪 | 姬 | 娛 | 娧 | 姘 | 婦 | 㛮 | 㛼 | 嬈 | 嬾 | 嬾 | 𡧈 | 寃 | 寘 | 寧 |
U+2F87x | 寳 | 𡬘 | 寿 | 将 | 当 | 尢 | 㞁 | 屠 | 屮 | 峀 | 岍 | 𡷤 | 嵃 | 𡷦 | 嵮 | 嵫 |
U+2F88x | 嵼 | 巡 | 巢 | 㠯 | 巽 | 帨 | 帽 | 幩 | 㡢 | 𢆃 | 㡼 | 庰 | 庳 | 庶 | 廊 | 𪎒 |
U+2F89x | 廾 | 𢌱 | 𢌱 | 舁 | 弢 | 弢 | 㣇 | 𣊸 | 𦇚 | 形 | 彫 | 㣣 | 徚 | 忍 | 志 | 忹 |
U+2F8Ax | 悁 | 㤺 | 㤜 | 悔 | 𢛔 | 惇 | 慈 | 慌 | 慎 | 慌 | 慺 | 憎 | 憲 | 憤 | 憯 | 懞 |
U+2F8Bx | 懲 | 懶 | 成 | 戛 | 扝 | 抱 | 拔 | 捐 | 𢬌 | 挽 | 拼 | 捨 | 掃 | 揤 | 𢯱 | 搢 |
U+2F8Cx | 揅 | 掩 | 㨮 | 摩 | 摾 | 撝 | 摷 | 㩬 | 敏 | 敬 | 𣀊 | 旣 | 書 | 晉 | 㬙 | 暑 |
U+2F8Dx | 㬈 | 㫤 | 冒 | 冕 | 最 | 暜 | 肭 | 䏙 | 朗 | 望 | 朡 | 杞 | 杓 | 𣏃 | 㭉 | 柺 |
U+2F8Ex | 枅 | 桒 | 梅 | 𣑭 | 梎 | 栟 | 椔 | 㮝 | 楂 | 榣 | 槪 | 檨 | 𣚣 | 櫛 | 㰘 | 次 |
U+2F8Fx | 𣢧 | 歔 | 㱎 | 歲 | 殟 | 殺 | 殻 | 𣪍 | 𡴋 | 𣫺 | 汎 | 𣲼 | 沿 | 泍 | 汧 | 洖 |
U+2F90x | 派 | 海 | 流 | 浩 | 浸 | 涅 | 𣴞 | 洴 | 港 | 湮 | 㴳 | 滋 | 滇 | 𣻑 | 淹 | 潮 |
U+2F91x | 𣽞 | 𣾎 | 濆 | 瀹 | 瀞 | 瀛 | 㶖 | 灊 | 災 | 灷 | 炭 | 𠔥 | 煅 | 𤉣 | 熜 | 𤎫 |
U+2F92x | 爨 | 爵 | 牐 | 𤘈 | 犀 | 犕 | 𤜵 | 𤠔 | 獺 | 王 | 㺬 | 玥 | 㺸 | 㺸 | 瑇 | 瑜 |
U+2F93x | 瑱 | 璅 | 瓊 | 㼛 | 甤 | 𤰶 | 甾 | 𤲒 | 異 | 𢆟 | 瘐 | 𤾡 | 𤾸 | 𥁄 | 㿼 | 䀈 |
U+2F94x | 直 | 𥃳 | 𥃲 | 𥄙 | 𥄳 | 眞 | 真 | 真 | 睊 | 䀹 | 瞋 | 䁆 | 䂖 | 𥐝 | 硎 | 碌 |
U+2F95x | 磌 | 䃣 | 𥘦 | 祖 | 𥚚 | 𥛅 | 福 | 秫 | 䄯 | 穀 | 穊 | 穏 | 𥥼 | 𥪧 | 𥪧 | 竮 |
U+2F96x | 䈂 | 𥮫 | 篆 | 築 | 䈧 | 𥲀 | 糒 | 䊠 | 糨 | 糣 | 紀 | 𥾆 | 絣 | 䌁 | 緇 | 縂 |
U+2F97x | 繅 | 䌴 | 𦈨 | 𦉇 | 䍙 | 𦋙 | 罺 | 𦌾 | 羕 | 翺 | 者 | 𦓚 | 𦔣 | 聠 | 𦖨 | 聰 |
U+2F98x | 𣍟 | 䏕 | 育 | 脃 | 䐋 | 脾 | 媵 | 𦞧 | 𦞵 | 𣎓 | 𣎜 | 舁 | 舄 | 辞 | 䑫 | 芑 |
U+2F99x | 芋 | 芝 | 劳 | 花 | 芳 | 芽 | 苦 | 𦬼 | 若 | 茝 | 荣 | 莭 | 茣 | 莽 | 菧 | 著 |
U+2F9Ax | 荓 | 菊 | 菌 | 菜 | 𦰶 | 𦵫 | 𦳕 | 䔫 | 蓱 | 蓳 | 蔖 | 𧏊 | 蕤 | 𦼬 | 䕝 | 䕡 |
U+2F9Bx | 𦾱 | 𧃒 | 䕫 | 虐 | 虜 | 虧 | 虩 | 蚩 | 蚈 | 蜎 | 蛢 | 蝹 | 蜨 | 蝫 | 螆 | 䗗 |
U+2F9Cx | 蟡 | 蠁 | 䗹 | 衠 | 衣 | 𧙧 | 裗 | 裞 | 䘵 | 裺 | 㒻 | 𧢮 | 𧥦 | 䚾 | 䛇 | 誠 |
U+2F9Dx | 諭 | 變 | 豕 | 𧲨 | 貫 | 賁 | 贛 | 起 | 𧼯 | 𠠄 | 跋 | 趼 | 跰 | 𠣞 | 軔 | 輸 |
U+2F9Ex | 𨗒 | 𨗭 | 邔 | 郱 | 鄑 | 𨜮 | 鄛 | 鈸 | 鋗 | 鋘 | 鉼 | 鏹 | 鐕 | 𨯺 | 開 | 䦕 |
U+2F9Fx | 閷 | 𨵷 | 䧦 | 雃 | 嶲 | 霣 | 𩅅 | 𩈚 | 䩮 | 䩶 | 韠 | 𩐊 | 䪲 | 𩒖 | 頋 | 頋 |
U+2FA0x | 頩 | 𩖶 | 飢 | 䬳 | 餩 | 馧 | 駂 | 駾 | 䯎 | 𩬰 | 鬒 | 鱀 | 鳽 | 䳎 | 䳭 | 鵧 |
U+2FA1x | 𪃎 | 䳸 | 𪄅 | 𪈎 | 𪊑 | 麻 | 䵖 | 黹 | 黾 | 鼅 | 鼏 | 鼖 | 鼻 | 𪘀 | ||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Compatibility Ideographs Supplement block:
Version | Final code points [a] | Count | L2 ID | WG2 ID | IRG ID | Document |
---|---|---|---|---|---|---|
3.1 | U+2F800..2FA1D | 542 | L2/00-032 | Jenkins, John (2000-02-01), Compatibility Ideographs in the Unicode Standard | ||
L2/00-005R2 | Moore, Lisa (2000-02-14), "Compatibility Characters", Minutes of UTC #82 in San Jose | |||||
L2/00-049 | N2159 | Tseng, Shih-shyeng (2000-02-15), Proposal for Compatibility Ideographs (in Plane 2) | ||||
L2/00-087 | N2159R | Tseng, Shih-shyeng (2000-03-10), Proposal for Compatibility Ideographs (in plane 2) | ||||
L2/00-099 | N2196 | Sato, T. K. (2000-03-15), CJK COMPATIBILITY IDEOGRAPH | ||||
L2/00-146 | N2222R | Sato, T. K. (2000-04-21), More information needed for COMPATIBILITY IDEOGRAPHS ? | ||||
L2/00-147 | N2223 | Sato, T. K. (2000-04-21), Beyond the request of JIS COMPATIBILITY IDEOGRAPHS | ||||
L2/00-233 | N2223R | Sato, T. K. (2000-04-21), Beyond the request of JIS COMPATIBILITY IDEOGRAPHS | ||||
L2/00-234 | N2203 (rtf, txt) | Umamaheswaran, V. S. (2000-07-21), "7.3", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24 | ||||
L2/00-270 | Suignard, Michel (2000-08-08), 10646-2 FCD: Comments on TCA CJK Compatibility | |||||
L2/00-187 | Moore, Lisa (2000-08-23), "Motion 84-M9", UTC minutes -- Boston, August 8-11, 2000 | |||||
L2/00-272 | N2237 | Suignard, Michel (2000-08-23), "T.10 Clause 11: Compatibility characters; T.12 Annex A.1", Comments accompanying the US positive vote on the CD ISO/IEC 10646-2 | ||||
N2284 | CJK Unified Ideographs Extension B for preDIS 10646-2, 2000-09-14 | |||||
L2/00-334 | N2270 | Shih-Shyeng, Tseng (2000-09-15), Updated CJK Compatibility Ideographs sets from TCA | ||||
L2/00-369 | Whistler, Ken (2000-10-06), "1.f CJK Compatibility Ideographs Supplement (Plane 2)", WG2 in Vouliagmeni (Athens) | |||||
L2/01-050 | N2253 | Umamaheswaran, V. S. (2001-01-21), "7.18 and 8.1", Minutes of the SC2/WG2 meeting in Athens, September 2000 | ||||
L2/03-399 | Fok, Anthony (2003-10-13), Unihan reported errors / changes re kHKSCS entries | |||||
L2/03-367 | N2667 | Suignard, Michel; Muller, Eric; Jenkins, John (2003-10-22), CJK Ideograph source references corrections | ||||
L2/03-398 | Nguyen, D. (2003-10-29), Unihan reported errors / changes re kCowles | |||||
L2/04-209 | N2775R | N1063 | Proposal to add 13 KP source reference to existing CJK Compatibility Characters, 2004-05-25 | |||
L2/10-100 | N3787 | Request for disunifying U+2F89F from U+5FF9, 2010-04-07 | ||||
L2/10-218 | N1666 | Error report on U+225D6 AND U+2F89F, 2010-06-24 | ||||
L2/11-243 | N4111 | Sources for Orphaned CJK Ideographs, 2011-06-14 | ||||
L2/11-254 | Constable, Peter (2011-06-20), "Update to UTR #45 U-Source Ideographs requested", UTC Liaison Report from WG2 | |||||
N4103 | "Resolution 58.05", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | |||||
L2/14-260 | N4621 | Suignard, Michel (2014-10-23), CJK chart and source references update | ||||
L2/16-052 | N4603 (pdf, doc) | Umamaheswaran, V. S. (2015-09-01), "M63.05", Unconfirmed minutes of WG 2 meeting 63 | ||||
L2/19-239 | N5080 | N2338 | TCA's feedback to IRG N2338 [Affects U+2F878, U+2F8F0, and U+2FA02], 2019-05-14 | |||
L2/19-240 | N5081 | N2337 | Disunification of U+2F83B/U+5406, 2019-05-14 | |||
L2/19-237 | N5068 | Editorial Report on Miscellaneous Issues (meeting IRG#52) [Affects U+2F83B, U+2F878, U+2F8D6, U+2F8D7, U+2F8DA, U+2F8F0, U+2F984, and U+2FA02], 2019-05-17 | ||||
L2/19-241 | N5083 | N2391 | Errata report for WG2 submission_TCA [Affects U+2F8D6, U+2F8D7, U+2F8DA, and U+2F984], 2019-05-31 | |||
L2/22-077 | N2512 | Shin, SangHyun; Cho, Sungduk; Kim, Kyongsok (2021-11-02), A revised proposal requesting a Horizontal Extension of 51 Hanja chars (previously submitted for ExtF/G/H) [Affects U+2F8B1] | ||||
L2/22-067 | Lunde, Ken (2022-04-16), "19 [Affects U+2F8B1]", CJK & Unihan Group Recommendations for UTC #171 Meeting | |||||
L2/22-061 | Constable, Peter (2022-07-27), "E.1 Section 19 [Affects U+2F8B1]", Approved Minutes of UTC Meeting 171 | |||||
|
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese.
The Ideographic Research Group (IRG), formerly called the Ideographic Rapporteur Group, is a subgroup of Working Group 2 (WG2) of ISO/IEC JTC1 Subcommittee 2 (SC2), which is the committee responsible for developing the Universal Coded Character Set. IRG is tasked with preparing and reviewing sets of CJK unified ideographs for eventual inclusion in both ISO/IEC 10646 and The Unicode Standard. The IRG is composed of representatives from national standards bodies from China, Japan, South Korea, Vietnam, and other regions that have historically used Chinese characters, as well as experts from liaison organizations such as the SAT Daizōkyō Text Database Committee (SAT), Taipei Computer Association (TCA), and the Unicode Technical Committee (UTC). The group holds two meetings every year lasting 4-5 days each, subsequently reporting its activities to its parent ISO/IEC JTC 1/SC 2 (SC2/WG2) committee.
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 16.0, Unicode defines a total of 97,680 characters.
CJK Radicals Supplement is a Unicode block containing alternative, often positional, forms of the Kangxi radicals. They are used as headers in dictionary indices and other CJK ideograph collections organized by radical-stroke.
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 16.0, five of the planes have assigned code points (characters), and seven are named.
Kangxi Radicals is a Unicode block. In version 3.0 (1999), this separate Kangxi Radicals block was introduced which encodes the 214 radicals in sequence, at U+2F00–2FD5. These are specific code points intended to represent the radical qua radical, as opposed to the character consisting of the unaugmented radical; thus, U+2F00 represents radical 1 while U+4E00 represents the character yī meaning "one". In addition, the CJK Radicals Supplement block (2E80–2EFF) was introduced, encoding alternative forms taken by Kangxi radicals as they appear within specific characters. For example, ⺁ "CJK RADICAL CLIFF" (U+2E81) is a variant of ⼚ radical 27 (U+2F1A), itself identical in shape to the character consisting of unaugmented radical 27, 厂 "cliff" (U+5382).
A variant form is an alternate glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character.
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters. When contrasted with other blocks containing CJK Unified Ideographs, it is also referred to as the Unified Repertoire and Ordering (URO).
CJK Unified Ideographs Extension B is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 1998 and 2000, plus seven gongche characters for kunqu added in Unicode 13.0, and two characters for the Macao Supplementary Character Set added in Unicode 14.0.
CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2002 and 2006, plus five "urgently needed" characters added in Unicode versions 14.0 and 15.0, some of which had previously been mistakenly unified with other characters.
CJK Unified Ideographs Extension D is a Unicode block containing uncommon CJK ideographs for Chinese, Japanese, Korean, and Vietnamese, some of which are in current use. Much smaller than most Unicode blocks for CJK unified ideographs, Extension D consists of characters which were submitted to the Ideographic Research Group as "urgently needed characters" between 2006 and 2009. Characters submitted during the same period which were needed less urgently were included in CJK Unified Ideographs Extension E instead.
CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. However, it also contains 12 unified ideographs sourced from Japanese character sets from IBM.
Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.
CJK Compatibility Forms is a Unicode block containing vertical glyph variants for east Asian compatibility. Its block name in Unicode 1.0 was CNS 11643 Compatibility, in reference to CNS 11643.
CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF). The square forms can have different presentations when they are used in horizontal or vertical text. For example, the characters U+333E㌾SQUARE BORUTO and U+3327㌧SQUARE TON should look different in horizontal and in vertical right-to-left: ㌧㌾
Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.
Variation Selectors is a Unicode block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1–VS4, VS7, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.
CJK Unified Ideographs Extension E is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2006 and 2013, excluding the characters submitted as "urgently needed" between 2006 and 2009, which were included in CJK Unified Ideographs Extension D.
CJK Unified Ideographs Extension G is a Unicode block containing rare and historic CJK Unified Ideographs for Chinese, Japanese, Korean, and Vietnamese which were submitted to the Ideographic Research Group during 2015. It is the first block to be allocated to the Tertiary Ideographic Plane.
CJK Unified Ideographs Extension H is a Unicode block containing rare and historic CJK Unified Ideographs for Chinese, Japanese, Korean, Sawndip, and Vietnamese submitted to the Ideographic Research Group during 2017.