CJK Compatibility Ideographs | |
---|---|
Range | U+F900..U+FAFF (512 code points) |
Plane | BMP |
Scripts | Han |
Assigned | 472 code points |
Unused | 40 reserved code points |
Source standards | KS X 1001 Big5 IBM 32 JIS X 0213 ARIB STD-B24 KPS 10721-2000 |
Unicode version history | |
1.0.1 (1992) | 302 (+302) |
3.2 (2002) | 361 (+59) |
4.1 (2005) | 467 (+106) |
5.2 (2009) | 470 (+3) |
6.1 (2012) | 472 (+2) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] Range was initially part of the Private Use Area in Unicode 1.0.0, [3] and removed from it in Unicode 1.0.1. |
CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. However, it also contains 12 unified ideographs sourced from Japanese character sets from IBM.
The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). [4] [5] These sequences specify the desired glyph variant for a given Unicode character.
Sources for the original collection of CJK Compatibility Ideographs include:
In ensuing versions of the standard, more characters have been added to the block from:
IBM Japanese double-byte EBCDIC includes several kanji which do not exist in, or do not round-trip from, JIS X 0208. These were included as gaiji in extensions to Shift JIS and EUC-JP from IBM (e.g. code page 942), NEC, the Open Software Foundation, and Microsoft (e.g. Windows code page 932). However, they were not used as a source for the original Unified Repertoire and Ordering (URO). Instead, 32 of the IBM extension kanji, those which had not been included in the URO from other sources, were included in the CJK Compatibility Ideographs block in the range U+FA0E–U+FA2D.
Of these 32 characters:
Unified_Ideograph
property, and which do not change upon normalisation). In spite of their inclusion in the CJK Compatibility Ideographs block and their algorithmically generated character names beginning with "CJK COMPATIBILITY IDEOGRAPH
", they are not duplicates of characters in the original CJK Unified Ideographs block in any respect; [7] [8] 11 of these 12 are completely non-duplicate, while U+FA23﨣CJK COMPATIBILITY IDEOGRAPH-FA23 was later unintentionally duplicated in CJK Unified Ideographs Extension B as U+27EAF𧺯CJK UNIFIED IDEOGRAPH-27EAF. They are as follows: CJK Compatibility Ideographs [1] [2] [3] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+F90x | 豈 | 更 | 車 | 賈 | 滑 | 串 | 句 | 龜 | 龜 | 契 | 金 | 喇 | 奈 | 懶 | 癩 | 羅 |
U+F91x | 蘿 | 螺 | 裸 | 邏 | 樂 | 洛 | 烙 | 珞 | 落 | 酪 | 駱 | 亂 | 卵 | 欄 | 爛 | 蘭 |
U+F92x | 鸞 | 嵐 | 濫 | 藍 | 襤 | 拉 | 臘 | 蠟 | 廊 | 朗 | 浪 | 狼 | 郎 | 來 | 冷 | 勞 |
U+F93x | 擄 | 櫓 | 爐 | 盧 | 老 | 蘆 | 虜 | 路 | 露 | 魯 | 鷺 | 碌 | 祿 | 綠 | 菉 | 錄 |
U+F94x | 鹿 | 論 | 壟 | 弄 | 籠 | 聾 | 牢 | 磊 | 賂 | 雷 | 壘 | 屢 | 樓 | 淚 | 漏 | 累 |
U+F95x | 縷 | 陋 | 勒 | 肋 | 凜 | 凌 | 稜 | 綾 | 菱 | 陵 | 讀 | 拏 | 樂 | 諾 | 丹 | 寧 |
U+F96x | 怒 | 率 | 異 | 北 | 磻 | 便 | 復 | 不 | 泌 | 數 | 索 | 參 | 塞 | 省 | 葉 | 說 |
U+F97x | 殺 | 辰 | 沈 | 拾 | 若 | 掠 | 略 | 亮 | 兩 | 凉 | 梁 | 糧 | 良 | 諒 | 量 | 勵 |
U+F98x | 呂 | 女 | 廬 | 旅 | 濾 | 礪 | 閭 | 驪 | 麗 | 黎 | 力 | 曆 | 歷 | 轢 | 年 | 憐 |
U+F99x | 戀 | 撚 | 漣 | 煉 | 璉 | 秊 | 練 | 聯 | 輦 | 蓮 | 連 | 鍊 | 列 | 劣 | 咽 | 烈 |
U+F9Ax | 裂 | 說 | 廉 | 念 | 捻 | 殮 | 簾 | 獵 | 令 | 囹 | 寧 | 嶺 | 怜 | 玲 | 瑩 | 羚 |
U+F9Bx | 聆 | 鈴 | 零 | 靈 | 領 | 例 | 禮 | 醴 | 隸 | 惡 | 了 | 僚 | 寮 | 尿 | 料 | 樂 |
U+F9Cx | 燎 | 療 | 蓼 | 遼 | 龍 | 暈 | 阮 | 劉 | 杻 | 柳 | 流 | 溜 | 琉 | 留 | 硫 | 紐 |
U+F9Dx | 類 | 六 | 戮 | 陸 | 倫 | 崙 | 淪 | 輪 | 律 | 慄 | 栗 | 率 | 隆 | 利 | 吏 | 履 |
U+F9Ex | 易 | 李 | 梨 | 泥 | 理 | 痢 | 罹 | 裏 | 裡 | 里 | 離 | 匿 | 溺 | 吝 | 燐 | 璘 |
U+F9Fx | 藺 | 隣 | 鱗 | 麟 | 林 | 淋 | 臨 | 立 | 笠 | 粒 | 狀 | 炙 | 識 | 什 | 茶 | 刺 |
U+FA0x | 切 | 度 | 拓 | 糖 | 宅 | 洞 | 暴 | 輻 | 行 | 降 | 見 | 廓 | 兀 | 嗀 | 﨎 | 﨏 |
U+FA1x | 塚 | 﨑 | 晴 | 﨓 | 﨔 | 凞 | 猪 | 益 | 礼 | 神 | 祥 | 福 | 靖 | 精 | 羽 | 﨟 |
U+FA2x | 蘒 | 﨡 | 諸 | 﨣 | 﨤 | 逸 | 都 | 﨧 | 﨨 | 﨩 | 飯 | 飼 | 館 | 鶴 | 郞 | 隷 |
U+FA3x | 侮 | 僧 | 免 | 勉 | 勤 | 卑 | 喝 | 嘆 | 器 | 塀 | 墨 | 層 | 屮 | 悔 | 慨 | 憎 |
U+FA4x | 懲 | 敏 | 既 | 暑 | 梅 | 海 | 渚 | 漢 | 煮 | 爫 | 琢 | 碑 | 社 | 祉 | 祈 | 祐 |
U+FA5x | 祖 | 祝 | 禍 | 禎 | 穀 | 突 | 節 | 練 | 縉 | 繁 | 署 | 者 | 臭 | 艹 | 艹 | 著 |
U+FA6x | 褐 | 視 | 謁 | 謹 | 賓 | 贈 | 辶 | 逸 | 難 | 響 | 頻 | 恵 | 𤋮 | 舘 | ||
U+FA7x | 並 | 况 | 全 | 侀 | 充 | 冀 | 勇 | 勺 | 喝 | 啕 | 喙 | 嗢 | 塚 | 墳 | 奄 | 奔 |
U+FA8x | 婢 | 嬨 | 廒 | 廙 | 彩 | 徭 | 惘 | 慎 | 愈 | 憎 | 慠 | 懲 | 戴 | 揄 | 搜 | 摒 |
U+FA9x | 敖 | 晴 | 朗 | 望 | 杖 | 歹 | 殺 | 流 | 滛 | 滋 | 漢 | 瀞 | 煮 | 瞧 | 爵 | 犯 |
U+FAAx | 猪 | 瑱 | 甆 | 画 | 瘝 | 瘟 | 益 | 盛 | 直 | 睊 | 着 | 磌 | 窱 | 節 | 类 | 絛 |
U+FABx | 練 | 缾 | 者 | 荒 | 華 | 蝹 | 襁 | 覆 | 視 | 調 | 諸 | 請 | 謁 | 諾 | 諭 | 謹 |
U+FACx | 變 | 贈 | 輸 | 遲 | 醙 | 鉶 | 陼 | 難 | 靖 | 韛 | 響 | 頋 | 頻 | 鬒 | 龜 | 𢡊 |
U+FADx | 𢡄 | 𣏕 | 㮝 | 䀘 | 䀹 | 𥉉 | 𥳐 | 𧻓 | 齃 | 龎 | ||||||
U+FAEx | ||||||||||||||||
U+FAFx | ||||||||||||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Compatibility Ideographs block:
Version | Final code points [a] | Count | L2 ID | WG2 ID | IRG ID | Document |
---|---|---|---|---|---|---|
1.0.1 | U+F900..FA2D | 302 | N782 | Ksar, Mike (1991-10-12), Attachment to N 767 WG2-Paris meeting copies of working papers | ||
L2/03-399 | Fok, Anthony (2003-10-13), Unihan reported errors / changes re kHKSCS entries | |||||
L2/03-367 | N2667 | Suignard, Michel; Muller, Eric; Jenkins, John (2003-10-22), CJK Ideograph source references corrections | ||||
L2/03-398 | Nguyen, D. (2003-10-29), Unihan reported errors / changes re kCowles | |||||
L2/03-417 | Muller, Eric (2003-10-31), Variation sequences for CJK Compatibility characters | |||||
L2/06-309R | Karlsson, Kent (2006-11-07), Bug in DerivedNumericValues.txt | |||||
L2/06-324R2 | Moore, Lisa (2006-11-29), "Consensus 109-C18", UTC #109 Minutes, Add numeric values to 8 compatibility ideographs to match their canonical characters. | |||||
L2/08-238 | Cook, Richard; Lunde, Ken (2008-06-09), Recommendation For IRG To Use IVD Collections | |||||
L2/08-373 | N3525 | Lunde, Ken; Muller, Eric (2008-10-06), Handling CJK compatibility characters with variation sequences | ||||
L2/08-425 | Cook, Richard; Lunde, Ken (2008-11-18), IRG Use of IVD Collections | |||||
L2/09-003R | Moore, Lisa (2009-02-12), "WG2 — Compatibility Ideographs", UTC #118 / L2 #215 Minutes | |||||
L2/09-080 | N3590 | Muller, Eric (2009-03-11), Difficulties with compatibility ideographs | ||||
L2/09-290 | Muller, Eric (2009-08-07), Draft IVD registration for Compatibility Characters | |||||
L2/11-243 | N4111 | Sources for Orphaned CJK Ideographs, 2011-06-14 | ||||
L2/11-254 | Constable, Peter (2011-06-20), "Update to UTR #45 U-Source Ideographs requested", UTC Liaison Report from WG2 | |||||
N4103 | "Resolution 58.05", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | |||||
L2/17-090 | Chung, Jaemin (2017-04-07), Proposal to add informative notes and cross-reference to U+F92C and U+F9B8 | |||||
L2/17-103 | Moore, Lisa (2017-05-18), "B.4.1 Proposal to add informative notes and cross-reference to U+F92C and U+F9B8", UTC #151 Minutes | |||||
3.2 | U+FA30..FA6A | 59 | L2/99-016 | N1935 | Paterson, Bruce (1998-11-30), Editorial corrigenda on CJK compatibility ideographs, and other items | |
L2/99-240 | Addition of fifty six KANJIs for compatibility, 1999-07-15 | |||||
L2/99-232 | N2003 | Umamaheswaran, V. S. (1999-08-03), "7.2.2.1 Editorial corrigenda on CJK compatibility", Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15 | ||||
L2/99-311 | Addition of fifty six KANJIs for compatibility, 1999-08-23 | |||||
L2/99-313 | N2095 | Sato, T. K. (1999-09-08), Addition of CJK ideographs which are already "unified" | ||||
L2/99-316 | Whistler, Ken (1999-09-13), Comments on JCS proposal | |||||
L2/99-322 | Collins, Lee (1999-10-11), Comments on JCS compatibility characters in L2/99-310 through L2/99-313 | |||||
L2/99-365 | Moore, Lisa (1999-11-23), Comments on JCS Proposals | |||||
L2/99-383 | N2142 | N710 | The response to WG2 resolution M37.16: CJK compatibility ideographs from JIS (WG2 N2104), 1999-12-09 | |||
L2/00-010 | N2103 | Umamaheswaran, V. S. (2000-01-05), "8.8", Minutes of WG 2 meeting 37, Copenhagen, Denmark: 1999-09-13—16 | ||||
L2/99-260R | Moore, Lisa (2000-02-07), "JCS Proposals", Minutes of the UTC/L2 meeting in Mission Viejo, October 26-28, 1999 | |||||
L2/00-101 | N2197 | Sato, T. K. (2000-03-15), Update: CJK COMPATIBILITY IDEOGRAPH request | ||||
L2/00-172 | N2221 | Sato, T. K. (2000-04-20), JIS COMPATIBILITY IDEOGRAPHS (draft for ammendment-1) [sic] | ||||
N2221R | JIS COMPATIBILITY IDEOGRAPHS (draft for ammendment-1) [sic] revised, 2000-06-01 | |||||
L2/00-190 | Moore, Lisa (2000-06-22), UTC Rescinds Acceptance of Four Duplicate Radicals from JIS X 213 | |||||
L2/00-234 | N2203 (rtf, txt) | Umamaheswaran, V. S. (2000-07-21), "7.3", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24 | ||||
L2/00-337 | N2273 | JIS compatibility ideographs, 2000-09-19 | ||||
L2/00-378 | N2295 | Sato, T. K. (2000-10-26), Feedback from Japan on N2281 -- working draft on pDAM 1 -- CJK Compatibility | ||||
L2/01-420 | Whistler, Ken (2001-10-30), "1. SC2 M11-04", WG2 (Singapore) Resolution Consent Docket for UTC | |||||
L2/01-405R | Moore, Lisa (2001-12-12), "Consensus 89-C20", Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001 | |||||
L2/06-321 | Whistler, Ken (2006-10-03), UCD Bug re JIS 0213 | |||||
L2/06-324R2 | Moore, Lisa (2006-11-29), "Consensus 109-C16", UTC #109 Minutes, Give U+FA30..U+FA6A the ideographic property, and fix the wordbreak property. | |||||
4.1 | U+FA70..FAD9 | 106 | L2/01-050 | N2253 | Umamaheswaran, V. S. (2001-01-21), "7.2.4 Proposal to add the Hanja column to 10646-1", Minutes of the SC2/WG2 meeting in Athens, September 2000 | |
L2/01-350 | N2375 | Proposal to add 160 Compatibility Hanja code table of D P R of Korea into CJK Compatibility Ideographs, 2001-09-03 | ||||
L2/02-154 | N2403 | Umamaheswaran, V. S. (2002-04-22), "TC 2", Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19 | ||||
N2478 | "Korea (DPRK):T2, USA T5", Proposed Disposition of comments on SC2 N 3584 (PDAM text for Amendment 2 to ISO/IEC 10646-1:2000), 2002-05-08 | |||||
L2/02-232 | N2493 | Sato, T. K.; Kobayashi, Tatsuo; Pak, Tong Gi (2002-05-22), Proposal to add 122 compatibility Hanja code table of the D P R of Korea into the CJK Compatibility Ideographs of ISO/IEC 10646-1:2000 | ||||
N2541 | "USA T.8", Proposed disposition of comments on SC2 N 3624 (FPDAM text for Amendment 2 to ISO/IEC 10646-1:2000), 2002-12-02 | |||||
N2540 | Freytag, Asmus (2002-12-05), Corrections to CJK Compatibility Ideographs Table in FPDAM | |||||
L2/02-465 | N2566 | Collins, Lee; Freytag, Asmus (2002-12-09), Review of DPRK Compatibility Ideographs | ||||
L2/02-471 | N2572 | CJK Compatibility Ideographs (Unicode 3.2, page 399), 2002-12-18 | ||||
L2/02-472 | N2573 | Report of DPRK compatibility characters ad hoc meeting, 2002-12-11 | ||||
L2/02-468 | N2569 | Suignard, Michel (2002-12-12), "USA T.5 e, USA T.8", Proposed disposition of comments on SC2 N 3624 (FPDAM text for Amendment 2 to ISO/IEC 10646-1:2000) | ||||
L2/03-023 | N2569R | Suignard, Michel (2003-01-27), "USA T.5 e, USA T.8", Disposition of Comments Report on 10646-1/FPDAM 2 | ||||
L2/03-346 | Chang, Cora (2003-10-20), Analysis of characters in WG2 documents N2572, N2573 | |||||
L2/03-346.1 | Chang, Cora (2003-10-20), Analysis of characters in WG2 documents N2572, N2573 [spreadsheet without glyphs] | |||||
L2/04-207 | N2776 | N1062 | Proposal to add 106 Compatibility Hanjas of D P R of Korea to CJK Compatibility Ideographs, 2004-05-25 | |||
L2/04-330 | Whistler, Ken (2004-08-03), "E", WG2 Consent Docket | |||||
L2/04-316 | Moore, Lisa (2004-08-19), "100-C12", UTC #100 Minutes | |||||
L2/05-050R | N2924R | Freytag, Asmus (2005-01-28), Charts - Amendments 1 and 2 to ISO/IEC 10646:2003 | ||||
L2/10-367 | N3899 | KP1-0000, 2010-09-30 | ||||
L2/11-243 | N4111 | Sources for Orphaned CJK Ideographs, 2011-06-14 | ||||
L2/11-254 | Constable, Peter (2011-06-20), "Update to UTR #45 U-Source Ideographs requested", UTC Liaison Report from WG2 | |||||
N4103 | "Resolution 58.05", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | |||||
5.2 | U+FA6B..FA6D | 3 | N3353 (pdf, doc) | Umamaheswaran, V. S. (2007-10-10), "M51.10", Unconfirmed minutes of WG 2 meeting 51 Hanzhou, China; 2007-04-24/27 | ||
L2/07-387 | Proposal to encode six CJK Ideographs in UCS, 2007-10-17 | |||||
L2/08-184 | N3318R (pdf, appendix) | Revised proposal to encode six CJK Ideographs in UCS, 2008-03-25 | ||||
L2/08-318 | N3453 (pdf, doc) | Umamaheswaran, V. S. (2008-08-13), "M52.2k", Unconfirmed minutes of WG 2 meeting 52 | ||||
L2/08-161R2 | Moore, Lisa (2008-11-05), "Consensus 115-C14", UTC #115 Minutes | |||||
6.1 | U+FA2E..FA2F | 2 | L2/10-087 | N3747 | A solution proposed by R.O.Korea for incorrectly mapped compatibility chars, 2010-03-19 | |
L2/10-108 | Moore, Lisa (2010-05-19), "Consensus 123-C8", UTC #123 / L2 #220 Minutes | |||||
N3803 (pdf, doc) | "M56.08l", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24 | |||||
|
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese.
The CNS 11643 character set, also officially known as the Chinese Standard Interchange Code or CSIC, is officially the standard character set of Taiwan. In practice, variants of the related Big5 character set are de facto standard.
The Ideographic Research Group (IRG), formerly called the Ideographic Rapporteur Group, is a subgroup of Working Group 2 (WG2) of ISO/IEC JTC1 Subcommittee 2 (SC2), which is the committee responsible for developing the Universal Coded Character Set. IRG is tasked with preparing and reviewing sets of CJK unified ideographs for eventual inclusion in both ISO/IEC 10646 and The Unicode Standard. The IRG is composed of representatives from national standards bodies from China, Japan, South Korea, Vietnam, and other regions that have historically used Chinese characters, as well as experts from liaison organizations such as the SAT Daizōkyō Text Database Committee (SAT), Taipei Computer Association (TCA), and the Unicode Technical Committee (UTC). The group holds two meetings every year lasting 4-5 days each, subsequently reporting its activities to its parent ISO/IEC JTC 1/SC 2 (SC2/WG2) committee.
Mojikyō, also known by its full name Konjaku Mojikyō, is a character encoding scheme created to provide a complete index of characters used in the Chinese, Japanese, Korean, Vietnamese Chữ Nôm and other historical Chinese logographic writing systems. The Mojikyō Institute, which published the character set, also published computer software and TrueType computer fonts to accompany it. The Mojikyō Institute, chaired by Tadahisa Ishikawa (石川忠久), originally had its character set and related software and data redistributed on CD-ROMs sold in Kinokuniya stores.
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 16.0, Unicode defines a total of 97,680 characters.
Chinese characters may have several variant forms—visually distinct glyphs that represent the same underlying meaning and pronunciation. Variants of a given character are allographs of one another, and many are directly analogous to allographs present in the English alphabet, such as the double-storey ⟨a⟩ and single-storey ⟨ɑ⟩ variants of the letter A, with the latter more commonly appearing in handwriting. Some contexts require usage of specific variants.
Radical 213 meaning "turtle" is one of only two of the 214 Kangxi radicals that are composed of 16 strokes.
In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:
A character that would not have been encoded except for compatibility and round-trip convertibility with other standards
KS X 1001, "Code for Information Interchange ", formerly called KS C 5601, is a South Korean coded character set standard to represent Hangul and Hanja characters on a computer.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
Kangxi Radicals is a Unicode block. In version 3.0 (1999), this separate Kangxi Radicals block was introduced which encodes the 214 radicals in sequence, at U+2F00–2FD5. These are specific code points intended to represent the radical qua radical, as opposed to the character consisting of the unaugmented radical; thus, U+2F00 represents radical 1 while U+4E00 represents the character yī meaning "one". In addition, the CJK Radicals Supplement block (2E80–2EFF) was introduced, encoding alternative forms taken by Kangxi radicals as they appear within specific characters. For example, ⺁ "CJK RADICAL CLIFF" (U+2E81) is a variant of ⼚ radical 27 (U+2F1A), itself identical in shape to the character consisting of unaugmented radical 27, 厂 "cliff" (U+5382).
A variant form is an alternate glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character.
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters. When contrasted with other blocks containing CJK Unified Ideographs, it is also referred to as the Unified Repertoire and Ordering (URO).
CJK Unified Ideographs Extension B is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 1998 and 2000, plus seven gongche characters for kunqu added in Unicode 13.0, and two characters for the Macao Supplementary Character Set added in Unicode 14.0.
CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2002 and 2006, plus five "urgently needed" characters added in Unicode versions 14.0 and 15.0, some of which had previously been mistakenly unified with other characters.
CJK Unified Ideographs Extension D is a Unicode block containing uncommon CJK ideographs for Chinese, Japanese, Korean, and Vietnamese, some of which are in current use. Much smaller than most Unicode blocks for CJK unified ideographs, Extension D consists of characters which were submitted to the Ideographic Research Group as "urgently needed characters" between 2006 and 2009. Characters submitted during the same period which were needed less urgently were included in CJK Unified Ideographs Extension E instead.
CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF). The square forms can have different presentations when they are used in horizontal or vertical text. For example, the characters U+333E㌾SQUARE BORUTO and U+3327㌧SQUARE TON should look different in horizontal and in vertical right-to-left: ㌧㌾
Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.
Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.
CJK Unified Ideographs Extension E is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2006 and 2013, excluding the characters submitted as "urgently needed" between 2006 and 2009, which were included in CJK Unified Ideographs Extension D.
These 12 characters are unified CJK ideographs, not compatibility ideographs, despite their names.