List of radicals in Unicode

Last updated

The List of Unicode radicals comprises those Unicode characters that represent radical components of CJK characters, Tangut characters or Yi syllables. These are used primarily for indexing characters in dictionaries.

Contents

There are two CJK radicals blocks: the "Kangxi Radicals" block that includes the 214 standard radicals used in the Kangxi Dictionary ; and the "CJK Radicals Supplement" block that includes 115 radical components used in other modern dictionaries, including simplified Chinese and Japanese radicals forms. [1]

There is one "Tangut Components" block that includes 768 radicals and components that are used to index Tangut characters in dictionaries of the Tangut script or to describe the structure of Tangut characters. [2]

There is one "Yi Radicals" block that includes 55 radicals used to index Yi characters in dictionaries of the standardized Yi script used for writing the Nuosu language in Southern Sichuan and Northern Yunnan. [3]

Sets of radicals for other sinoform scripts, such as Jurchen, have also been proposed for encoding in Unicode. [4]

Unicode tables

Kangxi Radicals [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+2F0x
U+2F1x
U+2F2x
U+2F3x ⼿
U+2F4x
U+2F5x
U+2F6x
U+2F7x ⽿
U+2F8x
U+2F9x
U+2FAx
U+2FBx ⾿
U+2FCx
U+2FDx
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points
CJK Radicals Supplement [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+2E8x
U+2E9x
U+2EAx
U+2EBx⺿
U+2ECx
U+2EDx
U+2EEx
U+2EFx
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points
Tangut Components [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1880x𘠀𘠁𘠂𘠃𘠄𘠅𘠆𘠇𘠈𘠉𘠊𘠋𘠌𘠍𘠎𘠏
U+1881x𘠐𘠑𘠒𘠓𘠔𘠕𘠖𘠗𘠘𘠙𘠚𘠛𘠜𘠝𘠞𘠟
U+1882x𘠠𘠡𘠢𘠣𘠤𘠥𘠦𘠧𘠨𘠩𘠪𘠫𘠬𘠭𘠮𘠯
U+1883x𘠰𘠱𘠲𘠳𘠴𘠵𘠶𘠷𘠸𘠹𘠺𘠻𘠼𘠽𘠾𘠿
U+1884x𘡀𘡁𘡂𘡃𘡄𘡅𘡆𘡇𘡈𘡉𘡊𘡋𘡌𘡍𘡎𘡏
U+1885x𘡐𘡑𘡒𘡓𘡔𘡕𘡖𘡗𘡘𘡙𘡚𘡛𘡜𘡝𘡞𘡟
U+1886x𘡠𘡡𘡢𘡣𘡤𘡥𘡦𘡧𘡨𘡩𘡪𘡫𘡬𘡭𘡮𘡯
U+1887x𘡰𘡱𘡲𘡳𘡴𘡵𘡶𘡷𘡸𘡹𘡺𘡻𘡼𘡽𘡾𘡿
U+1888x𘢀𘢁𘢂𘢃𘢄𘢅𘢆𘢇𘢈𘢉𘢊𘢋𘢌𘢍𘢎𘢏
U+1889x𘢐𘢑𘢒𘢓𘢔𘢕𘢖𘢗𘢘𘢙𘢚𘢛𘢜𘢝𘢞𘢟
U+188Ax𘢠𘢡𘢢𘢣𘢤𘢥𘢦𘢧𘢨𘢩𘢪𘢫𘢬𘢭𘢮𘢯
U+188Bx𘢰𘢱𘢲𘢳𘢴𘢵𘢶𘢷𘢸𘢹𘢺𘢻𘢼𘢽𘢾𘢿
U+188Cx𘣀𘣁𘣂𘣃𘣄𘣅𘣆𘣇𘣈𘣉𘣊𘣋𘣌𘣍𘣎𘣏
U+188Dx𘣐𘣑𘣒𘣓𘣔𘣕𘣖𘣗𘣘𘣙𘣚𘣛𘣜𘣝𘣞𘣟
U+188Ex𘣠𘣡𘣢𘣣𘣤𘣥𘣦𘣧𘣨𘣩𘣪𘣫𘣬𘣭𘣮𘣯
U+188Fx𘣰𘣱𘣲𘣳𘣴𘣵𘣶𘣷𘣸𘣹𘣺𘣻𘣼𘣽𘣾𘣿
U+1890x𘤀𘤁𘤂𘤃𘤄𘤅𘤆𘤇𘤈𘤉𘤊𘤋𘤌𘤍𘤎𘤏
U+1891x𘤐𘤑𘤒𘤓𘤔𘤕𘤖𘤗𘤘𘤙𘤚𘤛𘤜𘤝𘤞𘤟
U+1892x𘤠𘤡𘤢𘤣𘤤𘤥𘤦𘤧𘤨𘤩𘤪𘤫𘤬𘤭𘤮𘤯
U+1893x𘤰𘤱𘤲𘤳𘤴𘤵𘤶𘤷𘤸𘤹𘤺𘤻𘤼𘤽𘤾𘤿
U+1894x𘥀𘥁𘥂𘥃𘥄𘥅𘥆𘥇𘥈𘥉𘥊𘥋𘥌𘥍𘥎𘥏
U+1895x𘥐𘥑𘥒𘥓𘥔𘥕𘥖𘥗𘥘𘥙𘥚𘥛𘥜𘥝𘥞𘥟
U+1896x𘥠𘥡𘥢𘥣𘥤𘥥𘥦𘥧𘥨𘥩𘥪𘥫𘥬𘥭𘥮𘥯
U+1897x𘥰𘥱𘥲𘥳𘥴𘥵𘥶𘥷𘥸𘥹𘥺𘥻𘥼𘥽𘥾𘥿
U+1898x𘦀𘦁𘦂𘦃𘦄𘦅𘦆𘦇𘦈𘦉𘦊𘦋𘦌𘦍𘦎𘦏
U+1899x𘦐𘦑𘦒𘦓𘦔𘦕𘦖𘦗𘦘𘦙𘦚𘦛𘦜𘦝𘦞𘦟
U+189Ax𘦠𘦡𘦢𘦣𘦤𘦥𘦦𘦧𘦨𘦩𘦪𘦫𘦬𘦭𘦮𘦯
U+189Bx𘦰𘦱𘦲𘦳𘦴𘦵𘦶𘦷𘦸𘦹𘦺𘦻𘦼𘦽𘦾𘦿
U+189Cx𘧀𘧁𘧂𘧃𘧄𘧅𘧆𘧇𘧈𘧉𘧊𘧋𘧌𘧍𘧎𘧏
U+189Dx𘧐𘧑𘧒𘧓𘧔𘧕𘧖𘧗𘧘𘧙𘧚𘧛𘧜𘧝𘧞𘧟
U+189Ex𘧠𘧡𘧢𘧣𘧤𘧥𘧦𘧧𘧨𘧩𘧪𘧫𘧬𘧭𘧮𘧯
U+189Fx𘧰𘧱𘧲𘧳𘧴𘧵𘧶𘧷𘧸𘧹𘧺𘧻𘧼𘧽𘧾𘧿
U+18A0x𘨀𘨁𘨂𘨃𘨄𘨅𘨆𘨇𘨈𘨉𘨊𘨋𘨌𘨍𘨎𘨏
U+18A1x𘨐𘨑𘨒𘨓𘨔𘨕𘨖𘨗𘨘𘨙𘨚𘨛𘨜𘨝𘨞𘨟
U+18A2x𘨠𘨡𘨢𘨣𘨤𘨥𘨦𘨧𘨨𘨩𘨪𘨫𘨬𘨭𘨮𘨯
U+18A3x𘨰𘨱𘨲𘨳𘨴𘨵𘨶𘨷𘨸𘨹𘨺𘨻𘨼𘨽𘨾𘨿
U+18A4x𘩀𘩁𘩂𘩃𘩄𘩅𘩆𘩇𘩈𘩉𘩊𘩋𘩌𘩍𘩎𘩏
U+18A5x𘩐𘩑𘩒𘩓𘩔𘩕𘩖𘩗𘩘𘩙𘩚𘩛𘩜𘩝𘩞𘩟
U+18A6x𘩠𘩡𘩢𘩣𘩤𘩥𘩦𘩧𘩨𘩩𘩪𘩫𘩬𘩭𘩮𘩯
U+18A7x𘩰𘩱𘩲𘩳𘩴𘩵𘩶𘩷𘩸𘩹𘩺𘩻𘩼𘩽𘩾𘩿
U+18A8x𘪀𘪁𘪂𘪃𘪄𘪅𘪆𘪇𘪈𘪉𘪊𘪋𘪌𘪍𘪎𘪏
U+18A9x𘪐𘪑𘪒𘪓𘪔𘪕𘪖𘪗𘪘𘪙𘪚𘪛𘪜𘪝𘪞𘪟
U+18AAx𘪠𘪡𘪢𘪣𘪤𘪥𘪦𘪧𘪨𘪩𘪪𘪫𘪬𘪭𘪮𘪯
U+18ABx𘪰𘪱𘪲𘪳𘪴𘪵𘪶𘪷𘪸𘪹𘪺𘪻𘪼𘪽𘪾𘪿
U+18ACx𘫀𘫁𘫂𘫃𘫄𘫅𘫆𘫇𘫈𘫉𘫊𘫋𘫌𘫍𘫎𘫏
U+18ADx𘫐𘫑𘫒𘫓𘫔𘫕𘫖𘫗𘫘𘫙𘫚𘫛𘫜𘫝𘫞𘫟
U+18AEx𘫠𘫡𘫢𘫣𘫤𘫥𘫦𘫧𘫨𘫩𘫪𘫫𘫬𘫭𘫮𘫯
U+18AFx𘫰𘫱𘫲𘫳𘫴𘫵𘫶𘫷𘫸𘫹𘫺𘫻𘫼𘫽𘫾𘫿
Notes
1. ^ As of Unicode version 15.1
Yi Radicals [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+A49x
U+A4Ax
U+A4Bx
U+A4Cx
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

See also

Related Research Articles

Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese.

<span class="mw-page-title-main">Radical (Chinese characters)</span> Indexing component of Chinese characters

A Chinese radical or indexing component is a graphical component of a Chinese character under which the character is traditionally listed in a Chinese dictionary. This component is often a semantic indicator similar to a morpheme, though sometimes it may be a phonetic component or even an artificially extracted portion of the character. In some cases the original semantic or phonological connection has become obscure, owing to changes in character meaning or pronunciation over time.

<i>Shuowen Jiezi</i> 2nd century Chinese character dictionary

Shuowen Jiezi is an ancient Chinese dictionary compiled by Xu Shen during the Eastern Han dynasty. Although not the first comprehensive Chinese character dictionary, it was the first to analyze the structure of the characters and to give the rationale behind them, as well as the first to use the principle of organization by sections with shared components called radicals.

<span class="mw-page-title-main">GB 18030</span> Unicode character encoding mostly used for Simplified Chinese

GB 18030 is a Chinese government standard, described as Information Technology — Chinese coded character set and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312. As a Unicode Transformation Format, GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB2312, CP936, and GBK 1.0.

The CNS 11643 character set, also officially known as the Chinese Standard Interchange Code or CSIC, is officially the standard character set of Taiwan. In practice, variants of the related Big5 character set are de facto standard.

<i>Mojikyō</i> Character encoding scheme

Mojikyō, also known by its full name Konjaku Mojikyō, is a character encoding scheme. The Mojikyō Institute, which published the character set, also published computer software and TrueType fonts to accompany it. The Mojikyō Institute, chaired by Tadahisa Ishikawa (石川忠久), originally had its character set and related software and data redistributed on CD-ROMs sold in Kinokuniya stores.

The 214 Kangxi radicals, also known as Zihui radicals, were collated in the 18th-century Kangxi Dictionary to aid categorization of Chinese characters. They are primarily sorted by stroke count. They are the most popular system of radicals for dictionaries that order characters by radical and stroke count. They are encoded in Unicode alongside other CJK characters, under the block "Kangxi radicals", while graphical variants are included wit in the "CJK Radicals Supplement".

The Chinese character description languages are several proposed languages to most accurately and completely describe Chinese characters and information such as their list of components, list of strokes, their order, and the location of each of them on a background empty square. They are designed to overcome the inherent lack of information within a bitmap description. This enriched information can be used to identify variants of characters that are unified into one code point by Unicode and ISO/IEC 10646, as well as to provide an alternative form of representation for rare characters that do not yet have a standardized encoding in Unicode or ISO/IEC 10646. Many aim to work for Kaishu style and Song style, as well as to provide the character's internal structure which can be used for easier look-up of a character by indexing the character's internal make-up and cross-referencing among similar characters.

A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only support the basic Latin alphabet. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts", although as the maximum number of glyphs that can be defined in a TrueType font is restricted to 65,535, it is not possible for a single font to provide individual glyphs for all defined Unicode characters. This article lists some widely used Unicode fonts that support a comparatively large number and broad range of Unicode characters.

The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 15.1, Unicode defines a total of 97,680 characters.

<span class="mw-page-title-main">Andrew West (linguist)</span>

Andrew Christopher West is an English Sinologist. His first works concerned Chinese novels of the Ming and Qing dynasties. His study of Romance of the Three Kingdoms used a new approach to analyse the relationship among the various versions, extrapolating the original text of that novel.

<i>Zihui</i> Chinese dictionary

The 1615 Zìhuì is a Chinese dictionary edited by the Ming Dynasty scholar Mei Yingzuo. It is renowned for introducing two lexicographical innovations that continue to be used in the present day: the 214-radical system for indexing Chinese characters, which replaced the classic Shuowen Jiezi dictionary's 540-radical system, and the radical-and-stroke sorting method.

<span class="mw-page-title-main">Radical 2</span> Kangxi radical

Radical 2 or radical line (丨部) meaning "vertically connected" is one of 6 of the 214 Kangxi radicals that are composed of only one stroke.

<span class="mw-page-title-main">Radical 213</span> Chinese character radical

Radical 213 meaning "turtle" is one of only two of the 214 Kangxi radicals that are composed of 16 strokes.

In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 15.1, five of the planes have assigned code points (characters), and seven are named.

<span class="mw-page-title-main">Radical 3</span> Kangxi radical

Radical 3 or radical dot (丶部) meaning "to indicate an end" is one of six of the 214 Kangxi radicals that are composed of only one stroke.

The Table of Indexing Chinese Character Components is a lexicographic tool used to order the Chinese characters in mainland China. The specification is also known as GF 0011-2009.

Kangxi Radicals is a Unicode block. In version 3.0 (1999), this separate Kangxi Radicals block was introduced which encodes the 214 radicals in sequence, at U+2F00–2FD5. These are specific code points intended to represent the radical qua radical, as opposed to the character consisting of the unaugmented radical; thus, U+2F00 represents radical 1 while U+4E00 represents the character meaning "one". In addition, the CJK Radicals Supplement block (2E80–2EFF) was introduced, encoding alternative forms taken by Kangxi radicals as they appear within specific characters. For example, ⺁ "CJK RADICAL CLIFF" (U+2E81) is a variant of ⼚ radical 27 (U+2F1A), itself identical in shape to the character consisting of unaugmented radical 27, 厂 "cliff" (U+5382).

References

  1. The Unicode Standard Version 6.2 (PDF). p. 435.
  2. The Unicode Standard Version 9.0 (PDF). p. 712.
  3. The Unicode Standard Version 6.2 (PDF). pp. 421–422.
  4. "Revised Proposal to Encode the Jurchen Characters in UCS" (PDF). 2012-01-26.