Hangul Compatibility Jamo

Last updated
Hangul Compatibility Jamo
RangeU+3130..U+318F
(96 code points)
Plane BMP
Scripts Hangul
Major alphabetsHangul
Assigned94 code points
Unused2 reserved code points
Source standards KS X 1001 (formerly KS C 5601)
Unicode version history
1.0.0 (1991)94 (+94)
Note: [1] [2]
Hangul Compatibility Jamo block in Unicode Hangul Compatibility Jamo block in Unicode.svg
Hangul Compatibility Jamo block in Unicode

Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001 (formerly KS C 5601). Its block name in Unicode 1.0 was Hangul Elements. [3]

Contents

Block

Hangul Compatibility Jamo [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+313x
U+314x
U+315x
U+316x   HF  
U+317x
U+318x
Notes
1. ^ As of Unicode version 13.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Hangul Compatibility Jamo block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
1.0.0U+3131..318E94(to be determined)
L2/07-075 N3172 Kim, Kyongsok (2006-09-27), Add annotations for existing 5 Hangul Jamo names
L2/07-247 N3257 "3", A Proposal to add new Hangul Jamo extended characters to BMP of UCS, 2007-04-23
L2/09-096 Sung, Ienup (2009-02-26), Change Proposal for Informative Alias of U+3164 HANGUL FILLER
L2/09-104 Moore, Lisa (2009-05-20), "Scripts — Informative Alias of U+3164 HANGUL FILLER", UTC #119 / L2 #216 Minutes
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Korean language and computers Input and use of Korean on computers

The writing system of Korean, Hangul, is an alphabet organized into blocks of syllables; characters cannot just be written from left to right. Because of this, every possible syllable in Korean must either be rendered as syllable blocks by a font, or be encoded separately. Unicode uses the latter option. As an example, the syllable 하 (ha) consists of the characters ㅎ (h) and ㅏ (a), but both of them are encoded separately.

In computer programming, whitespace is any character or series of characters that represent horizontal or vertical space in typography. When rendered, a whitespace character does not correspond to a visible mark, but typically does occupy an area on a page. For example, the common whitespace symbol U+0020 SPACE represents a blank space punctuation character in text, used as a word divider in Western scripts.

Unified Hangul Code Windows character encoding for Korean

Unified Hangul Code (UHC), or Extended Wansung, also known under Microsoft Windows as Code Page 949, is the Microsoft Windows code page for the Korean language. It is an extension of Wansung Code to include all 11172 Hangul syllables present in Johab. This corresponds to the pre-composed syllables available in Unicode 2.0 and later.

New Gulim (새굴림/SaeGulRim) is a sans-serif type Unicode font designed especially for the Korean-language script, designed by HanYang System Co., Limited. It is an expanded version of Hanyang Gulrim.

Halfwidth and fullwidth forms Alternative width characters in East Asian typography

In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. Unlike monospaced fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name.

KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.

KS X 1001, "Code for Information Interchange ", formerly called KS C 5601, is a South Korean coded character set standard to represent hangul and hanja characters on a computer.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages.

Hangul Jamo (Unicode block) Unicode character block

Hangul Jamo is a Unicode block containing positional forms of the Hangul consonant and vowel clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Hangul Jamo Extended-A Unicode character block

Hangul Jamo Extended-A is a Unicode block containing choseong forms of archaic Hangul consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Hangul Jamo Extended-B Unicode character block

Hangul Jamo Extended-B is a Unicode block containing positional forms of archaic Hangul vowel and consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Hangul Syllables is a Unicode block containing precomposed Hangul syllable blocks for modern Korean. The syllables can be directly mapped by algorithm to sequences of two or three characters in the Hangul Jamo Unicode block:

CJK Compatibility Ideographs is a Unicode block created to contain Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. Such encodings include the South Korean KS X 1001:1998, Taiwanese Big5, Japanese IBM 32, South Korean KS X 1001:2004, Japanese JIS X 0213, Japanese ARIB STD-B24 and the North Korean KPS 10721-2000 source standards.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the last of the Basic Multilingual Plane excepting the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Hangul, Hangul Supplementary-A, and Hangul Supplementary-B were character blocks that existed in Unicode 1.0 and 1.1, and ISO/IEC 10646-1:1993. These blocks encoded precomposed modern Hangul syllables. These three Unicode 1.x blocks were deleted and superseded by the new Hangul Syllables block (U+AC00–U+D7AF) in Unicode 2.0 and ISO/IEC 10646-1:1993 Amd. 5 (1998), and are now occupied by CJK Unified Ideographs Extension A and Yijing Hexagram Symbols. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Hangul Syllables block introduced in Unicode 2.0 is immutable.

GB 12052-89, entitled Korean character coded character set for information interchange, is a Korean-language character set standard established by China. It consists of a total of 5,979 characters, and has no relationship nor compatibility with South Korea's KS X 1001 and North Korea's KPS 9566.

KS X 1002 is a South Korean character set standard that is established in order to supplement KS X 1001. It consists of a total of 7,649 characters.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.

See also