Hangul Jamo Extended-B

Last updated
Hangul Jamo Extended-B
RangeU+D7B0..U+D7FF
(80 code points)
Plane BMP
Scripts Hangul
Major alphabetsHangul
Assigned72 code points
Unused8 reserved code points
Unicode version history
5.2 (2009)72 (+72)
Code chart
Note: [1] [2]
Hangul jamo characters in Unicode Hangul jamo characters in Unicode.svg
Hangul jamo characters in Unicode

Hangul Jamo Extended-B is a Unicode block containing positional (jungseong and jongseong) forms of archaic Hangul vowel and consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Contents

Block

Hangul Jamo Extended-B [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+D7Bx
U+D7Cx
U+D7Dx
U+D7Ex
U+D7Fx
Notes
1. ^ As of Unicode version 14.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Hangul Jamo Extended-B block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
5.2U+D7B0..D7C6, D7CB..D7FB72 N3168R Kim, Kyongsok (2006-04-23), A Proposal to add new Hangul Jamo extended characters to BMP of UCS
L2/07-076 N3168Kim, Kyongsok (2006-09-27), A Proposal to add new Hangul Jamo extended characters to BMP of UCS
N3153 (pdf, doc)Umamaheswaran, V. S. (2007-02-16), "M49.23", Unconfirmed minutes of WG 2 meeting 49 AIST, Akihabara, Tokyo, Japan; 2006-09-25/29
L2/07-103 N3242 Proposed allocation of Old Hangul Jamos in the BMP, 2007-04-16
L2/07-247 N3257 "2", A Proposal to add new Hangul Jamo extended characters to BMP of UCS, 2007-04-23
L2/07-118R2 Moore, Lisa (2007-05-23), "111-C17", UTC #111 Minutes
L2/07-268 N3253 (pdf, doc)Umamaheswaran, V. S. (2007-07-26), "M50.34", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Korean language and computers Input and use of Korean on computers

The writing system of Korean, Hangul, is an alphabet organized into blocks of syllables; characters cannot be written from left to right. Because of this, every possible syllable in Korean must either be rendered as syllable blocks by a font, or be encoded separately. Unicode uses the latter option. As an example, the syllable 하 (ha) consists of the characters ㅎ (h) and ㅏ (a), but both of them are encoded separately.

In computer programming, whitespace is any character or series of characters that represent horizontal or vertical space in typography. When rendered, a whitespace character does not correspond to a visible mark, but typically does occupy an area on a page. For example, the common whitespace symbol U+0020 SPACE represents a blank space punctuation character in text, used as a word divider in Western scripts.

Unified Hangul Code Windows character encoding for Korean

Unified Hangul Code (UHC), or Extended Wansung, also known under Microsoft Windows as Code Page 949, is the Microsoft Windows code page for the Korean language. It is an extension of Wansung Code to include all 11172 non-partial Hangul syllables present in Johab. This corresponds to the pre-composed syllables available in Unicode 2.0 and later.

New Gulim (새굴림/SaeGulRim) is a sans-serif type Unicode font designed especially for the Korean-language script, designed by HanYang System Co., Limited. It is an expanded version of Hanyang Gulrim.

Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character sets, which often included similar or identical characters.

Halfwidth and fullwidth forms Alternative width characters in East Asian typography

In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. Unlike monospaced fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name.

In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 13.0, seven of the planes have assigned code points (characters), and five are named.

KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.

KS X 1001, "Code for Information Interchange ", formerly called KS C 5601, is a South Korean coded character set standard to represent hangul and hanja characters on a computer.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Hangul Jamo (Unicode block) Unicode character block

Hangul Jamo is a Unicode block containing positional forms of the Hangul consonant and vowel clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Hangul Jamo Extended-A Unicode character block

Hangul Jamo Extended-A is a Unicode block containing choseong forms of archaic Hangul consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically syllables that are not used in standard modern Korean.

Hangul Syllables is a Unicode block containing precomposed Hangul syllable blocks for modern Korean. The syllables can be directly mapped by algorithm to sequences of two or three characters in the Hangul Jamo Unicode block:

Hangul Compatibility Jamo Unicode character block

Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001. Its block name in Unicode 1.0 was Hangul Elements.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

GB 12052-89, entitled Korean character coded character set for information interchange, is a Korean-language character set standard established by China. It consists of a total of 5,979 characters, and has no relationship nor compatibility with South Korea's KS X 1001 and North Korea's KPS 9566.

KS X 1002 is a South Korean character set standard that is established in order to supplement KS X 1001. It consists of a total of 7,649 characters.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.