Bopomofo Extended

Last updated
Bopomofo Extended
RangeU+31A0..U+31BF
(32 code points)
Plane BMP
Scripts Bopomofo
Major alphabetsphonetic Min Nan and Hakka
Assigned32 code points
Unused0 reserved code points
Unicode version history
3.024 (+24)
6.027 (+3)
13.032 (+5)
Note: [1] [2]

Bopomofo Extended is a Unicode block containing additional Bopomofo characters for writing phonetic Min Nan, Hakka Chinese, Cantonese, Hmu, and Ge. The basic set of Bopomofo characters can be found in the Bopomofo block.

Contents

Block

Bopomofo Extended [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+31Ax
U+31Bx
Notes
1. ^ As of Unicode version 13.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Bopomofo Extended block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
3.0U+31A0..31B724L2/98-090 N1713R Proposal to add 24 extended Bopomofo and 2 modifier letters for Minnan and Hakka, 1998-03-19
L2/98-158 Aliprand, Joan; Winkler, Arnold (1998-05-26), "Bopomofo Extensions", Draft Minutes – UTC #76 & NCITS Subgroup L2 #173 joint meeting, Tredyffrin, Pennsylvania, April 20-22, 1998
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.25", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
L2/98-321 N1905 Revised text of 10646-1/FPDAM 23, AMENDMENT 23: Bopomofo Extended and other characters, 1998-10-22
L2/18-255 N4980 Tung, Bobby; Ko, But; Wei, Selena (2018-06-13), Proposal to Fix Glyphs in Bopomofo Extended Block and Encode one Bopomofo Letter
6.0U+31B8..31BA3 L2/09-049 N3570 West, Andrew (2009-01-25), Proposal to encode three Bopomofo letters for Hmu and Ge
L2/09-003R Moore, Lisa (2009-02-12), "B.15.15", UTC #118 / L2 #215 Minutes
L2/09-234 N3603 (pdf, doc)Umamaheswaran, V. S. (2009-07-08), "M54.13f", Unconfirmed minutes of WG 2 meeting 54
13.0U+31BB1 L2/18-255 N4980 Tung, Bobby; Ko, But; Wei, Selena (2018-06-13), Proposal to Fix Glyphs in Bopomofo Extended Block and Encode one Bopomofo Letter
L2/18-183 Moore, Lisa (2018-11-20), "C.21", UTC #156 Minutes
N5020 (pdf, doc)Umamaheswaran, V. S. (2019-01-11), "10.3.16", Unconfirmed minutes of WG 2 meeting 67
U+31BC..31BF4 L2/19-100 Chan, Eiso (2019-04-08), Preliminary proposal to encode four extended Bopomofo letters for Cantonese
L2/19-173 Anderson, Deborah; et al. (2019-04-29), "18", Recommendations to UTC #159 April-May 2019 on Script Proposals
L2/19-177R N5052 Yang, Ben; Chan, Eiso (2019-05-02), Proposal to encode Cantonese Bopomofo Characters
L2/19-122 Moore, Lisa (2019-05-08), "Consensus 159-C21", UTC #159 Minutes
N5122 "M68.10 and M68.19", Unconfirmed minutes of WG 2 meeting 68, 2019-12-31
  1. Proposed code points and characters names may differ from final code points and names

See also

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

Bopomofo, also called Zhuyin or Mandarin Phonetic Symbols, is the major Chinese transliteration system for Mandarin Chinese and other related languages and dialects which is nowadays most commonly used in Taiwanese Mandarin. It is also used to transcribe other varieties of Chinese, particularly other varieties of Standard Chinese and related Mandarin dialects, as well as Taiwanese Hokkien.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages.

Greek Extended is a Unicode block containing the accented vowels necessary for writing polytonic Greek. The regular, unaccented Greek characters as well as the characters with tonos and diaeresis can be found in the Greek and Coptic block. Greek Extended was encoded in version 1.1 of the Unicode Standard. As an alternative to Greek Extended, combining characters can be used to represent the tones and breath marks of polytonic Greek.

Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gurmukhi is a Unicode block containing characters for the Punjabi language, as it is written in India. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Myanmar is a Unicode block containing characters for the Burmese, Mon, Karen, Kayah, Shan, and Palaung languages of Myanmar. It is also used to write Pali and Sanskrit in Myanmar.

Vedic Extensions is a Unicode block containing characters for representing tones and other vedic symbols in Devanagari and other Indic scripts. Related symbols are defined in two other blocks: Devanagari (U+0900–U+097F) and Devanagari Extended (U+A8E0–U+A8FF).

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Bopomofo is a Unicode block containing phonetic characters for Chinese. The original set of 40 Bopomofo characters is based on the Chinese standard GB 2312. Additional Bopomofo characters can be found in the Bopomofo Extended block.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. During the unification with ISO 10646 for version 1.1, the Japanese Industrial Standard Symbol was reassigned from the code point U+32FF at the end of the block to U+3004. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the last of the Basic Multilingual Plane excepting the short Specials block at U+FFF0–FFFF.

Variation Selectors is a Unicode block containing 16 Variation Selector format. They are used to specify a specific glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to determine whether or not a character should be displayed as an emoji.

Combining Diacritical Marks Extended is a Unicode block containing diacritical marks used in German dialectology (Teuthonista).

Geometric Shapes Extended is a Unicode block containing webdings/wingdings symbols, mostly different weights of squares, crosses, and saltires, and different weights of variously spoked asterisks and stars. The block was introduced with Unicode version 7.0.

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Sakha and Americanist usage.

Supplemental Symbols and Pictographs is a Unicode block containing emoji characters. It extends the set of symbols included in the Miscellaneous Symbols and Pictographs block.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.