Vertical Forms | |
---|---|
Range | U+FE10..U+FE1F (16 code points) |
Plane | BMP |
Scripts | Common |
Symbol sets | Vertical punctuation |
Assigned | 10 code points |
Unused | 6 reserved code points |
Source standards | GB 18030 |
Unicode version history | |
4.1 (2005) | 10 (+10) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Vertical Forms is a Unicode block containing vertical punctuation for compatibility characters with the Chinese Standard GB 18030.
In the Unicode specification, U+FE18︘PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET has a typo in its name; "BRACKET" is spelt as "BRAKCET". [3]
Vertical Forms [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+FE1x | ︐ | ︑ | ︒ | ︓ | ︔ | ︕ | ︖ | ︗ | ︘ | ︙ | ||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Vertical Forms block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
4.1 | U+FE10..FE19 | 10 | L2/03-411 | Goldsmith, Deborah; Muller, Eric (2003-10-31), Unencoded chars in GB 18030 & HK-SCS | |
L2/04-161R | N2807 | Suignard, Michel; Muller, Eric; Jenkins, John (2004-06-17), HKSCS and GB 18030 PUA characters, background document | |||
L2/04-263 | N2808 | Suignard, Michel (2004-06-17), HKSCS and GB 18030 PUA characters, request for additional characters and related information | |||
L2/05-137 | Freytag, Asmus (2005-05-10), Handling "defective" names | ||||
L2/05-108R | Moore, Lisa (2005-08-26), "Consensus 103-C7", UTC #103 Minutes, Create a "Normative Name Alias" property and file in the UCD. Populate the property with names from the sections "Typos" and "Bad or misleading names" from document L2/05-137. | ||||
|
A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 15.1, Unicode defines a total of 97,680 characters.
A numeral is a character that denotes a number. The decimal number digits 0–9 are used widely in various writing systems throughout the world, however the graphemes representing the decimal digits differ widely. Therefore Unicode includes 22 different sets of graphemes for the decimal digits, and also various decimal points, thousands separators, negative signs, etc. Unicode also includes several non-decimal numerals such as Aegean numerals, Roman numerals, counting rod numerals, Mayan numerals, Cuneiform numerals and ancient Greek numerals. There is also a large number of typographical variations of the Western Arabic numerals provided for specialized mathematical use and for compatibility with earlier character sets, such as ² or ②, and composite characters such as ½.
In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. Unlike monospaced fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name.
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.
KS X 1001, "Code for Information Interchange ", formerly called KS C 5601, is a South Korean coded character set standard to represent Hangul and Hanja characters on a computer.
The Unicode Standard assigns various properties to each Unicode character and code point.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001. Its block name in Unicode 1.0 was Hangul Elements.
CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. However, it also contains 12 unified ideographs sourced from Japanese character sets from IBM.
CJK Compatibility Forms is a Unicode block containing vertical glyph variants for east Asian compatibility. Its block name in Unicode 1.0 was CNS 11643 Compatibility, in reference to CNS 11643.
CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF). The square forms can have different presentations when they are used in horizontal or vertical text. For example, the characters U+333E㌾SQUARE BORUTO and U+3327㌧SQUARE TON should look different in horizontal and in vertical right-to-left: ㌧㌾
General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.
Optical Character Recognition is a Unicode block containing signal characters for OCR and MICR standards.
Box Drawing is a Unicode block containing characters for compatibility with legacy graphics standards that contained characters for making bordered charts and tables, i.e. box-drawing characters. Its block name in Unicode 1.0 was Form and Chart Components.
Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.
Small Form Variants is a Unicode block containing small punctuation characters for compatibility with the Chinese National Standard CNS 11643. Its block name in Unicode 1.0 was simply Small Variants.
Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.