Shorthand Format Controls | |
---|---|
Range | U+1BCA0..U+1BCAF (16 code points) |
Plane | SMP |
Scripts | Common |
Major alphabets | Shorthands |
Assigned | 4 code points |
Unused | 12 reserved code points |
Unicode version history | |
7.0 (2014) | 4 (+4) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Shorthand Format Controls is a Unicode block containing four formatting characters for representing shorthands in Unicode. [3]
Shorthand Format Controls [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1BCAx | | | | | ||||||||||||
Notes |
Being invisible controls, they have no visible glyph but can have a representation.
The following Unicode-related documents record the purpose and process of defining specific characters in the Shorthand Format Controls block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
7.0 | U+1BCA0..1BCA3 | 4 | L2/09-184 | Anderson, Van (2009-05-03), Proposal to include Chinook Pipa script in UCS | |
L2/09-184.1 | Anderson, Van (2009-05-03), Proposal to include Chinook Pipa script in UCS [zip format of the HTML version] | ||||
L2/09-283 | Anderson, Van (2009-07-24), Proposal to include Chinook Pipa script in UCS | ||||
L2/09-364 | Anderson, Van (2009-10-06), Proposal to Include Duployan Shorthands and Chinook Script in UCS | ||||
L2/09-364.1 | Anderson, Van (2009-10-06), Proposal to Include Duployan Shorthands and Chinook Script in UCS [zip format of the HTML version] | ||||
L2/09-364.2 | Anderson, Van (2009-10-26), Proposal to Include Duployan Shorthands and Chinook Script in UCS [fonts and data] | ||||
L2/10-027 | Anderson, Van (2010-01-25), Duployan Charts | ||||
L2/10-028 | Anderson, Van (2010-01-25), Duployan Documentation | ||||
L2/10-026 | Anderson, Van (2010-01-31), Proposal to include Duployan Shorthands and Chinook script in UCS | ||||
L2/10-159 | Anderson, Van (2010-05-03), Proposal to include Duployan Shorthands and Chinook script in UCS | ||||
L2/10-201 | Anderson, Van (2010-05-12), Guide to Duployan Shorthands and Stenographies | ||||
L2/10-202 | Anderson, Van (2010-05-12), Duployan Texts | ||||
L2/10-272R2 | N3895 | Anderson, Van (2010-08-12), Proposal to include Duployan Shorthands and Chinook script and Shorthand Format Controls in UCS | |||
L2/10-221 | Moore, Lisa (2010-08-23), "C.21", UTC #124 / L2 #221 Minutes | ||||
L2/10-347 | N3908 | Everson, Michael (2010-09-21), On ordering and the proposed Duployan script for shorthands and Chinook | |||
L2/10-352 | N3922 | Anderson, Van (2010-09-21), Response to Irish NB comments N3908 | |||
L2/10-364 | N3931 | Everson, Michael (2010-10-03), Further discussion on ordering and the proposed Duployan script | |||
L2/10-383 | N3940 | Anderson, Van (2010-10-05), Quick response to Irish NB comments N3931 | |||
L2/11-215 | N4088 | Anderson, Van; Everson, Michael (2011-05-30), Resolving chart and collation order for the Duployan script | |||
L2/11-303 | Anderson, Van (2011-07-19), Proposal to include Duployan Shorthands and Chinook script and Shorthand Format Controls in UCS, as approved by WG2 | ||||
N4103 | "11.1.5 Duployan Shorthands and Chinook script and Shorthand Format Controls in UCS", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | ||||
L2/14-134 | Davis, Mark (2014-05-05), Fix to Script=Inherited values | ||||
L2/14-100 | Moore, Lisa (2014-05-13), "B.14.3", UTC #139 Minutes | ||||
|
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese.
In typography, a dingbat is an ornament, specifically, a glyph used in typesetting, often employed to create box frames, or as a dinkus. Some of the dingbat symbols have been used as signature marks or used in bookbinding to order sections.
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.
A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.
Braille ASCII is a subset of the ASCII character set which uses 64 of the printable ASCII characters to represent all possible dot combinations in six-dot braille. It was developed around 1969 and, despite originally being known as North American Braille ASCII, it is now used internationally.
The Kamloops Wawa was a newspaper published by Father Jean-Marie-Raphaël Le Jeune, superior of the Roman Catholic Diocese of Kamloops in British Columbia, Canada, beginning May 25, 1891, and continuing into the 1900s. The contents of the Kamloops Wawa were near-entirely written using Le Jeune's adaptation of the French Duployan shorthand writing system, called "chinuk pipa" in Chinook Jargon itself. Most of the texts of the Kamloops Wawa were composed in the local variant of Chinook Jargon with some passages and articles in Nlaka'pamuxtsin, Secwepmectsin, St'at'imcets and other traditional languages. Some series of articles, however, included translations into Chinook Jargon of classical texts from Latin, such as the Seven Kings of Rome, though most content was either community news or translations of the mass or other liturgical materials.
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 15.1, five of the planes have assigned code points (characters), and seven are named.
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
The Unicode Standard assigns various properties to each Unicode character and code point.
The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country codes in a way that allows optional special treatment.
The Duployan shorthand, or Duployan stenography, was created by Father Émile Duployé in 1860 for writing French. Since then, it has been expanded and adapted for writing English, German, Spanish, Romanian, Latin, Danish, and Chinook Jargon. The Duployan stenography is classified as a geometric, alphabetic stenography and is written left-to-right in connected stenographic style. The Duployan shorthands, including Chinook writing, Pernin's Universal Phonography, Perrault's English Shorthand, the Sloan-Duployan Modern Shorthand, and Romanian stenography, were included as a single script in version 7.0 of the Unicode Standard / ISO 10646
Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.
Mongolian is a Unicode block containing characters for dialects of Mongolian, Manchu, and Sibe languages. It is traditionally written in vertical lines Top-Down, right across the page, although the Unicode code charts cite the characters rotated to horizontal orientation as this is the orientation of glyphs in a font that supports layout in vertical orientation.
Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001. Its block name in Unicode 1.0 was Hangul Elements.
Duployan is a Unicode block containing characters for various Duployan shorthands, including French Duployéan, Chinook Writing, Romanian shorthand, and the English Sloan-Duployan, Pernin, and Perrault shorthands. It is the first block of shorthand characters included in the Unicode Standard.
Egyptian Hieroglyph Format Controls is a Unicode block containing formatting characters that enable full formatting of quadrats for Egyptian hieroglyphs.