Shorthand Format Controls

Last updated
Shorthand Format Controls
RangeU+1BCA0..U+1BCAF
(16 code points)
Plane SMP
Scripts Common
Major alphabetsShorthands
Assigned4 code points
Unused12 reserved code points
Unicode version history
7.0 (2014)4 (+4)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Shorthand Format Controls is a Unicode block containing four formatting characters for representing shorthands in Unicode. [3]

Contents

Block

Shorthand Format Controls [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1BCAx 𛲠  𛲡  𛲢  𛲣 
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

Being invisible controls, they have no visible glyph but can have a representation.

Romanian affix -tsion-
Sloan contracted ending oo/o + ZWSP
Sloan contracted ending uh/au/aui + ZWSP

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Shorthand Format Controls block:

See also

Related Research Articles

Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese.

<span class="mw-page-title-main">Dingbat</span> Typographic symbol class

In typography, a dingbat is an ornament, specifically, a glyph used in typesetting, often employed to create box frames, or as a dinkus. Some of the dingbat symbols have been used as signature marks or used in bookbinding to order sections.

In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.

ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.

A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

Braille ASCII is a subset of the ASCII character set which uses 64 of the printable ASCII characters to represent all possible dot combinations in six-dot braille. It was developed around 1969 and, despite originally being known as North American Braille ASCII, it is now used internationally.

<i>Kamloops Wawa</i> Missionary newspaper from British Columbia, Canada

The Kamloops Wawa was a newspaper published by Father Jean-Marie-Raphaël Le Jeune, superior of the Roman Catholic Diocese of Kamloops in British Columbia, Canada, beginning May 25, 1891, and continuing into the 1900s. The contents of the Kamloops Wawa were near-entirely written using Le Jeune's adaptation of the French Duployan shorthand writing system, called "chinuk pipa" in Chinook Jargon itself. Most of the texts of the Kamloops Wawa were composed in the local variant of Chinook Jargon with some passages and articles in Nlaka'pamuxtsin, Secwepmectsin, St'at'imcets and other traditional languages. Some series of articles, however, included translations into Chinook Jargon of classical texts from Latin, such as the Seven Kings of Rome, though most content was either community news or translations of the mass or other liturgical materials.

<span class="mw-page-title-main">Universal Character Set characters</span> Complete list of the characters available on most computers

The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.

In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 15.1, five of the planes have assigned code points (characters), and seven are named.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Unicode Standard assigns various properties to each Unicode character and code point.

The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country codes in a way that allows optional special treatment.

<span class="mw-page-title-main">Duployan shorthand</span> Shorthand writing system

The Duployan shorthand, or Duployan stenography, was created by Father Émile Duployé in 1860 for writing French. Since then, it has been expanded and adapted for writing English, German, Spanish, Romanian, Latin, Danish, and Chinook Jargon. The Duployan stenography is classified as a geometric, alphabetic stenography and is written left-to-right in connected stenographic style. The Duployan shorthands, including Chinook writing, Pernin's Universal Phonography, Perrault's English Shorthand, the Sloan-Duployan Modern Shorthand, and Romanian stenography, were included as a single script in version 7.0 of the Unicode Standard / ISO 10646

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

Mongolian is a Unicode block containing characters for dialects of Mongolian, Manchu, and Sibe languages. It is traditionally written in vertical lines Top-Down, right across the page, although the Unicode code charts cite the characters rotated to horizontal orientation as this is the orientation of glyphs in a font that supports layout in vertical orientation.

<span class="mw-page-title-main">Hangul Compatibility Jamo</span> Unicode character block

Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001. Its block name in Unicode 1.0 was Hangul Elements.

Duployan is a Unicode block containing characters for various Duployan shorthands, including French Duployéan, Chinook Writing, Romanian shorthand, and the English Sloan-Duployan, Pernin, and Perrault shorthands. It is the first block of shorthand characters included in the Unicode Standard.

Egyptian Hieroglyph Format Controls is a Unicode block containing formatting characters that enable full formatting of quadrats for Egyptian hieroglyphs.

References