Osage (Unicode block)

Last updated
Osage
RangeU+104B0..U+104FF
(80 code points)
Plane SMP
Scripts Osage
Assigned72 code points
Unused8 reserved code points
Unicode version history
9.072 (+72)
Note: [1] [2]

Osage is a Unicode block containing characters from the Osage alphabet, which was devised in 2006 for writing the Osage language spoken by the Osage people of Oklahoma, USA. [3]

Osage [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+104Bx𐒰𐒱𐒲𐒳𐒴𐒵𐒶𐒷𐒸𐒹𐒺𐒻𐒼𐒽𐒾𐒿
U+104Cx𐓀𐓁𐓂𐓃𐓄𐓅𐓆𐓇𐓈𐓉𐓊𐓋𐓌𐓍𐓎𐓏
U+104Dx𐓐𐓑𐓒𐓓𐓘𐓙𐓚𐓛𐓜𐓝𐓞𐓟
U+104Ex𐓠𐓡𐓢𐓣𐓤𐓥𐓦𐓧𐓨𐓩𐓪𐓫𐓬𐓭𐓮𐓯
U+104Fx𐓰𐓱𐓲𐓳𐓴𐓵𐓶𐓷𐓸𐓹𐓺𐓻
Notes
1. ^ As of Unicode version 13.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Osage block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
9.0U+104B0..104D3, 104D8..104FB72 L2/14-068 N4548 Everson, Michael; Lookout, Herman Mongrain; Pratt, Cameron (2014-02-20), Preliminary proposal to encode the Osage script in the UCS
L2/14-175 N4587 Everson, Michael; Lookout, Herman Mongrain; Pratt, Cameron (2014-07-30), Proposal to encode Latin characters for Osage in the UCS
N4553 (pdf, doc)Umamaheswaran, V. S. (2014-09-16), "10.4.2", Minutes of WG 2 meeting 62 Adobe, San Jose, CA, USA
L2/14-214 N4619 Everson, Michael; Lookout, Herman Mongrain; Pratt, Cameron (2014-09-21), Final proposal to encode the Osage script in the UCS
L2/14-177 Moore, Lisa (2014-10-17), "Proposal to encode Latin characters for Osage (C.4.2)", UTC #140 Minutes
L2/14-268R Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Iancu, Laurențiu; Glass, Andrew; Constable, Peter; Suignard, Michel (2014-10-27), "10. Osage", Recommendations to UTC #141 October 2014 on Script Proposals
L2/14-250 Moore, Lisa (2014-11-10), "Consensus 141-C21", UTC #141 Minutes, Accept 72 Osage letters at U+104B0..U+104FB, in block Osage U+104B0..U+104FF, with properties as documented in L2/14-214, for encoding in a future version of the standard.
L2/16-052 N4603 (pdf, doc)Umamaheswaran, V. S. (2015-09-01), "M63.07", Unconfirmed minutes of WG 2 meeting 63
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.

Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows.

Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages.

The Osage script is a new script promulgated in 2006 and revised 2012–2014 for the Osage language. Because Latin orthographies were subject to interference from English conventions among Osage students who were more familiar with English than with Osage, in 2006 the director of the Osage Language Program, Herman Mongrain Lookout, decided to create a distinct script by modifying or fusing Latin letters. This Osage script has been in regular use on the Osage Nation ever since.

Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gurmukhi is a Unicode block containing characters for the Punjabi language, as it is written in India. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Tamil is a Unicode block containing characters for the Tamil, Badaga, and Saurashtra languages of Tamil Nadu India, Sri Lanka, Singapore, and Malaysia. In its original incarnation, the code points U+0B02..U+0BCD were a direct copy of the Tamil characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Katakana Phonetic Extensions is a Unicode block containing additional small katakana characters for writing the Ainu language, in addition to characters in the Katakana block.

Variation Selectors Supplement is a Unicode block containing additional Variation Selectors beyond those found in the Variation Selectors block.

Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine-era musical notation.

Ancient Greek Musical Notation is a Unicode block containing symbols representing musical notations used in ancient Greece.

Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language. The Javanese script was added to the Unicode Standard in October 2009 with the release of version 5.2.

Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. Michael Everson; Herman Mongrain Lookout; Cameron Pratt (2014-09-21). "Final proposal to encode the Osage script in the UCS" (PDF). ISO/IEC JTC1/SC2/WG2, Document N4619. Retrieved 2015-01-10.