Glagolitic (Unicode block)

Last updated
Glagolitic
RangeU+2C00..U+2C5F
(96 code points)
Plane BMP
Scripts Glagolitic
Major alphabetsOld Slavonic
Assigned94 code points
Unused2 reserved code points
Unicode version history
4.194 (+94)
Note: [1] [2]

Glagolitic is a Unicode block containing the characters invented by Saint Cyril for translating scripture into Slavonic. Glagolitic script is the precursor of Cyrillic.

In Unicode, a block is defined as one contiguous range of code points. Blocks are named uniquely and have no overlap. They have a starting code point of the form hhh0 and an ending code point of the form hhhF. A block explicitly can include code points that are unassigned and non-characters. Code points not belonging to any of the named blocks, e.g. in the unassigned planes 3–13, have the value block="No_block".

Glagolitic script oldest known Slavic alphabet

The Glagolitic script is the oldest known Slavic alphabet. It is generally agreed to have been created in the 9th century by Saint Cyril, a Byzantine monk from Thessaloniki. He and his brother, Saint Methodius, were sent by the Byzantine Emperor Michael III in 863 to Great Moravia to spread Christianity among the West Slavs in the area. The brothers decided to translate liturgical books into the Old Slavic language that was understandable to the general population, but as the words of that language could not be easily written by using either the Greek or Latin alphabets, Cyril decided to invent a new script, Glagolitic, which he based on the local dialect of the Slavic tribes from the Byzantine theme of Thessalonica.

Contents

Block

Glagolitic [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+2C0x
U+2C1x
U+2C2x
U+2C3xⰿ
U+2C4x
U+2C5x
Notes
1. ^ As of Unicode version 12.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Glagolitic block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
4.1U+2C00..2C2E, 2C30..2C5E94 L2/98-190 Glagolitic coded character set for bibliographic information interchange, 1996-12-15
L2/98-023 N1659 Everson, Michael (1997-12-08), Proposal to encode Glagolitic in ISO/IEC 10646
L2/98-070 Aliprand, Joan; Winkler, Arnold, "3.A.2. item d. Glagolitic", Minutes of the joint UTC and L2 meeting from the meeting in Cupertino, February 25-27, 1998
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.9.3", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
L2/99-012 N1931 Everson, Michael (1998-11-24), Revised proposal for encoding the Glagolitic script in the UCS
L2/99-054R Aliprand, Joan (1999-06-21), "Glagolitic Script", Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999
L2/02-190 Wissink, Cathy (2002-04-22), Glagolitic Character Set
L2/02-448 N2555 Everson, Michael; Cleminson, Ralph (2002-12-04), Revised proposal for encoding the Glagolitic script in the UCS
L2/03-282R N2610R Everson, Michael; Cleminson, Ralph (2003-09-04), Final proposal for encoding the Glagolitic script in the UCS
L2/04-052 Cleminson, Ralph (2004-01-16), Letter from Ralph Cleminson about Glagolitic encoding
L2/04-051 Anderson, Deborah (2004-01-29), Comments on 2619R Final Glagolitic proposal
  1. Proposed code points and characters names may differ from final code points and names

See also

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorise these characters as being "letterlike".

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.

Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows.

Halfwidth and fullwidth forms Alternative width characters in East Asian typography

In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. With fixed-width fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.

Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.

Hangul Jamo is a Unicode block containing positional forms of the Hangul consonant and vowel clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode, specifically archaic syllables containing sounds that have since merged phonetically with other sounds in modern pronunciation.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of Tibet, Bhutan, Nepal, and northern India. The Tibetan Unicode block is unique for having been allocated as a standard virama-based encoding for version 1.0, removed from the Unicode Standard when unifying with ISO 10646 for version 1.1, then reintroduced as an explicit root/subjoined encoding, with a larger block size in version 2.0.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Katakana Phonetic Extensions is a Unicode block containing additional katakana characters for writing the Ainu language, in addition to characters in the Katakana block.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. During the unification with ISO 10646 for version 1.1, the Japanese Industrial Standard Symbol was reassigned from the code point U+32FF at the end of the block to U+3004. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with east Asian character sets.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine-era musical notation.

Ancient Greek Musical Notation is a Unicode block containing symbols representing musical notations used in ancient Greece.

Tai Viet is a Unicode block containing characters for writing the Tai languages Tai Dam, Tai Dón, and Thai Song.

Glagolitic Supplement is a Unicode block containing supplementary characters used in the Glagolitic script. It currently contains 38 combining letters.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.