Glagolitic Supplement

Last updated
Glagolitic Supplement
RangeU+1E000..U+1E02F
(48 code points)
Plane SMP
Scripts Glagolitic
Major alphabetsOld Slavonic
Assigned38 code points
Unused10 reserved code points
Unicode version history
9.0 (2016)38 (+38)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Glagolitic Supplement is a Unicode block containing supplementary characters used in the Glagolitic script. [3] It currently contains 38 combining letters.

Contents

Block

Glagolitic Supplement [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1E00x𞀀𞀁𞀂𞀃𞀄𞀅𞀆𞀈𞀉𞀊𞀋𞀌𞀍𞀎𞀏
U+1E01x𞀐𞀑𞀒𞀓𞀔𞀕𞀖𞀗𞀘𞀛𞀜𞀝𞀞𞀟
U+1E02x𞀠𞀡𞀣𞀤𞀦𞀧𞀨𞀩𞀪
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Glagolitic Supplement block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
9.0U+1E000..1E006, 1E008..1E018, 1E01B..1E021, 1E023..1E024, 1E026..1E02A38 L2/14-129 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh (2014-05-02), "6", Recommendations to UTC #139 May 2014 on Script Proposals
L2/14-087R N4608 Andreev, Aleksandr; Miklas, Heinz; Shardt, Yuri (2014-07-08), Proposal to Encode Combining Glagolitic Letters in Unicode
L2/14-103 Cleminson, Ralph; Birnbaum, David (2014-04-27), Expert Feedback on L2/14-087 Proposal to Encode Additional Glagolitic Characters
L2/14-165 Cleminson, Ralph; Birnbaum, David (2014-07-21), Additional Expert Feedback on L2/14-087 Proposal to Encode Additional Glagolitic Characters
L2/14-177 Moore, Lisa (2014-10-17), "Glagolitic (C.7)", UTC #140 Minutes
L2/14-259 Whistler, Ken; Anderson, Deborah (2014-10-21), WG2 Consent Docket
L2/14-250 Moore, Lisa (2014-11-10), "Consensus 141-C15, Action item 141-A41", UTC #141 Minutes
L2/15-017 Moore, Lisa (2015-02-12), "Consensus 142-C17", UTC #142 Minutes, Change the names of U+1E001 COMBINING GLAGOLITIC LETTER BUKI and U+1E00E COMBINING GLAGOLITIC LETTER LJUDIE to U+1E001 COMBINING GLAGOLITIC LETTER BUKY and U+1E00E COMBINING GLAGOLITIC LETTER LJUDIJE.
L2/16-052 N4603 (pdf, doc)Umamaheswaran, V. S. (2015-09-01), "M63.09", Unconfirmed minutes of WG 2 meeting 63
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

Combining Diacritical Marks Supplement is a Unicode block containing combining characters for the Uralic Phonetic Alphabet, Medievalist notations, and German dialectology (Teuthonista). It is an extension of the diacritic characters found in the Combining Diacritical Marks block.

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Cyrillic Supplement is a Unicode block containing Cyrillic letters for writing several minority languages, including Abkhaz, Kurdish, Komi, Mordvin, Aleut, Azerbaijani, and Jakovlev's Chuvash orthography.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Greek Extended is a Unicode block containing the accented vowels necessary for writing polytonic Greek. The regular, unaccented Greek characters as well as the characters with tonos and diaeresis can be found in the Greek and Coptic block. Greek Extended was encoded in version 1.1 of the Unicode Standard. As an alternative to Greek Extended, combining characters can be used to represent the tones and breath marks of polytonic Greek.

Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1, were included in the Latin-1 Supplement block.

Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Variation Selectors Supplement is a Unicode block containing additional Variation Selectors beyond those found in the Variation Selectors block.

Glagolitic is a Unicode block containing the characters invented by Saint Cyril for translating scripture into Slavonic. Glagolitic script is the precursor of Cyrillic.

Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Mongolian Supplement is a Unicode block containing additional Mongolian letters not found in Mongolian block in BMP. It currently comprises nine variant forms of birga marks used to mark the start of text.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. "N4608: Proposal to Encode Combining Glagolitic Letters in Unicode" (PDF). 2014-08-20. Retrieved 2016-06-23.