Georgian (Unicode block)

Last updated
Georgian
RangeU+10A0..U+10FF
(96 code points)
Plane BMP
Scripts Georgian (87 char.)
Common (1 char.)
Major alphabets Mkhedruli
Asomtavruli
Assigned88 code points
Unused8 reserved code points
Unicode version history
1.0.0 (1991)78 (+78)
3.2 (2002)80 (+2)
4.1 (2005)83 (+3)
6.1 (2012)88 (+5)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.

Contents

Asomtavruli capitals, known as Mtavruli, are included in a separate Georgian Extended block, but the capital letters are not used for title casing. [3]

Block

Georgian [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Georgian block:

Version Final code points [lower-alpha 1] Count UTC  ID L2  ID WG2  IDDocument
1.0.0U+10A0..10C5, 10D0..10F6, 10FB78(to be determined)
UTC/1999-017 Davis, Mark (1999-06-02), Data cross-checks (for Agenda)
L2/99-176R Moore, Lisa (1999-11-04), "Data Cross-Checks", Minutes from the joint UTC/L2 meeting in Seattle, June 8-10, 1999
L2/01-040 Becker, Joe (2001-01-26), Unicode 3.1 Text: Encoding Model for Georgian Script
3.2U+10F7..10F82 L2/00-404 Tarkhan-Mouravi, David (2000-10-30), Proposal for Asomtavruli, Nuskhuri, and Mkhedruli Georgian
L2/01-006 Moore, Lisa (2000-12-22), Reply to Georgian State Department of Information Technology
L2/01-046 Tarkhan-Mouravi, David (2001-01-22), Letter from the Georgian State department for Information Technology
L2/01-047 Megrelian and Svan Examples, 2001-01-22
L2/01-048 Proposal summary form for addition of 3 letters to the Georgian Mkhedruli block, 2001-01-22
L2/01-059 Everson, Michael (2001-01-24), Summary and proposed actions regarding the Georgian documents
L2/01-145 N2346R Moore, Lisa (2001-04-03), Proposal to encode 2 Georgian characters in the UCS
L2/01-166 Moore, Lisa (2001-04-16), Reply to Georgian State Department of Information Technology
L2/01-012R Moore, Lisa (2001-05-21), "Georgian", Minutes UTC #86 in Mountain View, Jan 2001
L2/01-227 Whistler, Ken (2001-05-22), "ITEM 1", WG2 Consent Docket for UTC #87
L2/01-184R Moore, Lisa (2001-06-18), "Motion 87-M16, ITEM 1", Minutes from the UTC/L2 meeting
L2/01-344 N2353 (pdf, doc)Umamaheswaran, V. S. (2001-09-09), "7.13", Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001
4.1U+10F9..10FA, 10FC3 L2/99-082 N1962 Everson, Michael (1999-02-26), Optimizing Georgian representation in the BMP of the UCS
L2/00-115R2 Moore, Lisa (2000-08-08), Minutes Of UTC Meeting #83
L2/03-230R2 N2608R2 Everson, Michael (2003-09-04), Proposal to add Georgian and other characters to the BMP of the UCS
6.1U+10C7, 10CD, 10FD..10FF5 L2/10-072 N3775 Everson, Michael (2010-03-09), Proposal for encoding Georgian and Nuskhuri letters for Ossetian and Abkhaz
L2/10-108 Moore, Lisa (2010-05-19), "Consensus 123-C7", UTC #123 / L2 #220 Minutes
N3803 (pdf, doc)"M56.08i", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

<span class="mw-page-title-main">Unicode</span> Character encoding standard

Unicode, formally The Unicode Standard, is a text encoding standard maintained by the Unicode Consortium designed to support the use of text written in all of the world's major writing systems. Version 15.1 of the standard defines 149813 characters and 161 scripts used in various ordinary, literary, academic, and technical contexts.

<span class="mw-page-title-main">Georgian scripts</span> Three related alphabets used to write Georgian

The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Although the systems differ in appearance, their letters share the same names and alphabetical order and are written horizontally from left to right. Of the three scripts, Mkhedruli, once the civilian royal script of the Kingdom of Georgia and mostly used for the royal charters, is now the standard script for modern Georgian and its related Kartvelian languages, whereas Asomtavruli and Nuskhuri are used only by the Georgian Orthodox Church, in ceremonial religious texts and iconography.

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context. Its block name in Unicode 1.0 was Generic Diacritical Marks.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

The Unicode Standard assigns various properties to each Unicode character and code point.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

<span class="mw-page-title-main">Greek and Coptic</span> Unicode character block

Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally also used for writing Coptic, using the similar Greek letters in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.

Coptic is a Unicode block used with the Greek and Coptic block to write the Coptic language. Prior to version 4.1 of the Unicode Standard, the "Greek and Coptic" block was used exclusively to write Coptic text, but Greek and Coptic letter forms are contrastive in many scholarly works, necessitating their disunification. Any specifically Coptic letters in the Greek and Coptic block are not reproduced in the Coptic Unicode block.

Georgian Supplement is a Unicode block containing characters for the ecclesiastical form of the Georgian script, Nuskhuri. To write the full ecclesiastical Khutsuri orthography, the Asomtavruli capitals encoded in the Georgian block.

Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar.

Deseret is a Unicode block containing characters in the Deseret alphabet, which were invented by the Church of Jesus Christ of Latter-day Saints to write English. The Deseret block was derived from an earlier private use encoding in the ConScript Unicode Registry, like the Shavian and Phaistos Disc encodings. The block was added in version 3.1 of the Unicode Standard; the letters Oi and Ew, both uppercase and lowercase, were added in version 4.0.

Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet consists of glyphs resembling capital letters in the basic Latin alphabet in their standard form and turned.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Ahom is a Unicode block containing characters used for writing the Ahom alphabet, which was used to write the Ahom language spoken by the Ahom people in Assam between the 13th and the 18th centuries.

Soyombo is a Unicode block containing characters from the Soyombo alphabet, which is an abugida developed by the monk and scholar Zanabazar (1635–1723) in 1686 to write Mongolian. It can also be used to write Tibetan and Sanskrit. In addition, this block includes the Soyombo symbol on the flag of Mongolia.

Mac OS Georgian is a character encoding for Mac OS created by Michael Everson for use in his fonts. It is not an official Mac OS character set.

Georgian Extended is a Unicode block containing Georgian Mtavruli letters that function as uppercase versions of their Mkhedruli counterparts in the Georgian block. Unlike all other casing scripts in Unicode, there is no title casing between Mkhedruli and Mtavruli letters, because Mtavruli is typically used only in all-caps text, although there have been some historical attempts at capitalization.

Old Sogdian is a Unicode block containing characters for a group of related, non-cursive Sogdian writing systems used to write historic Sogdian in the 3rd to 5th centuries CE.

Sogdian is a Unicode block containing characters used to write the Sogdian language from the 7th to 14th centuries CE.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. "Unicode® 11.0.0". Unicode Consortium. June 5, 2018. Retrieved 8 June 2018.