Romanization of Georgian

Last updated
Mtskheta and Tbilisi romanized Road Sign in Latin and Georgian.jpg
Mtskheta and Tbilisi romanized

Romanization of Georgian is the process of transliterating the Georgian language from the Georgian script into the Latin script.

Georgian language official language of Georgia

Georgian is a Kartvelian language spoken by Georgians. It is the official language of Georgia.

Latin script writing system used to write most Western and Central European languages

Latin or Roman script is a set of graphic signs (script) based on the letters of the classical Latin alphabet. This is derived from a form of the Cumaean Greek version of the Greek alphabet used by the Etruscans.


Georgian national system of romanization

This system, adopted in February 2002 by the State Department of Geodesy and Cartography of Georgia and the Institute of Linguistics, Georgian National Academy of Sciences, establishes a transliteration system of the Georgian letters into Latin letters. [1] The system was already in use, since 1998, on driving licenses. It is also used by BGN and PCGN since 2009.

Georgian National Academy of Sciences

The Georgian National Academy of Sciences (GNAS) is a main learned society of the Georgia. It was named Georgian SSR Academy of Sciences until November 1990. The Academy coordinates scientific research in Georgia and develops relationship with the academies and scientific centers of foreign countries.

Transliteration is a type of conversion of a text from one script to another that involves swapping letters in predictable ways.

Unofficial system of romanization

Despite its popularity this system sometimes leads to ambiguity. The system is mostly used in social networks, forums, chat rooms, etc. The system is greatly influenced by the common case-sensitive Georgian keyboard layout that ties each key to each letter in the alphabet (seven of them: T, W, R, S, J, Z, C with the help of the shift key to make another letter).

Internet forum online discussion site

An Internet forum, or message board, is an online discussion site where people can hold conversations in the form of posted messages. They differ from chat rooms in that messages are often longer than one line of text, and are at least temporarily archived. Also, depending on the access level of a user or the forum set-up, a posted message might need to be approved by a moderator before it becomes publicly visible.

The Georgian keyboard includes several keyboard layouts for Georgian script.

ISO standard

ISO 9984:1996, "Transliteration of Georgian characters into Latin characters", was last reviewed and confirmed in 2010. [2] The guiding principles in the standard are:

Digraph (orthography) pair of characters used to write one phoneme

A digraph or digram is a pair of characters used in the orthography of a language to write either a single phoneme, or a sequence of phonemes that does not correspond to the normal values of the two characters combined.

ISO 5426 is character set developed by ISO. It was first published in 1983.

In phonetics, aspiration is the strong burst of breath that accompanies either the release or, in the case of preaspiration, the closure of some obstruents. In English, aspirated consonants are allophones in complementary distribution with their unaspirated counterparts, but in some other languages, notably most Indian and East Asian languages, the difference is contrastive, while in Arabic and Persian, all stops are aspirated.

Transliteration table

Georgian letter IPA National system
ISO 9984
Unofficial system
[lower-alpha 1] /eɪ/eyēē
/tʰ/tt't'T [lower-alpha 2] or t
[lower-alpha 1] /i/, /j/jyy
/ʒ/zhzhžžJ, [lower-alpha 2] zh or j
[lower-alpha 1] /w/ww
/pʰ/pp'p'p or f
/kʰ/kk'k'q or k
/ʁ/ghghġg, gh or R [lower-alpha 2]
/qʼ/qqqy [lower-alpha 3]
/ʃ/shshššsh or S [lower-alpha 2]
/t͡ʃ(ʰ)/chch'č̕č'ch or C [lower-alpha 2]
/t͡s(ʰ)/tsts'c'c or ts
/d͡z/dzdzjżdz or Z [lower-alpha 2]
/t͡sʼ/tsʼtsccw, c or ts
/t͡ʃʼ/chʼchččW, [lower-alpha 2] ch or tch
/χ/khkhxxx or kh (rarely)
[lower-alpha 1] /q/, /qʰ/q'
[lower-alpha 1] /oː/ōō


  1. 1 2 3 4 5 Archaic letters.
  2. 1 2 3 4 5 6 7 These are influenced by aforementioned layout, and are preferred to avoid ambiguity, as an expressions: t, j, g, ch can mean two letters.
  3. Initially, the use of y letter for ყ is most probably due to their resemblance to each other.

Related Research Articles

A diacritic – also diacritical mark, diacritical point, diacritical sign, or accent – is a glyph added to a letter, or basic glyph. The term derives from the Ancient Greek διακριτικός, from διακρίνω. Diacritic is primarily an adjective, though sometimes used as a noun, whereas diacritical is only ever an adjective. Some diacritical marks, such as the acute ( ´ ) and grave ( ` ), are often called accents. Diacritical marks may appear above or below a letter, or in some other position such as within the letter or between two letters.

Esperanto is written in a Latin-script alphabet of twenty-eight letters, with upper and lower case. This is supplemented by punctuation marks and by various logograms, such as the numerals 0–9, currency signs such as $, and mathematical symbols.

A caron, háček or haček also known as a hachek, wedge, check, inverted circumflex, inverted hat, is a diacritic commonly placed over certain letters in the orthography of some Baltic, Slavic, Finnic, Samic, Berber, and other languages to indicate a change in the related letter's pronunciation.

Š letter of the Latin alphabet

The grapheme Š, š is used in various contexts representing the sh sound usually denoting the voiceless postalveolar fricative or similar voiceless retroflex fricative /ʂ/. In the International Phonetic Alphabet this sound is denoted with ʃ or ʂ, but the lowercase š is used in the Americanist phonetic notation, as well as in the Uralic Phonetic Alphabet. It represents the same sound as the Turkic letter Ş and the Romanian letter Ș (S-comma).

There are several methods of transliteration from Devanāgarī to the Roman script which share similarities, although no single system of transliteration has emerged as the standard. This process has been termed Romanagari, a portmanteau of the words Roman and Devanagari.. The term may also be used for other languages that use Devanagari as the standard writing script, such as Marathi, Nepali or Sanskrit.

The Romanian alphabet is a variant of the Latin alphabet used by the Romanian language. It is a modification of the classical Latin alphabet and consists of 31 letters, five of which have been modified from their Latin originals for the phonetic requirements of the language:

The ISO international standard ISO 9 establishes a system for the transliteration into Latin characters of Cyrillic characters constituting the alphabets of many Slavic and non-Slavic languages.

The International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme that allows the lossless romanization of Indic scripts as employed by Sanskrit and related Indic languages. It is based on a scheme that emerged during the nineteenth century from suggestions by Charles Trevelyan, William Jones, Monier Monier-Williams and other scholars, and formalised by the Transliteration Committee of the Geneva Oriental Congress, in September 1894. IAST makes it possible for the reader to read the Indic text unambiguously, exactly as if it were in the original Indic script. It is this faithfulness to the original scripts that accounts for its continuing popularity amongst scholars.

The Greek alphabet has been used to write the Greek language since the late ninth or early eighth century BC. It is derived from the earlier Phoenician alphabet, and was the first alphabetic script to have distinct letters for vowels as well as consonants. In Archaic and early Classical times, the Greek alphabet existed in many different local variants, but, by the end of the fourth century BC, the Eucleidean alphabet, with twenty-four letters, ordered from alpha to omega, had become standard and it is this version that is still used to write Greek today. These twenty-four letters are: Α α, Β β, Γ γ, Δ δ, Ε ε, Ζ ζ, Η η, Θ θ, Ι ι, Κ κ, Λ λ, Μ μ, Ν ν, Ξ ξ, Ο ο, Π π, Ρ ρ, Σ σ/ς, Τ τ, Υ υ, Φ φ, Χ χ, Ψ ψ, and Ω ω.

The Harvard-Kyoto Convention is a system for transliterating Sanskrit and other languages that use the Devanāgarī script into ASCII. It is predominantly used informally in e-mail, and for electronic texts.

The romanization or Latinization of Ukrainian is the representation of the Ukrainian language using Latin letters. Ukrainian is natively written in its own Ukrainian alphabet, which is based on the Cyrillic script. Romanization may be employed to represent Ukrainian text or pronunciation for non-Ukrainian readers, on computer systems that cannot reproduce Cyrillic characters, or for typists who are not familiar with the Ukrainian keyboard layout. Methods of romanization include transliteration, representing written text, and transcription, representing the spoken word.

Ayin is the sixteenth letter of the Semitic abjads, including Phoenician ʿayin, Hebrew ʿayinע‬, Aramaic ʿē, Syriac ʿē ܥ, and Arabic ʿayn ع‎.

The romanization of Arabic writes written and spoken Arabic in the Latin script in one of various systematic ways. Romanized Arabic is used for a number of different purposes, among them transcription of names and titles, cataloging Arabic language works, language education when used in lieu of or alongside the Arabic script, and representation of the language in scientific publications by linguists. These formal systems, which often make use of diacritics and non-standard Latin characters and are used in academic settings or for the benefit of non-speakers, contrast with informal means of written communication used by speakers such as the Latin-based Arabic chat alphabet.

Scientific transliteration, variously called academic, linguistic, international, or scholarly transliteration, is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). This system is most often seen in linguistics publications on Slavic languages.

J̌ letter of the Latin alphabet

is a letter of the Latin alphabet, derived from J with the addition of a háček. It is used in some phonetic transcription schemes, e.g. ISO 9, to represent the sound. It is also used in the Latin scripts or in the romanization of various Iranian and Pamir languages, Armenian, Georgian, Berber/Tuareg, and Classical Mongolian. The letter was invented by Lepsius in his Standard Alphabet on the model of š and ž to avoid the confusion caused by the ambiguous pronunciation of the letter j in European languages.

Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived of an existing script, usually Latin, Greek or Cyrillic. In Unicode there is no "IPA script". Apart from IPA, extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

There are various systems of romanization of the Armenian alphabet.

The Armenian alphabet is an alphabetic writing system used to write Armenian. It was developed around 405 AD by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. The system originally had 36 letters; eventually, three more were adopted.

Runic is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014). The original encoding of runes in UCS was based on the recommendations of the "ISO Runes Project" submitted in 1997.