The romanization of Ukrainian, or Latinization of Ukrainian, is the representation of the Ukrainian language in Latin letters. Ukrainian is natively written in its own Ukrainian alphabet, which is based on the Cyrillic script. Romanization may be employed to represent Ukrainian text or pronunciation for non-Ukrainian readers, on computer systems that cannot reproduce Cyrillic characters, or for typists who are not familiar with the Ukrainian keyboard layout. Methods of romanization include transliteration (representing written text) and transcription (representing the spoken word).
In contrast to romanization, there have been several historical proposals for a native Ukrainian Latin alphabet, usually based on those used by West Slavic languages, but none have caught on.
Transliteration is the letter-for-letter representation of text using another writing system. Rudnyckyj classified transliteration systems into scientific transliteration, used in academic and especially linguistic works, and practical systems, used in administration, journalism, in the postal system, in schools, etc.Scientific transliteration, also called the scholarly system, is used internationally, with very little variation, while the various practical methods of transliteration are adapted to the orthographical conventions of other languages, like English, French, German, etc.
Depending on the purpose of the transliteration it may be necessary to be able to reconstruct the original text, or it may be preferable to have a transliteration which sounds like the original language when read aloud.
Scientific transliteration , also called the academic, linguistic, international, or scholarly system, is most often seen in linguistic publications on Slavic languages. It is purely phonemic, meaning each character represents one meaningful unit of sound, and is based on the Croatian Latin alphabet.Different variations are appropriate to represent the phonology of historical Old Ukrainian (mid 11th–14th centuries) and Middle Ukrainian (15th–18th centuries).
A variation was codified in the 1898 Prussian Instructions for libraries, or Preußische Instruktionen (PI), and widely used in bibliographic cataloguing in Central Europe and Scandinavia. With further modifications it was published by the International Organization for Standardization as recommendation ISO/R 9 in 1954, revised in 1968, and again as an international standard in 1986 and 1995.
Representing all of the necessary diacritics on computers requires Unicode, Latin-2, Latin-4, or Latin-7 encoding. Other Slavic based romanizations occasionally seen are those based on the Slovak alphabet or the Polish alphabet, which include symbols for palatalized consonants.
The ALA-LC Romanization Tables were first discussed by the American Library Association in 1885,and published in 1904 and 1908, including rules for romanizing Church Slavic, the pre-reform Russian alphabet, and Serbo-Croatian. Revised tables including Ukrainian were published in 1941, and remain in use virtually unchanged according to the latest 2011 release. This system is used to represent bibliographic information by US and Canadian libraries, by the British Library since 1975, and in North American publications.
In addition to bibliographic cataloguing, simplified versions of the Library of Congress system are widely used for romanization in the text of academic and general publications. For notes or bibliographical references, some publications use a version without ligatures, which offers sufficient precision but simplifies the typesetting burden and easing readability. For specialist audiences or those familiar with Slavic languages, a version without ligatures and diacritical marks is sometimes used.For broader audiences, a "modified Library of Congress system" is employed for personal, organizational, and place names, omitting all ligatures and diacritics, ignoring the soft sign ь (ʹ), with initial Є- (I͡E-), Й- (Ĭ-), Ю- ( I͡U-), and Я- (I͡A-) represented by Ye-, Y-, Yu-, and Ya-, surnames' terminal -ий (-yĭ) and -ій (-iĭ) endings simplified to -y, and sometimes with common first names anglicized, for example, Олександр (Oleksandr) written as Alexander.
|Original Cyrillic text||–||Ярослав Рудницький|
|Strict ALA-LC||I͡Aroslav Rudnyt͡sʹkyĭ|
|Footnote or bibliography||Without ligatures||Iaroslav Rudnytsʹkyĭ|
|Academic text||Without ligatures or diacritics||Iaroslav Rudnytskyi|
|Names in general text||Modified Library of Congress||Yaroslav Rudnytsky|
Similar principles were systematically described for Russian by J. Thomas Shaw in 1969,and since widely adopted. Their application for Ukrainian and multilingual text were described in the 1984 English translation of Kubiiovych's Encyclopedia of Ukraine and in the 1997 translation of Hrushevskyi's History of Ukraine-Rusʹ , and other sources have referred to these, for example, historian Serhii Plokhy in several works. However, the details of usage vary, for example, the authors of the Historical Dictionary of Ukraine render the soft sign ь with an i, "thus Khvyliovy, not Khvylovy, as in the Encyclopedia of Ukraine".
Requires Unicode for connecting diacritics, but only plain ASCII characters for a simplified version.
British Standard 2979:1958 "Transliteration of Cyrillic and Greek Characters",from BSI, is used by the Oxford University Press. A variation is used by the British Museum and British Library, but since 1975 their new acquisitions have been catalogued using Library of Congress transliteration.
In addition to the "British" system, the standard also includes tables for the "International" system for Cyrillic, corresponding to ISO/R 9:1968 (and ISO's recommendation reciprocally has an alternate system corresponding to BSI's).It also includes tables for romanization of Greek.
BGN/PCGN romanization is a series of standards approved by the United States Board on Geographic Names and Permanent Committee on Geographical Names for British Official Use. Pronunciation is intuitive for English-speakers. For Ukrainian, the former BGN/PCGN system was adopted in 1965, but superseded there by the Ukrainian National System in 2019.A modified version is also mentioned in the Oxford Style Manual.
Requires only ASCII characters if optional separators are not used.
The Soviet Union's GOST, COMECON's SEV, and Ukraine's Derzhstandart are government standards bodies of the former Eurasian communist countries. They published a series of romanization systems for Ukrainian, which were replaced by ISO 9:1995. For details, see GOST 16876-71.
ISO 9 is a series of systems from the International Organization for Standardization. The ISO published editions of its "international system" for romanization of Cyrillic as recommendations (ISO/R 9) in 1954 and 1968, and standards (ISO 9) in 1986 and 1995. This was originally derived from scientific transliteration in 1954, and is meant to be usable by readers of most European languages. The 1968 edition also included an alternative system identical to the British Standard.
The 1995 edition supports most national Cyrillic alphabets in a single transliteration table. It is a pure transliteration system, with each Cyrillic character represented by exactly one unique Latin character, making it reliably reversible, but sacrificing readability and adaptation to individual languages. It considers only graphemes and disregards phonemic differences. So, for example, г (Ukrainian He or Russian Ge) is always represented by the transliteration g; ґ (Ukrainian letter Ge) is represented by g̀.
Representing all of the necessary diacritics on computers requires Unicode, and a few characters are rarely present in computer fonts, for example g-grave: g̀.
This is the official system of Ukraine, also employed by the United Nations and many countries' foreign services. It is currently widely used to represent Ukrainian geographic names, which were almost exclusively romanized from Russian before Ukraine's independence in 1991, and for personal names in passports. It is based on English orthography, and requires only ASCII characters with no diacritics.
Its first version was codified in Decision No. 9 of the Ukrainian Committee on Issues of Legal Terminology on April 19, 1996,stating that the system is binding for the transliteration of Ukrainian names in English in legislative and official acts.
A new official system was introduced for transliteration of Ukrainian personal names in Ukrainian passports in 2007.
An updated 2010 version became the system is used for transliterating all proper names and was approved as Resolution 55 of the Cabinet of Ministers of Ukraine, January 27, 2010.This modified earlier laws and brought together a unified system for official documents, publication of cartographic works, signs and indicators of inhabited localities, streets, stops, subway stations, etc.
It has been adopted internationally. The 27th session of the UN Group of Experts on Geographical Names (UNGEGN) held in New York 30 July and 10 August 2012 after a report by the State Agency of Land Resources of Ukraine (now known as Derzhheokadastr: Ukraine State Service of Geodesy, Cartography and Cadastre) expertsapproved the Ukrainian system of romanization. The BGN/PCGN jointly adopted the system in 2020.
Official geographic names are romanized directly from the original Ukrainian and not translated. For example, Kyivska oblast not Kyiv Oblast , Pivnichnokrymskyi kanal not North Crimean Canal .
Romanization intended for readers of other languages than English is usually transcribed phonetically into the familiar orthography. For example, y, kh, ch, sh, shch for anglophones may be transcribed j, ch, tsch, sch, schtsch for German readers (for letters й, х, ч, ш, щ), or it may be rendered in Latin letters according to the normal orthography of another Slavic language, such as Polish or Croatian (such as the established system of scientific transliteration, described above).
Czech and Slovak standard transliteration uses letters with diacritics (ž, š, č, ď, ť, ň, ě) and letters i, y, j, h, ch, c in the local meaning. Diphthong letters are transcribed as two letters (ja, je, ji, ju, šč).Czech transliteration was used, for example, on hiking signs in Transcarpathia, which was established according to the methodology of the Czech Tourists Club – the Ukrainian markers replaced that later with the English transcription. However, the fact that Ukraine itself has started to use English transliteration on its documents and boards, also influences the practice in Czech and Slovak, which is also penetrated by English transliteration of Ukrainian (similarly as of Russian).
Users of public-access computers or mobile text messaging services sometimes improvise informal romanization due to limitations in keyboard or character set. These may include both sound-alike and look-alike letter substitutions. Example: YKPAIHCbKA ABTOPKA for "УКРАЇНСЬКА АВТОРКА". See also Volapuk encoding.
This system uses the available character set.
For telegraph transmission. Each separate Ukrainian letter had a 1:1 equivalence to a Latin letter. Latin Q, W, V, and X are equivalent to Ukrainian Я (or sometimes Щ), В, Ж, Ь. Other letters are transcribed phonetically. This equivalency is used in building the KOI8-U table.
Transcription is the representation of the spoken word. Phonological, or phonemic, transcription represents the phonemes, or meaningful sounds of a language, and is useful to describe the general pronunciation of a word. Phonetic transcription represents every single sound, or phone, and can be used to compare different dialects of a language. Both methods can use the same sets of symbols, but linguists usually denote phonemic transcriptions by enclosing them in slashes / ... /, while phonetic transcriptions are enclosed in square brackets [ ... ].
The International Phonetic Alphabet precisely represents pronunciation. It requires a special Unicode font.
In many contexts, it is common to use a modified system of transliteration that strives to be read and pronounced naturally by anglophones. Such transcriptions are also used for the surnames of people of Ukrainian ancestry in English-speaking countries (personal names have often been translated to equivalent or similar English names, e.g., "Alexander" for Oleksandr, "Terry" for Taras).
Typically such a modified transliteration is based on the ALA-LC, or Library of Congress (in North America), or, less commonly, the British Standard system. Such a simplified system usually omits diacritics and ligatures (tie-bars) from, e.g., i͡e, ï or ĭ, often simplifies -yĭ and -iĭ word endings to "-y", omits romanizing the Ukrainian soft sign (ь) and apostrophe ( ' ), and may substitute ya, ye, yu, yo for ia, ie, iu, io at the beginnings of words. It may also simplify doubled letters. Unlike in the English language where an apostrophe is punctuation, in the Ukrainian language it is a letter. Therefore sometimes Rus' is translated with an apostrophe, even when the apostrophe is dropped for most other names and words.
Conventional transliterations can reflect the history of a person or place. Many well-known spellings are based on transcriptions into another Latin alphabet, such as the German or Polish. Others are transcribed from equivalent names in other languages, for example Ukrainian Pavlo ("Paul") may be called by the Russian equivalent Pavel, Ukrainian Kyiv by the Russian equivalent Kiev.
The employment of romanization systems can become complex. For example, the English translation of Kubijovyč's Ukraine: A Concise Encyclopædia uses a modified Library of Congress (ALA-LC) system as outlined above for Ukrainian and Russian names—with the exceptions for endings or doubled consonants applying variously to personal and geographic names. For technical reasons, maps in the Encyclopedia follow different conventions. Names of persons are anglicized in the encyclopedia's text, but also presented in their original form in the index. Various geographic names are presented in their anglicized, Russian, or both Ukrainian and Polish forms, and appear in several forms in the index. Scientific transliteration is used in linguistics articles. The Encyclopedia's explanation of its transliteration and naming convention occupies 2-1/2 pages.
|Cyrillic||British||BGN/PCGN 1965||ALA-LC||LC without diacritics||modified LC||Ukrainian National||German dictionary||French dictionary||Swedish Language Council|
|Г г||h||h||h||h||h||h, gh||h||h||h|
|Є є||ye||ye||i͡e||ie||ie, ye-||ie, ye-||je||ie||je|
|Ї ї||yi||yi||ï||ï||i||i, yi-||ji||ï||ji, (formerly) ï|
|Й й||ĭ||y||ĭ||i||i, y-||i, y-||j||y||j|
|С с||s||s||s||s||s||s||s, ss||s||r|
|Ю ю||yu||yu||i͡u||iu||iu, yu-||iu, yu-||ju||iou||ju, iu , u|
|Я я||ya||ya||i͡a||ia||ia, ya||ia, ya-||ja||ia||ja, ia , a|
|Cyrillic||Scientific (OUk)||Scientific (MUk)||Scientific||Prussian Instr.||ISO 1954||ISO 1968 basic||ISO 1968 Ukr. var.||ISO 1995|
|Cyrillic||GOST 1971||GOST 1986||Derzhstandart 1995||National 1996||Passport 2004||Passport 2007||National 2010|
|В в||v||v||v||v||v, w||v||v|
|Г г||g||g||gh||h, gh||h, g||g||h, gh|
|Ґ ґ||g||g||g, h||g||g|
|Є є||je||je||je||ie, ye-||ie, ye-||ie||ie, ye-|
|Ж ж||zh||ž||zh||zh||zh, j||zh||zh|
|Ї ї||ji||i||ji||yi, i-||yi, i-||i||yi, i|
|Й й||j||j||j||i, y-||i, y-||i||i, y-|
|К к||k||k||k||k||k, c||k||k|
|Ю ю||ju||ju||ju||iu, yu-||iu, yu-||iu||iu, yu-|
|Я я||ja||ja||ja||ia, ya-||ia, ya-||ia||ia, ya-|
Literary, historical, and social sciences texts adhere to the Library of Congress conventions, without ligatures.... We preserve the spelling of ï. To indicate the soft sign we use a slanted prime ... We preserve the Ukrainian apostrophe as a single curly quotation mark.... Omit primes in place-names (except Rusʹ).
The Cyrillic script is a writing system used for various languages across Eurasia and is used as the national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Eastern Europe, the Caucasus, Central Asia, North Asia, and East Asia.
Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, for representing the spoken word, and combinations of both. Transcription methods can be subdivided into phonemic transcription, which records the phonemes or units of semantic meaning in speech, and more strict phonetic transcription, which records speech sounds with precision.
The romanization of the Russian language, aside from its primary use for including Russian names and words in text written in a Latin alphabet, is also essential for computer users to input Russian text who either do not have a keyboard or word processor set up for inputting Cyrillic, or else are not capable of typing rapidly using a native Russian keyboard layout (JCUKEN). In the latter case, they would type using a system of transliteration fitted for their keyboard layout, such as for English QWERTY keyboards, and then use an automated tool to convert the text into Cyrillic.
The Royal Thai General System of Transcription (RTGS) is the official system for rendering Thai words in the Latin alphabet. It was published by the Royal Institute of Thailand.
The Hebrew language uses the Hebrew alphabet with optional vowel diacritics. The romanization of Hebrew is the use of the Latin alphabet to transliterate Hebrew words.
Romanization of Greek is the transliteration (letter-mapping) or transcription (sound-mapping) of text from the Greek alphabet into the Latin alphabet.
Romanization of Bulgarian is the practice of transliteration of text in Bulgarian from its conventional Cyrillic orthography into the Latin alphabet. Romanization can be used for various purposes, such as rendering of proper names and place names in foreign-language contexts, or for informal writing of Bulgarian in environments where Cyrillic is not easily available. Official use of romanization by Bulgarian authorities is found, for instance, in identity documents and in road signage. Several different standards of transliteration exist, one of which was chosen and made mandatory for common use by the Bulgarian authorities in a law of 2009.
The romanization of Arabic is the systematic rendering of written and spoken Arabic in the Latin script. Romanized Arabic is used for various purposes, among them transcription of names and titles, cataloging Arabic language works, language education when used instead of or alongside the Arabic script, and representation of the language in scientific publications by linguists. These formal systems, which often make use of diacritics and non-standard Latin characters and are used in academic settings or for the benefit of non-speakers, contrast with informal means of written communication used by speakers such as the Latin-based Arabic chat alphabet.
The romanization of Khmer is a representation of the Khmer (Cambodian) language using letters of the Latin alphabet. This is most commonly done with Khmer proper nouns, such as names of people and geographical names, as in a gazetteer.
Scientific transliteration, variously called academic, linguistic, international, or scholarly transliteration, is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). This system is most often seen in linguistics publications on Slavic languages.
Romanization or Latinization of Belarusian is any system for transliterating written Belarusian from Cyrillic to the Latin alphabet.
BGN/PCGN romanization system for Russian is a method for romanization of Cyrillic Russian texts, that is, their transliteration into the Latin alphabet as used in the English language.
The American Library Association and Library of Congress Romanization Tables for Russian, or the Library of Congress system, are a set of rules for the romanization of Russian-language text from Cyrillic script to Latin script.
There are many systems for the romanization of the Thai language, i.e. representing the language in Latin script. These include systems of transliteration, and transcription. The most seen system in public space is Royal Thai General System of Transcription (RTGS)—the official scheme promulgated by the Royal Thai Institute. It is based on spoken Thai, but disregards tone, vowel length and a few minor sound distinctions.
The romanization of Macedonian is the transliteration of text in Macedonian from the Macedonian Cyrillic alphabet into the Latin alphabet. Romanization can be used for various purposes, such as rendering of proper names in foreign contexts, or for informal writing of Macedonian in environments where Cyrillic is not easily available. Official use of romanization by North Macedonia's authorities is found, for instance, on road signage and in passports. Several different codified standards of transliteration currently exist and there is widespread variability in practice.
The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Cumaean Greek version of the Greek alphabet used by the Etruscans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
Romanization of Persian or Latinization of Persian is the representation of the Persian language with the Latin script. Several different romanization schemes exist, each with its own set of rules driven by its own set of ideological goals.
There are various systems of romanization of the Armenian alphabet.
Yo, Jo, or Io is a letter of the Cyrillic script. In Unicode, the letter ⟨Ё⟩ is named CYRILLIC CAPITAL/SMALL LETTER IO.
Romanization of the Burmese alphabet is representation of the Burmese language or Burmese names in the Latin alphabet.