![]() | This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
Part of a series on |
Translation |
---|
![]() |
Types |
Theory |
Technologies |
Localization |
Institutional |
Related topics |
|
Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus trans- + liter- ) in predictable ways, such as Greek ⟨ α ⟩ → ⟨ a ⟩, Cyrillic ⟨ д ⟩ → ⟨ d ⟩, Greek ⟨ χ ⟩ → the digraph ⟨ ch ⟩, Armenian ⟨ ն ⟩ → ⟨ n ⟩ or Latin ⟨ æ ⟩ → ⟨ ae ⟩. [1]
For instance, for the Greek term ⟨Ελληνική Δημοκρατία⟩, which is usually translated as 'Hellenic Republic', the usual transliteration into the Latin script (romanization) is ⟨Hellēnikḗ Dēmokratía⟩; and the Russian term ⟨Российская Республика⟩, which is usually translated as 'Russian Republic', can be transliterated either as ⟨Rossiyskaya Respublika⟩ or alternatively as ⟨Rossijskaja Respublika⟩.
Transliteration is the process of representing or intending to represent a word, phrase, or text in a different script or writing system. Transliterations are designed to convey the pronunciation of the original word in a different script, allowing readers or speakers of that script to approximate the sounds and pronunciation of the original word. Transliterations do not change the pronunciation of the word. Thus, in the Greek above example, ⟨λλ⟩ is transliterated ⟨ll⟩ though it is pronounced exactly the same way as [l], or the Greek letters, ⟨λλ⟩. ⟨Δ⟩ is transliterated ⟨D⟩ though pronounced as [ð], and ⟨η⟩ is transliterated ⟨ī⟩, though it is pronounced [i] (exactly like ⟨ι⟩) and is not long.
Transcription, conversely, seeks to capture sound, but phonetically approximate it into the new script; ⟨Ελληνική Δημοκρατία⟩ corresponds to [eliniˈciðimokraˈtia] in the International Phonetic Alphabet. While differentiation is lost in the case of [i], note the allophonic realization of /k/ as a palatalized [c] when preceding front vowels /e/ and /i/.
Angle brackets ⟨ ⟩ may be used to set off transliteration, as opposed to slashes / / for phonemic transcription and square brackets for phonetic transcription. Angle brackets may also be used to set off characters in the original script. Conventions and author preferences vary.
Systematic transliteration is a mapping from one system of writing into another, typically grapheme to grapheme. Most transliteration systems are one-to-one, so a reader who knows the system can reconstruct the original spelling.
Transliteration, which adapts written form without altering the pronunciation when spoken out, is opposed to letter transcription, which is a letter by letter conversion of one language into another writing system. Still, most systems of transliteration map the letters of the source script to letters pronounced similarly in the target script, for some specific pair of source and target language. Transliteration may be very close to letter-by-letter transcription if the relations between letters and sounds are similar in both languages.
For many script pairs, there are one or more standard transliteration systems. However, unsystematic transliteration is common, as for Burmese, for instance.
In Modern Greek, the letters ⟨η, ι, υ⟩ and the letter combinations ⟨ει, oι, υι⟩ are pronounced [i] (except when pronounced as semivowels), and a modern transcription renders them as ⟨i⟩. However, a transliteration distinguishes them; for example, by transliterating them as ⟨ē, i, y⟩ and ⟨ei, oi, yi⟩. (As the ancient pronunciation of ⟨η⟩ was [ɛː], it is often transliterated as ⟨ē⟩.) On the other hand, ⟨αυ, ευ, ηυ⟩ are pronounced /af,ef,if/, and are voiced to [av,ev,iv] when followed by a voiced consonant – a shift from Ancient Greek /au̯,eu̯,iu̯/. A transliteration would render them all as ⟨au, eu, iu⟩ no matter the environment these sounds are in, reflecting the traditional orthography of Ancient Greek, yet a transcription would distinguish them, based on their phonemic and allophonic pronunciations in Modern Greek. Furthermore, the initial letter ⟨h⟩ reflecting the historical rough breathing ⟨ ̔⟩ in words such as ⟨Hellēnikḗ⟩ would intuitively be omitted in transcription for Modern Greek, as Modern Greek no longer has the /h/ sound.
Greek word | Transliteration | Transcription | English translation |
---|---|---|---|
Ελληνική Δημοκρατία | Hellēnikḗ Dēmokratía | Elliniki Dimokratia | 'Hellenic Republic' |
Ελευθερία | Eleuthería | Eleftheria | 'Freedom, Liberty' |
Ευαγγέλιο | Euangélio | Evangelio | 'Gospel' |
των υιών | tōn hyiṓn | ton ion | 'of the sons' |
A simple example of difficulties in transliteration is the Arabic letter qāf. It is pronounced, in literary Arabic, approximately like English [k], except that the tongue makes contact not on the soft palate but on the uvula, but the pronunciation varies between different dialects of Arabic. The letter is sometimes transliterated into "g", sometimes into "q" or "'" (for in Egypt it is silent) and rarely even into "k" in English. [2] Another example is the Russian letter "Х" (kha). It is pronounced as the voiceless velar fricative /x/, like the Scottish pronunciation of ⟨ch⟩ in "loch". This sound is not present in most forms of English and is often transliterated as "kh" as in Nikita Khrushchev. Many languages have phonemic sounds, such as click consonants, which are quite unlike any phoneme in the language into which they are being transliterated.
Some languages and scripts present particular difficulties to transcribers. These are discussed on separate pages. Examples of languages and writing systems and methods of transliterating include:
The Hebrew alphabet, known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew language and other Jewish languages, most notably Yiddish, Ladino, Judeo-Arabic, and Judeo-Persian. In modern Hebrew, vowels are increasingly introduced. It is also used informally in Israel to write Levantine Arabic, especially among Druze. It is an offshoot of the Imperial Aramaic alphabet, which flourished during the Achaemenid Empire and which itself derives from the Phoenician alphabet.
A breve is the diacritic mark ◌̆, shaped like the bottom half of a circle. As used in Ancient Greek, it is also called brachy, βραχύ. It resembles the caron but is rounded, in contrast to the angular tip of the caron. In many forms of Latin, ◌̆ is used for a shorter, softer variant of a vowel, such as "Ĭ", where the sound is nearly identical to the English /i/.
In the polytonic orthography of Ancient Greek, the rough breathing character is a diacritical mark used to indicate the presence of an sound before a vowel, diphthong, or after rho. It remained in the polytonic orthography even after the Hellenistic period, when the sound disappeared from the Greek language. In the monotonic orthography of Modern Greek phonology, in use since 1982, it is not used at all.
In linguistics, romanization is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, for representing the spoken word, and combinations of both. Transcription methods can be subdivided into phonemic transcription, which records the phonemes or units of semantic meaning in speech, and more strict phonetic transcription, which records speech sounds with precision.
A caronKARR-ən. or háček, is a diacritic mark placed over certain letters in the orthography of some languages, to indicate a change of the related letter's pronunciation. Typographers tend to use the term caron, while linguists prefer the Czech word háček.
A phonemic orthography is an orthography in which the graphemes correspond consistently to the language's phonemes, or more generally to the language's diaphonemes. Natural languages rarely have perfectly phonemic orthographies; a high degree of grapheme–phoneme correspondence can be expected in orthographies based on alphabetic writing systems, but they differ in how complete this correspondence is. English orthography, for example, is alphabetic but highly nonphonemic.
A digraph or digram is a pair of characters used in the orthography of a language to write either a single phoneme, or a sequence of phonemes that does not correspond to the normal values of the two characters combined.
Cyrillization or Cyrillisation is the process of rendering words of a language that normally uses a writing system other than Cyrillic script into the Cyrillic alphabet. Although such a process has often been carried out in an ad hoc fashion, the term "cyrillization" usually refers to a consistent system applied, for example, to transcribe names of German, Chinese, or English people and places for use in Russian, Ukrainian, Serbian, Macedonian or Bulgarian newspapers and books. Cyrillization is analogous to romanization, when words from a non-Latin script-using language are rendered in the Latin alphabet for use
Sha, She or Shu, alternatively transliterated Ša is a letter of the Glagolitic and Cyrillic scripts. It commonly represents the voiceless postalveolar fricative, like the pronunciation of sh in "ship". More precisely, the sound in Russian denoted by ш is commonly transcribed as a palatoalveolar fricative but is actually a voiceless retroflex fricative. It is used in every variation of the Cyrillic alphabet for Slavic and non-Slavic languages.
The romanization of Ukrainian, or Latinization of Ukrainian, is the representation of the Ukrainian language in Latin letters. Ukrainian is natively written in its own Ukrainian alphabet, which is based on the Cyrillic script. Romanization may be employed to represent Ukrainian text or pronunciation for non-Ukrainian readers, on computer systems that cannot reproduce Cyrillic characters, or for typists who are not familiar with the Ukrainian keyboard layout. Methods of romanization include transliteration and transcription.
The Hebrew language uses the Hebrew alphabet with optional vowel diacritics. The romanization of Hebrew is the use of the Latin alphabet to transliterate Hebrew words.
Ayin is the sixteenth letter of the Semitic scripts, including Arabic ʿaynع, Aramaic ʿē 𐡏, Hebrew ʿayinע, Phoenician ʿayin 𐤏, and Syriac ʿē ܥ.
Ġ is a letter of the Latin script, formed from G with the addition of a dot above the letter.
Scientific transliteration, variously called academic, linguistic, international, or scholarly transliteration, is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). This system is most often seen in linguistics publications on Slavic languages.
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet. It is considered a dead language.
Hindustani has been written in several different scripts. Most Hindi texts are written in the Devanagari script, which is derived from the Brāhmī script of Ancient India. Most Urdu texts are written in the Urdu alphabet, which comes from the Persian alphabet. Hindustani has been written in both scripts. In recent years, the Latin script has been used in these languages for technological or internationalization reasons. Historically, Kaithi script has also been used.
Romanisation of Bengali is the representation of written Bengali language in the Latin script. Various romanisation systems for Bengali are used, most of which do not perfectly represent Bengali pronunciation. While different standards for romanisation have been proposed for Bengali, none has been adopted with the same degree of uniformity as Japanese or Sanskrit.
Romanization or Latinization of Persian is the representation of the Persian language with the Latin script. Several different romanization schemes exist, each with its own set of rules driven by its own set of ideological goals.
The romanization of Cyrillic is the process of converting text written in the Cyrillic script into the Latin alphabetic script, or a system for such conversion.