The romanization of Kurdish language [a] is the practice of transcribing the Kurdish, traditionally written in both Arabic and Latin scripts, into a standardized Latin alphabet. The development of Kurdish romanization systems supports the need for digital communication, linguistic research, and accessibility for Kurdish speakers and Kurdish speakers in diaspora communities. [1] These systems strive to maintain phonetic precision and consistency across the diverse Kurdish dialects. [2]
Efforts to standardize Kurdish in a Latin-based script began in the 20th century, driven by a desire to create a unified writing system that could bridge dialectal differences and promote a unified Kurdish identity. The adaptation of the Latin script for Kurdish was influenced by its use in other Arabic, Turkic and Iranian languages in the region, which were also transitioning away from Arabic scripts at the time. [3] [4]
Mîr Celadet Bedirxan, a Kurdish writer, linguist and diplomat, [5] was the first to bring the Latin script to the Middle East and use it for Kurdish, alongside the Arabic script since the 1920–1930s, [6] [7] [8] this new Latin script became known as Hawar. [9]
The Kurdish Academy system is one of the most widely used romanization systems for Kurdish. It emphasizes phonetic accuracy, utilizing Latin characters with diacritics to represent unique Kurdish phonology. This system is particularly useful for linguistic and educational purposes, providing clear guidelines on how to pronounce Kurdish words accurately. [10] [3]
The following table illustrates the transcription of selected Kurdish sounds into Latin script according to the Kurdish Romanization system, showing the corresponding Arabic script, phonetic sounds, and example words for each letter.
Romanization (Hawar generally considered) | Arabic Script (Sorani Kurdish) | Sound | Example Word |
---|---|---|---|
a | ئ | /æ/ | ئاوارە (aware - Refugee) |
a [b] | ا | /ɑ/ | باران (baran - Rain) |
b | ب | /b/ | بەفر (befir - Snow) |
c | ج | /ʤ/ | جاران (caran - Times) |
ç | چ | /ʧ/ | چاو (çaw - Eye) |
d | د | /d/ | دەنگ (deng - Sound) |
'e | ع | /ʕ/ | عەشیرەت ('eşîret - Tribe) |
e [c] | ە | /e/ | ئەمە (eme - This) |
ê | ێ | /ɛ/ | گوێ (giwê - Ear) |
f | ف | /f/ | فڕین (fiřîn - To fly) |
g | گ | /ɡ/ | گەرمی (germî - Warmth) |
h | ھ | /h/ | ھەزار (hezar - Thousand) |
ĥ [d] | ح | /ħ/ | حەوت (ĥewit - Seven) |
i [e] | 'بزرۆکە' [13] | /i/ | ئاگر (agir -Fire) |
î | ی | /iː/ | چیرۆک (çîrok - Tale) |
j | ژ | /ʒ/ | ژمارە (jimare - Number) |
k | ک | /k/ | کوردی (Kurdî - Kurdish) |
ll or ł | ڵ | /ɫ/ | لێڵ (lêll or lêł - Blurry) |
l | ل | /l/ | لەیلا (leyla - Leyla) [f] |
m | م | /m/ | مەزن (mezin - Big) |
n | ن | /n/ | نەورۆز (newroz - Kurdish new year) |
o | ۆ | /o/ | خۆشی (xoşî - Happiness) |
p | پ | /p/ | پڕ (př - Full) |
q | ق | /q/ | قسە (qise - Speech) |
r | ر | /ɾ/ | کرم (kirim - Earthworm) |
s | س | /s/ | سەرکەوتن (serkewtin - Victory) |
ş | ش | /ʃ/ | شاخ (şax - Mountain) |
t | ت | /t/ | تەمەن (temen - Age) |
u | و | /ʊ/ | بەرز و نزم (beriz u nizim - High and low) |
û | وو | /uː/ | بەستوو (bestû - Icy) |
v | ڤ | /v/ | ڤەداندەر (vedander - Creator) |
w | و | /w/ | وەرگرتن (wergirtin - To take) |
x | خ | /x/ | خاک (xak - Land) |
y | ی | /j/ | یار (yar - Lover) |
z | ز | /z/ | زەرد (zerd - Yellow) |
Due to the variety of Kurdish dialects, regional adaptations of romanization exist, reflecting pronunciation differences. This has led to some differences in preferred romanized forms across regions, as various linguistic institutions seek to adapt romanization practices that best reflect their local dialect. [14]
The use of romanized Kurdish is prevalent in digital communication, especially within Kurdish diaspora communities, where Latin-based keyboards are more accessible. [14] Additionally, the UK government has developed a practical romanization system for administrative purposes, designed to simplify Kurdish transcription without the use of diacritics. [15] [16]
An alphabet is a standard set of letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from another in a given language. Not all writing systems represent language in this way: a syllabary assigns symbols to spoken syllables, while logographies assign symbols to words, morphemes, or other semantic units.
An abjad, also abgad, is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introduced in 1990 by Peter T. Daniels. Other terms for the same concept include partial phonemic script, segmentally linear defective phonographic script, consonantary, consonant writing, and consonantal alphabet.
A diacritic is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek διακριτικός, from διακρίνω. The word diacritic is a noun, though it is sometimes used in an attributive sense, whereas diacritical is only an adjective. Some diacritics, such as the acute ⟨ó⟩, grave ⟨ò⟩, and circumflex ⟨ô⟩, are often called accents. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters.
The Hebrew alphabet, known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew language and other Jewish languages, most notably Yiddish, Ladino, Judeo-Arabic, and Judeo-Persian. In modern Hebrew, vowels are increasingly introduced. It is also used informally in Israel to write Levantine Arabic, especially among Druze. It is an offshoot of the Imperial Aramaic alphabet, which flourished during the Achaemenid Empire and which itself derives from the Phoenician alphabet.
Kurdish is a Northwestern Iranian language or group of languages spoken by Kurds in the region of Kurdistan, namely in Turkey, northern Iraq, northwest and northeast Iran, and Syria.
Thaana, Tãnaa, Taana or Tāna is the present writing system of the Maldivian language spoken in the Maldives. Thaana has characteristics of both an abugida and a true alphabet, with consonants derived from indigenous and Arabic numerals, and vowels derived from the vowel diacritics of the Arabic abjad. Maldivian orthography in Thaana is largely phonemic.
A breve is the diacritic mark ◌̆, shaped like the bottom half of a circle. As used in Ancient Greek, it is also called brachy, βραχύ. It resembles the caron but is rounded, in contrast to the angular tip of the caron. In many forms of Latin, ◌̆ is used for a shorter, softer variant of a vowel, such as "Ĭ", where the sound is nearly identical to the English /i/.
In linguistics, romanization is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, for representing the spoken word, and combinations of both. Transcription methods can be subdivided into phonemic transcription, which records the phonemes or units of semantic meaning in speech, and more strict phonetic transcription, which records speech sounds with precision.
Chechen is a Northeast Caucasian language spoken by approximately 1.8 million people, mostly in the Chechen Republic and by members of the Chechen diaspora throughout Russia and the rest of Europe, Jordan, Austria, Turkey, Azerbaijan, Ukraine, Central Asia and Georgia.
Avar, also known as Avaric, is a Northeast Caucasian language of the Avar–Andic subgroup that is spoken by Avars, primarily in Dagestan. In 2010, there were approximately one million speakers in Dagestan and elsewhere in Russia.
When used as a diacritic mark, the term dot refers to the glyphs "combining dot above", and "combining dot below" which may be combined with some letters of the extended Latin alphabets in use in a variety of languages. Similar marks are used with other scripts.
Comorian is the name given to a group of four Bantu languages spoken in the Comoro Islands, an archipelago in the southwestern Indian Ocean between Mozambique and Madagascar. It is named as one of the official languages of the Union of the Comoros in the Comorian constitution. Shimaore, one of the languages, is spoken on the disputed island of Mayotte, a French department claimed by Comoros.
Kabardian, also known as East Circassian, is a Northwest Caucasian language, that is considered to be the east dialect of Adyghe language. While some Soviet linguists have treated the two as distinct languages, the Circassians consider the eastern and western language variants to be dialects of one Circassian language.
Waw is the sixth letter of the Semitic abjads, including Arabic wāwو, Aramaic waw 𐡅, Hebrew vavו, Phoenician wāw 𐤅, and Syriac waw ܘ.
Kurdish is written using either of two alphabets: the Latin-based Bedirxan or Hawar alphabet, introduced by Celadet Alî Bedirxan in 1932 and popularized through the Hawar magazine, and the Kurdo-Arabic alphabet. The Kurdistan Region has agreed upon a standard for Central Kurdish, implemented in Unicode for computation purposes. The Hawar alphabet is primarily used in Syria and Turkey, while the Kurdo-Arabic alphabet is commonly used in Iraq and Iran. The Hawar alphabet is also used to some extent in Iraqi Kurdistan. Two additional alphabets, based on the Armenian and Cyrillic scripts, were once used by Kurds in the Soviet Union, most notably in the Armenian Soviet Socialist Republic and Kurdistansky Uyezd.
Celadet Elî Bedirxan, also known as Mîr Celadet, was a Kurdish diplomat, writer, linguist, journalist and political activist. He held a master's degree in law from Istanbul University, completed his studies in Munich, and spoke several languages including Arabic, Kurdish, Russian, German, Turkish, Persian and French. He left Turkey in 1923 when the Kemalists declared a new republic. In 1927, at a Kurdish conference held in Beirut, a committee was formed, the Xoybûn. He is known for having been the first modern linguist to compile and organise the grammar of the modern form of the Northern Kurdish language, Kurmanji, and having designed the Latin-based Hawar alphabet, which is now the formal alphabet of Kurmanji and is also sometimes used for the other dialects of the Kurdish Language, having replaced the Arabic-based, Cyrillic-based, Persian-based and Armenian-based alphabets formerly used for Kurmanji.
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
Romanisation of Bengali is the representation of written Bengali language in the Latin script. Various romanisation systems for Bengali are used, most of which do not perfectly represent Bengali pronunciation. While different standards for romanisation have been proposed for Bengali, none has been adopted with the same degree of uniformity as Japanese or Sanskrit.
Hawar was a Kurdish literature magazine, which was published in Damascus between 1932 and 1943. The magazine was first issued by Celadet Alî Bedirxan on 15 May 1932. The magazine was the first Kurdish literary magazine in Syria and it was intended to publish it on a monthly basis, but its publication was interrupted several times. The first 23 issues were published between 15 May 1932 and the 27 September 1935. The issues number 24 - 26 appeared between 1 April 1934 and 18 August 1935. The remainder was published between April 1941 and August 1943, and the French, who governed Syria and Lebanon at the time, supported its publication. The 57th and last issue was released on 15 August 1943. The first 23 issues of the magazine were published in both the Latin and the Arabic alphabets, but from the 24th issue onward, only the Latin alphabet was used. The Kurdish alphabet, which was formulated by the publisher Celadet Alî Bedirxan and also referred to as Hawar alphabet or the Bedirxan alphabet, was used to publish Hawar. The texts which were published emphasized the value of the Kurdish folklore for developing the Kurdish language and personal qualities. Hawar had its most subscribers in Syria and Iraq, but also Lebanon, Iran. Some educational institutions and libraries in Europe and the Middle East also subscribed to the Hawar magazine. Since 2006, the 15 May, the date of the first publication of Hawar in 1932, is celebrated as the Kurdish language day.
The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya today is written in three scripts, Hanifi, Arabic, and Latin (Rohingyalish). The Rohingya language was first written in the 19th century with a version of the Perso-Arabic script. In 1975, an orthographic Arabic script was developed and approved by the community leaders, based on the Urdu alphabet but with unique innovations to make the script suitable to Rohingya.
{{cite web}}
: CS1 maint: numeric names: authors list (link)