You can help expand this article with text translated from the corresponding article in Russian. (September 2017)Click [show] for important translation instructions.
|
Cyrillization of Arabic is the conversion of text written in Arabic script into Cyrillic script. Because the Arabic script is an abjad (a writing system without vowels), an accurate transliteration into Cyrillic, an alphabet, would still require prior knowledge of the subject language to read. Instead, systems of transcription have normally been used.
Letter | name | Scientific Transcription | Practical Transcription | Other | |||
---|---|---|---|---|---|---|---|
IPA | Cyrillic | DIA | lat. | ru. | |||
ﺍ | alif | /aː/ | а̄ ↓ | ā↓ | a, aa↓ | а,↓ я [1] | ә |
ﺀ | hamza | /ʔ/ | ’,— ↓ | ʾ↓ | ʼ, —↓ | —↓ | ъ, Ӏ |
ﺏ | bā’ | /b/ | б | b | b | б | — |
ﺕ | tā’ | /t/ | т | t | t | т | — |
ﺙ | thā’ | /θ/ | с̱ | ṯ | th, s, (t) [2] [3] | с, (т) [2] [3] | ҫ |
ﺝ | jīm | /d͡ʒ~ɡ/ | дж | ǧ | j, dj, (g) [2] [4] | дж, (г, ж) | җ, ж, ҹ, ӂ, џ |
ﺡ | ḥā’ (pharyngeal) | /ħ/ | х̣ | ḥ | h | х | ҳ, хӀ, хь |
ﺥ | khā’ | /x~χ/ | х̱,х̮ | ḫ | kh, x | х | хь |
ﺩ | dāl | /d/ | д | d | d | д | — |
ﺫ | zāl | /ð/ | з̱ | ḏ | dh, (z, d) [2] [3] | з, (д) [2] [3] | ҙ |
ﺭ | rā’ | /r/ | р | r | r | р | — |
ﺯ | zayn | /z/ | з | z | z | з | — |
ﺱ | sīn | /s/ | с | s | s | с | — |
ﺵ | shīn | /ʃ/ | ш | š | sh, ch | ш | — |
ﺹ | ṣād | /sˁ/ | с̣ | ṣ | s | с | сӀ |
ﺽ | ḍād | /dˁ/ | д̣ | ḍ | d, dh | д | дӀ |
ﻁ | ṭā’ | /tˁ/ | т̣ | ṭ | t | т | тӀ, тъ |
ﻅ | ẓā’ | /ðˁ~zˁ/ | з̣ | ẓ | z, (zh, dh) [2] [3] | з, (д) [3] | зӀ |
ﻉ | ‘ayn | /ʕ/ | ‘ | ʿ | ʻ | —, ъ, vowel doubling [5] | Ӏ, гӀ |
ﻍ | ghayn | /ɣ/ | г,г̣ | ġ | g, gh | г | ғ, ҕ, гӀ, гъ |
ﻑ | fā’ | /f/ | ф | f | f | ф | — |
ﻕ | qāf | /q/ | к̣ | q | q, k, (g, gh) [2] [6] | к, (г) [2] [6] | қ, ҡ, ҟ, къ, кх, кь, хъ |
ﻙ | kāf | /k/ | к | k | k, c | к | — |
ﻝ | lām | /l/ | л | l | l | л, ль [7] | — |
ﻡ | mīm | /m/ | м | m | m | м | — |
ﻥ | nūn | /n/ | н | n | n | н | — |
ﻩ | hā’ | /h/ | х,һ | h | h | х | һ, ҳ, хӀ, гь |
ﻭ | wāw | /w, uː/ | в,ў,ӯ | w, ū | w, u, oo, ou | в, у, ю [1] | — |
ﻱ | yā’ | /j, iː/ | й,ӣ | y, ī | y, i, ee | й, [1] и, ы [8] | ј |
Note: The following tables use the letter hamza (ء) as a carrier to illustrate the use of diacritics. It is not part of these signs.
To record short vowels after a consonant, optional signs (fatḥah, ḍammah, kasrah) are used above this consonant. To write long vowels, the same signs are used plus the corresponding consonant letter.
Letter | Title | Scientific transcription | Practical transcription | ||||
---|---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | |||
Short vowels | |||||||
ءَ | fatḥah | /a/ | а | a | a, e | а, е, э | |
ءُ | ḍammah | /u/ | у | u | u, o, ou | у, о | |
ءِ | kasrah | /i/ | и | i | i, e | и, е, э | |
Long vowels | |||||||
ءَا | fatḥah-alif | /aː/ | а̄ | ā | a, aa | а | |
ءٰ | alif-khanjariyah | /aː/ | а̄ | ā | a, aa | а | |
ءَى | fatḥah-alif-maqsurah | /aː/ | а̄ | ā | a, aa | а | |
ءَىٰ | — | /aː/ | а̄ | ā | a, aa | а | |
ءُو | ḍammah-waw | /uː/ | ӯ | ū | u, oo, ou | у | |
ءِي | kasrah-ya’ | /iː/ | ӣ | ī | i, ee | и | |
Diphthongs | |||||||
ءَو | fatḥah-waw | /au/ | аў | aw | aw, au, aou | ау | |
ءَي | fatḥah-ya’ | /ai/ | ай | ay | ay, ey, ei | ай, ей, эй | |
Combinations | |||||||
ءُوّ | — | /uːw,-uː/ | ӯв | uww | uww | ув | |
ءِيّ | — | /iːj,-iː/ | ӣй | iyy | uyy | ий [1] |
The glottal stop (in Arabic hamza ) has complex notation rules. It can be written as a single character on the line ⟨ﺀ⟩, thus not distinguishing itself from other consonants, but much more often it is written above or below three carrier letters: alif, vav, ya. At the same time, hamza, like any consonant, can be both before and after a vowel. At the beginning of words, the hamza is written exclusively above or below the alif. The combination "alif-hamza + fatha + alif" (that is, أَا) is written in a special way through alif-madda. The absence of a hamza is occasionally recorded with a wasla sign (usually used only before the initial alif). In unvoiced texts, even the complete absence of over- or signed hamza is possible (thus hamza is indistinguishable from vav and ya), which further complicates transcription. Hamza is almost never written over the alif of the definite article.
Letter | Name | Phonetic Transcription | Practical Transcription | ||||
---|---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | |||
Hamza before a vowel | |||||||
أَ | alifa-hamza-fatḥah | /ʔa/ | ’а | ʾa | ’a | а | |
أُ | alifa-hamza-ḍammah | /ʔu/ | ’у | ʾu | ’u | у | |
إِ | alif-hamza-kasrah | /ʔi/ | ’и | ʾi | ’i | и | |
آ | alif-maddah | /ʔaː/ | ’а̄ | ʾā | ’a | а | |
ؤَ | waw-hamza-fatḥah | /ʔa/ | ’а | ʾa | ’a | а | |
ؤُ | waw-hamza-ḍammah | /ʔu/ | ’у | ʾu | ’u | у | |
ؤِ | waw-hamza-kasra | /ʔi/ | ’и | ʾi | ’i | и | |
ئَ | yā’-hamza-fatḥah | /ʔa/ | ’а | ʾa | ’a | а | |
ئُ | yā’-hamza-ḍammah | /ʔu/ | ’у | ʾu | ’u | у | |
ئِ | yā’-hamza-kasrah | /ʔi/ | ’и | ʾi | ’i | и | |
Hamza after a vowel | |||||||
ءَأ | alifa-hamza-fatḥah | /aʔ/ | а’ | aʾ | a’ | а | |
ءُأ | alif-hamza-ḍammah | /uʔ/ | у’ | uʾ | u’ | у | |
ءِأ | alif-hamza-kasrah | /iʔ/ | и’ | iʾ | i’ | и | |
ءَؤ | waw-hamza-fatḥah | /aʔ/ | а’ | aʾ | a’ | а | |
ءُؤ | waw-hamza-ḍammah | /uʔ/ | у’ | uʾ | u’ | у | |
ءِؤ | waw-hamza-kasrah | /iʔ/ | и’ | iʾ | i’ | и | |
ءَئ | yā’-hamza-fatḥah | /aʔ/ | а’ | aʾ | a’ | а | |
ءُئ | yā’-hamza-ḍammah | /uʔ/ | у’ | uʾ | u’ | у | |
ءِئ | yā’-hamza-kasrah | /iʔ/ | и’ | iʾ | i’ | и | |
Absence of Hamza | |||||||
ٱ | hamzatu-l-waṣl | /∅/ | — |
As with the hamza, there are some difficulties in rendering the Arabic definite article. In neutral position, it is read and transcribed as al- ( el- ). In certain positions, the sound - l - can be assimilated with the subsequent consonant (see more at Sun and moon letters). After vowels, the initial a - ( e -) usually disappears (the so-called wallowing).
Letter | Name | Phonetic Transcription | Practical Transcription | |||
---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | ||
ال | alif-lam | /ʔal-, ʔaCː-/ | аль- | al- | al-, el- | аль-, эль- |
assimilation al before sun consonants | + | + | + | − / + | − / + |
The term ‘’nunation’’ (or ‘’tanwīn’’ ) in Arabic grammar refers to the case endings of the “indefinite state” (corresponds to the indefinite article in other languages). Sometimes, for historical reasons, after the nunation, the silent letters alif-maksura or a simple alif are additionally written.
Letter | Name | Phonetic Transcription | Practical Transcription | |||
---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | ||
ءً | tanwīn-fatḥah | /an/ | ан | an | ан | |
ءًى | — | /an/ | ан | an | ан | |
ءًا | — | /an/ | ан | an | ан | |
ءٌ | tanwīn-ḍammah | /un/ | ун | un | ун | |
ءٍ | tanwīn-kasra | /in/ | ин | in | ин |
Letter | Name | Phonetic Transcription | Practical Transcription | ||||
---|---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | |||
ءْ | sukūn | /∅/ | — | ||||
ءّ | shaddah | consonant doubling |
Letter | Name | Phonetic Transcription | Practical Transcription | |||
---|---|---|---|---|---|---|
IPA | Cyrillic | DIN | lat. | ru. | ||
ﺓ | tāʾ marbūṭah | /a, at/ | а, ат | h, t | a, ah / at | а, ат/ет |
ﻯ | alif maqsura | /aː/ | а̄ | ā | a | а |
ﻻ | lam-alif | /laː/ | ла̄ | lā | la | ля |
Arabic text with harakat [9] | Romanization | Russian Cyrillization | Translation |
---|---|---|---|
يُولَدُ جَمِيعُ النَّاسِ أحْرَاراً مُتَسَاوِينَ فِي الْكَرَامَةِ وَالحُقُوقِ. وَقَدْ وُهِبُوا عَقْلاً وَ ضَمِيراً وَ عَلَيهِمْ أنْ يُعَامِلَ بَعْضُهُمْ بَعْضاً بِرُوحِ اَلإخَاء. | Yūladu jamī‘u n-nāsi aḥrāran mutasāwīna fī l-karāmati wa-l-ḥuqūq. Wa-qad wuhibū ‘aqlan wa-ḍamīran wa-‘alayhim an yu‘āmila ba‘ḍuhum ba‘ḍan bi-rūḥi l-ikhā’. | Йӯладу джамӣ‘у н-на̄си ах̣ра̄ран мутаса̄вӣна фӣ л-кара̄мати ва-л-х̣ук̣ӯк̣. Ва-к̣ад вухибӯ ‘ак̣лан ва-д̣амӣран ва-‘алайхим ан йу‘а̄мила ба̄‘д̣ухум ба‘д̣ан би-рӯх̣и л-их̱а̄’. | All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood |
The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is written from right-to-left in a cursive style, and includes 28 letters, of which most have contextual letterforms. The Arabic alphabet is considered an abjad, with only consonants required to be written; due to its optional use of diacritics to notate vowels, it is considered an impure abjad.
The Arabic script has numerous diacritics, which include consonant pointing known as iʻjām (إِعْجَام), and supplementary diacritics known as tashkīl (تَشْكِيل). The latter include the vowel marks termed ḥarakāt.
Abaza is a Northwest Caucasian language spoken by Abazins in Russia. The language has gone through several different orthographies based primarily on Latin and Cyrillic letters. Its consonant-to-vowel ratio is remarkably high; making it quite similar to many other languages from the same parent chain. The language evolved in popularity in the mid to late 1800s, but has become an endangered language.
Bashkir or Bashkort is a Turkic language belonging to the Kipchak branch. It is co-official with Russian in Bashkortostan. It is spoken by 1.09 million native speakers in Russia, as well as in Ukraine, Belarus, Kazakhstan, Uzbekistan, Estonia and other neighboring post-Soviet states, and among the Bashkir diaspora. It has three dialect groups: Southern, Eastern and Northwestern.
Avar, also known as Avaric, is a Northeast Caucasian language of the Avar–Andic subgroup that is spoken by Avars, primarily in Dagestan. In 2010, there were approximately one million speakers in Dagestan and elsewhere in Russia.
When used as a diacritic mark, the term dot refers to the glyphs "combining dot above", and "combining dot below" which may be combined with some letters of the extended Latin alphabets in use in a variety of languages. Similar marks are used with other scripts.
Kalmyk Oirat, commonly known as the Kalmyk language, is a variety of the Oirat language, natively spoken by the Kalmyk people of Kalmykia, a federal subject of Russia. In Russia, it is the standard form of the Oirat language, which belongs to the Mongolic language family. The Kalmyk people of the Northwest Caspian Sea of Russia claim descent from the Oirats from Eurasia, who have also historically settled in Mongolia and Northwest China. According to UNESCO, the language is "definitely endangered". According to the Russian census of 2021, there are 110,000 speakers out of an ethnic population consisting of 178,000 people.
While many languages have numerous dialects that differ in phonology, contemporary spoken Arabic is more properly described as a continuum of varieties. This article deals primarily with Modern Standard Arabic (MSA), which is the standard variety shared by educated speakers throughout Arabic-speaking regions. MSA is used in writing in formal print media and orally in newscasts, speeches and formal declarations of numerous types.
Aljamiado or Aljamía texts are manuscripts that use the Arabic script for transcribing European languages, especially Romance languages such as Old Spanish or Aragonese. This alphabet is also called the Morisco alphabet.
Waw is the sixth letter of the Semitic abjads, including Phoenician wāw 𐤅, Aramaic waw 𐡅, Hebrew vavו, Syriac waw ܘ and Arabic wāwو.
Yodh is the tenth letter of the Semitic abjads, including Phoenician yōd 𐤉, Hebrew yudי, Aramaic yod 𐡉, Syriac yōḏ ܝ, and Arabic yāʾي. Its sound value is in all languages for which it is used; in many languages, it also serves as a long vowel, representing.
In an alphabetic writing system, a silent letter is a letter that, in a particular word, does not correspond to any sound in the word's pronunciation. In linguistics, a silent letter is often symbolised with a null sign U+2205∅EMPTY SET, which resembles the Scandinavian letter Ø. A null or zero is an unpronounced or unwritten segment.
Aleph is the first letter of the Semitic abjads, including Phoenician ʾālep 𐤀, Hebrew ʾālefא, Aramaic ʾālap 𐡀, Syriac ʾālap̄ ܐ, Arabic ʾalifا, and North Arabian 𐪑. It also appears as South Arabian 𐩱 and Ge'ez ʾälef አ.
Libyan Arabic, also called Sulaimitian Arabic by scholars, is a variety of Arabic spoken in Libya, and neighboring countries. It can be divided into two major dialect areas; the eastern centred in Benghazi and Bayda, and the western centred in Tripoli and Misrata. The Eastern variety extends beyond the borders to the east and share the same dialect with far Western Egypt, Western Egyptian Bedawi Arabic, with between 90,000 and 474,000 speakers in Egypt. A distinctive southern variety, centered on Sabha, also exists and is more akin to the western variety. Another Southern dialect is also shared along the borders with Niger with 12,900 speakers in Niger as of 2021.
Three alphabets are used to write Kazakh: the Cyrillic, Latin and Arabic scripts. The Cyrillic script is used in Kazakhstan, Russia, and Mongolia. An October 2017 Presidential Decree in Kazakhstan ordered that the transition from Cyrillic to a Latin script be completed by 2031. The Arabic script is used in Saudi Arabia, Iran, Afghanistan, and parts of China.
The hamza is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vowel, or other features, depending on the language. Derived from the letter ʿAyn, the hamza is written in initial, medial and final positions as an unlinked letter or placed above or under a carrier character. Despite its common usage as a letter in Modern Standard Arabic, it is generally not considered to be one of its letters, although some argue that it should be considered a letter.
Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the theologians Cyril and Methodius. It is the basis of alphabets used in various languages, past and present, Slavic origin, and non-Slavic languages influenced by Russian. As of 2011, around 252 million people in Eurasia use it as the official alphabet for their national languages. About half of them are in Russia. Cyrillic is one of the most-used writing systems in the world. The creator is Saint Clement of Ohrid from the Preslav literary school in the First Bulgarian Empire.
The Pashto alphabet is the right-to-left abjad-based alphabet developed from the Arabic script, used for the Pashto language in Pakistan and Afghanistan. It originated in the 16th century through the works of Pir Roshan.
The phonological system of the Hejazi Arabic consists of approximately 26 to 28 native consonant phonemes and 8 vowel phonemes:. Consonant length and vowel length are both distinctive in Hejazi.
Cham Jawi is a variant of the Jawi adaptation of the Arabic script used to write the Cham language.