Cham Jawi is a variant of the Jawi adaptation of the Arabic script used to write the Cham language. [1] [2] [3] [4]
Compared to Malay, the language of the parent script of Cham Jawi, Cham has a richer and larger family of vowels. Malay Jawi, like the Arabic script itself, is an impure Abjad, meaning that most, but not all, vowels are unwritten. In Cham Jawi, the emphasis has been to write most vowels, and to differentiate between them. This has been done by the addition of accents and dots on the three mater lectionis letters, the letters alif (ا), waw (و), and ya (ي), which act as vowel carriers.
There are some issues however, when it has come to the task of standardizing a script that accurately shows vowels. First is the lack of an enforcement or pedagogical mechanism for any single standard. This is augmented by the fact that Cham people are dispersed in Cambodia, Vietnam, and outside of the Cham homeland, in countries such as Malaysia. Second has been the slow development of Arabic keyboard for computers, specifically the slow incorporation of various accents, diacritics, and unique writing features that have long been a central part of Cham literature. Thus over the years, various parallel conventions have come to be followed by different publications and peoples.
Below table is the most comprehensive agreed-upon convention on vowels for Cham. But for example, the vowels ư and ơ, are respectively written in the table below with alif with three dots (اۛ) and alif with an inverted v (اٛ). But, as the three dots character wasn't incorporated into keyboards, they also have been commonly been written as alif with an inverted v (اٛ) and alif with a v (اٚ) respectively. Other sources may write the two vowels with a single letter alif with an inverted v (اٛ)and not differentiate between them.
Rumi | Final | Middle (CV) | Middle (CVC) | Monosyllabic (CVC) | Independent (V) | Initial (VC) | Cham script Initial | Cham script Diacritic | IPA |
---|---|---|---|---|---|---|---|---|---|
a | ◌ـا | ◌َ | ◌ـا◌ | ◌َ◌ | ا / أ | عا◌ | ꨀ | - | [ a ] |
aa | ◌ـآ | ◌ـآ | ◌ـآ◌ | - | - | - | ◌ꨩ | [ aː ] | |
ư (â) | ◌ـاۛ | ◌ـاۛ | ◌ـاۛ◌ | اۛ | عاۛ◌ | ꨀꨲ | ◌ꨲ | [ ɨ ] | |
ưư, ươ (ââ) | ◌ـاۛۤ | ◌ـاۛۤ | ◌ـاۛۤ◌ | - | عاۛۤ◌ | ꨀꨲꨩ | ◌ꨲꨩ | [ ɨə̯ ] | |
ơ (e) | ◌ـاٛ | ◌ـاٛ | ◌ـاٛ◌ | اٛ | عاٛ◌ | ꨀꨮ | ◌ꨮ | [ ə ] | |
u | ◌ـو | ◌ـو | ◌ـو◌ | او | عو◌ | ꨀꨭ | ◌ꨭ | [ u ] | |
uu | ◌ـوۤ | ◌ـوۤ | ◌ـوۤ◌ | - | عو◌ | - | ◌ꨭꨩ | [ uː ] | |
o (ao) | ◌ـوٚ | ◌ـوٚ | ◌ـوٚ◌ | اوٚ | عوٚ◌ | ꨀꨱ | ꨱ | [ ɔ ] | |
oo | ◌ـوٚۤ | ◌ـوٚۤ | ◌ـوٚۤ◌ | - | عوٚۤ◌ | - | ꨯꨩ | [ ɔː ] | |
ô | ◌ـوٛ | ◌ـوٛ | ◌ـوٛ◌ | اوٛ | عوٛ◌ | ꨅ | ꨯ | [ o ] | |
i | ◌ـي | ◌ـيـ | ◌ـیـ◌ | ايـ◌ | عيـ◌ | ꨁ | ◌ꨪ | [ i ] | |
ii | ◌ـيۤ | ◌ـيۤـ | ◌ـیۤـ◌ | - | عيۤـ◌ | - | ◌ꨫ | [ iː ] | |
e (ai) | ◌ـيٚ | ◌ـيٚـ | ◌يٚـ◌ | ايٚـ◌ | عيٚـ◌ | ꨄ | ꨰ | [ ɛ ] | |
ee (ia) | ◌ـيٚۤ | ◌ـيٚۤـ | ◌يٚۤـ◌ | - | عيٚۤـ◌ | - | ꨰꨩ | [ ɛː ] | |
ê (é) | ◌ـيٛ | ◌ـيٛـ | ◌يٛـ◌ | ايٛـ◌ | عيٛـ◌ | ꨅꨮ | ꨯꨮ | [ e ] | |
êê (éé) | ◌ـيٛۤ | ◌ـيٛۤـ | ◌يٛۤـ◌ | - | عيٛۤـ◌ | - | ꨯꨮꨩ | [ eː ] |
Below table lists vowel sequences and their representation in Cham Jawi script.
Rumi | Cham Jawi | Cham script Diacritic |
---|---|---|
au | او | ◌ꨮꨭ |
ao | اوٛ | ꨯꨱ |
ai | اي | ꨰ |
ơi | اٛي | ◌ꨬ |
ua | وا | ◌ꨶ |
uư | واۛ | ◌ꨶꨮ |
uô | ووٛ | ꨶꨯ |
ui | وي | ◌ꨬꨭ |
oa | وٚا | ◌ꨶ |
ia | يا | ◌ꨳ |
iư | ياۛ | ◌ꨳꨲ |
iơ | ياٛ | ◌ꨳꨮ |
In Cham Jawi, additional characters for nasalised stops not found in the Malay Jawi alphabet have been added:
Maintaining consistency in the use of three-dot letters for nasal stop consonants, the letter ڤ has been reassigned to the sound mb (written as pp in Latin), and the letter ف reassigned to the sound p. (The f of Malay is absent in Cham.)
Voiceless aspirate consonants are represented as digraphs using ح:
Voiced aspirate consonants are represented as digraphs using ه:
The table below illustrates the letters of Cham Jawi, followed by notes for their uses.
Arabic (Cham script) (Latin) [ IPA ] | أ / ا1 (ꨀ) (a) [ ∅ ]/[ ʔ ]/[ a ] | ب (ꨝ) (b) [ p ] | بهـ / به (ꨞ) (bh) [ pʰ ] | ڢ (ꨡ) (bb / mb) [ ɓ ] | تـ / ت2 (ꨓ / ꩅ) (t) [ t ] | تحـ / تح (ꨔ) (th) [ tʰ ] |
---|---|---|---|---|---|---|
Arabic (Cham script) (Latin) [ IPA ] | ث3 (ꨦ / ꨧ / ꩋ) (s) [ s ] | ج (ꨎ) (j) [ c ] | جهـ / جه (ꨏ) (jh) [ cʰ ] | چ (ꨌ / ꩄ) (c) [ c ] | چحـ / چح (ꨍ) (ch) [ cʰ ] | ح4 (ꨨ / ◌ꩍ) (h) [ h ]/[ ◌ʰ ] |
Arabic (Cham script) (Latin) [ IPA ] | خ3 (ꨇ) (kh) [ x ] | څ (ꨒ) (nj) [ j ] | د (ꨕ) (d) [ t ] | دهـ / ده (ꨖ) (dh) [ tʰ ] | ذ3 (ꨦ / ꨧ / ꩋ) (z) [ z ] | ڎ (ꨙ) (đ / nd) [ ɗ ] |
Arabic (Cham script) (Latin) [ IPA ] | ر5 (ꨣ / ◌ꨴ / ꩉ) (r) [ ɣ ~ r ] | ز3 (ꨦ / ꨧ / ꩋ) (z) [ z ] | س (ꨦ / ꨧ / ꩋ) (s) [ s ] | ش3 (ꨦ / ꨧ / ꩋ) (x / s) [ s ] | ص3 (ꨦ / ꨧ / ꩋ) (s) [ s ] | ض3 (ꨖ) (dh) [ tʰ ] |
Arabic (Cham script) (Latin) [ IPA ] | ط3 (ꨓ / ꩅ) (t) [ t ] | ظ3 (ꨦ / ꨧ / ꩋ) (z) [ z ] | ع6 (ꨀ) ( - ) [ ∅ ] | غ5 (ꨣ / ꩉ) (r) [ ɣ ~ r ] | ڠ (ꨊ/ꨋ/◌ꩃ/ꩂ) (ng) [ ŋ ] | ف (ꨚ / ꩇ) (p) [ p ] |
Arabic (Cham script) (Latin) [ IPA ] | فحـ / فح (ꨜ) (ph) [ pʰ ] | ڤ (ꨛ) (pp) [ p ] | ق3 (ꨆ) (q / k) [ q ~ k ] | ك7 (ꨆ / ꩀ) (k / c) [ k ] | كحـ / كح (ꨇ) (kh) [ kʰ ] | ڬ (ꨈ) (g) [ k ] |
Arabic (Cham script) (Latin) [ IPA ] | ڬهـ / ڬه (ꨉ) (gh) [ kʰ ] | ل (ꨤ / ◌ꨵ / ꩊ) (l) [ l ] | م (ꨟ / ꨠ / ◌ꩌ) (m) [ m ] | ن (ꨗ / ꨘ / ꩆ) (n) [ n ] | و8 (ꨥ / ꨥ / ꨅ) (w / o / u) [ w ]/[ o ]/[ ∅ ] | هـ / ه4 (ꨨ / ◌ꩍ) (h) [ h ]/[ ◌ʰ ] |
Arabic (Cham script) (Latin) [ IPA ] | ة2 (ꩅ) (t) [ t ] | ي9 (ꨢ / ꩈ / ꨁ) (y / i) [ j ]/[ i ]/[ ∅ ] | ء7 (ꩀ) (k) [ ʔ ] |
In Cham Jawi, Reduplication is done in a manner similar to Jawi script and Pegon script, that is with the use of the numeral "٢" right after the base word.
While suffixed, the numeral "٢" comes in between the base word and the suffix, effectively being in the middle of the word.
Below is a sample text in Cham, in Rumi, Jawi, and Cham scripts. This text is the translation of a famous Vietnamese short poetry.
English Translation | Loss of money; Sad for a few days |
---|---|
Cham Rumi Script | Lahik jiên; drut druy hadôm harei |
Cham Jawi Script | لحيء جييٛن؛ دروت دروي حدوٛم حغاٛي |
Cham Script | ꨤꨨꨪꩀ ꨎꨳꨯꨮꩆ ꨕꨴꨭꩅ ꨕꨴꨭꩈ ꨨꨕꨯꩌ ꨨꨣꨬ |
Vietnamese Original | Mất tiền; Buồn vài ngày |
The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is written from right-to-left in a cursive style, and includes 28 letters, of which most have contextual letterforms. The Arabic alphabet is considered an abjad, with only consonants required to be written; due to its optional use of diacritics to notate vowels, it is considered an impure abjad.
A diacritic is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek διακριτικός, from διακρίνω. The word diacritic is a noun, though it is sometimes used in an attributive sense, whereas diacritical is only an adjective. Some diacritics, such as the acute ⟨ó⟩, grave ⟨ò⟩, and circumflex ⟨ô⟩, are often called accents. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters.
The Arabic script has numerous diacritics, which include consonant pointing known as iʻjām (إِعْجَام), and supplementary diacritics known as tashkīl (تَشْكِيل). The latter include the vowel marks termed ḥarakāt.
Jawi is a writing system used for writing several languages of Southeast Asia, such as Acehnese, Magindanawn, Malay, Mëranaw, Minangkabau, Tausūg, and Ternate. Jawi is based on the Arabic script, consisting of all 31 original Arabic letters, six letters constructed to fit phonemes native to Malay, and one additional phoneme used in foreign loanwords, but not found in Classical Arabic, which are ca, nga, pa, ga, va, and nya.
The Azerbaijani alphabet has three versions which includes the Arabic, Latin, and Cyrillic alphabets.
Wakhi is an Indo-European language in the Eastern Iranian branch of the language family spoken today in Wakhan District, Northern Afghanistan, and also in Tajikistan, Northern Pakistan and Western China.
Aljamiado or Aljamía texts are manuscripts that use the Arabic script for transcribing European languages, especially Romance languages such as Old Spanish or Aragonese. This alphabet is also called the Morisco alphabet.
The Turkmen alphabet refers to variants of the Latin alphabet, Cyrillic alphabet, or Arabic alphabet used for writing of the Turkmen language.
Aleph is the first letter of the Semitic abjads, including Phoenician ʾālep 𐤀, Hebrew ʾālefא, Aramaic ʾālap 𐡀, Syriac ʾālap̄ ܐ, Arabic ʾalifا, and North Arabian 𐪑. It also appears as South Arabian 𐩱 and Ge'ez ʾälef አ.
Serer, often broken into differing regional dialects such as Serer-Sine and Serer saloum, is a language of the Senegambian branch of the Niger–Congo family spoken by 1.2 million people in Senegal and 30,000 in the Gambia as of 2009. It is the principal language of the Serer people, and was the language of the early modern kingdoms of Sine, Saloum, and Baol.
The Ottoman Turkish alphabet is a version of the Perso-Arabic script used to write Ottoman Turkish until 1928, when it was replaced by the Latin-based modern Turkish alphabet.
The Urdu alphabet is the right-to-left alphabet used for writing Urdu. It is a modification of the Persian alphabet, which itself is derived from the Arabic script. It has official status in the republics of Pakistan, India and South Africa. The Urdu alphabet has up to 39 or 40 distinct letters with no distinct letter cases and is typically written in the calligraphic Nastaʿlīq script, whereas Arabic is more commonly written in the Naskh style.
Wolofal is a derivation of the Arabic script for writing the Wolof language. It is basically the name of a West African Ajami script as used for that language.
The modern Malay and Indonesian alphabet consists of the 26 letters of the ISO basic Latin alphabet. It is the more common of the two alphabets used today to write the Malay language, the other being Jawi. The Latin Malay alphabet is the official Malay script in Indonesia, Malaysia and Singapore, while it is co-official with Jawi in Brunei.
The hamza is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vowel, or other features, depending on the language. Derived from the letter ʿAyn, the hamza is written in initial, medial and final positions as an unlinked letter or placed above or under a carrier character. Despite its common usage as a letter in Modern Standard Arabic, it is generally not considered to be one of its letters, although some argue that it should be considered a letter.
In orthography, a zero consonant, silent initial, or null-onset letter is a consonant letter that does not correspond to a consonant sound, but is required when a word or syllable starts with a vowel. Some abjads, abugidas, and alphabets have zero consonants, generally because they have an orthographic rule that all syllables must begin with a consonant letter, whereas the language they transcribe allows syllables to start with a vowel. In a few cases, such as Pahawh Hmong below, the lack of a consonant letter represents a specific consonant sound, so the lack of a consonant sound requires a distinct letter to disambiguate.
The Pashto alphabet is the right-to-left abjad-based alphabet developed from the Arabic script, used for the Pashto language in Pakistan and Afghanistan. It originated in the 16th century through the works of Pir Roshan.
Pegon is a modified Arabic script used to write the Javanese, Sundanese, and Madurese languages, as an alternative to the Latin script or the Javanese script and the Old Sundanese script. It was used in a variety of applications, from religion, to diplomacy, to poetry. But today particularly, it is used for religious (Islamic) writing and poetry, particularly in writing commentaries of the Qur'an. Pegon includes letters that are not present in Modern Standard Arabic. Pegon has been studied far less than its Jawi counterpart which is used for Malay, Acehnese and Minangkabau.
The Khowar alphabet is the right-to-left alphabet used for the Khowar language. It is a modification of the Urdu alphabet, which is itself a derivative of the Persian alphabet and Arabic alphabet and uses the calligraphic Nastaʿlīq script.
The Swahili Ajami script refers to the alphabet derived from the Arabic script that is used for the writing of the Swahili language.
{{cite book}}
: CS1 maint: unrecognized language (link)