Bengali language

Last updated

Bangla Script.svg
"Bangla" in Bengali script
Pronunciation [ˈbaŋla] ( Loudspeaker.svg listen )
Native to Bangladesh and India
Region Bengal
Ethnicity Bengalis
Native speakers
230 million (2011–2017) [1] [2]
L2 speakers: 37 million [1]
Early forms
Bengali alphabet
Bengali Braille
Bengali signed forms [3]
Official status
Official language in
Flag of Bangladesh.svg  Bangladesh
Flag of India.svg  India (in West Bengal, Tripura, Assam's Barak Valley, Jharkhand [4] )
Regulated by Bangla Academy
Paschimbanga Bangla Akademi
Language codes
ISO 639-1 bn
ISO 639-2 ben
ISO 639-3 ben
Glottolog beng1280 [5]
Bengalispeaking region.png
Bengali-speaking region of South Asia
This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters. For an introductory guide on IPA symbols, see Help:IPA.
Part of a series on
Montage of Bengal.jpg
A series of picture stories narrated in Bengali

Bengali ( /bɛŋˈɡɔːli/ ), [6] also known by its endonym Bangla (বাংলা [ˈbaŋla] ), is an Indo-Aryan language primarily spoken by the Bengalis in South Asia. It is the official and most widely spoken language of Bangladesh and second most widely spoken of the 22 scheduled languages of India, behind Hindi. With approximately 228 million native speakers and another 37 million as second language speakers, [1] [7] Bengali is the fifth most-spoken native language and the seventh most spoken language by total number of speakers in the world. [8] [9]


The official and de facto national language of Bangladesh is Modern Standard Bengali (Literary Bengali). [10] [11] [12] It serves as the lingua franca of the nation, with 98% of Bangladeshis being fluent in Bengali as their first language. [13] [14] Within India, Bengali is the official language of the states of West Bengal, Tripura and the Barak Valley in the state of Assam, and is the most widely spoken language in the Andaman and Nicobar Islands in the Bay of Bengal, [15] and is spoken by significant populations in other states including in Arunachal Pradesh, Delhi, Chhattisgarh, Jharkhand, Meghalaya, Mizoram, Nagaland and Uttarakhand. [16] Bengali is also spoken by the significant global Bengali diaspora (Bangladeshi diaspora and Indian Bengalis) communities in Pakistan, the United States, the United Kingdom, the Middle East. [17]

Bengali has developed over the course of more than 1,300 years. Bengali literature, with its millennium-old literary history, has extensively developed since the Bengali Renaissance and is one of the most prominent and diverse literary traditions in Asia. The Bengali language movement from 1948 to 1956 demanding Bengali to be an official language of Pakistan fostered Bengali nationalism in East Bengal leading to the emergence of Bangladesh in 1971. In 1999, UNESCO recognised 21 February as International Mother Language Day in recognition of the language movement. [18] [19] The Bengali language is the quintessential element of Bengali identity and binds together a culturally diverse region.


Silver coin with proto-Bengali script, Harikela Kingdom, circa 9th-13th century Coin - Silver - Circa 9-10th Century 13th Century CE - Harikela Kingdom - ACCN 90-C2752 - Indian Museum - Kolkata 2014-04-04 4303.JPG
Silver coin with proto-Bengali script, Harikela Kingdom, circa 9th–13th century

Ancient languages of Bengal

Sanskrit was practised by the priests in Bengal since the first millennium BCE. But, the local people were speaking in some varieties of Prakrita languages. Dr. Suniti Kumar Chatterjee coined it as "eastern variety of Magdhi Prakrita". During the Gupta Empire, Bengal was a hub of Sanskrit literature. [20] The Middle Indo-Aryan dialects were influential in Bengal in the first millennium when the region was a part of the Magadha Realm. These dialects were called Magadhi Prakrit spoken in current Bihar state of India. The Magdhi Prakrita eventually evolved into Ardha Magadhi and become more distinct from the languages of Bengal day by day. [21] [22] Ardha Magadhi began to give way to what are called Apabhraṃśa languages at the end of the first millennium. Then Bengali language evolved as a distinct language by the course of time. [23]


Along with other Eastern Indo-Aryan languages, Bengali evolved circa 1000–1200 CE from Sanskrit and Magadhi Prakrit. [24] The local Apabhraṃśa of the eastern subcontinent, Purbi Apabhraṃśa or Abahatta ("Meaningless Sounds"), eventually evolved into regional dialects, which in turn formed three groups of the Bengali–Assamese languages, the Bihari languages, and the Odia language. Some argue that the points of divergence occurred much earlier – going back to even 500, [25] but the language was not static: different varieties coexisted and authors often wrote in multiple dialects in this period. For example, Ardhamagadhi is believed to have evolved into Abahatta around the 6th century, which competed with the ancestor of Bengali for some time. [26] Proto-Bengali was the language of the Pala Empire and the Sena dynasty. [27] [28]


Silver Taka from the Sultanate of Bengal, circa 1417 Silver coin of Danujamarddana.jpg
Silver Taka from the Sultanate of Bengal, circa 1417

During the medieval period, Middle Bengali was characterised by the elision of word-final ô, the spread of compound verbs and Arabic and Persian influences. Bengali was an official court language of the Sultanate of Bengal. Muslim rulers promoted the literary development of Bengali. [29] Bengali became the most spoken vernacular language in the Sultanate. [30] This period saw borrowing of Perso-Arabic terms into Bengali vocabulary. Major texts of Middle Bengali (1400–1800) include Chandidas' Shreekrishna Kirtana .


Silchar Bhasa.jpeg
Language Martyr's Memorial at Silchar Railway Station in Assam, India.

The modern literary form of Bengali was developed during the 19th and early 20th centuries based on the dialect spoken in the Nadia region, a west-central Bengali dialect. Bengali presents a strong case of diglossia, with the literary and standard form differing greatly from the colloquial speech of the regions that identify with the language. [31] The modern Bengali vocabulary contains the vocabulary base from Magadhi Prakrit and Pali, also tatsamas and reborrowings from Sanskrit and other major borrowings from Persian, Arabic, Austroasiatic languages and other languages in contact with.

During this period, the

In 1948 the Government of Pakistan tried to impose Urdu as the sole state language in Pakistan, starting the Bengali language movement. [33] The Bengali Language Movement was a popular ethno-linguistic movement in the former East Bengal(today Bangladesh), which was a result of the strong linguistic consciousness of the Bengalis to gain and protect spoken and written Bengali's recognition as a state language of the then Dominion of Pakistan. On the day of 21 February 1952 five students and political activists were killed during protests near the campus of the University of Dhaka. In 1956 Bengali was made a state language of Pakistan. [33] The day has since been observed as Language Movement Day in Bangladesh and is also commemorated as International Mother Language Day by UNESCO every year since 2000.

In 2010, the parliament of Bangladesh and the legislative assembly of West Bengal proposed that Bengali be made an official UN language, [34] though no further action was taken on this matter.

Geographical distribution

Approximate distribution of native Bengali speakers (assuming a rounded total of 261 million) worldwide.

  Bangladesh (61.3%)
  India (37.2%)
  Other (1.5%)

The Bengali language is native to the region of Bengal, which comprises Indian states of West Bengal and the present-day nation of Bangladesh.

A Bengali sign in Brick Lane in London, which is home to a large Bengali diaspora Brick Lane - London.jpg
A Bengali sign in Brick Lane in London, which is home to a large Bengali diaspora

Besides the native region it is also spoken by the Bengalis living in Tripura, southern Assam and the Bengali population in the Indian union territory of Andaman and Nicobar Islands. Bengali is also spoken in the neighbouring states of Odisha, Bihar, and Jharkhand, and sizeable minorities of Bengali speakers reside in Indian cities outside Bengal, including Delhi, Mumbai, Varanasi, and Vrindavan. There are also significant Bengali-speaking communities in the Middle East, [35] [36] [37] the United States, [38] Singapore, [39] Malaysia, Australia, Canada, the United Kingdom, and Italy.

Official status

The 3rd article of the Constitution of Bangladesh states Bengali to be the sole official language of Bangladesh. [12] The Bengali Language Implementation Act, 1987 made it mandatory to use Bengali in all records and correspondences, laws, proceedings of court and other legal actions in all courts, government or semi-government offices, and autonomous institutions in Bangladesh. [10] It is also the de facto national language of the country.

In India, Bengali is one of the 23 official languages. [40] It is the official language of the Indian states of West Bengal, Tripura and in Barak Valley of Assam. [41] [42] Bengali is a second official language of the Indian state of Jharkhand since September 2011. It is also a recognised secondary language in the City of Karachi in Pakistan. [43] [44] [45] The Department of Bengali in the University of Karachi also offers regular programs of studies at the Bachelors and at the Masters levels for Bengali Literature. [46]

The national anthems of both Bangladesh ( Amar Sonar Bangla ) and India ( Jana Gana Mana ) were written in Bengali by the Bengali Nobel laureate Rabindranath Tagore. [47] Additionally, the first two verses of Vande Mataram , a patriotic song written in Bengali by Bankim Chandra Chatterjee, was adopted as the "national song" of India in both the colonial period and later in 1950 in independent India. Furthermore, it is believed by many that the national anthem of Sri Lanka (Sri Lanka Matha) was inspired by a Bengali poem written by Rabindranath Tagore, [48] [49] [50] [51] while some even believe the anthem was originally written in Bengali and then translated into Sinhala. [52] [53] [54] [55]

In 2009, elected representatives in both Bangladesh and West Bengal called for Bengali language to be made an official language of the United Nations. [56]


A map of Bengal (and some districts of Assam and Jharkhand) which shows the dialects of the Bengali Language.
Rarhi dialect
Bangali dialect
Manbhumi dialect
Varendri dialect
Sundarbani dialect
*Rajbanshi dialect/language
*Chittagonian dialect/language
*Sylheti dialect/language
(the name with *bold letter are considered as Bengali dialects. also sometimes as separate languages.) Bengali dialects political map.svg
A map of Bengal (and some districts of Assam and Jharkhand) which shows the dialects of the Bengali Language. (the name with *bold letter are considered as Bengali dialects. also sometimes as separate languages.)

Regional variation in spoken Bengali constitutes a dialect continuum. Linguist Suniti Kumar Chattopadhyay grouped these dialects into four large clusters – Rarh, Banga, Kamrupi and Varendra; [57] but many alternative grouping schemes have also been proposed. [58] The south-western dialects (Rarhi or Nadia dialect) form the basis of modern standard colloquial Bengali. In the dialects prevalent in much of eastern and south-eastern Bangladesh (Barisal, Chittagong, Dhaka and Sylhet Divisions of Bangladesh), many of the stops and affricates heard in West Bengal are pronounced as fricatives. Western alveolo-palatal affricates [ tɕɔ ], [ tɕʰɔ ], [ dʑɔ ] correspond to eastern [tsɔ], [tsʰɔ~sɔ], [dzɔ~zɔ]. The influence of Tibeto-Burman languages on the phonology of Eastern Bengali is seen through the lack of nasalised vowels and an alveolar articulation of what are categorised as the "cerebral" consonants (as opposed to the postalveolar articulation of West Bengal). Some variants of Bengali, particularly Chittagonian and Chakma, have contrastive tone; differences in the pitch of the speaker's voice can distinguish words. Rangpuri, Kharia Thar and Mal Paharia are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects. [59]

During the standardisation of Bengali in the 19th century and early 20th century, the cultural center of Bengal was in the city of Kolkata, founded by the British. What is accepted as the standard form today in both West Bengal and Bangladesh is based on the West-Central dialect of Nadia District, located next to the border of Bangladesh. [60] There are cases where speakers of Standard Bengali in West Bengal will use a different word from a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. For example, the word salt is নুনnun in the west which corresponds to লবণlôbôn in the east. [61]

Spoken and literary varieties

Bengali exhibits diglossia, though some scholars have proposed triglossia or even n-glossia or heteroglossia between the written and spoken forms of the language. [31] Two styles of writing have emerged, involving somewhat different vocabularies and syntax: [60] [62]

  1. Shadhu-bhasha (সাধুভাষা "uptight language") was the written language, with longer verb inflections and more of a Pali and Sanskrit-derived Tatsama vocabulary. Songs such as India's national anthem Jana Gana Mana (by Rabindranath Tagore) were composed in Shadhubhasha. However, use of Shadhubhasha in modern writing is uncommon, restricted to some official signs and documents in Bangladesh as well as for achieving particular literary effects.
  2. Cholito-bhasha (চলিতভাষা "running language"), known by linguists as Standard Colloquial Bengali, is a written Bengali style exhibiting a preponderance of colloquial idiom and shortened verb forms, and is the standard for written Bengali now. This form came into vogue towards the turn of the 19th century, promoted by the writings of Peary Chand Mitra ( Alaler Gharer Dulal , 1857), [63] Pramatha Chaudhuri (Sabujpatra, 1914) and in the later writings of Rabindranath Tagore. It is modelled on the dialect spoken in the Shantipur region in Nadia district, West Bengal. This form of Bengali is often referred to as the "Nadia standard", "Nadia dialect", "Southwestern/West-Central dialect" or "Shantipuri Bangla". [58]

Linguist Prabhat Ranjan Sarkar, categorises the language as:

While most writing is in Standard Colloquial Bengali (SCB), spoken dialects exhibit a greater variety. People in southeastern West Bengal, including Kolkata, speak in SCB. Other dialects, with minor variations from Standard Colloquial, are used in other parts of West Bengal and western Bangladesh, such as the Midnapore dialect, characterised by some unique words and constructions. However, a majority in Bangladesh speak in dialects notably different from SCB. Some dialects, particularly those of the Chittagong region, bear only a superficial resemblance to SCB. [64] The dialect in the Chittagong region is least widely understood by the general body of Bengalis. [64] The majority of Bengalis are able to communicate in more than one variety – often, speakers are fluent in Cholitobhasha (SCB) and one or more regional dialects. [32]

Even in SCB, the vocabulary may differ according to the speaker's religion: Hindus are more likely to use words derived from Sanskrit whereas Muslims are more likely to use words of Persian and Arabic origin, along with more native words respectively. [65] For example: [61]

Predominantly Hindu usagePredominantly Muslim usageTranslation
নমস্কারnômôshkarআসসালামু আলাইকুমAssalamu-Alaikumhello
দিদি didiআপু apusister / elder sister
দাদাdadaভাইbha'ibrother / elder brother [66]
মাসী mashiখালা khalamaternal aunt
কাকা kakaচাচা chachapaternal uncle
প্রার্থনা prarthonaদো'আ do'a / du'apray
প্রদীপ prodipবাতি batilight [67]


The phonemic inventory of standard Bengali consists of 29 consonants and 7 vowels, as well as 7 nasalised vowels. The inventory is set out below in the International Phonetic Alphabet (upper grapheme in each box) and romanisation (lower grapheme).

Front Central Back
Close ই~ঈ

Near-open অ্যা

Nasalized vowels
Front Central Back
Close ইঁ~ঈঁ
Close-mid এঁ

Near-open এ্যাঁ / অ্যাঁ
Open আঁ
Labial Dental/
Retroflex Palatoalveolar Velar Glottal
Nasal m n   ŋ  
Plosive voiceless unaspirated p ʈ k
aspirated pʰ~f t̪ʰʈʰtʃʰ
voiced unaspirated b ɖ ɡ
aspirated d̪ʱɖʱdʒʱɡʱ
Fricative s ʃ ɦ
Approximant ( w ) l ( j )
Rhotic r ~ ɾ ɽ~ɽʱ

Bengali is known for its wide variety of diphthongs, combinations of vowels occurring within the same syllable. [68] Two of these, /oi̯/ and /ou̯/, are the only ones with representation in script, as and respectively. /e̯ i̯ o̯ u̯/ may all form the glide part of a diphthong. The total number of diphthongs is not established, with bounds at 17 and 31. An incomplete chart is given by Sarkar (1985) of the following: [69]



In standard Bengali, stress is predominantly initial. Bengali words are virtually all trochaic; the primary stress falls on the initial syllable of the word, while secondary stress often falls on all odd-numbered syllables thereafter, giving strings such as in সহযোগিতাshô-hô-jo-gi-ta "cooperation", where the boldface represents primary and secondary stress.

Consonant clusters

Native Bengali words do not allow initial consonant clusters; [70] the maximum syllabic structure is CVC (i.e. one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this pattern, even when using Sanskrit or English borrowings, such as গেরামgeram (CV.CVC) for গ্রামgram (CCVC) "village" or ইস্কুলiskul (VC.CVC) for স্কুলskul (CCVC) "school".

Writing system

An example of handwritten Bengali. Part of a poem written in Bengali (and with its English translation below each Bengali paragraph) by Nobel Laureate Rabindranath Tagore in 1926 in Hungary Tagore handwriting Bengali.jpg
An example of handwritten Bengali. Part of a poem written in Bengali (and with its English translation below each Bengali paragraph) by Nobel Laureate Rabindranath Tagore in 1926 in Hungary

The Bengali script is an abugida, a script with letters for consonants, diacritics for vowels, and in which an inherent vowel (অ ô) is assumed for consonants if no vowel is marked. [71] The Bengali alphabet is used throughout Bangladesh and eastern India (Assam, West Bengal, Tripura). The Bengali alphabet is believed to have evolved from a modified Brahmic script around 1000 CE (or 10th–11th century). [72] Note that despite Bangladesh being majority Muslim, it uses the Bengali alphabet rather than an Arabic-based one like the Shahmukhi script used in Pakistan. However, throughout history there have been instances of the Bengali language being written in Perso-Arabic. The use of the Sylheti Nagari script also emerged in the Sylhet region of the Bengal. [11]

The Bengali script is a cursive script with eleven graphemes or signs denoting nine vowels and two diphthongs, and thirty-nine graphemes representing consonants and other modifiers. [72] There are no distinct upper and lower case letter forms. The letters run from left to right and spaces are used to separate orthographic words. Bengali script has a distinctive horizontal line running along the tops of the graphemes that links them together called মাত্রাmatra. [73]

Since the Bengali script is an abugida, its consonant graphemes usually do not represent phonetic segments, but carry an "inherent" vowel and thus are syllabic in nature. The inherent vowel is usually a back vowel, either [ɔ] as in মত[mɔt̪] "opinion" or [o], as in মন[mon] "mind", with variants like the more open [ɒ]. To emphatically represent a consonant sound without any inherent vowel attached to it, a special diacritic, called the hôsôntô (্), may be added below the basic consonant grapheme (as in ম্[m]). This diacritic, however, is not common, and is chiefly employed as a guide to pronunciation. The abugida nature of Bengali consonant graphemes is not consistent, however. Often, syllable-final consonant graphemes, though not marked by a hôsôntô, may carry no inherent vowel sound (as in the final in মন[mon] or the medial in গামলা[ɡamla]).

A consonant sound followed by some vowel sound other than the inherent [ɔ] is orthographically realised by using a variety of vowel allographs above, below, before, after, or around the consonant sign, thus forming the ubiquitous consonant-vowel typographic ligatures. These allographs, called কারkar, are diacritical vowel forms and cannot stand on their own. For example, the graph মি[mi] represents the consonant [m] followed by the vowel [i], where [i] is represented as the diacritical allographি (called ই-কারi-kar) and is placed before the default consonant sign. Similarly, the graphs মা[ma], মী[mi], মু[mu], মূ[mu], মৃ[mri], মে[me~mɛ], মৈ[moj], মো[mo] and মৌ[mow] represent the same consonant combined with seven other vowels and two diphthongs. In these consonant-vowel ligatures, the so-called "inherent" vowel [ɔ] is first expunged from the consonant before adding the vowel, but this intermediate expulsion of the inherent vowel is not indicated in any visual manner on the basic consonant sign [mɔ].

The vowel graphemes in Bengali can take two forms: the independent form found in the basic inventory of the script and the dependent, abridged, allograph form (as discussed above). To represent a vowel in isolation from any preceding or following consonant, the independent form of the vowel is used. For example, in মই[moj] "ladder" and in ইলিশ[iliʃ] "Hilsa fish", the independent form of the vowel is used (cf. the dependent formি). A vowel at the beginning of a word is always realised using its independent form.

In addition to the inherent-vowel-suppressing hôsôntô, three more diacritics are commonly used in Bengali. These are the superposed chôndrôbindu(ঁ), denoting a suprasegmental for nasalisation of vowels (as in চাঁদ[tʃãd] "moon"), the postposed ônusbar(ং) indicating the velar nasal [ŋ] (as in বাংলা[baŋla] "Bengali") and the postposed bisôrgô(ঃ) indicating the voiceless glottal fricative [h] (as in উঃ![uh] "ouch!") or the gemination of the following consonant (as in দুঃখ[dukʰːɔ] "sorrow").

The Bengali consonant clusters (যুক্তব্যঞ্জনjuktôbênjôn) are usually realised as ligatures, where the consonant which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond recognition. In the Bengali writing system, there are nearly 285 such ligatures denoting consonant clusters. Although there exist a few visual formulas to construct some of these ligatures, many of them have to be learned by rote. Recently, in a bid to lessen this burden on young learners, efforts have been made by educational institutions in the two main Bengali-speaking regions (West Bengal and Bangladesh) to address the opaque nature of many consonant clusters, and as a result, modern Bengali textbooks are beginning to contain more and more "transparent" graphical forms of consonant clusters, in which the constituent consonants of a cluster are readily apparent from the graphical form. However, since this change is not as widespread and is not being followed as uniformly in the rest of the Bengali printed literature, today's Bengali-learning children will possibly have to learn to recognise both the new "transparent" and the old "opaque" forms, which ultimately amounts to an increase in learning burden.

Bengali punctuation marks, apart from the downstroke daṛi – the Bengali equivalent of a full stop – have been adopted from western scripts and their usage is similar. [57]

Unlike in western scripts (Latin, Cyrillic, etc.) where the letter-forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called মাত্রাmatra. The presence and absence of this matra can be important. For example, the letter and the numeral "3" are distinguishable only by the presence or absence of the matra, as is the case between the consonant cluster ত্রtrô and the independent vowel e. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline).

There is yet to be a uniform standard collating sequence (sorting order of graphemes to be used in dictionaries, indices, computer sorting programs, etc.) of Bengali graphemes. Experts in both Bangladesh and India are currently working towards a common solution for this problem.

Orthographic depth

The Bengali script in general has a comparatively shallow orthography, i.e., in most cases there is a one-to-one correspondence between the sounds (phonemes) and the letters (graphemes) of Bengali. But grapheme-phoneme inconsistencies do occur in certain cases.

One kind of inconsistency is due to the presence of several letters in the script for the same sound. In spite of some modifications in the 19th century, the Bengali spelling system continues to be based on the one used for Sanskrit, [57] and thus does not take into account some sound mergers that have occurred in the spoken language. For example, there are three letters (, , and ) for the voiceless postalveolar fricative [ʃ], although the letter retains the voiceless alveolar sibilant [s] sound when used in certain consonant conjuncts as in স্খলন[skʰɔlon] "fall", স্পন্দন[spɔndon] "beat", etc. The letter also retains the voiceless retroflex sibilant [ʂ] sound when used in certain consonant conjuncts as in কষ্ট[kɔʂʈɔ] "suffering", গোষ্ঠী[ɡoʂʈʰi] "clan", etc. Similarly, there are two letters ( and ) for the voiced postalveolar affricate [dʒ]. Moreover, what was once pronounced and written as a retroflex nasal [ɳ] is now pronounced as an alveolar [n] when in conversation (the difference is heard when reading) (unless conjoined with another retroflex consonant such as , , and ), although the spelling does not reflect this change. The open-mid front unrounded vowel [ɛ] is orthographically realised by multiple means, as seen in the following examples: এত[ɛto] "so much", এ্যাকাডেমী[ɛkademi] "academy", অ্যামিবা[ɛmiba] "amoeba", দেখা[dɛkʰa] "to see", ব্যস্ত[bɛsto] "busy", ব্যাকরণ[bɛkorɔn] "grammar".

Another kind of inconsistency is concerned with the incomplete coverage of phonological information in the script. The inherent vowel attached to every consonant can be either [ɔ] or [o] depending on vowel harmony (স্বরসঙ্গতি) with the preceding or following vowel or on the context, but this phonological information is not captured by the script, creating ambiguity for the reader. Furthermore, the inherent vowel is often not pronounced at the end of a syllable, as in কম[kɔm] "less", but this omission is not generally reflected in the script, making it difficult for the new reader.

Many consonant clusters have different sounds than their constituent consonants. For example, the combination of the consonants ক্[k] and [ʂ] is graphically realised as ক্ষ and is pronounced [kkʰɔ] (as in রুক্ষ[rukkʰo] "coarse") or [kkʰo] (as in ক্ষতি[kkʰot̪i] "harm") or even [kkʰɔ] (as in ক্ষমতা[kkʰɔmot̪a] "capability"), depending on the position of the cluster in a word. The Bengali writing system is, therefore, not always a true guide to pronunciation.


The script used for Bengali, Assamese and other languages is known as Bengali script. The script is known as the Bengali alphabet for Bengali and its dialects and the Assamese alphabet for Assamese language with some minor variations. Other related languages in the nearby region also make use of the Bengali alphabet like the Meitei language in the Indian state of Manipur, where the Meitei language has been written in the Bengali alphabet for centuries, though the Meitei script has been promoted in recent times.


There are various Romanisation systems used for Bengali created in recent years which have failed to represent the true Bengali phonetic sound. The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation where the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration or IAST system (based on diacritics), [74] "Indian languages Transliteration" or ITRANS (uses upper case letters suited for ASCII keyboards), [75] and the National Library at Kolkata romanisation. [76]

In the context of Bengali romanisation, it is important to distinguish transliteration from transcription. Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription is phonetically accurate (the pronunciation can be reproduced).

Although it might be desirable to use a transliteration scheme where the original Bengali orthography is recoverable from the Latin text, Bengali words are currently Romanized on Wikipedia using a phonemic transcription, where the true phonetic pronunciation of Bengali is represented with no reference to how it is written.

The most recent attempt has been by publishers Mitra and Ghosh with the launch of three popular children's books, Abol Tabol, Hasi Khusi and Sahoj Path in Roman script at the Kolkata Book Fair 2018. Published under the imprint of Benglish Books, these are based on phonetic transliteration and closely follow spellings used in social media but for using an underline to describe soft consonants.


Bengali nouns are not assigned gender, which leads to minimal changing of adjectives (inflection). However, nouns and pronouns are moderately declined (altered depending on their function in a sentence) into four cases while verbs are heavily conjugated, and the verbs do not change form depending on the gender of the nouns.

Word order

As a head-final language, Bengali follows subject–object–verb word order, although variations to this theme are common. [77] Bengali makes use of postpositions, as opposed to the prepositions used in English and other European languages. Determiners follow the noun, while numerals, adjectives, and possessors precede the noun. [78]

Yes-no questions do not require any change to the basic word order; instead, the low (L) tone of the final syllable in the utterance is replaced with a falling (HL) tone. Additionally, optional particles (e.g. কি-ki, না-na, etc.) are often encliticised onto the first or last word of a yes-no question.

Wh-questions are formed by fronting the wh-word to focus position, which is typically the first or second word in the utterance.


Nouns and pronouns are inflected for case, including nominative, objective, genitive (possessive), and locative. [23] The case marking pattern for each noun being inflected depends on the noun's degree of animacy. When a definite article such as -টা-ṭa (singular) or -গুলো-gulo (plural) is added, as in the tables below, nouns are also inflected for number.

In most of the Bengali grammar books, cases are divided in to 6 categories and an additional possessive case (possessive form is not recognised as a type of case by Bengali grammarians). But in term of usages, cases are generally grouped in to only 4 categories.

Singular noun inflection
the student
the shoe
the student
the shoe
the student's
the shoe's
on/in the shoe
Plural noun inflection
the students
the shoes
the students
the shoes
the students'
the shoes'
on/in the shoes

When counted, nouns take one of a small set of measure words. Nouns in Bengali (Japanese is similar in this respect) cannot be counted by adding the numeral directly adjacent to the noun. An appropriate measure word (MW) must be used between the numeral and the noun. Most nouns take the generic measure word -টা-ṭa, though other measure words indicate semantic classes (e.g. -জন-jôn for humans). There is also the classifier -khana, and its diminutive form -khani, which attach only to nouns denoting something flat, long, square, or thin. These are the least common of the classifiers. [79]

Measure words
BengaliBengali transliterationLiteral translationEnglish translation
নয়টা গরুNôy-ṭa goruNine-MW cowNine cows
কয়টা বালিশKôy-ṭa balishHow many-MW pillowHow many pillows
অনেকজন লোকÔnek-jôn lokMany-MW personMany people
চার-পাঁচজন শিক্ষকCar-pãc-jôn shikkhôkFour-five-MW teacherFour to five teachers

Measuring nouns in Bengali without their corresponding measure words (e.g. আট বিড়ালaṭ biṛal instead of আটটা বিড়ালaṭ-ṭa biṛal "eight cats") would typically be considered ungrammatical. However, when the semantic class of the noun is understood from the measure word, the noun is often omitted and only the measure word is used, e.g. শুধু একজন থাকবে।Shudhu êk-jôn thakbe. (lit. "Only one-MW will remain.") would be understood to mean "Only one person will remain.", given the semantic class implicit in -জন-jôn.

In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to mass nouns.


There are two classes of verbs: finite and non-finite. Non-finite verbs have no inflection for tense or person, while finite verbs are fully inflected for person (first, second, third), tense (present, past, future), aspect (simple, perfect, progressive), and honour (intimate, familiar, and formal), but not for number. Conditional, imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of inflections on many verb roots can total more than 200.

Inflectional suffixes in the morphology of Bengali vary from region to region, along with minor differences in syntax.

Bengali differs from most Indo-Aryan Languages in the zero copula, where the copula or connective be is often missing in the present tense. [57] Thus, "he is a teacher" is সে শিক্ষকse shikkhôk, (literally "he teacher"). [80] In this respect, Bengali is similar to Russian and Hungarian. Romani grammar is also the closest to Bengali grammar. [81]


Sources of modern literary Bengali words
67% native
25% Sanskrit reborrowings
8% indigenous and foreign loans Bengali Vocabulary2.gif
Sources of modern literary Bengali words
  67% native
  25% Sanskrit reborrowings
  8% indigenous and foreign loans

Bengali has as many as 100,000 separate words, of which 50,000 are considered Tadbhavas, 21,100 are Tatsamas and the remainder loanwords from Austroasiatic and other foreign languages.

However, these figures do not take into account the large proportion of archaic or highly technical words that are very rarely used. Furthermore, different dialects use more Persian and Arabic vocabulary especially in different areas of Bangladesh and Muslim majority areas of West Bengal. Hindus, on the other hand, use more Sanskrit vocabulary than Muslims. While standard Bengali is based on the Nadia dialect of spoken in the Hindu majority states of West Bengal, about 90% of Bengalis in Bangladesh (ca. 148 million) and 27% of Bengalis in West Bengal and 10% in Assam (ca. 36 million) are Muslim and speak a more "persio-arabised" version of Bengali instead of the more Sanskrit influenced Standard Nadia dialect. The productive vocabulary used in modern literary works, in fact, is made up mostly (67%) of tadbhavas, while tatsamas comprise only 25% of the total. [82] [83] Loanwords from non-Indic languages comprise the remaining 8% of the vocabulary used in modern Bengali literature.

According to Suniti Kumar Chatterji, dictionaries from the early 20th century attributed about 50% of the Bengali vocabulary to native words (i.e., naturally modified Prakrit words, corrupted forms of Aryan words, and non Indo-European languages. About 45% percent of Bengali words are unmodified Sanskrit, and the remaining words are from foreign languages. [84] Dominant in the last group was Persian, which was also the source of some grammatical forms. More recent studies suggest that the use of native and foreign words has been increasing, mainly because of the preference of Bengali speakers for the colloquial style. [84] Because of centuries of contact with Europeans, Turkic peoples, and Persians, Bengali has absorbed numerous words from foreign languages, often totally integrating these borrowings into the core vocabulary.

The most common borrowings from foreign languages come from three different kinds of contact. After close contact with several indigenous Austroasiatic languages, [85] [86] [87] [88] and later the Mughal invasion whose court language was Persian, numerous Chagatai, Arabic, and Persian words were absorbed into the lexicon. [33]

Later, East Asian travellers and lately European colonialism brought words from Portuguese, French, Dutch, and most significantly English during the colonial period.

Sample text

The following is a sample text in Bengali of Article 1 of the Universal Declaration of Human Rights:

Bengali in the Bengali alphabet

ধারা ১: সমস্ত মানুষ স্বাধীনভাবে সমান মর্যাদা এবং অধিকার নিয়ে জন্মগ্রহণ করে। তাঁদের বিবেক এবং বুদ্ধি আছে; সুতরাং সকলেরই একে অপরের প্রতি ভ্রাতৃত্বসুলভ মনোভাব নিয়ে আচরণ করা উচিত।

Bengali in phonetic Romanization

Dhara êk: Sômôstô manush shadhinbhabe sôman môrjada ebông ôdhikar niye jônmôgrôhôn kôre. Tãder bibek ebông buddhi achhe; sutôrang sôkôleri êke ôpôrer prôti bhratrittôsulôbh mônobhab niye achôrôn kôra uchit.

Bengali in the International Phonetic Alphabet

d̪ʱara ɛk | ʃɔmost̪o manuʃ ʃad̪ʱinbʱabe ʃoman mɔɾdʒad̪a eboŋ od̪ʱikaɾ nie̯e dʒɔnmoɡrohon kɔɾe | t̪ãdeɾ bibek eboŋ bud̪ʱːi atʃʰe | ʃut̪oraŋ ʃɔkoleɾi ɛke ɔporeɾ prot̪i bʱrat̪rit̪ːoʃulɔbʱ monobʱab nie̯e atʃorɔn kɔra utʃit̪


Clause 1: All human free-manner-in equal dignity and right taken birth-take do. Their reason and intelligence exist; therefore everyone-indeed one another's towards brotherhood-ly attitude taken conduct do should.


Article 1: All human beings are born free and equal in dignity and rights. They possess conscience and reason. Therefore, everyone should act in a spirit of brotherhood towards each other.

See also


  1. 1 2 3 Bengali language at Ethnologue (22nd ed., 2019)
  2. "Scheduled Languages in descending order of speaker's strength - 2011" (PDF). Registrar General and Census Commissioner of India. Archived (PDF) from the original on 14 November 2018. Retrieved 28 June 2018.
  3. "Bangla Sign Language Dictionary". Retrieved 12 September 2018.
  4. "Jharkhand gives second language status to Magahi, Angika, Bhojpuri and Maithili". The Avenue Mail. 21 March 2018. Archived from the original on 28 March 2019. Retrieved 30 April 2019.
  5. Hammarström, Harald; Forkel, Robert; Haspelmath, Martin, eds. (2017). "Bengali". Glottolog 3.0 . Jena, Germany: Max Planck Institute for the Science of Human History.
  6. Laurie Bauer, 2007, The Linguistics Student's Handbook, Edinburgh
  7. Hays, Jeffrey. "BENGALIS – Facts and Details". Archived from the original on 30 July 2017. Retrieved 4 July 2018.
  8. "The World Factbook". Central Intelligence Agency. Archived from the original on 13 February 2008. Retrieved 21 February 2018.
  9. "Summary by language size". Ethnologue. 3 October 2018. Archived from the original on 11 September 2013. Retrieved 21 February 2019.
  10. 1 2 "Bangla Bhasha Procholon Ain, 1987" বাংলা ভাষা প্রচলন আইন, ১৯৮৭ [ Bengali Language Implementation Act, 1987 ](PDF). Bangladesh Codeবাংলাদেশ কোড (in Bengali). 27 (Online ed.). Dhaka: Ministry of Law, Justice and Parliamentary Affairs, Bangladesh. Retrieved 23 May 2019.
  11. 1 2 "Bangla Language – Banglapedia". Archived from the original on 6 July 2015. Retrieved 12 September 2018.
  12. 1 2 "Article 3. The state language". The Constitution of the People's Republic of Bangladesh. Ministry of Law, The People's Republic of Bangladesh. Retrieved 23 May 2019.
  13. "National Languages Of Bangladesh". 11 June 2017. Archived from the original on 2 August 2017. Retrieved 12 September 2018.
  14. "5 Surprising Reasons the Bengali Language Is Important". 17 August 2017. Archived from the original on 26 June 2018. Retrieved 10 March 2018.
  15. "50th Report of the Commissioner for Linguistic Minorities in India (July 2012 to June 2013)" (PDF). National Commission for Linguistic Minorities. 16 July 2014. Archived from the original (PDF) on 2 January 2018. Retrieved 20 February 2018.
  16. "50th REPORT OF THE COMMISSIONER FOR LINGUISTIC MINORITIES IN INDIA" (PDF). Ministry of Minority Affairs. Archived from the original (PDF) on 8 July 2016. Retrieved 23 May 2019.
  17. "Bengali Language". Archived from the original on 26 June 2018. Retrieved 12 September 2018.
  18. "Amendment to the Draft Programme and Budget for 2000–2001 (30 C/5)" (PDF). General Conference, 30th Session, Draft Resolution. UNESCO. 1999. Archived (PDF) from the original on 21 May 2011. Retrieved 27 May 2008.
  19. "Resolution adopted by the 30th Session of UNESCO's General Conference (1999)". International Mother Language Day. UNESCO. Archived from the original on 1 June 2008. Retrieved 27 May 2008.
  20. "Bangla Script – Banglapedia". Archived from the original on 11 October 2017. Retrieved 7 November 2017.
  21. Shah 1998 , p. 11
  22. Keith 1998 , p. 187
  23. 1 2 ( Bhattacharya & 50000000 )
  24. Oberlies, Thomas (2007). "Chapter Five: Aśokan Prakrit and Pāli" Archived 7 May 2016 at the Wayback Machine . In Cardona, George; Jain, Danesh. The Indo-Aryan Languages. Routledge. p. 163. ISBN   978-1-135-79711-9.
  25. ( Sen 1996 )
  26. "Banglapedia". Retrieved 7 November 2017.
  27. "Pala dynasty – Indian dynasty". Archived from the original on 5 March 2017. Retrieved 7 November 2017.
  28. nimmi. "Pala Dynasty, Pala Empire, Pala empire in India, Pala School of Sculptures". Archived from the original on 28 October 2017. Retrieved 7 November 2017.
  29. Rabbani, AKM Golam (7 November 2017). "Politics and Literary Activities in the Bengali Language during the Independent Sultanate of Bengal". Dhaka University Journal of Linguistics. 1 (1): 151–166. Archived from the original on 11 October 2017. Retrieved 7 November 2017 via
  30. Eaton, Richard M. (1993). The Rise of Islam and the Bengal Frontier, 1204–1760. University of California. ISBN   978-0-520-20507-9. Archived from the original on 11 October 2017. Retrieved 13 July 2017.
  31. 1 2 "Bengali Language at Cornell". Department of Asian Studies. Cornell University. Archived from the original on 15 November 2012.
  32. 1 2 Ray, S Kumar. "The Bengali Language and Translation". Translation Articles. Kwintessential. Archived from the original on 25 September 2006. Retrieved 19 November 2006.
  33. 1 2 3 Thompson, Hanne-Ruth (2012). Bengali (Paperback with corrections. ed.). Amsterdam: John Benjamins Pub. Co. p. 3. ISBN   978-90-272-3819-1.
  34. "Bengali 'should be UN language'". 22 December 2009. Archived from the original on 8 November 2017. Retrieved 7 November 2017.
  35. "Kuwait restricts recruitment of male Bangladeshi workers | Dhaka Tribune". 7 September 2016. Archived from the original on 30 August 2017. Retrieved 4 December 2017.
  36. "Bahrain: Foreign population by country of citizenship, sex and migration status (worker/ family dependent) (selected countries, January 2015) – GLMM". GLMM. 20 October 2015. Archived from the original on 16 December 2017. Retrieved 4 December 2017.
  37. "Saudi Arabia". Ethnologue. Archived from the original on 23 November 2017. Retrieved 4 December 2017.
  38. "Archived copy" (PDF). Archived (PDF) from the original on 27 January 2018. Retrieved 10 February 2018.CS1 maint: archived copy as title (link)
  39. Archived 5 May 2013 at the Wayback Machine Bangla Language and Literary Society, Singapore
  40. "Languages of India". Ethnologue Report. Archived from the original on 26 December 2009. Retrieved 4 November 2006.
  41. "Language". Government of Assam. Archived from the original on 26 August 2006. Retrieved 20 June 2006.
  42. Bhattacharjee, Kishalay (30 April 2008). "It's Indian language vs Indian language". . Archived from the original on 23 July 2013. Retrieved 27 May 2008.
  43. Syed Yasir Kazmi (16 October 2009). "Pakistani Bengalis". DEMOTIX. Archived from the original on 2 January 2015. Retrieved 2 April 2013.
  44. "کراچی کے 'بنگالی پاکستانی'(Urdu)". محمد عثمان جامعی. 17 November 2003. Archived from the original on 19 November 2003. Retrieved 2 April 2013.
  45. Rafiqul Islam. "The Language Movement : An Outline". Archived from the original on 19 November 2012. Retrieved 12 May 2016.
  46. "Karachi Department of Bengali". Archived from the original on 10 May 2013. Retrieved 2 April 2013.
  47. "Statement by Hon'ble Foreign Minister on Second Bangladesh-India Track II dialogue at BRAC Centre on 07 August, 2005". Ministry of Foreign Affairs, Government of Bangladesh. Archived from the original on 18 April 2008. Retrieved 27 May 2008.
  48. "Sri Lanka". The World Factbook . Archived from the original on 22 July 2015. Retrieved 20 September 2017.
  49. "Man of the series: Nobel laureate Tagore". The Times of India . Times News Network. 3 April 2011. Archived from the original on 12 June 2018. Retrieved 20 June 2019.
  50. "Sri Lanka I-Day to have anthem in Tamil". The Hindu . 4 February 2016. Archived from the original on 4 February 2016. Retrieved 20 September 2017.
  51. "Tagore's influence on Lankan culture". Hindustan Times . 12 May 2010. Archived from the original on 11 October 2017. Retrieved 20 September 2017.
  52. Wickramasinghe, Nira (2003). Dressing the Colonised Body: Politics, Clothing, and Identity in Sri Lanka. Orient Longman. p. 26. ISBN   978-81-250-2479-8. Archived from the original on 29 September 2018. Retrieved 29 September 2018.
  53. Wickramasinghe, Kamanthi; Perera, Yoshitha. "Sri Lankan National Anthem: can it be used to narrow the gap?". The Daily Mirror (Sri Lanka) (30 March 2015). Archived from the original on 11 October 2017. Retrieved 20 September 2017.
  54. Haque, Junaidul (7 May 2011). "Rabindranath: He belonged to the world". The Daily Star (Bangladesh) . Archived from the original on 4 March 2016. Retrieved 20 September 2017.
  55. Habib, Haroon (17 May 2011). "Celebrating Rabindranath Tagore's legacy". The Hindu . Archived from the original on 13 June 2014. Retrieved 20 September 2017.
  56. Subir Bhaumik (22 December 2009). "Bengali 'should be UN language'". BBC News. Archived from the original on 3 April 2012. Retrieved 25 January 2010.
  57. 1 2 3 4 Bangla language Archived 6 July 2015 at the Wayback Machine in Asiatic Society of Bangladesh 2003
  58. 1 2 Morshed, Abul Kalam Manjoor. "Dialect". Banglapedia. Asiatic Society of Bangladesh. Archived from the original on 6 July 2015. Retrieved 17 November 2006.
  59. "Hajong". The Ethnologue Report. Archived from the original on 15 November 2006. Retrieved 19 November 2006.
  60. 1 2 Huq, Mohammad Daniul. "Chalita Bhasa". Banglapedia . Asiatic Society of Bangladesh. Archived from the original on 6 July 2015. Retrieved 17 November 2006.
  61. 1 2 "History of Bengali (Banglar itihash)". Bengal Telecommunication and Electric Company. Archived from the original on 7 July 2011. Retrieved 20 November 2006.
  62. Huq, Mohammad Daniul. "Sadhu Bhasa". Banglapedia. Asiatic Society of Bangladesh. Archived from the original on 6 July 2015. Retrieved 17 November 2006.
  63. Huq, Mohammad Daniul. "Alaler Gharer Dulal". Banglapedia. Asiatic Society of Bangladesh. Archived from the original on 6 July 2015. Retrieved 17 November 2006.
  64. 1 2 Ray, Hai & Ray 1966 , p. 89
  65. Ray, Hai & Ray 1966 , p. 80
  66. "A Bilingual Dictionary of Words and Phrases (English-Bengali)". Archived from the original on 25 October 2016. Retrieved 25 October 2016.
  67. "light – Meaning in Bengali – light in Bengali – Shabdkosh | অভিধান : English Bengali Dictionary and Translation". Archived from the original on 5 March 2017. Retrieved 25 October 2016.
  68. ( Masica 1991 , pp. 116)
  69. Sarkar, Pabitra (1985). Bangla diswar dhoni. Bhasa.
  70. ( Masica 1991 , pp. 125)
  71. Escudero Pascual Alberto (23 October 2005). "Writing Systems/ Scripts" (PDF). Primer to Localization of Software. Archived from the original (PDF) on 19 March 2009. Retrieved 20 November 2006.
  72. 1 2 Bangalah Archived 5 July 2015 at the Wayback Machine in Asiatic Society of Bangladesh 2003
  73. "". Archived from the original on 24 December 2010.
  74. "Learning International Alphabet of Sanskrit Transliteration". Sanskrit 3 – Learning transliteration. Gabriel Pradiipaka & Andrés Muni. Archived from the original on 12 February 2007. Retrieved 20 November 2006.
  75. "ITRANS – Indian Language Transliteration Package". Avinash Chopde. Archived from the original on 23 January 2013. Retrieved 20 November 2006.
  76. "Annex-F: Roman Script Transliteration" (PDF). Indian Standard: Indian Script Code for Information Interchange – ISCII. Bureau of Indian Standards. 1 April 1999. p. 32. Archived (PDF) from the original on 16 July 2012. Retrieved 20 November 2006.
  77. ( Bhattacharya 2000 , pp. 16)
  78. "Bengali". UCLA Language Materials project. University of California, Los Angeles. Archived from the original on 15 July 2007. Retrieved 20 November 2006.
  79. Boyle David, Anne (2015). Descriptive grammar of Bangla. De Gruyter. pp. 141–142.
  80. Among Bengali speakers brought up in neighbouring linguistic regions (e.g. Hindi), the lost copula may surface in utterances such as she shikkhôk hocche. This is viewed as ungrammatical by other speakers, and speakers of this variety are sometimes (humorously) referred as "hocche-Bangali".
  81. Hübschmannová, Milena (1995). "Romaňi čhib – romština: Několik základních informací o romském jazyku". Bulletin Muzea Romské Kultury. Brno (4/1995). Zatímco romská lexika je bližší hindštině, marvárštině, pandžábštině atd., v gramatické sféře nacházíme mnoho shod s východoindickým jazykem, s bengálštinou.
  82. Tatsama Archived 6 July 2015 at the Wayback Machine in Asiatic Society of Bangladesh 2003
  83. Tadbhaba Archived 6 July 2015 at the Wayback Machine in Asiatic Society of Bangladesh 2003
  84. 1 2 "Bengali language". Archived from the original on 11 October 2016. Retrieved 2 September 2016.
  85. Byomkes Chakrabarti A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, ISBN   81-7074-128-9
  86. Das, Khudiram (1998). Santhali Bangla Samashabda Abhidhan. Kolkata, India: Paschim Banga Bangla Akademi.
  87. "Archived copy" (PDF). Archived (PDF) from the original on 1 March 2017. Retrieved 16 March 2017.CS1 maint: archived copy as title (link)
  88. Das, Khudiram. Bangla Santali Bhasa Samporko (eBook).

Related Research Articles

Arabic Afro-Asiatic language

Arabic is a Semitic language that first emerged in the 1st to 4th centuries CE. It is now the lingua franca of the Arab world. It is named after the Arabs, a term initially used to describe peoples living in the area bounded by Mesopotamia in the east and the Anti-Lebanon mountains in the west, in Northwestern Arabia and in the Sinai Peninsula. The ISO assigns language codes to thirty varieties of Arabic, including its standard form, Modern Standard Arabic, which is modernized Classical Arabic. This distinction exists primarily among Western linguists; Arabic speakers themselves generally do not distinguish between Modern Standard Arabic and Classical Arabic, but rather refer to both as al-ʿarabīyah al-fuṣḥā.

Abugida Writing system

An abugida, or alphasyllabary, is a segmental writing system in which consonant–vowel sequences are written as a unit: each unit is based on a consonant letter, and vowel notation is secondary. This contrasts with a full alphabet, in which vowels have status equal to consonants, and with an abjad, in which vowel marking is absent, partial, or optional. The terms also contrast them with a syllabary, in which the symbols cannot be split into separate consonants and vowels. Abugidas include the extensive Brahmic family of scripts of Tibet, South and Southeast Asia, Semitic Ethiopic scripts, and Canadian Aboriginal syllabics.

Malayalam Language spoken in India

Malayalam is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (Mahé) by the Malayali people. It is one of 22 scheduled languages of India spoken by nearly 2.88% of Indians. Malayalam has official language status in the state of Kerala and in the union territories of Lakshadweep and Puducherry (Mahé) and is spoken by 37 million people worldwide. Malayalam is also spoken by linguistic minorities in the neighbouring states; with significant number of speakers in the Nilgiris, Kanyakumari, and Coimbatore districts of Tamil Nadu, and Kodagu and Dakshina Kannada districts of Karnataka. Due to Malayali expatriates in the Persian Gulf, the language is also widely spoken in Gulf countries.

Telugu language Dravidian language

Telugu is a Dravidian language spoken in the Indian states of Andhra Pradesh, Telangana and the union territories of Puducherry (Yanam) by the Telugu people. It stands alongside Hindi, English and Bengali as one of the few languages with primary official language status in more than one Indian state. There are also significant linguistic minorities in states of Orissa, Karnataka, Tamil Nadu, Kerala and Maharashtra. It is one of six languages designated a classical language of India by the country's government.

The Mon language is an Austroasiatic language spoken by the Mon people, who live in Myanmar and Thailand. Mon, like the related Khmer language, but unlike most languages in mainland Southeast Asia, is not tonal. In recent years, usage of Mon has declined rapidly, especially among the younger generation. The Mon language is a recognised indigenous language in Myanmar as well as a recognised indigenous language of Thailand.

Chechen is a Northeast Caucasian language spoken by more than 1.4 million people, mostly in the Chechen Republic and by members of the Chechen diaspora throughout Europe, Russia, Jordan, Central Asia, and Georgia.

Assamese language Indo-Aryan language spoken in India

Assamese, also known as Asamiya, is an Eastern Indo-Aryan language spoken mainly in the Indian state of Assam, where it is an official language. It is the easternmost Indo-European language, spoken by over 15 million speakers, and serves as a lingua franca in the region.

Sinhala language Indo-Aryan language spoken in Sri Lanka

Sinhala, also known as Sinhalese, is an Indo-Aryan language primarily spoken by the Sinhalese people of Sri Lanka, who make up the largest ethnic group on the island, numbering about 16 million. Sinhala is also spoken as the first language by other ethnic groups in Sri Lanka, totalling about four million. Sinhala is written using the Sinhala script, which is one of the Brahmic scripts, a descendant of the ancient Indian Brahmi script closely related to the Kadamba script.

Javanese language Austronesian language

Javanese is the language of the Javanese people from the central and eastern parts of the island of Java, in Indonesia. There are also pockets of Javanese speakers on the northern coast of western Java. It is the native language of more than 98 million people.

In linguistics, a phonemic orthography is an orthography in which the graphemes correspond to the phonemes of the language. Natural languages rarely have perfectly phonemic orthographies; a high degree of grapheme-phoneme correspondence can be expected in orthographies based on alphabetic writing systems, but they differ in how complete this correspondence is. English orthography, for example, is alphabetic but highly nonphonemic; it was once mostly phonemic during the Middle English stage, when the modern spellings originated, but spoken English changed rapidly while the orthography was much more stable, resulting in the modern nonphonemic situation. However, because of their relatively recent modernizations when compared to English, the Italian, Turkish, Spanish, Finnish, Czech, Latvian and Polish orthographic systems come much closer to being consistent phonemic representations.

Meitei or Meetei language (also Manipuri, Meitheilon, Meeteilon, Meeʁteilon, Katheis a Sino-Tibetan language and the predominant language and lingua franca in the southeastern Himalayan state of Manipur, in northeastern India. It is the one of the official languages of Government of India.

Khasi is an Austroasiatic language spoken primarily in Meghalaya state in India by the Khasi people. It is also spoken by a sizeable population in Assam and Bangladesh. Khasi is part of the Austroasiatic language family, and is related to Khmer, Vietnamese and Mon languages of Southeast Asia, and the Munda and Nicobarese branches of that family, which are spoken in east–central India and in the Nicobar Islands, respectively.

The Manipuri Bishnupriya or Bishnupriya Manipuri (BPM) is an Indo-Aryan language spoken in parts of the Indian states of Assam, Tripura and others, as well as in the Sylhet region of Bangladesh, Burma, and other countries. It uses the Bengali script alphabet as its writing system.

Bengali dialects

The dialects of the Bengali language are part of the Eastern Indo-Aryan language group of the Indo-European language family widely spoken in the Bengal region of South Asia. Although the spoken dialects of Bengali are mutually intelligible with neighbouring dialects.

The phonology of Bengali, like that of its neighbouring Eastern Indo-Aryan languages, is characterised by a wide variety of diphthongs and inherent back vowels. corresponds to and developed out of the Sanskrit schwa, which is retained as such by almost all other branches of the Indo-Aryan language family.

Romanisation of Bengali is the representation of written Bengali language in the Latin script. Various romanisation systems for Bengali are used, most of which do not perfectly represent Bengali pronunciation. While different standards for romanisation have been proposed for Bengali, none has been adopted with the same degree of uniformity as Japanese or Sanskrit.

Mal Paharia is a language spoken by 51,000 of 110,000 ethnic Mal Paharia in the states of Jharkhand and West Bengal in India, and regions of Bangladesh. The language is also known as Mal Pahoria, Malto, Malti, Paharia, Parsi, and Mal Pahariya. It has been variously regarded as a Bengali–Assamese language, a dialect of Malto, and a mixed Dravidian–Indo-Aryan language. There is a generally positive attitude among speakers of the language towards it, but it is considered vulnerable as some speakers have shifted to Bengali. Mal Paharia uses the Devanagari script and rules for its writing/ reading/ speech.

Bengali alphabet abugida used in writing Bengali

The Bengali alphabet or Bengali script, is the writing system, originating in the Indian subcontinent, for the Bengali language and is the fifth most widely used writing system in the world. The script is used for other languages like Bishnupriya Manipuri and has historically been used to write Sanskrit within Bengal.

Bengali–Assamese script A type of South Asian writing system

The Bengali–Assamese script is the fifth most widely used writing system in the world. It is the basis of the Bengali, Assamese and Tirhuta as well as the alphabets for Bishnupriya Manipuri, Kokborok (Tripuri) and Meithei (Manipuri), Chakma, Santhali etc. Other languages, such as Angika, Bodo, Karbi, Maithili and Mising were once written in this script. Modern Sylheti is often written using this script as well. It was originally used to write Sanskrit, and the use of this script for Sanskrit continues in Eastern India.


Further reading