| Bengali alphabet বাংলা বর্ণমালা বা লিপি | |
|---|---|
| | |
| Script type | Abugida |
Period | 11th century to the present [1] |
| Direction | Left-to-right |
| Official script | for Bengali language and Meitei language [2] [3] |
| Region | Bengal |
| Languages | Bengali, Sanskrit, Kokborok, Kudmali, Hajong, Bishnupriya Manipuri, Meitei, Magahi [4] |
| Related scripts | |
Parent systems | Egyptian
|
Sister systems | Assamese and Tirhuta |
| ISO 15924 | |
| ISO 15924 | Beng(325),Bengali (Bangla) |
| Unicode | |
Unicode alias | Bengali |
| U+0980–U+09FF | |
| Part of a series on the |
| Culture of Bengal |
|---|
| |
| History |
| Cuisine |
| Part of a series on the |
| Culture of Bangladesh |
|---|
| |
| Brahmic scripts |
|---|
| The Brahmi script and its descendants |
The Bengali script or Bangla alphabet [a] is the standard writing system used to write the Bengali language, and has historically been used to write Sanskrit within Bengal. [6] An estimated 300 million people use this syllabic alphabet, which makes it the 5th most commonly used writing system in the world. [7] [8] It is the sole national script of Bangladesh and one of the official scripts of India, specifically used in the Indian states of West Bengal, Tripura and the Barak Valley of Assam. The script is also used for the Meitei language in Manipur, defined by the Manipur Official Language Act. [9]
From a classificatory point of view, the Bengali writing system is derived from the Brahmi script. [10] It is written from left to right. It is an abugida, i.e., its vowel graphemes are mainly realised not as independent letters, but as diacritics modifying the inherent vowel in the base letter to which they are added. There are no distinct upper and lower case letter forms, which makes it a unicameral script. The script is characterised by many conjuncts, upstrokes, downstrokes, and other features that hang from a horizontal line running along the tops of the graphemes that links them together called matra (মাত্রা [ˈmat̪ɾaˑ] 'measure'). The punctuation is all borrowed from 19th-century English, with the exception of one. [10]
The Bengali script can be divided into vowels and vowel diacritics, consonants and conjunct consonants, diacritical and other symbols, digits, and punctuation marks. Vowels and consonants are used as letters and also as diacritical marks.
The Bengali script has a total of 11 vowel graphemes, each of which is called a svaravana. [b] They represent six of the seven main vowel sounds of Bengali, along with two vowel diphthongs. All of them are used in both Bengali and Assamese languages.
The table below shows the vowels present in the modern (i.e., since the late 19th century) inventory of the Bengali abugida:
| হ্রস্ব (short) | দীর্ঘ (long) | |||||
|---|---|---|---|---|---|---|
| স্বর (vowel phoneme) | কার (vowel mark) | স্বর (vowel phoneme) | কার (vowel mark) | |||
| কন্ঠ্য (Guttural) | অ | ô / ɔ ~ o / [k] | - | আ | a /ɐ/ [l] | া |
| তালব্য (Palatal) | ই | i /i/ | ি | ঈ | ī /i/ | ী |
| ওষ্ঠ্য (Labial) | উ | u / u / [m] | ু | ঊ | ū /u/ | ূ |
| মূর্ধন্য (retroflex) | ঋ | ṛ /ri/ | ৃ | ৠ | ṝ | ৄ |
| দন্ত্য (Dental) | ঌ | ḷ /li/ | ৢ | ৡ | ḹ | ৣ |
| যুক্তস্বর (complex vowels) | ||||||
| কন্ঠ্যতালব্য (Palato-guttural) | এ | e / e / [n] | ে | ঐ | oi /oi/ | ৈ |
| কন্ঠৌষ্ঠ্য (Labioguttural) | ও | o / o ~ w / [o] [m] | ো | ঔ | ou /ou/ | ৌ |
Consonant letters are called ব্যঞ্জনবর্ণbênjônbôrṇô [a] [b] in Bengali. The names of the letters are typically just the consonant sound plus the inherent vowel অô. Since the inherent vowel is assumed and not written, most letters' names look identical to the letter itself (e.g., the name of the letter ঘ is itself ghô, not gh).
| স্পর্শ (Stop) | অনুনাসিক (Nasal) | অন্তঃস্থ (Approximant) | ঊষ্ম (Fricative) | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| বর্গীয় বর্ণ (Generic sounds) | ||||||||||||||||
| Voicing → | অঘোষ (Voiceless) | ঘোষ (Voiced) | অঘোষ (Voiceless) | ঘোষ (Voiced) | ||||||||||||
| Aspiration → | অল্পপ্রাণ (Unaspirated) | মহাপ্রাণ (Aspirated) | অল্পপ্রাণ (Unaspirated) | মহাপ্রাণ (Aspirated) | অল্পপ্রাণ (Unaspirated) | মহাপ্রাণ (Aspirated) | ||||||||||
| কন্ঠ্য (Guttural) [f] | ক | kô / k ɔ/ | খ | khô / kʰ ɔ/ | গ | gô / ɡ ɔ/ | ঘ | ghô / ɡʱ ɔ/ | ঙ | ṅô / ŋ ɔ/ | হ | hô / ɦ ɔ~ h ɔ/ [g] | ||||
| তালব্য (Palatal) [h] | চ | cô / tʃ ɔ~ tɕ ɔ/ | ছ | chô /tʃʰɔ~ tɕʰ ɔ/ | জ | jô / dʒ ɔ~ dz ɔ/ | ঝ | jhô /dʒʱɔ~dzʱɔ/ | ঞ | ñô / n ɔ/ [i] | য | yô / dʒ ɔ~ dz ɔ~ z ɔ/ [j] | শ | śô / ʃ ɔ~ ɕ ɔ~ s ɔ/ [k] | ||
| মূর্ধন্য (Retroflex) [l] | ট | ṭô / ʈ ɔ/ | ঠ | ṭhô /ʈʰɔ/ | ড | ḍô / ɖ ɔ/ | ঢ | ḍhô /ɖʱɔ/ | ণ | ṇô / n ɔ/ [m] | র | rô /ɾɔ/ [n] | ষ | ṣô / ɕ ɔ~ ʃ ɔ~ ʂ ɔ/ [k] | ||
| দন্ত্য (Dental) | ত | tô / t̪ ɔ/ | থ | thô /t̪ʰɔ/ | দ | dô / d̪ ɔ/ | ধ | dhô /d̪ʱɔ/ | ন | nô / n ɔ/ | ল | lô / l ɔ/ | স | sô / s ɔ~ ɕ ɔ~ ʃ ɔ/ [k] | ||
| ওষ্ঠ্য (Labial) | প | pô / p ɔ/ | ফ | phô /pʰɔ/ [o] | ব | bô / b ɔ/ | ভ | bhô /bʱɔ/ [p] | ম | mô / m ɔ/ | ৱ | wô /wɔ/ | ||||
| Post-reform letters | ড় | ṛô / ɽ ɔ/ | ঢ় | ṛhô /ɽʱɔ~ ɽ / [q] | য় | ẏô / j ɔ~ e̯ ɔ/ |
|---|
Clusters of up to four consonants can be orthographically represented as a typographic ligature, called a consonant conjunct (Bengali : যুক্তাক্ষর/যুক্তবর্ণyuktakṣôr/yuktôbôrṇô, or more precisely, যুক্তব্যঞ্জনyuktôbêñjôn). Typically, the first consonant in the conjunct is shown above or to the left of the following consonants. Many consonants appear in an abbreviated or compressed form when serving as part of a conjunct. Others simply take exceptional forms in conjuncts, bearing little or no resemblance to the base character.
Often, consonant conjuncts are not actually pronounced as would be implied by the pronunciation of the individual components. For example, adding লlô underneath শśô in Bengali creates the conjunct শ্ল, which is pronounced /slɔ/ (and not /ʃlɔ/) in Bengali. Many conjuncts represent Sanskrit sounds that were lost centuries before modern Bengali was ever spoken; for instance, জ্ঞjñô, which is a combination of জjô and ঞñô, is pronounced ggô/gːɔ/ in modern Bengali (which does not permit the sequence /*dʒɲ/). Thus, as conjuncts often represent combinations of sounds that cannot be easily understood from the components, the following descriptions are concerned only with the construction of the conjunct, and not the resulting pronunciation.
Some consonants fuse in such a way that one stroke of the first consonant also serves as a stroke of the next.
Some consonants are written closer to one another simply to indicate that they are in a conjunct together.
Some consonants are compressed (and often simplified) when appearing as the first member of a conjunct.
Some consonants are abbreviated when appearing in conjuncts and lose part of their basic shape.
Some consonants have forms that are used regularly but only within conjuncts.
When serving as a vowel mark, উ u, ঊ u, and ঋ ri take on many exceptional forms.
These are mainly the Brahmi-Sanskrit diacritics, phones and punctuation marks present in languages with Sanskrit influence or Brahmi-derived scripts.
| Symbol/ Graphemes | Name | Function | Romanisation | IPA transcription |
|---|---|---|---|---|
| ৎ [nc 1] | খণ্ড ত khôndô tô | Special character. Final unaspirated dental [t̪] | t | /t̪/ |
| ং [nc 2] | অনুস্বার ônusshar | Diacritic. Final velar nasal [ŋ] | ṅ | /ŋ/ |
| ঃ [nc 2] | বিসর্গ bisôrgô | Diacritic. 1. Doubles the next consonant sound without the vowel (spelling feature) in দুঃখduḥkhô[ˈd̪uɦkʰoˑ]>[ˈd̪uʔkʰoˑ]>[ˈd̪uk̚kʰoˑ] 'sorrow' 2. Final -ḥ examples: এঃeḥ, উঃuḥ 3. Silent in spellings like আন্তঃনগরantôḥnôgôr[ˈant̪ɔɦˌnoɡɔɾ]>[ˈant̪ɔˌnoɡɔɾ] 'intercity' 4. Also used as an abbreviation, e.g., কিঃমিঃ (similar to 'km' in English, for the word কিলোমিটার 'kilometre'), ডাঃ (similar to 'Dr.' in English, for ডাক্তার 'doctor'. However, in modern Bengali, using বিসর্গ bisôrgô for making abbreviations is considered grammatically wrong and the full stop is used for making abbreviations, e.g., as in কি.মি. 'km', ডা. 'Dr.'. [17] [18] | ḥ | /h/ |
| ঁ | চন্দ্রবিন্দু côndrôbindu | Diacritic. Vowel nasalisation | ◌̃ / ṃ | /◌̃/ |
| ্ | হসন্ত hôsôntô | Diacritic. Suppresses the inherent vowel [ɔ] | – | – |
| ঽ | অবগ্রহ ôbôgrôhô | Special character or sign. Used for prolonging vowel sounds E.g., শোনঽঽঽ…śônôôô… 'listennn…' (This is where the default inherited vowel sound ô in নnô is prolonged.) E.g., কিঽঽঽ?kiii? 'whaaat?' (This is where the vowel sound i which is attached with the consonant কkô is prolonged.) | - | – |
| ্য | যফলা yôphôla | Diacritic. Used with two types of pronunciation in modern Bengali depending on the location of the consonant it is used with within a syllable E.g., when the consonant it is used with is syllable-initial, it acts as the vowel /æ/, and thus, ত্যাগ is pronounced /t̪æɡ/ E.g., when the consonant with which it is used l is syllable-final, it doubles the consonant, and thus, মুখ্য is pronounced /ˈmukʰːɔ/ Notably used in transliterating English words with /æ/, e.g. ব্ল্যাক 'black', and sometimes as a diacritic to indicate non-Bengali vowels of various kinds in transliterated foreign words, e.g. the schwa indicated by a yôphôla; the French u/y/ and the German umlaut ü /y~ʏ/ as উ্যuyô; the French eu/ø~œ/ and the German umlaut ö /ø~œ/ as ও্যoyô or এ্যeyô. | ê / yô | /æ/ or /ː/ |
| ্র | রফলা rôphôla | Diacritic. [r] pronounced following a consonant phoneme. | r | /r/ |
| র্ক | রেফ reph | Diacritic. [ɾ] pronounced preceding a consonant phoneme. | r | /r/ |
| ্ব | বফলা bôphôla | Diacritic. Used in spellings only, if they were adopted from Sanskrit and has two different pronunciations depending on the location of the consonant with which it is used. E.g., when the consonant with which it is used is syllable-initial, it remains silent, and thus, স্বাধীন is pronounced /ˈʃad̪ʱin/ (and not /*ˈʃbad̪ʱin/ or *ˈʃʋad̪ʱin/). E.g., when the consonant with which it is used is syllable-final, it doubles the consonant, and thus, বিদ্বান is pronounced /ˈbid̪ːan/ and বিশ্ব is pronounced /ˈbiʃːɔ/. However, certain Sanskrit sandhis (i.e., phonetic fusions) such as ঋগ্বেদ, দিগ্বিজয়, উদ্বেগ, and উদ্বৃত্ত are pronounced /ˈriɡbed̪/, /ˈd̪iɡbidʒɔe̯/, /ˈud̪beɡ/, and /ˈud̪brittɔ/, respectively, while usage with the consonant হ defies phonological rules, e.g., আহ্বান /ˈaɦban/>[ˈau̯bʱan], জিহ্বা {{IPA|/ˈdʒiɦba/ > [ˈdʒiu̯bʱa]. Also used in transliterating Islam-related Arabic words Note: Not all instances of বbô used as the last member of a conjunct are bôphôla, e.g., in the words অম্বরômbôr, লম্বাlômba, তিব্বতtibbôt, বাল্বbalb, etc. | - | /ː/ |
| ৺ | ঈশ্বর iśbôr | Sign. Represents the name of the deity and also written before the name of a deceased person. | – | – |
| ঀ | আঞ্জী/সিদ্ধিরস্তু añji/siddhirôstu | Sign. Used at the beginning of texts as an invocation. | – | – |
The Bengali script has ten numerical digits (graphemes or symbols indicating the numbers from 0 to 9). Bengali numerals have no horizontal headstroke or মাত্রা matra.
| Hindu-Arabic numerals | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|---|
| Bengali numerals | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ |
Numbers larger than 9 are written in Bengali using a positional base 10 numeral system (the decimal system). A period or dot is used to denote the decimal separator, which separates the integral and the fractional parts of a decimal number. When writing large numbers with many digits, commas are used as delimiters to group digits, indicating the thousand (হাজার hajar), the hundred thousand or lakh (লাখ lakh or লক্ষ lôkṣô), and the ten million or hundred lakh or crore (কোটি koṭi) units. I.e., leftwards from the decimal separator, the first grouping consists of three digits, and the subsequent groupings always consist of two digits.
For example, the English number 17,557,345 will be written in traditional Bengali as ১,৭৫,৫৭,৩৪৫.
Bengali punctuation marks, apart from the downstroke দাড়ি daṛi (।), the Bengali equivalent of a full stop, have been adopted from western scripts and their usage is similar: Commas, semicolons, colons, quotation marks, etc. are the same as in English. Capital letters are absent in the Bengali script so proper names are unmarked.
An apostrophe, known in Bengali as ঊর্ধ্বকমা urdhbôkôma 'upper comma', is sometimes used to distinguish between homographs, e.g., পাটা paṭa 'plank', পাʼটা pa'ṭa 'the leg'. Alternatively a hyphen is used for the same purpose, e.g., পা-টা pa-ṭa.
Bengali text is written and read horizontally, from left to right. The consonant graphemes and the full form of vowel graphemes fit into an imaginary rectangle of uniform size (uniform width and height). The size of a consonant conjunct, regardless of its complexity, is deliberately maintained the same as that of a single consonant grapheme, so that diacritic vowel forms can be attached to it without any distortion. In a typical Bengali text, orthographic words, words as they are written, can be seen as being separated from each other by an even spacing. Graphemes within a word are also evenly spaced, but that spacing is much narrower than the spacing between words.
Unlike in purely alphabetic scripts – like Latin, Greek, and Cyrillic – for which the letter-forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called মাত্রা matra. The presence and absence of this matra can be important. For example, the letter ত tô and the numeral ৩ (3) are distinguishable only by the presence or absence of the matra, as is the case between the consonant cluster ত্র trô and the independent vowel এ e. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline).
| Grapheme | Percentage |
|---|---|
| আ | 11.32 |
| এ | 8.96 |
| র | 7.01 |
| অ | 6.63 |
| ব | 4.44 |
| ক | 4.15 |
| ল | 4.14 |
| ত | 3.83 |
| ম | 2.78 |
According to Bengali linguist Munier Chowdhury, there are about nine graphemes that are the most frequent in Bengali texts, shown with its percentage of appearance in the adjacent table. [19]
| a | ā | i | ī | u | ū | ṛ | ṝ | ḷ | ḹ | e | ai | o | au | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bengali | অ | আ | ই | ঈ | উ | ঊ | ঋ | ৠ | ঌ | ৡ | এ | ঐ | ও | ঔ |
| Odia | ଅ | ଆ | ଇ | ଈ | ଉ | ଊ | ଋ | ୠ | ଌ | ୡ | ଏ | ଐ | ଓ | ଔ |
| Devanagari | अ | आ | इ | ई | उ | ऊ | ऋ | ॠ | ऌ | ॡ | ए | ऐ | ओ | औ |
| Siddham | | | | | | | | | | | | | | |
| ka | kā | ki | kī | ku | kū | kṛ | kṝ | kḷ | kḹ | ke | kai | ko | kau | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bengali | ক | কা | কি | কী | কু | কূ | কৃ | কৄ | কৢ | কৣ | কে | কৈ | কো | কৌ |
| Odia | କ | କା | କି | କୀ | କୁ | କୂ | କୃ | କୄ | କୢ | କୣ | କେ | କୈ | କୋ | କୌ |
| Devanagari | क | का | कि | की | कु | कू | कृ | कॄ | कॢ | कॣ | के | कै | को | कौ |
| |
In the Bengali abugida, clusters of consonants are represented by different and sometimes quite irregular forms; thus, learning to read is complicated by the sheer size of the full set of letters and letter combinations, numbering about 350. Ishwar Chandra Vidyasagar introduced punctuation marks in Bengali language and wrote a book named Barnaparichay to standardize Bengali alphabets. While efforts at standardising the alphabet for the Bengali language continue in such notable centres as the Bangla Academy at Dhaka (Bangladesh) and the Pôshchimbônggô Bangla Akademi at Kolkata (West Bengal, India), it is still not quite uniform yet, as many people continue to use various archaic forms of letters, resulting in concurrent forms for the same sounds.
Romanisation of Bengali is the representation of the Bengali language in the Latin script. There are various ways of Romanization systems of Bengali, created in recent years but failed to represent the true Bengali phonetic sound. While different standards for romanisation have been proposed for Bengali, they have not been adopted with the degree of uniformity seen in languages such as Japanese or Sanskrit. [nb 2] The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation in which the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration or "IAST system", [20] "Indian languages Transliteration" or ITRANS (uses upper case alphabets suited for ASCII keyboards), [21] and the extension of IAST intended for non-Sanskrit languages of the Indian region called the National Library at Kolkata romanisation. [22]
সমস্ত
Sômôstô
[ˈʃɔmost̪oˑ
All
মানুষ
manuṣ
ˈmanuʃ
human
স্বাধীনভাবে
sbadhinbhabe
ˈʃad̪ʱinˌbʱabeˑ
free-manner-in
সমান
sôman
ˈʃoman
equal
মর্যাদা
môrjada
ˈmɔɾdʒad̪aˑ
dignity
এবং
ebôṅ
ˈeboŋ
and
অধিকার
ôdhikar
ˈod̪ʱikaɾ
right
নিয়ে
niẏe
ˈnie̯eˑ
taken
জন্মগ্রহণ
jônmôgrôhôṇ
ˈdʒɔnmoˌɡɾoɦon
birth-take
করে।
kôre.
ˈkɔɾeˑ‖
do.
তাঁদের
Tãder
ˈt̪ãd̪eɾ
Their
বিবেক
bibek
ˈbibek
reason
এবং
ebôṅ
ˈeboŋ
and
বুদ্ধি
buddhi
ˈbud̪ːʱiˑ
intelligence
আছে;
ache;
ˈatʃʰeˑ‖
exist;
সুতরাং
sutôraṅ
ˈʃut̪oɾaŋ
therefore
সকলেরই
sôkôleri
ˈʃɔkoˌleɾiˑ
everyone-indeed
একে
êke
ˈækeˑ
one
অপরের
ôpôrer
ˈɔpoɾeɾ
another's
প্রতি
prôti
ˈpɾot̪iˑ
towards
ভ্রাতৃত্বসুলভ
bhratṛtbôsulôbh
ˈbʱɾat̪ɾiˌt̪ːoʃulɔbʱ
brotherhood-ly
মনোভাব
mônobhab
ˈmonobʱab
attitude
নিয়ে
niẏe
ˈnie̯eˑ
taken
আচরণ
acôrôṇ
ˈatʃoɾɔn
conduct
করা
kôra
ˈkɔɾaˑ
do
উচিত।
ucit.
ˈutʃit̪‖]
should.
All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience. Therefore, they should act towards one another in a spirit of brotherhood.
Bengali script was added to the Unicode Standard in October 1991 with the release of version 1.0.
The Unicode block for Bengali is U+0980–U+09FF:
| Bengali [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+098x | ঀ | ঁ | ং | ঃ | অ | আ | ই | ঈ | উ | ঊ | ঋ | ঌ | এ | |||
| U+099x | ঐ | ও | ঔ | ক | খ | গ | ঘ | ঙ | চ | ছ | জ | ঝ | ঞ | ট | ||
| U+09Ax | ঠ | ড | ঢ | ণ | ত | থ | দ | ধ | ন | প | ফ | ব | ভ | ম | য | |
| U+09Bx | র | ল | শ | ষ | স | হ | ় | ঽ | া | ি | ||||||
| U+09Cx | ী | ু | ূ | ৃ | ৄ | ে | ৈ | ো | ৌ | ্ | ৎ | |||||
| U+09Dx | ৗ | ড় | ঢ় | য় | ||||||||||||
| U+09Ex | ৠ | ৡ | ৢ | ৣ | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ | ||
| U+09Fx | ৰ | ৱ | ৲ | ৳ | ৴ | ৵ | ৶ | ৷ | ৸ | ৹ | ৺ | ৻ | ৼ | ৽ | ৾ | |
| Notes | ||||||||||||||||
"Manipuri Language" means Meeteilon written in Meetei Mayek and spoken by the majority of Manipur population: Provided that the concurrent use of Bengali Script and Meetei Mayek shall be allowed in addition to English language, for a period up to 10 (ten) years from the date of commencement of this Act.
Although in modern usage Sanskrit is most commonly written or printed in Nagari, in theory it can be represented by virtually any of the main Brāhmī based scripts, and in practice it often is. Thus scripts such as Gujarati, Bangla, and Oriya, as well as the major south Indian scripts, traditionally have been and often still are used in their proper territories for writing Sanskrit.
yet it is to be noted as a fact, that the cerebral letters are not so much cerebral as they are dental in our speech. If we carefully notice our pronunciation of the letters of the ট class we will see that we articulate ট and ড, for example, almost like English T and D without turning up the tip of the tongue much away from the region of the teeth.