This section needs additional citations for verification .(June 2023) |
South Asia is home to several hundred languages, spanning the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka. It is home to the fourth most spoken language in the world, Hindi–Urdu; the seventh most spoken language, Bengali; and thirteenth most spoken language, Punjabi. [note 1] Languages like Bengali, Tamil and Nepali have official/national status in more than one country of this region. The languages in the region mostly comprise Indo-Iranic and Dravidian languages, and further members of other language families like Austroasiatic, and Tibeto-Burman languages.
Geolinguistically, the Indo-Aryan, Dravidian and Munda language groups are predominantly distributed across the Indian subcontinent. The term Indic languages is also used to refer to these languages, [1] though it may be narrowed to refer only to Indo-Aryan and Dravidian languages. [2] The subcontinent is also home to a few language isolates, like Burushaski, Kusunda, Nihali, and Vedda.
Areally, the influence of the languages extend beyond the subcontinent into other neighbouring Southern Asian as well as East and Southeast Asian regions, and the extended linguistic area is known as Indosphere. More precisely, the sprachbund of Indic languages and other geopolitically-neighboring languages is known as South Asian languages (which additionally includes Eastern-Iranic and Nuristani languages, as well as Central- and Western- Tibeto-Burman linkages).
The Iranian Plateau located west to the subcontinent is home to Iranic languages, beginning with Pashto of Pashtunistan and Balochi of Balochistan in the eastern-side of the plateau. Persian is spoken in the central plateau, with varieties like Dari in Afghanistan, Farsi in Iran and Tajik in Tajikistan. Kafiri languages are spoken in pockets at the northern intersection of the plateau and the subcontinent.
Tibeto-Burman languages of the Trans-Himalayan family and Khasi–Palaungic languages of the Austroasiatic family are spoken in and beyond the regions of the Himalayan and Indo-Burman Ranges, predominantly on the Tibetan Plateau and Burma. Andamanese languages are spoken on the Andaman Islands.
Historically, Sanskrit was the lingua franca of South Asia. In the Indo-Muslim period, Persian became a connecting language throughout much of India and the Persianate world in general, influencing local languages. [3] From the colonial era onwards, English became a lingua franca to some extent, aiding those who participated in the Indian independence movement for example.
In contemporary times, English is considered the international lingua franca of the South Asian countries. Since the colonial era, the South Asian languages have absorbed significant influences from the English language, [4] [5] [6] [7] with the most-spoken South Asian language Hindustani also acquiring a new English-influenced variant known as Hinglish [8] [9] [10] [11] [12] which is spoken more in urban areas. [13]
On a subregional level, Telugu was a language of high culture in South India in precolonial times, [17] while in modern times, Punjabi and Bengali function as major transnational languages connecting the northwestern and eastern regions of India to Pakistan and Bangladesh respectively (see also Punjabiyat). [18] [19]
The official languages of Afghanistan are Pashto and Dari (Farsi), both of which are Iranic languages. Dari, an Afghan standardised register of the Persian language, is considered the lingua franca of Afghanistan and used to write Afghan literature. Tajik is spoken by people closer to Tajikistan, although officially, is regarded to be the same as Dari. Pashto is widely spoken by the Pashtun people, who mainly reside towards the south of Afghanistan on the Pakistani-Afghan border. A few Turkic languages, like Uzbek and Turkmen, are spoken near regions closer to Uzbekistan and Turkmenistan.
Standard Bengali based on the Rarhi dialect is the national language of Bangladesh. The majority of Bangladeshis speak an eastern variant of Bengali. [20] Other native languages of Bangladesh include Sylheti, Rangpuri and Chittagonian, while some ethnic minority groups also speak Tibeto-Burman, Dravidian and Austro-Asiatic languages. [20]
Dzongkha is the national language of the Kingdom of Bhutan. Other languages spoken include Brokpa, Dzala, Chali Chocangacakha, Dakpa language, Khengkha language, Nepali language, Gongduk, Nyenkha, Lhokpu, Takpa and Tshangla. [21]
Almost all the languages of Bhutan are from the Tibetic family (except Nepali, an Indo-Aryan language).
Most languages spoken in the Republic of India belong either to the Indo-Aryan (c. 74%), the Dravidian (c. 24%), the Austroasiatic (Munda) (c. 1.2%), or the Tibeto-Burman (c. 0.6%) families, with some languages of the Himalayas still unclassified. The SIL Ethnologue lists 461 living languages for the Indian Republic.
Hindustani is the most widespread language of India. The Indian census takes the widest possible definition of "Hindi" as the broad variety of the Hindi languages. The native speakers of Hindi so defined account for 39% of Indians. Bengali is the second most spoken language of South Asia, found in both Bangladesh and Indian states of West Bengal, Tripura and Assam. The International Mother Language Day was created by UNESCO to commemorate the Bengali language. [22] Other notable languages include Odia, Telugu, Punjabi, Marathi, Tamil, Urdu, Sindhi, Kannada, Pashto, Malayalam, Maithili, Meitei (Manipuri), Konkani, and Tulu.
Thirteen languages account for more than 1% of Indian population each, and between themselves for over 95%; all of them are the "scheduled languages of the Constitution".
Scheduled languages spoken by less than 1% of Indians are Santali (0.64%), Meitei (Manipuri) (0.14%), Bodo (0.13%), Dogri (0.01%, spoken in Jammu and Kashmir). The largest language that is not "scheduled" is Bhili (0.95%), followed by Gondi (0.27%), Tulu (0.17%) and Kurukh (0.099%)
Divehi is national language of Maldives, spoken by 95% of the population. Arabic is considered as the religious language, and English is medium of instruction for education and international purposes such as tourism.
Most of the languages of Nepal either fall under Indo-Aryan languages or Sino-Tibetan languages. The official language of the country is Nepali, earlier known as Gorkhali in the Kingdom of Nepal, which is part of the Indo-Aryan group and is the spoken by majority of the population.
The Indo-Aryan languages spoken in Nepal include Maithili language, Bhojpuri language and Tharu language which constitutes majority of the speakers in southern Nepal in the Terai region. [23] The Sino-Tibetan languages includes Tamang, Newari, Magar language, Gurung language, Kiranti languages and Sherpa language and are often spoken in central and northern Nepal in the hilly and mountainous regions. [23]
Pakistan is a linguistically diverse country; it has many dozens of languages spoken as first languages. [24] [25] The major languages of Pakistan broadly fall under the category Indo-Iranian languages, with western regions of Pakistan speaking Iranic languages, and eastern regions speaking Indo-Aryan languages; with the Indus River approximately dividing the families.
Other language families in Pakistan include Dravidian (Brahui spoken in Central Balochistan), Sino-Tibetan languages such as Balti and Purgi spoken in the north-east (In Baltistan region of Pakistan), Nuristani languages such as Kamkata-vari spoken in the north-west (In chitral region of Pakistan), Language Isolate Burushaski spoken in the north (In Gilgit Division), Turkic languages are also spoken in Pakistan, by Kyrgyz migrant families in the North and Uzbeks and Turkmen in Khyber Pakhtunkhwa and by refugees from Afghanistan; and Uyghur refugees from China. [26] [27]
The lingua franca in Pakistan is Urdu, a Persianised register of the Hindustani language. The most-widely spoken first language in the country is Punjabi, spoken by the Punjabi people, forming a majority in the Punjab province and Islamabad Capital Territory. Punjabi is followed by Pashto, Sindhi, Saraiki, [a] Urdu, Balochi; while more than 70 other languages like Shina, Balti, Gujarati, [28] Bengali [29] etc. are also spoken.
Sinhala and Tamil are the official languages of Sri Lanka, with English as the link language. Tamil is a South-Dravidian language, and Sinhala belongs to the Insular Indic family (along with Dhivehi of Maldives). Vedda is said to be the indigenous language of Sri Lanka before the arrival of the Indo-Aryans and Dravidians.
Hindustani is an Indo-Aryan language spoken in North India and Pakistan, and functioning as the lingua franca of the region. It is also spoken by the Deccani people. Hindustani is a pluricentric language with two standard registers, known as Hindi and Urdu which serve as official languages of India and Pakistan, respectively. Thus, it is also called Hindi–Urdu. Colloquial registers of the language fall on a spectrum between these standards. In modern times, a third variety of Hindustani with significant English influences has also appeared, which is sometimes called Hinglish or Urdish.
The Indo-Aryan languages are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated east of the Indus river in Bangladesh, North India, Eastern Pakistan, Sri Lanka, Maldives and Nepal. Moreover, apart from the Indian subcontinent, large immigrant and expatriate Indo-Aryan–speaking communities live in Northwestern Europe, Western Asia, North America, the Caribbean, Southeast Africa, Polynesia and Australia, along with several million speakers of Romani languages primarily concentrated in Southeastern Europe. There are over 200 known Indo-Aryan languages.
Asia is home to hundreds of languages comprising several families and some unrelated isolates. The most spoken language families on the continent include Austroasiatic, Austronesian, Japonic, Dravidian, Indo-European, Afroasiatic, Turkic, Sino-Tibetan, Kra–Dai and Koreanic. Many languages of Asia, such as Chinese, Persian, Sanskrit, Arabic, Tamil or Telugu, have a long history as a written language.
Languages spoken in the Republic of India belong to several language families, the major ones being the Indo-Aryan languages spoken by 78.05% of Indians and the Dravidian languages spoken by 19.64% of Indians; both families together are sometimes known as Indic languages. Languages spoken by the remaining 2.31% of the population belong to the Austroasiatic, Sino–Tibetan, Tai–Kadai, and a few other minor language families and isolates. According to the People's Linguistic Survey of India, India has the second highest number of languages (780), after Papua New Guinea (840). Ethnologue lists a lower number of 456.
Pakistan is a multilingual country with over 70 languages spoken as first languages. The majority of Pakistan's languages belong to the Indo-Iranian group of the Indo-European language family.
South Asian literature refers to the literature that is composed by authors in the Indian subcontinent and its diaspora. It has an extensive history with some of the earliest known pieces of literature. South Asia has many different languages that have been spoken due to its size and how long people have been inhabiting it. This has caused the region to be the most linguistically diverse region in the planet, and as well as having four language families, hundreds of languages and thousands of dialects. Many modern pieces of South Asian literature are written in English for a global audience. Many of the ancient texts of the subcontinent have been lost due to the inability to preserve verbally transmitted literature. South Asia has many significant authors that shaped the postcolonial period and response to the British establishment in the subcontinent. Modern South Asian literature has a deep focus on independence from Britain, mainly expressed in prose, this literature commonly discusses the partition of India and how different South Asian nations, religions, and cultures interact with each other. Countries to which South Asian literature's writers are linked include India, Pakistan, Bangladesh, Sri Lanka and Nepal. Works from Bhutan, Myanmar, Tibet, and the Maldives are sometimes also included.
Bishnupriya Manipuri, also known as Bishnupriya Meitei or simply as Bishnupriya, is an Indo-Aryan lect belonging to the Bengali–Assamese linguistic sub-branch. It is a creole of Bengali language and Meitei language and it still retains its pre-Bengali features. It is spoken in parts of the Indian states of Assam, Tripura and Manipur as well as in the Sylhet Division of Bangladesh. It uses the Bengali-Assamese script as its writing system. Bishnupriya Manipuri, being a member of the Eastern Indo-Aryan languages, was evolved from Magadhi Prakrit. So, its origin is associated with Magadha realm. The Government of Tripura categorised Bishnnupriya Manipuri under the "Tribal Language Cell" of the State Council of Educational Research and Training. Its speakers are also given the "Other Backward Classes" status by the Assam Government and notably, there is no legal status of the Bishnupriyas in Manipur. In the 2020s, the Bishnupriya speaking people started demanding that the Assam Government should give them the status of "indigenous people" of Assam and treat the same like other indigenous communities of the state.
Since the Iron Age in India, the native languages of the Indian subcontinent are divided into various language families, of which the Indo-Aryan and the Dravidian are the most widely spoken. There are also many languages belonging to unrelated language families such as Munda and Tibeto-Burman, spoken by smaller groups.
Ethnic groups in South Asia are ethnolinguistic groupings within the diverse populations of South Asia, including the countries of Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka. Afghanistan is variously considered to be a part of both Central Asia and South Asia, which means Afghans are not always included among South Asians, but when they are, South Asia has a total population of about 2.04 billion.
Indosphere is a term coined by the linguist James Matisoff for areas of Indian linguistic influence in the neighboring Southern Asian, Southeast Asian, and East Asian regions. It is commonly used in areal linguistics in contrast with the Sinophone languages of the Mainland Southeast Asia linguistic area of the Sinosphere. Notably, unlike terms such as Lusophone or Francophone that refer to the multinational spread and influence of a single language with multiple dialects, this term refers to all languages that are considered to originate in India, of which there are 22 recognised languages alone across several major language families, including Indo-European and Dravidian. It considers these collectively in regards to the influence of these languages on the languages of other countries, rather than from the perspective of the spread of the language only.
Afghanistan is a linguistically diverse nation, with upwards of 40 distinct languages. However, Dari and Pashto are two of the most prominent languages in the country, and have shared official status under various governments of Afghanistan. Dari, as a shared language between multiple ethnic groups in the country, has served as a historical lingua franca between different linguistic groups in the region and is the most widely understood language in the country.
The People of Assam inhabit a multi-ethnic, multi-linguistic and multi-religious society. They speak languages that belong to four main language groups: Tibeto-Burman, Indo-Aryan, Tai-Kadai, and Austroasiatic. The large number of ethnic and linguistic groups, the population composition, and the peopling process in the state has led to it being called an "India in miniature".
Colin Paul Masica was an American linguist who was professor emeritus in the Department of South Asian Languages and Civilizations and the Department of Linguistics at the University of Chicago. Besides being a specialist in Indo-Aryan languages, much of his work was on the typological convergence of languages belonging to different linguistic families in the South Asian area and beyond, more broadly on this phenomenon in general, and on possible explanations for it and implications of it in connection with both linguistic and cultural history.
The national language and official language of Bangladesh is Bangla according to the third article of the Constitution of Bangladesh. Almost 99% of Bangladeshis speak Bengali as their first language. Bengali Language Implementation Act, 1987 made it mandatory to use Bengali in all government affairs except in the cases of foreign relations. According to a 2022 census, Bengali is predominantly spoken by 99% of the country's population and it also serves as the national language of the nation. The indigenous people of northern and southeastern Bangladesh speak a variety of native languages. According to the Ethnologue, there are 36 indigenous living languages, which include 17 Tibeto-Burman, 10 Indo-Aryan, 7 Austroasiatic and 2 Dravidian languages in Bangladesh. Bangladesh has 44 indigenous languages according to Professor Shameem Reza.
The Dravidian peoples, Dravidian-speakers or Dravidians, are a collection of ethnolinguistic groups native to South Asia who speak Dravidian languages. There are around 250 million native speakers of Dravidian languages. Dravidian speakers form the majority of the population of South India and are natively found in India, Pakistan, Afghanistan, Bangladesh, the Maldives, Nepal, Bhutan and Sri Lanka. Dravidian peoples are also present in Singapore, Mauritius, Malaysia, France, South Africa, Myanmar, East Africa, the Caribbean, and the United Arab Emirates through recent migration.
Echo word is a linguistic term that refers to reduplication as a widespread areal feature in the languages of South Asia. Echo words are characterized by reduplication of a complete word or phrase, with the initial segment or syllable of the reduplicant being overwritten by a fixed segment or syllable. In most languages in which this phenomenon is present, echo words serve to express a meaning of "... and such; and things like that." In some cases the echo word may express a depreciative meaning as well.
Pakistan is an ethnically and linguistically diverse country. The major Pakistani ethnolinguistic groups include Punjabis, Pashtuns, Sindhis, Saraikis, Muhajirs, Balochs, Hindkowans/Hazarewals, Brahuis, and Kohistanis with significant numbers of Shina, Baltis, Kashmiris, Paharis, Chitralis, Torwalis, Hazaras, Burusho, Wakhis, Kalash, Siddis, Uzbeks, Nuristanis, Pamiris and various other smaller minorities.
Pakistanis are the citizens and nationals of the Islamic Republic of Pakistan. Pakistan is the fifth-most populous country, with a population of over 241.5 million, having the second-largest Muslim population as of 2023. As much as 90% of the population follows Sunni Islam. A majority of around 97% of Pakistanis are Muslims. The majority of Pakistanis natively speak languages belonging to the Indo-Iranic family.
Before British colonisation, the Persian language was the lingua franca of the Indian subcontinent and a widely used official language in North India. The language was brought into South Asia by various Turkics and Afghans and was preserved and patronized by Local Indian dynasties from the 11th century onwards, notable of which were the Ghaznavids, Sayyid Dynasty, Tughlaq dynasty, Khilji dynasty, Mughal Dynasty, Gujarat Sultanate, Bengal sultanate etc. Initially it was used by Muslim dynasties of India but later started being used by Non-Muslim empires too, For example the Sikh empire, Persian held official status in the court and the administration within these empires. It largely replaced Sanskrit as the language of politics, literature, education, and social status in the subcontinent.
The culture of South Asia, also known as Desi culture, is a mixture of several cultures in and around the Indian subcontinent. Ancient South Asian culture was primarily based in Hinduism, which itself formed as a mixture of Vedic religion and indigenous traditions, and later Buddhist influences. From the medieval era onwards, influences from the Muslim world and then Europe also became prevalent.
Telugu had become the language of high culture in southern India during the medieval period, and by the seventeenth century its status rivalled that of Sanskrit.
In Tyagaraja's time, Telugu was the language of high culture even in Tanjore, the heartland of the Tamil linguistic area.
In precolonial or early-modern South India, Telugu became the cultural language of the south, including the Tamil country, somewhat similar to the overwhelming dominance of French as the cultural language of modern Europe during roughly the same era. Therefore, Telugu predominates in the evolution of Carnatic music, and it is the practice to teach Telugu language in music colleges to those aspiring to become singers.