Linguistic demography is the statistical study of languages among all populations. Estimating the number of speakers of a given language is not straightforward, and various estimates may diverge considerably. This is first of all due to the question of defining "language" vs. "dialect". Identification of varieties as a single language or as distinct languages is often based on ethnic, cultural, or political considerations rather than mutual intelligibility. The second difficulty is multilingualism, complicating the definition of "native language". Finally, in many countries, insufficient census data add to the difficulties.
Demolinguistics is a branch of Sociology of language observing linguistic trends as affected by population distribution and redistribution and by the status of societies.
The following table compares the estimates of Comrie (1998) and Weber (1997) [1] (number of native speakers in millions). Also given are the estimates of SIL Ethnologue (2005). Comparing estimates that do not date to the same year is problematic due to the 1.14% per year growth of world population (with significant regional differences).
Language | Comrie (1998) | Weber (1997) | SIL | |
---|---|---|---|---|
1. | Mandarin Chinese | 836 | 1,100 | 1,205 (1999) |
2.-4. | Hindustani | 333 | 250 | 422 (2001) [2] |
Spanish | 332 | 300 | 322 (1995) | |
English | 322 | 300 | 309 (1984) | |
5.-6. | Arabic | 186 | 200 | 323 (2008) |
Bengali | 189 | 185 | 171 (1994) | |
7.-8. | Russian | 170 | 160 | 145 (2000) |
Portuguese | 170 | 160 | 178 (1995) | |
9. | Japanese | 125 | 125 | 122 (1985) |
10. | German | 100 | 100 | 95.4 (1994) |
This table shows that for the world's largest languages, it is impossible to give an estimate of the number of native speakers with a certainty better than maybe 10% or 20% or so.
Case studies:
There are over 250 languages indigenous to Europe, and most languages of Europe belong to the Indo-European language family. Out of a total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European language. Within Indo-European, the three largest phyla in Europe are Romance, Germanic, and Slavic; they have more than 200 million speakers each, and together account for close to 90% of Europeans.
The Demographics of Greece refer to the demography of the population that inhabits the Greek peninsula. The population of Greece was estimated by the United Nations to be 10,445,365 in 2021.
Pakistan had a population of 241,492,197 or 241.49 Million according to the final results of the 2023 Census. This figure includes Pakistan's four provinces e.g. Punjab, Sindh, KPK, Balochistan and Islamabad Capital Territory. AJK and Gilgit-Baltistan's census data is yet to be approved by CCI Council of Pakistan. Pakistan is the world's fifth most populous country.
Demographic features of the United Arab Emirates (UAE) include population density, vital statistics, immigration and emigration data, ethnicity, education levels, religions practiced, and languages spoken within the UAE.
Pakistan is a multilingual country with over 70 languages spoken as first languages. The majority of Pakistan's languages belong to the Indo-Iranian group of the Indo-European language family.
Bihari languages is a group of the Indo-Aryan languages. The Bihari languages are mainly spoken in the Indian states of Bihar, Jharkhand, Uttar Pradesh, and West Bengal, and also in Nepal. The most widely spoken languages of the Bihari group are Bhojpuri, Magahi and Maithili.
A minority language is a language spoken by a minority of the population of a territory. Such people are termed linguistic minorities or language minorities. With a total number of 196 sovereign states recognized internationally and an estimated number of roughly 5,000 to 7,000 languages spoken worldwide, the vast majority of languages are minority languages in every country in which they are spoken. Some minority languages are simultaneously also official languages, such as Irish in Ireland or the numerous indigenous languages of Bolivia. Likewise, some national languages are often considered minority languages, insofar as they are the national language of a stateless nation.
Peru has many languages in use, with its official languages being Spanish, Quechua and Aymara. Spanish has been in the country since it began being taught in the time of José Pardo instead of the country's Native languages, especially the languages in the Andes. In the beginning of the 21st century, it was estimated that in this multilingual country, about 50 very different and popular languages are spoken: which reduces to 44 languages if dialects are considered variants of the same language. The majority of these languages are Indigenous, but the most common language is Spanish, the main language that about 94.4% of the population speaks. Spanish is followed by the country's Indigenous languages, especially all types of Quechua and Aymara (1.7%), who also have co-official status according to Article 48 of the Constitution of Peru, as well as the languages of the Amazon and the Peruvian Sign Language. In urban areas of the country, especially the coastal region, most people are monolingual and only speak Spanish, while in many rural areas of the country, especially in the Amazon, multilingual populations are prevalent.
Afghanistan is a linguistically diverse nation, with upwards of 40 distinct languages. However, Dari and Pashto are two of the most prominent languages in the country, and have shared official status under various governments of Afghanistan. Dari, as a shared language between multiple ethnic groups in the country, has served as a historical lingua franca between different linguistic groups in the region and is the most widely understood language in the country. Pashto is also widely spoken in the region; but the language does not have a diverse multi-ethnic population like Dari, and the language is not as commonly spoken by non-Pashtuns. Dari and Pashto are also "relatives", as both are Iranian languages.
Polish dialects are regional vernacular varieties of the Polish language.
Thailand is home to 71 living languages, with the majority of people speaking languages of the Southwestern Tai family, and the national language being Central Thai. Lao is spoken along the borders with the Lao PDR, Karen languages are spoken along the border with Myanmar, Khmer is spoken near Cambodia and Malay is spoken in the south near Malaysia. Sixty-two 'domestic' languages are officially recognized, and international languages spoken in Thailand, primarily by international workers, expatriates and business people, include Burmese, Karen, English, Chinese, Japanese, and Vietnamese, among others.
The geographical distribution of speakers of Macedonian refers to the total number of native speakers of Macedonian, an East South Slavic language that serves as the official language of North Macedonia. Estimates of the number of native and second language speakers of Macedonian varies; the number of native speakers in the country ranges from 1,344,815 according to the 2002 census in North Macedonia to 1,476,500 per linguistic database Ethnologue in 2016. Estimates of the total number of speakers in the world include 3.5 million people. Macedonian is studied and spoken as a second language by all ethnic minorities in the country.
The Tharu or Tharuhat languages are any of the Indo-Aryan languages spoken by the Tharu people of the Terai region in Nepal, and neighboring regions of Uttarakhand, Uttar Pradesh and Bihar in India.
The Warumungu language is spoken by the Warumungu people in Australia's Northern Territory. In addition to spoken language, the Warumungu have a highly developed sign language.
The Punjabi dialects and languages or Greater Panjabic are a series of dialects and languages spoken around the Punjab region of Pakistan and India with varying degrees of official recognition. They have sometimes been referred to as the Greater Punjabi macrolanguage. Punjabi may also be considered as a pluricentric language with more than one standard variety.
Many countries and national censuses currently enumerate or have previously enumerated their populations by languages, native language, home language, level of knowing language or a combination of these characteristics.