Linguistic demography is the statistical study of languages among all populations. Estimating the number of speakers of a given language is not straightforward, and various estimates may diverge considerably. This is first of all due to the question of defining "language" vs. "dialect". Identification of varieties as a single language or as distinct languages is often based on ethnic, cultural, or political considerations rather than mutual intelligibility. The second difficulty is multilingualism, complicating the definition of "native language". Finally, in many countries, insufficient census data add to the difficulties.
Demolinguistics is a branch of Sociology of language observing linguistic trends as affected by population distribution and redistribution and by the status of societies.
The following table compares the estimates of Comrie (1998) and Weber (1997) [1] (number of native speakers in millions). Also given are the estimates of SIL Ethnologue (2005). Comparing estimates that do not date to the same year is problematic due to the 1.14% per year growth of world population (with significant regional differences).
Language | Comrie (1998) | Weber (1997) | SIL | |
---|---|---|---|---|
1. | Mandarin Chinese | 836 | 1,100 | 1,205 (1999) |
2.-4. | Hindustani | 333 | 250 | 422 (2001) [2] |
Spanish | 332 | 300 | 322 (1995) | |
English | 322 | 300 | 309 (1984) | |
5.-6. | Arabic | 186 | 200 | 323 (2008) |
Bengali | 189 | 185 | 171 (1994) | |
7.-8. | Russian | 170 | 160 | 145 (2000) |
Portuguese | 170 | 160 | 178 (1995) | |
9. | Japanese | 125 | 125 | 122 (1985) |
10. | German | 100 | 100 | 95.4 (1994) |
This table shows that for the world's largest languages, it is impossible to give an estimate of the number of native speakers with a certainty better than maybe 10% or 20% or so.
Case studies:
Ethnologue: Languages of the World is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensive catalogue of languages. It was first issued in 1951, and is now published by SIL International, an American evangelical Christian non-profit organization.
The Demographics of Greece refer to the demography of the population that inhabits the Greek peninsula. The population of Greece was estimated by the United Nations to be 10,445,365 in 2021.
Pakistan had a population of 241,492,197 according to the final results of the 2023 Census. This figure includes Pakistan's four provinces e.g. Punjab, Sindh, KPK, Balochistan and Islamabad Capital Territory. AJK and Gilgit-Baltistan's census data is yet to be approved by CCI Council of Pakistan. Pakistan is the world's fifth most populous country.
Pakistan is a multilingual country with over 70 languages spoken as first languages. The majority of Pakistan's languages belong to the Indo-Iranian group of the Indo-European language family.
Bihari languages are a group of the Indo-Aryan languages. The Bihari languages are mainly spoken in the Indian states of Bihar, Jharkhand, Uttar Pradesh, and West Bengal, and also in Nepal. The most widely spoken languages of the Bihari group are Bhojpuri, Magahi and Maithili.
A minority language is a language spoken by a minority of the population of a territory. Such people are termed linguistic minorities or language minorities. With a total number of 196 sovereign states recognized internationally and an estimated number of roughly 5,000 to 7,000 languages spoken worldwide, the vast majority of languages are minority languages in every country in which they are spoken. Some minority languages are simultaneously also official languages, such as Irish in Ireland or the numerous indigenous languages of Bolivia. Likewise, some national languages are often considered minority languages, insofar as they are the national language of a stateless nation.
Kenya is a multilingual country. The two official languages of Kenya, Swahili and English are widely spoken as lingua francas; however, including second-language speakers, Swahili is more widely spoken than English. Swahili is a Bantu language native to East Africa and English is inherited from British colonial rule.
Peru has many languages in use, with its official languages being Spanish, Quechua and Aymara. Spanish has been in the country since it began being taught in the time of José Pardo instead of the country's Native languages, especially the languages in the Andes. In the beginning of the 21st century, it was estimated that in this multilingual country, about 50 very different and popular languages are spoken: which reduces to 44 languages if dialects are considered variants of the same language. The majority of these languages are Indigenous, but the most common language is Spanish, the main language that about 94.4% of the population speaks. Spanish is followed by the country's Indigenous languages, especially all types of Quechua and Aymara (1.7%), who also have co-official status according to Article 48 of the Constitution of Peru, as well as the languages of the Amazon and the Peruvian Sign Language. In urban areas of the country, especially the coastal region, most people are monolingual and only speak Spanish, while in many rural areas of the country, especially in the Amazon, multilingual populations are prevalent.
Afghanistan is a linguistically diverse nation, with upwards of 40 distinct languages. However, Persian and Pashto are two of the most prominent languages in the country, and have shared official status under various governments of Afghanistan. Persian, as a shared language between multiple ethnic groups in the country, has served as a historical lingua franca between different linguistic groups in the region and is the most widely understood language in the country. Pashto is also widely spoken in the region; but the language does not have a diverse multi-ethnic population like Persian, and the language is not as commonly spoken by non-Pashtuns. Persian and Pashto are also "relatives", as both are Iranian languages.
Polish dialects are regional vernacular varieties of the Polish language.
Thailand is home to 51 living indigenous languages and 24 living non-indigenous languages, with the majority of people speaking languages of the Southwestern Tai family, and the national language being Central Thai. Lao is spoken along the borders with the Lao PDR, Karen languages are spoken along the border with Myanmar, Khmer is spoken near Cambodia and Malay is spoken in the south near Malaysia. Sixty-two 'domestic' languages are officially recognized, and international languages spoken in Thailand, primarily by international workers, expatriates and business people, include Burmese, Karen, English, Chinese, Japanese, and Vietnamese, among others.
The geographical distribution of speakers of Macedonian refers to the total number of native speakers of Macedonian, an East South Slavic language that serves as the official language of North Macedonia. Estimates of the number of native and second language speakers of Macedonian varies; the number of native speakers in the country ranges from 1,344,815 according to the 2002 census in North Macedonia to 1,476,500 per linguistic database Ethnologue in 2016. Estimates of the total number of speakers in the world include 3.5 million people. Macedonian is studied and spoken as a second language by all ethnic minorities in the country.
The Tharu or Tharuhat languages are any of the Indo-Aryan languages spoken by the Tharu people of the Terai region in Nepal, and neighboring regions of Uttarakhand, Uttar Pradesh and Bihar in India.
The Warumungu language is spoken by the Warumungu people in Australia's Northern Territory. In addition to spoken language, the Warumungu have a highly developed sign language.
The Punjabi dialects and languages or Greater Punjabi are a series of dialects and languages spoken around the Punjab region of Pakistan and India with varying degrees of official recognition. They have sometimes been referred to as the Greater Punjabi macrolanguage. Punjabi may also be considered as a pluricentric language with more than one standard variety.
Many countries and national censuses currently enumerate or have previously enumerated their populations by languages, native language, home language, level of knowing language or a combination of these characteristics.