List of languages by total number of speakers

Last updated

Current distribution of human language families Human Language Families Updated.jpg
Current distribution of human language families

This is a list of languages by total number of speakers.

Contents

It is difficult to define what constitutes a language as opposed to a dialect. For example, while Arabic is sometimes considered a single language centred on Modern Standard Arabic, other authors consider its mutually unintelligible varieties separate languages. [1] Similarly, Chinese is sometimes viewed as a single language because of a shared culture and common literary language, but sometimes considered multiple languages. [2] Conversely, colloquial registers of Hindi and Urdu are almost completely mutually intelligible and are sometimes classified as one language, Hindustani. Rankings of languages should therefore be used with caution, as it is not possible to devise a coherent set of linguistic criteria for distinguishing languages in a dialect continuum. [3]

There is no single criterion for how much knowledge is sufficient to be counted as a second-language (L2) speaker. For example, English has about 450 million native speakers but depending on the criterion chosen can be said to have as many as two billion speakers. [4]

There are also difficulties in obtaining reliable counts of speakers, which vary over time because of population change and language shift. In some areas, there are no reliable census data, the data are not current, or the census may not record languages spoken or may record them ambiguously. Speaker populations may be exaggerated for political reasons, or speakers of minority languages may be underreported in favor of a national language. [5]

Ethnologue (2025)

Ethnologue lists the following languages as having 50 million or more total speakers. [6] This section does not include entries that Ethnologue identifies as macrolanguages encompassing several varieties, such as Arabic, Lahnda, Persian, Malay, Pashto, and Chinese.

Most spoken languages, Ethnologue, 2025 [6]
Language Family BranchNumbers of speakers
(millions)
First-
language

(L1)
Second-
language

(L2)
Total
(L1+L2)
English
(excl. creole languages)
Indo-European Germanic 3901,1381,528
Mandarin Chinese
(incl. Standard Chinese but excl. other varieties)
Sino-Tibetan Sinitic 9901941,184
Hindi
(excl. Urdu)
Indo-European Indo-Aryan 345264609
Spanish
(excl. creole languages)
Indo-European Romance 48474558
Modern Standard Arabic
(excl. dialects)
Afro-Asiatic Semitic 0 [a] 335335
French
(excl. creole languages)
Indo-European Romance 74238312
Bengali Indo-European Indo-Aryan 24243284
Portuguese
(excl. creole languages)
Indo-European Romance 25017267
Russian Indo-European Balto-Slavic 145108253
Indonesian Austronesian Malayo-Polynesian 75177252
Urdu
(excl. Hindi)
Indo-European Indo-Aryan 78168246
Standard German Indo-European Germanic 7658134
Japanese Japonic 1242126
Nigerian Pidgin English Creole Krio 5116121
Egyptian Arabic
(excl. other Arabic dialects)
Afro-Asiatic Semitic 8435119
Marathi Indo-European Indo-Aryan 831699
Vietnamese Austroasiatic Vietic 861197
Telugu Dravidian South-Central 831396
Hausa Afro-Asiatic Chadic 583694
Turkish Turkic Oghuz 85691
Western Punjabi
(excl. Eastern Punjabi)
Indo-European Indo-Aryan 90
Swahili Niger–Congo Bantu 48387
Tagalog [b] Austronesian Malayo-Polynesian 335487
Tamil Dravidian South 79886
Yue Chinese
(incl. Cantonese)
Sino-Tibetan Sinitic 85186
Wu Chinese
(incl. Shanghainese)
Sino-Tibetan Sinitic 83<183
Iranian Persian
(excl. other Persian dialects)
Indo-European Iranian 651783
Korean Koreanic 81<182
Thai Kra–Dai Zhuang–Tai 274471
Javanese Austronesian Malayo-Polynesian 69
Italian Indo-European Romance 63366
Gujarati Indo-European Indo-Aryan 58562
Levantine Arabic
(excl. other Arabic dialects)
Afro-Asiatic Semitic 58360
Amharic Afro-Asiatic Semitic 352560
Kannada Dravidian South 441559
Bhojpuri Indo-European Indo-Aryan 53<153
Sudanese Arabic
(excl. other Arabic dialects)
Afro-Asiatic Semitic 411152

The World Factbook (2022)

The World Factbook , produced by the US Central Intelligence Agency (CIA), estimates the ten most spoken languages (L1 + L2) in 2022 as follows: [8]

Most spoken languages, CIA, 2022 [8]
LanguagePercentage of world population (2022)
English 18.8%
Mandarin Chinese 13.8%
Hindi 7.5%
Spanish 6.9%
French 3.4%
Arabic 3.4%
Bengali 3.4%
Russian 3.2%
Portuguese 3.2%
Urdu 2.9%

See also

Explanatory notes

  1. Modern Standard Arabic (MSA) is not an L1. Arabic speakers first learn their respective local dialect. MSA is acquired through formal education. [7]
  2. Tagalog and Filipino are defined as two different languages in the ISO 639 standard. Ethnologue considers that Filipino is a standardized variety of the Tagalog language with no speakers.

References

  1. Kaye, Alan S.; Rosenhouse, Judith (1997). "Arabic Dialects and Maltese". In Hetzron, Robert (ed.). The Semitic Languages. Routledge. pp. 263–311. ISBN   978-0-415-05767-7.
  2. Norman, Jerry (1988). Chinese. Cambridge University Press. p. 2. ISBN   978-0-521-29653-3.
  3. Paolillo, John C.; Das, Anupam (31 March 2006). "Evaluating language statistics: the Ethnologue and beyond" (PDF). UNESCO Institute of Statistics. pp. 3–5. Retrieved 17 November 2018.
  4. Crystal, David (March 2008). "Two thousand million?". English Today. 24: 3–6. doi: 10.1017/S0266078408000023 . S2CID   145597019.
  5. Crystal, David (1988). The Cambridge Encyclopedia of Language . Cambridge University Press. pp.  286–287. ISBN   978-0-521-26438-9.
  6. 1 2 "What are the top 200 most spoken languages?". Ethnologue. 2025. Retrieved 8 March 2025.
  7. Arabic, Standard at Ethnologue (27th ed., 2024) Closed Access logo transparent.svg
  8. 1 2 "Most spoken languages in the World". The World Factbook . CIA . Retrieved 2022-01-01.