This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these template messages)
|
Part of a series on |
Indo-European topics |
---|
This is a list of languages in the Indo-European language family. It contains a large number of individual languages, together spoken by roughly half the world's population.
The Indo-European languages include some 449 (SIL estimate, 2018 edition [1] ) languages spoken by about 3.5 billion people or more (roughly half of the world population). Most of the major languages belonging to language branches and groups in Europe, and western and southern Asia, belong to the Indo-European language family. This is thus the biggest language family in the world by number of mother tongue speakers (but not by number of languages: by this measure it is only the 3rd or 5th biggest). Eight of the top ten biggest languages, by number of native speakers, are Indo-European. One of these languages, English, is the de facto world lingua franca, with an estimate of over one billion second language speakers.
Indo-European language family has 10 known branches or subfamilies, of which eight are living and two are extinct. Most of the subfamilies or linguistic branches in this list contain many subgroups and individual languages. The relationships between these branches (how they are related to one another and branched from the ancestral proto-language) are a matter of further research and not yet fully known. There are some individual Indo-European languages that are unclassified within the language family; they are not yet classified in a branch and could constitute a separate branch.
The 449 Indo-European languages identified in the SIL estimate, 2018 edition, [1] are mostly living languages. If all the known extinct Indo-European languages are added, they number more than 800 or close to one thousand. This list includes all known Indo-European languages, living and extinct.
The distinction between a language and a dialect is not clear-cut and simple: in many areas there is a dialect continuum, with transitional dialects and languages. Further, there is no agreed standard criterion for what amount of differences in vocabulary, grammar, pronunciation and prosody are required to constitute a separate language, as opposed to a mere dialect. Mutual intelligibility can be considered, but there are closely related languages that are also mutual intelligible to some degree, even if it is an asymmetric intelligibility. Or there may be cases where between three dialects, A, B, and C, A and B are mutually intelligible, B and C are mutually intelligible, but A and C are not. In such circumstances grouping the three dielects becomes impossible. Because of this, in this list, several dialect groups and some individual dialects of languages are shown (in italics), especially if a language is or was spoken by a large number of people and over a large land area, but also if it has or had divergent dialects.
The ancestral population and language, Proto-Indo-Europeans that spoke Proto-Indo-European, are estimated to have lived about 4500 BCE (6500 BP). At some point in time, starting about 4000 BCE (6000 BP), this population expanded through migration and cultural influence. This started a complex process of population blend or population replacement, acculturation and language change of peoples in many regions of western and southern Eurasia. [2] This process gave origin to many languages and branches of this language family.
By around 1000 BCE, there were many millions of Indo-European speakers, and they lived in a vast geographical area which covered most of western and southern Eurasia (including western Central Asia).
In the following two millennia the number of speakers of Indo-European languages increased even further.
Indo-European languages continued to be spoken in large land areas, although most of western Central Asia and Asia Minor were lost to other language families (mainly Turkic) due to Turkic expansion, conquests and settlement (after the middle of the first millennium AD and the beginning and middle of the second millennium AD respectively) and also to Mongol invasions and conquests (which changed Central Asia ethnolinguistic composition). Another land area lost to non-Indo-European languages was today's Hungary, due to Magyar/Hungarian (Uralic language speakers) conquest and settlement.
However, from about AD 1500 onwards, Indo-European languages expanded their territories to North Asia (Siberia), through Russian expansion, and North America, South America, Australia and New Zealand as the result of the age of European discoveries and European conquests through the expansions of the Portuguese, Spanish, French, English and the Dutch. (These peoples had the biggest continental or maritime empires in the world and their countries were major powers.)
The contact between different peoples and languages, especially as a result of European colonization, also gave origin to the many pidgins, creoles and mixed languages that are mainly based in Indo-European languages (many of which are spoken in island groups and coastal regions).
Although all Indo-European languages descend from a common ancestor called Proto-Indo-European, the kinship between the subfamilies or branches (large groups of more closely related languages within the language family), that descend from other more recent proto-languages, is not the same because there are subfamilies that are closer or further, and they did not split-off at the same time, the affinity or kinship of Indo-European subfamilies or branches between themselves is still an unresolved and controversial issue and being investigated.
However, there is some consensus that Anatolian was the first group of Indo-European (branch) to split-off from all the others and Tocharian was the second in which that happened. [3]
Using a mathematical analysis borrowed from evolutionary biology, Donald Ringe and Tandy Warnow propose the following tree of Indo-European branches: [4]
David W. Anthony, following the methodology of Donald Ringe and Tandy Warnow, proposes the following sequence: [4]
Protolanguages that developed into the Indo-European languages
The following is a list of protolanguages of known Indo-European subfamilies and deeper branches.
The list below follows Donald Ringe, Tandy Warnow and Ann Taylor classification tree for Indo-European branches. [5] quoted in Anthony, David W. (2007), The Horse, the Wheel and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World, Princeton University Press.
Transitional Iranian-Indo-Aryan [75] [76] (older name: Kafiri) (according to some scholars [77] [78] there is the possibility that the older name "Kapisi" that was synonymal of Kambojas, related to the ancient Kingdom of Kapisa, in modern-day Kapisa Province, changed to "Kafiri" and came to be confused and assimilated with "kafiri", meaning "infidel" in Arabic and used in Islam)
Indo-European languages whose relationship to other languages in the family is unclear
Unclassified languages that may have been Indo-European or members of other language families (?)
The Baltic languages are a branch of the Indo-European language family spoken natively or as a second language by a population of about 6.5–7.0 million people mainly in areas extending east and southeast of the Baltic Sea in Europe. Together with the Slavic languages, they form the Balto-Slavic branch of the Indo-European family.
There are over 250 languages indigenous to Europe, and most belong to the Indo-European language family. Out of a total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European language. The three largest phyla of the Indo-European language family in Europe are Romance, Germanic, and Slavic; they have more than 200 million speakers each, and together account for close to 90% of Europeans.
The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family—English, French, Portuguese, Russian, Dutch, and Spanish—have expanded through colonialism in the modern period and are now spoken across several continents. The Indo-European family is divided into several branches or sub-families, of which there are eight groups with languages still alive today: Albanian, Armenian, Balto-Slavic, Celtic, Germanic, Hellenic, Indo-Iranian, and Italic/Romance; and another nine subdivisions that are now extinct.
The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic, spoken during the Early Middle Ages, which in turn is thought to have descended from the earlier Proto-Balto-Slavic language, linking the Slavic languages to the Baltic languages in a Balto-Slavic group within the Indo-European family.
The Semitic languages are a branch of the Afroasiatic language family. They include Arabic, Amharic, Aramaic, Hebrew, and numerous other ancient and modern languages. They are spoken by more than 330 million people across much of West Asia, North Africa, the Horn of Africa, Malta, and in large immigrant and expatriate communities in North America, Europe, and Australasia. The terminology was first used in the 1780s by members of the Göttingen school of history, who derived the name from Shem, one of the three sons of Noah in the Book of Genesis.
Asia is home to hundreds of languages comprising several families and some unrelated isolates. The most spoken language families on the continent include Austroasiatic, Austronesian, Japonic, Dravidian, Indo-European, Afroasiatic, Turkic, Sino-Tibetan, Kra–Dai and Koreanic. Many languages of Asia, such as Chinese, Sanskrit, Arabic, or Tamil, have a long history as a written language.
A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated varieties may not be. This is a typical occurrence with widely spread languages and language families around the world, when these languages did not spread recently. Some prominent examples include the Indo-Aryan languages across large parts of India, varieties of Arabic across north Africa and southwest Asia, the Turkic languages, the Chinese languages or dialects, and parts of the Romance, Germanic and Slavic families in Europe. Terms used in older literature include dialect area and L-complex.
An isogloss, also called a heterogloss, is the geographic boundary of a certain linguistic feature, such as the pronunciation of a vowel, the meaning of a word, or the use of some morphological or syntactic feature. Major dialects are typically demarcated by bundles of isoglosses, such as the Benrath line that distinguishes High German from the other West Germanic languages and the La Spezia–Rimini Line that divides the Northern Italian languages and Romance languages west of Italy from Central Italian dialects and Romance languages east of Italy. However, an individual isogloss may or may not have any coterminus with a language border. For example, the front-rounding of /y/ cuts across France and Germany, while the /y/ is absent from Italian and Spanish words that are cognates with the /y/-containing French words.
The South Slavic languages are one of three branches of the Slavic languages. There are approximately 30 million speakers, mainly in the Balkans. These are separated geographically from speakers of the other two Slavic branches by a belt of German, Hungarian and Romanian speakers.
A regional language is a language spoken in a region of a sovereign state, whether it be a small area, a federated state or province or some wider area.
The Iranian languages, alternately called the Iranic languages, are a branch of the Indo-Iranian languages in the Indo-European language family that are spoken natively by the Iranian peoples, predominantly in the Iranian Plateau.
There have been many languages spoken in the Iberian Peninsula.
The Eastern Iranian languages are a subgroup of the Iranian languages, having emerged during the Middle Iranian era. The Avestan language is often classified as early Eastern Iranian. As opposed to the Middle-era Western Iranian dialects, the Middle-era Eastern Iranian dialects preserve word-final syllables.
The official language of Greece is Greek, spoken by 99% of the population. In addition, a number of non-official, minority languages and some Greek dialects are spoken as well. The most common foreign languages learned by Greeks are English, German, French and Italian.
The Indo-European migrations are hypothesized migrations of Proto-Indo-European language (PIE) speakers, and subsequent migrations of people speaking derived Indo-European languages, which took place approx. 4000 to 1000 BCE, potentially explaining how these languages came to be spoken across a large area of Eurasia, spanning from the Indian subcontinent and Iranian plateau to Atlantic Europe, in a process of cultural diffusion.
The Albanian–Romanian linguistic relationship is a subject of historical linguistic research applied to the Albanian and Romanian languages. It has also been studied to understand the ethnogenesis of both peoples. The common phonological, morphological and syntactical features of the two languages have been studied for more than a century. Both languages are part of the Balkan sprachbund but there are certain elements shared only by Albanian and Romanian and its close relatives descended from Common Romanian. Aside from Latin, and from shared Greek, Slavic and Turkish elements, other characteristics and words are attributed to the Paleo-Balkan linguistic base: Illyrian, Thracian, Dacian and/or Thraco-Illyrian, Daco-Thracian. Similarities between Romanian and Albanian are not limited to their common Balkan features and the assumed substrate words: the two languages share calques and proverbs, and display analogous phonetic changes.