The Austroasiatic languages are a large language family spoken throughout Mainland Southeast Asia, South Asia and East Asia. These languages are natively spoken by the majority of the population in Vietnam and Cambodia, and by minority populations scattered throughout parts of Thailand, Laos, India, Myanmar, Malaysia, Bangladesh, Nepal, and southern China. Approximately 117 million people speak an Austroasiatic language, of which more than two-thirds are Vietnamese speakers. Of the Austroasiatic languages, only Vietnamese, Khmer, and Mon have lengthy, established presences in the historical record. Only two are presently considered to be the national languages of sovereign states: Vietnamese in Vietnam, and Khmer in Cambodia. The Mon language is a recognized indigenous language in Myanmar and Thailand, while the Wa language is a "recognized national language" in the de facto autonomous Wa State within Myanmar. Santali is one of the 22 scheduled languages of India. The remainder of the family's languages are spoken by minority groups and have no official status.
The Kra–Dai languages, are a language family in mainland Southeast Asia, southern China, and northeastern India. All languages in the family are tonal, including Thai and Lao, the national languages of Thailand and Laos, respectively. Around 93 million people speak Kra–Dai languages; 60% of those speak Thai. Ethnologue lists 95 languages in the family, with 62 of these being in the Tai branch.
Vietnamese is an Austroasiatic language spoken primarily in Vietnam where it is the national and official language. Vietnamese is spoken natively by around 85 million people, several times as many as the rest of the Austroasiatic family combined. It is the native language of the Vietnamese (Kinh) people, as well as a second or first language for other ethnic groups in Vietnam, and still used by Vietnamese diaspora in the world.
Asia is home to hundreds of languages comprising several families and some unrelated isolates. The most spoken language families on the continent include Austroasiatic, Austronesian, Japonic, Dravidian, Indo-European, Afroasiatic, Turkic, Sino-Tibetan, Kra–Dai and Koreanic. Many languages of Asia, such as Chinese, Sanskrit, Arabic, Tamil or Telugu, have a long history as a written language.
A sprachbund, also known as a linguistic area, area of linguistic convergence, or diffusion area, is a group of languages that share areal features resulting from geographical proximity and language contact. The languages may be genetically unrelated, or only distantly related, but the sprachbund characteristics might give a false appearance of relatedness.
The Hmong–Mien languages are a highly tonal language family of southern China and northern Southeast Asia. They are spoken in mountainous areas of southern China, including Guizhou, Hunan, Yunnan, Sichuan, Guangxi, Guangdong and Hubei provinces; the speakers of these languages are predominantly "hill people", in contrast to the neighboring Han Chinese, who have settled the more fertile river valleys.
The Austric languages are a proposed language family that includes the Austronesian languages spoken in Taiwan, Maritime Southeast Asia, the Pacific Islands, and Madagascar, as well as Kra–Dai and Austroasiatic languages spoken in Mainland Southeast Asia and South Asia. A genetic relationship between these language families is seen as plausible by some scholars, but remains unproven.
Indosphere is a term coined by the linguist James Matisoff for areas of Indian linguistic influence in the neighboring Southern Asian, Southeast Asian, and East Asian regions. It is commonly used in areal linguistics in contrast with the Sinophone languages of the Mainland Southeast Asia linguistic area of the Sinosphere. Notably, unlike terms such as Lusophone or Francophone that refer to the multinational spread and influence of a single language with multiple dialects, this term refers to all languages that are considered to originate in India, of which there are 22 recognised languages alone across several major language families, including Indo-European and Dravidian. It considers these collectively in regards to the influence of these languages on the languages of other countries, rather than from the perspective of the spread of the language only.
The peopling of Thailand refers to the process by which the ethnic groups that comprise the population of present-day Thailand came to inhabit the region.
The Austro-Tai languages, sometimes also Austro-Thai languages, are a proposed language family that comprises the Austronesian languages and Kra–Dai languages.
Primarily in Austroasiatic languages, in a typical word a minor syllable is a reduced (minor) syllable followed by a full tonic or stressed syllable. The minor syllable may be of the form or, with a reduced vowel, as in colloquial Khmer, or of the form with no vowel at all, as in Mlabri 'navel' and 'underneath', and Khasi kyndon 'rule', syrwet 'sign', kylla 'transform', symboh 'seed' and tyngkai 'conserve'.
Sino-Austronesian or Sino-Tibetan-Austronesian is a proposed language family suggested by Laurent Sagart in 1990. Using reconstructions of Old Chinese, Sagart argued that the Austronesian languages are related to the Sinitic languages phonologically, lexically and morphologically. Sagart later accepted the Sino-Tibetan languages as a valid group and extended his proposal to include the rest of Sino-Tibetan. He also placed the Tai–Kadai languages within the Austronesian family as a sister branch of Malayo-Polynesian. The proposal has been largely rejected by other linguists who argue that the similarities between Austronesian and Sino-Tibetan more likely arose from contact rather than being genetic.
Proto-Tai is the reconstructed proto-language of all the Tai languages, including modern Lao, Shan, Tai Lü, Tai Dam, Ahom, Northern Thai, Standard Thai, Bouyei, and Zhuang. The Proto-Tai language is not directly attested by any surviving texts, but has been reconstructed using the comparative method.
There have been various classification schemes for Southeast Asian languages.
Proto-Hmong–Mien (PHM), also known as Proto-Miao–Yao, is the reconstructed ancestor of the Hmong–Mien languages. Lower-level reconstructions include Proto-Hmongic and Proto-Mienic.
The Mainland Southeast Asia linguistic area is a sprachbund including languages of the Sino-Tibetan, Hmong–Mien, Kra–Dai, Austronesian and Austroasiatic families spoken in an area stretching from Thailand to China. Neighbouring languages across these families, though presumed unrelated, often have similar typological features, which are believed to have spread by diffusion. James Matisoff referred to this area as the "Sinosphere", contrasted with the "Indosphere", but viewed it as a zone of mutual influence in the ancient period.
The East Asian languages are a language family proposed by Stanley Starosta in 2001. The proposal has since been adopted by George van Driem and others.
Proto-Kra–Dai is the proposed reconstructed ancestor of the Kra–Dai languages.
Ilia Peiros is a Russian linguist who specializes in the historical linguistics of East Asia. Peiros is a well-known scholar in the Moscow School of Comparative Linguistics, known for its work on long-range comparative linguistics. Peiros is affiliated with the Santa Fe Institute in New Mexico, United States and was also a former faculty member at the University of Melbourne.
The Old Yue language is an unattested, unclassified language, or group(s) of various languages, spoken in ancient southern China, and northern Vietnam circa 700 BCE or later. It can refer to Yue, which was spoken in the realm of Yue during the Spring and Autumn period. It can also refer to the different languages spoken by the Baiyue. Possible languages spoken by them may have been of Kra–Dai, Hmong–Mien, Austronesian, Austroasiatic and other origins.