Concepticon

Last updated
Concepticon
Producer Max Planck Institute for the Science of Human History (Germany)
LanguagesEnglish
Access
CostFree
Coverage
DisciplinesLinguistics
Links
Website concepticon.clld.org

Concepticon is an open-source [1] online lexical database of linguistic concept lists (word lists). It links concept labels (i.e., word list glosses) in concept lists (i.e., word lists) to concept sets (i.e., standardized word meanings). [2] [3]

Contents

It is part of the Cross-Linguistic Linked Data (CLLD) project, which is hosted by the Max Planck Institute for the Science of Human History in Jena, Germany. [4] Version 1.0 was released in 2016.

Concept

Concept lists in the Concepticon include:

See also

Related Research Articles

<span class="mw-page-title-main">Chadic languages</span> Branch of the Afroasiatic languages

The Chadic languages form a branch of the Afroasiatic language family. They are spoken in parts of the Sahel. They include 150 languages spoken across northern Nigeria, southern Niger, southern Chad, the Central African Republic, and northern Cameroon. The most widely spoken Chadic language is Hausa, a lingua franca of much of inland Eastern West Africa.

Glottochronology is the part of lexicostatistics which involves comparative linguistics and deals with the chronological relationship between languages.

The Swadesh list is a classic compilation of tentatively universal concepts for the purposes of lexicostatistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness of those languages. The Swadesh list is named after linguist Morris Swadesh. It is used in lexicostatistics and glottochronology. Because there are several different lists, some authors also refer to "Swadesh lists".

<span class="mw-page-title-main">South Halmahera–West New Guinea languages</span> Subgroup of the Austronesian language family

The South Halmahera–West New Guinea (SHWNG) languages are a branch of the Malayo-Polynesian languages, found in the islands and along the shores of the Halmahera Sea in the Indonesian province of North Maluku and of Cenderawasih Bay in the provinces of Papua and West Papua. There are 38 languages.

The Grass languages are a group of languages in the Ramu language family. It is accepted by Foley (2018), but not by Glottolog. They are spoken in East Sepik Province, Papua New Guinea, with a small number of speakers also located just across the provincial border in Madang Province.

The Kho-Bwa languages, also known as Bugunish and Kamengic, are a small family of languages spoken in Arunachal Pradesh, northeast India. The name Kho-Bwa was originally proposed by George van Driem (2001). It is based on the reconstructed words *kho ("water") and *bwa ("fire"). Blench (2011) suggests the name Kamengic, from the Kameng area of Arunachal Pradesh, or alternatively Bugun–Mey, after its two main members. Alternatively, Anderson (2014) refers to Kho-Bwa as Northeast Kamengic.

<span class="mw-page-title-main">Tegali language</span> Rashad language spoken in Sudan

Tegali is a Kordofanian language in the Rashad family, which is thought by some to belong to the hypothetical Niger–Congo phylum. It is spoken in South Kordofan state, Sudan.

<span class="mw-page-title-main">Extinct languages of the Marañón River basin</span>

The Marañón River basin, at a low point in the Andes which made it an attractive location for trade between the Inca Empire and the Amazon basin, once harbored numerous languages which have been poorly attested or not attested at all. Those of the middle reaches of the river, above the Amazon basin, were replaced in historical times by Aguaruna, a Jivaroan language from the Amazon which is still spoken there. The languages further upriver are difficult to identify, due to lack of data. The region was multilingual at the time of the Conquest, and the people largely switched to Spanish rather than to Quechua, though Quechua also expanded during Colonial times.

Ngkolmpu Kanum, or Ngkontar, is part of a dialect chain in the Yam family spoken by the Kanum people of New Guinea. The Ngkâlmpw (Ngkontar) and moribund Bädi varieties have limited mutual intelligibility may be considered distinct languages.

The Leipzig–Jakarta list of 100 words is used by linguists to test the degree of chronological separation of languages by comparing words that are resistant to borrowing. The Leipzig–Jakarta list became available in 2009.

Yelmek, also rendered Jelmek or Jelmik, is a language of the proposed Trans-Fly – Bulaka River family in West Papua.

Kĕnaboi is an extinct unclassified language of Negeri Sembilan, Malaysia that may be a language isolate or an Austroasiatic language belonging to the Aslian branch. It is attested in what appears to be two dialects, based on word lists of about 250 lexical items, presumably collected around 1870–90.

Duli is an extinct Adamawa language of northern Cameroon.

Parsi has been used as a name for several languages of South Asia and Iran, some of them spurious:

The Automated Similarity Judgment Program (ASJP) is a collaborative project applying computational approaches to comparative linguistics using a database of word lists. The database is open access and consists of 40-item basic-vocabulary lists for well over half of the world's languages. It is continuously being expanded. In addition to isolates and languages of demonstrated genealogical groups, the database includes pidgins, creoles, mixed languages, and constructed languages. Words of the database are transcribed into a simplified standard orthography (ASJPcode). The database has been used to estimate dates at which language families have diverged into daughter languages by a method related to but still different from glottochronology, to determine the homeland (Urheimat) of a proto-language, to investigate sound symbolism, to evaluate different phylogenetic methods, and several other purposes.

Glottolog is a bibliographic database of the world's lesser-known languages, developed and maintained first at the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany. Its main curators include Harald Hammarström and Martin Haspelmath.

The Cross-Linguistic Linked Data (CLLD) project coordinates over a dozen linguistics databases covering the languages of the world. It is hosted by the Department of Linguistic and Cultural Evolution at the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany.

Lexibank is a linguistics database managed by the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany. The database consists of over 100 standardized wordlists (datasets) that are independently curated.

Johann-Mattis List is a German scientist. He is known for his work on quantitative comparative linguistics. List is currently Senior Scientist at the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany.

References

  1. Adding concept lists to Concepticon: A guide for beginners
  2. List, Johann-Mattis and Cysouw, Michael and Forkel, Robert. 2016. Concepticon: A Resource for the Linking of Concept Lists. In Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis (eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation , 2393-2400. European Language Resources Association (ELRA).
  3. List, Johann-Mattis (2018). "Exporting Sublists from a Wordlist with LingPy and Concepticon," in Computer-Assisted Language Comparison in Practice. https://calc.hypotheses.org/58
  4. Haspelmath, Martin. "Max Planck diversity linguistics redux: Welcome to "Linguistic and Cultural Evolution" in Jena". Hypotheses: Diversity Linguistics Comment (blog). Retrieved 28 March 2015.