The LINGUIST List is an online resource for the academic field of linguistics. It was founded by Anthony Aristar in early 1990 at the University of Western Australia, [1] and is used as a reference by the National Science Foundation in the United States. [2] Its main and oldest feature is the premoderated electronic mailing list, with subscribers all over the world.
Between 1991 and 2013, the service was run by Anthony Aristar and Helen Aristar-Dry. In 1991, it moved from Australia to Texas A&M University, and Eastern Michigan University was established as the main editing site.[ citation needed ] By 1994, there were over 5,000 subscribers. [3] From 14 October through 6 November 1996, it held its first on-line conference, Geometric and Thematic Structure in Binding, devoted to the Binding Theory and opened by the keynote address by Howard Lasnik. [4] LINGUIST List moved from Texas A&M to its own site in 1997. Wayne State University in Michigan was established as the second editing site in 1998, but in 2006 all its operations moved to nearby Eastern Michigan University. In 2013, Aristar-Dry and Aristar retired from Eastern Michigan University and Damir Cavar became the moderator and director of operations. In 2014 Malgorzata E. Cavar became the second moderator. In 2014, LINGUIST List was moved to Indiana University and it has been hosted at the Department of Linguistics since then, with Damir Cavar and Malgorzata E. Cavar as the co-directors of the resource operations.[ citation needed ]
The LINGUIST List is funded by its donations from supporting publishers, institutions and its subscribers during the fund drive month each spring. Some LINGUIST List projects were funded by grants from the National Science Foundation.[ citation needed ] In recent years it has become a site for research into linguistic infrastructure on the web, and has received numerous grants from the National Science Foundation to do this work. [5] [ failed verification ]
The LINGUIST List hosts two mailing lists LINGUIST and LINGLITE:
The LINGUIST List mailing lists are free and open for subscription using a web interface. [8]
Everybody can submit postings to The LINGUIST List lists without being subscribed or in any way a registered member. [9] A web interface is used to submit postings to the lists. [10]
The LINGUIST List has been one of the resources for the creation of the new ISO 639-3 language identification standard (aiming to classify all known languages with an alpha-3 language code). [11] While the Ethnologue was used as the resource for natural languages currently in use, Linguist List has provided the information on historic varieties, ancient languages, international auxiliary languages and constructed languages.
The LINGUIST List has also received grants for
The EMELD project [17] was the instigator of the GOLD ontology, the furthest advanced of the current attempts to build an ontology for the morphosyntax of linguistic data. [18] It has also produced a phonetics ontology, based upon Peter Ladefoged's and Ian Maddieson's The Sounds of the World's Languages .
Some projects emerged from funded or internal activities at LINGUIST List:
Ethnologue: Languages of the World is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensive catalogue of languages. It was first issued in 1951, and is now published by SIL International, an American evangelical Christian non-profit organization.
Corpus linguistics is an empirical method for the study of language by way of a text corpus. Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing that aim to represent a given linguistic variety. Today, corpora are generally machine-readable data collections.
SIL Global is an evangelical Christian nonprofit organization whose main purpose is to study, develop and document languages, especially those that are lesser-known, in order to expand linguistic knowledge, promote literacy, translate the Christian Bible into local languages, and aid minority language development.
ISO 639-3:2007, Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages, is an international standard for language codes in the ISO 639 series. It defines three-letter codes for identifying languages. The standard was published by International Organization for Standardization (ISO) on 1 February 2007.
Northern Ndebele, also called Ndebele, isiNdebele saseNyakatho, Zimbabwean Ndebele, Sindebele or North Ndebele, associated with the term Matabele, is a Bantu language spoken by the Northern Ndebele people which belongs to the Nguni group of languages.
Language documentation is a subfield of linguistics which aims to describe the grammar and use of human languages. It aims to provide a comprehensive record of the linguistic practices characteristic of a given speech community. Language documentation seeks to create as thorough a record as possible of the speech community for both posterity and language revitalization. This record can be public or private depending on the needs of the community and the purpose of the documentation. In practice, language documentation can range from solo linguistic anthropological fieldwork to the creation of vast online archives that contain dozens of different languages, such as FirstVoices or OLAC.
The Pangloss Collection is a digital library whose objective is to store and facilitate access to audio recordings in endangered languages of the world. Developed by the LACITO centre of CNRS in Paris, the collection provides free online access to documents of connected, spontaneous speech, in otherwise little-documented languages of all continents.
Geospatial metadata is a type of metadata applicable to geographic data and information. Such objects may be stored in a geographic information system (GIS) or may simply be documents, data-sets, images or other objects, services, or related items that exist in some other native environment but whose features may be appropriate to describe in a (geographic) metadata catalog.
The EXtensible Cross-Linguistic Automatic Information Machine (EXCLAIM) was an integrated tool for cross-language information retrieval (CLIR), created at the University of California, Santa Cruz in early 2006, with some support for more than a dozen languages. The lead developers were Justin Nuger and Jesse Saba Kirchner.
This page is a list of lists of languages.
Linguistic categories include
Language resource management – Lexical markup framework, produced by ISO/TC 37, is the ISO standard for natural language processing (NLP) and machine-readable dictionary (MRD) lexicons. The scope is standardization of principles and methods relating to language resources in the contexts of multilingual communication.
Glottolog is an open-access online bibliographic database of the world's languages. In addition to listing linguistic materials describing individual languages, the database also contains the most up-to-date language affiliations based on the work of expert linguists.
Living Languages is an international non-profit organisation which was established to advance the sustainability of the world's Indigenous languages.
In natural language processing, linguistics, and neighboring fields, Linguistic Linked Open Data (LLOD) describes a method and an interdisciplinary community concerned with creating, sharing, and (re-)using language resources in accordance with Linked Data principles. The Linguistic Linked Open Data Cloud was conceived and is being maintained by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation, but has been a point of focal activity for several W3C community groups, research projects, and infrastructure efforts since then.
The Endangered Languages Project (ELP) is a worldwide collaboration between indigenous language organizations, linguists, institutions of higher education, and key industry partners to strengthen endangered languages. The foundation of the project is a website, which launched in June 2012.
The Catalogue of Endangered Languages (ELCat) is a major resource for information on the endangered languages of the world. It is available to the public via the Endangered Languages Project website.
Helen Aristar-Dry is an American linguist who currently serves as the series editor for SpringerBriefs in Linguistics. Most notably, from 1991 to 2013 she co-directed The LINGUIST List with Anthony Aristar. She has served as principal investigator or co-Principal Investigator on over $5,000,000 worth of research grants from the National Science Foundation and the National Endowment for the Humanities. She retired as Professor of English Language and Literature from Eastern Michigan University in 2013.
Andrea Berez-Kroeker is a documentary linguist and professor in the Department of Linguistics at the University of Hawaiʻi at Mānoa. She is the director of the Kaipuleohone archive of endangered languages. She is an expert on the practices of reproducibility and management of data in the field of linguistics.
In linguistics and language technology, a language resource is a "[composition] of linguistic material used in the construction, improvement and/or evaluation of language processing applications, (...) in language and language-mediated research studies and applications."