In linguistics and language technology, a language resource is a "[composition] of linguistic material used in the construction, improvement and/or evaluation of language processing applications, (...) in language and language-mediated research studies and applications." [1]
According to Bird & Simons (2003), [2] this includes
In a narrower sense, language resource is specifically applied to resources that are available in digital form, and then, "encompassing (a) data sets (textual, multimodal/multimedia and lexical data, grammars, language models, etc.) in machine readable form, and (b) tools/technologies/services used for their processing and management". [1]
As of May 2020, no widely used standard typology of language resources has been established (current proposals include the LREMap, [3] METASHARE, [4] and, for data, the LLOD classification). Important classes of language resources include
A major concern of the language resource community has been to develop infrastructures and platforms to present, discuss and disseminate language resources. Selected contributions in this regard include:
As for the development of standards and best practices for language resources, these are subject of several community groups and standardization efforts, including