This article needs additional citations for verification .(February 2017) |
The ISLRN or International Standard Language Resource Number is Persistent Unique Identifier for Language Resources. [1]
On November 18, 2013, [2] 12 major organisations (see list below) from the fields Language Resources and Technologies, Computational Linguistics, and Digital Humanities held a cooperation meeting in Paris (France) and agreed to announce the establishment of the International Standard Language Resource Number (ISLRN), to be assigned to each Language Resource.
Among the 12 organisations, 4 institutions constitute the ISLRN Steering Committee (ST)
The Joint Research Centre(JRC), [3] the [European Commission]'s in-house science service, was the first organisation to adopt the ISLRN initiative and requested. 2500 resources and tools have already been allocated an ISLRN. These resources include written data (Annotated corpus, Annotated text, List of misspelled word, Terminological database, Treebank, Wordnet, etc.) and speech corpora (Synthesised Speech, Transcripts and Audiovisual Recordings, Conversational Speech, Folk Sayings, etc.)
Providing Language Resources with unique names and identifiers using a standardized nomenclature ensures the identification of each Language Resources and streamlines the citation with proper references in activities within Human Language Technology as well as in documents and scientific publications. Such unique identifier also enhances the reproducibility, [4] an essential feature of scientific work.
The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.
An International Standard Serial Number (ISSN) is an eight-digit serial number used to uniquely identify a serial publication, such as a magazine. The ISSN is especially helpful in distinguishing between serials with the same title. ISSNs are used in ordering, cataloging, interlibrary loans, and other practices in connection with serial literature.
A digital object identifier (DOI) is a persistent identifier or handle used to uniquely identify various objects, standardized by the International Organization for Standardization (ISO). DOIs are an implementation of the Handle System; they also fit within the URI system. They are widely used to identify academic, professional, and government information, such as journal articles, research reports, data sets, and official publications. DOIs have also been used to identify other types of information resources, such as commercial videos.
UniProt is a freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the research literature. It is maintained by the UniProt consortium, which consists of several European bioinformatics organisations and a foundation from Washington, DC, United States.
The European Language Resources Association (ELRA) is a not-for-profit organisation established under the law of the Grand Duchy of Luxembourg. Its seat is in Luxembourg and its headquarters is in Paris, France.
Digital Earth is the name given to a concept by former US vice president Al Gore in 1998, describing a virtual representation of the Earth that is georeferenced and connected to the world's digital knowledge archives.
Linguistic categories include
ISO/TC 37 is a technical committee within the International Organization for Standardization (ISO) that prepares standards and other documents concerning methodology and principles for terminology and language resources.
Central Scientific Instruments Organisation is a Chandigarh, India-based national laboratory dedicated to research, design and development of scientific, and industrial instruments. It is one of the constituent laboratories of the Council of Scientific and Industrial Research (CSIR), an Indian industrial research and development organisation situated at Chandigarth's Sector 30C.
The International Generic Sample Number or IGSN is a persistent identifier for sample. As an active persistent identifier it can be resolved through the Handle System. The system is used in production by the System for Earth Sample Registration (SESAR), Geoscience Australia, Commonwealth Scientific and Industrial Research Organisation Mineral Resources, Australian Research Data Commons (ARDC), University of Bremen MARUM, German Research Centre for Geosciences (GFZ), IFREMER Institut Français de Recherche pour l'Exploitation de la Mer, Korea Institute of Geoscience & Mineral Resources (KIGAM), and University of Kiel. Other organisations are preparing the introduction of the IGSN.
The International Standard Name Identifier (ISNI) is an identifier system for uniquely identifying the public identities of contributors to media content such as books, television programmes, and newspaper articles. Such an identifier consists of 16 digits. It can optionally be displayed as divided into four blocks.
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
The UNESCO Institute for Statistics (UIS) is the statistical office of UNESCO and is the UN depository for cross-nationally comparable statistics on education, science and technology, culture, and communication.
ISO/IEC 19788Information technology – Learning, education and training – Metadata for learning resources is a multi-part standard prepared by subcommittee SC 36 of the joint technical committee ISO/IEC JTC 1, Information Technology for Learning, Education and Training.
The Science and Technology Information Center (STIC) is an Ethiopian organisation which provides information to support scientific and technological (S&T) activities in the country. STIC has published information on the financing of research and development and on the nature and progress of innovative projects, and in 2014 was planning to introduce bibliometric monitoring of publications in S&T. The center has also provided information and communications technology facilities including a digital library, a patent information system, an automated personnel management system, and a S&T-related database.
In natural language processing, linguistics, and neighboring fields, Linguistic Linked Open Data (LLOD) describes a method and an interdisciplinary community concerned with creating, sharing, and (re-)using language resources in accordance with Linked Data principles. The Linguistic Linked Open Data Cloud was conceived and is being maintained by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation, but has been a point of focal activity for several W3C community groups, research projects, and infrastructure efforts since then.
Joseph Mariani is a French computer science researcher and pioneer in the field of speech processing.
Dafydd Gibbon is a British emeritus professor of English and General Linguistics at Bielefeld University in Germany, specialising in computational linguistics, the lexicography of spoken languages, applied phonetics and phonology. He is particularly concerned with endangered languages and has received awards from the Ivory Coast, Nigeria and Poland.
In linguistics and language technology, a language resource is a "[composition] of linguistic material used in the construction, improvement and/or evaluation of language processing applications, (...) in language and language-mediated research studies and applications."