CLARIN

Last updated
Common Language Resources and Technology Infrastructure
AbbreviationCLARIN
Formation2012-02-29 [1]
Type ERIC, IGO
HeadquartersUtrecht, Netherlands
Region served
Europe and beyond
Membership
Member countries: Austria, Belgium, Bulgaria, Cyprus, Czech Republic, Croatia, Denmark, Estonia, Finland, Germany, Greece, Hungary, Iceland, Italy, Latvia, Lithuania, Netherlands, Norway, Poland, Portugal, Slovenia, South Africa, Spain, Sweden; Observers: Switzerland, United Kingdom
Official language
English
Executive Director
Darja Fišer [2]
Main organ
General Assembly, Board of Directors, National Coordinators Forum
Website www.clarin.eu

Common Language Resources and Technology Infrastructure (CLARIN ERIC) is a European Research Infrastructure Consortium founded in 2012. It comprises national consortia in and outside the European Union, consisting of institutes such as universities, research centres, libraries and public archives. The goal of the consortium is providing access to digital language data collections, to digital tools, and training material for researchers to work with the language resources.

Contents

Objectives

The goal of CLARIN is to make various digital language resources and tools from all over Europe and beyond accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences and to support scholars who want to engage in data-driven research, contributing to a multilingual European Research Area. [3] CLARIN is one of the research infrastructures selected for the European Research Infrastructures Roadmap by ESFRI [4] the European Strategy Forum on Research Infrastructures. In the 2016 ESFRI Roadmap CLARIN was listed as a landmark.

One of the missions of the CLARIN Knowledge Infrastructure is to ensure that the knowledge and expertise available throughout the CLARIN infrastructure is accessible in an organised way, easy to navigate for the CLARIN community and for the Social Sciences and Humanities research community at large. A cornerstone of the CLARIN knowledge sharing ecosystem are the Knowledge Centres, [5] institutions with expertise in one or more aspects of the domain covered by the CLARIN community.

Services and resources

CLARIN offers a wide range of services, [6] some of which are integrated in the European Open Science Cloud (EOSC). [7] They include:

Some of the resources accessible through CLARIN infrastructure include:

Governance

The General Assembly [11] represents the members of CLARIN ERIC and is the highest decision-making body of CLARIN ERIC. It is assisted by an international Scientific Advisory Board. [12] The day-to-day management is in the hands of the Board of Directors [13] chaired by the Executive Director Darja Fišer [14] and supported by the CLARIN Office. [15]

The largest effort towards the further integration of data, tools and expertise stems from the activities in the national consortia. The National Coordinators’ Forum is responsible for the coordination of the collaboration across countries.

Membership

The majority of operations, services and centres of the CLARIN infrastructure is provided and funded by the CLARIN ERIC membership (member, observers and linked third parties). Members and observers can be countries or intergovernmental organisations. They set up a national consortium, typically consisting of universities, research institutions, libraries and public archives [16] ).

Below is the list of current CLARIN ERIC members, [17] observers and third parties:

MembersNational Consortia (NC)
Austria Digital Humanities Austria
Belgium CLARIN-BE
Bulgaria CLaDA-BG
Croatia HR-CLARIN
CyprusCLARIN-CY
Czech Republic LINDAT/CLARIAH-CZ
Denmark CLARIN-DK
Estonia CLARIN Estonia
Finland FIN-CLARIN
Germany CLARIN-D
Greece clarin:el
Hungary HunCLARIN
Iceland CLARIN Iceland
Italy CLARIN-IT
Latvia CLARIN-LV
Lithuania CLARIN-LT
The Netherlands CLARIAH-NL Archived 2021-02-25 at the Wayback Machine
Norway CLARINO
Poland CLARIN PL Archived 2019-03-12 at the Wayback Machine
Portugal PORTULAN CLARIN
Slovenia CLARIN.SI
South Africa SADiLaR
Spain CLARIAH-ES
Sweden SWE-CLARIN
ObserversNational Consortia (NC)
Switzerland CLARIN-CH
United Kingdom CLARIN-UK

Related Research Articles

E-Science or eScience is computationally intensive science that is carried out in highly distributed network environments, or science that uses immense data sets that require grid computing; the term sometimes includes technologies that enable distributed collaboration, such as the Access Grid. The term was created by John Taylor, the Director General of the United Kingdom's Office of Science and Technology in 1999 and was used to describe a large funding initiative starting in November 2000. E-science has been more broadly interpreted since then, as "the application of computer technology to the undertaking of modern scientific investigation, including the preparation, experimentation, data collection, results dissemination, and long-term storage and accessibility of all materials generated through the scientific process. These may include data modeling and analysis, electronic/digitized laboratory notebooks, raw and fitted data sets, manuscript production and draft versions, pre-prints, and print and/or electronic publications." In 2014, IEEE eScience Conference Series condensed the definition to "eScience promotes innovation in collaborative, computationally- or data-intensive research across all disciplines, throughout the research lifecycle" in one of the working definitions used by the organizers. E-science encompasses "what is often referred to as big data [which] has revolutionized science... [such as] the Large Hadron Collider (LHC) at CERN... [that] generates around 780 terabytes per year... highly data intensive modern fields of science...that generate large amounts of E-science data include: computational biology, bioinformatics, genomics" and the human digital footprint for the social sciences.

The Biomedical Informatics Research Network, commonly referred among analysts as “BIRN” is a national proposed project to assist biomedical researchers in their bioscience investigations through data sharing and online collaborations. BIRN provides data-sharing infrastructure, advisory services from a single source and software tools and techniques. This national initiative is funded by NIH Grants, the National Center for Research Resources and the National Institute of General Medical Sciences (NIGMS), a component of the United States National Institutes of Health (NIH).

<span class="mw-page-title-main">Deborah McGuinness</span>

Deborah Louise McGuinness is an American computer scientist and researcher at Rensselaer Polytechnic Institute (RPI). She is a professor of Computer, Cognitive and Web Sciences, Industrial and Systems Engineering, and an endowed chair in the Tetherless World Constellation, a multidisciplinary research institution within RPI that focuses on the study of theories, methods and applications of the World Wide Web. Her fields of expertise include interdisciplinary data integration, artificial intelligence, specifically in knowledge representation and reasoning, description logics, the semantic web, explanation, and trust.

The Center for Research Libraries is a consortium of North American universities, colleges, and independent research libraries, based on a buy-in concept for membership of the consortia. The consortium acquires and preserves traditional and digital resources for research and teaching and makes them available to member institutions through interlibrary loan and electronic delivery. It also gathers and analyzes data pertaining to the preservation of physical and digital resources, and fosters the sharing of expertise, in order to assist member libraries in maintaining their collections.

The European Social Survey (ESS) is a social scientific endeavour to map the attitudes, beliefs and behaviour patterns of the various populations in Europe.
ESS is listed in the Registry of Research Data Repositories re3data.org.

<span class="mw-page-title-main">Cherenkov Telescope Array</span>

The Cherenkov Telescope Array or CTA is a multinational, worldwide project to build a new generation of ground-based gamma-ray instrument in the energy range extending from some tens of GeV to about 300 TeV. It is proposed as an open observatory and will consist of two arrays of imaging atmospheric Cherenkov telescopes (IACTs), a first array at the Northern Hemisphere with emphasis on the study of extragalactic objects at the lowest possible energies, and a second array at the Southern Hemisphere, which is to cover the full energy range and concentrate on galactic sources. The physics program of CTA goes beyond high energy astrophysics into cosmology and fundamental physics.

<span class="mw-page-title-main">MeerKAT</span> 64 antenna radio telescope. South Africa (launched 2018)

MeerKAT, originally the Karoo Array Telescope, is a radio telescope consisting of 64 antennas in the Meerkat National Park, in the Northern Cape of South Africa. In 2003, South Africa submitted an expression of interest to host the Square Kilometre Array (SKA) Radio Telescope in Africa, and the locally designed and built MeerKAT was incorporated into the first phase of the SKA. MeerKAT was launched in 2018.

<span class="mw-page-title-main">Extreme Light Infrastructure</span>

The Extreme Light Infrastructure is a research organization with the world's largest collection of high power-lasers. ELI operates several high-power, high-repetition-rate laser systems which enable the research of physical, chemical, materials, and medical sciences.

<span class="mw-page-title-main">Public Population Project in Genomics</span>

P3G (Public Population Project in Genomicsand Society) is a not-for-profit international consortium dedicated to facilitating collaboration between researchers and biobanks working in the area of human population genomics. P3G is member-based and composed of experts from the different disciplines in the areas of and related to genomics, including epidemiology, law, ethics, technology, biomolecular science, etc. P3G and its members are committed to a philosophy of information sharing with the goal of supporting researchers working in areas that will improve the health of people around the world.

The Archaeology Data Service (ADS) is an open access digital archive for archaeological research outputs. It is located in The King's Manor, at the University of York. Originally intended to curate digital outputs from archaeological researchers based in the UK's Higher Education sector, the ADS also holds archive material created under the auspices of national and local government as well as in the commercial archaeology sector. The ADS carries out research, most of which focuses on resource discovery, cross-searching and interoperability with other relevant archives in the UK, Europe and the United States of America.

A library consortium is any cooperative association of libraries that coordinates resources and/or activities on behalf of its members, whether they are academic, public, school or special libraries, and/or information centers. Library consortia have been created to service specific regions or geographic areas, e.g., local, state, regional, national or international. Many libraries commonly belong to multiple consortia. The goal of a library consortium is to amplify the capabilities and effectiveness of its member libraries through collective action, including, but not limited to, print or electronic resource sharing, reducing costs through group purchases of resources, and hosting professional development opportunities. The “bedrock principle upon which consortia operate is that libraries can accomplish more together than alone.”

Open scientific data or open research data is a type of open data focused on publishing observations and results of scientific activities available for anyone to analyze and reuse. A major purpose of the drive for open data is to allow the verification of scientific claims, by allowing others to look at the reproducibility of results, and to allow data from many sources to be integrated to give new knowledge.

<span class="mw-page-title-main">Australian Research Data Commons</span>

The Australian Research Data Commons (ARDC) is a limited company, formed on 1 July 2018 by combining the Australian National Data Service (ANDS), Nectar and Research Data Services (RDS). Its purpose is to enable Australian researchers and industry access to nationally significant eInfrastructure, skills platforms, and data collections.

A data infrastructure is a digital infrastructure promoting data sharing and consumption.

ELIXIR is an initiative that allows life science laboratories across Europe to share and store their research data as part of an organised network. Its goal is to bring together Europe's research organisations and data centres to help coordinate the collection, quality control and storage of large amounts of biological data produced by life science experiments. ELIXIR aims to ensure that biological data is integrated into a federated system easily accessible by the scientific community.

A European Research Infrastructure Consortium (ERIC) is a full juridical person and a corporation under European Union law. With a membership of at least one European Union member state and two EU member or associated states, it has legal personality and full legal capacity recognized in all Member States. Currently there are 25 ERICs established.

<span class="mw-page-title-main">Walther von Hahn</span> German linguist and computer scientist

Walther von Hahn is a German linguist and computer scientist. From 1977 to 2007, von Hahn taught Computer Science and Linguistics at Universität Hamburg.

The European Holocaust Research Infrastructure (EHRI) is an international digital infrastructure and community. It is a joint undertaking of Holocaust historians, archivists, and specialists in digital humanities. Through the development of heritage archives into research infrastructures and by connecting the knowledge of heritage archives and making that knowledge relevant for research, EHRI aims to support Holocaust research, commemoration and education. EHRI is coordinated by the Netherlands-based NIOD Institute for War, Holocaust and Genocide Studies, and is directed by Reto Speck and Karel Berkhoff.

<span class="mw-page-title-main">Open Science Infrastructure</span>

Open Science Infrastructure is an information infrastructure that supports the open sharing of scientific productions such as publications, datasets, metadata or code. In November 2021 the Unesco recommendation on Open Science describe it as "shared research infrastructures that are needed to support open science and serve the needs of different communities".

References

  1. Press release of the European Commission
  2. Darja Fišer Appointed Executive Director of CLARIN ERIC
  3. CLARIN Value Proposition
  4. European Commission definition of Research Infrastructures (RIs)
  5. CLARIN Knowledge Centres
  6. CLARIN services
  7. European Open Science Cloud
  8. Virtual Language Observatory
  9. The Language Resource Switchboard
  10. Depositing Services
  11. CLARIN General Assembly
  12. CLARIN Scientific Advisory Board
  13. CLARIN Board of Directors
  14. "View Contact | CLARIN ERIC".
  15. CLARIN Office
  16. Overview of CLARIN centres
  17. Participating consortia in CLARIN