UK Data Service

Last updated

UK Data Service
FormationOctober 2012 (consolidating previous services)
TypeGovernment-funded research infrastructure
PurposeAdvancing social science research and teaching through access to data, documentation, guidance and support
Headquarters University of Essex, Colchester
Region served
UK and abroad
Director
Prof Matthew Woollard
Affiliations ESRC, UK Data Archive
Website ukdataservice.ac.uk

The UK Data Service is the largest digital repository for quantitative and qualitative social science and humanities research data in the United Kingdom. The organisation is funded by the UK government through the Economic and Social Research Council and is led by the UK Data Archive at the University of Essex, in partnership with other universities.

Contents

The service is free to data owners and free at the point of use to non-commercial data users.

History

The organisation developed from the UK Data Archive, established at Essex University in 1967. [1] Its funder, the Economic and Social Research Council (part of UK Research and Innovation) established the UK Data Service in its current form in October 2012. [2] Besides the UK Data Archive, the partners are: [1]

Rapid response to COVID-19

The UK Data Service enabled continued research access to data throughout the pandemic and the first COVID-19 datasets were made available in July 2020. Within six months of the pandemic they held their first online COVID-19 data dive with researchers and policy makers across the UK and beyond.[ citation needed ]

The organisation frequently advise at a national and international level on data governance, ethics and confidentiality. They advised the Cabinet office on its development of the draft data ethics framework, which influenced the implementation of the Digital Economy Act 2017, and are often called upon to input into government White Papers around the use of data for research.[ citation needed ]

In October 2013, the UK Data Service received additional funding to coordinate the Administrative Data Research Network (ADRN), the predecessor to ADR-UK, designed to streamline research access to data routinely collected by UK government departments and other agencies. [3] [4]

UK Data Service is listed in the Registry of Research Data Repositories, re3data.org. [5] It is part of the "trusted research environment" provided by the Office for National Statistics. [6]

Data access

The UK Data Service is commissioned to provide data access and support for researchers from all sectors including higher education, central and local government, foundations, charities and business. There are currently more than 6,000 datasets available from a variety of sources. [7] Key data types include: [8]

Access to the data catalogue, documentation and guides are available free of charge. Registration may be required to download data, and its use subject to licensing requirements specified by data owners. Most data are available under a standard End User Licence, but data are available along a spectrum ranging from 'open' to 'secure'. [9] The latter requires specific approval and training to ensure the appropriate level of security for highly detailed and sensitive data. [10]

Data skills training and guidance

Their comprehensive learning resources in their Learning hub, training events and on-demand webinars relate to several innovative aspects of data-intensive social science research, alongside the foundations of research using quantitative and qualitative data. Whether new to research, new undergraduates or teachers; veteran data analysts, career researchers or data managers; there is likely to be some training or best practice guidance that will be useful.

From understanding more about how to use large national surveys, Census or qualitative data, through to the many innovations, including those in modelling, simulation, big data, web-scraping, social media and more, we continue to enable researchers to access, manage and explore data.

Experts in research data management

The organisation publishes policies, procedures and protocols to help other national and international organisations, researchers and data providers to manage research data more effectively. They promote and maintain metadata standards for describing collections and data in the social sciences and their approach to data preparation and curation, and the standards they use, mean that data can be accessed now and in the future.

Through the UK Data Archive they continue to be at the forefront of developing international standards for data processing, both quantitative and qualitative.

Experts in data quality and replicability

As part of their commitment to making sure research is replicable, their work with DataCite and the British Library helps researchers and data depositors cite data correctly, thereby helping demonstrate real value and impact to the data collections they hold. They ensure data follows the FAIR principles of being Findable, Accessible, Interoperable and Reusable, thereby enhancing the quality and replicability of the data. Their guidance to improve data citation across social science, along with their easy-to-use citation tool helps researchers cite data as they would other sources.

Social science research benefits from accountability and transparency, which can usefully be underpinned by high quality and trustworthy data. Rigorous data curation practices are still sometimes viewed as a dark art, and easy-to-use tools to correct and clean numeric data are not widely used, despite awareness of the desire to make data FAIR. Their innovative, open source QAMyData tool provides a health check for numeric data and helps repository staff check, clean and document data.

Digital preservation and archiving

The UK Data Service follows a policy of active preservation to ensure the authenticity, reliability and logical integrity of all digital resources while providing usable versions for research, teaching or learning, in perpetuity. [11]

The UK Data Service is based around a functional model, which in turn is based on the Open Archival Information System or OAIS (an ISO standard). This means that the UK Data Service works with standards for archiving digital materials to build trust relationships: researchers must trust that archivists are giving them the ‘right’ data, and data owners and producers must trust that the archivists are not damaging the integrity of their data. [12]

The UK Data Service adheres to the UK Data Archive Preservation Policy, which codifies long-standing archival practice standards. This policy conforms to the OAIS reference model, with additions and alterations specific to the materials held within the UK Data Service collection. The policy holds strict requirements for digital preservation activities, together with how these requirements can best be achieved in keeping with regulatory requirements, archival best practice, information security and funding constraints.

Data sharing and reuse

The UK Data Service encourages data sharing and reuse as a means to extend the inherent value in primary data for replicating research results as well as for additional analysis and teaching use. [13] To this end, it supports the ESRC's Research Data Policy, which requires researchers funded by the research council to commit to a structured data management plan to enable data produced in the course of research to be deposited and archived for future sharing and reuse. [14] To support researchers in developing robust data management plans, the UK Data Service makes a toolkit of resources available in formats designed for researchers as well as those responsible for teaching data management skills.

Experts in safe research using controlled data

Information security is at the heart of the UK Data Service and flows through everything they do. Lead partner, the UK Data Archive at the University of Essex, was the first university to be awarded ISO27001 certification, an international standard of information security. Working closely with HMRC DataLab and the Office for National Statistics Secure Research Services, they developed safe research protocols including the Five Safes Framework to enable secure research access to data while protecting confidentiality.

Accredited in 2020 by the UK Statistic Authority under the Digital Economy Act 2017, they continue to provide a more streamlined pathway for researchers to securely access controlled data they need for research.

UK Data Service partners have a long history of contributing to best practice in trustworthy repositories. Also in 2020, they helped to develop CoreTrustSeal, which sets out international requirements for trusted data repositories. This demonstrates their expertise to perform all of the relevant activities relating to providing long-term access to data of value to social science researchers.

They also advised on the creation of the SafePod Network (SPN) launched in 2021 that enables wider geographical research access to sensitive data.

Structure and governance

The Economic and Social Research Council (ESRC) in its current form established the UK Data Service in October 2012, after originally setting up and funding the organisation in earlier forms since 1967. The governance structure is managed by the ESRC. [15]

Additional funding from other UK Research and Innovation (UKRI) councils includes the Engineering and Physical Sciences Research Council (EPSRC) that enables specific projects such as the Smart Energy Research Lab (SERL), and European based projects and investments such as the Consortium of European Social Science Data Archives (CESSDA) and Synergies for Europe’s Research Infrastructure in the Social Sciences (SERISS).

Advisory committee

Following the establishment of UKRI in 2018, the new national body that brought together all UK Research Councils, Innovate UK and Research England, the strategic oversight and provision to the ESRC of independent assurance for the UK Data Service has been undergoing change.

A new advisory committee for the UK Data Service was established in 2019 and meets twice a year. The advisory committee for the UK Data Service is the independent body with responsibility to ensure that the investment is developed, managed and maintained in a manner that maximises its benefit as a long-term scientific resource of importance both nationally and internationally. As an independent body, advisory committee members act as critical friends in relation to the overall shape, academic direction and relevance to policy and practice of the UK Data Service, advising on its general strategy in order for it to achieve its overall objectives. Scientific and/methodological advice may also be sought from external, independent sources.

Related Research Articles

In library and archival science, digital preservation is a formal process to ensure that digital information of continuing value remains accessible and usable in the long term. It involves planning, resource allocation, and application of preservation methods and technologies, and combines policies, strategies and actions to ensure access to reformatted and "born-digital" content, regardless of the challenges of media failure and technological change. The goal of digital preservation is the accurate rendering of authenticated content over time.

The Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC) is a digital archive of records of some of the many small cultures and languages of the world. They digitise reel-to-reel field tapes, have a mass data store and use international standards for metadata description. PARADISEC is part of the worldwide community of language archives. PARADISEC's main motivation is to ensure that unique recordings of small languages are preserved for the future, and that researchers consider the future accessibility of their materials for other researchers, community members, or anyone who has an interest in such materials.

The Economic and Social Research Council (ESRC), formerly the Social Science Research Council (SSRC), is part of UK Research and Innovation (UKRI). UKRI is a non-departmental public body (NDPB) funded by the UK government. ESRC provides funding and support for research and training in the social sciences. It is the UK's largest organisation for funding research on economic and social issues.

ESDS International was a Jisc/ESRC funded service which provided the UK academic community with free online access to the major databanks produced by international governmental organisations such as the World Bank, International Monetary Fund and the United Nations. The service also supported the use of these databanks in teaching and research through the provision of a helpdesk for user queries, comprehensive documentation and training.

<span class="mw-page-title-main">UK Data Archive</span>

The UK Data Archive is a national centre of expertise in data archiving in the United Kingdom. It houses the largest collection of social sciences and population digital data in the UK. It is certified under CoreTrustSeal as a trusted digital repository. It is also certified under the international ISO 27001 standard for information security. Located in Colchester, the UK Data Archive is a specialist department of the University of Essex, co-located with the Institute for Social and Economic Research (ISER). It is primarily funded by the Economic and Social Research Council (ESRC) and the University of Essex.

Research data archiving is the long-term storage of scholarly research data, including the natural sciences, social sciences, and life sciences. The various academic journals have differing policies regarding how much of their data and methods researchers are required to store in a public archive, and what is actually archived varies widely between different disciplines. Similarly, the major grant-giving institutions have varying attitudes towards public archiving of data. In general, the tradition of science has been for publications to contain sufficient information to allow fellow researchers to replicate and therefore test the research. In recent years this approach has become increasingly strained as research in some areas depends on large datasets which cannot easily be replicated independently.

The National E-Infrastructure Service (NES), formerly the National Grid Service, was an organisation for UK academics and researchers from 2004 through 2011. It was funded by two governmental bodies, Engineering and Physical Sciences Research Council (EPSRC) and the Joint Information Systems Committee (JISC).

The Archaeology Data Service (ADS) is an open access digital archive for archaeological research outputs. It is located in The King's Manor, at the University of York. Originally intended to curate digital outputs from archaeological researchers based in the UK's Higher Education sector, the ADS also holds archive material created under the auspices of national and local government as well as in the commercial archaeology sector. The ADS carries out research, most of which focuses on resource discovery, cross-searching and interoperability with other relevant archives in the UK, Europe and the United States of America.

ESDS Qualidata is a specialist service of the Economic and Social Data Service (ESDS), led by the UK Data Archive at the University of Essex, jointly funded by the Joint Information Systems Committee (JISC) and the Economic and Social Research Council (ESRC). The service provides access to a wide range of qualitative data from the social sciences as well as user-support, promoting the increased use of secondary analysis in social research and related learning and teaching resources.

The Economic and Social Data Service (ESDS) is a national data archiving and dissemination service that was founded in January 2003. It is funded jointly by the Economic and Social Research Council (ESRC) and Jisc. In July 2012, the ESRC announced it will become a partner in the UK Data Service, to be established as of October 1st of 2012.

ESDS Longitudinal is a specialist service of the Economic and Social Data Service (ESDS), led by the UK Data Archive at the University of Essex and the ESRC United Kingdom Longitudinal Studies Centre (ULSC), jointly funded by the Joint Information Systems Committee (JISC) and the Economic and Social Research Council (ESRC).

<span class="mw-page-title-main">Berlin School of Library and Information Science</span>

The Berlin School of Library and Information Science at Humboldt Universität zu Berlin offers study programmes at three levels: bachelors, masters, and doctoral. It is the only institute in Germany with a doctoral programme and the right to award doctorates. Research methods are also an integral part of the pre-doctoral curriculum.

<span class="mw-page-title-main">Dryad (repository)</span>

Dryad is an international open-access repository of research data, especially data underlying scientific and medical publications. Dryad is a curated general-purpose repository that makes data discoverable, freely reusable, and citable. The scientific, educational, and charitable mission of Dryad is to provide the infrastructure for and promote the re-use of scholarly research data.

A data management plan or DMP is a formal document that outlines how data are to be handled both during a research project, and after the project is completed. The goal of a data management plan is to consider the many aspects of data management, metadata generation, data preservation, and analysis before the project begins; this may lead to data being well-managed in the present, and prepared for preservation in the future.

The Open Identity Exchange (OIX) is a membership organisation that works to accelerate the adoption of digital identity services based on open standards. It is a non-profit organisation and is technology agnostic. It is collaborative, and works across the private and public sectors.

The Registry of Research Data Repositories (re3data.org) is an open science tool that offers researchers, funding organizations, libraries, and publishers an overview of existing international repositories for research data.

Data publishing is the act of releasing research data in published form for use by others. It is a practice consisting in preparing certain data or data set(s) for public use thus to make them available to everyone to use as they wish. This practice is an integral part of the open science movement. There is a large and multidisciplinary consensus on the benefits resulting from this practice.

<span class="mw-page-title-main">Inter-university Consortium for Political and Social Research</span> Organization of research institutions

The Inter-university Consortium for Political and Social Research (ICPSR) is an American political science and social science research consortium, based at the University of Michigan. It was founded in 1962. An integral part of the infrastructure of social science research, ICPSR maintains and provides access to a vast archive of social science data for research and instruction.

Data preservation is the act of conserving and maintaining both the safety and integrity of data. Preservation is done through formal activities that are governed by policies, regulations and strategies directed towards protecting and prolonging the existence and authenticity of data and its metadata. Data can be described as the elements or units in which knowledge and information is created, and metadata are the summarizing subsets of the elements of data; or the data about the data. The main goal of data preservation is to protect data from being lost or destroyed and to contribute to the reuse and progression of the data.

The National Centre for Social Research (NatCen) is a registered charity and is the largest independent social research institute in the UK. The research charity was founded in 1969 by Sir Roger Jowell and Gerald Hoinville with the aim of carrying out rigorous social policy research to improve society.

References

  1. 1 2 "Host organisations". UK Data Service. Retrieved 23 September 2022.
  2. "Governance and funding". UK Data Service. Retrieved 23 September 2022.
  3. Essex receives £5 million for new Big Data Network centre, 10 October 2013, University of Essex, retrieved 8 July 2014
  4. Administrative Data Research Network website, retrieved 8 July 2014
  5. "UK Data Service Entry in re3data.org". www.re3data.org. Retrieved 21 July 2014.
  6. "About the Secure Research Service". Office for National Statistics. Retrieved 23 September 2022.
  7. UK Data Service: About our data, retrieved 11 November 2013
  8. UK Data Service: Key data, retrieved 11 November 2013
  9. UK Data Service: How to access data, retrieved 11 November 2013
  10. UK Data Service: About Secure Access, retrieved 11 November 2013
  11. The longstanding culture in the social sciences of making data accessible is one to value, LSE Impact Blog, retrieved 11 November 2013
  12. The longstanding culture in the social sciences of making data accessible is one to value, LSE Impact Blog, retrieved 11 November 2013
  13. UK Data Service: Prepare and manage data, retrieved 11 November 2013
  14. ESRC Research Data Policy, September 2010, retrieved 11 November 2013
  15. Introducing the UK Data Service, StatsLife, Royal Statistical Society, 15 April 2013, retrieved 11 November 2013