CORE (research service)

Last updated

CORE (Connecting Repositories)
Commercial?No
Type of projectOpen Access, Repositories, Harvesting
Location Open University
CountryUnited Kingdom
Key peoplePetr Knoth
Website core.ac.uk OOjs UI icon edit-ltr-progressive.svg

CORE (Connecting Repositories) is a global open access indexing service developed at the Knowledge Media Institute (KMi) of The Open University, United Kingdom. It systematically indexes metadata and full text content from institutional and subject repositories, as well as open access and hybrid journals, to provide a comprehensive access to the world’s scholarly works.

Contents

CORE’s strongly supports the open access movement in scholarly communication by improving the discoverability, accessibility, and reuse of research outputs at scale.

The service underpins applications in a wide variety of areas including in systematic reviews, open access compliance monitoring, reproducibility of research software, training of LLMs, and analytics for funders and institutions.

As of August 2025, CORE indexes over 400 million scholarly resources from repositories [1] globally, a measure of its reach and comprehensiveness within the scholarly ecosystem.

This represents a growth of 47% from an earlier report in 2023.The data is made available via a range of services, which currently reach over 30 million monthly users, highlighting CORE’s role as a major hub for open scientific knowledge.

Service description

There are existing commercial academic search systems, such as Google Scholar, which provide search and access level services, but do not support programmable machine access to the content. This is seen with the use of an API or data dumps, and limits the further reuse of the open access content (e.g., text and data mining). There are three access levels to content: [2]

The programmable machine access is the main feature that distinguishes CORE from Google Scholar and formerly Microsoft Academic Search.

History

The first version of CORE was created in 2011 by Petr Knoth with the aim to make it easier to access and text mine very large amounts of research publications. [3] The value of the aggregation was first demonstrated by developing a content recommendation system for research papers, following the ideas of literature-based discovery introduced by Don R. Swanson. Since its start, CORE has received financial support from a range of funders including Jisc and the European Commission. CORE aggregates from across the world; in 2017, it was calculated that it reached documents from 102 countries in 52 languages. [4] It has the status of the UK's national aggregator of open access content, aggregating metadata and full-text outputs from both UK publishers' databases as well as institutional and subject repositories. [5] [6]

CORE operates as a one step search tool for UK's open access research outputs, facilitating discoverability, use and reuse. The importance of the service has been widely recognised by Jisc, which suggested that CORE should preserve the required resources to sustain its operation and explore an international sustainability model. [7] CORE is now one of the Repository Shared Services projects, along with Sherpa Services, [8] IRUS, [9] Jisc Publications Router [10] and OpenDOAR.

In 2018, CORE said it was the world's largest aggregator of open access research papers. [11] Based on the open access fundamental principles, as they were described in the Budapest Open Access Initiative, its open access content not only must be openly available to download and read, but it must also allow its reuse, both by humans and machines. As a result, there was a need to exploit the content reuse, which could be made possible with the implementation of a technical infrastructure. The CORE project started with the goal of connecting metadata and full-text outputs offering, through content aggregation, value-added services, and by opening new opportunities in the research process.

As of November 2025, CORE provided access to 431 million metadata records of scholarly papers, with an estimated 323M free to read full text links and 46M full texts hosted directly by CORE. [12]

CORE makes its data available freely over the CORE API to everyone, including unauthenticated users. [13] Higher rate limits are available to academics from CORE member institutions and paying users. [14] CORE does not claim ownership of any individual metadata record or underlying work received from third-party repositories. In 2025, CORE announced that it is reviewing its data-licensing framework. Future versions of CORE will distinguish between factual metadata, which will be made openly reusable under a suitable licence, such as CC0 or ODC-0, and expressive content (e.g. abstracts or full texts), which will be made available under conditional access for lawful text- and data-mining purposes. [15]

Programmable access to CORE data

CORE data can be accessed through an API or downloaded as a pre-processed and semantically enriched data dump. [16]

Searching CORE

CORE provides searchable access to a collection of over 125 million open access harvested research outputs. All outputs can be accessed and downloaded free of cost and have limited re-use restrictions. One can search the CORE content using a faceted search. CORE also provides a cross-repository content recommendation system based on full-texts. The collection of the harvested outputs is available either by looking at the latest additions [17] or by browsing [18] the collection at the date of harvesting. The CORE search engine was selected by an author on Jisc in 2013 as one of the top 10 search engines [19] for open access research, facilitating access to academic papers. [20] [21]

Analytical use of CORE data

The availability of data aggregated and enriched by CORE provides opportunities for the development of new analytical services for research literature. These can be used, for example, to monitor growth and trends in research, validate compliance with open access mandates and to develop new automatic metrics for evaluating research excellence.

According to the Registry of Open Access Repositories, the number of funders increased from 22 units in 2007 to 34 in 2010 and then to 67 in 2015, while the number of institutional full-text and open access mandates picked up from 137 units in 2007 to 430 in 2015. [22]

Applications

CORE offers eight applications:

See also

References

  1. "Powerful dataset of CORE". core.ac.uk. Retrieved 24 November 2025.
  2. Knoth, Petr; Zdrahal, Zdenek (2012). "CORE: Three access levels to underpin open access". D-Lib Magazine . Vol. 18, no. 11. doi: 10.1045/november2012-knoth . ISSN   1082-9873.
  3. "OUs full text search system makes huge leaps in widening access to academic papers". 24 October 2012. Archived from the original on 17 June 2022. Retrieved 19 December 2014.
  4. "CORE franchit le cap des 5 millions de documents en texte intégral indexés et en libre accès" (in French).
  5. "CORE melds UK repositories". Times of Higher Education. 13 October 2011. Retrieved 11 November 2014.
  6. "UK's first open access full-text search engine to aid research". The Research Centre. 3 October 2011. Archived from the original on 9 January 2015. Retrieved 19 December 2014.
  7. Jacobs, Neil; Ferguson, Nicky (2014). Bringing the UK's open access research together: Barriers on the Berlin road to open access (PDF). Jisc.
  8. "SHERPA Services" . Retrieved 20 January 2015.
  9. "IRUS UK" . Retrieved 20 January 2015.
  10. "Jisc Publications Router". Archived from the original on 9 January 2015. Retrieved 20 January 2015.
  11. Notay, Balviar; Knoth, Petr; Pontika, Nancy (1 June 2018). "CORE becomes the world's largest aggregator". Jisc scholarly communications blog.
  12. "Powerful dataset of CORE". core.ac.uk. Retrieved 24 November 2025.
  13. "CORE API Documentation". api.core.ac.uk. Retrieved 24 November 2025.
  14. "CORE Membership for data providers". core.ac.uk. Retrieved 24 November 2025.
  15. "Terms & Conditions". core.ac.uk. Retrieved 24 November 2025.
  16. "CORE Services". core.ac.uk. Retrieved 20 June 2021.
  17. "CORE Latest Additions". Archived from the original on 15 May 2013. Retrieved 20 January 2015.
  18. "CORE Browsing". Archived from the original on 12 January 2015. Retrieved 20 January 2015.
  19. "Ten Search Engines for researchers that go beyond Google". Jisc Inform. Summer 2013. Archived from the original on 24 December 2014. Retrieved 17 November 2014.
  20. "OU widens access to academic papers". Archived from the original on 9 January 2015. Retrieved 17 November 2014.
  21. Else, Holly (14 August 2014). "'Dismal' start for Access to Research initiative". Times Higher Education. Retrieved 15 November 2014.
  22. Pontika, Nancy; Knoth, Petr; Cancellieri, Matteo; Pearce, Samuel (2016). "Developing Infrastructure to Support Closer Collaboration of Aggregators with Open Repositories". LIBER Quarterly. 25 (4): 172–188. doi: 10.18352/lq.10138 . ISSN   1435-5205. OCLC   1005985574.
  23. "CORE API" . Retrieved 6 March 2018.
  24. "CORE Dataset" . Retrieved 6 March 2018.
  25. "CORE Recommender" . Retrieved 6 March 2018.
  26. "CORE Repository Dashboard" . Retrieved 6 March 2018.
  27. Pontika, Nancy; Knoth, Petr; Cancellieri, Matteo; Pearce, Samuel (8 March 2016). "Developing Infrastructure to Support Closer Collaboration of Aggregators with Open Repositories". LIBER Quarterly. 25 (4): 183. doi: 10.18352/lq.10138 . ISSN   1435-5205.
  28. "CORE Analytics Dashboard" . Retrieved 6 March 2018.
  29. "CORE Search" . Retrieved 6 March 2018.
  30. "CORE Publisher Connector" . Retrieved 6 March 2018.