OpenSIGLE

Last updated

The OpenSIGLE repository provides open access to the bibliographic records of the former SIGLE database. The creation of the OpenSIGLE archive was decided by some major European STI centres, members of the former European network EAGLE for the collection and dissemination of grey literature (European Association for Grey Literature Exploitation). OpenSIGLE was developed by the French INIST-CNRS, with assistance from the German FIZ Karlsruhe and the Dutch Grey Literature Network Service (GreyNet). OpenSIGLE is hosted on an INIST-CNRS server at Nancy. Part of the open Access movement, OpenSIGLE is referenced by the international Directory of Open Access Repositories.

Contents

History of OpenSIGLE

SIGLE (System for Information on Grey Literature in Europe) was a unique multidisciplinary bibliographic database dedicated to grey literature. Up to 15 European partners participated in SIGLE, mostly national libraries or important research libraries. Created in 1980 and produced from 1984 onwards by EAGLE (European Association for Grey Literature Exploitation), the database was last available through STN International and on CD-ROM via Silverplatter/Ovid Technologies, until it stopped input in 2005. Together with other former EAGLE members, INIST decided to make the data publicly available on an open access platform. The OpenSIGLE website went live in December 2007.

OpenSIGLE is indexed by Google and Google Scholar, integrated in the portal of the WorldWideScience Alliance and included in the bookmarks of national libraries and research institutes.

Implementation of OpenSIGLE

OpenSIGLE was developed on a MIT DSpace platform 1.3.2. In the following the database migrated to DSpace version 1.4. It is available under the Creative Commons Attribution Non-commercial No Derivatives (CC-BY-NC-ND) License.

OpenSIGLE metadata

DSpace uses a qualified Dublin Core metadata set less detailed than the SIGLE metadata received from the former SIGLE operating agent DPC (FIZ Karlsruhe). The FIZ Karlsruhe XML records were written in the SIGLE format and completed by some server-related fields.

Several specific fields from the source format were merged to one field for OpenSIGLE. For example in the SIGLE record the English title could be either in the field for the original title or in the field for the English title. In the OpenSIGLE metadata, the English title appears systematically in the field labelled "Title".

Other fields were defined differently to fit with the metadata set. Some qualified fields were added to the metadata set used by DSpace without disturbing the OAI compliance: conference title, report number and availability statement.

The most significant change was a simplification in the document type information. The original SIGLE format distinguished between document type and literature indicator, but diverging conversion practices led to inconsistencies. OpenSIGLE proposes a simplified list of the principal document types. [1]

OpenSIGLE content

DSpace allows organizing the contents of a repository according to communities and collections. INIST decided to use 2 types of communities: the member countries and SIGLE subject categories on their primary level. Each country or subject category holds a collection of records. Some minor and less used subject categories were regrouped in one collection. In a mass upload on DSpace each record (or item) can be "attributed" to only one community or collection. We decided to choose the first classification code of each record. Since the files of each member country are treated separately, it is possible to declare also the country community for each record.

Contrary to the CD-ROM version, the document type is no longer searchable in OpenSIGLE. We found it interesting to display the information in the list of results, along with the title, the authors and the publication date. This is not a feature of the basic version of DSpace, but we observed similar practices in other repositories (see ERA 2006 and Glasgow 2006).

The SIGLE classification scheme with its 246 subject sub-categories can be searched through the subject field, either by its code or its wording. A specific help page accessible at any moment lists the complete classification schemes with both the codes and their description. As mentioned above, the subject areas were reduced to 15 entries for the organization of the database in collections and for browsing purposes.

For OpenSIGLE INIST chose the latest stable version available of the software which was then DSpace 1.3.2. One of the new features in this version is the support of multilingualism of the user interface (cf. DSpace system documentation 2006). This feature has been developed a bit further by a LIS student and OpenSIGLE can now be used with interfaces in English (the main version), French, German and Italian. These are the four most representative languages in the database. The help pages and the "About" information are available in English and French only, since they must be translated specifically.

Document delivery being very important for the SIGLE database, INIST decided to add an order form to facilitate contact with the holder of the document (former EAGLE member) and the information about the document’s availability in each record. In addition INIST gives updated information for each participating centre on each of the "Countries" pages.

OpenSIGLE functionalities and perspectives

With the migration to the DSpace platform look and presentation of the former SIGLE records have changed.

Some data like the language or the document type are no longer searchable, but are still displayed, even in the list of results. The principal characteristics of the SIGLE database have been preserved or even improved. Access to the full text will be facilitated through an order form for document delivery and for some records hopefully through links to the electronic version in the future. Since the records are organized in collections based on the subject categories, and the OAI protocol for metadata harvesting considers collections as sets, a selective harvesting by subject will be possible.

More generally, OpenSIGLE seems to be the first migration of an important traditional bibliographic database into an OAI (Open Archives Initiative) compliant environment. Some factors facilitated this migration, e.g. the mapping of the metadata from a verpeny detailed format to a simpler one. The whole project benefited largely from INIST-CNRS previous experience with DSpace and in particular from knowledge about the import of records. Still OpenSIGLE provided INIST-CNRS with a new experience concerning mass uploads on an Open Source platform.

Perspectives for the future developments of the OpenSIGLE archive are:

At the 12th International Conference on Grey Literature at Prague [2] in December 2010, INIST-CNRS presented a new project called OpenGrey. [3] OpenGrey signifies a new website with OAI-PMH, improved research facilities and export of records. OpenGrey also includes recent records and links to the full text. At the Prague conference, INIST and GreyNet called former SIGLE members and new partners to contribute to OpenGrey. In 2011 OpenSIGLE changed its platform and its name. OpenGrey provides new features and new content

OpenSIGLE and GreyNet

For the past 15 years, GreyNet has sought to serve researchers and authors in the field of grey literature. To further this end, GreyNet has signed on to the OpenSIGLE repository and in so doing seeks to preserve and make openly available research results originating in the International Conference Series on Grey Literature. GreyNet together with INIST-CNRS have designed the format for a metadata record, which encompasses standardized PDF attachments of the full-text conference preprints, PowerPoint presentations, abstracts and biographical notes.

In 2010, OpenSIGLE provides open access to some 200 conference papers on grey literature, from 1995 to 2009. Twenty-one, full-text papers from the Second International Conference on Grey Literature held in Washington, D.C., on November 2–3, 1995, were added in March 2010. GreyNet purchased permission last year from Emerald to make openly accessible the papers published in the GL Conference Proceedings from 1994 to 2000. These earlier collections are added to the more recent collections in the OpenSIGLE Repository. The work involved relies on the efforts of INIST-CNRS as service provider and GreyNet as data provider. By autumn 2010, it is anticipated that all of the papers in the International Conference Series on Grey Literature will be fully accessible via the OpenSIGLE Repository.

OpenSIGLE participates in the WorldWideScience global science gateway.

See also

Related Research Articles

<span class="mw-page-title-main">Open Archives Initiative</span>

The Open Archives Initiative (OAI) was an informal organization, in the circle around the colleagues Herbert Van de Sompel, Carl Lagoze, Michael L. Nelson and Simeon Warner, to develop and apply technical interoperability standards for archives to share catalogue information (metadata). The group got together in the late late 1990s and was active for around twenty years. OAI coordinated in particular three specification activities: OAI-PMH, OAI-ORE and ResourceSync. All along the group worked towards building a "low-barrier interoperability framework" for archives containing digital content to allow people harvest metadata. Such sets of metadata are since then harvested to provide "value-added services", often by combining different data sets.

CiteSeerX is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science.

The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.

An institutional repository is an archive for collecting, preserving, and disseminating digital copies of the intellectual output of an institution, particularly a research institution. Academics also utilize their IRs for archiving published works to increase their visibility and collaboration with other academics However, most of these outputs produced by universities are not effectively accessed and shared by researchers and other stakeholders As a result Academics should be involved in the implementation and development of an IR project so that they can learn the benefits and purpose of building an IR.

<span class="mw-page-title-main">DSpace</span> Repository software package

DSpace is an open source repository software package typically used for creating open access repositories for scholarly and/or published digital content. While DSpace shares some feature overlap with content management systems and document management systems, the DSpace repository software serves a specific need as a digital archives system, focused on the long-term storage, access and preservation of digital content. The optional DSpace registry lists almost three thousand repositories all over the world.

<span class="mw-page-title-main">Grey literature</span> Documents and research not produced for commercial or academic journal purposes

Grey literature is materials and research produced by organizations outside of the traditional commercial or academic publishing and distribution channels. Common grey literature publication types include reports, working papers, government documents, white papers and evaluations. Organizations that produce grey literature include government departments and agencies, civil society or non-governmental organizations, academic centres and departments, and private companies and consultants.

An Open Archival Information System is an archive, consisting of an organization of people and systems, that has accepted the responsibility to preserve information and make it available for a Designated Community. The OAIS model can be applied to various archives, e.g., open access, closed, restricted, "dark", or proprietary.

<span class="mw-page-title-main">Public Knowledge Project</span> Metadata reservation project for e-journals

The Public Knowledge Project (PKP) is a non-profit research initiative that is focused on the importance of making the results of publicly funded research freely available through open access policies, and on developing strategies for making this possible including software solutions. It is a partnership between the Faculty of Education at the University of British Columbia, the Canadian Centre for Studies in Publishing at Simon Fraser University, the University of Pittsburgh, Ontario Council of University Libraries, the California Digital Library and the School of Education at Stanford University. It seeks to improve the scholarly and public quality of academic research through the development of innovative online environments.

DPubS, developed by Cornell University Library and Penn State University Libraries, is a free open access publication management software. DPubS arose out of Project Euclid, an electronic publishing platform for journals in mathematics and statistics. DPubS is free software released under Educational Community License.

<span class="mw-page-title-main">BASE (search engine)</span> Academic search engine

BASE is a multi-disciplinary search engine to scholarly internet resources, created by Bielefeld University Library in Bielefeld, Germany. It is based on free and open-source software such as Apache Solr and VuFind. It harvests OAI metadata from institutional repositories and other academic digital libraries that implement the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), and then normalizes and indexes the data for searching. In addition to OAI metadata, the library indexes selected web sites and local data collections, all of which can be searched via a single search interface.

EPrints is a free and open-source software package for building open access repositories that are compliant with the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). It shares many of the features commonly seen in document management systems, but is primarily used for institutional repositories and scientific journals. EPrints has been developed at the University of Southampton School of Electronics and Computer Science and released under the GPL-3.0-or-later license.

PREservation Metadata: Implementation Strategies (PREMIS) is the de facto digital preservation metadata standard.

A digital library, also called an online library, an internet library, a digital repository, a library without walls, or a digital collection is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital media formats or a library accessible through the internet. Objects can consist of digitized content like print or photographs, as well as originally produced digital content like word processor files or social media posts. In addition to storing content, digital libraries provide means for organizing, searching, and retrieving the content contained in the collection. Digital libraries can vary immensely in size and scope, and can be maintained by individuals or organizations. The digital content may be stored locally, or accessed remotely via computer networks. These information retrieval systems are able to exchange information with each other through interoperability and sustainability.

AGRIS is a global public domain database with more than 12 million structured bibliographical records on agricultural science and technology. It became operational in 1975 and the database was maintained by Coherence in Information for Agricultural Research for Development, and its content is provided by more than 150 participating institutions from 65 countries. The AGRIS Search system, allows scientists, researchers and students to perform sophisticated searches using keywords from the AGROVOC thesaurus, specific journal titles or names of countries, institutions, and authors.

<span class="mw-page-title-main">Grey Literature Network Service</span>

GreyNet International, the Grey Literature Network Service is an independent organization founded in 1992. It is dedicated to research, publication, open access, education, and bringing public awareness to grey literature. Grey literature is often defined as "information produced and distributed on all levels of government, academics, business and industry in electronic and print formats not controlled by commercial publishing i.e. where publishing is not the primary activity of the producing body.".

The Grey Literature International Steering Committee (GLISC) was established in 2006 after the 7th International Conference on Grey Literature (GL7) held in Nancy (France) on 5–6 December 2005.

The “System for Information on Grey Literature in Europe” (SIGLE) was established in 1980, two years after a seminar on grey literature organised by the European Commission in York (UK). Operated by a network of national information or document supply centres active in collecting and promoting grey literature, SIGLE was an online, pan-European electronic bibliographic database and document delivery system.

The “European Association for Grey Literature Exploitation” (EAGLE) was created in 1985 by European scientific and technical information centres and libraries in order to produce the bibliographic database “System for Information on Grey Literature in Europe” (SIGLE).

An open repository or open-access repository is a digital platform that holds research output and provides free, immediate and permanent access to research results for anyone to use, download and distribute. To facilitate open access such repositories must be interoperable according to the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). Search engines harvest the content of open access repositories, constructing a database of worldwide, free of charge available research.

The Open Knowledge Repository is the official open-access repository of the World Bank and features research content about development. It was launched in 2012, alongside the World Bank's Open Access Policy and its adoption of the Creative Commons Attribution license for all research and knowledge products that it publishes, which collectively made the World Bank the first international organization to completely embrace open access. The repository collects the intellectual output of the World Bank in digital form, disseminates it, and preserves it long-term.

References