The Registry Interchange Format - Collections and Services (RIF-CS) is an XML vocabulary for representing metadata about data collections and related entities based on ISO 2146. [1]
It is a machine-readable format to describe metadata about data collections. Similar to how MARC standards are used by library systems to describe books, RIF-CS is used to describe data collections. For example, a RIF-CS record can describe a spreadsheet containing experimental results: it might contain the title, description, creator, keywords, date the experiment was conducted and a URL to obtain the actual spreadsheet. Another RIF-CS record can describe the person who created the spreadsheet: it might contain their name, address and contact email.
RIF-CS documents, containing RIF-CS records, are exchanged between computer systems. For example, a university can send the RIF-CS documents to a national database, where it can be indexed and searched. People can download RIF-CS records from a national database to use the information to find out more about the data and/or the researcher.
The metadata describes the entity as well relationships between the entities. RIF-CS represents metadata about the following types of entities, which it calls registry objects: [2]
These entities are based on the entities defined in ISO 2146. [3]
The registry objects contain either embedded information or links to other registry objects. Some information is represented directly in the registry object; for example: names, descriptions and dates. Other information is represented by a link to another registry object: for example, the creator for a collection (research data) is a link to another registry object that describes the party (a researcher).
RIF-CS is used in Australian universities and industry as a format for submitting metadata to the Australian National Data Service (ANDS) Research Data Australia service. [4]
In that context, collections represent collections of research data; parties represent people and groups; activities represent research projects or programs; services represent ways to access the research data collections.
RIF-CS is only a representation of the metadata. To exchange RIF-CS a protocol, such as OAI-PMH, is used.
This example RIF-CS document contains a collection record and a party record.
<?xml version="1.0"?><registryObjectsxmlns="https://ands.org.au/standards/rif-cs/registryObjects"><registryObjectgroup="G1"><key>urn:uuid:7d9127c5-b97e-4474-87a3-68f2c219c609</key><originatingSource>https://www.example.com/thales/source</originatingSource><collectiontype="dataset"dateModified="2013-03-14T15:09:26Z"><identifiertype="local">urn:uuid:dec9b127-0d24-4d12-aaf2-a71e9ed29f75</identifier><nametype="primary"><namePart>Testdata</namePart></name><nametype="alternative"><namePart>Datafortesting</namePart></name><descriptiontype="full">Datausedfortesting.</description><subjecttype="local">test</subject><location><address><electronictype="email"><value>admin@example.edu</value></electronic></address></location><location><address><electronictype="url"><value>https://www.example.edu/data/1234</value></electronic></address></location><coverage><temporal><datetype="dateFrom"dateFormat="W3CDTF">2012</date><datetype="dateTo"dateFormat="W3CDTF">2013</date></temporal></coverage><relatedInfo><identifiertype="uri">https://www.example.com/foobar</identifier></relatedInfo><relatedObject><key>urn:uuid:30fda6cd-5b08-4376-a1b7-2ee40144d9da</key><relationtype="hasCollector"/></relatedObject></collection></registryObject><registryObjectgroup="G1"><key>urn:uuid:30fda6cd-5b08-4376-a1b7-2ee40144d9da</key><originatingSource>https://www.example.com/thales/source</originatingSource><partytype="person"dateModified="2013-03-14T07:14:25Z"><identifiertype="ORCID">https://orcid.org/0000-0002-9501-6966</identifier><nametype="primary"><nameParttype="title">Dr</namePart><nameParttype="given">John</namePart><nameParttype="family">Citizen</namePart></name></party></registryObject></registryObjects>
The Dublin Core, also known as the Dublin Core Metadata Element Set (DCMES), is a set of fifteen main metadata items for describing digital or physical resources. The Dublin Core Metadata Initiative (DCMI) is responsible for formulating the Dublin Core; DCMI is a project of the Association for Information Science and Technology (ASIS&T), a non-profit organization.
An identifier is a name that identifies either a unique object or a unique class of objects, where the "object" or class may be an idea, physical countable object, or physical noncountable substance. The abbreviation Id often refers to identity, identification, or an identifier. An identifier may be a word, number, letter, symbol, or any combination of those.
A Universally Unique IDentifier (UUID) is a 128-bit label used for information in computer systems. The term Globally Unique IDentifier (GUID) is also used, mostly in Microsoft systems.
The Geography Markup Language (GML) is the XML grammar defined by the Open Geospatial Consortium (OGC) to express geographical features. GML serves as a modeling language for geographic systems as well as an open interchange format for geographic transactions on the Internet. Key to GML's utility is its ability to integrate all forms of geographic information, including not only conventional "vector" or discrete objects, but coverages and sensor data.
An International Standard Serial Number (ISSN) is an eight-digit serial number used to uniquely identify a serial publication, such as a magazine. The ISSN is especially helpful in distinguishing between serials with the same title. ISSNs are used in ordering, cataloging, interlibrary loans, and other practices in connection with serial literature.
A digital object identifier (DOI) is a persistent identifier or handle used to uniquely identify various objects, standardized by the International Organization for Standardization (ISO). DOIs are an implementation of the Handle System; they also fit within the URI system. They are widely used to identify academic, professional, and government information, such as journal articles, research reports, data sets, and official publications. DOIs have also been used to identify other types of information resources, such as commercial videos.
A metadata registry is a central location in an organization where metadata definitions are stored and maintained in a controlled method.
NIEMOpen, frequently referred to as NIEM, originated as an XML-based information exchange framework from the United States, but has transitioned to an OASISOpen Project. This initiative formalizes NIEM's designation as an official standard in national and international policy and procurement. NIEMOpen's Project Governing Board recently approved the first standard under this new project; the Conformance Targets Attribute Specification (CTAS) Version 3.0. A full collection of NIEMOpen standards are anticipated by end of year 2024.
A unique identifier (UID) is an identifier that is guaranteed to be unique among all identifiers used for those objects and for a specific purpose. The concept was formalized early in the development of computer science and information systems. In general, it was associated with an atomic data type.
This article describes the technical specifications of the OpenDocument office document standard, as developed by the OASIS industry consortium. A variety of organizations developed the standard publicly and make it publicly accessible, meaning it can be implemented by anyone without restriction. The OpenDocument format aims to provide an open alternative to proprietary document formats.
Security Assertion Markup Language 2.0 (SAML 2.0) is a version of the SAML standard for exchanging authentication and authorization identities between security domains. SAML 2.0 is an XML-based protocol that uses security tokens containing assertions to pass information about a principal between a SAML authority, named an Identity Provider, and a SAML consumer, named a Service Provider. SAML 2.0 enables web-based, cross-domain single sign-on (SSO), which helps reduce the administrative overhead of distributing multiple authentication tokens to the user.
The Open Packaging Conventions (OPC) is a container-file technology initially created by Microsoft to store a combination of XML and non-XML files that together form a single entity such as an Open XML Paper Specification (OpenXPS) document. OPC-based file formats combine the advantages of leaving the independent file entities embedded in the document intact and resulting in much smaller files compared to normal use of XML.
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
Location awareness refers to devices that can passively or actively determine their location. Navigational instruments provide location coordinates for vessels and vehicles. Surveying equipment identifies location with respect to a well-known location wireless communications device.
An Archival Resource Key (ARK) is a multi-purpose URL suited to being a persistent identifier for information objects of any type. It is widely used by libraries, data centers, archives, museums, publishers, and government agencies to provide reliable references to scholarly, scientific, and cultural objects. In 2019 it was registered as a Uniform Resource Identifier (URI).
In computing, Open Data Protocol (OData) is an open protocol that allows the creation and consumption of queryable and interoperable REST APIs in a simple and standard way. Microsoft initiated OData in 2007. Versions 1.0, 2.0, and 3.0 are released under the Microsoft Open Specification Promise. Version 4.0 was standardized at OASIS, with a release in March 2014. In April 2015 OASIS submitted OData v4 and OData JSON Format v4 to ISO/IEC JTC 1 for approval as an international standard. In December 2016, ISO/IEC published OData 4.0 Core as ISO/IEC 20802-1:2016 and the OData JSON Format as ISO/IEC 20802-2:2016.
ISO 2146 is an ISO standard defining an information model for "registry services for libraries and related organisations". Operating at a higher level than item-level standards such as MARC, it takes as principal elements parties, collections, services and activities
In computer science, a U-form is an abstract data type comprising a collection of attribute–value pairs associated with a universally-unique identifier (UUID). A U-form essentially comprises an associative array augmented with a UUID and with keys limited to strings.
The Entertainment Identifier Registry, or EIDR, is a global unique identifier system for a broad array of audiovisual objects, including motion pictures, television, and radio programs. The identification system resolves an identifier to a metadata record that is associated with top-level titles, edits, DVDs, encodings, clips, and mashups. EIDR also provides identifiers for video service providers, such as broadcast and cable networks.
ISO/IEC 19788Information technology – Learning, education and training – Metadata for learning resources is a multi-part standard prepared by subcommittee SC 36 of the joint technical committee ISO/IEC JTC 1, Information Technology for Learning, Education and Training.