Publishing Requirements for Industry Standard Metadata

Last updated December 09, 2024

The Publishing Requirements for Industry Standard Metadata (PRISM)^[1] for the Internet, computing, and computer science, is a specification that defines a set of XML metadata vocabularies for syndicating, aggregating, post-processing and multi-purposing content.

PRISM provides a framework for the interchange and preservation of content and metadata, a collection of elements to describe that content, and a set of controlled vocabularies listing the values for those elements. PRISM can be XML, RDF/XML, or XMP and incorporates Dublin Core elements. PRISM can be thought of as a set of XML tags used to contain the metadata of articles and even tag article content.

PRISM conforms to the World Wide Web standard for Namespaces. PRISM namespaces are PRISM (prism:), PRISM Usage Rights (pur:), Dublin Core (dc: and dcterms:), PRISM Inline Metadata (pim:), PRISM Rights Language (prl:), PRISM Aggregator Message (pam:), and PRISM Controlled Vocabulary (pcv:). PRISM incorporated existing industry standards such as Dublin Core and XHTML in order to leverage work that had already been done in the publishing industry. New elements were created only when required and assigned to PRISM specific namespaces.

Overview

PRISM consists of three specifications. The PRISM Specification, itself, provides a definition for the overall PRISM framework. A second specification, the PRISM Aggregator Message (PAM) Schema/DTD, is a standard format for publishers to use for delivery of content to websites, aggregators, and syndicators. PAM is available as an XML DTD and an XML schema (XSD). Both PAM formats provide a simple, flexible model for transmitting content and PRISM metadata. The third, and newest, specification provides an XML schema (XSD) for the capture of content usage rights metadata. This Guide to PRISM Usage Rights utilizes the elements found in PRISM’s Usage Rights Namespace to allow users to comprehensively capture and relay rights metadata for text and media content.

Background

In 1999, Idealliance contracted Linda Burman to found the PRISM Working Group to address emerging publisher requirements for a metadata standard to facilitate “agile” content for search, digital asset management, content aggregation. Since that time, individuals from more than 50 Idealliance member companies have participated in the development of the specifications.

PRISM is an Idealliance specification but is available free of charge. Idealliance (International Digital Enterprise Alliance) is a not-for-profit membership organization. Its mission is to advance user-driven, cross-industry solutions for all publishing and content-related processes by developing standards, fostering business alliances, and identifying best practices.

Many organizations use PRISM because it provides a common metadata standard across platforms, media types and business units. Organizations who are involved in any type of content creation, categorization, management, aggregation and distribution, both commercially and within intranet and extranet frameworks can use the PRISM standards.

The PRISM Working Group is open to all Idealliance members and includes: Adobe Systems, Hachette Filipacchi Media, Hearst, L.A. Burman Associates, LexisNexis, The McGraw-Hill Companies, Reader’s Digest, Source Interlink Media Companies, Time Inc., The Nature Publishing Group, and U.S. News & World Report.

Usage and Applications

PRISM can be incorporated into other standards and at this time, the PRISM Working Group is only aware of PRISM incorporation with RSS 1.0. See RSS 1.0^[2] and the RSS 1.0 PRISM Module^[3] for more information.

The PRISM specification defines a set of metadata vocabularies. PRISM metadata may be expressed in a different syntax depending on the specific use-case scenario. Currently PRISM metadata can be encoded XML, XML/RDF, or as XMP. Each of these expressions of PRISM metadata is called a profile.

Profile 1 is for the expression of PRISM metadata in XML. An example is the XML PRISM Aggregator Message (PAM).
Profile 2 is for the expression of PRISM metadata in XML/RDF such as for expressing PRISM metadata in RSS feeds.
Profile 3 is for embedding PRISM metadata in media objects such as digital images or PDFs using XMP technology.

PRISM describes many components of print, online, mobile, and multimedia content including the following:

Who created, contributed to, and owns the rights to the content?
What locations, organizations, topics, people, and/or events it covers, the media it contains, and under what conditions it may be reproduced?
When it was published? (cover date, post date, volume, number), withdrawn?
Where it can be republished, and the original platform on which it appeared?
How it can be reused?

Common PRISM Usage

Syndication to partners
Content aggregation
Content repurposing
Resource discovery and search optimization
Multiple platform and channel distribution
Content archiving
Capture rights usage information
Creation of feeds, such as RSS
Standalone services
Embedded descriptions, such as XMP
Web publishing

Related Research Articles

The Dublin Core vocabulary, also known as the Dublin Core Metadata Terms (DCMT), is a general purpose metadata vocabulary for describing resources of any type. It was first developed for describing web content in the early days of the World Wide Web. The Dublin Core Metadata Initiative (DCMI) is responsible for maintaining the Dublin Core vocabulary.

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

RSS is a web feed that allows users and applications to access updates to websites in a standardized, computer-readable format. Subscribing to RSS feeds can allow a user to keep track of many different websites in a single news aggregator, which constantly monitors sites for new content, removing the need for the user to manually check them. News aggregators can be built into a browser, installed on a desktop computer, or installed on a mobile device.

XSD, a recommendation of the World Wide Web Consortium (W3C), specifies how to formally describe the elements in an Extensible Markup Language (XML) document. It can be used by programmers to verify each piece of item content in a document, to assure it adheres to the description of the element it is placed in.

The Geography Markup Language (GML) is the XML grammar defined by the Open Geospatial Consortium (OGC) to express geographical features. GML serves as a modeling language for geographic systems as well as an open interchange format for geographic transactions on the Internet. Key to GML's utility is its ability to integrate all forms of geographic information, including not only conventional "vector" or discrete objects, but coverages and sensor data.

On the World Wide Web, a web feed is a data format used for providing users with frequently updated content. Content distributors syndicate a web feed, thereby allowing users to subscribe a channel to it by adding the feed resource address to a news aggregator client. Users typically subscribe to a feed by manually entering the URL of a feed or clicking a link in a web browser or by dragging the link from the web browser to the aggregator, thus "RSS and Atom files provide news updates from a website in a simple form for your computer."

The International Press Telecommunications Council (IPTC), based in London, United Kingdom, is a consortium of the world's major news agencies, other news providers and news industry vendors and acts as the global standards body of the news media.

An XML schema is a description of a type of XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntactical constraints imposed by XML itself. These constraints are generally expressed using some combination of grammatical rules governing the order of elements, Boolean predicates that the content must satisfy, data types governing the content of elements and attributes, and more specialized rules such as uniqueness and referential integrity constraints.

Learning Object Metadata is a data model, usually encoded in XML, used to describe a learning object and similar digital resources used to support learning. The purpose of learning object metadata is to support the reusability of learning objects, to aid discoverability, and to facilitate their interoperability, usually in the context of online learning management systems (LMS).

RDF Schema (Resource Description Framework Schema, variously abbreviated as RDFS, RDF(S), RDF-S, or RDF/S) is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

The Extensible Metadata Platform (XMP) is an ISO standard, originally created by Adobe Systems Inc., for the creation, processing and interchange of standardized and custom metadata for digital documents and data sets.

Catalogue Service for the Web (CSW), sometimes seen as Catalogue Service - Web, is a standard for exposing a catalogue of geospatial records in XML on the Internet. The catalogue is made up of records that describe geospatial data, geospatial services, and related resources.

RDFa or Resource Description Framework in Attributes is a W3C Recommendation that adds a set of attribute-level extensions to HTML, XHTML and various XML-based document types for embedding rich metadata within Web documents. The Resource Description Framework (RDF) data-model mapping enables its use for embedding RDF subject-predicate-object expressions within XHTML documents. It also enables the extraction of RDF model triples by compliant user agents.

The AgMES initiative was developed by the Food and Agriculture Organization (FAO) of the United Nations and aims to encompass issues of semantic standards in the domain of agriculture with respect to description, resource discovery, interoperability, and data exchange for different types of information resources.

The Metadata Encoding and Transmission Standard (METS) is a metadata standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language of the World Wide Web Consortium (W3C). The standard is maintained as part of the MARC standards of the Library of Congress, and is being developed as an initiative of the Digital Library Federation (DLF).

Web syndication technologies were preceded by metadata standards such as the Meta Content Framework (MCF) and the Resource Description Framework (RDF), as well as by 'push' specifications such as Channel Definition Format (CDF). Early web syndication standards included Information and Content Exchange (ICE) and RSS. More recent specifications include Atom and GData.

The Office Open XML file formats are a set of file formats that can be used to represent electronic office documents. There are formats for word processing documents, spreadsheets and presentations as well as specific formats for material such as mathematical formulas, graphics, bibliographies etc.

XHTML+RDFa is an extended version of the XHTML markup language for supporting RDF through a collection of attributes and processing rules in the form of well-formed XML documents. XHTML+RDFa is one of the techniques used to develop Semantic Web content by embedding rich semantic markup. Version 1.1 of the language is a superset of XHTML 1.1, integrating the attributes according to RDFa Core 1.1. In other words, it is an RDFa support through XHTML Modularization.

<span class="mw-page-title-main">Asset Description Metadata Schema</span>

The Asset Description Metadata Schema (ADMS) is a common metadata vocabulary to describe standards, so-called interoperability assets, on the Web.

References

↑ PRISM Metadata Standard
↑ "RDF Site Summary (RSS) 1.0". Archived from the original on 2013-01-12. Retrieved 2009-12-09.
↑ "RDF Site Summary 1.0 Modules: PRISM". Archived from the original on 2009-11-24. Retrieved 2009-12-09.

Publishing Requirements for Industry Standard Metadata

Contents

Overview

Background

Usage and Applications

See also

Related Research Articles

References

Further reading