International Press Telecommunications Council

Last updated

The International Press Telecommunications Council (IPTC), based in London, United Kingdom, is a consortium of the world's major news agencies, other news providers and news industry vendors and acts as the global standards body of the news media.

Contents

Currently more than 50 companies and organizations from the news industry are members of the IPTC, including global players like Associated Press (AP), Agence France-Presse (AFP), Deutsche Presse-Agentur (dpa), BBC, Getty Images, Press Association (PA), Reuters and The New York Times .

IPTC aims at simplifying the distribution of information. To achieve this technical standards are developed to improve the management and exchange of information between content providers, intermediaries and consumers. IPTC is committed to open standards and makes all standards freely available to its members and the wider community.

The IPTC was established in 1965 by a group of news organisations including the Alliance Européenne des Agences de Presse (EANA), American Newspaper Publishers Association (then ANPA, now NAA), Fédération Internationale des Editeurs de Journaux (now WAN-IFRA) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) to safeguard the telecommunications interests of the world's press.

IPTC Standards

Photo metadata

The IPTC defined a set of metadata properties that can be applied to images, part of a broader standard developed in the early 1990s and known as the IPTC Information Interchange Model (IIM). Embedded IIM image information is often referred to as an "IPTC header". This basic set of metadata fields is branded as IPTC Core, in 2008 a second set branded as IPTC Extension was developed.

The Extensible Metadata Platform (XMP) has largely superseded IIM's image file header structure, but the properties of the IPTC Core are synchronized between the technical structures of XMP and IIM by a vast majority of imaging software.

Because of its nearly universal acceptance among photographers even amateurs this is by far IPTC's most widely used standard.

IPTC NewsML-G2-Standards Family

Using XML and the W3C's model for the Semantic Web, in 2008 IPTC launched a new series of standards that feature modular construction and many opportunities for embedding metadata. This allows developers to build applications that use only the parts of the IPTC NewsML-G2-Standards that are required by the customer, and that reduce programming costs by re-using XML and metadata modules. Further the NewsML-G2-Standards extend the scope of exchange formats beyond news content, they include event data and well organised information about persons, organisations, points of interest, geopolitical areas or abstract concepts.

These members of the family of standards are available now: NewsML-G2, EventsML-G2 and SportsML-G2. Although they have different roles within the news industry they share much of the date model and the same XML coding.

NewsML-G2

NewsML-G2 acts as an envelope and organizer so that news providers can create single news items text, photos, video or anything else and bundle them into concise cohesive packages than can be automatically processed by web CMSs or newsroom systems. The content and relationships of the individual news items can be described using a rich set of metadata. [1]

NewsML-G2 is not a text or image mark-up format. For example, it does not contain paragraph or headline tagging. Rather, it is an envelope and organizer that does not, by itself, act as a content format. For text article formatting, IPTC mentions News Industry Text Format (NITF) or XHTML as common examples. [2]

NewsML-G2 is used by several large international news agencies, including Agence France-Presse, Associated Press, Austria Presse Agentur, Deutsche Presse-Agentur, Fourth Estate, Thomson Reuters, and others. [3] [4]

EventsML-G2

EventsML-G2 optimises the sharing of event information such as meetings, sports events, elections, and even rocket launches — virtually any announced activity that can be broadly classified as an "event." Using XML and metadata, EventsML-G2 allows news providers to create rich descriptions of events plus information about planned coverage, and then share that information with web sites, databases, newsrooms or other customers.

EventsML-G2 is not a calendaring standard, but rather a way for news agencies to share event information to populate their own calendars. The events need not have fixed starting or ending dates, nor do they even need to have a fixed length.

SportsML-G2

SportsML-G2 is a convenient way to share sports statistics in a concise, unambiguous way. All major sports are supported, and certain sports that are known for rich or especially complex statistics (such as baseball) can use special add-on modules. SportsML-G2 has found wide usage outside the news industry; among the user communities are sports teams, fantasy sports leagues, sports betting firms and sports historians. SportsML-G2 is the newer sibling of SportsML.

Version 2.0 of SportsML-G2 was released in October 2008.

NewsCodes

NewsCodes is the brand name of the set of Controlled Vocabularies created and maintained by IPTC. One group of NewsCodes can be used to classify news content - regardless of the media type - and another set is used to support the functionality of IPTC's news exchange format standards.

Media Topics

The Media Topics is the IPTC's recommended controlled vocabulary for classifying media content by topic. Around 1,200 terms in a hierarchy describe the subject of news content in a generic, international way. The IPTC created Media Topics in 2010 as a successor to the IPTC'S previous subject taxonomy, SubjectCodes.

The Media Topics vocabulary is actively maintained by the IPTC NewsCodes Working Group. Updates are released every two to three months including new terms and clarifications to existing terms.

Translations of the Media Topics vocabulary are available in 12 languages and language variants: [5] American English (en-US), Arabic (ar), Brazilian Portuguese (pt-BR), British English (en-GB), Danish (dk), French (fr), German (de), Norwegian (no), Portuguese (pt-PT), Simplified Chinese (zh-Hans), Spanish (es) and Swedish (se).

NewsML 1

NewsML 1 is an XML standard developed by the IPTC to provide a media-independent, structural enveloping framework for multi-media news.

It has also been adopted as Japanese Industrial Standards (JIS) X7201:2005.

Although superseded in 2008 by NewsML-G2, NewsML 1 (as the original NewsML is now known) remains popular outside North America (especially in Asia) and IPTC plans to support this standard as NewsML 1.x indefinitely.

ninjs

ninjs standardizes the representation of news in JSON - a data interchange format. The format covers the data interchange by APIs and documents in content management systems like search engines.

NITF - News Industry Text Format

News Industry Text Format (NITF) is an XML specification published by the IPTC that is designed to standardize the content and structure of text-based news articles.

RightsML

RightsML is IPTC's Rights Expression Language for news media and is based on ODRL.

rNews

rNews is a standard for using semantic markup to annotate online news. It defines a data model for embedding machine-readable publishing metadata in web documents and a set of suggested implementations. Some properties of this standard have been adopted by Schema.org for its metadata schema.

SportsML

SportsML is an XML news exchange standard for sharing sports statistics in a concise, unambiguous way. All major sports are supported, and certain sports that are known for rich or especially complex statistics (such as baseball) can use special add-on modules. SportsML has found wide usage outside the news industry; among the user communities are sports teams, fantasy sports leagues, sports betting firms and sports historians.

In June 2008, the IPTC approved SportsML-G2 as a parallel standard to SportsML. Although SportsML-G2 is built from standardized XML modules that are used by other IPTC G2 standards, both standards effectively accomplish the same task.

IPTC 7901

IPTC 7901 is still a widely used format for plain text news, despite being 30 years old. It is a close relative of the Newspaper Association of America North American standard ANPA-1312, and uses similar control and other special characters to create a file that can be used to drive computerized news editing systems, photo typesetters or even teleprinter machines.

Related Research Articles

<span class="mw-page-title-main">XML</span> Markup language by the W3C for encoding of data

Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

<span class="mw-page-title-main">Exif</span> Metadata standard in digital images

Exchangeable image file format is a standard that specifies formats for images, sound, and ancillary tags used by digital cameras, scanners and other systems handling image and sound files recorded by digital cameras. The specification uses the following existing encoding formats with the addition of specific metadata tags: JPEG lossy coding for compressed image files, TIFF Rev. 6.0 for uncompressed image files, and RIFF WAV for audio files. It does not support JPEG 2000 or GIF encoded images.

The Open Digital Rights Language (ODRL) is a policy expression language that provides a flexible and interoperable information model, vocabulary, and encoding mechanisms for representing statements about the usage of content and services. ODRL became an endorsed W3C Recommendation in 2018.

The Darwin Information Typing Architecture (DITA) specification defines a set of document types for authoring and organizing topic-oriented information, as well as a set of mechanisms for combining, extending, and constraining document types. It is an open standard that is defined and maintained by the OASIS DITA Technical Committee.

News Industry Text Format (NITF) is an XML specification designed to standardize the content and structure of individual text news articles.

The Extensible Metadata Platform (XMP) is an ISO standard, originally created by Adobe Systems Inc., for the creation, processing and interchange of standardized and custom metadata for digital documents and data sets.

RDFa or Resource Description Framework in Attributes is a W3C Recommendation that adds a set of attribute-level extensions to HTML, XHTML and various XML-based document types for embedding rich metadata within Web documents. The Resource Description Framework (RDF) data-model mapping enables its use for embedding RDF subject-predicate-object expressions within XHTML documents. It also enables the extraction of RDF model triples by compliant user agents.

The Open Packaging Conventions (OPC) is a container-file technology initially created by Microsoft to store a combination of XML and non-XML files that together form a single entity such as an Open XML Paper Specification (OpenXPS) document. OPC-based file formats combine the advantages of leaving the independent file entities embedded in the document intact and resulting in much smaller files compared to normal use of XML.

<span class="mw-page-title-main">FastPictureViewer</span>

FastPictureViewer is a freemium image viewer for Windows XP and later. Its aim is to facilitate quick review, rating and annotation of large quantities of digital images in the early steps of the digital workflow, with an emphasis on simplicity and speed. As an app with a freemium license, a basic version is available cost-free for personal, non-profit or educational uses, while a commercial license is required for the professional version with additional features. The basic version starts as a full version trial.

The Information Interchange Model (IIM) is a file structure and set of metadata attributes that can be applied to text, images and other media types. It was developed in the early 1990s by the International Press Telecommunications Council (IPTC) to expedite the international exchange of news among newspapers and news agencies.

IPTC 7901 is a news service text markup specification published by the International Press Telecommunications Council that was designed to standardize the content and structure of text news articles. It was formally approved in 1979, and is still the world's most common way of transmitting news articles to newspapers, web sites and broadcasters from news services.

ANPA-1312 is a 7-bit news agency text markup specification published by the Newspaper Association of America, designed to standardize the content and structure of text news articles.

<span class="mw-page-title-main">Metadata</span> Data about data

Metadata is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:

Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages. It mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated.

<span class="mw-page-title-main">EPUB</span> E-book file format

EPUB is an e-book file format that uses the ".epub" file extension. The term is short for electronic publication and is sometimes styled ePub. EPUB is supported by many e-readers, and compatible software is available for most smartphones, tablets, and computers. EPUB is a technical standard published by the International Digital Publishing Forum (IDPF). It became an official standard of the IDPF in September 2007, superseding the older Open eBook (OEB) standard.

The Office Open XML file formats are a set of file formats that can be used to represent electronic office documents. There are formats for word processing documents, spreadsheets and presentations as well as specific formats for material such as mathematical formulae, graphics, bibliographies etc.

The Publishing Requirements for Industry Standard Metadata (PRISM) for the Internet, computing, and computer science, is a specification that defines a set of XML metadata vocabularies for syndicating, aggregating, post-processing and multi-purposing content.

The Quake Markup Language (QuakeML) is a flexible, extensible and modular XML representation of seismological data which is intended to cover a broad range of fields of application in modern seismology.

References

  1. "IPTC - NewsML-G2 - Introduction". IPTC. Retrieved 27 June 2013.
  2. "7.6 The Content" (PDF). NewsML-G2 Implementation Guide (5 ed.). IPTC. 2012. pp. 49–51.
  3. "IPTC - NewsML-G2 - Who's using it". IPTC. Retrieved 27 June 2013.
  4. "dpa stellt Produktion auf Datenformat NewsML-G2 um". Deutsche Presse-Agentur (in German). 8 October 2013. Retrieved 26 October 2014.
  5. "NewsCodes guidelines". iptc.org. IPTC. Retrieved 20 April 2021.