Geospatial metadata

Last updated

Geospatial metadata (also geographic metadata) is a type of metadata applicable to geographic data and information. Such objects may be stored in a geographic information system (GIS) or may simply be documents, data-sets, images or other objects, services, or related items that exist in some other native environment but whose features may be appropriate to describe in a (geographic) metadata catalog (may also be known as a data directory or data inventory).

Contents

Definition

ISO 19115:2013 "Geographic Information – Metadata" [1] from ISO/TC 211, the industry standard for geospatial metadata, describes its scope as follows:

[This standard] provides information about the identification, the extent, the quality, the spatial and temporal aspects, the content, the spatial reference, the portrayal, distribution, and other properties of digital geographic data and services. [1]

ISO 19115:2013 also provides for non-digital mediums:

Though this part of ISO 19115 is applicable to digital data and services, its principles can be extended to many other types of resources such as maps, charts, and textual documents as well as non-geographic data. [1]

The U.S. Federal Geographic Data Committee (FGDC) describes geospatial metadata as follows:

A metadata record is a file of information, usually presented as an XML document, which captures the basic characteristics of a data or information resource. It represents the who, what, when, where, why and how of the resource. Geospatial metadata commonly document geographic digital data such as Geographic Information System (GIS) files, geospatial databases, and earth imagery but can also be used to document geospatial resources including data catalogs, mapping applications, data models and related websites. Metadata records include core library catalog elements such as Title, Abstract, and Publication Data; geographic elements such as Geographic Extent and Projection Information; and database elements such as Attribute Label Definitions and Attribute Domain Values. [2]

History

The growing appreciation of the value of geospatial metadata through the 1980s and 1990s led to the development of a number of initiatives to collect metadata according to a variety of formats either within agencies, communities of practice, or countries/groups of countries. For example, NASA's "DIF" metadata format was developed during an Earth Science and Applications Data Systems Workshop in 1987, [3] and formally approved for adoption in 1988. Similarly, the U.S. FGDC developed its geospatial metadata standard over the period 1992–1994. [4] The Spatial Information Council of Australia and New Zealand (ANZLIC), [5] a combined body representing spatial data interests in Australia and New Zealand, released version 1 of its "metadata guidelines" in 1996. [6] ISO/TC 211 undertook the task of harmonizing the range of formal and de facto standards over the approximate period 1999–2002, resulting in the release of ISO 19115 "Geographic Information – Metadata" in 2003 and a subsequent revision in 2013. As of 2011 individual countries, communities of practice, agencies, etc. have started re-casting their previously used metadata standards as "profiles" or recommended subsets of ISO 19115, occasionally with the inclusion of additional metadata elements as formal extensions to the ISO standard. The growth in popularity of Internet technologies and data formats, such as Extensible Markup Language (XML) during the 1990s led to the development of mechanisms for exchanging geographic metadata on the web. In 2004, the Open Geospatial Consortium released the current version (3.1) of Geography Markup Language (GML), an XML grammar for expressing geospatial features and corresponding metadata. With the growth of the Semantic Web in the 2000s, the geospatial community has begun to develop ontologies for representing semantic geospatial metadata. Some examples include the Hydrology and Administrative ontologies developed by the Ordnance Survey in the United Kingdom.

ISO 19115: Geographic information – Metadata

ISO 19115 is a standard of the International Organization for Standardization (ISO). [7] The standard is part of the ISO geographic information suite of standards (19100 series). ISO 19115 and its parts define how to describe geographical information and associated services, including contents, spatial-temporal purchases, data quality, access and rights to use.

The objective of this International Standard is to provide a clear procedure for the description of digital geographic data-sets so that users will be able to determine whether the data in a holding will be of use to them and how to access the data. By establishing a common set of metadata terminology, definitions and extension procedures, this standard promotes the proper use and effective retrieval of geographic data. [8]

ISO 19115 was revised in 2013 to accommodate growing use of the internet for metadata management, as well as add many new categories of metadata elements (referred to as codelists) and the ability to limit the extent of metadata use temporally or by user. [9]

ISO 19139 Geographic information Metadata XML schema implementation

ISO 19139:2012 [10] provides the XML implementation schema for ISO 19115 specifying the metadata record format and may be used to describe, validate, and exchange geospatial metadata prepared in XML. [11]

The standard is part of the ISO geographic information suite of standards (19100 series), and provides a spatial metadata XML (spatial metadata eXtensible Mark-up Language (smXML)) encoding, an XML schema implementation derived from ISO 19115, Geographic information – Metadata. The metadata includes information about the identification, constraint, extent, quality, spatial and temporal reference, distribution, lineage, and maintenance of the digital geographic data-set.

Metadata directories

Also known as metadata catalogues or data directories.

(need discussion of, and subsections on GCMD, FGDC metadata gateway, ASDD, European and Canadian initiatives, etc. etc.)

Geospatial metadata tools

There are many proprietary GIS or geospatial products that support metadata viewing and editing on GIS resources. For example, ESRI's ArcGIS Desktop, SOCET GXP, Autodesk's AutoCAD Map 3D 2008, Arcitecta's Mediaflux and Intergraph's GeoMedia support geospatial metadata extensively.

GIS Inventory is a free web-based tool that provides a very simple interface to create geospatial metadata. Participants create a profile and document their data layers through a survey-style interface. The GIS Inventory produces metadata that is compliant with the Federal Content Standard for Digital Geospatial Metadata (CSDGM). The GIS Inventory is also capably of ingesting already completed metadata through document upload and web server connectivity. Through the GIS Inventory web services, metadata are automatically shared with US Federal agencies.

GeoNetwork opensource is a comprehensive Free and Open Source Software solution to manage and publish geospatial metadata and services based on international metadata and catalog standards. The software is part of the Open Source Geospatial Foundation's software stack.

GeoCat Bridge allows users to edit, validate and directly publish metadata from ArcGIS Desktop to GeoNetwork (and generic CSW catalogs) and publishes data as map services on GeoServer. Several metadata profiles are supported.

pycsw is an OGC CSW server implementation written in Python. pycsw fully implements the OpenGIS Catalogue Service Implementation Specification (Catalogue Service for the Web). The project is certified OGC Compliant, and is an OGC Reference Implementation.

CATMDEdit terraCatalog ArcCatalog ArcGIS Server Portal GeoNetwork opensource IME M3CAT MetaD Archived 6 July 2011 at the Wayback Machine MetaGenie Parcs Canada Metadata Editor Mapit/CADit NOKIS Editor

Related Research Articles

<span class="mw-page-title-main">Geography Markup Language</span> XML grammar for geographical features

The Geography Markup Language (GML) is the XML grammar defined by the Open Geospatial Consortium (OGC) to express geographical features. GML serves as a modeling language for geographic systems as well as an open interchange format for geographic transactions on the Internet. Key to GML's utility is its ability to integrate all forms of geographic information, including not only conventional "vector" or discrete objects, but coverages and sensor data.

A coverage is the digital representation of some spatio-temporal phenomenon. ISO 19123 provides the definition:

ISO/TC 211 is a standard technical committee formed within ISO, tasked with covering the areas of digital geographic information and geomatics. It is responsible for preparation of a series of International Standards and Technical Specifications numbered in the number range starting at ISO-19101. The Chair of the committee was 1994-2016: Olaf Østensen; during 2017-2018: Christina Wasström; and from 2019 Agneta Gren Engberg.

<span class="mw-page-title-main">Shapefile</span> Geospatial vector data format

The shapefile format is a geospatial vector data format for geographic information system (GIS) software. It is developed and regulated by Esri as a mostly open specification for data interoperability among Esri and other GIS software products. The shapefile format can spatially describe vector features: points, lines, and polygons, representing, for example, water wells, rivers, and lakes. Each item usually has attributes that describe it, such as name or temperature.

Isearch is open-source text retrieval software first developed in 1994 by Nassib Nassar as part of the Isite Z39.50 information framework. The project started at the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) of the North Carolina supercomputing center MCNC and funded by the National Science Foundation to follow in the track of WAIS and develop prototype systems for distributed information networks encompassing Internet applications, library catalogs and other information resources.

Catalogue Service for the Web (CSW), sometimes seen as Catalogue Service - Web, is a standard for exposing a catalogue of geospatial records in XML on the Internet. The catalogue is made up of records that describe geospatial data, geospatial services, and related resources.

Pennsylvania Spatial Data Access (PASDA) is Pennsylvania's official public access geospatial information clearinghouse. PASDA serves as Pennsylvania's node on the National Spatial Data Infrastructure (NSDI). PASDA is a cooperative effort of the Pennsylvania Geospatial Technologies Office of the Office of Information Technology and the Pennsylvania State University Institute of Energy and the Environment.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

<span class="mw-page-title-main">Minimum bounding rectangle</span> Smallest rectangle which encloses some planar set of points

In computational geometry, the minimum bounding rectangle (MBR), also known as bounding box (BBOX) or envelope, is an expression of the maximum extents of a two-dimensional object (e.g. point, line, polygon) or set of objects within its x-y coordinate system; in other words min(x), max(x), min(y), max(y). The MBR is a 2-dimensional case of the minimum bounding box.

The GeoNetwork opensource (GNOS) project is a free and open source (FOSS) cataloging application for spatially referenced resources. It is a catalog of location-oriented information.

A Spatial Data Infrastructure (SDI), also called geospatial data infrastructure, is a data infrastructure implementing a framework of geographic data, metadata, users and tools that are interactively connected in order to use spatial data in an efficient and flexible way. Another definition is "the technology, policies, standards, human resources, and related activities necessary to acquire, process, distribute, use, maintain, and preserve spatial data".

A geoportal is a type of web portal used to find and access geographic information and associated geographic services via the Internet. Geoportals are important for effective use of geographic information systems (GIS) and a key element of a spatial data infrastructure (SDI).

The Open Geospatial Consortium Web Coverage Service Interface Standard (WCS) defines Web-based retrieval of coverages – that is, digital geospatial information representing space/time-varying phenomena.

The Conservation Geoportal was an online geoportal, intended to provide a comprehensive listing of geographic information systems (GIS) datasets and web map service relevant to biodiversity conservation. It is currently defunct. The site, its contents and functionality were free for anyone to use and contribute to. The Conservation Geoportal was launched on June 28, 2006 at the joint Society for Conservation Biology and Society for Conservation GIS Conference in San Jose, California, USA. As of October 2007, it included metadata for over 3,667 GIS records.

Observations and Measurements (O&M) is an international standard which defines a conceptual schema encoding for observations, and for features involved in sampling when making observations. While the O&M standard was developed in the context of geographic information systems, the model is derived from generic patterns proposed by Fowler and Odell, and is not limited to geospatial information. O&M is one of the core standards in the OGC Sensor Web Enablement suite, providing the response model for Sensor Observation Service (SOS).

A metadata standard is a requirement which is intended to establish a common understanding of the meaning or semantics of the data, to ensure correct and proper use and interpretation of the data by its owners and users. To achieve this common understanding, a number of characteristics, or attributes of the data have to be defined, also known as metadata.

The Spatial Archive and Interchange Format was defined in the early 1990s as a self-describing, extensible format designed to support interoperability and storage of geospatial data.

GeoAPI is free software providing a set of Java interfaces for GIS applications. GeoAPI interfaces are derived from the abstract model and concrete specifications published collaboratively by the International Organization for Standardization (ISO) in its 19100 series of documents and the Open Geospatial Consortium (OGC) in its abstract and implementation specifications. GeoAPI provides an interpretation and adaptation of these standards to match the constraints and usages of the target programming language. The international standards translated to Java interfaces are:

<span class="mw-page-title-main">Open Geospatial Consortium</span> Standards organization

The Open Geospatial Consortium (OGC), an international voluntary consensus standards organization for geospatial content and location-based services, sensor web and Internet of Things, GIS data processing and data sharing. It originated in 1994 and involves more than 500 commercial, governmental, nonprofit and research organizations in a consensus process encouraging development and implementation of open standards.

GeoPackage (GPKG) is an open, non-proprietary, platform-independent and standards-based data format for geographic information systems built as a set of conventions over a SQLite database. Defined by the Open Geospatial Consortium (OGC) with the backing of the US military and published in 2014, GeoPackage has seen widespread support from various government, commercial, and open source organizations.

References

  1. 1 2 3 International Organization for Standardization (1 April 2014). "ISO 19115-1:2014(en)". ISO. Retrieved 1 April 2016.
  2. "Geospatial Metadata – Federal Geographic Data Committee". www.fgdc.gov. Retrieved 1 April 2016.
  3. Gene Major and Lola Olsen: "A short history of the DIF". On GCMD website, visited 16 October 2006 Archived 17 October 2006 at the Wayback Machine
  4. MIT Libraries Guide: "Federal Geographic Data Committee (FGDC) Metadata". On MIT Libraries website, visited 16 October 2006 Archived 18 October 2006 at the Wayback Machine
  5. "ANZLIC Metadata Profile Guidelines version 1.2 July 2011" (PDF). ANZLIC. 2011. Archived from the original (PDF) on 18 March 2016. Retrieved 11 April 2011. ANZLIC[:] The Spatial Information Council of Australia and New Zealand (formerly known as the Australia New Zealand Land Information Council)
  6. ANZLIC Metadata Guidelines: Core metadata elements for geographic data in Australia and New Zealand, Version 2 (February 2001)
  7. ISO 19115 Geographic Information – Metadata. International Organization for Standardization (ISO), Geneva, 2003
  8. "ISO 19115 Metadata Factsheet" (PDF). AG Outreach. Retrieved 22 November 2012.
  9. "NASA Metadata and the New ISO 19115-1 Capabilities - NASA ISO for EOSDIS - Earthdata Wiki". wiki.earthdata.nasa.gov. Retrieved 1 April 2016.
  10. International Organization for Standardization (15 December 2012). "ISO/TS 19139-2:2012(en)". ISO. Retrieved 1 April 2016.
  11. "ISO 19139 Geographic information Metadata XML schema implementation" Archived 9 August 2010 at the Wayback Machine , Marine Metadata Interoperability Project

ANZLIC Metadata Profile Version 1.2 (viewed July 2011)