Georelational data model

Last updated

A georelational data model is a geographic data model that represents geographic features as an interrelated set of spatial and attribute data. The georelational model was the dominant form of vector file format during the 1980s and 1990s, including the Esri coverage and Shapefile. [1]

Contents

History

The second era in the history of GIS, starting in the mid-1970s, was characterized by the rise of the first general-purpose GIS software programs (rather than the bespoke systems created in the 1960s and early 1970s). Each of these programs also created its own data file structures, primarily focused on finding innovative ways to store the spatial or geometric aspect of the data in the most efficient and error-free way. One example of this was the POLYVRT software and data structure (1973) from the Harvard Laboratory for Computer Graphics and Spatial Analysis, which inspired the Arc/INFO Coverage format. [2] :105 In experimental GIS software such as ODYSSEY, attribute data was only handled in a rudimentary way. Meanwhile, the relational database was quickly becoming the most promising software for managing non-spatial data, and several nascent GIS software companies chose to adopt it into their systems, especially Esri.

Although there were exceptions such as the object-oriented data models in Smallworld GIS (1989) and Intergraph's experimental TIGRIS, georelational data dominated the GIS industry until the rise of spatial databases in the late 1990s. Most of them are obsolete, although the Shapefile is still in common (if decreasing) use.

Georelational formats

In any vector data structure, the core unit is an object (either a geographic feature or a sample location for a field) that has a location in space (of 0, 1, 2, or 3 dimension) and a set of attributes. In the georelational model, these are stored as separate files: a geometry file that is usually custom-designed by a software developer for use in a particular program, and an attribute table that follows relational database principles; often, the latter is adopted directly from an existing relational database management system software.

Examples of commonly-used georelational data formats include:

ARC/INFO Coverage (Esri 1981-2005)
The name ARC/INFO literally reflected the georelational design of the software and the coverage format. The ARC model or Coverage was the topological vector data structure developed by ESRI, based on earlier structures developed at Harvard such as POLYVRT. INFO was a relational database developed by Henco Software, Inc. (originally for financial management) that was licensed by ESRI. In the Coverage structure, each point, line, or polygon had an identification number, which could be joined to the row in the INFO table with the same primary key, as in a relational table join. In an ARC/INFO workspace (=directory/folder), all of the INFO tables were stored in a separate directory from the directories for the ARC data for each coverage. [3] To process attribute data, the user had to leave the ARC program and start the INFO program. During the 1990s, Esri added support for other commercial RDBMS software for the attribute data.
MGE (Intergraph 1989-2000)
During the 1980s, Intergraph was an industry leader on workstation CAD with its IGDS software, including Microstation (developed by Bentley Systems). When it developed MGE (Modular GIS Environment), its first flagship GIS product, it directly incorporated the Microstation software as its interactive environment, and the Microstation Design File (.dgn, a non-topological vector graphics file format) for storing graphics. The associated attribute table could be stored in any RDBMS supported on Intergraph UNIX workstations, Informix being one of the most common. [4] An ID attached to each object in the design file enabled a relational join to the rows in the attribute table.
Shapefile (Esri 1992–present)
As the GIS industry grew to incorporate more casual users, the inherent complexity of the coverage data structure became a concern. When Esri released ArcView GIS 2.0 in 1992, it introduced the new shapefile format for vector data. This was a much simpler data model, eliminating features such as topology, but was still a georelational design. A shape-"file" actually consisted of several files, including at the very least a .shp file to store the geometry, and a .dbf file for the attributes, the latter directly adopting the dBase format that was the dominant microcomputer database at the time (despite it being a proprietary trade secret, the .dbf format had been legally reverse-engineered by the xBase community and published). Rather than using a relational join to connect the two files, the shapefile merely uses file order: the first shape matches the first attribute row, and so on. [5]

See also

Related Research Articles

<span class="mw-page-title-main">Esri</span> Geospatial software & SaaS company

Environmental Systems Research Institute, Inc., doing business as Esri, is an American multinational geographic information system (GIS) software company headquartered in Redlands, California. It is best known for its ArcGIS products. With a 40% market share, Esri is the world's leading supplier of GIS software, web GIS and geodatabase management applications.

A coverage is the digital representation of some spatio-temporal phenomenon. ISO 19123 provides the definition:

<span class="mw-page-title-main">Geometric primitive</span> Basic shapes represented in vector graphics

In vector computer graphics, CAD systems, and geographic information systems, geometric primitive is the simplest geometric shape that the system can handle. Sometimes the subroutines that draw the corresponding objects are called "geometric primitives" as well. The most "primitive" primitives are point and straight line segment, which were all that early vector graphics systems had.

A GIS file format is a standard for encoding geographical information into a computer file, as a specialized type of file format for use in geographic information systems (GIS) and other geospatial applications. Since the 1970s, dozens of formats have been created based on various data models for various purposes. They have been created by government mapping agencies, GIS software vendors, standards bodies such as the Open Geospatial Consortium, informal user communities, and even individual developers.

<span class="mw-page-title-main">TerraLib</span> Geographic information system software library

TerraLib is an open-source geographic information system (GIS) software library. It extends object-relational database management systems (DBMS) to handle spatiotemporal data types.

A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, analyze, and visualize geographic data, that is, data representing phenomena for which location is important. The GIS software industry encompasses a broad range of commercial and open-source products that provide some or all of these capabilities within various information technology architectures.

ArcSDE is a server-software sub-system that aims to enable the usage of Relational Database Management Systems for spatial data. The spatial data may then be used as part of a geodatabase.

<span class="mw-page-title-main">Shapefile</span> Geospatial vector data format

The shapefile format is a geospatial vector data format for geographic information system (GIS) software. It is developed and regulated by Esri as a mostly open specification for data interoperability among Esri and other GIS software products. The shapefile format can spatially describe vector features: points, lines, and polygons, representing, for example, water wells, rivers, and lakes. Each item usually has attributes that describe it, such as name or temperature.

<span class="mw-page-title-main">ArcGIS</span> Geographic information system maintained by Esri

ArcGIS is a family of client, server and online geographic information system (GIS) software developed and maintained by Esri.

ArcInfo is a full-featured geographic information system produced by Esri, and is the highest level of licensing in the ArcGIS Desktop product line. It was originally a command-line based system. The command-line processing abilities are now available through the GUI of the ArcGIS Desktop product.

<span class="mw-page-title-main">GDAL</span> Translator library for raster and vector geospatial data formats

The Geospatial Data Abstraction Library (GDAL) is a computer software library for reading and writing raster and vector geospatial data formats, and is released under the permissive X/MIT style free software license by the Open Source Geospatial Foundation. As a library, it presents a single abstract data model to the calling application for all supported formats. It may also be built with a variety of useful command line interface utilities for data translation and processing. Projections and transformations are supported by the PROJ library.

MapInfo Pro is a desktop geographic information system (GIS) software developed by Precisely, used for mapping and location analysis. It was formerly developed by Pitney Bowes Software and the MapInfo Corporation.

ArcMap is the former main component of Esri's ArcGIS suite of geospatial processing programs. Used primarily to view, edit, create, and analyze geospatial data. ArcMap allows the user to explore data within a data set, symbolize features accordingly, and create maps. This is done through two distinct sections of the program, the table of contents and the data frame. In October 2020, it was announced that there are no plans to release 10.9 in 2021, and that ArcMap would no longer be supported after March 1, 2026. Esri is encouraging their users to transition to ArcGIS Pro.

A geographic data model, geospatial data model, or simply data model in the context of geographic information systems, is a mathematical and digital structure for representing phenomena over the Earth. Generally, such data models represent various aspects of these phenomena by means of geographic data, including spatial locations, attributes, change over time, and identity. For example, the vector data model represents geography as collections of points, lines, and polygons, and the raster data model represent geography as cell matrices that store numeric values. Data models are implemented throughout the GIS ecosystem, including the software tools for data management and spatial analysis, data stored in a variety of GIS file formats, specifications and standards, and specific designs for GIS installations.

The following tables compare general and technical information for a number of GIS vector file format. Please see the individual products' articles for further information. Unless otherwise specified in footnotes, comparisons are based on the stable versions without any add-ons, extensions or external programs.

<span class="mw-page-title-main">Geospatial topology</span> Type of spatial relationship

Geospatial topology is the study and application of qualitative spatial relationships between geographic features, or between representations of such features in geographic information, such as in geographic information systems (GIS). For example, the fact that two regions overlap or that one contains the other are examples of topological relationships. It is thus the application of the mathematics of topology to GIS, and is distinct from, but complementary to the many aspects of geographic information that are based on quantitative spatial measurements through coordinate geometry. Topology appears in many aspects of geographic information science and GIS practice, including the discovery of inherent relationships through spatial query, vector overlay and map algebra; the enforcement of expected relationships as validation rules stored in geospatial data; and the use of stored topological relationships in applications such as network analysis. Spatial topology is the generalization of geospatial topology for non-geographic domains, e.g., CAD software.

The Esri TIN format is a popular yet proprietary geospatial vector data format for geographic information system (GIS) software for storing elevation data as a triangulated irregular network. It is developed and regulated by Esri, US. The Esri TIN format can spatially describe elevation information including breaking edge features. Each points and triangle can carry a tag information. A TIN stored in this file format can have any shape, cover multiple regions and contain holes.

The Harvard Laboratory for Computer Graphics and Spatial Analysis pioneered early cartographic and architectural computer applications that led to integrated geographic information systems (GIS). Some of the Laboratory's influential programs included SYMAP, SYMVU, GRID, CALFORM, and POLYVRT. The Laboratory's Odyssey project created a geographic information system that served as a milestone in the development of integrated mapping systems. The Laboratory influenced numerous computer graphic, mapping and architectural systems such as Intergraph, Computervision, and Esri.

Vector overlay is an operation in a geographic information system (GIS) for integrating two or more vector spatial data sets. Terms such as polygon overlay, map overlay, and topological overlay are often used synonymously, although they are not identical in the range of operations they include. Overlay has been one of the core elements of spatial analysis in GIS since its early development. Some overlay operations, especially Intersect and Union, are implemented in all GIS software and are used in a wide variety of analytical applications, while others are less common.

A Geodatabase is a proprietary GIS file format developed in the late 1990s by Esri to represent, store, and organize spatial datasets within a geographic information system. A geodatabase is both a logical data model and the physical implementation of that logical model in several proprietary file formats released during the 2000s. The geodatabase design is based on the spatial database model for storing spatial data in relational and object-relational databases. Given the dominance of Esri in the GIS industry, the term "geodatabase" is used by some as a generic trademark for any spatial database, regardless of platform or design.

References

  1. Wade, T. and Sommer, S. eds. A to Z GIS
  2. Chrisman, Nick (2006). Charting the Unknown: How Computer Mapping at Harvard Became GIS. Esri Press. ISBN   978-1-58948-118-3.
  3. Kennedy, Michael (2006). Introducing Geographic Information Systems with ArcGIS. Wiley. pp. 220–235. ISBN   978-0-471-79229-1.
  4. Korte, George B. (1994). The GIS Book (3rd ed.). OnWord Press. p. 63.
  5. ESRI (July 1998). "ESRI Shapefile Technical Description" (PDF). Retrieved 2007-07-04.{{cite journal}}: Cite journal requires |journal= (help)