Georeferencing

Last updated

Georeferencing or georegistration is a type of coordinate transformation that binds a digital raster image or vector database that represents a geographic space (usually a scanned map or aerial photograph) to a spatial reference system, thus locating the digital data in the real world. [1] [2] It is thus the geographic form of image registration. The term can refer to the mathematical formulas used to perform the transformation, the metadata stored alongside or within the image file to specify the transformation, or the process of manually or automatically aligning the image to the real world to create such metadata. The most common result is that the image can be visually and analytically integrated with other geographic data in geographic information systems and remote sensing software.

Contents

A number of mathematical methods are available, but the process typically involves identifying several sample ground control points with known locations on the image and the ground, then using curve fitting techniques to generate a parametric (or piecewise parametric) formula to transform the rest of the image. [3] Once the parameters of the formula are stored, the image may be transformed dynamically at drawing time, or resampled to generate a georeferenced raster GIS file or orthophoto.

The term georeferencing has also been used to refer to other types of transformation from general expressions of geographic location ( geocodes ) to coordinate measurements, [4] but most of these other methods are more commonly called geocoding. Because of this ambiguity, Georegistration is preferred by some to refer to the image transformation. [5] :141–143 Occasionally, this process has been called rubbersheeting , but that term is more commonly applied to a very similar process applied to vector GIS data. [5] :240

Motivation

Mathematics

Graphical view of the affine transformation. WorldFileParametersSchemas.gif
Graphical view of the affine transformation.

The registration of an image to a geographic space is essentially the transformation from an input coordinate system (the inherent coordinates of pixels in the images based on row and column number) to an output coordinate system, a spatial reference system of the user's choice, such as the geographic coordinate system or a particular Universal Transverse Mercator zone. It is thus the extension of the typical task of curve fitting a relationship between two variables to four dimensions. The goal is to have a pair of functions of the form:

Such that for every pixel in the image ( being its column and row number, respectively), a corresponding real-world coordinate can be calculated.

Several types of functions are available in most GIS and remote sensing software for georeferencing. [6] As the simplest type of two-dimensional curve is a straight line, so the simplest form of coordinate transformation is a linear transformation, the most common type being the affine transformation: [7] :171

Where A-F are constant coefficients set for the entire image. These formulas allow an image to be moved (the C and F coefficients specify the desired location of the top left corner of the image), scaled (without rotation, the A and E coefficients specify the size of each cell or spatial resolution), and rotated. [8] :115 In the last case, if the cell size is r in both the x and y directions, and the image is to be rotated α degrees counter-clockwise, then . The world file developed by Esri is a commonly used sidecar file that specifies these six coefficients for image georeferencing.

Higher order polynomial transformations are also commonly used. For example, a Second-order polynomial transformation would be:

The second-order terms (and third-order terms in a third-order polynomial) allow for the variable warping of the image, which is especially useful for removing the inherent distortion in aerial photographs.

In addition to global parametric formulas, piecewise formulas can also be used, which transform different parts of the image in different ways. A common example is a Thin plate spline transformation. [9]

The GCP method

It is very rare that a user would specify the parameters for the transformation directly. Instead, most GIS and remote sensing software provides an interactive environment for visually aligning the image to the destination coordinate system. The most common method for doing this is to create a series of ground control points (GCP). [7] :170 A ground control point is a location that can be identified on both the image and the ground, so that it has precise coordinates in both the image coordinate system ( = pixel column, = pixel row) and the ground coordinate system (). Easily visible locations that be precisely located are preferred as GCP's, such as a road intersection or the corner of a building. When very high accuracy registration is required, it is common to place or paint high-contrast markers on the ground at survey control monuments before the photography is taken, and use GNSS-measured coordinates for the output. In most software, these are entered by pointing at the location on the image, then pointing at the same location on a vector base map or orthophoto that is already in the desired coordinate system. This can then be moved and adjusted to improve accuracy.

With a minimal set of GCPs, the known coordinates can be entered into the mathematical equations for the desired type of transformation, which can then be solved using linear algebra to determine the coefficients and derive the formulas to use for the entire grid. [8] :116 For example, the linear affine transformation above has six unknown coefficients, so six equations with known <> are needed to derive them, which will require three ground control points. [7] :171 The second-order polynomial requires a minimum of six ground control points, and so on.

The entered GCPs are rarely perfectly located and are even more rarely perfectly representative of the distortion in the rest of the image, but the algebraic solution, which appears to be a perfect match, masks any error. To avoid this, it is common to create many more than the minimal required set (creating an overdetermined system), and use least squares regression to derive a set of function parameters that most closely matches the points. [8] :116 This is almost never a perfect match, so the variance between each GCP location and the location predicted by the functions can be measured and summarized as a Root-mean-square error (RMSE). A lower RMSE thus means that the transformation formulas closely match the GCPs.

Once the function parameters are determined, the transformation functions can be used to transform every pixel of the image to its real-world location. Two options are usually available for making this transformation permanent. One option is to save the parameters themselves as a form of metadata, either in the header of the image file itself (e.g., GeoTIFF), or in a sidecar file stored alongside the image file (e.g., a world file). With this metadata, the software can perform the transformation dynamically as it displays the image, so that it appears to align with other data in the desired coordinate system. The alternative method is rectification, in which the image is resampled to create a new raster grid that is natively tied to the coordinate system. Rectification was traditionally the only option, until the computing power became available for the intense calculations of dynamic coordinate transformations; even now, drawing and analysis performance is better with a rectified image.

Software implementations

See also

Related Research Articles

<span class="mw-page-title-main">Geographic information system</span> System to capture, manage and present geographic data

A geographic information system (GIS) consists of integrated computer hardware and software that store, manage, analyze, edit, output, and visualize geographic data. Much of this often happens within a spatial database, however, this is not essential to meet the definition of a GIS. In a broader sense, one may consider such a system also to include human users and support staff, procedures and workflows, the body of knowledge of relevant concepts and methods, and institutional organizations.

<span class="mw-page-title-main">Vector graphics</span> Computer graphics images defined by points, lines and curves

Vector graphics are a form of computer graphics in which visual images are created directly from geometric shapes defined on a Cartesian plane, such as points, lines, curves and polygons. The associated mechanisms may include vector display and printing hardware, vector data models and file formats, as well as the software based on these data models. Vector graphics is an alternative to raster or bitmap graphics, with each having advantages and disadvantages in specific situations.

<span class="mw-page-title-main">2D computer graphics</span> Computer-based generation of digital images

2D computer graphics is the computer-based generation of digital images—mostly from two-dimensional models and by techniques specific to them. It may refer to the branch of computer science that comprises such techniques or to the models themselves.

A world file is a six line plain text sidecar file used by geographic information systems (GIS) to georeference raster map images. The file specification was introduced by Esri.

In geodesy, conversion among different geographic coordinate systems is made necessary by the different geographic coordinate systems in use across the world and over time. Coordinate conversion is composed of a number of different types of conversion: format change of geographic coordinates, conversion of coordinate systems, or transformation to different geodetic datums. Geographic coordinate conversion has applications in cartography, surveying, navigation and geographic information systems.

A GIS file format is a standard for encoding geographical information into a computer file, as a specialized type of file format for use in geographic information systems (GIS) and other geospatial applications. Since the 1970s, dozens of formats have been created based on various data models for various purposes. They have been created by government mapping agencies, GIS software vendors, standards bodies such as the Open Geospatial Consortium, informal user communities, and even individual developers.

GeoTIFF is a public domain metadata standard which allows georeferencing information to be embedded within a TIFF file. The potential additional information includes map projection, coordinate systems, ellipsoids, datums, and everything else necessary to establish the exact spatial reference for the file. The GeoTIFF format is fully compliant with TIFF 6.0, so software incapable of reading and interpreting the specialized metadata will still be able to open a GeoTIFF format file.

<span class="mw-page-title-main">Tomographic reconstruction</span> Estimate object properties from a finite number of projections

Tomographic reconstruction is a type of multidimensional inverse problem where the challenge is to yield an estimate of a specific system from a finite number of projections. The mathematical basis for tomographic imaging was laid down by Johann Radon. A notable example of applications is the reconstruction of computed tomography (CT) where cross-sectional images of patients are obtained in non-invasive manner. Recent developments have seen the Radon transform and its inverse used for tasks related to realistic object insertion required for testing and evaluating computed tomography use in airport security.

The computer graphics pipeline, also known as the rendering pipeline or graphics pipeline, is a framework within computer graphics that outlines the necessary procedures for transforming a three-dimensional (3D) scene into a two-dimensional (2D) representation on a screen. Once a 3D model is generated, the graphics pipeline converts the model into a visually perceivable format on the computer display. Due to the dependence on specific software, hardware configurations, and desired display attributes, a universally applicable graphics pipeline does not exist. Nevertheless, graphics application programming interfaces (APIs), such as Direct3D, OpenGL and Vulkan were developed to standardize common procedures and oversee the graphics pipeline of a given hardware accelerator. These APIs provide an abstraction layer over the underlying hardware, relieving programmers from the need to write code explicitly targeting various graphics hardware accelerators like AMD, Intel, Nvidia, and others.

<span class="mw-page-title-main">Equirectangular projection</span> Cylindrical equidistant map projection

The equirectangular projection, and which includes the special case of the plate carrée projection, is a simple map projection attributed to Marinus of Tyre, who Ptolemy claims invented the projection about AD 100.

<span class="mw-page-title-main">Geotagging</span> Act of associating geographic coordinates to digital media

Geotagging, or GeoTagging, is the process of adding geographical identification metadata to various media such as a geotagged photograph or video, websites, SMS messages, QR Codes or RSS feeds and is a form of geospatial metadata. This data usually consists of latitude and longitude coordinates, though they can also include altitude, bearing, distance, accuracy data, and place names, and perhaps a time stamp.

A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, analyze, and visualize geographic data, that is, data representing phenomena for which location is important. The GIS software industry encompasses a broad range of commercial and open-source products that provide some or all of these capabilities within various information technology architectures.

<span class="mw-page-title-main">Spatial reference system</span> System to specify locations on Earth

A spatial reference system (SRS) or coordinate reference system (CRS) is a framework used to precisely measure locations on the surface of Earth as coordinates. It is thus the application of the abstract mathematics of coordinate systems and analytic geometry to geographic space. A particular SRS specification comprises a choice of Earth ellipsoid, horizontal datum, map projection, origin point, and unit of measure. Thousands of coordinate systems have been specified for use around the world or in specific regions and for various purposes, necessitating transformations between different SRS.

<span class="mw-page-title-main">GDAL</span> Translator library for raster and vector geospatial data formats

The Geospatial Data Abstraction Library (GDAL) is a computer software library for reading and writing raster and vector geospatial data formats, and is released under the permissive X/MIT style free software license by the Open Source Geospatial Foundation. As a library, it presents a single abstract data model to the calling application for all supported formats. It may also be built with a variety of useful command line interface utilities for data translation and processing. Projections and transformations are supported by the PROJ library.

In cartography and geographic information systems, rubbersheeting is a form of coordinate transformation that warps a vector dataset to match a known geographic space. This is most commonly needed when a dataset has systematic positional error, such as one digitized from a historical map of low accuracy. The mathematics and procedure are very similar to the georeferencing of raster images, and this term is occasionally used for that process as well, but image georegistration is an unambiguous term for the raster process.

Camera resectioning is the process of estimating the parameters of a pinhole camera model approximating the camera that produced a given photograph or video; it determines which incoming light ray is associated with each pixel on the resulting image. Basically, the process determines the pose of the pinhole camera.

<span class="mw-page-title-main">Image rectification</span>

Image rectification is a transformation process used to project images onto a common image plane. This process has several degrees of freedom and there are many strategies for transforming images to the common plane. Image rectification is used in computer stereo vision to simplify the problem of finding matching points between images, and in geographic information systems to merge images taken from multiple perspectives into a common map coordinate system.

Integrated Land and Water Information System (ILWIS) is a geographic information system (GIS) and remote sensing software for both vector and raster processing. Its features include digitizing, editing, analysis and display of data, and production of quality maps. ILWIS was initially developed and distributed by ITC Enschede in the Netherlands for use by its researchers and students. Since 1 July 2007, it has been released as free software under the terms of the GPL-2.0-only license. Having been used by many students, teachers and researchers for more than two decades, ILWIS is one of the most user-friendly integrated vector and raster software programmes currently available. ILWIS has some very powerful raster analysis modules, a high-precision and flexible vector and point digitizing module, a variety of very practical tools, as well as a great variety of user guides and training modules all available for downloading. The current version is ILWIS 3.8.6. Similar to the GRASS GIS in many respects, ILWIS is currently available natively only on Microsoft Windows. However, a Linux Wine manual has been released.

A geographic data model, geospatial data model, or simply data model in the context of geographic information systems, is a mathematical and digital structure for representing phenomena over the Earth. Generally, such data models represent various aspects of these phenomena by means of geographic data, including spatial locations, attributes, change over time, and identity. For example, the vector data model represents geography as collections of points, lines, and polygons, and the raster data model represent geography as cell matrices that store numeric values. Data models are implemented throughout the GIS ecosystem, including the software tools for data management and spatial analysis, data stored in a variety of GIS file formats, specifications and standards, and specific designs for GIS installations.

Geospatial PDF is a set of geospatial extensions to the Portable Document Format (PDF) 1.7 specification to include information that relates a region in the document page to a region in physical space — called georeferencing. A geospatial PDF can contain geometry such as points, lines, and polygons. These, for example, could represent building locations, road networks and city boundaries, respectively. The georeferencing metadata for geospatial PDF is most commonly encoded in one of two ways: the OGC best practice; and as Adobe's proposed geospatial extensions to ISO 32000. The specifications also allow geometry to have attributes, such as a name or identifying type.

References

  1. "What does "georeferenced" mean?". www.usgs.gov. U.S. Geological Survey. Retrieved 2022-01-04.
  2. Yao, Xiaobai A. (2020-01-01), "Georeferencing and Geocoding", in Kobayashi, Audrey (ed.), International Encyclopedia of Human Geography (Second Edition), Oxford: Elsevier, pp. 111–117, doi:10.1016/b978-0-08-102295-5.10548-7, ISBN   978-0-08-102296-2, S2CID   241797395 , retrieved 2022-01-04
  3. Hackeloeer, A.; Klasing, K.; Krisp, J.M.; Meng, L. (2014). "Georeferencing: a review of methods and applications". Annals of GIS . 20 (1): 61–69. doi: 10.1080/19475683.2013.868826 . S2CID   38306705.
  4. Leidner, J.L. (2017). "Georeferencing: From Texts to Maps". International Encyclopedia of Geography: People, the Earth, Environment and Technology . vi: 2897–2907. doi:10.1002/9781118786352.wbieg0160. ISBN   9780470659632.
  5. 1 2 Longley, Paul A.; Goodchild, Michael F.; Maguire, David J.; Rhind, David W. (2011). Geographic Information Systems & Science (3rd ed.). Wiley.
  6. 1 2 "Overview of georeferencing". ArcGIS Pro Documentation. Esri. Retrieved 8 January 2023.
  7. 1 2 3 Bolstad, Paul (2019). GIS Fundamentals: A First Text on Geographic Information Systems. Ann Arbor, MI: XanEdu. ISBN   978-1-59399-552-2.
  8. 1 2 3 Chang, Kang-tsung (2014). Introduction to Geographic Information Systems (7th ed.). McGraw-Hill. pp. 50–57. ISBN   978-0-07-352290-6.
  9. 1 2 "16.3 Georeferencer". QGIS 3.22 documentation. OSGEO. Retrieved 8 January 2023.

Further reading