Cost distance analysis

Last updated July 02, 2023

In spatial analysis and geographic information systems, cost distance analysis or cost path analysis is a method for determining one or more optimal routes of travel through unconstrained (two-dimensional) space.^[1] The optimal solution is that which minimizes the total cost of the route, based on a field of cost density (cost per linear unit) that varies over space due to local factors. It is thus based on the fundamental geographic principle of Friction of distance. It is an optimization problem with multiple deterministic algorithm solutions, implemented in most GIS software.

The various problems, algorithms, and tools of cost distance analysis operate over an unconstrained two-dimensional space, meaning that a path could be of any shape. Similar cost optimization problems can also arise in a constrained space, especially a one-dimensional linear network such as a road or telecommunications network. Although they are similar in principle, the problems in network space require very different (usually simpler) algorithms to solve, largely adopted from graph theory. The collection of GIS tools for solving these problems are called network analysis.

History

Humans seem to have an innate desire to travel with minimal effort and time. Historic, even ancient, roads show patterns similar to what modern computational algorithms would generate, traveling straight across flat spaces, but curving around mountains, canyons, and thick vegetation.

However, it was not until the 20th century that geographers developed theories to explain this route optimization, and algorithms to reproduce it. In 1957, during the Quantitative revolution in Geography, with its propensity to adopt principles or mathematical formalisms from the "hard" sciences (known as social physics), William Warntz used refraction as an analogy for how minimizing travel cost will make transportation routes change direction at the boundary between two landscapes with very different friction of distance (e.g., emerging from a forest into a prairie).^[2] His principle of "parsimonious movement," changing direction to minimize cost, was widely accepted, but the refraction analogy and mathematics (Snell's law) was not, largely because it does not scale well to normally complex geographic situations.^[3]

Warntz and others then adopted another analogy that proved much more successful in the common situation where travel cost varies continuously over space, by comparing it to terrain.^[4] They compared the cost rate (i.e., cost per unit distance, the inverse of velocity if the cost is time) to the slope of a terrain surface (i.e., elevation change per unit distance), both being mathematical derivatives of an accumulated function or field: total elevation above a vertical datum (sea level) in the case of terrain. Integrating the cost rate field from a given starting point would create an analogous surface of total accumulated cost of travel from that point. In the same way that a stream follows the path of least resistance downhill, the streamline on the cost accumulation surface from any point "down" to the source will be the minimum-cost path.^[5]^[6] Additional lines of research in the 1960s further developed the nature of the cost rate field as a manifestation of the concept of friction of distance, studying how it was affected by various geographic features.^[7]

At the time, this solution was only theoretical, lacking the data and computing power for the continuous solution. Raster GIS provided the first feasible platform for implementing the theoretical solution by converting the continuous integration into a discrete summation procedure. Dana Tomlin implemented cost distance analysis in his Map Analysis Package by 1986, and Ronald Eastman added it to IDRISI by 1989, with a more efficient "pushbroom" cost accumulation algorithm.^[8] Douglas (1994) further refined the accumulation algorithm, which is basically what is implemented in most current GIS software.^[9]

Cost raster

The primary data set used in cost distance analysis is the cost raster, sometimes called the cost-of-passage surface,^[9] the friction image,^[8] the cost-rate field, or cost surface. In most implementations, this is a raster grid, in which the value of each cell represents the cost (i.e., expended resources, such as time, money, or energy) of a route crossing the cell in a horizontal or vertical direction.^[10] It is thus a discretization of a field of cost rate (cost per linear unit), a spatially intensive property. This cost is a manifestation of the principle of friction of distance.

A number of different types of cost may be relevant in a given routing problem:

Travel cost, the resource expenditure required to move across the cell, usually time or energy/fuel.
Construction cost, the resources (usually monetary) required to build the infrastructure that makes travel possible, such as roads, pipes, and cables. While some construction costs are constant (e.g., paving material), others are spatially variant, such as property acquisition and excavation.
Environmental impacts, the negative effects on the natural or human environment caused by the infrastructure or the travel along it. For example, building an expressway through a residential neighborhood or a wetland would incur a high political cost (in the form of environmental impact assessments, protests, lawsuits, etc.).

Some of these costs are easily quantifiable and measurable, such as transit time, fuel consumption, and construction costs, thus naturally lending themselves to computational solutions. That said, there may be significant uncertainty in predicting the cost prior to implementing the route. Other costs are much more difficult to measure due to their qualitative or subjective nature, such as political protest or ecological impact; these typically require operationalization through the creation of a scale.^[11]

In many situations, multiple types of cost may be simultaneously relevant, and the total cost is a combination of them. Because different costs are expressed in different units (or, in the case of scales, no units at all), they usually cannot be directly summed, but must be combined by creating an index. A common type of index is created by scaling each factor to a consistent range (say, [0,1]), then combining them using weighted linear combination. An important part of the creation of an index model like this is Calibration (statistics), adjusting the parameters of the formula(s) to make the modeled relative cost match real-world costs, using methods such as the Analytic hierarchy process. The index model formula is typically implemented in a raster GIS using map algebra tools from raster grids representing each cost factor, resulting in a single cost raster grid.

Directional cost

One limitation of the traditional method is that the cost field is isotropic or omni-directional: the cost at a given location does not depend on the direction of traversal. This is appropriate in many situations, but not others. For example, if one is flying in a windy location, an airplane flying in the direction of the wind incurs a much lower cost than an airplane flying against it. Some research has been done on extending cost distance analysis algorithms to incorporate directional cost, but it is not yet widely implemented in GIS software.^[12] IDRISI has some support for anisotropy.^[1]

Least-cost-path algorithm

The most common cost distance task is to determine the single path through the space between a given source location and a destination location that has the least total accumulated cost. The typical solution algorithm is a discrete raster implementation of the cost integration strategy of Warntz and Lindgren,^[5] which is a deterministic (NP-complete) optimization.^[10]

Inputs: cost field raster, source location, destination location (most implementations can solve for multiple sources and destinations simultaneously)
Accumulation: Starting at the source location compute the lowest total cost needed to reach every other cell in the grid. Although there are several algorithms, such as those published by Eastman and Douglas,^[8]^[9] they generally follow a similar strategy.^[13] This process also creates, as an important byproduct, a second raster grid usually called the backlink grid (Esri) or movement direction grid (GRASS), in which each cell has a direction code (0-7) representing which of its eight neighbors had the lowest cost.
1. Find a cell that is adjacent to at least one cell that already has an accumulated cost assigned (initially, this is only the source cell)
2. Determine which neighbor has the lowest accumulated cost. Encode the direction from the target to the lowest-cost neighbor in the backlink grid.
3. Add the cost of the target cell (or an average of the costs of the target and neighbor cells) to the neighbor accumulated cost, to create the accumulated cost of the target cell. If the neighbor is diagonal, the local cost is multiplied by ${\sqrt {2}}$
4. The algorithm must also take into account that indirect routes may have lower cost, often using a hash table to keep track of temporary cost values along the expanding fringe of computation that can be reconsidered.
5. Repeat the procedure until all cells are assigned.
Drain: In keeping with the terrain analogy, trace the optimal route from the given destination back to the source like a stream draining away from a location. At its most basic, this is accomplished by starting at the destination cell, moving in the direction indicated in the backlink grid, then repeating for the next cell, and so on until the source is reached. Recent software adds some improvements, such as looking across three or more cells to recognize straight lines at angles other than the eight neighbor directions. For example, the r.walk function in GRASS can recognize the "knight's move" (one cell straight, then one cell diagonal) and draw a straight line bypassing the middle cell.

Corridor analysis

A slightly different version of the least-cost path problem, which could be considered a fuzzy version of it, is to look for corridors more than one cell in width, thus providing some flexibility in applying the results. Corridors are commonly used in transportation planning and in wildlife management.

The solution to this problem is to compute, for every cell in the study space, the total accumulated cost of the optimal path between a given source and destination that passes through that cell. Thus, every cell in the optimal path derived above would have the same minimum value. Cells near this path would be reached by paths deviating only slightly from the optimal path, so they would have relatively low cost values, collectively forming a corridor with fuzzy edges as more distant cells have increasing cost values.

The algorithm to derive this corridor field is created by generating two cost accumulation grids: one using the source as described above. Then the algorithm is repeated, but using the destination as the source. Then these two grids are added using map algebra. This works because for each cell, the optimal source-destination path passing through that cell is the optimal path from that cell to the source, added to the optimal path from that cell to the destination. This can be accomplished using the cost accumulation tool above, along with a map algebra tool, although ArcGIS provides a Corridor tool that automates the process.

Cost-based allocation

Another use of the cost accumulation algorithm is to partition space among multiple sources, with each cell assigned to the source it can reach with the lowest cost, creating a series of regions in which each source is the "nearest". In the terrain analogy, these would correspond to watersheds (one could thus call these "cost-sheds," but this term is not in common usage). They are directly related to a voronoi diagram, which is essentially an allocation over a space with constant cost. They are also conceptually (if not computationally) similar to location-allocation tools for network analysis.

A cost-based allocation can be created using two methods. The first is to use a modified version of the cost accumulation algorithm, which substitutes the backlink grid for an allocation grid, in which each cell is assigned the same source identifier of its lowest-cost neighbor, causing the domain of each source to gradually grow until they meet each other. This is the approach taken in ArcGIS Pro.^[14] The second solution is to first run the basic accumulation algorithm, then use the backlink grid to determine the source into which each cell "flows." GRASS GIS uses this approach; in fact, the same tool is used as for computing watersheds from terrain.^[15]

Implementations

Cost distance tools are available in most raster GIS software:

GRASS GIS (often bundled into QGIS), with separate accumulation (r.cost) and drain (r.walk) functions
ArcGIS Desktop and ArcGIS Pro, with separate accumulation (Cost Distance) and drain (Cost Path) geoprocessing tools, as well as Corridor generation. Recently, starting with ArcGIS Pro version 2.5, a new set of cost distance tools was introduced, using more advanced algorithms with more flexible options.^[16]
TerrSet (formerly Idrisi) has several tools, implementing a variety of algorithms to solve different kinds of cost distance problems, including anisotropic (directional) cost.^[17]

Related Research Articles

A geographic information system (GIS) consists of integrated computer hardware and software that store, manage, analyze, edit, output, and visualize geographic data. Much of this often happens within a spatial database, however, this is not essential to meet the definition of a GIS. In a broader sense, one may consider such a system also to include human users and support staff, procedures and workflows, the body of knowledge of relevant concepts and methods, and institutional organizations.

<span class="mw-page-title-main">Dijkstra's algorithm</span> Graph search algorithm

Dijkstra's algorithm is an algorithm for finding the shortest paths between nodes in a weighted graph, which may represent, for example, road networks. It was conceived by computer scientist Edsger W. Dijkstra in 1956 and published three years later.

A* is a graph traversal and path search algorithm, which is used in many fields of computer science due to its completeness, optimality, and optimal efficiency. One major practical drawback is its $space complexity, as it stores all generated nodes in memory. Thus, in practical travel-routing systems, it is generally outperformed by algorithms that can pre-process the graph to attain better performance, as well as memory-bounded approaches; however, A* is still the best solution in many cases.$

Esri is an American multinational geographic information system (GIS) software company. It is best known for its ArcGIS products. With a 43% market share, Esri is the world's leading supplier of GIS software, web GIS and geodatabase management applications. The company is headquartered in Redlands, California.

A GIS file format is a standard for encoding geographical information into a computer file, as a specialized type of file format for use in geographic information systems (GIS) and other geospatial applications. Since the 1970s, dozens of formats have been created based on various data models for various purposes. They have been created by government mapping agencies, GIS software vendors, standards bodies such as the Open Geospatial Consortium, informal user communities, and even individual developers.

A transport network, or transportation network, is a network or graph in geographic space, describing an infrastructure that permits and constrains movement or flow. Examples include but are not limited to road networks, railways, air routes, pipelines, aqueducts, and power lines. The digital representation of these networks, and the methods for their analysis, is a core part of spatial analysis, geographic information systems, public utilities, and transport engineering. Network analysis is an application of the theories and algorithms of graph theory and is a form of proximity analysis.

A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, analyze, and visualize geographic data, that is, data representing phenomena for which location is important. The GIS software industry encompasses a broad range of commercial and open-source products that provide some or all of these capabilities within various information technology architectures.

ArcSDE is a server-software sub-system that aims to enable the usage of Relational Database Management Systems for spatial data. The spatial data may then be used as part of a geodatabase.

<span class="mw-page-title-main">JUMP GIS</span>

Java Unified Mapping Program (JUMP) is a Java based vector and raster GIS and programming framework. Current development continues under the OpenJUMP name.

<span class="mw-page-title-main">Spatial analysis</span> Formal techniques which study entities using their topological, geometric, or geographic properties

Spatial analysis is any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques using different analytic approaches, especially spatial statistics. It may be applied in fields as diverse as astronomy, with its studies of the placement of galaxies in the cosmos, or to chip fabrication engineering, with its use of "place and route" algorithms to build complex wiring structures. In a more restricted sense, spatial analysis is geospatial analysis, the technique applied to structures at the human scale, most notably in the analysis of geographic data. It may also be applied to genomics, as in transcriptomics data.

gvSIG, geographic information system (GIS), is a desktop application designed for capturing, storing, handling, analyzing and deploying any kind of referenced geographic information in order to solve complex management and planning problems. gvSIG is known for having a user-friendly interface, being able to access the most common formats, both vector and raster ones. It features a wide range of tools for working with geographic-like information.

ArcInfo is a full-featured geographic information system produced by Esri, and is the highest level of licensing in the ArcGIS Desktop product line. It was originally a command-line based system. The command-line processing abilities are now available through the GUI of the ArcGIS Desktop product.

Geomorphometry, or geomorphometrics, is the science and practice of measuring the characteristics of terrain, the shape of the surface of the Earth, and the effects of this surface form on human and natural geography. It gathers various mathematical, statistical and image processing techniques that can be used to quantify morphological, hydrological, ecological and other aspects of a land surface. Common synonyms for geomorphometry are geomorphological analysis, terrain morphometry, terrain analysis, and land surface analysis. Geomorphometrics is the discipline based on the computational measures of the geometry, topography and shape of the Earth's horizons, and their temporal change. This is a major component of geographic information systems (GIS) and other software tools for spatial analysis.

Map algebra is an algebra for manipulating geographic data, primarily fields. Developed by Dr. Dana Tomlin and others in the late 1970s, it is a set of primitive operations in a geographic information system (GIS) which allows one or more raster layers ("maps") of similar dimensions to produce a new raster layer (map) using mathematical or other operations such as addition, subtraction etc.

An Esri grid is a raster GIS file format developed by Esri, which has two formats:

A proprietary binary format, also known as an ARC/INFO GRID, ARC GRID and many other variations
A non-proprietary ASCII format, also known as an ARC/INFO ASCII GRID

In geographic information systems (GIS) and spatial analysis, buffer analysis is the determination of a zone around a geographic feature containing locations that are within a specified distance of that feature, the buffer zone. A buffer is likely the most commonly used tool within the proximity analysis methods.

A geographic data model, geospatial data model, or simply data model in the context of geographic information systems, is a mathematical and digital structure for representing phenomena over the Earth. Generally, such data models represent various aspects of these phenomena by means of geographic data, including spatial locations, attributes, change over time, and identity. For example, the vector data model represents geography as collections of points, lines, and polygons, and the raster data model represent geography as cell matrices that store numeric values. Data models are implemented throughout the GIS ecosystem, including the software tools for data management and spatial analysis, data stored in a variety of GIS file formats, specifications and standards, and specific designs for GIS installations.

Proximity analysis is a class of spatial analysis tools and algorithms that employ geographic distance as a central principle. Distance is fundamental to geographic inquiry and spatial analysis, due to principles such as the friction of distance, Tobler's first law of geography, and Spatial autocorrelation, which are incorporated into analytical tools. Proximity methods are thus used in a variety of applications, especially those that involve movement and interaction.

<span class="mw-page-title-main">David Mount</span> American computer scientist

David Mount is a professor at the University of Maryland, College Park department of computer science whose research is in computational geometry.

Geographic information systems (GIS) play a constantly evolving role in geospatial intelligence (GEOINT) and United States national security. These technologies allow a user to efficiently manage, analyze, and produce geospatial data, to combine GEOINT with other forms of intelligence collection, and to perform highly developed analysis and visual production of geospatial data. Therefore, GIS produces up-to-date and more reliable GEOINT to reduce uncertainty for a decisionmaker. Since GIS programs are Web-enabled, a user can constantly work with a decision maker to solve their GEOINT and national security related problems from anywhere in the world. There are many types of GIS software used in GEOINT and national security, such as Google Earth, ERDAS IMAGINE, GeoNetwork opensource, and Esri ArcGIS.

References

1 2 de Smith, Michael, Paul Longley, Michael Goodchild (2018) Cost Distance, Geospatial Analysis, 6th Edition
↑ Warntz, William (1957). "Transportation, Social Physics, and the Law of Refraction". The Professional Geographer. 9 (4): 2–7. doi:10.1111/j.0033-0124.1957.094_2.x.
↑ Bunge, William (1966). Theoretical Geography. Lund, Sweden: Berlingsta Botryckeriet. p. 128.
↑ Warntz, William (1965) "A note on surfaces and paths and applications to geographical problems, IMaGe Discussion Paper #6, Ann Arbor: Michigan Inter-University Community of Mathematical Geographers
1 2 Lindgren, Ernesto S. (1967). "Proposed solution for the minimum path problem". Harvard Papers in Theoretical Geography, Geography and the Properties of Surface Series. 4.
↑ Lindgren, Ernesto S. (1967). "A minimum path problem reconsidered". Harvard Papers in Theoretical Geography, Geography and the Properties of Surface Series. 28.
↑ Huff, David L.; Jenks, George F. (1968). "Graphic interpretation of the friction of distance in gravity models". Annals of the Association of American Geographers. 58 (4): 814. doi:10.1111/j.1467-8306.1968.tb01670.x.
1 2 3 Eastman J R (1989) Pushbroom algorithms for calculating distances in raster grids. Proceedings, AutoCarto 9, pp.288-97
1 2 3 Douglas, David H. (1994). "Least-cost Path in GIS Using an Accumulated Cost Surface and Slopelines". Cartographica. 31 (3): 37–51. doi:10.3138/D327-0323-2JUT-016M.
1 2 Bolstad, Paul (2008). GIS Fundamentals: A First Text on Geographic Information Systems (3rd ed.). Eider Press. pp. 404–408. ISBN 978-0-9717647-2-9.
↑ G.H. Pirie (2009) Distance, in Rob Kitchin, Nigel Thrift (eds.) International Encyclopedia of Human Geography, Elsevier, Pages 242-251. doi:10.1016/B978-008044910-4.00265-0
↑ Collischonn, Walter; Pilar, Jorge Victor (2000). "A direction dependent least-cost-path algorithm for roads and canals". International Journal of Geographical Information Science. 14 (4): 397–407. doi:10.1080/13658810050024304. S2CID 37823291.
↑ "How cost distance tools work". ArcGIS Pro Documentation. Esri. Retrieved 29 December 2020.
↑ "Cost Allocation (Spatial Analyst)". ArcGIS Pro Documentation. Esri. Retrieved 30 December 2020.
↑ "r.watershed". GRASS GIS Documentation. Retrieved 30 December 2020.
↑ "An overview of the Distance toolset". ArcGIS Pro Documentation. Esri. Retrieved 29 December 2020.
↑ Eastman, J. Ronald, TerrSet Manual, p.115, 227, 356

External links

Distance toolset documentation for Esri ArcGIS Pro
Cost Surface tools in GRASS GIS

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[desmith-1] 1 2 de Smith, Michael, Paul Longley, Michael Goodchild (2018) Cost Distance, Geospatial Analysis, 6th Edition

[warntz1957-2] Warntz, William (1957). "Transportation, Social Physics, and the Law of Refraction". The Professional Geographer. 9 (4): 2–7. doi:10.1111/j.0033-0124.1957.094_2.x.

[bunge1966-3] Bunge, William (1966). Theoretical Geography. Lund, Sweden: Berlingsta Botryckeriet. p. 128.

[warntz1965-4] Warntz, William (1965) "A note on surfaces and paths and applications to geographical problems, IMaGe Discussion Paper #6, Ann Arbor: Michigan Inter-University Community of Mathematical Geographers

[lindgren1967-5] 1 2 Lindgren, Ernesto S. (1967). "Proposed solution for the minimum path problem". Harvard Papers in Theoretical Geography, Geography and the Properties of Surface Series. 4.

[lindgren1969-6] Lindgren, Ernesto S. (1967). "A minimum path problem reconsidered". Harvard Papers in Theoretical Geography, Geography and the Properties of Surface Series. 28.

[huff1968-7] Huff, David L.; Jenks, George F. (1968). "Graphic interpretation of the friction of distance in gravity models". Annals of the Association of American Geographers. 58 (4): 814. doi:10.1111/j.1467-8306.1968.tb01670.x.

[eastman1989-8] 1 2 3 Eastman J R (1989) Pushbroom algorithms for calculating distances in raster grids. Proceedings, AutoCarto 9, pp.288-97

[douglas1994-9] 1 2 3 Douglas, David H. (1994). "Least-cost Path in GIS Using an Accumulated Cost Surface and Slopelines". Cartographica. 31 (3): 37–51. doi:10.3138/D327-0323-2JUT-016M.

[bolstad-10] 1 2 Bolstad, Paul (2008). GIS Fundamentals: A First Text on Geographic Information Systems (3rd ed.). Eider Press. pp. 404–408. ISBN 978-0-9717647-2-9.

[Pirie2009-11] G.H. Pirie (2009) Distance, in Rob Kitchin, Nigel Thrift (eds.) International Encyclopedia of Human Geography, Elsevier, Pages 242-251. doi:10.1016/B978-008044910-4.00265-0

[collischonn2000-12] Collischonn, Walter; Pilar, Jorge Victor (2000). "A direction dependent least-cost-path algorithm for roads and canals". International Journal of Geographical Information Science. 14 (4): 397–407. doi:10.1080/13658810050024304. S2CID 37823291.

[esricostdoc-13] "How cost distance tools work". ArcGIS Pro Documentation. Esri. Retrieved 29 December 2020.

[14] "Cost Allocation (Spatial Analyst)". ArcGIS Pro Documentation. Esri. Retrieved 30 December 2020.

[15] "r.watershed". GRASS GIS Documentation. Retrieved 30 December 2020.

[16] "An overview of the Distance toolset". ArcGIS Pro Documentation. Esri. Retrieved 29 December 2020.

[17] Eastman, J. Ronald, TerrSet Manual, p.115, 227, 356

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]