Highway dimension

Last updated

The highway dimension is a graph parameter modelling transportation networks, such as road networks or public transportation networks. It was first formally defined by Abraham et al. [1] based on the observation by Bast et al. [2] [3] that any road network has a sparse set of "transit nodes", such that driving from a point A to a sufficiently far away point B along the shortest route will always pass through one of these transit nodes. It has also been proposed that the highway dimension captures the properties of public transportation networks well (at least according to definitions 1 and 2 below), given that longer routes using busses, trains, or airplanes will typically be serviced by larger transit hubs (stations and airports). This relates to the spoke–hub distribution paradigm in transport topology optimization.

Contents

Definitions

Several definitions of the highway dimension exist. [4] Each definition of the highway dimension uses a hitting set of a certain set of shortest paths: given a graph with edge lengths , let contain every vertex set such that induces a shortest path between some vertex pair of , according to the edge lengths . To measure the highway dimension we determine the "sparseness" of a hitting set of a subset of in a local area of the graph, for which we define a ball of radius around a vertex to be the set of vertices at distance at most from in according to the edge lengths . In the context of low highway dimension graphs, the vertices of a hitting set for the shortest paths are called hubs.

Definition 1

The original definition [1] of the highway dimension measures the sparseness of a hub set of shortest paths contained within a ball of radius :

The highway dimension of is the smallest integer such that for any radius and any node there is a hitting set of size at most for all shortest paths of length more than for which .

A variant of this definition uses balls of radius for some constant . Choosing a constant greater than 4 implies additional structural properties of graphs of bounded highway dimension, which can be exploited algorithmically. [5]

Definition 2

A subsequent definition [6] of the highway dimension measures the sparseness of a hub set of shortest paths intersecting a ball of radius :

The highway dimension of is the smallest integer such that for any radius and any node there is a hitting set of size at most for all shortest paths of length more than and at most for which .

This definition is weaker than the first, i.e., every graph of highway dimension also has highway dimension , but not vice versa. [5]

Definition 3

For the third definition [7] of the highway dimension we introduce the notion of a "witness path": for a given radius , a shortest path has an -witness path if has length more than and can be obtained from by adding at most one vertex to either end of (i.e., has at most 2 vertices more than and these additional vertices are incident to ). Note that may be shorter than but is contained in , which has length more than .

The highway dimension of is the smallest integer such that for any radius and any node there is a hitting set of size at most for all shortest paths that have an -witness path with .

This definition is stronger than the above, i.e., every graph of highway dimension also has highway dimension , but cannot be bounded in terms of . [5]

Shortest path cover

A notion closely related to the highway dimension is that of a shortest path cover, [1] where the order of the quantifiers in the definition is reversed, i.e., instead of a hub set for each ball, there is a one hub set , which is sparse in every ball:

Given a radius , an -shortest path cover of is a hitting set for all shortest paths in of length more than and at most . The -shortest path cover is locally -sparse if any node the ball contains at most vertices of , i.e., .

Every graph of bounded highway dimension (according to any of the above definitions) also has a locally -sparse -shortest path cover for every , but not vice versa. [4] For algorithmic purposes it is often more convenient to work with one hitting set for each radius , which makes shortest path covers an important tool for algorithms on graphs of bounded highway dimension.

Relation to other graph parameters

The highway dimension combines structural and metric properties of graphs, and is thus incomparable to common structural and metric parameters. In particular, for any graph it is possible to choose edge lengths such that the highway dimension is , [5] while at the same time some graphs with very simple structure such as trees can have arbitrary large highway dimension. This implies that the highway dimension parameter is incomparable to structural graph parameters such as treewidth, cliquewidth, or minor-freeness. On the other hand, a star with unit edge lengths has highway dimension (according to definitions 1 and 2 above) but unbounded doubling dimension, while a grid graph with unit edge lengths has constant doubling dimension but highway dimension . [1] This means that the highway dimension according to definitions 1 and 2 is also incomparable to the doubling dimension. Any graph of bounded highway dimension according to definition 3 above, also has bounded doubling dimension. [7]

Computing the highway dimension

Computing the highway dimension of a given graph is NP-hard. [5] Assuming that all shortest paths are unique (which can be done by slightly perturbing the edge lengths), an -approximation can be computed in polynomial time, [6] given that the highway dimension of the graph is . It is not known whether computing the highway dimension is fixed-parameter tractable (FPT), however there are hardness results indicating that this is likely not the case. [8] In particular, these results imply that, under standard complexity assumptions, an FPT algorithm can neither compute the highway dimension bottom-up (from the smallest value to the largest) nor top-down (from the largest value to the smallest).

Algorithms exploiting the highway dimension

Shortest path algorithms

Some heuristics to compute shortest paths, such as the Reach, Contraction Hierarchies, Transit Nodes, and Hub Labelling algorithms, can be formally proven to run faster than other shortest path algorithms (e.g. Dijkstra's algorithm) on graphs of bounded highway dimension according to definition 3 above. [7]

Approximations for NP-hard problems

A crucial property that can be exploited algorithmically for graphs of bounded highway dimension is that vertices that are far from the hubs of a shortest path cover are clustered into so-called towns: [5]

Given a radius , an -shortest path cover of , and a vertex at distance more than from , the set of vertices at distance at most from according to the edge lengths is called a town. The set of all vertices not lying in any town is called the sprawl.

It can be shown that the diameter of every town is at most , while the distance between a town and any vertex outside it is more than . Furthermore, the distance from any vertex in the sprawl to some hub of is at most .

Based on this structure, Feldmann et al. [5] defined the towns decomposition, which recursively decomposes the sprawl into towns of exponentially growing values . For a graph of bounded highway dimension (according to definition 1 above) this decomposition can be used to find a metric embedding into a graph of bounded treewidth that preserves distances between vertices arbitrarily well. Due to this embedding it is possible to obtain quasi-polynomial time approximation schemes (QPTASs) for various problems such as Travelling Salesman (TSP), Steiner Tree, k-Median, and Facility Location. [5]

For clustering problems such as k-Median, k-Means, and Facility Location, faster polynomial-time approximation schemes (PTASs) are known for graphs of bounded highway dimension according to definition 1 above. [9] For network design problems such as TSP and Steiner Tree it is not known how to obtain a PTAS.

For the k-Center problem, it is not known whether a PTAS exists for graphs of bounded highway dimension, however it is NP-hard to compute a ()-approximation on graphs of highway dimension , [10] which implies that any ()-approximation algorithm needs at least double exponential time in the highway dimension, unless P=NP. [10] On the other hand, it was shown that a parameterized -approximation algorithm with a runtime of exists for k-Center where is the highway dimension according to any of the above definitions. [10] When using definition 1 above, a parameterized approximation scheme (PAS) is known to exist when using and as parameters. [11]

For the Capacitated k-Center problem there is no PAS parameterized by and the highway dimension , unless FPT=W[1]. [12] This is notable, since typically (i.e., for all the problems mentioned above), if there is an approximation scheme for metrics of low doubling dimension, then there is also one for graphs of bounded highway dimension. But for Capacitated k-Center there is a PAS parameterized by and the doubling dimension. [12]

Related Research Articles

<span class="mw-page-title-main">Dijkstra's algorithm</span> Graph search algorithm

Dijkstra's algorithm is an algorithm for finding the shortest paths between nodes in a weighted graph, which may represent, for example, road networks. It was conceived by computer scientist Edsger W. Dijkstra in 1956 and published three years later.

In computer science, the Floyd–Warshall algorithm is an algorithm for finding shortest paths in a directed weighted graph with positive or negative edge weights. A single execution of the algorithm will find the lengths of shortest paths between all pairs of vertices. Although it does not return details of the paths themselves, it is possible to reconstruct the paths with simple modifications to the algorithm. Versions of the algorithm can also be used for finding the transitive closure of a relation , or widest paths between all pairs of vertices in a weighted graph.

<span class="mw-page-title-main">Steiner tree problem</span> On short connecting networks with added vertices

In combinatorial mathematics, the Steiner tree problem, or minimum Steiner tree problem, named after Jakob Steiner, is an umbrella term for a class of problems in combinatorial optimization. While Steiner tree problems may be formulated in a number of settings, they all require an optimal interconnect for a given set of objects and a predefined objective function. One well-known variant, which is often used synonymously with the term Steiner tree problem, is the Steiner tree problem in graphs. Given an undirected graph with non-negative edge weights and a subset of vertices, usually referred to as terminals, the Steiner tree problem in graphs requires a tree of minimum weight that contains all terminals and minimizes the total weight of its edges. Further well-known variants are the Euclidean Steiner tree problem and the rectilinear minimum Steiner tree problem.

In computer science, a topological sort or topological ordering of a directed graph is a linear ordering of its vertices such that for every directed edge (u,v) from vertex u to vertex v, u comes before v in the ordering. For instance, the vertices of the graph may represent tasks to be performed, and the edges may represent constraints that one task must be performed before another; in this application, a topological ordering is just a valid sequence for the tasks. Precisely, a topological sort is a graph traversal in which each node v is visited only after all its dependencies are visited. A topological ordering is possible if and only if the graph has no directed cycles, that is, if it is a directed acyclic graph (DAG). Any DAG has at least one topological ordering, and algorithms are known for constructing a topological ordering of any DAG in linear time. Topological sorting has many applications, especially in ranking problems such as feedback arc set. Topological sorting is possible even when the DAG has disconnected components.

In the mathematical field of graph theory, the distance between two vertices in a graph is the number of edges in a shortest path connecting them. This is also known as the geodesic distance or shortest-path distance. Notice that there may be more than one shortest path between two vertices. If there is no path connecting the two vertices, i.e., if they belong to different connected components, then conventionally the distance is defined as infinite.

In graph theory, the metric dimension of a graph G is the minimum cardinality of a subset S of vertices such that all other vertices are uniquely determined by their distances to the vertices in S. Finding the metric dimension of a graph is an NP-hard problem; the decision version, determining whether the metric dimension is less than a given value, is NP-complete.

<span class="mw-page-title-main">Dominating set</span> Subset of a graphs nodes such that all other nodes link to at least one

In graph theory, a dominating set for a graph G is a subset D of its vertices, such that any vertex of G is either in D, or has a neighbor in D. The domination numberγ(G) is the number of vertices in a smallest dominating set for G.

<span class="mw-page-title-main">Convex polytope</span> Convex hull of a finite set of points in a Euclidean space

A convex polytope is a special case of a polytope, having the additional property that it is also a convex set contained in the -dimensional Euclidean space . Most texts use the term "polytope" for a bounded convex polytope, and the word "polyhedron" for the more general, possibly unbounded object. Others allow polytopes to be unbounded. The terms "bounded/unbounded convex polytope" will be used below whenever the boundedness is critical to the discussed issue. Yet other texts identify a convex polytope with its boundary.

<span class="mw-page-title-main">Szemerédi regularity lemma</span>

Szemerédi's regularity lemma is one of the most powerful tools in extremal graph theory, particularly in the study of large dense graphs. It states that the vertices of every large enough graph can be partitioned into a bounded number of parts so that the edges between different parts behave almost randomly.

Set packing is a classical NP-complete problem in computational complexity theory and combinatorics, and was one of Karp's 21 NP-complete problems. Suppose one has a finite set S and a list of subsets of S. Then, the set packing problem asks if some k subsets in the list are pairwise disjoint.

In graph theory, reachability refers to the ability to get from one vertex to another within a graph. A vertex can reach a vertex if there exists a sequence of adjacent vertices which starts with and ends with .

In graph theory, a connected graph is k-edge-connected if it remains connected whenever fewer than k edges are removed.

In computer science, the Hopcroft–Karp algorithm is an algorithm that takes a bipartite graph as input and produces a maximum-cardinality matching as output — a set of as many edges as possible with the property that no two edges share an endpoint. It runs in time in the worst case, where is set of edges in the graph, is set of vertices of the graph, and it is assumed that . In the case of dense graphs the time bound becomes , and for sparse random graphs it runs in time with high probability.

In graph theory, the planar separator theorem is a form of isoperimetric inequality for planar graphs, that states that any planar graph can be split into smaller pieces by removing a small number of vertices. Specifically, the removal of vertices from an n-vertex graph can partition the graph into disjoint subgraphs each of which has at most vertices.

In graph theory, the metric k-center problem is a combinatorial optimization problem studied in theoretical computer science. Given n cities with specified distances, one wants to build k warehouses in different cities and minimize the maximum distance of a city to a warehouse. In graph theory, this means finding a set of k vertices for which the largest distance of any point to its closest vertex in the k-set is minimum. The vertices must be in a metric space, providing a complete graph that satisfies the triangle inequality.

In graph theory, the tree-depth of a connected undirected graph is a numerical invariant of , the minimum height of a Trémaux tree for a supergraph of . This invariant and its close relatives have gone under many different names in the literature, including vertex ranking number, ordered chromatic number, and minimum elimination tree height; it is also closely related to the cycle rank of directed graphs and the star height of regular languages. Intuitively, where the treewidth of a graph measures how far it is from being a tree, this parameter measures how far a graph is from being a star.

In computer science, the method of contraction hierarchies is a speed-up technique for finding the shortest-path in a graph. The most intuitive applications are car-navigation systems: a user wants to drive from to using the quickest possible route. The metric optimized here is the travel time. Intersections are represented by vertices, the road sections connecting them by edges. The edge weights represent the time it takes to drive along this segment of the road. A path from to is a sequence of edges ; the shortest path is the one with the minimal sum of edge weights among all possible paths. The shortest path in a graph can be computed using Dijkstra's algorithm but, given that road networks consist of tens of millions of vertices, this is impractical. Contraction hierarchies is a speed-up method optimized to exploit properties of graphs representing road networks. The speed-up is achieved by creating shortcuts in a preprocessing phase which are then used during a shortest-path query to skip over "unimportant" vertices. This is based on the observation that road networks are highly hierarchical. Some intersections, for example highway junctions, are "more important" and higher up in the hierarchy than for example a junction leading into a dead end. Shortcuts can be used to save the precomputed distance between two important junctions such that the algorithm doesn't have to consider the full path between these junctions at query time. Contraction hierarchies do not know about which roads humans consider "important", but they are provided with the graph as input and are able to assign importance to vertices using heuristics.

In network theory, the Wiener connector is a means of maximizing efficiency in connecting specified "query vertices" in a network. Given a connected, undirected graph and a set of query vertices in a graph, the minimum Wiener connector is an induced subgraph that connects the query vertices and minimizes the sum of shortest path distances among all pairs of vertices in the subgraph. In combinatorial optimization, the minimum Wiener connector problem is the problem of finding the minimum Wiener connector. It can be thought of as a version of the classic Steiner tree problem, where instead of minimizing the size of the tree, the objective is to minimize the distances in the subgraph.

The vertex k-center problem is a classical NP-hard problem in computer science. It has application in facility location and clustering. Basically, the vertex k-center problem models the following real problem: given a city with facilities, find the best facilities where to build fire stations. Since firemen must attend any emergency as quickly as possible, the distance from the farthest facility to its nearest fire station has to be as small as possible. In other words, the position of the fire stations must be such that every possible fire is attended as quickly as possible.

A central problem in algorithmic graph theory is the shortest path problem. One of the generalizations of the shortest path problem is known as the single-source-shortest-paths (SSSP) problem, which consists of finding the shortest paths from a source vertex to all other vertices in the graph. There are classical sequential algorithms which solve this problem, such as Dijkstra's algorithm. In this article, however, we present two parallel algorithms solving this problem.

References

  1. 1 2 3 4 Abraham, Ittai; Fiat, Amos; Goldberg, Andrew V.; Werneck, Renato F. (2010-01-17). Highway Dimension, Shortest Paths, and Provably Efficient Algorithms. Society for Industrial and Applied Mathematics. pp. 782–793. doi:10.1137/1.9781611973075.64. ISBN   978-0-89871-701-3. S2CID   9330775.
  2. Bast, Holger; Funke, Stefan; Matijevic, Domagoj; Sanders, Peter; Schultes, Dominik (2007-01-06), Applegate, David; Stølting Brodal, Gerth (eds.), "In Transit to Constant Time Shortest-Path Queries in Road Networks", 2007 Proceedings of the Ninth Workshop on Algorithm Engineering and Experiments (ALENEX), Philadelphia, PA: Society for Industrial and Applied Mathematics, pp. 46–59, doi: 10.1137/1.9781611972870.5 , ISBN   978-1-61197-287-0 , retrieved 2023-11-10
  3. Bast, Holger; Funke, Stefan; Matijevic, Domagoj; Demetrescu, Camil; Goldberg, Andrew; Johnson, David (2006). "TRANSIT: Ultrafast Shortest-Path Queries with Linear-Time Preprocessing". The Shortest Path Problem: Ninth DIMACS Implementation Challenge.
  4. 1 2 Blum, Johannes (2019). "Hierarchy of Transportation Network Parameters and Hardness Results". Proceedings of the 14th International Symposium on Parameterized and Exact Computation (IPEC 2019). Schloss-Dagstuhl - Leibniz Zentrum für Informatik. doi:10.4230/LIPIcs.IPEC.2019.4. S2CID   166228480.
  5. 1 2 3 4 5 6 7 8 9 Feldmann, Andreas Emil; Fung, Wai Shing; Könemann, Jochen; Post, Ian (January 2018). "A $(1+\varepsilon)$-Embedding of Low Highway Dimension Graphs into Bounded Treewidth Graphs". SIAM Journal on Computing. 47 (4): 1667–1704. doi:10.1137/16M1067196. ISSN   0097-5397. S2CID   11339698.
  6. 1 2 3 Abraham, Ittai; Delling, Daniel; Fiat, Amos; Goldberg, Andrew V.; Werneck, Renato F. (2011). "VC-Dimension and Shortest Path Algorithms". In Aceto, Luca; Henzinger, Monika; Sgall, Jiří (eds.). Automata, Languages and Programming. Lecture Notes in Computer Science. Vol. 6755. Berlin, Heidelberg: Springer. pp. 690–699. doi:10.1007/978-3-642-22006-7_58. ISBN   978-3-642-22006-7.
  7. 1 2 3 Abraham, Ittai; Delling, Daniel; Fiat, Amos; Goldberg, Andrew V.; Werneck, Renato F. (2016-12-08). "Highway Dimension and Provably Efficient Shortest Path Algorithms". Journal of the ACM. 63 (5): 41:1–41:26. doi:10.1145/2985473. ISSN   0004-5411. S2CID   1943037.
  8. Blum, Johannes; Disser, Yann; Feldmann, Andreas Emil; Gupta, Siddharth; Zych-Pawlewicz, Anna (2022). "On Sparse Hitting Sets: From Fair Vertex Cover to Highway Dimension". Proceedings of 17th International Symposium on Parameterized and Exact Computation (IPEC 2022). Schloss-Dagstuhl - Leibniz Zentrum für Informatik. doi:10.4230/LIPIcs.IPEC.2022.5.
  9. Feldmann, Andreas Emil; Saulpic, David (2021-12-01). "Polynomial time approximation schemes for clustering in low highway dimension graphs". Journal of Computer and System Sciences. 122: 72–93. doi:10.1016/j.jcss.2021.06.002. ISSN   0022-0000.
  10. 1 2 3 Feldmann, Andreas Emil (2019-03-01). "Fixed-Parameter Approximations for k-Center Problems in Low Highway Dimension Graphs". Algorithmica. 81 (3): 1031–1052. doi:10.1007/s00453-018-0455-0. ISSN   1432-0541.
  11. Becker, Amariah; Klein, Philip N.; Saulpic, David (2018). "Polynomial-Time Approximation Schemes for k-center, k-median, and Capacitated Vehicle Routing in Bounded Highway Dimension". Proceedings of the 26th Annual European Symposium on Algorithms (ESA 2018). Schloss-Dagstuhl - Leibniz Zentrum für Informatik. doi:10.4230/LIPIcs.ESA.2018.8.
  12. 1 2 3 Feldmann, Andreas Emil; Vu, Tung Anh (2022). Bekos, Michael A.; Kaufmann, Michael (eds.). "Generalized $$k$$-Center: Distinguishing Doubling and Highway Dimension". Graph-Theoretic Concepts in Computer Science. Lecture Notes in Computer Science. Cham: Springer International Publishing: 215–229. doi:10.1007/978-3-031-15914-5_16. ISBN   978-3-031-15914-5.