Mega-Merger

Last updated May 07, 2021

Mega-merger is a distributed algorithm aimed at solving the election problem in generic connected undirected graph.^[1]^[2]

Introduction

Mega-Merger was developed by Robert Gray Gallager at MIT in 1983. It applies a distributed divide and conquer approach mixed with a rank-based conquer strategy. The algorithm is usually presented through a village-city analogy. Each node in the graph indicates a village, while the edges that connect them are the roads and a rooted spanning tree in a sub-graph is a city. The whole graph is then a mega-city. Mega-Merger pushes villages to bind together to form cities according to each other's rank and edges. Cities are then formed by alliances or by conquering/absorption.

Pre-requisites

Mega-Merger builds a minimum spanning tree over connected graphs provided:

Total reliability: No message is lost in transmission.
UI (unique initiator): A single node starts the protocol.
Bi-directional communications channels: Each edge is bi-directional, communications can travel in both directions.

No further restrictions are necessary.

Algorithm

The algorithm assigns to each village a name and a rank, the former usually unique. The latter states the number of friendly mergers that the city has gone through, and the larger it is, the more powerful a city is considered. Moreover, to each edge is assigned a weight: each village/city $C$ has a minimum-weight edge $e_{merge}(C,C')$ also called merge link, that is the edge whose traversal has minimum cost.

The algorithm proceeds in consecutive stages until a mega-city is formed. Each city C computes its own merge link and sends a request for merging across $e_{merge}(C,C')$ . The request is handled by $C'$ in the following ways:

Friendly merge: $rank(C)=rank(C')\land e_{merge}(C,C')=e_{merge}(C',C)$ : If the cities share the same merge link and have same rank, a friendly merge occurs, and the two cities merge into one. A new name is picked for the newly created city, a ruling village is picked and the path from the previous ruler to the node in the merge link is re-oriented such that it leads to the new leader. The new city also has its rank increased by one. Notice as this is the only way two cities can increase each other's rank.
Absorption: $rank(C)<rank(C')$ : If the requesting city has a lower rank, the city in the receiving end enacts an absorption process: $C$ is absorbed like in the friendly merge, but loses its name and the resulting city has the rank of $C'$ .
Suspension: ${\textstyle rank(C)=rank(C')\land e_{merge}(C,C')\neq e_{merge}(C',C)\lor rank(C)>rank(C')}$ : In such cases $C'$ freezes the request: it waits to either be absorbed by rule 2 or to merge and increase its rank above the one of $C$ in order to be able to enact rule 1 and absorb $C$ .

Outside messages

No nodes in the graph have a list of villages belonging to their village, hence each time a city wants to look for edges leading outside of it, it has to adopt an ask-reply protocol. The city ruler sends a broadcast message through its spanning tree, and each node $x$ receiving it sends requests to its neighbors, excluding the edges to its child(ren) and parent. The response protocol is as follows:

$x.city=y.city$ : clearly the edge is an intra-edge in $C$ . $x$ and $y$ exchange negative responses.
$x.city\neq y.city\land x.rank<y.rank$ : $x$ is asking to a city of higher rank. By rule 2 we can assert that no absorption occurs, and $y$ indeed belongs to another city.
$x.city\neq y.city\land x.rank>y.rank$ : in this case $y$ will delay the answer as by rule 3.

Properties

Mega-Merger holds several properties:

Monotonic rank: Each city $C$ , mega-city excluded, will eventually rise in rank. By rule 1 $C$ could friendly merge, raising its rank by $1$ ; by rule 2 and 3 $C$ will have a merge link (by hypothesis $C$ is not the mega-city) it will either ask a higher-rank city $C'$ , getting absorbed and increasing its rank, or wait until $C'$ reaches its level and operate a friendly merge.
$rank(C)=K\implies |C|\geq 2^{k}$ : we have a level increase each time a friendly merge is operated. We compute by induction: on the base case, $rank(C)=0$ , exactly one village is in $C$ . On the inductive case, two cities $C',C''s.t.rank(C')=rank(C'')=k$ operate a friendly merge, hence $|C'-C''|=|C'|+|C''|=2^{k}+2^{k}=2^{k+1}$ by inductive hypothesis.
$\max {rank(C)}\leq \log n$ : by the previous rule cities are built up on an exponential base $2$ , hence the inverse $\log _{2}n$ .
Deadlock prevention: Mega-Merger doesn't incur in any deadlock. As shown by rule 3 a city $C$ can wait for a lower-rank city to answer on merge link $e$ : in order to incur in a deadlock such city $C'$ would have to wait on $C''$ , and $C''$ on $C'''$ , and so on until a cycle is detected on $C^{n}$ waiting on $C$ on a merge link $e'$ . But by hypothesis $e$ is the merge-link of $C$ , hence such chain cannot exist. The other deadlock-inducing situation is a request from $C$ to $C'$ where $C'$ has a different merge link than $C$ . Still, as shown by monotonic rank either $C'$ will grow its rank to absorb $C$ , or will consume all its merge links to be the only city in the graph with $C$ . Trivially in such case the two merge links would coincide and $C'$ would be forced into absorption by rule 2.

Termination

Termination is granted by deadlock prevention and total reliability.

Cost

The cost analysis has two components, the stage-cost and the stage upper-bound. A city $C$ enacts a stage by requesting a merge link from its villages and applying one of the above rules according to the desired situation. We can divide this stage in five steps:

Broadcast request for merge link to the $\leq n$ nodes in the tree.
Each node forwards an $outside?$ message to its $\leq n$ neighbors and waits for their $\leq n$ answers.
The nodes then send the answers back to the city ruler by convergecast for a total of $\leq n$ messages.
The root then decides on a merge link and sends a message to the elected node. Trivially this message will need to travel $height(Tree)\leq n$ nodes.

These five phases of request, outside discovery, communication and delivery have a total cost of $n+2n+n+n\leq 5n$ . As for the wasted messages in the $outside?$ between internal nodes, each node $x$ has at most $deg(x)-2$ internal edges, or $deg(x)-1$ if $x$ is a leaf, for a total of $2m-n$ wasted internal messages.

Now for the number of stages. By the previously presented property on the cities size, each city of level $k$ has $\geq 2^{k}$ , hence the largest reachable rank is $log_{2}{n}$ . Since cities can merge/be absorbed only once per stage, we have a total of $2m+n+5n\log {n}$ total messages.

Correctness

Mega-Merger creates a minimum spanning tree by merging sub-trees through the minimum cost path, i.e. the merge link. By definition of minimum spanning tree, a minimum spanning tree is a set of minimum spanning trees connected through minimum-cost paths. By construction Mega-Merger forwards a request through its merge-link, and that sooner or later that edge is going to be part of the tree by deadlock prevention.

Related Research Articles

The travelling salesman problem asks the following question: "Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city exactly once and returns to the origin city?" It is an NP-hard problem in combinatorial optimization, important in theoretical computer science and operations research.

A minimum spanning tree (MST) or minimum weight spanning tree is a subset of the edges of a connected, edge-weighted undirected graph that connects all the vertices together, without any cycles and with the minimum possible total edge weight. That is, it is a spanning tree whose sum of edge weights is as small as possible. More generally, any edge-weighted undirected graph has a minimum spanning forest, which is a union of the minimum spanning trees for its connected components.

Kruskal's algorithm finds a minimum spanning forest of an undirected edge-weighted graph. If the graph is connected, it finds a minimum spanning tree. It is a greedy algorithm in graph theory as in each step it adds the next lowest-weight edge that will not form a cycle to the minimum spanning forest.

Depth-first search (DFS) is an algorithm for traversing or searching tree or graph data structures. The algorithm starts at the root node and explores as far as possible along each branch before backtracking.

Hypergraph Generalization of graph theory

In mathematics, a hypergraph is a generalization of a graph in which an edge can join any number of vertices. In contrast, in an ordinary graph, an edge connects exactly two vertices.

In mathematics, random graph is the general term to refer to probability distributions over graphs. Random graphs may be described simply by a probability distribution, or by a random process which generates them. The theory of random graphs lies at the intersection between graph theory and probability theory. From a mathematical perspective, random graphs are used to answer questions about the properties of typical graphs. Its practical applications are found in all areas in which complex networks need to be modeled – many random graph models are thus known, mirroring the diverse types of complex networks encountered in different areas. In a mathematical context, random graph refers almost exclusively to the Erdős–Rényi random graph model. In other contexts, any graph model may be referred to as a random graph.

In optimization theory, maximum flow problems involve finding a feasible flow through a flow network that obtains the maximum possible flow rate.

In the mathematical theory of matroids, a graphic matroid is a matroid whose independent sets are the forests in a given finite undirected graph. The dual matroids of graphic matroids are called co-graphic matroids or bond matroids. A matroid that is both graphic and co-graphic is called a planar matroid; these are exactly the graphic matroids formed from planar graphs.

In computer science, a disjoint-set data structure, also called a union–find data structure or merge–find set, is a data structure that stores a collection of disjoint (non-overlapping) sets. Equivalently, it stores a partition of a set into disjoint subsets. It provides operations for adding new sets, merging sets, and finding a representative member of a set. The last operation allows to find out efficiently if any two elements are in the same or different sets.

The distributed minimum spanning tree (MST) problem involves the construction of a minimum spanning tree by a distributed algorithm, in a network where nodes communicate by message passing. It is radically different from the classical sequential problem, although the most basic approach resembles Borůvka's algorithm. One important application of this problem is to find a tree that can be used for broadcasting. In particular, if the cost for a message to pass through an edge in a graph is significant, a MST can minimize the total cost for a source process to communicate with all the other processes in the network.

In distributed computing, leader election is the process of designating a single process as the organizer of some task distributed among several computers (nodes). Before the task is begun, all network nodes are either unaware which node will serve as the "leader" of the task, or unable to communicate with the current coordinator. After a leader election algorithm has been run, however, each node throughout the network recognizes a particular, unique node as the task leader.

In computer science and graph theory, Karger's algorithm is a randomized algorithm to compute a minimum cut of a connected graph. It was invented by David Karger and first published in 1993.

In graph theory, Edmonds' algorithm or Chu–Liu/Edmonds' algorithm is an algorithm for finding a spanning arborescence of minimum weight . It is the directed analog of the minimum spanning tree problem. The algorithm was proposed independently first by Yoeng-Jin Chu and Tseng-Hong Liu (1965) and then by Jack Edmonds (1967).

In graph theory, a random geometric graph (RGG) is the mathematically simplest spatial network, namely an undirected graph constructed by randomly placing N nodes in some metric space and connecting two nodes by a link if and only if their distance is in a given range, e.g. smaller than a certain neighborhood radius, r.

In statistics, single-linkage clustering is one of several methods of hierarchical clustering. It is based on grouping clusters in bottom-up fashion, at each step combining two clusters that contain the closest pair of elements not yet belonging to the same cluster as each other.

Capacitated minimum spanning tree is a minimal cost spanning tree of a graph that has a designated root node $and satisfies the capacity constraint . The capacity constraint ensures that all subtrees incident on the root node have no more than nodes. If the tree nodes have weights, then the capacity constraint may be interpreted as follows: the sum of weights in any subtree should be no greater than . The edges connecting the subgraphs to the root node are called gates . Finding the optimal solution is NP-hard.$

The expected linear time MST algorithm is a randomized algorithm for computing the minimum spanning forest of a weighted graph with no isolated vertices. It was developed by David Karger, Philip Klein, and Robert Tarjan. The algorithm relies on techniques from Borůvka's algorithm along with an algorithm for verifying a minimum spanning tree in linear time. It combines the design paradigms of divide and conquer algorithms, greedy algorithms, and randomized algorithms to achieve expected linear performance.

In graph theory, the Stoer–Wagner algorithm is a recursive algorithm to solve the minimum cut problem in undirected weighted graphs with non-negative weights. It was proposed by Mechthild Stoer and Frank Wagner in 1995. The essential idea of this algorithm is to shrink the graph by merging the most intensive vertices, until the graph only contains two combined vertex sets. At each phase, the algorithm finds the minimum $- cut for two vertices and chosen at its will. Then the algorithm shrinks the edge between and to search for non - cuts. The minimum cut found in all phases will be the minimum weighted cut of the graph.$

Yo-Yo is a distributed algorithm aimed at minimum finding and leader election in generic connected undirected graph. Unlike Mega-Merger it has a trivial termination and cost analysis.

$-dimensional hypercube is a network topology for parallel computers with processing elements. The topology allows for an efficient implementation of some basic communication primitives such as Broadcast, All-Reduce, and Prefix sum. The processing elements are numbered through . Each processing element is adjacent to processing elements whose numbers differ in one and only one bit. The algorithms described in this page utilize this structure efficiently.$

References

↑ Gallager, Robert (1983). "A distributed algorithm for minimum spanning tree" (PDF). Massachusetts Institute of Technology.
↑ Awerbuch, Baruch (1987). "Optimal Distributed Algorithm for Minimum Weight Spanning Tree, Counting, Leader Election and Other Problems" (PDF). SIAM Journal on Computing.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Gallager, Robert (1983). "A distributed algorithm for minimum spanning tree" (PDF). Massachusetts Institute of Technology.

[2] Awerbuch, Baruch (1987). "Optimal Distributed Algorithm for Minimum Weight Spanning Tree, Counting, Leader Election and Other Problems" (PDF). SIAM Journal on Computing.

[1]

[2]