Biased random walk on a graph

Last updated

In network science, a biased random walk on a graph is a time path process in which an evolving variable jumps from its current state to one of various potential new states; unlike in a pure random walk, the probabilities of the potential new states are unequal.

Contents

Biased random walks on a graph provide an approach for the structural analysis of undirected graphs in order to extract their symmetries when the network is too complex or when it is not large enough to be analyzed by statistical methods. The concept of biased random walks on a graph has attracted the attention of many researchers and data companies over the past decade especially in the transportation and social networks. [1]

Model

There have been written many different representations of the biased random walks on graphs based on the particular purpose of the analysis. A common representation of the mechanism for undirected graphs is as follows: [2]

On an undirected graph, a walker takes a step from the current node, to node Assuming that each node has an attribute the probability of jumping from node to is given by:

where represents the topological weight of the edge going from to

In fact, the steps of the walker are biased by the factor of which may differ from one node to another. [3]

Depending on the network, the attribute can be interpreted differently. It might be implied as the attraction of a person in a social network, it might be betweenness centrality or even it might be explained as an intrinsic characteristic of a node. In case of a fair random walk on graph is one for all the nodes.

In case of shortest paths random walks [4] is the total number of the shortest paths between all pairs of nodes that pass through the node . In fact the walker prefers the nodes with higher betweenness centrality which is defined as below:

Based on the above equation, the recurrence time to a node in the biased walk is given by: [5]

Applications

There are a variety of applications using biased random walks on graphs. Such applications include control of diffusion, [6] advertisement of products on social networks, [7] explaining dispersal and population redistribution of animals and micro-organisms, [8] community detections, [9] wireless networks, [10] and search engines. [11]

See also

Related Research Articles

In graph theory, a clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together. Evidence suggests that in most real-world networks, and in particular social networks, nodes tend to create tightly knit groups characterised by a relatively high density of ties; this likelihood tends to be greater than the average probability of a tie randomly established between two nodes.

<span class="mw-page-title-main">Centrality</span> Degree of connectedness within a graph

In graph theory and network analysis, indicators of centrality assign numbers or rankings to nodes within a graph corresponding to their network position. Applications include identifying the most influential person(s) in a social network, key infrastructure nodes in the Internet or urban networks, super-spreaders of disease, and brain networks. Centrality concepts were first developed in social network analysis, and many of the terms used to measure centrality reflect their sociological origin.

<span class="mw-page-title-main">Assortativity</span> Tendency for similar nodes to be connected

Assortativity, or assortative mixing, is a preference for a network's nodes to attach to others that are similar in some way. Though the specific measure of similarity may vary, network theorists often examine assortativity in terms of a node's degree. The addition of this characteristic to network models more closely approximates the behaviors of many real world networks.

<span class="mw-page-title-main">Random geometric graph</span> In graph theory, the mathematically simplest spatial network

In graph theory, a random geometric graph (RGG) is the mathematically simplest spatial network, namely an undirected graph constructed by randomly placing N nodes in some metric space and connecting two nodes by a link if and only if their distance is in a given range, e.g. smaller than a certain neighborhood radius, r.

Different definitions have been given for the dimension of a complex network or graph. For example, metric dimension is defined in terms of the resolving set for a graph. Dimension has also been defined based on the box covering method applied to graphs. Here we describe the definition based on the complex network zeta function. This generalises the definition based on the scaling property of the volume with distance. The best definition depends on the application.

An important question in statistical mechanics is the dependence of model behaviour on the dimension of the system. The shortcut model was introduced in the course of studying this dependence. The model interpolates between discrete regular lattices of integer dimension.

<span class="mw-page-title-main">Network science</span> Academic field

Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, cognitive and semantic networks, and social networks, considering distinct elements or actors represented by nodes and the connections between the elements or actors as links. The field draws on theories and methods including graph theory from mathematics, statistical mechanics from physics, data mining and information visualization from computer science, inferential modeling from statistics, and social structure from sociology. The United States National Research Council defines network science as "the study of network representations of physical, biological, and social phenomena leading to predictive models of these phenomena."

<span class="mw-page-title-main">Closeness centrality</span>

In a connected graph, closeness centrality of a node is a measure of centrality in a network, calculated as the reciprocal of the sum of the length of the shortest paths between the node and all other nodes in the graph. Thus, the more central a node is, the closer it is to all other nodes.

<span class="mw-page-title-main">Google matrix</span> Stochastic matrix representing links between entities

A Google matrix is a particular stochastic matrix that is used by Google's PageRank algorithm. The matrix represents a graph with edges representing links between pages. The PageRank of each page can then be generated iteratively from the Google matrix using the power method. However, in order for the power method to converge, the matrix must be stochastic, irreducible and aperiodic.

<span class="mw-page-title-main">Katz centrality</span>

In graph theory, the Katz centrality or alpha centrality of a node is a measure of centrality in a network. It was introduced by Leo Katz in 1953 and is used to measure the relative degree of influence of an actor within a social network. Unlike typical centrality measures which consider only the shortest path between a pair of actors, Katz centrality measures influence by taking into account the total number of walks between a pair of actors.

<span class="mw-page-title-main">Betweenness centrality</span> Measure of a graphs centrality, based on shortest paths

In graph theory, betweenness centrality is a measure of centrality in a graph based on shortest paths. For every pair of vertices in a connected graph, there exists at least one shortest path between the vertices such that either the number of edges that the path passes through or the sum of the weights of the edges is minimized. The betweenness centrality for each vertex is the number of these shortest paths that pass through the vertex.

Random walk closeness centrality is a measure of centrality in a network, which describes the average speed with which randomly walking processes reach a node from other nodes of the network. It is similar to the closeness centrality except that the farness is measured by the expected length of a random walk rather than by the shortest path.

<span class="mw-page-title-main">Hyperbolic geometric graph</span>

A hyperbolic geometric graph (HGG) or hyperbolic geometric network (HGN) is a special type of spatial network where (1) latent coordinates of nodes are sprinkled according to a probability density function into a hyperbolic space of constant negative curvature and (2) an edge between two nodes is present if they are close according to a function of the metric (typically either a Heaviside step function resulting in deterministic connections between vertices closer than a certain threshold distance, or a decaying function of hyperbolic distance yielding the connection probability). A HGG generalizes a random geometric graph (RGG) whose embedding space is Euclidean.

<span class="mw-page-title-main">Multidimensional network</span> Networks with multiple kinds of relations

In network theory, multidimensional networks, a special type of multilayer network, are networks with multiple kinds of relations. Increasingly sophisticated attempts to model real-world systems as multidimensional networks have yielded valuable insight in the fields of social network analysis, economics, urban and international transport, ecology, psychology, medicine, biology, commerce, climatology, physics, computational neuroscience, operations management, and finance.

<span class="mw-page-title-main">Disparity filter algorithm of weighted network</span>

Disparity filter is a network reduction algorithm to extract the backbone structure of undirected weighted network. Many real world networks such as citation networks, food web, airport networks display heavy tailed statistical distribution of nodes' weight and strength. Disparity filter can sufficiently reduce the network without destroying the multi-scale nature of the network. The algorithm is developed by M. Angeles Serrano, Marian Boguna and Alessandro Vespignani.

<span class="mw-page-title-main">Temporal network</span> Network whose links change over time

A temporal network, also known as a time-varying network, is a network whose links are active only at certain points in time. Each link carries information on when it is active, along with other possible characteristics such as a weight. Time-varying networks are of particular relevance to spreading processes, like the spread of information and disease, since each link is a contact opportunity and the time ordering of contacts is included.

<span class="mw-page-title-main">Efficiency (network science)</span>

In network science, the efficiency of a network is a measure of how efficiently it exchanges information and it is also called communication efficiency. The underlying idea is that the more distant two nodes are in the network, the less efficient their communication will be. The concept of efficiency can be applied to both local and global scales in a network. On a global scale, efficiency quantifies the exchange of information across the whole network where information is concurrently exchanged. The local efficiency quantifies a network's resistance to failure on a small scale. That is the local efficiency of a node characterizes how well information is exchanged by its neighbors when it is removed.

Maximal entropy random walk (MERW) is a popular type of biased random walk on a graph, in which transition probabilities are chosen accordingly to the principle of maximum entropy, which says that the probability distribution which best represents the current state of knowledge is the one with largest entropy. While standard random walk chooses for every vertex uniform probability distribution among its outgoing edges, locally maximizing entropy rate, MERW maximizes it globally by assuming uniform probability distribution among all paths in a given graph.

A set of networks that satisfies given structural characteristics can be treated as a network ensemble. Brought up by Ginestra Bianconi in 2007, the entropy of a network ensemble measures the level of the order or uncertainty of a network ensemble.

In network science, the network entropy is a disorder measure derived from information theory to describe the level of randomness and the amount of information encoded in a graph. It is a relevant metric to quantitatively characterize real complex networks and can also be used to quantify network complexity

References

  1. Roberta Sinatra; Jesús Gómez-Gardeñes; Renaud Lambiotte; Vincenzo Nicosia; Vito Latora (March 2011). "Maximal-entropy random walks in complex networks with limited information". Physical Review E. 83 (3): 030103. arXiv: 1007.4936 . Bibcode:2011PhRvE..83c0103S. doi:10.1103/PhysRevE.83.030103. PMID   21517435. S2CID   6984660.
  2. J. Gómez-Gardeñes; V. Latora (Dec 2008). "Entropy rate of diffusion processes on complex networks". Physical Review E. 78 (6): 065102. arXiv: 0712.0278 . Bibcode:2008PhRvE..78f5102G. doi:10.1103/PhysRevE.78.065102. PMID   19256892. S2CID   14100937.
  3. R. Lambiotte; R. Sinatra; J.-C. Delvenne; T.S. Evans; M. Barahona; V. Latora (Dec 2010). "Flow graphs: interweaving dynamics and structure". Physical Review E. 84 (1): 017102. arXiv: 1012.1211 . Bibcode:2011PhRvE..84a7102L. doi:10.1103/PhysRevE.84.017102. PMID   21867345. S2CID   2286264.
  4. Blanchard, P; Volchenkov, D (2008). Mathematical Analysis of Urban Spatial Networks. Springer. doi:10.1007/978-3-540-87829-2. ISBN   978-3-540-87828-5 via ResearchGate.
  5. Volchenkov D; Blanchard P (2011). Fair and biased random walks on undirected graphs and related entropies. Birkhäuser. p. 380. ISBN   978-0-8176-4903-6.
  6. Chung, Zhao, Fan, Wenbo (2010). "PageRank and random walks on graphs". Fete of Combinatorics and Computer Science. Bolyai Society Mathematical Studies. Vol. 20. pp. 43–62. CiteSeerX   10.1.1.157.7116 . doi:10.1007/978-3-642-13580-4_3. ISBN   978-3-642-13579-8. S2CID   3207094.
  7. Adal, K.M. (June 2010). "Biased random walk based routing for mobile ad hoc networks". 2010 International Conference on Intelligent and Advanced Systems. pp. 1–6. doi:10.1109/ICIAS.2010.5716181. ISBN   978-1-4244-6623-8. S2CID   16113377.
  8. Kakajan Komurov; Michael A. White; Prahlad T. Ram (Aug 2010). "Use of Data-Biased Random Walks on Graphs for the Retrieval of Context-Specific Networks from Genomic Data". PLOS Comput Biol. 6 (8): e1000889. Bibcode:2010PLSCB...6E0889K. doi:10.1371/journal.pcbi.1000889. PMC   2924243 . PMID   20808879.
  9. J.K. Ochab; Z. Burda (Jan 2013). "Maximal entropy random walk in community detection". The European Physical Journal Special Topics. 216: 73–81. arXiv: 1208.3688 . Bibcode:2013EPJST.216...73O. doi:10.1140/epjst/e2013-01730-6. S2CID   56409069.
  10. Beraldi, Roberto (Apr 2009). "Biased Random Walks in Uniform Wireless Networks". IEEE Transactions on Mobile Computing. 8 (4): 500–513. doi:10.1109/TMC.2008.151. S2CID   13521325.
  11. Da-Cheng Nie; Zi-Ke Zhang; Qiang Dong; Chongjing Sun; Yan Fu (July 2014). "Information Filtering via Biased Random Walk on Coupled Social Network". The Scientific World Journal. 2014: 829137. doi: 10.1155/2014/829137 . PMC   4132410 . PMID   25147867.