Link-centric preferential attachment

Last updated

In mathematical modeling of social networks, link-centric preferential attachment [1] [2] is a node's propensity to re-establish links to nodes it has previously been in contact with in time-varying networks. [3] This preferential attachment model relies on nodes keeping memory of previous neighbors up to the current time. [1] [4]

Contents

Background

In real social networks individuals exhibit a tendency to re-connect with past contacts (ex. family, friends, co-workers, etc.) rather than strangers. In 1970, Mark Granovetter examined this behaviour in the social networks of a group of workers and identified tie strength, a characteristic of social ties describing the frequency of contact between two individuals. From this comes the idea of strong and weak ties, [5] where an individual's strong ties are those she has come into frequent contact with. Link-centric preferential attachment aims to explain the mechanism behind strong and weak ties as a stochastic reinforcement process for old ties in agent-based modeling where nodes have long-term memory.

Examples

In a simple model for this mechanism, a node's propensity to establish a new link can be characterized solely by , the number of contacts it has had in the past. The probability for a node with n social ties to establish a new social tie could then be simply given by [4]

where c is an offset constant. The probability for a node to re-connect with old ties is then

Figure 1. shows an example of this process: in the first step nodes A and C connect to node B, giving B a total of two social ties. With c = 1, in the next step B has a probability P(2) = 1/(2 + 1) = 1/3 to create a new tie with D, whereas the probability to reconnect with A or C is twice that at 2/3.

More complex models may take into account other variables, such as frequency of contact, contact and intercontact duration, as well as short term memory effects. [1]

Effects on the spreading of contagions / weakness of strong ties

Understanding the evolution of a network's structure and how it can influence dynamical processes has become an important part of modeling the spreading of contagions. [6] [7] In models of social and biological contagion spreading on time-varying networks link-centric preferential attachment can alter the spread of the contagion to the entire population. Compared to the classic rumour spreading process where nodes are memory-less, link-centric preferential attachment can cause not only a slower spread of the contagion but also one less diffuse. In these models an infected node's chances of connecting to new contacts diminishes as their size of their social circle grows leading to a limiting effect on the growth of n. The result is strong ties with a node's early contacts and consequently the weakening of the diffusion of the contagion. [1] [4]

See also

Related Research Articles

Scale-free network Network whose degree distribution follows a power law

A scale-free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction P(k) of nodes in the network having k connections to other nodes goes for large values of k as

Small-world network Mathematical graph where most nodes can be reached by a small number of steps

A small-world network is a type of mathematical graph in which most nodes are not neighbors of one another, but the neighbors of any given node are likely to be neighbors of each other and most nodes can be reached from every other node by a small number of hops or steps. Specifically, a small-world network is defined to be a network where the typical distance L between two randomly chosen nodes grows proportionally to the logarithm of the number of nodes N in the network, that is:

Barabási–Albert model

The Barabási–Albert (BA) model is an algorithm for generating random scale-free networks using a preferential attachment mechanism. Several natural and human-made systems, including the Internet, the world wide web, citation networks, and some social networks are thought to be approximately scale-free and certainly contain few nodes with unusually high degree as compared to the other nodes of the network. The BA model tries to explain the existence of such nodes in real networks. The algorithm is named for its inventors Albert-László Barabási and Réka Albert and is a special case of an earlier and more general model called Price's model.

In complex network theory, the fitness model is a model of the evolution of a network: how the links between nodes change over time depends on the fitness of nodes. Fitter nodes attract more links at the expense of less fit nodes.

In applied probability theory, the Simon model is a class of stochastic models that results in a power-law distribution function. It was proposed by Herbert A. Simon to account for the wide range of empirical distributions following a power-law. It models the dynamics of a system of elements with associated counters. In this model the dynamics of the system is based on constant growth via addition of new elements as well as incrementing the counters at a rate proportional to their current values.

Network science

Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, cognitive and semantic networks, and social networks, considering distinct elements or actors represented by nodes and the connections between the elements or actors as links. The field draws on theories and methods including graph theory from mathematics, statistical mechanics from physics, data mining and information visualization from computer science, inferential modeling from statistics, and social structure from sociology. The United States National Research Council defines network science as "the study of network representations of physical, biological, and social phenomena leading to predictive models of these phenomena."

Behavioral contagion is a form of social contagion involving the spread of behaviour through a group. It refers to the propensity for a person to copy a certain behavior of others who are either in the vicinity, or whom they have been exposed to. The term was originally used by Gustave Le Bon in his 1895 work The Crowd: A Study of the Popular Mind to explain undesirable aspects of behavior of people in crowds. In the digital age, behavioural contagion is also concerned with the spread of online behaviour and information. A variety of behavioral contagion mechanisms were incorporated in models of collective human behavior.

Evolving network

Evolving networks are networks that change as a function of time. They are a natural extension of network science since almost all real world networks evolve over time, either by adding or removing nodes or links over time. Often all of these processes occur simultaneously, such as in social networks where people make and lose friends over time, thereby creating and destroying edges, and some people become part of new social networks or leave their networks, changing the nodes in the network. Evolving network concepts build on established network theory and are now being introduced into studying networks in many diverse fields.

Hierarchical network model

Hierarchical network models are iterative algorithms for creating networks which are able to reproduce the unique properties of the scale-free topology and the high clustering of the nodes at the same time. These characteristics are widely observed in nature, from biology to language to some social networks.

Price's model is a mathematical model for the growth of citation networks. It was the first model which generalized the Simon model to be used for networks, especially for growing networks. Price's model belongs to the broader class of network growing models whose primary target is to explain the origination of networks with strongly skewed degree distributions. The model picked up the ideas of the Simon model reflecting the concept of rich get richer, also known as the Matthew effect. Price took the example of a network of citations between scientific papers and expressed its properties. His idea was that the way an old vertex gets new edges should be proportional to the number of existing edges the vertex already has. This was referred to as cumulative advantage, now also known as preferential attachment. Price's work is also significant in providing the first known example of a scale-free network. His ideas were used to describe many real-world networks such as the Web.

Temporal network Network whose links change over time

A temporal network, also known as a time-varying network, is a network whose links are active only at certain points in time. Each link carries information on when it is active, along with other possible characteristics such as a weight. Time-varying networks are of particular relevance to spreading processes, like the spread of information and disease, since each link is a contact opportunity and the time ordering of contacts is included.

Targeted immunization strategies

Targeted immunization strategies are approaches designed to increase the immunization level of populations and decrease the chances of epidemic outbreaks. Though often in regards to use in healthcare practices and the administration of vaccines to prevent biological epidemic outbreaks, these strategies refer in general to immunization schemes in complex networks, biological, social or artificial in nature. Identification of at-risk groups and individuals with higher odds of spreading the disease often plays an important role in these strategies.

In network science, preferential attachment means that nodes of a network tend to connect to those nodes which have more links. If the network is growing and new nodes tend to connect to existing ones with linear probability in the degree of the existing nodes then preferential attachment leads to a scale-free network. If this probability is sub-linear then the network’s degree distribution is stretched exponential and hubs are much smaller than in a scale-free network. If this probability is super-linear then almost all nodes are connected to a few hubs. According to Kunegis, Blattner, and Moser several online networks follow a non-linear preferential attachment model. Communication networks and online contact networks are sub-linear while interaction networks are super-linear. The co-author network among scientists also shows the signs of sub-linear preferential attachment.

Bianconi–Barabási model

The Bianconi–Barabási model is a model in network science that explains the growth of complex evolving networks. This model can explain that nodes with different characteristics acquire links at different rates. It predicts that a node's growth depends on its fitness and can calculate the degree distribution. The Bianconi–Barabási model is named after its inventors Ginestra Bianconi and Albert-László Barabási. This model is a variant of the Barabási–Albert model. The model can be mapped to a Bose gas and this mapping can predict a topological phase transition between a "rich-get-richer" phase and a "winner-takes-all" phase.

Hub (network science) Node with a number of links that greatly exceeds the average

In network science, a hub is a node with a number of links that greatly exceeds the average. Emergence of hubs is a consequence of a scale-free property of networks. While hubs cannot be observed in a random network, they are expected to emerge in scale-free networks. The uprise of hubs in scale-free networks is associated with power-law distribution. Hubs have a significant impact on the network topology. Hubs can be found in many real networks, such as the brain or the Internet.

In a scale-free network the degree distribution follows a power law function. In some empirical examples this power-law fits the degree distribution well only in the high degree region, however for small degree nodes the empirical degree-distribution deviates from it. See for example the network of scientific citations. This deviation of the observed degree-distribution from the theoretical prediction at the low-degree region is often referred as low-degree saturation.

The initial attractiveness is a possible extension of the Barabási–Albert model. The Barabási–Albert model generates scale-free networks where the degree distribution can be described by a pure power law. However, the degree distribution of most real life networks cannot be described by a power law solely. The most common discrepancies regarding the degree distribution found in real networks are the high degree cut-off and the low degree cut-off. The inclusion of initial attractiveness in the Barabási–Albert model addresses the low-degree cut-off phenomenon.

Physicists often use various lattices to apply their favorite models in them. For instance, the most favorite lattice is perhaps the square lattice. There are 14 Bravais space lattice where every cell has exactly the same number of nearest, next nearest, nearest of next nearest etc neighbors and hence they are called regular lattice. Often physicists and mathematicians study phenomena which require disordered lattice where each cell do not have exactly the same number of neighbors rather the number of neighbors can vary wildly. For instance, if one wants to study the spread of disease, viruses, rumors etc then the last thing one would look for is the square lattice. In such cases a disordered lattice is necessary. One way of constructing a disordered lattice is by doing the following.

Mediation-driven attachment model

In the scale-free network theory, a mediation-driven attachment (MDA) model appears to embody a preferential attachment rule tacitly rather than explicitly. According to MDA rule, a new node first picks a node from the existing network at random and connect itself not with that but with one of the neighbors also picked at random.

Configuration model

In network science, the configuration model is a method for generating random networks from given degree sequence. It is widely used as a reference model for real-life social networks, because it allows the user to incorporate arbitrary degree distributions.

References

  1. 1 2 3 4 Vestergaard, Christian L.; Genois, Mathieu; Barrat, Alain (October 9, 2014). "How memory generates hetergeneous dynamics in temporal networks". Physical Review E. 90 (4): 042805. arXiv: 1409.1805 . Bibcode:2014PhRvE..90d2805V. doi:10.1103/PhysRevE.90.042805. PMID   25375547. S2CID   16022001.
  2. Barabasi, Albert-Laszlo; Albert, Reka (October 15, 1999). "Emergence of Scaling in Random Networks". Science. 286 (5439): 509–512. arXiv: cond-mat/9910332 . Bibcode:1999Sci...286..509B. doi:10.1126/science.286.5439.509. PMID   10521342. S2CID   524106.
  3. Perra, Nicola; Goncalves, Bruno; Pastor-Satorras, Romualdo; Vespignani, Alessandro (June 25, 2012). "Activity driven modeling of time-varying networks". Scientific Reports. 2: 469. arXiv: 1203.5351 . Bibcode:2012NatSR...2E.469P. doi:10.1038/srep00469. PMC   3384079 . PMID   22741058.
  4. 1 2 3 Karsai, Marton (February 10, 2014). "Time-varying networks and the weakness of strong ties". Scientific Reports. 4: 4001. arXiv: 1303.5966 . Bibcode:2014NatSR...4E4001K. doi:10.1038/srep04001. PMC   3918922 . PMID   24510159.
  5. Granovetter, Mark (1973). "The strength of weak ties". American Journal of Sociology. 78 (6): 1360–1380. doi:10.1086/225469. S2CID   59578641.
  6. Newman, M. E. J. (July 26, 2002). "Spread of epidemic disease on networks". Physical Review E. 66 (16128): 016128. arXiv: cond-mat/0205009 . Bibcode:2002PhRvE..66a6128N. doi:10.1103/PhysRevE.66.016128. PMID   12241447. S2CID   15291065.
  7. Kamp, Christel; Moslonka-Lefebvre, Mathieu; Alizon, Samuel (December 13, 2013). "Epidemic Spread on Weighted Networks". PLOS Computational Biology. 9 (1371): e1003352. Bibcode:2013PLSCB...9E3352K. doi: 10.1371/journal.pcbi.1003352 . PMC   3861041 . PMID   24348225.