This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The relative importance of structure and dynamics on node influence in reversible spreading processes

Junyi Qu School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China    Ming Tang [email protected] School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China Shanghai Key Laboratory of Multidimensional Information Processing, East China Normal University, Shanghai 200241, China    Ying Liu [email protected] School of Computer Science, Southwest Petroleum University, Chengdu 610500, China    Shuguang Guan [email protected] School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
Abstract

The reversible spreading processes with repeated infection widely exist in nature and human society, such as gonorrhea propagation and meme spreading. Identifying influential spreaders is an important issue in the reversible spreading dynamics on complex networks, which has been given much attention. Except for structural centrality, the nodes’ dynamical states play a significant role in their spreading influence in the reversible spreading processes. By integrating the number of outgoing edges and infection risks of node’s neighbors into structural centrality, a new measure for identifying influential spreaders is articulated which considers the relative importance of structure and dynamics on node influence. The number of outgoing edges and infection risks of neighbors represent the positive effect of the local structural characteristic and the negative effect of the dynamical states of nodes in identifying influential spreaders, respectively. We find that an appropriate combination of these two characteristics can greatly improve the accuracy of the proposed measure in identifying the most influential spreaders. Notably, compared with the positive effect of the local structural characteristic, slightly weakening the negative effect of dynamical states of nodes can make the proposed measure play the best performance. Quantitatively understanding the relative importance of structure and dynamics on node influence provides a significant insight into identifying influential nodes in the reversible spreading processes.

pacs:
87.19.X-,89.75.Hc,87.23.Ge

I Introduction

Identifying influential spreaders is an important topic in the spreading dynamics on complex networks Koschützki et al. (2005); Lü et al. (2016); Pei et al. (2020). Identifying these nodes can help us to formulate precise marketing strategies Leskovec et al. (2007), control the spread of public opinions Bovet and Makse (2019); Lin et al. (2018), prevent the catastrophic disruptions in power girds Motter and Lai (2002); Albert et al. (2004) and control the outbreak of epidemics Pastor-Satorras and Vespignani (2002); Scarpino and Petri (2019); Zhou and Liu (2008).

In recent years, there have been a lot of researches devoted to identifying the most influential spreaders Morone and Makse (2015); Pei et al. (2018); Hu et al. (2018); Lokhov and Saad (2017); Zheng et al. (2021). As the network structure has a significant impact on the spreading processes, some structural centralities are used to measure the spreading influence of a node or node set, such as degree, closeness centrality, betweenness centrality and kk-shell index Lü et al. (2016); Pei et al. (2020).

However, the complex interplays between network structure and spreading dynamics sometimes make the structural centralities be unable to effectively identify the influential nodes accurately in complex networks. Poux-Médard et al. (2020); Erkol et al. (2020); Aral and Dhillon (2018). Integrating the dynamical states of nodes in the spreading process can promote the accuracy of measures in identifying the most influential spreaders Klemm et al. (2012). Most of previous researches focus on irreversible spreading dynamics which has a final state. But, some spreading processes are reversible with repeated infections, such as meme spreading on online social networks Gleeson et al. (2014), gonorrhea spreading propagation on sexual intercourse networks Pastor-Satorras and Vespignani (2001a) and the rise and fall of stock prices on financial stock market networks Stavroglou et al. (2019). These reversible processes can be described by the SIS spreading dynamics Barzel and Barabási (2013); Pastor-Satorras and Vespignani (2001b, c), where a node transfers between infection state and recovery state. Qu etet alal. proposed a single-node control model to evaluate the influence of nodes and a structure-dynamics combined centrality to identify influential nodes in the reversible spreading system Qu et al. (2020). They found that taking the neighbors’ centrality and dynamical states into account can identify influential nodes more accurately in real-world networks than the benchmark centralities.

Although the interplay between network structure and dynamics determines the influence of nodes, the relative importance of the two factors in identifying influential nodes is still unknown. By considering the positive effects of the neighbors’ local structures and the negative effects of the neighbors’ dynamical states, a new structure-dynamics combined centrality is proposed to identify influential spreaders and evaluate the relative importance of structure and dynamics on the determining node influence. In this new measure, the number of outgoing edges and the infection risks of neighbors are used to represent the local structures and dynamical states of the neighbors respectively. Simulation results on real-world networks show that considering neighbors’ local structures and dynamical states can help improving the accuracy of the index in identifying influential spreaders. Notably, compared with the number of outgoing edges of neighbors, slightly weakening the impact of neighbors’ infection risks can greatly promote the accuracy of the index to identify influential spreaders. There is an optimal range, within which the local structures and dynamical states of neighbors enhance the accuracy of the influence ranking index the most.

The rest of the paper is organized as follows. Section II introduces a single-node control model to quantify the influence of a node in SIS spreading processes. In Section III, based on the neighbors’ local structures and dynamical states , we propose a node centrality index in the SIS spreading dynamics. Section IV shows the numerical simulation results and analyses on real-world networks. Section V is the conclusion and discussion.

II Single-node control model in the SIS spreading processes

In our previous research, we have proposed a single-node control method to evaluate the influence of nodes in the reversible spreading system Qu et al. (2020). In SIS spreading processes, a node has only two possible states: susceptible (S) state and infected (I) state. An infected node transmits a disease to each of its direct susceptible neighbors with rate β\beta and recovers to susceptible state with rate μ\mu. After a long-time transient process, the proportion of infected nodes in the network will be stable, which is recorded as φ0\varphi_{0}. Then we artificially set one node to be in the infected state constantly. This will cause a change (improvement) in the proportion of infected nodes in the entire network. After certain time steps, the proportion of the I-state nodes reaches a new steady state and the proportion of infected nodes is recorded as φ\varphi. We define Δφ=φφ0\Delta\varphi=\varphi-\varphi_{0} as the spreading influence, also called the spreading efficiency of the controlled node. The larger the Δφ\Delta\varphi is, the higher influence of the controlled node has.

In simulations, we use the synchronous updating method Shu et al. (2016). The recovery rate is μ=0.1\mu=0.1 and the effective transmission rate λ=β/μ\lambda=\beta/\mu should not be too large because when it is too large, the propagation can easily spread to the entire network regardless of the node under control Liu et al. (2015). In that case the difference between the controlled nodes is diminished, and the influence of each node is indistinguishable. Meanwhile, the effective transmission rate λ\lambda should not be too small. If it is too small, the propagation will be limited to the local neighbourhood of the controlled node and cannot affect the entire network. We obtain the SIS epidemic threshold λc\lambda_{c} using the susceptibility Ferreira et al. (2012); Shu et al. (2015); Xu et al. (2019)

χ(λ)=N<ρ2><ρ>2<ρ>,\chi(\lambda)=N\frac{<\rho^{2}>-<\rho>^{2}}{<\rho>}, (1)

and choose an appropriate transmission rate λ>λc\lambda>\lambda_{c}. Under this effective transmission rate, the influence of nodes can be distinguished.

III The proposed centrality NSRC

It has been pointed out that compared with degree and kk-shell index, considering the contribution of neighbors to the initial spreader can significantly improve the accuracy of ranking measures in the SIR-like spreading processes Liu et al. (2016). If a neighbor jj of node ii only has neighbors that are also the direct neighbor of node ii, when we control node ii, the contribution of neighbor jj is less because it cannot assist node ii to spread the message to the entire network.

Assume that the network G=(V,E)G=(V,E) is unweighted and undirected composed of |V|=N|V|=N nodes and |E|=M|E|=M edges. The adjacency matrix of the network is denoted as A=(aij)\textbf{A}=(a_{ij}), where aij=1a_{ij}=1 means there is a direct connection between node ii and node jj, otherwise aij=0a_{ij}=0. We further consider the two-step neighbor ll of node ii connected to the node ii through the nearest neighbour node jj. If aij=1a_{ij}=1, ajl=1a_{jl}=1 and ail1a_{il}\neq 1, then we define bjl=1b_{jl}=1 to represent an outgoing edge of node jj. The edge between the nearest neighbour node jj and node ll is the outgoing edge of node jj relevant to node ii. For each of the neighbor node jj, there is kjout=lΓjbjlk_{j}^{out}=\sum_{l\in\Gamma_{j}}{b_{jl}}. Take the network in Fig. 1 for example. Node 1 has four neighbors {2,3,4,5}\{2,3,4,5\} and the outgoing edges of its nearest neighbour node 2 are the edges that connect node 2 and the node set {6,7,8,9}\{6,7,8,9\}. The outgoing edges of nodes 3, 4, 5 connect to the node sets {10,11}\{10,11\}, {12,13,14}\{12,13,14\} and {15,16}\{15,16\} respectively. Considering the characteristics of SIS spreading processes, we introduce the number of outgoing edges of neighbors into the node influential index with a weighting parameter aa. The idea is the more outgoing edges the neighbor nodes have, the more they help the considered node to spread information or message to the entire network Liu et al. (2016).

In the SIS spreading dynamics, when the system reaches a steady state, a node can still be infected repeatedly. In order to maximize the spreading to the entire network, we consider the risk that a node is infected. In Fig. 1, the infection risk of node 3 is 0.5. When node 1 is controlled and spreads out through node 3, because node 3 has a risk of infection, the influence of node 1 will be reduced. As the infection risk has a negative impact on the influence of origin node, the term with ρ\rho has a minus sign. Parameter bb is an adjustable weight indicating the relative importance of local dynamical states (i.e., infection risks).

Refer to caption
Figure 1: Illustration of calculating the NSRC. The decimal number labeled to the node number is the probability of being in the infected state. The NSRC of node 1 is calculated as C(1)=k1+a(k2out+k3out+k4out+k5out)b(k2outρ2+k3outρ3+k4outρ4+k5outρ5)=4+a(4+2+3+2)b(1.6+1+1.5+0.6)=4+11a4.7bC(1)=k_{1}+a(k_{2}^{out}+k_{3}^{out}+k_{4}^{out}+k_{5}^{out})-b(k_{2}^{out}\rho_{2}+k_{3}^{out}\rho_{3}+k_{4}^{out}\rho_{4}+k_{5}^{out}\rho_{5})=4+a(4+2+3+2)-b(1.6+1+1.5+0.6)=4+11a-4.7b

As considering neighbors’ local structures and dynamical states can effectively improve the accuracy of ranking measure for node influence on them Qu et al. (2020), we here investigate how to quantitatively evaluate the importance of these two factors. We propose an index to rank node influence in SIS reversible spreading processes called neighboring local structures and infection risks coupled centrality (NSRC), which is defined as

C(i)=ki+ajΓikjoutbjΓikjoutρj,C(i)=k_{i}+a\sum_{j\in\Gamma_{i}}{k_{j}^{out}}-b\sum_{j\in\Gamma_{i}}{k_{j}^{out}\rho_{j}}, (2)

where kik_{i} is the benchmark centrality (i. e., degree), jΓij\in\Gamma_{i} is the direct neighbor of node ii. The second item on the right side of Eq. (6) represents the local structures of the neighbor node jj, which is the number of outgoing edges through all neighbors of node ii. The third item represents the contribution of dynamical state of neighbor node jj, which is the infection risk of node jj. In the spreading process, when the system reaches its steady state, a node can still be infected repeatedly. In order to maximize the spread of infection to the entire network, we consider controlling a node ii in I state. Suppose the neighboring node jj is in the I state with infection probability ρj\rho_{j} in steady state. In simulations, if the neighboring node jj is in I state at this time step, node ii will have no effect on node jj and the influence of node ii will be reduced. As the ρj\rho_{j} has a negative impact on the influence of node ii, the term with ρj\rho_{j} has a minus sign. For simplicity, only the one-step neighbors of node ii and their outgoing edges are considered. aa and bb are two weight parameters, which represent contribution of the the local structures and the dynamical states of the neighbors respectively. The larger aa indicates the higher importance of the local structures and the larger bb indicates that the neighbors’ infections are more important. The combination of aa and bb can be coordinated by their weights which will be discussed in detail later.

In order to measure the accuracy of NSRC in identifying influential spreaders, the following two evaluation methods are used. The imprecision function is defined as

ε(p)=1ϕ(p)ϕeff(p),\varepsilon(p)=1-\frac{\phi(p)}{\phi_{eff}(p)}, (3)

where pp is the fraction of network size NN. ϕ(p)\phi(p) is the average spreading efficiency of pNpN nodes with the highest NSRC, and ϕeff(p)\phi_{eff}(p) is the average spreading efficiency of the pNpN nodes with the highest spreading efficiency. When pp is fixed, a smaller ε(p)\varepsilon(p) value indicates a more accurate centrality in identifying influential spreaders.

The imprecision function reflects the accuracy of the NSRC in identifying the most influential nodes. In order to measure the ability of NSRC in ranking all nodes in the network, the Kendall’s τ\tau correlation coefficient is used, which is given by Kendall (1938)

τ=i<jsgn[(xixj)(yiyj)]12N(N1),\tau=\frac{\sum_{i<j}\operatorname{sgn}[(x_{i}-x_{j})(y_{i}-y_{j})]}{\frac{1}{2}N(N-1)}, (4)

where sgn(x)\operatorname{sgn}(x) is a sign function. sgn(x)\operatorname{sgn}(x) returns 1 if x>0x>0, -1 if x<0x<0, and 0 if x=0x=0, respectively. NN is the number of nodes in the list. xix_{i} and xjx_{j} are the ranking orders of node ii and node jj in sorting sequence 1. yiy_{i} and yjy_{j} are the ranking orders of node ii and node jj in sorting sequence 2. If node ii and node jj have a concordant order in sorting sequence 1 and 2, (xixj)(yiyj)>0(x_{i}-x_{j})(y_{i}-y_{j})>0. If node ii and node jj have a discordant order in sorting sequence 1 and 2, (xixj)(yiyj)<0(x_{i}-x_{j})(y_{i}-y_{j})<0. If node ii and node jj have a same order in sorting sequence 1 or 2, (xixj)(yiyj)=0(x_{i}-x_{j})(y_{i}-y_{j})=0. We rank the nodes by the proposed NSRC as the sorting sequence 1 and rank the nodes by the simulated spreading efficiency as sorting sequence 2. We compute the correlation coefficient of sorting sequence 1 and 2. The larger correlation coefficient is, the more concordant the centrality and the spreading efficiency are. For an explicit comparison, we calculate the improved τ\tau ratio of the NSRC over the reference centrality, which is

η={τcτ0τ0τ0>0;τcτ0τ0τ0<0;0τ0=0,\eta=\left\{\begin{aligned} \frac{\tau_{c}-\tau_{0}}{\tau_{0}}&&{\tau_{0}>0};\\ \frac{\tau_{c}-\tau_{0}}{-\tau_{0}}&&{\tau_{0}<0};\\ 0&&{\tau_{0}=0},\end{aligned}\right. (5)

where τc\tau_{c} is the correlation coefficient between the NSRC and spreading efficiency, τ0\tau_{0} is the correlation coefficient between the neighboring dynamical information centrality (NDIC) and spreading efficiency.

IV Numerical simulations

We apply the NSRC to eight real-world networks as listed in Table 1. The real-world networks are: Netsci (collaboration network of network scientists), Hamster (friendships and family network of a website), Router (the router network collected by the Rocketfuel Project), Blog (communication network between users on the MSN website), PGP (an encrypted communication network), CA_hep (collaboration network of arxiv in high-energy physics theory), Astro (collaboration network of astrophysics scientists) and Email (email communication network of University College London) Newman (2006); Spring et al. (2002); Boguna et al. (2004); Newman (2001); Kitsak et al. (2010).

Network NN EE <k><k> kmaxk_{max} HkH_{k} rr CC λc\lambda_{c} λ\lambda
Net379 379 914 4.8 34 1.663 -0.082 0.741 0.20 0.28
Hamster 2000 16097 16.1 273 2.719 0.023 0.540 0.024 0.04
Router 5022 6258 2.5 16 5.503 -0.138 0.012 0.10 0.20
Blog 3982 6803 3.4 189 4.038 -0.133 0.284 0.10 0.20
PGP 10680 24340 4.6 206 4.153 0.240 0.226 0.06 0.12
CA_Hep 8638 24806 5.7 65 2.261 0.239 0.482 0.08 014
Astro 14845 119625 16.1 360 2.820 0.228 0.670 0.02 0.10
Email 12625 20362 3.2 576 34.249 -0.387 0.109 0.01 0.06
Table 1: Characteristics of the real-world networks studied in this paper. Theses characteristics include the number of nodes (NN), number of edges (EE), average degree (<k><k>), maximum degree (kmaxk_{max}), degree heterogeneity (HkH_{k}), degree assortativity (rr), clustering coefficient(CC), epidemic threshold (λc\lambda_{c}) and disease-transmission rate λ\lambda used in the SIS spreading processes.

IV.1 The relative importance of neighbors’ local structures and dynamical states

Next we focus on the most influential spreaders which are very critical in the SIS spreading processes. We investigate the relative importance of the neighbors’ local structures and dynamical states based on the top 5% nodes and top 20% nodes ranked by the NSRC. Figures 2 and 3 show the the imprecisions of the NSRC to identify the top 5% and 20% nodes under different values of the weight parameters aa and bb respectively. In the Fig. 2, where the top 5% influential spreaders are considered, and the blue areas in which the imprecision of NSRC is relatively small correspond to the parameter range aba\geq b in the lower right part of the diagram. In the Netsci, Router, Blog, Email and PGP networks respectively, there is an optimal area at the lower part of the diagonal a=ba=b. In the Hamster, CA_Hep and Astro networks respectively, the imprecision is relatively small in the area of a<ba<b. There is a saturation effect in this area where the imprecision does not change much with the change of aa and bb. When we consider the top ranked 20% nodes by the NSRC, the results shown in Fig. 3 are similar to that of the top ranked 5% nodes shown in Fig. 2. In the Netsci and Email networks, there are optimal areas near the diagonal while in the other six networks, the optimal area is the lower right part of a>ba>b. This implies that for the proposed index NSRC based on neighbors’ local structures and dynamical states to be more accurate, the weight of the infection risks bb should not exceed the weight of the neighbors’ local structures aa.

Refer to caption
Figure 2: The imprecision of NSRC as a function of aa and bb in eight real-world networks for p=5%p=5\%. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).
Refer to caption
Figure 3: The imprecision of NSRC as a function of aa and bb in eight real-world networks for p=20%p=20\%. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).

To further explore the impact of the local structures and dynamical states of neighbor nodes on the accuracy of the proposed centrality, we fix the value of the parameter aa and investigate the imprecisions of NSRC as a function of the parameter bb. Figure 4 shows that when the weight parameter aa of the neighbors’ local structure is fixed, the imprecision function has an optimal value or saturation effect with the change of bb. When a=0.2a=0.2, in the Blog, Email and PGP networks, the imprecisions are lowest at around b=0.1,b=0.2,b=0.2b=0.1,b=0.2,b=0.2 respectively. In the other five networks, the imprecisions increase with the increase of bb, and there is a saturation effect when ba=0.2b\leq a=0.2. The results for a=0.4a=0.4 and a=0.8a=0.8 support the same conclusion, as shown in Fig. 4. These results indicate that the same conclusion also applies when a=0.4a=0.4 and a=0.8a=0.8 in Fig. 4. These results indicate that the parameters aa and bb have a synergistic effect in determining the accuracy of the centrality based on the neighbors’ local structures and dynamical states. When the accuracy is the highest, the optimal parameter bb changes with the parameter aa.

Refer to caption
Figure 4: The imprecision of NSRC as a function of bb in eight real-world networks for p=5%p=5\%. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).

Through the experimental results in eight real-world networks, it is found that regardless of the value of the parameter aa, the imprecision function will have a minimum value if bb is not greater than aa. To understand the above phenomenon, we can rewrite the centrality index as

C(i)=ki+ajΓikjout(1baρj).C(i)=k_{i}+a\sum_{j\in\Gamma_{i}}{k_{j}^{out}(1-\frac{b}{a}\rho_{j})}. (6)

In the form of Eq. (6), we can discuss the relationship between the items more conveniently. The optimal area of b<ab<a indicates that although both the positive effect of the outgoing edges and the negative effect of the infection risks of neighbors’ have a significant contribution to the accuracy of the ranking measure, the relative importance of them determines the ranking accuracy. Specifically speaking, compared to the positive effect of the number of neighbors’ outgoing edges, slightly weakening the negative effect of neighbors’ infection risks can get better identification results.

The above results can be explained as follows. For a considered node, if two of its neighbors have the same number of outgoing edges, a higher infection risk of its neighbor ρj\rho_{j} means a limited impact towards the spreading processes. However, a larger ρj\rho_{j} often implies that node jj tends to be closely connected to the considered node and its neighbor set (i.e., node i and its neighbor node jj having more common neighbors) Boguná et al. (2013); Castellano and Pastor-Satorras (2012). For example, as there are more common neighbors between node 1 and its neighbor node 3, the node 3 has a higher infection risk (i. e., infection probability) than neighbor node 5. When the considered node 1 is controlled as an I-state node, the neighbor node with more common neighbors can enhance the influence of this neighbor node’s outgoing propagation to a certain extent due to the echo chamber effect of local close structures Zhang et al. (2014); Chen et al. (2018); Wang et al. (2015). Combining the above two effects, an appropriate parameter combination of bb being slightly less than aa can effectively and quantitatively distinguish the impact of neighbors with different infection risks. For example, for node 3 and node 5 in Fig. 1, under the echo chamber effect, the infection risk increases from 0.5 to 0.8, and 0.3 to 0.65, respectively. When b=0b=0, the infection risks of these two nodes is not considered. Since the number of outgoing edges is the same, the two nodes have the same contribution to the spreading capacity of the controlled node 1. When b=1b=1, although the infection risk of node 5 is significantly greater than that of node 3, the negative effect of high infection risk towards the spreading range of the node 1 is overvalued due to the enhancement of the echo chamber effect of local close structures. But in fact we need to consider the impact of the echo chamber effect and appropriately decrease the weight of the neighbor’s infection risk, thus the value of bb should be less than aa.

IV.2 Compared with NDIC

In our previous study, we proposed a similar ranking index based on neighboring dynamical information (NDIC) Qu et al. (2020). Compared with the NDIC, the newly proposed NSRC classifies the neighbors’ edges and especially addresses the impact of outgoing edges. For a better illustration, we compared the two indicators by the Kendall’s τ\tau correlation and calculated the improved τ\tau ratio η\eta of the new index over the existing NDIC. It has been discussed in Ref. Qu et al. (2020) that when a=0.2a=0.2, the accuracy of NDIC is the highest or a saturation effect appears when b=0.5b=0.5.

Refer to caption
Figure 5: The improved ratio η\eta of NSRC over NDIC as a function of pp. The weight parameters of aa and bb are both a=0.2,b=0.5a=0.2,b=0.5. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).
Refer to caption
Figure 6: The improved ratio η\eta of NSRC over NDIC as a function of pp. The weight parameters of aa and bb for NSRC and NDIC are a=0.2,b=0.1a=0.2,b=0.1 and a=0.2,b=0.5a=0.2,b=0.5 respectively. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).

In this combination of aa and bb, the increased correlation ratio η\eta of NSRC over NDIC is greater than 0 in most cases as shown in Fig. 5. We also see that the NSRC displays a greater increase in ranking accuracy than the NDIC in the all networks, when an appropriate combination of parameters is fixed, as shown in Fig. 6. If we choose an optimal parameter combination of a=0.4a=0.4 and b=0.2b=0.2, the improved τ\tau ratio of NSRC over NDIC is much more significant as shown in Fig. 7. In the Router network, the improved τ\tau ratio η\eta is as high as 90%. In the Blog network, η\eta is close to 50%. In the other six networks, η\eta is up to 10% to 30%.

Refer to caption
Figure 7: The improved ratio η\eta of NSRC over NDIC as a function of pp. The weight parameters of aa and bb for NSRC and NDIC are a=0.4,b=0.2a=0.4,b=0.2 and a=0.2,b=0.5a=0.2,b=0.5 respectively. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).

Next, we investigate the dependence of η\eta on the transmission rate λ\lambda. The transmission rate should not be too large when distinguishing the node influence by using the single-node control model. Because when λ\lambda is too large, a node with low centrality can also spread the disease to the entire network. In this case the influence of nodes cannot be distinguished. We thus use the 1 to 3 times of the epidemic threshold as the transmission rate Liu et al. (2015). The improved ratio η\eta of NSRC over NDIC as a function of transmission rate λ\lambda in eight real-world networks are shown in Fig. 8. The weight parameters are set to be a=0.4,b=0.2a=0.4,b=0.2 and a=0.2,b=0.5a=0.2,b=0.5 for NSRC and NDIC respectively. In the Netsci, Router, and Blog networks, the improved τ\tau ratio is about 80%. In the rest five networks, τ\tau is around 10% to 30%. These results indicate that the ranking accuracy of NSRC is better than that of NDIC at various transmission rates.

Refer to caption
Figure 8: The improved ratio η\eta of NSRC over NDIC as a function of λ\lambda in eight real-world networks. The imprecisions for p=5%p=5\%(red column) and 20%20\%(blue column) are compared in each network. The real-world networks are Nstsci (a), Hamster (b), Router (c), Blog (d), CA_Hep (e), Email (f), PGP (g), Astro (h).

V Conclusions

In this paper, we quantitatively analyzed the relative importance of the neighbors’ local structures and dynamical states in ranking node influence in the SIS-like spreading processes. We proposed a centrality index NSRC that combines the positive effect of the neighbors’ outgoing edges and the negative effect of the neighbors’ infection risks with two weighting factors aa and bb. A large number of neighbors’ outgoing edges have a positive effect on the spreading influence of a node, while the high infection risks of the neighbor nodes reduce the influence of the considered node. A larger value of aa or bb means a stronger positive local structure effect or negative local dynamical effect. Through a large number of simulations on eight real-world networks, we found that when the positive and negative effects are taken appropriately, under appropriate values of a/ba/b, the proposed centrality index NSRC can improve the ranking accuracy significantly.

Specifically, to identify the most influential nodes accurately, the weight of the neighbors’ dynamical states bb should not be greater than the weight of the neighbors’ local structures aa. Although the high infection risk of a neighbor node can inhibit the spreading capacity of the controlled node, the close local structure with more common neighbors between the considered node and its neighbor node can induce the echo chamber effect, and thus slightly promote the spreading processes. Therefore, an optimal combination of parameters with b<ab<a can make the NSRC evaluate the spreading influence of a node more accurately. This is helpful to distinguish the impact of the neighboring nodes having the same local structures but different infection probabilities. By analyzing the simulation results of real-world networks, it is found that when the outgoing edges of neighbor nodes are taken into account, the accuracy of NSRC is greatly improved over the centrality measure NDIC. After further optimizing the parameters, the improved ranking correlation of NSRC over NDIC can be as high as 10% to 100%. Compared with the degree centrality of the neighbors’, the neighbors’ outgoing edges can be more accurate in reflecting the importance of neighbors’ local structures in the spreading processes. Further more, when we change the transmission rate λ\lambda, the NSRC has a higher accuracy than the existing index in ranking influential spreaders in the SIS-like spreading processes.

In the reversible processes, we found that considering the positive effects of neighbors’ outgoing edges and the negative effects of neighbors’ infection risks can effectively improve the accuracy of indexes in identifying influential spreaders. This deepens our understanding of identifying influential spreaders in the reversible processes. The benchmark centrality used in NSRC in our paper is degree, which can be substituted by any other benchmark centralities, such as kk-shell index centrality Kitsak et al. (2010) and the collective influence index Morone and Makse (2015). The proposed ranking index NSRC can be applied in other reversible dynamics and dynamical systems with steady state such as synchronization processes Barzel and Barabási (2013) and cascading failures Lin et al. (2020).

Acknowledgements.
This work was supported by the National Natural Science Foundation of China (Grant Nos. 11875132, 11975099, 82161148012, 11835003, 61802321), the Natural Science Foundation of Shanghai (Grant No. 18ZR1411800), and the Science and Technology Commission of Shanghai Municipality (Grant No. 14DZ2260800).

References

  • Koschützki et al. (2005) D. Koschützki, K. A. Lehmann, L. Peeters, S. Richter, D. Tenfelde-Podehl, and O. Zlotowski, in Network Analysis (Springer, 2005), pp. 16–61.
  • Lü et al. (2016) L. Lü, D. Chen, X. Ren, Q. Zhang, Y. Zhang, and T. Zhou, Physics Reports 650, 1 (2016).
  • Pei et al. (2020) S. Pei, J. Wang, F. Morone, and H. A. Makse, Journal of Complex Networks 8, cnz029 (2020).
  • Leskovec et al. (2007) J. Leskovec, L. A. Adamic, and B. A. Huberman, ACM Transactions on the Web (TWEB) 1, 5 (2007).
  • Bovet and Makse (2019) A. Bovet and H. A. Makse, Nature Communications 10, 1 (2019).
  • Lin et al. (2018) Y.-T. Lin, X.-P. Han, B.-K. Chen, J. Zhou, and B.-H. Wang, Frontiers of Physics 13, 1 (2018).
  • Motter and Lai (2002) A. E. Motter and Y. Lai, Physical Review E 66, 065102 (2002).
  • Albert et al. (2004) R. Albert, I. Albert, and G. L. Nakarado, Physical Review E 69, 025103 (2004).
  • Pastor-Satorras and Vespignani (2002) R. Pastor-Satorras and A. Vespignani, Physical Review E 65, 036104 (2002).
  • Scarpino and Petri (2019) S. V. Scarpino and G. Petri, Nature Communications 10, 1 (2019).
  • Zhou and Liu (2008) J. Zhou and Z.-H. Liu, Frontiers of Physics 3, 331 (2008).
  • Morone and Makse (2015) F. Morone and H. A. Makse, Nature 524, 65 (2015).
  • Pei et al. (2018) S. Pei, F. Morone, and H. A. Makse, in Complex spreading phenomena in social systems (Springer, 2018), pp. 125–148.
  • Hu et al. (2018) Y. Hu, S. Ji, Y. Jin, L. Feng, H. E. Stanley, and S. Havlin, Proceedings of the National Academy of Sciences 115, 7468 (2018).
  • Lokhov and Saad (2017) A. Y. Lokhov and D. Saad, Proceedings of the National Academy of Sciences 114, E8138 (2017).
  • Zheng et al. (2021) K. Zheng, Y. Liu, Y. Wang, and W. Wang, arXiv preprint arXiv:2101.02335 (2021).
  • Poux-Médard et al. (2020) G. Poux-Médard, R. Pastor-Satorras, and C. Castellano, Physical Review Research 2, 023332 (2020).
  • Erkol et al. (2020) Ş. Erkol, D. Mazzilli, and F. Radicchi, Physical Review E 102, 042307 (2020).
  • Aral and Dhillon (2018) S. Aral and P. S. Dhillon, Nature Human Behaviour 2, 375 (2018).
  • Klemm et al. (2012) K. Klemm, M. Á. Serrano, V. M. Eguíluz, and M. San Miguel, Scientific Reports 2, 1 (2012).
  • Gleeson et al. (2014) J. P. Gleeson, J. A. Ward, K. P. O’sullivan, and W. T. Lee, Physical Review Letters 112, 048701 (2014).
  • Pastor-Satorras and Vespignani (2001a) R. Pastor-Satorras and A. Vespignani, Physical Review E 63, 066117 (2001a).
  • Stavroglou et al. (2019) S. K. Stavroglou, A. A. Pantelous, H. E. Stanley, and K. M. Zuev, Proceedings of the National Academy of Sciences 116, 10646 (2019).
  • Barzel and Barabási (2013) B. Barzel and A. Barabási, Nature Physics 9, 673 (2013).
  • Pastor-Satorras and Vespignani (2001b) R. Pastor-Satorras and A. Vespignani, Physical Review Letters 86, 3200 (2001b).
  • Pastor-Satorras and Vespignani (2001c) R. Pastor-Satorras and A. Vespignani, Physical Review E 63, 066117 (2001c).
  • Qu et al. (2020) J. Qu, M. Tang, Y. Liu, and S. Guan, Chaos, Solitons & Fractals 140, 110197 (2020).
  • Shu et al. (2016) P. Shu, W. Wang, M. Tang, P. Zhao, and Y. Zhang, Chaos: An Interdisciplinary Journal of Nonlinear Science 26, 063108 (2016).
  • Liu et al. (2015) Y. Liu, M. Tang, T. Zhou, and Y. Do, Scientific Reports 5, 9602 (2015).
  • Ferreira et al. (2012) S. C. Ferreira, C. Castellano, and R. Pastor-Satorras, Physical Review E 86, 041125 (2012).
  • Shu et al. (2015) P. Shu, W. Wang, M. Tang, and Y. Do, Chaos: An Interdisciplinary Journal of Nonlinear Science 25, 063104 (2015).
  • Xu et al. (2019) Y. Xu, M. Tang, Y. Liu, Y. Zou, and Z. Liu, Chaos: An Interdisciplinary Journal of Nonlinear Science 29, 103141 (2019).
  • Liu et al. (2016) Y. Liu, M. Tang, T. Zhou, and Y. Do, Physica A: Statistical Mechanics and its Applications 452, 289 (2016).
  • Kendall (1938) M. G. Kendall, Biometrika 30, 81 (1938).
  • Newman (2006) M. E. Newman, Physical Review E 74, 036104 (2006).
  • Spring et al. (2002) N. Spring, R. Mahajan, and D. Wetherall, ACM SIGCOMM Computer Communication Review 32, 133 (2002).
  • Boguna et al. (2004) M. Boguna, R. Pastorsatorras, A. Diazguilera, and A. Arenas, Physical Review E 70, 056122 (2004).
  • Newman (2001) M. E. Newman, Proceedings of the National Academy of Sciences 98, 404 (2001).
  • Kitsak et al. (2010) M. Kitsak, L. K. Gallos, S. Havlin, F. Liljeros, L. Muchnik, H. E. Stanley, and H. A. Makse, Nature Physics 6, 888 (2010).
  • Boguná et al. (2013) M. Boguná, C. Castellano, and R. Pastor-Satorras, Physical Review Letters 111, 068701 (2013).
  • Castellano and Pastor-Satorras (2012) C. Castellano and R. Pastor-Satorras, Scientific Reports 2, 1 (2012).
  • Zhang et al. (2014) H. Zhang, J. Xie, M. Tang, and Y. Lai, Chaos: An Interdisciplinary Journal of Nonlinear Science 24, 043106 (2014).
  • Chen et al. (2018) X. Chen, R. Wang, M. Tang, S. Cai, H. E. Stanley, and L. A. Braunstein, New Journal of Physics 20, 013007 (2018).
  • Wang et al. (2015) W. Wang, M. Tang, H. Zhang, and Y. Lai, Physical Review E 92, 012820 (2015).
  • Lin et al. (2020) Z. Lin, M. Feng, M. Tang, Z. Liu, C. Xu, P. M. Hui, and Y. Lai, Nature Communications 11, 1 (2020).