University of Tehran, Tehran, Iran.
Yazd University, Yazd, Iran.
Nonlinear Dynamics Psychol Life Sci. 2021 Apr;25(2):127-155.
The diffusion process in networks is studied with the objective of identifying the dynamics and for predicting the behavior of network entities. Social media plays an important role in people's lives. Diffusion processes, as one of the most important branches of social media analysis, have their presence in various domains such as information spreading, diffusion of innovation, idea dissemination, and product acceptance to identify user's pattern and their behavior in social media networks. Users are not limited to one social network and are engaged in multiple social media such as Twitter, Instagram, Telegram, and Facebook. This fact has created new phenomena in social network analysis, called multiplex network analysis. Thus, the scope of diffusion process analysis has been transferred from single layer networks to multiplex networks. Diffusion process analysis can be studied at both infrastructure-level and diffusion-level; at infrastructure-level, the structural network's properties such as clustering coefficient and degree centrality are being studied; and in diffusion-level the diffusion network's properties such as diffusion depth and seed nodes are being studied. On the other hand, a reliable analysis requires complete information on both infrastructure and diffusion networks. However, complete data is not accessible forever, this fact is due to some limitations such as crawling big data, gathering social media policies, and user privacy. Incomplete data can lead to poor analysis, so in this work we, first of all, investigate the impact of missing data in both infrastructure and diffusion networks, the impact of random and non-random missing infrastructure data on nine diffusion network's properties such as number of infected nodes, number of infected edges, diffusion length and number of seed nodes. Secondly, based on the multiplex diffusion tree, we introduce a new model named as MLC-tree for an incomplete diffusion network. Finally, we evaluate our model on both synthetic and real social networks; these results show that the MLC-tree can decrease the relative error more than 50 percent while missing 20 to 80 percent of complete data.
研究网络中的扩散过程旨在识别网络实体的动态并预测其行为。社交媒体在人们的生活中扮演着重要的角色。扩散过程作为社交媒体分析的最重要分支之一,存在于信息传播、创新扩散、思想传播和产品接受等各个领域,以识别用户在社交媒体网络中的模式和行为。用户不仅限于一个社交媒体,他们还参与多个社交媒体,如 Twitter、Instagram、Telegram 和 Facebook。这一事实在社交网络分析中创造了一种新现象,称为多重网络分析。因此,扩散过程分析的范围已从单层网络转移到多重网络。扩散过程分析可以在基础设施层和扩散层进行研究;在基础设施层,研究结构网络的属性,如聚类系数和度中心度;在扩散层,研究扩散网络的属性,如扩散深度和种子节点。另一方面,可靠的分析需要基础设施和扩散网络的完整信息。然而,完整的数据并不总是可用的,这是由于一些限制,如大数据爬行、收集社交媒体政策和用户隐私。不完整的数据可能导致分析不佳,因此在这项工作中,我们首先调查了基础设施和扩散网络中缺失数据的影响,随机和非随机缺失基础设施数据对九个扩散网络属性的影响,如感染节点的数量、感染边的数量、扩散长度和种子节点的数量。其次,基于多重扩散树,我们引入了一种新的模型,称为 MLC 树,用于不完整的扩散网络。最后,我们在合成和真实社交网络上评估了我们的模型;这些结果表明,当缺失 20%至 80%的完整数据时,MLC 树可以将相对误差降低 50%以上。