IEEE Trans Cybern. 2017 Apr;47(4):1078-1089. doi: 10.1109/TCYB.2016.2537366. Epub 2016 Mar 21.
Diffusion on social networks refers to the process where opinions are spread via the connected nodes. Given a set of observed information cascades, one can infer the underlying diffusion process for social network analysis. The independent cascade model (IC model) is a widely adopted diffusion model where a node is assumed to be activated independently by any one of its neighbors. In reality, how a node will be activated also depends on how its neighbors are connected and activated. For instance, the opinions from the neighbors of the same social group are often similar and thus redundant. In this paper, we extend the IC model by considering that: 1) the information coming from the connected neighbors are similar and 2) the underlying redundancy can be modeled using a dynamic structural diversity measure of the neighbors. Our proposed model assumes each node to be activated independently by different communities (or components) of its parent nodes, each weighted by its effective size. An expectation maximization algorithm is derived to infer the model parameters. We compare the performance of the proposed model with the basic IC model and its variants using both synthetic data sets and a real-world data set containing news stories and Web blogs. Our empirical results show that incorporating the community structure of neighbors and the structural diversity measure into the diffusion model significantly improves the accuracy of the model, at the expense of only a reasonable increase in run-time.
社交网络中的扩散是指通过连接节点传播意见的过程。给定一组观察到的信息级联,可以推断出社交网络分析的基础扩散过程。独立级联模型(IC 模型)是一种广泛采用的扩散模型,其中假设节点可以由其任意一个邻居独立激活。在现实中,节点如何被激活也取决于其邻居的连接和激活方式。例如,来自同一社交群体的邻居的意见往往相似,因此存在冗余。在本文中,我们通过考虑以下因素扩展了 IC 模型:1)来自连接邻居的信息相似,2)可以使用邻居的动态结构多样性度量来建模潜在的冗余。我们提出的模型假设每个节点由其父节点的不同社区(或组件)独立激活,每个社区由其有效大小加权。我们推导出了一种期望最大化算法来推断模型参数。我们使用合成数据集和包含新闻故事和 Web 博客的真实世界数据集,将提出的模型与基本 IC 模型及其变体进行了比较。我们的实验结果表明,将邻居的社区结构和结构多样性度量纳入扩散模型中,以牺牲合理的运行时间增加为代价,显著提高了模型的准确性。