College of Information and Electrical Engineering, China Agricultural University, Beijing, China.
School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.
PLoS One. 2022 Apr 12;17(4):e0266598. doi: 10.1371/journal.pone.0266598. eCollection 2022.
Although social media has highly facilitated people's daily communication and dissemination of information, it has unfortunately been an ideal hotbed for the breeding and dissemination of Internet rumors. Therefore, automatically monitoring rumor dissemination in the early stage is of great practical significance. However, the existing detection methods fail to take full advantage of the semantics of the microblog information propagation graph. To address this shortcoming, this study models the information transmission network of a microblog as a heterogeneous graph with a variety of semantic information and then constructs a Microblog-HAN, which is a graph-based rumor detection model, to capture and aggregate the semantic information using attention layers. Specifically, after the initial textual and visual features of posts are extracted, the node-level attention mechanism combines neighbors of the microblog nodes to generate three groups of node embeddings with specific semantics. Moreover, semantic-level attention fuses different semantics to obtain the final node embedding of the microblog, which is then used as a classifier's input. Finally, the classification results of whether the microblog is a rumor or not are obtained. The experimental results on two real-world microblog rumor datasets, Weibo2016 and Weibo2021, demonstrate that the proposed Microblog-HAN can detect microblog rumors with an accuracy of over 92%, demonstrating its superiority over the most existing methods in identifying rumors from the view of the whole information transmission graph.
虽然社交媒体极大地方便了人们的日常交流和信息传播,但不幸的是,它也成为了互联网谣言滋生和传播的理想温床。因此,早期自动监测谣言传播具有重要的现实意义。然而,现有的检测方法未能充分利用微博信息传播图的语义。针对这一不足,本研究将微博信息传输网络建模为具有多种语义信息的异构图,然后构建基于图的微博-HAN(Microblog-Heterogeneous Attention Network)谣言检测模型,使用注意力层捕获和聚合语义信息。具体来说,在提取帖子的初始文本和视觉特征后,节点级注意力机制结合微博节点的邻居生成具有特定语义的三组节点嵌入。此外,语义级注意力融合不同的语义,以获得微博的最终节点嵌入,然后将其作为分类器的输入。最后,得到微博是否为谣言的分类结果。在两个真实的微博谣言数据集 Weibo2016 和 Weibo2021 上的实验结果表明,所提出的微博-HAN 可以以超过 92%的准确率检测微博谣言,从整个信息传播图的角度来看,证明了其在识别谣言方面优于大多数现有方法的优越性。