Wu Sirong, Deng Yuhui, Liu Junjie, Luo Xi, Sun Gengchen
Guangdong Provincial Key Laboratoryof Interdisciplinary Research and Application for Data Science, Beijing NormalUniversity-Hong Kong Baptist University United International College, Zhuhai,Guangdong Province, China.
Department of Statistics and Data Science, Faculty ofScience and Technology, Beijing Normal University-Hong Kong Baptist UniversityUnited International College, Zhuhai, Guangdong Province, China.
PLoS One. 2025 Apr 7;20(4):e0320333. doi: 10.1371/journal.pone.0320333. eCollection 2025.
The rapid propagation of rumors on social media can give rise to various social issues, underscoring the necessity of swift and automated rumor detection. Existing studies typically identify rumors based on their textual or static propagation structural information, without considering the dynamic changes in the structure of rumor propagation over time. In this paper, we propose the Temporal Tree Transformer model, which simultaneously considers text, propagation structure, and temporal changes. By analyzing observing the growth of propagation tree structures in different time windows, we use Gated Recurrent Unit (GRU) to encode these trees to obtain better representations for the classification task. We evaluate our model's performance using the PHEME dataset. In most existing studies, information leakage occurs when conversation threads from all events are randomly divided into training and test sets. We perform Leave-One-Event-Out (LOEO) cross-validation, which better reflects real-world scenarios. The experimental results show that our model achieves state-of-the-art accuracy 75.84% and Macro F1 score of 71.98%, respectively. These results demonstrate that extracting temporal features from propagation structures leads to improved model generalization.
谣言在社交媒体上的迅速传播会引发各种社会问题,这凸显了快速自动检测谣言的必要性。现有研究通常基于文本或静态传播结构信息来识别谣言,而没有考虑谣言传播结构随时间的动态变化。在本文中,我们提出了时态树变换器模型,该模型同时考虑了文本、传播结构和时间变化。通过分析观察不同时间窗口中传播树结构的增长情况,我们使用门控循环单元(GRU)对这些树进行编码,以获得更好的分类任务表示。我们使用PHEME数据集评估模型的性能。在大多数现有研究中,当将所有事件的对话线程随机分为训练集和测试集时会发生信息泄露。我们执行留一事件法(LOEO)交叉验证,这能更好地反映现实世界的情况。实验结果表明,我们的模型分别达到了75.84%的最新准确率和71.98%的宏F1分数。这些结果表明,从传播结构中提取时间特征可提高模型的泛化能力。