Qiao Boyu, Zhou Wei, Li Kun, Li Shilong, Hu Songlin
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7302-7315. doi: 10.1109/TNNLS.2024.3396192. Epub 2025 Apr 4.
Social bot detection is essential for maintaining the safety and integrity of online social networks (OSNs). Graph neural networks (GNNs) have emerged as a promising solution. Mainstream GNN-based social bot detection methods learn rich user representations by recursively performing message passing along user-user interaction edges, where users are treated as nodes and their relationships as edges. However, these methods face challenges when detecting advanced bots interacting with genuine accounts. Interaction with real accounts results in the graph structure containing camouflaged and unreliable edges. These unreliable edges interfere with the differentiation between bot and human representations, and the iterative graph encoding process amplifies this unreliability. In this article, we propose a social Bot detection method based on Edge Confidence Evaluation (BECE). Our model incorporates an edge confidence evaluation module that assesses the reliability of the edges and identifies the unreliable edges. Specifically, we design features for edges based on the representation of user nodes and introduce parameterized Gaussian distributions to map the edge embeddings into a latent semantic space. We optimize these embeddings by minimizing Kullback-Leibler (KL) divergence from the standard distribution and evaluate their confidence based on edge representation. Experimental results on three real-world datasets demonstrate that BECE is effective and superior in social bot detection. Additionally, experimental results on six widely used GNN architectures demonstrate that our proposed edge confidence evaluation module can be used as a plug-in to improve detection performance.
社交机器人检测对于维护在线社交网络(OSN)的安全和完整性至关重要。图神经网络(GNN)已成为一种很有前景的解决方案。主流的基于GNN的社交机器人检测方法通过沿着用户 - 用户交互边递归地执行消息传递来学习丰富的用户表示,其中用户被视为节点,他们之间的关系被视为边。然而,这些方法在检测与真实账户交互的高级机器人时面临挑战。与真实账户的交互导致图结构包含伪装且不可靠的边。这些不可靠的边干扰了机器人和人类表示之间的区分,并且迭代图编码过程放大了这种不可靠性。在本文中,我们提出了一种基于边置信度评估(BECE)的社交机器人检测方法。我们的模型包含一个边置信度评估模块,该模块评估边的可靠性并识别不可靠的边。具体来说,我们基于用户节点的表示为边设计特征,并引入参数化高斯分布将边嵌入映射到潜在语义空间。我们通过最小化与标准分布的库尔贝克 - 莱布勒(KL)散度来优化这些嵌入,并基于边表示评估它们的置信度。在三个真实世界数据集上的实验结果表明BECE在社交机器人检测中是有效且优越的。此外,在六种广泛使用的GNN架构上的实验结果表明,我们提出的边置信度评估模块可以用作插件来提高检测性能。