Kong Xiangjie, Zhang Wenyi, Wang Hui, Hou Mingliang, Chen Xin, Yan Xiaoran, Das Sajal K
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):7931-7944. doi: 10.1109/TNNLS.2024.3414326. Epub 2025 May 6.
Attribute graph anomaly detection aims to identify nodes that significantly deviate from the majority of normal nodes, and has received increasing attention due to the ubiquity and complexity of graph-structured data in various real-world scenarios. However, current mainstream anomaly detection methods are primarily designed for centralized settings, which may pose privacy leakage risks in certain sensitive situations. Although federated graph learning offers a promising solution by enabling collaborative model training in distributed systems while preserving data privacy, a practical challenge arises as each client typically possesses a limited amount of graph data. Consequently, naively applying federated graph learning directly to anomaly detection tasks in distributed environments may lead to suboptimal performance results. We propose a federated graph anomaly detection framework via contrastive self-supervised learning (CSSL) [federated CSSL anomaly detection framework (FedCAD)] to address these challenges. FedCAD updates anomaly node information between clients via federated learning (FL) interactions. First, FedCAD uses pseudo-label discovery to determine the anomaly node of the client preliminarily. Second, FedCAD employs a local anomaly neighbor embedding aggregation strategy. This strategy enables the current client to aggregate the neighbor embeddings of anomaly nodes from other clients, thereby amplifying the distinction between anomaly nodes and their neighbor nodes. Doing so effectively sharpens the contrast between positive and negative instance pairs within contrastive learning, thus enhancing the efficacy and precision of anomaly detection through such a learning paradigm. Finally, the efficiency of FedCAD is demonstrated by experimental results on four real graph datasets.
属性图异常检测旨在识别与大多数正常节点有显著偏差的节点,由于图结构数据在各种现实场景中的普遍性和复杂性,该技术受到了越来越多的关注。然而,当前主流的异常检测方法主要是为集中式设置设计的,在某些敏感情况下可能会带来隐私泄露风险。虽然联邦图学习通过在分布式系统中实现协作模型训练同时保护数据隐私提供了一个有前景的解决方案,但由于每个客户端通常拥有有限数量的图数据,因此出现了一个实际挑战。因此,在分布式环境中直接将联邦图学习天真地应用于异常检测任务可能会导致次优的性能结果。我们提出了一种通过对比自监督学习(CSSL)的联邦图异常检测框架[联邦CSSL异常检测框架(FedCAD)]来应对这些挑战。FedCAD通过联邦学习(FL)交互在客户端之间更新异常节点信息。首先,FedCAD使用伪标签发现来初步确定客户端的异常节点。其次,FedCAD采用局部异常邻居嵌入聚合策略。该策略使当前客户端能够聚合来自其他客户端的异常节点的邻居嵌入,从而放大异常节点与其邻居节点之间的区别。这样做有效地增强了对比学习中正负实例对之间的对比度,从而通过这种学习范式提高了异常检测的功效和精度。最后,通过在四个真实图数据集上的实验结果证明了FedCAD的效率。