Chang Ze, Cai Yunfei, Liu Xiao Fan, Xie Zhenping, Liu Yuan, Zhan Qianyi
School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China.
Department of Media and Communication, City University of Hong Kong, Hong Kong SAR, China.
Sensors (Basel). 2024 Dec 24;25(1):1. doi: 10.3390/s25010001.
With the rapid development of blockchain technology, fraudulent activities have significantly increased, posing a major threat to the personal assets of blockchain users. The blockchain transaction network formed during user transactions can be represented as a graph consisting of nodes and edges, making it suitable for a graph data structure. Fraudulent nodes in the transaction network are referred to as anomalous nodes. In recent years, the mainstream method for detecting anomalous nodes in graphs has been the use of graph data mining techniques. However, anomalous nodes typically constitute only a small portion of the transaction network, known as the minority class, while the majority of nodes are normal nodes, referred to as the majority class. This discrepancy in sample sizes results in class imbalance data, where models tend to overfit the features of the majority class and neglect those of the minority class. This issue presents significant challenges for traditional graph data mining techniques. In this paper, we propose a novel graph neural network method to overcome class imbalance issues by improving the Graph Attention Network (GAT) and incorporating ensemble learning concepts. Our method combines GAT with a subtree attention mechanism and two ensemble learning methods: Bootstrap Aggregating (Bagging) and Categorical Boosting (CAT), called SGAT-BC. We conducted experiments on four real-world blockchain transaction datasets, and the results demonstrate that SGAT-BC outperforms existing baseline models.
随着区块链技术的快速发展,欺诈活动显著增加,对区块链用户的个人资产构成了重大威胁。用户交易过程中形成的区块链交易网络可以表示为由节点和边组成的图,这使其适用于图数据结构。交易网络中的欺诈节点被称为异常节点。近年来,检测图中异常节点的主流方法是使用图数据挖掘技术。然而,异常节点通常只占交易网络的一小部分,即少数类,而大多数节点是正常节点,即多数类。样本大小的这种差异导致了类不平衡数据,在这种情况下,模型往往会过度拟合多数类的特征而忽略少数类的特征。这个问题给传统的图数据挖掘技术带来了重大挑战。在本文中,我们提出了一种新颖的图神经网络方法,通过改进图注意力网络(GAT)并结合集成学习概念来克服类不平衡问题。我们的方法将GAT与子树注意力机制以及两种集成学习方法:自助聚合(Bagging)和分类提升(CAT)相结合,称为SGAT-BC。我们在四个真实世界的区块链交易数据集上进行了实验,结果表明SGAT-BC优于现有的基线模型。