Guo Wenjie, Du Wenbiao, Yang Xiuqi, Xue Jingfeng, Wang Yong, Han Weijie, Hu Jingjing
School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100811, China.
School of Space Information, Space Engineering University, Beijing 100084, China.
Sensors (Basel). 2025 Jan 10;25(2):374. doi: 10.3390/s25020374.
While deep learning techniques have been extensively employed in malware detection, there is a notable challenge in effectively embedding malware features. Current neural network methods primarily capture superficial characteristics, lacking in-depth semantic exploration of functions and failing to preserve structural information at the file level. Motivated by the aforementioned challenges, this paper introduces MalHAPGNN, a novel framework for malware detection that leverages a hierarchical attention pooling graph neural network based on enhanced call graphs. Firstly, to ensure semantic richness, a Bidirectional Encoder Representations from Transformers-based (BERT) attribute-enhanced function embedding method is proposed for the extraction of node attributes in the function call graph. Subsequently, this work designs a hierarchical graph neural network that integrates attention mechanisms and pooling operations, complemented by function node sampling and structural learning strategies. This framework delivers a comprehensive profile of malicious code across semantic, syntactic, and structural dimensions. Extensive experiments conducted on the Kaggle and VirusShare datasets have demonstrated that the proposed framework outperforms other graph neural network (GNN)-based malware detection methods.
虽然深度学习技术已广泛应用于恶意软件检测,但在有效嵌入恶意软件特征方面存在显著挑战。当前的神经网络方法主要捕捉表面特征,缺乏对功能的深入语义探索,并且无法在文件级别保留结构信息。受上述挑战的启发,本文介绍了MalHAPGNN,这是一种用于恶意软件检测的新颖框架,它利用基于增强调用图的分层注意力池化图神经网络。首先,为确保语义丰富性,提出了一种基于变换器的双向编码器表示(BERT)属性增强函数嵌入方法,用于在函数调用图中提取节点属性。随后,这项工作设计了一种分层图神经网络,该网络集成了注意力机制和池化操作,并辅以函数节点采样和结构学习策略。该框架从语义、句法和结构维度提供了恶意代码的全面概况。在Kaggle和VirusShare数据集上进行的大量实验表明,所提出的框架优于其他基于图神经网络(GNN)的恶意软件检测方法。