基于深度学习的使用Windows可执行文件应用程序编程接口调用进行恶意软件分析的序列模型。

Deep learning based Sequential model for malware analysis using Windows exe API Calls.

作者信息

Catak Ferhat Ozgur, Yazı Ahmet Faruk, Elezaj Ogerta, Ahmed Javed

机构信息

Department of Information Security and Communication Technology, NTNU Norwegian University of Science and Technology, Gjøvik, Norway.

TUBITAK Bilgem Cyber Security Institute, Kocaeli, Turkey.

出版信息

PeerJ Comput Sci. 2020 Jul 27;6:e285. doi: 10.7717/peerj-cs.285. eCollection 2020.

DOI:10.7717/peerj-cs.285

PMID:33816936

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7924690/

Abstract

Malware development has seen diversity in terms of architecture and features. This advancement in the competencies of malware poses a severe threat and opens new research dimensions in malware detection. This study is focused on metamorphic malware, which is the most advanced member of the malware family. It is quite impossible for anti-virus applications using traditional signature-based methods to detect metamorphic malware, which makes it difficult to classify this type of malware accordingly. Recent research literature about malware detection and classification discusses this issue related to malware behavior. The main goal of this paper is to develop a classification method according to malware types by taking into consideration the behavior of malware. We started this research by developing a new dataset containing API calls made on the windows operating system, which represents the behavior of malicious software. The types of malicious malware included in the dataset are Adware, Backdoor, Downloader, Dropper, spyware, Trojan, Virus, and Worm. The classification method used in this study is LSTM (Long Short-Term Memory), which is a widely used classification method in sequential data. The results obtained by the classifier demonstrate accuracy up to 95% with 0.83 $F_1$-score, which is quite satisfactory. We also run our experiments with binary and multi-class malware datasets to show the classification performance of the LSTM model. Another significant contribution of this research paper is the development of a new dataset for Windows operating systems based on API calls. To the best of our knowledge, there is no such dataset available before our research. The availability of our dataset on GitHub facilitates the research community in the domain of malware detection to benefit and make a further contribution to this domain.

摘要

恶意软件的开发在架构和功能方面呈现出多样性。恶意软件能力的这种进步构成了严重威胁，并为恶意软件检测开辟了新的研究维度。本研究聚焦于变形恶意软件，它是恶意软件家族中最先进的成员。使用传统基于签名的方法的杀毒应用程序完全不可能检测到变形恶意软件，这使得对这类恶意软件进行分类变得困难。最近关于恶意软件检测和分类的研究文献讨论了与恶意软件行为相关的这个问题。本文的主要目标是通过考虑恶意软件的行为来开发一种根据恶意软件类型进行分类的方法。我们通过开发一个包含在Windows操作系统上进行的API调用的新数据集来启动这项研究，该数据集代表了恶意软件的行为。数据集中包含的恶意软件类型有广告软件、后门程序、下载器、投放器、间谍软件、木马、病毒和蠕虫。本研究中使用的分类方法是长短期记忆网络（LSTM），它是序列数据中广泛使用的分类方法。分类器获得的结果显示准确率高达95%，F1分数为0.83，这相当令人满意。我们还使用二进制和多类恶意软件数据集运行了实验，以展示LSTM模型的分类性能。这篇研究论文的另一个重要贡献是基于API调用为Windows操作系统开发了一个新数据集。据我们所知，在我们的研究之前没有这样的数据集。我们的数据集在GitHub上的可用性便于恶意软件检测领域的研究社区从中受益，并为该领域做出进一步贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d16f/7924690/abbba654a9a7/peerj-cs-06-285-g001.jpg

相似文献

Deep learning based Sequential model for malware analysis using Windows exe API Calls.基于深度学习的使用Windows可执行文件应用程序编程接口调用进行恶意软件分析的序列模型。

PeerJ Comput Sci. 2020 Jul 27;6:e285. doi: 10.7717/peerj-cs.285. eCollection 2020.

Sparse attention with residual pyramidal depthwise separable convolutional based malware detection with optimization mechanism.基于带有优化机制的残差金字塔深度可分离卷积的稀疏注意力恶意软件检测

Sci Rep. 2024 Oct 18;14(1):24414. doi: 10.1038/s41598-024-76193-4.

Channel Features and API Frequency-Based Transformer Model for Malware Identification.基于通道特征和API频率的恶意软件识别变压器模型

Sensors (Basel). 2024 Jan 17;24(2):580. doi: 10.3390/s24020580.

An ensemble approach for imbalanced multiclass malware classification using 1D-CNN.一种使用一维卷积神经网络（1D-CNN）的不平衡多类恶意软件分类集成方法。

PeerJ Comput Sci. 2023 Nov 14;9:e1677. doi: 10.7717/peerj-cs.1677. eCollection 2023.

Artificial Intelligence Algorithms for Malware Detection in Android-Operated Mobile Devices.人工智能算法在安卓操作系统移动设备中的恶意软件检测。

Sensors (Basel). 2022 Mar 15;22(6):2268. doi: 10.3390/s22062268.

Windows malware detection based on static analysis with multiple features.基于多特征静态分析的Windows恶意软件检测

PeerJ Comput Sci. 2023 Apr 21;9:e1319. doi: 10.7717/peerj-cs.1319. eCollection 2023.

MDABP: A Novel Approach to Detect Cross-Architecture IoT Malware Based on PaaS.MDABP：一种基于 PaaS 的新型跨体系结构 IoT 恶意软件检测方法。

Sensors (Basel). 2023 Mar 13;23(6):3060. doi: 10.3390/s23063060.

Deep-Hook: A trusted deep learning-based framework for unknown malware detection and classification in Linux cloud environments.深钩：一种基于深度学习的可信框架，用于在 Linux 云环境中检测和分类未知恶意软件。

Neural Netw. 2021 Dec;144:648-685. doi: 10.1016/j.neunet.2021.09.019. Epub 2021 Oct 2.

Convolution neural network with batch normalization and inception-residual modules for Android malware classification.基于批量归一化和 Inception-Residual 模块的卷积神经网络用于安卓恶意软件分类。

Sci Rep. 2022 Aug 17;12(1):13996. doi: 10.1038/s41598-022-18402-6.

An Efficient DenseNet-Based Deep Learning Model for Malware Detection.一种基于高效密集连接网络的恶意软件检测深度学习模型。

Entropy (Basel). 2021 Mar 15;23(3):344. doi: 10.3390/e23030344.

引用本文的文献

Machine learning techniques for imbalanced multiclass malware classification through adaptive feature selection.通过自适应特征选择进行不平衡多类恶意软件分类的机器学习技术

PeerJ Comput Sci. 2025 Mar 25;11:e2752. doi: 10.7717/peerj-cs.2752. eCollection 2025.

A malware classification method based on directed API call relationships.一种基于有向应用程序编程接口调用关系的恶意软件分类方法。

PLoS One. 2025 Mar 17;20(3):e0299706. doi: 10.1371/journal.pone.0299706. eCollection 2025.

A Survey on ML Techniques for Multi-Platform Malware Detection: Securing PC, Mobile Devices, IoT, and Cloud Environments.多平台恶意软件检测的机器学习技术调查：保护个人电脑、移动设备、物联网和云环境安全

Sensors (Basel). 2025 Feb 13;25(4):1153. doi: 10.3390/s25041153.

Channel Features and API Frequency-Based Transformer Model for Malware Identification.基于通道特征和API频率的恶意软件识别变压器模型

Sensors (Basel). 2024 Jan 17;24(2):580. doi: 10.3390/s24020580.

An ensemble approach for imbalanced multiclass malware classification using 1D-CNN.一种使用一维卷积神经网络（1D-CNN）的不平衡多类恶意软件分类集成方法。

PeerJ Comput Sci. 2023 Nov 14;9:e1677. doi: 10.7717/peerj-cs.1677. eCollection 2023.

A Kullback-Liebler divergence-based representation algorithm for malware detection.一种基于库尔贝克-莱布勒散度的恶意软件检测表示算法。

PeerJ Comput Sci. 2023 Sep 22;9:e1492. doi: 10.7717/peerj-cs.1492. eCollection 2023.

Windows malware detection based on static analysis with multiple features.基于多特征静态分析的Windows恶意软件检测

PeerJ Comput Sci. 2023 Apr 21;9:e1319. doi: 10.7717/peerj-cs.1319. eCollection 2023.

Malware homology determination using visualized images and feature fusion.使用可视化图像和特征融合进行恶意软件同源性判定。

PeerJ Comput Sci. 2021 Apr 15;7:e494. doi: 10.7717/peerj-cs.494. eCollection 2021.

Data augmentation based malware detection using convolutional neural networks.基于数据增强的卷积神经网络恶意软件检测

PeerJ Comput Sci. 2021 Jan 22;7:e346. doi: 10.7717/peerj-cs.346. eCollection 2021.

本文引用的文献

Long short-term memory.长短期记忆

Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度学习的使用Windows可执行文件应用程序编程接口调用进行恶意软件分析的序列模型。

Deep learning based Sequential model for malware analysis using Windows exe API Calls.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献