Saudi Aramco Cybersecurity Chair, Dhahran, Saudi Arabia.
Department of Networks and Communications, College of Computer Science and Information Technology (CCSIT), Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia.
Comput Intell Neurosci. 2022 May 9;2022:1615528. doi: 10.1155/2022/1615528. eCollection 2022.
For the enormous growth and the hysterical impact of undocumented malicious software, otherwise known as Zero-Day malware, specialized practices were joined to implement systems capable of detecting these kinds of software to avert possible disastrous consequences. Owing to the nature of developed Zero-Day malware, distinct evasion tactics are used to remain stealth. Hence, there is a need for advance investigations of the methods that can identify such kind of malware. Machine learning (ML) is among the promising techniques for such type of predictions, while the sandbox provides a safe environment for such experiments. After thorough literature review, carefully chosen ML techniques are proposed for the malware detection, under Cuckoo sandboxing (CS) environment. The proposed system is coined as Zero-Day Vigilante (ZeVigilante) to detect the malware considering both static and dynamic analyses. We used adequate datasets for both analyses incorporating sufficient samples in contrast to other studies. Consequently, the processed datasets are used to train and test several ML classifiers including Random Forest (RF), Neural Networks (NN), Decision Tree (DT), k-Nearest Neighbor (kNN), Naïve Bayes (NB), and Support Vector Machine (SVM). It is observed that RF achieved the best accuracy for both static and dynamic analyses, 98.21% and 98.92%, respectively.
由于未记录的恶意软件(也称为零日恶意软件)的巨大增长和歇斯底里的影响,专门的实践被加入进来,以实施能够检测这些软件的系统,以避免可能的灾难性后果。由于开发的零日恶意软件的性质,使用了不同的规避策略来保持隐蔽。因此,需要对可以识别此类恶意软件的方法进行预先调查。机器学习 (ML) 是此类预测的有前途的技术之一,而沙盒为此类实验提供了安全的环境。在彻底的文献回顾之后,在 Cuckoo 沙盒 (CS) 环境下为恶意软件检测提出了精心挑选的 ML 技术。该系统被称为零日 vigilant (ZeVigilante),用于同时考虑静态和动态分析来检测恶意软件。与其他研究相比,我们在两种分析中都使用了足够的数据集,包含了足够的样本。因此,处理后的数据集用于训练和测试几种 ML 分类器,包括随机森林 (RF)、神经网络 (NN)、决策树 (DT)、k-最近邻 (kNN)、朴素贝叶斯 (NB) 和支持向量机 (SVM)。观察到 RF 在静态和动态分析中分别实现了最佳的准确性,分别为 98.21%和 98.92%。