Huang Wanwei, Tian Haobin, Wang Sunan, Zhang Chaoqin, Zhang Xiaohui
College of Software Engineering, Zhengzhou University of Light Industry, Zhengzhou, Henan, China.
Electronic & Communication Engineering, Shenzhen Polytechnic School, Shenzhen, Guangdong, China.
PeerJ Comput Sci. 2024 Jul 16;10:e2176. doi: 10.7717/peerj-cs.2176. eCollection 2024.
In the context of the 5G network, the proliferation of access devices results in heightened network traffic and shifts in traffic patterns, and network intrusion detection faces greater challenges. A feature selection algorithm is proposed for network intrusion detection systems that uses an improved binary pigeon-inspired optimizer (SABPIO) algorithm to tackle the challenges posed by the high dimensionality and complexity of network traffic, resulting in complex models, reduced accuracy, and longer detection times. First, the raw dataset is pre-processed by uniquely one-hot encoded and standardized. Next, feature selection is performed using SABPIO, which employs simulated annealing and the population decay factor to identify the most relevant subset of features for subsequent review and evaluation. Finally, the selected subset of features is fed into decision trees and random forest classifiers to evaluate the effectiveness of SABPIO. The proposed algorithm has been validated through experimentation on three publicly available datasets: UNSW-NB15, NLS-KDD, and CIC-IDS-2017. The experimental findings demonstrate that SABPIO identifies the most indicative subset of features through rational computation. This method significantly abbreviates the system's training duration, enhances detection rates, and compared to the use of all features, minimally reduces the training and testing times by factors of 3.2 and 0.3, respectively. Furthermore, it enhances the F1-score of the feature subset selected by CPIO and Boost algorithms when compared to CPIO and XGBoost, resulting in improvements ranging from 1.21% to 2.19%, and 1.79% to 4.52%.
在5G网络环境下,接入设备的激增导致网络流量增加和流量模式的转变,网络入侵检测面临着更大的挑战。针对网络入侵检测系统提出了一种特征选择算法,该算法使用改进的二进制鸽启发式优化器(SABPIO)算法来应对网络流量的高维度和复杂性所带来的挑战,这些挑战会导致模型复杂、准确性降低以及检测时间延长。首先,通过唯一的一键编码和标准化对原始数据集进行预处理。接下来,使用SABPIO进行特征选择,该算法采用模拟退火和种群衰减因子来识别最相关的特征子集,以供后续审查和评估。最后,将选定的特征子集输入决策树和随机森林分类器,以评估SABPIO的有效性。该算法已通过在三个公开可用数据集上进行实验得到验证:UNSW-NB15、NLS-KDD和CIC-IDS-2017。实验结果表明,SABPIO通过合理计算识别出最具指示性的特征子集。该方法显著缩短了系统的训练时长,提高了检测率,与使用所有特征相比,分别将训练和测试时间最少缩短了3.2倍和0.3倍。此外,与CPIO和XGBoost相比,它提高了CPIO和Boost算法所选特征子集的F1分数,提高幅度在1.21%至2.19%以及1.79%至4.52%之间。