Tahaei Hamid, Liu Anqi, Forooghikian Hamid, Gheisari Mehdi, Zaki Faiz, Anuar Nor Badrul, Fang Zhaoxi, Huang Longjun
Institute of Artificial Intelligence, Shaoxing University, Shaoxing, China.
School of Computing and Data Science, Xiamen University Malaysia, Selangor, Malaysia.
PeerJ Comput Sci. 2025 May 8;11:e2873. doi: 10.7717/peerj-cs.2873. eCollection 2025.
The rapid deployment of millions of connected devices brings significant security challenges to the Internet of Things (IoT). IoT devices are typically resource-constrained and designed for specific tasks, from which new security challenges are introduced. As such, IoT device identification has garnered substantial attention and is regarded as an initial layer of cybersecurity. One of the major steps in distinguishing IoT devices involves leveraging machine learning (ML) techniques on device network flows known as device fingerprinting. Numerous studies have proposed various solutions that incorporate ML and feature selection (FS) algorithms with different degrees of accuracy. Yet, the domain needs a comparative analysis of the accuracy of different classifiers and FS algorithms to comprehend their true capabilities in various datasets. This article provides a comprehensive performance evaluation of several reputable classifiers being used in the literature. The study evaluates the efficacy of filter-and wrapper-based FS methods across various ML classifiers. Additionally, we implemented a Binary Green Wolf Optimizer (BGWO) and compared its performance with that of traditional ML classifiers to assess the potential of this binary meta-heuristic algorithm. To ensure the robustness of our findings, we evaluated the effectiveness of each classifier and FS method using two widely utilized datasets. Our experiments demonstrated that BGWO effectively reduced the feature set by 85.11% and 73.33% for datasets 1 and 2, respectively, while achieving classification accuracies of 98.51% and 99.8%, respectively. The findings of this study highlight the strong capabilities of BGWO in reducing both the feature dimensionality and accuracy gained through classification. Furthermore, it demonstrates the effectiveness of wrapper methods in the reduction of feature sets.
数百万联网设备的快速部署给物联网(IoT)带来了重大的安全挑战。物联网设备通常资源受限且专为特定任务设计,由此引入了新的安全挑战。因此,物联网设备识别受到了广泛关注,并被视为网络安全的初始层面。区分物联网设备的主要步骤之一是在称为设备指纹识别的设备网络流上利用机器学习(ML)技术。众多研究提出了各种结合ML和特征选择(FS)算法的解决方案,其准确率各不相同。然而,该领域需要对不同分类器和FS算法的准确率进行比较分析,以了解它们在各种数据集中的真实能力。本文对文献中使用的几种著名分类器进行了全面的性能评估。该研究评估了基于过滤和包装的FS方法在各种ML分类器中的有效性。此外,我们实现了一种二进制灰狼优化器(BGWO),并将其性能与传统ML分类器的性能进行比较,以评估这种二进制元启发式算法的潜力。为确保我们研究结果的稳健性,我们使用两个广泛使用的数据集评估了每个分类器和FS方法的有效性。我们的实验表明,对于数据集1和数据集2,BGWO分别有效地将特征集减少了85.11%和73.33%,同时分类准确率分别达到了98.51%和99.8%。本研究结果突出了BGWO在降低特征维度和提高分类准确率方面的强大能力。此外,它还证明了包装方法在减少特征集方面的有效性。