Department of Electrical and Information Engineering, University of Nairobi, Nairobi 30197, Kenya.
Biomed Res Int. 2021 Aug 28;2021:2555622. doi: 10.1155/2021/2555622. eCollection 2021.
Feature selection is the process of decreasing the number of features in a dataset by removing redundant, irrelevant, and randomly class-corrected data features. By applying feature selection on large and highly dimensional datasets, the redundant features are removed, reducing the complexity of the data and reducing training time. The objective of this paper was to design an optimizer that combines the well-known metaheuristic population-based optimizer, the grey wolf algorithm, and the gradient descent algorithm and test it for applications in feature selection problems. The proposed algorithm was first compared against the original grey wolf algorithm in 23 continuous test functions. The proposed optimizer was altered for feature selection, and 3 binary implementations were developed with final implementation compared against the two implementations of the binary grey wolf optimizer and binary grey wolf particle swarm optimizer on 6 medical datasets from the UCI machine learning repository, on metrics such as accuracy, size of feature subsets, -measure, accuracy, precision, and sensitivity. The proposed optimizer outperformed the three other optimizers in 3 of the 6 datasets in average metrics. The proposed optimizer showed promise in its capability to balance the two objectives in feature selection and could be further enhanced.
特征选择是通过删除冗余、不相关和随机类别校正的数据特征来减少数据集特征数量的过程。通过在大型和高维数据集上应用特征选择,可以去除冗余特征,降低数据的复杂性并减少训练时间。本文的目的是设计一个优化器,该优化器结合了著名的基于群体的元启发式优化算法,灰狼算法和梯度下降算法,并将其应用于特征选择问题。首先,将所提出的算法与原始灰狼算法在 23 个连续测试函数上进行了比较。针对特征选择对所提出的优化器进行了修改,并针对 6 个来自 UCI 机器学习存储库的医学数据集开发了 3 个二进制实现,最后与二进制灰狼优化器和二进制灰狼粒子群优化器的两种实现进行比较,比较指标包括准确性、特征子集大小、F1 度量、准确性、精度和灵敏度。在所比较的 6 个数据集的 3 个数据集中,所提出的优化器在平均指标方面优于其他 3 个优化器。所提出的优化器在特征选择中平衡两个目标的能力显示出了一定的潜力,并且可以进一步增强。