Akinola Olatunji O, Ezugwu Absalom E, Agushaka Jeffrey O, Zitar Raed Abu, Abualigah Laith
School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa.
Sorbonne Center of Artificial Intelligence, Sorbonne University-Abu Dhabi, 38044 Abu Dhabi, United Arab Emirates.
Neural Comput Appl. 2022;34(22):19751-19790. doi: 10.1007/s00521-022-07705-4. Epub 2022 Aug 30.
Selecting relevant feature subsets is vital in machine learning, and multiclass feature selection is harder to perform since most classifications are binary. The feature selection problem aims at reducing the feature set dimension while maintaining the performance model accuracy. Datasets can be classified using various methods. Nevertheless, metaheuristic algorithms attract substantial attention to solving different problems in optimization. For this reason, this paper presents a systematic survey of literature for solving multiclass feature selection problems utilizing metaheuristic algorithms that can assist classifiers selects optima or near optima features faster and more accurately. Metaheuristic algorithms have also been presented in four primary behavior-based categories, i.e., evolutionary-based, swarm-intelligence-based, physics-based, and human-based, even though some literature works presented more categorization. Further, lists of metaheuristic algorithms were introduced in the categories mentioned. In finding the solution to issues related to multiclass feature selection, only articles on metaheuristic algorithms used for multiclass feature selection problems from the year 2000 to 2022 were reviewed about their different categories and detailed descriptions. We considered some application areas for some of the metaheuristic algorithms applied for multiclass feature selection with their variations. Popular multiclass classifiers for feature selection were also examined. Moreover, we also presented the challenges of metaheuristic algorithms for feature selection, and we identified gaps for further research studies.
在机器学习中,选择相关特征子集至关重要,而多类特征选择更难执行,因为大多数分类都是二分类。特征选择问题旨在在保持性能模型准确性的同时降低特征集维度。数据集可以使用各种方法进行分类。然而,元启发式算法在解决优化中的不同问题时引起了广泛关注。因此,本文对利用元启发式算法解决多类特征选择问题的文献进行了系统综述,这些算法可以帮助分类器更快、更准确地选择最优或接近最优的特征。元启发式算法也被分为基于四种主要行为的类别,即基于进化的、基于群体智能的、基于物理的和基于人类的,尽管一些文献提出了更多的分类。此外,还介绍了上述类别中的元启发式算法列表。在寻找与多类特征选择相关问题的解决方案时,我们仅回顾了2000年至2022年用于多类特征选择问题的元启发式算法的文章,涉及它们的不同类别和详细描述。我们考虑了一些应用于多类特征选择的元启发式算法及其变体的应用领域。还研究了用于特征选择的流行多类分类器。此外,我们还提出了元启发式算法在特征选择方面的挑战,并确定了进一步研究的差距。