Yildirim Suna, Alatas Bilal
Data Processing Department, Secretary general of Special Provincial Administration, Elazig, Turkey.
Department of Software Engineering, Firat (Euphrates) University, Elazig, Turkey.
PeerJ Comput Sci. 2024 Sep 6;10:e2307. doi: 10.7717/peerj-cs.2307. eCollection 2024.
Classification rule mining represents a significant field of machine learning, facilitating informed decision-making through the extraction of meaningful rules from complex data. Many classification methods cannot simultaneously optimize both explainability and different performance metrics at the same time. Metaheuristic optimization-based solutions, inspired by natural phenomena, offer a potential paradigm shift in this field, enabling the development of interpretable and scalable classifiers. In contrast to classical methods, such rule extraction-based solutions are capable of classification by taking multiple purposes into consideration simultaneously. To the best of our knowledge, although there are limited studies on metaheuristic based classification, there is not any method that optimize more than three objectives while increasing the explainability and interpretability for classification task. In this study, data sets are treated as the search space and metaheuristics as the many-objective rule discovery strategy and study proposes a metaheuristic many-objective optimization-based rule extraction approach for the first time in the literature. Chaos theory is also integrated to the optimization method for performance increment and the proposed chaotic rule-based SPEA2 algorithm enables the simultaneous optimization of four different success metrics and automatic rule extraction. Another distinctive feature of the proposed algorithm is that, in contrast to classical random search methods, it can mitigate issues such as correlation and poor uniformity between candidate solutions through the use of a chaotic random search mechanism in the exploration and exploitation phases. The efficacy of the proposed method is evaluated using three distinct data sets, and its performance is demonstrated in comparison with other classical machine learning results.
分类规则挖掘是机器学习的一个重要领域,它通过从复杂数据中提取有意义的规则来促进明智的决策。许多分类方法无法同时优化可解释性和不同的性能指标。受自然现象启发的基于元启发式优化的解决方案在该领域提供了一种潜在的范式转变,能够开发可解释且可扩展的分类器。与传统方法不同,这种基于规则提取的解决方案能够通过同时考虑多个目标来进行分类。据我们所知,尽管关于基于元启发式的分类的研究有限,但尚无任何方法能在增加分类任务的可解释性和可解读性的同时优化超过三个目标。在本研究中,将数据集视为搜索空间,将元启发式方法视为多目标规则发现策略,并且首次在文献中提出了一种基于元启发式多目标优化的规则提取方法。混沌理论也被集成到优化方法中以提高性能,所提出的基于混沌规则的SPEA2算法能够同时优化四个不同的成功指标并自动提取规则。所提出算法的另一个显著特点是,与传统随机搜索方法不同,它可以在探索和利用阶段通过使用混沌随机搜索机制来缓解候选解决方案之间的相关性和均匀性差等问题。使用三个不同的数据集评估了所提出方法的有效性,并与其他经典机器学习结果进行比较展示了其性能。