Park Jaegyun, Park Min-Woo, Kim Dae-Won, Lee Jaesung
School of Computer Science and Engineering, Chung-Ang University, 221, Heukseok-Dong, Dongjak-Gu, Seoul 06974, Korea.
Entropy (Basel). 2020 Aug 10;22(8):876. doi: 10.3390/e22080876.
Multilabel feature selection is an effective preprocessing step for improving multilabel classification accuracy, because it highlights discriminative features for multiple labels. Recently, multi-population genetic algorithms have gained significant attention with regard to feature selection studies. This is owing to their enhanced search capability when compared to that of traditional genetic algorithms that are based on communication among multiple populations. However, conventional methods employ a simple communication process without adapting it to the multilabel feature selection problem, which results in poor-quality final solutions. In this paper, we propose a new multi-population genetic algorithm, based on a novel communication process, which is specialized for the multilabel feature selection problem. Our experimental results on 17 multilabel datasets demonstrate that the proposed method is superior to other multi-population-based feature selection methods.
多标签特征选择是提高多标签分类准确率的有效预处理步骤,因为它突出了多个标签的判别性特征。最近,多群体遗传算法在特征选择研究方面受到了广泛关注。这是因为与基于多个群体之间通信的传统遗传算法相比,它们具有更强的搜索能力。然而,传统方法采用的是简单的通信过程,没有使其适应多标签特征选择问题,这导致最终解决方案的质量较差。在本文中,我们基于一种新颖的通信过程提出了一种新的多群体遗传算法,该算法专门针对多标签特征选择问题。我们在17个多标签数据集上的实验结果表明,所提出的方法优于其他基于多群体的特征选择方法。