College of Mathematics and Statistics, Shenzhen University, Shenzhen, 518060, China.
College of Mathematics and Statistics, Shenzhen University, Shenzhen, 518060, China; Shenzhen Key Lab. of Advanced Machine Learning and Applications, Shenzhen University, Shenzhen, 518060, China; Guangdong Key Lab. of Intelligent Information Process, Shenzhen University, Shenzhen, 518060, China.
Neural Netw. 2023 May;162:258-270. doi: 10.1016/j.neunet.2023.02.045. Epub 2023 Mar 7.
Multi-label Active Learning (MLAL) is an effective method to improve the performance of the classifier on multi-label problems with less annotation effort by allowing the learning system to actively select high-quality examples (example-label pairs) for labeling. Existing MLAL algorithms mainly focus on designing reasonable algorithms to evaluate the potential values (as previously mentioned quality) of the unlabeled data. These manually designed methods may show totally different results on various types of datasets due to the defect of the methods or the particularity of the datasets. In this paper, instead of manually designing an evaluation method, we propose a deep reinforcement learning (DRL) model to explore a general evaluation method on several seen datasets and eventually apply it to unseen datasets based on a meta framework. In addition, a self-attention mechanism along with a reward function is integrated into the DRL structure to address the label correlation and data imbalanced problems in MLAL. Comprehensive experiments show that our proposed DRL-based MLAL method is able to produce comparable results as compared with other methods reported in the literature.
多标签主动学习(MLAL)是一种有效的方法,可以通过允许学习系统主动选择高质量的示例(示例-标签对)进行标注,从而减少标注工作量,提高多标签问题的分类器性能。现有的 MLAL 算法主要侧重于设计合理的算法来评估未标记数据的潜在值(如前所述的质量)。由于方法的缺陷或数据集的特殊性,这些手动设计的方法在各种类型的数据集上可能会产生完全不同的结果。在本文中,我们不是手动设计评估方法,而是提出了一个深度强化学习(DRL)模型,在几个已见数据集上探索一种通用的评估方法,最终基于元框架将其应用于未见数据集。此外,我们还将自注意力机制和奖励函数集成到 DRL 结构中,以解决 MLAL 中的标签相关性和数据不平衡问题。综合实验表明,我们提出的基于 DRL 的 MLAL 方法能够产生与文献中报道的其他方法相当的结果。