IEEE Trans Cybern. 2022 Jun;52(6):4459-4471. doi: 10.1109/TCYB.2020.3027509. Epub 2022 Jun 16.
Multi-label learning deals with training examples each represented by a single instance while associated with multiple class labels. Due to the exponential number of possible label sets to be considered by the predictive model, it is commonly assumed that label correlations should be well exploited to design an effective multi-label learning approach. On the other hand, class-imbalance stands as an intrinsic property of multi-label data which significantly affects the generalization performance of the multi-label predictive model. For each class label, the number of training examples with positive labeling assignment is generally much less than those with negative labeling assignment. To deal with the class-imbalance issue for multi-label learning, a simple yet effective class-imbalance aware learning strategy called cross-coupling aggregation (COCOA) is proposed in this article. Specifically, COCOA works by leveraging the exploitation of label correlations as well as the exploration of class-imbalance simultaneously. For each class label, a number of multiclass imbalance learners are induced by randomly coupling with other labels, whose predictions on the unseen instance are aggregated to determine the corresponding labeling relevancy. Extensive experiments on 18 benchmark datasets clearly validate the effectiveness of COCOA against state-of-the-art multi-label learning approaches especially in terms of imbalance-specific evaluation metrics.
多标签学习处理的训练示例每个都由单个实例表示,同时与多个类别标签相关联。由于预测模型需要考虑的可能标签集的数量呈指数增长,因此通常假设应该充分利用标签相关性来设计有效的多标签学习方法。另一方面,类不平衡是多标签数据的固有特性,这会显著影响多标签预测模型的泛化性能。对于每个类别标签,具有正标记分配的训练示例数量通常远少于具有负标记分配的训练示例数量。为了解决多标签学习中的类不平衡问题,本文提出了一种简单而有效的称为交叉耦合聚合(COCOA)的类不平衡感知学习策略。具体来说,COCOA 通过同时利用标签相关性的利用和类不平衡的探索来工作。对于每个类别标签,通过随机与其他标签耦合来诱导多个多类不平衡学习者,其对未见过的实例的预测被聚合以确定相应的标记相关性。在 18 个基准数据集上进行的广泛实验清楚地验证了 COCOA 相对于最先进的多标签学习方法的有效性,尤其是在不平衡特定评估指标方面。