Xie Ming-Kun, Huang Sheng-Jun
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3676-3687. doi: 10.1109/TPAMI.2021.3059290. Epub 2022 Jun 3.
Partial multi-label learning (PML) deals with problems where each instance is assigned with a candidate label set, which contains multiple relevant labels and some noisy labels. Recent studies usually solve PML problems with the disambiguation strategy, which recovers ground-truth labels from the candidate label set by simply assuming that the noisy labels are generated randomly. In real applications, however, noisy labels are usually caused by some ambiguous contents of the example. Based on this observation, we propose a partial multi-label learning approach to simultaneously recover the ground-truth information and identify the noisy labels. The two objectives are formalized in a unified framework with trace norm and l norm regularizers. Under the supervision of the observed noise-corrupted label matrix, the multi-label classifier and noisy label identifier are jointly optimized by incorporating the label correlation exploitation and feature-induced noise model. Furthermore, by mapping each bag to a feature vector, we extend PML-NI method into multi-instance multi-label learning by identifying noisy labels based on ambiguous instances. A theoretical analysis of generalization bound and extensive experiments on multiple data sets from various real-world tasks demonstrate the effectiveness of the proposed approach.
部分多标签学习(PML)处理的问题是,每个实例都被分配一个候选标签集,其中包含多个相关标签和一些噪声标签。最近的研究通常使用消歧策略来解决PML问题,该策略通过简单地假设噪声标签是随机生成的,从候选标签集中恢复真实标签。然而,在实际应用中,噪声标签通常是由示例的一些模糊内容引起的。基于这一观察,我们提出了一种部分多标签学习方法,以同时恢复真实信息并识别噪声标签。这两个目标在一个带有迹范数和l范数正则化器的统一框架中被形式化。在观察到的噪声损坏标签矩阵的监督下,通过结合标签相关性利用和特征诱导噪声模型,对多标签分类器和噪声标签识别器进行联合优化。此外,通过将每个包映射到一个特征向量,我们通过基于模糊实例识别噪声标签,将PML-NI方法扩展到多实例多标签学习。对泛化界的理论分析以及在来自各种实际任务的多个数据集上进行的大量实验证明了所提出方法的有效性。