Xia Xiaobo, Han Bo, Wang Nannan, Deng Jiankang, Li Jiatong, Mao Yinian, Liu Tongliang
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3047-3058. doi: 10.1109/TPAMI.2022.3180545. Epub 2023 Feb 3.
The noise transition matrix T, reflecting the probabilities that true labels flip into noisy ones, is of vital importance to model label noise and build statistically consistent classifiers. The traditional transition matrix is limited to model closed-set label noise, where noisy training data have true class labels within the noisy label set. It is unfitted to employ such a transition matrix to model open-set label noise, where some true class labels are outside the noisy label set. Therefore, when considering a more realistic situation, i.e., both closed-set and open-set label noises occur, prior works will give unbelievable solutions. Besides, the traditional transition matrix is mostly limited to model instance-independent label noise, which may not perform well in practice. In this paper, we focus on learning with the mixed closed-set and open-set noisy labels. We address the aforementioned issues by extending the traditional transition matrix to be able to model mixed label noise, and further to the cluster-dependent transition matrix to better combat the instance-dependent label noise in real-world applications. We term the proposed transition matrix as the cluster-dependent extended transition matrix. An unbiased estimator (i.e., extended T-estimator) has been designed to estimate the cluster-dependent extended transition matrix by only exploiting the noisy data. Comprehensive experiments validate that our method can better cope with realistic label noise, following its more robust performance than the prior state-of-the-art label-noise learning methods.
噪声转移矩阵T反映了真实标签翻转成噪声标签的概率,对于建模标签噪声和构建统计上一致的分类器至关重要。传统的转移矩阵仅限于对封闭集标签噪声进行建模,其中噪声训练数据在噪声标签集中具有真实类别标签。使用这样的转移矩阵来对开放集标签噪声进行建模是不合适的,在开放集标签噪声中,一些真实类别标签在噪声标签集之外。因此,当考虑更现实的情况,即封闭集和开放集标签噪声都出现时,先前的工作会给出不可信的解决方案。此外,传统的转移矩阵大多限于对实例独立的标签噪声进行建模,这在实际中可能表现不佳。在本文中,我们专注于使用混合的封闭集和开放集噪声标签进行学习。我们通过扩展传统的转移矩阵来解决上述问题,使其能够对混合标签噪声进行建模,并进一步扩展到聚类相关的转移矩阵,以更好地应对实际应用中实例相关的标签噪声。我们将提出的转移矩阵称为聚类相关扩展转移矩阵。已经设计了一种无偏估计器(即扩展T估计器),仅通过利用噪声数据来估计聚类相关扩展转移矩阵。全面的实验验证了我们的方法能够更好地应对现实的标签噪声,其性能比先前的最先进标签噪声学习方法更稳健。