• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

估计标签噪声学习的每类统计量。

Estimating Per-Class Statistics for Label Noise Learning.

作者信息

Luo Wenshui, Chen Shuo, Liu Tongliang, Han Bo, Niu Gang, Sugiyama Masashi, Tao Dacheng, Gong Chen

出版信息

IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):305-322. doi: 10.1109/TPAMI.2024.3466182. Epub 2024 Dec 4.

DOI:10.1109/TPAMI.2024.3466182
PMID:39312440
Abstract

Real-world data may contain a considerable amount of noisily labeled examples, which usually mislead the training algorithm and result in degraded classification performance on test data. Therefore, Label Noise Learning (LNL) was proposed, of which one popular research trend focused on estimating the critical statistics (e.g., sample mean and sample covariance), to recover the clean data distribution. However, existing methods may suffer from the unreliable sample selection process or can hardly be applied to multi-class cases. Inspired by the centroid estimation theory, we propose Per-Class Statistic Estimation (PCSE), which establishes the quantitative relationship between the clean (first-order and second-order) statistics and the corresponding noisy statistics for every class. This relationship is further utilized to induce a generative classifier for model inference. Unlike existing methods, our approach does not require sample selection from the instance level. Moreover, our PCSE can serve as a general post-processing strategy applicable to various popular networks pre-trained on the noisy dataset for boosting their classification performance. Theoretically, we prove that the estimated statistics converge to their ground-truth values as the sample size increases, even if the estimated label transition matrix is biased. Empirically, we conducted intensive experiments on various binary and multi-class datasets, and the results demonstrate that PCSE achieves more precise statistic estimation as well as higher classification accuracy when compared with state-of-the-art methods in LNL.

摘要

现实世界的数据可能包含大量标注有噪声的示例,这通常会误导训练算法,并导致测试数据的分类性能下降。因此,提出了标签噪声学习(LNL),其一个流行的研究趋势集中在估计关键统计量(例如样本均值和样本协方差),以恢复干净的数据分布。然而,现有方法可能会受到不可靠的样本选择过程的影响,或者很难应用于多类情况。受质心估计理论的启发,我们提出了每类统计估计(PCSE),它为每个类建立了干净(一阶和二阶)统计量与相应噪声统计量之间的定量关系。这种关系被进一步用于诱导一个生成分类器进行模型推理。与现有方法不同,我们的方法不需要从实例级别进行样本选择。此外,我们的PCSE可以作为一种通用的后处理策略,适用于在有噪声数据集上预训练的各种流行网络,以提高它们的分类性能。从理论上讲,我们证明了即使估计的标签转移矩阵有偏差,随着样本量的增加,估计的统计量也会收敛到它们的真实值。从实验上讲,我们在各种二分类和多分类数据集上进行了大量实验,结果表明,与LNL中的现有方法相比,PCSE实现了更精确的统计估计以及更高的分类准确率。

相似文献

1
Estimating Per-Class Statistics for Label Noise Learning.估计标签噪声学习的每类统计量。
IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):305-322. doi: 10.1109/TPAMI.2024.3466182. Epub 2024 Dec 4.
2
Class-Wise Denoising for Robust Learning Under Label Noise.基于类别噪声的鲁棒学习的去噪。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2835-2848. doi: 10.1109/TPAMI.2022.3178690. Epub 2023 Feb 3.
3
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning.不良标签:关于评估和增强标签噪声学习的稳健视角
IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4398-4409. doi: 10.1109/TPAMI.2024.3355425. Epub 2024 May 7.
4
Adaptive estimation of instance-dependent noise transition matrix for learning with instance-dependent label noise.用于处理实例相关标签噪声学习的实例相关噪声转移矩阵的自适应估计。
Neural Netw. 2025 Aug;188:107464. doi: 10.1016/j.neunet.2025.107464. Epub 2025 Apr 12.
5
Combating Medical Label Noise through more precise partition-correction and progressive hard-enhanced learning.通过更精确的分区校正和渐进式硬增强学习来对抗医学标签噪声。
Comput Methods Programs Biomed. 2025 Jun;265:108734. doi: 10.1016/j.cmpb.2025.108734. Epub 2025 Mar 29.
6
A Parametrical Model for Instance-Dependent Label Noise.一种针对实例相关标签噪声的参数模型。
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14055-14068. doi: 10.1109/TPAMI.2023.3301876. Epub 2023 Nov 3.
7
Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise.在存在高标签噪声的情况下,用于不平衡医学图像分类任务稳健训练的主动标签细化
Med Image Comput Comput Assist Interv. 2024 Oct;15011:37-47. doi: 10.1007/978-3-031-72120-5_4. Epub 2024 Oct 3.
8
Harnessing Side Information for Classification Under Label Noise.利用侧信息进行标签噪声下的分类。
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3178-3192. doi: 10.1109/TNNLS.2019.2938782. Epub 2019 Sep 25.
9
Dynamic Loss for Robust Learning.用于稳健学习的动态损失
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14420-14434. doi: 10.1109/TPAMI.2023.3311636. Epub 2023 Nov 3.
10
Typicality- and instance-dependent label noise-combating: a novel framework for simulating and combating real-world noisy labels for endoscopic polyp classification.典型性和实例依赖的标签噪声对抗:一种用于模拟和对抗内镜息肉分类中现实世界噪声标签的新框架。
Vis Comput Ind Biomed Art. 2024 May 6;7(1):10. doi: 10.1186/s42492-024-00162-x.