• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自适应校正对有噪声标签进行稳健的常识推理

Robust Commonsense Reasoning Against Noisy Labels Using Adaptive Correction.

作者信息

Yang Xu, Deng Cheng, Wei Kun, Tao Dacheng

出版信息

IEEE Trans Cybern. 2024 Jul;54(7):4138-4149. doi: 10.1109/TCYB.2023.3339629. Epub 2024 Jul 11.

DOI:10.1109/TCYB.2023.3339629
PMID:38150342
Abstract

Commonsense reasoning based on knowledge graphs (KGs) is a challenging task that requires predicting complex questions over the described textual contexts and relevant knowledge about the world. However, current methods typically assume clean training scenarios with accurately labeled samples, which are often unrealistic. The training set can include mislabeled samples, and the robustness to label noises is essential for commonsense reasoning methods to be practical, but this problem remains largely unexplored. This work focuses on commonsense reasoning with mislabeled training samples and makes several technical contributions: 1) we first construct diverse augmentations from knowledge and model, and offer a simple yet effective multiple-choice alignment method to divide the training samples into clean, semi-clean, and unclean parts; 2) we design adaptive label correction methods for the semi-clean and unclean samples to exploit the supervised potential of noisy information; and 3) finally, we extensively test these methods on noisy versions of commonsense reasoning benchmarks (CommonsenseQA and OpenbookQA). Experimental results show that the proposed method can significantly enhance robustness and improve overall performance. Furthermore, the proposed method is generally applicable to multiple existing commonsense reasoning frameworks to boost their robustness. The code is available at https://github.com/xdxuyang/CR_Noisy_Labels.

摘要

基于知识图谱(KGs)的常识推理是一项具有挑战性的任务,它需要根据所描述的文本上下文和关于世界的相关知识来预测复杂问题。然而,当前的方法通常假设训练场景是干净的,样本标签准确无误,而这往往不切实际。训练集可能包含错误标注的样本,对于常识推理方法而言,对标签噪声的鲁棒性是其具备实用性的关键,但这个问题在很大程度上仍未得到充分探索。这项工作聚焦于带有错误标注训练样本的常识推理,并做出了几项技术贡献:1)我们首先从知识和模型构建多样化的增强方法,并提供一种简单而有效的多项选择对齐方法,将训练样本划分为干净、半干净和不干净的部分;2)我们为半干净和不干净的样本设计自适应标签校正方法,以挖掘噪声信息的监督潜力;3)最后,我们在常识推理基准(常识问答和开放书本问答)的噪声版本上广泛测试这些方法。实验结果表明,所提出的方法能够显著增强鲁棒性并提高整体性能。此外,所提出的方法通常适用于多个现有的常识推理框架,以提升它们的鲁棒性。代码可在https://github.com/xdxuyang/CR_Noisy_Labels获取。

相似文献

1
Robust Commonsense Reasoning Against Noisy Labels Using Adaptive Correction.使用自适应校正对有噪声标签进行稳健的常识推理
IEEE Trans Cybern. 2024 Jul;54(7):4138-4149. doi: 10.1109/TCYB.2023.3339629. Epub 2024 Jul 11.
2
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning.不良标签:关于评估和增强标签噪声学习的稳健视角
IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4398-4409. doi: 10.1109/TPAMI.2024.3355425. Epub 2024 May 7.
3
Generative Reasoning Integrated Label Noise Robust Deep Image Representation Learning.生成式推理集成标签噪声鲁棒深度图像表示学习
IEEE Trans Image Process. 2023;32:4529-4542. doi: 10.1109/TIP.2023.3293776. Epub 2023 Aug 10.
4
Learning With Noisy Labels Over Imbalanced Subpopulations.在不平衡子群体上使用噪声标签进行学习。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6544-6555. doi: 10.1109/TNNLS.2024.3389676. Epub 2025 Apr 4.
5
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.S-CUDA:用于医学图像分割的自清洁无监督域适应
Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.
6
Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.利用符号知识库和模式理论进行常识自然语言推理。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13185-13202. doi: 10.1109/TPAMI.2023.3287837. Epub 2023 Oct 3.
7
The plausibility machine commonsense (PMC) dataset: A massively crowdsourced human-annotated dataset for studying plausibility in large language models.似真性机器常识(PMC)数据集:一个用于研究大语言模型中似真性的大规模众包人工标注数据集。
Data Brief. 2024 Aug 24;57:110869. doi: 10.1016/j.dib.2024.110869. eCollection 2024 Dec.
8
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense.CRIC:一个用于视觉与常识组合推理的视觉问答数据集。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5561-5578. doi: 10.1109/TPAMI.2022.3210780. Epub 2023 Apr 3.
9
Knockoffs-SPR: Clean Sample Selection in Learning With Noisy Labels.仿冒品-SPR:有噪声标签学习中的干净样本选择
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3242-3256. doi: 10.1109/TPAMI.2023.3338268. Epub 2024 Apr 3.
10
Configurable Graph Reasoning for Visual Relationship Detection.可配置图推理在视觉关系检测中的应用。
IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):117-129. doi: 10.1109/TNNLS.2020.3027575. Epub 2022 Jan 5.