Yang Xu, Deng Cheng, Wei Kun, Tao Dacheng
IEEE Trans Cybern. 2024 Jul;54(7):4138-4149. doi: 10.1109/TCYB.2023.3339629. Epub 2024 Jul 11.
Commonsense reasoning based on knowledge graphs (KGs) is a challenging task that requires predicting complex questions over the described textual contexts and relevant knowledge about the world. However, current methods typically assume clean training scenarios with accurately labeled samples, which are often unrealistic. The training set can include mislabeled samples, and the robustness to label noises is essential for commonsense reasoning methods to be practical, but this problem remains largely unexplored. This work focuses on commonsense reasoning with mislabeled training samples and makes several technical contributions: 1) we first construct diverse augmentations from knowledge and model, and offer a simple yet effective multiple-choice alignment method to divide the training samples into clean, semi-clean, and unclean parts; 2) we design adaptive label correction methods for the semi-clean and unclean samples to exploit the supervised potential of noisy information; and 3) finally, we extensively test these methods on noisy versions of commonsense reasoning benchmarks (CommonsenseQA and OpenbookQA). Experimental results show that the proposed method can significantly enhance robustness and improve overall performance. Furthermore, the proposed method is generally applicable to multiple existing commonsense reasoning frameworks to boost their robustness. The code is available at https://github.com/xdxuyang/CR_Noisy_Labels.
基于知识图谱(KGs)的常识推理是一项具有挑战性的任务,它需要根据所描述的文本上下文和关于世界的相关知识来预测复杂问题。然而,当前的方法通常假设训练场景是干净的,样本标签准确无误,而这往往不切实际。训练集可能包含错误标注的样本,对于常识推理方法而言,对标签噪声的鲁棒性是其具备实用性的关键,但这个问题在很大程度上仍未得到充分探索。这项工作聚焦于带有错误标注训练样本的常识推理,并做出了几项技术贡献:1)我们首先从知识和模型构建多样化的增强方法,并提供一种简单而有效的多项选择对齐方法,将训练样本划分为干净、半干净和不干净的部分;2)我们为半干净和不干净的样本设计自适应标签校正方法,以挖掘噪声信息的监督潜力;3)最后,我们在常识推理基准(常识问答和开放书本问答)的噪声版本上广泛测试这些方法。实验结果表明,所提出的方法能够显著增强鲁棒性并提高整体性能。此外,所提出的方法通常适用于多个现有的常识推理框架,以提升它们的鲁棒性。代码可在https://github.com/xdxuyang/CR_Noisy_Labels获取。