ExpMRC：机器阅读理解的可解释性评估

ExpMRC: explainability evaluation for machine reading comprehension.

作者信息

Cui Yiming, Liu Ting, Che Wanxiang, Chen Zhigang, Wang Shijin

机构信息

Research Center for SCIR, Harbin Institute of Technology, Harbin 150001, China.

State Key Laboratory of Cognitive Intelligence, iFLYTEK Research, Beijing 100010, China.

出版信息

Heliyon. 2022 Apr 19;8(4):e09290. doi: 10.1016/j.heliyon.2022.e09290. eCollection 2022 Apr.

DOI:10.1016/j.heliyon.2022.e09290

PMID:35497046

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9048090/

Abstract

Achieving human-level performance on some Machine Reading Comprehension (MRC) datasets is no longer challenging with the help of powerful Pre-trained Language Models (PLMs). However, it is necessary to provide both answer prediction and its explanation to further improve the MRC system's reliability, especially for real-life applications. In this paper, we propose a new benchmark called ExpMRC for evaluating the textual explainability of the MRC systems. ExpMRC contains four subsets, including SQuAD, CMRC 2018, RACE, and C, with additional annotations of the answer's evidence. The MRC systems are required to give not only the correct answer but also its explanation. We use state-of-the-art PLMs to build baseline systems and adopt various unsupervised approaches to extract both answer and evidence spans without human-annotated evidence spans. The experimental results show that these models are still far from human performance, suggesting that the ExpMRC is challenging. Resources (data and baselines) are available through https://github.com/ymcui/expmrc.

摘要

借助强大的预训练语言模型（PLM），在某些机器阅读理解（MRC）数据集上实现人类水平的性能已不再具有挑战性。然而，为了进一步提高MRC系统的可靠性，特别是对于实际应用而言，有必要同时提供答案预测及其解释。在本文中，我们提出了一个名为ExpMRC的新基准，用于评估MRC系统的文本可解释性。ExpMRC包含四个子集，包括SQuAD、CMRC 2018、RACE和C，并带有答案证据的附加注释。MRC系统不仅需要给出正确答案，还需要给出其解释。我们使用最先进的PLM来构建基线系统，并采用各种无监督方法来提取答案和证据跨度，而无需人工标注的证据跨度。实验结果表明，这些模型仍远未达到人类性能，这表明ExpMRC具有挑战性。可通过https://github.com/ymcui/expmrc获取资源（数据和基线）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/441b/9048090/acaf86a408f9/gr001.jpg

相似文献

ExpMRC: explainability evaluation for machine reading comprehension.ExpMRC：机器阅读理解的可解释性评估

Heliyon. 2022 Apr 19;8(4):e09290. doi: 10.1016/j.heliyon.2022.e09290. eCollection 2022 Apr.

Multilingual multi-aspect explainability analyses on machine reading comprehension models.基于机器阅读理解模型的多语言多维度可解释性分析

iScience. 2022 Mar 31;25(5):104176. doi: 10.1016/j.isci.2022.104176. eCollection 2022 May 20.

On solving textual ambiguities and semantic vagueness in MRC based question answering using generative pre-trained transformers.基于生成式预训练变换器解决基于机器阅读理解的问答中的文本歧义与语义模糊问题。

PeerJ Comput Sci. 2023 Jul 24;9:e1422. doi: 10.7717/peerj-cs.1422. eCollection 2023.

Clinical concept and relation extraction using prompt-based machine reading comprehension.基于提示的机器阅读理解的临床概念和关系抽取。

J Am Med Inform Assoc. 2023 Aug 18;30(9):1486-1493. doi: 10.1093/jamia/ocad107.

HCT: Chinese Medical Machine Reading Comprehension Question-Answering via Hierarchically Collaborative Transformer.HCT：基于层次协作 Transformer 的中文医学机器阅读理解问答

IEEE J Biomed Health Inform. 2024 May;28(5):3055-3066. doi: 10.1109/JBHI.2024.3368288. Epub 2024 May 6.

BioADAPT-MRC: adversarial learning-based domain adaptation improves biomedical machine reading comprehension task.BioADAPT-MRC：基于对抗学习的领域自适应提高生物医学机器阅读理解任务。

Bioinformatics. 2022 Sep 15;38(18):4369-4379. doi: 10.1093/bioinformatics/btac508.

Integrate Candidate Answer Extraction with Re-Ranking for Chinese Machine Reading Comprehension.将候选答案提取与重排序相结合用于中文机器阅读理解。

Entropy (Basel). 2021 Mar 8;23(3):322. doi: 10.3390/e23030322.

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.用于医疗保健应用的高效机器阅读理解：上下文提取方法的算法开发与验证

JMIR Form Res. 2024 Mar 25;8:e52482. doi: 10.2196/52482.

Visualizing attention zones in machine reading comprehension models.可视化机器阅读理解模型中的注意力区域。

STAR Protoc. 2022 Jun 16;3(3):101481. doi: 10.1016/j.xpro.2022.101481. eCollection 2022 Sep 16.

: a novel resource for question answering on scholarly articles.一种用于学术文章问答的新型资源。

Int J Digit Libr. 2022;23(3):289-301. doi: 10.1007/s00799-022-00329-y. Epub 2022 Jul 20.

引用本文的文献

Multilingual multi-aspect explainability analyses on machine reading comprehension models.基于机器阅读理解模型的多语言多维度可解释性分析

iScience. 2022 Mar 31;25(5):104176. doi: 10.1016/j.isci.2022.104176. eCollection 2022 May 20.

本文引用的文献

Multilingual multi-aspect explainability analyses on machine reading comprehension models.基于机器阅读理解模型的多语言多维度可解释性分析

iScience. 2022 Mar 31;25(5):104176. doi: 10.1016/j.isci.2022.104176. eCollection 2022 May 20.

XAI-Explainable artificial intelligence.可解释人工智能

Sci Robot. 2019 Dec 18;4(37). doi: 10.1126/scirobotics.aay7120.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ExpMRC：机器阅读理解的可解释性评估

ExpMRC: explainability evaluation for machine reading comprehension.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献