基于机器阅读理解模型的多语言多维度可解释性分析

Multilingual multi-aspect explainability analyses on machine reading comprehension models.

作者信息

Cui Yiming, Zhang Wei-Nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin

机构信息

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, Harbin 150001, China.

State Key Laboratory of Cognitive Intelligence, iFLYTEK Research, Beijing 100083, China.

出版信息

iScience. 2022 Mar 31;25(5):104176. doi: 10.1016/j.isci.2022.104176. eCollection 2022 May 20.

DOI:10.1016/j.isci.2022.104176

PMID:35465050

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9019247/

Abstract

Achieving human-level performance on some of the machine reading comprehension (MRC) datasets is no longer challenging with the help of powerful pre-trained language models (PLMs). However, the internal mechanism of these artifacts remains unclear, placing an obstacle to further understand these models. This paper focuses on conducting a series of analytical experiments to examine the relations between the multi-head self-attention and the final MRC system performance, revealing the potential explainability in PLM-based MRC models. To ensure the robustness of the analyses, we perform our experiments in a multilingual way on top of various PLMs. We discover that passage-to-question and passage understanding attentions are the most important ones in the question answering process, showing strong correlations to the final performance than other parts. Through comprehensive visualizations and case studies, we also observe several general findings on the attention maps, which can be helpful to understand how these models solve the questions.

摘要

借助强大的预训练语言模型（PLM），在一些机器阅读理解（MRC）数据集上实现人类水平的性能已不再具有挑战性。然而，这些模型的内部机制仍不清楚，这为进一步理解这些模型设置了障碍。本文专注于进行一系列分析实验，以检验多头自注意力与最终MRC系统性能之间的关系，揭示基于PLM的MRC模型中的潜在可解释性。为确保分析的稳健性，我们在各种PLM之上以多语言方式进行实验。我们发现，篇章到问题的注意力和篇章理解注意力在问答过程中最为重要，与最终性能的相关性比其他部分更强。通过全面的可视化和案例研究，我们还在注意力图上观察到了几个一般性的发现，这有助于理解这些模型是如何解决问题的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7e0/9019247/e4a1e34a8fe2/fx1.jpg

相似文献

Multilingual multi-aspect explainability analyses on machine reading comprehension models.基于机器阅读理解模型的多语言多维度可解释性分析

iScience. 2022 Mar 31;25(5):104176. doi: 10.1016/j.isci.2022.104176. eCollection 2022 May 20.

ExpMRC: explainability evaluation for machine reading comprehension.ExpMRC：机器阅读理解的可解释性评估

Heliyon. 2022 Apr 19;8(4):e09290. doi: 10.1016/j.heliyon.2022.e09290. eCollection 2022 Apr.

On solving textual ambiguities and semantic vagueness in MRC based question answering using generative pre-trained transformers.基于生成式预训练变换器解决基于机器阅读理解的问答中的文本歧义与语义模糊问题。

PeerJ Comput Sci. 2023 Jul 24;9:e1422. doi: 10.7717/peerj-cs.1422. eCollection 2023.

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.用于医疗保健应用的高效机器阅读理解：上下文提取方法的算法开发与验证

JMIR Form Res. 2024 Mar 25;8:e52482. doi: 10.2196/52482.

A self-supervised language model selection strategy for biomedical question answering.一种用于生物医学问答的自监督语言模型选择策略。

J Biomed Inform. 2023 Oct;146:104486. doi: 10.1016/j.jbi.2023.104486. Epub 2023 Sep 16.

Visualizing attention zones in machine reading comprehension models.可视化机器阅读理解模型中的注意力区域。

STAR Protoc. 2022 Jun 16;3(3):101481. doi: 10.1016/j.xpro.2022.101481. eCollection 2022 Sep 16.

A Unified Machine Reading Comprehension Framework for Cohort Selection.队列选择的统一机器阅读理解框架。

IEEE J Biomed Health Inform. 2022 Jan;26(1):379-387. doi: 10.1109/JBHI.2021.3095478. Epub 2022 Jan 17.

Integrate Candidate Answer Extraction with Re-Ranking for Chinese Machine Reading Comprehension.将候选答案提取与重排序相结合用于中文机器阅读理解。

Entropy (Basel). 2021 Mar 8;23(3):322. doi: 10.3390/e23030322.

: a novel resource for question answering on scholarly articles.一种用于学术文章问答的新型资源。

Int J Digit Libr. 2022;23(3):289-301. doi: 10.1007/s00799-022-00329-y. Epub 2022 Jul 20.

UDDIPOK: A reading comprehension based question answering dataset in Bangla language.UDDIPOK：一个基于阅读理解的孟加拉语问答数据集。

Data Brief. 2023 Feb 2;47:108933. doi: 10.1016/j.dib.2023.108933. eCollection 2023 Apr.

引用本文的文献

Visualizing attention zones in machine reading comprehension models.可视化机器阅读理解模型中的注意力区域。

STAR Protoc. 2022 Jun 16;3(3):101481. doi: 10.1016/j.xpro.2022.101481. eCollection 2022 Sep 16.

ExpMRC: explainability evaluation for machine reading comprehension.ExpMRC：机器阅读理解的可解释性评估

Heliyon. 2022 Apr 19;8(4):e09290. doi: 10.1016/j.heliyon.2022.e09290. eCollection 2022 Apr.

本文引用的文献

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

ExpMRC: explainability evaluation for machine reading comprehension.ExpMRC：机器阅读理解的可解释性评估

Heliyon. 2022 Apr 19;8(4):e09290. doi: 10.1016/j.heliyon.2022.e09290. eCollection 2022 Apr.

Improved image classification explainability with high-accuracy heatmaps.通过高精度热图提高图像分类的可解释性。

iScience. 2022 Feb 15;25(3):103933. doi: 10.1016/j.isci.2022.103933. eCollection 2022 Mar 18.

Definitions, methods, and applications in interpretable machine learning.可解释机器学习中的定义、方法和应用。

Proc Natl Acad Sci U S A. 2019 Oct 29;116(44):22071-22080. doi: 10.1073/pnas.1900654116. Epub 2019 Oct 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器阅读理解模型的多语言多维度可解释性分析

Multilingual multi-aspect explainability analyses on machine reading comprehension models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献