• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EXSEQREG:使用形态特征进行命名实体识别的案例研究,解释基于序列的 NLP 任务的区域。

EXSEQREG: Explaining sequence-based NLP tasks with regions with a case study using morphological features for named entity recognition.

机构信息

Computer Engineering Department, Boğaziçi University, Istanbul, Turkey.

出版信息

PLoS One. 2020 Dec 30;15(12):e0244179. doi: 10.1371/journal.pone.0244179. eCollection 2020.

DOI:10.1371/journal.pone.0244179
PMID:33378340
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7773252/
Abstract

The state-of-the-art systems for most natural language engineering tasks employ machine learning methods. Despite the improved performances of these systems, there is a lack of established methods for assessing the quality of their predictions. This work introduces a method for explaining the predictions of any sequence-based natural language processing (NLP) task implemented with any model, neural or non-neural. Our method named EXSEQREG introduces the concept of region that links the prediction and features that are potentially important for the model. A region is a list of positions in the input sentence associated with a single prediction. Many NLP tasks are compatible with the proposed explanation method as regions can be formed according to the nature of the task. The method models the prediction probability differences that are induced by careful removal of features used by the model. The output of the method is a list of importance values. Each value signifies the impact of the corresponding feature on the prediction. The proposed method is demonstrated with a neural network based named entity recognition (NER) tagger using Turkish and Finnish datasets. A qualitative analysis of the explanations is presented. The results are validated with a procedure based on the mutual information score of each feature. We show that this method produces reasonable explanations and may be used for i) assessing the degree of the contribution of features regarding a specific prediction of the model, ii) exploring the features that played a significant role for a trained model when analyzed across the corpus.

摘要

用于大多数自然语言工程任务的最先进系统采用机器学习方法。尽管这些系统的性能有所提高,但缺乏评估其预测质量的既定方法。这项工作介绍了一种用于解释任何基于序列的自然语言处理 (NLP) 任务的预测的方法,该方法可以使用任何模型(神经或非神经)来实现。我们的方法名为 EXSEQREG,它引入了与潜在对模型重要的特征相关联的预测的区域的概念。区域是与单个预测相关联的输入句子中位置的列表。许多 NLP 任务都与所提出的解释方法兼容,因为可以根据任务的性质形成区域。该方法对由模型使用的特征的仔细去除引起的预测概率差异进行建模。该方法的输出是重要值的列表。每个值表示相应特征对预测的影响。该方法使用基于神经网络的命名实体识别 (NER) 标记器和土耳其语及芬兰语数据集进行了演示。提出了一种定性分析解释的方法。结果通过基于每个特征的互信息得分的过程进行验证。我们表明,该方法产生了合理的解释,可用于:i)评估特征对于模型的特定预测的贡献程度,ii)探索在分析语料库时对经过训练的模型起重要作用的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/cbe035658d3b/pone.0244179.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/f74e964e7473/pone.0244179.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/264d5a88d879/pone.0244179.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/fb6006c873bb/pone.0244179.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/c9464b8b4119/pone.0244179.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/c91699400372/pone.0244179.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/e1526feaef40/pone.0244179.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/3c78b4ef557f/pone.0244179.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/cbe035658d3b/pone.0244179.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/f74e964e7473/pone.0244179.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/264d5a88d879/pone.0244179.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/fb6006c873bb/pone.0244179.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/c9464b8b4119/pone.0244179.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/c91699400372/pone.0244179.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/e1526feaef40/pone.0244179.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/3c78b4ef557f/pone.0244179.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b24d/7773252/cbe035658d3b/pone.0244179.g008.jpg

相似文献

1
EXSEQREG: Explaining sequence-based NLP tasks with regions with a case study using morphological features for named entity recognition.EXSEQREG:使用形态特征进行命名实体识别的案例研究,解释基于序列的 NLP 任务的区域。
PLoS One. 2020 Dec 30;15(12):e0244179. doi: 10.1371/journal.pone.0244179. eCollection 2020.
2
Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别
AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.
3
Entity recognition from clinical texts via recurrent neural network.基于循环神经网络的临床文本实体识别。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67. doi: 10.1186/s12911-017-0468-7.
4
Candidate region aware nested named entity recognition.候选区域感知嵌套命名实体识别。
Neural Netw. 2021 Oct;142:340-350. doi: 10.1016/j.neunet.2021.02.019. Epub 2021 Mar 16.
5
Improving Clinical Named-Entity Recognition with Transfer Learning.利用迁移学习改进临床命名实体识别
Stud Health Technol Inform. 2018;252:182-187.
6
Biomedical and clinical English model packages for the Stanza Python NLP library.适用于Stanza Python自然语言处理库的生物医学和临床英语模型包。
J Am Med Inform Assoc. 2021 Aug 13;28(9):1892-1899. doi: 10.1093/jamia/ocab090.
7
Transformers-sklearn: a toolkit for medical language understanding with transformer-based models.Transformer-sklearn:一个基于 Transformer 的模型的医学语言理解工具包。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):90. doi: 10.1186/s12911-021-01459-0.
8
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
9
Effects of data and entity ablation on multitask learning models for biomedical entity recognition.数据和实体消融对生物医学实体识别多任务学习模型的影响。
J Biomed Inform. 2022 Jun;130:104062. doi: 10.1016/j.jbi.2022.104062. Epub 2022 Apr 9.
10
Active learning for ontological event extraction incorporating named entity recognition and unknown word handling.结合命名实体识别和未知词处理的本体事件抽取的主动学习
J Biomed Semantics. 2016 Apr 27;7:22. doi: 10.1186/s13326-016-0059-z. eCollection 2016.

引用本文的文献

1
Data driven identification of international cutting edge science and technologies using SpaCy.利用 SpaCy 进行数据驱动的国际前沿科学技术识别。
PLoS One. 2022 Oct 12;17(10):e0275872. doi: 10.1371/journal.pone.0275872. eCollection 2022.
2
Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing.2020 年(含新冠疫情):临床自然语言处理相关科学文献观察
Yearb Med Inform. 2021 Aug;30(1):257-263. doi: 10.1055/s-0041-1726528. Epub 2021 Sep 3.

本文引用的文献

1
On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.关于通过逐层相关性传播对非线性分类器决策进行逐像素解释
PLoS One. 2015 Jul 10;10(7):e0130140. doi: 10.1371/journal.pone.0130140. eCollection 2015.