• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EUSKOR:巴斯克语端到端共指消解系统。

EUSKOR: End-to-end coreference resolution system for Basque.

机构信息

Computer Languages and Systems Department, University of the Basque Country, Donostia-San Sebastian, Spain.

Computer Architecture and Technology Department, University of the Basque Country, Donostia-San Sebastian, Spain.

出版信息

PLoS One. 2019 Sep 12;14(9):e0221801. doi: 10.1371/journal.pone.0221801. eCollection 2019.

DOI:10.1371/journal.pone.0221801
PMID:31513627
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6742394/
Abstract

This paper describes the process of adapting the Stanford Coreference resolution module to the Basque language, taking into account the characteristics of the language. The module has been integrated in a linguistic analysis pipeline obtaining an end-to-end coreference resolution system for the Basque language. The adaptation process explained can benefit and facilitate other languages with similar characteristics in the implementation of their coreference resolution systems. During the experimentation phase, we have demonstrated that language-specific features have a noteworthy effect on coreference resolution, obtaining a gain in CoNLL score of 7.07 with respect to the baseline system. We have also analysed the effect that preprocessing has in coreference resolution, comparing the results obtained with automatic mentions versus gold mentions. When gold mentions are provided, the results increase 11.5 points in CoNLL score in comparison with results obtained when automatic mentions are used. The contribution of each sieve is analysed concluding that morphology is essential for agglutinative languages to obtain good performance in coreference resolution. Finally, an error analysis of the coreference resolution system is presented which have revealed our system's weak points and help to determine the improvements of the system. As a result of the error analysis, we have enriched the Basque coreference resolution adding new two sieves, obtaining an improvement of 0.24 points in CoNLL F1 when automatic mentions are used and of 0.39 points when the gold mentions are provided.

摘要

本文描述了将斯坦福共指解析模块适配到巴斯克语的过程,考虑到语言的特点。该模块已经集成到语言分析管道中,得到了巴斯克语的端到端共指解析系统。所解释的适配过程可以为具有类似特征的其他语言在实现其共指解析系统时提供帮助和便利。在实验阶段,我们已经证明语言特定特征对共指解析有显著影响,与基线系统相比,共指得分提高了 7.07 分。我们还分析了预处理对共指解析的影响,比较了自动提及与黄金提及的结果。当提供黄金提及时,与使用自动提及相比,共指得分提高了 11.5 分。分析了每个筛子的贡献,得出结论,形态学对于黏着语在共指解析中获得良好性能至关重要。最后,对共指解析系统进行了错误分析,揭示了系统的弱点,并有助于确定系统的改进。作为错误分析的结果,我们通过添加两个新的筛子来丰富巴斯克语的共指解析,当使用自动提及时,共指 F1 提高了 0.24 分,当提供黄金提及时,共指 F1 提高了 0.39 分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02b/6742394/27f84948d8b1/pone.0221801.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02b/6742394/656e6df5c6a9/pone.0221801.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02b/6742394/27f84948d8b1/pone.0221801.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02b/6742394/656e6df5c6a9/pone.0221801.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02b/6742394/27f84948d8b1/pone.0221801.g002.jpg

相似文献

1
EUSKOR: End-to-end coreference resolution system for Basque.EUSKOR:巴斯克语端到端共指消解系统。
PLoS One. 2019 Sep 12;14(9):e0221801. doi: 10.1371/journal.pone.0221801. eCollection 2019.
2
A categorical analysis of coreference resolution errors in biomedical texts.生物医学文本中指代消解错误的分类分析。
J Biomed Inform. 2016 Apr;60:309-18. doi: 10.1016/j.jbi.2016.02.015. Epub 2016 Feb 27.
3
Minimalistic Approach to Coreference Resolution in Lithuanian Medical Records.立陶宛语病历中指代消解的极简方法。
Comput Math Methods Med. 2019 Mar 20;2019:9079840. doi: 10.1155/2019/9079840. eCollection 2019.
4
Using domain knowledge and domain-inspired discourse model for coreference resolution for clinical narratives.利用领域知识和领域启发的语篇模型解决临床叙述中的共指消解问题。
J Am Med Inform Assoc. 2013 Mar-Apr;20(2):356-62. doi: 10.1136/amiajnl-2011-000767. Epub 2012 Jul 10.
5
Coreferential Relations in Basque: The Annotation Process.巴斯克语中的共指关系:标注过程。
J Psycholinguist Res. 2018 Apr;47(2):325-342. doi: 10.1007/s10936-018-9559-6.
6
Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation.预测提及的共指分区评分:参考实现。
Proc Conf Assoc Comput Linguist Meet. 2014 Jun;2014:30-35. doi: 10.3115/v1/P14-2006.
7
Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.生物共指消解评分系统(Bio-SCoRes):一种用于生物医学文本共指消解的混合架构
PLoS One. 2016 Mar 2;11(3):e0148538. doi: 10.1371/journal.pone.0148538. eCollection 2016.
8
A supervised framework for resolving coreference in clinical records.一种用于解决临床记录中共指消解问题的有监督框架。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):875-82. doi: 10.1136/amiajnl-2012-000810. Epub 2012 May 19.
9
Distinguished representation of identical mentions in bio-entity coreference resolution.生物实体共指消解中相同提及的出色表示。
BMC Med Inform Decis Mak. 2022 Apr 30;22(1):116. doi: 10.1186/s12911-022-01862-1.
10
Evaluating the state of the art in coreference resolution for electronic medical records.评估电子病历中核心参考解析的最新技术水平。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):786-91. doi: 10.1136/amiajnl-2011-000784. Epub 2012 Feb 24.

本文引用的文献

1
Coreferential Relations in Basque: The Annotation Process.巴斯克语中的共指关系:标注过程。
J Psycholinguist Res. 2018 Apr;47(2):325-342. doi: 10.1007/s10936-018-9559-6.
2
Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation.预测提及的共指分区评分:参考实现。
Proc Conf Assoc Comput Linguist Meet. 2014 Jun;2014:30-35. doi: 10.3115/v1/P14-2006.