Suppr超能文献

评估电子病历中核心参考解析的最新技术水平。

Evaluating the state of the art in coreference resolution for electronic medical records.

机构信息

Department of Information Studies, University at Albany, SUNY, Albany, New York 12222, USA.

出版信息

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):786-91. doi: 10.1136/amiajnl-2011-000784. Epub 2012 Feb 24.

Abstract

BACKGROUND

The fifth i2b2/VA Workshop on Natural Language Processing Challenges for Clinical Records conducted a systematic review on resolution of noun phrase coreference in medical records. Informatics for Integrating Biology and the Bedside (i2b2) and the Veterans Affair (VA) Consortium for Healthcare Informatics Research (CHIR) partnered to organize the coreference challenge. They provided the research community with two corpora of medical records for the development and evaluation of the coreference resolution systems. These corpora contained various record types (ie, discharge summaries, pathology reports) from multiple institutions.

METHODS

The coreference challenge provided the community with two annotated ground truth corpora and evaluated systems on coreference resolution in two ways: first, it evaluated systems for their ability to identify mentions of concepts and to link together those mentions. Second, it evaluated the ability of the systems to link together ground truth mentions that refer to the same entity. Twenty teams representing 29 organizations and nine countries participated in the coreference challenge.

RESULTS

The teams' system submissions showed that machine-learning and rule-based approaches worked best when augmented with external knowledge sources and coreference clues extracted from document structure. The systems performed better in coreference resolution when provided with ground truth mentions. Overall, the systems struggled in solving coreference resolution for cases that required domain knowledge.

摘要

背景

第五届 i2b2/VA 自然语言处理挑战临床记录研讨会对医疗记录中的名词短语共指消解问题进行了系统综述。整合生物学和床边信息学(i2b2)和退伍军人事务部(VA)医疗保健信息学研究联盟(CHIR)合作组织了本次共指挑战。他们为研究社区提供了两份医疗记录语料库,用于开发和评估共指消解系统。这些语料库包含来自多个机构的各种记录类型(例如,出院小结、病理报告)。

方法

本次共指挑战为社区提供了两个已注释的真实语料库,并通过两种方式评估系统的共指消解能力:首先,评估系统识别概念提及和将这些提及联系起来的能力。其次,评估系统将指称同一实体的真实提及联系起来的能力。二十支代表 29 个组织和九个国家的团队参加了共指挑战。

结果

团队系统提交的结果表明,机器学习和基于规则的方法在结合外部知识源和从文档结构中提取的共指线索时效果最佳。当提供真实提及时,系统在共指消解方面的表现更好。总体而言,系统在解决需要领域知识的共指消解问题时遇到了困难。

相似文献

1

引用本文的文献

本文引用的文献

1
MCORES: a system for noun phrase coreference resolution for clinical records.MCORES:用于临床记录中名词短语共指消解的系统。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):906-12. doi: 10.1136/amiajnl-2011-000591. Epub 2012 Mar 14.
3
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
4
Anaphoric relations in the clinical narrative: corpus creation.临床叙述中的回指关系:语料库创建。
J Am Med Inform Assoc. 2011 Jul-Aug;18(4):459-65. doi: 10.1136/amiajnl-2011-000108. Epub 2011 Apr 1.
5
Extracting medication information from clinical text.从临床文本中提取药物信息。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.
7
Recognizing obesity and comorbidities in sparse data.在稀疏数据中识别肥胖及合并症。
J Am Med Inform Assoc. 2009 Jul-Aug;16(4):561-70. doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.
9
Identifying patient smoking status from medical discharge records.从医疗出院记录中识别患者的吸烟状况。
J Am Med Inform Assoc. 2008 Jan-Feb;15(1):14-24. doi: 10.1197/jamia.M2408. Epub 2007 Oct 18.
10
Evaluating the state-of-the-art in automatic de-identification.评估自动去识别技术的最新进展。
J Am Med Inform Assoc. 2007 Sep-Oct;14(5):550-63. doi: 10.1197/jamia.M2444. Epub 2007 Jun 28.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验