临床笔记中指代消解的无限混合模型。

An Infinite Mixture Model for Coreference Resolution in Clinical Notes.

作者信息

Liu Sijia, Liu Hongfang, Chaudhary Vipin, Li Dingcheng

机构信息

University at Buffalo, the State University of New York, Buffalo, NY;

Mayo Clinic, Rochester, MN.

出版信息

AMIA Jt Summits Transl Sci Proc. 2016 Jul 22;2016:428-37. eCollection 2016.

PMID:27595047

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5009297/

Abstract

It is widely acknowledged that natural language processing is indispensable to process electronic health records (EHRs). However, poor performance in relation detection tasks, such as coreference (linguistic expressions pertaining to the same entity/event) may affect the quality of EHR processing. Hence, there is a critical need to advance the research for relation detection from EHRs. Most of the clinical coreference resolution systems are based on either supervised machine learning or rule-based methods. The need for manually annotated corpus hampers the use of such system in large scale. In this paper, we present an infinite mixture model method using definite sampling to resolve coreferent relations among mentions in clinical notes. A similarity measure function is proposed to determine the coreferent relations. Our system achieved a 0.847 F-measure for i2b2 2011 coreference corpus. This promising results and the unsupervised nature make it possible to apply the system in big-data clinical setting.

摘要

人们普遍认为，自然语言处理对于处理电子健康记录（EHR）不可或缺。然而，在关系检测任务（如共指消解，即与同一实体/事件相关的语言表达）方面表现不佳，可能会影响电子健康记录处理的质量。因此，迫切需要推进从电子健康记录中进行关系检测的研究。大多数临床共指消解系统基于监督机器学习或基于规则的方法。对人工标注语料库的需求阻碍了此类系统在大规模场景中的应用。在本文中，我们提出了一种使用确定性采样的无限混合模型方法，以解决临床笔记中提及内容之间的共指关系。提出了一种相似性度量函数来确定共指关系。我们的系统在i2b2 2011共指语料库上的F值为0.847。这一有前景的结果以及无监督的特性使得该系统能够应用于大数据临床场景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d987/5009297/5e0d80129f36/2370111f1.jpg

相似文献

An Infinite Mixture Model for Coreference Resolution in Clinical Notes.临床笔记中指代消解的无限混合模型。

AMIA Jt Summits Transl Sci Proc. 2016 Jul 22;2016:428-37. eCollection 2016.

Evaluating the state of the art in coreference resolution for electronic medical records.评估电子病历中核心参考解析的最新技术水平。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):786-91. doi: 10.1136/amiajnl-2011-000784. Epub 2012 Feb 24.

Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.科罗拉多生物医学期刊文章丰富注释全文（CRAFT）语料库中的共指标注与消解

BMC Bioinformatics. 2017 Aug 17;18(1):372. doi: 10.1186/s12859-017-1775-9.

A classification approach to coreference in discharge summaries: 2011 i2b2 challenge.一种用于出院小结中核心参照的分类方法：2011 i2b2 挑战赛。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):897-905. doi: 10.1136/amiajnl-2011-000734. Epub 2012 Apr 13.

A supervised framework for resolving coreference in clinical records.一种用于解决临床记录中共指消解问题的有监督框架。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):875-82. doi: 10.1136/amiajnl-2012-000810. Epub 2012 May 19.

Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.从非结构化临床记录中提取症状的任务定义、标注数据集和监督自然语言处理模型。

J Biomed Inform. 2020 Feb;102:103354. doi: 10.1016/j.jbi.2019.103354. Epub 2019 Dec 12.

Coreference resolution: a review of general methodologies and applications in the clinical domain.共指消解：综述临床领域的通用方法及应用。

J Biomed Inform. 2011 Dec;44(6):1113-22. doi: 10.1016/j.jbi.2011.08.006. Epub 2011 Aug 12.

Lexical patterns, features and knowledge resources for coreference resolution in clinical notes.临床笔记中用于指代消解的词汇模式、特征和知识资源。

J Biomed Inform. 2012 Oct;45(5):901-12. doi: 10.1016/j.jbi.2012.02.012. Epub 2012 Mar 17.

Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.生物共指消解评分系统（Bio-SCoRes）：一种用于生物医学文本共指消解的混合架构

PLoS One. 2016 Mar 2;11(3):e0148538. doi: 10.1371/journal.pone.0148538. eCollection 2016.

Machine learning-based coreference resolution of concepts in clinical documents.基于机器学习的临床文档中概念的共指消解。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):883-7. doi: 10.1136/amiajnl-2011-000774. Epub 2012 May 12.

引用本文的文献

Unsupervised probabilistic models for sequential Electronic Health Records.无监督概率模型在连续电子健康记录中的应用。

J Biomed Inform. 2022 Oct;134:104163. doi: 10.1016/j.jbi.2022.104163. Epub 2022 Aug 28.

Clinical Text Data in Machine Learning: Systematic Review.机器学习中的临床文本数据：系统综述

JMIR Med Inform. 2020 Mar 31;8(3):e17984. doi: 10.2196/17984.

Making Sense of Big Textual Data for Health Care: Findings from the Section on Clinical Natural Language Processing.理解医疗保健领域的大文本数据：临床自然语言处理部分的研究结果。

Yearb Med Inform. 2017 Aug;26(1):228-234. doi: 10.15265/IY-2017-027. Epub 2017 Sep 11.

Correlating Lab Test Results in Clinical Notes with Structured Lab Data: A Case Study in HbA1c and Glucose.将临床记录中的实验室检查结果与结构化实验室数据相关联：糖化血红蛋白（HbA1c）和血糖的案例研究

AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:221-228. eCollection 2017.

A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text.一种基于主题模型的从生物医学文本中进行药物相互作用分类的框架。

AMIA Annu Symp Proc. 2017 Feb 10;2016:789-798. eCollection 2016.

本文引用的文献

ClearTK 2.0: Design Patterns for Machine Learning in UIMA.ClearTK 2.0：UIMA中机器学习的设计模式

LREC Int Conf Lang Resour Eval. 2014 May;2014:3289-3293.

Exploiting the potential of large databases of electronic health records for research using rapid search algorithms and an intuitive query interface.利用电子健康记录的大型数据库的潜力，通过快速搜索算法和直观的查询界面进行研究。

J Am Med Inform Assoc. 2014 Mar-Apr;21(2):292-8. doi: 10.1136/amiajnl-2013-001847. Epub 2013 Nov 22.

A rule based solution to co-reference resolution in clinical text.基于规则的临床文本共指消解解决方案。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):891-7. doi: 10.1136/amiajnl-2011-000770. Epub 2012 Oct 11.

Coreference analysis in clinical notes: a multi-pass sieve with alternate anaphora resolution modules.临床记录中的共指分析：一种带有交替回指解析模块的多遍筛选方法。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):867-74. doi: 10.1136/amiajnl-2011-000766. Epub 2012 Jun 16.

A supervised framework for resolving coreference in clinical records.一种用于解决临床记录中共指消解问题的有监督框架。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):875-82. doi: 10.1136/amiajnl-2012-000810. Epub 2012 May 19.

A classification approach to coreference in discharge summaries: 2011 i2b2 challenge.一种用于出院小结中核心参照的分类方法：2011 i2b2 挑战赛。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):897-905. doi: 10.1136/amiajnl-2011-000734. Epub 2012 Apr 13.

Lexical patterns, features and knowledge resources for coreference resolution in clinical notes.临床笔记中用于指代消解的词汇模式、特征和知识资源。

J Biomed Inform. 2012 Oct;45(5):901-12. doi: 10.1016/j.jbi.2012.02.012. Epub 2012 Mar 17.

Evaluating the state of the art in coreference resolution for electronic medical records.评估电子病历中核心参考解析的最新技术水平。

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):786-91. doi: 10.1136/amiajnl-2011-000784. Epub 2012 Feb 24.

Coreference resolution: a review of general methodologies and applications in the clinical domain.共指消解：综述临床领域的通用方法及应用。

J Biomed Inform. 2011 Dec;44(6):1113-22. doi: 10.1016/j.jbi.2011.08.006. Epub 2011 Aug 12.

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统（cTAKES）：架构、组件评估和应用。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

临床笔记中指代消解的无限混合模型。

An Infinite Mixture Model for Coreference Resolution in Clinical Notes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献