混合方法提高临床文档信息获取：概念、断言和关系识别。

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

机构信息

LIMSI-CNRS, Orsay Cedex, France.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):588-93. doi: 10.1136/amiajnl-2011-000154. Epub 2011 May 19.

DOI:10.1136/amiajnl-2011-000154

PMID:21597105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168313/

Abstract

OBJECTIVE

This paper describes the approaches the authors developed while participating in the i2b2/VA 2010 challenge to automatically extract medical concepts and annotate assertions on concepts and relations between concepts.

DESIGN

The authors'approaches rely on both rule-based and machine-learning methods. Natural language processing is used to extract features from the input texts; these features are then used in the authors' machine-learning approaches. The authors used Conditional Random Fields for concept extraction, and Support Vector Machines for assertion and relation annotation. Depending on the task, the authors tested various combinations of rule-based and machine-learning methods.

RESULTS

The authors'assertion annotation system obtained an F-measure of 0.931, ranking fifth out of 21 participants at the i2b2/VA 2010 challenge. The authors' relation annotation system ranked third out of 16 participants with a 0.709 F-measure. The 0.773 F-measure the authors obtained on concept extraction did not make it to the top 10.

CONCLUSION

On the one hand, the authors confirm that the use of only machine-learning methods is highly dependent on the annotated training data, and thus obtained better results for well-represented classes. On the other hand, the use of only a rule-based method was not sufficient to deal with new types of data. Finally, the use of hybrid approaches combining machine-learning and rule-based approaches yielded higher scores.

摘要

目的

本文描述了作者在参与 i2b2/VA 2010 挑战赛时所开发的方法，这些方法用于自动提取医学概念，并对概念以及概念之间的关系进行断言标注。

设计

作者的方法依赖于基于规则和基于机器学习的方法。自然语言处理用于从输入文本中提取特征；然后，这些特征被用于作者的机器学习方法中。作者使用条件随机场进行概念提取，使用支持向量机进行断言和关系标注。根据任务的不同，作者测试了基于规则和基于机器学习的方法的各种组合。

结果

作者的断言标注系统在 i2b2/VA 2010 挑战赛的 21 个参赛队伍中排名第五，获得了 0.931 的 F 值。作者的关系标注系统在 16 个参赛队伍中排名第三，获得了 0.709 的 F 值。作者在概念提取方面获得的 0.773 的 F 值没有进入前 10 名。

结论

一方面，作者证实仅使用机器学习方法高度依赖于标注的训练数据，因此对于代表性良好的类别获得了更好的结果。另一方面，仅使用基于规则的方法不足以处理新类型的数据。最后，使用结合了机器学习和基于规则的方法的混合方法产生了更高的分数。

相似文献

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):588-93. doi: 10.1136/amiajnl-2011-000154. Epub 2011 May 19.

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):601-6. doi: 10.1136/amiajnl-2011-000163. Epub 2011 Apr 20.

A flexible framework for deriving assertions from electronic medical records.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):568-73. doi: 10.1136/amiajnl-2011-000152. Epub 2011 Jul 1.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

Enhancing clinical concept extraction with distributional semantics.

J Biomed Inform. 2012 Feb;45(1):129-40. doi: 10.1016/j.jbi.2011.10.007. Epub 2011 Nov 7.

A knowledge discovery and reuse pipeline for information extraction in clinical notes.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):574-9. doi: 10.1136/amiajnl-2011-000302. Epub 2011 Jul 7.

MITRE system for clinical assertion status classification.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):563-7. doi: 10.1136/amiajnl-2011-000164. Epub 2011 Apr 22.

Using machine learning for concept extraction on clinical documents from multiple data sources.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):580-7. doi: 10.1136/amiajnl-2011-000155. Epub 2011 Jun 27.

Automatic extraction of relations between medical concepts in clinical texts.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):594-600. doi: 10.1136/amiajnl-2011-000153.

Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.

J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.

引用本文的文献

Chinese Clinical Named Entity Recognition with ALBERT and MHA Mechanism.

Evid Based Complement Alternat Med. 2022 May 23;2022:2056039. doi: 10.1155/2022/2056039. eCollection 2022.

DI++: A deep learning system for patient condition identification in clinical notes.

Artif Intell Med. 2022 Jan;123:102224. doi: 10.1016/j.artmed.2021.102224. Epub 2021 Dec 2.

Clinical Concept Extraction with Lexical Semantics to Support Automatic Annotation.

Int J Environ Res Public Health. 2021 Oct 9;18(20):10564. doi: 10.3390/ijerph182010564.

Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.

J Biomed Semantics. 2020 Nov 16;11(1):14. doi: 10.1186/s13326-020-00231-z.

Differentiating Sense through Semantic Interaction Data.

AMIA Annu Symp Proc. 2017 Feb 10;2016:1238-1247. eCollection 2016.

Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning.

Biomed Inform Insights. 2016 May 22;8:11-8. doi: 10.4137/BII.S38308. eCollection 2016.

Automated Assessment of Medical Students' Clinical Exposures according to AAMC Geriatric Competencies.

AMIA Annu Symp Proc. 2014 Nov 14;2014:375-84. eCollection 2014.

Learning to identify treatment relations in clinical text.

AMIA Annu Symp Proc. 2014 Nov 14;2014:282-8. eCollection 2014.

Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text.

J Am Med Inform Assoc. 2015 Apr;22(e1):e162-76. doi: 10.1136/amiajnl-2014-002954. Epub 2014 Oct 21.

"Big data" and the electronic health record.

Yearb Med Inform. 2014 Aug 15;9(1):97-104. doi: 10.15265/IY-2014-0003.

本文引用的文献

High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):524-7. doi: 10.1136/jamia.2010.003939.

An overview of MetaMap: historical perspective and recent advances.

J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.

Rule-based information extraction from patients' clinical data.

J Biomed Inform. 2009 Oct;42(5):923-36. doi: 10.1016/j.jbi.2009.07.007. Epub 2009 Jul 29.

Machine learning and rule-based approaches to assertion classification.

J Am Med Inform Assoc. 2009 Jan-Feb;16(1):109-15. doi: 10.1197/jamia.M2950. Epub 2008 Oct 24.

Semantic structuring of and information extraction from medical documents using the UMLS.

Methods Inf Med. 2008;47(5):425-34. doi: 10.3414/me0508.

Lessons extracting diseases from discharge summaries.

AMIA Annu Symp Proc. 2007 Oct 11;2007:478-82.

Extracting information from textual documents in the electronic health record: a review of recent research.

Yearb Med Inform. 2008:128-44.

Comparing natural language processing tools to extract medical problems from narrative text.

AMIA Annu Symp Proc. 2005;2005:525-9.

Identifying important concepts from medical documents.

J Biomed Inform. 2006 Dec;39(6):668-79. doi: 10.1016/j.jbi.2006.02.001. Epub 2006 Mar 2.

Automated encoding of clinical documents based on natural language processing.

J Am Med Inform Assoc. 2004 Sep-Oct;11(5):392-402. doi: 10.1197/jamia.M1552. Epub 2004 Jun 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

混合方法提高临床文档信息获取：概念、断言和关系识别。

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

机构信息

LIMSI-CNRS, Orsay Cedex, France.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):588-93. doi: 10.1136/amiajnl-2011-000154. Epub 2011 May 19.

DOI:10.1136/amiajnl-2011-000154

PMID:21597105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168313/

Abstract

OBJECTIVE

DESIGN

RESULTS

CONCLUSION

摘要

目的

本文描述了作者在参与 i2b2/VA 2010 挑战赛时所开发的方法，这些方法用于自动提取医学概念，并对概念以及概念之间的关系进行断言标注。

混合方法提高临床文档信息获取：概念、断言和关系识别。

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

机构信息

出版信息

OBJECTIVE

DESIGN

RESULTS

CONCLUSION

目的

设计

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

混合方法提高临床文档信息获取：概念、断言和关系识别。

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

机构信息

出版信息

OBJECTIVE

DESIGN

RESULTS

CONCLUSION

目的

设计

结果

结论