临床文本中医用概念间关系的自动提取。

Automatic extraction of relations between medical concepts in clinical texts.

机构信息

Human Language Technology Research Institute, University of Texas at Dallas, Richardson, Texas 75083-0688, USA.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):594-600. doi: 10.1136/amiajnl-2011-000153.

DOI:10.1136/amiajnl-2011-000153

PMID:21846787

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168312/

Abstract

OBJECTIVE

A supervised machine learning approach to discover relations between medical problems, treatments, and tests mentioned in electronic medical records.

MATERIALS AND METHODS

A single support vector machine classifier was used to identify relations between concepts and to assign their semantic type. Several resources such as Wikipedia, WordNet, General Inquirer, and a relation similarity metric inform the classifier.

RESULTS

The techniques reported in this paper were evaluated in the 2010 i2b2 Challenge and obtained the highest F1 score for the relation extraction task. When gold standard data for concepts and assertions were available, F1 was 73.7, precision was 72.0, and recall was 75.3. F1 is defined as 2PrecisionRecall/(Precision+Recall). Alternatively, when concepts and assertions were discovered automatically, F1 was 48.4, precision was 57.6, and recall was 41.7.

DISCUSSION

Although a rich set of features was developed for the classifiers presented in this paper, little knowledge mining was performed from medical ontologies such as those found in UMLS. Future studies should incorporate features extracted from such knowledge sources, which we expect to further improve the results. Moreover, each relation discovery was treated independently. Joint classification of relations may further improve the quality of results. Also, joint learning of the discovery of concepts, assertions, and relations may also improve the results of automatic relation extraction.

CONCLUSION

Lexical and contextual features proved to be very important in relation extraction from medical texts. When they are not available to the classifier, the F1 score decreases by 3.7%. In addition, features based on similarity contribute to a decrease of 1.1% when they are not available.

摘要

目的

采用有监督机器学习方法，发现电子病历中提到的医疗问题、治疗方法和检测之间的关系。

材料和方法

采用单支持向量机分类器来识别概念之间的关系，并为其分配语义类型。该分类器使用了 Wikipedia、WordNet、General Inquirer 等多种资源以及关系相似性度量标准。

结果

本文报道的技术在 2010 年 i2b2 挑战赛中进行了评估，在关系提取任务中获得了最高的 F1 分数。当有概念和断言的黄金标准数据时，F1 为 73.7，精度为 72.0，召回率为 75.3。F1 的定义为 2PrecisionRecall/(Precision+Recall)。或者，当自动发现概念和断言时，F1 为 48.4，精度为 57.6，召回率为 41.7。

讨论

尽管为本文提出的分类器开发了丰富的特征集，但从 UMLS 等医学本体中进行的知识挖掘很少。未来的研究应纳入从这些知识源中提取的特征，我们预计这将进一步提高结果。此外，每个关系发现都是独立处理的。关系的联合分类可能会进一步提高结果的质量。此外，概念、断言和关系的发现的联合学习也可能会提高自动关系提取的结果。

结论

词汇和上下文特征在从医学文本中提取关系时非常重要。当分类器无法获得这些特征时，F1 分数会降低 3.7%。此外，当无法获得基于相似性的特征时，F1 分数会降低 1.1%。

相似文献

Automatic extraction of relations between medical concepts in clinical texts.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):594-600. doi: 10.1136/amiajnl-2011-000153.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

A flexible framework for deriving assertions from electronic medical records.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):568-73. doi: 10.1136/amiajnl-2011-000152. Epub 2011 Jul 1.

MITRE system for clinical assertion status classification.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):563-7. doi: 10.1136/amiajnl-2011-000164. Epub 2011 Apr 22.

A knowledge discovery and reuse pipeline for information extraction in clinical notes.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):574-9. doi: 10.1136/amiajnl-2011-000302. Epub 2011 Jul 7.

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):601-6. doi: 10.1136/amiajnl-2011-000163. Epub 2011 Apr 20.

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):588-93. doi: 10.1136/amiajnl-2011-000154. Epub 2011 May 19.

Automatic discourse connective detection in biomedical text.

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):800-8. doi: 10.1136/amiajnl-2011-000775. Epub 2012 Jun 28.

A supervised framework for resolving coreference in clinical records.

J Am Med Inform Assoc. 2012 Sep-Oct;19(5):875-82. doi: 10.1136/amiajnl-2012-000810. Epub 2012 May 19.

Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):557-62. doi: 10.1136/amiajnl-2011-000150. Epub 2011 May 12.

引用本文的文献

Enhancing pre-trained language model by answering natural questions for event extraction.

Front Artif Intell. 2025 Apr 24;8:1520290. doi: 10.3389/frai.2025.1520290. eCollection 2025.

Large Language Models in Biomedical and Health Informatics: A Review with Bibliometric Analysis.

J Healthc Inform Res. 2024 Sep 14;8(4):658-711. doi: 10.1007/s41666-024-00171-8. eCollection 2024 Dec.

Clinical Decision Support and Natural Language Processing in Medicine: Systematic Literature Review.

J Med Internet Res. 2024 Sep 30;26:e55315. doi: 10.2196/55315.

End-to-End -ary Relation Extraction for Combination Drug Therapies.

Proc (IEEE Int Conf Healthc Inform). 2023 Jun;2023:72-80. doi: 10.1109/ichi57859.2023.00021. Epub 2023 Dec 11.

BIR: Biomedical Information Retrieval System for Cancer Treatment in Electronic Health Record Using Transformers.

Sensors (Basel). 2023 Nov 23;23(23):9355. doi: 10.3390/s23239355.

Natural language processing to assess the epidemiology of delirium-suggestive behavioural disturbances in critically ill patients.

Crit Care Resusc. 2023 Oct 18;23(2):144-153. doi: 10.51893/2021.2.oa1. eCollection 2021 Jun.

Plant disease prescription recommendation based on electronic medical records and sentence embedding retrieval.

Plant Methods. 2023 Aug 26;19(1):91. doi: 10.1186/s13007-023-01070-6.

A Study of Factors Influencing the Volume of Responses to Posts in Physician Online Community.

Healthcare (Basel). 2023 Apr 29;11(9):1275. doi: 10.3390/healthcare11091275.

BERT-based Transfer Learning in Sentence-level Anatomic Classification of Free-Text Radiology Reports.

Radiol Artif Intell. 2023 Feb 15;5(2):e220097. doi: 10.1148/ryai.220097. eCollection 2023 Mar.

Using Artificial Intelligence Technology to Solve the Electronic Health Service by Processing the Online Case Information.

J Healthc Eng. 2021 Nov 26;2021:9637018. doi: 10.1155/2021/9637018. eCollection 2021.

本文引用的文献

A flexible framework for deriving assertions from electronic medical records.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):568-73. doi: 10.1136/amiajnl-2011-000152. Epub 2011 Jul 1.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Proc AMIA Symp. 2001:17-21.

The Unified Medical Language System.

Methods Inf Med. 1993 Aug;32(4):281-91. doi: 10.1055/s-0038-1634945.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

临床文本中医用概念间关系的自动提取。

Automatic extraction of relations between medical concepts in clinical texts.

机构信息

Human Language Technology Research Institute, University of Texas at Dallas, Richardson, Texas 75083-0688, USA.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):594-600. doi: 10.1136/amiajnl-2011-000153.

DOI:10.1136/amiajnl-2011-000153

PMID:21846787

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168312/

Abstract

OBJECTIVE

A supervised machine learning approach to discover relations between medical problems, treatments, and tests mentioned in electronic medical records.

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

摘要

目的

采用有监督机器学习方法，发现电子病历中提到的医疗问题、治疗方法和检测之间的关系。

临床文本中医用概念间关系的自动提取。

Automatic extraction of relations between medical concepts in clinical texts.

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料和方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

临床文本中医用概念间关系的自动提取。

Automatic extraction of relations between medical concepts in clinical texts.

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料和方法

结果

讨论

结论