语言方法在临床叙述中识别药物名称和相关信息。

Linguistic approach for identification of medication names and related information in clinical narratives.

机构信息

UFR SMBH Léonard de Vinci, Université Paris 13, 93017 Bobigny Cedex, France.

出版信息

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):549-54. doi: 10.1136/jamia.2010.004036.

DOI:10.1136/jamia.2010.004036

PMID:20819862

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2995681/

Abstract

BACKGROUND

Pharmacotherapy is an integral part of any medical care process and plays an important role in the medical history of most patients. Information on medication is crucial for several tasks such as pharmacovigilance, medical decision or biomedical research.

OBJECTIVES

Within a narrative text, medication-related information can be buried within other non-relevant data. Specific methods, such as those provided by text mining, must be designed for accessing them, and this is the objective of this study.

METHODS

The authors designed a system for analyzing narrative clinical documents to extract from them medication occurrences and medication-related information. The system also attempts to deduce medications not covered by the dictionaries used.

RESULTS

Results provided by the system were evaluated within the framework of the I2B2 NLP challenge held in 2009. The system achieved an F-measure of 0.78 and ranked 7th out of 20 participating teams (the highest F-measure was 0.86). The system provided good results for the annotation and extraction of medication names, their frequency, dosage and mode of administration (F-measure over 0.81), while information on duration and reasons is poorly annotated and extracted (F-measure 0.36 and 0.29, respectively). The performance of the system was stable between the training and test sets.

摘要

背景

药物疗法是任何医疗护理过程的一个组成部分，在大多数患者的医疗史中发挥着重要作用。药物信息对于药物警戒、医疗决策或生物医学研究等多项任务至关重要。

目的

在叙述性文本中，与药物相关的信息可能隐藏在其他不相关的数据中。必须设计特定的方法（如文本挖掘提供的方法）来访问这些信息，这就是本研究的目的。

方法

作者设计了一个用于分析叙述性临床文档的系统，以从文档中提取药物出现和与药物相关的信息。该系统还试图推断出字典中未涵盖的药物。

结果

系统的结果在 2009 年 I2B2 NLP 挑战赛的框架内进行了评估。该系统的 F 度量值为 0.78，在 20 个参赛团队中排名第 7（最高 F 度量值为 0.86）。该系统在药物名称的注释和提取、其频率、剂量和给药方式（F 度量值均超过 0.81）方面提供了良好的结果，而关于持续时间和原因的信息注释和提取效果较差（F 度量值分别为 0.36 和 0.29）。系统在训练集和测试集之间的性能表现稳定。

相似文献

Linguistic approach for identification of medication names and related information in clinical narratives.语言方法在临床叙述中识别药物名称和相关信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):549-54. doi: 10.1136/jamia.2010.004036.

Integrating existing natural language processing tools for medication extraction from discharge summaries.整合现有的自然语言处理工具，从出院小结中提取药物信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):528-31. doi: 10.1136/jamia.2010.003855.

Medication information extraction with linguistic pattern matching and semantic rules.基于语言模式匹配和语义规则的药物信息提取。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):532-5. doi: 10.1136/jamia.2010.003657.

Automatic extraction of medication information from medical discharge summaries.从医疗出院小结中自动提取药物信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):545-8. doi: 10.1136/jamia.2010.003863.

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.Textractor：一种混合系统，用于从临床文本文档中提取药物和其处方的原因。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):559-62. doi: 10.1136/jamia.2010.004028.

Extracting medication information from clinical text.从临床文本中提取药物信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.

Extracting medical information from narrative patient records: the case of medication-related information.从叙事性患者记录中提取医学信息：以药物相关信息为例。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):555-8. doi: 10.1136/jamia.2010.003962.

Recognition of medication information from discharge summaries using ensembles of classifiers.使用分类器集成识别出院小结中的药物信息。

BMC Med Inform Decis Mak. 2012 May 7;12:36. doi: 10.1186/1472-6947-12-36.

Automatically detecting medications and the reason for their prescription in clinical narrative text documents.在临床叙述文本文件中自动检测药物及其处方原因。

Stud Health Technol Inform. 2010;160(Pt 2):944-8.

medExtractR: A targeted, customizable approach to medication extraction from electronic health records.medExtractR：一种从电子健康记录中提取药物信息的针对性、可定制方法。

J Am Med Inform Assoc. 2020 Mar 1;27(3):407-418. doi: 10.1093/jamia/ocz207.

引用本文的文献

A refined set of RxNorm drug names for enhancing unstructured data analysis in drug safety surveillance.一组经过优化的RxNorm药物名称，用于加强药物安全监测中的非结构化数据分析。

Exp Biol Med (Maywood). 2025 May 2;250:10374. doi: 10.3389/ebm.2025.10374. eCollection 2025.

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.BioEGRE：一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。

BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.

BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.BioByGANS：通过图注意力网络在节点分类框架中融合上下文和句法特征进行生物医学命名实体识别。

BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.

Measurement error and misclassification in electronic medical records: methods to mitigate bias.电子病历中的测量误差和错误分类：减轻偏差的方法。

Curr Epidemiol Rep. 2018 Dec;5(4):343-356. doi: 10.1007/s40471-018-0164-x. Epub 2018 Sep 10.

Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.从出院小结中提取药物名称及相关属性：文本挖掘研究

JMIR Med Inform. 2021 May 5;9(5):e24678. doi: 10.2196/24678.

CAS: corpus of clinical cases in French.法语临床病例语料库。

J Biomed Semantics. 2020 Aug 6;11(1):7. doi: 10.1186/s13326-020-00225-x.

Identifying risks areas related to medication administrations - text mining analysis using free-text descriptions of incident reports.识别与药物管理相关的风险领域 - 使用事件报告的自由文本描述进行文本挖掘分析。

BMC Health Serv Res. 2019 Nov 4;19(1):791. doi: 10.1186/s12913-019-4597-9.

Evaluating the Impact of Dictionary Updates on Automatic Annotations Based on Clinical NLP Systems.评估词典更新对基于临床自然语言处理系统的自动标注的影响。

AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:714-721. eCollection 2019.

An annotation and modeling schema for prescription regimens.处方用药方案的注释与建模架构

J Biomed Semantics. 2019 May 31;10(1):10. doi: 10.1186/s13326-019-0201-9.

Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record.临床信息学研究者对下一代电子健康记录数据内容的需求。

Appl Clin Inform. 2017 Oct;8(4):1159-1172. doi: 10.4338/ACI-2017-06-R-0101. Epub 2017 Dec 21.

本文引用的文献

Extracting medication information from clinical text.从临床文本中提取药物信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.

Tracking medication information across medical records.跨医疗记录追踪用药信息。

AMIA Annu Symp Proc. 2009 Nov 14;2009:266-70.

MedEx: a medication information extraction system for clinical narratives.MedEx：一个用于临床叙述的药物信息提取系统。

J Am Med Inform Assoc. 2010 Jan-Feb;17(1):19-24. doi: 10.1197/jamia.M3378.

Extracting structured medication event information from discharge summaries.从出院小结中提取结构化用药事件信息。

AMIA Annu Symp Proc. 2008 Nov 6;2008:237-41.

Assessment of commercial NLP engines for medication information extraction from dictated clinical notes.评估用于从口述临床记录中提取用药信息的商用自然语言处理引擎。

Int J Med Inform. 2009 Apr;78(4):284-91. doi: 10.1016/j.ijmedinf.2008.08.006. Epub 2008 Oct 5.

Use of natural language programming to extract medication from unstructured electronic medical records.使用自然语言编程从非结构化电子病历中提取用药信息。

AMIA Annu Symp Proc. 2007 Oct 11:908.

Extraction and mapping of drug names from free text to a standardized nomenclature.从自由文本中提取药物名称并将其映射到标准化命名法。

AMIA Annu Symp Proc. 2007 Oct 11;2007:438-42.

Drug name recognition and classification in biomedical texts. A case study outlining approaches underpinning automated systems.生物医学文本中的药物名称识别与分类。一项概述自动化系统基础方法的案例研究。

Drug Discov Today. 2008 Sep;13(17-18):816-23. doi: 10.1016/j.drudis.2008.06.001. Epub 2008 Jul 17.

Specification of business rules for the development of hospital alarm system: application to the pharmaceutical validation.医院报警系统开发的业务规则规范：在药品验证中的应用。

Stud Health Technol Inform. 2008;136:145-50.

Translating molecular discoveries into new therapies for atherosclerosis.将分子层面的发现转化为治疗动脉粥样硬化的新疗法。

Nature. 2008 Feb 21;451(7181):904-13. doi: 10.1038/nature06796.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验