利用从电子健康记录中的临床记录中提取的药物信息构建纵向药物剂量数据。

Building longitudinal medication dose data using medication information extracted from clinical notes in electronic health records.

机构信息

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

出版信息

J Am Med Inform Assoc. 2021 Mar 18;28(4):782-790. doi: 10.1093/jamia/ocaa291.

DOI:10.1093/jamia/ocaa291

PMID:33338223

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7973457/

Abstract

OBJECTIVE

To develop an algorithm for building longitudinal medication dose datasets using information extracted from clinical notes in electronic health records (EHRs).

MATERIALS AND METHODS

We developed an algorithm that converts medication information extracted using natural language processing (NLP) into a usable format and builds longitudinal medication dose datasets. We evaluated the algorithm on 2 medications extracted from clinical notes of Vanderbilt's EHR and externally validated the algorithm using clinical notes from the MIMIC-III clinical care database.

RESULTS

For the evaluation using Vanderbilt's EHR data, the performance of our algorithm was excellent; F1-measures were ≥0.98 for both dose intake and daily dose. For the external validation using MIMIC-III, the algorithm achieved F1-measures ≥0.85 for dose intake and ≥0.82 for daily dose.

DISCUSSION

Our algorithm addresses the challenge of building longitudinal medication dose data using information extracted from clinical notes. Overall performance was excellent, but the algorithm can perform poorly when incorrect information is extracted by NLP systems. Although it performed reasonably well when applied to the external data source, its performance was worse due to differences in the way the drug information was written. The algorithm is implemented in the R package, "EHR," and the extracted data from Vanderbilt's EHRs along with the gold standards are provided so that users can reproduce the results and help improve the algorithm.

CONCLUSION

Our algorithm for building longitudinal dose data provides a straightforward way to use EHR data for medication-based studies. The external validation results suggest its potential for applicability to other systems.

摘要

目的

开发一种使用电子健康记录（EHR）中的临床记录中提取的信息构建纵向药物剂量数据集的算法。

材料与方法

我们开发了一种算法，该算法可将使用自然语言处理（NLP）提取的药物信息转换为可用格式，并构建纵向药物剂量数据集。我们使用范德比尔特 EHR 中的临床记录中的 2 种药物评估了该算法，并使用 MIMIC-III 临床护理数据库中的临床记录对该算法进行了外部验证。

结果

对于使用范德比尔特 EHR 数据的评估，我们的算法性能非常出色；剂量摄入量和每日剂量的 F1 度量值均≥0.98。对于使用 MIMIC-III 的外部验证，该算法在剂量摄入量和每日剂量方面的 F1 度量值均≥0.85。

讨论

我们的算法解决了使用从临床记录中提取的信息构建纵向药物剂量数据的难题。总体性能非常出色，但当 NLP 系统提取错误信息时，算法的性能可能会很差。尽管将其应用于外部数据源时表现相当不错，但由于药物信息的编写方式不同，其性能会更差。该算法已在 R 包“EHR”中实现，并且提供了从范德比尔特 EHR 提取的数据以及黄金标准，以便用户可以复制结果并帮助改进算法。

结论

我们用于构建纵向剂量数据的算法为使用 EHR 数据进行基于药物的研究提供了一种直接的方法。外部验证结果表明其适用于其他系统的潜力。

相似文献

Building longitudinal medication dose data using medication information extracted from clinical notes in electronic health records.利用从电子健康记录中的临床记录中提取的药物信息构建纵向药物剂量数据。

J Am Med Inform Assoc. 2021 Mar 18;28(4):782-790. doi: 10.1093/jamia/ocaa291.

medExtractR: A targeted, customizable approach to medication extraction from electronic health records.medExtractR：一种从电子健康记录中提取药物信息的针对性、可定制方法。

J Am Med Inform Assoc. 2020 Mar 1;27(3):407-418. doi: 10.1093/jamia/ocz207.

Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.基于自然语言处理的电子健康记录中阿尔茨海默病及相关痴呆症社会决定因素的识别。

Health Serv Res. 2023 Dec;58(6):1292-1302. doi: 10.1111/1475-6773.14210. Epub 2023 Aug 3.

A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架，在临床笔记上进行了微调，能够从电子健康记录中准确提取头痛频率。

Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.

Identifying Information Gaps in Electronic Health Records by Using Natural Language Processing: Gynecologic Surgery History Identification.利用自然语言处理识别电子健康记录中的信息空白：妇科手术史识别。

J Med Internet Res. 2022 Jan 28;24(1):e29015. doi: 10.2196/29015.

Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source.通过与外部临床数据源进行外部验证，评估电子健康记录系统中的数据可用性和质量。

BMC Med Inform Decis Mak. 2019 Jul 25;19(1):143. doi: 10.1186/s12911-019-0864-2.

A method for cohort selection of cardiovascular disease records from an electronic health record system.一种从电子健康记录系统中选择心血管疾病记录队列的方法。

Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.

Integrating existing natural language processing tools for medication extraction from discharge summaries.整合现有的自然语言处理工具，从出院小结中提取药物信息。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):528-31. doi: 10.1136/jamia.2010.003855.

Medication Extraction from Electronic Clinical Notes in an Integrated Health System: A Study on Aspirin Use in Patients with Nonvalvular Atrial Fibrillation.综合医疗系统中电子临床记录的药物提取：非瓣膜性心房颤动患者阿司匹林使用情况的研究

Clin Ther. 2015 Sep;37(9):2048-2058.e2. doi: 10.1016/j.clinthera.2015.07.002. Epub 2015 Jul 29.

Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0).从电子健康记录中提取药物、适应症和药物不良事件的自然语言处理挑战赛概述（MADE 1.0）。

Drug Saf. 2019 Jan;42(1):99-111. doi: 10.1007/s40264-018-0762-z.

引用本文的文献

Using electronic health records for clinical pharmacology research: Challenges and considerations.利用电子健康记录进行临床药理学研究：挑战与考虑。

Clin Transl Sci. 2024 Jul;17(7):e13871. doi: 10.1111/cts.13871.

Sensitivity of estimated tacrolimus population pharmacokinetic profile to assumed dose timing and absorption in real-world data and simulated data.真实世界数据和模拟数据中，估算他克莫司群体药代动力学模型对假设剂量时间和吸收的灵敏度。

Br J Clin Pharmacol. 2022 Jun;88(6):2863-2874. doi: 10.1111/bcp.15218. Epub 2022 Jan 27.

A natural language processing pipeline to synthesize patient-generated notes toward improving remote care and chronic disease management: a cystic fibrosis case study.一种用于合成患者生成的笔记以改善远程护理和慢性病管理的自然语言处理管道：囊性纤维化案例研究。

JAMIA Open. 2021 Sep 29;4(3):ooab084. doi: 10.1093/jamiaopen/ooab084. eCollection 2021 Jul.

本文引用的文献

Development of a System for Postmarketing Population Pharmacokinetic and Pharmacodynamic Studies Using Real-World Data From Electronic Health Records.利用电子健康记录中的真实世界数据开发上市后人群药代动力学和药效学研究系统。

Clin Pharmacol Ther. 2020 Apr;107(4):934-943. doi: 10.1002/cpt.1787. Epub 2020 Feb 11.

medExtractR: A targeted, customizable approach to medication extraction from electronic health records.medExtractR：一种从电子健康记录中提取药物信息的针对性、可定制方法。

J Am Med Inform Assoc. 2020 Mar 1;27(3):407-418. doi: 10.1093/jamia/ocz207.

An investigation of single-domain and multidomain medication and adverse drug event relation extraction from electronic health record notes using advanced deep learning models.使用先进的深度学习模型从电子健康记录中提取单域和多域药物与不良药物事件关系的研究。

J Am Med Inform Assoc. 2019 Jul 1;26(7):646-654. doi: 10.1093/jamia/ocz018.

HLA-A*32:01 is strongly associated with vancomycin-induced drug reaction with eosinophilia and systemic symptoms.HLA-A*32:01 与万古霉素诱导的嗜酸性粒细胞增多和全身症状药物反应密切相关。

J Allergy Clin Immunol. 2019 Jul;144(1):183-192. doi: 10.1016/j.jaci.2019.01.045. Epub 2019 Feb 16.

CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.CLAMP - 一个用于高效构建定制化临床自然语言处理管道的工具包。

J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336. doi: 10.1093/jamia/ocx132.

MIMIC-III, a freely accessible critical care database.MIMIC-III，一个免费获取的重症监护数据库。

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text.评估药物适应症资源在从临床文本中提取治疗关系方面的作用。

J Am Med Inform Assoc. 2015 Apr;22(e1):e162-76. doi: 10.1136/amiajnl-2014-002954. Epub 2014 Oct 21.

MedXN: an open source medication extraction and normalization tool for clinical text.MedXN：一个用于临床文本的开源药物提取和规范化工具。

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):858-65. doi: 10.1136/amiajnl-2013-002190. Epub 2014 Mar 17.

Development and evaluation of an ensemble resource linking medications to their indications.开发并评估一个药物与适应证关联的集成资源。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):954-61. doi: 10.1136/amiajnl-2012-001431. Epub 2013 Apr 10.

The use of a DNA biobank linked to electronic medical records to characterize pharmacogenomic predictors of tacrolimus dose requirement in kidney transplant recipients.利用与电子病历相关联的 DNA 生物库来描述肾移植受者他克莫司剂量需求的药物基因组预测因子。

Pharmacogenet Genomics. 2012 Jan;22(1):32-42. doi: 10.1097/FPC.0b013e32834e1641.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验