• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用临床数据仓库中的特定信息提取来复制药物趋势研究。

Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse.

机构信息

Computer Science, Unviversity of Würzburg, Am Hubland, Würzburg, 97074, Germany.

Comprehensive Heart Failure Center, University and University Hospital Hospital of Würzburg, Am Schwarzenberg 15, Würzburg, 97078, Germany.

出版信息

BMC Med Inform Decis Mak. 2019 Jan 18;19(1):15. doi: 10.1186/s12911-018-0729-0.

DOI:10.1186/s12911-018-0729-0
PMID:30658633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6339317/
Abstract

BACKGROUND

Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach of information extraction (IE) demands a high developmental effort, we used ad hoc IE instead. This technique queries information and extracts it on the fly from texts contained in the CDW.

METHODS

We present a generalizable approach of ad hoc IE for pharmacotherapy (medications and their daily dosage) presented in hospital discharge letters. We added import and query features to the CDW system, like error tolerant queries to deal with misspellings and proximity search for the extraction of the daily dosage. During the data integration process in the CDW, negated, historical and non-patient context data are filtered. For the replication studies, we used a drug list grouped by ATC (Anatomical Therapeutic Chemical Classification System) codes as input for queries to the CDW.

RESULTS

We achieve an F1 score of 0.983 (precision 0.997, recall 0.970) for extracting medication from discharge letters and an F1 score of 0.974 (precision 0.977, recall 0.972) for extracting the dosage. We replicated three published medical trend studies for hypertension, atrial fibrillation and chronic kidney disease. Overall, 93% of the main findings could be replicated, 68% of sub-findings, and 75% of all findings. One study could be completely replicated with all main and sub-findings.

CONCLUSION

A novel approach for ad hoc IE is presented. It is very suitable for basic medical texts like discharge letters and finding reports. Ad hoc IE is by definition more limited than conventional IE and does not claim to replace it, but it substantially exceeds the search capabilities of many CDWs and it is convenient to conduct replication studies fast and with high quality.

摘要

背景

药物趋势研究显示了多年来药物的变化,并可通过临床数据仓库 (CDW) 进行复制。即使在今天,许多患者信息,如电子病历中的药物数据,仍以自由文本的形式存储。由于传统的信息提取 (IE) 方法需要大量的开发工作,因此我们使用了特定于任务的 IE 方法。该技术从 CDW 中包含的文本中实时查询和提取信息。

方法

我们提出了一种可用于从医院出院记录中提取药物治疗(药物及其每日剂量)信息的特定于任务的 IE 通用方法。我们向 CDW 系统添加了导入和查询功能,例如容错查询来处理拼写错误,以及近邻搜索来提取每日剂量。在 CDW 中的数据集成过程中,过滤了否定、历史和非患者上下文数据。对于复制研究,我们使用按 ATC(解剖治疗化学分类系统)代码分组的药物列表作为查询 CDW 的输入。

结果

我们从出院记录中提取药物的 F1 得分为 0.983(精度 0.997,召回率 0.970),提取剂量的 F1 得分为 0.974(精度 0.977,召回率 0.972)。我们复制了三项已发表的医学趋势研究,分别为高血压、心房颤动和慢性肾病。总体而言,93%的主要发现可以复制,68%的次要发现可以复制,75%的所有发现可以复制。一项研究可以用所有主要和次要发现完全复制。

结论

提出了一种新的特定于任务的 IE 方法。它非常适合基本的医疗文本,如出院记录和发现报告。特定于任务的 IE 从定义上讲比传统的 IE 更有限,并不声称要取代它,但它大大超过了许多 CDW 的搜索能力,并且方便快速、高质量地进行复制研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/3c4a005fb6a2/12911_2018_729_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/b2ed1f23c59e/12911_2018_729_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/638f2d6e6349/12911_2018_729_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/3e2bf9986c25/12911_2018_729_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/8b9b41247546/12911_2018_729_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/624784f7fdaf/12911_2018_729_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/813392305f41/12911_2018_729_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/3c4a005fb6a2/12911_2018_729_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/b2ed1f23c59e/12911_2018_729_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/638f2d6e6349/12911_2018_729_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/3e2bf9986c25/12911_2018_729_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/8b9b41247546/12911_2018_729_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/624784f7fdaf/12911_2018_729_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/813392305f41/12911_2018_729_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb0a/6339317/3c4a005fb6a2/12911_2018_729_Fig7_HTML.jpg

相似文献

1
Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse.利用临床数据仓库中的特定信息提取来复制药物趋势研究。
BMC Med Inform Decis Mak. 2019 Jan 18;19(1):15. doi: 10.1186/s12911-018-0729-0.
2
Ad Hoc Information Extraction for Clinical Data Warehouses.临床数据仓库的临时信息提取
Methods Inf Med. 2018 May;57(1):e22-e29. doi: 10.3414/ME17-02-0010. Epub 2018 May 25.
3
Finding Needles in the Haystack: Identifying Patients with Rare Subtype of Multiple Myeloma Supported by a Data Warehouse and Information Extraction.大海捞针:借助数据仓库和信息提取识别多发性骨髓瘤罕见亚型患者
Stud Health Technol Inform. 2018;253:160-164.
4
Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse.改进全文搜索引擎:否定检测和家族病史背景对在生物医学数据仓库中识别病例的重要性。
J Am Med Inform Assoc. 2017 May 1;24(3):607-613. doi: 10.1093/jamia/ocw144.
5
Semantic integration of medication data into the EHOP Clinical Data Warehouse.将用药数据语义整合到EHOP临床数据仓库中。
Stud Health Technol Inform. 2015;210:702-6.
6
A Fast Healthcare Interoperability Resources (FHIR) layer implemented over i2b2.基于 i2b2 实现的快速医疗互操作性资源(FHIR)层。
BMC Med Inform Decis Mak. 2017 Aug 14;17(1):120. doi: 10.1186/s12911-017-0513-6.
7
Extending the Query Language of a Data Warehouse for Patient Recruitment.扩展用于患者招募的数据仓库查询语言。
Stud Health Technol Inform. 2017;243:152-156.
8
The Georges Pompidou University Hospital Clinical Data Warehouse: A 8-years follow-up experience.乔治·蓬皮杜大学医院临床数据仓库:8年随访经验
Int J Med Inform. 2017 Jun;102:21-28. doi: 10.1016/j.ijmedinf.2017.02.006. Epub 2017 Feb 16.
9
Extracting Structured Genotype Information from Free-Text HLA Reports Using a Rule-Based Approach.基于规则的方法从 HLA 报告的自由文本中提取结构化基因型信息。
J Korean Med Sci. 2020 Mar 30;35(12):e78. doi: 10.3346/jkms.2020.35.e78.
10
Information Extraction from Echocardiography Reports for a Clinical Follow-up Study-Comparison of Extracted Variables Intended for General Use in a Data Warehouse with Those Intended Specifically for the Study.用于临床随访研究的超声心动图报告信息提取——数据仓库中通用提取变量与专门用于该研究的提取变量的比较
Methods Inf Med. 2019 Nov;58(4-05):140-150. doi: 10.1055/s-0039-3402069. Epub 2020 Jan 30.

引用本文的文献

1
Structured, Harmonized, and Interoperable Integration of Clinical Routine Data to Compute Heart Failure Risk Scores.临床常规数据的结构化、协调化和可互操作集成以计算心力衰竭风险评分。
Life (Basel). 2022 May 18;12(5):749. doi: 10.3390/life12050749.
2
Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.电子健康记录能否用于研究抗癫痫药物的疗效和癫痫相关变量?当前方法综述。
Seizure. 2021 Feb;85:138-144. doi: 10.1016/j.seizure.2020.11.011. Epub 2021 Jan 13.

本文引用的文献

1
Ad Hoc Information Extraction for Clinical Data Warehouses.临床数据仓库的临时信息提取
Methods Inf Med. 2018 May;57(1):e22-e29. doi: 10.3414/ME17-02-0010. Epub 2018 May 25.
2
Antihypertensive Medications and Change in Stages of Chronic Kidney Disease.抗高血压药物与慢性肾脏病分期的变化
Int J Chronic Dis. 2018 Feb 25;2018:1382705. doi: 10.1155/2018/1382705. eCollection 2018.
3
Drug treatment of hypertension in Sweden in relation to sex, age, and comorbidity.瑞典高血压的药物治疗与性别、年龄和合并症有关。
J Clin Hypertens (Greenwich). 2018 Jan;20(1):106-114. doi: 10.1111/jch.13149. Epub 2017 Dec 8.
4
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
5
Current Trends of Hypertension Treatment in the United States.美国高血压治疗的现状。
Am J Hypertens. 2017 Oct 1;30(10):1008-1014. doi: 10.1093/ajh/hpx085.
6
Analysis of free text in electronic health records for identification of cancer patient trajectories.电子健康记录中自由文本的分析用于识别癌症患者轨迹。
Sci Rep. 2017 Apr 7;7:46226. doi: 10.1038/srep46226.
7
Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse.改进全文搜索引擎:否定检测和家族病史背景对在生物医学数据仓库中识别病例的重要性。
J Am Med Inform Assoc. 2017 May 1;24(3):607-613. doi: 10.1093/jamia/ocw144.
8
Increased use of oral anticoagulants in patients with atrial fibrillation: temporal trends from 2005 to 2015 in Denmark.丹麦 2005 年至 2015 年期间心房颤动患者口服抗凝剂使用增加:时间趋势。
Eur Heart J. 2017 Mar 21;38(12):899-906. doi: 10.1093/eurheartj/ehw658.
9
Non-vitamin K antagonist oral anticoagulation usage according to age among patients with atrial fibrillation: Temporal trends 2011-2015 in Denmark.丹麦 2011-2015 年非维生素 K 拮抗剂口服抗凝药物在房颤患者中的使用情况:按年龄分组的时间趋势
Sci Rep. 2016 Aug 11;6:31477. doi: 10.1038/srep31477.
10
1,500 scientists lift the lid on reproducibility.1500名科学家揭开了可重复性的盖子。
Nature. 2016 May 26;533(7604):452-4. doi: 10.1038/533452a.