• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将电子病历中的结构化和非结构化数据相结合,创建临床定义的电子病历衍生队列。

Combining structured and unstructured data in EMRs to create clinically-defined EMR-derived cohorts.

机构信息

Centre for Translational Data Science, The University of Sydney, Sydney, Australia.

Northern Clinical School, The University of Sydney, Sydney, Australia.

出版信息

BMC Med Inform Decis Mak. 2021 Mar 8;21(1):91. doi: 10.1186/s12911-021-01441-w.

DOI:10.1186/s12911-021-01441-w
PMID:33685456
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7938556/
Abstract

BACKGROUND

There have been few studies describing how production EMR systems can be systematically queried to identify clinically-defined populations and limited studies utilising free-text in this process. The aim of this study is to provide a generalisable methodology for constructing clinically-defined EMR-derived patient cohorts using structured and unstructured data in EMRs.

METHODS

Patients with possible acute coronary syndrome (ACS) were used as an exemplar. Cardiologists defined clinical criteria for patients presenting with possible ACS. These were mapped to data tables within the production EMR system creating seven inclusion criteria comprised of structured data fields (orders and investigations, procedures, scanned electrocardiogram (ECG) images, and diagnostic codes) and unstructured clinical documentation. Data were extracted from two local health districts (LHD) in Sydney, Australia. Outcome measures included examination of the relative contribution of individual inclusion criteria to the identification of eligible encounters, comparisons between inclusion criterion and evaluation of consistency of data extracts across years and LHDs.

RESULTS

Among 802,742 encounters in a 5 year dataset (1/1/13-30/12/17), the presence of an ECG image (54.8% of encounters) and symptoms and keywords in clinical documentation (41.4-64.0%) were used most often to identify presentations of possible ACS. Orders and investigations (27.3%) and procedures (1.4%), were less often present for identified presentations. Relevant ICD-10/SNOMED CT codes were present for 3.7% of identified encounters. Similar trends were seen when the two LHDs were examined separately, and across years.

CONCLUSIONS

Clinically-defined EMR-derived cohorts combining structured and unstructured data during cohort identification is a necessary prerequisite for critical validation work required for development of real-time clinical decision support and learning health systems.

摘要

背景

描述如何系统地查询生产电子病历 (EMR) 系统以识别临床定义人群的研究较少,利用这一过程中的自由文本进行的研究也有限。本研究的目的是提供一种可推广的方法,用于使用 EMR 中的结构化和非结构化数据构建临床定义的 EMR 衍生患者队列。

方法

以可能的急性冠状动脉综合征 (ACS) 患者为例。心脏病专家为可能出现 ACS 的患者定义了临床标准。这些标准被映射到生产 EMR 系统中的数据表中,创建了七个包含结构化数据字段(医嘱和检查、程序、扫描心电图 (ECG) 图像和诊断代码)和非结构化临床文档的纳入标准。数据从澳大利亚悉尼的两个地方卫生区 (LHD) 中提取。结果测量包括检查各个纳入标准对确定合格就诊的相对贡献,以及对纳入标准的比较和对跨年份和 LHD 的数据提取的一致性的评估。

结果

在 5 年数据集(2013 年 1 月 1 日至 2017 年 12 月 30 日)中的 802,742 次就诊中,最常用于识别可能 ACS 表现的是 ECG 图像的存在(54.8%的就诊)和临床文档中的症状和关键字(41.4-64.0%)。对于确定的表现,医嘱和检查(27.3%)和程序(1.4%)不太常见。确定的就诊中,相关的 ICD-10/SNOMED CT 代码存在 3.7%。当分别检查两个 LHD 以及跨年份时,也出现了类似的趋势。

结论

在队列识别过程中结合结构化和非结构化数据进行临床定义的 EMR 衍生队列是开发实时临床决策支持和学习健康系统所需的关键验证工作的必要前提。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/fe9d2315c1bf/12911_2021_1441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/6f1a17f2f812/12911_2021_1441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/cd084a4953e3/12911_2021_1441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/9443fadaa356/12911_2021_1441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/b2c7064c3b4c/12911_2021_1441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/fe9d2315c1bf/12911_2021_1441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/6f1a17f2f812/12911_2021_1441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/cd084a4953e3/12911_2021_1441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/9443fadaa356/12911_2021_1441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/b2c7064c3b4c/12911_2021_1441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f54/7938556/fe9d2315c1bf/12911_2021_1441_Fig5_HTML.jpg

相似文献

1
Combining structured and unstructured data in EMRs to create clinically-defined EMR-derived cohorts.将电子病历中的结构化和非结构化数据相结合,创建临床定义的电子病历衍生队列。
BMC Med Inform Decis Mak. 2021 Mar 8;21(1):91. doi: 10.1186/s12911-021-01441-w.
2
Novel methodology to measure pre-procedure antimicrobial prophylaxis: integrating text searches with structured data from the Veterans Health Administration's electronic medical record.一种新颖的方法来测量术前抗菌预防措施:将文本搜索与退伍军人健康管理局电子病历中的结构化数据相结合。
BMC Med Inform Decis Mak. 2020 Jan 30;20(1):15. doi: 10.1186/s12911-020-1031-5.
3
Building a Cohort of Transgender and Nonbinary Patients from the Electronic Medical Record.从电子病历中建立跨性别和非二进制患者队列。
LGBT Health. 2024 May-Jun;11(4):310-316. doi: 10.1089/lgbt.2022.0107. Epub 2023 Dec 28.
4
A hybrid solution for extracting structured medical information from unstructured data in medical records via a double-reading/entry system.一种通过双读/录入系统从病历中的非结构化数据提取结构化医学信息的混合解决方案。
BMC Med Inform Decis Mak. 2016 Aug 30;16(1):114. doi: 10.1186/s12911-016-0357-5.
5
Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records.电子病历中索赔数据对成人先天性心脏病的识别和分类的准确性。
Arch Cardiovasc Dis. 2019 Jan;112(1):31-43. doi: 10.1016/j.acvd.2018.07.002. Epub 2019 Jan 3.
6
A rule-based electronic phenotyping algorithm for detecting clinically relevant cardiovascular disease cases.一种用于检测临床相关心血管疾病病例的基于规则的电子表型分析算法。
BMC Res Notes. 2017 Jul 14;10(1):281. doi: 10.1186/s13104-017-2600-2.
7
Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study.利用临床记录的自然语言处理技术识别HIV感染者中的精神疾病和药物使用情况:回顾性队列研究
JMIR Med Inform. 2021 Mar 10;9(3):e23456. doi: 10.2196/23456.
8
An eClinical trial system for cancer that integrates with clinical pathways and electronic medical records.癌症电子临床试验系统,与临床路径和电子病历集成。
Clin Trials. 2012 Aug;9(4):408-17. doi: 10.1177/1740774512445912. Epub 2012 May 17.
9
Interventions to improve the use of EMRs in primary health care: a systematic review and meta-analysis.改善基层医疗保健中电子病历使用情况的干预措施:一项系统评价与荟萃分析。
BMJ Health Care Inform. 2019 May;26(1):0. doi: 10.1136/bmjhci-2019-000023.
10
Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.电子病历的自动化数据提取:数据挖掘构建胃肠病学临床试验入组研究数据库的有效性。
Ups J Med Sci. 2022 Jan 27;127. doi: 10.48101/ujms.v127.8260. eCollection 2022.

引用本文的文献

1
Migraine headache (MH) classification using machine learning methods with data augmentation.使用机器学习方法并结合数据增强技术进行偏头痛(MH)分类。
Sci Rep. 2024 Mar 2;14(1):5180. doi: 10.1038/s41598-024-55874-0.
2
Usability Study of an Electronic Medical Record From the Nurse Practitioners' Practice: A Qualitative Study Using the Think-Aloud Technique.来自执业护士实践的电子病历可用性研究:一项采用出声思考技术的定性研究。
Cureus. 2023 Jul 9;15(7):e41603. doi: 10.7759/cureus.41603. eCollection 2023 Jul.
3
Validation and Improvement of a Convolutional Neural Network to Predict the Involved Pathology in a Head and Neck Surgery Cohort.

本文引用的文献

1
Derivation and validation of a computable phenotype for acute decompensated heart failure in hospitalized patients.基于住院患者的急性失代偿性心力衰竭的可计算表型的推导和验证。
BMC Med Inform Decis Mak. 2020 May 7;20(1):85. doi: 10.1186/s12911-020-1092-5.
2
Considerations for Improving the Portability of Electronic Health Record-Based Phenotype Algorithms.提高基于电子健康记录的表型算法便携性的考量因素。
AMIA Annu Symp Proc. 2020 Mar 4;2019:755-764. eCollection 2019.
3
Defining Major Depressive Disorder Cohorts Using the EHR: Multiple Phenotypes Based on ICD-9 Codes and Medication Orders.
验证和改进卷积神经网络以预测头颈部手术队列中的受累病理学。
Int J Environ Res Public Health. 2022 Sep 26;19(19):12200. doi: 10.3390/ijerph191912200.
4
Harnessing Electronic Medical Records in Cardiovascular Clinical Practice and Research.利用电子病历进行心血管临床实践和研究。
J Cardiovasc Transl Res. 2023 Jun;16(3):546-556. doi: 10.1007/s12265-022-10313-1. Epub 2022 Sep 14.
5
Objective risk assessment vs standard care for acute coronary syndromes-The Australian GRACE Risk tool Implementation Study (AGRIS): a process evaluation.急性冠状动脉综合征的客观风险评估与标准治疗——澳大利亚GRACE风险工具实施研究(AGRIS):一项过程评估
BMC Health Serv Res. 2022 Mar 22;22(1):380. doi: 10.1186/s12913-022-07750-8.
使用电子健康记录定义重度抑郁症队列:基于ICD - 9编码和用药医嘱的多种表型
Neurol Psychiatry Brain Res. 2020 Jun;36:18-26. doi: 10.1016/j.npbr.2020.02.002. Epub 2020 Feb 21.
4
An overview of clinical decision support systems: benefits, risks, and strategies for success.临床决策支持系统概述:益处、风险及成功策略。
NPJ Digit Med. 2020 Feb 6;3:17. doi: 10.1038/s41746-020-0221-y. eCollection 2020.
5
Artificial intelligence approaches using natural language processing to advance EHR-based clinical research.利用自然语言处理技术的人工智能方法来推进基于电子健康记录的临床研究。
J Allergy Clin Immunol. 2020 Feb;145(2):463-469. doi: 10.1016/j.jaci.2019.12.897. Epub 2019 Dec 26.
6
Deep learning in clinical natural language processing: a methodical review.深度学习在临床自然语言处理中的应用:系统综述。
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.
7
The use of structured data elements to identify ASCVD patients with statin-associated side effects: Insights from the Department of Veterans Affairs.使用结构化数据元素识别接受他汀类药物治疗的 ASCVD 患者的副作用:来自美国退伍军人事务部的见解。
J Clin Lipidol. 2019 Sep-Oct;13(5):797-803.e1. doi: 10.1016/j.jacl.2019.08.002. Epub 2019 Aug 9.
8
Validation of an Electronic Medical Record-Based Algorithm for Identifying Posttraumatic Stress Disorder in U.S. Veterans.基于电子病历的美国退伍军人创伤后应激障碍识别算法的验证。
J Trauma Stress. 2019 Apr;32(2):226-237. doi: 10.1002/jts.22399.
9
Identification of urinary tract infections using electronic health record data.利用电子健康记录数据识别尿路感染。
Am J Infect Control. 2019 Apr;47(4):371-375. doi: 10.1016/j.ajic.2018.10.009. Epub 2018 Dec 4.
10
Using Electronic Health Records To Generate Phenotypes For Research.利用电子健康记录生成用于研究的表型。
Curr Protoc Hum Genet. 2019 Jan;100(1):e80. doi: 10.1002/cphg.80. Epub 2018 Dec 5.