• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过数据溯源评估结构化电子健康记录数据与参考术语的一致性以及数据完整性。

Assessing the harmonization of structured electronic health record data to reference terminologies and data completeness through data provenance.

作者信息

Marsolo Keith, Curtis Lesley, Qualls Laura, Xu Jennifer, Zhang Yinghong, Phillips Thomas, Hill C Larry, Sanders Gretchen, Maro Judith C, Kiernan Daniel, Draper Christine, Coughlin Kevin, Dutcher Sarah K, Hernández-Muñoz José J, Falconer Monique

机构信息

Department of Population Health Sciences Duke University School of Medicine Durham North Carolina USA.

Duke Clinical Research Institute Duke University School of Medicine Durham North Carolina USA.

出版信息

Learn Health Syst. 2024 Oct 21;9(2):e10468. doi: 10.1002/lrh2.10468. eCollection 2025 Apr.

DOI:10.1002/lrh2.10468
PMID:40247903
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12000768/
Abstract

INTRODUCTION

(1) Assess the harmonization of structured electronic health record data (laboratory results and medications) to reference terminologies and characterize the severity of issues. (2) Identify issues of data completeness by comparing complementary data domains, stratifying by time, care setting, and provenance.

METHODS

Queries were distributed to 3 Data Partners (DP). Using harmonization queries, we examined the top 200 laboratory results and medications by volume, identifying outliers and computing summary statistics. The completeness queries looked at 4 conditions of interest and related clinical concepts. Counts were generated for each condition, stratified by year, encounter type, and provenance. We analyzed trends over time within and across DPs.

RESULTS

We found that the median number of codes associated with a given laboratory/medication name (and vice versa) generally met expectations, though there were DP-specific issues that resulted in outliers. In addition, there were drastic differences in the percentage of patients with a given concept depending on provenance.

CONCLUSIONS

The harmonization queries surfaced several mapping errors, as well as issues with overly specific codes and records with "null" codes. The completeness queries demonstrated having access to multiple types of data provenance provides more robust results compared with any single provenance type. Harmonization errors between source data and reference terminologies may not be widespread but do exist within CDMs, affecting tens of thousands or even millions of records. Provenance information can help identify potential completeness issues with EHR data, but only if it is represented in the CDM and then populated by DPs.

摘要

引言

(1)评估结构化电子健康记录数据(实验室检查结果和用药情况)与参考术语的一致性,并描述问题的严重程度。(2)通过比较互补数据域,按时间、护理环境和来源进行分层,识别数据完整性问题。

方法

向3个数据合作伙伴(DP)分发查询。使用一致性查询,我们按数量检查了前200项实验室检查结果和用药情况,识别异常值并计算汇总统计数据。完整性查询关注4种感兴趣的病症及相关临床概念。针对每种病症生成计数,并按年份、就诊类型和来源进行分层。我们分析了各DP内部和之间随时间的趋势。

结果

我们发现,与给定实验室检查/用药名称相关的代码中位数(反之亦然)总体上符合预期,不过存在特定于DP的问题导致出现异常值。此外,根据来源不同,患有特定概念病症的患者百分比存在巨大差异。

结论

一致性查询揭示了几个映射错误,以及过于具体的代码和带有“空”代码的记录所存在的问题。完整性查询表明,与任何单一来源类型相比,获取多种类型的数据来源能提供更可靠的结果。源数据与参考术语之间的一致性错误可能并不普遍,但在临床数据模型(CDM)中确实存在,影响到数万甚至数百万条记录。来源信息有助于识别电子健康记录数据潜在的完整性问题,但前提是它要在CDM中体现并由DP填充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/23014f564707/LRH2-9-e10468-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/961dc095b2f8/LRH2-9-e10468-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/21be02b807eb/LRH2-9-e10468-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/1c258f916f7f/LRH2-9-e10468-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/23014f564707/LRH2-9-e10468-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/961dc095b2f8/LRH2-9-e10468-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/21be02b807eb/LRH2-9-e10468-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/1c258f916f7f/LRH2-9-e10468-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e88b/12000768/23014f564707/LRH2-9-e10468-g003.jpg

相似文献

1
Assessing the harmonization of structured electronic health record data to reference terminologies and data completeness through data provenance.通过数据溯源评估结构化电子健康记录数据与参考术语的一致性以及数据完整性。
Learn Health Syst. 2024 Oct 21;9(2):e10468. doi: 10.1002/lrh2.10468. eCollection 2025 Apr.
2
How the provenance of electronic health record data matters for research: a case example using system mapping.电子健康记录数据的来源对研究为何重要:一个使用系统映射的案例
EGEMS (Wash DC). 2014 Apr 16;2(1):1058. doi: 10.13063/2327-9214.1058. eCollection 2014.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
An alternative database approach for management of SNOMED CT and improved patient data queries.一种用于管理医学系统命名法临床术语(SNOMED CT)及改进患者数据查询的替代数据库方法。
J Biomed Inform. 2015 Oct;57:350-7. doi: 10.1016/j.jbi.2015.08.016. Epub 2015 Aug 21.
5
A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization.用于语义协调的SNOMED CT概念内容密度的比较分析。
Artif Intell Med. 2015 May;64(1):29-40. doi: 10.1016/j.artmed.2015.03.002. Epub 2015 Apr 2.
6
Comparative Analysis and Data Provenance for 1,113 Bacterial Genome Assemblies.对 1113 个细菌基因组组装的比较分析和数据溯源。
mSphere. 2022 Jun 29;7(3):e0007722. doi: 10.1128/msphere.00077-22. Epub 2022 May 2.
7
Provenance Information for Biomedical Data and Workflows: Scoping Review.生物医学数据和工作流程的出处信息:范围综述。
J Med Internet Res. 2024 Aug 23;26:e51297. doi: 10.2196/51297.
8
Methodological Issues in Using a Common Data Model of COVID-19 Vaccine Uptake and Important Adverse Events of Interest: Feasibility Study of Data and Connectivity COVID-19 Vaccines Pharmacovigilance in the United Kingdom.使用新冠疫苗接种和重要关注不良事件通用数据模型的方法学问题:英国数据与连接性新冠疫苗药物警戒可行性研究
JMIR Form Res. 2022 Aug 22;6(8):e37821. doi: 10.2196/37821.
9
Harmonization process for the identification of medical events in eight European healthcare databases: the experience from the EU-ADR project.在八个欧洲医疗保健数据库中识别医疗事件的协调过程:来自 EU-ADR 项目的经验。
J Am Med Inform Assoc. 2013 Jan 1;20(1):184-92. doi: 10.1136/amiajnl-2012-000933. Epub 2012 Sep 6.
10
Assessing the Effect of Electronic Health Record Data Quality on Identifying Patients With Type 2 Diabetes: Cross-Sectional Study.评估电子健康记录数据质量对识别2型糖尿病患者的影响:横断面研究。
JMIR Med Inform. 2024 Aug 27;12:e56734. doi: 10.2196/56734.

本文引用的文献

1
Systematic data quality assessment of electronic health record data to evaluate study-specific fitness: Report from the PRESERVE research study.电子健康记录数据的系统数据质量评估以评估特定研究适用性:PRESERVE研究报告
PLOS Digit Health. 2024 Jun 27;3(6):e0000527. doi: 10.1371/journal.pdig.0000527. eCollection 2024 Jun.
2
Electronic health record data quality assessment and tools: a systematic review.电子健康记录数据质量评估及工具:系统综述。
J Am Med Inform Assoc. 2023 Sep 25;30(10):1730-1740. doi: 10.1093/jamia/ocad120.
3
Now is the time to fix the evidence generation system.
现在是时候修复证据生成系统了。
Clin Trials. 2023 Feb;20(1):3-12. doi: 10.1177/17407745221147689. Epub 2023 Jan 17.
4
Mis-mappings between a producer's quantitative test codes and LOINC codes and an algorithm for correcting them.生产商的定量测试代码与 LOINC 代码之间的映射错误及纠正算法。
J Am Med Inform Assoc. 2023 Jan 18;30(2):301-307. doi: 10.1093/jamia/ocac215.
5
The SHOnet learning health system: Infrastructure for continuous learning in pediatric rehabilitation.SHOnet学习健康系统:儿科康复持续学习的基础设施。
Learn Health Syst. 2022 Feb 15;6(3):e10305. doi: 10.1002/lrh2.10305. eCollection 2022 Jul.
6
Real-world data: Assessing electronic health records and medical claims data to support regulatory decision-making for drug and biological products.真实世界数据:评估电子健康记录和医疗理赔数据以支持药品和生物制品的监管决策。
Pharmacoepidemiol Drug Saf. 2022 Jul;31(7):717-720. doi: 10.1002/pds.5444. Epub 2022 May 3.
7
Developing real-world evidence from real-world data: Transforming raw data into analytical datasets.从真实世界数据中生成真实世界证据:将原始数据转化为分析数据集。
Learn Health Syst. 2021 Oct 14;6(1):e10293. doi: 10.1002/lrh2.10293. eCollection 2022 Jan.
8
Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use.基于预期用途,制定一种系统的方法来评估临床数据二次使用中的数据质量。
Learn Health Syst. 2021 May 3;6(1):e10264. doi: 10.1002/lrh2.10264. eCollection 2022 Jan.
9
Increasing trust in real-world evidence through evaluation of observational data quality.通过评估观察性数据质量来增加对真实世界证据的信任。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2251-2257. doi: 10.1093/jamia/ocab132.
10
The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment.国家 COVID 队列协作组织(N3C):原理、设计、基础设施和部署。
J Am Med Inform Assoc. 2021 Mar 1;28(3):427-443. doi: 10.1093/jamia/ocaa196.