• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于评估电子病历数据以供研究使用的数据质量评估框架。

Data quality assessment framework to assess electronic medical record data for use in research.

作者信息

Reimer Andrew P, Milinovich Alex, Madigan Elizabeth A

机构信息

Frances Payne Bolton School of Nursing, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106, United States; Cleveland Clinic, 10900 Euclid Avenue, Cleveland, OH 44195, United States.

Cleveland Clinic, 10900 Euclid Avenue, Cleveland, OH 44195, United States.

出版信息

Int J Med Inform. 2016 Jun;90:40-7. doi: 10.1016/j.ijmedinf.2016.03.006. Epub 2016 Mar 24.

DOI:10.1016/j.ijmedinf.2016.03.006
PMID:27103196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4845906/
Abstract

INTRODUCTION

The proliferation and use of electronic medical records (EMR) in the clinical setting now provide a rich source of clinical data that can be leveraged to support research on patient outcomes, comparative effectiveness, and health systems research. Once the large volume and variety of data that robust clinical EMRs provide is aggregated, the suitability of the data for research purposes must be addressed. Therefore, the purpose of this paper is two-fold. First, we present a stepwise framework capable of guiding initial data quality assessment when matching multiple data sources regardless of context or application. Then, we demonstrate a use case of initial analysis of a longitudinal data repository of electronic health record data that illustrates the first four steps of the framework, and report results.

METHODS

A six-step data quality assessment framework is proposed and described that includes the following data quality assessment steps: (1) preliminary analysis, (2) documentation-longitudinal concordance, (3) breadth, (4) data element presence, (5) density, and (6) prediction. The six-step framework was applied to the Transport Data Mart-a data repository that contains over 28,000 records for patients that underwent interhospital transfer that includes EMRs from the sending hospitalization, transport, and receiving hospitalization.

RESULTS

There were a total of 9557 log entries of which 8139 were successfully matched to corresponding hospital encounters. 2832 were successfully mapped to both the sending and receiving hospital encounters (resulting in a 93% automatic matching rate), with 590 including air medical transport EMR data representing a complete case for testing. Results from Step 2 indicate that once records are identified and matched, there appears to be relatively limited drop-off of additional records when the criteria for matching increases, indicating the a proportion of records consistently contain nearly complete data. Measures of central tendency used in Step 3 and 4 exhibit a right skewness suggesting that a small proportion of records contain the highest number of repeated measures for the measured variables.

CONCLUSIONS

The proposed six-step data quality assessment framework is useful in establishing the metadata for a longitudinal data repository that can be replicated by other studies. There are practical issues that need to be addressed including the data quality assessments-with the most prescient being the need to establish data quality metrics for benchmarking acceptable levels of EMR data inclusiveness through testing and application.

摘要

引言

电子病历(EMR)在临床环境中的广泛应用和使用,如今提供了丰富的临床数据来源,可用于支持关于患者预后、比较疗效以及卫生系统研究。一旦汇总了强大的临床电子病历所提供的大量且多样的数据,就必须解决这些数据用于研究目的的适用性问题。因此,本文的目的有两个。首先,我们提出一个逐步框架,能够在匹配多个数据源时指导初始数据质量评估,无论其背景或应用如何。然后,我们展示一个电子健康记录数据纵向存储库初始分析的用例,该用例说明了框架的前四个步骤,并报告结果。

方法

提出并描述了一个六步数据质量评估框架,包括以下数据质量评估步骤:(1)初步分析,(2)文档 - 纵向一致性,(3)广度,(4)数据元素存在情况,(5)密度,以及(6)预测。该六步框架应用于运输数据集市——一个数据存储库,其中包含超过28000条接受院间转运患者的记录,包括来自转出医院、转运过程以及转入医院的电子病历。

结果

总共有9557条日志条目,其中8139条成功匹配到相应的医院就诊记录。2832条成功映射到转出和转入医院的就诊记录(自动匹配率为93%),其中590条包括空中医疗转运电子病历数据,可作为完整案例进行测试。步骤2的结果表明,一旦识别并匹配记录,当匹配标准提高时,额外记录的减少似乎相对有限,这表明一定比例的记录始终包含几乎完整的数据。步骤3和4中使用的集中趋势度量呈现右偏态,表明一小部分记录包含所测量变量的重复测量次数最多。

结论

所提出的六步数据质量评估框架有助于为纵向数据存储库建立元数据,其他研究可对其进行复制。存在一些实际问题需要解决,包括数据质量评估——其中最具先见之明的是需要通过测试和应用建立数据质量指标,以基准化电子病历数据可接受的包容水平。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/2f8c3257ba3f/nihms774712f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/9e190f748ccc/nihms774712f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/18affdd19c1f/nihms774712f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/2f8c3257ba3f/nihms774712f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/9e190f748ccc/nihms774712f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/18affdd19c1f/nihms774712f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d64/4845906/2f8c3257ba3f/nihms774712f3.jpg

相似文献

1
Data quality assessment framework to assess electronic medical record data for use in research.用于评估电子病历数据以供研究使用的数据质量评估框架。
Int J Med Inform. 2016 Jun;90:40-7. doi: 10.1016/j.ijmedinf.2016.03.006. Epub 2016 Mar 24.
2
Development of electronic medical records for clinical and research purposes: the breast cancer module using an implementation framework in a middle income country- Malaysia.为临床和研究目的开发电子病历:在中等收入国家马来西亚使用实施框架的乳腺癌模块。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):402. doi: 10.1186/s12859-018-2406-9.
3
Hospices' use of electronic medical records for quality assessment and performance improvement programs.临终关怀机构在质量评估和绩效改进项目中对电子病历的使用。
J Pain Symptom Manage. 2014 Oct;48(4):582-9. doi: 10.1016/j.jpainsymman.2013.11.010. Epub 2014 Mar 15.
4
A method for cohort selection of cardiovascular disease records from an electronic health record system.一种从电子健康记录系统中选择心血管疾病记录队列的方法。
Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.
5
Developing and implementing an interoperable document-based electronic health record.开发并实施一个基于文档的可互操作电子健康记录系统。
Stud Health Technol Inform. 2013;192:1169.
6
Development and validation of method for defining conditions using Chinese electronic medical record.利用中国电子病历定义条件的方法的开发与验证
BMC Med Inform Decis Mak. 2016 Aug 20;16:110. doi: 10.1186/s12911-016-0348-6.
7
Documentation and coding of ED patient encounters: an evaluation of the accuracy of an electronic medical record.急诊患者诊疗记录与编码:电子病历准确性评估
Am J Emerg Med. 2006 Oct;24(6):664-78. doi: 10.1016/j.ajem.2006.02.005.
8
Impacts of structuring the electronic health record: Results of a systematic literature review from the perspective of secondary use of patient data.构建电子健康记录的影响:从患者数据二次使用角度进行的系统文献综述结果
Int J Med Inform. 2017 Jan;97:293-303. doi: 10.1016/j.ijmedinf.2016.10.004. Epub 2016 Oct 4.
9
Usability and Safety in Electronic Medical Records Interface Design: A Review of Recent Literature and Guideline Formulation.电子病历界面设计中的可用性与安全性:近期文献综述与指南制定
Hum Factors. 2015 Aug;57(5):805-34. doi: 10.1177/0018720815576827. Epub 2015 Mar 23.
10
Description and comparison of documentation of nursing assessment between paper-based and electronic systems in Australian aged care homes.澳大利亚老年护理院内基于纸质系统和电子系统的护理评估文档的描述和比较。
Int J Med Inform. 2013 Sep;82(9):789-97. doi: 10.1016/j.ijmedinf.2013.05.002. Epub 2013 Jun 17.

引用本文的文献

1
Data quality assessment in healthcare, dimensions, methods and tools: a systematic review.医疗保健中的数据质量评估:维度、方法与工具——一项系统综述
BMC Med Inform Decis Mak. 2025 Aug 9;25(1):296. doi: 10.1186/s12911-025-03136-y.
2
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
3
Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System.将人群健康数据整合到OMOP通用数据模型中:以内罗毕城市卫生和人口监测系统的COVID-19血清学监测数据为例
Front Digit Health. 2025 Jan 28;7:1423621. doi: 10.3389/fdgth.2025.1423621. eCollection 2025.
4
Assessing the capture of sociodemographic information in electronic medical records to inform clinical decision making.评估电子病历中社会人口学信息的获取情况,以为临床决策提供依据。
PLoS One. 2025 Jan 17;20(1):e0317599. doi: 10.1371/journal.pone.0317599. eCollection 2025.
5
Enhancing Data Completeness in Early Detection Pathway of Prostate Cancer: Integration of a Dashboard-Driven Feedback Tool to Improve Quality of Care.提高前列腺癌早期检测途径中的数据完整性:整合仪表盘驱动的反馈工具以改善医疗质量。
J Clin Med. 2024 Dec 11;13(24):7529. doi: 10.3390/jcm13247529.
6
Development and initial validation of a data quality evaluation tool in obstetrics real-world data through HL7-FHIR interoperable Bayesian networks and expert rules.通过HL7-FHIR可互操作贝叶斯网络和专家规则开发并初步验证产科真实世界数据中的数据质量评估工具
JAMIA Open. 2024 Jul 27;7(3):ooae062. doi: 10.1093/jamiaopen/ooae062. eCollection 2024 Oct.
7
Development of a quantitative index system for evaluating the quality of electronic medical records in disease risk intelligent prediction.开发用于评估疾病风险智能预测中电子病历质量的定量指标体系。
BMC Med Inform Decis Mak. 2024 Jun 24;24(1):178. doi: 10.1186/s12911-024-02533-z.
8
Assessing readiness to use electronic health record data for outcome ascertainment in clinical trials - A case study.评估使用电子健康记录数据进行临床试验结局判定的准备情况——一个案例研究。
Contemp Clin Trials. 2024 Jul;142:107572. doi: 10.1016/j.cct.2024.107572. Epub 2024 May 11.
9
A Standard Framework for Evaluating Large Health Care Data and Related Resources.用于评估大型医疗保健数据和相关资源的标准框架。
MMWR Suppl. 2024 May 9;73(3):1-13. doi: 10.15585/mmwr.su7303a1.
10
The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset.澳大利亚初级保健数据中的 OMOP 通用数据模型:构建一个高质量、可用于研究的协调数据集。
PLoS One. 2024 Apr 18;19(4):e0301557. doi: 10.1371/journal.pone.0301557. eCollection 2024.

本文引用的文献

1
NATIONAL INCIDENCE OF MEDICAL TRANSFER: PATIENT CHARACTERISTICS AND REGIONAL VARIATION.医疗转运的全国发病率:患者特征与地区差异
J Health Hum Serv Adm. 2016 Spring;38(4):509-28.
2
Structured data quality reports to improve EHR data quality.用于提高电子健康记录(EHR)数据质量的结构化数据质量报告。
Int J Med Inform. 2015 Dec;84(12):1094-8. doi: 10.1016/j.ijmedinf.2015.09.008. Epub 2015 Oct 9.
3
Developing a fully integrated medical transport record to support comparative effectiveness research for patients undergoing medical transport.开发一个完全集成的医疗运输记录,以支持对接受医疗运输的患者进行比较效果研究。
EGEMS (Wash DC). 2013 Dec 18;1(3):1024. doi: 10.13063/2327-9214.1024. eCollection 2013.
4
Recommendations for the use of operational electronic health record data in comparative effectiveness research.关于在比较效果研究中使用电子健康记录操作数据的建议。
EGEMS (Wash DC). 2013 Oct 8;1(1):1018. doi: 10.13063/2327-9214.1018. eCollection 2013.
5
Improving record linkage performance in the presence of missing linkage data.在存在缺失链接数据的情况下提高记录链接性能。
J Biomed Inform. 2014 Dec;52:43-54. doi: 10.1016/j.jbi.2014.01.016. Epub 2014 Feb 10.
6
Defining and measuring completeness of electronic health records for secondary use.定义和衡量电子健康记录在二次使用中的完整性。
J Biomed Inform. 2013 Oct;46(5):830-6. doi: 10.1016/j.jbi.2013.06.010. Epub 2013 Jun 29.
7
Caveats for the use of operational electronic health record data in comparative effectiveness research.使用操作性电子健康记录数据进行比较有效性研究的注意事项。
Med Care. 2013 Aug;51(8 Suppl 3):S30-7. doi: 10.1097/MLR.0b013e31829b1dbd.
8
Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research.电子健康记录数据质量评估的方法和维度:为临床研究提供可重用性。
J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51. doi: 10.1136/amiajnl-2011-000681. Epub 2012 Jun 25.
9
The methods of comparative effectiveness research.比较疗效研究的方法。
Annu Rev Public Health. 2012 Apr;33:425-45. doi: 10.1146/annurev-publhealth-031811-124610. Epub 2012 Jan 3.
10
Quantifying the longitudinal value of healthcare record collections for pharmacoepidemiology.量化用于药物流行病学的医疗记录集的纵向价值。
AMIA Annu Symp Proc. 2011;2011:1318-25. Epub 2011 Oct 22.