• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子健康记录数据中纵向观测的统计质量评估方法及其在 VA 百万老兵计划中的应用。

A statistical quality assessment method for longitudinal observations in electronic health record data with an application to the VA million veteran program.

机构信息

Department of Veterans Affairs, Cooperative Studies Program Palo Alto Coordinating Center, 701B North Shoreline Blvd, Mountain View, CA, 94043, USA.

Department of Medicine, Stanford University School of Medicine, 1265 Welch Road, Stanford, CA, 94305-5464, USA.

出版信息

BMC Med Inform Decis Mak. 2021 Oct 20;21(1):289. doi: 10.1186/s12911-021-01643-2.

DOI:10.1186/s12911-021-01643-2
PMID:34670548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8529838/
Abstract

BACKGROUND

To describe an automated method for assessment of the plausibility of continuous variables collected in the electronic health record (EHR) data for real world evidence research use.

METHODS

The most widely used approach in quality assessment (QA) for continuous variables is to detect the implausible numbers using prespecified thresholds. In augmentation to the thresholding method, we developed a score-based method that leverages the longitudinal characteristics of EHR data for detection of the observations inconsistent with the history of a patient. The method was applied to the height and weight data in the EHR from the Million Veteran Program Data from the Veteran's Healthcare Administration (VHA). A validation study was also conducted.

RESULTS

The receiver operating characteristic (ROC) metrics of the developed method outperforms the widely used thresholding method. It is also demonstrated that different quality assessment methods have a non-ignorable impact on the body mass index (BMI) classification calculated from height and weight data in the VHA's database.

CONCLUSIONS

The score-based method enables automated and scaled detection of the problematic data points in health care big data while allowing the investigators to select the high-quality data based on their need. Leveraging the longitudinal characteristics in EHR will significantly improve the QA performance.

摘要

背景

描述一种自动化方法,用于评估电子健康记录(EHR)数据中用于真实世界证据研究的连续变量的合理性。

方法

质量评估(QA)中最常用的连续变量方法是使用预设阈值检测不合理的数字。除了阈值方法之外,我们还开发了一种基于评分的方法,利用 EHR 数据的纵向特征来检测与患者病史不一致的观察值。该方法应用于退伍军人医疗保健管理局(VHA)百万退伍军人计划数据中的 EHR 中的身高和体重数据。还进行了一项验证研究。

结果

开发的方法的接收者操作特征(ROC)指标优于广泛使用的阈值方法。还表明,不同的质量评估方法对 VHA 数据库中身高和体重数据计算的体重指数(BMI)分类有不可忽视的影响。

结论

基于评分的方法能够在医疗保健大数据中自动、规模化地检测有问题的数据点,同时允许研究人员根据自己的需求选择高质量的数据。利用 EHR 的纵向特征将显著提高 QA 性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c4/8529838/bd173fa037d8/12911_2021_1643_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c4/8529838/1c480560d538/12911_2021_1643_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c4/8529838/bd173fa037d8/12911_2021_1643_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c4/8529838/1c480560d538/12911_2021_1643_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c4/8529838/bd173fa037d8/12911_2021_1643_Fig2_HTML.jpg

相似文献

1
A statistical quality assessment method for longitudinal observations in electronic health record data with an application to the VA million veteran program.电子健康记录数据中纵向观测的统计质量评估方法及其在 VA 百万老兵计划中的应用。
BMC Med Inform Decis Mak. 2021 Oct 20;21(1):289. doi: 10.1186/s12911-021-01643-2.
2
Big Data in the Veterans Health Administration: A Nursing Informatics Perspective.退伍军人健康管理局的大数据:护理信息学视角。
J Nurs Scholarsh. 2021 May;53(3):288-295. doi: 10.1111/jnu.12631. Epub 2021 Mar 10.
3
A clustering approach for detecting implausible observation values in electronic health records data.一种用于检测电子健康记录数据中不合理观测值的聚类方法。
BMC Med Inform Decis Mak. 2019 Jul 23;19(1):142. doi: 10.1186/s12911-019-0852-6.
4
Dynamic ElecTronic hEalth reCord deTection (DETECT) of individuals at risk of a first episode of psychosis: a case-control development and validation study.动态电子健康记录检测(DETECT)对首发精神病风险个体的识别:一项病例对照研究。
Lancet Digit Health. 2020 May;2(5):e229-e239. doi: 10.1016/S2589-7500(20)30024-8. Epub 2020 Mar 26.
5
Automating Electronic Health Record Data Quality Assessment.自动化电子健康记录数据质量评估。
J Med Syst. 2023 Feb 13;47(1):23. doi: 10.1007/s10916-022-01892-2.
6
Measuring Exposure to Incarceration Using the Electronic Health Record.使用电子健康记录测量监禁暴露。
Med Care. 2019 Jun;57 Suppl 6 Suppl 2(Suppl 6 2):S157-S163. doi: 10.1097/MLR.0000000000001049.
7
Impacts of an Electronic Health Record Transition on Veterans Health Administration Health Professions Trainee Experience.电子健康记录过渡对退伍军人健康管理局卫生专业实习生体验的影响。
J Gen Intern Med. 2023 Oct;38(Suppl 4):1031-1039. doi: 10.1007/s11606-023-08283-4. Epub 2023 Oct 5.
8
Leveraging Electronic Health Care Record Information to Measure Pressure Ulcer Risk in Veterans With Spinal Cord Injury: A Longitudinal Study Protocol.利用电子医疗记录信息评估脊髓损伤退伍军人的压疮风险:一项纵向研究方案。
JMIR Res Protoc. 2017 Jan 19;6(1):e3. doi: 10.2196/resprot.5948.
9
Depression Quality of Care: Measuring Quality over Time Using VA Electronic Medical Record Data.抑郁症护理质量:利用退伍军人事务部电子病历数据随时间衡量质量。
J Gen Intern Med. 2016 Apr;31 Suppl 1(Suppl 1):36-45. doi: 10.1007/s11606-015-3563-4.
10
Automated safety event monitoring using electronic medical records in a clinical trial setting: Validation study using the VA NEPHRON-D trial.在临床试验环境中使用电子病历进行自动化安全事件监测:使用 VA NEPHRON-D 试验进行验证研究。
Clin Trials. 2019 Feb;16(1):81-89. doi: 10.1177/1740774518813121. Epub 2018 Nov 16.

引用本文的文献

1
Data quality assessment in healthcare, dimensions, methods and tools: a systematic review.医疗保健中的数据质量评估:维度、方法与工具——一项系统综述
BMC Med Inform Decis Mak. 2025 Aug 9;25(1):296. doi: 10.1186/s12911-025-03136-y.
2
Challenges for Data Quality in the Clinical Data Life Cycle: Systematic Review.临床数据生命周期中数据质量面临的挑战:系统评价
J Med Internet Res. 2025 Apr 23;27:e60709. doi: 10.2196/60709.
3
Electronic Health Record Data Quality and Performance Assessments: Scoping Review.电子健康记录数据质量和性能评估:范围综述。

本文引用的文献

1
Quality assessment of real-world data repositories across the data life cycle: A literature review.贯穿数据生命周期的真实世界数据存储库质量评估:文献综述。
J Am Med Inform Assoc. 2021 Jul 14;28(7):1591-1599. doi: 10.1093/jamia/ocaa340.
2
A Rule-Based Data Quality Assessment System for Electronic Health Record Data.基于规则的数据质量评估系统在电子健康记录数据中的应用。
Appl Clin Inform. 2020 Aug;11(4):622-634. doi: 10.1055/s-0040-1715567. Epub 2020 Sep 23.
3
Comprehensive comparative effectiveness and safety of first-line antihypertensive drug classes: a systematic, multinational, large-scale analysis.
JMIR Med Inform. 2024 Nov 6;12:e58130. doi: 10.2196/58130.
4
Predicting post-liver transplant outcomes in patients with acute-on-chronic liver failure using Expert-Augmented Machine Learning.使用专家增强机器学习预测慢加急性肝衰竭患者肝移植术后结局。
Am J Transplant. 2023 Dec;23(12):1908-1921. doi: 10.1016/j.ajt.2023.08.022. Epub 2023 Aug 30.
5
Optimization of the Electronic Health Record for Research.用于研究的电子健康记录的优化
Ann Surg Open. 2023 Jun 13;4(2):e297. doi: 10.1097/AS9.0000000000000297. eCollection 2023 Jun.
6
Data quality control in longitudinal epidemiologic studies: conditional studentized residuals from linear mixed effects models for outlier detection in the setting of pediatric chronic kidney disease.纵向流行病学研究中的数据质量控制:小儿慢性肾脏病背景下线性混合效应模型条件学生化残差在异常值检测中的应用。
Ann Epidemiol. 2023 Sep;85:38-44. doi: 10.1016/j.annepidem.2023.07.005. Epub 2023 Jul 16.
7
DQAgui: a graphical user interface for the MIRACUM data quality assessment tool.DQAgui:MIRACUM 数据质量评估工具的图形用户界面。
BMC Med Inform Decis Mak. 2022 Aug 11;22(1):213. doi: 10.1186/s12911-022-01961-z.
一线降压药类别全面比较效果和安全性:系统的、多国的、大规模分析。
Lancet. 2019 Nov 16;394(10211):1816-1826. doi: 10.1016/S0140-6736(19)32317-7. Epub 2019 Oct 24.
4
Feasibility analysis of conducting observational studies with the electronic health record.电子健康记录开展观察性研究的可行性分析。
BMC Med Inform Decis Mak. 2019 Oct 28;19(1):202. doi: 10.1186/s12911-019-0939-0.
5
Incrementally Transforming Electronic Medical Records into the Observational Medical Outcomes Partnership Common Data Model: A Multidimensional Quality Assurance Approach.逐步将电子病历转化为观察性医疗结局伙伴关系通用数据模型:一种多维质量保证方法。
Appl Clin Inform. 2019 Oct;10(5):794-803. doi: 10.1055/s-0039-1697598. Epub 2019 Oct 23.
6
Developing real-world comparators for clinical trials in chemotherapy-refractory patients with gastric cancer or gastroesophageal junction cancer.为化疗耐药的胃癌或胃食管交界癌患者的临床试验开发真实世界对照。
Gastric Cancer. 2020 Jan;23(1):133-141. doi: 10.1007/s10120-019-01008-9. Epub 2019 Sep 23.
7
First-line treatment of essential hypertension: A real-world analysis across four antihypertensive treatment classes.原发性高血压的一线治疗:四大降压治疗类别中的真实世界分析。
J Clin Hypertens (Greenwich). 2019 May;21(5):627-634. doi: 10.1111/jch.13531. Epub 2019 Apr 13.
8
A Data Quality Assessment Guideline for Electronic Health Record Data Reuse.电子健康记录数据复用的数据质量评估指南
EGEMS (Wash DC). 2017 Sep 4;5(1):14. doi: 10.5334/egems.218.
9
A Comparison of Data Quality Assessment Checks in Six Data Sharing Networks.六个数据共享网络中数据质量评估检查的比较
EGEMS (Wash DC). 2017 Jun 12;5(1):8. doi: 10.5334/egems.223.
10
Population trends in the 10-year incidence and prevalence of diabetic retinopathy in the UK: a cohort study in the Clinical Practice Research Datalink 2004-2014.英国 10 年内糖尿病视网膜病变发病率和患病率的人口趋势:2004-2014 年临床实践研究数据链接中的队列研究。
BMJ Open. 2017 Feb 28;7(2):e014444. doi: 10.1136/bmjopen-2016-014444.