• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用电子病历衍生的概率性表型降低因结局错分导致的流行病学研究偏倚。

Reducing Bias Due to Outcome Misclassification for Epidemiologic Studies Using EHR-derived Probabilistic Phenotypes.

机构信息

From the Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, Pennsylvania.

出版信息

Epidemiology. 2020 Jul;31(4):542-550. doi: 10.1097/EDE.0000000000001193.

DOI:10.1097/EDE.0000000000001193
PMID:32282406
Abstract

Epidemiologic studies using electronic health record (EHR)-derived phenotypes as outcomes are subject to bias due to phenotyping error. In the case of dichotomous phenotypes, existing methods for misclassified outcomes can be used to reduce bias. In this article, we present a bias correction approach for EHR-derived probabilistic phenotypes: continuous predicted probabilities of the outcome of interest. This approach makes use of correction factors that can be computed by hand and do not require specialized software. We used simulation studies to investigate the performance of the proposed approach under a variety of scenarios for accuracy of the probabilistic phenotype, strength of the outcome/exposure association, and prevalence of the outcome of interest. Across all scenarios investigated, the proposed approach substantially reduced bias in association parameter estimates relative to a naive approach. We demonstrate the application of this approach to a study of pediatric type 2 diabetes using data from the PEDSnet network of children's hospitals. This straightforward correction factor can substantially reduce bias and improve the validity of EHR-based epidemiology.

摘要

使用电子健康记录 (EHR) 衍生表型作为结局的流行病学研究可能由于表型错误而存在偏倚。在二分类表型的情况下,可以使用针对错误分类结局的现有方法来减少偏倚。在本文中,我们提出了一种用于 EHR 衍生概率表型的偏倚校正方法:感兴趣结局的连续预测概率。该方法利用可以手动计算且不需要专门软件的校正因子。我们使用模拟研究来研究在概率表型准确性、结局/暴露关联强度和感兴趣结局的患病率等多种情况下,所提出方法的性能。在所研究的所有情况下,与简单的方法相比,所提出的方法可大大减少关联参数估计的偏差。我们使用来自儿科医院网络 PEDSnet 的数据,展示了该方法在儿科 2 型糖尿病研究中的应用。这种简单的校正因子可以大大减少偏倚,提高基于 EHR 的流行病学的有效性。

相似文献

1
Reducing Bias Due to Outcome Misclassification for Epidemiologic Studies Using EHR-derived Probabilistic Phenotypes.利用电子病历衍生的概率性表型降低因结局错分导致的流行病学研究偏倚。
Epidemiology. 2020 Jul;31(4):542-550. doi: 10.1097/EDE.0000000000001193.
2
3
An augmented estimation procedure for EHR-based association studies accounting for differential misclassification.基于电子健康记录的关联研究的增强估计程序,考虑到差异误诊。
J Am Med Inform Assoc. 2020 Feb 1;27(2):244-253. doi: 10.1093/jamia/ocz180.
4
Leveraging error-prone algorithm-derived phenotypes: Enhancing association studies for risk factors in EHR data.利用易错算法衍生的表型:增强电子健康记录数据中风险因素的关联研究。
J Biomed Inform. 2024 Sep;157:104690. doi: 10.1016/j.jbi.2024.104690. Epub 2024 Jul 14.
5
Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification.基于电子健康记录的关联研究的统计推断:处理选择偏倚和结局错误分类。
Biometrics. 2022 Mar;78(1):214-226. doi: 10.1111/biom.13400. Epub 2020 Dec 3.
6
Inflation of type I error rates due to differential misclassification in EHR-derived outcomes: Empirical illustration using breast cancer recurrence.由于电子病历衍生结局的差异误分类导致 I 类错误率膨胀:基于乳腺癌复发的实证说明。
Pharmacoepidemiol Drug Saf. 2019 Feb;28(2):264-268. doi: 10.1002/pds.4680. Epub 2018 Oct 30.
7
A cost-effective chart review sampling design to account for phenotyping error in electronic health records (EHR) data.一种具有成本效益的图表审查抽样设计,用于解决电子健康记录 (EHR) 数据中的表型错误。
J Am Med Inform Assoc. 2021 Dec 28;29(1):52-61. doi: 10.1093/jamia/ocab222.
8
Studying pediatric health outcomes with electronic health records using Bayesian clustering and trajectory analysis.使用贝叶斯聚类和轨迹分析研究电子健康记录中的儿科健康结果。
J Biomed Inform. 2021 Jan;113:103654. doi: 10.1016/j.jbi.2020.103654. Epub 2020 Dec 11.
9
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
10
SAT: a Surrogate-Assisted Two-wave case boosting sampling method, with application to EHR-based association studies.SAT:一种基于替代辅助的两波病例增强抽样方法,应用于基于电子健康记录的关联研究。
J Am Med Inform Assoc. 2022 Apr 13;29(5):918-927. doi: 10.1093/jamia/ocab267.

引用本文的文献

1
Sensitivity Analysis for Binary Outcome Misclassification in Randomization Tests via Integer Programming.通过整数规划对随机化检验中二元结果误分类的敏感性分析
J Comput Graph Stat. 2025 Apr 17. doi: 10.1080/10618600.2025.2461222.
2
Malaria Incidence in US Military Families Is Related to Service Member's Birthplace.美国军人家庭中的疟疾发病率与军人的出生地有关。
Open Forum Infect Dis. 2025 Aug 11;12(8):ofaf479. doi: 10.1093/ofid/ofaf479. eCollection 2025 Aug.
3
Optimal Surrogate-Assisted Sampling for Cost-Efficient Validation of Electronic Health Record Outcomes.
用于电子健康记录结果成本效益验证的最优代理辅助抽样
Stat Med. 2025 May;44(10-12):e70095. doi: 10.1002/sim.70095.
4
Leveraging undecided cases in chart-reviewed phenotypes to enhance EHR-based association studies.利用图表审查表型中的不确定病例来加强基于电子健康记录的关联研究。
J Biomed Inform. 2025 Jun;166:104839. doi: 10.1016/j.jbi.2025.104839. Epub 2025 Apr 30.
5
Synthetic surrogates improve power for genome-wide association studies of partially missing phenotypes in population biobanks.合成替代物可提高在人群生物库中对部分缺失表型进行全基因组关联研究的功效。
Nat Genet. 2024 Jul;56(7):1527-1536. doi: 10.1038/s41588-024-01793-9. Epub 2024 Jun 13.
6
Development and validation of an electronic health records-based opioid use disorder algorithm by expert clinical adjudication among patients with prescribed opioids.基于专家临床裁定的电子健康记录阿片类药物使用障碍算法在处方阿片类药物患者中的开发和验证。
Pharmacoepidemiol Drug Saf. 2023 May;32(5):577-585. doi: 10.1002/pds.5591. Epub 2023 Jan 4.
7
Machine learning approaches for electronic health records phenotyping: a methodical review.基于机器学习的电子健康记录表型分析方法:系统评价
J Am Med Inform Assoc. 2023 Jan 18;30(2):367-381. doi: 10.1093/jamia/ocac216.
8
Core concepts in pharmacoepidemiology: Validation of health outcomes of interest within real-world healthcare databases.药物流行病学的核心概念:在真实医疗保健数据库中验证感兴趣的健康结果。
Pharmacoepidemiol Drug Saf. 2023 Jan;32(1):1-8. doi: 10.1002/pds.5537. Epub 2022 Sep 14.
9
Why Is the Electronic Health Record So Challenging for Research and Clinical Care?电子健康记录为何对研究和临床护理极具挑战性?
Methods Inf Med. 2021 May;60(1-02):32-48. doi: 10.1055/s-0041-1731784. Epub 2021 Jul 19.