• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择和表型错误分类情况下电子健康记录数据的偏差减少与推断:三个案例研究

Bias reduction and inference for electronic health record data under selection and phenotype misclassification: three case studies.

作者信息

Beesley Lauren J, Mukherjee Bhramar

机构信息

University of Michigan, Department of Biostatistics.

出版信息

medRxiv. 2020 Dec 23:2020.12.21.20248644. doi: 10.1101/2020.12.21.20248644.

DOI:10.1101/2020.12.21.20248644
PMID:33398299
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7781342/
Abstract

Electronic Health Records (EHR) are not designed for population-based research, but they provide access to longitudinal health information for many individuals. Many statistical methods have been proposed to account for selection bias, missing data, phenotyping errors, or other problems that arise in EHR data analysis. However, addressing multiple sources of bias simultaneously is challenging. Recently, we developed a methodological framework (R package, ) for jointly handling both selection bias and phenotype misclassification in the EHR setting that leverages external data sources. These methods assume factors related to selection and misclassification are fully observed, but these factors may be poorly understood and partially observed in practice. As a follow-up to the methodological work, we explore how these methods perform for three real-world case studies. In all three examples, we use individual patient-level data collected through the University of Michigan Health System and various external population-based data sources. In case study (a), we explore the impact of these methods on estimated associations between gender and cancer diagnosis. In case study (b), we compare corrected associations between previously identified genetic loci and age-related macular degeneration with gold standard external estimates. In case study (c), we evaluate these methods for modeling the association of COVID-19 outcomes and potential risk factors. These case studies illustrate how to utilize diverse auxiliary information to achieve less biased inference in EHR-based research.

摘要

电子健康记录(EHR)并非为基于人群的研究而设计,但它们能为许多个体提供纵向健康信息。人们已经提出了许多统计方法来处理EHR数据分析中出现的选择偏倚、数据缺失、表型错误或其他问题。然而,同时解决多种偏倚来源具有挑战性。最近,我们开发了一个方法框架(R包, ),用于在EHR环境中联合处理选择偏倚和表型错误分类,该框架利用了外部数据源。这些方法假设与选择和错误分类相关的因素是完全可观测的,但在实际中,这些因素可能理解不足且部分可观测。作为方法学工作的后续,我们探讨这些方法在三个实际案例研究中的表现。在所有三个例子中,我们使用通过密歇根大学医疗系统收集的个体患者层面数据以及各种基于人群的外部数据源。在案例研究(a)中,我们探讨这些方法对性别与癌症诊断之间估计关联的影响。在案例研究(b)中,我们将先前确定的基因位点与年龄相关性黄斑变性之间的校正关联与金标准外部估计值进行比较。在案例研究(c)中,我们评估这些方法对COVID-19结局与潜在风险因素之间关联进行建模的情况。这些案例研究说明了如何利用多样的辅助信息在基于EHR的研究中实现偏差较小的推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/e7c3a727f65f/nihpp-2020.12.21.20248644-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/da8db120a58e/nihpp-2020.12.21.20248644-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/48b608b6be1a/nihpp-2020.12.21.20248644-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/dee9d2f4e350/nihpp-2020.12.21.20248644-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/0eb3e0d5106a/nihpp-2020.12.21.20248644-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/e7c3a727f65f/nihpp-2020.12.21.20248644-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/da8db120a58e/nihpp-2020.12.21.20248644-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/48b608b6be1a/nihpp-2020.12.21.20248644-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/dee9d2f4e350/nihpp-2020.12.21.20248644-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/0eb3e0d5106a/nihpp-2020.12.21.20248644-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe7b/7781342/e7c3a727f65f/nihpp-2020.12.21.20248644-f0005.jpg

相似文献

1
Bias reduction and inference for electronic health record data under selection and phenotype misclassification: three case studies.选择和表型错误分类情况下电子健康记录数据的偏差减少与推断:三个案例研究
medRxiv. 2020 Dec 23:2020.12.21.20248644. doi: 10.1101/2020.12.21.20248644.
2
Case studies in bias reduction and inference for electronic health record data with selection bias and phenotype misclassification.具有选择偏倚和表型错分的电子健康记录数据的偏差减少和推断的案例研究。
Stat Med. 2022 Dec 10;41(28):5501-5516. doi: 10.1002/sim.9579. Epub 2022 Sep 21.
3
Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification.基于电子健康记录的关联研究的统计推断:处理选择偏倚和结局错误分类。
Biometrics. 2022 Mar;78(1):214-226. doi: 10.1111/biom.13400. Epub 2020 Dec 3.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
6
Addressing Information Biases Within Electronic Health Record Data to Improve the Examination of Epidemiologic Associations With Diabetes Prevalence Among Young Adults: Cross-Sectional Study.解决电子健康记录数据中的信息偏差,以改善对年轻人糖尿病患病率的流行病学关联的检查:横断面研究。
JMIR Med Inform. 2024 Oct 1;12:e58085. doi: 10.2196/58085.
7
Prevalence estimation by joint use of big data and health survey: a demonstration study using electronic health records in New York city.大数据与健康调查联合应用的流行率估计:以纽约市电子健康记录为例的示范研究。
BMC Med Res Methodol. 2020 Apr 6;20(1):77. doi: 10.1186/s12874-020-00956-6.
8
PIE: A prior knowledge guided integrated likelihood estimation method for bias reduction in association studies using electronic health records data.PIE:一种基于先验知识的集成似然估计方法,用于减少使用电子健康记录数据的关联研究中的偏差。
J Am Med Inform Assoc. 2018 Mar 1;25(3):345-352. doi: 10.1093/jamia/ocx137.
9
An augmented estimation procedure for EHR-based association studies accounting for differential misclassification.基于电子健康记录的关联研究的增强估计程序,考虑到差异误诊。
J Am Med Inform Assoc. 2020 Feb 1;27(2):244-253. doi: 10.1093/jamia/ocz180.
10
Adult patient access to electronic health records.成年患者获取电子健康记录。
Cochrane Database Syst Rev. 2021 Feb 26;2(2):CD012707. doi: 10.1002/14651858.CD012707.pub2.

本文引用的文献

1
Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification.基于电子健康记录的关联研究的统计推断:处理选择偏倚和结局错误分类。
Biometrics. 2022 Mar;78(1):214-226. doi: 10.1111/biom.13400. Epub 2020 Dec 3.
2
Characteristics Associated With Racial/Ethnic Disparities in COVID-19 Outcomes in an Academic Health Care System.在学术医疗体系中,与新冠疫情结果的种族/民族差异相关的特征。
JAMA Netw Open. 2020 Oct 1;3(10):e2025197. doi: 10.1001/jamanetworkopen.2020.25197.
3
Population-scale longitudinal mapping of COVID-19 symptoms, behaviour and testing.
大规模人群中 COVID-19 症状、行为和检测的纵向图谱绘制。
Nat Hum Behav. 2020 Sep;4(9):972-982. doi: 10.1038/s41562-020-00944-2. Epub 2020 Aug 26.
4
Risk factors for positive and negative COVID-19 tests: a cautious and in-depth analysis of UK biobank data.COVID-19 检测呈阳性和阴性的风险因素:对英国生物银行数据的谨慎和深入分析。
Int J Epidemiol. 2020 Oct 1;49(5):1454-1467. doi: 10.1093/ije/dyaa134.
5
False Negative Tests for SARS-CoV-2 Infection - Challenges and Implications.新型冠状病毒2型感染的假阴性检测——挑战与影响
N Engl J Med. 2020 Aug 6;383(6):e38. doi: 10.1056/NEJMp2015897. Epub 2020 Jun 5.
6
Test-Negative Designs: Differences and Commonalities with Other Case-Control Studies with "Other Patient" Controls.阴性对照设计:与其他以“其他患者”为对照的病例对照研究的差异和共同点。
Epidemiology. 2019 Nov;30(6):838-844. doi: 10.1097/EDE.0000000000001088.
7
Extending inferences from a randomized trial to a target population.将随机试验的推论扩展至目标人群。
Eur J Epidemiol. 2019 Aug;34(8):719-722. doi: 10.1007/s10654-019-00533-2. Epub 2019 Jun 19.
8
Association of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative.多癌种多基因风险评分在表型全基因组研究中的关联:密歇根基因组倡议的结果。
Am J Hum Genet. 2018 Jun 7;102(6):1048-1061. doi: 10.1016/j.ajhg.2018.04.001. Epub 2018 May 17.
9
PIE: A prior knowledge guided integrated likelihood estimation method for bias reduction in association studies using electronic health records data.PIE:一种基于先验知识的集成似然估计方法,用于减少使用电子健康记录数据的关联研究中的偏差。
J Am Med Inform Assoc. 2018 Mar 1;25(3):345-352. doi: 10.1093/jamia/ocx137.
10
A General Framework for Considering Selection Bias in EHR-Based Studies: What Data Are Observed and Why?基于电子健康记录的研究中考虑选择偏倚的通用框架:观察到了哪些数据以及原因是什么?
EGEMS (Wash DC). 2016 Aug 31;4(1):1203. doi: 10.13063/2327-9214.1203. eCollection 2016.