• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于利用电子健康记录探索全基因组和全表型组关联研究中的抽样和观察过程偏差的分析框架。

An analytic framework for exploring sampling and observation process biases in genome and phenome-wide association studies using electronic health records.

作者信息

Beesley Lauren J, Fritsche Lars G, Mukherjee Bhramar

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Stat Med. 2020 Jun 30;39(14):1965-1979. doi: 10.1002/sim.8524. Epub 2020 Mar 20.

DOI:10.1002/sim.8524
PMID:32198773
Abstract

Large-scale association analyses based on observational health care databases such as electronic health records have been a topic of increasing interest in the scientific community. However, challenges due to nonprobability sampling and phenotype misclassification associated with the use of these data sources are often ignored in standard analyses. The extent of the bias introduced by ignoring these factors is not well-characterized. In this paper, we develop an analytic framework for characterizing the bias expected in disease-gene association studies based on electronic health records when disease status misclassification and the sampling mechanism are ignored. Through a sensitivity analysis approach, this framework can be used to obtain plausible values for parameters of interest given summary results from standard analysis. We develop an online tool for performing this sensitivity analysis. Simulations demonstrate promising properties of the proposed method. We apply our approach to study bias in disease-gene association studies using electronic health record data from the Michigan Genomics Initiative, a longitudinal biorepository effort within The University Michigan health system.

摘要

基于诸如电子健康记录等观察性医疗保健数据库的大规模关联分析,一直是科学界日益关注的话题。然而,在标准分析中,常常忽略了与使用这些数据源相关的非概率抽样和表型错误分类所带来的挑战。忽略这些因素所引入的偏差程度尚未得到充分描述。在本文中,我们开发了一个分析框架,用于刻画在忽略疾病状态错误分类和抽样机制的情况下,基于电子健康记录的疾病 - 基因关联研究中预期的偏差。通过敏感性分析方法,该框架可用于根据标准分析的汇总结果获得感兴趣参数的合理值。我们开发了一个用于执行这种敏感性分析的在线工具。模拟结果表明了所提出方法的良好特性。我们应用我们的方法,利用密歇根基因组计划的电子健康记录数据,研究疾病 - 基因关联研究中的偏差,该计划是密歇根大学健康系统内的一项纵向生物样本库项目。

相似文献

1
An analytic framework for exploring sampling and observation process biases in genome and phenome-wide association studies using electronic health records.一种用于利用电子健康记录探索全基因组和全表型组关联研究中的抽样和观察过程偏差的分析框架。
Stat Med. 2020 Jun 30;39(14):1965-1979. doi: 10.1002/sim.8524. Epub 2020 Mar 20.
2
Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification.基于电子健康记录的关联研究的统计推断:处理选择偏倚和结局错误分类。
Biometrics. 2022 Mar;78(1):214-226. doi: 10.1111/biom.13400. Epub 2020 Dec 3.
3
Exploring various polygenic risk scores for skin cancer in the phenomes of the Michigan genomics initiative and the UK Biobank with a visual catalog: PRSWeb.探索密歇根基因组倡议和英国生物库表型中用于皮肤癌的多种多基因风险评分:PRSWeb。
PLoS Genet. 2019 Jun 13;15(6):e1008202. doi: 10.1371/journal.pgen.1008202. eCollection 2019 Jun.
4
Association of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative.多癌种多基因风险评分在表型全基因组研究中的关联:密歇根基因组倡议的结果。
Am J Hum Genet. 2018 Jun 7;102(6):1048-1061. doi: 10.1016/j.ajhg.2018.04.001. Epub 2018 May 17.
5
Phenotype validation in electronic health records based genetic association studies.基于电子健康记录的基因关联研究中的表型验证
Genet Epidemiol. 2017 Dec;41(8):790-800. doi: 10.1002/gepi.22080. Epub 2017 Oct 11.
6
Case studies in bias reduction and inference for electronic health record data with selection bias and phenotype misclassification.具有选择偏倚和表型错分的电子健康记录数据的偏差减少和推断的案例研究。
Stat Med. 2022 Dec 10;41(28):5501-5516. doi: 10.1002/sim.9579. Epub 2022 Sep 21.
7
LabWAS: Novel findings and study design recommendations from a meta-analysis of clinical labs in two independent biobanks.LabWAS:来自两个独立生物库中临床实验室的荟萃分析的新发现和研究设计建议。
PLoS Genet. 2020 Nov 11;16(11):e1009077. doi: 10.1371/journal.pgen.1009077. eCollection 2020 Nov.
8
A framework for understanding selection bias in real-world healthcare data.一个用于理解真实世界医疗数据中选择偏倚的框架。
J R Stat Soc Ser A Stat Soc. 2024 May 2;187(3):606-635. doi: 10.1093/jrsssa/qnae039. eCollection 2024 Aug.
9
Phenome-wide association studies (PheWASs) for functional variants.针对功能变异的全表型组关联研究(PheWASs)。
Eur J Hum Genet. 2015 Apr;23(4):523-9. doi: 10.1038/ejhg.2014.123. Epub 2014 Jul 30.
10
Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.系统比较电子病历数据的表型全基因组关联研究和全基因组关联研究数据。
Nat Biotechnol. 2013 Dec;31(12):1102-10. doi: 10.1038/nbt.2749.

引用本文的文献

1
Quantitative bias analysis for mismeasured variables in health research: a review of software tools.健康研究中测量错误变量的定量偏差分析:软件工具综述
BMC Med Res Methodol. 2025 Aug 1;25(1):187. doi: 10.1186/s12874-025-02635-w.
2
Exploring beyond diagnoses in electronic health records to improve discovery: a review of the phenome-wide association study.探索电子健康记录中的诊断之外的信息以改善发现:全表型关联研究综述
JAMIA Open. 2025 Feb 28;8(1):ooaf006. doi: 10.1093/jamiaopen/ooaf006. eCollection 2025 Feb.
3
A framework for understanding selection bias in real-world healthcare data.
一个用于理解真实世界医疗数据中选择偏倚的框架。
J R Stat Soc Ser A Stat Soc. 2024 May 2;187(3):606-635. doi: 10.1093/jrsssa/qnae039. eCollection 2024 Aug.
4
Characterising the genetic architecture of changes in adiposity during adulthood using electronic health records.利用电子健康记录描述成年期体脂变化的遗传结构。
Nat Commun. 2024 Jul 10;15(1):5801. doi: 10.1038/s41467-024-49998-0.
5
ADDRESSING SELECTION BIAS AND MEASUREMENT ERROR IN COVID-19 CASE COUNT DATA USING AUXILIARY INFORMATION.利用辅助信息解决新冠疫情病例数数据中的选择偏倚和测量误差问题。
Ann Appl Stat. 2023 Dec;17(4):2903-2923. doi: 10.1214/23-aoas1744. Epub 2023 Oct 30.
6
Biases in Electronic Health Records Data for Generating Real-World Evidence: An Overview.用于生成真实世界证据的电子健康记录数据中的偏差:概述
J Healthc Inform Res. 2023 Nov 14;8(1):121-139. doi: 10.1007/s41666-023-00153-2. eCollection 2024 Mar.
7
Patterns of item nonresponse behaviour to survey questionnaires are systematic and associated with genetic loci.项目对调查问卷的非应答行为模式是系统的,并与遗传基因座相关。
Nat Hum Behav. 2023 Aug;7(8):1371-1387. doi: 10.1038/s41562-023-01632-7. Epub 2023 Jun 29.
8
A Query Engine for Self-controlled Case Series, with an application to COVID-19 EHR data.一种用于自我对照病例系列的查询引擎及其在COVID-19电子健康记录数据中的应用。
AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:350-359. eCollection 2023.
9
Detecting and Adjusting for Hidden Biases due to Phenotype Misclassification in Genome-Wide Association Studies.在全基因组关联研究中检测并校正因表型错误分类导致的潜在偏差
medRxiv. 2023 Jan 18:2023.01.17.23284670. doi: 10.1101/2023.01.17.23284670.
10
ExPRSweb: An online repository with polygenic risk scores for common health-related exposures.ExPRSweb:一个在线存储库,包含常见健康相关暴露的多基因风险评分。
Am J Hum Genet. 2022 Oct 6;109(10):1742-1760. doi: 10.1016/j.ajhg.2022.09.001. Epub 2022 Sep 23.