• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在利用电子健康记录的基因组研究中,Cox回归增强了检测基因型与表型关联的效能。

Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record.

作者信息

Hughey Jacob J, Rhoades Seth D, Fu Darwin Y, Bastarache Lisa, Denny Joshua C, Chen Qingxia

机构信息

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA.

出版信息

BMC Genomics. 2019 Nov 4;20(1):805. doi: 10.1186/s12864-019-6192-1.

DOI:10.1186/s12864-019-6192-1
PMID:31684865
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6829851/
Abstract

BACKGROUND

The growth of DNA biobanks linked to data from electronic health records (EHRs) has enabled the discovery of numerous associations between genomic variants and clinical phenotypes. Nonetheless, although clinical data are generally longitudinal, standard approaches for detecting genotype-phenotype associations in such linked data, notably logistic regression, do not naturally account for variation in the period of follow-up or the time at which an event occurs. Here we explored the advantages of quantifying associations using Cox proportional hazards regression, which can account for the age at which a patient first visited the healthcare system (left truncation) and the age at which a patient either last visited the healthcare system or acquired a particular phenotype (right censoring).

RESULTS

In comprehensive simulations, we found that, compared to logistic regression, Cox regression had greater power at equivalent Type I error. We then scanned for genotype-phenotype associations using logistic regression and Cox regression on 50 phenotypes derived from the EHRs of 49,792 genotyped individuals. Consistent with the findings from our simulations, Cox regression had approximately 10% greater relative sensitivity for detecting known associations from the NHGRI-EBI GWAS Catalog. In terms of effect sizes, the hazard ratios estimated by Cox regression were strongly correlated with the odds ratios estimated by logistic regression.

CONCLUSIONS

As longitudinal health-related data continue to grow, Cox regression may improve our ability to identify the genetic basis for a wide range of human phenotypes.

摘要

背景

与电子健康记录(EHR)数据相关联的DNA生物样本库的发展,使得人们发现了基因组变异与临床表型之间的众多关联。尽管如此,虽然临床数据通常是纵向的,但在这种关联数据中检测基因型-表型关联的标准方法,尤其是逻辑回归,并未自然地考虑随访期的变化或事件发生的时间。在此,我们探讨了使用Cox比例风险回归来量化关联的优势,该方法可以考虑患者首次就诊于医疗系统的年龄(左删失)以及患者最后一次就诊于医疗系统或获得特定表型的年龄(右删失)。

结果

在全面的模拟中,我们发现,与逻辑回归相比,Cox回归在同等I型错误水平下具有更大的检验效能。然后,我们使用逻辑回归和Cox回归对来自49792名基因分型个体的EHR中的50种表型进行了基因型-表型关联扫描。与我们模拟的结果一致,Cox回归在检测来自NHGRI-EBI全基因组关联研究目录中的已知关联时,相对灵敏度大约高10%。在效应大小方面,Cox回归估计的风险比与逻辑回归估计的比值比高度相关。

结论

随着与健康相关的纵向数据不断增加,Cox回归可能会提高我们识别广泛人类表型遗传基础的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/177829b37324/12864_2019_6192_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/15894a342a33/12864_2019_6192_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/0cec2603c3b4/12864_2019_6192_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/fd177d2f515d/12864_2019_6192_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/177829b37324/12864_2019_6192_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/15894a342a33/12864_2019_6192_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/0cec2603c3b4/12864_2019_6192_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/fd177d2f515d/12864_2019_6192_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b604/6829851/177829b37324/12864_2019_6192_Fig4_HTML.jpg

相似文献

1
Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record.在利用电子健康记录的基因组研究中,Cox回归增强了检测基因型与表型关联的效能。
BMC Genomics. 2019 Nov 4;20(1):805. doi: 10.1186/s12864-019-6192-1.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
4
The clinical effectiveness and cost-effectiveness of enzyme replacement therapy for Gaucher's disease: a systematic review.戈谢病酶替代疗法的临床疗效和成本效益:一项系统评价。
Health Technol Assess. 2006 Jul;10(24):iii-iv, ix-136. doi: 10.3310/hta10240.
5
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
10
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.

引用本文的文献

1
Analysis of follow-up data in large biobank cohorts: a review of methodology.大型生物样本库队列中的随访数据分析:方法学综述
Front Genet. 2025 Jun 25;16:1534726. doi: 10.3389/fgene.2025.1534726. eCollection 2025.
2
BAYESIAN VARIABLE SELECTION IN A COX PROPORTIONAL HAZARDS MODEL WITH THE "SUM OF SINGLE EFFECTS" PRIOR.具有“单效应之和”先验的Cox比例风险模型中的贝叶斯变量选择
ArXiv. 2025 Jun 6:arXiv:2506.06233v1.
3
Neighborhood-level social determinants of suicidality in youth with schizophrenia: An EHR-based study.精神分裂症青少年自杀倾向的社区层面社会决定因素:一项基于电子健康记录的研究。

本文引用的文献

1
The "All of Us" Research Program.“All of Us”研究计划。
N Engl J Med. 2019 Aug 15;381(7):668-676. doi: 10.1056/NEJMsr1809937.
2
The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.NHGRI-EBI GWAS Catalog 于 2019 年发布的已发表全基因组关联研究、靶向基因芯片和汇总统计数据
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012. doi: 10.1093/nar/gky1120.
3
gwasurvivr: an R package for genome-wide survival analysis.gwasurvivr:一个用于全基因组生存分析的 R 包。
Schizophr Res. 2025 Jul;281:74-81. doi: 10.1016/j.schres.2025.04.035. Epub 2025 May 3.
4
Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).提高全基因组关联研究(GWAS)中用于事件发生时间结局的Cox比例风险模型的拟合效率。
Bioinform Adv. 2023 Oct 13;3(1):vbad148. doi: 10.1093/bioadv/vbad148. eCollection 2023.
5
Local and Bayesian Survival FDR Estimations to Identify Reliable Associations in Whole Genome of Bread Wheat.基于局部和贝叶斯生存 FDR 估计的全基因组小麦可靠关联分析。
Int J Mol Sci. 2023 Sep 12;24(18):14011. doi: 10.3390/ijms241814011.
6
ADuLT: An efficient and robust time-to-event GWAS.成人:一种高效稳健的事件时间 GWAS 分析方法。
Nat Commun. 2023 Sep 9;14(1):5553. doi: 10.1038/s41467-023-41210-z.
7
The phenotype-genotype reference map: Improving biobank data science through replication.表型-基因型参考图谱:通过复制提高生物库数据科学。
Am J Hum Genet. 2023 Sep 7;110(9):1522-1533. doi: 10.1016/j.ajhg.2023.07.012. Epub 2023 Aug 21.
8
Cox regression is robust to inaccurate EHR-extracted event time: an application to EHR-based GWAS.Cox 回归对 EHR 提取的事件时间不准确具有稳健性:基于 EHR 的 GWAS 的应用。
Bioinformatics. 2022 Apr 12;38(8):2297-2306. doi: 10.1093/bioinformatics/btac086.
9
Accounting for age of onset and family history improves power in genome-wide association studies.考虑发病年龄和家族史可提高全基因组关联研究的效能。
Am J Hum Genet. 2022 Mar 3;109(3):417-432. doi: 10.1016/j.ajhg.2022.01.009. Epub 2022 Feb 8.
10
Two-step hypothesis testing to detect gene-environment interactions in a genome-wide scan with a survival endpoint.两步假设检验法检测生存终点全基因组扫描中的基因-环境交互作用。
Stat Med. 2022 Apr 30;41(9):1644-1657. doi: 10.1002/sim.9319. Epub 2022 Jan 24.
Bioinformatics. 2019 Jun 1;35(11):1968-1970. doi: 10.1093/bioinformatics/bty920.
4
Profiling and Leveraging Relatedness in a Precision Medicine Cohort of 92,455 Exomes.在一个包含 92455 个外显子组的精准医学队列中进行相关分析和利用。
Am J Hum Genet. 2018 May 3;102(5):874-889. doi: 10.1016/j.ajhg.2018.03.012.
5
The Influence of Big (Clinical) Data and Genomics on Precision Medicine and Drug Development.大数据和基因组学对精准医学和药物研发的影响。
Clin Pharmacol Ther. 2018 Mar;103(3):409-418. doi: 10.1002/cpt.951. Epub 2018 Feb 5.
6
Two-sample tests for survival data from observational studies.观察性研究中生存数据的双样本检验。
Lifetime Data Anal. 2018 Jul;24(3):509-531. doi: 10.1007/s10985-017-9408-1. Epub 2017 Aug 28.
7
Bayesian analysis of genetic association across tree-structured routine healthcare data in the UK Biobank.英国生物银行中跨树形结构常规医疗数据的遗传关联贝叶斯分析。
Nat Genet. 2017 Sep;49(9):1311-1318. doi: 10.1038/ng.3926. Epub 2017 Jul 31.
8
A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design.用于队列和病例队列设计的全基因组关联研究的Cox回归与逻辑回归比较。
Eur J Hum Genet. 2017 Jun;25(7):854-862. doi: 10.1038/ejhg.2017.78. Epub 2017 May 3.
9
SurvivalGWAS_SV: software for the analysis of genome-wide association studies of imputed genotypes with "time-to-event" outcomes.生存全基因组关联研究-结构变异:用于分析具有“事件发生时间”结局的插补基因型全基因组关联研究的软件。
BMC Bioinformatics. 2017 May 19;18(1):265. doi: 10.1186/s12859-017-1683-z.
10
Conditional screening for ultra-high dimensional covariates with survival outcomes.基于生存结局的超高维协变量条件筛选
Lifetime Data Anal. 2018 Jan;24(1):45-71. doi: 10.1007/s10985-016-9387-7. Epub 2016 Dec 8.