提高全基因组关联研究（GWAS）中用于事件发生时间结局的Cox比例风险模型的拟合效率。

Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).

作者信息

Gebski Val, Silva S Sandun M, Byth Karen, Jenkins Alicia, Keech Anthony

机构信息

NHMRC Clinical Trials Centre, University of Sydney, Camperdown, NSW 1450, Australia.

出版信息

Bioinform Adv. 2023 Oct 13;3(1):vbad148. doi: 10.1093/bioadv/vbad148. eCollection 2023.

DOI:10.1093/bioadv/vbad148

PMID:37928342

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10625458/

Abstract

SUMMARY

Technologies identifying single nucleotide polymorphisms () in DNA sequencing yield an avalanche of data requiring analysis and interpretation. Standard methods may require many weeks of processing time. The use of statistical methods requiring data sorting, matrix inversions of a high-dimension and replication in subsets of the data on multiple outcomes exacerbate these times.A method which reduces the computational time in problems with time-to-event outcomes and hundreds of thousands/millions of using Cox-Snell residuals after fitting the Cox proportional hazards model () to a fixed set of concomitant variables is proposed. This yields coefficients for SNP effect from a Cox-Snell adjusted Poisson model and shows a high concordance to the adjusted model.The method is illustrated with a sample of 10 000 from a genome-wide association study in a diabetic population. The gain in processing efficiency using the proposed method based on Poisson modelling can be as high as 62%. This could result in saving of over three weeks processing time if 5 million require analysis. The method involves only a single predictor variable (SNP), offering a simpler, computationally more stable approach to examining and identifying SNP patterns associated with the outcome(s) allowing for a faster development of genetic signatures. Use of deviance residuals from the model to screen demonstrates a large discordance rate at a 0.2% threshold of concordance. This rate is 15 times larger than that based on the Cox-Snell residuals from the Cox-Snell adjusted Poisson model.

AVAILABILITY AND IMPLEMENTATION

The method is simple to implement as the procedures are available in most statistical packges. The approach involves obtaining Cox-Snell residuals from a model, to a binary time-to-event outcome, for factors which need to be common when assessing each Each is then fitted as a predictor to the outcome of interest using a Poisson model with the Cox-Snell as the exposure variable.

摘要

DNA测序中识别单核苷酸多态性（SNP）的技术产生了大量需要分析和解读的数据。标准方法可能需要数周的处理时间。使用需要数据排序、高维矩阵求逆以及对多个结果的数据子集进行重复分析的统计方法会进一步延长这些时间。本文提出一种方法，在将Cox比例风险模型（CPHM）拟合到一组固定的伴随变量后，利用Cox - Snell残差减少具有事件发生时间结局和数十万/数百万个SNP问题的计算时间。这会从Cox - Snell调整后的泊松模型中得出SNP效应的系数，并显示出与调整后的CPHM模型高度一致。

该方法通过对糖尿病群体全基因组关联研究中的10000个SNP样本进行说明。使用基于泊松建模的所提出方法，处理效率的提升高达62%。如果需要分析500万个SNP，这可能会节省超过三周的处理时间。该方法仅涉及单个预测变量（SNP），为检查和识别与结局相关的SNP模式提供了一种更简单、计算上更稳定的方法，从而能够更快地开发遗传特征。使用CPHM模型的偏差残差筛选SNP，在一致性阈值为0.2%时显示出较大的不一致率。该比率比基于Cox - Snell调整后的泊松模型的Cox - Snell残差的比率大15倍。

可用性与实现

该方法易于实现，因为其程序在大多数统计软件包中都可用。该方法包括从CPHM模型中获取针对二元事件发生时间结局的Cox - Snell残差，用于评估每个SNP时需要共同考虑的因素。然后，使用以Cox - Snell为暴露变量的泊松模型，将每个SNP作为预测变量拟合到感兴趣的结局。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d118/10625458/68297fc56e41/vbad148f1.jpg

相似文献

Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).提高全基因组关联研究（GWAS）中用于事件发生时间结局的Cox比例风险模型的拟合效率。

Bioinform Adv. 2023 Oct 13;3(1):vbad148. doi: 10.1093/bioadv/vbad148. eCollection 2023.

Comparison of statistics in association tests of genetic markers for survival outcomes.生存结局关联检验中遗传标记统计学比较。

Stat Med. 2014 Feb 28;33(5):828-44. doi: 10.1002/sim.5982. Epub 2013 Sep 18.

A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。

BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.

A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank.一种用于全基因组事件时间数据分析的快速而准确的方法及其在 UK Biobank 中的应用。

Am J Hum Genet. 2020 Aug 6;107(2):222-233. doi: 10.1016/j.ajhg.2020.06.003. Epub 2020 Jun 25.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

SNP-based pathway enrichment analysis for genome-wide association studies.基于 SNP 的通路富集分析在全基因组关联研究中的应用。

BMC Bioinformatics. 2011 Apr 15;12:99. doi: 10.1186/1471-2105-12-99.

A Comparison between Accelerated Failure-time and Cox Proportional Hazard Models in Analyzing the Survival of Gastric Cancer Patients.加速失效时间模型与Cox比例风险模型在分析胃癌患者生存情况中的比较

Iran J Public Health. 2015 Aug;44(8):1095-102.

GWIS--model-free, fast and exhaustive search for epistatic interactions in case-control GWAS.GWIS--无模型、快速且全面搜索病例对照 GWAS 中的上位相互作用。

BMC Genomics. 2013;14 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2164-14-S3-S10. Epub 2013 May 28.

Fast and accurate recurrent event analysis for genome-wide association studies.全基因组关联研究中的快速准确的复发事件分析。

Genet Epidemiol. 2023 Jul;47(5):365-378. doi: 10.1002/gepi.22525. Epub 2023 Apr 15.

Genetic correlates of longevity and selected age-related phenotypes: a genome-wide association study in the Framingham Study.长寿及特定年龄相关表型的遗传关联：弗雷明汉心脏研究中的全基因组关联研究

BMC Med Genet. 2007 Sep 19;8 Suppl 1(Suppl 1):S13. doi: 10.1186/1471-2350-8-S1-S13.

本文引用的文献

Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine.BNT162b2 mRNA 新冠病毒疫苗的安全性和有效性。

N Engl J Med. 2020 Dec 31;383(27):2603-2615. doi: 10.1056/NEJMoa2034577. Epub 2020 Dec 10.

Iterative hard thresholding in genome-wide association studies: Generalized linear models, prior weights, and double sparsity.全基因组关联研究中的迭代硬阈值法：广义线性模型、先验权重和双重稀疏性。

Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa044.

A new linear regression-like residual for survival analysis, with application to genome wide association studies of time-to-event data.一种新的类似于线性回归的生存分析残差，应用于基于时间事件数据的全基因组关联研究。

PLoS One. 2020 May 4;15(5):e0232300. doi: 10.1371/journal.pone.0232300. eCollection 2020.

Clinical and Genetic Risk Prediction of Cognitive Impairment After Blood or Marrow Transplantation for Hematologic Malignancy.血液或骨髓移植治疗血液恶性肿瘤后认知障碍的临床和遗传风险预测。

J Clin Oncol. 2020 Apr 20;38(12):1312-1321. doi: 10.1200/JCO.19.01085. Epub 2020 Feb 21.

Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record.在利用电子健康记录的基因组研究中，Cox回归增强了检测基因型与表型关联的效能。

BMC Genomics. 2019 Nov 4;20(1):805. doi: 10.1186/s12864-019-6192-1.

A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design.用于队列和病例队列设计的全基因组关联研究的Cox回归与逻辑回归比较。

Eur J Hum Genet. 2017 Jun;25(7):854-862. doi: 10.1038/ejhg.2017.78. Epub 2017 May 3.

Variants near CHRNA3/5 and APOE have age- and sex-related effects on human lifespan.CHRNA3/5和APOE附近的变异对人类寿命有与年龄和性别相关的影响。

Nat Commun. 2016 Mar 31;7:11174. doi: 10.1038/ncomms11174.

Application of high-dimensional feature selection: evaluation for genomic prediction in man.高维特征选择的应用：人类基因组预测评估

Sci Rep. 2015 May 19;5:10312. doi: 10.1038/srep10312.

Prioritizing GWAS results: A review of statistical methods and recommendations for their application.优先考虑 GWAS 结果：统计方法综述及其应用建议。

Am J Hum Genet. 2010 Jan;86(1):6-22. doi: 10.1016/j.ajhg.2009.11.017.

Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances.单核苷酸多态性阵列：生物学、计算和技术进步的十年。

Nucleic Acids Res. 2009 Jul;37(13):4181-93. doi: 10.1093/nar/gkp552. Epub 2009 Jul 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

提高全基因组关联研究（GWAS）中用于事件发生时间结局的Cox比例风险模型的拟合效率。

Improving efficiency of fitting Cox proportional hazards models for time-to-event outcomes in genome-wide association studies (GWAS).

作者信息

机构信息

出版信息

SUMMARY

AVAILABILITY AND IMPLEMENTATION

摘要

可用性与实现

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献