Suppr超能文献

采用组套索方法进行变量选择:在含脆弱模型的Cox回归中的应用。

Variable selection with Group LASSO approach: Application to Cox regression with frailty model.

作者信息

Utazirubanda Jean Claude, Leon Tomas, Ngom Papa

机构信息

LMA,Université Cheikh Anta Diop, Dakar, Senegal.

School of Public Health, University of California, Berkeley, USA.

出版信息

Commun Stat Simul Comput. 2021;50(3):881-901. doi: 10.1080/03610918.2019.1571605. Epub 2018 Feb 28.

Abstract

In analysis of survival outcomes supplemented with both clinical information and high-dimensional gene expression data, use of the traditional Cox proportional hazards model fails to meet some emerging needs in biomedical research. First, the number of covariates is generally much larger the sample size. Secondly, predicting an outcome based on individual gene expression is inadequate because multiple biological processes and functional pathways regulate phenotypic expression. Another challenge is that the Cox model assumes that populations are homogenous, implying that all individuals have the same risk of death, which is rarely true due to unmeasured risk factors among populations. In this paper we propose group LASSO with gamma-distributed frailty for variable selection in Cox regression by extending previous scholarship to account for heterogeneity among group structures related to exposure and susceptibility. The consistency property of the proposed method is established. This method is appropriate for addressing a wide variety of research questions from genetics to air pollution. Simulated and real world data analysis shows promising performance by group LASSO compared with other methods, including group SCAD and group MCP. Future research directions include expanding the use of frailty with adaptive group LASSO and sparse group LASSO methods.

摘要

在对补充了临床信息和高维基因表达数据的生存结果进行分析时,使用传统的Cox比例风险模型无法满足生物医学研究中一些新出现的需求。首先,协变量的数量通常比样本量要大得多。其次,基于单个基因表达来预测结果是不够的,因为多个生物过程和功能通路调节表型表达。另一个挑战是,Cox模型假定总体是同质的,这意味着所有个体具有相同的死亡风险,但由于总体中存在未测量的风险因素,这很少是真的。在本文中,我们通过扩展先前的研究成果以考虑与暴露和易感性相关的组结构之间的异质性,提出了用于Cox回归中变量选择的具有伽马分布脆弱性的组套索方法。建立了所提方法的一致性性质。该方法适用于解决从遗传学到空气污染等各种各样的研究问题。模拟和实际数据分析表明,与包括组SCAD和组MCP在内的其他方法相比,组套索方法具有良好的性能。未来的研究方向包括将脆弱性与自适应组套索和稀疏组套索方法结合起来进行扩展应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9fe1/8261624/f2aa8dcc854d/nihms-1012306-f0001.jpg

相似文献

引用本文的文献

本文引用的文献

5
Relapsed/refractory diffuse large B-cell lymphoma.复发/难治性弥漫性大 B 细胞淋巴瘤。
Hematology Am Soc Hematol Educ Program. 2011;2011:498-505. doi: 10.1182/asheducation-2011.1.498.
6
Stromal gene signatures in large-B-cell lymphomas.大B细胞淋巴瘤中的基质基因特征
N Engl J Med. 2008 Nov 27;359(22):2313-23. doi: 10.1056/NEJMoa0802885.
9
The generalized LASSO.广义套索算法
IEEE Trans Neural Netw. 2004 Jan;15(1):16-28. doi: 10.1109/TNN.2003.809398.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验