用于Cox比例风险模型的梯度套索法

Gradient lasso for Cox proportional hazards model.

作者信息

Sohn Insuk, Kim Jinseog, Jung Sin-Ho, Park Changyi

机构信息

Department of Biostatistics & Bioinformatics, Duke University, NC 27705, USA.

出版信息

Bioinformatics. 2009 Jul 15;25(14):1775-81. doi: 10.1093/bioinformatics/btp322. Epub 2009 May 15.

DOI:10.1093/bioinformatics/btp322

PMID:19447787

Abstract

MOTIVATION

There has been an increasing interest in expressing a survival phenotype (e.g. time to cancer recurrence or death) or its distribution in terms of a subset of the expression data of a subset of genes. Due to high dimensionality of gene expression data, however, there is a serious problem of collinearity in fitting a prediction model, e.g. Cox's proportional hazards model. To avoid the collinearity problem, several methods based on penalized Cox proportional hazards models have been proposed. However, those methods suffer from severe computational problems, such as slow or even failed convergence, because of high-dimensional matrix inversions required for model fitting. We propose to implement the penalized Cox regression with a lasso penalty via the gradient lasso algorithm that yields faster convergence to the global optimum than do other algorithms. Moreover the gradient lasso algorithm is guaranteed to converge to the optimum under mild regularity conditions. Hence, our gradient lasso algorithm can be a useful tool in developing a prediction model based on high-dimensional covariates including gene expression data.

RESULTS

Results from simulation studies showed that the prediction model by gradient lasso recovers the prognostic genes. Also results from diffuse large B-cell lymphoma datasets and Norway/Stanford breast cancer dataset indicate that our method is very competitive compared with popular existing methods by Park and Hastie and Goeman in its computational time, prediction and selectivity.

AVAILABILITY

R package glcoxph is available at http://datamining.dongguk.ac.kr/R/glcoxph.

摘要

动机

人们越来越关注表达生存表型（例如癌症复发或死亡时间）或其在一组基因的表达数据子集方面的分布。然而，由于基因表达数据的高维度，在拟合预测模型（例如Cox比例风险模型）时存在严重的共线性问题。为了避免共线性问题，已经提出了几种基于惩罚Cox比例风险模型的方法。然而，由于模型拟合需要进行高维矩阵求逆，这些方法存在严重的计算问题，例如收敛缓慢甚至失败。我们建议通过梯度套索算法实现带套索惩罚的惩罚Cox回归，该算法比其他算法更快地收敛到全局最优解。此外，梯度套索算法在温和的正则条件下保证收敛到最优解。因此，我们的梯度套索算法可以成为开发基于包括基因表达数据在内的高维协变量的预测模型的有用工具。

结果

模拟研究结果表明，梯度套索预测模型能够恢复预后基因。弥漫性大B细胞淋巴瘤数据集和挪威/斯坦福乳腺癌数据集的结果也表明，我们的方法在计算时间、预测和选择性方面与Park和Hastie以及Goeman的现有流行方法相比具有很强的竞争力。

可用性

R包glcoxph可在http://datamining.dongguk.ac.kr/R/glcoxph获取。

相似文献

Gradient lasso for Cox proportional hazards model.用于Cox比例风险模型的梯度套索法

Bioinformatics. 2009 Jul 15;25(14):1775-81. doi: 10.1093/bioinformatics/btp322. Epub 2009 May 15.

L1 penalized estimation in the Cox proportional hazards model.Cox比例风险模型中的L1惩罚估计

Biom J. 2010 Feb;52(1):70-84. doi: 10.1002/bimj.200900028.

Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用

Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.

High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型：作为模型构建过程一部分的惩罚项选择

Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.

Assessment of survival prediction models based on microarray data.基于微阵列数据的生存预测模型评估。

Bioinformatics. 2007 Jul 15;23(14):1768-74. doi: 10.1093/bioinformatics/btm232. Epub 2007 May 7.

Predicting survival from microarray data--a comparative study.从微阵列数据预测生存率——一项比较研究。

Bioinformatics. 2007 Aug 15;23(16):2080-7. doi: 10.1093/bioinformatics/btm305. Epub 2007 Jun 6.

Partial Cox regression analysis for high-dimensional microarray gene expression data.高维微阵列基因表达数据的偏Cox回归分析

Bioinformatics. 2004 Aug 4;20 Suppl 1:i208-15. doi: 10.1093/bioinformatics/bth900.

Variable selection for proportional odds model.比例优势模型的变量选择

Stat Med. 2007 Sep 10;26(20):3771-81. doi: 10.1002/sim.2833.

Survival analysis of microarray expression data by transformation models.

Comput Biol Chem. 2005 Apr;29(2):91-4. doi: 10.1016/j.compbiolchem.2005.02.001.

Dimension reduction methods for microarrays with application to censored survival data.用于微阵列的降维方法及其在删失生存数据中的应用。

Bioinformatics. 2004 Dec 12;20(18):3406-12. doi: 10.1093/bioinformatics/bth415. Epub 2004 Jul 15.

引用本文的文献

Integrated network pharmacology and experiments to reveal the anti-inflammatory mechanism of Qinghuo Rougan Formula in uveitis.整合网络药理学与实验揭示清火柔肝方治疗葡萄膜炎的抗炎机制

Front Mol Biosci. 2025 Jul 11;12:1632027. doi: 10.3389/fmolb.2025.1632027. eCollection 2025.

Importance of CD8 Tex cell-associated gene signatures in the prognosis and immunology of osteosarcoma.CD8 T 细胞相关基因特征在骨肉瘤预后和免疫学中的重要性。

Sci Rep. 2024 Apr 29;14(1):9769. doi: 10.1038/s41598-024-60539-z.

CRYL1 is a Potential Prognostic Biomarker of Clear Cell Renal Cell Carcinoma Correlated with Immune Infiltration and Cuproptosis.CRYL1 是与免疫浸润和铜死亡相关的透明细胞肾细胞癌的潜在预后生物标志物。

Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241237439. doi: 10.1177/15330338241237439.

Estimation of Norm Penalized Models: A Statistical Treatment.规范惩罚模型的估计：一种统计处理方法。

Comput Stat Data Anal. 2024 Apr;192. doi: 10.1016/j.csda.2023.107902. Epub 2023 Dec 6.

The solution surface of the Li-Stephens haplotype copying model.李-斯蒂芬斯单倍型复制模型的解曲面。

Algorithms Mol Biol. 2023 Aug 9;18(1):12. doi: 10.1186/s13015-023-00237-z.

Improving the Post-Operative Prediction of BCR-Free Survival Time with mRNA Variables and Machine Learning.利用mRNA变量和机器学习改善无生化复发存活时间的术后预测

Cancers (Basel). 2023 Feb 17;15(4):1276. doi: 10.3390/cancers15041276.

Cancer Med. 2023 Feb;12(4):5071-5087. doi: 10.1002/cam4.5247. Epub 2022 Sep 26.

Predicting Patient-Reported Outcomes Following Surgery Using Machine Learning.运用机器学习预测术后患者报告结局

Am Surg. 2023 Jan;89(1):31-35. doi: 10.1177/00031348221109478. Epub 2022 Jun 18.

Fitting and Cross-Validating Cox Models to Censored Big Data With Missing Values Using Extensions of Partial Least Squares Regression Models.使用偏最小二乘回归模型的扩展方法对带有缺失值的删失大数据进行Cox模型拟合和交叉验证

Front Big Data. 2021 Nov 1;4:684794. doi: 10.3389/fdata.2021.684794. eCollection 2021.

A risk scoring tool for predicting Kenyan women at high risk of contraceptive discontinuation.一种用于预测肯尼亚女性高避孕中断风险的风险评分工具。

Contracept X. 2020 Oct 29;2:100045. doi: 10.1016/j.conx.2020.100045. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于Cox比例风险模型的梯度套索法

Gradient lasso for Cox proportional hazards model.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献