• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加速失效时间模型的正则化估计

Regularized estimation for the accelerated failure time model.

作者信息

Cai T, Huang J, Tian L

机构信息

Department of Biostatistics, Harvard University, Boston, Massachusetts 02115, USA.

出版信息

Biometrics. 2009 Jun;65(2):394-404. doi: 10.1111/j.1541-0420.2008.01074.x.

DOI:10.1111/j.1541-0420.2008.01074.x
PMID:18573133
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3073158/
Abstract

In the presence of high-dimensional predictors, it is challenging to develop reliable regression models that can be used to accurately predict future outcomes. Further complications arise when the outcome of interest is an event time, which is often not fully observed due to censoring. In this article, we develop robust prediction models for event time outcomes by regularizing the Gehan's estimator for the accelerated failure time (AFT) model (Tsiatis, 1996, Annals of Statistics 18, 305-328) with least absolute shrinkage and selection operator (LASSO) penalty. Unlike existing methods based on the inverse probability weighting and the Buckley and James estimator (Buckley and James, 1979, Biometrika 66, 429-436), the proposed approach does not require additional assumptions about the censoring and always yields a solution that is convergent. Furthermore, the proposed estimator leads to a stable regression model for prediction even if the AFT model fails to hold. To facilitate the adaptive selection of the tuning parameter, we detail an efficient numerical algorithm for obtaining the entire regularization path. The proposed procedures are applied to a breast cancer dataset to derive a reliable regression model for predicting patient survival based on a set of clinical prognostic factors and gene signatures. Finite sample performances of the procedures are evaluated through a simulation study.

摘要

在存在高维预测变量的情况下,开发能够准确预测未来结果的可靠回归模型具有挑战性。当感兴趣的结果是事件时间时,会出现进一步的复杂情况,由于删失,事件时间往往无法完全观测到。在本文中,我们通过使用最小绝对收缩和选择算子(LASSO)惩罚对加速失效时间(AFT)模型(Tsiatis,1996年,《统计学年鉴》18卷,305 - 328页)的Gehan估计器进行正则化,来开发针对事件时间结果的稳健预测模型。与基于逆概率加权以及Buckley和James估计器(Buckley和James,1979年,《生物统计学》66卷,429 - 436页)的现有方法不同,所提出的方法不需要关于删失的额外假设,并且总能得到一个收敛的解。此外,即使AFT模型不成立,所提出的估计器也能导致一个用于预测的稳定回归模型。为了便于自适应选择调谐参数,我们详细介绍了一种用于获得整个正则化路径的高效数值算法。所提出的方法应用于一个乳腺癌数据集,以基于一组临床预后因素和基因特征推导一个用于预测患者生存的可靠回归模型。通过模拟研究评估了这些方法的有限样本性能。

相似文献

1
Regularized estimation for the accelerated failure time model.加速失效时间模型的正则化估计
Biometrics. 2009 Jun;65(2):394-404. doi: 10.1111/j.1541-0420.2008.01074.x.
2
Marginal hazards regression for retrospective studies within cohort with possibly correlated failure time data.对队列中具有可能相关的失效时间数据的回顾性研究进行边际风险回归。
Biometrics. 2009 Jun;65(2):405-14. doi: 10.1111/j.1541-0420.2008.01077.x. Epub 2008 May 19.
3
Estimating survival and association in a semicompeting risks model.在半竞争风险模型中估计生存率和关联性。
Biometrics. 2008 Mar;64(1):180-8. doi: 10.1111/j.1541-0420.2007.00872.x. Epub 2007 Jul 23.
4
Clustering in the presence of scatter.存在散点情况下的聚类
Biometrics. 2009 Jun;65(2):341-52. doi: 10.1111/j.1541-0420.2008.01064.x. Epub 2008 May 30.
5
Consistent estimation of the expected Brier score in general survival models with right-censored event times.在具有右删失事件时间的一般生存模型中对预期Brier评分进行一致估计。
Biom J. 2006 Dec;48(6):1029-40. doi: 10.1002/bimj.200610301.
6
Cluster detection based on spatial associations and iterated residuals in generalized linear mixed models.基于广义线性混合模型中空间关联和迭代残差的聚类检测
Biometrics. 2009 Jun;65(2):353-60. doi: 10.1111/j.1541-0420.2008.01069.x. Epub 2008 May 11.
7
Inference for clustered inhomogeneous spatial point processes.聚类非均匀空间点过程的推断
Biometrics. 2009 Jun;65(2):423-30. doi: 10.1111/j.1541-0420.2008.01070.x. Epub 2008 May 18.
8
Double inverse-weighted estimation of cumulative treatment effects under nonproportional hazards and dependent censoring.非比例风险和相依删失下累积治疗效果的双重逆加权估计
Biometrics. 2011 Mar;67(1):29-38. doi: 10.1111/j.1541-0420.2010.01449.x.
9
Partly functional temporal process regression with semiparametric profile estimating functions.具有半参数轮廓估计函数的部分功能时间过程回归
Biometrics. 2009 Jun;65(2):431-40. doi: 10.1111/j.1541-0420.2008.01071.x. Epub 2008 May 10.
10
Diagnosis of random-effect model misspecification in generalized linear mixed models for binary response.二元响应广义线性混合模型中随机效应模型误设的诊断
Biometrics. 2009 Jun;65(2):361-8. doi: 10.1111/j.1541-0420.2008.01103.x.

引用本文的文献

1
EFFICIENT ESTIMATION OF THE MAXIMAL ASSOCIATION BETWEEN MULTIPLE PREDICTORS AND A SURVIVAL OUTCOME.多个预测因素与生存结局之间最大关联的有效估计
Ann Stat. 2023 Oct;51(5):1965-1988. doi: 10.1214/23-aos2313. Epub 2023 Dec 14.
2
Predictive signature development based on maximizing the area between receiver operating characteristic curves.基于最大化接收者操作特征曲线之间面积的预测特征开发。
Stat Med. 2022 Nov 20;41(26):5242-5257. doi: 10.1002/sim.9565. Epub 2022 Aug 31.
3
A regularized estimation approach for case-cohort periodic follow-up studies with an application to HIV vaccine trials.

本文引用的文献

1
Doubly penalized buckley-james method for survival data with high-dimensional covariates.用于具有高维协变量生存数据的双重惩罚Buckley-James方法
Biometrics. 2008 Mar;64(1):132-40. doi: 10.1111/j.1541-0420.2007.00877.x. Epub 2007 Aug 3.
2
Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO.通过使用偏最小二乘法和套索算法的加速失效时间模型从微阵列数据预测患者生存率。
Biometrics. 2007 Mar;63(1):259-71. doi: 10.1111/j.1541-0420.2006.00660.x.
3
Path consistent model selection in additive risk model via Lasso.
一种用于病例-队列定期随访研究的正则化估计方法及其在 HIV 疫苗试验中的应用。
Biom J. 2020 Sep;62(5):1176-1191. doi: 10.1002/bimj.201900180. Epub 2020 Feb 20.
4
Marginal screening for high-dimensional predictors of survival outcomes.生存结局高维预测因子的边际筛选
Stat Sin. 2019 Oct;29(4):2105-2139. doi: 10.5705/ss.202017.0298.
5
Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.基于蛋白质表达数据的泛癌生存预测的贝叶斯数据整合和变量选择。
Biometrics. 2020 Mar;76(1):316-325. doi: 10.1111/biom.13132. Epub 2019 Oct 3.
6
A new joint screening method for right-censored time-to-event data with ultra-high dimensional covariates.一种针对具有超高维协变量的右删失事件发生时间数据的新型联合筛选方法。
Stat Methods Med Res. 2020 Jun;29(6):1499-1513. doi: 10.1177/0962280219864710. Epub 2019 Jul 30.
7
INFERENCE FOR LOW-DIMENSIONAL COVARIATES IN A HIGH-DIMENSIONAL ACCELERATED FAILURE TIME MODEL.高维加速失效时间模型中低维协变量的推断
Stat Sin. 2019 Apr;29(2):877-894. doi: 10.5705/ss.202016.0449.
8
Extreme learning machine Cox model for high-dimensional survival analysis.极端学习机 Cox 模型在高维生存分析中的应用。
Stat Med. 2019 May 30;38(12):2139-2156. doi: 10.1002/sim.8090. Epub 2019 Jan 10.
9
Lasso regularization for left-censored Gaussian outcome and high-dimensional predictors.左截断高斯结局和高维预测因子的套索正则化。
BMC Med Res Methodol. 2018 Dec 4;18(1):159. doi: 10.1186/s12874-018-0609-4.
10
Robust sparse accelerated failure time model for survival analysis.用于生存分析的稳健稀疏加速失效时间模型。
Technol Health Care. 2018;26(S1):55-63. doi: 10.3233/THC-174141.
基于套索法的相加风险模型中的路径一致性模型选择
Stat Med. 2007 Sep 10;26(20):3753-70. doi: 10.1002/sim.2834.
4
Variable selection for proportional odds model.比例优势模型的变量选择
Stat Med. 2007 Sep 10;26(20):3771-81. doi: 10.1002/sim.2833.
5
Regularized estimation in the accelerated failure time model with high-dimensional covariates.具有高维协变量的加速失效时间模型中的正则化估计。
Biometrics. 2006 Sep;62(3):813-20. doi: 10.1111/j.1541-0420.2006.00562.x.
6
Application of the time-dependent ROC curves for prognostic accuracy with multiple biomarkers.将时间依赖性ROC曲线应用于多种生物标志物的预后准确性评估。
Biometrics. 2006 Mar;62(1):279-87. doi: 10.1111/j.1541-0420.2005.00441.x.
7
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用
Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.
8
Iterative partial least squares with right-censored data analysis: a comparison to other dimension reduction techniques.带有删失数据分析的迭代偏最小二乘法:与其他降维技术的比较
Biometrics. 2005 Mar;61(1):17-24. doi: 10.1111/j.0006-341X.2005.040304.x.
9
Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival.伤口反应基因表达特征在预测乳腺癌生存中的稳健性、可扩展性和整合性。
Proc Natl Acad Sci U S A. 2005 Mar 8;102(10):3738-43. doi: 10.1073/pnas.0409462102. Epub 2005 Feb 8.
10
A gene-expression signature as a predictor of survival in breast cancer.一种作为乳腺癌生存预测指标的基因表达特征。
N Engl J Med. 2002 Dec 19;347(25):1999-2009. doi: 10.1056/NEJMoa021967.