基于伪观测的 AUC 损失的多类型数据生存堆叠。

Survival stacking with multiple data types using pseudo-observation-based-AUC loss.

机构信息

Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Solna, Sweden.

出版信息

J Biopharm Stat. 2022 Nov 2;32(6):858-870. doi: 10.1080/10543406.2022.2041655. Epub 2022 May 15.

DOI:10.1080/10543406.2022.2041655

Abstract

There have been many strategies to adapt machine learning algorithms to account for right censored observations in survival data in order to build more accurate risk prediction models. These adaptions have included pre-processing steps such as pseudo-observation transformation of the survival outcome or inverse probability of censoring weighted (IPCW) bootstrapping of the observed binary indicator of an event prior to a time point of interest. These pre-processing steps allow existing or newly developed machine learning methods, which were not specifically developed with time-to-event data in mind, to be applied to right censored survival data for predicting the risk of experiencing an event. Stacking or ensemble methods can improve on risk predictions, but in general, the combination of pseudo-observation-based algorithms, IPCW bootstrapping, IPC weighting of the methods directly, and methods developed specifically for survival has not been considered in the same ensemble. In this paper, we propose an ensemble procedure based on the area under the pseudo-observation-based-time-dependent ROC curve to optimally stack predictions from any survival or survival adapted algorithm. The real application results show that our proposed method can improve on single survival based methods such as survival random forest or on other strategies that use a pre-processing step such as inverse probability of censoring weighted bagging or pseudo-observations alone.

摘要

已经有许多策略可以使机器学习算法适应生存数据中的右删失观测值，以构建更准确的风险预测模型。这些自适应方法包括预处理步骤，例如对生存结局进行伪观测转换，或者在感兴趣的时间点之前对事件的观测二元指示符进行逆概率 censoring 加权（Inverse Probability of Censoring Weighting，IPCW）引导。这些预处理步骤允许现有的或新开发的机器学习方法（这些方法不是专门为时间事件数据开发的）应用于右删失生存数据，以预测经历事件的风险。堆叠或集成方法可以提高风险预测，但一般来说，基于伪观测的算法、IPCW 引导、方法的直接 IPC 加权以及专门为生存开发的方法的组合尚未在同一集成中考虑。在本文中，我们提出了一种基于基于伪观测的时间相关 ROC 曲线下面积的集成程序，以最优地堆叠任何生存或生存适应算法的预测。实际应用结果表明，我们提出的方法可以改进基于生存的单一方法，如生存随机森林，或者改进其他使用预处理步骤（如逆概率 censoring 加权套袋或伪观测）的策略。

相似文献

Survival stacking with multiple data types using pseudo-observation-based-AUC loss.基于伪观测的 AUC 损失的多类型数据生存堆叠。

J Biopharm Stat. 2022 Nov 2;32(6):858-870. doi: 10.1080/10543406.2022.2041655. Epub 2022 May 15.

Adapting machine learning techniques to censored time-to-event health record data: A general-purpose approach using inverse probability of censoring weighting.使机器学习技术适用于删失的事件发生时间健康记录数据：一种使用删失加权逆概率的通用方法。

J Biomed Inform. 2016 Jun;61:119-31. doi: 10.1016/j.jbi.2016.03.009. Epub 2016 Mar 16.

Comparison of baseline covariate adjustment methods for restricted mean survival time.限制平均生存时间的基线协变量调整方法比较。

Contemp Clin Trials. 2024 Mar;138:107440. doi: 10.1016/j.cct.2024.107440. Epub 2024 Jan 14.

Regression modeling of restricted mean survival time for left-truncated right-censored data.左截断右删失数据的限制平均生存时间的回归建模。

Stat Med. 2022 Jul 20;41(16):3003-3021. doi: 10.1002/sim.9399. Epub 2022 Mar 28.

Correcting for dependent censoring in routine outcome monitoring data by applying the inverse probability censoring weighted estimator.通过应用逆概率删失加权估计量对常规结局监测数据中的依存删失进行校正。

Stat Methods Med Res. 2018 Feb;27(2):323-335. doi: 10.1177/0962280216628900. Epub 2016 Mar 17.

Model selection for survival individualized treatment rules using the jackknife estimator.利用刀切估计量进行生存个体化治疗规则的模型选择。

BMC Med Res Methodol. 2022 Dec 22;22(1):328. doi: 10.1186/s12874-022-01811-6.

Deep learning for survival outcomes.用于生存结果的深度学习。

Stat Med. 2020 Jul 30;39(17):2339-2349. doi: 10.1002/sim.8542. Epub 2020 Apr 13.

Calibration plots for multistate risk predictions models.多状态风险预测模型的校准图。

Stat Med. 2024 Jun 30;43(14):2830-2852. doi: 10.1002/sim.10094. Epub 2024 May 8.

On the role of Volterra integral equations in self-consistent, product-limit, inverse probability of censoring weighted, and redistribution-to-the-right estimators for the survival function.在生存函数的自洽、乘积限、逆概率删失加权和重分配到右估计中，沃尔泰拉积分方程的作用。

Lifetime Data Anal. 2024 Jul;30(3):649-666. doi: 10.1007/s10985-024-09623-0. Epub 2024 Mar 21.

Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA？

Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.

引用本文的文献

A novel non-negative Bayesian stacking modeling method for Cancer survival prediction using high-dimensional omics data.一种使用高维组学数据进行癌症生存预测的新型非负贝叶斯堆叠建模方法。

BMC Med Res Methodol. 2024 May 3;24(1):105. doi: 10.1186/s12874-024-02232-3.

Differential network connectivity analysis for microbiome data adjusted for clinical covariates using jackknife pseudo-values.基于 Jackknife 伪值调整临床协变量的微生物组数据的差异网络连通性分析。

BMC Bioinformatics. 2024 Mar 18;25(1):117. doi: 10.1186/s12859-024-05689-7.

Comparison of models for stroke-free survival prediction in patients with CADASIL.CADASIL 患者无卒中生存预测模型的比较。

Sci Rep. 2023 Dec 17;13(1):22443. doi: 10.1038/s41598-023-49552-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于伪观测的 AUC 损失的多类型数据生存堆叠。

Survival stacking with multiple data types using pseudo-observation-based-AUC loss.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献