通过整合机器学习模型提高治愈率分析：一项比较研究。

Enhancing Cure Rate Analysis Through Integration of Machine Learning Models: A Comparative Study.

作者信息

Aselisewine Wisdom, Pal Suvra

机构信息

Department of Mathematics, University of Texas at Arlington, Texas, USA 76019.

Division of Data Science, College of Science, University of Texas at Arlington, Arlington, TX 76019, United States.

出版信息

Stat Comput. 2024 Aug;34(4). doi: 10.1007/s11222-024-10456-y. Epub 2024 Jun 25.

DOI:10.1007/s11222-024-10456-y

PMID:39776468

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11706543/

Abstract

Cure rate models have been thoroughly investigated across various domains, encompassing medicine, reliability, and finance. The merging of machine learning (ML) with cure models is emerging as a promising strategy to improve predictive accuracy and gain profound insights into the underlying mechanisms influencing the probability of cure. The current body of literature has explored the benefits of incorporating a single ML algorithm with cure models. However, there is a notable absence of a comprehensive study that compares the performances of various ML algorithms in this context. This paper seeks to address and bridge this gap. Specifically, we focus on the well-known mixture cure model and examine the incorporation of five distinct ML algorithms: extreme gradient boosting, neural networks, support vector machines, random forests, and decision trees. To bolster the robustness of our comparison, we also include cure models with logistic and spline-based regression. For parameter estimation, we formulate an expectation maximization algorithm. A comprehensive simulation study is conducted across diverse scenarios to compare various models based on the accuracy and precision of estimates for different quantities of interest, along with the predictive accuracy of cure. The results derived from both the simulation study, as well as the analysis of real cutaneous melanoma data, indicate that the incorporation of ML models into cure model provides a beneficial contribution to the ongoing endeavors aimed at improving the accuracy of cure rate estimation.

摘要

治愈率模型已在医学、可靠性和金融等各个领域得到了深入研究。机器学习（ML）与治愈率模型的融合正成为一种有前景的策略，以提高预测准确性，并深入了解影响治愈概率的潜在机制。当前的文献探讨了将单一ML算法与治愈率模型相结合的好处。然而，在此背景下，明显缺乏对各种ML算法性能进行比较的全面研究。本文旨在解决并弥合这一差距。具体而言，我们聚焦于著名的混合治愈率模型，并研究了五种不同ML算法的纳入情况：极端梯度提升、神经网络、支持向量机、随机森林和决策树。为增强比较的稳健性，我们还纳入了基于逻辑回归和样条回归的治愈率模型。对于参数估计，我们制定了一种期望最大化算法。我们在各种不同场景下进行了全面的模拟研究，以根据对不同感兴趣量的估计的准确性和精度以及治愈的预测准确性来比较各种模型。模拟研究以及对真实皮肤黑色素瘤数据的分析结果表明，将ML模型纳入治愈率模型对旨在提高治愈率估计准确性的现有努力有有益贡献。

相似文献

Enhancing Cure Rate Analysis Through Integration of Machine Learning Models: A Comparative Study.通过整合机器学习模型提高治愈率分析：一项比较研究。

Stat Comput. 2024 Aug;34(4). doi: 10.1007/s11222-024-10456-y. Epub 2024 Jun 25.

On the integration of decision trees with mixture cure model.决策树与混合治愈模型的整合。

Stat Med. 2023 Oct 15;42(23):4111-4127. doi: 10.1002/sim.9850. Epub 2023 Jul 28.

A support vector machine-based cure rate model for interval censored data.基于支持向量机的区间 censored 数据治愈率模型。

Stat Methods Med Res. 2023 Dec;32(12):2405-2422. doi: 10.1177/09622802231210917. Epub 2023 Nov 8.

A New Approach to Modeling the Cure Rate in the Presence of Interval Censored Data.一种在存在区间删失数据情况下对治愈率进行建模的新方法。

Comput Stat. 2024 Jul;39(5):2743-2769. doi: 10.1007/s00180-023-01389-7. Epub 2023 Jul 15.

On the parameter estimation of Box-Cox transformation cure model.Box-Cox 变换治愈模型的参数估计。

Stat Med. 2023 Jul 10;42(15):2600-2618. doi: 10.1002/sim.9739. Epub 2023 Apr 5.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者？

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Dementia risk prediction in individuals with mild cognitive impairment: a comparison of Cox regression and machine learning models.轻度认知障碍个体的痴呆风险预测：Cox 回归和机器学习模型的比较。

BMC Med Res Methodol. 2022 Nov 2;22(1):284. doi: 10.1186/s12874-022-01754-y.

Development of a predictive model for 1-year postoperative recovery in patients with lumbar disk herniation based on deep learning and machine learning.基于深度学习和机器学习的腰椎间盘突出症患者术后1年恢复情况预测模型的开发

Front Neurol. 2024 Jun 11;15:1255780. doi: 10.3389/fneur.2024.1255780. eCollection 2024.

Promotion time cure rate model with a neural network estimated nonparametric component.基于神经网络估计的非参数分量的促销时间治愈率模型。

Stat Med. 2021 Jul 10;40(15):3516-3532. doi: 10.1002/sim.8980. Epub 2021 Apr 29.

A semiparametric accelerated failure time-based mixture cure tree.一种基于半参数加速失效时间的混合治愈树。

J Appl Stat. 2024 Oct 23;52(6):1177-1194. doi: 10.1080/02664763.2024.2418476. eCollection 2025.

引用本文的文献

A Neural Network Integrated Accelerated Failure Time-Based Mixture Cure Model.一种基于神经网络集成的加速失效时间混合治愈模型。

Stat Comput. 2025 Oct;35(5). doi: 10.1007/s11222-025-10674-y. Epub 2025 Jun 22.

本文引用的文献

A New Approach to Modeling the Cure Rate in the Presence of Interval Censored Data.一种在存在区间删失数据情况下对治愈率进行建模的新方法。

Comput Stat. 2024 Jul;39(5):2743-2769. doi: 10.1007/s00180-023-01389-7. Epub 2023 Jul 15.

A support vector machine-based cure rate model for interval censored data.基于支持向量机的区间 censored 数据治愈率模型。

Stat Methods Med Res. 2023 Dec;32(12):2405-2422. doi: 10.1177/09622802231210917. Epub 2023 Nov 8.

On the estimation of interval censored destructive negative binomial cure model.区间删失破坏性负二项式生存模型的估计。

Stat Med. 2023 Dec 10;42(28):5113-5134. doi: 10.1002/sim.9904. Epub 2023 Sep 14.

On the integration of decision trees with mixture cure model.决策树与混合治愈模型的整合。

Stat Med. 2023 Oct 15;42(23):4111-4127. doi: 10.1002/sim.9850. Epub 2023 Jul 28.

On the parameter estimation of Box-Cox transformation cure model.Box-Cox 变换治愈模型的参数估计。

Stat Med. 2023 Jul 10;42(15):2600-2618. doi: 10.1002/sim.9739. Epub 2023 Apr 5.

A Bayesian Mixture Cure Rate Model for Estimating Short-Term and Long-Term Recidivism.一种用于估计短期和长期再犯率的贝叶斯混合治愈率模型。

Entropy (Basel). 2022 Dec 28;25(1):56. doi: 10.3390/e25010056.

A New Non-Linear Conjugate Gradient Algorithm for Destructive Cure Rate Model and a Simulation Study: Illustration with Negative Binomial Competing Risks.一种用于破坏性治愈率模型的新型非线性共轭梯度算法及模拟研究：以负二项竞争风险为例

Commun Stat Simul Comput. 2022;51(11):6866-6880. doi: 10.1080/03610918.2020.1819321. Epub 2020 Sep 10.

On a reparameterization of a flexible family of cure models.对一类灵活的治愈模型的重参数化。

Stat Med. 2022 Sep 20;41(21):4091-4111. doi: 10.1002/sim.9498. Epub 2022 Jun 18.

A simplified stochastic EM algorithm for cure rate model with negative binomial competing risks: An application to breast cancer data.具有负二项竞争风险的治愈率模型的简化随机 EM 算法：在乳腺癌数据中的应用。

Stat Med. 2021 Dec 10;40(28):6387-6409. doi: 10.1002/sim.9189. Epub 2021 Sep 8.

Promotion time cure rate model with a neural network estimated nonparametric component.基于神经网络估计的非参数分量的促销时间治愈率模型。

Stat Med. 2021 Jul 10;40(15):3516-3532. doi: 10.1002/sim.8980. Epub 2021 Apr 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。