Pal Suvra, Peng Yingwei, Aselisewine Wisdom
Department of Mathematics, University of Texas at Arlington, TX, 76019, USA.
Department of Public Health Sciences, Queen's University, Kingston, Ontario, K7L 3N6, Canada.
Comput Stat. 2024 Jul;39(5):2743-2769. doi: 10.1007/s00180-023-01389-7. Epub 2023 Jul 15.
We consider interval censored data with a cured subgroup that arises from longitudinal followup studies with a heterogeneous population where a certain proportion of subjects is not susceptible to the event of interest. We propose a two component mixture cure model, where the first component describing the probability of cure is modeled by a support vector machine-based approach and the second component describing the survival distribution of the uncured group is modeled by a proportional hazard structure. Our proposed model provides flexibility in capturing complex effects of covariates on the probability of cure unlike the traditional models that rely on modeling the cure probability using a generalized linear model with a known link function. For the estimation of model parameters, we develop an expectation maximization-based estimation algorithm. We conduct simulation studies and show that our proposed model performs better in capturing complex effects of covariates on the cure probability when compared to the traditional logit link-based two component mixture cure model. This results in more accurate (smaller bias) and precise (smaller mean square error) estimates of the cure probabilities, which in-turn improves the predictive accuracy of the latent cured status. We further show that our model's ability to capture complex covariate effects also improves the estimation results corresponding to the survival distribution of the uncured. Finally, we apply the proposed model and estimation procedure to an interval censored data on smoking cessation.
我们考虑来自具有异质性总体的纵向随访研究中的区间删失数据,其中存在一个治愈亚组,该总体中有一定比例的受试者对感兴趣的事件不敏感。我们提出了一种双组分混合治愈模型,其中描述治愈概率的第一组分采用基于支持向量机的方法建模,描述未治愈组生存分布的第二组分采用比例风险结构建模。与依赖于使用具有已知连接函数的广义线性模型对治愈概率进行建模的传统模型不同,我们提出的模型在捕捉协变量对治愈概率的复杂影响方面具有灵活性。对于模型参数的估计,我们开发了一种基于期望最大化的估计算法。我们进行了模拟研究,结果表明,与传统的基于逻辑连接的双组分混合治愈模型相比,我们提出的模型在捕捉协变量对治愈概率的复杂影响方面表现更好。这导致对治愈概率的估计更准确(偏差更小)且更精确(均方误差更小),进而提高了潜在治愈状态的预测准确性。我们进一步表明,我们的模型捕捉复杂协变量效应的能力也改善了与未治愈者生存分布相对应的估计结果。最后,我们将提出的模型和估计程序应用于戒烟的区间删失数据。