Liang Fu-Wen, Chan Wenyaw, Swartz Michael D, Dabaja Bouthaina S
Department of Public Health, College of Health Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan.
Center for Big Data Research, Kaohsiung Medical University, Kaohsiung, Taiwan.
BMC Med Res Methodol. 2025 May 14;25(1):132. doi: 10.1186/s12874-025-02580-8.
Finite mixture models have been recently applied in time-to-event data to identify subgroups with distinct hazard functions, yet they often assume differing covariate effects on failure times across latent classes but homogeneous covariate distributions. This study aimed to develop a method for analyzing time-to-event data while accounting for unobserved heterogeneity within a mixture modeling framework.
A joint model was developed to incorporate latent survival trajectories and observed information for the joint analysis of time-to-event outcomes, correlated discrete and continuous covariates, and a latent class variable. It assumed covariate effects on survival times and covariate distributions vary across latent classes. Unobservable trajectories were identified by estimating the probability of belonging to a particular class based on observed information. This method was applied to a Hodgkin lymphoma study, identifying four distinct classes in terms of long-term survival and distributions of prognostic factors.
Results from simulation studies and the Hodgkin lymphoma study demonstrated the superiority of our joint model compared with the conventional survival model. Four unobserved subgroups were identified, each characterized by distinct survival parameters and varying distributions of prognostic factors. A notable decreasing trend in the incidence of second malignancy over time was noted, along with different effects of second malignancy and relapse on survival across subgroups, providing deeper insights into disease progression over time.
The proposed joint model effectively identifies latent subgroups, revealing unobserved heterogeneity in survival outcomes and prognostic factors. Its flexibility enables more precise estimation of survival trajectories, with broad applicability in survival analysis.
有限混合模型最近已应用于生存时间数据,以识别具有不同风险函数的亚组,但它们通常假设潜在类别对失败时间的协变量效应不同,而协变量分布是同质的。本研究旨在开发一种在混合建模框架内考虑未观察到的异质性的同时分析生存时间数据的方法。
开发了一个联合模型,纳入潜在生存轨迹和观察信息,用于对生存时间结局、相关离散和连续协变量以及一个潜在类别变量进行联合分析。它假设潜在类别对生存时间的协变量效应和协变量分布各不相同。通过根据观察信息估计属于特定类别的概率来识别不可观察的轨迹。该方法应用于一项霍奇金淋巴瘤研究,从长期生存和预后因素分布方面识别出四个不同的类别。
模拟研究和霍奇金淋巴瘤研究的结果表明,我们的联合模型优于传统生存模型。识别出四个未观察到的亚组,每个亚组具有不同的生存参数和不同的预后因素分布。随着时间的推移,观察到第二原发性恶性肿瘤的发病率有明显下降趋势,同时第二原发性恶性肿瘤和复发对各亚组生存的影响不同,这为随时间推移的疾病进展提供了更深入的见解。
所提出的联合模型有效地识别了潜在亚组,揭示了生存结局和预后因素中未观察到的异质性。其灵活性能够更精确地估计生存轨迹,在生存分析中具有广泛的适用性。