Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston, Houston, Texas, USA.
Department of Biostatistics and Bioinformatics, Duke University Medical Center, Durham, North Carolina, USA.
Stat Med. 2022 Jul 10;41(15):2894-2907. doi: 10.1002/sim.9392. Epub 2022 Mar 28.
The joint model for longitudinal and survival data improves time-to-event predictions by including longitudinal outcome variables in addition to baseline covariates. However, in practice, joint models may be limited by parametric assumptions in both the longitudinal and survival submodels. In addition, computational difficulties arise when considering multiple longitudinal outcomes due to the large number of random effects to be integrated out in the full likelihood. In this article, we discuss several recent machine learning methods for incorporating multivariate longitudinal data for time-to-event prediction. The presented methods use functional data analysis or convolutional neural networks to model the longitudinal data, both of which scale well to multiple longitudinal outcomes. In addition, we propose a novel architecture based on the transformer neural network, named TransformerJM, which jointly models longitudinal and time-to-event data. The prognostic abilities of each model are assessed and compared through both simulation and real data analysis on Alzheimer's disease datasets. Specifically, the models were evaluated based on their ability to dynamically update predictions as new longitudinal data becomes available. We showed that TransformerJM improves upon the predictive performance of existing methods across different scenarios.
联合纵向和生存数据模型通过在基线协变量之外加入纵向结局变量来改善事件时间预测。然而,在实践中,联合模型可能受到纵向和生存子模型中参数假设的限制。此外,当考虑多个纵向结局时,由于在全似然中需要整合大量随机效应,因此会出现计算困难。在本文中,我们讨论了几种最近的机器学习方法,用于将多元纵向数据纳入事件时间预测。所提出的方法使用功能数据分析或卷积神经网络来对纵向数据进行建模,这两种方法都可以很好地扩展到多个纵向结局。此外,我们提出了一种基于转换器神经网络的新架构,名为 TransformerJM,它联合对纵向数据和事件时间数据进行建模。通过对阿尔茨海默病数据集的模拟和真实数据分析,评估并比较了每个模型的预后能力。具体来说,评估了这些模型根据新的纵向数据可用情况动态更新预测的能力。我们表明,TransformerJM 在不同场景下均提高了现有方法的预测性能。