The Department of Epidemiology and Biostatistics, University of Western Ontario, London, ON, Canada.
The Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
Biom J. 2020 Oct;62(6):1371-1393. doi: 10.1002/bimj.201900112. Epub 2020 Mar 20.
In clinical research and practice, landmark models are commonly used to predict the risk of an adverse future event, using patients' longitudinal biomarker data as predictors. However, these data are often observable only at intermittent visits, making their measurement times irregularly spaced and unsynchronized across different subjects. This poses challenges to conducting dynamic prediction at any post-baseline time. A simple solution is the last-value-carry-forward method, but this may result in bias for the risk model estimation and prediction. Another option is to jointly model the longitudinal and survival processes with a shared random effects model. However, when dealing with multiple biomarkers, this approach often results in high-dimensional integrals without a closed-form solution, and thus the computational burden limits its software development and practical use. In this article, we propose to process the longitudinal data by functional principal component analysis techniques, and then use the processed information as predictors in a class of flexible linear transformation models to predict the distribution of residual time-to-event occurrence. The measurement schemes for multiple biomarkers are allowed to be different within subject and across subjects. Dynamic prediction can be performed in a real-time fashion. The advantages of our proposed method are demonstrated by simulation studies. We apply our approach to the African American Study of Kidney Disease and Hypertension, predicting patients' risk of kidney failure or death by using four important longitudinal biomarkers for renal functions.
在临床研究和实践中,常用 landmark 模型来预测未来不良事件的风险,使用患者的纵向生物标志物数据作为预测因子。然而,这些数据通常只能在间歇性就诊时观察到,导致其测量时间在不同个体之间不规律地间隔且不同步。这对在任何基线后时间进行动态预测提出了挑战。一种简单的解决方案是最后观测值传递法,但这可能导致风险模型估计和预测的偏差。另一种选择是使用具有共享随机效应模型的纵向和生存过程联合模型。然而,当处理多个生物标志物时,这种方法通常会导致高维积分而没有闭式解,因此计算负担限制了其软件开发和实际应用。在本文中,我们提出通过功能主成分分析技术来处理纵向数据,然后将处理后的信息用作一类灵活线性变换模型的预测因子,以预测剩余事件发生时间的分布。允许在个体内和个体间对多个生物标志物的测量方案有所不同。可以实时进行动态预测。通过模拟研究证明了我们提出的方法的优势。我们将我们的方法应用于非洲裔美国人肾脏病和高血压研究,通过使用四个重要的肾功能纵向生物标志物来预测患者发生肾衰竭或死亡的风险。