Sheikh Md Tuhin, Ibrahim Joseph G, Gelfond Jonathan A, Sun Wei, Chen Ming-Hui
Department of Statistics, University of Connecticut, Storrs, CT, USA.
Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
Stat Modelling. 2021 Feb;21(1-2):72-94. doi: 10.1177/1471082X20944620. Epub 2020 Sep 25.
This research is motivated from the data from a large Selenium and Vitamin E Cancer Prevention Trial (SELECT). The prostate specific antigens (PSAs) were collected longitudinally, and the survival endpoint was the time to low-grade cancer or the time to high-grade cancer (competing risks). In this article, the goal is to model the longitudinal PSA data and the time-to-prostate cancer (PC) due to low- or high-grade. We consider the low-grade and high-grade as two competing causes of developing PC. A joint model for simultaneously analysing longitudinal and time-to-event data in the presence of multiple causes of failure (or competing risk) is proposed within the Bayesian framework. The proposed model allows for handling the missing causes of failure in the SELECT data and implementing an efficient Markov chain Monte Carlo sampling algorithm to sample from the posterior distribution via a novel reparameterization technique. Bayesian criteria, ΔDIC, and ΔWAIC, are introduced to quantify the gain in fit in the survival sub-model due to the inclusion of longitudinal data. A simulation study is conducted to examine the empirical performance of the posterior estimates as well as ΔDIC and ΔWAIC and a detailed analysis of the SELECT data is also carried out to further demonstrate the proposed methodology.
本研究的动机来自于一项大型硒和维生素E癌症预防试验(SELECT)的数据。纵向收集前列腺特异性抗原(PSA),生存终点为低级别癌症发生时间或高级别癌症发生时间(竞争风险)。在本文中,目标是对纵向PSA数据以及低级别或高级别前列腺癌(PC)发生时间进行建模。我们将低级别和高级别视为发生PC的两个竞争原因。在贝叶斯框架内,提出了一种用于在存在多种失败原因(或竞争风险)的情况下同时分析纵向数据和事件发生时间数据的联合模型。所提出的模型允许处理SELECT数据中缺失的失败原因,并通过一种新颖的重新参数化技术实现一种有效的马尔可夫链蒙特卡罗抽样算法,以便从后验分布中抽样。引入贝叶斯准则、ΔDIC和ΔWAIC来量化由于纳入纵向数据而在生存子模型中拟合度的提高。进行了一项模拟研究,以检验后验估计以及ΔDIC和ΔWAIC的实证性能,并对SELECT数据进行了详细分析,以进一步证明所提出的方法。