Huang Yangxin, Yan Chunning, Xing Dongyuan, Zhang Nanhua, Chen Henian
a Department of Epidemiology and Biostatistics , College of Public Health, University of South Florida , Tampa , Florida , USA.
J Biopharm Stat. 2015;25(4):670-94. doi: 10.1080/10543406.2014.920866.
In longitudinal studies it is often of interest to investigate how a repeatedly measured marker in time is associated with a time to an event of interest. This type of research question has given rise to a rapidly developing field of biostatistics research that deals with the joint modeling of longitudinal and time-to-event data. Normality of model errors in longitudinal model is a routine assumption, but it may be unrealistically obscuring important features of subject variations. Covariates are usually introduced in the models to partially explain between- and within-subject variations, but some covariates such as CD4 cell count may be often measured with substantial errors. Moreover, the responses may encounter nonignorable missing. Statistical analysis may be complicated dramatically based on longitudinal-survival joint models where longitudinal data with skewness, missing values, and measurement errors are observed. In this article, we relax the distributional assumptions for the longitudinal models using skewed (parametric) distribution and unspecified (nonparametric) distribution placed by a Dirichlet process prior, and address the simultaneous influence of skewness, missingness, covariate measurement error, and time-to-event process by jointly modeling three components (response process with missing values, covariate process with measurement errors, and time-to-event process) linked through the random-effects that characterize the underlying individual-specific longitudinal processes in Bayesian analysis. The method is illustrated with an AIDS study by jointly modeling HIV/CD4 dynamics and time to viral rebound in comparison with potential models with various scenarios and different distributional specifications.
在纵向研究中,经常会关注随时间重复测量的标志物如何与感兴趣事件的发生时间相关联。这类研究问题催生了一个快速发展的生物统计学研究领域,该领域处理纵向数据和事件发生时间数据的联合建模。纵向模型中模型误差的正态性是一个常规假设,但它可能不切实际地掩盖了个体变异的重要特征。通常在模型中引入协变量以部分解释个体间和个体内的变异,但一些协变量,如CD4细胞计数,可能经常存在大量测量误差。此外,响应可能会出现不可忽略的缺失值。基于观察到具有偏态、缺失值和测量误差的纵向数据的纵向-生存联合模型,统计分析可能会显著复杂化。在本文中,我们使用偏态(参数)分布和由狄利克雷过程先验设定的未指定(非参数)分布来放宽纵向模型的分布假设,并通过对通过随机效应链接的三个组件(具有缺失值的响应过程、具有测量误差的协变量过程和事件发生时间过程)进行联合建模,来解决偏态、缺失值、协变量测量误差和事件发生时间过程的同时影响,这些随机效应表征了贝叶斯分析中潜在的个体特定纵向过程。通过对HIV/CD4动态和病毒反弹时间进行联合建模,并与具有各种情景和不同分布规格的潜在模型进行比较,用一项艾滋病研究对该方法进行了说明。