Chen Jiaqing, Huang Yangxin, Wang Qing
Department of Statistics, College of Science, Wuhan University of Technology, Wuhan, China.
College of Public Health, University of South Florida, Tampa, Florida, USA.
Stat Med. 2023 Nov 30;42(27):4972-4989. doi: 10.1002/sim.9896. Epub 2023 Sep 5.
Joint models and statistical inference for longitudinal and survival data have been an active area of statistical research and have mostly coupled a longitudinal biomarker-based mixed-effects model with normal distribution and an event time-based survival model. In practice, however, the following issues may standout: (i) Normality of model error in longitudinal models is a routine assumption, but it may be unrealistically violating data features of subject variations. (ii) Data collected are often featured by the mixed types of multiple longitudinal outcomes which are significantly correlated, ignoring their correlation may lead to biased estimation. Additionally, a parametric model specification may be inflexible to capture the complicated patterns of longitudinal data. (iii) Missing observations in the longitudinal data are often encountered; the missing measures are likely to be informative (nonignorable) and ignoring this phenomenon may result in inaccurate inference. Multilevel item response theory (MLIRT) models have been increasingly used to analyze the multiple longitudinal data of mixed types (ie, continuous and categorical) in clinical studies. In this article, we develop an MLIRT-based semiparametric joint model with skew-t distribution that consists of an extended MLIRT model for the mixed types of multiple longitudinal data and a Cox proportional hazards model, linked through random-effects. A Bayesian approach is employed for joint modeling. Simulation studies are conducted to assess performance of the proposed models and method. A real example from primary biliary cirrhosis clinical study is analyzed to estimate parameters in the joint model and also evaluate sensitivity of parameter estimates for various plausible nonignorable missing data mechanisms.
纵向和生存数据的联合模型与统计推断一直是统计学研究的一个活跃领域,大多将基于纵向生物标志物的正态分布混合效应模型与基于事件时间的生存模型相结合。然而,在实际应用中,可能会出现以下问题:(i)纵向模型中模型误差的正态性是一个常规假设,但它可能不符合实际,违反了个体变异的数据特征。(ii)收集到的数据通常具有多种显著相关的纵向结果的混合类型,忽略它们之间的相关性可能会导致有偏估计。此外,参数模型设定可能不够灵活,无法捕捉纵向数据的复杂模式。(iii)纵向数据中经常会遇到缺失观测值;缺失的测量值很可能是信息性的(不可忽略的),忽略这一现象可能会导致不准确的推断。多级项目反应理论(MLIRT)模型已越来越多地用于分析临床研究中混合类型(即连续型和分类型)的多个纵向数据。在本文中,我们开发了一种基于MLIRT的半参数联合模型,该模型具有偏态t分布,由用于多种混合类型纵向数据的扩展MLIRT模型和Cox比例风险模型组成,通过随机效应进行链接。采用贝叶斯方法进行联合建模。进行模拟研究以评估所提出的模型和方法的性能。对原发性胆汁性肝硬化临床研究的一个真实例子进行分析,以估计联合模型中的参数,并评估参数估计对各种合理的不可忽略缺失数据机制的敏感性。