Department of Statistics and Clinical Studies, NHS Blood and Transplant, Bristol, UK.
Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
Stat Med. 2021 Apr 15;40(8):1917-1929. doi: 10.1002/sim.8879. Epub 2021 Jan 19.
In patient follow-up studies, events of interest may take place between periodic clinical assessments and so the exact time of onset is not observed. Such events are known as "bounded" or "interval-censored." Methods for handling such events can be categorized as either (i) applying multiple imputation (MI) strategies or (ii) taking a full likelihood-based (LB) approach. We focused on MI strategies, rather than LB methods, because of their flexibility. We evaluated MI strategies for bounded event times in a competing risks analysis, examining the extent to which interval boundaries, features of the data distribution and substantive analysis model are accounted for in the imputation model. Candidate imputation models were predictive mean matching (PMM); log-normal regression with postimputation back-transformation; normal regression with and without restrictions on the imputed values and Delord and Genin's method based on sampling from the cumulative incidence function. We used a simulation study to compare MI methods and one LB method when data were missing at random and missing not at random, also varying the proportion of missing data, and then applied the methods to a hematopoietic stem cell transplantation dataset. We found that cumulative incidence and median event time estimation were sensitive to model misspecification. In a competing risks analysis, we found that it is more important to account for features of the data distribution than to restrict imputed values based on interval boundaries or to ensure compatibility with the substantive analysis by sampling from the cumulative incidence function. We recommend MI by type 1 PMM.
在患者随访研究中,感兴趣的事件可能发生在定期临床评估之间,因此无法观察到发病的确切时间。此类事件被称为“有界”或“区间删失”。处理此类事件的方法可分为(i)应用多重插补(MI)策略或(ii)采用完全似然(LB)方法。我们专注于 MI 策略,而不是 LB 方法,因为 MI 策略更加灵活。我们在竞争风险分析中评估了有界事件时间的 MI 策略,考察了插补模型在多大程度上考虑了区间边界、数据分布特征和实质性分析模型。候选插补模型包括预测均值匹配(PMM);对数正态回归后进行插补值的反转换;正常回归,包括和不包括对插补值的限制以及 Delord 和 Genin 基于累积发生率函数抽样的方法。我们使用模拟研究比较了 MI 方法和一种 LB 方法在随机缺失和非随机缺失数据时的情况,同时还改变了缺失数据的比例,然后将这些方法应用于造血干细胞移植数据集。我们发现,累积发生率和中位事件时间估计对模型的误设定很敏感。在竞争风险分析中,我们发现,更重要的是要考虑数据分布的特征,而不是基于区间边界限制插补值或通过从累积发生率函数中抽样来确保与实质性分析兼容。我们推荐使用类型 1 PMM 的 MI。