Department of Psychology, University of Turin, Turin, Italy.
Unit of Cancer Epidemiology, "Città della Salute e della Scienza" Hospital-University of Turin and CPO Piemonte, Turin, Italy.
J Eval Clin Pract. 2021 Feb;27(1):34-41. doi: 10.1111/jep.13376. Epub 2020 Feb 26.
RATIONALE, AIMS, AND OBJECTIVES: Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later.
The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later.
Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates.
Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.
原理、目的和目标:缺失数据是纵向研究中的一个挑战。本研究的目的是使用部分缺失数据且 2 年后完整补全的真实数据集,比较多元正态插补和完全条件指定方法的性能。
使用的数据来自一项正在进行的、随访 5 年的随机对照试验。在某个时间点,我们观察到一些存在缺失数据的患者和一些因尚未符合特定随访条件而未被观察到的患者。对未观察到的数据和缺失数据都进行了插补。将插补的未观察到的数据与 2 年后获得的相应真实信息进行了比较。
两种插补方法在准确性测量上表现相似,产生的估计值最小偏差。
尽管存在大量具有间歇性缺失数据的重复测量和数据的非正态多元分布,但两种方法都表现良好,无法确定哪种方法更好。