纵向数据中缺失复合结局的多重填补

Multiple Imputation of Missing Composite Outcomes in Longitudinal Data.

作者信息

O'Keeffe Aidan G, Farewell Daniel M, Tom Brian D M, Farewell Vernon T

机构信息

Department of Statistical Science, University College London, Gower St., London, WC1E 6BT UK.

Institute of Primary Care and Public Health, Cardiff University School of Medicine, Neuadd Meirionnydd, Heath Park, Cardiff, CF14 4YS UK.

出版信息

Stat Biosci. 2016;8(2):310-332. doi: 10.1007/s12561-016-9146-z. Epub 2016 Apr 5.

DOI:10.1007/s12561-016-9146-z

PMID:27729945

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5035329/

Abstract

In longitudinal randomised trials and observational studies within a medical context, a composite outcome-which is a function of several individual patient-specific outcomes-may be felt to best represent the outcome of interest. As in other contexts, missing data on patient outcome, due to patient drop-out or for other reasons, may pose a problem. Multiple imputation is a widely used method for handling missing data, but its use for composite outcomes has been seldom discussed. Whilst standard multiple imputation methodology can be used directly for the composite outcome, the distribution of a composite outcome may be of a complicated form and perhaps not amenable to statistical modelling. We compare direct multiple imputation of a composite outcome with separate imputation of the components of a composite outcome. We consider two imputation approaches. One approach involves modelling each component of a composite outcome using standard likelihood-based models. The other approach is to use linear increments methods. A linear increments approach can provide an appealing alternative as assumptions concerning both the missingness structure within the data and the imputation models are different from the standard likelihood-based approach. We compare both approaches using simulation studies and data from a randomised trial on early rheumatoid arthritis patients. Results suggest that both approaches are comparable and that for each, separate imputation offers some improvement on the direct imputation of a composite outcome.

摘要

在医学背景下的纵向随机试验和观察性研究中，复合结局（它是几个个体患者特定结局的函数）可能被认为最能代表感兴趣的结局。与其他情况一样，由于患者退出或其他原因导致的患者结局数据缺失可能会带来问题。多重填补是处理缺失数据的一种广泛使用的方法，但其在复合结局中的应用很少被讨论。虽然标准的多重填补方法可以直接用于复合结局，但复合结局的分布可能形式复杂，也许不适用于统计建模。我们比较了复合结局的直接多重填补与复合结局各组成部分的单独填补。我们考虑两种填补方法。一种方法是使用基于标准似然模型对复合结局的每个组成部分进行建模。另一种方法是使用线性增量法。线性增量法可以提供一种有吸引力的替代方法，因为关于数据中的缺失结构和填补模型的假设与基于标准似然的方法不同。我们使用模拟研究和一项针对早期类风湿性关节炎患者的随机试验数据对这两种方法进行了比较。结果表明这两种方法具有可比性，并且对于每种方法，单独填补在复合结局的直接填补基础上都有一定改进。