Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom.
Centre for Artificial Intelligence in Medicine, University of Cambridge, Cambridge, United Kingdom.
PLoS One. 2021 Aug 6;16(8):e0253425. doi: 10.1371/journal.pone.0253425. eCollection 2021.
Statistical models for outcome prediction are central to traumatic brain injury research and critical to baseline risk adjustment. Glasgow coma score (GCS) and pupil reactivity are crucial covariates in all such models but may be measured at multiple time points between the time of injury and hospital and are subject to a variable degree of unreliability and/or missingness. Imputation of missing data may be undertaken using full multiple imputation or by simple substitution of measurements from other time points. However, it is unknown which strategy is best or which time points are more predictive. We evaluated the pseudo-R2 of logistic regression models (dichotomous survival) and proportional odds models (Glasgow Outcome Score-extended) using different imputation strategies on the The Collaborative European NeuroTrauma Effectiveness Research in Traumatic Brain Injury (CENTER-TBI) study dataset. Substitution strategies were easy to implement, achieved low levels of missingness (<< 10%) and could outperform multiple imputation without the need for computationally costly calculations and pooling multiple final models. While model performance was sensitive to imputation strategy, this effect was small in absolute terms and clinical relevance. A strategy of using the emergency department discharge assessments and working back in time when these were missing generally performed well. Full multiple imputation had the advantage of preserving time-dependence in the models: the pre-hospital assessments were found to be relatively unreliable predictors of survival or outcome. The predictive performance of later assessments was model-dependent. In conclusion, simple substitution strategies for imputing baseline GCS and pupil response can perform well and may be a simple alternative to full multiple imputation in many cases.
统计模型在创伤性脑损伤研究中至关重要,是预后预测的核心,对基线风险调整也非常关键。格拉斯哥昏迷评分(GCS)和瞳孔反应是所有此类模型中的关键协变量,但可能在受伤和住院之间的多个时间点进行测量,并且存在不同程度的不可靠性和/或缺失。缺失数据可以通过完全多重插补或简单替代其他时间点的测量值进行插补。然而,目前尚不清楚哪种策略最好,或者哪些时间点更具预测性。我们使用不同的插补策略对协同欧洲神经创伤效应研究在创伤性脑损伤(CENTER-TBI)研究数据集进行了逻辑回归模型(二项生存)和比例优势模型(格拉斯哥结局评分扩展)的伪 R2 评估。替代策略易于实施,缺失率低(<<10%),并且可以优于多重插补,而无需进行计算成本高的计算和汇集多个最终模型。虽然模型性能对插补策略敏感,但这种效果在绝对值和临床相关性方面很小。使用急诊科出院评估并在缺失时回溯时间的策略通常表现良好。完全多重插补的优点是在模型中保留时间依赖性:发现院前评估是生存或预后的相对不可靠预测因子。后期评估的预测性能取决于模型。总之,插补基线 GCS 和瞳孔反应的简单替代策略可以表现良好,并且在许多情况下可能是完全多重插补的简单替代方法。