School of Public Health and Preventive Medicine, Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Victoria.
School of Public Health and Preventive Medicine, Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Victoria; School of Public Health, Department of Health Policy and Management, Curtin University, Western Australia.
J Thorac Cardiovasc Surg. 2017 May;153(5):1128-1135.e3. doi: 10.1016/j.jtcvs.2016.11.028. Epub 2016 Nov 19.
To compare the impact of different variable selection methods in multiple regression to develop a parsimonious model for predicting postoperative outcomes of patients undergoing cardiac surgery.
Data from 84,135 patients in the Australian and New Zealand Society of Cardiac and Thoracic Surgeons registry between 2001 and 2014 were analyzed. Primary outcome was 30-day-mortality. Mixed-effect logistic regressions were used to build the model. Missing values were imputed by the use of multiple imputations. The following 5 variable selection methods were compared: bootstrap receiver-operative characteristic (ROC), bootstrap Akaike information criteria, bootstrap Bayesian information criteria, and stepwise forward and stepwise backward methods. The final model's prediction performance was evaluated by the use of Frank Harrell's calibration curve and using a multifold cross-validation approach.
Stepwise forward and backward methods selected same set of 21 variables into the model with the area under the ROC (AUC) of 0.8490. The bootstrap ROC method selected 13 variables with AUC of 0.8450. Bootstrap Bayesian information criteria and Akaike information criteria respectively selected 16 (AUC: 0.8470) and 23 (AUC: 0.8491) variables. Bootstrap ROC model was selected as the final model which showed very good discrimination and calibration power.
Clinical suitability in terms of parsimony and prediction performance can be achieved substantially by using the bootstrap ROC method for the development of risk prediction models.
比较多元回归中不同变量选择方法对心脏手术后患者术后结局预测模型的影响,以建立一个简洁的模型。
分析了 2001 年至 2014 年间澳大利亚和新西兰心胸外科协会注册中心 84135 例患者的数据。主要结局为 30 天死亡率。采用混合效应逻辑回归建立模型。使用多重插补法处理缺失值。比较了以下 5 种变量选择方法:Bootstrap 接收者操作特征(ROC)、Bootstrap Akaike 信息准则、Bootstrap Bayesian 信息准则和逐步向前法和逐步向后法。使用 Frank Harrell 的校准曲线和多次交叉验证方法评估最终模型的预测性能。
逐步向前和向后法选择了相同的 21 个变量进入模型,ROC 曲线下面积(AUC)为 0.8490。Bootstrap ROC 法选择了 13 个变量,AUC 为 0.8450。Bootstrap Bayesian 信息准则和 Akaike 信息准则分别选择了 16 个(AUC:0.8470)和 23 个(AUC:0.8491)变量。选择 Bootstrap ROC 模型作为最终模型,该模型具有很好的区分度和校准能力。
使用 Bootstrap ROC 方法开发风险预测模型,可以在保持简洁和预测性能的前提下,在临床适用性方面取得很大进展。