Dr. Foster Unit at Imperial College, Department of Primary Care and Public Health, Imperial College London, London, W6 8RP, UK.
Health Care Manag Sci. 2011 Sep;14(3):267-78. doi: 10.1007/s10729-011-9159-6. Epub 2011 May 10.
We have previously described a system for monitoring a number of healthcare outcomes using case-mix adjustment models. It is desirable to automate the model fitting process in such a system if monitoring covers a large number of outcome measures or subgroup analyses. Our aim was to compare the performance of three different variable selection strategies: "manual", "automated" backward elimination and re-categorisation, and including all variables at once, irrespective of their apparent importance, with automated re-categorisation. Logistic regression models for predicting in-hospital mortality and emergency readmission within 28 days were fitted to an administrative database for 78 diagnosis groups and 126 procedures from 1996 to 2006 for National Health Services hospital trusts in England. The performance of models was assessed with Receiver Operating Characteristic (ROC) c statistics, (measuring discrimination) and Brier score (assessing the average of the predictive accuracy). Overall, discrimination was similar for diagnoses and procedures and consistently better for mortality than for emergency readmission. Brier scores were generally low overall (showing higher accuracy) and were lower for procedures than diagnoses, with a few exceptions for emergency readmission within 28 days. Among the three variable selection strategies, the automated procedure had similar performance to the manual method in almost all cases except low-risk groups with few outcome events. For the rapid generation of multiple case-mix models we suggest applying automated modelling to reduce the time required, in particular when examining different outcomes of large numbers of procedures and diseases in routinely collected administrative health data.
我们之前描述了一种使用病例组合调整模型监测多项医疗保健结果的系统。如果监测涵盖大量结果测量或亚组分析,那么在这种系统中自动完成模型拟合过程是很理想的。我们的目的是比较三种不同变量选择策略的性能:“手动”、“自动”逐步回归和重新分类,以及自动重新分类时不考虑其明显重要性而同时包含所有变量。使用逻辑回归模型对英格兰国家卫生服务医院信托 1996 年至 2006 年期间的 78 个诊断组和 126 种手术进行了住院内死亡率和 28 天内急诊再入院的预测。使用接收者操作特征(ROC)c 统计量(衡量区分度)和 Brier 评分(评估预测准确性的平均值)评估模型性能。总体而言,诊断和手术的区分度相似,死亡率的区分度始终优于急诊再入院。Brier 评分总体上较低(显示出更高的准确性),手术的评分低于诊断,急诊再入院的评分在 28 天内除外。在三种变量选择策略中,除了低风险组结果事件较少外,自动程序在几乎所有情况下的性能都与手动方法相似。对于快速生成多个病例组合模型,我们建议应用自动化建模来减少所需的时间,特别是在定期收集的行政健康数据中检查大量手术和疾病的不同结果时。