Hadorn D C, Draper D, Rogers W H, Keeler E B, Brook R H
RAND, Santa Monica, CA 90406.
Stat Med. 1992 Feb 28;11(4):475-89. doi: 10.1002/sim.4780110409.
Mortality prediction models hold substantial promise as tools for patient management, quality assessment, and, perhaps, health care resource allocation planning. Yet relatively little is known about the predictive validity of these models. We report here a comparison of the cross-validation performance of seven statistical models of patient mortality: (1) ordinary-least-squares (OLS) regression predicting 0/1 death status six months after admission; (2) logistic regression; (3) Cox regression; (4-6) three unit-weight models derived from the logistic regression, and (7) a recursive partitioning classification technique (CART). We calculated the following performance statistics for each model in both a learning and test sample of patients, all of whom were drawn from a nationally representative sample of 2558 Medicare patients with acute myocardial infarction: overall accuracy in predicting six-month mortality, sensitivity and specificity rates, positive and negative predictive values, and per cent improvement in accuracy rates and error rates over model-free predictions (i.e., predictions that make no use of available independent variables). We developed ROC curves based on logistic regression, the best unit-weight model, the single best predictor variable, and a series of CART models generated by varying the misclassification cost specifications. In our sample, the models reduced model-free error rates at the patient level by 8-22 per cent in the test sample. We found that the performance of the logistic regression models was marginally superior to that of other models. The areas under the ROC curves for the best models ranged from 0.61 to 0.63. Overall predictive accuracy for the best models may be adequate to support activities such as quality assessment that involve aggregating over large groups of patients, but the extent to which these models may be appropriately applied to patient-level resource allocation planning is less clear.
死亡率预测模型作为患者管理、质量评估以及或许还有医疗资源分配规划的工具,具有巨大的前景。然而,人们对这些模型的预测有效性了解相对较少。我们在此报告对七种患者死亡率统计模型交叉验证性能的比较:(1)普通最小二乘法(OLS)回归,预测入院六个月后的0/1死亡状态;(2)逻辑回归;(3)Cox回归;(4 - 6)从逻辑回归派生的三种单位权重模型;以及(7)递归划分分类技术(CART)。我们在患者的学习样本和测试样本中为每个模型计算了以下性能统计数据,所有患者均来自2558名患有急性心肌梗死的医疗保险患者的全国代表性样本:预测六个月死亡率的总体准确性、敏感度和特异度、阳性和阴性预测值,以及与无模型预测(即不使用可用自变量的预测)相比准确率和错误率的提高百分比。我们基于逻辑回归、最佳单位权重模型、单个最佳预测变量以及通过改变误分类成本规格生成的一系列CART模型绘制了ROC曲线。在我们的样本中,这些模型在测试样本中将患者层面的无模型错误率降低了8% - 22%。我们发现逻辑回归模型的性能略优于其他模型。最佳模型的ROC曲线下面积范围为0.61至0.63。最佳模型的总体预测准确性可能足以支持诸如涉及大量患者汇总的质量评估等活动,但这些模型在多大程度上可适用于患者层面的资源分配规划尚不清楚。