Steyerberg Ewout W, Eijkemans Marinus J C, Boersma Eric, Habbema J D F
Department of Public Health, Center for Clinical Decision Sciences, Ee2093, Erasmus MC, University Medical Center Rotterdam, PO Box 1738, 3000 DR Rotterdam, The Netherlands.
J Clin Epidemiol. 2005 Apr;58(4):383-90. doi: 10.1016/j.jclinepi.2004.07.008.
Models that predict mortality after acute myocardial infarction (AMI) contain different predictors and are based on different populations. We studied the agreement and validity of predictions for individual patients.
We compared predictions from five predictive logistic regression models for short-term mortality after AMI. Three models were developed previously, and two models were developed in the GUSTO-I data, where all five models were applied (n =40,830, 7.0% 30-day mortality). Agreement was studied with weighted kappa statistics of categorized predictions. Validity was assessed by comparing observed frequencies with predictions (indicating calibration) and by the area under the receiver operating characteristic curve (AUC), indicating discriminative ability.
The predictions from the five models varied considerably for individual patients, with low agreement between most (kappa <0.6). Risk predictions from the three previously developed models were on average too high, which could be corrected by re-calibration of the model intercept. The AUC ranged from 0.76-0.78 and increased to 0.78-0.79 with re-estimated regression coefficients that were optimal for the GUSTO-I patients. The two more detailed GUSTO-I based models performed better (AUC approximately 0.82).
Models with different predictors may have a similar validity while the agreement between predictions for individual patients is poor. The main concerns in the applicability of predictive models for AMI should relate to the selected predictors and average calibration.
预测急性心肌梗死(AMI)后死亡率的模型包含不同的预测因素,且基于不同的人群。我们研究了对个体患者预测的一致性和有效性。
我们比较了五个预测急性心肌梗死后短期死亡率的逻辑回归模型的预测结果。三个模型是先前开发的,另外两个模型是在GUSTO-I数据中开发的,所有五个模型都在该数据中应用(n = 40830,30天死亡率为7.0%)。通过对分类预测的加权kappa统计量研究一致性。通过将观察频率与预测结果进行比较(以表明校准情况)以及通过受试者工作特征曲线下面积(AUC)来评估有效性,AUC表明判别能力。
五个模型对个体患者的预测差异很大,大多数模型之间的一致性较低(kappa < 0.6)。先前开发的三个模型的风险预测平均过高,可通过重新校准模型截距来校正。AUC范围为0.76 - 0.78,重新估计对GUSTO-I患者最优的回归系数后,AUC增加到0.78 - 0.79。基于GUSTO-I的另外两个更详细的模型表现更好(AUC约为0.82)。
具有不同预测因素的模型可能具有相似的有效性,但个体患者预测之间的一致性较差。急性心肌梗死预测模型适用性的主要关注点应与所选预测因素和平均校准有关。