Livingston B M, MacKirdy F N, Howie J C, Jones R, Norrie J D
Department of Public Health, University of Glasgow, Scotland.
Crit Care Med. 2000 Jun;28(6):1820-7. doi: 10.1097/00003246-200006000-00023.
To assess and compare the performance of five severity of illness scoring systems used commonly for intensive care unit (ICU) patients in the United Kingdom. The five models analyzed were versions II and III of the Acute Physiology and Chronic Health Evaluation (APACHE) system, a version of APACHE II using United Kingdom (UK)-derived coefficients (UK APACHE II), version II of the Simplified Acute Physiology Score (SAPS), and version II of the Mortality Probability Model, computed at admission (MPM0) and after 24 hrs in the ICU (MPM24).
A 2-yr prospective cohort study of consecutive admissions to intensive care units.
A total of 22 general ICUs in Scotland
A total of 13,291 admissions to the study, which after prospectively agreed exclusions left a total of 10,393 patients for the analysis.
Death or survival at hospital discharge.
All the models showed reasonable discrimination using the area under the receiver operating characteristic curve (APACHE III, 0.845; APACHE II, 0.805; UKAPACHE II, 0.809; SAPS II, 0.843; MPM0, 0.785; MPM24, 0.799). The levels of observed mortality were significantly different than that predicted by all models, using the Hosmer-Lemeshow goodness-of-fit C test (p < .001), with the results of the test being confirmed by calibration curves. When excluding patients discharged in the first 24 hrs to allow for comparisons using the same patient group, APACHE III, MPM24, and SAPS II (APACHE III, 0.795; MPM24, 0.791; SAPS II, 0.784) showed significantly better discrimination than APACHE II, UK APACHE II, and MPM0 (APACHE II, 0.763; UK APACHE II, 0.756; MPM0, 0.741). However, calibration changed little for all models with observed mortality still significantly different from that predicted by the scoring systems (p < .001). For equivalent data sets, APACHE II demonstrated superior calibration to all the models using the chi-squared value from the Hosmer-Lemeshow test for both populations (APACHE III, 366; APACHE II, 67; UKAPACHE II, 237; SAPS II, 142; MPM0, 452; MPM24, 101).
SAPS II demonstrated the best overall performance, but the superior calibration of APACHE II makes it the most appropriate model for comparisons of mortality rates in different ICUs. The significance of the Hosmer-Lemeshow C test in all the models suggest that new logistic regression coefficients should be generated and the systems retested before they could be used with confidence in Scottish ICUs.
评估并比较英国重症监护病房(ICU)患者常用的五种疾病严重程度评分系统的性能。分析的五种模型分别是急性生理与慢性健康评估(APACHE)系统的第二版和第三版、使用源自英国(UK)的系数的APACHE II版本(UK APACHE II)、简化急性生理评分(SAPS)的第二版以及死亡概率模型的第二版,分别在入院时(MPM0)和入住ICU 24小时后(MPM24)进行计算。
对连续入住重症监护病房的患者进行为期2年的前瞻性队列研究。
苏格兰的22个普通ICU。
共有13291例患者纳入本研究,经过前瞻性的排除标准后,共有10393例患者用于分析。
出院时的死亡或存活情况。
所有模型使用受试者工作特征曲线下面积显示出合理的区分度(APACHE III为0.845;APACHE II为0.805;UKAPACHE II为0.809;SAPS II为0.843;MPM0为0.785;MPM24为0.799)。使用Hosmer-Lemeshow拟合优度C检验,观察到的死亡率水平与所有模型预测的死亡率水平显著不同(p <.001),校准曲线证实了该检验结果。当排除在最初24小时内出院的患者以便使用相同患者组进行比较时,APACHE III、MPM24和SAPS II(APACHE III为0.795;MPM24为0.791;SAPS II为0.784)显示出比APACHE II、UK APACHE II和MPM0(APACHE II为0.763;UK APACHE II为0.756;MPM为0.741)更好的区分度。然而,所有模型的校准变化不大,观察到的死亡率仍然与评分系统预测的死亡率显著不同(p <.001)。对于等效数据集,使用针对两个人群的Hosmer-Lemeshow检验的卡方值,APACHE II显示出比所有模型更好的校准(APACHE III为366;APACHE II为67;UKAPACHE II为2);SAPS II为142;MPM0为452;MPM24为101)。
SAPS II表现出最佳的总体性能,但APACHE II的优越校准使其成为不同ICU死亡率比较的最合适模型。所有模型中Hosmer-Lemeshow C检验的显著性表明,在苏格兰ICU中可以放心使用之前,应生成新的逻辑回归系数并对系统进行重新测试。