Surgical Outcomes and Applied Research Program, University of Colorado School of Medicine, Aurora, CO; Adult and Child Center for Health Outcomes Research and Delivery Science, University of Colorado School of Medicine, Aurora, CO; Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO.
Surgical Outcomes and Applied Research Program, University of Colorado School of Medicine, Aurora, CO; Department of Surgery, University of Colorado School of Medicine, Aurora, CO.
Surgery. 2021 Oct;170(4):1184-1194. doi: 10.1016/j.surg.2021.02.033. Epub 2021 Apr 16.
The universal Surgical Risk Preoperative Assessment System (SURPAS) prediction models for postoperative adverse outcomes have good accuracy for estimating risk in broad surgical populations and for surgical specialties. The accuracy in individual operations has not yet been assessed. The objective of this study was to evaluate the Surgical Risk Preoperative Assessment System in predicting adverse outcomes for selected individual operations.
The SURPAS models were applied to the top 2 most frequent common procedural terminology codes in 9 surgical specialties and 5 additional common general surgical operations in the 2009 to 2018 database of the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP). Goodness of fit statistics were estimated, including c-indices for discrimination, Hosmer-Lemeshow graphs and P values for calibration, overall observed versus expected event rates, and Brier scores.
The total sample size was 2,020,172, which represented 29% of the 6.9 million operations in the ACS NSQIP database. Average c-indices across 12 outcomes were acceptable (≥0.70) for 13 (56.5%) of the 23 operations. Overall observed-to-expected rates were similar for mortality and overall morbidity across the 23 operations. Hosmer-Lemeshow graphs over quintiles of risk comparing observed-to-expected rates of mortality and overall morbidity were similar for 52% and 70% of operations, respectively. Model performance was better in less complex operations and those done in patients with lower preoperative risk.
SURPAS displayed accuracy in estimating postoperative adverse events for some of the 23 operations studied, but not all. In the procedures where SURPAS was not accurate, developing disease or operation-specific risk models might be appropriate.
通用外科手术风险术前评估系统(SURPAS)预测模型对于术后不良结局的预测具有很好的准确性,可广泛用于评估外科手术人群和各外科专业的风险。但尚未评估其在单个手术中的准确性。本研究的目的是评估 SURPAS 在预测特定单个手术不良结局中的准确性。
将 SURPAS 模型应用于美国外科医师学会全国外科质量改进计划(ACS NSQIP)数据库中 9 个外科专业和 5 个普通普外科中前 2 个最常见的常见程序术语代码,在 2009 年至 2018 年期间的 2,020,172 例中进行评估。估计拟合优度统计数据,包括区分的 C 指数、Hosmer-Lemeshow 图和校准的 P 值、总体观察到的与预期事件发生率以及 Brier 评分。
总样本量为 2,020,172 例,占 ACS NSQIP 数据库中 690 万例手术的 29%。在 12 个结果中,13 个(56.5%)手术的平均 C 指数可接受(≥0.70)。在 23 个手术中,死亡率和总发病率的总体观察到的与预期的比率相似。风险五分位数的 Hosmer-Lemeshow 图比较死亡率和总发病率的观察到的与预期的比率,分别有 52%和 70%的手术相似。SURPAS 在较简单的手术和术前风险较低的患者中表现出更好的预测效果。
SURPAS 在评估所研究的 23 种手术中的一些手术的术后不良事件的准确性,但并非全部。在 SURPAS 不精确的手术中,开发疾病或手术特异性风险模型可能是合适的。