Laboratory for Experimental Oncology and Radiobiology, Center for Experimental and Molecular Medicine, Cancer Center Amsterdam and Amsterdam Gastroenterology and Metabolism, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, the Netherlands.
Oncode Institute, Utrecht, the Netherlands.
JAMA Netw Open. 2021 Apr 1;4(4):e217737. doi: 10.1001/jamanetworkopen.2021.7737.
Quality improvement programs for colorectal cancer surgery have been introduced with benchmarking based on quality indicators, such as mortality. Detailed (pre)operative characteristics may offer relevant information for proper case-mix correction.
To investigate the added value of machine learning to predict quality indicators for colorectal cancer surgery and identify previously unrecognized predictors of 30-day mortality based on a large, nationwide colorectal cancer registry that collected extensive data on comorbidities.
DESIGN, SETTING, AND PARTICIPANTS: All patients who underwent resection for primary colorectal cancer registered in the Dutch ColoRectal Audit between January 1, 2011, and December 31, 2016, were included. Multiple machine learning models (multivariable logistic regression, elastic net regression, support vector machine, random forest, and gradient boosting) were made to predict quality indicators. Model performance was compared with conventionally used scores. Risk factors were identified by logistic regression analyses and Shapley additive explanations (ie, SHAP values). Statistical analysis was performed between March 1 and September 30, 2020.
The primary outcome of this cohort study was 30-day mortality. Prediction models were trained on a training set by performing 5-fold cross-validation, and outcomes were measured by the area under the receiver operating characteristic curve on the test set. Machine learning was further used to identify risk factors, measured by odds ratios and SHAP values.
This cohort study included 62 501 records, most patients were male (35 116 [56.2%]), were aged 61 to 80 years (41 560 [66.5%]), and had an American Society of Anesthesiology score of II (35 679 [57.1%]). A 30-day mortality rate of 2.7% (n = 1693) was found. The area under the curve of the best machine learning model for 30-day mortality (0.82; 95% CI, 0.79-0.85) was significantly higher than the American Society of Anesthesiology score (0.74; 95% CI, 0.71-0.77; P < .001), Charlson Comorbidity Index (0.66; 95% CI, 0.63-0.70; P < .001), and preoperative score to predict postoperative mortality (0.73; 95% CI, 0.70-0.77; P < .001). Hypertension, myocardial infarction, chronic obstructive pulmonary disease, and asthma were comorbidities with a high risk for increased mortality. Machine learning identified specific risk factors for a complicated course, intensive care unit admission, prolonged hospital stay, and readmission. Laparoscopic surgery was associated with a decreased risk for all adverse outcomes.
This study found that machine learning methods outperformed conventional scores to predict 30-day mortality after colorectal cancer surgery, identified specific patient groups at risk for adverse outcomes, and provided directions to optimize benchmarking in clinical audits.
已经引入了针对结直肠癌手术的质量改进计划,并基于死亡率等质量指标进行基准测试。详细的(术前)特征可能为适当的病例组合校正提供相关信息。
利用机器学习来预测结直肠癌手术的质量指标,并基于一个大型全国性结直肠癌登记处,该登记处收集了大量合并症数据,确定 30 天死亡率的先前未被识别的预测因素。
设计、设置和参与者:纳入了 2011 年 1 月 1 日至 2016 年 12 月 31 日期间在荷兰 ColoRectal Audit 接受原发性结直肠癌切除术的所有患者。使用多种机器学习模型(多变量逻辑回归、弹性网回归、支持向量机、随机森林和梯度提升)来预测质量指标。使用传统使用的评分比较模型性能。通过逻辑回归分析和 Shapley 附加解释(即 SHAP 值)识别风险因素。统计分析于 2020 年 3 月 1 日至 9 月 30 日进行。
本队列研究的主要结局为 30 天死亡率。通过 5 折交叉验证在训练集上训练预测模型,并通过测试集上的接收者操作特征曲线下面积来测量结果。机器学习还用于确定风险因素,以优势比和 SHAP 值衡量。
本队列研究共纳入 62501 例记录,大多数患者为男性(35116[56.2%]),年龄在 61 至 80 岁之间(41560[66.5%]),美国麻醉医师协会评分 II 级(35679[57.1%])。发现 30 天死亡率为 2.7%(n=1693)。用于 30 天死亡率的最佳机器学习模型的曲线下面积(0.82;95%CI,0.79-0.85)明显高于美国麻醉医师协会评分(0.74;95%CI,0.71-0.77;P<0.001)、Charlson 合并症指数(0.66;95%CI,0.63-0.70;P<0.001)和术前评分预测术后死亡率(0.73;95%CI,0.70-0.77;P<0.001)。高血压、心肌梗死、慢性阻塞性肺疾病和哮喘是死亡率增加的高风险合并症。机器学习确定了特定的风险因素,这些风险因素与复杂的病程、重症监护病房入院、住院时间延长和再入院有关。腹腔镜手术与所有不良结局的风险降低相关。
本研究发现,机器学习方法优于传统评分来预测结直肠癌手术后的 30 天死亡率,确定了具有不良结局风险的特定患者群体,并为临床审计中的基准测试优化提供了方向。