Department of Surgical Gastroenterology and Transplantation, Rigshospitalet; Center for Surgical Translational and Artificial Intelligence Research, Rigshospitalet.
Harris Orthopedics Laboratory, Massachusetts General Hospital, Boston, MA, USA.
Lancet Digit Health. 2021 Aug;3(8):e471-e485. doi: 10.1016/S2589-7500(21)00084-4. Epub 2021 Jun 29.
Early detection of postoperative complications, including organ failure, is pivotal in the initiation of targeted treatment strategies aimed at attenuating organ damage. In an era of increasing health-care costs and limited financial resources, identifying surgical patients at a high risk of postoperative complications and providing personalised precision medicine-based treatment strategies provides an obvious pathway for reducing patient morbidity and mortality. We aimed to leverage deep learning to create, through training on structured electronic health-care data, a multilabel deep neural network to predict surgical postoperative complications that would outperform available models in surgical risk prediction.
In this retrospective study, we used data on 58 input features, including demographics, laboratory values, and 30-day postoperative complications, from the American College of Surgeons (ACS) National Surgical Quality Improvement Program database, which collects data from 722 hospitals from around 15 countries. We queried the entire adult (≥18 years) database for patients who had surgery between Jan 1, 2012, and Dec 31, 2018. We then identified all patients who were treated at a large midwestern US academic medical centre, excluded them from the base dataset, and reserved this independent group for final model testing. We then randomly created a training set and a validation set from the remaining cases. We developed three deep neural network models with increasing numbers of input variables and so increasing levels of complexity. Output variables comprised mortality and 18 different postoperative complications. Overall morbidity was defined as any of 16 postoperative complications. Model performance was evaluated on the test set using the area under the receiver operating characteristic curve (AUC) and compared with previous metrics from the ACS-Surgical Risk Calculator (ACS-SRC). We evaluated resistance to changes in the underlying patient population on a subset of the test set, comprising only patients who had emergency surgery. Results were also compared with the Predictive OpTimal Trees in Emergency Surgery Risk (POTTER) calculator.
5 881 881 surgical patients, with 2941 unique Current Procedural Terminology codes, were included in this study, with 4 694 488 in the training set, 1 173 622 in the validation set, and 13 771 in the test set. The mean AUCs for the validation set were 0·864 (SD 0·053) for model 1, 0·871 (0·055) for model 2, and 0·882 (0·053) for model 3. The mean AUCs for the test set were 0·859 (SD 0·063) for model 1, 0·863 (0·064) for model 2, and 0·874 (0·061) for model 3. The mean AUCs of each model outperformed previously published performance metrics from the ACS-SRC, with a direct correlation between increasing model complexity and performance. Additionally, when tested on a subgroup of patients who had emergency surgery, our models outperformed previously published POTTER metrics.
We have developed unified prediction models, based on deep neural networks, for predicting surgical postoperative complications. The models were generally superior to previously published surgical risk prediction tools and appeared robust to changes in the underlying patient population. Deep learning could offer superior approaches to surgical risk prediction in clinical practice.
The Novo Nordisk Foundation.
早期发现术后并发症,包括器官衰竭,对于启动针对减轻器官损伤的靶向治疗策略至关重要。在医疗保健成本不断增加和有限的财务资源的时代,确定术后并发症风险较高的外科患者,并提供个性化的基于精准医学的治疗策略,为降低患者的发病率和死亡率提供了明显的途径。我们旨在利用深度学习,通过在结构化电子医疗数据上进行训练,创建一个多标签深度神经网络,以预测外科术后并发症,该模型在外科风险预测方面的表现优于现有模型。
在这项回顾性研究中,我们使用了来自美国外科医师学会(ACS)国家外科质量改进计划数据库的 58 个输入特征,包括人口统计学、实验室值和 30 天术后并发症的数据,该数据库收集了来自 15 个国家的 722 家医院的数据。我们从 2012 年 1 月 1 日至 2018 年 12 月 31 日期间对整个成人(≥18 岁)数据库进行了查询。然后,我们确定了所有在中西部美国一家大型学术医疗中心接受治疗的患者,并将其从基础数据库中排除,保留这个独立的小组用于最终的模型测试。然后,我们从剩余的病例中随机创建了一个训练集和一个验证集。我们开发了三个具有越来越多输入变量的深度神经网络模型,因此具有越来越复杂的水平。输出变量包括死亡率和 18 种不同的术后并发症。总体发病率定义为任何 16 种术后并发症。使用接受者操作特征曲线(AUC)下的面积在测试集上评估模型性能,并与 ACS-Surgical Risk Calculator(ACS-SRC)的先前指标进行比较。我们在测试集的一个子集上评估了对基础患者人群变化的抵抗力,该子集仅包括接受紧急手术的患者。结果还与 Predictive OpTimal Trees in Emergency Surgery Risk(POTTER)计算器进行了比较。
共有 5881881 例外科患者,有 2941 个独特的当前程序术语代码,其中 4694488 例患者在训练集中,1173622 例患者在验证集中,13771 例患者在测试集中。验证集的平均 AUC 为模型 1 的 0.864(SD 0.053),模型 2 的 0.871(0.055),模型 3 的 0.882(0.053)。测试集的平均 AUC 为模型 1 的 0.859(SD 0.063),模型 2 的 0.863(0.064),模型 3 的 0.874(0.061)。每个模型的平均 AUC 均优于 ACS-SRC 先前发布的性能指标,并且模型的复杂性与性能之间存在直接相关性。此外,当在接受紧急手术的患者亚组上进行测试时,我们的模型优于先前发布的 POTTER 指标。
我们已经开发了基于深度神经网络的统一预测模型,用于预测外科术后并发症。这些模型总体上优于以前发表的外科风险预测工具,并且似乎对基础患者人群的变化具有鲁棒性。深度学习在临床实践中可能提供优越的外科风险预测方法。
诺和诺德基金会。