Department of Strategy and Innovation, Memorial Sloan Kettering Cancer Center, New York, New York.
Adobe Inc, San Jose, California.
JAMA Surg. 2021 Apr 1;156(4):315-321. doi: 10.1001/jamasurg.2020.6361.
Accurate surgical scheduling affects patients, clinical staff, and use of physical resources. Although numerous retrospective analyses have suggested a potential for improvement, the real-world outcome of implementing a machine learning model to predict surgical case duration appears not to have been studied.
To assess accuracy and real-world outcome from implementation of a machine learning model that predicts surgical case duration.
DESIGN, SETTING, AND PARTICIPANTS: This randomized clinical trial was conducted on 2 surgical campuses of a cancer specialty center. Patients undergoing colorectal and gynecology surgery at Memorial Sloan Kettering Cancer Center who were scheduled more than 1 day before surgery between April 7, 2018, and June 25, 2018, were included. The randomization process included 29 strata (11 gynecological surgeons at 2 campuses and 7 colorectal surgeons at a single campus) to ensure equal chance of selection for each surgeon and each campus. Patients undergoing more than 1 surgery during the study's timeframe were enrolled only once. Data analyses took place from July 2018 to November 2018.
Cases were assigned to machine learning-assisted surgical predictions 1 day before surgery and compared with a control group.
The primary outcome measure was accurate prediction of the duration of each scheduled surgery, measured by (arithmetic) mean (SD) error and mean absolute error. Effects on patients and systems were measured by start time delay of following cases, the time between cases, and the time patients spent in presurgical area.
A total of 683 patients were included (mean [SD] age, 55.8 [13.8] years; 566 women [82.9%]); 72 were excluded. Of the 683 patients included, those assigned to the machine learning algorithm had significantly lower mean (SD) absolute error (control group, 59.3 [72] minutes; intervention group, 49.5 [66] minutes; difference, -9.8 minutes; P = .03) compared with the control group. Mean start-time delay for following cases (patient wait time in a presurgical area), dropped significantly: 62.4 minutes (from 70.2 minutes to 7.8 minutes) and 16.7 minutes (from 36.9 minutes to 20.2 minutes) for patients receiving colorectal and gynecology surgery, respectively. The overall mean (SD) reduction of wait time was 33.1 minutes per patient (from 49.4 minutes to 16.3 minutes per patient). Improved accuracy did not adversely inflate time between cases (surgeon wait time). There was marginal improvement (1.5 minutes, from a mean of 70.6 to 69.1 minutes) in time between the end of cases and start of to-follow cases using the predictive model, compared with the control group. Patients spent a mean of 25.2 fewer minutes in the facility before surgery (173.3 minutes vs 148.1 minutes), indicating a potential benefit vis-à-vis available resources for other patients before and after surgery.
Implementing machine learning-generated predictions for surgical case durations may improve case duration accuracy, presurgical resource use, and patient wait time, without increasing surgeon wait time between cases.
ClinicalTrials.gov Identifier: NCT03471377.
准确的手术安排会影响患者、临床医护人员和物理资源的使用。虽然许多回顾性分析表明有改进的潜力,但实际实施机器学习模型来预测手术持续时间的结果似乎尚未得到研究。
评估机器学习模型预测手术持续时间的准确性和实际结果。
设计、地点和参与者:这是一项在癌症专科中心的 2 个外科校区进行的随机临床试验。2018 年 4 月 7 日至 6 月 25 日期间,在纪念斯隆凯特琳癌症中心接受结直肠和妇科手术、手术前 1 天以上安排的患者被纳入研究。随机化过程包括 29 个分层(2 个校区的 11 名妇科外科医生和单个校区的 7 名结直肠外科医生),以确保每个外科医生和每个校区都有平等的机会被选中。在研究期间接受超过 1 次手术的患者仅被纳入 1 次。数据分析于 2018 年 7 月至 2018 年 11 月进行。
手术病例在手术前 1 天被分配给机器学习辅助手术预测,并与对照组进行比较。
主要结局指标是测量每个预定手术持续时间的准确性,方法是(算术)平均(SD)误差和平均绝对误差。通过后续病例的开始时间延迟、病例之间的时间以及患者在术前区域的时间来衡量对患者和系统的影响。
共纳入 683 例患者(平均[SD]年龄,55.8[13.8]岁;566 名女性[82.9%]);72 例被排除。在纳入的 683 例患者中,与对照组相比,接受机器学习算法的患者的绝对误差(控制组为 59.3[72]分钟;干预组为 49.5[66]分钟;差异,-9.8 分钟;P=0.03)显著降低。后续病例的平均开始时间延迟(患者在术前区域的等待时间)显著下降:接受结直肠和妇科手术的患者分别从 70.2 分钟降至 7.8 分钟和 36.9 分钟降至 20.2 分钟。每位患者的总体平均(SD)等待时间减少了 33.1 分钟(从 49.4 分钟降至每位患者 16.3 分钟)。准确性的提高并没有增加病例之间的时间(外科医生的等待时间)。与对照组相比,使用预测模型后,病例结束和后续病例开始之间的时间仅略有改善(1.5 分钟,从平均 70.6 分钟增加到 69.1 分钟)。使用机器学习生成的预测来预测手术持续时间,可能会提高手术持续时间的准确性、术前资源的利用和患者的等待时间,而不会增加外科医生在病例之间的等待时间。
实施机器学习生成的手术持续时间预测可能会提高手术持续时间的准确性、术前资源的利用和患者的等待时间,而不会增加外科医生在病例之间的等待时间。
ClinicalTrials.gov 标识符:NCT03471377。