Hung Andrew J, Chen Jian, Che Zhengping, Nilanon Tanachat, Jarc Anthony, Titus Micha, Oh Paul J, Gill Inderbir S, Liu Yan
1 Catherine & Joseph Aresty Department of Urology, Center for Robotic Simulation & Education, USC Institute of Urology, University of Southern California , Los Angeles, California.
2 USC Machine Learning Center, Viterbi School of Engineering, University of Southern California , Los Angeles, California.
J Endourol. 2018 May;32(5):438-444. doi: 10.1089/end.2018.0035. Epub 2018 Mar 20.
Surgical performance is critical for clinical outcomes. We present a novel machine learning (ML) method of processing automated performance metrics (APMs) to evaluate surgical performance and predict clinical outcomes after robot-assisted radical prostatectomy (RARP).
We trained three ML algorithms utilizing APMs directly from robot system data (training material) and hospital length of stay (LOS; training label) (≤2 days and >2 days) from 78 RARP cases, and selected the algorithm with the best performance. The selected algorithm categorized the cases as "Predicted as expected LOS (pExp-LOS)" and "Predicted as extended LOS (pExt-LOS)." We compared postoperative outcomes of the two groups (Kruskal-Wallis/Fisher's exact tests). The algorithm then predicted individual clinical outcomes, which we compared with actual outcomes (Spearman's correlation/Fisher's exact tests). Finally, we identified five most relevant APMs adopted by the algorithm during predicting.
The "Random Forest-50" (RF-50) algorithm had the best performance, reaching 87.2% accuracy in predicting LOS (73 cases as "pExp-LOS" and 5 cases as "pExt-LOS"). The "pExp-LOS" cases outperformed the "pExt-LOS" cases in surgery time (3.7 hours vs 4.6 hours, p = 0.007), LOS (2 days vs 4 days, p = 0.02), and Foley duration (9 days vs 14 days, p = 0.02). Patient outcomes predicted by the algorithm had significant association with the "ground truth" in surgery time (p < 0.001, r = 0.73), LOS (p = 0.05, r = 0.52), and Foley duration (p < 0.001, r = 0.45). The five most relevant APMs, adopted by the RF-50 algorithm in predicting, were largely related to camera manipulation.
To our knowledge, ours is the first study to show that APMs and ML algorithms may help assess surgical RARP performance and predict clinical outcomes. With further accrual of clinical data (oncologic and functional data), this process will become increasingly relevant and valuable in surgical assessment and training.
手术表现对临床结果至关重要。我们提出了一种新颖的机器学习(ML)方法来处理自动性能指标(APM),以评估机器人辅助根治性前列腺切除术(RARP)后的手术表现并预测临床结果。
我们使用直接来自机器人系统数据(训练材料)和78例RARP病例的住院时间(LOS;训练标签)(≤2天和>2天)训练了三种ML算法,并选择了性能最佳的算法。所选算法将病例分为“预测为预期住院时间(pExp-LOS)”和“预测为延长住院时间(pExt-LOS)”。我们比较了两组的术后结果(Kruskal-Wallis检验/ Fisher精确检验)。然后,该算法预测个体临床结果,并将其与实际结果进行比较(Spearman相关性检验/ Fisher精确检验)。最后,我们确定了算法在预测过程中采用的五个最相关的APM。
“随机森林-50”(RF-50)算法性能最佳,预测LOS的准确率达到87.2%(73例为“pExp-LOS”,5例为“pExt-LOS”)。“pExp-LOS”组在手术时间(3.7小时对4.6小时,p = 0.007)、LOS(2天对4天,p = 0.02)和导尿管留置时间(9天对14天,p = 0.02)方面优于“pExt-LOS”组。算法预测的患者结果与手术时间(p < 0.001,r = 0.73)、LOS(p = 0.05,r = 0.52)和导尿管留置时间(p < 0.001,r = 0.45)的“真实情况”有显著关联。RF-50算法在预测中采用的五个最相关的APM在很大程度上与摄像头操作有关。
据我们所知,我们的研究首次表明APM和ML算法可能有助于评估RARP手术表现并预测临床结果。随着临床数据(肿瘤学和功能数据)的进一步积累,这一过程在手术评估和培训中将变得越来越相关和有价值。