Khan Danyal Z, Luengo Imanol, Barbarisi Santiago, Addis Carole, Culshaw Lucy, Dorward Neil L, Haikka Pinja, Jain Abhiney, Kerr Karen, Koh Chan Hee, Layard Horsfall Hugo, Muirhead William, Palmisciano Paolo, Vasey Baptiste, Stoyanov Danail, Marcus Hani J
1Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London.
2Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London.
J Neurosurg. 2021 Nov 5;137(1):51-58. doi: 10.3171/2021.6.JNS21923. Print 2022 Jul 1.
Surgical workflow analysis involves systematically breaking down operations into key phases and steps. Automatic analysis of this workflow has potential uses for surgical training, preoperative planning, and outcome prediction. Recent advances in machine learning (ML) and computer vision have allowed accurate automated workflow analysis of operative videos. In this Idea, Development, Exploration, Assessment, Long-term study (IDEAL) stage 0 study, the authors sought to use Touch Surgery for the development and validation of an ML-powered analysis of phases and steps in the endoscopic transsphenoidal approach (eTSA) for pituitary adenoma resection, a first for neurosurgery.
The surgical phases and steps of 50 anonymized eTSA operative videos were labeled by expert surgeons. Forty videos were used to train a combined convolutional and recurrent neural network model by Touch Surgery. Ten videos were used for model evaluation (accuracy, F1 score), comparing the phase and step recognition of surgeons to the automatic detection of the ML model.
The longest phase was the sellar phase (median 28 minutes), followed by the nasal phase (median 22 minutes) and the closure phase (median 14 minutes). The longest steps were step 5 (tumor identification and excision, median 17 minutes); step 3 (posterior septectomy and removal of sphenoid septations, median 14 minutes); and step 4 (anterior sellar wall removal, median 10 minutes). There were substantial variations within the recorded procedures in terms of video appearances, step duration, and step order, with only 50% of videos containing all 7 steps performed sequentially in numerical order. Despite this, the model was able to output accurate recognition of surgical phases (91% accuracy, 90% F1 score) and steps (76% accuracy, 75% F1 score).
In this IDEAL stage 0 study, ML techniques have been developed to automatically analyze operative videos of eTSA pituitary surgery. This technology has previously been shown to be acceptable to neurosurgical teams and patients. ML-based surgical workflow analysis has numerous potential uses-such as education (e.g., automatic indexing of contemporary operative videos for teaching), improved operative efficiency (e.g., orchestrating the entire surgical team to a common workflow), and improved patient outcomes (e.g., comparison of surgical techniques or early detection of adverse events). Future directions include the real-time integration of Touch Surgery into the live operative environment as an IDEAL stage 1 (first-in-human) study, and further development of underpinning ML models using larger data sets.
手术工作流程分析涉及将手术系统地分解为关键阶段和步骤。对该工作流程进行自动分析在手术训练、术前规划和结果预测方面具有潜在用途。机器学习(ML)和计算机视觉的最新进展使得能够对手术视频进行准确的自动工作流程分析。在这项理念、开发、探索、评估、长期研究(IDEAL)0期研究中,作者试图使用Touch Surgery开发并验证一种基于ML的垂体腺瘤切除内镜经蝶窦入路(eTSA)阶段和步骤分析方法,这在神经外科领域尚属首次。
50份匿名的eTSA手术视频的手术阶段和步骤由专家外科医生进行标注。40份视频用于通过Touch Surgery训练一个结合卷积神经网络和循环神经网络的模型。10份视频用于模型评估(准确率、F1分数),将外科医生的阶段和步骤识别与ML模型的自动检测进行比较。
最长的阶段是蝶鞍阶段(中位数28分钟),其次是鼻腔阶段(中位数22分钟)和闭合阶段(中位数14分钟)。最长的步骤是步骤5(肿瘤识别与切除,中位数17分钟);步骤3(后鼻中隔切除术和蝶窦分隔去除,中位数14分钟);以及步骤4(前蝶鞍壁切除,中位数10分钟)。在记录的手术过程中,视频外观、步骤持续时间和步骤顺序存在很大差异,只有50%的视频包含按数字顺序依次执行的所有7个步骤。尽管如此,该模型能够准确识别手术阶段(准确率91%,F-1分数90%)和步骤(准确率76%,F-1分数75%)。
在这项IDEAL 0期研究中,已开发出ML技术来自动分析eTSA垂体手术的手术视频。此前已证明该技术为神经外科团队和患者所接受。基于ML的手术工作流程分析有许多潜在用途,如教育(例如,为教学对当代手术视频进行自动索引)、提高手术效率(例如,使整个手术团队遵循共同的工作流程)以及改善患者预后(例如,比较手术技术或早期发现不良事件)。未来的方向包括作为IDEAL 1期(人体首次)研究将Touch Surgery实时集成到实际手术环境中,以及使用更大的数据集进一步开发基础ML模型。