Faculty of Data and Decision Sciences, Technion - Israel Institute of Technology, Haifa, 3200003, Israel.
Department of Surgery, University of Wisconsin-Madison School of Medicine and Public Health, 600 Highland Ave, Madison, WI, 53792, USA.
Int J Comput Assist Radiol Surg. 2024 Jul;19(7):1349-1357. doi: 10.1007/s11548-024-03158-3. Epub 2024 May 15.
In this paper, we present a novel approach to the automatic evaluation of open surgery skills using depth cameras. This work is intended to show that depth cameras achieve similar results to RGB cameras, which is the common method in the automatic evaluation of open surgery skills. Moreover, depth cameras offer advantages such as robustness to lighting variations, camera positioning, simplified data compression, and enhanced privacy, making them a promising alternative to RGB cameras.
Experts and novice surgeons completed two simulators of open suturing. We focused on hand and tool detection and action segmentation in suturing procedures. YOLOv8 was used for tool detection in RGB and depth videos. Furthermore, UVAST and MSTCN++ were used for action segmentation. Our study includes the collection and annotation of a dataset recorded with Azure Kinect.
We demonstrated that using depth cameras in object detection and action segmentation achieves comparable results to RGB cameras. Furthermore, we analyzed 3D hand path length, revealing significant differences between experts and novice surgeons, emphasizing the potential of depth cameras in capturing surgical skills. We also investigated the influence of camera angles on measurement accuracy, highlighting the advantages of 3D cameras in providing a more accurate representation of hand movements.
Our research contributes to advancing the field of surgical skill assessment by leveraging depth cameras for more reliable and privacy evaluations. The findings suggest that depth cameras can be valuable in assessing surgical skills and provide a foundation for future research in this area.
本文提出了一种使用深度相机自动评估开放手术技能的新方法。本工作旨在表明深度相机可以达到与 RGB 相机相似的效果,这是自动评估开放手术技能的常用方法。此外,深度相机具有抗光照变化、相机定位、简化数据压缩和增强隐私等优势,是 RGB 相机的一种有前途的替代方案。
专家和新手外科医生完成了两个开放缝合模拟器。我们专注于缝合过程中的手和工具检测以及动作分割。YOLOv8 用于 RGB 和深度视频中的工具检测。此外,还使用了 UVAST 和 MSTCN++ 进行动作分割。我们的研究包括使用 Azure Kinect 记录的数据集的收集和注释。
我们证明了在目标检测和动作分割中使用深度相机可以获得与 RGB 相机相当的结果。此外,我们分析了 3D 手路径长度,发现专家和新手外科医生之间存在显著差异,强调了深度相机在捕捉手术技能方面的潜力。我们还研究了相机角度对测量精度的影响,突出了 3D 相机在提供更准确的手部运动表示方面的优势。
我们的研究通过利用深度相机进行更可靠和隐私评估,为推进手术技能评估领域做出了贡献。研究结果表明,深度相机在评估手术技能方面具有重要价值,为该领域的未来研究提供了基础。