Frey Sébastien, Facente Federica, Wei Wen, Ekmekci Ezem Sura, Séjor Eric, Baqué Patrick, Durand Matthieu, Delingette Hervé, Bremond François, Berthet-Rayne Pierre, Ayache Nicholas
Université Côte d'Azur, Nice, France.
Department of General Surgery, Pasteur 2 Hospital, University Hospital of Nice, Nice, France.
J Robot Surg. 2025 Mar 31;19(1):131. doi: 10.1007/s11701-025-02284-7.
The accurate recognition of surgical instruments is essential for the advancement of intraoperative artificial intelligence (AI) systems. In this study, we assessed the YOLOv8 model's efficacy in identifying robotic and laparoscopic instruments in robot-assisted abdominal surgeries. Specifically, we evaluated its ability to detect, classify, and segment seven different types of surgical instruments. A diverse dataset was compiled from four public and private sources, encompassing over 7,400 frames and 17,175 annotations that represent a variety of surgical contexts and instruments. YOLOv8 was trained and tested on these datasets, achieving a mean average precision of 0.77 for binary detection and 0.72 for multi-instrument classification. Optimal performance was observed when the training set of a specific instrument reached 1300 instances. The model also demonstrated excellent segmentation accuracy, achieving a mean Dice score of 0.91 and a mean intersection over union of 0.86, with Monopolar Curved Scissors yielding the highest accuracy. Notably, YOLOv8 exhibited superior recognition performance for robotic instruments compared to laparoscopic tools, a difference likely attributed to the greater representation of robotic instruments in the training set. Furthermore, the model's rapid inference speed of 1.12 milliseconds per frame highlights its suitability for real-time clinical applications. These findings confirm YOLOv8's potential for precise and efficient recognition of surgical instruments using a comprehensive multi-source dataset.
准确识别手术器械对于术中人工智能(AI)系统的发展至关重要。在本研究中,我们评估了YOLOv8模型在机器人辅助腹部手术中识别机器人和腹腔镜器械的效果。具体而言,我们评估了其检测、分类和分割七种不同类型手术器械的能力。从四个公共和私人来源汇编了一个多样化的数据集,包含超过7400帧和17175个注释,代表了各种手术场景和器械。在这些数据集上对YOLOv8进行了训练和测试,二元检测的平均精度为0.77,多器械分类的平均精度为0.72。当特定器械的训练集达到1300个实例时,观察到最佳性能。该模型还展示了出色的分割精度,平均Dice评分为0.91,平均交并比为0.86,单极弯剪刀的精度最高。值得注意的是,与腹腔镜工具相比,YOLOv8对机器人器械表现出卓越的识别性能,这种差异可能归因于训练集中机器人器械的代表性更强。此外,该模型每帧1.12毫秒的快速推理速度突出了其适用于实时临床应用的特点。这些发现证实了YOLOv8使用综合多源数据集精确高效识别手术器械的潜力。