Hu Ran, Tang Ming, Jin Yi, Xiang Lelang, Li Keyu, Peng Bing, Liu Jie, Qin Dian, Liang Long, Li Yichuan, Liu Linxun, Wang Chunrong, Xiong Yong, Dai Peilin, Li Ang, Wang Xin
Department of General Surgery, Division of Pancreatic Surgery, West China Hospital of Sichuan University, Chengdu, China.
West China School of Medicine, West China Hospital of Sichuan University, Chengdu, China.
Int J Surg. 2025 Aug 1;111(8):5048-5057. doi: 10.1097/JS9.0000000000002562. Epub 2025 May 23.
Surgical videos offer rich intraoperative data critical for surgical education, quality control, and skill assessment. However, their increasing volume and complexity render manual review impractical. To address this challenge, this study aims to develop an intelligent platform capable of automatically recognizing and visualizing intraoperative surgical instrument usage, and to evaluate its clinical utility in real-world surgical settings.
Surgical videos from 21 medical centers in China, covering five surgical types, were collected to develop a generalized artificial intelligence (AI) model for automated surgical instrument recognition. The model was deployed on the SurgSmart platform which features clinically oriented functions. A multicenter survey involving 30 surgeons was conducted to assess the clinical value of the platform.
A total of 1261 surgical videos were collected, from which 96 324 images were extracted and annotated with 268 828 labels. The developed model achieved a mean Average Precision of 80.31% for recognizing 21 surgical instruments. Based on this model, four core functions were implemented on SurgSmart: Rapid Review Mode, Surgical Instrument Report, Surgical Instrument Heatmap, and Surgical Teaching Mode. All participating surgeons reported a high level of satisfaction and acknowledged the clinical relevance of these functionalities.
A universally applicable surgical instrument recognition model was developed and deployed on SurgSmart to enable the visualization of intraoperative instrument usage, demonstrating promising clinical potential for automated surgical video analysis and enhanced intraoperative data interpretation.