通过深度学习进行自我中心视频分析以自动评估开放手术技能

Egocentric video analysis for automated assessment of open surgical skills via deep learning.

作者信息

Gazis Athanasios, Schizas Dimitrios, Kykalos Stylianos, Karaiskos Pantelis, Loukas Constantinos

机构信息

Laboratory of Medical Physics, Medical School, National and Kapodistrian University of Athens, Mikras Asias 75, 11527, Athens, Attiki, Greece.

1st Department of Surgery, Laikon General Hospital, National and Kapodistrian University of Athens, Agiou Thoma 17, 11527, Athens, Attiki, Greece.

出版信息

Int J Comput Assist Radiol Surg. 2025 Sep 18. doi: 10.1007/s11548-025-03518-7.

DOI:10.1007/s11548-025-03518-7

PMID:40963049

Abstract

PURPOSE

While significant progress has been made in skill assessment for minimally invasive procedures, objective evaluation methods for open surgery remain limited. This paper presents a deep learning framework for assessing technical surgical skills using egocentric video data from open surgery training.

METHODS

Our dataset includes 201 videos and corresponding hand kinematics data from three fundamental training task-knot tying (KT), continuous suturing (CS), and interrupted suturing (IS)-performed by 20 participants. Each video was annotated by two experts using a modified OSATS scale (KT: five criteria, total score range: 5-25; CS/IS: seven criteria, total score range: 7-35). We evaluate three temporal architectures (LSTM, TCN, and Transformer), each using ResNet50 as the backbone for spatial feature extraction, and assess them under various training strategies: single-task learning, feature concatenation, pretraining, and multi-task learning with integrated kinematic data. Performance metrics included mean absolute error (MAE) and Spearman correlation coefficient ( ), both with respect to total score prediction.

RESULTS

The Transformer-based models consistently outperformed LSTM and TCN across all tasks. The multi-task Transformer incorporating prediction of task completion time ( ) achieved the lowest MAE (KT: 1.92, CS: 2.81, and IS: 2.89) and = 0.84- 0.90. It also demonstrated promising capabilities for early skill assessment by predicting the total score from partial observations-particularly for simpler tasks. Additionally, we show that models trained on consensus expert ratings outperform those trained on individual annotations, highlighting the value of multi-rater ground truth.

CONCLUSION

This research provides a foundation for objective, automated assessment of open surgical skills, with potential to improve the efficiency and standardization of surgical training.

摘要

目的

虽然在微创手术技能评估方面已取得显著进展，但开放手术的客观评估方法仍然有限。本文提出了一种深度学习框架，用于使用开放手术训练中的第一人称视角视频数据评估手术技术技能。

方法

我们的数据集包括来自20名参与者进行的三项基本训练任务——打结（KT）、连续缝合（CS）和间断缝合（IS）的201个视频及相应的手部运动学数据。每个视频由两名专家使用改良的OSATS量表进行标注（KT：五个标准，总分范围：5 - 25；CS/IS：七个标准，总分范围：7 - 35）。我们评估了三种时间架构（长短期记忆网络、门控循环单元和Transformer），每种架构都使用ResNet50作为空间特征提取的主干，并在各种训练策略下对其进行评估：单任务学习、特征拼接、预训练以及结合运动学数据的多任务学习。性能指标包括平均绝对误差（MAE）和斯皮尔曼相关系数（），均针对总分预测。

结果

基于Transformer的模型在所有任务中始终优于长短期记忆网络和门控循环单元。结合任务完成时间预测（）的多任务Transformer实现了最低的MAE（KT：1.92，CS：2.81，IS：2.89），且 = 0.84 - 0.90。它还通过从部分观察中预测总分展示了早期技能评估的良好能力——特别是对于较简单的任务。此外，我们表明在专家共识评分上训练的模型优于在个体标注上训练的模型，突出了多评分者真实数据的价值。

结论

本研究为开放手术技能的客观、自动化评估提供了基础，有望提高手术训练的效率和标准化。

相似文献

Egocentric video analysis for automated assessment of open surgical skills via deep learning.通过深度学习进行自我中心视频分析以自动评估开放手术技能

Int J Comput Assist Radiol Surg. 2025 Sep 18. doi: 10.1007/s11548-025-03518-7.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Systemic Inflammatory Response Syndrome全身炎症反应综合征

Can Repetition-based Training in a High-fidelity Model Enhance Critical Trauma Surgical Skills Among Trainees and Attending Surgeons Equally?在高保真模型中基于重复的训练能否同样提高实习医生和主治医生的关键创伤手术技能？

Clin Orthop Relat Res. 2025 Feb 1;483(2):330-339. doi: 10.1097/CORR.0000000000003225. Epub 2024 Aug 28.

Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.利用基于计算机视觉的自动严重程度估计提高帕金森病运动评估的可靠性。

J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.

Laparoscopic surgical box model training for surgical trainees with no prior laparoscopic experience.针对没有腹腔镜手术经验的外科实习生的腹腔镜手术箱模型培训。

Cochrane Database Syst Rev. 2014 Jan 17;2014(1):CD010479. doi: 10.1002/14651858.CD010479.pub2.

Facial Emotion Recognition of 16 Distinct Emotions From Smartphone Videos: Comparative Study of Machine Learning and Human Performance.基于智能手机视频的16种不同情绪的面部表情识别：机器学习与人类表现的对比研究

J Med Internet Res. 2025 Jul 2;27:e68942. doi: 10.2196/68942.

A Comprehensive and Modality Diverse Cervical Spine and Back Musculoskeletal Physical Exam Curriculum for Medical Students.面向医学生的全面且多模态的颈椎和背部肌肉骨骼物理检查课程

J Educ Teach Emerg Med. 2025 Jul 31;10(3):SG1-SG8. doi: 10.21980/J8RQ0N. eCollection 2025 Jul.

Can we improve time to patency with vasoepididymostomy with an innovative epididymal occlusion stitch?我们能否通过一种创新的附睾结扎缝线来改善吻合术的通畅时间？

Int Braz J Urol. 2024 Jul-Aug;50(4):504-506. doi: 10.1590/S1677-5538.IBJU.2024.0222.

本文引用的文献

Video-based robotic surgical action recognition and skills assessment on porcine models using deep learning.基于深度学习的猪模型视频机器人手术动作识别与技能评估

Surg Endosc. 2025 Mar;39(3):1709-1719. doi: 10.1007/s00464-024-11486-3. Epub 2025 Jan 13.

AIxSuture: vision-based assessment of open suturing skills.基于视觉的开放性缝合技能评估 AIxSuture

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):1045-1052. doi: 10.1007/s11548-024-03093-3. Epub 2024 Mar 25.

Surgical data recording in the operating room: a systematic review of modalities and metrics.手术室中的手术数据记录：方法和指标的系统评价。

Br J Surg. 2021 Jun 22;108(6):613-621. doi: 10.1093/bjs/znab016.

Deep learning with convolutional neural network for objective skill evaluation in robot-assisted surgery.基于卷积神经网络的深度学习在机器人辅助手术中的客观技能评估。

Int J Comput Assist Radiol Surg. 2018 Dec;13(12):1959-1970. doi: 10.1007/s11548-018-1860-1. Epub 2018 Sep 25.

Surgical skill and complication rates after bariatric surgery.减重手术后的手术技能和并发症发生率。

N Engl J Med. 2013 Oct 10;369(15):1434-42. doi: 10.1056/NEJMsa1300625.

Observational tools for assessment of procedural skills: a systematic review.观察工具在操作技能评估中的应用：系统评价。

Am J Surg. 2011 Oct;202(4):469-480.e6. doi: 10.1016/j.amjsurg.2010.10.020. Epub 2011 Jul 28.

The contribution of simulation training in enhancing key components of laparoscopic competence.模拟训练在提升腹腔镜操作能力关键要素方面的作用。

Am Surg. 2011 Jun;77(6):708-15.

A global assessment tool for evaluation of intraoperative laparoscopic skills.一种用于评估术中腹腔镜手术技能的全球评估工具。

Am J Surg. 2005 Jul;190(1):107-13. doi: 10.1016/j.amjsurg.2005.04.004.

Measurement of surgical dexterity using motion analysis of simple bench tasks.使用简单实验台任务的运动分析来测量手术灵巧性。

World J Surg. 2003 Apr;27(4):390-4. doi: 10.1007/s00268-002-6769-7.

Skills evaluation in minimally invasive surgery using force/torque signatures.使用力/扭矩特征评估微创手术技能

Surg Endosc. 2000 Sep;14(9):791-8. doi: 10.1007/s004640000230.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过深度学习进行自我中心视频分析以自动评估开放手术技能

Egocentric video analysis for automated assessment of open surgical skills via deep learning.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献