基于视频的术中手术技能评估的时空注意力。

Spatial-temporal attention for video-based assessment of intraoperative surgical skill.

机构信息

Department of Computer Science, Whiting School of Engineering, Johns Hopkins University, Baltimore, 21218, USA.

Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, 21218, USA.

出版信息

Sci Rep. 2024 Nov 6;14(1):26912. doi: 10.1038/s41598-024-77176-1.

DOI:10.1038/s41598-024-77176-1

PMID:39506003

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11541759/

Abstract

Accurate, unbiased, and reproducible assessment of skill is a vital resource for surgeons throughout their career. The objective in this research is to develop and validate algorithms for video-based assessment of intraoperative surgical skill. Algorithms to classify surgical video into expert or novice categories provide a summative assessment of skill, which is useful for evaluating surgeons at discrete time points in their training or certification of surgeons. Using a spatial-temporal neural network architecture, we tested the hypothesis that explicit supervision of spatial attention supervised by instrument tip locations improves the algorithm's generalizability to unseen dataset. The best performing model had an area under the receiver operating characteristic curve (AUC) of 0.88. Augmenting the network with supervision of spatial attention improved specificity of its predictions (with small changes in sensitivity and AUC) and led to improved measures of discrimination when tested with unseen dataset. Our findings show that explicit supervision of attention learned from images using instrument tip locations can improve performance of algorithms for objective video-based assessment of surgical skill.

摘要

准确、无偏且可重现的技能评估是外科医生整个职业生涯中的宝贵资源。本研究的目的是开发和验证基于视频的手术技能评估算法。将手术视频分类为专家或新手类别的算法提供了技能的总结性评估，这对于在培训或认证外科医生的离散时间点评估外科医生很有用。使用时空神经网络架构，我们检验了这样一个假设，即通过器械尖端位置进行的空间注意力的明确监督可以提高算法对未见数据集的泛化能力。表现最佳的模型的接收者操作特征曲线下面积（AUC）为 0.88。使用器械尖端位置从图像中学习的注意力的增强网络提高了其预测的特异性（敏感性和 AUC 略有变化），并在使用未见数据集进行测试时提高了区分度的度量。我们的研究结果表明，使用器械尖端位置从图像中学习的注意力的明确监督可以提高基于视频的手术技能客观评估算法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df41/11541759/058422a9ef3c/41598_2024_77176_Fig1_HTML.jpg

相似文献

Spatial-temporal attention for video-based assessment of intraoperative surgical skill.

Sci Rep. 2024 Nov 6;14(1):26912. doi: 10.1038/s41598-024-77176-1.

Video-based assessment of intraoperative surgical skill.

Int J Comput Assist Radiol Surg. 2022 Oct;17(10):1801-1811. doi: 10.1007/s11548-022-02681-5. Epub 2022 May 30.

Objective assessment of intraoperative technical skill in capsulorhexis using videos of cataract surgery.

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1097-1105. doi: 10.1007/s11548-019-01956-8. Epub 2019 Apr 11.

Video-based surgical skill assessment using 3D convolutional neural networks.

Int J Comput Assist Radiol Surg. 2019 Jul;14(7):1217-1225. doi: 10.1007/s11548-019-01995-1. Epub 2019 May 18.

Expert surgeons and deep learning models can predict the outcome of surgical hemorrhage from 1 min of video.

Sci Rep. 2022 May 17;12(1):8137. doi: 10.1038/s41598-022-11549-2.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

Does Robotic Surgical Simulator Performance Correlate With Surgical Skill?

J Surg Educ. 2017 Nov-Dec;74(6):1052-1056. doi: 10.1016/j.jsurg.2017.05.011. Epub 2017 Jun 13.

Development and Validation of a 3-Dimensional Convolutional Neural Network for Automatic Surgical Skill Assessment Based on Spatiotemporal Video Analysis.

JAMA Netw Open. 2021 Aug 2;4(8):e2120786. doi: 10.1001/jamanetworkopen.2021.20786.

Objective assessment of robotic surgical skill using instrument contact vibrations.

Surg Endosc. 2016 Apr;30(4):1419-31. doi: 10.1007/s00464-015-4346-z. Epub 2015 Jul 23.

Evaluation of Deep Learning Models for Identifying Surgical Actions and Measuring Performance.

JAMA Netw Open. 2020 Mar 2;3(3):e201664. doi: 10.1001/jamanetworkopen.2020.1664.

引用本文的文献

Evaluating the generalizability of video-based assessment of intraoperative surgical skill in capsulorhexis.

Int J Comput Assist Radiol Surg. 2025 May 22. doi: 10.1007/s11548-025-03406-0.

本文引用的文献

An American Board of Surgery Pilot of Video Assessment of Surgeon Technical Performance in Surgery.

Ann Surg. 2023 Apr 1;277(4):591-595. doi: 10.1097/SLA.0000000000005804. Epub 2023 Jan 16.

Artificial Intelligence Methods and Artificial Intelligence-Enabled Metrics for Surgical Education: A Multidisciplinary Consensus.

J Am Coll Surg. 2022 Jun 1;234(6):1181-1192. doi: 10.1097/XCS.0000000000000190. Epub 2022 May 11.

Video-based assessment of intraoperative surgical skill.

Int J Comput Assist Radiol Surg. 2022 Oct;17(10):1801-1811. doi: 10.1007/s11548-022-02681-5. Epub 2022 May 30.

Expert surgeons and deep learning models can predict the outcome of surgical hemorrhage from 1 min of video.

Sci Rep. 2022 May 17;12(1):8137. doi: 10.1038/s41598-022-11549-2.

A Unified Framework on Generalizability of Clinical Prediction Models.

Front Artif Intell. 2022 Apr 29;5:872720. doi: 10.3389/frai.2022.872720. eCollection 2022.

The association between video-based assessment of intraoperative technical performance and patient outcomes: a systematic review.

Surg Endosc. 2022 Nov;36(11):7938-7948. doi: 10.1007/s00464-022-09296-6. Epub 2022 May 12.

Machine learning for technical skill assessment in surgery: a systematic review.

NPJ Digit Med. 2022 Mar 3;5(1):24. doi: 10.1038/s41746-022-00566-0.

Gesture Recognition in Robotic Surgery With Multimodal Attention.

IEEE Trans Med Imaging. 2022 Jul;41(7):1677-1687. doi: 10.1109/TMI.2022.3147640. Epub 2022 Jun 30.

Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review.

NPJ Digit Med. 2022 Jan 10;5(1):2. doi: 10.1038/s41746-021-00549-7.

An Auxiliary Tasks Based Framework for Automated Medical Skill Assessment with Limited Data.

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:1613-1617. doi: 10.1109/EMBC46164.2021.9630498.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于视频的术中手术技能评估的时空注意力。

Spatial-temporal attention for video-based assessment of intraoperative surgical skill.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献