TRandAugment：一种用于从视频中识别手术活动的时间随机增强策略。

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos.

机构信息

Altair Robotics Lab, University of Verona, 37134, Verona, Italy.

ICube, University of Strasbourg, CNRS, 67000, Strasbourg, France.

出版信息

Int J Comput Assist Radiol Surg. 2023 Sep;18(9):1665-1672. doi: 10.1007/s11548-023-02864-8. Epub 2023 Mar 22.

DOI:10.1007/s11548-023-02864-8

PMID:36944845

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10491694/

Abstract

PURPOSE

Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities.

METHODS

This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN.

RESULTS

The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1-6% over previous state-of-the-art methods, that uses manually designed augmentations.

CONCLUSION

This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos.

摘要

目的

从术中手术视频中自动识别手术活动对于开发计算机辅助干预的智能支持系统至关重要。当前最先进的识别方法基于深度学习，其中数据增强已显示出提高这些方法泛化能力的潜力。这促使人们研究用于图像分类和对象检测的数据集的自动化和简化增强策略。将此类增强方法扩展到视频并不简单，因为需要考虑时间维度。此外，由于手术视频由多个相互关联且持续时间长的活动组成，因此它们带来了额外的挑战。

方法

这项工作提出了一种新的简化增强方法，称为 TRandAugment，专门针对长时间的手术视频设计，它将每个视频视为时间片段的集合，并对每个片段应用一致但随机的变换。所提出的增强方法用于训练一个端到端的时空模型，该模型由一个 CNN（ResNet50）和一个 TCN 组成。

结果

所提出的方法在两个手术视频数据集，即 Bypass40 和 CATARACTS 以及两个任务，即手术阶段和步骤识别上的有效性得到了证明。与使用手动设计的增强方法的先前最先进的方法相比，TRandAugment 提高了 1-6%的性能。

结论

本文提出了一种用于长时间手术视频的简化自动化增强方法。该方法已在不同的数据集和任务上进行了验证，表明为长时间手术视频设计时间增强方法的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7542/10491694/ee9703e3ce16/11548_2023_2864_Fig1_HTML.jpg

相似文献

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos.TRandAugment：一种用于从视频中识别手术活动的时间随机增强策略。

Int J Comput Assist Radiol Surg. 2023 Sep;18(9):1665-1672. doi: 10.1007/s11548-023-02864-8. Epub 2023 Mar 22.

Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures.多任务时频卷积网络联合识别胃旁路手术中的手术阶段和步骤。

Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1111-1119. doi: 10.1007/s11548-021-02388-z. Epub 2021 May 19.

Weakly Supervised Temporal Convolutional Networks for Fine-Grained Surgical Activity Recognition.用于细粒度手术活动识别的弱监督时间卷积网络

IEEE Trans Med Imaging. 2023 Sep;42(9):2592-2602. doi: 10.1109/TMI.2023.3262847. Epub 2023 Aug 31.

Smart data augmentation for surgical tool detection on the surgical tray.用于手术托盘上手术工具检测的智能数据增强

Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:4407-4410. doi: 10.1109/EMBC.2017.8037833.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

Assisted phase and step annotation for surgical videos.辅助手术视频的阶段和步骤标注。

Int J Comput Assist Radiol Surg. 2020 Apr;15(4):673-680. doi: 10.1007/s11548-019-02108-8. Epub 2020 Feb 10.

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.使用增强卷积和递归神经网络监测手术视频中的工具使用情况。

Med Image Anal. 2018 Jul;47:203-218. doi: 10.1016/j.media.2018.05.001. Epub 2018 May 9.

SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network.SV-RCNet：基于递归卷积网络的手术视频工作流程识别

IEEE Trans Med Imaging. 2018 May;37(5):1114-1126. doi: 10.1109/TMI.2017.2787657.

Global-local multi-stage temporal convolutional network for cataract surgery phase recognition.用于白内障手术阶段识别的全局-局部多阶段时间卷积网络。

Biomed Eng Online. 2022 Nov 30;21(1):82. doi: 10.1186/s12938-022-01048-w.

SF-TMN: SlowFast temporal modeling network for surgical phase recognition.SF-TMN：用于手术阶段识别的慢快时变建模网络。

Int J Comput Assist Radiol Surg. 2024 May;19(5):871-880. doi: 10.1007/s11548-024-03095-1. Epub 2024 Mar 21.

本文引用的文献

Semi-supervised learning with progressive unlabeled data excavation for label-efficient surgical workflow recognition.基于渐进式未标记数据挖掘的半监督学习在标签高效手术流程识别中的应用。

Med Image Anal. 2021 Oct;73:102158. doi: 10.1016/j.media.2021.102158. Epub 2021 Jul 8.

SAGES consensus recommendations on an annotation framework for surgical video.外科手术视频标注框架的 SAGES 共识建议

Surg Endosc. 2021 Sep;35(9):4918-4929. doi: 10.1007/s00464-021-08578-9. Epub 2021 Jul 6.

Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1111-1119. doi: 10.1007/s11548-021-02388-z. Epub 2021 May 19.

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.基于相关损失的多任务递归卷积网络在手术视频分析中的应用。

Med Image Anal. 2020 Jan;59:101572. doi: 10.1016/j.media.2019.101572. Epub 2019 Oct 10.

Surgical data science for next-generation interventions.面向下一代干预措施的外科数据科学。

Nat Biomed Eng. 2017 Sep;1(9):691-696. doi: 10.1038/s41551-017-0132-7.

Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos.基于弱监督卷积 LSTM 的腹腔镜视频中工具跟踪方法。

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1059-1067. doi: 10.1007/s11548-019-01958-6. Epub 2019 Apr 9.

CATARACTS: Challenge on automatic tool annotation for cataRACT surgery.白内障：白内障手术自动工具标注挑战。

Med Image Anal. 2019 Feb;52:24-41. doi: 10.1016/j.media.2018.11.008. Epub 2018 Nov 16.

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos.EndoNet：腹腔镜视频识别任务的深度架构。

IEEE Trans Med Imaging. 2017 Jan;36(1):86-97. doi: 10.1109/TMI.2016.2593957. Epub 2016 Jul 22.

LapOntoSPM: an ontology for laparoscopic surgeries and its application to surgical phase recognition.LapOntoSPM：一种用于腹腔镜手术的本体及其在手术阶段识别中的应用。

Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1427-34. doi: 10.1007/s11548-015-1222-1. Epub 2015 Jun 11.

Toward increased autonomy in the surgical OR: needs, requests, and expectations.朝着手术 OR 中的自主性提升：需求、请求和期望。

Surg Endosc. 2013 May;27(5):1681-8. doi: 10.1007/s00464-012-2656-y. Epub 2012 Dec 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

TRandAugment：一种用于从视频中识别手术活动的时间随机增强策略。

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献