深度学习在机器人前列腺切除术中的语义分割应用：卷积神经网络与视觉转换器的比较。

Application of deep learning for semantic segmentation in robotic prostatectomy: Comparison of convolutional neural networks and visual transformers.

机构信息

Department of Urology, Kangnam Sacred Heart Hospital, Hallym University College of Medicine, Seoul, Korea.

STARLABS Corp., Seoul, Korea.

出版信息

Investig Clin Urol. 2024 Nov;65(6):551-558. doi: 10.4111/icu.20240159.

DOI:10.4111/icu.20240159

PMID:39505514

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11543645/

Abstract

PURPOSE

Semantic segmentation is a fundamental part of the surgical application of deep learning. Traditionally, segmentation in vision tasks has been performed using convolutional neural networks (CNNs), but the transformer architecture has recently been introduced and widely investigated. We aimed to investigate the performance of deep learning models in segmentation in robot-assisted radical prostatectomy (RARP) and identify which of the architectures is superior for segmentation in robotic surgery.

MATERIALS AND METHODS

Intraoperative images during RARP were obtained. The dataset was randomly split into training and validation data. Segmentation of the surgical instruments, bladder, prostate, vas and seminal vesicle was performed using three CNN models (DeepLabv3, MANet, and U-Net++) and three transformers (SegFormer, BEiT, and DPT), and their performances were analyzed.

RESULTS

The overall segmentation performance during RARP varied across different model architectures. For the CNN models, DeepLabV3 achieved a mean Dice score of 0.938, MANet scored 0.944, and U-Net++ reached 0.930. For the transformer architectures, SegFormer attained a mean Dice score of 0.919, BEiT scored 0.916, and DPT achieved 0.940. The performance of CNN models was superior to that of transformer models in segmenting the prostate, vas, and seminal vesicle.

CONCLUSIONS

Deep learning models provided accurate segmentation of the surgical instruments and anatomical structures observed during RARP. Both CNN and transformer models showed reliable predictions in the segmentation task; however, CNN models may be more suitable than transformer models for organ segmentation and may be more applicable in unusual cases. Further research with large datasets is needed.

摘要

目的

语义分割是深度学习在外科应用中的一个基本部分。传统上，视觉任务中的分割是使用卷积神经网络（CNN）完成的，但最近引入了并广泛研究了变压器架构。我们旨在研究深度学习模型在机器人辅助根治性前列腺切除术（RARP）中的分割性能，并确定哪种架构更适合机器人手术中的分割。

材料和方法

在 RARP 期间获得了术中图像。数据集随机分为训练和验证数据。使用三个 CNN 模型（DeepLabv3、MANet 和 U-Net++）和三个变压器（SegFormer、BEiT 和 DPT）对手术器械、膀胱、前列腺、血管和精囊进行分割，并分析其性能。

结果

在 RARP 期间，整体分割性能因模型架构而异。对于 CNN 模型，DeepLabV3 的平均 Dice 得分为 0.938，MANet 得分为 0.944，U-Net++ 达到 0.930。对于变压器架构，SegFormer 的平均 Dice 得分为 0.919，BEiT 的得分为 0.916，DPT 的得分为 0.940。CNN 模型在分割前列腺、血管和精囊方面的性能优于变压器模型。

结论

深度学习模型能够准确分割 RARP 期间观察到的手术器械和解剖结构。CNN 和变压器模型在分割任务中均表现出可靠的预测能力；然而，与变压器模型相比，CNN 模型可能更适合器官分割，并且在异常情况下可能更适用。需要进一步使用大型数据集进行研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e76e/11543645/b01525644c04/icu-65-551-g001.jpg

相似文献

Application of deep learning for semantic segmentation in robotic prostatectomy: Comparison of convolutional neural networks and visual transformers.

Investig Clin Urol. 2024 Nov;65(6):551-558. doi: 10.4111/icu.20240159.

Deep Learning Model for Real‑time Semantic Segmentation During Intraoperative Robotic Prostatectomy.

Eur Urol Open Sci. 2024 Feb 27;62:47-53. doi: 10.1016/j.euros.2024.02.005. eCollection 2024 Apr.

A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound.

Comput Biol Med. 2020 Nov;126:104036. doi: 10.1016/j.compbiomed.2020.104036. Epub 2020 Oct 8.

Deep Learning-Based Seminal Vesicle and Vas Deferens Recognition in the Posterior Approach of Robot-Assisted Radical Prostatectomy.

Urology. 2023 Mar;173:98-103. doi: 10.1016/j.urology.2022.12.006. Epub 2022 Dec 23.

Combined Transfer Learning and Test-Time Augmentation Improves Convolutional Neural Network-Based Semantic Segmentation of Prostate Cancer from Multi-Parametric MR Images.

Comput Methods Programs Biomed. 2021 Oct;210:106375. doi: 10.1016/j.cmpb.2021.106375. Epub 2021 Aug 28.

Estimating Surgical Urethral Length on Intraoperative Robot-Assisted Prostatectomy Images Using Artificial Intelligence Anatomy Recognition.

J Endourol. 2024 Jul;38(7):690-696. doi: 10.1089/end.2023.0697. Epub 2024 Jul 1.

A deep learning-based framework (Co-ReTr) for auto-segmentation of non-small cell-lung cancer in computed tomography images.

J Appl Clin Med Phys. 2024 Mar;25(3):e14297. doi: 10.1002/acm2.14297. Epub 2024 Feb 19.

Analysis of Current Deep Learning Networks for Semantic Segmentation of Anatomical Structures in Laparoscopic Surgery.

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3502-3505. doi: 10.1109/EMBC48229.2022.9871583.

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Med Phys. 2023 Nov;50(11):6990-7002. doi: 10.1002/mp.16750. Epub 2023 Sep 22.

Real-time deep learning semantic segmentation during intra-operative surgery for 3D augmented reality assistance.

Int J Comput Assist Radiol Surg. 2021 Sep;16(9):1435-1445. doi: 10.1007/s11548-021-02432-y. Epub 2021 Jun 24.

本文引用的文献

Deep Learning Model for Real‑time Semantic Segmentation During Intraoperative Robotic Prostatectomy.

Eur Urol Open Sci. 2024 Feb 27;62:47-53. doi: 10.1016/j.euros.2024.02.005. eCollection 2024 Apr.

Cancer statistics, 2023.

CA Cancer J Clin. 2023 Jan;73(1):17-48. doi: 10.3322/caac.21763.

Deep Learning-Based Seminal Vesicle and Vas Deferens Recognition in the Posterior Approach of Robot-Assisted Radical Prostatectomy.

Urology. 2023 Mar;173:98-103. doi: 10.1016/j.urology.2022.12.006. Epub 2022 Dec 23.

Recent advances and clinical applications of deep learning in medical image analysis.

Med Image Anal. 2022 Jul;79:102444. doi: 10.1016/j.media.2022.102444. Epub 2022 Apr 4.

Radical prostatectomy for localized prostate cancer: 20-year oncological outcomes from a German high-volume center.

Urol Oncol. 2021 Dec;39(12):830.e17-830.e26. doi: 10.1016/j.urolonc.2021.04.031. Epub 2021 Jun 4.

Computer Vision in the Surgical Operating Room.

Visc Med. 2020 Dec;36(6):456-462. doi: 10.1159/000511934. Epub 2020 Oct 15.

Management of prostate cancer patients during COVID-19 pandemic.

Prostate Cancer Prostatic Dis. 2020 Sep;23(3):398-406. doi: 10.1038/s41391-020-0258-7. Epub 2020 Jul 20.

A review of the application of deep learning in medical image classification and segmentation.

Ann Transl Med. 2020 Jun;8(11):713. doi: 10.21037/atm.2020.02.44.

UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation.

IEEE Trans Med Imaging. 2020 Jun;39(6):1856-1867. doi: 10.1109/TMI.2019.2959609. Epub 2019 Dec 13.

The Impact of Experience on the Risk of Surgical Margins and Biochemical Recurrence after Robot-Assisted Radical Prostatectomy: A Learning Curve Study.

J Urol. 2019 Jul;202(1):108-113. doi: 10.1097/JU.0000000000000147. Epub 2019 Jun 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度学习在机器人前列腺切除术中的语义分割应用：卷积神经网络与视觉转换器的比较。

Application of deep learning for semantic segmentation in robotic prostatectomy: Comparison of convolutional neural networks and visual transformers.

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSIONS

目的

材料和方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献