基于时空层的术中立体深度估计网络，通过分层预测和渐进式训练。

Spatio-temporal layers based intra-operative stereo depth estimation network via hierarchical prediction and progressive training.

机构信息

Politecnico di Milano, Department of Electronics, Information and Bioengineering, Milano, 20133, Italy.

出版信息

Comput Methods Programs Biomed. 2024 Feb;244:107937. doi: 10.1016/j.cmpb.2023.107937. Epub 2023 Nov 22.

DOI:10.1016/j.cmpb.2023.107937

PMID:38006707

Abstract

BACKGROUND AND OBJECTIVE

Safety of robotic surgery can be enhanced through augmented vision or artificial constraints to the robotl motion, and intra-operative depth estimation is the cornerstone of these applications because it provides precise position information of surgical scenes in 3D space. High-quality depth estimation of endoscopic scenes has been a valuable issue, and the development of deep learning provides more possibility and potential to address this issue.

METHODS

In this paper, a deep learning-based approach is proposed to recover 3D information of intra-operative scenes. To this aim, a fully 3D encoder-decoder network integrating spatio-temporal layers is designed, and it adopts hierarchical prediction and progressive learning to enhance prediction accuracy and shorten training time.

RESULTS

Our network gets the depth estimation accuracy of MAE 2.55±1.51 (mm) and RMSE 5.23±1.40 (mm) using 8 surgical videos with a resolution of 1280×1024, which performs better compared with six other state-of-the-art methods that were trained on the same data.

CONCLUSIONS

Our network can implement a promising depth estimation performance in intra-operative scenes using stereo images, allowing the integration in robot-assisted surgery to enhance safety.

摘要

背景与目的

通过增强视觉或对机器人运动施加人为限制，可以提高机器人手术的安全性，术中深度估计是这些应用的基础，因为它提供了三维空间中手术场景的精确位置信息。高质量的内窥镜场景深度估计一直是一个有价值的问题，深度学习的发展为解决这个问题提供了更多的可能性和潜力。

方法

本文提出了一种基于深度学习的方法来恢复手术场景中的三维信息。为此，设计了一个完全的 3D 编解码器网络，集成了时空层，它采用分层预测和渐进式学习来提高预测精度和缩短训练时间。

结果

我们的网络使用分辨率为 1280×1024 的 8 个手术视频，得到 MAE 为 2.55±1.51（mm）和 RMSE 为 5.23±1.40（mm）的深度估计精度，与在相同数据上训练的其他 6 种最先进的方法相比，表现更好。

结论

我们的网络可以使用立体图像实现术中场景有前景的深度估计性能，允许集成到机器人辅助手术中以提高安全性。

相似文献

Spatio-temporal layers based intra-operative stereo depth estimation network via hierarchical prediction and progressive training.基于时空层的术中立体深度估计网络，通过分层预测和渐进式训练。

Comput Methods Programs Biomed. 2024 Feb;244:107937. doi: 10.1016/j.cmpb.2023.107937. Epub 2023 Nov 22.

FRSR: Framework for real-time scene reconstruction in robot-assisted minimally invasive surgery.FRSR：机器人辅助微创手术中的实时场景重建框架。

Comput Biol Med. 2023 Sep;163:107121. doi: 10.1016/j.compbiomed.2023.107121. Epub 2023 Jun 3.

Motion Decoupling Network for Intra-Operative Motion Estimation Under Occlusion.运动解耦网络在遮挡下的术中运动估计

IEEE Trans Med Imaging. 2023 Oct;42(10):2924-2935. doi: 10.1109/TMI.2023.3268774. Epub 2023 Oct 2.

Details preserved unsupervised depth estimation by fusing traditional stereo knowledge from laparoscopic images.通过融合来自腹腔镜图像的传统立体视觉知识来保留无监督深度估计的细节。

Healthc Technol Lett. 2019 Nov 13;6(6):154-158. doi: 10.1049/htl.2019.0063. eCollection 2019 Dec.

Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery.Surgical-DINO：内窥镜手术中深度估计的基础模型适配器学习。

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):1013-1020. doi: 10.1007/s11548-024-03083-5. Epub 2024 Mar 8.

Dynamic surface reconstruction in robot-assisted minimally invasive surgery based on neural radiance fields.基于神经辐射场的机器人辅助微创手术中的动态表面重建。

Int J Comput Assist Radiol Surg. 2024 Mar;19(3):519-530. doi: 10.1007/s11548-023-03016-8. Epub 2023 Sep 28.

A spatio-temporal network for video semantic segmentation in surgical videos.用于手术视频中视频语义分割的时空网络。

Int J Comput Assist Radiol Surg. 2024 Feb;19(2):375-382. doi: 10.1007/s11548-023-02971-6. Epub 2023 Jun 22.

Recovering dense 3D point clouds from single endoscopic image.从单张内窥镜图像中恢复密集三维点云。

Comput Methods Programs Biomed. 2021 Jun;205:106077. doi: 10.1016/j.cmpb.2021.106077. Epub 2021 Apr 3.

RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.RT-ViT：基于轻量级视觉Transformer 的实时单目深度估计。

Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.

Multi-Scale Spatio-Temporal Feature Extraction and Depth Estimation from Sequences by Ordinal Classification.基于序分类的序列多尺度时空特征提取与深度估计。

Sensors (Basel). 2020 Apr 1;20(7):1979. doi: 10.3390/s20071979.

引用本文的文献

Non-rigid scene reconstruction of deformable soft tissue with monocular endoscopy in minimally invasive surgery.在微创手术中利用单目内窥镜进行可变形软组织的非刚性场景重建。

Int J Comput Assist Radiol Surg. 2024 Dec;19(12):2433-2443. doi: 10.1007/s11548-024-03149-4. Epub 2024 May 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于时空层的术中立体深度估计网络，通过分层预测和渐进式训练。

Spatio-temporal layers based intra-operative stereo depth estimation network via hierarchical prediction and progressive training.

机构信息

出版信息

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景与目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献