• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SHADeS:通过非朗伯体图像分解实现自监督单目深度估计

SHADeS: self-supervised monocular depth estimation through non-Lambertian image decomposition.

作者信息

Daher Rema, Vasconcelos Francisco, Stoyanov Danail

机构信息

Department of Computer Science, UCL Hawkes Institute, University College London, Gower Street, London, WC1E 6BT, UK.

出版信息

Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1255-1263. doi: 10.1007/s11548-025-03371-8. Epub 2025 May 13.

DOI:10.1007/s11548-025-03371-8
PMID:40358844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12167237/
Abstract

PURPOSE

Visual 3D scene reconstruction can support colonoscopy navigation. It can help in recognising which portions of the colon have been visualised and characterising the size and shape of polyps. This is still a very challenging problem due to complex illumination variations, including abundant specular reflections. We investigate how to effectively decouple light and depth in this problem.

METHODS

We introduce a self-supervised model that simultaneously characterises the shape and lighting of the visualised colonoscopy scene. Our model estimates shading, albedo, depth, and specularities (SHADeS) from single images. Unlike previous approaches (IID (Li et al. IEEE J Biomed Health Inform https://doi.org/10.1109/JBHI.2024.3400804 , 2024)), we use a non-Lambertian model that treats specular reflections as a separate light component. The implementation of our method is available at https://github.com/RemaDaher/SHADeS .

RESULTS

We demonstrate on real colonoscopy images (Hyper Kvasir) that previous models for light decomposition (IID) and depth estimation (MonoViT, ModoDepth2) are negatively affected by specularities. In contrast, SHADeS can simultaneously produce light decomposition and depth maps that are robust to specular regions. We also perform a quantitative comparison on phantom data (C3VD) where we further demonstrate the robustness of our model.

CONCLUSION

Modelling specular reflections improves depth estimation in colonoscopy. We propose an effective self-supervised approach that uses this insight to jointly estimate light decomposition and depth. Light decomposition has the potential to help with other problems, such as place recognition within the colon.

摘要

目的

可视化三维场景重建可支持结肠镜检查导航。它有助于识别结肠镜已观察到的结肠部分,并确定息肉的大小和形状。由于复杂的光照变化,包括大量的镜面反射,这仍然是一个极具挑战性的问题。我们研究如何在这个问题中有效地分离光线和深度。

方法

我们引入了一种自监督模型,该模型同时对可视化结肠镜检查场景的形状和光照进行表征。我们的模型从单张图像中估计阴影、反照率、深度和镜面反射(SHADeS)。与之前的方法(IID(Li等人,《IEEE生物医学与健康信息学杂志》https://doi.org/10.1109/JBHI.2024.3400804,2024年))不同,我们使用一种非朗伯模型,将镜面反射视为一个单独的光分量。我们方法的实现可在https://github.com/RemaDaher/SHADeS获取。

结果

我们在真实的结肠镜检查图像(Hyper Kvasir)上证明,之前用于光分解(IID)和深度估计(MonoViT、ModoDepth2)的模型会受到镜面反射的负面影响。相比之下,SHADeS可以同时生成对镜面反射区域具有鲁棒性的光分解图和深度图。我们还对体模数据(C3VD)进行了定量比较,进一步证明了我们模型的鲁棒性。

结论

对镜面反射进行建模可改善结肠镜检查中的深度估计。我们提出了一种有效的自监督方法,利用这一见解联合估计光分解和深度。光分解有潜力帮助解决其他问题,例如结肠内的位置识别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/d86f222fb5d4/11548_2025_3371_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/b7368141be65/11548_2025_3371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/31d4156ef650/11548_2025_3371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/e4b515a2ab62/11548_2025_3371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/1daa6254c33f/11548_2025_3371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/ee4ae80a2982/11548_2025_3371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/d86f222fb5d4/11548_2025_3371_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/b7368141be65/11548_2025_3371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/31d4156ef650/11548_2025_3371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/e4b515a2ab62/11548_2025_3371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/1daa6254c33f/11548_2025_3371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/ee4ae80a2982/11548_2025_3371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfb1/12167237/d86f222fb5d4/11548_2025_3371_Fig6_HTML.jpg

相似文献

1
SHADeS: self-supervised monocular depth estimation through non-Lambertian image decomposition.SHADeS:通过非朗伯体图像分解实现自监督单目深度估计
Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1255-1263. doi: 10.1007/s11548-025-03371-8. Epub 2025 May 13.
2
Leveraging a realistic synthetic database to learn Shape-from-Shading for estimating the colon depth in colonoscopy images.利用逼真的合成数据库学习从阴影中估计结肠镜图像中结肠深度的形状。
Comput Med Imaging Graph. 2024 Jul;115:102390. doi: 10.1016/j.compmedimag.2024.102390. Epub 2024 May 3.
3
Self-supervised monocular depth estimation for high field of view colonoscopy cameras.用于高视野结肠镜检查相机的自监督单目深度估计
Front Robot AI. 2023 Jul 25;10:1212525. doi: 10.3389/frobt.2023.1212525. eCollection 2023.
4
Unsupervised colonoscopic depth estimation by domain translations with a Lambertian-reflection keeping auxiliary task.基于 Lambertian 反射保持辅助任务的域变换的无监督结肠镜深度估计。
Int J Comput Assist Radiol Surg. 2021 Jun;16(6):989-1001. doi: 10.1007/s11548-021-02398-x. Epub 2021 May 17.
5
SfMDiffusion: self-supervised monocular depth estimation in endoscopy based on diffusion models.SfMDiffusion:基于扩散模型的内窥镜自监督单目深度估计
Int J Comput Assist Radiol Surg. 2025 May;20(5):971-979. doi: 10.1007/s11548-025-03333-0. Epub 2025 Feb 24.
6
Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy.结肠镜检查中基于跨任务一致性的多任务学习以改善深度估计。
Med Image Anal. 2025 Jan;99:103379. doi: 10.1016/j.media.2024.103379. Epub 2024 Nov 4.
7
Specular reflections and the perception of shape.镜面反射与形状感知。
J Vis. 2004 Sep 23;4(9):798-820. doi: 10.1167/4.9.10.
8
MonoLoT: Self-Supervised Monocular Depth Estimation in Low-Texture Scenes for Automatic Robotic Endoscopy.MonoLoT:用于自动机器人内窥镜的低纹理场景下的自监督单目深度估计。
IEEE J Biomed Health Inform. 2024 Oct;28(10):6078-6091. doi: 10.1109/JBHI.2024.3423791. Epub 2024 Oct 3.
9
Image Intrinsic-Based Unsupervised Monocular Depth Estimation in Endoscopy.基于图像内在特征的内窥镜无监督单目深度估计
IEEE J Biomed Health Inform. 2024 May 14;PP. doi: 10.1109/JBHI.2024.3400804.
10
Self-Supervised monocular depth and ego-Motion estimation in endoscopy: Appearance flow to the rescue.内窥镜中单目深度和自我运动估计的自监督学习:外观流来救援。
Med Image Anal. 2022 Apr;77:102338. doi: 10.1016/j.media.2021.102338. Epub 2021 Dec 25.

本文引用的文献

1
WS-SfMLearner: self-supervised monocular depth and ego-motion estimation on surgical videos with unknown camera parameters.WS-SfMLearner:在相机参数未知的手术视频上进行自监督单目深度和自我运动估计。
J Med Imaging (Bellingham). 2025 Mar;12(2):025003. doi: 10.1117/1.JMI.12.2.025003. Epub 2025 Apr 30.
2
Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model.Diff-DTI:使用特征增强联合扩散模型的快速扩散张量成像
IEEE J Biomed Health Inform. 2024 Dec 30;PP. doi: 10.1109/JBHI.2024.3523532.
3
Image Intrinsic-Based Unsupervised Monocular Depth Estimation in Endoscopy.
基于图像内在特征的内窥镜无监督单目深度估计
IEEE J Biomed Health Inform. 2024 May 14;PP. doi: 10.1109/JBHI.2024.3400804.
4
Monocular endoscopy images depth estimation with multi-scale residual fusion.单目内窥镜图像的多尺度残差融合深度估计。
Comput Biol Med. 2024 Feb;169:107850. doi: 10.1016/j.compbiomed.2023.107850. Epub 2023 Dec 17.
5
A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence.一种用于修复内镜镜面反射的时间学习方法及其对图像匹配的影响。
Med Image Anal. 2023 Dec;90:102994. doi: 10.1016/j.media.2023.102994. Epub 2023 Oct 4.
6
Colonoscopy 3D video dataset with paired depth from 2D-3D registration.具有通过二维到三维配准得到的配对深度信息的结肠镜检查三维视频数据集。
Med Image Anal. 2023 Dec;90:102956. doi: 10.1016/j.media.2023.102956. Epub 2023 Sep 7.
7
Self-Supervised monocular depth and ego-Motion estimation in endoscopy: Appearance flow to the rescue.内窥镜中单目深度和自我运动估计的自监督学习:外观流来救援。
Med Image Anal. 2022 Apr;77:102338. doi: 10.1016/j.media.2021.102338. Epub 2021 Dec 25.
8
EndoSLAM dataset and an unsupervised monocular visual odometry and depth estimation approach for endoscopic videos.内镜 SLAM 数据集和一种用于内镜视频的无监督单目视觉里程计和深度估计方法。
Med Image Anal. 2021 Jul;71:102058. doi: 10.1016/j.media.2021.102058. Epub 2021 Apr 15.
9
HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy.HyperKvasir,一个用于胃肠道内镜的全面多类图像和视频数据集。
Sci Data. 2020 Aug 28;7(1):283. doi: 10.1038/s41597-020-00622-y.