• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的单目深度估计研究综述。

Monocular Depth Estimation Using Deep Learning: A Review.

机构信息

Department of Computer Engineering and Mathematics, University of Rovira i Virgili, 43007 Tarragona, Spain.

Department of Electrical and Computer Engineering, University of California, Riverside, CA 92521, USA.

出版信息

Sensors (Basel). 2022 Jul 18;22(14):5353. doi: 10.3390/s22145353.

DOI:10.3390/s22145353
PMID:35891033
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9325018/
Abstract

In current decades, significant advancements in robotics engineering and autonomous vehicles have improved the requirement for precise depth measurements. Depth estimation (DE) is a traditional task in computer vision that can be appropriately predicted by applying numerous procedures. This task is vital in disparate applications such as augmented reality and target tracking. Conventional monocular DE (MDE) procedures are based on depth cues for depth prediction. Various deep learning techniques have demonstrated their potential applications in managing and supporting the traditional ill-posed problem. The principal purpose of this paper is to represent a state-of-the-art review of the current developments in MDE based on deep learning techniques. For this goal, this paper tries to highlight the critical points of the state-of-the-art works on MDE from disparate aspects. These aspects include input data shapes and training manners such as supervised, semi-supervised, and unsupervised learning approaches in combination with applying different datasets and evaluation indicators. At last, limitations regarding the accuracy of the DL-based MDE models, computational time requirements, real-time inference, transferability, input images shape and domain adaptation, and generalization are discussed to open new directions for future research.

摘要

在最近几十年,机器人工程和自动驾驶汽车领域的重大进展提高了对精确深度测量的要求。深度估计(Depth Estimation,DE)是计算机视觉中的一项传统任务,可以通过应用多种程序来进行适当的预测。这项任务在增强现实和目标跟踪等不同应用中至关重要。传统的单目深度估计(Monocular DE,MDE)程序基于深度线索进行深度预测。各种深度学习技术已证明它们在管理和支持传统不适定问题方面的潜在应用。本文的主要目的是代表基于深度学习技术的 MDE 最新进展的最新综述。为此,本文试图从不同方面突出 MDE 最新工作的关键点。这些方面包括输入数据的形状和训练方式,例如监督、半监督和无监督学习方法,以及结合不同的数据集和评估指标的应用。最后,讨论了基于深度学习的 MDE 模型的准确性、计算时间要求、实时推断、可转移性、输入图像形状和域自适应以及泛化方面的局限性,为未来的研究开辟了新的方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/58af259c0146/sensors-22-05353-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f3d3ec0b3ecf/sensors-22-05353-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f84652aa74cb/sensors-22-05353-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/ac2e501838c8/sensors-22-05353-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f34c4549a2da/sensors-22-05353-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/658e472c0938/sensors-22-05353-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/26e3d993bdbb/sensors-22-05353-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/53f0bcf4bac7/sensors-22-05353-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f3cb08378d11/sensors-22-05353-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/8aad7d2b433b/sensors-22-05353-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/264bed94f867/sensors-22-05353-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/7875f9f88642/sensors-22-05353-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/7874ed15de19/sensors-22-05353-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/875e63d85dfe/sensors-22-05353-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/64576a5417fe/sensors-22-05353-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/2c0fd91a4662/sensors-22-05353-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/58af259c0146/sensors-22-05353-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f3d3ec0b3ecf/sensors-22-05353-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f84652aa74cb/sensors-22-05353-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/ac2e501838c8/sensors-22-05353-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f34c4549a2da/sensors-22-05353-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/658e472c0938/sensors-22-05353-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/26e3d993bdbb/sensors-22-05353-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/53f0bcf4bac7/sensors-22-05353-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/f3cb08378d11/sensors-22-05353-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/8aad7d2b433b/sensors-22-05353-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/264bed94f867/sensors-22-05353-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/7875f9f88642/sensors-22-05353-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/7874ed15de19/sensors-22-05353-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/875e63d85dfe/sensors-22-05353-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/64576a5417fe/sensors-22-05353-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/2c0fd91a4662/sensors-22-05353-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af65/9325018/58af259c0146/sensors-22-05353-g016.jpg

相似文献

1
Monocular Depth Estimation Using Deep Learning: A Review.基于深度学习的单目深度估计研究综述。
Sensors (Basel). 2022 Jul 18;22(14):5353. doi: 10.3390/s22145353.
2
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.
3
Online supervised attention-based recurrent depth estimation from monocular video.基于在线监督注意力机制的单目视频递归深度估计
PeerJ Comput Sci. 2020 Nov 23;6:e317. doi: 10.7717/peerj-cs.317. eCollection 2020.
4
Self-Supervised Adversarial Training of Monocular Depth Estimation Against Physical-World Attacks.
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9084-9101. doi: 10.1109/TPAMI.2024.3412632. Epub 2024 Nov 6.
5
Monocular Depth Estimation via Self-Supervised Self-Distillation.通过自监督自蒸馏进行单目深度估计
Sensors (Basel). 2024 Jun 24;24(13):4090. doi: 10.3390/s24134090.
6
Self-supervised recurrent depth estimation with attention mechanisms.基于注意力机制的自监督循环深度估计
PeerJ Comput Sci. 2022 Jan 31;8:e865. doi: 10.7717/peerj-cs.865. eCollection 2022.
7
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
8
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation.基于立体视觉的深度估计深度学习技术研究综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1738-1764. doi: 10.1109/TPAMI.2020.3032602. Epub 2022 Mar 4.
9
Recovering dense 3D point clouds from single endoscopic image.从单张内窥镜图像中恢复密集三维点云。
Comput Methods Programs Biomed. 2021 Jun;205:106077. doi: 10.1016/j.cmpb.2021.106077. Epub 2021 Apr 3.
10
Image Intrinsic-Based Unsupervised Monocular Depth Estimation in Endoscopy.基于图像内在特征的内窥镜无监督单目深度估计
IEEE J Biomed Health Inform. 2024 May 14;PP. doi: 10.1109/JBHI.2024.3400804.

引用本文的文献

1
Human-like monocular depth biases in deep neural networks.深度神经网络中类似人类的单眼深度偏差。
PLoS Comput Biol. 2025 Aug 19;21(8):e1013020. doi: 10.1371/journal.pcbi.1013020. eCollection 2025 Aug.
2
SfMDiffusion: self-supervised monocular depth estimation in endoscopy based on diffusion models.SfMDiffusion:基于扩散模型的内窥镜自监督单目深度估计
Int J Comput Assist Radiol Surg. 2025 May;20(5):971-979. doi: 10.1007/s11548-025-03333-0. Epub 2025 Feb 24.
3
MonoDFNet: Monocular 3D Object Detection with Depth Fusion and Adaptive Optimization.

本文引用的文献

1
PSOWNNs-CNN: A Computational Radiology for Breast Cancer Diagnosis Improvement Based on Image Processing Using Machine Learning Methods.PSOWNNs-CNN:一种基于图像处理和机器学习方法的计算机放射学乳腺癌诊断改进算法。
Comput Intell Neurosci. 2022 May 11;2022:5667264. doi: 10.1155/2022/5667264. eCollection 2022.
2
Self-supervised recurrent depth estimation with attention mechanisms.基于注意力机制的自监督循环深度估计
PeerJ Comput Sci. 2022 Jan 31;8:e865. doi: 10.7717/peerj-cs.865. eCollection 2022.
3
Stereoscopic portable hybrid gamma imaging for source depth estimation.
MonoDFNet:基于深度融合与自适应优化的单目3D目标检测
Sensors (Basel). 2025 Jan 27;25(3):760. doi: 10.3390/s25030760.
4
Effect of Depth Band Replacement on Red, Green and Blue Image for Deep Learning Weed Detection.深度波段替换对用于深度学习杂草检测的红、绿、蓝图像的影响
Sensors (Basel). 2024 Dec 30;25(1):161. doi: 10.3390/s25010161.
5
Residual Vision Transformer and Adaptive Fusion Autoencoders for Monocular Depth Estimation.用于单目深度估计的残差视觉Transformer和自适应融合自动编码器
Sensors (Basel). 2024 Dec 26;25(1):80. doi: 10.3390/s25010080.
6
Fusing Events and Frames with Coordinate Attention Gated Recurrent Unit for Monocular Depth Estimation.基于坐标注意力门控循环单元融合事件与帧用于单目深度估计
Sensors (Basel). 2024 Dec 4;24(23):7752. doi: 10.3390/s24237752.
7
Impact of Perception Errors in Vision-Based Detection and Tracking Pipelines on Pedestrian Trajectory Prediction in Autonomous Driving Systems.基于视觉的检测与跟踪管道中的感知误差对自动驾驶系统中行人轨迹预测的影响
Sensors (Basel). 2024 Aug 5;24(15):5066. doi: 10.3390/s24155066.
8
Monocular Absolute Depth Estimation from Motion for Small Unmanned Aerial Vehicles by Geometry-Based Scale Recovery.基于几何尺度恢复的小型无人机运动单目绝对深度估计
Sensors (Basel). 2024 Jul 13;24(14):4541. doi: 10.3390/s24144541.
9
Synthetic Data Enhancement and Network Compression Technology of Monocular Depth Estimation for Real-Time Autonomous Driving System.用于实时自动驾驶系统的单目深度估计的合成数据增强与网络压缩技术
Sensors (Basel). 2024 Jun 28;24(13):4205. doi: 10.3390/s24134205.
10
LNMVSNet: A Low-Noise Multi-View Stereo Depth Inference Method for 3D Reconstruction.LNMVSNet:一种用于三维重建的低噪声多视图立体深度推理方法
Sensors (Basel). 2024 Apr 9;24(8):2400. doi: 10.3390/s24082400.
立体便携混合γ成像用于源深度估计。
Phys Med Biol. 2021 Feb 12;66(4):045031. doi: 10.1088/1361-6560/abd955.
4
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer.迈向稳健的单目深度估计:混合数据集以实现零样本跨数据集迁移。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1623-1637. doi: 10.1109/TPAMI.2020.3019967. Epub 2022 Feb 3.
5
Pupil Localisation and Eye Centre Estimation Using Machine Learning and Computer Vision.使用机器学习和计算机视觉进行瞳孔定位和眼中心估计。
Sensors (Basel). 2020 Jul 6;20(13):3785. doi: 10.3390/s20133785.
6
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.
7
Miniaturized 3D Depth Sensing-Based Smartphone Light Field Camera.基于微型3D深度感知的智能手机光场相机。
Sensors (Basel). 2020 Apr 9;20(7):2129. doi: 10.3390/s20072129.
8
Review of Stereo Matching Algorithms Based on Deep Learning.基于深度学习的立体匹配算法综述。
Comput Intell Neurosci. 2020 Mar 23;2020:8562323. doi: 10.1155/2020/8562323. eCollection 2020.
9
Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods.基于自监督学习方法的单目内窥镜下密集深度估计。
IEEE Trans Med Imaging. 2020 May;39(5):1438-1447. doi: 10.1109/TMI.2019.2950936. Epub 2019 Nov 1.
10
Deep Ordinal Regression Network for Monocular Depth Estimation.用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.