• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

渐进式硬挖掘网络的单目深度估计。

Progressive Hard-Mining Network for Monocular Depth Estimation.

出版信息

IEEE Trans Image Process. 2018 Aug;27(8):3691-3702. doi: 10.1109/TIP.2018.2821979.

DOI:10.1109/TIP.2018.2821979
PMID:29698202
Abstract

Depth estimation from the monocular RGB image is a challenging task for computer vision due to no reliable cues as the prior knowledge. Most existing monocular depth estimation works including various geometric or network learning methods lack of an effective mechanism to preserve the cross-border details of depth maps, which yet is very important for the performance promotion. In this paper, we propose a novel end-to-end progressive hard-mining network (PHN) framework to address this problem. Specifically, we construct the hard-mining objective function, the intra-scale and inter-scale refinement subnetworks to accurately localize and refine those hard-mining regions. The intra-scale refining block recursively recovers details of depth maps from different semantic features in the same receptive field while the inter-scale block favors a complementary interaction among multi-scale depth cues of different receptive fields. For further reducing the uncertainty of the network, we design a difficulty-ware refinement loss function to guide the depth learning process, which can adaptively focus on mining these hard-regions where accumulated errors easily occur. All three modules collaborate together to progressively reduce the error propagation in the depth learning process, and then, boost the performance of monocular depth estimation to some extent. We conduct comprehensive evaluations on several public benchmark data sets (including NYU Depth V2, KITTI, and Make3D). The experiment results well demonstrate the superiority of our proposed PHN framework over other state of the arts for monocular depth estimation task.

摘要

从单目 RGB 图像进行深度估计是计算机视觉中的一项具有挑战性的任务,因为缺乏可靠的先验知识。大多数现有的单目深度估计工作,包括各种几何或网络学习方法,都缺乏有效机制来保留深度图的跨边界细节,而这对于提高性能却非常重要。在本文中,我们提出了一种新颖的端到端渐进式硬挖掘网络(PHN)框架来解决这个问题。具体来说,我们构建了硬挖掘目标函数、内尺度和外尺度细化子网络,以准确地定位和细化那些硬挖掘区域。内尺度细化块从同一感受野中的不同语义特征递归地恢复深度图的细节,而外尺度细化块则有利于不同感受野的多尺度深度线索之间的互补交互。为了进一步降低网络的不确定性,我们设计了一种困难感知细化损失函数来引导深度学习过程,该函数可以自适应地专注于挖掘那些容易累积误差的硬区域。所有三个模块协同工作,逐步减少深度学习过程中的误差传播,从而在一定程度上提高单目深度估计的性能。我们在几个公共基准数据集(包括 NYU Depth V2、KITTI 和 Make3D)上进行了全面评估。实验结果很好地证明了我们提出的 PHN 框架在单目深度估计任务上优于其他最先进的方法。

相似文献

1
Progressive Hard-Mining Network for Monocular Depth Estimation.渐进式硬挖掘网络的单目深度估计。
IEEE Trans Image Process. 2018 Aug;27(8):3691-3702. doi: 10.1109/TIP.2018.2821979.
2
Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement.具有语义增强的多尺度空间注意力引导单目深度估计
IEEE Trans Image Process. 2021;30:8811-8822. doi: 10.1109/TIP.2021.3120670. Epub 2021 Oct 27.
3
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
4
Joint Task-Recursive Learning for RGB-D Scene Understanding.用于RGB-D场景理解的联合任务递归学习
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2608-2623. doi: 10.1109/TPAMI.2019.2926728. Epub 2019 Jul 10.
5
Monocular Depth Estimation Using Multi-Scale Continuous CRFs as Sequential Deep Networks.使用多尺度连续条件随机场作为序列深度网络的单目深度估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1426-1440. doi: 10.1109/TPAMI.2018.2839602. Epub 2018 May 22.
6
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
7
Unsupervised Monocular Depth Estimation via Recursive Stereo Distillation.通过递归立体蒸馏实现无监督单目深度估计
IEEE Trans Image Process. 2021;30:4492-4504. doi: 10.1109/TIP.2021.3072215. Epub 2021 Apr 27.
8
Laplacian Pyramid Neural Network for Dense Continuous-Value Regression for Complex Scenes.用于复杂场景密集连续值回归的拉普拉斯金字塔神经网络。
IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):5034-5046. doi: 10.1109/TNNLS.2020.3026669. Epub 2021 Oct 27.
9
Deep Ordinal Regression Network for Monocular Depth Estimation.用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.
10
Monocular Depth Estimation via Self-Supervised Self-Distillation.通过自监督自蒸馏进行单目深度估计
Sensors (Basel). 2024 Jun 24;24(13):4090. doi: 10.3390/s24134090.