• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过全局和局部预测融合实现深度单目深度估计

Deep Monocular Depth Estimation via Integration of Global and Local Predictions.

作者信息

Kim Youngjung, Jung Hyungjoo, Min Dongbo, Sohn Kwanghoon

出版信息

IEEE Trans Image Process. 2018 May 15. doi: 10.1109/TIP.2018.2836318.

DOI:10.1109/TIP.2018.2836318
PMID:29994769
Abstract

Recent works on machine learning have greatly advanced the accuracy of single image depth estimation. However, the resulting depth images are still over-smoothed and perceptually unsatisfying. This paper casts depth prediction from single image as a parametric learning problem. Specifically, we propose a deep variational model that effectively integrates heterogeneous predictions from two convolutional neural networks (CNNs), named global and local networks. They have contrasting network architecture and are designed to capture depth information with complementary attributes. These intermediate outputs are then combined in the integration network based on the variational framework. By unrolling the optimization steps of Split Bregman (SB) iterations in the integration network, our model can be trained in an end-to-end manner. This enables one to simultaneously learn an efficient parameterization of the CNNs and hyper-parameter in the variational method. Finally, we offer a new dataset of 0.22 million RGB-D images captured by Microsoft Kinect v2. Our model generates realistic and discontinuity-preserving depth prediction without involving any low-level segmentation or superpixels. Intensive experiments demonstrate the superiority of the proposed method in a range of RGB-D benchmarks including both indoor and outdoor scenarios.

摘要

近期关于机器学习的研究极大地提高了单图像深度估计的准确性。然而,生成的深度图像仍然过度平滑,在感知上不能令人满意。本文将单图像深度预测视为一个参数学习问题。具体而言,我们提出了一种深度变分模型,该模型有效地整合了来自两个卷积神经网络(CNN)(即全局网络和局部网络)的异构预测。它们具有截然不同的网络架构,并旨在通过互补属性来捕捉深度信息。然后,这些中间输出基于变分框架在整合网络中进行组合。通过在整合网络中展开分裂布雷格曼(SB)迭代的优化步骤,我们的模型可以以端到端的方式进行训练。这使得人们能够同时学习CNN的有效参数化以及变分方法中的超参数。最后,我们提供了一个由微软Kinect v2捕获的22万张RGB-D图像的新数据集。我们的模型生成逼真且保留不连续性的深度预测,而无需涉及任何低级分割或超像素。大量实验证明了该方法在包括室内和室外场景在内的一系列RGB-D基准测试中的优越性。

相似文献

1
Deep Monocular Depth Estimation via Integration of Global and Local Predictions.通过全局和局部预测融合实现深度单目深度估计
IEEE Trans Image Process. 2018 May 15. doi: 10.1109/TIP.2018.2836318.
2
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.利用单目图像的深度进行目标检测和语义分割。
IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.
3
RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.RT-ViT:基于轻量级视觉Transformer 的实时单目深度估计。
Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.
4
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
5
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
6
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.
7
Convolution-Based Encoding of Depth Images for Transfer Learning in RGB-D Scene Classification.基于卷积的深度图像编码在 RGB-D 场景分类中的迁移学习。
Sensors (Basel). 2021 Nov 28;21(23):7950. doi: 10.3390/s21237950.
8
Monocular Depth Estimation Using Multi-Scale Continuous CRFs as Sequential Deep Networks.使用多尺度连续条件随机场作为序列深度网络的单目深度估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1426-1440. doi: 10.1109/TPAMI.2018.2839602. Epub 2018 May 22.
9
Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance.基于迁移学习和表面法向导引的卓越单目深度估计。
Sensors (Basel). 2020 Aug 27;20(17):4856. doi: 10.3390/s20174856.
10
Lung tumor segmentation in 4D CT images using motion convolutional neural networks.使用运动卷积神经网络进行 4D CT 图像中的肺部肿瘤分割。
Med Phys. 2021 Nov;48(11):7141-7153. doi: 10.1002/mp.15204. Epub 2021 Sep 13.

引用本文的文献

1
Fusing Events and Frames with Coordinate Attention Gated Recurrent Unit for Monocular Depth Estimation.基于坐标注意力门控循环单元融合事件与帧用于单目深度估计
Sensors (Basel). 2024 Dec 4;24(23):7752. doi: 10.3390/s24237752.
2
Computational Large Field-of-View RGB-D Integral Imaging System.计算型大视场 RGB-D 积分成像系统。
Sensors (Basel). 2021 Nov 8;21(21):7407. doi: 10.3390/s21217407.