• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过深度神经网络嵌入焦距从单张图像中学习深度

Learning Depth from Single Images with Deep Neural Network Embedding Focal Length.

作者信息

He Lei, Wang Guanghui, Hu Zhanyi

出版信息

IEEE Trans Image Process. 2018 May 17. doi: 10.1109/TIP.2018.2832296.

DOI:10.1109/TIP.2018.2832296
PMID:29994526
Abstract

Learning depth from a single image, as an important issue in scene understanding, has attracted a lot of attention in the past decade. The accuracy of the depth estimation has been improved from conditional Markov random fields, non-parametric methods, to deep convolutional neural networks most recently. However, there exist inherent ambiguities in recovering 3D from a single 2D image. In this paper, we first prove the ambiguity between the focal length and monocular depth learning, and verify the result using experiments, showing that the focal length has a great influence on accurate depth recovery. In order to learn monocular depth by embedding the focal length, we propose a method to generate synthetic varying-focal-length dataset from fixed-focal-length datasets, and a simple and effective method is implemented to fill the holes in the newly generated images. For the sake of accurate depth recovery, we propose a novel deep neural network to infer depth through effectively fusing the middle-level information on the fixed-focal-length dataset, which outperforms the state-of-the-art methods built on pretrained VGG. Furthermore, the newly generated varying-focallength dataset is taken as input to the proposed network in both learning and inference phases. Extensive experiments on the fixed- and varying-focal-length datasets demonstrate that the learned monocular depth with embedded focal length is significantly improved compared to that without embedding the focal length information.

摘要

从单张图像中学习深度作为场景理解中的一个重要问题,在过去十年中受到了广泛关注。深度估计的准确性已经从条件马尔可夫随机场、非参数方法,发展到最近的深度卷积神经网络。然而,从单张二维图像恢复三维存在固有的模糊性。在本文中,我们首先证明了焦距与单目深度学习之间的模糊性,并通过实验验证了结果,表明焦距对准确的深度恢复有很大影响。为了通过嵌入焦距来学习单目深度,我们提出了一种从固定焦距数据集生成合成变焦距数据集的方法,并实现了一种简单有效的方法来填补新生成图像中的空洞。为了实现准确的深度恢复,我们提出了一种新颖的深度神经网络,通过有效融合固定焦距数据集上的中级信息来推断深度,该方法优于基于预训练VGG构建的现有方法。此外,在学习和推理阶段,新生成的变焦距数据集都作为所提出网络的输入。在固定焦距和变焦距数据集上进行的大量实验表明,与未嵌入焦距信息的情况相比,嵌入焦距后学习到的单目深度有显著提高。

相似文献

1
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length.通过深度神经网络嵌入焦距从单张图像中学习深度
IEEE Trans Image Process. 2018 May 17. doi: 10.1109/TIP.2018.2832296.
2
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.
3
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.
4
Depth Estimation from Light Field Geometry Using Convolutional Neural Networks.基于卷积神经网络的光场几何深度估计
Sensors (Basel). 2021 Sep 10;21(18):6061. doi: 10.3390/s21186061.
5
Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.
6
3D Hand Pose Estimation Using Synthetic Data and Weakly Labeled RGB Images.基于合成数据和弱标注 RGB 图像的三维手姿估计。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3739-3753. doi: 10.1109/TPAMI.2020.2993627. Epub 2021 Oct 1.
7
Deep Monocular Depth Estimation via Integration of Global and Local Predictions.通过全局和局部预测融合实现深度单目深度估计
IEEE Trans Image Process. 2018 May 15. doi: 10.1109/TIP.2018.2836318.
8
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
9
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
10
Convolution-Based Encoding of Depth Images for Transfer Learning in RGB-D Scene Classification.基于卷积的深度图像编码在 RGB-D 场景分类中的迁移学习。
Sensors (Basel). 2021 Nov 28;21(23):7950. doi: 10.3390/s21237950.

引用本文的文献

1
Nested DWT-Based CNN Architecture for Monocular Depth Estimation.基于嵌套 DWT 的单目深度估计卷积神经网络架构。
Sensors (Basel). 2023 Mar 13;23(6):3066. doi: 10.3390/s23063066.
2
Three-dimensional imaging through turbid media using deep learning: NIR transillumination imaging of animal bodies.利用深度学习透过浑浊介质进行三维成像:动物体的近红外透射成像
Biomed Opt Express. 2021 Apr 23;12(5):2873-2887. doi: 10.1364/BOE.420337. eCollection 2021 May 1.
3
A Residual Network and FPGA Based Real-Time Depth Map Enhancement System.一种基于残差网络和现场可编程门阵列的实时深度图增强系统。
Entropy (Basel). 2021 Apr 28;23(5):546. doi: 10.3390/e23050546.
4
Real-Time Single Image Depth Perception in the Wild with Handheld Devices.使用手持设备在自然环境中进行实时单图像深度感知
Sensors (Basel). 2020 Dec 22;21(1):15. doi: 10.3390/s21010015.
5
A comparative study on polyp classification using convolutional neural networks.基于卷积神经网络的息肉分类比较研究。
PLoS One. 2020 Jul 30;15(7):e0236452. doi: 10.1371/journal.pone.0236452. eCollection 2020.
6
Fast Depth Estimation in a Single Image Using Lightweight Efficient Neural Network.基于轻量级高效神经网络的单图像快速深度估计。
Sensors (Basel). 2019 Oct 13;19(20):4434. doi: 10.3390/s19204434.