• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于迁移学习和表面法向导引的卓越单目深度估计。

Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance.

机构信息

Department of Mechanical Engineering and Automation, School of Mechanical and Aerospace Engineering, Jilin University, Changchun 130022, China.

Research Center for Space Optical Engineering, Harbin Institute of Technology, P.O. Box 307, Harbin 150001, China.

出版信息

Sensors (Basel). 2020 Aug 27;20(17):4856. doi: 10.3390/s20174856.

DOI:10.3390/s20174856
PMID:32867293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7506624/
Abstract

Accurately sensing the surrounding 3D scene is indispensable for drones or robots to execute path planning and navigation. In this paper, a novel monocular depth estimation method was proposed that primarily utilizes a lighter-weight Convolutional Neural Network (CNN) structure for coarse depth prediction and then refines the coarse depth images by combining surface normal guidance. Specifically, the coarse depth prediction network is designed as pre-trained encoder-decoder architecture for describing the 3D structure. When it comes to surface normal estimation, the deep learning network was designed as a two-stream encoder-decoder structure, which hierarchically merges red-green-blue-depth (RGB-D) images for capturing more accurate geometric boundaries. Relying on fewer network parameters and simpler learning structure, better detailed depth maps are produced than the existing states. Moreover, 3D point cloud maps reconstructed from depth prediction images confirm that our framework can be conveniently adopted as components of a monocular simultaneous localization and mapping (SLAM) paradigm.

摘要

准确感知周围的 3D 场景对于无人机或机器人执行路径规划和导航是必不可少的。在本文中,提出了一种新颖的单目深度估计方法,该方法主要利用更轻量级的卷积神经网络 (CNN) 结构进行粗深度预测,然后通过结合表面法线引导来细化粗深度图像。具体来说,粗深度预测网络被设计为用于描述 3D 结构的预训练编码器-解码器架构。在进行表面法线估计时,深度学习网络被设计为具有两个流编码器-解码器结构,该结构分层合并 RGB-D 图像以捕获更准确的几何边界。与现有方法相比,我们的方法使用更少的网络参数和更简单的学习结构,生成了更好的详细深度图。此外,从深度预测图像重建的 3D 点云图证实,我们的框架可以方便地作为单目同时定位和映射 (SLAM) 范例的组件。

相似文献

1
Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance.基于迁移学习和表面法向导引的卓越单目深度估计。
Sensors (Basel). 2020 Aug 27;20(17):4856. doi: 10.3390/s20174856.
2
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.
3
Self-supervised Monocular Depth Estimation with 3D Displacement Module for Laparoscopic Images.用于腹腔镜图像的基于3D位移模块的自监督单目深度估计
IEEE Trans Med Robot Bionics. 2022 May;4(2):331-334. doi: 10.1109/TMRB.2022.3170206.
4
SLAM-based dense surface reconstruction in monocular Minimally Invasive Surgery and its application to Augmented Reality.基于 SLAM 的单目微创手术中密集表面重建及其在增强现实中的应用。
Comput Methods Programs Biomed. 2018 May;158:135-146. doi: 10.1016/j.cmpb.2018.02.006. Epub 2018 Feb 8.
5
RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.RT-ViT:基于轻量级视觉Transformer 的实时单目深度估计。
Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.
6
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.
7
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
8
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
9
DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization.DiT-SLAM:基于隐式深度表示和紧密耦合图优化的实时密集视觉惯性同步定位与地图构建
Sensors (Basel). 2022 Apr 28;22(9):3389. doi: 10.3390/s22093389.
10
DENAO: Monocular Depth Estimation Network With Auxiliary Optical Flow.DENAO:具有辅助光流的单目深度估计网络。
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2598-2610. doi: 10.1109/TPAMI.2020.2977021. Epub 2021 Jul 1.

引用本文的文献

1
Potential Obstacle Detection Using RGB to Depth Image Encoder-Decoder Network: Application to Unmanned Aerial Vehicles.使用RGB到深度图像编码器-解码器网络的潜在障碍物检测:在无人机中的应用。
Sensors (Basel). 2022 Sep 5;22(17):6703. doi: 10.3390/s22176703.
2
Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network.单目深度估计:轻量级卷积和矩阵胶囊特征融合网络。
Sensors (Basel). 2022 Aug 23;22(17):6344. doi: 10.3390/s22176344.
3
Monocular Depth Estimation with Joint Attention Feature Distillation and Wavelet-Based Loss Function.

本文引用的文献

1
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer.迈向稳健的单目深度估计:混合数据集以实现零样本跨数据集迁移。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1623-1637. doi: 10.1109/TPAMI.2020.3019967. Epub 2022 Feb 3.
2
A Novel Method for Estimating Monocular Depth Using Cycle GAN and Segmentation.一种使用循环生成对抗网络(Cycle GAN)和分割技术估计单目深度的新方法。
Sensors (Basel). 2020 Apr 30;20(9):2567. doi: 10.3390/s20092567.
3
Deep Ordinal Regression Network for Monocular Depth Estimation.
基于联合注意特征提取和基于小波的损失函数的单目深度估计。
Sensors (Basel). 2020 Dec 24;21(1):54. doi: 10.3390/s21010054.
用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.
4
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.