• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于深度补全的学习引导卷积网络。

Learning Guided Convolutional Network for Depth Completion.

作者信息

Tang Jie, Tian Fei-Peng, Feng Wei, Li Jian, Tan Ping

出版信息

IEEE Trans Image Process. 2021;30:1116-1129. doi: 10.1109/TIP.2020.3040528. Epub 2020 Dec 15.

DOI:10.1109/TIP.2020.3040528
PMID:33290217
Abstract

Dense depth perception is critical for autonomous driving and other robotics applications. However, modern LiDAR sensors only provide sparse depth measurement. It is thus necessary to complete the sparse LiDAR data, where a synchronized guidance RGB image is often used to facilitate this completion. Many neural networks have been designed for this task. However, they often naïvely fuse the LiDAR data and RGB image information by performing feature concatenation or element-wise addition. Inspired by the guided image filtering, we design a novel guided network to predict kernel weights from the guidance image. These predicted kernels are then applied to extract the depth image features. In this way, our network generates content-dependent and spatially-variant kernels for multi-modal feature fusion. Dynamically generated spatially-variant kernels could lead to prohibitive GPU memory consumption and computation overhead. We further design a convolution factorization to reduce computation and memory consumption. The GPU memory reduction makes it possible for feature fusion to work in multi-stage scheme. We conduct comprehensive experiments to verify our method on real-world outdoor, indoor and synthetic datasets. Our method produces strong results. It outperforms state-of-the-art methods on the NYUv2 dataset and ranks 1st on the KITTI depth completion benchmark at the time of submission. It also presents strong generalization capability under different 3D point densities, various lighting and weather conditions as well as cross-dataset evaluations. The code will be released for reproduction.

摘要

密集深度感知对于自动驾驶和其他机器人应用至关重要。然而,现代激光雷达传感器仅提供稀疏的深度测量。因此,有必要对稀疏的激光雷达数据进行补全,通常会使用同步的引导RGB图像来辅助完成这一补全过程。许多神经网络已被设计用于此任务。然而,它们通常只是简单地通过执行特征拼接或逐元素相加来融合激光雷达数据和RGB图像信息。受引导图像滤波的启发,我们设计了一种新颖的引导网络,从引导图像中预测核权重。然后将这些预测的核应用于提取深度图像特征。通过这种方式,我们的网络为多模态特征融合生成依赖于内容且空间变化的核。动态生成的空间变化核可能导致过高的GPU内存消耗和计算开销。我们进一步设计了一种卷积分解方法来减少计算和内存消耗。GPU内存的减少使得特征融合能够以多阶段方案运行。我们进行了全面的实验,在真实世界的室外、室内和合成数据集上验证了我们的方法。我们的方法取得了很好的结果。它在NYUv2数据集上优于现有方法,在提交时在KITTI深度补全基准测试中排名第一。在不同的3D点密度、各种光照和天气条件以及跨数据集评估下,它也表现出强大的泛化能力。代码将发布以供复现。

相似文献

1
Learning Guided Convolutional Network for Depth Completion.用于深度补全的学习引导卷积网络。
IEEE Trans Image Process. 2021;30:1116-1129. doi: 10.1109/TIP.2020.3040528. Epub 2020 Dec 15.
2
An Adaptive Fusion Algorithm for Depth Completion.深度补全的自适应融合算法。
Sensors (Basel). 2022 Jun 18;22(12):4603. doi: 10.3390/s22124603.
3
Learning Steering Kernels for Guided Depth Completion.用于引导深度补全的学习导向内核
IEEE Trans Image Process. 2021;30:2850-2861. doi: 10.1109/TIP.2021.3055629. Epub 2021 Feb 12.
4
A Comprehensive Survey of Depth Completion Approaches.深度完成方法综述。
Sensors (Basel). 2022 Sep 14;22(18):6969. doi: 10.3390/s22186969.
5
HMS-Net: Hierarchical Multi-scale Sparsity-invariant Network for Sparse Depth Completion.HMS-Net:用于稀疏深度补全的分层多尺度稀疏不变网络
IEEE Trans Image Process. 2019 Dec 31. doi: 10.1109/TIP.2019.2960589.
6
Real-time depth completion based on LiDAR-stereo for autonomous driving.基于激光雷达-立体视觉的自动驾驶实时深度补全
Front Neurorobot. 2023 Apr 18;17:1124676. doi: 10.3389/fnbot.2023.1124676. eCollection 2023.
7
Guided Depth Completion with Instance Segmentation Fusion in Autonomous Driving Applications.自动驾驶应用中的实例分割融合引导深度补全。
Sensors (Basel). 2022 Dec 7;22(24):9578. doi: 10.3390/s22249578.
8
PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation.PLIN:用于伪激光雷达点云插值的网络。
Sensors (Basel). 2020 Mar 12;20(6):1573. doi: 10.3390/s20061573.
9
Confidence Propagation through CNNs for Guided Sparse Depth Regression.通过卷积神经网络进行置信传播以实现引导式稀疏深度回归
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2423-2436. doi: 10.1109/TPAMI.2019.2929170. Epub 2019 Jul 17.
10
Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer.基于Transformer的具有深度补全功能的多任务前景感知网络用于增强RGB-D融合目标检测
Sensors (Basel). 2024 Apr 8;24(7):2374. doi: 10.3390/s24072374.

引用本文的文献

1
GeometryFormer: Semi-Convolutional Transformer Integrated with Geometric Perception for Depth Completion in Autonomous Driving Scenes.GeometryFormer:集成几何感知的半卷积Transformer,用于自动驾驶场景中的深度补全
Sensors (Basel). 2024 Dec 18;24(24):8066. doi: 10.3390/s24248066.
2
A Transformer-Based Image-Guided Depth-Completion Model with Dual-Attention Fusion Module.一种基于Transformer的具有双注意力融合模块的图像引导深度补全模型。
Sensors (Basel). 2024 Sep 27;24(19):6270. doi: 10.3390/s24196270.
3
Robust detection and refinement of saliency identification.
显著性识别的稳健检测与优化
Sci Rep. 2024 May 14;14(1):11076. doi: 10.1038/s41598-024-61105-3.
4
SPNet: Structure preserving network for depth completion.SPNet:用于深度完成的结构保持网络。
PLoS One. 2023 Jan 24;18(1):e0280886. doi: 10.1371/journal.pone.0280886. eCollection 2023.
5
A Critical Review of Deep Learning-Based Multi-Sensor Fusion Techniques.深度学习的多传感器融合技术综述。
Sensors (Basel). 2022 Dec 1;22(23):9364. doi: 10.3390/s22239364.
6
ClueDepth Grasp: Leveraging positional clues of depth for completing depth of transparent objects.线索深度抓取:利用深度位置线索来完成透明物体的深度信息。
Front Neurorobot. 2022 Nov 8;16:1041702. doi: 10.3389/fnbot.2022.1041702. eCollection 2022.
7
A Comprehensive Survey of Depth Completion Approaches.深度完成方法综述。
Sensors (Basel). 2022 Sep 14;22(18):6969. doi: 10.3390/s22186969.
8
SGSNet: A Lightweight Depth Completion Network Based on Secondary Guidance and Spatial Fusion.SGSNet:一种基于二次引导和空间融合的轻量级深度补全网络。
Sensors (Basel). 2022 Aug 25;22(17):6414. doi: 10.3390/s22176414.
9
An Adaptive Fusion Algorithm for Depth Completion.深度补全的自适应融合算法。
Sensors (Basel). 2022 Jun 18;22(12):4603. doi: 10.3390/s22124603.
10
DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization.DiT-SLAM:基于隐式深度表示和紧密耦合图优化的实时密集视觉惯性同步定位与地图构建
Sensors (Basel). 2022 Apr 28;22(9):3389. doi: 10.3390/s22093389.