• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于深度补全的自适应上下文感知多模态网络

Adaptive Context-Aware Multi-Modal Network for Depth Completion.

作者信息

Zhao Shanshan, Gong Mingming, Fu Huan, Tao Dacheng

出版信息

IEEE Trans Image Process. 2021;30:5264-5276. doi: 10.1109/TIP.2021.3079821. Epub 2021 May 31.

DOI:10.1109/TIP.2021.3079821
PMID:34033540
Abstract

Depth completion aims to recover a dense depth map from the sparse depth data and the corresponding single RGB image. The observed pixels provide the significant guidance for the recovery of the unobserved pixels' depth. However, due to the sparsity of the depth data, the standard convolution operation, exploited by most of existing methods, is not effective to model the observed contexts with depth values. To address this issue, we propose to adopt the graph propagation to capture the observed spatial contexts. Specifically, we first construct multiple graphs at different scales from observed pixels. Since the graph structure varies from sample to sample, we then apply the attention mechanism on the propagation, which encourages the network to model the contextual information adaptively. Furthermore, considering the mutli-modality of input data, we exploit the graph propagation on the two modalities respectively to extract multi-modal representations. Finally, we introduce the symmetric gated fusion strategy to exploit the extracted multi-modal features effectively. The proposed strategy preserves the original information for one modality and also absorbs complementary information from the other through learning the adaptive gating weights. Our model, named Adaptive Context-Aware Multi-Modal Network (ACMNet), achieves the state-of-the-art performance on two benchmarks, i.e., KITTI and NYU-v2, and at the same time has fewer parameters than latest models. Our code is available at: https://github.com/sshan-zhao/ACMNet.

摘要

深度补全旨在从稀疏深度数据和相应的单张RGB图像中恢复密集深度图。已观测像素为未观测像素深度的恢复提供了重要指导。然而,由于深度数据的稀疏性,大多数现有方法所采用的标准卷积操作在利用深度值对已观测上下文进行建模时并不有效。为解决此问题,我们建议采用图传播来捕捉已观测空间上下文。具体而言,我们首先从已观测像素构建不同尺度的多个图。由于图结构因样本而异,我们随后在传播过程中应用注意力机制,这促使网络自适应地对上下文信息进行建模。此外,考虑到输入数据的多模态特性,我们分别在两种模态上利用图传播来提取多模态表示。最后,我们引入对称门控融合策略以有效利用提取的多模态特征。所提出的策略保留了一种模态的原始信息,同时通过学习自适应门控权重从另一种模态吸收互补信息。我们的模型名为自适应上下文感知多模态网络(ACMNet),在两个基准数据集即KITTI和NYU-v2上取得了最优性能,同时与最新模型相比参数更少。我们的代码可在以下网址获取:https://github.com/sshan-zhao/ACMNet。

相似文献

1
Adaptive Context-Aware Multi-Modal Network for Depth Completion.用于深度补全的自适应上下文感知多模态网络
IEEE Trans Image Process. 2021;30:5264-5276. doi: 10.1109/TIP.2021.3079821. Epub 2021 May 31.
2
Structure-Aware Cross-Modal Transformer for Depth Completion.用于深度补全的结构感知跨模态变换器
IEEE Trans Image Process. 2024;33:1016-1031. doi: 10.1109/TIP.2024.3355807. Epub 2024 Jan 30.
3
HMS-Net: Hierarchical Multi-scale Sparsity-invariant Network for Sparse Depth Completion.HMS-Net:用于稀疏深度补全的分层多尺度稀疏不变网络
IEEE Trans Image Process. 2019 Dec 31. doi: 10.1109/TIP.2019.2960589.
4
DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection.DPANet:用于RGB-D显著目标检测的深度潜力感知门控注意力网络
IEEE Trans Image Process. 2021;30:7012-7024. doi: 10.1109/TIP.2020.3028289. Epub 2021 Aug 10.
5
Learning Depth with Convolutional Spatial Propagation Network.基于卷积空间传播网络的深度学习
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2361-2379. doi: 10.1109/TPAMI.2019.2947374. Epub 2019 Oct 15.
6
MSTGC: Multi-Channel Spatio-Temporal Graph Convolution Network for Multi-Modal Brain Networks Fusion.MSTGC:用于多模态脑网络融合的多通道时空图卷积网络。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:2359-2369. doi: 10.1109/TNSRE.2023.3275608. Epub 2023 May 23.
7
Multi-Modal Graph Learning for Disease Prediction.多模态图学习在疾病预测中的应用。
IEEE Trans Med Imaging. 2022 Sep;41(9):2207-2216. doi: 10.1109/TMI.2022.3159264. Epub 2022 Aug 31.
8
Cross-Modal Object Tracking via Modality-Aware Fusion Network and a Large-Scale Dataset.通过模态感知融合网络和大规模数据集实现跨模态目标跟踪
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6981-6994. doi: 10.1109/TNNLS.2024.3406189. Epub 2025 Apr 8.
9
An Adaptive Fusion Algorithm for Depth Completion.深度补全的自适应融合算法。
Sensors (Basel). 2022 Jun 18;22(12):4603. doi: 10.3390/s22124603.
10
Confidence Propagation through CNNs for Guided Sparse Depth Regression.通过卷积神经网络进行置信传播以实现引导式稀疏深度回归
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2423-2436. doi: 10.1109/TPAMI.2019.2929170. Epub 2019 Jul 17.

引用本文的文献

1
GAC-Net: A Geometric-Attention Fusion Network for Sparse Depth Completion from LiDAR and Image.GAC网络:一种用于从激光雷达和图像进行稀疏深度补全的几何注意力融合网络。
Sensors (Basel). 2025 Sep 4;25(17):5495. doi: 10.3390/s25175495.
2
A Smartphone-Based Non-Destructive Multimodal Deep Learning Approach Using pH-Sensitive Pitaya Peel Films for Real-Time Fish Freshness Detection.一种基于智能手机的非破坏性多模态深度学习方法,使用对pH敏感的火龙果果皮薄膜进行实时鱼新鲜度检测。
Foods. 2025 May 19;14(10):1805. doi: 10.3390/foods14101805.
3
GeometryFormer: Semi-Convolutional Transformer Integrated with Geometric Perception for Depth Completion in Autonomous Driving Scenes.
GeometryFormer:集成几何感知的半卷积Transformer,用于自动驾驶场景中的深度补全
Sensors (Basel). 2024 Dec 18;24(24):8066. doi: 10.3390/s24248066.
4
A Transformer-Based Image-Guided Depth-Completion Model with Dual-Attention Fusion Module.一种基于Transformer的具有双注意力融合模块的图像引导深度补全模型。
Sensors (Basel). 2024 Sep 27;24(19):6270. doi: 10.3390/s24196270.
5
Real-time depth completion based on LiDAR-stereo for autonomous driving.基于激光雷达-立体视觉的自动驾驶实时深度补全
Front Neurorobot. 2023 Apr 18;17:1124676. doi: 10.3389/fnbot.2023.1124676. eCollection 2023.
6
SPNet: Structure preserving network for depth completion.SPNet:用于深度完成的结构保持网络。
PLoS One. 2023 Jan 24;18(1):e0280886. doi: 10.1371/journal.pone.0280886. eCollection 2023.
7
A Critical Review of Deep Learning-Based Multi-Sensor Fusion Techniques.深度学习的多传感器融合技术综述。
Sensors (Basel). 2022 Dec 1;22(23):9364. doi: 10.3390/s22239364.
8
ClueDepth Grasp: Leveraging positional clues of depth for completing depth of transparent objects.线索深度抓取:利用深度位置线索来完成透明物体的深度信息。
Front Neurorobot. 2022 Nov 8;16:1041702. doi: 10.3389/fnbot.2022.1041702. eCollection 2022.
9
A Comprehensive Survey of Depth Completion Approaches.深度完成方法综述。
Sensors (Basel). 2022 Sep 14;22(18):6969. doi: 10.3390/s22186969.
10
SGSNet: A Lightweight Depth Completion Network Based on Secondary Guidance and Spatial Fusion.SGSNet:一种基于二次引导和空间融合的轻量级深度补全网络。
Sensors (Basel). 2022 Aug 25;22(17):6414. doi: 10.3390/s22176414.