用于 RGB-D 显著目标检测的判别式跨模态迁移学习和密集跨层反馈融合。

Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection.

出版信息

IEEE Trans Cybern. 2020 Nov;50(11):4808-4820. doi: 10.1109/TCYB.2019.2934986. Epub 2019 Aug 30.

DOI:10.1109/TCYB.2019.2934986

Abstract

This article addresses two key issues in RGB-D salient object detection based on the convolutional neural network (CNN). 1) How to bridge the gap between the "data-hungry" nature of CNNs and the insufficient labeled training data in the depth modality? 2) How to take full advantages of the complementary information among two modalities. To solve the first problem, we model the depth-induced saliency detection as a CNN-based cross-modal transfer learning problem. Instead of directly adopting the RGB CNN as initialization, we additionally train a modality classification network (MCNet) to encourage discriminative modality-specific representations in minimizing the modality classification loss. To solve the second problem, we propose a densely cross-level feedback topology, in which the cross-modal complements are combined in each level and then densely fed back to all shallower layers for sufficient cross-level interactions. Compared to traditional two-stream frameworks, the proposed one can better explore, select, and fuse cross-modal cross-level complements. Experiments show the significant and consistent improvements of the proposed CNN framework over other state-of-the-art methods.

摘要

本文针对基于卷积神经网络（CNN）的 RGB-D 显著目标检测中的两个关键问题。1）如何弥合 CNN 的“数据饥渴”性质与深度模态中不足的标记训练数据之间的差距？2）如何充分利用两种模态之间的互补信息。为了解决第一个问题，我们将深度诱导的显著检测建模为基于 CNN 的跨模态迁移学习问题。我们不是直接采用 RGB CNN 作为初始化，而是额外训练一个模态分类网络（MCNet），通过最小化模态分类损失来鼓励具有判别力的模态特定表示。为了解决第二个问题，我们提出了一种密集的跨层反馈拓扑结构，其中在每个级别中组合跨模态互补，并将其密集地反馈到所有较浅层，以实现充分的跨层交互。与传统的双流框架相比，所提出的方法可以更好地探索、选择和融合跨模态跨层互补。实验表明，所提出的 CNN 框架在其他最先进的方法上有显著且一致的改进。

相似文献

Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection.用于 RGB-D 显著目标检测的判别式跨模态迁移学习和密集跨层反馈融合。

IEEE Trans Cybern. 2020 Nov;50(11):4808-4820. doi: 10.1109/TCYB.2019.2934986. Epub 2019 Aug 30.

Three-stream Attention-aware Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的三流注意力感知网络

IEEE Trans Image Process. 2019 Jan 7. doi: 10.1109/TIP.2019.2891104.

CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion.基于卷积神经网络的跨视图迁移和多视图融合的 RGB-D 显著目标检测。

IEEE Trans Cybern. 2018 Nov;48(11):3171-3183. doi: 10.1109/TCYB.2017.2761775. Epub 2017 Oct 31.

Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。

IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.

Learning Discriminative Cross-Modality Features for RGB-D Saliency Detection.学习用于RGB-D显著性检测的判别性跨模态特征。

IEEE Trans Image Process. 2022;31:1285-1297. doi: 10.1109/TIP.2022.3140606. Epub 2022 Jan 25.

Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and Beyond.用于RGB-D显著目标检测及其他领域的解缠跨模态变换器

IEEE Trans Image Process. 2024;33:1699-1709. doi: 10.1109/TIP.2024.3364022. Epub 2024 Mar 5.

Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的全局引导跨模态跨尺度网络

Sensors (Basel). 2023 Aug 17;23(16):7221. doi: 10.3390/s23167221.

RGB-T Salient Object Detection via Fusing Multi-level CNN Features.基于融合多级卷积神经网络特征的RGB-T显著目标检测

IEEE Trans Image Process. 2019 Dec 17. doi: 10.1109/TIP.2019.2959253.

RGB-D Salient Object Detection With Ubiquitous Target Awareness.基于无处不在目标感知的 RGB-D 显著目标检测。

IEEE Trans Image Process. 2021;30:7717-7731. doi: 10.1109/TIP.2021.3108412. Epub 2021 Sep 10.

Cross-Modal Attentional Context Learning for RGB-D Object Detection.跨模态注意上下文学习的 RGB-D 目标检测。

IEEE Trans Image Process. 2019 Apr;28(4):1591-1601. doi: 10.1109/TIP.2018.2878956. Epub 2018 Oct 31.

引用本文的文献

AFI-Net: Attention-Guided Feature Integration Network for RGBD Saliency Detection.AFI-Net：用于RGBD显著目标检测的注意力引导特征融合网络。

Comput Intell Neurosci. 2021 Mar 30;2021:8861446. doi: 10.1155/2021/8861446. eCollection 2021.

RGB-D salient object detection: A survey.RGB-D显著目标检测：一项综述。

Comput Vis Media (Beijing). 2021;7(1):37-69. doi: 10.1007/s41095-020-0199-z. Epub 2021 Jan 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于 RGB-D 显著目标检测的判别式跨模态迁移学习和密集跨层反馈融合。

Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献