• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过模态感知解码器改进RGB-D显著目标检测

Improving RGB-D Salient Object Detection via Modality-Aware Decoder.

作者信息

Song Mengke, Song Wenfeng, Yang Guowei, Chen Chenglizhao

出版信息

IEEE Trans Image Process. 2022;31:6124-6138. doi: 10.1109/TIP.2022.3205747. Epub 2022 Sep 22.

DOI:10.1109/TIP.2022.3205747
PMID:36112559
Abstract

Most existing RGB-D salient object detection (SOD) methods are primarily focusing on cross-modal and cross-level saliency fusion, which has been proved to be efficient and effective. However, these methods still have a critical limitation, i.e., their fusion patterns - typically the combination of selective characteristics and its variations, are too highly dependent on the network's non-linear adaptability. In such methods, the balances between RGB and D (Depth) are formulated individually considering the intermediate feature slices, but the relation at the modality level may not be learned properly. The optimal RGB-D combinations differ depending on the RGB-D scenarios, and the exact complementary status is frequently determined by multiple modality-level factors, such as D quality, the complexity of the RGB scene, and degree of harmony between them. Therefore, given the existing approaches, it may be difficult for them to achieve further performance breakthroughs, as their methodologies belong to some methods that are somewhat less modality sensitive. To conquer this problem, this paper presents the Modality-aware Decoder (MaD). The critical technical innovations include a series of feature embedding, modality reasoning, and feature back-projecting and collecting strategies, all of which upgrade the widely-used multi-scale and multi-level decoding process to be modality-aware. Our MaD achieves competitive performance over other state-of-the-art (SOTA) models without using any fancy tricks in the decoder's design. Codes and results will be publicly available at https://github.com/MengkeSong/MaD.

摘要

大多数现有的RGB-D显著目标检测(SOD)方法主要聚焦于跨模态和跨层次的显著特征融合,这已被证明是高效且有效的。然而,这些方法仍存在一个关键局限性,即它们的融合模式——通常是选择性特征及其变体的组合,过于高度依赖网络的非线性适应性。在这类方法中,RGB和D(深度)之间的平衡是在考虑中间特征切片的基础上单独制定的,但模态层面的关系可能无法被正确学习。最优的RGB-D组合因RGB-D场景而异,确切的互补状态通常由多个模态层面的因素决定,如D的质量、RGB场景的复杂性以及它们之间的和谐程度。因此,鉴于现有方法,它们可能难以实现进一步的性能突破,因为它们的方法属于一些对模态不太敏感的方法。为解决这个问题,本文提出了模态感知解码器(MaD)。关键的技术创新包括一系列特征嵌入、模态推理以及特征反向投影和收集策略,所有这些都将广泛使用的多尺度和多层次解码过程升级为模态感知的。我们的MaD在解码器设计中不使用任何奇特技巧的情况下,相对于其他现有最先进(SOTA)模型实现了具有竞争力的性能。代码和结果将在https://github.com/MengkeSong/MaD上公开提供。

相似文献

1
Improving RGB-D Salient Object Detection via Modality-Aware Decoder.通过模态感知解码器改进RGB-D显著目标检测
IEEE Trans Image Process. 2022;31:6124-6138. doi: 10.1109/TIP.2022.3205747. Epub 2022 Sep 22.
2
CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection.CIR-Net:用于RGB-D显著目标检测的跨模态交互与优化
IEEE Trans Image Process. 2022;31:6800-6815. doi: 10.1109/TIP.2022.3216198. Epub 2022 Oct 28.
3
3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond.用于RGB-D显著目标检测及其他应用的3D卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):4309-4323. doi: 10.1109/TNNLS.2022.3202241. Epub 2024 Feb 29.
4
CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet:用于RGB-D显著目标检测的互补深度网络。
IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.
5
Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection.用于轻量级RGB-D显著目标检测的中级特征融合
IEEE Trans Image Process. 2022;31:6621-6634. doi: 10.1109/TIP.2022.3214092. Epub 2022 Oct 26.
6
Quality-Aware Selective Fusion Network for V-D-T Salient Object Detection.用于视频-深度-文本显著目标检测的质量感知选择性融合网络
IEEE Trans Image Process. 2024;33:3212-3226. doi: 10.1109/TIP.2024.3393365. Epub 2024 May 6.
7
UTDNet: A unified triplet decoder network for multimodal salient object detection.UTDNet:一种用于多模态显著目标检测的统一三元解码器网络。
Neural Netw. 2024 Feb;170:521-534. doi: 10.1016/j.neunet.2023.11.051. Epub 2023 Nov 24.
8
Depth-Quality-Aware Salient Object Detection.深度质量感知显著目标检测
IEEE Trans Image Process. 2021;30:2350-2363. doi: 10.1109/TIP.2021.3052069. Epub 2021 Jan 27.
9
Data-Level Recombination and Lightweight Fusion Scheme for RGB-D Salient Object Detection.用于RGB-D显著目标检测的数据级重组与轻量级融合方案
IEEE Trans Image Process. 2021;30:458-471. doi: 10.1109/TIP.2020.3037470. Epub 2020 Nov 23.
10
ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection.ASIF-Net:用于 RGB-D 显著目标检测的注意力导向交织融合网络。
IEEE Trans Cybern. 2021 Jan;51(1):88-100. doi: 10.1109/TCYB.2020.2969255. Epub 2020 Dec 22.

引用本文的文献

1
Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection.用于RGB-T显著目标检测的小波驱动多波段特征融合
Sensors (Basel). 2024 Dec 20;24(24):8159. doi: 10.3390/s24248159.