• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于RGB-D显著目标检测及其他应用的连体网络

Siamese Network for RGB-D Salient Object Detection and Beyond.

作者信息

Fu Keren, Fan Deng-Ping, Ji Ge-Peng, Zhao Qijun, Shen Jianbing, Zhu Ce

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Apr 16;PP. doi: 10.1109/TPAMI.2021.3073689.

DOI:10.1109/TPAMI.2021.3073689
PMID:33861691
Abstract

Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information and design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed training process. Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture. In this paper, we propose two effective components: joint learning (JL), and densely cooperative fusion (DCF). The JL module provides robust saliency feature learning by exploiting cross-modal commonality via a Siamese network, while the DCF module is introduced for complementary feature discovery. Comprehensive experiments using 5 popular metrics show that the designed framework yields a robust RGB-D saliency detector with good generalization. As a result, JL-DCF significantly advances the SOTAs by an average of ~2.0% (F-measure) across 7 challenging datasets. In addition, we show that JL-DCF is readily applicable to other related multi-modal detection tasks, including RGB-T SOD and video SOD, achieving comparable or better performance.

摘要

现有的RGB-D显著目标检测(SOD)模型通常将RGB和深度视为独立信息,并设计单独的网络从各自中提取特征。这样的方案很容易受到有限训练数据量的限制,或者过度依赖精心设计的训练过程。受RGB和深度模态在区分显著目标时实际上存在一定共性这一观察结果的启发,设计了一种新颖的联合学习与密集协作融合(JL-DCF)架构,通过一个共享的网络主干(即暹罗架构)从RGB和深度输入中进行学习。在本文中,我们提出了两个有效组件:联合学习(JL)和密集协作融合(DCF)。JL模块通过暹罗网络利用跨模态共性来提供强大的显著特征学习,而DCF模块则用于互补特征发现。使用5种流行指标进行的综合实验表明,所设计的框架产生了一个具有良好泛化能力的强大RGB-D显著检测器。结果,JL-DCF在7个具有挑战性的数据集上平均将当前最优方法(SOTAs)显著提高了约2.0%(F值)。此外,我们表明JL-DCF很容易应用于其他相关的多模态检测任务,包括RGB-T SOD和视频SOD,实现了可比或更好的性能。

相似文献

1
Siamese Network for RGB-D Salient Object Detection and Beyond.用于RGB-D显著目标检测及其他应用的连体网络
IEEE Trans Pattern Anal Mach Intell. 2021 Apr 16;PP. doi: 10.1109/TPAMI.2021.3073689.
2
Looking at Boundary: Siamese Densely Cooperative Fusion for Salient Object Detection.审视边界:用于显著目标检测的暹罗密集协作融合
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3580-3593. doi: 10.1109/TNNLS.2021.3113657. Epub 2023 Jul 6.
3
UTDNet: A unified triplet decoder network for multimodal salient object detection.UTDNet:一种用于多模态显著目标检测的统一三元解码器网络。
Neural Netw. 2024 Feb;170:521-534. doi: 10.1016/j.neunet.2023.11.051. Epub 2023 Nov 24.
4
CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet:用于RGB-D显著目标检测的互补深度网络。
IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.
5
SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.SLMSF-Net:用于RGB-D显著目标检测的语义定位与多尺度融合网络
Sensors (Basel). 2024 Feb 8;24(4):1117. doi: 10.3390/s24041117.
6
Data-Level Recombination and Lightweight Fusion Scheme for RGB-D Salient Object Detection.用于RGB-D显著目标检测的数据级重组与轻量级融合方案
IEEE Trans Image Process. 2021;30:458-471. doi: 10.1109/TIP.2020.3037470. Epub 2020 Nov 23.
7
Dynamic Selective Network for RGB-D Salient Object Detection.基于动态选择网络的 RGB-D 显著目标检测
IEEE Trans Image Process. 2021;30:9179-9192. doi: 10.1109/TIP.2021.3123548. Epub 2021 Nov 10.
8
RGB-D Salient Object Detection With Ubiquitous Target Awareness.基于无处不在目标感知的 RGB-D 显著目标检测。
IEEE Trans Image Process. 2021;30:7717-7731. doi: 10.1109/TIP.2021.3108412. Epub 2021 Sep 10.
9
Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.基于绝对和相对深度信息的 RGB-D 显著目标检测网络
Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.
10
ICNet: Information Conversion Network for RGB-D Based Salient Object Detection.ICNet:基于RGB-D的显著目标检测的信息转换网络。
IEEE Trans Image Process. 2020 Mar 4. doi: 10.1109/TIP.2020.2976689.

引用本文的文献

1
CSANet: Context-Spatial Awareness Network for RGB-T Urban Scene Understanding.CSANet:用于RGB-T城市场景理解的上下文空间感知网络
J Imaging. 2025 Jun 9;11(6):188. doi: 10.3390/jimaging11060188.
2
RGB-D salient object detection via convolutional capsule network based on feature extraction and integration.基于特征提取与融合的卷积胶囊网络实现RGB-D显著目标检测
Sci Rep. 2023 Oct 17;13(1):17652. doi: 10.1038/s41598-023-44698-z.
3
Continuous Prediction of Web User Visual Attention on Short Span Windows Based on Gaze Data Analytics.
基于眼动数据分析的短窗网页用户视觉注意力持续预测。
Sensors (Basel). 2023 Feb 18;23(4):2294. doi: 10.3390/s23042294.
4
Multi-Object Tracking Algorithm for RGB-D Images Based on Asymmetric Dual Siamese Networks.基于非对称双孪生网络的 RGB-D 图像多目标跟踪算法。
Sensors (Basel). 2020 Nov 25;20(23):6745. doi: 10.3390/s20236745.