• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于Transformer学习用于RGB-D协同显著目标检测的隐式类别知识

Learning Implicit Class Knowledge for RGB-D Co-Salient Object Detection With Transformers.

作者信息

Zhang Ni, Han Junwei, Liu Nian

出版信息

IEEE Trans Image Process. 2022;31:4556-4570. doi: 10.1109/TIP.2022.3185550. Epub 2022 Jul 18.

DOI:10.1109/TIP.2022.3185550
PMID:35763477
Abstract

RGB-D co-salient object detection aims to segment co-occurring salient objects when given a group of relevant images and depth maps. Previous methods often adopt separate pipeline and use hand-crafted features, being hard to capture the patterns of co-occurring salient objects and leading to unsatisfactory results. Using end-to-end CNN models is a straightforward idea, but they are less effective in exploiting global cues due to the intrinsic limitation. Thus, in this paper, we alternatively propose an end-to-end transformer-based model which uses class tokens to explicitly capture implicit class knowledge to perform RGB-D co-salient object detection, denoted as CTNet. Specifically, we first design adaptive class tokens for individual images to explore intra-saliency cues and then develop common class tokens for the whole group to explore inter-saliency cues. Besides, we also leverage the complementary cues between RGB images and depth maps to promote the learning of the above two types of class tokens. In addition, to promote model evaluation, we construct a challenging and large-scale benchmark dataset, named RGBD CoSal1k, which collects 106 groups containing 1000 pairs of RGB-D images with complex scenarios and diverse appearances. Experimental results on three benchmark datasets demonstrate the effectiveness of our proposed method.

摘要

RGB-D共显著目标检测旨在在给定一组相关图像和深度图时分割同时出现的显著目标。先前的方法通常采用单独的流程并使用手工制作的特征,难以捕捉同时出现的显著目标的模式,导致结果不尽人意。使用端到端的卷积神经网络(CNN)模型是一个直接的想法,但由于其固有的局限性,它们在利用全局线索方面效果较差。因此,在本文中,我们提出了一种基于端到端Transformer的模型,该模型使用类别令牌来明确捕捉隐式类别知识,以执行RGB-D共显著目标检测,称为CTNet。具体来说,我们首先为单个图像设计自适应类别令牌以探索内部显著线索,然后为整个组开发通用类别令牌以探索相互显著线索。此外,我们还利用RGB图像和深度图之间的互补线索来促进上述两种类别令牌的学习。此外,为了促进模型评估,我们构建了一个具有挑战性的大规模基准数据集,名为RGBD CoSal1k,它收集了106组包含1000对具有复杂场景和多样外观的RGB-D图像。在三个基准数据集上的实验结果证明了我们提出的方法的有效性。

相似文献

1
Learning Implicit Class Knowledge for RGB-D Co-Salient Object Detection With Transformers.基于Transformer学习用于RGB-D协同显著目标检测的隐式类别知识
IEEE Trans Image Process. 2022;31:4556-4570. doi: 10.1109/TIP.2022.3185550. Epub 2022 Jul 18.
2
An Iterative Co-Saliency Framework for RGBD Images.基于 RGBD 图像的迭代协同显著图框架。
IEEE Trans Cybern. 2019 Jan;49(1):233-246. doi: 10.1109/TCYB.2017.2771488. Epub 2017 Nov 21.
3
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
4
RGB-D salient object detection: A survey.RGB-D显著目标检测:一项综述。
Comput Vis Media (Beijing). 2021;7(1):37-69. doi: 10.1007/s41095-020-0199-z. Epub 2021 Jan 7.
5
Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.基于绝对和相对深度信息的 RGB-D 显著目标检测网络
Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.
6
Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的全局引导跨模态跨尺度网络
Sensors (Basel). 2023 Aug 17;23(16):7221. doi: 10.3390/s23167221.
7
CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet:用于RGB-D显著目标检测的互补深度网络。
IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.
8
RGB-D Salient Object Detection With Ubiquitous Target Awareness.基于无处不在目标感知的 RGB-D 显著目标检测。
IEEE Trans Image Process. 2021;30:7717-7731. doi: 10.1109/TIP.2021.3108412. Epub 2021 Sep 10.
9
Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.利用未标记的 RGB 图像提升 RGB-D 显著度检测。
IEEE Trans Image Process. 2022;31:1107-1119. doi: 10.1109/TIP.2021.3139232. Epub 2022 Jan 12.
10
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection.DMRA:用于 RGB-D 显著度检测的深度诱导多尺度递归注意网络。
IEEE Trans Image Process. 2022;31:2321-2336. doi: 10.1109/TIP.2022.3154931. Epub 2022 Mar 11.

引用本文的文献

1
Link prediction of heterogeneous complex networks based on an improved embedding learning algorithm.基于改进嵌入学习算法的异质复杂网络链接预测
PLoS One. 2025 Jan 7;20(1):e0315507. doi: 10.1371/journal.pone.0315507. eCollection 2025.
2
SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.SLMSF-Net:用于RGB-D显著目标检测的语义定位与多尺度融合网络
Sensors (Basel). 2024 Feb 8;24(4):1117. doi: 10.3390/s24041117.
3
Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.
基于绝对和相对深度信息的 RGB-D 显著目标检测网络
Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.