基于组语义和金字塔注意力的鲁棒深度协同显著性检测

Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention.

作者信息

Zha Zheng-Jun, Wang Chong, Liu Dong, Xie Hongtao, Zhang Yongdong

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2398-2408. doi: 10.1109/TNNLS.2020.2967471. Epub 2020 Feb 13.

DOI:10.1109/TNNLS.2020.2967471

PMID:32071003

Abstract

High-level semantic knowledge in addition to low-level visual cues is essentially crucial for co-saliency detection. This article proposes a novel end-to-end deep learning approach for robust co-saliency detection by simultaneously learning high-level groupwise semantic representation as well as deep visual features of a given image group. The interimage interaction at the semantic level and the complementarity between the group semantics and visual features are exploited to boost the inferring capability of co-salient regions. Specifically, the proposed approach consists of a co-category learning branch and a co-saliency detection branch. While the former is proposed to learn a groupwise semantic vector using co-category association of an image group as supervision, the latter is to infer precise co-salient maps based on the ensemble of group-semantic knowledge and deep visual cues. The group-semantic vector is used to augment visual features at multiple scales and acts as a top-down semantic guidance for boosting the bottom-up inference of co-saliency. Moreover, we develop a pyramidal attention (PA) module that endows the network with the capability of concentrating on important image patches and suppressing distractions. The co-category learning and co-saliency detection branches are jointly optimized in a multitask learning manner, further improving the robustness of the approach. We construct a new large-scale co-saliency data set COCO-SEG to facilitate research of the co-saliency detection. Extensive experimental results on COCO-SEG and a widely used benchmark Cosal2015 have demonstrated the superiority of the proposed approach compared with state-of-the-art methods.

摘要

除了低级视觉线索外，高级语义知识对于协同显著性检测至关重要。本文提出了一种新颖的端到端深度学习方法，通过同时学习给定图像组的高级分组语义表示以及深度视觉特征，来进行鲁棒的协同显著性检测。利用语义层面的图像间交互以及组语义与视觉特征之间的互补性，来提升协同显著区域的推断能力。具体而言，所提出的方法由一个协同类别学习分支和一个协同显著性检测分支组成。前者旨在以图像组的协同类别关联为监督来学习一个分组语义向量，而后者则基于组语义知识和深度视觉线索的集合来推断精确的协同显著图。分组语义向量用于在多个尺度上增强视觉特征，并作为自上而下的语义指导，以促进协同显著性的自下而上推断。此外，我们开发了一个金字塔注意力（PA）模块，赋予网络专注于重要图像块并抑制干扰的能力。协同类别学习和协同显著性检测分支以多任务学习的方式进行联合优化，进一步提高了该方法的鲁棒性。我们构建了一个新的大规模协同显著性数据集COCO-SEG，以促进协同显著性检测的研究。在COCO-SEG和广泛使用的基准数据集Cosal2015上的大量实验结果表明，与现有方法相比，所提出的方法具有优越性。

相似文献

Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention.基于组语义和金字塔注意力的鲁棒深度协同显著性检测

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2398-2408. doi: 10.1109/TNNLS.2020.2967471. Epub 2020 Feb 13.

Rethinking Image Salient Object Detection: Object-Level Semantic Saliency Reranking First, Pixelwise Saliency Refinement Later.重新思考图像显著目标检测：先进行目标级语义显著度重排序，后进行像素级显著度细化。

IEEE Trans Image Process. 2021;30:4238-4252. doi: 10.1109/TIP.2021.3068649. Epub 2021 Apr 12.

Comprehensive mining of information in Weakly Supervised Semantic Segmentation: Saliency semantics and edge semantics.弱监督语义分割中的信息综合挖掘：显著语义和边缘语义。

Neural Netw. 2024 Jan;169:75-82. doi: 10.1016/j.neunet.2023.10.009. Epub 2023 Oct 13.

Co-Salient Object Detection Based on Deep Saliency Networks and Seed Propagation Over an Integrated Graph.基于深度显著图网络和集成图上种子传播的协同显著目标检测。

IEEE Trans Image Process. 2018 Dec;27(12):5866-5879. doi: 10.1109/TIP.2018.2859752. Epub 2018 Jul 25.

Co-Saliency Detection via a Self-Paced Multiple-Instance Learning Framework.基于自定步多示例学习框架的协同显著目标检测

IEEE Trans Pattern Anal Mach Intell. 2017 May;39(5):865-878. doi: 10.1109/TPAMI.2016.2567393. Epub 2016 May 12.

Capturing the grouping and compactness of high-level semantic feature for saliency detection.捕捉高层语义特征的分组和紧凑性以进行显著度检测。

Neural Netw. 2021 Oct;142:351-362. doi: 10.1016/j.neunet.2021.04.028. Epub 2021 Apr 24.

Deep Group-wise Fully Convolutional Network for Co-saliency Detection with Graph Propagation.基于图传播的深度分组全卷积网络协同显著性检测

IEEE Trans Image Process. 2019 Apr 15. doi: 10.1109/TIP.2019.2909649.

Salient Object Detection with Recurrent Fully Convolutional Networks.基于循环全卷积网络的显著目标检测

IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1734-1746. doi: 10.1109/TPAMI.2018.2846598. Epub 2018 Jun 12.

Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测

IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.

Meaning maps detect the removal of local semantic scene content but deep saliency models do not.意义图谱能检测到局部语义场景内容的移除，而深度显著模型则不能。

Atten Percept Psychophys. 2022 Apr;84(3):647-654. doi: 10.3758/s13414-021-02395-x. Epub 2022 Feb 9.

引用本文的文献

Video Desnowing and Deraining via Saliency and Dual Adaptive Spatiotemporal Filtering.基于显著性和双自适应时空滤波的视频去雪和去雨

Sensors (Basel). 2021 Nov 16;21(22):7610. doi: 10.3390/s21227610.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于组语义和金字塔注意力的鲁棒深度协同显著性检测

Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献