显著目标检测、深度估计和轮廓提取的联合学习

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction.

作者信息

Zhao Xiaoqi, Pang Youwei, Zhang Lihe, Lu Huchuan

出版信息

IEEE Trans Image Process. 2022;31:7350-7362. doi: 10.1109/TIP.2022.3222641. Epub 2022 Nov 30.

DOI:10.1109/TIP.2022.3222641

Abstract

Benefiting from color independence, illumination invariance and location discrimination attributed by the depth map, it can provide important supplemental information for extracting salient objects in complex environments. However, high-quality depth sensors are expensive and can not be widely applied. While general depth sensors produce the noisy and sparse depth information, which brings the depth-based networks with irreversible interference. In this paper, we propose a novel multi-task and multi-modal filtered transformer (MMFT) network for RGB-D salient object detection (SOD). Specifically, we unify three complementary tasks: depth estimation, salient object detection and contour estimation. The multi-task mechanism promotes the model to learn the task-aware features from the auxiliary tasks. In this way, the depth information can be completed and purified. Moreover, we introduce a multi-modal filtered transformer (MFT) module, which equips with three modality-specific filters to generate the transformer-enhanced feature for each modality. The proposed model works in a depth-free style during the testing phase. Experiments show that it not only significantly surpasses the depth-based RGB-D SOD methods on multiple datasets, but also precisely predicts a high-quality depth map and salient contour at the same time. And, the resulted depth map can help existing RGB-D SOD methods obtain significant performance gain.

摘要

得益于深度图所具有的颜色独立性、光照不变性和位置辨别能力，它能够为在复杂环境中提取显著物体提供重要的补充信息。然而，高质量的深度传感器价格昂贵，无法广泛应用。而普通深度传感器产生的深度信息噪声大且稀疏，这给基于深度的网络带来了不可逆转的干扰。在本文中，我们提出了一种用于RGB-D显著目标检测（SOD）的新型多任务多模态滤波变压器（MMFT）网络。具体来说，我们统一了三个互补任务：深度估计、显著目标检测和轮廓估计。多任务机制促使模型从辅助任务中学习任务感知特征。通过这种方式，深度信息可以得到完善和净化。此外，我们引入了一个多模态滤波变压器（MFT）模块，它配备了三个特定模态的滤波器，为每个模态生成变压器增强特征。所提出的模型在测试阶段以无深度的方式工作。实验表明，它不仅在多个数据集上显著超越了基于深度的RGB-D SOD方法，而且同时能够精确预测高质量的深度图和显著轮廓。并且，生成的深度图可以帮助现有的RGB-D SOD方法获得显著的性能提升。

相似文献

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction.显著目标检测、深度估计和轮廓提取的联合学习

IEEE Trans Image Process. 2022;31:7350-7362. doi: 10.1109/TIP.2022.3222641. Epub 2022 Nov 30.

RGB-D Salient Object Detection With Ubiquitous Target Awareness.基于无处不在目标感知的 RGB-D 显著目标检测。

IEEE Trans Image Process. 2021;30:7717-7731. doi: 10.1109/TIP.2021.3108412. Epub 2021 Sep 10.

Swin Transformer-Based Edge Guidance Network for RGB-D Salient Object Detection.基于Swin Transformer的RGB-D显著目标检测边缘引导网络

Sensors (Basel). 2023 Oct 29;23(21):8802. doi: 10.3390/s23218802.

UTDNet: A unified triplet decoder network for multimodal salient object detection.UTDNet：一种用于多模态显著目标检测的统一三元解码器网络。

Neural Netw. 2024 Feb;170:521-534. doi: 10.1016/j.neunet.2023.11.051. Epub 2023 Nov 24.

DMGNet: Depth mask guiding network for RGB-D salient object detection.DMGNet：用于 RGB-D 显著目标检测的深度掩模引导网络。

Neural Netw. 2024 Dec;180:106751. doi: 10.1016/j.neunet.2024.106751. Epub 2024 Sep 24.

Quality-Aware Selective Fusion Network for V-D-T Salient Object Detection.用于视频-深度-文本显著目标检测的质量感知选择性融合网络

IEEE Trans Image Process. 2024;33:3212-3226. doi: 10.1109/TIP.2024.3393365. Epub 2024 May 6.

CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet：用于RGB-D显著目标检测的互补深度网络。

IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.

Siamese Network for RGB-D Salient Object Detection and Beyond.用于RGB-D显著目标检测及其他应用的连体网络

IEEE Trans Pattern Anal Mach Intell. 2021 Apr 16;PP. doi: 10.1109/TPAMI.2021.3073689.

EM-Trans: Edge-Aware Multimodal Transformer for RGB-D Salient Object Detection.EM-Trans：用于RGB-D显著目标检测的边缘感知多模态Transformer

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3175-3188. doi: 10.1109/TNNLS.2024.3358858. Epub 2025 Feb 6.

Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.基于绝对和相对深度信息的 RGB-D 显著目标检测网络

Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.

引用本文的文献

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.基于噪声消除的动态知识蒸馏的 RGB-D 显著目标检测。

Sensors (Basel). 2022 Aug 18;22(16):6188. doi: 10.3390/s22166188.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

显著目标检测、深度估计和轮廓提取的联合学习

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献