Suppr超能文献

显著目标检测、深度估计和轮廓提取的联合学习

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction.

作者信息

Zhao Xiaoqi, Pang Youwei, Zhang Lihe, Lu Huchuan

出版信息

IEEE Trans Image Process. 2022;31:7350-7362. doi: 10.1109/TIP.2022.3222641. Epub 2022 Nov 30.

Abstract

Benefiting from color independence, illumination invariance and location discrimination attributed by the depth map, it can provide important supplemental information for extracting salient objects in complex environments. However, high-quality depth sensors are expensive and can not be widely applied. While general depth sensors produce the noisy and sparse depth information, which brings the depth-based networks with irreversible interference. In this paper, we propose a novel multi-task and multi-modal filtered transformer (MMFT) network for RGB-D salient object detection (SOD). Specifically, we unify three complementary tasks: depth estimation, salient object detection and contour estimation. The multi-task mechanism promotes the model to learn the task-aware features from the auxiliary tasks. In this way, the depth information can be completed and purified. Moreover, we introduce a multi-modal filtered transformer (MFT) module, which equips with three modality-specific filters to generate the transformer-enhanced feature for each modality. The proposed model works in a depth-free style during the testing phase. Experiments show that it not only significantly surpasses the depth-based RGB-D SOD methods on multiple datasets, but also precisely predicts a high-quality depth map and salient contour at the same time. And, the resulted depth map can help existing RGB-D SOD methods obtain significant performance gain.

摘要

得益于深度图所具有的颜色独立性、光照不变性和位置辨别能力,它能够为在复杂环境中提取显著物体提供重要的补充信息。然而,高质量的深度传感器价格昂贵,无法广泛应用。而普通深度传感器产生的深度信息噪声大且稀疏,这给基于深度的网络带来了不可逆转的干扰。在本文中,我们提出了一种用于RGB-D显著目标检测(SOD)的新型多任务多模态滤波变压器(MMFT)网络。具体来说,我们统一了三个互补任务:深度估计、显著目标检测和轮廓估计。多任务机制促使模型从辅助任务中学习任务感知特征。通过这种方式,深度信息可以得到完善和净化。此外,我们引入了一个多模态滤波变压器(MFT)模块,它配备了三个特定模态的滤波器,为每个模态生成变压器增强特征。所提出的模型在测试阶段以无深度的方式工作。实验表明,它不仅在多个数据集上显著超越了基于深度的RGB-D SOD方法,而且同时能够精确预测高质量的深度图和显著轮廓。并且,生成的深度图可以帮助现有的RGB-D SOD方法获得显著的性能提升。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验