• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新思考RGB-D显著目标检测:模型、数据集和大规模基准

Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks.

作者信息

Fan Deng-Ping, Lin Zheng, Zhang Zhao, Zhu Menglong, Cheng Ming-Ming

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2075-2089. doi: 10.1109/TNNLS.2020.2996406. Epub 2021 May 3.

DOI:10.1109/TNNLS.2020.2996406
PMID:32491986
Abstract

The use of RGB-D information for salient object detection (SOD) has been extensively explored in recent years. However, relatively few efforts have been put toward modeling SOD in real-world human activity scenes with RGB-D. In this article, we fill the gap by making the following contributions to RGB-D SOD: 1) we carefully collect a new Salient Person (SIP) data set that consists of ~1 K high-resolution images that cover diverse real-world scenes from various viewpoints, poses, occlusions, illuminations, and background s; 2) we conduct a large-scale (and, so far, the most comprehensive) benchmark comparing contemporary methods, which has long been missing in the field and can serve as a baseline for future research, and we systematically summarize 32 popular models and evaluate 18 parts of 32 models on seven data sets containing a total of about 97k images; and 3) we propose a simple general architecture, called deep depth-depurator network (DNet). It consists of a depth depurator unit (DDU) and a three-stream feature learning module (FLM), which performs low-quality depth map filtering and cross-modal feature learning, respectively. These components form a nested structure and are elaborately designed to be learned jointly. DNet exceeds the performance of any prior contenders across all five metrics under consideration, thus serving as a strong model to advance research in this field. We also demonstrate that DNet can be used to efficiently extract salient object masks from real scenes, enabling effective background-changing application with a speed of 65 frames/s on a single GPU. All the saliency maps, our new SIP data set, the DNet model, and the evaluation tools are publicly available at https://github.com/DengPingFan/D3NetBenchmark.

摘要

近年来,利用RGB-D信息进行显著目标检测(SOD)已得到广泛探索。然而,在使用RGB-D对真实人类活动场景中的SOD进行建模方面所做的工作相对较少。在本文中,我们通过对RGB-D SOD做出以下贡献来填补这一空白:1)我们精心收集了一个新的显著人物(SIP)数据集,该数据集由约1000张高分辨率图像组成,这些图像涵盖了来自不同视角、姿势、遮挡、光照和背景的各种真实世界场景;2)我们进行了一项大规模(也是迄今为止最全面的)基准测试,比较当代方法,该基准测试在该领域长期缺失,可作为未来研究的基线,并且我们系统地总结了32种流行模型,并在包含总共约97k张图像的七个数据集上对32个模型的18个部分进行了评估;3)我们提出了一种简单的通用架构,称为深度深度净化器网络(DNet)。它由一个深度净化器单元(DDU)和一个三流特征学习模块(FLM)组成,分别执行低质量深度图滤波和跨模态特征学习。这些组件形成一个嵌套结构,并经过精心设计以联合学习。DNet在所有考虑的五个指标上均超过了任何先前的竞争者,从而成为推动该领域研究的强大模型。我们还证明,DNet可用于从真实场景中高效提取显著目标掩码,以65帧/秒的速度在单个GPU上实现有效的背景更改应用。所有显著图、我们的新SIP数据集、DNet模型和评估工具均可在https://github.com/DengPingFan/D3NetBenchmark上公开获取。

相似文献

1
Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks.重新思考RGB-D显著目标检测:模型、数据集和大规模基准
IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2075-2089. doi: 10.1109/TNNLS.2020.2996406. Epub 2021 May 3.
2
CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet:用于RGB-D显著目标检测的互补深度网络。
IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.
3
Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.利用未标记的 RGB 图像提升 RGB-D 显著度检测。
IEEE Trans Image Process. 2022;31:1107-1119. doi: 10.1109/TIP.2021.3139232. Epub 2022 Jan 12.
4
Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的分层交替交互网络
IEEE Trans Image Process. 2021;30:3528-3542. doi: 10.1109/TIP.2021.3062689. Epub 2021 Mar 11.
5
ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection.ASIF-Net:用于 RGB-D 显著目标检测的注意力导向交织融合网络。
IEEE Trans Cybern. 2021 Jan;51(1):88-100. doi: 10.1109/TCYB.2020.2969255. Epub 2020 Dec 22.
6
MCD-Net: Toward RGB-D Video Inpainting in Real-World Scenes.MCD-Net:面向真实场景中的RGB-D视频修复
IEEE Trans Image Process. 2024;33:1095-1108. doi: 10.1109/TIP.2024.3358675. Epub 2024 Feb 5.
7
RGB-D salient object detection: A survey.RGB-D显著目标检测:一项综述。
Comput Vis Media (Beijing). 2021;7(1):37-69. doi: 10.1007/s41095-020-0199-z. Epub 2021 Jan 7.
8
SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.SLMSF-Net:用于RGB-D显著目标检测的语义定位与多尺度融合网络
Sensors (Basel). 2024 Feb 8;24(4):1117. doi: 10.3390/s24041117.
9
MobileSal: Extremely Efficient RGB-D Salient Object Detection.MobileSal:高效 RGB-D 显著目标检测。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):10261-10269. doi: 10.1109/TPAMI.2021.3134684. Epub 2022 Nov 7.
10
Dynamic Selective Network for RGB-D Salient Object Detection.基于动态选择网络的 RGB-D 显著目标检测
IEEE Trans Image Process. 2021;30:9179-9192. doi: 10.1109/TIP.2021.3123548. Epub 2021 Nov 10.

引用本文的文献

1
Cross-Modal Weakly Supervised RGB-D Salient Object Detection with a Focus on Filamentary Structures.关注丝状结构的跨模态弱监督RGB-D显著目标检测
Sensors (Basel). 2025 May 9;25(10):2990. doi: 10.3390/s25102990.
2
LRNet: lightweight attention-oriented residual fusion network for light field salient object detection.LRNet:用于光场显著目标检测的轻量级注意力导向残差融合网络
Sci Rep. 2024 Oct 29;14(1):26030. doi: 10.1038/s41598-024-76874-0.
3
SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.
SLMSF-Net:用于RGB-D显著目标检测的语义定位与多尺度融合网络
Sensors (Basel). 2024 Feb 8;24(4):1117. doi: 10.3390/s24041117.
4
Lightweight Cross-Modal Information Mutual Reinforcement Network for RGB-T Salient Object Detection.用于RGB-T显著目标检测的轻量级跨模态信息相互增强网络
Entropy (Basel). 2024 Jan 31;26(2):130. doi: 10.3390/e26020130.
5
Swin Transformer-Based Edge Guidance Network for RGB-D Salient Object Detection.基于Swin Transformer的RGB-D显著目标检测边缘引导网络
Sensors (Basel). 2023 Oct 29;23(21):8802. doi: 10.3390/s23218802.
6
RGB-D salient object detection via convolutional capsule network based on feature extraction and integration.基于特征提取与融合的卷积胶囊网络实现RGB-D显著目标检测
Sci Rep. 2023 Oct 17;13(1):17652. doi: 10.1038/s41598-023-44698-z.
7
Exploring Focus and Depth-Induced Saliency Detection for Light Field.探索光场的聚焦和深度诱导显著目标检测
Entropy (Basel). 2023 Sep 15;25(9):1336. doi: 10.3390/e25091336.
8
Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的全局引导跨模态跨尺度网络
Sensors (Basel). 2023 Aug 17;23(16):7221. doi: 10.3390/s23167221.
9
Dynamic Sweep Experiments on a Heterogeneous Phase Composite System Based on Branched-Preformed Particle Gel in High Water-Cut Reservoirs after Polymer Flooding.聚合物驱后高含水油藏中基于分支预成型颗粒凝胶的非均相复合体系动态驱替实验
Gels. 2023 Apr 25;9(5):364. doi: 10.3390/gels9050364.
10
Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.基于绝对和相对深度信息的 RGB-D 显著目标检测网络
Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.