• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不确定性启发的RGB-D显著目标检测

Uncertainty Inspired RGB-D Saliency Detection.

作者信息

Zhang Jing, Fan Deng-Ping, Dai Yuchao, Anwar Saeed, Saleh Fatemeh, Aliakbarian Sadegh, Barnes Nick

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5761-5779. doi: 10.1109/TPAMI.2021.3073564. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3073564
PMID:33856982
Abstract

We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the saliency data labeling process, we propose a generative architecture to achieve probabilistic RGB-D saliency detection which utilizes a latent variable to model the labeling variations. Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution. The generator model is an encoder-decoder saliency network. To infer the latent variable, we introduce two different solutions: i) a Conditional Variational Auto-encoder with an extra encoder to approximate the posterior distribution of the latent variable; and ii) an Alternating Back-Propagation technique, which directly samples the latent variable from the true posterior distribution. Qualitative and quantitative results on six challenging RGB-D benchmark datasets show our approach's superior performance in learning the distribution of saliency maps. The source code is publicly available via our project page: https://github.com/JingZhang617/UCNet.

摘要

我们提出了首个随机框架,通过从数据标注过程中学习,将不确定性用于RGB-D显著目标检测。现有的RGB-D显著目标检测模型通过遵循确定性学习流程预测单个显著图,将此任务视为点估计问题。然而,我们认为确定性解决方案相对不适定。受显著数据标注过程的启发,我们提出一种生成架构,以实现概率性RGB-D显著目标检测,该架构利用一个潜在变量对标注变化进行建模。我们的框架包括两个主要模型:1)一个生成器模型,它将输入图像和潜在变量映射到随机显著预测;2)一个推理模型,它通过从真实或近似后验分布中对潜在变量进行采样来逐步更新该变量。生成器模型是一个编码器-解码器显著网络。为了推断潜在变量,我们引入了两种不同的解决方案:i)一种条件变分自编码器,带有一个额外的编码器来近似潜在变量的后验分布;ii)一种交替反向传播技术,它直接从真实后验分布中对潜在变量进行采样。在六个具有挑战性的RGB-D基准数据集上的定性和定量结果表明,我们的方法在学习显著图分布方面具有卓越性能。源代码可通过我们的项目页面公开获取:https://github.com/JingZhang617/UCNet。

相似文献

1
Uncertainty Inspired RGB-D Saliency Detection.不确定性启发的RGB-D显著目标检测
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5761-5779. doi: 10.1109/TPAMI.2021.3073564. Epub 2022 Aug 4.
2
An Energy-Based Prior for Generative Saliency.用于生成显著性的基于能量的先验。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13100-13116. doi: 10.1109/TPAMI.2023.3290610. Epub 2023 Oct 3.
3
Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.利用未标记的 RGB 图像提升 RGB-D 显著度检测。
IEEE Trans Image Process. 2022;31:1107-1119. doi: 10.1109/TIP.2021.3139232. Epub 2022 Jan 12.
4
Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB-D Images.用于RGB-D图像显著性预测的深度多模态融合自动编码器
Comput Intell Neurosci. 2021 May 5;2021:6610997. doi: 10.1155/2021/6610997. eCollection 2021.
5
CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet:用于RGB-D显著目标检测的互补深度网络。
IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.
6
CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection.CIR-Net:用于RGB-D显著目标检测的跨模态交互与优化
IEEE Trans Image Process. 2022;31:6800-6815. doi: 10.1109/TIP.2022.3216198. Epub 2022 Oct 28.
7
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection.DMRA:用于 RGB-D 显著度检测的深度诱导多尺度递归注意网络。
IEEE Trans Image Process. 2022;31:2321-2336. doi: 10.1109/TIP.2022.3154931. Epub 2022 Mar 11.
8
3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond.用于RGB-D显著目标检测及其他应用的3D卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):4309-4323. doi: 10.1109/TNNLS.2022.3202241. Epub 2024 Feb 29.
9
RGB-'D' Saliency Detection With Pseudo Depth.基于伪深度的 RGB-D 显著度检测。
IEEE Trans Image Process. 2019 May;28(5):2126-2139. doi: 10.1109/TIP.2018.2882156. Epub 2018 Nov 19.
10
Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency.用于预测RGB-D显著图的分层多模态自适应融合(HMAF)网络
Comput Intell Neurosci. 2020 Nov 20;2020:8841681. doi: 10.1155/2020/8841681. eCollection 2020.

引用本文的文献

1
Semisupervised 3D segmentation of pancreatic tumors in positron emission tomography/computed tomography images using a mutual information minimization and cross-fusion strategy.使用互信息最小化和交叉融合策略对正电子发射断层扫描/计算机断层扫描图像中的胰腺肿瘤进行半监督3D分割。
Quant Imaging Med Surg. 2024 Feb 1;14(2):1747-1765. doi: 10.21037/qims-23-1153. Epub 2024 Jan 23.
2
Bilateral adaptive graph convolutional network on CT based Covid-19 diagnosis with uncertainty-aware consensus-assisted multiple instance learning.基于 CT 的新冠诊断双侧自适应图卷积网络,具有不确定性感知共识辅助多实例学习。
Med Image Anal. 2023 Feb;84:102722. doi: 10.1016/j.media.2022.102722. Epub 2022 Dec 15.
3
Effective multiscale deep learning model for COVID19 segmentation tasks: A further step towards helping radiologist.
用于COVID-19分割任务的有效多尺度深度学习模型:向帮助放射科医生迈进的又一步。
Neurocomputing (Amst). 2022 Aug 14;499:63-80. doi: 10.1016/j.neucom.2022.05.009. Epub 2022 May 12.
4
A Novel Approach to Automated 3D Spalling Defects Inspection in Railway Tunnel Linings Using Laser Intensity and Depth Information.基于激光强度和深度信息的铁路隧道衬砌自动 3D 剥落缺陷检测新方法。
Sensors (Basel). 2021 Aug 25;21(17):5725. doi: 10.3390/s21175725.