• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度空间上下文的显著性检测长短期记忆卷积网络。

A Deep Spatial Contextual Long-Term Recurrent Convolutional Network for Saliency Detection.

出版信息

IEEE Trans Image Process. 2018 Jul;27(7):3264-3274. doi: 10.1109/TIP.2018.2817047.

DOI:10.1109/TIP.2018.2817047
PMID:29641405
Abstract

Traditional saliency models usually adopt hand-crafted image features and human-designed mechanisms to calculate local or global contrast. In this paper, we propose a novel computational saliency model, i.e., deep spatial contextual long-term recurrent convolutional network (DSCLRCN), to predict where people look in natural scenes. DSCLRCN first automatically learns saliency related local features on each image location in parallel. Then, in contrast with most other deep network based saliency models which infer saliency in local contexts, DSCLRCN can mimic the cortical lateral inhibition mechanisms in human visual system to incorporate global contexts to assess the saliency of each image location by leveraging the deep spatial long short-term memory (DSLSTM) model. Moreover, we also integrate scene context modulation in DSLSTM for saliency inference, leading to a novel deep spatial contextual LSTM (DSCLSTM) model. The whole network can be trained end-to-end and works efficiently when testing. Experimental results on two benchmark datasets show that DSCLRCN can achieve state-of-the-art performance on saliency detection. Furthermore, the proposed DSCLSTM model can significantly boost the saliency detection performance by incorporating both global spatial interconnections and scene context modulation, which may uncover novel inspirations for studies on them in computational saliency models.

摘要

传统的显著度模型通常采用手工制作的图像特征和人为设计的机制来计算局部或全局对比度。在本文中,我们提出了一种新颖的计算显著度模型,即深度空间上下文长期递归卷积网络(DSCLRCN),用于预测人们在自然场景中看哪里。DSCLRCN 首先在每个图像位置上自动并行学习与显著度相关的局部特征。然后,与大多数其他基于深度网络的显著度模型不同,DSCLRCN 可以模拟人类视觉系统中的皮层侧抑制机制,通过利用深度空间长短期记忆(DSLSTM)模型来整合全局上下文,以评估每个图像位置的显著度。此外,我们还在 DSLSTM 中集成了场景上下文调制,用于显著度推断,从而得到一种新颖的深度空间上下文 LSTM(DSCLSTM)模型。整个网络可以端到端训练,在测试时效率很高。在两个基准数据集上的实验结果表明,DSCLRCN 可以在显著度检测方面达到最先进的性能。此外,所提出的 DSCLSTM 模型通过整合全局空间相互连接和场景上下文调制,可以显著提高显著度检测性能,这可能为计算显著度模型中对它们的研究提供新的启示。

相似文献

1
A Deep Spatial Contextual Long-Term Recurrent Convolutional Network for Saliency Detection.基于深度空间上下文的显著性检测长短期记忆卷积网络。
IEEE Trans Image Process. 2018 Jul;27(7):3264-3274. doi: 10.1109/TIP.2018.2817047.
2
Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测
IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.
3
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
4
A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.一种用于视频显著性预测的时空循环神经网络。
IEEE Trans Image Process. 2021;30:572-587. doi: 10.1109/TIP.2020.3036749. Epub 2020 Nov 24.
5
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
6
Contextual encoder-decoder network for visual saliency prediction.上下文编解码网络的视觉显著性预测。
Neural Netw. 2020 Sep;129:261-270. doi: 10.1016/j.neunet.2020.05.004. Epub 2020 May 8.
7
DeepFix: A Fully Convolutional Neural Network for Predicting Human Eye Fixations.DeepFix:一种用于预测人眼注视点的全卷积神经网络。
IEEE Trans Image Process. 2017 Sep;26(9):4446-4456. doi: 10.1109/TIP.2017.2710620.
8
End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network.通过深度上下文时空网络实现的端到端视频显著目标检测
IEEE Trans Neural Netw Learn Syst. 2021 Apr;32(4):1691-1702. doi: 10.1109/TNNLS.2020.2986823. Epub 2021 Apr 2.
9
Deep Visual Attention Prediction.深度视觉注意力预测。
IEEE Trans Image Process. 2018 May;27(5):2368-2378. doi: 10.1109/TIP.2017.2787612. Epub 2017 Dec 27.
10
Co-Saliency-Enhanced Deep Recurrent Convolutional Networks for Human Fall Detection in E-Healthcare.用于电子医疗保健中人体跌倒检测的协同显著性增强深度循环卷积网络
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:1572-1575. doi: 10.1109/EMBC.2018.8512586.

引用本文的文献

1
DeSa COVID-19: Deep salient COVID-19 image-based quality assessment.DeSa COVID-19:基于深度显著特征的COVID-19图像质量评估
J King Saud Univ Comput Inf Sci. 2022 Nov;34(10):9501-9512. doi: 10.1016/j.jksuci.2021.11.013. Epub 2021 Dec 6.
2
Intelligent auxiliary system for music performance under edge computing and long short-term recurrent neural networks.基于边缘计算和长短时记忆递归神经网络的音乐演奏智能辅助系统。
PLoS One. 2023 May 8;18(5):e0285496. doi: 10.1371/journal.pone.0285496. eCollection 2023.
3
Characteristics of Visual Saliency Caused by Character Feature for Reconstruction of Saliency Map Model.
基于字符特征的视觉显著性特性用于显著性图模型重建
Vision (Basel). 2021 Oct 19;5(4):49. doi: 10.3390/vision5040049.
4
Expertise Classification of Soccer Goalkeepers in Highly Dynamic Decision Tasks: A Deep Learning Approach for Temporal and Spatial Feature Recognition of Fixation Image Patch Sequences.高动态决策任务中足球守门员的专业技能分类:一种用于注视图像块序列时空特征识别的深度学习方法。
Front Sports Act Living. 2021 Jul 26;3:692526. doi: 10.3389/fspor.2021.692526. eCollection 2021.
5
Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB-D Images.用于RGB-D图像显著性预测的深度多模态融合自动编码器
Comput Intell Neurosci. 2021 May 5;2021:6610997. doi: 10.1155/2021/6610997. eCollection 2021.
6
COCO-Search18 fixation dataset for predicting goal-directed attention control.COCO-Search18 数据集用于预测目标导向注意力控制。
Sci Rep. 2021 Apr 22;11(1):8776. doi: 10.1038/s41598-021-87715-9.
7
Ranking-Based Salient Object Detection and Depth Prediction for Shallow Depth-of-Field.基于排序的显著目标检测和浅层景深的深度预测。
Sensors (Basel). 2021 Mar 5;21(5):1815. doi: 10.3390/s21051815.
8
Correspondence between Monkey Visual Cortices and Layers of a Saliency Map Model Based on a Deep Convolutional Neural Network for Representations of Natural Images.基于深度卷积神经网络的自然图像表示的显著性映射模型的猴子视觉皮层与层之间的对应关系。
eNeuro. 2021 Feb 9;8(1). doi: 10.1523/ENEURO.0200-20.2020. Print 2021 Jan-Feb.
9
Salience Models: A Computational Cognitive Neuroscience Review.显著性模型:计算认知神经科学综述
Vision (Basel). 2019 Oct 25;3(4):56. doi: 10.3390/vision3040056.
10
DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning.DeepNitro:基于深度学习的蛋白质硝化和亚硝化位点预测。
Genomics Proteomics Bioinformatics. 2018 Aug;16(4):294-306. doi: 10.1016/j.gpb.2018.04.007. Epub 2018 Sep 27.