• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度视觉注意力预测。

Deep Visual Attention Prediction.

出版信息

IEEE Trans Image Process. 2018 May;27(5):2368-2378. doi: 10.1109/TIP.2017.2787612. Epub 2017 Dec 27.

DOI:10.1109/TIP.2017.2787612
PMID:29990140
Abstract

In this paper, we aim to predict human eye fixation with view-free scenes based on an end-to-end deep learning architecture. Although convolutional neural networks (CNNs) have made substantial improvement on human attention prediction, it is still needed to improve the CNN-based attention models by efficiently leveraging multi-scale features. Our visual attention network is proposed to capture hierarchical saliency information from deep, coarse layers with global saliency information to shallow, fine layers with local saliency response. Our model is based on a skip-layer network structure, which predicts human attention from multiple convolutional layers with various reception fields. Final saliency prediction is achieved via the cooperation of those global and local predictions. Our model is learned in a deep supervision manner, where supervision is directly fed into multi-level layers, instead of previous approaches of providing supervision only at the output layer and propagating this supervision back to earlier layers. Our model thus incorporates multi-level saliency predictions within a single network, which significantly decreases the redundancy of previous approaches of learning multiple network streams with different input scales. Extensive experimental analysis on various challenging benchmark data sets demonstrate our method yields the state-of-the-art performance with competitive inference time.

摘要

在本文中,我们旨在基于端到端深度学习架构,预测基于视图的场景中的人眼注视。尽管卷积神经网络(CNN)在人类注意力预测方面取得了重大进展,但仍需要通过有效地利用多尺度特征来改进基于 CNN 的注意力模型。我们的视觉注意网络旨在从具有全局显著性信息的深层、粗粒度层到具有局部显著性响应的浅层、细粒度层中捕获分层显著信息。我们的模型基于跳过层网络结构,从具有不同感受野的多个卷积层预测人类注意力。最终的显著度预测是通过全局和局部预测的合作来实现的。我们的模型以深度监督的方式进行学习,其中直接在多层级提供监督,而不是以前的方法只在输出层提供监督并将这种监督传播回早期层。我们的模型因此在单个网络中结合了多层次的显著度预测,这显著减少了学习具有不同输入尺度的多个网络流的先前方法的冗余性。在各种具有挑战性的基准数据集上进行的广泛实验分析表明,我们的方法在具有竞争力的推断时间下实现了最先进的性能。

相似文献

1
Deep Visual Attention Prediction.深度视觉注意力预测。
IEEE Trans Image Process. 2018 May;27(5):2368-2378. doi: 10.1109/TIP.2017.2787612. Epub 2017 Dec 27.
2
DeepFix: A Fully Convolutional Neural Network for Predicting Human Eye Fixations.DeepFix:一种用于预测人眼注视点的全卷积神经网络。
IEEE Trans Image Process. 2017 Sep;26(9):4446-4456. doi: 10.1109/TIP.2017.2710620.
3
Personalized Saliency and Its Prediction.个性化显著性及其预测。
IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2975-2989. doi: 10.1109/TPAMI.2018.2866563. Epub 2018 Aug 23.
4
CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion.基于卷积神经网络的跨视图迁移和多视图融合的 RGB-D 显著目标检测。
IEEE Trans Cybern. 2018 Nov;48(11):3171-3183. doi: 10.1109/TCYB.2017.2761775. Epub 2017 Oct 31.
5
Attention-based fusion network for human eye-fixation prediction in 3D images.用于3D图像中人眼注视预测的基于注意力的融合网络。
Opt Express. 2019 Nov 11;27(23):34056-34066. doi: 10.1364/OE.27.034056.
6
Visual Saliency Detection Based on Multiscale Deep CNN Features.基于多尺度深度卷积神经网络特征的显著目标检测
IEEE Trans Image Process. 2016 Nov;25(11):5012-5024. doi: 10.1109/TIP.2016.2602079. Epub 2016 Aug 24.
7
Saliency-driven system models for cell analysis with deep learning.基于深度学习的细胞分析的显著驱动系统模型。
Comput Methods Programs Biomed. 2019 Dec;182:105053. doi: 10.1016/j.cmpb.2019.105053. Epub 2019 Aug 26.
8
Learning to Predict Eye Fixations via Multiresolution Convolutional Neural Networks.通过多分辨率卷积神经网络学习预测眼动。
IEEE Trans Neural Netw Learn Syst. 2018 Feb;29(2):392-404. doi: 10.1109/TNNLS.2016.2628878. Epub 2016 Nov 29.
9
Two-Stage Learning to Predict Human Eye Fixations via SDAEs.基于 SDAEs 的两阶段学习预测人类眼动注视。
IEEE Trans Cybern. 2016 Feb;46(2):487-98. doi: 10.1109/TCYB.2015.2404432. Epub 2015 Feb 27.
10
A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。
Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.

引用本文的文献

1
Attention mechanism based CNN-LSTM hybrid deep learning model for atmospheric ozone concentration prediction.基于注意力机制的CNN-LSTM混合深度学习模型用于大气臭氧浓度预测。
Sci Rep. 2025 Jul 1;15(1):21260. doi: 10.1038/s41598-025-05877-2.
2
Scene categorization by Hessian-regularized active perceptual feature selection.基于黑塞正则化主动感知特征选择的场景分类
Sci Rep. 2025 Jan 4;15(1):739. doi: 10.1038/s41598-024-84181-x.
3
DeSa COVID-19: Deep salient COVID-19 image-based quality assessment.DeSa COVID-19:基于深度显著特征的COVID-19图像质量评估
J King Saud Univ Comput Inf Sci. 2022 Nov;34(10):9501-9512. doi: 10.1016/j.jksuci.2021.11.013. Epub 2021 Dec 6.
4
Translating AI to Clinical Practice: Overcoming Data Shift with Explainability.将人工智能转化为临床实践:用可解释性克服数据偏移。
Radiographics. 2023 May;43(5):e220105. doi: 10.1148/rg.220105.
5
Continuous Prediction of Web User Visual Attention on Short Span Windows Based on Gaze Data Analytics.基于眼动数据分析的短窗网页用户视觉注意力持续预测。
Sensors (Basel). 2023 Feb 18;23(4):2294. doi: 10.3390/s23042294.
6
Multiscale Cascaded Attention Network for Saliency Detection Based on ResNet.基于 ResNet 的多尺度级联注意力网络的显著目标检测
Sensors (Basel). 2022 Dec 16;22(24):9950. doi: 10.3390/s22249950.
7
Fuzzy Color Aura Matrices for Texture Image Segmentation.用于纹理图像分割的模糊彩色光环矩阵
J Imaging. 2022 Sep 8;8(9):244. doi: 10.3390/jimaging8090244.
8
A multichannel optical computing architecture for advanced machine vision.一种用于先进机器视觉的多通道光学计算架构。
Light Sci Appl. 2022 Aug 18;11(1):255. doi: 10.1038/s41377-022-00945-y.
9
A Bio-Inspired Endogenous Attention-Based Architecture for a Social Robot.一种基于生物启发的内源性注意的社交机器人架构。
Sensors (Basel). 2022 Jul 13;22(14):5248. doi: 10.3390/s22145248.
10
Rice bacterial blight resistant cultivar selection based on visible/near-infrared spectrum and deep learning.基于可见/近红外光谱和深度学习的水稻抗白叶枯病品种筛选
Plant Methods. 2022 Apr 15;18(1):49. doi: 10.1186/s13007-022-00882-2.