• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

受视网膜启发的模型增强视觉显著性预测。

Retina-Inspired Models Enhance Visual Saliency Prediction.

作者信息

Shen Gang, Ma Wenjun, Zhai Wen, Lv Xuefei, Chen Guangyao, Tian Yonghong

机构信息

Smart Tower Co., Ltd., Beijing 100089, China.

State Unclear Electric Power Planning Design & Research Institute Co., Ltd., Beijing 100095, China.

出版信息

Entropy (Basel). 2025 Apr 18;27(4):436. doi: 10.3390/e27040436.

DOI:10.3390/e27040436
PMID:40282671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12026020/
Abstract

Biologically inspired retinal preprocessing improves visual perception by efficiently encoding and reducing entropy in images. In this study, we introduce a new saliency prediction framework that combines a retinal model with deep neural networks (DNNs) using information theory ideas. By mimicking the human retina, our method creates clearer saliency maps with lower entropy and supports efficient computation with DNNs by optimizing information flow and reducing redundancy. We treat saliency prediction as an information maximization problem, where important regions have high information and low local entropy. Tests on several benchmark datasets show that adding the retinal model boosts the performance of various bottom-up saliency prediction methods by better managing information and reducing uncertainty. We use metrics like mutual information and entropy to measure improvements in accuracy and efficiency. Our framework outperforms state-of-the-art models, producing saliency maps that closely match where people actually look. By combining neurobiological insights with information theory-using measures like Kullback-Leibler divergence and information gain-our method not only improves prediction accuracy but also offers a clear, quantitative understanding of saliency. This approach shows promise for future research that brings together neuroscience, entropy, and deep learning to enhance visual saliency prediction.

摘要

受生物启发的视网膜预处理通过有效编码和降低图像熵来改善视觉感知。在本研究中,我们引入了一种新的显著性预测框架,该框架利用信息论思想将视网膜模型与深度神经网络(DNN)相结合。通过模仿人类视网膜,我们的方法创建了具有更低熵的更清晰的显著性图,并通过优化信息流和减少冗余来支持DNN的高效计算。我们将显著性预测视为一个信息最大化问题,其中重要区域具有高信息和低局部熵。在几个基准数据集上的测试表明,添加视网膜模型通过更好地管理信息和减少不确定性,提高了各种自底向上显著性预测方法的性能。我们使用互信息和熵等指标来衡量准确性和效率的提高。我们的框架优于现有模型,生成的显著性图与人们实际注视的位置非常匹配。通过将神经生物学见解与信息论相结合——使用诸如库尔贝克-莱布勒散度和信息增益等度量——我们的方法不仅提高了预测准确性,还提供了对显著性的清晰、定量理解。这种方法为未来将神经科学、熵和深度学习结合起来以增强视觉显著性预测的研究展现出了前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/179db8b09cc4/entropy-27-00436-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/8f7d7dcd71a3/entropy-27-00436-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/b3be3a48b271/entropy-27-00436-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/179db8b09cc4/entropy-27-00436-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/8f7d7dcd71a3/entropy-27-00436-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/b3be3a48b271/entropy-27-00436-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1c/12026020/179db8b09cc4/entropy-27-00436-g003.jpg

相似文献

1
Retina-Inspired Models Enhance Visual Saliency Prediction.受视网膜启发的模型增强视觉显著性预测。
Entropy (Basel). 2025 Apr 18;27(4):436. doi: 10.3390/e27040436.
2
Saliency-driven system models for cell analysis with deep learning.基于深度学习的细胞分析的显著驱动系统模型。
Comput Methods Programs Biomed. 2019 Dec;182:105053. doi: 10.1016/j.cmpb.2019.105053. Epub 2019 Aug 26.
3
Maximal entropy random walk for region-based visual saliency.基于区域的视觉显著度的最大熵随机游走。
IEEE Trans Cybern. 2014 Sep;44(9):1661-72. doi: 10.1109/TCYB.2013.2292054.
4
Video saliency incorporating spatiotemporal cues and uncertainty weighting.视频显著度融合时空线索和不确定性加权。
IEEE Trans Image Process. 2014 Sep;23(9):3910-21. doi: 10.1109/TIP.2014.2336549. Epub 2014 Jul 16.
5
Visual Saliency via Embedding Hierarchical Knowledge in a Deep Neural Network.通过在深度神经网络中嵌入分层知识实现视觉显著性
IEEE Trans Image Process. 2020 Aug 19;PP. doi: 10.1109/TIP.2020.3016464.
6
Deep learning-based image deconstruction method with maintained saliency.基于深度学习的保显著度图像分解方法。
Neural Netw. 2022 Nov;155:224-241. doi: 10.1016/j.neunet.2022.08.015. Epub 2022 Aug 23.
7
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis.医学成像中基于显著性的可解释深度学习:架起视觉可解释性与统计定量分析之间的桥梁。
BioData Min. 2024 Jun 22;17(1):18. doi: 10.1186/s13040-024-00370-4.
8
Revisiting Video Saliency Prediction in the Deep Learning Era.深度学习时代的视频显著度预测再探讨。
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):220-237. doi: 10.1109/TPAMI.2019.2924417. Epub 2020 Dec 4.
9
Learning Saliency From Single Noisy Labelling: A Robust Model Fitting Perspective.从单噪标签中学习显著度:稳健的模型拟合视角。
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2866-2873. doi: 10.1109/TPAMI.2020.3046486. Epub 2021 Jul 1.
10
Clinical validation of saliency maps for understanding deep neural networks in ophthalmology.用于理解眼科中深度神经网络的显著图的临床验证。
Med Image Anal. 2022 Apr;77:102364. doi: 10.1016/j.media.2022.102364. Epub 2022 Jan 22.

本文引用的文献

1
CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection.CAVER:用于双模态显著目标检测的跨模态视图混合变换器
IEEE Trans Image Process. 2023;32:892-904. doi: 10.1109/TIP.2023.3234702. Epub 2023 Jan 23.
2
Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection.学习选择性互注意力和对比用于 RGB-D 显著检测。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9026-9042. doi: 10.1109/TPAMI.2021.3122139. Epub 2022 Nov 7.
3
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model.
基于长短期记忆网络的显著性注意力模型预测人眼注视点
IEEE Trans Image Process. 2018 Jun 29. doi: 10.1109/TIP.2018.2851672.
4
Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较,揭示了层级对应关系。
Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.
5
SUN: A Bayesian framework for saliency using natural statistics.SUN:一种使用自然统计的显著性贝叶斯框架。
J Vis. 2008 Dec 16;8(7):32.1-20. doi: 10.1167/8.7.32.
6
Virtual Retina: a biological retina model and simulator, with contrast gain control.虚拟视网膜:一种具有对比度增益控制的生物视网膜模型与模拟器。
J Comput Neurosci. 2009 Apr;26(2):219-49. doi: 10.1007/s10827-008-0108-4. Epub 2008 Aug 1.
7
Sensory coding in the vertebrate retina: towards an adaptive control of visual sensitivity.脊椎动物视网膜中的感觉编码:迈向视觉敏感度的自适应控制。
Network. 1996 May;7(2):317-23. doi: 10.1088/0954-898X/7/2/012.
8
Multiple neural spike train data analysis: state-of-the-art and future challenges.多神经脉冲序列数据分析:现状与未来挑战。
Nat Neurosci. 2004 May;7(5):456-61. doi: 10.1038/nn1228.
9
Adaptation of retinal processing to image contrast and spatial scale.视网膜处理对图像对比度和空间尺度的适应。
Nature. 1997 Mar 6;386(6620):69-73. doi: 10.1038/386069a0.