• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

递归神经网络可以解释生物视觉中速度和精度的灵活交易。

Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.

机构信息

Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom.

Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands.

出版信息

PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.

DOI:10.1371/journal.pcbi.1008215
PMID:33006992
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7556458/
Abstract

Deep feedforward neural network models of vision dominate in both computational neuroscience and engineering. The primate visual system, by contrast, contains abundant recurrent connections. Recurrent signal flow enables recycling of limited computational resources over time, and so might boost the performance of a physically finite brain or model. Here we show: (1) Recurrent convolutional neural network models outperform feedforward convolutional models matched in their number of parameters in large-scale visual recognition tasks on natural images. (2) Setting a confidence threshold, at which recurrent computations terminate and a decision is made, enables flexible trading of speed for accuracy. At a given confidence threshold, the model expends more time and energy on images that are harder to recognise, without requiring additional parameters for deeper computations. (3) The recurrent model's reaction time for an image predicts the human reaction time for the same image better than several parameter-matched and state-of-the-art feedforward models. (4) Across confidence thresholds, the recurrent model emulates the behaviour of feedforward control models in that it achieves the same accuracy at approximately the same computational cost (mean number of floating-point operations). However, the recurrent model can be run longer (higher confidence threshold) and then outperforms parameter-matched feedforward comparison models. These results suggest that recurrent connectivity, a hallmark of biological visual systems, may be essential for understanding the accuracy, flexibility, and dynamics of human visual recognition.

摘要

深度前馈神经网络模型在计算神经科学和工程领域占据主导地位。相比之下,灵长类动物的视觉系统包含丰富的递归连接。递归信号流使有限的计算资源能够随时间重复利用,从而可能提高物理上有限的大脑或模型的性能。在这里,我们展示了:(1)在大规模视觉识别任务中,与参数匹配的前馈卷积模型相比,递归卷积神经网络模型在自然图像上的表现更好。(2)设置置信度阈值,在该阈值处,递归计算终止并做出决策,从而能够灵活地在速度和准确性之间进行权衡。在给定的置信度阈值下,该模型在识别困难的图像上花费更多的时间和精力,而不需要更深层次计算的额外参数。(3)模型对图像的反应时间比几个参数匹配和最先进的前馈模型对同一图像的反应时间预测更好。(4)在置信度阈值内,递归模型的行为与前馈控制模型相似,即在大致相同的计算成本(浮点运算的平均数量)下达到相同的准确性。然而,递归模型可以运行更长时间(更高的置信度阈值),并且优于参数匹配的前馈比较模型。这些结果表明,递归连接是生物视觉系统的一个标志,对于理解人类视觉识别的准确性、灵活性和动态性可能至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/1fc2430b2593/pcbi.1008215.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/894ec4a6f5d9/pcbi.1008215.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/a0f120532bce/pcbi.1008215.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/62d07d3dcc55/pcbi.1008215.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/21834bd3e3b5/pcbi.1008215.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/546ca7fb76b5/pcbi.1008215.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/fa7ef3d171aa/pcbi.1008215.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/53165cb915f2/pcbi.1008215.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/21674226283e/pcbi.1008215.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/1fc2430b2593/pcbi.1008215.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/894ec4a6f5d9/pcbi.1008215.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/a0f120532bce/pcbi.1008215.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/62d07d3dcc55/pcbi.1008215.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/21834bd3e3b5/pcbi.1008215.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/546ca7fb76b5/pcbi.1008215.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/fa7ef3d171aa/pcbi.1008215.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/53165cb915f2/pcbi.1008215.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/21674226283e/pcbi.1008215.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/1fc2430b2593/pcbi.1008215.g009.jpg

相似文献

1
Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.递归神经网络可以解释生物视觉中速度和精度的灵活交易。
PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.
2
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
3
Recurrent Connections in the Primate Ventral Visual Stream Mediate a Trade-Off Between Task Performance and Network Size During Core Object Recognition.灵长类动物腹侧视觉流中的重复连接在核心物体识别过程中,在任务表现和网络规模之间进行权衡。
Neural Comput. 2022 Jul 14;34(8):1652-1675. doi: 10.1162/neco_a_01506.
4
A neurocomputational model of decision and confidence in object recognition task.一种用于对象识别任务的决策和信心的神经计算模型。
Neural Netw. 2024 Jul;175:106318. doi: 10.1016/j.neunet.2024.106318. Epub 2024 Apr 12.
5
Beyond the feedforward sweep: feedback computations in the visual cortex.超越前馈扫掠:视觉皮层中的反馈计算。
Ann N Y Acad Sci. 2020 Mar;1464(1):222-241. doi: 10.1111/nyas.14320. Epub 2020 Feb 28.
6
Visual perception of liquids: Insights from deep neural networks.液体的视觉感知:来自深度神经网络的见解。
PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.
7
Going in circles is the way forward: the role of recurrence in visual inference.兜圈子是前进的道路:在视觉推理中再现的作用。
Curr Opin Neurobiol. 2020 Dec;65:176-193. doi: 10.1016/j.conb.2020.11.009. Epub 2020 Dec 3.
8
Capsule networks as recurrent models of grouping and segmentation.胶囊网络作为分组和分割的递归模型。
PLoS Comput Biol. 2020 Jul 21;16(7):e1008017. doi: 10.1371/journal.pcbi.1008017. eCollection 2020 Jul.
9
Structurally-constrained encoding framework using a multi-voxel reduced-rank latent model for human natural vision.使用多体素降秩潜在模型进行人类自然视觉的结构约束编码框架。
J Neural Eng. 2024 Jul 26;21(4). doi: 10.1088/1741-2552/ad6184.
10
Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition.循环卷积神经网络:生物物体识别的更佳模型。
Front Psychol. 2017 Sep 12;8:1551. doi: 10.3389/fpsyg.2017.01551. eCollection 2017.

引用本文的文献

1
Recurrence affects the geometry of visual representations across the ventral visual stream in the human brain.复发会影响人类大脑腹侧视觉通路中视觉表征的几何结构。
PLoS Biol. 2025 Aug 25;23(8):e3003354. doi: 10.1371/journal.pbio.3003354. eCollection 2025 Aug.
2
High-level visual representations in the human brain are aligned with large language models.人类大脑中的高级视觉表征与大语言模型相一致。
Nat Mach Intell. 2025;7(8):1220-1234. doi: 10.1038/s42256-025-01072-0. Epub 2025 Aug 7.
3
Computational basis of hierarchical and counterfactual information processing.

本文引用的文献

1
Recurrence is required to capture the representational dynamics of the human visual system.为了捕捉人类视觉系统的表示动态,需要进行再现。
Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.
2
Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior.证据表明,循环回路对于腹侧流执行核心物体识别行为至关重要。
Nat Neurosci. 2019 Jun;22(6):974-983. doi: 10.1038/s41593-019-0392-5. Epub 2019 Apr 29.
3
Recurrent computations for visual pattern completion.
分层与反事实信息处理的计算基础。
Nat Hum Behav. 2025 Jun 11. doi: 10.1038/s41562-025-02232-3.
4
End-to-end topographic networks as models of cortical map formation and human visual behaviour.作为皮质图谱形成和人类视觉行为模型的端到端地形网络
Nat Hum Behav. 2025 Jun 6. doi: 10.1038/s41562-025-02220-7.
5
Self-Attention-Based Contextual Modulation Improves Neural System Identification.基于自注意力的上下文调制改善神经系统识别。
ArXiv. 2025 Feb 28:arXiv:2406.07843v3.
6
Unraveling the complexity of rat object vision requires a full convolutional network and beyond.剖析大鼠物体视觉的复杂性需要一个全卷积网络及其他技术。
Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.
7
An image-computable model of speeded decision-making.一种快速决策的图像可计算模型。
Elife. 2025 Feb 28;13:RP98351. doi: 10.7554/eLife.98351.
8
RTify: Aligning Deep Neural Networks with Human Behavioral Decisions.RTify:使深度神经网络与人类行为决策保持一致
ArXiv. 2024 Dec 26:arXiv:2411.03630v2.
9
Benchmarking the speed-accuracy tradeoff in object recognition by humans and neural networks.比较人类和神经网络在目标识别中速度与准确性的权衡
J Vis. 2025 Jan 2;25(1):4. doi: 10.1167/jov.25.1.4.
10
Teaching deep networks to see shape: Lessons from a simplified visual world.教深度网络识别形状:简化视觉世界的经验教训。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012019. doi: 10.1371/journal.pcbi.1012019. eCollection 2024 Nov.
视觉模式完成的反复计算。
Proc Natl Acad Sci U S A. 2018 Aug 28;115(35):8835-8840. doi: 10.1073/pnas.1719397115. Epub 2018 Aug 13.
4
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
5
Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition.循环卷积神经网络:生物物体识别的更佳模型。
Front Psychol. 2017 Sep 12;8:1551. doi: 10.3389/fpsyg.2017.01551. eCollection 2017.
6
Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.深度神经网络:一种用于模拟生物视觉和大脑信息处理的新框架。
Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.
7
Representational Distance Learning for Deep Neural Networks.深度神经网络的代表性距离学习
Front Comput Neurosci. 2016 Dec 27;10:131. doi: 10.3389/fncom.2016.00131. eCollection 2016.
8
Representational Dynamics of Facial Viewpoint Encoding.面部视角编码的表征动态
J Cogn Neurosci. 2017 Apr;29(4):637-651. doi: 10.1162/jocn_a_01070. Epub 2016 Oct 28.
9
Extensive training leads to temporal and spatial shifts of cortical activity underlying visual category selectivity.广泛的训练会导致视觉类别选择性背后的皮层活动在时间和空间上发生变化。
Neuroimage. 2016 Jul 1;134:22-34. doi: 10.1016/j.neuroimage.2016.03.066. Epub 2016 Apr 6.
10
Using goal-driven deep learning models to understand sensory cortex.利用目标驱动的深度学习模型理解感觉皮层。
Nat Neurosci. 2016 Mar;19(3):356-65. doi: 10.1038/nn.4244.