Suppr
超能文献

递归神经网络可以解释生物视觉中速度和精度的灵活交易。

Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.

机构信息

Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom.

Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands.

出版信息

PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.

DOI:10.1371/journal.pcbi.1008215

PMID:33006992

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7556458/

Abstract

Deep feedforward neural network models of vision dominate in both computational neuroscience and engineering. The primate visual system, by contrast, contains abundant recurrent connections. Recurrent signal flow enables recycling of limited computational resources over time, and so might boost the performance of a physically finite brain or model. Here we show: (1) Recurrent convolutional neural network models outperform feedforward convolutional models matched in their number of parameters in large-scale visual recognition tasks on natural images. (2) Setting a confidence threshold, at which recurrent computations terminate and a decision is made, enables flexible trading of speed for accuracy. At a given confidence threshold, the model expends more time and energy on images that are harder to recognise, without requiring additional parameters for deeper computations. (3) The recurrent model's reaction time for an image predicts the human reaction time for the same image better than several parameter-matched and state-of-the-art feedforward models. (4) Across confidence thresholds, the recurrent model emulates the behaviour of feedforward control models in that it achieves the same accuracy at approximately the same computational cost (mean number of floating-point operations). However, the recurrent model can be run longer (higher confidence threshold) and then outperforms parameter-matched feedforward comparison models. These results suggest that recurrent connectivity, a hallmark of biological visual systems, may be essential for understanding the accuracy, flexibility, and dynamics of human visual recognition.

摘要

深度前馈神经网络模型在计算神经科学和工程领域占据主导地位。相比之下，灵长类动物的视觉系统包含丰富的递归连接。递归信号流使有限的计算资源能够随时间重复利用，从而可能提高物理上有限的大脑或模型的性能。在这里，我们展示了：（1）在大规模视觉识别任务中，与参数匹配的前馈卷积模型相比，递归卷积神经网络模型在自然图像上的表现更好。（2）设置置信度阈值，在该阈值处，递归计算终止并做出决策，从而能够灵活地在速度和准确性之间进行权衡。在给定的置信度阈值下，该模型在识别困难的图像上花费更多的时间和精力，而不需要更深层次计算的额外参数。（3）模型对图像的反应时间比几个参数匹配和最先进的前馈模型对同一图像的反应时间预测更好。（4）在置信度阈值内，递归模型的行为与前馈控制模型相似，即在大致相同的计算成本（浮点运算的平均数量）下达到相同的准确性。然而，递归模型可以运行更长时间（更高的置信度阈值），并且优于参数匹配的前馈比较模型。这些结果表明，递归连接是生物视觉系统的一个标志，对于理解人类视觉识别的准确性、灵活性和动态性可能至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8550/7556458/894ec4a6f5d9/pcbi.1008215.g001.jpg

相似文献

Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.

PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

Recurrent Connections in the Primate Ventral Visual Stream Mediate a Trade-Off Between Task Performance and Network Size During Core Object Recognition.

Neural Comput. 2022 Jul 14;34(8):1652-1675. doi: 10.1162/neco_a_01506.

A neurocomputational model of decision and confidence in object recognition task.

Neural Netw. 2024 Jul;175:106318. doi: 10.1016/j.neunet.2024.106318. Epub 2024 Apr 12.

Beyond the feedforward sweep: feedback computations in the visual cortex.

Ann N Y Acad Sci. 2020 Mar;1464(1):222-241. doi: 10.1111/nyas.14320. Epub 2020 Feb 28.

Visual perception of liquids: Insights from deep neural networks.

PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.

Going in circles is the way forward: the role of recurrence in visual inference.

Curr Opin Neurobiol. 2020 Dec;65:176-193. doi: 10.1016/j.conb.2020.11.009. Epub 2020 Dec 3.

Capsule networks as recurrent models of grouping and segmentation.

PLoS Comput Biol. 2020 Jul 21;16(7):e1008017. doi: 10.1371/journal.pcbi.1008017. eCollection 2020 Jul.

Structurally-constrained encoding framework using a multi-voxel reduced-rank latent model for human natural vision.

J Neural Eng. 2024 Jul 26;21(4). doi: 10.1088/1741-2552/ad6184.

Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition.

Front Psychol. 2017 Sep 12;8:1551. doi: 10.3389/fpsyg.2017.01551. eCollection 2017.

引用本文的文献

Recurrence affects the geometry of visual representations across the ventral visual stream in the human brain.

PLoS Biol. 2025 Aug 25;23(8):e3003354. doi: 10.1371/journal.pbio.3003354. eCollection 2025 Aug.

High-level visual representations in the human brain are aligned with large language models.

Nat Mach Intell. 2025;7(8):1220-1234. doi: 10.1038/s42256-025-01072-0. Epub 2025 Aug 7.

Computational basis of hierarchical and counterfactual information processing.

Nat Hum Behav. 2025 Jun 11. doi: 10.1038/s41562-025-02232-3.

End-to-end topographic networks as models of cortical map formation and human visual behaviour.

Nat Hum Behav. 2025 Jun 6. doi: 10.1038/s41562-025-02220-7.

Self-Attention-Based Contextual Modulation Improves Neural System Identification.

ArXiv. 2025 Feb 28:arXiv:2406.07843v3.

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.

Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.

An image-computable model of speeded decision-making.

Elife. 2025 Feb 28;13:RP98351. doi: 10.7554/eLife.98351.

RTify: Aligning Deep Neural Networks with Human Behavioral Decisions.

ArXiv. 2024 Dec 26:arXiv:2411.03630v2.

Benchmarking the speed-accuracy tradeoff in object recognition by humans and neural networks.

J Vis. 2025 Jan 2;25(1):4. doi: 10.1167/jov.25.1.4.

Teaching deep networks to see shape: Lessons from a simplified visual world.

PLoS Comput Biol. 2024 Nov 11;20(11):e1012019. doi: 10.1371/journal.pcbi.1012019. eCollection 2024 Nov.

本文引用的文献

Recurrence is required to capture the representational dynamics of the human visual system.

Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.

Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior.

Nat Neurosci. 2019 Jun;22(6):974-983. doi: 10.1038/s41593-019-0392-5. Epub 2019 Apr 29.

Recurrent computations for visual pattern completion.

Proc Natl Acad Sci U S A. 2018 Aug 28;115(35):8835-8840. doi: 10.1073/pnas.1719397115. Epub 2018 Aug 13.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition.

Front Psychol. 2017 Sep 12;8:1551. doi: 10.3389/fpsyg.2017.01551. eCollection 2017.

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.

Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.

Representational Distance Learning for Deep Neural Networks.

Front Comput Neurosci. 2016 Dec 27;10:131. doi: 10.3389/fncom.2016.00131. eCollection 2016.

Representational Dynamics of Facial Viewpoint Encoding.

J Cogn Neurosci. 2017 Apr;29(4):637-651. doi: 10.1162/jocn_a_01070. Epub 2016 Oct 28.

Extensive training leads to temporal and spatial shifts of cortical activity underlying visual category selectivity.

Neuroimage. 2016 Jul 1;134:22-34. doi: 10.1016/j.neuroimage.2016.03.066. Epub 2016 Apr 6.

Using goal-driven deep learning models to understand sensory cortex.

Nat Neurosci. 2016 Mar;19(3):356-65. doi: 10.1038/nn.4244.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

递归神经网络可以解释生物视觉中速度和精度的灵活交易。

Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译