基于序贯深度神经网络揭示的人类动态物体识别机制

Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.

机构信息

Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.

Amsterdam Brain & Cognition (ABC), University of Amsterdam, Amsterdam, Netherlands.

出版信息

PLoS Comput Biol. 2023 Jun 9;19(6):e1011169. doi: 10.1371/journal.pcbi.1011169. eCollection 2023 Jun.

DOI:10.1371/journal.pcbi.1011169

PMID:37294830

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10306191/

Abstract

Humans can quickly recognize objects in a dynamically changing world. This ability is showcased by the fact that observers succeed at recognizing objects in rapidly changing image sequences, at up to 13 ms/image. To date, the mechanisms that govern dynamic object recognition remain poorly understood. Here, we developed deep learning models for dynamic recognition and compared different computational mechanisms, contrasting feedforward and recurrent, single-image and sequential processing as well as different forms of adaptation. We found that only models that integrate images sequentially via lateral recurrence mirrored human performance (N = 36) and were predictive of trial-by-trial responses across image durations (13-80 ms/image). Importantly, models with sequential lateral-recurrent integration also captured how human performance changes as a function of image presentation durations, with models processing images for a few time steps capturing human object recognition at shorter presentation durations and models processing images for more time steps capturing human object recognition at longer presentation durations. Furthermore, augmenting such a recurrent model with adaptation markedly improved dynamic recognition performance and accelerated its representational dynamics, thereby predicting human trial-by-trial responses using fewer processing resources. Together, these findings provide new insights into the mechanisms rendering object recognition so fast and effective in a dynamic visual world.

摘要

人类能够快速识别动态变化世界中的物体。这一能力体现在观察者能够成功识别快速变化的图像序列中的物体，其速度可达每幅图像 13 毫秒。迄今为止，动态物体识别的机制仍未被充分理解。在这里，我们开发了用于动态识别的深度学习模型，并比较了不同的计算机制，包括前馈和循环、单图像和序列处理以及不同形式的适应。我们发现，只有通过侧向循环序列地整合图像的模型才能模拟人类的表现（N=36），并且能够预测跨图像持续时间的逐次反应（13-80 毫秒/图像）。重要的是，具有序列侧向循环整合的模型还捕捉到了人类表现如何随图像呈现持续时间而变化，其中处理几个时间步长的模型在较短的呈现持续时间内捕获人类物体识别，而处理更多时间步长的模型在较长的呈现持续时间内捕获人类物体识别。此外，通过适应来增强这种循环模型显著提高了动态识别性能，并加速了其表示动态，从而使用更少的处理资源来预测人类逐次反应。总之，这些发现为在动态视觉世界中使物体识别如此快速和有效的机制提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/0ae53d5f501e/pcbi.1011169.g001.jpg

相似文献

Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.

PLoS Comput Biol. 2023 Jun 9;19(6):e1011169. doi: 10.1371/journal.pcbi.1011169. eCollection 2023 Jun.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

Capturing the objects of vision with neural networks.

Nat Hum Behav. 2021 Sep;5(9):1127-1144. doi: 10.1038/s41562-021-01194-6. Epub 2021 Sep 20.

Beyond core object recognition: Recurrent processes account for object recognition under occlusion.

PLoS Comput Biol. 2019 May 15;15(5):e1007001. doi: 10.1371/journal.pcbi.1007001. eCollection 2019 May.

Recurrent Connections in the Primate Ventral Visual Stream Mediate a Trade-Off Between Task Performance and Network Size During Core Object Recognition.

Neural Comput. 2022 Jul 14;34(8):1652-1675. doi: 10.1162/neco_a_01506.

Visual Object Recognition: Do We (Finally) Know More Now Than We Did?

Annu Rev Vis Sci. 2016 Oct 14;2:377-396. doi: 10.1146/annurev-vision-111815-114621. Epub 2016 Aug 3.

The attentive reconstruction of objects facilitates robust object recognition.

PLoS Comput Biol. 2024 Jun 13;20(6):e1012159. doi: 10.1371/journal.pcbi.1012159. eCollection 2024 Jun.

Common Object Representations for Visual Production and Recognition.

Cogn Sci. 2018 Nov;42(8):2670-2698. doi: 10.1111/cogs.12676. Epub 2018 Aug 20.

The Spatiotemporal Neural Dynamics of Object Recognition for Natural Images and Line Drawings.

J Neurosci. 2023 Jan 18;43(3):484-500. doi: 10.1523/JNEUROSCI.1546-22.2022. Epub 2022 Dec 19.

Comparison of Object Recognition Behavior in Human and Monkey.

J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.

引用本文的文献

Maintenance and transformation of representational formats during working memory prioritization.

Nat Commun. 2024 Sep 19;15(1):8234. doi: 10.1038/s41467-024-52541-w.

Volleyball training video classification description using the BiLSTM fusion attention mechanism.

Heliyon. 2024 Jul 16;10(15):e34735. doi: 10.1016/j.heliyon.2024.e34735. eCollection 2024 Aug 15.

Feature binding is slow: Temporal integration explains apparent ultrafast binding.

J Vis. 2024 Aug 1;24(8):3. doi: 10.1167/jov.24.8.3.

Memorability shapes perceived time (and vice versa).

Nat Hum Behav. 2024 Jul;8(7):1296-1308. doi: 10.1038/s41562-024-01863-2. Epub 2024 Apr 22.

Empirically Identifying and Computationally Modeling the Brain-Behavior Relationship for Human Scene Categorization.

J Cogn Neurosci. 2023 Nov 1;35(11):1879-1897. doi: 10.1162/jocn_a_02043.

本文引用的文献

The neuroconnectionist research programme.

Nat Rev Neurosci. 2023 Jul;24(7):431-450. doi: 10.1038/s41583-023-00705-w. Epub 2023 May 30.

Neural representational geometry underlies few-shot concept learning.

Proc Natl Acad Sci U S A. 2022 Oct 25;119(43):e2200800119. doi: 10.1073/pnas.2200800119. Epub 2022 Oct 17.

On the Necessity of Recurrent Processing during Object Recognition: It Depends on the Need for Scene Segmentation.

J Neurosci. 2021 Jul 21;41(29):6281-6289. doi: 10.1523/JNEUROSCI.2851-20.2021.

An ecologically motivated image dataset for deep learning yields better models of human vision.

Proc Natl Acad Sci U S A. 2021 Feb 23;118(8). doi: 10.1073/pnas.2011417118.

Going in circles is the way forward: the role of recurrence in visual inference.

Curr Opin Neurobiol. 2020 Dec;65:176-193. doi: 10.1016/j.conb.2020.11.009. Epub 2020 Dec 3.

Capturing human categorization of natural images by combining deep networks and cognitive models.

Nat Commun. 2020 Oct 27;11(1):5418. doi: 10.1038/s41467-020-18946-z.

Fast Recurrent Processing via Ventrolateral Prefrontal Cortex Is Needed by the Primate Ventral Stream for Robust Core Visual Object Recognition.

Neuron. 2021 Jan 6;109(1):164-176.e5. doi: 10.1016/j.neuron.2020.09.035. Epub 2020 Oct 19.

Incorporating intrinsic suppression in deep neural networks captures dynamics of adaptation in neurophysiology and perception.

Sci Adv. 2020 Oct 14;6(42). doi: 10.1126/sciadv.abd4205. Print 2020 Oct.

Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.

PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.

Array programming with NumPy.

Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序贯深度神经网络揭示的人类动态物体识别机制

Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献