• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序贯深度神经网络揭示的人类动态物体识别机制

Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.

机构信息

Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.

Amsterdam Brain & Cognition (ABC), University of Amsterdam, Amsterdam, Netherlands.

出版信息

PLoS Comput Biol. 2023 Jun 9;19(6):e1011169. doi: 10.1371/journal.pcbi.1011169. eCollection 2023 Jun.

DOI:10.1371/journal.pcbi.1011169
PMID:37294830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10306191/
Abstract

Humans can quickly recognize objects in a dynamically changing world. This ability is showcased by the fact that observers succeed at recognizing objects in rapidly changing image sequences, at up to 13 ms/image. To date, the mechanisms that govern dynamic object recognition remain poorly understood. Here, we developed deep learning models for dynamic recognition and compared different computational mechanisms, contrasting feedforward and recurrent, single-image and sequential processing as well as different forms of adaptation. We found that only models that integrate images sequentially via lateral recurrence mirrored human performance (N = 36) and were predictive of trial-by-trial responses across image durations (13-80 ms/image). Importantly, models with sequential lateral-recurrent integration also captured how human performance changes as a function of image presentation durations, with models processing images for a few time steps capturing human object recognition at shorter presentation durations and models processing images for more time steps capturing human object recognition at longer presentation durations. Furthermore, augmenting such a recurrent model with adaptation markedly improved dynamic recognition performance and accelerated its representational dynamics, thereby predicting human trial-by-trial responses using fewer processing resources. Together, these findings provide new insights into the mechanisms rendering object recognition so fast and effective in a dynamic visual world.

摘要

人类能够快速识别动态变化世界中的物体。这一能力体现在观察者能够成功识别快速变化的图像序列中的物体,其速度可达每幅图像 13 毫秒。迄今为止,动态物体识别的机制仍未被充分理解。在这里,我们开发了用于动态识别的深度学习模型,并比较了不同的计算机制,包括前馈和循环、单图像和序列处理以及不同形式的适应。我们发现,只有通过侧向循环序列地整合图像的模型才能模拟人类的表现(N=36),并且能够预测跨图像持续时间的逐次反应(13-80 毫秒/图像)。重要的是,具有序列侧向循环整合的模型还捕捉到了人类表现如何随图像呈现持续时间而变化,其中处理几个时间步长的模型在较短的呈现持续时间内捕获人类物体识别,而处理更多时间步长的模型在较长的呈现持续时间内捕获人类物体识别。此外,通过适应来增强这种循环模型显著提高了动态识别性能,并加速了其表示动态,从而使用更少的处理资源来预测人类逐次反应。总之,这些发现为在动态视觉世界中使物体识别如此快速和有效的机制提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/cac14186f622/pcbi.1011169.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/0ae53d5f501e/pcbi.1011169.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/e2dab62d4acd/pcbi.1011169.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/8907ca621a64/pcbi.1011169.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/e73203941f9c/pcbi.1011169.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/cc52a165f96d/pcbi.1011169.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/410911b3d090/pcbi.1011169.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/cac14186f622/pcbi.1011169.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/0ae53d5f501e/pcbi.1011169.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/e2dab62d4acd/pcbi.1011169.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/8907ca621a64/pcbi.1011169.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/e73203941f9c/pcbi.1011169.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/cc52a165f96d/pcbi.1011169.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/410911b3d090/pcbi.1011169.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dc0/10306191/cac14186f622/pcbi.1011169.g007.jpg

相似文献

1
Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.基于序贯深度神经网络揭示的人类动态物体识别机制
PLoS Comput Biol. 2023 Jun 9;19(6):e1011169. doi: 10.1371/journal.pcbi.1011169. eCollection 2023 Jun.
2
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
3
Capturing the objects of vision with neural networks.用神经网络捕捉视觉对象。
Nat Hum Behav. 2021 Sep;5(9):1127-1144. doi: 10.1038/s41562-021-01194-6. Epub 2021 Sep 20.
4
Beyond core object recognition: Recurrent processes account for object recognition under occlusion.超越核心物体识别:递归过程解释了遮挡下的物体识别。
PLoS Comput Biol. 2019 May 15;15(5):e1007001. doi: 10.1371/journal.pcbi.1007001. eCollection 2019 May.
5
Recurrent Connections in the Primate Ventral Visual Stream Mediate a Trade-Off Between Task Performance and Network Size During Core Object Recognition.灵长类动物腹侧视觉流中的重复连接在核心物体识别过程中,在任务表现和网络规模之间进行权衡。
Neural Comput. 2022 Jul 14;34(8):1652-1675. doi: 10.1162/neco_a_01506.
6
Visual Object Recognition: Do We (Finally) Know More Now Than We Did?视觉物体识别:我们(终于)比以前知道得更多了吗?
Annu Rev Vis Sci. 2016 Oct 14;2:377-396. doi: 10.1146/annurev-vision-111815-114621. Epub 2016 Aug 3.
7
The attentive reconstruction of objects facilitates robust object recognition.对物体的细致重建有助于实现可靠的物体识别。
PLoS Comput Biol. 2024 Jun 13;20(6):e1012159. doi: 10.1371/journal.pcbi.1012159. eCollection 2024 Jun.
8
Common Object Representations for Visual Production and Recognition.常见的视觉生成和识别的目标表示。
Cogn Sci. 2018 Nov;42(8):2670-2698. doi: 10.1111/cogs.12676. Epub 2018 Aug 20.
9
The Spatiotemporal Neural Dynamics of Object Recognition for Natural Images and Line Drawings.自然图像和线条图的对象识别的时空神经动力学。
J Neurosci. 2023 Jan 18;43(3):484-500. doi: 10.1523/JNEUROSCI.1546-22.2022. Epub 2022 Dec 19.
10
Comparison of Object Recognition Behavior in Human and Monkey.人类与猴子物体识别行为的比较
J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.

引用本文的文献

1
Maintenance and transformation of representational formats during working memory prioritization.工作记忆优先化过程中表象格式的维持和转换。
Nat Commun. 2024 Sep 19;15(1):8234. doi: 10.1038/s41467-024-52541-w.
2
Volleyball training video classification description using the BiLSTM fusion attention mechanism.基于双向长短期记忆融合注意力机制的排球训练视频分类描述
Heliyon. 2024 Jul 16;10(15):e34735. doi: 10.1016/j.heliyon.2024.e34735. eCollection 2024 Aug 15.
3
Feature binding is slow: Temporal integration explains apparent ultrafast binding.

本文引用的文献

1
The neuroconnectionist research programme.神经连接主义研究计划。
Nat Rev Neurosci. 2023 Jul;24(7):431-450. doi: 10.1038/s41583-023-00705-w. Epub 2023 May 30.
2
Neural representational geometry underlies few-shot concept learning.神经表象几何是少样本概念学习的基础。
Proc Natl Acad Sci U S A. 2022 Oct 25;119(43):e2200800119. doi: 10.1073/pnas.2200800119. Epub 2022 Oct 17.
3
On the Necessity of Recurrent Processing during Object Recognition: It Depends on the Need for Scene Segmentation.论物体识别过程中循环处理的必要性:这取决于场景分割的需求。
特征绑定很慢:时间整合解释了明显的超快绑定。
J Vis. 2024 Aug 1;24(8):3. doi: 10.1167/jov.24.8.3.
4
Memorability shapes perceived time (and vice versa).易记性塑造感知时间(反之亦然)。
Nat Hum Behav. 2024 Jul;8(7):1296-1308. doi: 10.1038/s41562-024-01863-2. Epub 2024 Apr 22.
5
Empirically Identifying and Computationally Modeling the Brain-Behavior Relationship for Human Scene Categorization.从经验上识别和计算人类场景分类的大脑-行为关系。
J Cogn Neurosci. 2023 Nov 1;35(11):1879-1897. doi: 10.1162/jocn_a_02043.
J Neurosci. 2021 Jul 21;41(29):6281-6289. doi: 10.1523/JNEUROSCI.2851-20.2021.
4
An ecologically motivated image dataset for deep learning yields better models of human vision.一个受生态学启发的图像数据集,可用于深度学习,从而更好地模拟人类视觉。
Proc Natl Acad Sci U S A. 2021 Feb 23;118(8). doi: 10.1073/pnas.2011417118.
5
Going in circles is the way forward: the role of recurrence in visual inference.兜圈子是前进的道路:在视觉推理中再现的作用。
Curr Opin Neurobiol. 2020 Dec;65:176-193. doi: 10.1016/j.conb.2020.11.009. Epub 2020 Dec 3.
6
Capturing human categorization of natural images by combining deep networks and cognitive models.通过将深度网络和认知模型相结合来捕捉人类对自然图像的分类。
Nat Commun. 2020 Oct 27;11(1):5418. doi: 10.1038/s41467-020-18946-z.
7
Fast Recurrent Processing via Ventrolateral Prefrontal Cortex Is Needed by the Primate Ventral Stream for Robust Core Visual Object Recognition.腹外侧前额叶皮层的快速重复处理是灵长类动物腹侧流进行稳健核心视觉物体识别所必需的。
Neuron. 2021 Jan 6;109(1):164-176.e5. doi: 10.1016/j.neuron.2020.09.035. Epub 2020 Oct 19.
8
Incorporating intrinsic suppression in deep neural networks captures dynamics of adaptation in neurophysiology and perception.将内在抑制纳入深度神经网络可捕捉神经生理学和感知中的适应动态。
Sci Adv. 2020 Oct 14;6(42). doi: 10.1126/sciadv.abd4205. Print 2020 Oct.
9
Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision.递归神经网络可以解释生物视觉中速度和精度的灵活交易。
PLoS Comput Biol. 2020 Oct 2;16(10):e1008215. doi: 10.1371/journal.pcbi.1008215. eCollection 2020 Oct.
10
Array programming with NumPy.使用 NumPy 进行数组编程。
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.