深度神经网络与人类共有的真实世界场景视觉数字感。

Visual number sense for real-world scenes shared by deep neural networks and humans.

作者信息

Wencheng Wu, Ge Yingxi, Zuo Zhentao, Chen Lin, Qin Xu, Zuxiang Liu

机构信息

AHU-IAI AI Joint Laboratory, Anhui University, Hefei, 230601, China.

Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China.

出版信息

Heliyon. 2023 Jul 24;9(8):e18517. doi: 10.1016/j.heliyon.2023.e18517. eCollection 2023 Aug.

DOI:10.1016/j.heliyon.2023.e18517

PMID:37560656

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10407052/

Abstract

Recently, visual number sense has been identified from deep neural networks (DNNs). However, whether DNNs have the same capacity for real-world scenes, rather than the simple geometric figures that are often tested, is unclear. In this study, we explore the number perception of scenes using AlexNet and find that numerosity can be represented by the pattern of group activation of the category layer units. The global activation of these units increases with the number of objects in the scene, and the variations in their activation decrease accordingly. By decoding the numerosity from this pattern, we reveal that the embedding coefficient of a scene determines the likelihood of potential objects to contribute to numerical perception. This was demonstrated by the more optimized performance for pictures with relatively high embedding coefficients in both DNNs and humans. This study for the first time shows that a distinct feature in visual environments, revealed by DNNs, can modulate human perception, supported by a group-coding mechanism.

摘要

最近，视觉数字感已在深度神经网络（DNN）中被识别出来。然而，DNN是否具有感知现实世界场景的能力，而非仅限于常被测试的简单几何图形，目前尚不清楚。在本研究中，我们使用AlexNet探索场景的数字感知，发现数字量可以通过类别层单元的群体激活模式来表示。这些单元的全局激活随场景中物体数量的增加而增加，其激活的变化相应减小。通过从这种模式中解码数字量，我们发现场景的嵌入系数决定了潜在物体对数字感知做出贡献的可能性。这在DNN和人类对具有相对较高嵌入系数的图片表现出更优化的性能中得到了证明。本研究首次表明，DNN揭示的视觉环境中的独特特征可以通过群体编码机制调节人类感知。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63e3/10407052/aa5cf8d5567e/gr1.jpg

相似文献

Visual number sense for real-world scenes shared by deep neural networks and humans.深度神经网络与人类共有的真实世界场景视觉数字感。

Heliyon. 2023 Jul 24;9(8):e18517. doi: 10.1016/j.heliyon.2023.e18517. eCollection 2023 Aug.

Visual numerosity perception shows no advantage in real-world scenes compared to artificial displays.与人工显示相比，视觉数量感知在真实场景中没有优势。

Cognition. 2023 Jan;230:105291. doi: 10.1016/j.cognition.2022.105291. Epub 2022 Sep 29.

The Neural Dynamics of Attentional Selection in Natural Scenes.自然场景中注意选择的神经动力学

J Neurosci. 2016 Oct 12;36(41):10522-10528. doi: 10.1523/JNEUROSCI.1385-16.2016.

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.在解释物体相似性判断方面，深度卷积神经网络的表现优于基于特征的模型，但不优于分类模型。

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

Real-world structure facilitates the rapid emergence of scene category information in visual brain signals.真实世界的结构促进了视觉大脑信号中场景类别信息的快速出现。

J Neurophysiol. 2020 Jul 1;124(1):145-151. doi: 10.1152/jn.00164.2020. Epub 2020 Jun 10.

Multimodal deep neural decoding reveals highly resolved spatiotemporal profile of visual object representation in humans.多模态深度神经网络解码揭示了人类视觉物体表示的高分辨率时空特征。

Neuroimage. 2023 Jul 15;275:120164. doi: 10.1016/j.neuroimage.2023.120164. Epub 2023 May 9.

Interaction between Scene and Object Processing Revealed by Human fMRI and MEG Decoding.通过人类功能磁共振成像和脑磁图解码揭示场景与物体加工之间的相互作用

J Neurosci. 2017 Aug 9;37(32):7700-7710. doi: 10.1523/JNEUROSCI.0582-17.2017. Epub 2017 Jul 7.

Robust decoding of the speech envelope from EEG recordings through deep neural networks.通过深度神经网络从 EEG 记录中稳健地解码语音包络。

J Neural Eng. 2022 Jul 6;19(4). doi: 10.1088/1741-2552/ac7976.

Visual perception of liquids: Insights from deep neural networks.液体的视觉感知：来自深度神经网络的见解。

PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.

Scene-selective brain regions respond to embedded objects of a scene.场景选择性脑区对场景中的嵌入物体作出反应。

Cereb Cortex. 2023 Apr 25;33(9):5066-5074. doi: 10.1093/cercor/bhac399.

引用本文的文献

An object numbering task reveals an underestimation of complexity for typically structured scenes.一项物体计数任务揭示了对于典型结构化场景的复杂性估计不足。

Psychon Bull Rev. 2025 Apr;32(2):760-769. doi: 10.3758/s13423-024-02577-2. Epub 2024 Sep 17.

本文引用的文献

Spontaneous representation of numerosity zero in a deep neural network for visual object recognition.用于视觉目标识别的深度神经网络中数字零的自发表征。

iScience. 2021 Oct 15;24(11):103301. doi: 10.1016/j.isci.2021.103301. eCollection 2021 Nov 19.

Rapid Extraction of the Spatial Distribution of Physical Saliency and Semantic Informativeness from Natural Scenes in the Human Brain.快速提取人类大脑中自然场景的物理显著性和语义信息量的空间分布。

J Neurosci. 2022 Jan 5;42(1):97-108. doi: 10.1523/JNEUROSCI.0602-21.2021. Epub 2021 Nov 8.

Grouping strategies in numerosity perception between intrinsic and extrinsic grouping cues.数量感知中的分组策略：内在分组线索与外在分组线索。

Sci Rep. 2021 Sep 2;11(1):17605. doi: 10.1038/s41598-021-96944-x.

The Evolutionary History of Brains for Numbers.大脑数感的进化历史。

Trends Cogn Sci. 2021 Jul;25(7):608-621. doi: 10.1016/j.tics.2021.03.012. Epub 2021 Apr 26.

Front Comput Neurosci. 2021 Feb 22;15:625804. doi: 10.3389/fncom.2021.625804. eCollection 2021.

Visual number sense in untrained deep neural networks.未训练的深度神经网络中的视觉数字感知。

Sci Adv. 2021 Jan 1;7(1). doi: 10.1126/sciadv.abd6127. Print 2021 Jan.

DNNBrain: A Unifying Toolbox for Mapping Deep Neural Networks and Brains.DNNBrain：用于映射深度神经网络与大脑的统一工具箱。

Front Comput Neurosci. 2020 Nov 30;14:580632. doi: 10.3389/fncom.2020.580632. eCollection 2020.

Prevalence of neural collapse during the terminal phase of deep learning training.深度学习训练末期的神经崩溃的普遍性。

Proc Natl Acad Sci U S A. 2020 Oct 6;117(40):24652-24663. doi: 10.1073/pnas.2015509117. Epub 2020 Sep 21.

Grouping strategies in number estimation extend the subitizing range.分组策略在数量估计中扩展了直觉范围。

Sci Rep. 2020 Sep 11;10(1):14979. doi: 10.1038/s41598-020-71871-5.

Cuckoos use host egg number to choose host nests for parasitism.杜鹃会根据寄主的卵数来选择寄生的鸟巢。

Proc Biol Sci. 2020 Jun 10;287(1928):20200343. doi: 10.1098/rspb.2020.0343.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度神经网络与人类共有的真实世界场景视觉数字感。

Visual number sense for real-world scenes shared by deep neural networks and humans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献