• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特征失明:理解和建模视觉物体识别的挑战。

Feature blindness: A challenge for understanding and modelling visual object recognition.

机构信息

School of Psychological Sciences, University of Bristol, Bristol, United Kingdom.

出版信息

PLoS Comput Biol. 2022 May 13;18(5):e1009572. doi: 10.1371/journal.pcbi.1009572. eCollection 2022 May.

DOI:10.1371/journal.pcbi.1009572
PMID:35560155
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9132323/
Abstract

Humans rely heavily on the shape of objects to recognise them. Recently, it has been argued that Convolutional Neural Networks (CNNs) can also show a shape-bias, provided their learning environment contains this bias. This has led to the proposal that CNNs provide good mechanistic models of shape-bias and, more generally, human visual processing. However, it is also possible that humans and CNNs show a shape-bias for very different reasons, namely, shape-bias in humans may be a consequence of architectural and cognitive constraints whereas CNNs show a shape-bias as a consequence of learning the statistics of the environment. We investigated this question by exploring shape-bias in humans and CNNs when they learn in a novel environment. We observed that, in this new environment, humans (i) focused on shape and overlooked many non-shape features, even when non-shape features were more diagnostic, (ii) learned based on only one out of multiple predictive features, and (iii) failed to learn when global features, such as shape, were absent. This behaviour contrasted with the predictions of a statistical inference model with no priors, showing the strong role that shape-bias plays in human feature selection. It also contrasted with CNNs that (i) preferred to categorise objects based on non-shape features, and (ii) increased reliance on these non-shape features as they became more predictive. This was the case even when the CNN was pre-trained to have a shape-bias and the convolutional backbone was frozen. These results suggest that shape-bias has a different source in humans and CNNs: while learning in CNNs is driven by the statistical properties of the environment, humans are highly constrained by their previous biases, which suggests that cognitive constraints play a key role in how humans learn to recognise novel objects.

摘要

人类主要依靠物体的形状来识别物体。最近,有人认为卷积神经网络(CNN)也可以表现出形状偏向,只要它们的学习环境中存在这种偏向。这导致了一种观点,即 CNN 提供了形状偏向和更一般的人类视觉处理的良好机制模型。然而,人类和 CNN 表现出形状偏向的原因也可能非常不同,即人类的形状偏向可能是由于结构和认知限制所致,而 CNN 表现出形状偏向则是由于学习环境的统计数据所致。我们通过在新环境中探索人类和 CNN 的形状偏向来研究这个问题。我们观察到,在这种新环境中,人类(i)专注于形状而忽略了许多非形状特征,即使非形状特征更具诊断性,(ii)仅根据多个预测特征中的一个进行学习,(iii)当全局特征(例如形状)不存在时无法学习。这种行为与没有先验的统计推断模型的预测形成对比,表明形状偏向在人类特征选择中起着重要作用。它还与 CNN 形成对比,(i)更愿意基于非形状特征对物体进行分类,(ii)随着这些非形状特征变得更具预测性,它们对这些非形状特征的依赖度增加。即使 CNN 经过预训练具有形状偏向并且卷积骨干网络被冻结,也是如此。这些结果表明,人类和 CNN 中的形状偏向有不同的来源:在 CNN 中,学习是由环境的统计特性驱动的,而人类则受到其先前偏见的高度限制,这表明认知限制在人类学习识别新物体的方式中起着关键作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/fc5c3c9fa800/pcbi.1009572.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/01c0b5cca78e/pcbi.1009572.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/0b3114c35e65/pcbi.1009572.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/60f4b7183fbb/pcbi.1009572.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/d54494f602b2/pcbi.1009572.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/e7582a077169/pcbi.1009572.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/00ab0a3167c2/pcbi.1009572.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/fc5c3c9fa800/pcbi.1009572.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/01c0b5cca78e/pcbi.1009572.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/0b3114c35e65/pcbi.1009572.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/60f4b7183fbb/pcbi.1009572.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/d54494f602b2/pcbi.1009572.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/e7582a077169/pcbi.1009572.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/00ab0a3167c2/pcbi.1009572.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec32/9132323/fc5c3c9fa800/pcbi.1009572.g007.jpg

相似文献

1
Feature blindness: A challenge for understanding and modelling visual object recognition.特征失明:理解和建模视觉物体识别的挑战。
PLoS Comput Biol. 2022 May 13;18(5):e1009572. doi: 10.1371/journal.pcbi.1009572. eCollection 2022 May.
2
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.
3
A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations.一种新颖的特征混淆方法揭示了卷积神经网络学习空间关系的能力。
Neural Netw. 2023 Oct;167:400-414. doi: 10.1016/j.neunet.2023.08.021. Epub 2023 Aug 18.
4
Hiding a plane with a pixel: examining shape-bias in CNNs and the benefit of building in biological constraints.用像素隐藏飞机:研究 CNN 中的形状偏差和构建生物约束的好处。
Vision Res. 2020 Sep;174:57-68. doi: 10.1016/j.visres.2020.04.013. Epub 2020 Jun 28.
5
Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.检查人类枕颞叶皮层和卷积神经网络中对象身份和非身份特征的编码强度。
J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.
6
Transfer of Learning in the Convolutional Neural Networks on Classifying Geometric Shapes Based on Local or Global Invariants.基于局部或全局不变量的卷积神经网络在几何形状分类中的学习迁移
Front Comput Neurosci. 2021 Feb 19;15:637144. doi: 10.3389/fncom.2021.637144. eCollection 2021.
7
Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.通过在卷积神经网络中加入对模糊的鲁棒性来改进对人类视觉的建模。
Nat Commun. 2024 Mar 5;15(1):1989. doi: 10.1038/s41467-024-45679-0.
8
From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction.从照片到素描——人类和深度神经网络如何在不同层次的视觉抽象中处理对象。
J Vis. 2022 Feb 1;22(2):4. doi: 10.1167/jov.22.2.4.
9
Human shape representations are not an emergent property of learning to classify objects.人形表示不是学习分类对象的涌现属性。
J Exp Psychol Gen. 2023 Dec;152(12):3380-3402. doi: 10.1037/xge0001440. Epub 2023 Sep 11.
10
Do Humans and Deep Convolutional Neural Networks Use Visual Information Similarly for the Categorization of Natural Scenes?人类和深度卷积神经网络是否以类似的方式使用视觉信息对自然场景进行分类?
Cogn Sci. 2021 Jun;45(6):e13009. doi: 10.1111/cogs.13009.

引用本文的文献

1
An image-computable model of speeded decision-making.一种快速决策的图像可计算模型。
Elife. 2025 Feb 28;13:RP98351. doi: 10.7554/eLife.98351.
2
Teaching deep networks to see shape: Lessons from a simplified visual world.教深度网络识别形状:简化视觉世界的经验教训。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012019. doi: 10.1371/journal.pcbi.1012019. eCollection 2024 Nov.

本文引用的文献

1
Controlling the spatial dimensions of visual stimuli in online experiments.在线实验中视觉刺激的空间维度控制。
J Vis. 2021 Aug 2;21(8):19. doi: 10.1167/jov.21.8.19.
2
An ecologically motivated image dataset for deep learning yields better models of human vision.一个受生态学启发的图像数据集,可用于深度学习,从而更好地模拟人类视觉。
Proc Natl Acad Sci U S A. 2021 Feb 23;118(8). doi: 10.1073/pnas.2011417118.
3
What do adversarial images tell us about human vision?对抗图像能告诉我们人类视觉什么?
Elife. 2020 Sep 2;9:e55978. doi: 10.7554/eLife.55978.
4
Hiding a plane with a pixel: examining shape-bias in CNNs and the benefit of building in biological constraints.用像素隐藏飞机:研究 CNN 中的形状偏差和构建生物约束的好处。
Vision Res. 2020 Sep;174:57-68. doi: 10.1016/j.visres.2020.04.013. Epub 2020 Jun 28.
5
Deep learning and cognitive science.深度学习与认知科学。
Cognition. 2020 Oct;203:104365. doi: 10.1016/j.cognition.2020.104365. Epub 2020 Jun 17.
6
A deep learning framework for neuroscience.深度学习在神经科学中的应用框架。
Nat Neurosci. 2019 Nov;22(11):1761-1770. doi: 10.1038/s41593-019-0520-2. Epub 2019 Oct 28.
7
Deep Learning: The Good, the Bad, and the Ugly.深度学习:好的、坏的和丑的。
Annu Rev Vis Sci. 2019 Sep 15;5:399-426. doi: 10.1146/annurev-vision-091718-014951. Epub 2019 Aug 8.
8
A new framework for understanding vision from the perspective of the primary visual cortex.从初级视皮层的角度理解视觉的新框架。
Curr Opin Neurobiol. 2019 Oct;58:1-10. doi: 10.1016/j.conb.2019.06.001. Epub 2019 Jul 1.
9
Mental labour.脑力劳动。
Nat Hum Behav. 2018 Dec;2(12):899-908. doi: 10.1038/s41562-018-0401-9. Epub 2018 Sep 3.
10
Deep Neural Networks as Scientific Models.深度神经网络作为科学模型。
Trends Cogn Sci. 2019 Apr;23(4):305-317. doi: 10.1016/j.tics.2019.01.009. Epub 2019 Feb 19.