视觉中的杂讯中有个“U”：人类视觉中杂讯容忍背后强大稀疏编码的证据。

There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.

作者信息

Cox Patrick H, Riesenhuber Maximilian

机构信息

Department of Neuroscience, Georgetown University Medical Center, Washington, DC 20007.

Department of Neuroscience, Georgetown University Medical Center, Washington, DC 20007

出版信息

J Neurosci. 2015 Oct 21;35(42):14148-59. doi: 10.1523/JNEUROSCI.1211-15.2015.

DOI:10.1523/JNEUROSCI.1211-15.2015

PMID:26490856

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4683683/

Abstract

UNLABELLED

The ability to recognize objects in clutter is crucial for human vision, yet the underlying neural computations remain poorly understood. Previous single-unit electrophysiology recordings in inferotemporal cortex in monkeys and fMRI studies of object-selective cortex in humans have shown that the responses to pairs of objects can sometimes be well described as a weighted average of the responses to the constituent objects. Yet, from a computational standpoint, it is not clear how the challenge of object recognition in clutter can be solved if downstream areas must disentangle the identity of an unknown number of individual objects from the confounded average neuronal responses. An alternative idea is that recognition is based on a subpopulation of neurons that are robust to clutter, i.e., that do not show response averaging, but rather robust object-selective responses in the presence of clutter. Here we show that simulations using the HMAX model of object recognition in cortex can fit the aforementioned single-unit and fMRI data, showing that the averaging-like responses can be understood as the result of responses of object-selective neurons to suboptimal stimuli. Moreover, the model shows how object recognition can be achieved by a sparse readout of neurons whose selectivity is robust to clutter. Finally, the model provides a novel prediction about human object recognition performance, namely, that target recognition ability should show a U-shaped dependency on the similarity of simultaneously presented clutter objects. This prediction is confirmed experimentally, supporting a simple, unifying model of how the brain performs object recognition in clutter.

SIGNIFICANCE STATEMENT

The neural mechanisms underlying object recognition in cluttered scenes (i.e., containing more than one object) remain poorly understood. Studies have suggested that neural responses to multiple objects correspond to an average of the responses to the constituent objects. Yet, it is unclear how the identities of an unknown number of objects could be disentangled from a confounded average response. Here, we use a popular computational biological vision model to show that averaging-like responses can result from responses of clutter-tolerant neurons to suboptimal stimuli. The model also provides a novel prediction, that human detection ability should show a U-shaped dependency on target-clutter similarity, which is confirmed experimentally, supporting a simple, unifying account of how the brain performs object recognition in clutter.

摘要

未标注

在杂乱环境中识别物体的能力对人类视觉至关重要，但其潜在的神经计算仍知之甚少。先前在猴子颞下皮质进行的单神经元电生理记录以及对人类物体选择性皮质的功能磁共振成像研究表明，对成对物体的反应有时可以很好地描述为对组成物体反应的加权平均值。然而，从计算的角度来看，如果下游区域必须从混淆的平均神经元反应中解开未知数量的单个物体的身份，那么尚不清楚如何解决杂乱环境中物体识别的挑战。另一种观点是，识别基于对杂乱具有鲁棒性的神经元亚群，即，在存在杂乱的情况下不显示反应平均，而是显示鲁棒的物体选择性反应。在这里，我们表明，使用皮质中物体识别的HMAX模型进行的模拟可以拟合上述单神经元和功能磁共振成像数据，表明类似平均的反应可以理解为物体选择性神经元对次优刺激反应的结果。此外，该模型展示了如何通过对选择性对杂乱具有鲁棒性的神经元进行稀疏读出实现物体识别。最后，该模型对人类物体识别性能做出了一个新的预测，即目标识别能力应该对同时呈现的杂乱物体的相似度呈现U形依赖。这一预测得到了实验证实，支持了一个关于大脑如何在杂乱环境中进行物体识别的简单统一模型。

意义声明

杂乱场景（即包含多个物体）中物体识别的神经机制仍知之甚少。研究表明，对多个物体的神经反应对应于对组成物体反应的平均值。然而，尚不清楚如何从混淆的平均反应中解开未知数量物体的身份。在这里，我们使用一个流行的计算生物视觉模型来表明，类似平均的反应可能是耐杂乱神经元对次优刺激反应的结果。该模型还提供了一个新的预测，即人类检测能力应该对目标 - 杂乱相似度呈现U形依赖，这一预测得到了实验证实，支持了一个关于大脑如何在杂乱环境中进行物体识别的简单统一解释。

相似文献

There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.

J Neurosci. 2015 Oct 21;35(42):14148-59. doi: 10.1523/JNEUROSCI.1211-15.2015.

Clutter modulates the representation of target objects in the human occipitotemporal cortex.

J Cogn Neurosci. 2014 Mar;26(3):490-500. doi: 10.1162/jocn_a_00505. Epub 2013 Oct 21.

What response properties do individual neurons need to underlie position and clutter "invariant" object recognition?

J Neurophysiol. 2009 Jul;102(1):360-76. doi: 10.1152/jn.90745.2008. Epub 2009 May 13.

Category selectivity in the ventral visual pathway confers robustness to clutter and diverted attention.

Curr Biol. 2007 Dec 4;17(23):2067-72. doi: 10.1016/j.cub.2007.10.043. Epub 2007 Nov 8.

Multiple object response normalization in monkey inferotemporal cortex.

J Neurosci. 2005 Sep 7;25(36):8150-64. doi: 10.1523/JNEUROSCI.2058-05.2005.

Representation of contextually related multiple objects in the human ventral visual pathway.

J Cogn Neurosci. 2013 Aug;25(8):1261-9. doi: 10.1162/jocn_a_00406. Epub 2013 Apr 22.

Interaction between Scene and Object Processing Revealed by Human fMRI and MEG Decoding.

J Neurosci. 2017 Aug 9;37(32):7700-7710. doi: 10.1523/JNEUROSCI.0582-17.2017. Epub 2017 Jul 7.

Invariant visual object recognition: biologically plausible approaches.

Biol Cybern. 2015 Oct;109(4-5):505-35. doi: 10.1007/s00422-015-0658-2. Epub 2015 Sep 3.

Unsupervised changes in core object recognition behavior are predicted by neural plasticity in inferior temporal cortex.

Elife. 2021 Jun 11;10:e60830. doi: 10.7554/eLife.60830.

Suppressed Sensory Response to Predictable Object Stimuli throughout the Ventral Visual Stream.

J Neurosci. 2018 Aug 22;38(34):7452-7461. doi: 10.1523/JNEUROSCI.3421-17.2018. Epub 2018 Jul 20.

引用本文的文献

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.

Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.

On the ability of standard and brain-constrained deep neural networks to support cognitive superposition: a position paper.

Cogn Neurodyn. 2024 Dec;18(6):3383-3400. doi: 10.1007/s11571-023-10061-1. Epub 2024 Feb 4.

How the mind sees the world.

Nat Hum Behav. 2020 Nov;4(11):1100-1101. doi: 10.1038/s41562-020-00973-x.

Nonlinear Processing of Shape Information in Rat Lateral Extrastriate Cortex.

J Neurosci. 2019 Feb 27;39(9):1649-1670. doi: 10.1523/JNEUROSCI.1938-18.2018. Epub 2019 Jan 7.

Representation of multiple objects in macaque category-selective areas.

Nat Commun. 2018 May 2;9(1):1774. doi: 10.1038/s41467-018-04126-7.

Perceptual category learning and visual processing: An exercise in computational cognitive neuroscience.

Neural Netw. 2017 May;89:31-38. doi: 10.1016/j.neunet.2017.02.010. Epub 2017 Mar 6.

Can (should) theories of crowding be unified?

J Vis. 2016 Dec 1;16(15):10. doi: 10.1167/16.15.10.

本文引用的文献

Neural population coding of multiple stimuli.

J Neurosci. 2015 Mar 4;35(9):3825-41. doi: 10.1523/JNEUROSCI.4097-14.2015.

Face features and face configurations both contribute to visual crowding.

Atten Percept Psychophys. 2015 Feb;77(2):508-19. doi: 10.3758/s13414-014-0786-0.

Large-scale, high-resolution neurophysiological maps underlying FMRI of macaque temporal lobe.

J Neurosci. 2013 Sep 18;33(38):15207-19. doi: 10.1523/JNEUROSCI.1248-13.2013.

The distributed representation of random and meaningful object pairs in human occipitotemporal cortex: the weighted average as a general rule.

Neuroimage. 2013 Apr 15;70:37-47. doi: 10.1016/j.neuroimage.2012.12.023. Epub 2012 Dec 22.

Constructing scenes from objects in human occipitotemporal cortex.

Nat Neurosci. 2011 Sep 4;14(10):1323-9. doi: 10.1038/nn.2903.

Probabilistic, positional averaging predicts object-level crowding effects with letter-like stimuli.

J Vis. 2010 Aug 1;10(10):14. doi: 10.1167/10.10.14.

Robust selectivity to two-object images in human visual cortex.

Curr Biol. 2010 May 11;20(9):872-9. doi: 10.1016/j.cub.2010.03.050. Epub 2010 Apr 22.

Crowding changes appearance.

Curr Biol. 2010 Mar 23;20(6):496-501. doi: 10.1016/j.cub.2010.01.023. Epub 2010 Mar 4.

A summary-statistic representation in peripheral vision explains visual crowding.

J Vis. 2009 Nov 19;9(12):13.1-18. doi: 10.1167/9.12.13.

Attention and biased competition in multi-voxel object representations.

Proc Natl Acad Sci U S A. 2009 Dec 15;106(50):21447-52. doi: 10.1073/pnas.0907330106. Epub 2009 Dec 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

视觉中的杂讯中有个“U”：人类视觉中杂讯容忍背后强大稀疏编码的证据。

There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.

作者信息

Cox Patrick H, Riesenhuber Maximilian

机构信息

Department of Neuroscience, Georgetown University Medical Center, Washington, DC 20007.

Department of Neuroscience, Georgetown University Medical Center, Washington, DC 20007

出版信息

J Neurosci. 2015 Oct 21;35(42):14148-59. doi: 10.1523/JNEUROSCI.1211-15.2015.

DOI:10.1523/JNEUROSCI.1211-15.2015

PMID:26490856

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4683683/

Abstract

UNLABELLED

SIGNIFICANCE STATEMENT

摘要

视觉中的杂讯中有个“U”：人类视觉中杂讯容忍背后强大稀疏编码的证据。

There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.

作者信息

机构信息

出版信息

UNLABELLED

SIGNIFICANCE STATEMENT

未标注

意义声明

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

视觉中的杂讯中有个“U”：人类视觉中杂讯容忍背后强大稀疏编码的证据。

There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.

作者信息

机构信息

出版信息

UNLABELLED

SIGNIFICANCE STATEMENT

未标注

意义声明

相似文献

引用本文的文献

本文引用的文献