增加神经网络的鲁棒性可以提高与猕猴 V1 本征谱、空间频率偏好和预测能力的匹配度。

Increasing neural network robustness improves match to macaque V1 eigenspectrum, spatial frequency preference and predictivity.

机构信息

Department of Psychology, Stanford University, Stanford, California, United States of America.

Wu Tsai Neurosciences Institute, Stanford University, Stanford, California, United States of America.

出版信息

PLoS Comput Biol. 2022 Jan 7;18(1):e1009739. doi: 10.1371/journal.pcbi.1009739. eCollection 2022 Jan.

DOI:10.1371/journal.pcbi.1009739

PMID:34995280

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8775238/

Abstract

Task-optimized convolutional neural networks (CNNs) show striking similarities to the ventral visual stream. However, human-imperceptible image perturbations can cause a CNN to make incorrect predictions. Here we provide insight into this brittleness by investigating the representations of models that are either robust or not robust to image perturbations. Theory suggests that the robustness of a system to these perturbations could be related to the power law exponent of the eigenspectrum of its set of neural responses, where power law exponents closer to and larger than one would indicate a system that is less susceptible to input perturbations. We show that neural responses in mouse and macaque primary visual cortex (V1) obey the predictions of this theory, where their eigenspectra have power law exponents of at least one. We also find that the eigenspectra of model representations decay slowly relative to those observed in neurophysiology and that robust models have eigenspectra that decay slightly faster and have higher power law exponents than those of non-robust models. The slow decay of the eigenspectra suggests that substantial variance in the model responses is related to the encoding of fine stimulus features. We therefore investigated the spatial frequency tuning of artificial neurons and found that a large proportion of them preferred high spatial frequencies and that robust models had preferred spatial frequency distributions more aligned with the measured spatial frequency distribution of macaque V1 cells. Furthermore, robust models were quantitatively better models of V1 than non-robust models. Our results are consistent with other findings that there is a misalignment between human and machine perception. They also suggest that it may be useful to penalize slow-decaying eigenspectra or to bias models to extract features of lower spatial frequencies during task-optimization in order to improve robustness and V1 neural response predictivity.

摘要

任务优化卷积神经网络（CNNs）与腹侧视觉流表现出惊人的相似性。然而，人类无法察觉的图像干扰会导致 CNN 做出错误的预测。通过研究对图像干扰具有鲁棒性或不具有鲁棒性的模型的表示，我们深入了解了这种脆弱性。理论表明，系统对这些干扰的鲁棒性可能与神经网络响应集合的特征谱的幂律指数有关，其中幂律指数更接近且大于一的系统对输入干扰的敏感性更低。我们表明，老鼠和猕猴初级视觉皮层（V1）中的神经反应服从该理论的预测，其特征谱的幂律指数至少为一。我们还发现，模型表示的特征谱的衰减速度比神经生理学观察到的要慢，并且鲁棒模型的特征谱衰减速度略快，幂律指数高于非鲁棒模型。特征谱的缓慢衰减表明，模型响应中的大量方差与精细刺激特征的编码有关。因此，我们研究了人工神经元的空间频率调谐，发现它们中的很大一部分更喜欢高空间频率，并且鲁棒模型的空间频率分布更接近猕猴 V1 细胞的测量空间频率分布。此外，鲁棒模型是比非鲁棒模型更好的 V1 模型。我们的结果与其他发现一致，即人类和机器感知之间存在不匹配。它们还表明，在任务优化过程中，可以通过惩罚缓慢衰减的特征谱或偏向模型提取较低空间频率的特征来提高鲁棒性和 V1 神经反应可预测性，这可能是有用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0bd/8775238/c48f5d167f45/pcbi.1009739.g001.jpg

相似文献

Increasing neural network robustness improves match to macaque V1 eigenspectrum, spatial frequency preference and predictivity.增加神经网络的鲁棒性可以提高与猕猴 V1 本征谱、空间频率偏好和预测能力的匹配度。

PLoS Comput Biol. 2022 Jan 7;18(1):e1009739. doi: 10.1371/journal.pcbi.1009739. eCollection 2022 Jan.

Deep convolutional models improve predictions of macaque V1 responses to natural images.深度卷积模型提高了猕猴 V1 对自然图像反应的预测。

PLoS Comput Biol. 2019 Apr 23;15(4):e1006897. doi: 10.1371/journal.pcbi.1006897. eCollection 2019 Apr.

Convolutional neural network models applied to neuronal responses in macaque V1 reveal limited nonlinear processing.卷积神经网络模型应用于猕猴 V1 的神经元反应揭示了有限的非线性处理。

J Vis. 2024 Jun 3;24(6):1. doi: 10.1167/jov.24.6.1.

Dynamics of spatial frequency tuning in macaque V1.猕猴初级视皮层中空间频率调谐的动力学

J Neurosci. 2002 Mar 1;22(5):1976-84. doi: 10.1523/JNEUROSCI.22-05-01976.2002.

Convolutional neural network models of V1 responses to complex patterns.V1对复杂模式反应的卷积神经网络模型。

J Comput Neurosci. 2019 Feb;46(1):33-54. doi: 10.1007/s10827-018-0687-7. Epub 2018 Jun 5.

A theory of direction selectivity for macaque primary visual cortex.猴初级视皮层方向选择性的理论。

Proc Natl Acad Sci U S A. 2021 Aug 10;118(32). doi: 10.1073/pnas.2105062118.

A neuronal network model of primary visual cortex explains spatial frequency selectivity.初级视觉皮层的神经网络模型解释了空间频率选择性。

J Comput Neurosci. 2009 Apr;26(2):271-87. doi: 10.1007/s10827-008-0110-x. Epub 2008 Jul 31.

Anatomy and Physiology of Macaque Visual Cortical Areas V1, V2, and V5/MT: Bases for Biologically Realistic Models.猕猴视皮质区 V1、V2 和 V5/MT 的解剖和生理学：生物逼真模型的基础。

Cereb Cortex. 2020 May 18;30(6):3483-3517. doi: 10.1093/cercor/bhz322.

Spatial and temporal frequency selectivity of neurones in visual cortical areas V1 and V2 of the macaque monkey.猕猴视觉皮层V1和V2区神经元的空间和时间频率选择性

J Physiol. 1985 Aug;365:331-63. doi: 10.1113/jphysiol.1985.sp015776.

Complexity and diversity in sparse code priors improve receptive field characterization of Macaque V1 neurons.稀疏编码先验的复杂性和多样性提高了猕猴 V1 神经元感受野的表征。

PLoS Comput Biol. 2021 Oct 25;17(10):e1009528. doi: 10.1371/journal.pcbi.1009528. eCollection 2021 Oct.

引用本文的文献

Butterfly effects in perceptual development: a review of the 'adaptive initial degradation' hypothesis.知觉发展中的蝴蝶效应：“适应性初始退化”假说综述

Dev Rev. 2024 Mar;71. doi: 10.1016/j.dr.2024.101117. Epub 2024 Jan 19.

A simplified minimodel of visual cortical neurons.视觉皮层神经元的简化微模型。

Nat Commun. 2025 Jul 1;16(1):5724. doi: 10.1038/s41467-025-61171-9.

The cortical critical power law balances energy and information in an optimal fashion.皮质临界功率定律以最优方式平衡能量和信息。

Proc Natl Acad Sci U S A. 2025 May 27;122(21):e2418218122. doi: 10.1073/pnas.2418218122. Epub 2025 May 23.

J Vis. 2024 Jun 3;24(6):1. doi: 10.1167/jov.24.6.1.

How well do models of visual cortex generalize to out of distribution samples?视觉皮层模型对分布外样本的泛化能力如何？

PLoS Comput Biol. 2024 May 31;20(5):e1011145. doi: 10.1371/journal.pcbi.1011145. eCollection 2024 May.

Diverse task-driven modeling of macaque V4 reveals functional specialization towards semantic tasks.猴 V4 的多样化任务驱动建模揭示了其在语义任务上的功能专业化。

PLoS Comput Biol. 2024 May 23;20(5):e1012056. doi: 10.1371/journal.pcbi.1012056. eCollection 2024 May.

A unifying framework for functional organization in early and higher ventral visual cortex.早期和高级腹侧视觉皮层功能组织的统一框架。

Neuron. 2024 Jul 17;112(14):2435-2451.e7. doi: 10.1016/j.neuron.2024.04.018. Epub 2024 May 10.

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.通过在卷积神经网络中加入对模糊的鲁棒性来改进对人类视觉的建模。

Nat Commun. 2024 Mar 5;15(1):1989. doi: 10.1038/s41467-024-45679-0.

Model metamers reveal divergent invariances between biological and artificial neural networks.模型同型揭示了生物神经网络和人工神经网络之间的不同不变性。

Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.

A Unifying Principle for the Functional Organization of Visual Cortex.视觉皮层功能组织的统一原则

bioRxiv. 2023 May 18:2023.05.18.541361. doi: 10.1101/2023.05.18.541361.

本文引用的文献

Unsupervised neural network models of the ventral visual stream.腹侧视觉流的无监督神经网络模型。

Proc Natl Acad Sci U S A. 2021 Jan 19;118(3). doi: 10.1073/pnas.2014196118.

Understanding the development of amblyopia using macaque monkey models.利用猕猴模型了解弱视的发展。

Proc Natl Acad Sci U S A. 2019 Dec 26;116(52):26217-26223. doi: 10.1073/pnas.1902285116. Epub 2019 Dec 23.

High-dimensional geometry of population responses in visual cortex.群体视觉皮层反应的高维几何结构。

Nature. 2019 Jul;571(7765):361-365. doi: 10.1038/s41586-019-1346-5. Epub 2019 Jun 26.

Deep convolutional models improve predictions of macaque V1 responses to natural images.深度卷积模型提高了猕猴 V1 对自然图像反应的预测。

PLoS Comput Biol. 2019 Apr 23;15(4):e1006897. doi: 10.1371/journal.pcbi.1006897. eCollection 2019 Apr.

Potential downside of high initial visual acuity.高初始视力的潜在缺点。

Proc Natl Acad Sci U S A. 2018 Oct 30;115(44):11333-11338. doi: 10.1073/pnas.1800901115. Epub 2018 Oct 15.

Convolutional neural network-based encoding and decoding of visual object recognition in space and time.基于卷积神经网络的视觉目标在空间和时间上的识别的编解码。

Neuroimage. 2018 Oct 15;180(Pt A):253-266. doi: 10.1016/j.neuroimage.2017.07.018. Epub 2017 Jul 16.

Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较，揭示了层级对应关系。

Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.

Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。

J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.

Deep supervised, but not unsupervised, models may explain IT cortical representation.深度监督模型而非无监督模型可能解释IT皮层表征。

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

增加神经网络的鲁棒性可以提高与猕猴 V1 本征谱、空间频率偏好和预测能力的匹配度。

Increasing neural network robustness improves match to macaque V1 eigenspectrum, spatial frequency preference and predictivity.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献