通过在卷积神经网络中加入对模糊的鲁棒性来改进对人类视觉的建模。

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.

机构信息

Department of Psychology, Vanderbilt Vision Research Center, Vanderbilt University, Nashville, TN, USA.

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.

出版信息

Nat Commun. 2024 Mar 5;15(1):1989. doi: 10.1038/s41467-024-45679-0.

DOI:10.1038/s41467-024-45679-0

PMID:38443349

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10915141/

Abstract

Whenever a visual scene is cast onto the retina, much of it will appear degraded due to poor resolution in the periphery; moreover, optical defocus can cause blur in central vision. However, the pervasiveness of blurry or degraded input is typically overlooked in the training of convolutional neural networks (CNNs). We hypothesized that the absence of blurry training inputs may cause CNNs to rely excessively on high spatial frequency information for object recognition, thereby causing systematic deviations from biological vision. We evaluated this hypothesis by comparing standard CNNs with CNNs trained on a combination of clear and blurry images. We show that blur-trained CNNs outperform standard CNNs at predicting neural responses to objects across a variety of viewing conditions. Moreover, blur-trained CNNs acquire increased sensitivity to shape information and greater robustness to multiple forms of visual noise, leading to improved correspondence with human perception. Our results provide multi-faceted neurocomputational evidence that blurry visual experiences may be critical for conferring robustness to biological visual systems.

摘要

每当视觉场景投射到视网膜上时，由于周边分辨率较差，其中大部分会显得退化; 此外，光学散焦会导致中央视力模糊。然而，在卷积神经网络 (CNN) 的训练中，通常会忽略输入模糊或退化的普遍性。我们假设，缺乏模糊的训练输入可能导致 CNN 过度依赖对象识别的高空间频率信息，从而导致与生物视觉的系统偏差。我们通过将标准 CNN 与在清晰和模糊图像组合上训练的 CNN 进行比较来评估这一假设。我们表明，在预测各种观察条件下物体的神经反应方面，模糊训练的 CNN 优于标准 CNN。此外，模糊训练的 CNN 对形状信息的敏感性增加，对多种形式的视觉噪声的鲁棒性增强，从而提高了与人类感知的一致性。我们的研究结果提供了多方面的神经计算证据，表明模糊的视觉体验对于赋予生物视觉系统鲁棒性可能至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0cda/10915141/2269e82707fd/41467_2024_45679_Fig1_HTML.jpg

相似文献

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.

Nat Commun. 2024 Mar 5;15(1):1989. doi: 10.1038/s41467-024-45679-0.

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.

bioRxiv. 2023 Jul 31:2023.07.29.551089. doi: 10.1101/2023.07.29.551089.

Convolutional neural networks trained with a developmental sequence of blurry to clear images reveal core differences between face and object processing.

J Vis. 2021 Nov 1;21(12):6. doi: 10.1167/jov.21.12.6.

Human peripheral blur is optimal for object recognition.

Vision Res. 2022 Nov;200:108083. doi: 10.1016/j.visres.2022.108083. Epub 2022 Jul 10.

Does training with blurred images bring convolutional neural networks closer to humans with respect to robust object recognition and internal representations?

Front Psychol. 2023 Feb 15;14:1047694. doi: 10.3389/fpsyg.2023.1047694. eCollection 2023.

A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.

Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.

Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision.

J Vis. 2021 Sep 1;21(10):14. doi: 10.1167/jov.21.10.14.

A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations.

Neural Netw. 2023 Oct;167:400-414. doi: 10.1016/j.neunet.2023.08.021. Epub 2023 Aug 18.

Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.

J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

引用本文的文献

Potential role of developmental experience in the emergence of the parvo-magno distinction.

Commun Biol. 2025 Jul 3;8(1):987. doi: 10.1038/s42003-025-08382-4.

Fast and robust visual object recognition in young children.

Sci Adv. 2025 Jul 4;11(27):eads6821. doi: 10.1126/sciadv.ads6821. Epub 2025 Jul 2.

Transferable polychromatic optical encoder for neural networks.

Nat Commun. 2025 Jul 1;16(1):5623. doi: 10.1038/s41467-025-61338-4.

Configural processing as an optimized strategy for robust object recognition in neural networks.

Commun Biol. 2025 Mar 7;8(1):386. doi: 10.1038/s42003-025-07672-1.

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.

Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.

Convolutional neural network models applied to neuronal responses in macaque V1 reveal limited nonlinear processing.

J Vis. 2024 Jun 3;24(6):1. doi: 10.1167/jov.24.6.1.

本文引用的文献

Does training with blurred images bring convolutional neural networks closer to humans with respect to robust object recognition and internal representations?

Front Psychol. 2023 Feb 15;14:1047694. doi: 10.3389/fpsyg.2023.1047694. eCollection 2023.

Increasing neural network robustness improves match to macaque V1 eigenspectrum, spatial frequency preference and predictivity.

PLoS Comput Biol. 2022 Jan 7;18(1):e1009739. doi: 10.1371/journal.pcbi.1009739. eCollection 2022 Jan.

Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.

PLoS Biol. 2021 Dec 9;19(12):e3001418. doi: 10.1371/journal.pbio.3001418. eCollection 2021 Dec.

Convolutional neural networks trained with a developmental sequence of blurry to clear images reveal core differences between face and object processing.

J Vis. 2021 Nov 1;21(12):6. doi: 10.1167/jov.21.12.6.

Limits to visual representational correspondence between convolutional neural networks and the human brain.

Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.

A map of object space in primate inferotemporal cortex.

Nature. 2020 Jul;583(7814):103-108. doi: 10.1038/s41586-020-2350-5. Epub 2020 Jun 3.

THINGS: A database of 1,854 object concepts and more than 26,000 naturalistic object images.

PLoS One. 2019 Oct 15;14(10):e0223792. doi: 10.1371/journal.pone.0223792. eCollection 2019.

Recurrence is required to capture the representational dynamics of the human visual system.

Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.

Evolving Images for Visual Neurons Using a Deep Generative Network Reveals Coding Principles and Neuronal Preferences.

Cell. 2019 May 2;177(4):999-1009.e10. doi: 10.1016/j.cell.2019.04.005.

Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior.

Nat Neurosci. 2019 Jun;22(6):974-983. doi: 10.1038/s41593-019-0392-5. Epub 2019 Apr 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过在卷积神经网络中加入对模糊的鲁棒性来改进对人类视觉的建模。

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献