经噪声训练的深度神经网络能有效预测人类视觉及其对挑战性图像的神经反应。

Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.

机构信息

Psychology Department and Vanderbilt Vision Research Center, Vanderbilt University, Nashville, Tennessee, United States of America.

出版信息

PLoS Biol. 2021 Dec 9;19(12):e3001418. doi: 10.1371/journal.pbio.3001418. eCollection 2021 Dec.

DOI:10.1371/journal.pbio.3001418

PMID:34882676

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8659651/

Abstract

Deep neural networks (DNNs) for object classification have been argued to provide the most promising model of the visual system, accompanied by claims that they have attained or even surpassed human-level performance. Here, we evaluated whether DNNs provide a viable model of human vision when tested with challenging noisy images of objects, sometimes presented at the very limits of visibility. We show that popular state-of-the-art DNNs perform in a qualitatively different manner than humans-they are unusually susceptible to spatially uncorrelated white noise and less impaired by spatially correlated noise. We implemented a noise training procedure to determine whether noise-trained DNNs exhibit more robust responses that better match human behavioral and neural performance. We found that noise-trained DNNs provide a better qualitative match to human performance; moreover, they reliably predict human recognition thresholds on an image-by-image basis. Functional neuroimaging revealed that noise-trained DNNs provide a better correspondence to the pattern-specific neural representations found in both early visual areas and high-level object areas. A layer-specific analysis of the DNNs indicated that noise training led to broad-ranging modifications throughout the network, with greater benefits of noise robustness accruing in progressively higher layers. Our findings demonstrate that noise-trained DNNs provide a viable model to account for human behavioral and neural responses to objects in challenging noisy viewing conditions. Further, they suggest that robustness to noise may be acquired through a process of visual learning.

摘要

用于目标分类的深度神经网络 (DNN) 被认为提供了最有前途的视觉系统模型，并声称它们已经达到甚至超过了人类水平的性能。在这里，我们评估了当使用具有挑战性的嘈杂物体图像进行测试时，DNN 是否为人类视觉提供了可行的模型，这些图像有时在可见度的极限呈现。我们表明，流行的最先进的 DNN 以与人类不同的方式表现-它们对空间上不相关的白噪声异常敏感，而对空间上相关的噪声的干扰较小。我们实施了噪声训练程序，以确定噪声训练的 DNN 是否表现出更稳健的响应，更能匹配人类的行为和神经表现。我们发现，经过噪声训练的 DNN 提供了与人类性能更好的定性匹配；此外，它们可以可靠地预测图像的人类识别阈值。功能神经影像学显示，经过噪声训练的 DNN 与在早期视觉区域和高级物体区域中发现的特定于模式的神经表示更匹配。对 DNN 的层特异性分析表明，噪声训练导致整个网络的广泛修改，在逐渐更高的层中获得了更大的噪声稳健性收益。我们的研究结果表明，经过噪声训练的 DNN 为解释人类在具有挑战性的嘈杂观察条件下对物体的行为和神经反应提供了可行的模型。此外，它们表明对噪声的稳健性可能是通过视觉学习过程获得的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/8659651/df8743df085d/pbio.3001418.g001.jpg

相似文献

Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.

PLoS Biol. 2021 Dec 9;19(12):e3001418. doi: 10.1371/journal.pbio.3001418. eCollection 2021 Dec.

Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.

J Neurosci. 2023 Mar 8;43(10):1731-1741. doi: 10.1523/JNEUROSCI.1424-22.2022. Epub 2023 Feb 9.

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

The developmental trajectory of object recognition robustness: Children are like small adults but unlike big deep neural networks.

J Vis. 2023 Jul 3;23(7):4. doi: 10.1167/jov.23.7.4.

Biological convolutions improve DNN robustness to noise and generalisation.

Neural Netw. 2022 Apr;148:96-110. doi: 10.1016/j.neunet.2021.12.005. Epub 2021 Dec 17.

Noise increases the correspondence between artificial and human vision.

PLoS Biol. 2021 Dec 10;19(12):e3001477. doi: 10.1371/journal.pbio.3001477. eCollection 2021 Dec.

Are Deep Neural Networks Adequate Behavioral Models of Human Visual Perception?

Annu Rev Vis Sci. 2023 Sep 15;9:501-524. doi: 10.1146/annurev-vision-120522-031739. Epub 2023 Mar 31.

Improving robustness of a deep learning-based lung-nodule classification model of CT images with respect to image noise.

Phys Med Biol. 2021 Dec 7;66(24). doi: 10.1088/1361-6560/ac3d16.

Deep problems with neural network models of human vision.

Behav Brain Sci. 2022 Dec 1;46:e385. doi: 10.1017/S0140525X22002813.

Brain-like illusion produced by Skye's Oblique Grating in deep neural networks.

PLoS One. 2024 Feb 23;19(2):e0299083. doi: 10.1371/journal.pone.0299083. eCollection 2024.

引用本文的文献

Machine Learning Techniques for Simulating Human Psychophysical Testing of Low-Resolution Phosphene Face Images in Artificial Vision.

Adv Sci (Weinh). 2025 Apr;12(15):e2405789. doi: 10.1002/advs.202405789. Epub 2025 Feb 22.

Perceptual Expertise and Attention: An Exploration using Deep Neural Networks.

bioRxiv. 2024 Oct 16:2024.10.15.617743. doi: 10.1101/2024.10.15.617743.

Convolutional neural network models applied to neuronal responses in macaque V1 reveal limited nonlinear processing.

J Vis. 2024 Jun 3;24(6):1. doi: 10.1167/jov.24.6.1.

Human-Unrecognizable Differential Private Noised Image Generation Method.

Sensors (Basel). 2024 May 16;24(10):3166. doi: 10.3390/s24103166.

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks.

Nat Commun. 2024 Mar 5;15(1):1989. doi: 10.1038/s41467-024-45679-0.

Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.

PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.

Model metamers reveal divergent invariances between biological and artificial neural networks.

Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.

Spatial transformation of multi-omics data unlocks novel insights into cancer biology.

Elife. 2023 Sep 5;12:RP87133. doi: 10.7554/eLife.87133.

Graph convolutional network-based feature selection for high-dimensional and low-sample size data.

Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad135.

Guiding visual attention in deep convolutional neural networks based on human eye movements.

Front Neurosci. 2022 Sep 13;16:975639. doi: 10.3389/fnins.2022.975639. eCollection 2022.

本文引用的文献

Limits to visual representational correspondence between convolutional neural networks and the human brain.

Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.

A map of object space in primate inferotemporal cortex.

Nature. 2020 Jul;583(7814):103-108. doi: 10.1038/s41586-020-2350-5. Epub 2020 Jun 3.

Recurrence is required to capture the representational dynamics of the human visual system.

Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.

Evolving Images for Visual Neurons Using a Deep Generative Network Reveals Coding Principles and Neuronal Preferences.

Cell. 2019 May 2;177(4):999-1009.e10. doi: 10.1016/j.cell.2019.04.005.

Neural population control via deep image synthesis.

Science. 2019 May 3;364(6439). doi: 10.1126/science.aav9436.

Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior.

Nat Neurosci. 2019 Jun;22(6):974-983. doi: 10.1038/s41593-019-0392-5. Epub 2019 Apr 29.

Bayesian analysis of retinotopic maps.

Elife. 2018 Dec 6;7:e40224. doi: 10.7554/eLife.40224.

Mid-level visual features underlie the high-level categorical organization of the ventral stream.

Proc Natl Acad Sci U S A. 2018 Sep 18;115(38):E9015-E9024. doi: 10.1073/pnas.1719616115. Epub 2018 Aug 31.

Recurrent computations for visual pattern completion.

Proc Natl Acad Sci U S A. 2018 Aug 28;115(35):8835-8840. doi: 10.1073/pnas.1719397115. Epub 2018 Aug 13.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

经噪声训练的深度神经网络能有效预测人类视觉及其对挑战性图像的神经反应。

Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献