• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物褶皱提高了 DNN 对噪声和泛化的鲁棒性。

Biological convolutions improve DNN robustness to noise and generalisation.

机构信息

School of Psychological Science, University of Bristol, 12a Priory Road, Bristol BS8 1TU, UK.

School of Psychological Science, University of Bristol, 12a Priory Road, Bristol BS8 1TU, UK.

出版信息

Neural Netw. 2022 Apr;148:96-110. doi: 10.1016/j.neunet.2021.12.005. Epub 2021 Dec 17.

DOI:10.1016/j.neunet.2021.12.005
PMID:35114495
Abstract

Deep Convolutional Neural Networks (DNNs) have achieved superhuman accuracy on standard image classification benchmarks. Their success has reignited significant interest in their use as models of the primate visual system, bolstered by claims of their architectural and representational similarities. However, closer scrutiny of these models suggests that they rely on various forms of shortcut learning to achieve their impressive performance, such as using texture rather than shape information. Such superficial solutions to image recognition have been shown to make DNNs brittle in the face of more challenging tests such as noise-perturbed or out-of-distribution images, casting doubt on their similarity to their biological counterparts. In the present work, we demonstrate that adding fixed biological filter banks, in particular banks of Gabor filters, helps to constrain the networks to avoid reliance on shortcuts, making them develop more structured internal representations and more tolerance to noise. Importantly, they also gained around 20-35% improved accuracy when generalising to our novel out-of-distribution test image sets over standard end-to-end trained architectures. We take these findings to suggest that these properties of the primate visual system should be incorporated into DNNs to make them more able to cope with real-world vision and better capture some of the more impressive aspects of human visual perception such as generalisation.

摘要

深度卷积神经网络 (DNN) 在标准图像分类基准测试中取得了超人的准确性。它们的成功重新激发了人们对将其用作灵长类视觉系统模型的浓厚兴趣,这一说法得到了它们在架构和表示方面相似性的支持。然而,对这些模型的更仔细审查表明,它们依赖于各种形式的捷径学习来实现令人印象深刻的性能,例如使用纹理而不是形状信息。这种对图像识别的表面解决方案已被证明使 DNN 在面对更具挑战性的测试(例如噪声干扰或分布外图像)时变得脆弱,这使人对它们与生物对应物的相似性产生了怀疑。在本工作中,我们证明了添加固定的生物滤波器组(特别是 Gabor 滤波器组)有助于限制网络避免依赖捷径,从而使它们开发出更具结构的内部表示,并对噪声具有更高的容忍度。重要的是,与标准的端到端训练架构相比,当它们推广到我们新颖的分布外测试图像集时,它们的准确性也提高了约 20-35%。我们认为这些灵长类视觉系统的特性应该被纳入 DNN 中,以使它们更能够应对现实世界的视觉,并更好地捕捉人类视觉感知的一些更令人印象深刻的方面,例如泛化。

相似文献

1
Biological convolutions improve DNN robustness to noise and generalisation.生物褶皱提高了 DNN 对噪声和泛化的鲁棒性。
Neural Netw. 2022 Apr;148:96-110. doi: 10.1016/j.neunet.2021.12.005. Epub 2021 Dec 17.
2
Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.经噪声训练的深度神经网络能有效预测人类视觉及其对挑战性图像的神经反应。
PLoS Biol. 2021 Dec 9;19(12):e3001418. doi: 10.1371/journal.pbio.3001418. eCollection 2021 Dec.
3
Divergences in color perception between deep neural networks and humans.深度神经网络与人类在颜色感知上的差异。
Cognition. 2023 Dec;241:105621. doi: 10.1016/j.cognition.2023.105621. Epub 2023 Sep 14.
4
Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.在解释物体相似性判断方面,深度卷积神经网络的表现优于基于特征的模型,但不优于分类模型。
Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.
5
The role of capacity constraints in Convolutional Neural Networks for learning random versus natural data.容量限制在用于学习随机数据与自然数据的卷积神经网络中的作用。
Neural Netw. 2023 Apr;161:515-524. doi: 10.1016/j.neunet.2023.01.011. Epub 2023 Feb 4.
6
Improving robustness of a deep learning-based lung-nodule classification model of CT images with respect to image noise.提高基于深度学习的 CT 图像肺结节分类模型对图像噪声鲁棒性。
Phys Med Biol. 2021 Dec 7;66(24). doi: 10.1088/1361-6560/ac3d16.
7
The developmental trajectory of object recognition robustness: Children are like small adults but unlike big deep neural networks.客体识别稳健性的发展轨迹:儿童似小大人而非大深度神经网络。
J Vis. 2023 Jul 3;23(7):4. doi: 10.1167/jov.23.7.4.
8
Deep neural networks and image classification in biological vision.深度神经网络与生物视觉中的图像分类。
Vision Res. 2022 Aug;197:108058. doi: 10.1016/j.visres.2022.108058. Epub 2022 Apr 26.
9
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
10
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.

引用本文的文献

1
Covariant spatio-temporal receptive fields for spiking neural networks.用于脉冲神经网络的协变时空感受野
Nat Commun. 2025 Sep 5;16(1):8231. doi: 10.1038/s41467-025-63493-0.
2
Deep neural networks have an inbuilt Occam's razor.深度神经网络具有内在的奥卡姆剃刀原理。
Nat Commun. 2025 Jan 14;16(1):220. doi: 10.1038/s41467-024-54813-x.
3
Neuroscientific insights about computer vision models: a concise review.神经科学对视知觉模型的启示:简要综述。
Biol Cybern. 2024 Dec;118(5-6):331-348. doi: 10.1007/s00422-024-00998-9. Epub 2024 Oct 9.
4
A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations.一种新颖的特征混淆方法揭示了卷积神经网络学习空间关系的能力。
Neural Netw. 2023 Oct;167:400-414. doi: 10.1016/j.neunet.2023.08.021. Epub 2023 Aug 18.
5
The neuroconnectionist research programme.神经连接主义研究计划。
Nat Rev Neurosci. 2023 Jul;24(7):431-450. doi: 10.1038/s41583-023-00705-w. Epub 2023 May 30.
6
From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction.从照片到素描——人类和深度神经网络如何在不同层次的视觉抽象中处理对象。
J Vis. 2022 Feb 1;22(2):4. doi: 10.1167/jov.22.2.4.