卷积架构是从头开始与皮质对齐的。

Convolutional architectures are cortex-aligned de novo.

作者信息

Kazemian Atlas, Elmoznino Eric, Bonner Michael F

机构信息

Department of Cognitive Science, Johns Hopkins University, Baltimore, MD, USA.

University of Montreal and MILA - Quebec AI Institute.

出版信息

bioRxiv. 2025 Jul 29:2024.05.10.593623. doi: 10.1101/2024.05.10.593623.

DOI:10.1101/2024.05.10.593623

PMID:40766564

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12324244/

Abstract

What underlies the emergence of cortex-aligned representations in deep neural network models of vision? Earlier work suggested that shared architectural constraints were a major factor, but the success of widely varied architectures after pre-training raises critical questions about the importance of architectural constraints. Here we show that in wide networks with minimal training, architectural inductive biases have a prominent role. We examined networks with varied architectures but no pre-training and quantified their ability to predict image representations in the visual cortices of monkeys and humans. We found that cortex-aligned representations emerge in convolutional architectures that combine two key manipulations of dimensionality: compression in the spatial domain, through pooling, and expansion in the feature domain by increasing the number of channels. We further show that the inductive biases of convolutional architectures are critical for obtaining performance gains from feature expansion-dimensionality manipulations were relatively ineffective in other architectures and in convolutional models with targeted lesions. Our findings suggest that the architectural constraints of convolutional networks are sufficiently close to the constraints of biological vision to allow many aspects of cortical visual representation to emerge even before synaptic connections have been tuned through experience.

摘要

在视觉深度神经网络模型中，与皮层对齐的表征是如何出现的？早期的研究表明，共享的架构约束是一个主要因素，但预训练后广泛多样的架构取得成功，引发了关于架构约束重要性的关键问题。在这里，我们表明，在经过最少训练的宽网络中，架构归纳偏差起着突出的作用。我们研究了各种架构但未经过预训练的网络，并量化了它们预测猴子和人类视觉皮层中图像表征的能力。我们发现，与皮层对齐的表征出现在结合了两种关键维度操作的卷积架构中：通过池化在空间域进行压缩，以及通过增加通道数量在特征域进行扩展。我们进一步表明，卷积架构的归纳偏差对于从特征扩展中获得性能提升至关重要——维度操作在其他架构以及具有靶向损伤的卷积模型中相对无效。我们的研究结果表明，卷积网络的架构约束与生物视觉的约束足够接近，以至于即使在通过经验调整突触连接之前，皮层视觉表征的许多方面也能出现。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

卷积架构是从头开始与皮质对齐的。

Convolutional architectures are cortex-aligned de novo.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

卷积架构是从头开始与皮质对齐的。

Convolutional architectures are cortex-aligned de novo.

作者信息

机构信息

出版信息

相似文献

本文引用的文献